Latest news

Publication

C2MS: Dynamic Monitoring and Management of Cloud Infrastructures

9 years 2 months ago
Presentation

Ad hoc Cloud Computing

9 years 2 months ago
Publication

Evolutionary Computation and Constraint Satisfaction

9 years 3 months ago
Publication

Ad hoc Cloud Computing

9 years 5 months ago
Software release

DICOM Confidential 1.4.4 released

9 years 7 months ago
Story

Congratulations to Gary McGilvary on his PhD

10 years 1 week ago
Publication

Ad hoc Cloud Computing (PhD Thesis)

10 years 1 week ago
Publication

Quantification of Ultra-Widefield Retinal Images

10 years 1 month ago
Publication

Precise montaging and metric quantification of retinal surface area from ultra-widefield fundus photography and fluorescein angiography

10 years 1 month ago
Software release

New DICOM Confidential Release

10 years 2 months ago

Historical Interest Only

This is a static HTML version of an old Drupal site. The site is no longer maintained and could be deleted at any point. It is only here for historical interest.

A Generic Parallel Processing Model for Facilitating Data Mining and Integration

4 March 2011 - 4:17pm — Chee.Sun.Liew

Title	A Generic Parallel Processing Model for Facilitating Data Mining and Integration
Publication Type	Journal Article
Year of Publication	2011
Authors	Han, L, Liew, CS, van Hemert, J, Atkinson, M
Journal Title	Parallel Computing
Volume	37
Issue	3
Pages	157 - 171
Keywords	Data Mining and Data Integration (DMI); Life Sciences; OGSA-DAI; Parallelism; Pipeline Streaming; workflow
Abstract	To facilitate Data Mining and Integration (DMI) processes in a generic way, we investigate a parallel pipeline streaming model. We model a DMI task as a streaming data-flow graph: a directed acyclic graph (DAG) of Processing Elements PEs. The composition mechanism links PEs via data streams, which may be in memory, buffered via disks or inter-computer data-flows. This makes it possible to build arbitrary DAGs with pipelining and both data and task parallelisms, which provides room for performance enhancement. We have applied this approach to a real DMI case in the Life Sciences and implemented a prototype. To demonstrate feasibility of the modelled DMI task and assess the efficiency of the prototype, we have also built a performance evaluation model. The experimental evaluation results show that a linear speedup has been achieved with the increase of the number of distributed computing nodes in this case study.
DOI	10.1016/j.parco.2011.02.006
Full Text

Main menu

Latest news

Pages

You are here

Historical Interest Only

A Generic Parallel Processing Model for Facilitating Data Mining and Integration

Search form

Main menu

Latest news

Pages

You are here

Historical Interest Only

A Generic Parallel Processing Model for Facilitating Data Mining and Integration