Latest news

Publication

C2MS: Dynamic Monitoring and Management of Cloud Infrastructures

9 years 2 months ago
Presentation

Ad hoc Cloud Computing

9 years 2 months ago
Publication

Evolutionary Computation and Constraint Satisfaction

9 years 3 months ago
Publication

Ad hoc Cloud Computing

9 years 4 months ago
Software release

DICOM Confidential 1.4.4 released

9 years 7 months ago
Story

Congratulations to Gary McGilvary on his PhD

10 years 1 week ago
Publication

Ad hoc Cloud Computing (PhD Thesis)

10 years 1 week ago
Publication

Quantification of Ultra-Widefield Retinal Images

10 years 1 month ago
Publication

Precise montaging and metric quantification of retinal surface area from ultra-widefield fundus photography and fluorescein angiography

10 years 1 month ago
Software release

New DICOM Confidential Release

10 years 2 months ago

Historical Interest Only

This is a static HTML version of an old Drupal site. The site is no longer maintained and could be deleted at any point. It is only here for historical interest.

Create Parallel Data Mining Algorithms for Cloud Computing

13 January 2009 - 9:22am — Jano.van.Hemert

Student:

Tantana Saengngam

Grade:

first

Principle goal: to take an existing algorithm and to make it parallel in a cloud computing environment following the Map and Reduce approach of Google.

Research at Google in combination with their vast computational resources have led to interesting ways of making algorithms parallel with the aim to make them faster for problems with large amount of input data [1]. Data mining is such an area where this same principle can apply, assuming algorithms can be run in parallel in a similar fashion. Important to note is that not only the algorithm itself, but also the processes in which it is embedded are distributed. For example, the data may need to be integrated, cleaned and transformed before supplied to the data mining algorithm.

In this project, you will take an algorithm used in a specific project where the aim is to automatically classify anatomical components that exhibit gene expression patterns. These patterns are taken from images taken from stained embryo sections. It is then your task to make the data mining algorithm parallel using the map and reduce principle and then execute your implementation of it on a cloud computing infrastructure, such as Eucalyptus [2].

Project status:

Finished

Degree level:

MSc

Supervisors @ NeSC:

Jano.van.Hemert

Liangxiu.Han

Subject areas:

Algorithm Design

Computer Architecture

Distributed Systems

Machine Learning/Neural Networks/Connectionist Computing

Student project type:

MSc student project

References:

[1] C.-T. Chu, S. K. Kim, Y.-A. Lin, Y. Yu, G. R. Bradski, A. Y. Ng, and K. Olukotun. Map-reduce for machine learning on multicore. In B. Schölkopf, J. C. Platt, and T. Hoffman, editors, NIPS, pages 281–288. MIT Press, 2006. [2] http://eucalyptus.cs.ucsb.edu/

Main menu

Latest news

Pages

You are here

Historical Interest Only

Create Parallel Data Mining Algorithms for Cloud Computing

Search form

Main menu

Latest news

Pages

You are here

Historical Interest Only

Create Parallel Data Mining Algorithms for Cloud Computing