TY  - JOUR
T1  - Data-Intensive Architecture for Scientific Knowledge Discovery
JF  - Distributed and Parallel Databases
Y1  - 2012
A1  - Atkinson, Malcolm P.
A1  - Chee Sun Liew
A1  - Michelle Galea
A1  - Paul Martin
A1  - Krause, Amrey
A1  - Adrian Mouat
A1  - Oscar Corcho
A1  - Snelling, David
KW  - Knowledge discovery, workflow management system
AB  - This paper presents a data-intensive architecture that demonstrates the ability to support applications from a wide range of application domains, and support the different types of users involved in defining, designing and executing data-intensive processing tasks. The prototype architecture is introduced, and the pivotal role of DISPEL as a canonical language is explained. The architecture promotes the exploration and exploitation of distributed and heterogeneous data and spans the complete knowledge discovery process, from data preparation, to analysis, to evaluation and reiteration. The architecture evaluation included large-scale applications from astronomy, cosmology, hydrology, functional genetics, imaging processing and seismology.
VL  - 30
UR  - http://dx.doi.org/10.1007/s10619-012-7105-3
IS  - 5
ER  -