TY  - BOOK
T1  - The DATA Bonanza: Improving Knowledge Discovery in Science, Engineering, and Business
T2  - Wiley Series on Parallel and Distributed Computing (Editor: Albert Y. Zomaya)
Y1  - 2013
A1  - Atkinson, Malcolm P.
A1  - Baxter, Robert M.
A1  - Peter Brezany
A1  - Oscar Corcho
A1  - Michelle Galea
A1  - Parsons, Mark
A1  - Snelling, David
A1  - van Hemert, Jano
KW  - Big Data
KW  - Data Intensive
KW  - data mining
KW  - Data Streaming
KW  - Databases
KW  - Dispel
KW  - Distributed Computing
KW  - Knowledge Discovery
KW  - Workflows
AB  - With the digital revolution opening up tremendous opportunities in many fields, there is a growing need for skilled professionals who can develop data-intensive systems and extract information and knowledge from them. This book frames for the first time a new systematic approach for tackling the challenges of data-intensive computing, providing decision makers and technical experts alike with practical tools for dealing with our exploding data collections.  Emphasising data-intensive thinking and interdisciplinary collaboration,  The DATA Bonanza: Improving Knowledge Discovery in Science, Engineering, and Business examines the essential components of knowledge discovery, surveys many of the current research efforts worldwide, and points to new areas for innovation. Complete with a wealth of examples and DISPEL-based methods demonstrating how to gain more from data in real-world systems, the book:  * Outlines the concepts and rationale for implementing data-intensive computing in organisations  * Covers from the ground up problem-solving strategies for data analysis in a data-rich world  * Introduces techniques for data-intensive engineering using the Data-Intensive Systems Process Engineering Language DISPEL  * Features in-depth case studies in customer relations, environmental hazards, seismology, and more  * Showcases successful applications in areas ranging from astronomy and the humanities to transport engineering  * Includes sample program snippets throughout the text as well as additional materials on a companion website  The DATA Bonanza is a must-have guide for information strategists, data analysts, and engineers in business, research, and government, and for anyone wishing to be on the cutting edge of data mining, machine learning, databases, distributed systems, or large-scale computing.
JF  - Wiley Series on Parallel and Distributed Computing (Editor: Albert Y. Zomaya)
PB  - John Wiley & Sons Inc.
SN  - 978-1-118-39864-7
ER  - 

TY  - CHAP
T1  - The Data-Intensive Survival Guide
T2  - THE DATA BONANZA: Improving Knowledge Discovery for Science, Engineering and Business
Y1  - 2013
A1  - Malcolm Atkinson
ED  - Malcolm Atkinson
ED  - Rob Baxter
ED  - Peter Brezany
ED  - Oscar Corcho
ED  - Michelle Galea
ED  - Parsons, Mark
ED  - Snelling, David
ED  - van Hemert, Jano
KW  - Data-Analysis Experts
KW  - Data-Intensive Architecture
KW  - Data-intensive Computing
KW  - Data-Intensive Engineers
KW  - Datascopes
KW  - Dispel
KW  - Domain Experts
KW  - Intellectual Ramps
KW  - Knowledge Discovery
KW  - Workflows
AB  - Chapter 3: "The data-intensive survival guide", presents an overview of all of the elements of the proposed data-intensive strategy. Sufficient detail is presented for readers to understand the principles and practice that we recommend. It should also provide a good preparation for readers who choose to sample later chapters. It introduces three professional viewpoints: domain experts, data-analysis experts, and data-intensive engineers. Success depends on a balanced approach that develops the capacity of all three groups. A data-intensive architecture provides a flexible framework for that balanced approach. This enables the three groups to build and exploit data-intensive processes that incrementally step from data to results. A language is introduced to describe these incremental data processes from all three points of view. The chapter introduces ‘datascopes’ as the productized data-handling environments and ‘intellectual ramps’ as the ‘on ramps’  for the highways from data to knowledge.
JF  - THE DATA BONANZA: Improving Knowledge Discovery for Science, Engineering and Business
PB  - John Wiley & Sons Ltd.
ER  - 

TY  - CHAP
T1  - Data-Intensive Thinking with DISPEL
T2  - THE DATA BONANZA: Improving Knowledge Discovery for Science, Engineering and Business
Y1  - 2013
A1  - Malcolm Atkinson
ED  - Malcolm Atkinson
ED  - Rob Baxter
ED  - Peter Brezany
ED  - Oscar Corcho
ED  - Michelle Galea
ED  - Parsons, Mark
ED  - Snelling, David
ED  - van Hemert, Jano
KW  - Data-Intensive Machines
KW  - Data-Intensive Thinking, Data-intensive Computing
KW  - Dispel
KW  - Distributed Computing
KW  - Knowledge Discovery
AB  - Chapter 4: "Data-intensive thinking with DISPEL", engages the reader with technical issues and solutions, by working through a sequence of examples, building up from a sketch of a solution to a large-scale data challenge. It uses the DISPEL language extensively, introducing its concepts and constructs. It shows how DISPEL may help designers, data-analysts, and engineers develop  solutions to the requirements emerging in any data-intensive application domain. The reader is taken through simple steps initially, this then builds to conceptually complex steps that are necessary to cope with the realities of real data providers, real data, real distributed systems, and long-running processes.
JF  - THE DATA BONANZA: Improving Knowledge Discovery for Science, Engineering and Business
PB  - John Wiley & Sons Inc.
ER  - 

TY  - CHAP
T1  - Definition of the DISPEL Language
T2  - THE DATA BONANZA: Improving Knowledge Discovery for Science, Engineering and Business
Y1  - 2013
A1  - Paul Martin
A1  - Yaikhom, Gagarine
ED  - Malcolm Atkinson
ED  - Rob Baxter
ED  - Peter Brezany
ED  - Oscar Corcho
ED  - Michelle Galea
ED  - Parsons, Mark
ED  - Snelling, David
ED  - van Hemert, Jano
KW  - Data Streaming
KW  - Data-intensive Computing
KW  - Dispel
AB  - Chapter 10: "Definition of the DISPEL language", describes the novel aspects of the DISPEL language: its constructs, capabilities, and anticipated programming style.
JF  - THE DATA BONANZA: Improving Knowledge Discovery for Science, Engineering and Business
T3  - {Parallel and Distributed Computing, series editor Albert Y. Zomaya}
PB  - John Wiley & Sons Inc.
ER  - 

TY  - CHAP
T1  - DISPEL Development
T2  - THE DATA BONANZA: Improving Knowledge Discovery for Science, Engineering and Business
Y1  - 2013
A1  - Adrian Mouat
A1  - Snelling, David
ED  - Malcolm Atkinson
ED  - Rob Baxter
ED  - Peter Brezany
ED  - Oscar Corcho
ED  - Michelle Galea
ED  - Parsons, Mark
ED  - Snelling, David
ED  - van Hemert, Jano
KW  - Diagnostics
KW  - Dispel
KW  - IDE
KW  - Libraries
KW  - Processing Elements
AB  - Chapter 11: "DISPEL development", describes the tools and libraries that a DISPEL developer might expect to use. The tools include those needed during process definition, those required to organize enactment, and diagnostic aids for developers of applications and platforms.
JF  - THE DATA BONANZA: Improving Knowledge Discovery for Science, Engineering and Business
PB  - John Wiley & Sons Inc.
ER  - 

TY  - CHAP
T1  - DISPEL Enactment
T2  - THE DATA BONANZA: Improving Knowledge Discovery for Science, Engineering and Business
Y1  - 2013
A1  - Chee Sun Liew
A1  - Krause, Amrey
A1  - Snelling, David
ED  - Malcolm Atkinson
ED  - Rob Baxter
ED  - Peter Brezany
ED  - Oscar Corcho
ED  - Michelle Galea
ED  - Parsons, Mark
ED  - Snelling, David
ED  - van Hemert, Jano
KW  - Data Streaming
KW  - Data-Intensive Engineering
KW  - Dispel
KW  - Workflow Enactment
AB  - Chapter 12: "DISPEL enactment", describes the four stages of DISPEL enactment. It is targeted at the data-intensive engineers who implement enactment services.
JF  - THE DATA BONANZA: Improving Knowledge Discovery for Science, Engineering and Business
PB  - John Wiley & Sons Inc.
ER  - 

TY  - CHAP
T1  - Platforms for Data-Intensive Analysis
T2  - THE DATA BONANZA: Improving Knowledge Discovery for Science, Engineering and Business
Y1  - 2013
A1  - Snelling, David
ED  - Malcolm Atkinson
ED  - Baxter, Robert M.
ED  - Peter Brezany
ED  - Oscar Corcho
ED  - Michelle Galea
ED  - Parsons, Mark
ED  - Snelling, David
ED  - van Hemert, Jano
KW  - Data-Intensive Engineering
KW  - Data-Intensive Systems
KW  - Dispel
KW  - Distributed Systems
AB  - Part III: "Data-intensive engineering", is targeted at technical experts who will develop complex applications, new components, or data-intensive platforms.  The techniques introduced may be applied very widely; for example, to any data-intensive distributed application, such as index generation, image processing, sequence comparison, text analysis, and sensor-stream monitoring. The challenges, methods, and implementation requirements are illustrated by making extensive use of DISPEL.    Chapter 9: "Platforms for data-intensive analysis", gives a reprise of data-intensive architectures, examines the business case for investing in them, and introduces the stages of data-intensive workflow enactment.
JF  - THE DATA BONANZA: Improving Knowledge Discovery for Science, Engineering and Business
PB  - John Wiley & Sons Ltd.
ER  - 

TY  - RPRT
T1  - Dispel Tutorial
Y1  - 2012
A1  - Paul Martin
KW  - Dispel
AB  - Dispel is a strongly-typed imperative language for generating executable workflows for data-intensive distributed applications, particularly (but not exclusively) for use in computational sciences such as bioinformatics, astronomy and seismology — it has been designed to be a portable lingua franca by which researchers can interact with complex distributed research infrastructures without detailed knowledge of the underlying computational middleware, all in order to more easily conduct experiments in data integration, simulation and data-intensive modelling.    This document is a tutorial for Dispel.
ER  -