TY - BOOK
T1 - The DATA Bonanza: Improving Knowledge Discovery in Science, Engineering, and Business
T2 - Wiley Series on Parallel and Distributed Computing (Editor: Albert Y. Zomaya)
Y1 - 2013
A1 - Atkinson, Malcolm P.
A1 - Baxter, Robert M.
A1 - Peter Brezany
A1 - Oscar Corcho
A1 - Michelle Galea
A1 - Parsons, Mark
A1 - Snelling, David
A1 - van Hemert, Jano
KW - Big Data
KW - Data Intensive
KW - data mining
KW - Data Streaming
KW - Databases
KW - Dispel
KW - Distributed Computing
KW - Knowledge Discovery
KW - Workflows
AB - With the digital revolution opening up tremendous opportunities in many fields, there is a growing need for skilled professionals who can develop data-intensive systems and extract information and knowledge from them. This book frames for the first time a new systematic approach for tackling the challenges of data-intensive computing, providing decision makers and technical experts alike with practical tools for dealing with our exploding data collections. Emphasising data-intensive thinking and interdisciplinary collaboration, The DATA Bonanza: Improving Knowledge Discovery in Science, Engineering, and Business examines the essential components of knowledge discovery, surveys many of the current research efforts worldwide, and points to new areas for innovation. Complete with a wealth of examples and DISPEL-based methods demonstrating how to gain more from data in real-world systems, the book: * Outlines the concepts and rationale for implementing data-intensive computing in organisations * Covers from the ground up problem-solving strategies for data analysis in a data-rich world * Introduces techniques for data-intensive engineering using the Data-Intensive Systems Process Engineering Language DISPEL * Features in-depth case studies in customer relations, environmental hazards, seismology, and more * Showcases successful applications in areas ranging from astronomy and the humanities to transport engineering * Includes sample program snippets throughout the text as well as additional materials on a companion website The DATA Bonanza is a must-have guide for information strategists, data analysts, and engineers in business, research, and government, and for anyone wishing to be on the cutting edge of data mining, machine learning, databases, distributed systems, or large-scale computing.
JF - Wiley Series on Parallel and Distributed Computing (Editor: Albert Y. Zomaya)
PB - John Wiley & Sons Inc.
SN - 978-1-118-39864-7
ER -
TY - CHAP
T1 - Data-Intensive Analysis
T2 - THE DATA BONANZA: Improving Knowledge Discovery for Science, Engineering and Business
Y1 - 2013
A1 - Oscar Corcho
A1 - van Hemert, Jano
ED - Malcolm Atkinson
ED - Rob Baxter
ED - Peter Brezany
ED - Oscar Corcho
ED - Michelle Galea
ED - Parsons, Mark
ED - Snelling, David
ED - van Hemert, Jano
KW - data mining
KW - Data-Analysis Experts
KW - Data-Intensive Analysis
KW - Knowledge Discovery
AB - Part II: "Data-intensive Knowledge Discovery", focuses on the needs of data-analysis experts. It illustrates the problem-solving strategies appropriate for a data-rich world, without delving into the details of underlying technologies. It should engage and inform data-analysis specialists, such as statisticians, data miners, image analysts, bio-informaticians or chemo-informaticians, and generate ideas pertinent to their application areas. Chapter 5: "Data-intensive Analysis", introduces a set of common problems that data-analysis experts often encounter, by means of a set of scenarios of increasing levels of complexity. The scenarios typify knowledge discovery challenges and the presented solutions provide practical methods; a starting point for readers addressing their own data challenges.
JF - THE DATA BONANZA: Improving Knowledge Discovery for Science, Engineering and Business
PB - John Wiley & Sons Ltd.
ER -
TY - CHAP
T1 - Data-Intensive Components and Usage Patterns
T2 - THE DATA BONANZA: Improving Knowledge Discovery for Science, Engineering and Business
Y1 - 2013
A1 - Oscar Corcho
ED - Malcolm Atkinson
ED - Rob Baxter
ED - Peter Brezany
ED - Oscar Corcho
ED - Michelle Galea
ED - Parsons, Mark
ED - Snelling, David
ED - van Hemert, Jano
KW - Data Analysis
KW - data mining
KW - Data-Intensive Components
KW - Registry
KW - Workflow Libraries
KW - Workflow Sharing
AB - Chapter 7: "Data-intensive components and usage patterns", provides a systematic review of the components that are commonly used in knowledge discovery tasks as well as common patterns of component composition. That is, it introduces the processing elements from which knowledge discovery solutions are built and common composition patterns for delivering trustworthy information. It reflects on how these components and patterns are evolving in a data-intensive context.
JF - THE DATA BONANZA: Improving Knowledge Discovery for Science, Engineering and Business
PB - John Wiley & Sons Ltd.
ER -
TY - CHAP
T1 - The Data-Intensive Survival Guide
T2 - THE DATA BONANZA: Improving Knowledge Discovery for Science, Engineering and Business
Y1 - 2013
A1 - Malcolm Atkinson
ED - Malcolm Atkinson
ED - Rob Baxter
ED - Peter Brezany
ED - Oscar Corcho
ED - Michelle Galea
ED - Parsons, Mark
ED - Snelling, David
ED - van Hemert, Jano
KW - Data-Analysis Experts
KW - Data-Intensive Architecture
KW - Data-intensive Computing
KW - Data-Intensive Engineers
KW - Datascopes
KW - Dispel
KW - Domain Experts
KW - Intellectual Ramps
KW - Knowledge Discovery
KW - Workflows
AB - Chapter 3: "The data-intensive survival guide", presents an overview of all of the elements of the proposed data-intensive strategy. Sufficient detail is presented for readers to understand the principles and practice that we recommend. It should also provide a good preparation for readers who choose to sample later chapters. It introduces three professional viewpoints: domain experts, data-analysis experts, and data-intensive engineers. Success depends on a balanced approach that develops the capacity of all three groups. A data-intensive architecture provides a flexible framework for that balanced approach. This enables the three groups to build and exploit data-intensive processes that incrementally step from data to results. A language is introduced to describe these incremental data processes from all three points of view. The chapter introduces ‘datascopes’ as the productized data-handling environments and ‘intellectual ramps’ as the ‘on ramps’ for the highways from data to knowledge.
JF - THE DATA BONANZA: Improving Knowledge Discovery for Science, Engineering and Business
PB - John Wiley & Sons Ltd.
ER -
TY - CHAP
T1 - Data-Intensive Thinking with DISPEL
T2 - THE DATA BONANZA: Improving Knowledge Discovery for Science, Engineering and Business
Y1 - 2013
A1 - Malcolm Atkinson
ED - Malcolm Atkinson
ED - Rob Baxter
ED - Peter Brezany
ED - Oscar Corcho
ED - Michelle Galea
ED - Parsons, Mark
ED - Snelling, David
ED - van Hemert, Jano
KW - Data-Intensive Machines
KW - Data-Intensive Thinking, Data-intensive Computing
KW - Dispel
KW - Distributed Computing
KW - Knowledge Discovery
AB - Chapter 4: "Data-intensive thinking with DISPEL", engages the reader with technical issues and solutions, by working through a sequence of examples, building up from a sketch of a solution to a large-scale data challenge. It uses the DISPEL language extensively, introducing its concepts and constructs. It shows how DISPEL may help designers, data-analysts, and engineers develop solutions to the requirements emerging in any data-intensive application domain. The reader is taken through simple steps initially, this then builds to conceptually complex steps that are necessary to cope with the realities of real data providers, real data, real distributed systems, and long-running processes.
JF - THE DATA BONANZA: Improving Knowledge Discovery for Science, Engineering and Business
PB - John Wiley & Sons Inc.
ER -
TY - CHAP
T1 - Definition of the DISPEL Language
T2 - THE DATA BONANZA: Improving Knowledge Discovery for Science, Engineering and Business
Y1 - 2013
A1 - Paul Martin
A1 - Yaikhom, Gagarine
ED - Malcolm Atkinson
ED - Rob Baxter
ED - Peter Brezany
ED - Oscar Corcho
ED - Michelle Galea
ED - Parsons, Mark
ED - Snelling, David
ED - van Hemert, Jano
KW - Data Streaming
KW - Data-intensive Computing
KW - Dispel
AB - Chapter 10: "Definition of the DISPEL language", describes the novel aspects of the DISPEL language: its constructs, capabilities, and anticipated programming style.
JF - THE DATA BONANZA: Improving Knowledge Discovery for Science, Engineering and Business
T3 - {Parallel and Distributed Computing, series editor Albert Y. Zomaya}
PB - John Wiley & Sons Inc.
ER -
TY - CONF
T1 - The demand for consistent web-based workflow editors
T2 - Proceedings of the 8th Workshop on Workflows in Support of Large-Scale Science
Y1 - 2013
A1 - Gesing, Sandra
A1 - Atkinson, Malcolm
A1 - Klampanos, Iraklis
A1 - Galea, Michelle
A1 - Berthold, Michael R.
A1 - Barbera, Roberto
A1 - Scardaci, Diego
A1 - Terstyanszky, Gabor
A1 - Kiss, Tamas
A1 - Kacsuk, Peter
KW - web-based workflow editors
KW - workflow composition
KW - workflow interoperability
KW - workflow languages and concepts
JF - Proceedings of the 8th Workshop on Workflows in Support of Large-Scale Science
PB - ACM
CY - New York, NY, USA
SN - 978-1-4503-2502-8
UR - http://doi.acm.org/10.1145/2534248.2534260
ER -
TY - CHAP
T1 - The Digital-Data Challenge
T2 - THE DATA BONANZA: Improving Knowledge Discovery for Science, Engineering and Business
Y1 - 2013
A1 - Malcolm Atkinson
A1 - Parsons, Mark
ED - Malcolm Atkinson
ED - Rob Baxter
ED - Peter Brezany
ED - Oscar Corcho
ED - Michelle Galea
ED - Parsons, Mark
ED - Snelling, David
ED - van Hemert, Jano
KW - Big Data
KW - Data-intensive Computing, Knowledge Discovery
KW - Digital Data
KW - Digital-Data Revolution
AB - Part I: Strategies for success in the digital-data revolution, provides an executive summary of the whole book to convince strategists, politicians, managers, and educators that our future data-intensive society requires new thinking, new behavior, new culture, and new distribution of investment and effort. This part will introduce the major concepts so that readers are equipped to discuss and steer their organization’s response to the opportunities and obligations brought by the growing wealth of data. It will help readers understand the changing context brought about by advances in digital devices, digital communication, and ubiquitous computing. Chapter 1: The digital-data challenge, will help readers to understand the challenges ahead in making good use of the data and introduce ideas that will lead to helpful strategies. A global digital-data revolution is catalyzing change in the ways in which we live, work, relax, govern, and organize. This is a significant change in society, as important as the invention of printing or the industrial revolution, but more challenging because it is happening globally at lnternet speed. Becoming agile in adapting to this new world is essential.
JF - THE DATA BONANZA: Improving Knowledge Discovery for Science, Engineering and Business
PB - John Wiley & Sons Ltd.
ER -
TY - CHAP
T1 - The Digital-Data Revolution
T2 - THE DATA BONANZA: Improving Knowledge Discovery for Science, Engineering and Business
Y1 - 2013
A1 - Malcolm Atkinson
ED - Malcolm Atkinson
ED - Rob Baxter
ED - Peter Brezany
ED - Oscar Corcho
ED - Michelle Galea
ED - Parsons, Mark
ED - Snelling, David
ED - van Hemert, Jano
KW - Data
KW - Information
KW - Knowledge
KW - Knowledge Discovery
KW - Social Impact of Digital Data
KW - Wisdom, Data-intensive Computing
AB - Chapter 2: "The digital-data revolution", reviews the relationships between data, information, knowledge, and wisdom. It analyses and quantifies the changes in technology and society that are delivering the data bonanza, and then reviews the consequential changes via representative examples in biology, Earth sciences, social sciences, leisure activity, and business. It exposes quantitative details and shows the complexity and diversity of the growing wealth of data, introducing some of its potential benefits and examples of the impediments to successfully realizing those benefits.
JF - THE DATA BONANZA: Improving Knowledge Discovery for Science, Engineering and Business
PB - John Wiley & Sons Ltd.
ER -
TY - CHAP
T1 - DISPEL Development
T2 - THE DATA BONANZA: Improving Knowledge Discovery for Science, Engineering and Business
Y1 - 2013
A1 - Adrian Mouat
A1 - Snelling, David
ED - Malcolm Atkinson
ED - Rob Baxter
ED - Peter Brezany
ED - Oscar Corcho
ED - Michelle Galea
ED - Parsons, Mark
ED - Snelling, David
ED - van Hemert, Jano
KW - Diagnostics
KW - Dispel
KW - IDE
KW - Libraries
KW - Processing Elements
AB - Chapter 11: "DISPEL development", describes the tools and libraries that a DISPEL developer might expect to use. The tools include those needed during process definition, those required to organize enactment, and diagnostic aids for developers of applications and platforms.
JF - THE DATA BONANZA: Improving Knowledge Discovery for Science, Engineering and Business
PB - John Wiley & Sons Inc.
ER -
TY - CHAP
T1 - DISPEL Enactment
T2 - THE DATA BONANZA: Improving Knowledge Discovery for Science, Engineering and Business
Y1 - 2013
A1 - Chee Sun Liew
A1 - Krause, Amrey
A1 - Snelling, David
ED - Malcolm Atkinson
ED - Rob Baxter
ED - Peter Brezany
ED - Oscar Corcho
ED - Michelle Galea
ED - Parsons, Mark
ED - Snelling, David
ED - van Hemert, Jano
KW - Data Streaming
KW - Data-Intensive Engineering
KW - Dispel
KW - Workflow Enactment
AB - Chapter 12: "DISPEL enactment", describes the four stages of DISPEL enactment. It is targeted at the data-intensive engineers who implement enactment services.
JF - THE DATA BONANZA: Improving Knowledge Discovery for Science, Engineering and Business
PB - John Wiley & Sons Inc.
ER -
TY - CONF
T1 - A Data Driven Science Gateway for Computational Workflows
T2 - UNICORE Summit 2012
Y1 - 2012
A1 - Grunzke, Richard
A1 - Birkenheuer, G.
A1 - Blunk, D.
A1 - Breuers, S.
A1 - Brinkmann, A.
A1 - Gesing, Sandra
A1 - Herres-Pawlis, S
A1 - Kohlbacher, O.
A1 - Krüger, J.
A1 - Kruse, M.
A1 - Müller-Pfefferkorn, R.
A1 - Schäfer, P.
A1 - Schuller, B.
A1 - Steinke, T.
A1 - Zink, A.
JF - UNICORE Summit 2012
ER -
TY - CONF
T1 - A databank, rather than statistical, model of normal ageing brain structure to indicate pathology
T2 - OHBM 2012
Y1 - 2012
A1 - Dickie, David Alexander
A1 - Dominic Job
A1 - Rodríguez, David
A1 - Shenkin, Susan
A1 - Wardlaw, Joanna
JF - OHBM 2012
UR - http://ww4.aievolution.com/hbm1201/index.cfm?do=abs.viewAbs&abs=5102
ER -
TY - JOUR
T1 - Data-Intensive Architecture for Scientific Knowledge Discovery
JF - Distributed and Parallel Databases
Y1 - 2012
A1 - Atkinson, Malcolm P.
A1 - Chee Sun Liew
A1 - Michelle Galea
A1 - Paul Martin
A1 - Krause, Amrey
A1 - Adrian Mouat
A1 - Oscar Corcho
A1 - Snelling, David
KW - Knowledge discovery, workflow management system
AB - This paper presents a data-intensive architecture that demonstrates the ability to support applications from a wide range of application domains, and support the different types of users involved in defining, designing and executing data-intensive processing tasks. The prototype architecture is introduced, and the pivotal role of DISPEL as a canonical language is explained. The architecture promotes the exploration and exploitation of distributed and heterogeneous data and spans the complete knowledge discovery process, from data preparation, to analysis, to evaluation and reiteration. The architecture evaluation included large-scale applications from astronomy, cosmology, hydrology, functional genetics, imaging processing and seismology.
VL - 30
UR - http://dx.doi.org/10.1007/s10619-012-7105-3
IS - 5
ER -
TY - CONF
T1 - Dimensioning Scientific Computing Systems to Improve Performance of Map-Reduce based Applications
T2 - Procedia Computer Science, Proceedings of the International Conference on Computational Science, ICCS 2012
Y1 - 2012
A1 - Gabriel G. Castañè
A1 - Alberto Nuñez
A1 - Rosa Filgueira
A1 - Jesus Carretero
JF - Procedia Computer Science, Proceedings of the International Conference on Computational Science, ICCS 2012
ER -
TY - JOUR
T1 - Dimensioning Scientific Computing Systems to Improve Performance of Map-Reduce based Applications
JF - Procedia CS
Y1 - 2012
A1 - Gabriel G. Castañè
A1 - Alberto Nuñez
A1 - Rosa Filgueira
A1 - Jesus Carretero
VL - 9
ER -
TY - RPRT
T1 - Dispel Tutorial
Y1 - 2012
A1 - Paul Martin
KW - Dispel
AB - Dispel is a strongly-typed imperative language for generating executable workflows for data-intensive distributed applications, particularly (but not exclusively) for use in computational sciences such as bioinformatics, astronomy and seismology — it has been designed to be a portable lingua franca by which researchers can interact with complex distributed research infrastructures without detailed knowledge of the underlying computational middleware, all in order to more easily conduct experiments in data integration, simulation and data-intensive modelling. This document is a tutorial for Dispel.
ER -
TY - JOUR
T1 - Discovering the suitability of optimisation algorithms by learning from evolved instances
JF - Annals of Mathematics and Artificial Intelligence
Y1 - 2011
A1 - K. Smith-Miles
A1 - {van Hemert}, J. I.
KW - problem evolving
VL - Online Fir
UR - http://www.springerlink.com/content/6x83q3201gg71554/
ER -
TY - CHAP
T1 - DISPEL Reference Manual
T2 - Advanced Data Mining and Integration Research for Europe (ADMIRE)
Y1 - 2011
A1 - Paul Martin
A1 - Yaikhom, Gagarine
KW - DISPEL, ADMIRE
AB - Reference manual for the Data Intensive Systems Process Engineering Language (DISPEL).
JF - Advanced Data Mining and Integration Research for Europe (ADMIRE)
UR - www.admire-project.eu
ER -
TY - RPRT
T1 - Data-Intensive Research Workshop (15-19 March 2010) Report
Y1 - 2010
A1 - Malcolm Atkinson
A1 - Roure, David De
A1 - van Hemert, Jano
A1 - Shantenu Jha
A1 - Ruth McNally
A1 - Robert Mann
A1 - Stratis Viglas
A1 - Chris Williams
KW - Data-intensive Computing
KW - Data-Intensive Machines
KW - Machine Learning
KW - Scientific Databases
AB - We met at the National e-Science Institute in Edinburgh on 15-19 March 2010 to develop our understanding of DIR. Approximately 100 participants (see Appendix A) worked together to develop their own understanding, and we are offering this report as the first step in communicating that to a wider community. We present this in turns of our developing/emerging understanding of "What is DIR?" and "Why it is important?'". We then review the status of the field, report what the workshop achieved and what remains as open questions.
JF - National e-Science Centre
PB - Data-Intensive Research Group, School of Informatics, University of Edinburgh
CY - Edinburgh
ER -
TY - JOUR
T1 - Dynamic-CoMPI: Dynamic optimization techniques for MPI parallel applications.
JF - The Journal of Supercomputing.
Y1 - 2010
A1 - Rosa Filgueira
A1 - Jesús Carretero
A1 - David E. Singh
A1 - Alejandro Calderón
A1 - Alberto Nunez
KW - Adaptive systems
KW - Clusters architectures
KW - Collective I/O
KW - Compression algorithms
KW - Heuristics
KW - MPI library - Parallel techniques
AB - This work presents an optimization of MPI communications, called Dynamic-CoMPI, which uses two techniques in order to reduce the impact of communications and non-contiguous I/O requests in parallel applications. These techniques are independent of the application and complementaries to each other. The first technique is an optimization of the Two-Phase collective I/O technique from ROMIO, called Locality aware strategy for Two-Phase I/O (LA-Two-Phase I/O). In order to increase the locality of the file accesses, LA-Two-Phase I/O employs the Linear Assignment Problem (LAP) for finding an optimal I/O data communication schedule. The main purpose of this technique is the reduction of the number of communications involved in the I/O collective operation. The second technique, called Adaptive-CoMPI, is based on run-time compression of MPI messages exchanged by applications. Both techniques can be applied on every application, because both of them are transparent for the users. Dynamic-CoMPI has been validated by using several MPI benchmarks and real HPC applications. The results show that, for many of the considered scenarios, important reductions in the execution time are achieved by reducing the size and the number of the messages. Additional benefits of our approach are the reduction of the total communication time and the network contention, thus enhancing, not only performance, but also scalability.
PB - Springer
ER -
TY - JOUR
T1 - Design and Optimization of Reverse-Transcription Quantitative PCR Experiments
JF - Clin Chem
Y1 - 2009
A1 - Tichopad, Ales
A1 - Kitchen, Rob
A1 - Riedmaier, Irmgard
A1 - Becker, Christiane
A1 - Stahlberg, Anders
A1 - Kubista, Mikael
AB - BACKGROUND: Quantitative PCR (qPCR) is a valuable technique for accurately and reliably profiling and quantifying gene expression. Typically, samples obtained from the organism of study have to be processed via several preparative steps before qPCR. METHOD: We estimated the errors of sample withdrawal and extraction, reverse transcription (RT), and qPCR that are introduced into measurements of mRNA concentrations. We performed hierarchically arranged experiments with 3 animals, 3 samples, 3 RT reactions, and 3 qPCRs and quantified the expression of several genes in solid tissue, blood, cell culture, and single cells. RESULTS: A nested ANOVA design was used to model the experiments, and relative and absolute errors were calculated with this model for each processing level in the hierarchical design. We found that intersubject differences became easily confounded by sample heterogeneity for single cells and solid tissue. In cell cultures and blood, the noise from the RT and qPCR steps contributed substantially to the overall error because the sampling noise was less pronounced. CONCLUSIONS: We recommend the use of sample replicates preferentially to any other replicates when working with solid tissue, cell cultures, and single cells, and we recommend the use of RT replicates when working with blood. We show how an optimal sampling plan can be calculated for a limited budget.
UR - http://www.clinchem.org/cgi/content/abstract/clinchem.2009.126201v1
ER -
TY - CONF
T1 - A Distributed Architecture for Data Mining and Integration
T2 - Data-Aware Distributed Computing (DADC'09), in conjunction with the 18th International Symposium on High Performance Distributed Computing
Y1 - 2009
A1 - Atkinson, Malcolm P.
A1 - van Hemert, Jano
A1 - Liangxiu Han
A1 - Ally Hume
A1 - Chee Sun Liew
AB - This paper presents the rationale for a new architecture to support a significant increase in the scale of data integration and data mining. It proposes the composition into one framework of (1) data mining and (2) data access and integration. We name the combined activity “DMI”. It supports enactment of DMI processes across heterogeneous and distributed data resources and data mining services. It posits that a useful division can be made between the facilities established to support the definition of DMI processes and the computational infrastructure provided to enact DMI processes. Communication between those two divisions is restricted to requests submitted to gateway services in a canonical DMI language. Larger-scale processes are enabled by incremental refinement of DMI-process definitions often by recomposition of lower-level definitions. Autonomous types and descriptions which will support detection of inconsistencies and semi-automatic insertion of adaptations.These architectural ideas are being evaluated in a feasibility study that involves an application scenario and representatives of the community.
JF - Data-Aware Distributed Computing (DADC'09), in conjunction with the 18th International Symposium on High Performance Distributed Computing
PB - ACM
ER -
TY - CONF
T1 - Data Locality Aware Strategy for Two-Phase Collective I/O
T2 - VECPAR
Y1 - 2008
A1 - Rosa Filgueira
A1 - David E. Singh
A1 - Juan Carlos Pichel
A1 - Florin Isaila
A1 - Jesús Carretero
JF - VECPAR
ER -
TY - JOUR
T1 - Distributed Computing Education, Part 1: A Special Case?
JF - IEEE Distributed Systems Online
Y1 - 2008
A1 - Fergusson, D.
A1 - Hopkins, R.
A1 - Romano, D.
A1 - Vander Meer, E.
A1 - Atkinson, M.
VL - 9
UR - http://dsonline.computer.org/portal/site/dsonline/menuitem.9ed3d9924aeb0dcd82ccc6716bbe36ec/index.jsp?&pName=dso_level1&path=dsonline/2008/06&file=o6002edu.xml&xsl=article.xsl&;jsessionid=LZ5zjySvc2xPnVv4qTYJXhlvwSnRGGj7S7WvPtrPyv23rJGQdjJr!982319602
IS - 6
ER -
TY - JOUR
T1 - Distributed Computing Education, Part 2: International Summer Schools
JF - IEEE Distributed Systems Online
Y1 - 2008
A1 - Fergusson, D.
A1 - Hopkins, R.
A1 - Romano, D.
A1 - Vander Meer, E.
A1 - Atkinson, M.
VL - 9
UR - http://dsonline.computer.org/portal/site/dsonline/menuitem.9ed3d9924aeb0dcd82ccc6716bbe36ec/index.jsp?&pName=dso_level1&path=dsonline/2008/07&file=o7002edu.xml&xsl=article.xsl&
IS - 7
ER -
TY - JOUR
T1 - Distributed Computing Education, Part 3: The Winter School Online Experience
JF - Distributed Systems Online
Y1 - 2008
A1 - Low, B.
A1 - Cassidy, K.
A1 - Fergusson, D.
A1 - Atkinson, M.
A1 - Vander Meer, E.
A1 - McGeever, M.
AB - The International Summer Schools in Grid Computing (ISSGC) have provided numerous international students with the opportunity to learn grid systems, as detailed in part 2 of this series (http://doi.ieeecomputersociety.org/10.1109/MDSO.2008.20). The International Winter School on Grid Computing 2008 (IWSGC 08) followed the successful summer schools, opening up the ISSGC experience to a wider range of students because of its online format. The previous summer schools made it clear that many students found the registration and travel costs and the time requirements prohibitive. The EU FP6 ICEAGE project held the first winter school from 6 February to 12 March 2008. The winter school repurposed summer school materials and added resources such as the ICEAGE digital library and summer-school-tested t-Infrastructures such as GILDA (Grid INFN Laboratory for Dissemination Activities). The winter schools shared the goals of the summer school, which emphasized disseminating grid knowledge. The students act as multipliers, spreading the skills and knowledge they acquired at the winter school to their colleagues to build strong and enthusiastic local grid communities.
PB - IEEE Computer Society
VL - 9
UR - http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=4659260
IS - 9
ER -
TY - JOUR
T1 - Distributed Computing Education, Part 4: Training Infrastructure
JF - Distributed Systems Online
Y1 - 2008
A1 - Fergusson, D.
A1 - Barbera, R.
A1 - Giorgio, E.
A1 - Fargetta, M.
A1 - Sipos, G.
A1 - Romano, D.
A1 - Atkinson, M.
A1 - Vander Meer, E.
AB - In the first article of this series (see http://doi.ieeecomputersociety.org/10.1109/MDSO.2008.16), we identified the need for teaching environments that provide infrastructure to support education and training in distributed computing. Training infrastructure, or t-infrastructure, is analogous to the teaching laboratory in biology and is a vital tool for educators and students. In practice, t-infrastructure includes the computing equipment, digital communications, software, data, and support staff necessary to teach a course. The International Summer Schools in Grid Computing (ISSGC) series and the first International Winter School on Grid Computing (IWSGC 08) used the Grid INFN Laboratory of Dissemination Activities (GILDA) infrastructure so students could gain hands-on experience with middleware. Here, we describe GILDA, related summer and winter school experiences, multimiddleware integration, t-infrastructure, and academic courses, concluding with an analysis and recommendations.
PB - IEEE Computer Society
VL - 9
UR - http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=4752926
IS - 10
ER -
TY - JOUR
T1 - Distributed Computing Education, Part 5: Coming to Terms with Intellectual Property Rights
JF - Distributed Systems Online
Y1 - 2008
A1 - Boon Low
A1 - Kathryn Cassidy
A1 - Fergusson, David
A1 - Malcolm Atkinson
A1 - Elizabeth Vander Meer
A1 - Mags McGeever
AB - In part 1 of this series on distributed computing education, we introduced a list of components important for teaching environments. We outlined the first three components, which included development of materials for education, education for educators and teaching infrastructures, identifying current practice, challenges, and opportunities for provision. The final component, a supportive policy framework that encourages cooperation and sharing, includes the need to manage intellectual property rights (IPR).
PB - IEEE Computer Society
VL - 9
UR - http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=4755177
IS - 12
ER -
TY - CONF
T1 - Data Integration in eHealth: A Domain/Disease Specific Roadmap
T2 - Studies in Health Technology and Informatics
Y1 - 2007
A1 - Ure, J.
A1 - Proctor, R.
A1 - Martone, M.
A1 - Porteous, D.
A1 - Lloyd, S.
A1 - Lawrie, S.
A1 - Job, D.
A1 - Baldock, R.
A1 - Philp, A.
A1 - Liewald, D.
A1 - Rakebrand, F.
A1 - Blaikie, A.
A1 - McKay, C.
A1 - Anderson, S.
A1 - Ainsworth, J.
A1 - van Hemert, J.
A1 - Blanquer, I.
A1 - Sinno
ED - N. Jacq
ED - Y. Legr{\'e}
ED - H. Muller
ED - I. Blanquer
ED - V. Breton
ED - D. Hausser
ED - V. Hern{\'a}ndez
ED - T. Solomonides
ED - M. Hofman-Apitius
KW - e-Science
AB - The paper documents a series of data integration workshops held in 2006 at the UK National e-Science Centre, summarizing a range of the problem/solution scenarios in multi-site and multi-scale data integration with six HealthGrid projects using schizophrenia as a domain-specific test case. It outlines emerging strategies, recommendations and objectives for collaboration on shared ontology-building and harmonization of data for multi-site trials in this domain.
JF - Studies in Health Technology and Informatics
PB - IOPress
VL - 126
SN - 978-1-58603-738-3
ER -
TY - JOUR
T1 - The design and implementation of Grid database services in OGSA-DAI
JF - Concurrency - Practice and Experience
Y1 - 2005
A1 - Antonioletti, Mario
A1 - Atkinson, Malcolm P.
A1 - Baxter, Robert M.
A1 - Borley, Andrew
A1 - Hong, Neil P. Chue
A1 - Collins, Brian
A1 - Hardman, Neil
A1 - Hume, Alastair C.
A1 - Knox, Alan
A1 - Mike Jackson
A1 - Krause, Amrey
A1 - Laws, Simon
A1 - Magowan, James
A1 - Pato
VL - 17
ER -
TY - CONF
T1 - The Digital Curation Centre: a vision for digital curation
T2 - 2005 IEEE International Symposium on Mass Storage Systems and Technology
Y1 - 2005
A1 - Rusbridge, C.
A1 - P. Burnhill
A1 - S. Ross
A1 - P. Buneman
A1 - D. Giaretta
A1 - Lyon, L.
A1 - Atkinson, M.
AB - We describe the aims and aspirations for the Digital Curation Centre (DCC), the UK response to the realisation that digital information is both essential and fragile. We recognise the equivalence of preservation as "interoperability with the future", asserting that digital curation is concerned with "communication across time". We see the DCC as having relevance for present day data curation and for continuing data access for generations to come. We describe the structure and plans of the DCC, designed to support these aspirations and based on a view of world class research being developed into curation services, all of which are underpinned by outreach to the broadest community.
JF - 2005 IEEE International Symposium on Mass Storage Systems and Technology
PB - IEEE Computer Society
CY - Sardinia, Italy
SN - 0-7803-9228-0
ER -
TY - Generic
T1 - Development of a Grid Infrastructure for Functional Genomics
T2 - Life Science Grid Conference (LSGrid 2004)
Y1 - 2004
A1 - Sinnott, R. O.
A1 - Bayer, M.
A1 - Houghton, D.
A1 - D. Berry
A1 - Ferrier, M.
JF - Life Science Grid Conference (LSGrid 2004)
T3 - LNCS
PB - Springer Verlag
CY - Kanazawa, Japan
ER -
TY - CONF
T1 - Dynamic Routing Problems with Fruitful Regions: Models and Evolutionary Computation
T2 - LNCS
Y1 - 2004
A1 - van Hemert, J. I.
A1 - la Poutré, J. A.
ED - Xin Yao
ED - Edmund Burke
ED - Jose A. Lozano
ED - Jim Smith
ED - Juan J. Merelo-Guerv\'os
ED - John A. Bullinaria
ED - Jonathan Rowe
ED - Peter Ti\v{n}o Ata Kab\'an
ED - Hans-Paul Schwefel
KW - dynamic problems
KW - evolutionary computation
KW - vehicle routing
AB - We introduce the concept of fruitful regions in a dynamic routing context: regions that have a high potential of generating loads to be transported. The objective is to maximise the number of loads transported, while keeping to capacity and time constraints. Loads arrive while the problem is being solved, which makes it a real-time routing problem. The solver is a self-adaptive evolutionary algorithm that ensures feasible solutions at all times. We investigate under what conditions the exploration of fruitful regions improves the effectiveness of the evolutionary algorithm.
JF - LNCS
PB - Springer-Verlag
CY - Birmingham, UK
VL - 3242
SN - 3-540-23092-0
ER -
TY - CHAP
T1 - Data Access, Integration, and Management
T2 - The Grid 2: Blueprint for a New Computing Infrastructure (2nd edition),
Y1 - 2003
A1 - Atkinson. M.
A1 - Chervenak, A. L.
A1 - Kunszt, P.
A1 - Narang, I.
A1 - Paton, N. W.
A1 - Pearson, D.
A1 - Shoshani, A.
A1 - Watson, P.
ED - Foster, I.
ED - Kesselman, C
JF - The Grid 2: Blueprint for a New Computing Infrastructure (2nd edition),
PB - Morgan Kaufmann
SN - 1-55860-933-4
ER -
TY - CONF
T1 - Databases and the Grid: Who Challenges Whom?
T2 - BNCOD
Y1 - 2003
A1 - Atkinson, Malcolm P.
JF - BNCOD
ER -
TY - CONF
T1 - Dependable Grid Services
T2 - UK e-Science All Hands Meeting 2003, 2-4th September, Nottingham, UK
Y1 - 2003
A1 - Stuart Anderson
A1 - Yin Chen
A1 - Glen Dobson
A1 - Stephen Hall
A1 - Conrad Hughes
A1 - Yong Li
A1 - Sheng Qu
A1 - Ed Smith
A1 - Ian Sommerville
A1 - Ma Tiejun
ED - Proceedings of UK e-Science All Hands Meeting 2003
AB - The provision of dependable computer systems by deploying diverse, redundant components in order to mask or provide recovery from component failures has mostly been restricted to systems with very high criticality. In this paper we present an architecture and prototype implementation of an approach to providing such redundancy at low cost in service-based infrastructures. In particular we consider services that are supplied by composing a number of component services and consider how service discovery, automatic monitoring and failure detection have the potential to create composed services that are more dependable than might be possible using a straightforward approach. The work is still in its early stages and so far no evaluation of the approach has been carried out.
JF - UK e-Science All Hands Meeting 2003, 2-4th September, Nottingham, UK
CY - Nottingham, UK
ER -
TY - CONF
T1 - The Design and Implementation of Grid Database Services in OGSA-DAI
T2 - All Hands Meeting 2003
Y1 - 2003
A1 - Ali Anjomshoaa
A1 - Antonioletti, Mario
A1 - Malcolm Atkinson
A1 - Rob Baxter
A1 - Borley, Andrew
A1 - Hong, Neil P. Chue
A1 - Collins, Brian
A1 - Hardman, Neil
A1 - George Hicken
A1 - Ally Hume
A1 - Knox, Alan
A1 - Mike Jackson
A1 - Krause, Amrey
A1 - Laws, Simon
A1 - Magowan, James
A1 - Charaka Palansuriya
A1 - Paton, Norman W.
AB - This paper presents a high-level overview of the design and implementation of the core components of the OGSA-DAI project. It describes the design decisions made, the project’s interaction with the Data Access and Integration Working Group of the Global Grid Forum and provides an overview of implementation characteristics. Further details of the implementation are provided in the extensive documentation available from the project web site.
JF - All Hands Meeting 2003
CY - Nottingham, UK
ER -
TY - JOUR
T1 - The DWMM network traffic model
JF - Journal of Communication
Y1 - 2003
A1 - Cong Suo
A1 - Liangxiu Han
VL - 24
IS - 5
ER -
TY - JOUR
T1 - Database indexing for large DNA and protein sequence collections
JF - VLDB J.
Y1 - 2002
A1 - Hunt, Ela
A1 - Atkinson, Malcolm P.
A1 - Irving, Robert W.
VL - 11
ER -
TY - CONF
T1 - A Database Index to Large Biological Sequences
T2 - VLDB
Y1 - 2001
A1 - Hunt, Ela
A1 - Atkinson, Malcolm P.
A1 - Irving, Robert W.
JF - VLDB
ER -
TY - JOUR
T1 - De Creatieve Computer
JF - AIgg Kennisgeving
Y1 - 2000
A1 - van Hemert, J. I.
KW - evolutionary art
AB - Here we show an application that generates images resembling art as it was produced by Mondriaan, a Dutch artist, well known for his minimalistic and pure abstract pieces of art. The current version generates images using a linear chromosome and a recursive function as a decoder.
PB - Artifici{\"e}le Intelligentie gebruikers groep
VL - 13
N1 - invited article (in Dutch)
ER -
TY - CONF
T1 - Defining and Handling Transient Fields in PJama
T2 - DBPL
Y1 - 1999
A1 - Printezis, Tony
A1 - Atkinson, Malcolm P.
A1 - Jordan, Mick J.
JF - DBPL
ER -