TY - JOUR T1 - Automatically Identifying and Annotating Mouse Embryo Gene Expression Patterns JF - Bioinformatics Y1 - 2011 A1 - Liangxiu Han A1 - van Hemert, Jano A1 - Richard Baldock KW - classification KW - e-Science AB - Motivation: Deciphering the regulatory and developmental mechanisms for multicellular organisms requires detailed knowledge of gene interactions and gene expressions. The availability of large datasets with both spatial and ontological annotation of the spatio-temporal patterns of gene-expression in mouse embryo provides a powerful resource to discover the biological function of embryo organisation. Ontological annotation of gene expressions consists of labelling images with terms from the anatomy ontology for mouse development. If the spatial genes of an anatomical component are expressed in an image, the image is then tagged with a term of that anatomical component. The current annotation is done manually by domain experts, which is both time consuming and costly. In addition, the level of detail is variable and inevitably, errors arise from the tedious nature of the task. In this paper, we present a new method to automatically identify and annotate gene expression patterns in the mouse embryo with anatomical terms. Results: The method takes images from in situ hybridisation studies and the ontology for the developing mouse embryo, it then combines machine learning and image processing techniques to produce classifiers that automatically identify and annotate gene expression patterns in these images.We evaluate our method on image data from the EURExpress-II study where we use it to automatically classify nine anatomical terms: humerus, handplate, fibula, tibia, femur, ribs, petrous part, scapula and head mesenchyme. The accuracy of our method lies between 70–80% with few exceptions. Conclusions: We show that other known methods have lower classification performance than ours. We have investigated the images misclassified by our method and found several cases where the original annotation was not correct. This shows our method is robust against this kind of noise. Availability: The annotation result and the experimental dataset in the paper can be freely accessed at http://www2.docm.mmu.ac.uk/STAFF/L.Han/geneannotation/ Contact: l.han@mmu.ac.uk, j.vanhemert@ed.ac.uk and Richard.Baldock@hgu.mrc.ac.uk VL - 27 UR - http://bioinformatics.oxfordjournals.org/content/early/2011/02/25/bioinformatics.btr105.abstract ER - TY - JOUR T1 - A Generic Parallel Processing Model for Facilitating Data Mining and Integration JF - Parallel Computing Y1 - 2011 A1 - Liangxiu Han A1 - Chee Sun Liew A1 - van Hemert, Jano A1 - Malcolm Atkinson KW - Data Mining and Data Integration (DMI) KW - Life Sciences KW - OGSA-DAI KW - Parallelism KW - Pipeline Streaming KW - workflow AB - To facilitate Data Mining and Integration (DMI) processes in a generic way, we investigate a parallel pipeline streaming model. We model a DMI task as a streaming data-flow graph: a directed acyclic graph (DAG) of Processing Elements PEs. The composition mechanism links PEs via data streams, which may be in memory, buffered via disks or inter-computer data-flows. This makes it possible to build arbitrary DAGs with pipelining and both data and task parallelisms, which provides room for performance enhancement. We have applied this approach to a real DMI case in the Life Sciences and implemented a prototype. To demonstrate feasibility of the modelled DMI task and assess the efficiency of the prototype, we have also built a performance evaluation model. The experimental evaluation results show that a linear speedup has been achieved with the increase of the number of distributed computing nodes in this case study. PB - Elsevier VL - 37 IS - 3 ER - TY - JOUR T1 - Performance database: capturing data for optimizing distributed streaming workflows JF - Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences Y1 - 2011 A1 - Chee Sun Liew A1 - Atkinson, Malcolm P. A1 - Radoslaw Ostrowski A1 - Murray Cole A1 - van Hemert, Jano I. A1 - Liangxiu Han KW - measurement framework KW - performance data KW - streaming workflows AB - The performance database (PDB) stores performance-related data gathered during workflow enactment. We argue that by carefully understanding and manipulating this data, we can improve efficiency when enacting workflows. This paper describes the rationale behind the PDB, and proposes a systematic way to implement it. The prototype is built as part of the Advanced Data Mining and Integration Research for Europe project. We use workflows from real-world experiments to demonstrate the usage of PDB. VL - 369 IS - 1949 ER - TY - CHAP T1 - Exploiting P2P and Grid Computing Technologies for Resource Sharing to support High Performance Distributed System T2 - Handbook of Research on P2P and Grid Systems for Service-Oriented Computing: Models, Methodologies and Applications Y1 - 2010 A1 - Liangxiu Han ED - Nick Antonopoulos ED - Georgios Exarchakos ED - Maozhen Li ED - Antonio Liottac JF - Handbook of Research on P2P and Grid Systems for Service-Oriented Computing: Models, Methodologies and Applications PB - IGI Global publishing VL - 1 ER - TY - Generic T1 - Federated Enactment of Workflow Patterns T2 - Lecture Notes in Computer Science Y1 - 2010 A1 - Yaikhom, Gagarine A1 - Liew, Chee A1 - Liangxiu Han A1 - van Hemert, Jano A1 - Malcolm Atkinson A1 - Krause, Amy ED - D’Ambra, Pasqua ED - Guarracino, Mario ED - Talia, Domenico AB - In this paper we address two research questions concerning workflows: 1) how do we abstract and catalogue recurring workflow patterns?; and 2) how do we facilitate optimisation of the mapping from workflow patterns to actual resources at runtime? Our aim here is to explore techniques that are applicable to large-scale workflow compositions, where the resources could change dynamically during the lifetime of an application. We achieve this by introducing a registry-based mechanism where pattern abstractions are catalogued and stored. In conjunction with an enactment engine, which communicates with this registry, concrete computational implementations and resources are assigned to these patterns, conditional to the execution parameters. Using a data mining application from the life sciences, we demonstrate this new approach. JF - Lecture Notes in Computer Science PB - Springer Berlin / Heidelberg VL - 6271 UR - http://dx.doi.org/10.1007/978-3-642-15277-1_31 N1 - 10.1007/978-3-642-15277-1_31 ER - TY - CONF T1 - Towards Optimising Distributed Data Streaming Graphs using Parallel Streams T2 - Data Intensive Distributed Computing (DIDC'10), in conjunction with the 19th International Symposium on High Performance Distributed Computing Y1 - 2010 A1 - Chee Sun Liew A1 - Atkinson, Malcolm P. A1 - van Hemert, Jano A1 - Liangxiu Han KW - Data-intensive Computing KW - Distributed Computing KW - Optimisation KW - Parallel Stream KW - Scientific Workflows AB - Modern scientific collaborations have opened up the opportunity of solving complex problems that involve multi- disciplinary expertise and large-scale computational experiments. These experiments usually involve large amounts of data that are located in distributed data repositories running various software systems, and managed by different organisations. A common strategy to make the experiments more manageable is executing the processing steps as a workflow. In this paper, we look into the implementation of fine-grained data-flow between computational elements in a scientific workflow as streams. We model the distributed computation as a directed acyclic graph where the nodes represent the processing elements that incrementally implement specific subtasks. The processing elements are connected in a pipelined streaming manner, which allows task executions to overlap. We further optimise the execution by splitting pipelines across processes and by introducing extra parallel streams. We identify performance metrics and design a measurement tool to evaluate each enactment. We conducted ex- periments to evaluate our optimisation strategies with a real world problem in the Life Sciences—EURExpress-II. The paper presents our distributed data-handling model, the optimisation and instrumentation strategies and the evaluation experiments. We demonstrate linear speed up and argue that this use of data-streaming to enable both overlapped pipeline and parallelised enactment is a generally applicable optimisation strategy. JF - Data Intensive Distributed Computing (DIDC'10), in conjunction with the 19th International Symposium on High Performance Distributed Computing PB - ACM CY - Chicago, Illinois UR - http://www.cct.lsu.edu/~kosar/didc10/index.php ER - TY - CONF T1 - Automating Gene Expression Annotation for Mouse Embryo T2 - Lecture Notes in Computer Science (Advanced Data Mining and Applications, 5th International Conference) Y1 - 2009 A1 - Liangxiu Han A1 - van Hemert, Jano A1 - Richard Baldock A1 - Atkinson, Malcolm P. ED - Ronghuai Huang ED - Qiang Yang ED - Jian Pei ED - et al JF - Lecture Notes in Computer Science (Advanced Data Mining and Applications, 5th International Conference) PB - Springer VL - LNAI 5678 ER - TY - CONF T1 - A Distributed Architecture for Data Mining and Integration T2 - Data-Aware Distributed Computing (DADC'09), in conjunction with the 18th International Symposium on High Performance Distributed Computing Y1 - 2009 A1 - Atkinson, Malcolm P. A1 - van Hemert, Jano A1 - Liangxiu Han A1 - Ally Hume A1 - Chee Sun Liew AB - This paper presents the rationale for a new architecture to support a signiﬁcant increase in the scale of data integration and data mining. It proposes the composition into one framework of (1) data mining and (2) data access and integration. We name the combined activity “DMI”. It supports enactment of DMI processes across heterogeneous and distributed data resources and data mining services. It posits that a useful division can be made between the facilities established to support the deﬁnition of DMI processes and the computational infrastructure provided to enact DMI processes. Communication between those two divisions is restricted to requests submitted to gateway services in a canonical DMI language. Larger-scale processes are enabled by incremental reﬁnement of DMI-process deﬁnitions often by recomposition of lower-level deﬁnitions. Autonomous types and descriptions which will support detection of inconsistencies and semi-automatic insertion of adaptations.These architectural ideas are being evaluated in a feasibility study that involves an application scenario and representatives of the community. JF - Data-Aware Distributed Computing (DADC'09), in conjunction with the 18th International Symposium on High Performance Distributed Computing PB - ACM ER - TY - JOUR T1 - The performance model of dynamic Virtual Organization (VO) formations within grid computing context JF - Chaos, Solitons & Fractals Y1 - 2009 A1 - Liangxiu Han KW - complex network KW - graph theory KW - grid computing KW - virtual organization formation PB - Elsevier Science VL - 40 IS - 4 N1 - In press ER - TY - CONF T1 - Using architectural simulation models to aid the design of data intensive application T2 - The Third International Conference on Advanced Engineering Computing and Applications in Sciences (ADVCOMP 2009) Y1 - 2009 A1 - Javier Fernández A1 - Liangxiu Han A1 - Alberto Nuñez A1 - Jesus Carretero A1 - van Hemert, Jano JF - The Third International Conference on Advanced Engineering Computing and Applications in Sciences (ADVCOMP 2009) PB - IEEE Computer Society CY - Sliema, Malta ER - TY - CONF T1 - Using Simulation for Decision Support: Lessons Learned from FireGrid T2 - Proceedings of the 6th International Conference on Information Systems for Crisis Response and Management (ISCRAM 2009) Y1 - 2009 A1 - Gerhard Wickler A1 - George Beckett A1 - Liangxiu Han A1 - Sung Han Koo A1 - Stephen Potter A1 - Gavin Pringle A1 - Austin Tate ED - J. Landgren, U. Nulden ED - B. Van de Walle JF - Proceedings of the 6th International Conference on Information Systems for Crisis Response and Management (ISCRAM 2009) CY - Gothenburg, Sweden ER - TY - CONF T1 - An Architecture for an Integrated Fire Emergency Response System for the Built Environment T2 - 9th Symposium of the International Association for Fire Safety Science (IAFSS) Y1 - 2008 A1 - Rochan Upadhyay A1 - Galvin Pringle A1 - George Beckett A1 - Stephen Potter A1 - Liangxiu Han A1 - Stephen Welch A1 - Asif Usmani A1 - Jose Torero KW - emergency response system KW - FireGrid KW - system architecture KW - technology integration AB - FireGrid is a modern concept that aims to leverage a number of modern technologies to aid fire emergency response. In this paper we provide a brief introduction to the FireGrid project. A number of different technologies such as wireless sensor networks, grid-enabled High Performance Computing (HPC) implementation of fire models, and artificial intelligence tools need to be integrated to build up a modern fire emergency response system. We propose a system architecture that provides the framework for integration of the various technologies. We describe the components of the generic FireGrid system architecture in detail. Finally we present a small-scale demonstration experiment which has been completed to highlight the concept and application of the FireGrid system to an actual fire. Although our proposed system architecture provides a versatile framework for integration, a number of new and interesting research problems need to be solved before actual deployment of the system. We outline some of the challenges involved which require significant interdisciplinary collaborations. JF - 9th Symposium of the International Association for Fire Safety Science (IAFSS) PB - IAFSS CY - Karlsruhe, GERMANY ER - TY - CONF T1 - A novel visual discriminator on network traffic pattern T2 - The Second International Conference on Advanced Engineering Computing and Applications in Sciences (ADVCOMP 2008) Y1 - 2008 A1 - Liangxiu Han A1 - J. Hemert AB - The wavelet transform has been shown to be a powerful tool for characterising network traffic. However, the resulting decomposition of a wavelet transform typically forms a high-dimension space. This is obviously problematic on compact representations, visualizations, and modelling approaches that are based on these high-dimensional data. In this study, we show how data projection techniques can represent the high-dimensional wavelet decomposition in a low dimensional space to facilitate visual analysis. A low-dimensional representation can significantly reduce the model complexity. Hence, features in the data can be presented with a small number of parameters. We demonstrate these projections in the context of network traffic pattern analysis. The experimental results show that the proposed method can effectively discriminate between different application flows, such as FTP and P2P. JF - The Second International Conference on Advanced Engineering Computing and Applications in Sciences (ADVCOMP 2008) PB - IEEE Computer Society CY - Valencia, Spain ER - TY - JOUR T1 - The Self-adaptation to dynamic failures for efficient Virtual Organization formation in Grid computing context JF - Chaos, Solitons and Fractals Y1 - 2008 A1 - Liangxiu Han KW - complex network system KW - failure recovery KW - graph theory KW - grid computing KW - virtual organization formation AB - Grid computing aims to enable “resource sharing and coordinated problem solving in dynamic, multi-institutional Virtual Organizations (VOs)”. However, due to the nature of heterogeneous and dynamic resources, dynamic failures in the distributed grid environment usually occur more than in traditional computation platforms, which cause failed VO formations. In this paper, we develop a novel self-adaptive mechanism to dynamic failures during VO formations. Such a self-adaptive scheme allows an individual and member of VOs to automatically find other available or replaceable one once a failure happens and therefore makes systems automatically recover from dynamic failures. We define dynamic failure situations of a system by using two standard indicators: Mean Time between Failures (MTBF) and Mean Time to Recover (MTTR). We model both MTBF and MTTR as Poisson distributions. We investigate and analyze the efficiency of the proposed self-adaptation mechanism to dynamic failures by comparing the success probability of VO formations before and after adopting it in three different cases: 1) different failure situations; 2) different organizational structures and scales; 3) different task complexities. The experimental results show that the proposed scheme can automatically adapt to dynamic failures and effectively improve the dynamic VO formation performance in the event of node failures, which provide a valuable addition to the field. PB - Elsevier Science ER - TY - JOUR T1 - Semantic-Supported and Agent-Based Decentralized Grid Resource Discovery JF - Future Generation Computer Systems Y1 - 2008 A1 - Liangxiu Han A1 - Dave Berry KW - grid resource discovery, decentralization, agent, semantic similarity, ontology AB - One of open issues in grid computing is efficient resource discovery. In this paper, we propose a novel semantic-supported and agent-based decentralized grid resource discovery mechanism. Without overhead of negotiation, the algorithm allows individual resource agents to semantically interact with neighbour agents based on local knowledge and dynamically form a resource service chain to complete a task. The algorithm ensures resource agent’s ability to cooperate and coordinate on neighbour knowledge requisition for flexible problem solving. The developed algorithm is evaluated by investigating the relationship between the success probability of resource discovery and semantic similarity under different factors. The experiments show the algorithm could flexibly and dynamically discover resources and therefore provide a valuable addition to the field. PB - ScienceDirect VL - 24 IS - 8 ER - TY - CONF T1 - Towards a Grid-Enabled Simulation Framework for Nano-CMOS Electronics T2 - 3rd IEEE International Conference on eScience and Grid Computing Y1 - 2007 A1 - Liangxiu Han A1 - Asen Asenov A1 - Dave Berry A1 - Campbell Millar A1 - Gareth Roy A1 - Scott Roy A1 - Richard Sinnott A1 - Gordon Stewart AB - The electronics design industry is facing major challenges as transistors continue to decrease in size. The next generation of devices will be so small that the position of individual atoms will affect their behaviour. This will cause the transistors on a chip to have highly variable characteristics, which in turn will impact circuit and system design tools. The EPSRC project “Meeting the Design Challenges of Nano-CMOS Electronics” (Nano-CMOS) has been funded to explore this area. In this paper, we describe the distributed data-management and computing framework under development within Nano-CMOS. A key aspect of this framework is the need for robust and reliable security mechanisms that support distributed electronics design groups who wish to collaborate by sharing designs, simulations, workflows, datasets and computation resources. This paper presents the system design, and an early prototype of the project which hasbeen useful in helping us to understand the benefits of such a grid infrastructure. In particular, we also present two typical use cases: user authentication, and execution of large-scale device simulations. JF - 3rd IEEE International Conference on eScience and Grid Computing PB - IEEE Computer Society CY - Bangalore, India ER - TY - CONF T1 - Cross-Layer Peer-to-Peer Traffic Identification and Optimization Based on Active Networking T2 - 7th International Working Conference on Active and Programmable Networks Y1 - 2005 A1 - Dedinski, I. A1 - De Meer, H. A1 - Liangxiu Han A1 - Mathy, L. A1 - Pezaros, D. A1 - P. , Sventek, J. S. A1 - Xiaoying, Z. JF - 7th International Working Conference on Active and Programmable Networks CY - Sophia Antipolis, French Riviera, La Cote d'Azur, France, November 21-23, 2005. ER - TY - JOUR T1 - The Research of Relationship between Self-similar of TCP and Network Performance JF - Journal on communications Y1 - 2004 A1 - yan Liu A1 - Liangxiu Han VL - 25 IS - 4 ER - TY - JOUR T1 - The DWMM network traffic model JF - Journal of Communication Y1 - 2003 A1 - Cong Suo A1 - Liangxiu Han VL - 24 IS - 5 ER - TY - JOUR T1 - Application of the methodology combined formal method with Object-Oriented technology in E-commerce JF - Journal of Computer Engineering Y1 - 2002 A1 - Liangxiu Han VL - 29 IS - z1 ER - TY - JOUR T1 - A network traffic model based on the cascade process JF - Journal of Mini-Micro Computer System Y1 - 2002 A1 - Liangxiu Han A1 - yan Liu A1 - Zhiwei Cen VL - 23 IS - 12 ER - TY - JOUR T1 - A new multifractal network traffic model JF - Journal of Chaos, solitons & fractals Y1 - 2002 A1 - Liangxiu Han A1 - Zhiwei Ceng A1 - Chuanshan Gao PB - Elsevier Science VL - 13 IS - 7 ER - TY - JOUR T1 - The characterizing network traffic based on the wavelet technique JF - Journal of Mini-Micro Computer System Y1 - 2001 A1 - Liangxiu Han A1 - Cong Suo VL - 22 IS - 9 ER - TY - CONF T1 - A new multifractal traffic model based on the wavelet transform T2 - ISCA 14th International Conference on Parallel and Distributed Computing systems Y1 - 2001 A1 - Chuanshan Gao A1 - Liangxiu Han JF - ISCA 14th International Conference on Parallel and Distributed Computing systems CY - Texas, USA ER - TY - JOUR T1 - Neural network applied to the prediction of the failure stress for pressurized cylinders containing defects JF - International Journal of Pressure Vessels and Piping Y1 - 1999 A1 - Lianghao Han A1 - Liangxiu Han A1 - zengdian Liu PB - Elsevier VL - 76 IS - 4 ER - TY - JOUR T1 - Questions considered in object-oriented software quality metrics based on Java environment JF - Journal of East China University of Science and Technology Y1 - 1999 A1 - Liangxiu Han PB - East China University of Science and Technology ER - TY - JOUR T1 - On Zero-symmetric BZ-algebras JF - Journal of East China University of Science and Technology Y1 - 1997 A1 - Wang, Y. A1 - Liangxiu Han PB - East China University of Science and Technology VL - 23 IS - 6 ER -