Email updates

Keep up to date with the latest news and content from Health Information Science and Systems and BioMed Central.

Open Access Methodology

Dynamic integration of biological data sources using the data concierge

Peng Gong

Author Affiliations

Biomedical and Multimedia Information Technology (BMIT) Research Group, School of Information Technologies, the University of Sydney, Sydney, NSW, 2006, Australia

Department of PET and Nuclear Medicine, RPA Hospital, Camperdown, NSW, 2050, Australia

Health Information Science and Systems 2013, 1:7  doi:10.1186/2047-2501-1-7

Published: 4 February 2013

Abstract

Background

The ever-changing landscape of large-scale network environments and innovative biology technologies require dynamic mechanisms to rapidly integrate previously unknown bioinformatics sources at runtime. However, existing integration technologies lack sufficient flexibility to adapt to these changes, because the techniques used for integration are static, and sensitive to new or changing bioinformatics source implementations and evolutionary biologist requirements.

Methods

To address this challenge, in this paper we propose a new semantics-based adaptive middleware, the Data Concierge, which is able to dynamically integrate heterogeneous biological data sources without the need for wrappers. Along with the architecture necessary to facilitate dynamic integration, API description mechanism is proposed to dynamically classify, recognize, locate, and invoke newly added biological data source functionalities. Based on the unified semantic metadata, XML-based state machines are able to provide flexible configurations to execute biologist's abstract and complex operations.

Results and discussion

Experimental results demonstrate that for obtaining dynamic features, the Data Concierge sacrifices reasonable performance on reasoning knowledge models and dynamically doing data source API invocations. The overall costs to integrate new biological data sources are significantly lower when using the Data Concierge.

Conclusions

The Data Concierge facilitates the rapid integration of new biological data sources in existing applications with no repetitive software development required, and hence, this mechanism would provide a cost-effective solution to the labor-intensive software engineering tasks.

Keywords:
Biology; Middleware; Data integration; Ontology; State machine