Bioengineering
Data Integration for Dynamic and Sustainable Systems Biology Resources: Challenges and Lessons Learned
Article first published online: 20 MAY 2010
DOI: 10.1002/cbdv.200900317
Copyright © 2010 Verlag Helvetica Chimica Acta AG, Zürich
Issue
1612-1880/asset/cover.gif?v=1&s=cc68b12e733d21d6415eeec10551f83cb03996eb)
Chemistry & Biodiversity
Special Issue: Systems Biology in the Microbial World and Beyond
Volume 7, Issue 5, pages 1124–1141, May 2010
Additional Information
How to Cite
Sullivan, Daniel E., Gabbard, Joseph L., Shukla, M. and Sobral, B. (2010), Data Integration for Dynamic and Sustainable Systems Biology Resources: Challenges and Lessons Learned. Chemistry & Biodiversity, 7: 1124–1141. doi: 10.1002/cbdv.200900317
Publication History
- Issue published online: 20 MAY 2010
- Article first published online: 20 MAY 2010
- Manuscript Received: 1 SEP 2009
- Abstract
- References
- Cited By
Keywords:
- Systems biology;
- Infectious diseases;
- Data integration;
- Bioinformatics
Abstract
Systems-biology and infectious-disease (host–pathogen–environment) research and development is becoming increasingly dependent on integrating data from diverse and dynamic sources. Maintaining integrated resources over long periods of time presents distinct challenges. This review describes experiences and lessons learned from integrating data in two five-year projects focused on pathosystems biology: the Pathosystems Resource Integration Center (PATRIC, http://patric.vbi.vt.edu/), with a goal of developing bioinformatics resources for the research and countermeasures-development communities based on genomics data, and the Resource Center for Biodefense Proteomics Research (RCBPR, http://www.proteomicsresource.org/), with a goal of developing resources based on the experiment data such as microarray and proteomics data from diverse sources and technologies. Some challenges include integrating genomic sequence and experiment data, data synchronization, data quality control, and usability engineering. We present examples of a variety of data-integration problems drawn from our experiences with PATRIC and RBPRC, as well as open research questions related to long-term sustainability, and describe the next steps to meeting these challenges. Novel contributions of this work include 1) an approach for addressing discrepancies between experiment results and interpreted results, and 2) expanding the range of data-integration techniques to include usability engineering at the presentation level.

1612-1880/asset/olbannerleft.gif?v=1&s=4a84f8e2a477e35281af9c4f448bba416086b2cd)
1612-1880/asset/olbannerright.gif?v=1&s=4b6196b7678c53479419a28e69a5ed821fc2a749)