Skip to main content

IBM and Indigo BioSystems Create Research Repository for Data Mining

"TrueBlue Archive" Will Store Raw Life Sciences Data for Proteomics and Drug Testing

SOMERS, NY & INDIANAPOLIS, IN - 17 May 2005: IBM and Indigo BioSystems, Inc. today announced a collaboration to host a global warehouse capable of storing raw human proteomics data. The warehouse, based on Indigo BioSystems' True Blue Archive technology, is specifically designed to reliably capture and store Life Sciences research data from a wide variety of instruments. The result is a public database of information designed to help global researchers reduce time spent and costs by re-mining existing data to develop new conclusions. This work is part of a worldwide community effort.

"This repository will test one way to meet an important requirement for the Human Proteome Project: accessibility of raw data to enable global collaborative analysis," said Indigo chief scientific officer, Randall K. Julian, Ph.D. "The final implementation of such a repository will make raw proteomics data available to the international research community for the first time."

The joint effort between IBM and Indigo BioSystems, Inc. to test a model for global implementation of a proteome database was constructed using open standards developed by the Human Proteome Organization - Proteomics Standards Initiative (HUPO-PSI). The Archive will house instrument data from proprietary formats, which has been migrated into new open standards, like the HUPO-PSI mzData standard.

"Open standards are vitally important to continued progress in this field," said Michael Niemi, Solutions Architect in the IBM Healthcare and Life Sciences Organization. "This project uses open standards from HUPO-PSI to represent the proteomics raw instrument data, along with Web services to access the data. Web services can then be integrated into workflows using Service Orientated Architectures and broad industry standards such as BPEL."

Julian said that he expects the final HUPO repository to be a widely distributed system. "There is now a general acknowledgement that everything from collaborations to peer review can be dramatically enhanced when the raw instrument data collected in proteomic experiments is easily shared. For several years now, we have worked with industrial groups on data interchange and raw data repositories and I think our systems can help large scale projects like HUPO. We announced at the HUPO World Congress in China our intent to provide our technology for use in this important project."

Indigo BioSystems, Inc. first began to develop the True Blue archive to meet the data management challenges of a large pharma Drug Disposition group. The Archive was then applied to proteomics, which seeks to identify and to characterize all the proteins synthesized in biological systems. Based on this information, researchers can then try to understand how individual proteins or protein groups function within an organism. Experimenters hope that access to raw data will accelerate proteomics research and speed the understanding of the human proteome and its relationship to disease processes.

Utilizing the successful IBM Linux on POWER(TM) hardware, the DB2® database, and IBM® WebSphere® Application Server, the repository makes use of Indigo's archival storage technology to help ensure data integrity during storage and retrieval of data.

Contact(s) information

Jorge Alberni
IBM Media Relations
(914) 766-3336
alberni@us.ibm.com

Brian Straight
Indigo BioSystems, Inc.
(317) 536-2736
brian_straight@indigobio.com

Related XML feeds
Topics XML feeds
Research
Chemistry, computer science, electrical engineering, materials and mathematical sciences, physics and services science
Technologies

Document options