Research Article
Multiple approaches to data-mining of proteomic data based on statistical and pattern classification methods
Article first published online: 9 SEP 2003
DOI: 10.1002/pmic.200300512
Copyright © 2003 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim
Issue
1615-9861/asset/cover.gif?v=1&s=463b401ca5e74069f6be8214ba1f0e643cf2e9ae)
PROTEOMICS
Special Issue: Mining MALDI-TOF Data
Volume 3, Issue 9, pages 1704–1709, September 2003
Additional Information
How to Cite
Tatay, J. W., Feng, X., Sobczak, N., Jiang, H., Chen, C.-F., Kirova, R., Struble, C., Wang, N. J. and Tonellato, P. J. (2003), Multiple approaches to data-mining of proteomic data based on statistical and pattern classification methods. PROTEOMICS, 3: 1704–1709. doi: 10.1002/pmic.200300512
Publication History
- Issue published online: 9 SEP 2003
- Article first published online: 9 SEP 2003
- Manuscript Received: 16 OCT 2002
- Abstract
- References
- Cited By
Keywords:
- Artificial neural networks;
- Fuzzy logic;
- Principle component analysis;
- Support vector machines
Abstract
The data-mining challenge presented is composed of two fundamental problems. Problem one is the separation of forty-one subjects into two classifications based on the data produced by the mass spectrometry of protein samples from each subject. Problem two is to find the specific differences between protein expression data of two sets of subjects. In each problem, one group of subjects has a disease, while the other group is nondiseased. Each problem was approached with the intent to introduce a new and potentially useful tool to analyze protein expression from mass spectrometry data. A variety of methodologies, both conventional and nonconventional were used in the analysis of these problems. The results presented show both overlap and discrepancies. What is important is the breadth of the techniques and the future direction this analysis will create.

1615-9861/asset/olbannerleft.gif?v=1&s=5e7e0f1cdb0951c5b1ba024be31918c1f138c065)