Research Article
NMR metabolic analysis of samples using fuzzy K-means clustering
Article first published online: 4 SEP 2009
DOI: 10.1002/mrc.2502
Copyright © 2009 Crown in the right of Canada. Published by John Wiley & Sons, Ltd.
Issue
1097-458Xa/asset/cover.gif?v=1&s=d5f2b3f6d127fcc37c26f1dbdb0c8887da653c0f)
Magnetic Resonance in Chemistry
Supplement: NMR-based mixture analysis – metabolomics and beyond
Volume 47, Issue S1, pages S96–S104, December 2009
Additional Information
How to Cite
Čuperlović-Culf, M., Belacel, N., Culf, A. S., Chute, I. C., Ouellette, R. J., Burton, I. W., Karakach, T. K. and Walter, J. A. (2009), NMR metabolic analysis of samples using fuzzy K-means clustering. Magnetic Resonance in Chemistry, 47: S96–S104. doi: 10.1002/mrc.2502
Publication History
- Issue published online: 6 NOV 2009
- Article first published online: 4 SEP 2009
- Manuscript Accepted: 22 JUL 2009
- Manuscript Revised: 22 FEB 2009
- Manuscript Received: 27 OCT 2008
Funded by
- National Research Council Atlantic Initiative
- Atlantic Innovation Fund
- Abstract
- References
- Cited By
Keywords:
- fuzzy clustering;
- sample classification;
- metabolomics;
- metabolic profiling;
- mixture analysis;
- sample subtypes;
- 1H NMR;
- phenotype analysis
Graphical Abstract

The application of fuzzy K-means (FKM) clustering method for the analysis of 1D 1H NMR metabolomics measurements of different types of cell lines as well as urine samples is presented. In FKM clustering, each sample is assigned to all clusters with varying memberships. The FKM clustering allowed more accurate sample classification relative to principal component analysis, hierarchical clustering and K-means clustering. Fuzzy clustering method provided clear separation of sample types, and the analysis of membership values allowed assignment of subtypes.
Abstract
The global analysis of metabolites can be used to define the phenotypes of cells, tissues or organisms. Classifying groups of samples based on their metabolic profile is one of the main topics of metabolomics research. Crisp clustering methods assign each feature to one cluster, thereby omitting information about the multiplicity of sample subtypes. Here, we present the application of fuzzy K-means clustering method for the classification of samples based on metabolomics 1D 1H NMR fingerprints. The sample classification was performed on NMR spectra of cancer cell line extracts and of urine samples of type 2 diabetes patients and animal models. The cell line dataset included NMR spectra of lipophilic cell extracts for two normal and three cancer cell lines with cancer cell lines including two invasive and one non-invasive cancers. The second dataset included previously published NMR spectra of urine samples of human type 2 diabetics and healthy controls, mouse wild type and diabetes model and rat obese and lean phenotypes. The fuzzy K-means clustering method allowed more accurate sample classification in both datasets relative to the other tested methods including principal component analysis (PCA), hierarchical clustering (HCL) and K-means clustering. In the cell line samples, fuzzy clustering provided a clear separation of individual cell lines, groups of cancer and normal cell lines as well as non-invasive and invasive tumour cell lines. In the diabetes dataset, clear separation of healthy controls and diabetics in all three models was possible only by using the fuzzy clustering method. Copyright © 2009 Crown in the right of Canada. Published by John Wiley & Sons, Ltd.

1097-458Xa/asset/MRC_left.gif?v=1&s=0f1e59ed442848dec17e962f55cca7913850ed0a)
1097-458Xa/asset/MRC_right.gif?v=1&s=3c66d51ecc5485b25b34d7e6c299cf538bc0f5c6)