NMR metabolic analysis of samples using fuzzy K-means clustering

The global analysis of metabolites can be used to define the phenotypes of cells, tissues or organisms. Classifying groups of samples based on their metabolic profile is one of the main topics of metabolomics research. Crisp clustering methods assign each feature to one cluster, thereby omitting inf...

Full description

Saved in:
Bibliographic Details
Published in:Magnetic resonance in chemistry Vol. 47; no. S1; pp. S96 - S104
Main Authors: Čuperlović-Culf, Miroslava, Belacel, Nabil, Culf, Adrian S., Chute, Ian C., Ouellette, Rodney J., Burton, Ian W., Karakach, Tobias K., Walter, John A.
Format: Journal Article
Language:English
Published: Chichester, UK John Wiley & Sons, Ltd 01-12-2009
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:The global analysis of metabolites can be used to define the phenotypes of cells, tissues or organisms. Classifying groups of samples based on their metabolic profile is one of the main topics of metabolomics research. Crisp clustering methods assign each feature to one cluster, thereby omitting information about the multiplicity of sample subtypes. Here, we present the application of fuzzy K‐means clustering method for the classification of samples based on metabolomics 1D 1H NMR fingerprints. The sample classification was performed on NMR spectra of cancer cell line extracts and of urine samples of type 2 diabetes patients and animal models. The cell line dataset included NMR spectra of lipophilic cell extracts for two normal and three cancer cell lines with cancer cell lines including two invasive and one non‐invasive cancers. The second dataset included previously published NMR spectra of urine samples of human type 2 diabetics and healthy controls, mouse wild type and diabetes model and rat obese and lean phenotypes. The fuzzy K‐means clustering method allowed more accurate sample classification in both datasets relative to the other tested methods including principal component analysis (PCA), hierarchical clustering (HCL) and K‐means clustering. In the cell line samples, fuzzy clustering provided a clear separation of individual cell lines, groups of cancer and normal cell lines as well as non‐invasive and invasive tumour cell lines. In the diabetes dataset, clear separation of healthy controls and diabetics in all three models was possible only by using the fuzzy clustering method. Copyright © 2009 Crown in the right of Canada. Published by John Wiley & Sons, Ltd. The application of fuzzy K‐means (FKM) clustering method for the analysis of 1D 1H NMR metabolomics measurements of different types of cell lines as well as urine samples is presented. In FKM clustering, each sample is assigned to all clusters with varying memberships. The FKM clustering allowed more accurate sample classification relative to principal component analysis, hierarchical clustering and K‐means clustering. Fuzzy clustering method provided clear separation of sample types, and the analysis of membership values allowed assignment of subtypes.
Bibliography:Atlantic Innovation Fund
National Research Council Atlantic Initiative
ArticleID:MRC2502
istex:4D3D520A710DE6060514ED205906FB06A97C6C8A
ark:/67375/WNG-3FC3RP45-F
ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ObjectType-Article-2
ObjectType-Feature-1
ISSN:0749-1581
1097-458X
1097-458X
DOI:10.1002/mrc.2502