GTI: a novel algorithm for identifying outlier gene expression profiles from integrated microarray datasets

Meta-analysis of gene expression microarray datasets presents significant challenges for statistical analysis. We developed and validated a new bioinformatic method for the identification of genes upregulated in subsets of samples of a given tumour type ('outlier genes'), a hallmark of pot...

Full description

Saved in:
Bibliographic Details
Published in:PloS one Vol. 6; no. 2; p. e17259
Main Authors: Mpindi, John Patrick, Sara, Henri, Haapa-Paananen, Saija, Kilpinen, Sami, Pisto, Tommi, Bucher, Elmar, Ojala, Kalle, Iljin, Kristiina, Vainio, Paula, Björkman, Mari, Gupta, Santosh, Kohonen, Pekka, Nees, Matthias, Kallioniemi, Olli
Format: Journal Article
Language:English
Published: United States Public Library of Science 18-02-2011
Public Library of Science (PLoS)
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Meta-analysis of gene expression microarray datasets presents significant challenges for statistical analysis. We developed and validated a new bioinformatic method for the identification of genes upregulated in subsets of samples of a given tumour type ('outlier genes'), a hallmark of potential oncogenes. A new statistical method (the gene tissue index, GTI) was developed by modifying and adapting algorithms originally developed for statistical problems in economics. We compared the potential of the GTI to detect outlier genes in meta-datasets with four previously defined statistical methods, COPA, the OS statistic, the t-test and ORT, using simulated data. We demonstrated that the GTI performed equally well to existing methods in a single study simulation. Next, we evaluated the performance of the GTI in the analysis of combined Affymetrix gene expression data from several published studies covering 392 normal samples of tissue from the central nervous system, 74 astrocytomas, and 353 glioblastomas. According to the results, the GTI was better able than most of the previous methods to identify known oncogenic outlier genes. In addition, the GTI identified 29 novel outlier genes in glioblastomas, including TYMS and CDKN2A. The over-expression of these genes was validated in vivo by immunohistochemical staining data from clinical glioblastoma samples. Immunohistochemical data were available for 65% (19 of 29) of these genes, and 17 of these 19 genes (90%) showed a typical outlier staining pattern. Furthermore, raltitrexed, a specific inhibitor of TYMS used in the therapy of tumour types other than glioblastoma, also effectively blocked cell proliferation in glioblastoma cell lines, thus highlighting this outlier gene candidate as a potential therapeutic target. Taken together, these results support the GTI as a novel approach to identify potential oncogene outliers and drug targets. The algorithm is implemented in an R package (Text S1).
Bibliography:Conceived and designed the experiments: JPM SH-P OK. Performed the experiments: SH-P. Analyzed the data: JPM SH-P. Contributed reagents/materials/analysis tools: SK TP EB KI PV MB SG PK. Wrote the paper: JPM SH-P MN OK. Helped in the development of GeneSapiens: HS. Helped perform method simulations testing: HS. Helped in development of GeneSapiens database: KO.
ISSN:1932-6203
1932-6203
DOI:10.1371/journal.pone.0017259