Use of keyword hierarchies to interpret gene expression patterns

Motivation: High-density microarray technology permits the quantitative and simultaneous monitoring of thousands of genes. The interpretation challenge is to extract relevant information from this large amount of data. A growing variety of statistical analysis approaches are available to identify cl...

Full description

Saved in:

Bibliographic Details
Published in:	Bioinformatics Vol. 17; no. 4; pp. 319 - 326
Main Authors:	Masys, Daniel R., Welsh, John B., Lynn Fink, J., Gribskov, Michael, Klacansky, Igor, Corbeil, Jacques
Format:	Journal Article
Language:	English
Published:	Oxford Oxford University Press 01-04-2001 Oxford Publishing Limited (England)
Subjects:	Abstracting and Indexing as Topic Biological and medical sciences Databases, Factual DNA microarrays Fundamental and applied biological sciences. Psychology Gene Expression Profiling General aspects Information Storage and Retrieval Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects) MEDLINE Oligonucleotide Array Sequence Analysis Data analysis Automatic classification Statistical analysis Keyword Hierarchical classification Genomics Gene expression Data mining Gene Database Bioinformatics Indexing Information interpretation
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	Motivation: High-density microarray technology permits the quantitative and simultaneous monitoring of thousands of genes. The interpretation challenge is to extract relevant information from this large amount of data. A growing variety of statistical analysis approaches are available to identify clusters of genes that share common expression characteristics, but provide no information regarding the biological similarities of genes within clusters. The published literature provides a potential source of information to assist in interpretation of clustering results. Results: We describe a data mining method that uses indexing terms (‘keywords’) from the published literature linked to specific genes to present a view of the conceptual similarity of genes within a cluster or group of interest. The method takes advantage of the hierarchical nature of Medical Subject Headings used to index citations in the MEDLINE database, and the registry numbers applied to enzymes. Availability: We have created a publicly accessible website that provides this form of gene expression interpretation at http://www.array.ucsd.edu. Contact: dmasys@ucsd.edu 8 To whom correspondence should be addressed at: University of California, San Diego School of Medicine, 9500 Gilman Drive, La Jolla, CA 92093-0602, USA. 7 Present address: Genomics Institute of the Novartis Research Foundation, 3115 Merryfield Row, San Diego, CA 92121, USA.
Bibliography:	ark:/67375/HXZ-40314BLQ-T istex:659D126F62B47A05C469165C8367AF246B51F63E local:170319 PII:1460-2059 ObjectType-Article-2 SourceType-Scholarly Journals-1 ObjectType-Feature-1 content type line 23 ObjectType-Article-1 ObjectType-Feature-2
ISSN:	1367-4803 1460-2059 1367-4811
DOI:	10.1093/bioinformatics/17.4.319