Use of keyword hierarchies to interpret gene expression patterns

Motivation: High-density microarray technology permits the quantitative and simultaneous monitoring of thousands of genes. The interpretation challenge is to extract relevant information from this large amount of data. A growing variety of statistical analysis approaches are available to identify cl...

Full description

Saved in:
Bibliographic Details
Published in:Bioinformatics Vol. 17; no. 4; pp. 319 - 326
Main Authors: Masys, Daniel R., Welsh, John B., Lynn Fink, J., Gribskov, Michael, Klacansky, Igor, Corbeil, Jacques
Format: Journal Article
Language:English
Published: Oxford Oxford University Press 01-04-2001
Oxford Publishing Limited (England)
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Motivation: High-density microarray technology permits the quantitative and simultaneous monitoring of thousands of genes. The interpretation challenge is to extract relevant information from this large amount of data. A growing variety of statistical analysis approaches are available to identify clusters of genes that share common expression characteristics, but provide no information regarding the biological similarities of genes within clusters. The published literature provides a potential source of information to assist in interpretation of clustering results. Results: We describe a data mining method that uses indexing terms (‘keywords’) from the published literature linked to specific genes to present a view of the conceptual similarity of genes within a cluster or group of interest. The method takes advantage of the hierarchical nature of Medical Subject Headings used to index citations in the MEDLINE database, and the registry numbers applied to enzymes. Availability: We have created a publicly accessible website that provides this form of gene expression interpretation at http://www.array.ucsd.edu. Contact: dmasys@ucsd.edu 8 To whom correspondence should be addressed at: University of California, San Diego School of Medicine, 9500 Gilman Drive, La Jolla, CA 92093-0602, USA. 7 Present address: Genomics Institute of the Novartis Research Foundation, 3115 Merryfield Row, San Diego, CA 92121, USA.
Bibliography:ark:/67375/HXZ-40314BLQ-T
istex:659D126F62B47A05C469165C8367AF246B51F63E
local:170319
PII:1460-2059
ObjectType-Article-2
SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 23
ObjectType-Article-1
ObjectType-Feature-2
ISSN:1367-4803
1460-2059
1367-4811
DOI:10.1093/bioinformatics/17.4.319