Use of keyword hierarchies to interpret gene expression patterns
Motivation: High-density microarray technology permits the quantitative and simultaneous monitoring of thousands of genes. The interpretation challenge is to extract relevant information from this large amount of data. A growing variety of statistical analysis approaches are available to identify cl...
Saved in:
Published in: | Bioinformatics Vol. 17; no. 4; pp. 319 - 326 |
---|---|
Main Authors: | , , , , , |
Format: | Journal Article |
Language: | English |
Published: |
Oxford
Oxford University Press
01-04-2001
Oxford Publishing Limited (England) |
Subjects: | |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | Motivation: High-density microarray technology permits the quantitative and simultaneous monitoring of thousands of genes. The interpretation challenge is to extract relevant information from this large amount of data. A growing variety of statistical analysis approaches are available to identify clusters of genes that share common expression characteristics, but provide no information regarding the biological similarities of genes within clusters. The published literature provides a potential source of information to assist in interpretation of clustering results. Results: We describe a data mining method that uses indexing terms (‘keywords’) from the published literature linked to specific genes to present a view of the conceptual similarity of genes within a cluster or group of interest. The method takes advantage of the hierarchical nature of Medical Subject Headings used to index citations in the MEDLINE database, and the registry numbers applied to enzymes. Availability: We have created a publicly accessible website that provides this form of gene expression interpretation at http://www.array.ucsd.edu. Contact: dmasys@ucsd.edu 8 To whom correspondence should be addressed at: University of California, San Diego School of Medicine, 9500 Gilman Drive, La Jolla, CA 92093-0602, USA. 7 Present address: Genomics Institute of the Novartis Research Foundation, 3115 Merryfield Row, San Diego, CA 92121, USA. |
---|---|
Bibliography: | ark:/67375/HXZ-40314BLQ-T istex:659D126F62B47A05C469165C8367AF246B51F63E local:170319 PII:1460-2059 ObjectType-Article-2 SourceType-Scholarly Journals-1 ObjectType-Feature-1 content type line 23 ObjectType-Article-1 ObjectType-Feature-2 |
ISSN: | 1367-4803 1460-2059 1367-4811 |
DOI: | 10.1093/bioinformatics/17.4.319 |