Chemical compound navigator: A web-based chem-BLAST, chemical taxonomy-based search engine for browsing compounds
A novel technique to annotate, query, and analyze chemical compounds has been developed and is illustrated by using the inhibitor data on HIV protease‐inhibitor complexes. In this method, all chemical compounds are annotated in terms of standard chemical structural fragments. These standard fragment...
Saved in:
Published in: | Proteins, structure, function, and bioinformatics Vol. 63; no. 4; pp. 907 - 917 |
---|---|
Main Authors: | , , , , |
Format: | Journal Article |
Language: | English |
Published: |
Hoboken
Wiley Subscription Services, Inc., A Wiley Company
01-06-2006
|
Subjects: | |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | A novel technique to annotate, query, and analyze chemical compounds has been developed and is illustrated by using the inhibitor data on HIV protease‐inhibitor complexes. In this method, all chemical compounds are annotated in terms of standard chemical structural fragments. These standard fragments are defined by using criteria, such as chemical classification; structural, chemical, or functional groups; and commercial, scientific or common names or synonyms. These fragments are then organized into a data tree based on their chemical substructures. Search engines have been developed to use this data tree to enable query on inhibitors of HIV protease (http://xpdb.nist.gov/hivsdb/hivsdb.html). These search engines use a new novel technique, Chemical Block Layered Alignment of Substructure Technique (Chem‐BLAST) to search on the fragments of an inhibitor to look for its chemical structural neighbors. This novel technique to annotate and query compounds lays the foundation for the use of the Semantic Web concept on chemical compounds to allow end users to group, sort, and search structural neighbors accurately and efficiently. During annotation, it enables the attachment of “meaning” (i.e., semantics) to data in a manner that far exceeds the current practice of associating “metadata” with data by creating a knowledge base (or ontology) associated with compounds. Intended users of the technique are the research community and pharmaceutical industry, for which it will provide a new tool to better identify novel chemical structural neighbors to aid drug discovery. Proteins 2006. © 2006 Wiley‐Liss, Inc. |
---|---|
Bibliography: | Intramural Research Program of the NIH, National Cancer Institute, Center for Cancer Research Systems Integration for Manufacturing Applications (SIMA) Certain trade and company products are identified in this article to specify adequately the computer products needed to develop this data system. In no case, does such identification imply endorsement by the National Institute of Standards and Technology (NIST), nor does it imply that the products are necessarily the best available for the purpose. Exploratory Research Award for the year 2004 ark:/67375/WNG-FG6PM180-D ArticleID:PROT20914 This article is a US Government work and, as such, is in the public domain in the United States of America. istex:106BCADDFD6E7DD6C98CA8ED6B707B7DF171E09C ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 |
ISSN: | 0887-3585 1097-0134 |
DOI: | 10.1002/prot.20914 |