Assessing the reliability of a QSAR model's predictions

Quantitative structure activity relationships (QSAR) are one of the well-developed areas in computational chemistry. In this field, many successful predictive models have been developed for various property, activity or toxicity predictions. However, the predictive power of models for new query comp...

Full description

Saved in:

Bibliographic Details
Published in:	Journal of molecular graphics & modelling Vol. 23; no. 6; pp. 503 - 523
Main Authors:	He, Linnan, Jurs, Peter C.
Format:	Journal Article
Language:	English
Published:	United States Elsevier Inc 01-06-2005
Subjects:	Fathead minnow acute aquatic toxicity Hierarchical clustering Models, Chemical Organic Chemicals - chemistry Organic Chemicals - toxicity QSAR model Quantitative Structure-Activity Relationship Toxicity Tests Hierarchical clustering Fathead minnow acute aquatic toxicity QSAR model
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	Quantitative structure activity relationships (QSAR) are one of the well-developed areas in computational chemistry. In this field, many successful predictive models have been developed for various property, activity or toxicity predictions. However, the predictive power of models for new query compounds is often not well characterized. The breadth of applicability of models is often not characterized. In other words, with a given QSAR model and a specific query compound to be predicted, can the model be used reliably for the desired prediction? In this study, we assessed the reliability of QSAR models’ prediction on query compounds. Our approach, employing hierarchical clustering, was developed and tested using a test dataset containing 322 organic compounds with fathead minnow acute aquatic toxicity as the activity of interest. The hypothesis of the approach was that if a query compound is more similar to the compounds used to generate the QSAR model, it should be predicted more accurately. Thus, the core of the approach is to determine the relationship between the similarity of query compounds to the training set compounds of the QSAR model and the prediction accuracy given by that model. This relationship determination was achieved by comparing the results given by the two major components of the approach: objects clustering and activity prediction. With the resultant information from the two steps, a direct relationship was shown.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23
ISSN:	1093-3263 1873-4243
DOI:	10.1016/j.jmgm.2005.03.003