Data Mining Meets Machine Learning: A Novel ANN‐based Multi‐body Interaction Docking Scoring Function (MBI‐score) Based on Utilizing Frequent Geometric and Chemical Patterns of Interfacial Atoms in Native Protein‐ligand Complexes
Accurate prediction of binding poses is crucial to structure‐based drug design. We employ two powerful artificial intelligence (AI) approaches, data‐mining and machine‐learning, to design artificial neural network (ANN) based pose‐scoring function. It is a simple machine‐learning‐based statistical f...
Saved in:
Published in: | Molecular informatics Vol. 41; no. 8; pp. e2100248 - n/a |
---|---|
Main Authors: | , , |
Format: | Journal Article |
Language: | English |
Published: |
Germany
Wiley Subscription Services, Inc
01-08-2022
|
Subjects: | |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | Accurate prediction of binding poses is crucial to structure‐based drug design. We employ two powerful artificial intelligence (AI) approaches, data‐mining and machine‐learning, to design artificial neural network (ANN) based pose‐scoring function. It is a simple machine‐learning‐based statistical function that employs frequent geometric and chemical patterns of interacting atoms at protein‐ligand interfaces. The patterns are derived by mining interfaces of “native” protein‐ligand complexes. Each interface is represented by a graph where nodes are atoms and edges connect protein‐ligand interfacial atoms located within certain cutoff distance of each other. Applying frequent subgraph mining to these interfaces provides “native” frequent patterns of interacting atoms. Subsequently, given a pose for a protein‐ligand complex of interest, the pose‐scoring function (the information‐processing unit or neuron) calculates the degree of matching between the interaction patterns present at the pose's interface and the native frequent patterns. The pose‐scoring function takes into account the frequency of occurrence of the matching native patterns, the size of the match, and the degree of geometrical similarity between pose‐specific and matching native frequent patterns. This novel “multi‐body interaction” pose‐scoring function (MBI‐Score) was validated using two databases, PDBbind and Astex‐85, and it outperformed seven commonly used commercial scoring functions. MBI‐Score is available at www.khashanlab.org/mbi‐score. |
---|---|
Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 |
ISSN: | 1868-1743 1868-1751 |
DOI: | 10.1002/minf.202100248 |