SAINT: self-attention augmented inception-inside-inception network improves protein secondary structure prediction

Abstract Motivation Protein structures provide basic insight into how they can interact with other proteins, their functions and biological roles in an organism. Experimental methods (e.g. X-ray crystallography and nuclear magnetic resonance spectroscopy) for predicting the secondary structure (SS)...

Full description

Saved in:

Bibliographic Details
Published in:	Bioinformatics Vol. 36; no. 17; pp. 4599 - 4608
Main Authors:	Uddin, Mostofa Rafid, Mahbub, Sazan, Rahman, M Saifur, Bayzid, Md Shamsuzzoha
Format:	Journal Article
Language:	English
Published:	England Oxford University Press 01-11-2020
Subjects:	Databases, Protein Deep Learning Neural Networks, Computer Protein Structure, Secondary Proteins
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	Abstract Motivation Protein structures provide basic insight into how they can interact with other proteins, their functions and biological roles in an organism. Experimental methods (e.g. X-ray crystallography and nuclear magnetic resonance spectroscopy) for predicting the secondary structure (SS) of proteins are very expensive and time consuming. Therefore, developing efficient computational approaches for predicting the SS of protein is of utmost importance. Advances in developing highly accurate SS prediction methods have mostly been focused on 3-class (Q3) structure prediction. However, 8-class (Q8) resolution of SS contains more useful information and is much more challenging than the Q3 prediction. Results We present SAINT, a highly accurate method for Q8 structure prediction, which incorporates self-attention mechanism (a concept from natural language processing) with the Deep Inception-Inside-Inception network in order to effectively capture both the short- and long-range interactions among the amino acid residues. SAINT offers a more interpretable framework than the typical black-box deep neural network methods. Through an extensive evaluation study, we report the performance of SAINT in comparison with the existing best methods on a collection of benchmark datasets, namely, TEST2016, TEST2018, CASP12 and CASP13. Our results suggest that self-attention mechanism improves the prediction accuracy and outperforms the existing best alternate methods. SAINT is the first of its kind and offers the best known Q8 accuracy. Thus, we believe SAINT represents a major step toward the accurate and reliable prediction of SSs of proteins. Availability and implementation SAINT is freely available as an open-source project at https://github.com/SAINTProtein/SAINT.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23
ISSN:	1367-4803 1460-2059 1367-4811
DOI:	10.1093/bioinformatics/btaa531