Decoding transaminase motifs: Tracing the unknown patterns for enhancing the accuracy of computational screening methodologies

•A dedicated 246-sequence dataset of transaminase sequences was systematicallly constructed.•Phylogenetics and AlphaFold reveal conserved motifs, mapping crucial residues for transaminase activity.•Identified 5, 7, 10, and 2 novel motifs in ADCL, BCAT, DATA, and RATA transaminase subfamilies. Transa...

Full description

Saved in:
Bibliographic Details
Published in:Gene Vol. 936; p. 149091
Main Authors: Runthala, Ashish, Satya Sri, Pulla Sai, Nair, Aayush Sasikumar, Puttagunta, Murali Krishna, Sekhar Rao, T. Chandra, Sreya, Vajrala, Sowmya, Ganugapati Reshma, Reddy, G. Koteswara
Format: Journal Article
Language:English
Published: Elsevier B.V 05-02-2025
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:•A dedicated 246-sequence dataset of transaminase sequences was systematicallly constructed.•Phylogenetics and AlphaFold reveal conserved motifs, mapping crucial residues for transaminase activity.•Identified 5, 7, 10, and 2 novel motifs in ADCL, BCAT, DATA, and RATA transaminase subfamilies. Transaminases, enzymes known for their amino group transfer capabilities, encompass four distinct subfamilies: D-alanine transaminase (DATA), L-selective Branched chain aminotransferase (BCAT), and 4-amino-4-deoxychorismate lyase (ADCL) and R-selective aminotransferase (RATA). RATA enzymes are particularly valuable in biocatalysis for synthesizing chiral amines and resolving racemic mixtures, yet their identification in sequence databases is challenging due to the lack of robust motif-based screening methods. Constructing a sequence dataset of transaminases, and categorizing them to various subfamilies, the conserved motifs are screened over the experimentally known ones, and the novel motifs are explored. Phylogenetic clustering of these subfamilies and structural localization of the identified motifs on the Alphafold-predicted protein models of the representative sequences validate their functional importance. For the ADCL, BCAT, DATA, and RATA datasets, we identified 5, 7, 10, and 2 novel motifs, with 3, 5, 7, and 2 motifs localized on secondary structures, confirming their structural importance. Furthermore, the analysis revealed 1, 3, 2, and 1 unique residue patterns of 293-KxxxR-297; 336-KxxxxY-341, 379-ExxxxNxF-386, and 453-ExFxxGT-459; 187-HxxRL-191, and 284-DxRWxxCDIK-293; and 191-HxxRL-195, integrating of which in the known computational tools would improve their accuracy. The conserved residue pattern or motif-based computational approach for robustly screening the transaminases holds promise for unveiling the novel RATA enzymes, facilitating their exploitation in biocatalytic applications.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:0378-1119
1879-0038
1879-0038
DOI:10.1016/j.gene.2024.149091