Decoding transaminase motifs: Tracing the unknown patterns for enhancing the accuracy of computational screening methodologies
•A dedicated 246-sequence dataset of transaminase sequences was systematicallly constructed.•Phylogenetics and AlphaFold reveal conserved motifs, mapping crucial residues for transaminase activity.•Identified 5, 7, 10, and 2 novel motifs in ADCL, BCAT, DATA, and RATA transaminase subfamilies. Transa...
Saved in:
Published in: | Gene Vol. 936; p. 149091 |
---|---|
Main Authors: | , , , , , , , |
Format: | Journal Article |
Language: | English |
Published: |
Elsevier B.V
05-02-2025
|
Subjects: | |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | •A dedicated 246-sequence dataset of transaminase sequences was systematicallly constructed.•Phylogenetics and AlphaFold reveal conserved motifs, mapping crucial residues for transaminase activity.•Identified 5, 7, 10, and 2 novel motifs in ADCL, BCAT, DATA, and RATA transaminase subfamilies.
Transaminases, enzymes known for their amino group transfer capabilities, encompass four distinct subfamilies: D-alanine transaminase (DATA), L-selective Branched chain aminotransferase (BCAT), and 4-amino-4-deoxychorismate lyase (ADCL) and R-selective aminotransferase (RATA). RATA enzymes are particularly valuable in biocatalysis for synthesizing chiral amines and resolving racemic mixtures, yet their identification in sequence databases is challenging due to the lack of robust motif-based screening methods. Constructing a sequence dataset of transaminases, and categorizing them to various subfamilies, the conserved motifs are screened over the experimentally known ones, and the novel motifs are explored. Phylogenetic clustering of these subfamilies and structural localization of the identified motifs on the Alphafold-predicted protein models of the representative sequences validate their functional importance. For the ADCL, BCAT, DATA, and RATA datasets, we identified 5, 7, 10, and 2 novel motifs, with 3, 5, 7, and 2 motifs localized on secondary structures, confirming their structural importance. Furthermore, the analysis revealed 1, 3, 2, and 1 unique residue patterns of 293-KxxxR-297; 336-KxxxxY-341, 379-ExxxxNxF-386, and 453-ExFxxGT-459; 187-HxxRL-191, and 284-DxRWxxCDIK-293; and 191-HxxRL-195, integrating of which in the known computational tools would improve their accuracy. The conserved residue pattern or motif-based computational approach for robustly screening the transaminases holds promise for unveiling the novel RATA enzymes, facilitating their exploitation in biocatalytic applications. |
---|---|
Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 |
ISSN: | 0378-1119 1879-0038 1879-0038 |
DOI: | 10.1016/j.gene.2024.149091 |