Extracting protein alignment models from the sequence database
Biologists often gain structural and functional insights into a protein sequence by constructing a multiple alignment model of the family. Here a program called PROBE fully automates this process of model construction starting from a single sequence. Central to this program is a powerful new method...
Saved in:
Published in: | Nucleic acids research Vol. 25; no. 9; pp. 1665 - 1677 |
---|---|
Main Authors: | , , , |
Format: | Journal Article |
Language: | English |
Published: |
England
Oxford University Press
01-05-1997
|
Subjects: | |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | Biologists often gain structural and functional insights into a protein sequence by constructing a multiple alignment model of the family. Here a program called PROBE fully automates this process of model construction starting from a single sequence. Central to this program is a powerful new method to locate and align only those, often subtly, conserved patterns essential to the family as a whole. When applied to randomly chosen proteins, PROBE found on average about four times as many relationships as a pairwise search and yielded many new discoveries. These include: an obscure subfamily of globins in the roundworm Caenorhabditis elegans; two new superfamilies of metallohydrolases; a lipoyl/biotin swinging arm domain in bacterial membrane fusion proteins; and a DH domain in the yeast Bud3 and Fus2 proteins. By identifying distant relationships and merging families into superfamilies in this way, this analysis further confirms the notion that proteins evolved from relatively few ancient sequences. Moreover, this method automatically generates models of these ancient conserved regions for rapid and sensitive screening of sequences. |
---|---|
Bibliography: | istex:0F076D676E70F72742D1AC6B668F20D0DE022284 ark:/67375/HXZ-GBFHJGJX-P ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 |
ISSN: | 0305-1048 1362-4962 |
DOI: | 10.1093/nar/25.9.1665 |