Shotgun Sequencing of the Human Transcriptome with ORF Expressed Sequence Tags

Theoretical considerations predict that amplification of expressed gene transcripts by reverse transcription-PCR using arbitrarily chosen primers will result in the preferential amplification of the central portion of the transcript. Systematic, high-throughput sequencing of such products would resu...

Full description

Saved in:
Bibliographic Details
Published in:Proceedings of the National Academy of Sciences - PNAS Vol. 97; no. 7; pp. 3491 - 3496
Main Authors: Neto, Emmanuel Dias, Correa, Ricardo Garcia, Verjovski-Almeida, Sergio, Marcelo R. S. Briones, Nagai, Maria Aparecida, da Silva, Wilson, Zago, Marco Antonio, Bordin, Silvana, Costa, Fernando Ferreira, Goldman, Gustavo Henrique, Carvalho, Alex F., Matsukuma, Adriana, Baia, Gilson S., Simpson, David H., Brunstein, Adriana, Paulo S. L. de Oliveira, Bucher, Philipp, Jongeneel, C. Victor, O'Hare, Michael J., Soares, Fernando, Brentani, Ricardo R., Luis F. L. Reis, de Souza, Sandro J., Andrew J. G. Simpson
Format: Journal Article
Language:English
Published: United States National Academy of Sciences of the United States of America 28-03-2000
National Acad Sciences
National Academy of Sciences
The National Academy of Sciences
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Theoretical considerations predict that amplification of expressed gene transcripts by reverse transcription-PCR using arbitrarily chosen primers will result in the preferential amplification of the central portion of the transcript. Systematic, high-throughput sequencing of such products would result in an expressed sequence tag (EST) database consisting of central, generally coding regions of expressed genes. Such a database would add significant value to existing public EST databases, which consist mostly of sequences derived from the extremities of cDNAs, and facilitate the construction of contigs of transcript sequences. We tested our predictions, creating a database of 10,000 sequences from human breast tumors. The data confirmed the central distribution of the sequences, the significant normalization of the sequence population, the frequent extension of contigs composed of existing human ESTs, and the identification of a series of potentially important homologues of known genes. This approach should make a significant contribution to the early identification of important human genes, the deciphering of the draft human genome sequence currently being compiled, and the shotgun sequencing of the human transcriptome.
Bibliography:ObjectType-Article-2
SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 23
ObjectType-Article-1
ObjectType-Feature-2
To whom reprint requests should be addressed. E-mail: asimpson@node1.com.br.
Communicated by Phillip A. Sharp, Massachusetts Institute of Technology, Cambridge, MA
ISSN:0027-8424
1091-6490
DOI:10.1073/pnas.97.7.3491