Using the miraEST assembler for reliable and automated mRNA transcript assembly and SNP detection in sequenced ESTs

We present an EST sequence assembler that specializes in reconstruction of pristine mRNA transcripts, while at the same time detecting and classifying single nucleotide polymorphisms (SNPs) occuring in different variations thereof. The assembler uses iterative multipass strategies centered on high-c...

Full description

Saved in:
Bibliographic Details
Published in:Genome research Vol. 14; no. 6; pp. 1147 - 1159
Main Authors: Chevreux, Bastien, Pfisterer, Thomas, Drescher, Bernd, Driesel, Albert J, Müller, Werner E G, Wetter, Thomas, Suhai, Sándor
Format: Journal Article
Language:English
Published: United States Cold Spring Harbor Laboratory Press 01-06-2004
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:We present an EST sequence assembler that specializes in reconstruction of pristine mRNA transcripts, while at the same time detecting and classifying single nucleotide polymorphisms (SNPs) occuring in different variations thereof. The assembler uses iterative multipass strategies centered on high-confidence regions within sequences and has a fallback strategy for using low-confidence regions when needed. It features special functions to assemble high numbers of highly similar sequences without prior masking, an automatic editor that edits and analyzes alignments by inspecting the underlying traces, and detection and classification of sequence properties like SNPs with a high specificity and a sensitivity down to one mutation per sequence. In addition, it includes possibilities to use incorrectly preprocessed sequences, routines to make use of additional sequencing information such as base-error probabilities, template insert sizes, strain information, etc., and functions to detect and resolve possible misassemblies. The assembler is routinely used for such various tasks as mutation detection in different cell types, similarity analysis of transcripts between organisms, and pristine assembly of sequences from various sources for oligo design in clinical microarray experiments.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.1917404. Article published online before print in May 2004.
Corresponding author.…E-MAIL bastien@chevreux.org; FAX +49 6227 422333.
ISSN:1088-9051
DOI:10.1101/gr.1917404