ABRA: improved coding indel detection via assembly-based realignment

Variant detection from next-generation sequencing (NGS) data is an increasingly vital aspect of disease diagnosis, treatment and research. Commonly used NGS-variant analysis tools generally rely on accurately mapped short reads to identify somatic variants and germ-line genotypes. Existing NGS read...

Full description

Saved in:
Bibliographic Details
Published in:Bioinformatics (Oxford, England) Vol. 30; no. 19; pp. 2813 - 2815
Main Authors: Mose, Lisle E, Wilkerson, Matthew D, Hayes, D Neil, Perou, Charles M, Parker, Joel S
Format: Journal Article
Language:English
Published: England Oxford University Press 01-10-2014
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Variant detection from next-generation sequencing (NGS) data is an increasingly vital aspect of disease diagnosis, treatment and research. Commonly used NGS-variant analysis tools generally rely on accurately mapped short reads to identify somatic variants and germ-line genotypes. Existing NGS read mappers have difficulty accurately mapping short reads containing complex variation (i.e. more than a single base change), thus making identification of such variants difficult or impossible. Insertions and deletions (indels) in particular have been an area of great difficulty. Indels are frequent and can have substantial impact on function, which makes their detection all the more imperative. We present ABRA, an assembly-based realigner, which uses an efficient and flexible localized de novo assembly followed by global realignment to more accurately remap reads. This results in enhanced performance for indel detection as well as improved accuracy in variant allele frequency estimation. ABRA is implemented in a combination of Java and C/C++ and is freely available for download at https://github.com/mozack/abra.
Bibliography:Associate Editor: Michael Brudno
ISSN:1367-4803
1367-4811
DOI:10.1093/bioinformatics/btu376