Haplotype-Phased Synthetic Long Reads from Short-Read Sequencing

Next-generation DNA sequencing has revolutionized the study of biology. However, the short read lengths of the dominant instruments complicate assembly of complex genomes and haplotype phasing of mixtures of similar sequences. Here we demonstrate a method to reconstruct the sequences of individual n...

Full description

Saved in:
Bibliographic Details
Published in:PloS one Vol. 11; no. 1; p. e0147229
Main Authors: Stapleton, James A, Kim, Jeongwoon, Hamilton, John P, Wu, Ming, Irber, Luiz C, Maddamsetti, Rohan, Briney, Bryan, Newton, Linsey, Burton, Dennis R, Brown, C Titus, Chan, Christina, Buell, C Robin, Whitehead, Timothy A
Format: Journal Article
Language:English
Published: United States Public Library of Science 20-01-2016
Public Library of Science (PLoS)
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Next-generation DNA sequencing has revolutionized the study of biology. However, the short read lengths of the dominant instruments complicate assembly of complex genomes and haplotype phasing of mixtures of similar sequences. Here we demonstrate a method to reconstruct the sequences of individual nucleic acid molecules up to 11.6 kilobases in length from short (150-bp) reads. We show that our method can construct 99.97%-accurate synthetic reads from bacterial, plant, and animal genomic samples, full-length mRNA sequences from human cancer cell lines, and individual HIV env gene variants from a mixture. The preparation of multiple samples can be multiplexed into a single tube, further reducing effort and cost relative to competing approaches. Our approach generates sequencing libraries in three days from less than one microgram of DNA in a single-tube format without custom equipment or specialized expertise.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
Conceived and designed the experiments: JAS JK JPH RM BB DRB CC CRB TAW. Performed the experiments: JAS JK JPH LN. Analyzed the data: JAS JK JPH MW LCI RM CTB CRB TAW. Contributed reagents/materials/analysis tools: DRB CC CRB. Wrote the paper: JAS JK JPH CRB TAW.
Competing Interests: The authors of this manuscript have read the journal's policy and have the following competing interests: Concepts described in this publication have been protected by US Patent Application number 14947988, “Methods for assembling and reading nucleic acid sequences from mixed populations.” This does not alter the authors' adherence to PLOS ONE policies on sharing data and materials.
ISSN:1932-6203
1932-6203
DOI:10.1371/journal.pone.0147229