PRICE: software for the targeted assembly of components of (Meta) genomic sequence data

Low-cost DNA sequencing technologies have expanded the role for direct nucleic acid sequencing in the analysis of genomes, transcriptomes, and the metagenomes of whole ecosystems. Human and machine comprehension of such large datasets can be simplified via synthesis of sequence fragments into long,...

Full description

Saved in:
Bibliographic Details
Published in:G3 : genes - genomes - genetics Vol. 3; no. 5; pp. 865 - 880
Main Authors: Ruby, J Graham, Bellare, Priya, Derisi, Joseph L
Format: Journal Article
Language:English
Published: United States Genetics Society of America 01-05-2013
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Low-cost DNA sequencing technologies have expanded the role for direct nucleic acid sequencing in the analysis of genomes, transcriptomes, and the metagenomes of whole ecosystems. Human and machine comprehension of such large datasets can be simplified via synthesis of sequence fragments into long, contiguous blocks of sequence (contigs), but most of the progress in the field of assembly has focused on genomes in isolation rather than metagenomes. Here, we present software for paired-read iterative contig extension (PRICE), a strategy for focused assembly of particular nucleic acid species using complex metagenomic data as input. We describe the assembly strategy implemented by PRICE and provide examples of its application to the sequence of particular genes, transcripts, and virus genomes from complex multicomponent datasets, including an assembly of the BCBL-1 strain of Kaposi's sarcoma-associated herpesvirus. PRICE is open-source and available for free download (derisilab.ucsf.edu/software/price/ or sourceforge.net/projects/pricedenovo/).
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
Sequence data from this article have been deposited in the NCBI SRA (NCBI Sequence Read Archive) under accession no. SRS367470.
Supporting information is available online at http://www.g3journal.org/lookup/suppl/doi:10.1534/g3.113.005967/-/DC1
Present address: Novartis Institutes for Biomedical Research, Emeryville, CA.
ISSN:2160-1836
2160-1836
DOI:10.1534/g3.113.005967