EnTAP: Bringing faster and smarter functional annotation to non‐model eukaryotic transcriptomes

EnTAP (Eukaryotic Non‐Model Transcriptome Annotation Pipeline) was designed to improve the accuracy, speed, and flexibility of functional gene annotation for de novo assembled transcriptomes in non‐model eukaryotes. This software package addresses the fragmentation and related assembly issues that r...

Full description

Saved in:
Bibliographic Details
Published in:Molecular ecology resources Vol. 20; no. 2; pp. 591 - 604
Main Authors: Hart, Alexander J., Ginzburg, Samuel, Xu, Muyang (Sam), Fisher, Cera R., Rahmatpour, Nasim, Mitton, Jeffry B., Paul, Robin, Wegrzyn, Jill L.
Format: Journal Article
Language:English
Published: England Wiley Subscription Services, Inc 01-03-2020
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:EnTAP (Eukaryotic Non‐Model Transcriptome Annotation Pipeline) was designed to improve the accuracy, speed, and flexibility of functional gene annotation for de novo assembled transcriptomes in non‐model eukaryotes. This software package addresses the fragmentation and related assembly issues that result in inflated transcript estimates and poor annotation rates of protein‐coding transcripts. Following filters applied through assessment of true expression and frame selection, open‐source tools are leveraged to functionally annotate the reduced set of translated proteins. Downstream features include fast similarity search across five repositories, protein domain assignment, orthologous gene family assessment, and Gene Ontology (GO) term assignment. The final annotation integrates across multiple databases and selects an optimal assignment from a combination of weighted metrics describing similarity search score, taxonomic relationship, and informativeness. Researchers have the option to include additional filters to identify and remove contaminants, identify associated pathways, and prepare the transcripts for enrichment analysis. This fully featured pipeline is easy to install, configure, and runs significantly faster than comparable annotation packages. EnTAP is optimized to generate extensive functional information for the gene space of organisms with limited or poorly characterized genomic resources.
Bibliography:ObjectType-Article-2
SourceType-Scholarly Journals-1
ObjectType-Undefined-1
ObjectType-Feature-3
content type line 23
ISSN:1755-098X
1755-0998
DOI:10.1111/1755-0998.13106