Structural variant-based pangenome construction has low sensitivity to variability of haplotype-resolved bovine assemblies

Advantages of pangenomes over linear reference assemblies for genome research have recently been established. However, potential effects of sequence platform and assembly approach, or of combining assemblies created by different approaches, on pangenome construction have not been investigated. Here...

Full description

Saved in:
Bibliographic Details
Published in:Nature communications Vol. 13; no. 1; pp. 3012 - 13
Main Authors: Leonard, Alexander S., Crysnanto, Danang, Fang, Zih-Hua, Heaton, Michael P., Vander Ley, Brian L., Herrera, Carolina, Bollwein, Heinrich, Bickhart, Derek M., Kuhn, Kristen L., Smith, Timothy P. L., Rosen, Benjamin D., Pausch, Hubert
Format: Journal Article
Language:English
Published: London Nature Publishing Group UK 31-05-2022
Nature Publishing Group
Nature Portfolio
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Advantages of pangenomes over linear reference assemblies for genome research have recently been established. However, potential effects of sequence platform and assembly approach, or of combining assemblies created by different approaches, on pangenome construction have not been investigated. Here we generate haplotype-resolved assemblies from the offspring of three bovine trios representing increasing levels of heterozygosity that each demonstrate a substantial improvement in contiguity, completeness, and accuracy over the current Bos taurus reference genome. Diploid coverage as low as 20x for HiFi or 60x for ONT is sufficient to produce two haplotype-resolved assemblies meeting standards set by the Vertebrate Genomes Project. Structural variant-based pangenomes created from the haplotype-resolved assemblies demonstrate significant consensus regardless of sequence platform, assembler algorithm, or coverage. Inspecting pangenome topologies identifies 90 thousand structural variants including 931 overlapping with coding sequences; this approach reveals variants affecting QRICH2 , PRDM9 , HSPA1A , TAS2R46 , and GC that have potential to affect phenotype. Pangenomes have a number of advantages over linear reference assemblies. Here the authors use bovine haplotype-resolved assemblies to show that structural variant-based pangenomes are consistent regardless of sequence platform, assembler, or coverage, suggesting that rigid protocols may not be required.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:2041-1723
2041-1723
DOI:10.1038/s41467-022-30680-2