Phylogenomic and population genetics analyses of extant tomato yellow leaf curl virus strains on a global scale

Tomato yellow leaf curl virus (TYLCV) is a monopartite DNA virus with a genome size of ~ 2,800 base pairs. The virus belongs to the genus Begomovirus within the family Geminiviridae . Extant TYLCV strains are differentiated based on an established threshold of 94% genome-wide pairwise nucleotide ide...

Full description

Saved in:
Bibliographic Details
Published in:Frontiers in virology (online) Vol. 3
Main Authors: Marchant, Wendy G., Mugerwa, Habibu, Gautam, Saurabh, Al-Aqeel, Hamed, Polston, Jane E., Rennberger, Gabriel, Smith, Hugh, Turechek, Bill, Adkins, Scott, Brown, Judith K., Srinivasan, Rajagopalbabu
Format: Journal Article
Language:English
Published: Frontiers Media S.A 25-07-2023
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Tomato yellow leaf curl virus (TYLCV) is a monopartite DNA virus with a genome size of ~ 2,800 base pairs. The virus belongs to the genus Begomovirus within the family Geminiviridae . Extant TYLCV strains are differentiated based on an established threshold of 94% genome-wide pairwise nucleotide identity. The phylogenetic relationships, diversification mechanisms, including recombination, and extent of spread within and from the center of origin for TYLCV have been reported in previous studies. However, the evolutionary relationships among strains, strains’ distribution and genomic diversification, and genetic mechanisms shaping TYLCV strains’ evolution have not been re-evaluated to consider globally representative genome sequences in publicly available sequence database, including herein newly sequenced genomes from the U.S. and Middle East, respectively. In this study, full-length genome sequences for the extant strains and isolates of TYLCV (n=818) were downloaded from the GenBank database. All previously published genome sequences, and newly sequenced TYLCV genomes of TYLCV isolates from Kuwait and USA, determined herein (n=834), were subjected to recombination analysis. To remove the ‘phylogenetic noise’ imparted by interspecific recombination, the recombinant genomes were removed from the data set, and the remaining non-recombinant genome sequences (n=423) were subjected to population genetics and Bayesian analyses. Results of the phylogeographical analysis indicated that the type strain, TYLCV-Israel, and TYLCV-Mild strain, were globally distributed, spanning Africa, America, Asia, Australia/Oceania, Europe, and New Caledonia, while the other TYLCV strains were prevalent only throughout the Middle East. The results of Bayesian evolutionary (ancestral) analysis predicted that TYLCV-Israel represents the oldest, most recent common ancestor (MRCA) (41,795 years), followed by TYLCV-Mild at 39,808 years. These were closely followed by two Iranian strains viz., TYLCV-Kerman and TYLCV-Iran at 37,529 and 36,420 years, respectively. In contrast, the most recently evolving strains were TYLCV-Kuwait and TYLCV-Kahnooj at 12,445 and 298 years, respectively. Results of the neutrality test indicated that TYLCV-Israel and TYLCV-Mild populations are undergoing purifying selection and/or population expansion, although statistically significant selection was documented for only TYLCV-Israel, based on positive selection acting on five codons.
ISSN:2673-818X
2673-818X
DOI:10.3389/fviro.2023.1221156