Viral dark matter and virus-host interactions resolved from publicly available microbial genomes
Abstract
The ecological importance of viruses is now widely recognized, yet our limited knowledge of viral sequence space and virus-host interactions precludes accurate prediction of their roles and impacts. Here we mined publicly available bacterial and archaeal genomic datasets to identify 12,498 high‑confidence viral genomes linked to their microbial hosts. These data augment public datasets 10-fold, provide first viral sequences for 13 new bacterial phyla including ecologically abundant phyla, and help taxonomically identify 7-38% of 'unknown' sequence space in viromes. Genome- and network-based classification was largely consistent with accepted viral taxonomy and suggested that (i) 264 new viral genera were identified (doubling known genera) and (ii) cross-taxon genomic recombination is limited. Further analyses provided empirical data on extrachromosomal prophages and co‑infection prevalences, as well as evaluation of in silico virus-host linkage predictions. Together these findings illustrate the value of mining viral signal from microbial genomes.
Article and author information
Author details
Reviewing Editor
- Richard A Neher, Max Planck Institute for Developmental Biology, Germany
Version history
- Received: May 2, 2015
- Accepted: July 22, 2015
- Accepted Manuscript published: July 22, 2015 (version 1)
- Accepted Manuscript updated: August 4, 2015 (version 2)
- Version of Record published: August 12, 2015 (version 3)
Copyright
© 2015, Roux et al.
This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 11,461
- Page views
-
- 2,275
- Downloads
-
- 287
- Citations
Article citation count generated by polling the highest count across the following sources: Crossref, PubMed Central, Scopus.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Computational and Systems Biology
- Ecology
High proportions of gut bacteria that produce their own food can be an indicator for poor gut health.
-
- Ecology
- Epidemiology and Global Health
Previously unknown pathogens often emerge from primary ecosystems, but there is little knowledge on the mechanisms of emergence. Most studies analyzing the influence of land-use change on pathogen emergence focus on a single host-pathogen system and often observe contradictory effects. Here, we studied virus diversity and prevalence patterns in natural and disturbed ecosystems using a multi-host and multi-taxa approach. Mosquitoes sampled along a disturbance gradient in Côte d’Ivoire were tested by generic RT-PCR assays established for all major arbovirus and insect-specific virus taxa including novel viruses previously discovered in these samples based on cell culture isolates enabling an unbiased and comprehensive approach. The taxonomic composition of detected viruses was characterized and viral infection rates according to habitat and host were analyzed. We detected 331 viral sequences pertaining to 34 novel and 15 previously identified viruses of the families Flavi-, Rhabdo-, Reo-, Toga-, Mesoni- and Iflaviridae and the order Bunyavirales. Highest host and virus diversity was observed in pristine and intermediately disturbed habitats. The majority of the 49 viruses was detected with low prevalence. However, nine viruses were found frequently across different habitats of which five viruses increased in prevalence towards disturbed habitats, in congruence with the dilution effect hypothesis. These viruses were mainly associated with one specific mosquito species (Culex nebulosus), that increased in relative abundance from pristine (3%) to disturbed habitats (38%). Interestingly, the observed increased prevalence of these five viruses in disturbed habitats was not caused by higher host infection rates but by increased host abundance, an effect tentatively named abundance effect. Our data show that host species composition is critical for virus abundance. Environmental changes that lead to an uneven host community composition and to more individuals of a single species is a key driver of virus emergence.