Viral dark matter and virus-host interactions resolved from publicly available microbial genomes

  1. Simon Roux
  2. Steven J Hallam
  3. Tanja Woyke
  4. Matthew B Sullivan  Is a corresponding author
  1. The Ohio State University, United States
  2. University of British Columbia, Canada
  3. U.S Department of Energy Joint Genome Institute, United States

Abstract

The ecological importance of viruses is now widely recognized, yet our limited knowledge of viral sequence space and virus-host interactions precludes accurate prediction of their roles and impacts. Here we mined publicly available bacterial and archaeal genomic datasets to identify 12,498 high‑confidence viral genomes linked to their microbial hosts. These data augment public datasets 10-fold, provide first viral sequences for 13 new bacterial phyla including ecologically abundant phyla, and help taxonomically identify 7-38% of 'unknown' sequence space in viromes. Genome- and network-based classification was largely consistent with accepted viral taxonomy and suggested that (i) 264 new viral genera were identified (doubling known genera) and (ii) cross-taxon genomic recombination is limited. Further analyses provided empirical data on extrachromosomal prophages and co‑infection prevalences, as well as evaluation of in silico virus-host linkage predictions. Together these findings illustrate the value of mining viral signal from microbial genomes.

Article and author information

Author details

  1. Simon Roux

    Department of Microbiology, The Ohio State University, Columbus, United States
    Competing interests
    The authors declare that no competing interests exist.
  2. Steven J Hallam

    Department of Microbiology and Immunology, University of British Columbia, Vancouver, Canada
    Competing interests
    The authors declare that no competing interests exist.
  3. Tanja Woyke

    U.S Department of Energy Joint Genome Institute, Walnut Creek, United States
    Competing interests
    The authors declare that no competing interests exist.
  4. Matthew B Sullivan

    Department of Microbiology, The Ohio State University, Columbus, United States
    For correspondence
    mbsulli@email.arizona.edu
    Competing interests
    The authors declare that no competing interests exist.

Copyright

© 2015, Roux et al.

This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.

Metrics

  • 11,627
    views
  • 2,351
    downloads
  • 397
    citations

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Simon Roux
  2. Steven J Hallam
  3. Tanja Woyke
  4. Matthew B Sullivan
(2015)
Viral dark matter and virus-host interactions resolved from publicly available microbial genomes
eLife 4:e08490.
https://doi.org/10.7554/eLife.08490

Share this article

https://doi.org/10.7554/eLife.08490

Further reading

    1. Ecology
    Mercury Shitindo
    Insight

    Tracking wild pigs with GPS devices reveals how their social interactions could influence the spread of disease, offering new strategies for protecting agriculture, wildlife, and human health.

    1. Ecology
    2. Neuroscience
    Ralph E Peterson, Aman Choudhri ... Dan H Sanes
    Research Article

    In nature, animal vocalizations can provide crucial information about identity, including kinship and hierarchy. However, lab-based vocal behavior is typically studied during brief interactions between animals with no prior social relationship, and under environmental conditions with limited ethological relevance. Here, we address this gap by establishing long-term acoustic recordings from Mongolian gerbil families, a core social group that uses an array of sonic and ultrasonic vocalizations. Three separate gerbil families were transferred to an enlarged environment and continuous 20-day audio recordings were obtained. Using a variational autoencoder (VAE) to quantify 583,237 vocalizations, we show that gerbils exhibit a more elaborate vocal repertoire than has been previously reported and that vocal repertoire usage differs significantly by family. By performing gaussian mixture model clustering on the VAE latent space, we show that families preferentially use characteristic sets of vocal clusters and that these usage preferences remain stable over weeks. Furthermore, gerbils displayed family-specific transitions between vocal clusters. Since gerbils live naturally as extended families in complex underground burrows that are adjacent to other families, these results suggest the presence of a vocal dialect which could be exploited by animals to represent kinship. These findings position the Mongolian gerbil as a compelling animal model to study the neural basis of vocal communication and demonstrates the potential for using unsupervised machine learning with uninterrupted acoustic recordings to gain insights into naturalistic animal behavior.