Ecology and evolution of viruses infecting uncultivated SUP05 bacteria as revealed by single-cell- and meta-genomics

Version of Record: September 16, 2014
Accepted Manuscript: August 29, 2014

Download
Cite
Share
CommentOpen annotations (there are currently 0 annotations on this page).

Altmetric provides a collated score for online attention across various platforms and media.
See more details

1. Related to
Viruses: Gene swapping in the dead zone

Jillian Petersen, Nicole Dubilier

Insight Oct 13, 2014

Abstract
eLife digest
Introduction
Results and discussion
Materials and methods
References
Article and author information
Metrics

Abstract

Viruses modulate microbial communities and alter ecosystem functions. However, due to cultivation bottlenecks, specific virus–host interaction dynamics remain cryptic. In this study, we examined 127 single-cell amplified genomes (SAGs) from uncultivated SUP05 bacteria isolated from a model marine oxygen minimum zone (OMZ) to identify 69 viral contigs representing five new genera within dsDNA Caudovirales and ssDNA Microviridae. Infection frequencies suggest that ∼1/3 of SUP05 bacteria is viral-infected, with higher infection frequency where oxygen-deficiency was most severe. Observed Microviridae clonality suggests recovery of bloom-terminating viruses, while systematic co-infection between dsDNA and ssDNA viruses posits previously unrecognized cooperation modes. Analyses of 186 microbial and viral metagenomes revealed that SUP05 viruses persisted for years, but remained endemic to the OMZ. Finally, identification of virus-encoded dissimilatory sulfite reductase suggests SUP05 viruses reprogram their host's energy metabolism. Together, these results demonstrate closely coupled SUP05 virus–host co-evolutionary dynamics with the potential to modulate biogeochemical cycling in climate-critical and expanding OMZs.

https://doi.org/10.7554/eLife.03125.001

eLife digest

Microorganisms help to drive a number of processes that recycle energy and nutrients, including elements such as carbon, nitrogen, and sulfur, around the Earth's ecosystems. Viruses that infect microbes can also affect these cycles by killing and breaking open microbial cells, or by reprogramming the cell's metabolism. However, as there are many different species of microbes and viruses —the vast majority of which cannot easily be grown in the laboratory— little is known about most virus–host interactions in natural ecosystems, especially in the oceans.

In the world's oceans, the concentration of oxygen dissolved in the water changes in different regions and at different depths. ‘Oxygen minimum zones’ occur globally throughout the oceans at depths of 200–1000 meters, and climate change is causing these zones to expand and intensify. Although a lack of oxygen is sometimes considered detrimental to living organisms, oxygen minimum zones appear to be rich with microbial life that is adapted to thrive under oxygen-starved conditions.

Sulfur-oxidizing bacteria are one of the most abundant groups of microbes in these oxygen minimum zones, and several of these bacteria are known to influence the recycling of chemical substances. Now, Roux et al. introduce a new method to identify viruses that infect the microbes in this environment, including those microbes that cannot be grown in the laboratory and which have previously remained largely unexplored.

The genomes of 127 individual bacterial cells —collected from an oxygen minimum zone in western Canada— were examined. Roux et al. estimate that about a third of the sulfur-oxidizing bacterial cells are infected by at least one virus, but often multiple viruses infected the same bacterium. Five new genera (groups of one or more species) of viruses were also discovered and found to infect these bacteria. Looking for these new viral sequences in the DNA of this oxygen minimum zone's microbial community revealed that these newly discovered viruses persist in this region over several years. It also revealed that these viruses appear to only be found within the oxygen minimum zone. Roux et al. uncovered that these viruses carry genes that could manipulate how an infected bacterium processes sulfur-containing compounds; this is similar to previous observations showing that other viruses also influence cellular process (such as photosynthesis) in infected bacteria. As such, these newly discovered viruses might also influence the recycling of chemical elements within oxygen minimum zones.

Together, Roux et al.'s findings provide an unprecedented look into a wild virus community using a method that can be generalized to uncover viruses in a data type that is quickly becoming more widespread: single cell genomes. This effort to understand virus–host interactions by looking in the genomes of individual cells now sets the stage for future efforts aimed to uncover the impact of viruses on bacteria in other environments across the globe.

https://doi.org/10.7554/eLife.03125.002

Introduction

Microbial communities are critical drivers of nutrient and energy conversion process in natural and engineered ecosystems (Falkowski et al., 2008). In the last two decades, it has progressively become clear that viral-mediated predation, gene transfer, and metabolic reprogramming modulate the structure, function, and evolutionary trajectory of these microbial communities (Suttle, 2007; Abedon, 2009; Rodriguez-Valera et al., 2009; Hurwitz et al., 2013). At the same time, the vast majority of microbes and viruses remain uncultivated and their diversity is extensive, so that model system-based measurements rarely reflect the network properties of natural microbial communities. While culture-independent methods, such as metagenomics and metatranscriptomics, can illuminate latent and expressed metabolic potential of microbial (Frias-Lopez et al., 2008; Venter et al., 2004; Stewart et al., 2012; DeLong et al., 2006) or viral communities (Angly et al., 2006; Hurwitz et al., 2013; Mizuno et al., 2013), interactions between community members remain difficult to resolve.

Clustered regularly interspaced short palindromic repeats (CRISPRs) containing short stretches of viral or plasmid DNA separated between repeat sequences can provide a record of past infections in uncultivated microbial communities. Together with associated Cas (CRISPR-associated) genes, CRISPRs function as an adaptive immune system in prokaryotes with the potential to suppress viral replication or horizontal gene transfer (Sorek et al., 2008). However, an application of CRISPR-based virus–host association to both uncultivated hosts and viruses require the assembly of complete or near-complete genomes of both entities, limiting their utility to lower diversity ecosystems (Andersson and Banfield, 2008; Anderson et al., 2011). Alternatively, single-cell amplified genome (SAG) sequencing is emerging as a more direct method to chart metabolic potential of individual cells within microbial communities with special emphasis on candidate phyla that have no cultured representatives (Yoon et al., 2011; Martinez-Garcia et al., 2012; Rinke et al., 2013; Swan et al., 2013). Here, we combine metagenomic and single-cell genomic sequencing to explore virus–host interactions within uncultivated bacteria inhabiting a marine oxygen minimum zone (OMZ).

Marine OMZs, defined by dissolved oxygen concentrations <20 μmol kg⁻¹, are oceanographic features that arise from elevated demand for respiratory oxygen in poorly ventilated, highly stratified waters. OMZs are crucial for biogeochemical cycles in the global ocean, as they represent hotspots for microbial-driven carbon, nitrogen, and sulfur transformations (Ulloa et al., 2012; Wright et al., 2012) and play a disproportionate role in nitrogen loss processes and greenhouse gas cycling (Lam et al., 2009; Ward et al., 2009). Moreover, these zones are expanding due to changing ocean water temperatures and circulation patterns (Stramma et al., 2008; Whitney et al., 2007). Given these changing physical and chemical conditions and the importance of OMZs to ocean-atmosphere functioning, a clearer understanding of biological responses is critical to develop a much-needed predictive modeling capacity for OMZs.

In OMZs, microbial communities drive matter and energy transformations and are typically dominated by sulfur-oxidizing Gammaproteobacteria related to the chemoautotrophic gill symbionts of deep-sea clams and mussels (Stewart et al., 2012; Wright et al., 2012). Phylogenetic analysis indicates that these bacteria are comprised of two primary lineages; one consisting of sequences affiliated with SUP05 and clam and mussel symbionts, and the other consisting of sequences affiliated with Arctic96BD-19 (Walsh et al., 2009; Wright et al., 2012). Both groups partition along gradients of oxygen and sulfide, with Arctic96BD-19 most prevalent in oxygenated waters and SUP05 most prevalent in anoxic or anoxic/sulfidic waters (Wright et al., 2012). Niche partitioning between SUP05 and Arctic96BD-19 is driven by complementary modes of carbon and energy metabolism that harness alternative terminal electron acceptors. While both Arctic96BD-19 and SUP05 use reduced sulfur compounds as electron donors to drive inorganic carbon fixation, SUP05 manifests a more versatile energy metabolism linking carbon, nitrogen, and sulfur cycling within OMZ and hydrothermal vent waters (Canfield et al., 2010; Zaikova et al., 2010; Swan et al., 2011; Stewart et al., 2012; Anantharaman et al., 2013; Mattes et al., 2013; Anantharaman et al., 2014; Hawley et al., 2014).

Ocean viruses, predominantly investigated in the sunlit or photic zone, are abundant, dynamic, and diverse (Suttle, 2005) with growing evidence for direct roles in metabolic reprogramming of microbial photosynthesis, central carbon metabolism, and sulfur cycling (Mann et al., 2003; Lindell et al., 2005; Clokie et al., 2006; Breitbart et al., 2007; Dammeyer et al., 2008; Sharon et al., 2009, 2011; Thompson et al., 2011; Hurwitz et al., 2013). Preliminary studies suggest that similar patterns are emerging in OMZ waters. In the Eastern Tropical South Pacific, a metagenomic survey revealed specific viral populations endemic to OMZ waters (Cassman et al., 2012). Consistent with most viral metagenome surveys, approximately 3% of sequences were affiliated with functionally annotated genes in public databases. From a nitrogen and sulfur cycling perspective, viromes from the oxycline contained genes encoding components of nitric oxide synthase, nitrate and nitrite ammonification, and ammonia assimilation pathways as well as inorganic sulfur assimilation (Cassman et al., 2012). In anoxic waters, viromes contained genes encoding components of denitrification, nitrate and nitrite ammonification, and ammonia assimilation pathways as well as sulfate reduction, thioredoxin-disulfide reductase, and inorganic sulfur assimilation (Cassman et al., 2012). More recently, metagenomic analyses of hydrothermal vent plume microbial communities dominated by SUP05 bacteria-enabled phage genome assemblies presumed to infect SUP05 (Anantharaman et al., 2014). Consistent with viruses encoding auxiliary metabolic genes (AMGs, Breitbart et al., 2007) enabling viral reprogramming of microbial metabolic pathways (Lindell et al., 2005; Thompson et al., 2011), putative SUP05 phage contained genes encoding reverse dissimilatory sulfite reductase A and C positing a role for viruses in modulating the marine sulfur cycle (Anantharaman et al., 2014).

Given that SUP05 and Arctic96BD-19 play key roles in OMZ ecology and biogeochemistry, we designed an approach to target SUP05-associated viruses in a model OMZ ecosystem, Saanich Inlet a seasonally anoxic fjord on the coast of Vancouver Island, British Columbia, Canada. We obtained a SUP05 single-cell genomic data set spanning defined redox gradients in the Saanich Inlet water column, identified SUP05-associated viruses infecting SAGs, and used resulting virus–host pairs as recruitment platforms to estimate viral diversity, activity, dispersion, and potential impact on SUP05 population dynamics and metabolic capacity. The resulting data sets open an unprecedented window on uncultivated virus–host dynamics in OMZs and provide an analytical approach extensible to other natural or engineered ecosystems.

Results and discussion

Generating a SUP05 bacterial genomic data set

SUP05 SAGs were generated at the Bigelow Laboratory for Ocean Sciences (http://scgc.bigelow.org, [Stepanauskas and Sieracki, 2007; Swan et al., 2013]). Briefly, fluorescence-activated cell sorting was used to separate individual cells <10 µm in diameter from 100, 150, and 185 meters water depth, spanning water column gradients of oxygen and sulfide in Saanich Inlet (Figure 1—figure supplement 1). Water column redox conditions were typical for stratified summer months when SUP05 populations bloom in deep basin waters. A total of 315 anonymously sorted cells (discriminated solely using fluorescence and size for sorting) per depth interval were subjected to multiple displacement amplification (MDA), and the taxonomic identity of single amplified genomes (SAGs) was determined by directly sequencing bacterial small subunit ribosomal RNA (SSU rRNA) gene amplicons. SAGs affiliated with SUP05 (n = 127) and Arctic96BD-19 (n = 9) populations were subsequently whole genome shotgun sequenced on the Illumina HiSeq platform. Most (113/127) SUP05 SAGs fell into two major operational taxonomic units (OTUs) or subclades, based on SSU rRNA gene sequence clustering at the 97% identity threshold—SUP05_01 (n = 65) and SUP05_03 (n = 48) (Figure 1—figure supplement 2). SUP05_01 SAGs were recovered at 100, 150, and 185 meters, peaking at 150 meters, while SUP05_03 SAGs were more evenly distributed between 150 and 185 meters. A number of SUP05 SAG assemblies contained viral contigs consistent with sampling infected cells across the redoxcline.

New SUP05-associated phage genomes

50 bona fide viral contigs (Supplementary file 1, ‘Materials and methods’) were identified in 30 SUP05 SAGs using viral marker genes, hereafter termed ‘hallmark genes’ (Abrescia et al., 2012). SUP05 viral contigs were affiliated with known families of Caudovirales (dsDNA) and Microviridae (ssDNA) bacteriophages. The presence of Caudovirales is not surprising as they are commonly observed in oceanic samples (Williamson et al., 2012; Hurwitz and Sullivan, 2013), including the ETSP OMZ and SUP05-dominated hydrothermal vent plumes (Cassman et al., 2012; Anantharaman et al., 2014). Microviridae, however, are usually observed in surface seawater or deep-sea sediments and have not been previously associated with OMZs (Angly et al., 2009; Tucker et al., 2011; Yoshida et al., 2013; Labonté and Suttle, 2013b). Given the SUP05 lineages described above, we note that viral contigs recovered from SUP05_01 SAGs were exclusively Caudovirales, whereas SUP05_03 SAGs contained both Caudovirales and Microviridae. Using non-reference-based methods, an additional 19 contigs were identified as putative viral sequences. These sequences did not encode hallmark genes, but displayed genomic characteristics consistent with novel viral genomes including a low ratio of characterized genes (i.e., most genes predicted on these contigs do not match any sequences from the reference databases), a high number of short genes, and a low number of strand changes between two consecutive genes (i.e., gene sets tend to be coded on the same strand; ‘Materials and methods’, Figure 1—figure supplement 3). In total, 69 viral contigs encoding 898 predicted open reading frames over 529 kb were recovered from SUP05 SAGs representing current viral infections.

Viral infection of SUP05 cells in nature

Forty-two out of 127 SUP05 SAGs sequenced contained one or more viral contigs (Figure 1—source data 1), indicating that ∼1/3 of SUP05 cells inhabiting the Saanich Inlet water column were infected by viruses. Such lineage-specific infection frequency determination is unprecedented in uncultivated or cultivated host cells and is largely consistent with community-averaged estimates for marine bacteria (Suttle, 2007). As with all the other means to estimate infection frequency and viral-induced microbial mortality (Brum et al., 2014), there are caveats to these numbers including underestimation linked to incomplete identification of viruses in the SAG data sets. Such an underestimation could result from (i) lack of reference genomes, (ii) incomplete SAG genomes, (iii) early infections not being detected prior to genome insertion and replication, or (iv) late infections not being detected due to phage-directed degradation of host DNA preventing 16S identification during the SAG selection process. Since the infection frequency estimates are largely consistent with community-based measurements, we expect that these biases are small.

SUP05 viral infections showed strong depth partitioning along defined gradients of oxygen and sulfide (Figure 1). At 100 meters a single SUP05 SAG (of 12) displayed current viral infection, while the percentage of infected SUP05 SAGs increased to 28% and 47% at 150 and 185 meters (Figure 1—source data 1). Consistent with previous studies evaluating community-averaged lytic viral activity (Weinbauer et al., 2003), cell-specific lytic viral infection estimates peaked where SUP05 is typically most abundant and metabolically active in the Saanich Inlet water column (Hawley et al., 2014). Additionally, remnants of past infections were detected in SUP05 and Arctic96BD-19 SAGs, including 13 putative prophages and 25 CRISPR sequences (Supplementary file 2). None of these ‘past infection’ sequences match the detected ‘current infection’ viral contigs.

Figure 1 with 3 supplements see all

Download asset Open asset

Saanich Inlet water column characteristics and SUP05 infection frequency on the SAG sampling date (August 2011).

Key abiotic measurements are represented as background coloring (oxygen levels) and black lined graphs at left (hydrogen sulfide and temperature). SUP05 viral infections determined from 127 SAGs are indicated at right by black slices in pie charts where current infections were delineated from intact viral contigs and past infections were inferred from identification of defective prophages and CRISPR loci.

https://doi.org/10.7554/eLife.03125.003

Figure 1—source data 1 Number of SUP05 viral sequences detected at the three different depths sampled. For each depth, the count of SAG where viral sequence were detected (‘infected’ SAG) is indicated, alongside the number of SAGs for which two different viruses were retrieved, the number of SAGs with CRISPR spacer detected and the number of SAGs with a defective prophage identified.: https://doi.org/10.7554/eLife.03125.004
Download elife-03125-fig1-data1-v2.xls

Patterns of co-infection between SUP05 ssDNA and dsDNA viruses

To better understand the ecological and evolutionary forces shaping SUP05 virus–host interactions in Saanich Inlet, we focused on 12 viral reference contigs including 4 Caudovirales contigs longer than 15 kb (from 3 Podoviridae and 1 Siphoviridae) and 8 complete genomes of Microviridae. Genome organization (Figure 2) and phylogenetic analysis (Figure 2—figure supplement 1) revealed that all four Caudovirales contigs represent new genera (share <40% of their genes, Lavigne et al., 2008, Figure 2—source data 1) even when considering the viruses recently assembled from SUP05-dominated microbial metagenomes (Anantharaman et al., 2014). All 8 Microviridae contigs shared 100% nucleotide identity, despite their recovery from different SUP05_03 SAGs (Supplementary file 3), and represent a new genus within the subfamily Gokushovirinae (Figure 2—figure supplements 2 and 3). These identical Microviridae genomes could represent a lineage-specific viral bloom, targeting the SUP05_03 subclade. SUP05 infection by Gokushovirinae extends the known host range from small parasitic bacteria (namely Chlamydia, Bdellovibrio and Spiroplasma) to include free-living Gammaproteobacteria, the first marine host identified for this subfamily of viruses (Labonté and Suttle, 2013a).

Figure 2 with 3 supplements see all

Download asset Open asset

Genetic map and synteny plots for the four references SUP05 *Caudovirales* contigs M8F6_0 (A), C22_13 (B), K04_0 (C) and G10_6 (D) (highlighted in bold).

Viral hallmark genes are underlined and identified on plots (MCP: major capsid protein, Sc: scaffolding protein, H-T conn.: head-tail connector). Sequence similarities were deduced from a tBLASTx comparison. For clarity sake, several sequences including SUP05 viral contig M8F6_0, K04_0, and G10_6 are reverse-complemented (noted RC).

https://doi.org/10.7554/eLife.03125.008

Figure 2—source data 1 Summary of best BLAST hit affiliation for the predicted genes of the five SUP05 reference viral contigs. For each contig, taxonomic and functional affiliation are indicated with the group or category and the number of genes affiliated to this group. The category ‘virion formation’ includes all genes associated to the formation of the capsid and the genome encapsidation.: https://doi.org/10.7554/eLife.03125.009
Download elife-03125-fig2-data1-v2.xls

Curiously, most (11 of 12) Microviridae-infected SUP05_03 SAGs also contained Podoviridae contigs (Supplementary file 4). While previously postulated based on comparative genomics, lineage-specific co-infection between the ssDNA Microviridae and dsDNA phages has not been observed (Roux et al., 2012). Such highly correlated co-occurrence in SUP05 SAGs (Fisher exact test p-value = 2e⁻¹⁵) is consistent with non-random co-infection. This could be linked to cooperative infection modes between viruses or opportunistic infection of cells already infected by the other virus type, as seen in the case of satellite viruses and virophages (Murant and Mayo, 1982; La Scola et al., 2008). It is worth noting that the exact nature of interaction between satellite and helper viruses, or between virophages and their associated viruses, is still a matter of debate, and this association between two phages previously thought to be autonomous and independent (Microviridae and Caudovirales) presents a new variation on this theme (Desnues and Raoult, 2012; Krupovic and Cvirkaite-Krupovic, 2012; Fischer, 2012). Because the modular theory of phage evolution postulates that phage genomes consist of collections of gene modules, exchanged through proximity-enhanced recombination (Hendrix et al., 2000) such co-infection of a single host by ssDNA and dsDNA phages provides evidence for how such chimeric ssDNA–dsDNA viral genomes may come into existence (Diemer and Stedman, 2012; Roux et al., 2013).

SUP05 viruses endemic to Saanich Inlet are stable over time

To extend our analysis of SUP05 virus–host interactions beyond individual SAGs, we used the 12 reference viral contigs (i.e., the 4 Caudovirales and 8 Microviridae) as platforms to recruit 3 years of Saanich Inlet microbial metagenome sequences spanning the redoxcline (Figure 3, Supplementary file 5). SUP05 Microviridae contigs were inconsistently detected due to known methodological biases associated with linker-amplified metagenome library construction (‘Materials and methods’), so we focused on dsDNA viral contigs. All 4 SUP05 Caudovirales contigs were absent from surface waters, but repeatedly detected within and below the oxycline, consistent with SUP05 water column disposition (Figure 3A—figure supplement 1). Within the Caudovirales, recruited microbial metagenome sequences were more similar to the reference genome for Podoviridae contigs C22_13 and K04_0 (96% average amino-acid identity), than for Siphoviridae G10_6 and Podoviridae M8F6_0 (92% average amino-acid identity, Figure 2B). Beyond sequence variation, metagenome coverage in one region of M8F6_0 (3 hypothetical open reading frames) was absent in 2009, minimal in 2010, and as abundant as surrounding genomic regions in 2011 (Figure 3C), suggesting a selective sweep within this population. Contig-derived abundances of SUP05-Caudovirales were in sync with host distributions, but at virus-to-host ratios of 0.01 to 0.3 (Figure 4). While tightly choreographed virus–host abundance dynamics parallels that of cultured virus–host systems (e.g., cyanophages—[Waterbury and Valois, 1993]), the systematically lower (orders of magnitude lower than typical community measurements) virus-to-host ratios observed here indicates that a greater diversity of SUP05 viruses remains to be uncovered in the Saanich Inlet water column.

Figure 3 with 3 supplements see all

Download asset Open asset

Spatiotemporal dynamics of SUP05 viral reference genomes in Saanich Inlet.

(A) SUP05 viral presence in Saanich Inlet microbial metagenomes with OMZ sample names bolded. Four categories indicate the SUP05 virus was detected (>75% of viral genes detected at >80% amino-acid identity; light blue), a SUP05 viral relative was detected (>75% of viral genes detected at 60–80% amino-acid identity; light green), no SUP05 virus was detected (red) or detection was inconclusive (e.g., *Microviridae* in HiSeq Illumina data sets that strongly select against ssDNA sequences; gray). (B) SUP05 viral reference genomes had differing sequence conservation among recruited metagenomic reads. Upper and lower ‘hinges’ correspond to the first and third quartiles (the 25th and 75th percentiles), while outliers are displayed as points (values beyond 1.5 * Inter-Quartile Range of the hinge). (C) One SUP05 viral reference genome with low sequence conservation revealed evolution in action whereby a genomic region (see ∼21–30 kb) appears to sweep through the population.

https://doi.org/10.7554/eLife.03125.013

Figure 4

Download asset Open asset

Uncultivated SUP05 lineage-specific virus–host ecology.

Fragment recruitment from Saanich Inlet microbial metagenomes to microbial (95% nucleotide identity) and viral (100% amino-acid identity) reference contigs normalized by contig and metagenome size was used as a proxy for abundance. Hence, the relative abundance of microbial and viral genome is indicated as number of metagenomic bases recruited by contig(s) base pairs (bp) by megabase (Mb) of metagenome. Upper and lower ‘hinges’ of the relative abundance distribution correspond to the first and third quartiles (the 25th and 75th percentiles), while outliers are displayed as points (values beyond 1.5 * Inter-Quartile Range of the hinge). A virus-to-host ratio was then calculated for each SAG (i.e., each virus-host pair) as the ratio of relative abundance of viral contigs to the relative abundance of microbial contigs from the same SAG.

https://doi.org/10.7554/eLife.03125.017

To determine SUP05 viral biogeography, we interrogated 74 viromes and 112 microbial metagenomes sourced from Pacific Ocean waters (Supplementary file 5). Despite consistently recovering SUP05 viral sequences in Saanich Inlet, these sequences were extremely uncommon in other locales (22 instances out of 803 possibilities; Figure 3—figure supplements 2 and 3), even when proximal to Saanich Inlet (e.g., northeastern subarctic Pacific [NESAP] coastal and open ocean waters along the LineP transect) or when sourced from similar water column conditions (e.g., Eastern Tropical South Pacific OMZ, ETSP). Of the 22 SUP05-related viruses detected, all but two were recovered below 500 meters in NESAP OMZ samples, in which SUP05 bacteria were also detected with similar abundance as in Saanich Inlet samples. The remaining two detections derived from an ETSP OMZ virome and a hydrothermal vent plume microbial metagenome from the Guaymas basin. Taken together, these observations point to endemic SUP05 viral populations with the potential to modulate SUP05-mediated biogeochemical cycling via lysis or metabolic reprogramming.

Potential impact of SUP05 phages on sulfur metabolism

Recent studies have highlighted the role of viruses in metabolic reprogramming, from global photosynthesis (Mann et al., 2003; Lindell et al., 2005; Clokie et al., 2006; Sullivan et al., 2006; Sharon et al., 2009) to central carbon metabolism (Sharon et al., 2011; Thompson et al., 2011; Hurwitz et al., 2013) via auxiliary metabolism genes (AMGs). Additionally, viruses assembled from microbial metagenomes from SUP05 dominated hydrothermal vent samples contain sulfur cycling genes (Anantharaman et al., 2014). Therefore, we looked for AMGs encoded on SUP05 viral contigs in the Saanich Inlet water column.

Four putative AMGs were detected in 12 of the 69 viral contigs, predominantly from SUP05_01 SAGs recovered from 150 meters (Supplementary file 6). One AMG identified on a bona fide viral contig, phosphate-related phoH, is common among marine phages, but remains functionally uncharacterized (Sullivan et al., 2010; Goldsmith et al., 2011). The remaining 3 AMGs including 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily (2OG-FeII oxygenase), tripartite tricarboxylate transporter (tctA, protein domain hit only), and dissimilatory sulfite reductase subunit C (dsrC) were encoded on contigs identified by non-reference-based methods. In marine cyanophages, 2OG-FeII oxygenase-encoding genes are common where they are thought to modulate host nitrogen metabolism during infection (Sullivan et al., 2010). However, the precise metabolic role of tctA and dsrC-like genes during viral infection remains unknown.

Given that dsrC was found on 7 SUP05_01 viral contigs (Supplementary file 7) and DsrC is critical in SUP05 energy metabolism (Walsh et al., 2009), we focused on this gene. Although dsrC genes were only present on contigs identified by non-reference-based methods they were closely related to dsrC-like genes encoded on the hydrothermal vent plume phages (Anantharaman et al., 2014). Indeed, conceptually translated sequence alignment of these viral dsrC genes including putative viral and bacterial genes from microbial metagenomic data sets indicate that the Saanich Inlet 'viral' sequences belong to one dsrC subgroup (dsrC_1 according to the classification of Anantharaman et al., 2014). In addition to high sequence similarity viral dsrC genes from SUP05 SAGs co-localized on contigs with viral homologs (e.g., 2OG-FeII oxygenase, chaperonin), and occurred in genomic context that was completely different to the conserved and well-characterized dsrC region in SUP05 genomes (Figure 5A,B).

Figure 5 with 2 supplements see all

Download asset Open asset

Maps of DsrC-containing contigs.

(A) Seven contigs including *dsrC*-like gene detected as viral based on non-reference metrics (ratio of uncharacterized genes, strand coding bias). (B) Genomic context in which *dsrC*-like genes are retrieved in SUP05 microbial contigs from SAG. All contigs above 50 kb containing a *dsrC*-like gene were selected and compared to get a summary of the different regions in which *dsrC*-like genes are found in SUP05 genomes. (C) Map of *dsrC*-containing Contigs assembled from Saanich Inlet metagenomes. One viral-like contig from SAG (020_11) is included for comparison.

https://doi.org/10.7554/eLife.03125.018

The dsrC_1 group encodes a protein retaining 15 conserved residues across known DsrC subunits. However, the second C-terminal cysteine and a 7–8 residue insertion thought to be required for DsrC function based on structural analysis of Desulfovibrio vulgaris and Archaeoglobus fulgidus proteins are missing from the viral protein (Figure 5—figure supplement 1; Mander et al., 2005; Oliveira et al., 2008). These differences suggest that either the viral encoded dsrC is non-functional or has a modified function. Given that genes shared between different viral genomes rarely represent nonfunctional genes, it is likely that viral-encoded dsrC plays a biological role in SUP05. Indeed, there is precedent for divergent viral AMGs serving as modified functional counterparts to host-encoded homologues. Specifically, a highly divergent viral ‘pebA’ (Sullivan et al., 2005) was experimentally demonstrated to perform the functions of two host enzymes' (pebA and pebB) as a bifunctional enzyme, phycoerythrobilin synthetase (pebS) (Dammeyer et al., 2008).

Given that viral dsrC genes were abundant in the Saanich Inlet water column over a 3-year-time interval (Figure 5C) with peaked recovery consistent with blooming SUP05 populations (Figure 5—figure supplement 2; Hawley et al., 2014), we posit that this viral gene is functional in SUP05 sulfur cycling. Future functional characterization of viral DsrC is needed to constrain viral roles in modulating SUP05 electron transfer reactions during viral infection in the environment.

Conclusion

While new methods and model systems for identifying virus–host interactions continue to emerge (Tadmor et al., 2011; Allers et al., 2013; Mizuno et al., 2013; Deng et al., 2014), viral ecology remains predominantly community focused in nature. This is because most hosts are uncultivated (Rappé and Giovannoni, 2003), and culture-independent viral metagenomes are dominated by ‘unknown’ sequences (Hurwitz and Sullivan, 2013), which inhibits developing a mechanism- and population-based viral ecology. Here, we use single-cell genomics to directly link SUP05 viruses and their hosts across defined gradients of oxygen and sulfide over a 3-year-time interval in a model OMZ ecosystem. This spatiotemporal resolution revealed endemic patterns of co-infection between ssDNA and dsDNA viruses and the occurrence of AMGs with the potential to modulate electron transfer reactions essential to SUP05 energy metabolism. Together, these findings offer novel perspectives on the ecology and evolution of viruses infecting uncultivated bacterial populations. While the capacity to formulate such linkages between cultured virus–host systems in nature is recognized (e.g., cyanophages and pelagiphages), the use of single-cell genomics to explore such linkages in uncultivated microbial communities represents a watershed moment in illuminating viral dark matter and its role in modulating microbial interaction networks in natural and engineered ecosystems.

Materials and methods

Sample collection, sequencing, and assembly

Request a detailed protocol

Samples were collected in Saanich Inlet on Vancouver Island, British Columbia, on the 09th of August 2011. Sample collection and biochemical measurements were performed as previously described (Zaikova et al., 2010). Water column redox conditions were typical for stratified summer months when SUP05 populations bloom in deep basin waters. Individual cells <10 µm in diameter from 100, 150, and 185 meter depth samples were subjected to fluorescence-activated cell sorting, multiple displacement amplification (MDA), and taxonomic identification at the Bigelow Laboratory Single Cell Genomics Center (SCGC; http://scgc.bigelow.org), following previously described procedures (Stepanauskas and Sieracki, 2007; Swan et al., 2013). A total of 315 single amplified genomes (SAGs) per sample were subjected to multiple displacement amplification (MDA), and the taxonomic identity of single amplified genomes (SAG) was determined by directly sequencing bacterial small subunit ribosomal RNA (SSU rRNA) gene amplicons. A total of 136 SAGs affiliated with SUP05 or Arctic96BD-19 were selected for genome sequencing. Between 1 and 3 µg of MDA product was sent to Canada's Michael Smith Genome Sciences Center (Vancouver, BC) to create shotgun libraries. Briefly, the DNA was sheared to 350–450 bp fragments using a Covaris E210 and purified using AMPure XP Beads according to the manufacturer's instructions. The sheared DNA was end-repaired and A-tailed according to the Illumina standard PE protocol and purified again using AMPure XP Beads, generating paired-end 100-bp reads. Indexed libraries were amplified by PCR for six cycles, gel-purified, pooled (11–12 samples per lane), and QC assessed on a Bioanalyzer DNA Series II High Sensitivity chip (Agilent, Santa Clara, CA, USA), and then sequenced using an Illumina HiSeq2000 sequencer.

All raw Illumina sequence data were passed through DUK, a filtering program developed at JGI, which removes known Illumina sequencing and library preparation artifacts (Mingkun, Copeland, and Han, Unpublished). Artifact filtered sequence data were then screened and trimmed according to the k-mers present in the data set (Mingkun and Kmernorm, Unpublished). High-depth k-mers, presumably derived from MDA amplification bias, cause problems in the assembly, especially if the k-mer depth varies in orders of magnitude for different regions of the genome. Reads with high k-mer coverage (>30× average k–mer depth) were normalized to an average depth of 30×. Reads with an average k-mer depth of less than 2× were removed. Following steps were then performed for assembly: (i) normalized Illumina reads were assembled using IDBA–UD version 1.0.9 (Peng et al., 2012); (ii) 1–3 kb simulated paired end reads were created from IDBA–UD contigs using wgsim (https://github.com/lh3/wgsim); (iii) normalized Illumina reads were assembled with simulated read pairs using Allpaths–LG (version r42328) (Gnerre et al., 2011); (iv) Parameters for assembly steps were: (i) IDBA–UD (––no local), (ii) wgsim (–e 0 –1 100 –2 100 –r 0 –R 0 –X 0), (iii) Allpaths–LG (PrepareAllpathsInputs: PHRED 64=1 PLOIDY=1 FRAG COVERAGE=125 JUMP COVERAGE=25 LONG JUMP COV=50, RunAllpathsLG: THREADS=8 RUN=std shredpairs TARGETS=standard VAPI WARN ONLY=True OVERWRITE=True MIN CONTIG=2000).

SAG taxonomic assignment

Request a detailed protocol

SAG taxonomy was verified using the assembled contigs in two ways using MetaPathways 1.0 (Konwar et al., 2013). First, the assemblies were blasted against the SILVA (v.111) database to confirm the taxonomy based on SSU rRNA. Next, MEGAN5 was used to carry out taxonomic binning of all ORFs from the MetaPathways BLAST output using the Lowest Common Ancestor (LCA) approach (Huson et al., 2007).

A total of 2711 SSU rRNA sequences previously taxonomically assigned to SUP05 and Arctic96BD-19 lineages were aligned and clustered using mothur v.1.27.0 (Schloss et al., 2009), and 20 representative sequences for the most abundant clusters (cutoff = 6) at 97% similarity were selected. These representative sequences were used to build the phylogenetic tree differentiating between SUP05 and Arctic96BD-19. Reference SUP05 and Arctic96BD-19 sequences from different environments and symbionts and cluster representative sequences were aligned using the SILVA aligner tool (http://www.arb-silva.de/aligner/) and imported into an in-house ARB database for SUP05. Aligned sequences were exported from ARB into Mesquite for manual alignment refinement. The final phylogenetic tree was inferred from manually refined Mesquite alignment of sequences using maximum likelihood implemented in PHYML using a GTR model with estimated values for the α parameter of the Γ distribution and the proportion of invariable sites. The confidence of each node was determined by assembling a consensus tree of 1000 bootstrap replicates.

Microbial and viral metagenomes

Request a detailed protocol

The protocols used to generate the POV (Hurwitz and Sullivan, 2013), ETSP OMZ viromes (Cassman et al., 2012), ETSP microbial metagenomes and metatranscriptomes (Stewart et al., 2012; Ganesh et al., 2014), and Guaymas basin metagenome (Anantharaman et al., 2013) are described in their respective publications. All these data sets were sequenced with Roche 454 GL FLX Titanium systems, and quality controlled reads were used in the different analysis computed in this study.

LineP and Malaspina viral metagenomes (viromes) were obtained from samples collected during LineP (http://www.pac.dfo-mpo.gc.ca/science/oceans/data-donnees/line-p/index-eng.html) and Malaspina (http://scientific.expedicionmalaspina.es/) cruises. Particles were precipitated with Iron–Chloride from 0.2 µm filtrates, and resuspended in EDTA-Mg-Ascorbate buffer (John et al., 2011) before the DNA was extracted using Promega's Wizard Prep kit. Assembly and gene prediction were conducted through the IMG/M ER pipeline (Markowitz et al., 2014). Microbial metagenome samples at Saanich Inlet and along the LineP transect were also collected during LineP cruises (http://www.pac.dfo-mpo.gc.ca/science/oceans/data-donnees/line-p/index-eng.html). Sequencing and assembly of these data sets was conducted at the JGI. A list of the different web servers and accession numbers for these publicly available data sets is displayed in Supplementary file 5.

Detection of viral contigs in SUP05/Arctic SAG

Request a detailed protocol

SUP05 SAG contigs were annotated with the Metavir web server (Roux et al., 2014). Briefly, ORFs were predicted with MetaGeneAnnotator (Noguchi et al., 2008) and compared to the RefseqVirus database with BLASTp (Altschul et al., 1997). In order to select viral-associated contigs, we looked for viral-specific genes, that is, genes associated with the formation of the capsid and encapsidation of the genome (designated as ‘hallmark viral genes’). Thus, we searched for all genes annotated as ‘virion structure’, ‘capsid’, ‘portal’, ‘tail’, or ‘terminase’, and selected contigs including at least one of these hallmark genes (Supplementary file 1). Among the 50 viral contigs detected, we highlighted a set of 12 long (>15 kb) or circular contigs as the best references available for SUP05 phages (Supplementary file 1). We then compared the reference sequences retrieved in this first screening round to all the SUP05/Arctic96BD-19 SAG contigs, in order to extract more viral-related sequences (Supplementary file 1). At this step, all contigs with at least 50% of their genes similar to a previously detected SUP05 viral contigs were retained (sequence similarity between predicted genes assessed through BLASTp, thresholds of 0.001 for e-value and 50 for bit score).

Alternatively, we compared the SUP05/Arctic96BD-19 SAG contigs to a set of ocean viromes (Supplementary file 5) and looked for every contig which was covered by virome reads (for 454-sequenced viromes) or predicted genes (for HiSeq-sequenced viromes) on at least three genes with at least 90% of identity (protein sequences). However, this comparison to viromes only highlighted contigs already identified as viral from the hallmark gene analysis. Finally, we looked for every sequence which could come from a new type of phage, based on two known properties of phage genomes: most of their genes are not similar to anything in the current databases, and they tend to be mostly coded on the same strand (by block, or module) (Akhter et al., 2012). We thus looked for all regions in SAG contigs composed of at least 50% of uncharacterized genes, with at least 80% of them on the same coding strand. 19 new short viral contigs were highlighted through this detection (Supplementary file 1), which displayed characteristics close to the viral hallmark contigs (Figure 1—figure supplement 3).

A set of regions of putative viral origin within bacterial contigs also stood out. These sequences were manually curated to check if they could indeed be of viral origin, notably by checking if these regions were conserved between closely related bacterial contigs, and 13 putative defective prophages were eventually identified among them. CRISPR regions were detected with the CRISPR recognition tool (Bland et al., 2007). All spacers were extracted and compared to all SUP05/Arctic96BD-19 SAG contigs with BLASTn.

Annotation of viral contigs

Request a detailed protocol

The annotations of selected contigs were extracted from the Metavir web server (Roux et al., 2014) and manually curated. Taxonomic affiliations were based on a BLAST comparison to RefseqVirus and NR databases from NCBI, with a bit score threshold of 50 and e-value threshold of 0.001. A tBLASTx comparison of larger contigs (>15 kb) against WGS (Whole-Genome shotgun), HTGS (High-Throughput Genomic Shotgun), and GSS (Genomic Survey Sequences) from the NCBI was used to add the most closely related sequence to the analysis, which could have not been included in the NR and Refseq database yet. This screening notably lead to the detection of two contigs from a Gammaproteobacteria single-cell amplified genome (Gamma proteobacterium SCGC AAA160-D02) similar to SUP05 phage genome and was therefore included in the phylogenetic and genome comparison analysis. The affiliation of SUP05 viruses to new or existing genera was based on the criteria of 40% of genes shared within a genus previously defined for Caudovirales (Lavigne et al., 2008). Map comparison figures were created with Easyfig (Sullivan et al., 2011).

Functional annotation was achieved through a domain search against the PFAM database (Punta et al., 2012) (hmmscan [Eddy, 2011], using a threshold of 0.001 for e-value and 30 for score). When looking for putative AMGs, defective prophages were not considered since these regions are likely to be subject to rearrangement and gene transfer, and the origin of single genes within these regions is uncertain. A set of microbial dsrC sequences were selected as references for SUP05 viral-encoded dsrC genes in genomic context (Figure 5B). Briefly, all contigs in SUP05 SAGs longer than 50 kb and containing a DsrC-like gene were compared through BLASTn and displayed with Easyfig (Sullivan et al., 2011).

Phage multiple alignments and phylogenetic trees

Request a detailed protocol

Maximum-likelihood trees were computed with PhyML (Guindon and Gascuel, 2003) using a LG model, a CAT approximation for Gamma parameter, and computing SH-like scores for node supports. All SUP05 contigs affiliated to Podoviridae and including the major capsid protein gene were added in a single tree alongside reference sequences from Autographivirinae and N4-like viruses. The most closely related sequences to each SUP05 Podoviridae, as detected from the genome comparison analysis, were also included in the tree. SUP05 Microviridae were included in a phylogenetic tree based on the Major Capsid protein and centered around the Gokushovirinae sub-family, with sequences from Pichovirinae used as outgroup. Gokushovirinae reference sequences were taken from Roux et al. (2012) and Labonté and Suttle (2013b). In order to include more aquatic sequences, complete Microviridae genomes were assembled from two sets of viromes sampled from a freshwater subtropical reservoir (Tseng et al., 2013) and deep-sea sediments (Yoshida et al., 2013) and annotated as previously described (Roux et al., 2012). Tree figures were drawn with Itol (Letunic and Bork, 2007). DsrC-like predicted protein sequences were aligned with Muscle v3.8.31 (Edgar, 2004), and the multiple alignment was displayed with Jalview (Waterhouse et al., 2009).

Recruitment of metagenomic sequences to SUP05 viral genomes

Request a detailed protocol

A set of oceanic viromes and microbial metagenomes were used for comparison with SUP05 viral genomes (Supplementary file 5). Similarities between SUP05 viral genomes and published viromes were assessed through BLAST comparison, BLASTx for 454-sequenced viromes (POV data set [Hurwitz and Sullivan, 2013], ETSP OMZ viromes [Cassman et al., 2012], ETSP microbial metagenomes and metatranscriptomes [Ganesh et al., 2014; Stewart et al., 2012], and Guaymas basin metagenome [Anantharaman et al., 2013]) and BLASTp from predicted protein for HiSeq-sequenced viromes (LineP and Malaspina viromes, Saanich Inlet and LineP microbial metagenomes), with similar thresholds of 0.001 for e-value and 50 for bit score. Each metagenome—viral genome association was classified based on the number of viral genes detected and the amino-acid percentage identity of the BLAST hits associated: when more than 75% of the genes were detected at more than 80% identity in the metagenome, the viral genome was thought to be in the sample. The same ratio of genes detected at lower percentage (60 to 80%) indicates the presence of a related but distinct virus. We considered that less than 75% of the genes detected meant that this virus was likely absent from the sample. The results of Microviridae detection with the HiSeq Illumina data sets have to be carefully considered, as the linker amplification used in the preparation of samples for HiSeq Illumina sequencing displays a strong bias against ssDNA templates such as Microviridae genomes (Kim and Bae, 2011). Hence, if the detection of SUP05 Microviridae in HiSeq Illumina data sets undoubtedly testifies for the presence of these viruses in the samples, an absence of detection is not a strong indicator of their absence in the sample.

In order to detect the host of SUP05 viruses in the same data sets, a mapping of all sequences from each metagenome to non-viral SAG contigs was computed with mummer (Delcher et al., 2003) (minimum cluster length of 100, maximum gap between two matches in a cluster of 500). The Saanich Inlet SUP05 bacteria is considered present in the metagenome when more than 75% of genes are covered by metagenomic sequences with average nucleotide identity above 95%. Viral-encoded dsrC was computed with a threshold of 95% on average nucleotide identity, as no similarity beyond 80% average nucleotide identity was detected between viral and microbial homologues, whether from public database or from the SUP05 SAG microbial contigs. All recruitment and coverage plots were drawn with the ggplot2 module of R software (Wickham, 2009).

Abundance and variability of SUP05 viral and microbial genomes

Request a detailed protocol

Assessment of variability in the populations associated with each SUP05 virus was based on a BLASTp between all sequences from Saanich Inlet metagenomes recruited by each SUP05 viral contig (thresholds of 50 for bit score, 0.001 for e-value, and 80% for amino-acid identity). The relative abundance of SUP05 viral and microbial genomes was assessed from the recruitment of Saanich Inlet metagenomic reads to each viral contig and set of microbial contigs (all contigs greater than 5 kb and not identified as viral) for each ‘reference’ SAG (i.e., the 4 SAG in which a SUP05 reference Caudovirales was detected: AB-750C22AB-904 for C22_13, AB-750K04AB-904 for K04_0, AB-751_G10AB-905 for G10_6, and AB-755_M08F06 for M8F6_0, Figure 2—source data 1). For each metagenome, a normalized ratio of nucleotides recruited by each contig or set of contigs was calculated as the number of bases recruited (sum of the length of recruited reads) divided by the total number of bases in the (set of) contig(s) and the total number of bases in the metagenome. The ratio of viral genomes to host genomes was then calculated for each metagenome as the relative abundance of viral contig divided by the relative abundance of bacterial contig from the same SAG. The plots of genetic variability and relative abundance distributions were generated with the ggplot2 module of R software (Wickham, 2009). The perl scripts used in the different part of the bioinformatics analyses are available online at http://tmpl.arizona.edu/dokuwiki/doku.php?id=bioinformatics:scripts:sup05 and as Source code 1.

References

1. Abedon ST
(2009)
Advances in Applied Microbiology

Advances in Applied Microbiology, Elsevier, 1st edition, Vol. 67, 10.1016/S0065-2164(08)01001-0.
- Google Scholar
(2012) Structure unifies the viral universe
Annual Review of Biochemistry 81:795–822.

https://doi.org/10.1146/annurev-biochem-060910-095130
- Google Scholar
(2012) PhiSpy: a novel algorithm for finding prophages in bacterial genomes that combines similarity- and composition-based strategies
Nucleic Acids Research 40:1–13.

https://doi.org/10.1093/nar/gks406
- Google Scholar
1. Allers E
2. Moraru C
3. Duhaime MB
4. Beneze E
5. Solonenko N
6. Canosa JB
7. Amann R
8. Sullivan MB
(2013) Single-cell and population level viral infection dynamics revealed by phageFISH, a method to visualize intracellular and free viruses
Environmental Microbiology 15:2306–2318.

https://doi.org/10.1111/1462-2920.12100
- Google Scholar
1. Altschul SF
2. Madden TL
3. Schäffer AA
4. Zhang J
5. Zhang Z
6. Miller W
7. Lipman DJ
(1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
Nucleic Acids Research 25:3389–3402.

https://doi.org/10.1093/nar/25.17.3389
- Google Scholar
(2013) Evidence for hydrogen oxidation and metabolic plasticity in widespread deep-sea sulfur-oxidizing bacteria
Proceedings of the National Academy of Sciences of USA 110:330–335.

https://doi.org/10.1073/pnas.1215340110
- Google Scholar
1. Anantharaman K
2. Duhaime MB
3. Breier JA
4. Wendt K
5. Toner BM
6. Dick GJ
(2014) Sulfur oxidation genes in diverse deep-sea viruses
Science 344:757–760.

https://doi.org/10.1126/science.1252229
- Google Scholar
1. Andersson AF
2. Banfield JF
(2008) Virus population dynamics and acquired virus resistance in natural microbial communities
Science 320:1047–1050.

https://doi.org/10.1126/science.1157358
- Google Scholar
(2011) Using CRISPRs as a metagenomic tool to identify microbial hosts of a diffuse flow hydrothermal vent viral assemblage
FEMS Microbiology Ecology 77:120–133.

https://doi.org/10.1111/j.1574-6941.2011.01090.x
- Google Scholar
1. Angly FE
2. Felts B
3. Breitbart M
4. Salamon P
5. Edwards RA
6. Carlson C
7. Chan AM
8. Haynes M
9. Kelley S
10. Liu H
11. Mahaffy JM
12. Mueller JE
13. Nulton J
14. Olson R
15. Parsons R
16. Rayhawk S
17. Suttle CA
18. Rohwer F
(2006) The marine viromes of four oceanic regions
PLOS Biology 4:e368.

https://doi.org/10.1371/journal.pbio.0040368
- Google Scholar
1. Angly FE
2. Willner D
3. Prieto-Davó A
4. Edwards RA
5. Schmieder R
6. Vega-Thurber R
7. Antonopoulos DA
8. Barott K
9. Cottrell MT
10. Desnues C
11. Dinsdale EA
12. Furlan M
13. Haynes M
14. Henn MR
15. Hu Y
16. Kirchman DL
17. McDole T
18. McPherson JD
19. Meyer F
20. Miller RM
21. Mundt E
22. Naviaux RK
23. Rodriguez-Mueller B
24. Stevens R
25. Wegley L
26. Zhang L
27. Zhu B
28. Rohwer F
(2009) The GAAS metagenomic tool and its estimations of viral and microbial average genome size in four major biomes
PLOS Computational Biology 5:e1000593.

https://doi.org/10.1371/journal.pcbi.1000593
- Google Scholar
1. Bland C
2. Ramsey TL
3. Sabree F
4. Lowe M
5. Brown K
6. Kyrpides NC
7. Hugenholtz P
(2007) CRISPR recognition tool (CRT): a tool for automatic detection of clustered regularly interspaced palindromic repeats
BMC Bioinformatics 8:209.

https://doi.org/10.1186/1471-2105-8-209
- Google Scholar
(2007) Exploring the vast diversity of marine viruses
Oceanography 20:135–139.

https://doi.org/10.5670/oceanog.2007.58
- Google Scholar
1. Brum JR
2. Morris JJ
3. Décima M
4. Stukel MR
(2014)
Association for the Sciences of Limnology and Oceanography

16–48, Association for the Sciences of Limnology and Oceanography, Eco-Das IX, 10.4319/ecodas.2014.978-0-9845591-3-8.16.
- Google Scholar
(2010) A cryptic sulfur cycle in oxygen-minimum-zone waters off the Chilean coast
Science 330:1375–1378.

https://doi.org/10.1126/science.1196889
- Google Scholar
1. Cassman N
2. Prieto-Davó A
3. Walsh K
4. Silva GG
5. Angly F
6. Akhter S
7. Barott K
8. Busch J
9. McDole T
10. Haggerty JM
11. Willner D
12. Alarcón G
13. Ulloa O
14. DeLong EF
15. Dutilh BE
16. Rohwer F
17. Dinsdale EA
(2012) Oxygen minimum zones harbour novel viral communities with low diversity
Environmental Microbiology 14:3043–3065.

https://doi.org/10.1111/j.1462-2920.2012.02891.x
- Google Scholar
1. Clokie MR
2. Shan J
3. Bailey S
4. Jia Y
5. Krisch HM
6. West S
7. Mann NH
(2006) Transcription of a ‘photosynthetic’ T4-type phage during infection of a marine cyanobacterium
Environmental Microbiology 8:827–835.

https://doi.org/10.1111/j.1462-2920.2005.00969.x
- Google Scholar
(2008) Efficient phage-mediated pigment biosynthesis in oceanic cyanobacteria
Current Biology 18:442–448.

https://doi.org/10.1016/j.cub.2008.02.067
- Google Scholar
(2003)
Using MUMmer to identify similar regions in large sequence sets

Chapter 10: Unit 10.3, 10.1002/0471250953.bi1003s00.
- Google Scholar
1. DeLong EF
2. Preston CM
3. Mincer T
4. Rich V
5. Hallam SJ
6. Frigaard NU
7. Martinez A
8. Sullivan MB
9. Edwards R
10. Brito BR
11. Chisholm SW
12. Karl DM
(2006) Community genomics among stratified microbial assemblages in the ocean's interior
Science 311:496–503.

https://doi.org/10.1126/science.1120250
- Google Scholar
(2014) Viral tagging reveals discrete populations in Synechococcus viral genome sequence space
Nature In press.

https://doi.org/10.1038/nature13459
- Google Scholar
1. Desnues C
2. Raoult D
(2012)
Virophages question the existence of satellites

234, Nature Reviews Microbiology, 10, author reply 234, 10.1038/nrmicro2676-c3.
- Google Scholar
1. Diemer GS
2. Stedman KM
(2012) A novel virus genome discovered in an extreme environment suggests recombination between unrelated groups of RNA and DNA viruses
Biology Direct 7:13.

https://doi.org/10.1186/1745-6150-7-13
- Google Scholar
1. Eddy SR
(2011) Accelerated profile HMM searches
PLOS Computational Biology 7:e1002195.

https://doi.org/10.1371/journal.pcbi.1002195
- Google Scholar
1. Edgar RC
(2004) MUSCLE: a multiple sequence alignment method with reduced time and space complexity
BMC Bioinformatics 5:113.

https://doi.org/10.1186/1471-2105-5-113
- Google Scholar
(2008) The microbial engines that drive earth's biogeochemical cycles
Science 320:1034–1039.

https://doi.org/10.1126/science.1153213
- Google Scholar
1. Fischer MG
(2012)
Sputnik and Mavirus: more than just satellite viruses

78, Nature Reviews Microbiology, 10, author reply 78, 10.1038/nrmicro2676-c1.
- Google Scholar
1. Frias-Lopez J
2. Shi Y
3. Tyson GW
4. Coleman ML
5. Schuster SC
6. Chisholm SW
7. Delong EF
(2008) Microbial community gene expression in ocean surface waters
Proceedings of the National Academy of Sciences of USA 105:3805–3810.

https://doi.org/10.1073/pnas.0708897105
- Google Scholar
(2014) Metagenomic analysis of size-fractionated picoplankton in a marine oxygen minimum zone
The ISME Journal 8:187–211.

https://doi.org/10.1038/ismej.2013.144
- Google Scholar
1. Gnerre S
2. Maccallum I
3. Przybylski D
4. Ribeiro FJ
5. Burton JN
6. Walker BJ
7. Sharpe T
8. Hall G
9. Shea TP
10. Sykes S
11. Berlin AM
12. Aird D
13. Costello M
14. Daza R
15. Williams L
16. Nicol R
17. Gnirke A
18. Nusbaum C
19. Lander ES
20. Jaffe DB
(2011) High-quality draft assemblies of mammalian genomes from massively parallel sequence data
Proceedings of the National Academy of Sciences of USA 108:1513–1518.

https://doi.org/10.1073/pnas.1017351108
- Google Scholar
1. Goldsmith DB
2. Crosti G
3. Dwivedi B
4. McDaniel LD
5. Varsani A
6. Suttle CA
7. Weinbauer MG
8. Sandaa RA
9. Breitbart M
(2011) Development of phoH as a novel signature gene for assessing marine phage diversity
Applied and Environmental Microbiology 77:7730–7739.

https://doi.org/10.1128/AEM.05531-11
- Google Scholar
1. Guindon S
2. Gascuel O
(2003) A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood
Systematic Biology 52:696–704.

https://doi.org/10.1080/10635150390235520
- Google Scholar
(2014) Metaproteomics reveals differential modes of metabolic coupling among ubiquitous oxygen minimum zone microbes
Proceedings of the National Academy of Sciences of USA 111:11395–11140.

https://doi.org/10.1073/pnas.1322132111
- Google Scholar
(2000) The origins and ongoing evolution of viruses
Trends in Microbiology 8:504–508.

https://doi.org/10.1016/S0966-842X(00)01863-1
- Google Scholar
(2013) Metabolic reprogramming by viruses in the sunlit and dark ocean
Genome Biology 14:R123.

https://doi.org/10.1186/gb-2013-14-11-r123
- Google Scholar
1. Hurwitz BL
2. Sullivan MB
(2013) The Pacific Ocean virome (POV): a marine viral metagenomic dataset and associated protein clusters for quantitative viral ecology
PLOS ONE 8:e57355.

https://doi.org/10.1371/journal.pone.0057355
- Google Scholar
1. Huson DH
2. Auch AF
3. Qi J
4. Schuster SC
(2007) MEGAN analysis of metagenomic data
Genome Research 17:377–386.

https://doi.org/10.1101/gr.5969107
- Google Scholar
1. John SG
2. Mendez CB
3. Deng L
4. Poulos B
5. Kauffman AK
6. Kern S
7. Brum J
8. Polz MF
9. Boyle EA
10. Sullivan MB
(2011) A simple and efficient method for concentration of ocean viruses by chemical flocculation
Environmental Microbiology Reports 3:195–202.

https://doi.org/10.1111/j.1758-2229.2010.00208.x
- Google Scholar
1. Kim KH
2. Bae JW
(2011) Amplification methods bias metagenomic libraries of uncultured single-stranded and double-stranded DNA viruses
Applied and Environmental Microbiology 77:7663–7668.

https://doi.org/10.1128/AEM.00289-11
- Google Scholar
(2013) MetaPathways: a modular pipeline for constructing pathway/genome databases from environmental sequence information
BMC Bioinformatics 14:202.

https://doi.org/10.1186/1471-2105-14-202
- Google Scholar
1. Krupovic M
2. Cvirkaite-Krupovic V
(2012) Towards a more comprehensive classification of satellite viruses
Nature Reviews Microbiology 10:234.

https://doi.org/10.1038/nrmicro2676-c4
- Google Scholar
1. La Scola B
2. Desnues C
3. Pagnier I
4. Robert C
5. Barrassi L
6. Fournous G
7. Merchat M
8. Suzan-Monti M
9. Forterre P
10. Koonin E
11. Raoult D
(2008) The virophage as a unique parasite of the giant mimivirus
Nature 455:100–104.

https://doi.org/10.1038/nature07218
- Google Scholar
1. Labonté JM
2. Suttle CA
(2013a) Metagenomic and whole-genome analysis reveals new lineages of gokushoviruses and biogeographic separation in the sea
Frontiers in Microbiology 4:404.

https://doi.org/10.3389/fmicb.2013.00404
- Google Scholar
1. Labonté JM
2. Suttle CA
(2013b) Previously unknown and highly divergent ssDNA viruses populate the oceans
The ISME Journal 7:2169–2177.

https://doi.org/10.1038/ismej.2013.110
- Google Scholar
1. Lam P
2. Lavik G
3. Jensen MM
4. van de Vossenberg J
5. Schmid M
6. Woebken D
7. Gutiérrez D
8. Amann R
9. Jetten MS
10. Kuypers MM
(2009) Revising the nitrogen cycle in the Peruvian oxygen minimum zone
Proceedings of the National Academy of Sciences of USA 106:4752–4757.

https://doi.org/10.1073/pnas.0812444106
- Google Scholar
(2008) Unifying classical and molecular taxonomic classification: analysis of the Podoviridae using BLASTP-based tools
Research in Microbiology 159:406–414.

https://doi.org/10.1016/j.resmic.2008.03.005
- Google Scholar
1. Letunic I
2. Bork P
(2007) Interactive Tree Of Life (iTOL): an online tool for phylogenetic tree display and annotation
Bioinformatics 23:127–128.

https://doi.org/10.1093/bioinformatics/btl529
- Google Scholar
(2005) Photosynthesis genes in marine viruses yield proteins during host infection
Nature 438:86–89.

https://doi.org/10.1038/nature04111
- Google Scholar
1. Mander GJ
2. Weiss MS
3. Hedderich R
4. Kahnt J
5. Ermler U
6. Warkentin E
(2005) X-ray structure of the gamma-subunit of a dissimilatory sulfite reductase: fixed and flexible C-terminal arms
FEBS Letters 579:4600–4604.

https://doi.org/10.1016/j.febslet.2005.07.029
- Google Scholar
1. Mann NH
2. Cook A
3. Millard A
4. Bailey S
5. Clokie M
(2003) Bacterial photosynthesis genes in a virus
Nature 424:741.

https://doi.org/10.1038/424741a
- Google Scholar
1. Markowitz VM
2. Chen IM
3. Chu K
4. Szeto E
5. Palaniappan K
6. Pillay M
7. Ratner A
8. Huang J
9. Pagani I
10. Tringe S
11. Huntemann M
12. Billis K
13. Varghese N
14. Tennessen K
15. Mavromatis K
16. Pati A
17. Ivanova NN
18. Kyrpides NC
(2014) IMG/M 4 version of the integrated metagenome comparative analysis system
Nucleic Acids Research 42:D568–D573.

https://doi.org/10.1093/nar/gkt919
- Google Scholar
(2012) Unveiling in situ interactions between marine protists and bacteria through single cell sequencing
The ISME Journal 6:703–707.

https://doi.org/10.1038/ismej.2011.126
- Google Scholar
1. Mattes TE
2. Nunn BL
3. Marshall KT
4. Proskurowski G
5. Kelley DS
6. Kawka OE
7. Goodlett DR
8. Hansell DA
9. Morris RM
(2013) Sulfur oxidizers dominate carbon fixation at a biogeochemical hot spot in the dark ocean
The ISME Journal 7:2349–2360.

https://doi.org/10.1038/ismej.2013.113
- Google Scholar
(2013) Expanding the marine virosphere using metagenomics
PLOS Genetics 9:e1003987.

https://doi.org/10.1371/journal.pgen.1003987
- Google Scholar
1. Murant AF
2. Mayo MA
(1982) Satellites of plant viruses
Annual Review of Phytopathology 20:49–70.

https://doi.org/10.1146/annurev.py.20.090182.000405
- Google Scholar
(2008) MetaGeneAnnotator: detecting species-specific patterns of ribosomal binding site for precise gene prediction in anonymous prokaryotic and phage genomes
DNA Research 15:387–396.

https://doi.org/10.1093/dnares/dsn027
- Google Scholar
(2008) The crystal structure of Desulfovibrio vulgaris dissimilatory sulfite reductase bound to DsrC provides novel insights into the mechanism of sulfate respiration
The Journal of Biological Chemistry 283:34141–34149.

https://doi.org/10.1074/jbc.M805643200
- Google Scholar
1. Peng Y
2. Leung HC
3. Yiu SM
4. Chin FY
(2012) IDBA-UD: a de novo assembler for single-cell and metagenomic sequencing data with highly uneven depth
Bioinformatics 28:1420–1428.

https://doi.org/10.1093/bioinformatics/bts174
- Google Scholar
1. Punta M
2. Coggill PC
3. Eberhardt RY
4. Mistry J
5. Tate J
6. Boursnell C
7. Pang N
8. Forslund K
9. Ceric G
10. Clements J
11. Heger A
12. Holm L
13. Sonnhammer EL
14. Eddy SR
15. Bateman A
16. Finn RD
(2012) The Pfam protein families database
Nucleic Acids Research 40:D290–D301.

https://doi.org/10.1093/nar/gkr1065
- Google Scholar
1. Rappé MS
2. Giovannoni SJ
(2003) The uncultured microbial majority
Annual Review of Microbiology 57:369–394.

https://doi.org/10.1146/annurev.micro.57.030502.090759
- Google Scholar
1. Rinke C
2. Schwientek P
3. Sczyrba A
4. Ivanova NN
5. Anderson IJ
6. Cheng JF
7. Darling A
8. Malfatti S
9. Swan BK
10. Gies EA
11. Dodsworth JA
12. Hedlund BP
13. Tsiamis G
14. Sievert SM
15. Liu WT
16. Eisen JA
17. Hallam SJ
18. Kyrpides NC
19. Stepanauskas R
20. Rubin EM
21. Hugenholtz P
22. Woyke T
(2013) Insights into the phylogeny and coding potential of microbial dark matter
Nature 499:431–437.

https://doi.org/10.1038/nature12352
- Google Scholar
(2009) Explaining microbial population genomics through phage predation
Nature Reviews Microbiology 7:828–836.

https://doi.org/10.1038/nrmicro2235
- Google Scholar
1. Roux S
2. Enault F
3. Bronner G
4. Vaulot D
5. Forterre P
6. Krupovic M
(2013) Chimeric viruses blur the borders between the major groups of eukaryotic single-stranded DNA viruses
Nature Communications 4:2700.

https://doi.org/10.1038/ncomms3700
- Google Scholar
1. Roux S
2. Krupovic M
3. Poulet A
4. Debroas D
5. Enault F
(2012) Evolution and diversity of the Microviridae viral family through a collection of 81 new complete genomes assembled from virome reads
PLOS ONE 7:e40418.

https://doi.org/10.1371/journal.pone.0040418
- Google Scholar
1. Roux S
2. Tournayre J
3. Mahul A
4. Debroas D
5. Enault F
(2014) Metavir 2: virome comparative analysis and annotation of assembled genomic fragments
BMC Bioinformatics 15:76.

https://doi.org/10.1186/1471-2105-15-76
- Google Scholar
1. Schloss PD
2. Westcott SL
3. Ryabin T
4. Hall JR
5. Hartmann M
6. Hollister EB
7. Lesniewski RA
8. Oakley BB
9. Parks DH
10. Robinson CJ
11. Sahl JW
12. Stres B
13. Thallinger GG
14. Van Horn DJ
15. Weber CF
(2009) Introducing mothur: open-source, platform-independent, community-supported software for describing and comparing microbial communities
Applied and Environmental Microbiology 75:7537–7541.

https://doi.org/10.1128/AEM.01541-09
- Google Scholar
1. Sharon I
2. Alperovitch A
3. Rohwer F
4. Haynes M
5. Glaser F
6. Atamna-Ismaeel N
7. Pinter RY
8. Partensky F
9. Koonin EV
10. Wolf YI
11. Nelson N
12. Béjà O
(2009) Photosystem I Gene cassettes are present in marine virus genomes
Nature 461:258–262.

https://doi.org/10.1038/nature08284
- Google Scholar
1. Sharon I
2. Battchikova N
3. Aro EM
4. Giglione C
5. Meinnel T
6. Glaser F
7. Pinter RY
8. Breitbart M
9. Rohwer F
10. Béjà O
(2011) Comparative metagenomics of microbial traits within oceanic viral communities
The ISME Journal 5:1178–1190.

https://doi.org/10.1038/ismej.2011.2
- Google Scholar
(2008) CRISPR–a widespread system that provides acquired resistance against phages in bacteria and archaea
Nature Reviews Microbiology 6:181–186.

https://doi.org/10.1038/nrmicro1793
- Google Scholar
1. Stepanauskas R
2. Sieracki ME
(2007) Matching phylogeny and metabolism in the uncultured marine bacteria, one cell at a time
Proceedings of the National Academy of Sciences of USA 104:9052–9057.

https://doi.org/10.1073/pnas.0700496104
- Google Scholar
(2012) Microbial metatranscriptomics in a permanent marine oxygen minimum zone
Environmental Microbiology 14:23–40.

https://doi.org/10.1111/j.1462-2920.2010.02400.x
- Google Scholar
(2008) Expanding oxygen-minimum zones in the tropical oceans
Science 320:655–658.

https://doi.org/10.1126/science.1153847
- Google Scholar
(2005) Three Prochlorococcus cyanophage genomes: signature features and ecological interpretations
PLOS Biology 3:e144.

https://doi.org/10.1371/journal.pbio.0030144
- Google Scholar
1. Sullivan MB
2. Huang KH
3. Ignacio-Espinoza JC
4. Berlin AM
5. Kelly L
6. Weigele PR
7. DeFrancesco AS
8. Kern SE
9. Thompson LR
10. Young S
11. Yandava C
12. Fu R
13. Krastins B
14. Chase M
15. Sarracino D
16. Osburne MS
17. Henn MR
18. Chisholm SW
(2010) Genomic analysis of oceanic cyanobacterial myoviruses compared with T4-like myoviruses from diverse hosts and environments
Environmental Microbiology 12:3035–3056.

https://doi.org/10.1111/j.1462-2920.2010.02280.x
- Google Scholar
(2006) Prevalence and evolution of core photosystem II genes in marine cyanobacterial viruses and their hosts
PLOS Biology 4:e234.

https://doi.org/10.1371/journal.pbio.0040234
- Google Scholar
(2011) Easyfig: a genome comparison visualizer
Bioinformatics 27:1009–1010.

https://doi.org/10.1093/bioinformatics/btr039
- Google Scholar
1. Suttle CA
(2005) Viruses in the sea
Nature 437:356–361.

https://doi.org/10.1038/nature04160
- Google Scholar
1. Suttle CA
(2007) Marine viruses–major players in the global ecosystem
Nature Reviews Microbiology 5:801–812.

https://doi.org/10.1038/nrmicro1750
- Google Scholar
1. Swan BK
2. Martinez-Garcia M
3. Preston CM
4. Sczyrba A
5. Woyke T
6. Lamy D
7. Reinthaler T
8. Poulton NJ
9. Masland ED
10. Gomez ML
11. Sieracki ME
12. DeLong EF
13. Herndl GJ
14. Stepanauskas R
(2011) Potential for chemolithoautotrophy among ubiquitous bacteria lineages in the dark ocean
Science 333:1296–1300.

https://doi.org/10.1126/science.1203690
- Google Scholar
1. Swan BK
2. Tupper B
3. Sczyrba A
4. Lauro FM
5. Martinez-Garcia M
6. González JM
7. Luo H
8. Wright JJ
9. Landry ZC
10. Hanson NW
11. Thompson BP
12. Poulton NJ
13. Schwientek P
14. Acinas SG
15. Giovannoni SJ
16. Moran MA
17. Hallam SJ
18. Cavicchioli R
19. Woyke T
20. Stepanauskas R
(2013) Prevalent genome streamlining and latitudinal divergence of planktonic bacteria in the surface ocean
Proceedings of the National Academy of Sciences of USA 110:11463–11468.

https://doi.org/10.1073/pnas.1304246110
- Google Scholar
(2011) Probing individual environmental bacteria for viruses by using microfluidic digital PCR
Science 333:58–62.

https://doi.org/10.1126/science.1200758
- Google Scholar
1. Thompson LR
2. Zeng Q
3. Kelly L
4. Huang KH
5. Singer AU
6. Stubbe J
7. Chisholm SW
(2011) Phage auxiliary metabolic genes and the rRedirection of cyanobacterial host carbon metabolism
Proceedings of the National Academy of Sciences of USA 108:E757–E764.

https://doi.org/10.1073/pnas.1102164108
- Google Scholar
1. Tseng CH
2. Chiang PW
3. Shiah FK
4. Chen YL
5. Liou JR
6. Hsu TC
7. Maheswararajah S
8. Saeed I
9. Halgamuge S
10. Tang SL
(2013) Microbial and viral metagenomes of a subtropical freshwater reservoir subject to climatic disturbances
The ISME Journal 7:2374–2386.

https://doi.org/10.1038/ismej.2013.118
- Google Scholar
(2011) Diversity and distribution of single-stranded DNA phages in the North Atlantic Ocean
The ISME Journal 5:822–830.

https://doi.org/10.1038/ismej.2010.188
- Google Scholar
(2012) Microbial oceanography of anoxic oxygen minimum zones
Proceedings of the National Academy of Sciences of USA 109:15996–16003.

https://doi.org/10.1073/pnas.1205009109
- Google Scholar
1. Venter JC
2. Remington K
3. Heidelberg JF
4. Rusch D
5. Eisen JA
6. Wu D
7. Paulsen I
8. Nelson KE
9. Nelson W
10. Fouts DE
11. Levy S
12. Knap AH
13. Lomas MW
14. Nealson K
15. White O
16. Peterson J
17. Hoffman J
18. Parsons R
19. Baden-Tillson H
20. Pfannkoch C
21. Rogers YH
22. Smith HO
(2004) Environmental genome shotgun sequencing of the Sargasso sea
Science 304:66–74.

https://doi.org/10.1126/science.1093857
- Google Scholar
1. Walsh DA
2. Zaikova E
3. Howes CG
4. Song YC
5. Wright JJ
6. Tringe SG
7. Tortell PD
8. Hallam SJ
(2009) Metagenome of a versatile chemolithoautotroph from expanding oceanic dead zones
Science 326:578–582.

https://doi.org/10.1126/science.1175309
- Google Scholar
1. Ward BB
2. Devol AH
3. Rich JJ
4. Chang BX
5. Bulow SE
6. Naik H
7. Pratihary A
8. Jayakumar A
(2009) Denitrification as the dominant nitrogen loss process in the Arabian sea
Nature 461:78–81.

https://doi.org/10.1038/nature08276
- Google Scholar
1. Waterbury JB
2. Valois FW
(1993)
Resistance to co-occurring phages enables marine synechococcus communities to coexist with cyanophages abundant in seawater

Applied and Environmental Microbiology 59:3393–3399.
- Google Scholar
(2009) Jalview version 2–a multiple sequence alignment editor and analysis workbench
Bioinformatics 25:1189–1191.

https://doi.org/10.1093/bioinformatics/btp033
- Google Scholar
(2003) Lysogeny and virus-induced mortality of bacterioplankton in surface, deep, and anoxic marine waters
Limnology and Oceanography 48:1457–1465.

https://doi.org/10.4319/lo.2003.48.4.1457
- Google Scholar
(2007) Persistently declining oxygen levels in the interior waters of the eastern subarctic Pacific
Progress in Oceanography 75:179–199.

https://doi.org/10.1016/j.pocean.2007.08.007
- Google Scholar
1. Wickham H
(2009)
ggplot2: elegant graphics for data analysis

ggplot2: elegant graphics for data analysis, Springer Publishing Company.
- Google Scholar
(2012) Metagenomic Exploration of Viruses throughout the Indian Ocean
PLOS ONE 7:e42047.

https://doi.org/10.1371/journal.pone.0042047
- Google Scholar
(2012) Microbial ecology of expanding oxygen minimum zones
Nature Reviews Microbiology 10:381–394.

https://doi.org/10.1038/nrmicro2778
- Google Scholar
1. Yoon HS
2. Price DC
3. Stepanauskas R
4. Rajah VD
5. Sieracki ME
6. Wilson WH
7. Yang EC
8. Duffy S
9. Bhattacharya D
(2011) Single-cell genomics reveals organismal interactions in uncultivated marine protists
Science 332:714–717.

https://doi.org/10.1126/science.1203163
- Google Scholar
1. Yoshida M
2. Takaki Y
3. Eitoku M
4. Nunoura T
5. Takai K
(2013) Metagenomic analysis of viral communities in (hado) pelagic sediments
PLOS ONE 8:e57271.

https://doi.org/10.1371/journal.pone.0057271
- Google Scholar
1. Zaikova E
2. Walsh DA
3. Stilwell CP
4. Mohn WW
5. Tortell PD
6. Hallam SJ
(2010) Microbial community dynamics in a seasonally anoxic fjord: Saanich Inlet, British Columbia
Environmental Microbiology 12:172–191.

https://doi.org/10.1111/j.1462-2920.2009.02058.x
- Google Scholar

Article and author information

Author details

Simon Roux

Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, United States

Contribution
SR, Conception and design, Analysis and interpretation of data, Drafting or revising the article

Competing interests
The authors declare that no competing interests exist.
Alyse K Hawley

Department of Microbiology and Immunology, University of British Columbia, Vancouver, Canada

Contribution
AKH, Acquisition of data, Analysis and interpretation of data

Competing interests
The authors declare that no competing interests exist.
Monica Torres Beltran

Department of Microbiology and Immunology, University of British Columbia, Vancouver, Canada

Contribution
MTB, Acquisition of data, Analysis and interpretation of data

Competing interests
The authors declare that no competing interests exist.
Melanie Scofield

Department of Microbiology and Immunology, University of British Columbia, Vancouver, Canada

Contribution
MS, Acquisition of data, Drafting or revising the article

Competing interests
The authors declare that no competing interests exist.
Patrick Schwientek

U.S Department of Energy Joint Genome Institute, Walnut Creek, United States

Contribution
PS, Acquisition of data

Competing interests
The authors declare that no competing interests exist.
Ramunas Stepanauskas

Bigelow Laboratory for Ocean Sciences, East Boothbay, United States

Contribution
RS, Conception and design, Acquisition of data, Drafting or revising the article

Competing interests
The authors declare that no competing interests exist.
Tanja Woyke

U.S Department of Energy Joint Genome Institute, Walnut Creek, United States

Contribution
TW, Conception and design, Acquisition of data, Drafting or revising the article

Competing interests
The authors declare that no competing interests exist.
Steven J Hallam
1. Department of Microbiology and Immunology, University of British Columbia, Vancouver, Canada
2. Graduate Program in Bioinformatics, University of British Columbia, Vancouver, Canada
Contribution
SJH, Conception and design, Acquisition of data, Analysis and interpretation of data, Drafting or revising the article

For correspondence
shallam@mail.ubc.ca

Competing interests
The authors declare that no competing interests exist.
Matthew B Sullivan

Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, United States

Contribution
MBS, Conception and design, Analysis and interpretation of data, Drafting or revising the article

For correspondence
mbsulli@email.arizona.edu

Competing interests
The authors declare that no competing interests exist.

Funding

Office of Science (DE-AC02-05CH1123)

Ramunas Stepanauskas
Tanja Woyke
Steven J Hallam
Matthew B Sullivan

Ambrose Monell Foundation

Steven J Hallam

Tula Foundation

Steven J Hallam

Canadian Network for Research and Innovation in Machining Technology, Natural Sciences and Engineering Research Council of Canada

Steven J Hallam

Canada Foundation for Innovation

Steven J Hallam

Canadian Institute for Advanced Research

Steven J Hallam

Gordon and Betty Moore Foundation (3790)

Matthew B Sullivan

National Science Foundation (OCE-0961947)

Matthew B Sullivan

Bio5 Institute

Matthew B Sullivan

G. Unger Vetlesen Foundation and Ambrose Monell Foundation

Steven J Hallam

University of Arizona, Technology and Research Initiative Fund (Water, Environmental and Energy Solutions Inititative)

Matthew B Sullivan

National Science Foundation (OCE-821374, OCE-1019242)

Ramunas Stepanauskas

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Acknowledgements

We thank the crew aboard the MSV John Strickland for logistical and sampling support in Saanich Inlet and Melanie Scofield, Jody Wright, Evan Durno, and Elena Zaikova in the Hallam lab for technical assistance. We also thank the Joint Genome Institute, including IMG and GOLD teams and Sussanah Tringe, Stephanie Malfatti, and Tijana Glavina del Rio for technical and project management assistance. This work was performed under the auspices of the U.S. Department of Energy Joint Genome Institute supported by the Office of Science of the U.S. Department of Energy under Contract No. DE-AC02-05CH11231; the G Unger Vetlesen and Ambrose Monell Foundations and the Tula Foundation funded Centre for Microbial Diversity and Evolution, Natural Sciences and Engineering Research Council (NSERC) of Canada, Canada Foundation for Innovation (CFI), and the Canadian Institute for Advanced Research (CIFAR) through grants awarded to SJH; and BIO5, NSF (OCE-0961947) and the Gordon and Betty Moore Foundation (#3790) through grants awarded to MBS. This work was supported by the University of Arizona, Technology and Research Initiative Fund, through the Water, Environmental and Energy Solutions Initiative. Single cell genomics instrumentation at Bigelow Laboratory for Ocean Sciences was supported by NSF grants OCE-821374 and OCE-1019242 to RS and by the State of Maine Technology Institute. The single cell genome sequences and annotations can be accessed via IMG (img.jgi.doe.gov, SAG Ids are listed in Supplementary file 4). Viral contigs and defective prophages identified in the SUP05 SAG are available on the Metavir webserver (http://metavir-meb.univ-bpclermont.fr/), as virome ‘SUP05_viral_sequences’ in project ‘SUP05_SAGs’. The web servers hosting viral and microbial metagenome sequences used here are listed in Supplementary file 5.

Copyright

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.