Research Article

Genomic evidence for global ocean plankton biogeography shaped by large-scale current systems

Sorbonne Université, CNRS, Station Biologique de Roscoff, UMR7144, ECOMAP, France
Institut de Biologia Evolutiva (CSIC‐Universitat Pompeu Fabra), Passeig Marítim de la Barceloneta, Spain
Stazione Zoologica Anton Dohrn, Villa Comunale, Italy
CEA, DAM, DIF, F‐91297, France
Génomique Métabolique, Genoscope, Institut de Biologie François Jacob, CEA, CNRS, Université Evry, Université Paris‐Saclay, France
Research Federation for the study of Global Ocean systems ecology and evolution, FR2O22/Tara GOSEE, France
Aix Marseille Univ., Université de Toulon, CNRS, IRD, MIO UM, France
Changing Ocean Research Unit, Institute for the Oceans and Fisheries, University of British Columbia. Aquatic Ecosystems Research Lab, Canada
Ecology and Evolutionary Biology, Yale University, United States
Institut pasteur, Université Paris Cité, Bioinformatics and Biostatistics Hub, France
Univ Rennes, CNRS, Inria, IRISA-UMR 6074, France
Sorbonne Universités, CNRS, Laboratoire d’Oceanographie de Villefranche, LOV, France
Lundbeck Foundation GeoGenetics Centre, GLOBE Institute, University of Copenhagen, Denmark
MARUM, Center for Marine Environmental Sciences, University of Bremen, Germany
Max Planck Institute for Marine Microbiology, Germany
Dipartimento di Fisica e Astronomia ‘G. Galilei’ & CNISM, INFN, Università di Padova, Italy
MaIAGE, INRAE, Université Paris‐Saclay, France
Nantes Université, Ecole Centrale Nantes, CNRS, LS2N, France
Genoscope, Institut de biologie François‐Jacob, Commissariat à l'Energie Atomique (CEA), Université Paris‐Saclay, France
Department of Microbiology, The Ohio State University, United States
Institut de Biologie de l’Ecole Normale Supérieure (IBENS), Ecole Normale Supérieure, CNRS, INSERM, Université PSL, France
Structural and Computational Biology, European Molecular Biology Laboratory, Germany
Directors’ Research European Molecular Biology Laboratory, Germany
PANGAEA, Data Publisher for Earth and Environmental Science, University of Bremen, Germany
Department of Oceanography and Coastal Sciences, Louisiana State University, United States
Yonsei Frontier Lab, Yonsei University, Republic of Korea
Department of Bioinformatics, Biocenter, University of Würzburg, Germany
Institute of Microbiology, Department of Biology, ETH Zurich, Vladimir‐Prelog‐Weg, Switzerland
Institut Universitaire de France (IUF), France
School of Marine Sciences, University of Maine, United States
EMERGE Biology Integration Institute, The Ohio State University, United States
Center of Microbiome Science, The Ohio State University, United States
Department of Civil, Environmental and Geodetic Engineering, The Ohio State University, United States

Aug 3, 2022

Open access
Copyright information

Abstract
Editor's evaluation
eLife digest
Introduction
Results
Discussion
Materials and methods
Appendix 1
Appendix 2
Appendix 3
Appendix 4
Appendix 5
Appendix 6
Appendix 7
Data availability
References
Article and author information
Metrics

Abstract

Biogeographical studies have traditionally focused on readily visible organisms, but recent technological advances are enabling analyses of the large-scale distribution of microscopic organisms, whose biogeographical patterns have long been debated. Here we assessed the global structure of plankton geography and its relation to the biological, chemical, and physical context of the ocean (the ‘seascape’) by analyzing metagenomes of plankton communities sampled across oceans during the Tara Oceans expedition, in light of environmental data and ocean current transport. Using a consistent approach across organismal sizes that provides unprecedented resolution to measure changes in genomic composition between communities, we report a pan-ocean, size-dependent plankton biogeography overlying regional heterogeneity. We found robust evidence for a basin-scale impact of transport by ocean currents on plankton biogeography, and on a characteristic timescale of community dynamics going beyond simple seasonality or life history transitions of plankton.

Editor's evaluation

Richter and colleagues present an impressive analysis of metagenomic, OTU and imaging data collected from >100 ocean locations worldwide, with the purpose of elucidating the role of large-scale currents on global-scale marine plankton biogeography. The topic is exciting and timely.

https://doi.org/10.7554/eLife.78129.sa0

eLife digest

Oceans are brimming with life invisible to our eyes, a myriad of species of bacteria, viruses and other microscopic organisms essential for the health of the planet. These ‘marine plankton’ are unable to swim against currents and should therefore be constantly on the move, yet previous studies have suggested that distinct species of plankton may in fact inhabit different oceanic regions. However, proving this theory has been challenging; collecting plankton is logistically difficult, and it is often impossible to distinguish between species simply by examining them under a microscope. However, within the last decade, a research schooner called Tara has travelled the globe to gather thousands of plankton samples. At the same time, advances in genomics have made it possible to identify species based only on fragments of their DNA sequence.

To understand the hidden geography of plankton communities in Earth’s oceans, Richter et al. pored over DNA from the Tara Oceans expedition. This revealed that, despite being unable to resist the flow of water, various planktonic species which live close to the surface manage to occupy distinct, stable provinces shaped by currents. Different sizes of plankton are distributed in different sized provinces, with the smallest organisms tending to inhabit the smallest areas. Comparing DNA similarities and speeds of currents at the ocean surface revealed how these might stretch and mix plankton communities.

Plankton play a critical role in the health of the ocean and the chemical cycles of planet Earth. These results could allow deeper investigation by marine modellers, ecologists, and evolutionary biologists. Meanwhile, work is already underway to investigate how climate change might impact this hidden geography.

Introduction

Plankton communities are constantly on the move, transported by ocean currents. Transport involves both advection and mixing. While being advected by currents, plankton can be influenced by multiple processes, both physicochemical (fluxes of heat, light,and nutrients Moore et al., 2013) and biological (species interactions, life cycles, behavior, and acclimation/adaptation [Armbrust, 2009; Flynn et al., 2015]), which act across various spatial and temporal scales. In turn, plankton impact seawater physicochemistry while they are being advected (Moore et al., 2013). The community composition and biogeochemical properties of a water mass at a given site are also partially dependent on its history of mixing with neighboring water masses during transport. These intertwined processes occurring along transport by currents form the pelagic seascape (Pittman, 2017; Figure 1—figure supplement 1a). Due to logistical and analytical constraints, previous studies on plankton distribution have tended to be geographically or taxonomically restricted (Hanson et al., 2012; Martiny et al., 2006; McGowan and Walker, 1979; Reygondeau and Dunn, 2019; Roux et al., 2016), to focus on individual factors such as nutrient or light availability (Longhurst, 2006; Tagliabue et al., 2017), or have investigated the influence of transport on specific nutrients (Letscher et al., 2016) or types of planktonic organisms (Hellweger et al., 2014; Villarino et al., 2018; Wilkins et al., 2013). We set out to test for the first time at genomic resolution the hypotheses that a global-scale plankton biogeography exists and that it is closely linked to transport via large-scale ocean currents. To do this, we integrated metagenomic data from epipelagic samples collected during the Tara Oceans expedition (Karsenti et al., 2011) with in situ and satellite environmental metadata and large-scale ocean circulation simulations. Our sampling largely focused on open ocean sites located in the main gyres, but also included other areas with distinct oceanographic features, such as coastal upwelling zones and lagoons (Figure 1—figure supplement 1b). We chose to study biogeographic patterns along large-scale currents in the principal oceanic gyres, with counterpoints to other oceanographic features in which the influence of ocean transport by the main currents is likely to be relatively weaker, such as upwellings. Our analyses focus on the sunlit (epipelagic) layer of the ocean (subsurface and deep chlorophyll maximum [DCM] samples); at lower depths (the mesopelagic and below), the relationship between plankton community composition and ocean transport may be different than at the surface. The use of DNA as a primary proxy for global plankton diversity has several important advantages over classical morphology-based analyses, notably because methods can be standardized and applied across the entire range of plankton sizes, from viruses through prokaryotes and protists to animals.

Results

DNA sequence data was obtained from samples collected at 113 worldwide stations during the Tara Oceans expedition. Each plankton community sample was sequenced for up to six operational size fractions: one virus-enriched (0–0.22 μm; Roux et al., 2016), one prokaryote-enriched (either 0.22–1.6 or 0.22–3 μm; Sunagawa et al., 2015), and four eukaryote-enriched (0.8–5 μm, 5–20 μm, 20–180 μm, and 180–2000 μm; de Vargas et al., 2015; Figure 1—figure supplement 1b). These size fractions are operational in that each contains the organisms captured between two physical filters of a given size (either filters or nets, depending on size fraction [Pesant et al., 2015]). We estimated the average percentage of metagenomic sequence reads in samples from the prokaryote-enriched 0.22–1.6/3 µm size fractions that were of eukaryotic origin to be 12%, and the average percentage of reads in eukaryote-enriched size fractions that were of prokaryote origin as follows: 0.8–5 μm: 39%, 5–20 μm: 23%, 20–180 μm: 3%, 180–2000 μm: 5% (see Materials and methods). The Tara Oceans project produced a total of 24.2 terabases of metagenomic sequence reads (Supplementary Table 1). To account for uneven sequencing depth among samples, we analyzed a subset of 11.9 terabases, after testing that this subset accurately represented the complete data set (see Materials and methods). We also analyzed operational taxonomic units (OTUs, representing groups of genetically related organisms), consisting of previously published viral populations (Brum et al., 2015) previously derived bacterial 16S miTAGs (Sunagawa et al., 2015), and 738 million 18S V9 ribosomal DNA marker sequences in the eukaryote-enriched size fractions, enlarging a previously described Tara Oceans data set (de Vargas et al., 2015). We used metagenomic data and OTUs independently to compute pairwise comparisons of plankton community dissimilarity (as proxies for β-diversity). Metagenomic dissimilarity highlighted, at species and subspecies resolution, differences in the genomic identity of organisms between stations. Our metagenomic sampling resulted in pairwise metagenomic dissimilarities that likely represent an overestimate of β-diversity (Appendix 1). However, we applied an identical procedure to compute metagenomic dissimilarity for all size fractions (correlations among fractions ranged Spearman’s ρ 0.6–0.9, p≤10⁻⁴, Figure 1—figure supplement 2). The more thoroughly sampled OTU dissimilarity, in contrast, incorporated more numerous rare taxa within the plankton, but at genus or higher-level taxonomic resolution (de Vargas et al., 2015). Metagenomic and OTU dissimilarities were correlated for all size fractions (Spearman’s ρ 0.53–0.97, p≤10⁻⁴, Figure 1—figure supplement 2), indicating that both proxies, although characterized by different sampling levels and taxonomic resolution, provided coherent and complementary estimates of β-diversity (Appendix 1). We performed subsequent analyses using both measures, which produced consistent results. The taxonomic composition of these Tara Oceans samples, not discussed here, is instead presented in a parallel analysis of the spatial dynamics of planktonic eukaryotes, based on the same environmental data and large-scale ocean circulation simulations (Sommeria-Klein et al., 2021).

We focus on analyses of metagenomic dissimilarity here, with accompanying results for OTU dissimilarity presented in Supplementary Figures, and validation by comparison to abundance differences among metagenome-assembled genomes (MAGs; Delmont et al., 2022a) and to more traditional imaging data presented independently below.

Globally, we observed substantial metagenomic dissimilarities (average pairwise dissimilarity >80%) between sampled stations (including adjacent sites) across all size fractions (Figure 1—figure supplement 3a, Appendix 1). The resulting portrait is of a heterogeneous oceanic ecosystem at all scales separating Tara Oceans sampling sites (even those separated by only a few kilometers), dominated by a small number of abundant and cosmopolitan taxa, with a much larger number of less abundant taxa found at fewer sampling sites (Figure 1—figure supplement 3b-e), corroborating other studies (de Vargas et al., 2015).

Overlying this heterogeneity, we found robust evidence for the existence of large-scale biogeographical patterns within all plankton size classes using two complementary analyses of dissimilarity among samples (Figure 1a, Figure 1—figure supplement 4a-f, Figure 1—figure supplement 5, Appendix 2). First, we grouped metagenomic samples within each size fraction into ‘genomic provinces’ via hierarchical clustering (Figure 1—figure supplement 6). Second, we derived colors for each sample based on a principal coordinates analysis (PCoA-RGB; see Materials and methods) in order to visualize transitions in community composition within and between genomic provinces. Genomic provinces were mostly composed of geographically clustered stations (consistent with previous studies documenting patterns in plankton biogeography [Hanson et al., 2012; Martiny et al., 2006; McGowan and Walker, 1979; Roux et al., 2016; Figure 1a, Figure 1—figure supplement 4a-f]). Although the large majority of our samples were located in oceanic gyres, samples located in physically distant zones but with shared environmental conditions, such as oceanic upwellings, also grouped together (e.g. genomic province B6 in the bacterial-enriched size fraction). Genomic provinces of smaller plankton (viruses, bacteria, and eukaryotes <20 µm), with some exceptions (e.g. genomic province B5), tended to be limited to a single ocean basin and to approximately correspond to Longhurst biogeochemical provinces (BGCPs; Longhurst, 2006; Figure 1—figure supplement 4a-d; Appendix 3). In contrast, provinces of larger plankton (micro- and mesoplankton, >20 µm) spanned multiple basins (Figure 1—figure supplement 4e-f, Appendix 4).

Figure 1 with 7 supplements see all

Download asset Open asset

Plankton biogeography, environmental variation, and ocean transport among *Tara* Oceans stations.

Major currents are represented by solid arrows. (a) Genomic provinces of *Tara* Oceans surface samples for the 0.8–5 µm size fraction, each labeled with a letter prefix (‘C’ represents the 0.8–5 µm size fraction) and a number; samples not assigned to a genomic province are labeled with ‘-’. Maps of all six size fractions and including deep chlorophyll maximum samples are available in Figure 1—figure supplement 4. Station colors are derived from an ordination of metagenomic dissimilarities; more dissimilar colors indicate more dissimilar communities (see Materials and methods). (b) Stations colored based on an ordination of temperature and the ratio of NO₃ + NO₂ to PO₄ (replaced by 10⁻⁶ for three stations where the measurement of PO₄ was 0) and of NO₃ + NO₂ to Fe. Colors do not correspond directly between maps; however, the geographical partitioning among stations is similar between the two maps. (c) Simulated trajectories corresponding to the minimum travel time (T_min) for pairs of stations (black dots) connected by T_min <1.5 years. Directionality of trajectories is not represented.

These large-scale biogeographical patterns derived from metagenomes were linked to environmental parameters including nutrients and temperature. Seawater surface temperature was significantly different among genomic provinces for all plankton size classes (Kruskal-Wallis test, p<10⁻⁵), corroborating previous results for prokaryotes (Sunagawa et al., 2015), whereas other environmental conditions were significantly different only with respect to specific size classes (Figure 1—figure supplement 7). The geography of combined nutrient and temperature variations resembled the biogeography of smaller plankton size classes (Figure 1a–b, Figure 1—figure supplement 4a-d,h), whereas temperature alone more closely matched the distribution of larger plankton (Figure 1—figure supplement 4e,f,i), potentially reflecting different ecological constraints.

Plankton biogeographical patterns suggested a particular role for large-scale surface transport (a core component of the seascape) in the emergence of spatial patterns of plankton community composition (as previously proposed [Clayton et al., 2013]), as many genomic provinces were spatially consistent with ocean basin-scale circulation patterns (such as western boundary currents or major subtropical gyres [Talley et al., 2011; Figure 1a, Figure 1—figure supplement 4a-f]). To investigate whether plankton dynamics are related to ocean current timescales, we analyzed community metagenomic composition differences between sampled stations in light of the corresponding transit time, which has previously been suggested as the relevant factor for studying dispersal mechanisms (Wilkins et al., 2013). We inferred the characteristic timescale of main transport paths between stations from trajectories computed with the physically well-constrained MITgcm ocean model (see Materials and methods), which takes into account directionalities (Watson et al., 2010) and meso- to large-scale circulation, potential dispersal barriers, and mixing effects (Goetze et al., 2017; Mousing et al., 2016). For this we used the minimum travel time (Jönsson and Watson, 2016; T_min) between pairs of Tara stations. These trajectories corresponded to the dominant paths that transport the majority of water volume and its contents (e.g. heat, nutrients, and plankton; Figure 1c). For all plankton size classes, community composition differences between stations were significantly correlated to travel time (Figure 2—figure supplement 1).

Because the relationships between metagenomic dissimilarities and T_min are complex (Figure 2—figure supplement 1), global correlations do not necessarily accurately summarize the relationship between communities and currents. To provide more detail on the relationship, we examined cumulative correlation, namely, correlations between community dissimilarity and T_min computed for an increasing range of T_min, which can directly reveal the time window during which plankton dynamics are strongly correlated to ocean current timescales. Cumulative correlation values were maximal for pairs of stations separated by T_min <~1.5 years for all size classes, with correlation values (Spearman’s ρ 0.45–0.71 depending on size class, p≤10^–4; Figure 2a, Figure 3—figure supplement 1) far exceeding those based on previous studies of morphological and/or metabarcode data (Villarino et al., 2018) or considering geographic distance rather than travel time (Louca et al., 2016). These high correlations between metagenomic dissimilarity and T_min for travel times up to 1.5 years, which correspond well with the average time to travel across a basin or gyre (Lumpkin and Johnson, 2013), hence reveal measurable plankton community dynamics on time scales far longer than typical plankton growth rates or life cycles. In contrast, no such unimodal pattern was found for correlations between metagenomic dissimilarity and geographic distance (without traversing land; Figure 3—figure supplement 1f).

Figure 2 with 2 supplements see all

Download asset Open asset

Metagenomic dissimilarity and travel time of plankton are maximally correlated up to ~1.5 years.

(a) Spearman rank-based correlation by size fraction between metagenomic dissimilarity and minimum travel time along ocean currents (T_min) for pairs of *Tara* Oceans samples separated by a minimum travel time less than the value of T_min on the x-axis. Brown line: 0–0.2 µm size fraction, red: 0.22–1.6/3 µm, blue: 0.8–5 µm, green: 5–20 µm, purple: 20–180 µm, orange: 180–2000 µm. Shaded colored areas represent 95% CI. T_min >1.5 years is shaded in gray. See plots for operational taxonomic unit (OTU) dissimilarity in Figure 3—figure supplement 1. (b) Pairs of *Tara* stations connected by T_min <1.5 years in blue/black and >1.5 years in gray. Shading reflects metagenomic similarity from the 0.8–5 μm size fraction. (c) The relationship of metagenomic similarity to T_min with an exponential fit (black line, gray 95% CI), for pairs of surface samples in the 0.8–5 μm size fraction within the North Atlantic and Mediterranean current system (see map and plots for other size fractions and OTUs in Figure 2—figure supplement 2, and Appendix 1 for a discussion of metagenomic similarity).

We compared our analyses of metagenomic data to those based on more traditional zooplankton imaging data collected for the same Tara Oceans samples. β-diversity calculated from zooplankton imaging was correlated with metagenomic dissimilarity (Spearman’s ρ between 0.32 and 0.60; Figure 1—figure supplement 2), indicating that the two data sources provide concordant measurements of variation in plankton community composition. However, correlations with ocean transport time were far weaker for zooplankton imaging data than for metagenomic data from all organismal size fractions (Figure 3—figure supplement 1). We interpret this as being a result of the expected significantly lower resolution in imaging data as compared to metagenomic data (a similar difference of resolution in OTU data versus metagenomic data is discussed in Appendix 1). Finally, we also confirmed our metagenome sequence read comparison-based results by comparing them to β-diversity among sampling sites using a collection of MAGs, which are likely to represent the most abundant genomes, from the 20–180 µm size fraction (the size fraction in which the largest proportion of metagenomic reads were mapped to MAGs, 18.4%; Delmont et al., 2022a). Metagenomic and MAG β-diversity were highly correlated (Spearman’s ρ 0.94) and consequently they displayed similar biogeographical patterns (Figure 1—figure supplement 4e,g).

Up to ~1.5 years of travel time, the timescale of large-scale transport is therefore the appropriate framework for studying differences in plankton genomic community composition (Figure 2b). The fact that simulated transport times and metagenomic dissimilarity were correlated despite a 3-year pan-season sampling campaign, which could be considered to weaken our inference, suggests instead that a large-scale impact of the seascape promotes the existence of a biogeographical structure at a large spatial scale that is resilient to seasonal or other smaller spatiotemporal variations (across all size fractions, genomic provinces consist of stations sampled over an average of 4.7±2.8 different months and 2.7±1.2 different seasons, adjusted for hemisphere). Consistent with our results, seasonal variations have previously been shown to have minor effects on the boundary positions of BGCPs based on satellite data, but not enough to affect the overall pattern of ocean regionalization (Reygondeau et al., 2013).

Differences in environmental conditions for pairs of stations also covaried (although less strongly) with transit time for T_min <~1.5 years (Figure 3). This indicates that changes in environmental conditions and plankton community composition are concurrent along large-scale oceanic current systems. In our data, beyond ~1.5 years of transport, correlations of T_min with metagenomic dissimilarity decreased (Figures 2a and 3, Figure 3—figure supplement 1a-e), meaning the signature of transport in the timescale of large-scale diversity changes weakened and travel time therefore becomes a less appropriate context to study β-diversity. A similar trend was observed for the correlation between T_min and nutrient concentrations, whereas temperature, the gradients of which are mostly dictated by Earth-scale processes that are unaffected by plankton communities, remained well correlated for longer transit times (Figure 3).

Figure 3 with 1 supplement see all

Download asset Open asset

Plankton travel time, metagenomic dissimilarity, and environmental differences show different temporal patterns of pairwise correlation.

Spearman rank-based correlations between metagenomic dissimilarity and minimum travel time (T_min, blue), metagenomic dissimilarity and differences in NO₃ + NO₂, PO₄, and Fe (purple), metagenomic dissimilarity and differences in temperature (turquoise), T_min and differences in NO₃ + NO₂, PO₄, and Fe (purple, dashed), and T_min and differences in temperature (turquoise, dashed) for pairs of *Tara* Oceans samples separated by a minimum travel time less than the value of T_min on the x-axis. Shaded regions represent SEM. Correlations represent averages across four of six size fractions represented in Figure 2a; the 0–0.2 µm and 5–20 µm size fractions are excluded due to a lack of samples at the global level. Individual size fractions, partial correlations, and correlations with operational taxonomic unit data are in Figure 3—figure supplement 1. Color palette is from microshades (https://github.com/KarstensLab/microshades) (Dahl et al., 2022).

Together, these analyses suggest the existence in the seascape of biogeochemical continua stretched by currents on the basin scale with predictable, interlinked changes in environmental conditions and plankton community composition (Appendix 5). It has previously been posited that transport could generate continuous transitions between niches based on physical processes (Lévy et al., 2014), but it was not anticipated that plankton dynamics would be governed on the time and length scales of main ocean currents. Moreover, beyond ~1.5 years, the correlation of metagenomic dissimilarity with differences in temperature increased while that with differences in nutrients decreased (Figure 3, Figure 3—figure supplement 1a-e), although both of these correlations with metagenomic dissimilarity remained strong on these time scales. This might be related to distant Tara Oceans stations experiencing similar oceanographic phenomena (notably temperature), e.g., upwelling zones (stations 67, 92, and 135; Figure 1—figure supplement 1), producing generally similar environmental conditions.

Discussion

We present the following hypothesis as a potential mechanism for the partitioning of the global ocean into genomic provinces. The relatively large separation (in terms of transport time and season) among sampling stations allowed us to detect the large-scale effects of ocean circulation, which are superimposed on smaller-scale effects such as local patchiness and seasonality (as previously observed Longhurst, 2006; Reygondeau et al., 2013). Within ocean basins, as the intertwined dynamics of plankton and chemistry continuously occur along transport, smooth variations have emerged due to the periodic recirculation of within-basin currents. This leads to stable, continuous patterns of changes in community structure and nutrient concentrations, and also explains how temporally stable genomic provinces can exist in the face of ocean currents. Among ocean basins, depending on the sensitivity to the environment of a plankton community, higher heterogeneity in environmental conditions across different circulation patterns can disrupt the equilibrium of seascape processes within a given continuum, leading to a global delimitation into distinguishable ecological continua among different large-scale current systems, resulting in the genomic provinces that we detected.

The existence of a size-class-dependent (smaller or larger than 20 µm) structure of plankton geography indicates that the continua that we observe vary among size fractions because of different reactions of organisms within the seascape (i.e. the interplay among organismal biology, nutrients, and local environmental conditions), in agreement with a parallel survey based on taxonomic groups (Sommeria-Klein et al., 2021). In the case of the North Atlantic current system (including the Mediterranean Sea), a simple exponential fit of metagenomic dissimilarity along T_min for T_min <~1.5 years (Figure 2c) revealed that the smaller size classes (<20 µm) had a shorter metagenomic turnover time (ca. 1 year) than larger plankton (ca. 2 years; Figure 2—figure supplement 2, Appendix 6). At global geographical scales, the genomic provinces of small size classes, which are enriched in phytoplankton (de Vargas et al., 2015; Frémont et al., 2022; Sommeria-Klein et al., 2021; Sunagawa et al., 2015), corresponded in our data with differences in environmental parameters such as nutrient levels (Figure 1b, Figure 1—figure supplement 7) that are often constrained by regional oceanographic processes (Sarmiento and Gruber, 2006). On the other hand, genomic provinces of larger plankton, enriched in heterotrophic and symbiotic organisms (de Vargas et al., 2015; Frémont et al., 2022; Sommeria-Klein et al., 2021), were less coupled with geochemical parameters and were more related to global scale gradients and circulation patterns, notably major latitudinal temperature zones (Martin et al., 2021) or the separation between Atlantic and Indo-Pacific large-scale surface circulations (Figure 1—figure supplement 4e,f,i). These divergent effects were also evident in comparisons of metagenomic dissimilarity with variations in either nutrient concentrations or temperature (Figure 3—figure supplement 1b). For smaller plankton, correlations with differences in nutrient concentrations were strongest, at T_min up to ~1.5 years, while for larger plankton, correlations were strongest with temperature variations, for T_min beyond ~1.5 years. Larger plankton are dominated by eukaryotes, often multicellular, with much longer life cycles, potentially leading to slower community turnover. Organisms with long life cycles, on the order of several months or years, can be transported through basins spanning multiple biogeochemical niches in which they may encounter strong environmental variability; this trend was also detected in a taxonomy-based analysis accounting for differences in both body size and ecology among groups (Sommeria-Klein et al., 2021). As observed here, their biogeography is less affected by nutrient limitation and rather depends on large-scale temperature gradients among basins. This dependence may be linked to the known correlation between body size and organismal metabolic rate (Ikeda, 1985). Conversely, variants within populations of organisms with short life cycles have the capacity to increase their relative abundance within restricted ecological niches to which they are adapted. This difference, detectable at genomic resolution, may not be picked up in analyses performed using biological traits with less resolution. These results indicate a significant size-based decoupling within planktonic food webs. For example, large size predators will encounter different prey when transiting through the genomic provinces of small sized organisms (see Appendix 4).

In this study, we provide genomic evidence for an organism size-dependent global-scale open ocean plankton biogeography shaped by currents. Using analysis of standardized metagenomic data together with environmental and physical data, we reveal that, in a background of significant local patchiness, the integration of seascape physical, chemical, and biological processes over time and space produces a quasi-stationary biological partitioning of the oceans that supersedes short-term variability and seasonal cycles, ultimately generating global biogeographical patterns. Although the strong cross-coupling among our metagenomic, environmental, and physical data prevents a systematic disentangling of their various influences, we hypothesize that transport by ocean currents acts essentially as a conveyor on which interacting environmental and biological effects are layered. In this hypothesis, direct effects of currents (such as turbulent diffusivity) on plankton composition are secondary, and instead environmental and biological effects occurring during transport result in the emergence of a global plankton biogeography in the surface ocean. Future studies both on smaller spatiotemporal scales or specific oceanographic features (e.g. coastal regions) and on the global-scale constraints and influences on the seascape itself could lead to a more detailed understanding of plankton dynamics. The ocean is a three-dimensional system in which the primary axis of variation is depth. Our metagenomic data and simulations were limited to the sunlit layer of the ocean and therefore capture one part of seascape dynamics. At greater ocean depths (i.e. in the mesopelagic and below), the relationship between ocean transport and plankton community composition may differ from the one we describe at the surface. In addition, an understanding of plankton biogeography is a key component of future studies on the function of the genes they express; analyses that synergize both characterizations will refine the definition and ecological interpretation of plankton communities within genomic provinces. Overall, our work shows that studies of the dynamics of plankton communities must consider the critical influence of ocean currents in stretching, on the scale of basins, the distribution of both planktonic organisms and the physicochemical nature of the water mass in which they reside. We also demonstrate that the combination of ocean circulation modeling with the use of metagenomic DNA as a tracer of plankton communities provides a resolution above the minimum necessary for assessing the role of transport in community turnover over time and space. The open ocean planktonic ecosystem is fundamentally different in many ways from other major planetary ecosystems, and this study provides a basis to understand and potentially predict the structuring of the ocean ecosystem in a scenario of rapid environmental and current system changes (Beaugrand et al., 2002; Caesar et al., 2018; Frémont et al., 2022).

Materials and methods

Sampling, sequencing, and environmental parameters

Request a detailed protocol

Sampling, size fractionation, measurement of environmental parameters and associated metadata, DNA extraction, and metagenomic sequencing were conducted as described previously (Alberti et al., 2017; Pesant et al., 2015). Samples were collected at 113 Tara Oceans stations for up to six size fractions (0–0.2, 0.22–1.6/3, 0.8–5, 5–20, 20–180, 180–2000 µm; Figure 1—figure supplement 1b; Supplementary Table 1) and two depths (subsurface and DCM). The prokaryote-enriched size fraction was collected either a 0.22–1.6 µm or 0.22–3 µm filter (Pesant et al., 2015; Sunagawa et al., 2015). For technical reasons, not all size fractions were sequenced for all stations (see Appendix 7 for a summary of why this does not affect our principal conclusions).

We used physicochemical data measured in situ during the Tara Oceans expedition (depth of sampling, temperature, chlorophyll a, phosphate, nitrate + nitrite concentrations), supplemented with simulated values for iron and ammonium (using the MITgcm Darwin model described below in ‘Ocean circulation simulations’), day length, and 8-day averages calculated for photosynthetically active radiation (PAR) in surface waters (AMODIS, https://modis.gsfc.nasa.gov). In order to obtain PAR values at the DCM, we used the following formula (Morel et al., 2007):

PAR(Z) = PAR(0) × exp(−k × Z),

x = log(Chl),

log(Z) = 1.524 – 0.426x − 0.0145x^2 + 0.0186x^3,

k = –ln (0.01)/Z,

in which k is the attenuation coefficient, and Z is the depth of the DCM (in meters). Other data, such as silicate and the (nitrate + nitrite)/phosphate ratio, were extracted from the World Ocean Atlas 2013 (WOA13 version 2, https://www.nodc.noaa.gov/OC5/woa13/), by retrieving the annual mean values at the closest available geographical coordinates and depths to Tara sampling stations. For temperature and nitrate + nitrite, we calculated seasonality indexes (SI) from monthly WOA13 data. For each sample, the index is the annual variation of the parameter (max - min) at this location divided by the highest variation value among all samples.

A list of samples, metagenomic and metabarcode sequencing information, and associated environmental data are available in Supplementary Tables 1–2.

Calculation of metagenomic community dissimilarity

Request a detailed protocol

Metagenomic community distance between pairs of samples was estimated using whole shotgun metagenomes for all six size fractions. We used a metagenomic comparison method (Simka Benoit et al., 2016) that computes standard ecological distances by replacing species counts by counts of DNA sequence k-mers (segments of length k). Within Simka, we filtered regions of low complexity with read-shannon-index set to 1.5. K-mers of 31 base pairs (bp) derived from the first 100 million reads sequenced in each sample (or the first 30 million reads for the 0–0.2 µm size fraction) were used to compute a similarity measure between all pairs of samples within each organismal size fraction. Based on a benchmark of Simka, we selected 100 million reads per sample (or 30 million for the 0–0.2 µm fraction) because increasing this number did not produce a qualitatively different set of results, and to ensure that the same number of reads was used in each pairwise comparison within a size fraction. Nearly all samples in our data set had at least 100 million reads (or at least 30 million for the 0–0.2 µm fraction; Supplementary Table 1).

We estimated β-diversity for metagenomic reads with the following equation within Simka:

Metagenomic β-diversity = (b + c)/(2a + b + c)

where a is the number of distinct k-mers shared between two samples, and b and c are the number of distinct k-mers specific to each sample. We represented the distance between each pair of samples on a heatmap using the heatmap.2 function of the R-package (R Core Team T, 2017) gplots_2.17.0 (Warnes et al., 2015). The dissimilarity matrices we produced for each plankton size fraction (on a scale of 0 = identical to 100 = completely dissimilar) are available as Supplementary Tables 3–8.

Calculation of OTU-based community dissimilarity

Request a detailed protocol

Within the 0–0.2 µm size fraction, we used previously published viral populations (equivalent to OTUs; Brum et al., 2015) and viral clusters (analogous to higher taxonomic levels; Roux et al., 2016) based on clustering of protein content. For the 0.22–1.6/3 µm size fraction, we used previously derived miTAGs based on metagenomic matches to 16S ribosomal DNA loci and processed them as described (Sunagawa et al., 2015). For the four eukaryotic size fractions, we added additional samples to a previously published Tara Oceans metabarcoding data set and processed them using the same methods (de Vargas et al., 2015; also described at DOI: 10.5281/zenodo.15600).

We calculated OTU-based community dissimilarity for all size fractions as the Jaccard index based on presence/absence data using the vegdist function implemented in vegan 2.4–0 (Oksanen et al., 2019) in the software package R. The dissimilarity matrices we produced for each plankton size fraction (on a scale of 0 = identical to 100 = completely dissimilar) are available as Supplementary Tables 9–14.

Calculating distances of environmental parameters

Request a detailed protocol

We calculated Euclidean distances (Legendre and Legendre, 2012) for physicochemical parameters. Each were scaled individually to have a mean of 0 and a variance of 1 and thus to contribute equally to the distances. Then the Euclidean distance between two stations i and j for parameters P was computed as follows:

E D (i, j, P) = \sqrt{\sum_{p \in P} {(x_{i p} - x_{j p})}^{2}}

RGB encoding of environmental positions

Request a detailed protocol

We color-coded the position of stations in environmental space for Figure 1b and Figure 1—figure supplement 4h as follows. First, environmental variables were power-transformed using the Box-Cox transformation to have Gaussian-like distributions to mitigate the effect of outliers and scaled to have zero mean and unit variance. We then performed a principal components analysis (PCA) with the R command prcomp from the package stats 3.2.1 (R Core Team T, 2017) on the matrix of transformed environmental variables and kept only the first three principal components. Finally, we rescaled the scores in each component to have unit variance and decorrelated them using the Mahalanobis transformation. Each component was mapped to a color channel (red, green, or blue) and the channels were combined to attribute a single composite color to each station. The components (x, y, z) were mapped to color channel values (r, g, b) between 0 and 255 as r = 128 × (1 + x/max[abs(x)]), and similarly for g and b. This map ensures that the global dispersion is equally distributed across the three components and composite colors span the whole color space.

Definition of genomic provinces

Request a detailed protocol

We used a hierarchical clustering method on the metagenomic pairwise dissimilarities produced by Simka for all surface and DCM samples, and multiscale bootstrap resampling for assessing the uncertainty in hierarchical cluster analysis. We focused on metagenomic dissimilarity due to its higher resolution, and confirmed that the patterns found in metagenomic data were consistent when using OTU data (Figure 1—figure supplement 5). We used UPGMA (unweighted pair-group method using arithmetic averages) clustering, as it has been shown to have the best performance to describe clustering of regions for organismal biogeography (Kreft and Jetz, 2010). The R-package pvclust_1.3–2 (Suzuki and Shimodaira, 2006), with average linkage clustering and 1000 bootstrap replications, was used to construct dendrograms with the approximately unbiased p-value for each cluster (Figure 1—figure supplement 6). Because the number of genomic provinces by size fraction was not known apriori, we applied a combination of visualization and statistical methods to compare and determine the consistency within clusters of samples. First, the Silhouette method (Rousseeuw, 1987) was used to measure how similar a sample was within its own cluster compared to other clusters using the R package cluster_2.0.1 (Maechler et al., 2015). The Silhouette coefficient s for a single sample is given as:

s = (b - a) / m a x (a, b)

where a is the mean distance between a sample and all other points in the same class and b is the mean distance between a sample and all other points in the next nearest cluster. We used the value of s, in addition to bootstrap values, to partition each tree into genomic provinces (see Appendix 2 for further details on statistical validation of genomic provinces). Additionally, we used the Radial Reingold-Tilford Tree representation from the JavaScript library D3.js (https://d3js.org/) (Bostock et al., 2011) to visualize sample partitions from the dendrogram. Single samples were not considered as genomic provinces.

In a complementary approach, we performed a PCoA with the R command cmdscale (eig = TRUE, add = TRUE) from the package stats 3.2.1 (R Core Team T, 2017) on the matrices of pairwise metagenomic dissimilarities calculated by Simka (or OTU dissimilarity measured with the Jaccard index) within each size fraction and kept only the first three principal coordinates. We then converted those coordinates to a color using the RGB encoding described above, with one modification: scaling factors λ_r, λ_g, and λ_b were calculated as the ratios of the second and third eigenvalues to the first (dominant) eigenvalue to ensure that the dispersion of stations along each color channel reproduced the dispersion of the stations along the corresponding principal component (the ratio for the color corresponding to the dominant eigenvalue is 1). The components (x, y, z) were then mapped to color channel values (r, g, b) between 0 and 255 as r = 128 × (1 + λ_cx/max[abs(x)]), where λ_c is the ratio of the eigenvalue of color c to the dominant eigenvalue.

We represented number and PCoA-RGB color of genomic provinces for each sample on a world map (Figure 1, Figure 1—figure supplement 4a-f) generated with the R packages maps_3.0.0.2 (Becker et al., 2018), mapproj 1.2–4 (McIlroy et al., 2015), gplots_2.17.0 (Warnes et al., 2015), and mapplots_1.5 (Gerritsen, 2014). We also plotted phosphate and temperature (Figure 1—figure supplement 4a-f) obtained from the Csiro Atlas of Regional Seas (CARS2009, http://www.cmar.csiro.au/cars) using the phosphate_cars2009.nc and temprerature_cars2009a.nc files and the R package RNetCDF (Ridgway et al., 2002).

Comparison of genomic provinces to previous ocean divisions

Request a detailed protocol

To evaluate the spatial similarity between the clusters obtained in our study for each size fraction and previous biogeographic divisions, we performed an analysis of similarity (ANOSIM, Fathom toolbox, MATLAB). First, we collected coordinates for three spatial divisions at a resolution of 0.5 × 0.5°: biomes, BGCPs (Longhurst, 2006; Reygondeau et al., 2013), and objective global ocean biogeographic provinces (OGOBPs; Oliver and Irwin, 2008). Second, we assigned Tara Oceans stations to biomes, BGCPs, and OGOBPs based on their GPS coordinates. Third, for each size fraction we performed an ANOSIM with the metagenomic dissimilarity matrix calculated by Simka, using biogeographic clusters (biome, BGCP, and OGOBP) as group membership for each station. Each ANOSIM was bootstrapped 1000 times to evaluate the interval of confidence around the strength of the relationships we detected (Figure 1—figure supplement 4a-f).

Environmental differences among genomic provinces

Request a detailed protocol

For each size fraction, we tested which environmental parameters significantly discriminated among genomic provinces (Figure 1—figure supplement 7). A total of 12 parameters characterizing each sample, grouped by genomic provinces, were evaluated with a Kruskal-Wallis test within each size fraction with a significance threshold of p<10^–5. One such parameter, sunshine duration (day length) does not map unambiguously to season, as day lengths coincide in spring and autumn. Ocean biology, chemistry, and stratification often differ between spring and autumn. As such, we provide seasonality indices for temperature and for nitrate + nitrite (described above in the Methods, ‘Sampling, sequencing, and environmental parameters’), which represent annual variation in these environmental parameters, and can help interpret the effects of seasonality on genomic provinces. Selected parameters for each size fraction were then used to perform a PCA of the samples using the R package vegan_1.17–11 (Oksanen et al., 2019). Samples were plotted with the same PCoA-RGB colors used in the genomic province maps above and each genomic province surrounded by a gray polygon. In analyses where Southern Ocean (including Antarctic) stations were considered independently from other stations, the following were considered Southern Ocean stations: 82, 83, 84, 85, 86, 87, 88, and 89.

Ocean circulation simulations

Request a detailed protocol

We derived travel times from the MITgcm Darwin simulation (Clayton et al., 2016) based on an optimized global ocean circulation model from the ECCO2 group (Menemenlis et al., 2008). The horizontal resolution of the model was approximately 18 km, with 1,103,735 total ocean cells. We ran the model for six continuous years in order to smooth anomalies that might occur during any single year. We used surface velocity simulation data to compute trajectories of floats originating in ocean cells containing all Tara Oceans stations and applied the following stitching procedure to generate a large number of trajectories for each initial position. (The use of surface velocity data implies that Ekman transport also influences trajectories within the simulation.)

First, we precomputed a set of monthly trajectories: for each of the 72 months in the data set, we released floats in every ocean cell of the model grid and simulated transport for 1 month. We used a fourth-order Runge-Kutta method with trilinearly interpolated velocities and a diffusion of 100 m²/s.

Second, following previous studies (Hellweger et al., 2014), we stitched together monthly trajectories to create 10,000-year trajectories: for each float released within a 200 km radius of a Tara station, we constructed 1000 trajectories, each 10,000 years long. To avoid seasonal effects, we began by selecting a random starting month. We followed the trajectory of a float released within that month to the grid cell containing its end point at the end of the month. Next, we randomly selected a trajectory starting on the following month (e.g. February would follow January) from that grid cell and repeated until reaching a 10,000-year trajectory.

We searched the resulting 50.8 million trajectories for those that connected pairs of Tara Oceans stations. To ensure robustness of our results, we only included pairs of stations that were connected by more than 1000 trajectories. For each pair of stations, T_min was defined as the minimum travel time of all trajectories (if any) connecting the two stations.

The source code for ocean circulation simulations can be found at https://mitgcm.org/source-code/, which contains installation instructions for the GitHub repository available at https://github.com/MITgcm/MITgcm; Losch, 2022. General configuration information for the high-resolution global calculation can be found in the CVS directory of user contributions: http://wwwcvs.mitgcm.org/viewvc/MITgcm/MITgcm_contrib/ under ‘hi_res_cube’. The specific simulation configuration we used followed (Clayton et al., 2013) and is available at http://wwwcvs.mitgcm.org/viewvc/MITgcm/MITgcm_contrib/high_res_cube/README.cs510?view=log. The travel time matrix we produced (measured in years) is available as Supplementary Table 15. Standard minimum geographic distance without traversing land (Rattray et al., 2016) is available as Supplementary Table 16.

Correlations of β-diversity, T_min, and environmental parameters

Request a detailed protocol

Our correlation analyses were restricted to Tara Oceans samples collected at the surface and did not include DCM samples. We excluded stations that were not from open ocean locations from correlation analyses to avoid sites impacted by coastal processes (those numbered 54, 61, 62, 79, 113, 114, 115, 116, 117, 118, 119, 120, and 121). In analyses where Southern Ocean (including Antarctic) stations were considered independently from other stations, the following were considered Southern Ocean (including Antarctic) stations: 82, 83, 84, 85, 86, 87, 88, and 89. We calculated rank-based Spearman correlations between β-diversity, T_min, and environmental parameters (either differences in temperature or the Euclidean distance composed of differences in NO₃ + NO₂, PO₄, and Fe, see above) for surface samples with a Mantel test with 1000 permutations and a nominal significance threshold of p<0.01. For the correlations presented in Figures 2a and 3 and Figure 3—figure supplement 1, correlation values were derived from pairs of stations connected by T_min up to the value on the x-axis. We calculated partial correlations of metagenomic and OTU dissimilarity and T_min by controlling for differences in temperature and for differences in nutrient concentrations, and partial correlations of dissimilarity with temperature or nutrient variation by controlling for T_min. We calculated rank-based Spearman partial correlations using the standard formula for two variables x1 and x2 and a controlling variable x3: (cor[x1,x2] − cor[x1,x3] × cor[x1,x3])/(sqrt[1 − cor(x1,x3)^2] × sqrt[1 − cor(x2,x3)^2]).

Community turnover in the North Atlantic

Request a detailed protocol

Tara Oceans stations numbered 72, 76, 142, 143, 144, and all stations from 146 to 151 were located along the main current system connecting South Atlantic and North Atlantic oceans and continuing to the strait of Gibraltar. In addition, we included stations 4, 7, 18, and 30 located on the main current system in the Mediterranean Sea (Figure 2—figure supplement 2). As the Tara Oceans samples within the subtropical gyre of the North Atlantic and in the Mediterranean Sea were all collected in winter, seasonal variations should not play a role in the variability in community composition that we observed (see Supplementary Table 2). We calculated genomic e-folding times (the time after which the detected genomic similarity between plankton communities changes by 63%) over scales from months to years based on an exponential fit of metagenomic dissimilarity to T_min with the form y = C₀ e^−x/τ (where C₀ is a constant and τ is the folding time). Exponential fits for size fractions 0–0.2 µm and 5–20 µm were not calculated due to an insufficient number of sampled stations in the North Atlantic (Appendix 6).

The synthetic map (Figure 2—figure supplement 2a) was generated with the R packages maps_3.0.0.2, mapproj 1.2.4, gplots_2.17.0, and mapplots_1.5. We derived dynamic sea surface height from the Csiro Atlas of Regional Seas (CARS2009, http://www.cmar.csiro.au/cars) using the hgt2000_cars2009a.nc file and plotted with the R package RNetCDF.

Imaging methods

Request a detailed protocol

Plankton were also collected using WP2 (200 µm mesh) nets, using vertical tows (0–100 m), and preserved with borax-buffered formaldehyde. Taxonomic classification was performed using the ZooScan imaging system (Gorsky et al., 2010) and identified with an automatic recognition algorithm to the finest possible taxonomic resolution using Ecotaxa (Picheral et al., 2017) . The resulting identifications were manually visualized by taxonomic specialists and either validated or corrected. Resolution of the taxonomic identifications depended on morphological heterogeneity within taxonomic groups. Hence, identifications reached different taxonomic levels, from species to phylum, and most of them reached family level. All images and their taxonomic assignation are accessible within Ecotaxa (https://ecotaxa.obs-vlfr.fr/prj/377). Since all genomic data were collected during day time, we restricted our analysis on day-collected samples. We also discarded non-living objects in our analyses. We estimated β-diversity by calculating Bray-Curtis dissimilarities between pairs of stations based on the relative abundances of each annotated taxonomic unit. Bray-Curtis dissimilarities are available as Supplementary Table 17.

MAGs analysis

Request a detailed protocol

MAG relative abundances in metagenomic samples were retrieved from Delmont et al., 2022a. β-diversity was estimated by calculating the Bray-Curtis dissimilarities between pairs of stations based on the relative abundances of each of the 713 MAGs calculated by read mapping in the metagenomes of size fraction 20–180 µm (the size fraction in which MAGs recruit the largest relative share of all reads). We represented PCoA-RGB color of the Bray-Curtis dissimilarity matrix for each sample on a world map (Figure 1—figure supplement 4g) following the methodology described above. The Spearman ρ correlation coefficient was calculated between MAG-based β-diversity and metagenomic-based β-diversity from the size fraction 20–180 μm. MAG abundances for the 20–180 µm size fraction are available as Dataset 4. MAG-derived Bray-Curtis dissimilarities for the 20–180 µm size fraction are available as Supplementary Table 18.

Estimates of percentages of prokaryote reads in eukaryote-enriched size fractions, and vice versa

Request a detailed protocol

We used MAG read mappings from Delmont et al., 2022a, Delmont et al., 2022b to calculate percentages of prokaryote reads in eukaryote-enriched size fractions, and vice versa. For each eukaryote and prokaryote MAG and for each sample, we used the proportion of reads unambiguously mapped to the MAG. For each sample, we next obtained estimates of the percentages of prokaryote or eukaryote reads by summing these relative counts for all prokaryote (bacterial or archaeal) or all eukaryote MAGs. We discarded samples for which both prokaryote and eukaryote relative counts had a zero or negligible sum (<1% of total reads). For the remaining 538 samples, the average percentage represents the average across all samples within a given size fraction.

Appendix 1

Comparison of metagenomes and OTUs

Metagenomic comparisons reflect fine-scale differences in genome content at the community level as a function of diversity, genome size, and organismal abundance, and also depend on the rate of evolution of each specific lineage. With exhaustive sampling, metagenomic dissimilarity could theoretically distinguish among genomes in a sample separated by a single mutation. However, our metagenomic sequencing level was likely not able to reach saturation due to the number of genomes per sample and their putative large size (metatranscriptomes, which contain fewer sequences per species than do metagenomes, did not reach saturation within Tara Oceans samples Carradec et al., 2018). For example, if for a pair of samples we sequence 50% of the total amount of the unique genomic DNA present, we expect the maximum similarity of the two samples to be roughly 25% (0.5 × 0.5). Therefore, the pairwise metagenomic dissimilarities we calculated between samples probably reflected a combination of genomic differences weighted toward more abundant organisms. In contrast, OTUs, obtained by sequencing single marker genes, approach biodiversity saturation (de Vargas et al., 2015; Roux et al., 2016; Sunagawa et al., 2015). However, OTU resolution depends on the choice of the marker to be used, the threshold of similarity for the marker, and its lineage-specific substitution rate, and may therefore confound evolutionarily and/or ecologically distant organisms (Piganeau et al., 2011; Seeleuthner et al., 2018; Vannier et al., 2016; Worden et al., 2009; Wu et al., 2015). We observed a significant agreement between the two proxies (Figure 1—figure supplement 2), although dissimilarities based on OTUs were generally lower than those computed from metagenomic data (Figure 1—figure supplement 3 a).

Analyses of plankton biogeography produced consistent results based on metagenomic and OTU data (Figure 1—figure supplement 4, Figure 1—figure supplement 5, Figure 2—figure supplement 1, Figure 3—figure supplement 1). For simplicity, in the main text, we chose to highlight results based on metagenomes rather than on OTUs for three reasons. First, the metagenomic sequencing protocol and subsequent measurement of dissimilarity were uniform across size fractions, whereas OTUs were defined differently for the viral-enriched, bacterial-enriched and eukaryote-enriched size fractions (Materials and methods). Second, the biogeographical patterns we obtained (see below) may be more evident in comparisons among metagenomic sequences (our data source in identifying genomic provinces), as genomes accumulate single-base changes and other variants more quickly than a single ribosomal gene marker. Third, β-diversity estimated by metagenomic dissimilarity generally displayed higher correlation values with minimum travel time (T_min; Figure 2—figure supplement 1).

Appendix 2

Robustness of genomic provinces

We assessed the robustness of genomic provinces in five separate ways. First, we tested five different hierarchical clustering algorithms from the R-package pvclust_1.3–2 (Suzuki and Shimodaira, 2006; UPGMA; McQuitty’s method; Complete linkage; Ward’s method; Single linkage) on the metagenomic pairwise dissimilarities produced by Simka separately for the six organismal size fractions, followed by multiscale bootstrap resampling. We used the cophenetic correlation coefficient from the R-package dendextend_1.5.2 (Galili, 2015) to measure how accurately the dendrograms produced by each method preserved the pairwise distances within the input dissimilarity matrices (Sneath and Sokal, 1973; Sokal and Rohlf, 1962). The ranking of the cophenetic correlation coefficient for different clustering methods within each size fraction (Supplementary Table 19) was consistent with a published large-scale methodological comparison of clustering methods for biogeography, which considered UPGMA agglomerative hierarchical clustering to have consistently the best performance (Kreft and Jetz, 2010). Second, we compared clustering results among all size fractions using Baker’s Gamma Index (Baker and Hubert, 1975) from the R-package corrplot_0.77 (Wei and Simko, 2016), which is a measure of association (similarity) between two trees based on hierarchical clustering (dendrograms). The Baker’s Gamma Index is defined as the rank correlation between the stages at which pairs of objects combine in each of the two trees. For each type of correlation, the UPGMA was consistently the most correlated with other clustering methods (Supplementary Table 20). This allowed us to conclude, in agreement with previous results (Kreft and Jetz, 2010), that the UPGMA method is likely more robust than the other methods we tested.

Third, we compared the genomic provinces found by our UPGMA hierarchical clustering approach to those found by two different non-hierarchical methods: K-means on the positions found by multidimensional scaling and spectral clustering on the nearest-neighbor graph. Both methods rely on (i) a dissimilarity matrix and (ii) a tuning parameter (dimension of the projection space for K-means, and number of neighbors for spectral clustering). K-means uses the numeric values of the dissimilarities, whereas spectral relies only on their ordering (e.g. community A is closer to B than to C). We compared the genomic provinces to clusters found by K-means and spectral clustering for all values of the tuning parameter using the Rand Index (RI; from the GARI function of the loe R package version 1.1 Terada and Luxburg, 2016), a score of agreement between partitions. Results are reported as mean ± SD of the RI: 1 means perfect agreement and 0 complete disagreement. Fourth, in order to assess the significance of the genomic provinces, we performed a multivariate ANOVA to partition metagenomic dissimilarity across regions, using the adonis function of the vegan R package version 2.5–4 (Oksanen et al., 2019). Note, however, that since the same data were used both to construct the genomic provinces and to assess their significance, the p-values estimated by ADONIS might be anti-conservative. The results of the third and fourth analyses are presented in Supplementary Table 21.

Fifth, we found that clustering of samples in genomic provinces was consistent with a complementary visualization based on the same data: RGB colors derived from the first three axes of a principal coordinates analysis (PCoA-RGB) of β-diversity, in which similar colors represent similar communities (Figure 1—figure supplement 4; see Methods). Samples within the same genomic province generally shared the same range of PCoA-RGB colors. Because the clustering approach was hierarchical, samples sharing some similarity could have been assigned to different genomic provinces due to binary decisions during the clustering process. This was also reflected in the PCoA-RGB colors, where the boundaries of genomic provinces did not indicate a complete change of communities among genomic provinces (and, conversely, belonging to the same genomic province did not imply identical communities). Nonetheless, samples with similar PCoA-RGB colors were generally situated in closely-related branches in the UPGMA tree (Figure 1—figure supplement 6). An illustrative example is genomic province F5 (of the 180–2000 µm size fraction; Figure 1—figure supplement 4f), which encompassed stations in the Atlantic, Mediterranean Sea and some subtropical stations in the Indo-Pacific. In this wide region, the PCoA-RGB colors indicate the variation in community composition within the genomic province, and also reflect the relatedness of F5 to its adjacent samples, in particular those in the subtropical Atlantic/Pacific region F4, its neighbor in the UPGMA tree (Figure 1—figure supplement 6f).

Appendix 3

Comparison of genomic provinces to previous biogeographical divisions

Current approaches in biogeographic theory divide the ocean into regions based either on expert knowledge applied to satellite data, as in the hierarchical nesting by Longhurst, 2006 into biomes (macroscale, essentially representing a division of the world’s oceans into cold and warm waters, and coastal upwelling zones) and biogeochemical provinces (BGCPs, areas within biomes defined by observable boundaries and predicted ecological characteristics), or, alternatively, into the objective provinces of Oliver and Irwin, 2008, which are based solely on statistical analyses. Longhurst BGCPs are based upon, primarily, monthly variations of chlorophyll a, the geography of the seasonal cycle of physical factors (such as the depth of the upper ocean mixed layer) and surface temperatures. In turn, these ocean properties are strongly modulated by oceanic currents (e.g. moderate to large mixed layer depths are observed generally on the poleward side of the subtropical gyres). In contrast, the objective global ocean biogeographic provinces proposed by Oliver and Irwin, 2008 were based upon clustering temporal variability of chlorophyll concentration and surface temperatures, both measured from satellite data. They combined a proxy for the intensity of primary productivity with water temperature, therefore emphasizing regions similar in their temporal variability for both properties (which essentially corresponds to the seasonal cycle). None of these ocean partitions directly considered organismal community composition.

We tested whether genomic provinces were comparable with these partitions by performing an analysis of similarity (ANOSIM; Figure 1—figure supplement 4a-f, insets; Methods). The four small size classes, 0–0.2 µm, 0.22–1.6/3 µm, 0.8–5 µm, and 5–20 µm (Figure 1—figure supplement 4a-d) were more consistent with Longhurst BGCPs. In contrast, for the two larger size fractions 20–180 µm and 180–2000 µm, the three biogeographical divisions were not strongly different within the ANOSIM (Figure 1—figure supplement 4e-f).

From an oceanographic point of view, plankton should be quasi-neutrally redistributed (i.e. homogenized) by currents and their biogeography should follow the structure of the main recirculations, within a range of physiologically compatible temperatures. In this point of view, our results are consistent with the large-scale geographic distributions found by Hellweger et al., 2014 using a neutral model.

Although our analyses of the influence of transport and other environmental conditions on plankton biogeography are focused on surface samples (the depth at which our simulations of transport was conducted), we included samples collected at the DCM in our genomic provinces in order to produce a broader description of plankton biogeography. In general, samples at the surface and DCM from the same station clustered together in the same genomic provinces, although we observed a minority of cases in which surface and DCM samples from the same station were found in different genomic provinces, and even a few genomic provinces composed only of DCM samples (e.g. C6 in the 0.8–5 µm size fraction; Figure 1—figure supplement 4).

Appendix 4

Differences in genomic province sizes among organismal size fractions

Globally, we obtained more numerous, smaller genomic provinces in the smaller size fractions and fewer, larger genomic provinces in the larger size fractions (Figure 1—figure supplement 4, Figure 1—figure supplement 7). We observed a similar pattern using OTU data (Figure 1—figure supplement 5). Whereas smaller size fractions generally lacked geographically widespread genomic provinces containing numerous Tara Oceans samples, the two largest size fractions were both characterized by two very widespread genomic provinces: F5 and F8 for the 180–2000 µm size fraction, and E5 and E6 for the 20–180 µm size fraction. These large genomic provinces were latitudinally limited by the boundary between the subtropics and subpolar regions, and spanned different oceanic basins. Notably, in the southern hemisphere the subtropical gyres actually form a single supergyre (Speich et al., 2007) and there are almost no metabolic (mainly temperature) barriers between the northern and southern subtropical gyres (see Figure 1—figure supplement 4), potentially explaining genomic provinces in the 20–180 µm and 180–2000 µm size fraction that contain samples from the North and South Atlantic. For example, in the 180–2000 µm size fraction, F5 mostly covered the North and South Atlantic Oceans and adjacent systems, and F8 covered the Indo-Pacific low- and mid-latitudes. No clear correspondence existed with biogeochemical patterns (e.g. nutrient ratios), except for the clusters coinciding with upwelling systems (F3 for the California upwelling, F7 for the Chile-Peru upwelling, and F2 for the Benguela upwelling system) and for the samples collected at the DCM in the Pacific subtropical gyres (F5); this is consistent with the comparison of genomic provinces to previous biographical divisions, in which the genomic provinces of smaller size fractions were more consistent with Longhurst BGCPs, but those of larger size fractions were not (Appendix 3). A bimodal zooplankton species distribution (split into subtropical and subpolar communities, with ubiquitous warm water species) was also detected by a recent study on copepod population dynamics that used alternative approaches to analyze the same metagenomic data set (Madoui et al., 2017) (see their Figure 2). More locally, within the North Atlantic (see also Appendix 6), along the northern boundary of the subtropical gyre, cold and warm copepod species overlapped because of cross-current dispersal. Nonetheless, although both cold and warm species appeared to be able to travel long distances, mixing among them was not sufficient to create a local genomic province in our data.

We interpret the difference in genomic province sizes between smaller and larger size fractions as the result of various factors. Plankton smaller than 20 µm (femto-, pico-, and nanoplankton), which represent most of the prokaryotic and eukaryotic phototrophs (de Vargas et al., 2015; Sunagawa et al., 2015), are sensitive to a suite of environmental factors (i.e. temperature [Eppley, 1972], nutrients, and trace elements [Moore et al., 2013]; see also Figure 1—figure supplement 7) and generally have a shorter life cycle, together leading to faster fluctuations in their relative abundance in the communities we sampled. In contrast, larger plankton have longer life cycles and, if they are predators that are not strongly selective in their feeding, or are photosymbiotic hosts capable of partnering with multiple different symbionts, may cope with local fluctuations in environmental conditions. Therefore, they should be affected primarily by large scale, mostly latitudinal, variations in the environment, leading to larger genomic provinces, whereas smaller plankton are grouped into smaller provinces more influenced by local environmental conditions. Overall, this difference in biogeography suggests a size-based decoupling between smaller and larger plankton (which may also extend to nekton such as tuna and billfish; Reygondeau et al., 2012), with implications for the structure and function of oceanic food webs and other types of biotic interactions.

Appendix 5

Genomic provinces as stable ecological continua

As plankton communities are transported by ocean currents, they change over time due to the various processes that occur in the context of the seascape: variations in temperature, light and nutrients (where changes in the latter may also be induced by plankton communities), intra- and interindividual and species biological interactions, and mixing with neighboring water masses. Thus, a continuum of composition among nearby samples is expected as a natural consequence of community turnover within the seascape over time. We observed the effects of continuous turnover in our biogeographical analyses (Figure 1 a, Figure 1—figure supplement 4, Figure 1—figure supplement 5, Appendix 2) in which nearby samples often reflected gradual, but not complete changes in community composition.

We measured the time window of transport by currents separating two samples during which the changes in their community composition were maximally correlated with travel time, resulting in a global average of T_min less than roughly 1.5 years. This represents the travel time during which predictable continuous turnover occurs in our data set. This time scale of 1.5 years is probably an underestimate, since our sparse sampling did not cover all current systems. Notably, T_min does not necessarily define the turnover rate itself, which depends on how strongly different seascape processes affect communities with differing biological characteristics (see Appendix 6).

Appendix 6

Community turnover in the North Atlantic

In order to characterize the impact of physical and biological processes on changes in metagenomic composition during travel along currents, we focused on the well-known current systems crossing the North Atlantic into the Mediterranean Sea (the Gulf Stream and other currents around the subtropical gyre Dornelas et al., 2014; Fofonoff, 1981; Franklin, 1786; Talley et al., 2011; Figure 2—figure supplement 2a). Across this region, the piconanoplankton (0.8–5 µm) were split into three genomic provinces, C5, C8, and C3, each less than 5000 km wide (~11 months of travel time; Figure 1—figure supplement 4c). In contrast, mesoplankton (180–2000 µm) biogeography corresponded to a single province, F5, spanning from the Caribbean to Cyprus (>9700 km or ~18 months of travel time; Figure 1—figure supplement 4f; see also Appendix 4). Metagenomic dissimilarity and T_min were strongly correlated within the region (Spearman’s ρ between 0.44 and 0.86 depending on size fraction, Figure 2—figure supplement 2b-e), which allowed us to explore the relationship of genomic province size, ocean transport, and plankton community turnover over scales from months to years. We calculated metagenomic turnover times as e-folding times based on an exponential fit of metagenomic dissimilarity to T_min (ranging from a few months to a few years, Methods). The metagenomic turnover time of smaller plankton (<20 µm) was approximately 1 year. In contrast, for the larger size fractions, the metagenomic turnover time was approximately 2 years, suggesting that a lower turnover rate for larger plankton may explain their geographically larger genomic provinces.

We note that our results on metagenomic turnover time appear different from a recently published study that also calculated turnover rates for plankton, which found faster rates for larger organisms (Villarino et al., 2018). This may be explained by two significant differences between our approach and theirs: first, their measurements of β-diversity were based on presence/absence (Jaccard) comparisons among either morphological species or OTUs, whereas our calculations of turnover time above were based on metagenomic sequences. As described above (Appendix 1), there are significant differences in resolution between OTU-based and metagenomic data, and we would expect similar differences in resolution between organismal observation data and metagenomic sequences. In fact, due to these differences in resolution, our estimates of metagenomic turnover based on OTU rather than metagenomic data show a similar trend to those of (Villarino et al., 2018; Figure 2—figure supplement 2f-i). Second, their turnover rates were calculated separately for individual plankton groups (the nine main groups were prokaryotes, coccolithophores, dinoflagellates, diatoms, all microbial eukaryotes, gelatinous zooplankton, mesozooplankton, macrozooplankton, and myctophids), whereas our metagenomic data represent samples of the full plankton community within each size fraction. Among these, several groups (e.g. dinoflagellates or mesozooplankton) would be expected to be found across multiple Tara Oceans size fractions, blurring potential comparisons. Thus, our study and Villarino et al. calculated rates of change using broadly similar approaches, but based on very different underlying biological substrates.

Appendix 7

Plankton biogeography is robust to missing samples

Although many individual Tara Oceans stations are missing metagenomic or metabarcode sequencing data for a subset of size fractions (Figure 1—figure supplement 1b), all oceanic regions have broad coverage for each size fraction, with the exception of the viral-enriched size fraction in the North Atlantic. In fact, the largest source of missing data in our study is due to limited sampling of the viral-enriched size fraction in this region. Nevertheless, we found a pattern for organismal biogeography and for its relation with transport time that is not dependent on the size fraction, and therefore also does not depend on the particular size fractions sampled at specific set of sampling sites. In our analyses, we found a consistently similar patterns across the four smaller size fractions (each fraction was sampled and analyzed independently from the others) as opposed to the two larger ones. In addition, for our results relating to ocean transport time, the fact that the sampling sites are not exactly the same among size fractions actually lends robustness to our results, since it means that the dynamics we found are not overly dependent on any one selected site, region, or subset of sampling stations.

Data availability

The authors declare that all data reported herein are fully and freely available from the date of publication, with no restrictions, and that all of the samples, analyses, publications, and ownership of data are free from legal entanglement or restriction of any sort by the various nations in whose waters the Tara Oceans expedition sampled. Metagenomic and metabarcoding sequencing reads have been deposited at the European Nucleotide Archive under accession numbers provided in Supplementary Table 1. Contextual metadata of Tara Oceans stations are available in Supplementary Table 2. Metagenomic dissimilarity, OTU community dissimilarity, imaging community dissimilarity, simulated travel times, geographic distances and MAG dissimilarity are provided in Supplementary Tables 3-18. All Supplementary Tables, in addition to Datasets 1-4 (tables of 18S V9 barcodes and OTUs, the V9 reference database and MAG abundances) are available on FigShare at the following URL: https://doi.org/10.6084/m9.figshare.11303177. Images and their taxonomic assignations are accessible within Ecotaxa (https://ecotaxa.obs-vlfr.fr/prj/377).

The following data sets were generated

1. Richter D
2. Watteaux R
3. Vannier T
4. Leconte J
5. Frémont P
6. Reygondeau G
7. Maillet N
8. Henry N
9. Benoit G
10. Ophélie DS
11. delmont t
12. Fernàndez-Guerra A
13. Suweis S
14. Narci R
15. Berney C
16. Eveillard D
17. Gavory F
18. Guidi L
19. Labadie K
20. Mahieu E
21. Poulain J
22. Romac S
23. Roux S
24. Dimier C
25. Kandels S
26. Picheral M
27. Searson S
28. Coordinators TO
29. Pesant S
30. Aury JM
31. Brum JR
32. Lemaitre C
33. Pelletier E
34. Bork P
35. Sunagawa S
36. Lombard F
37. Karp-Boss L
38. Bowler C
39. Sullivan MB
40. Karsenti E
41. Mariadassou M
42. Probert I
43. Peterlongo P
44. Wincker P
45. Vargas Cd
46. d'Alcalà MR
47. Iudicone D
48. Jaillon O
(2021) figshare
Data from: Genomic evidence for global ocean plankton biogeography shaped by large-scale current systems.

https://doi.org/10.6084/m9.figshare.11303177

The following previously published data sets were used

1. Tara Oceans Consortium
(2016) NCBI Sequence Read Archive
ID PRJEB16766. sequence data corresponding to the V9 loop of the 18S rDNA in 850 size-fractionnated plankton communities sampled at 123 location including samples from the mesopelagic zone.

https://www.ncbi.nlm.nih.gov/sra/?term=PRJEB16766
1. Tara Oceans Consortium
(2013) NCBI Sequence Read Archive
ID PRJEB4352. Shotgun Sequencing of Tara Oceans DNA samples corresponding to size fractions for protist.

https://www.ncbi.nlm.nih.gov/sra/?term=PRJEB4352
1. Tara Oceans Consortium
(2013) NCBI Sequence Read Archive
ID PRJEB1787. Shotgun Sequencing of Tara Oceans DNA samples corresponding to size fractions for prokaryotes.

https://www.ncbi.nlm.nih.gov/sra/?term=PRJEB1787
1. Tara Oceans Consortium
(2013) NCBI Sequence Read Archive
ID PRJEB4419. Shotgun Sequencing of Tara Oceans DNA samples corresponding to size fractions for viruses.

https://www.ncbi.nlm.nih.gov/sra/?term=PRJEB4419

References

1. Alberti A
2. Poulain J
3. Engelen S
4. Labadie K
5. Romac S
6. Ferrera I
7. Albini G
8. Aury JM
9. Belser C
10. Bertrand A
11. Cruaud C
12. Da Silva C
13. Dossat C
14. Gavory F
15. Gas S
16. Guy J
17. Haquelle M
18. Jacoby E
19. Jaillon O
20. Lemainque A
21. Pelletier E
22. Samson G
23. Wessner M
24. Acinas SG
25. Royo-Llonch M
26. Cornejo-Castillo FM
27. Logares R
28. Fernández-Gómez B
29. Bowler C
30. Cochrane G
31. Amid C
32. Hoopen PT
33. De Vargas C
34. Grimsley N
35. Desgranges E
36. Kandels-Lewis S
37. Ogata H
38. Poulton N
39. Sieracki ME
40. Stepanauskas R
41. Sullivan MB
42. Brum JR
43. Duhaime MB
44. Poulos BT
45. Hurwitz BL
46. Pesant S
47. Karsenti E
48. Wincker P
49. Genoscope Technical Team
50. Tara Oceans Consortium Coordinators
(2017) Viral to metazoan marine plankton nucleotide sequences from the Tara Oceans expedition
Scientific Data 4:170093.

https://doi.org/10.1038/sdata.2017.93
- PubMed
- Google Scholar
1. Armbrust EV
(2009) The life of diatoms in the world’s oceans
Nature 459:185–192.

https://doi.org/10.1038/nature08057
- PubMed
- Google Scholar
1. Baker FB
2. Hubert LJ
(1975) Measuring the power of hierarchical cluster analysis
Journal of the American Statistical Association 70:31–38.

https://doi.org/10.1080/01621459.1975.10480256
- Google Scholar
(2002) Reorganization of North Atlantic marine copepod biodiversity and climate
Science 296:1692–1694.

https://doi.org/10.1126/science.1071329
- PubMed
- Google Scholar
Software
1. Becker RA
2. Wilks AR
3. Brownrigg R
4. Minka TP
5. Deckmyn A
(2018) Draw Geographical Maps, version 3.4.0
R Package Maps.

https://cran.r-project.org/web/packages/maps/maps.pdf
(2016) Multiple comparative metagenomics using multiset k -mer counting
PeerJ Computer Science 2:e94.

https://doi.org/10.7717/peerj-cs.94
- Google Scholar
(2011) D
IEEE Transactions on Visualization and Computer Graphics 17:2301–2309.

https://doi.org/10.1109/TVCG.2011.185
- PubMed
- Google Scholar
1. Brum JR
2. Ignacio-Espinoza JC
3. Roux S
4. Doulcier G
5. Acinas SG
6. Alberti A
7. Chaffron S
8. Cruaud C
9. de Vargas C
10. Gasol JM
11. Gorsky G
12. Gregory AC
13. Guidi L
14. Hingamp P
15. Iudicone D
16. Not F
17. Ogata H
18. Pesant S
19. Poulos BT
20. Schwenck SM
21. Speich S
22. Dimier C
23. Kandels-Lewis S
24. Picheral M
25. Searson S
26. Bork P
27. Bowler C
28. Sunagawa S
29. Wincker P
30. Karsenti E
31. Sullivan MB
32. Tara Oceans Coordinators
(2015) Ocean plankton: Patterns and ecological drivers of ocean viral communities
Science 348:1261498.

https://doi.org/10.1126/science.1261498
- PubMed
- Google Scholar
1. Caesar L
2. Rahmstorf S
3. Robinson A
4. Feulner G
5. Saba V
(2018) Observed fingerprint of a weakening Atlantic Ocean overturning circulation
Nature 556:191–196.

https://doi.org/10.1038/s41586-018-0006-5
- PubMed
- Google Scholar
1. Carradec Q
2. Pelletier E
3. Da Silva C
4. Alberti A
5. Seeleuthner Y
6. Blanc-Mathieu R
7. Lima-Mendez G
8. Rocha F
9. Tirichine L
10. Labadie K
11. Kirilovsky A
12. Bertrand A
13. Engelen S
14. Madoui M-A
15. Méheust R
16. Poulain J
17. Romac S
18. Richter DJ
19. Yoshikawa G
20. Dimier C
21. Kandels-Lewis S
22. Picheral M
23. Searson S
24. Tara Oceans Coordinators
25. Jaillon O
26. Aury J-M
27. Karsenti E
28. Sullivan MB
29. Sunagawa S
30. Bork P
31. Not F
32. Hingamp P
33. Raes J
34. Guidi L
35. Ogata H
36. de Vargas C
37. Iudicone D
38. Bowler C
39. Wincker P
(2018) A global ocean atlas of eukaryotic genes
Nature Communications 9:373.

https://doi.org/10.1038/s41467-017-02342-1
- PubMed
- Google Scholar
(2013) Dispersal, eddies, and the diversity of marine phytoplankton
Limnology and Oceanography 3:182–197.

https://doi.org/10.1215/21573689-2373515
- Google Scholar
1. Clayton S
2. Dutkiewicz S
3. Jahn O
4. Hill C
5. Heimbach P
6. Follows MJ
(2016) Biogeochemical versus ecological consequences of modeled ocean
Biogeophysics 10:1–20.

https://doi.org/10.5194/bg-2016-337
- Google Scholar
Software
1. Dahl E
2. Neer E
3. Karstens L
(2022) microshades: A custom color palette for improving data visualization
Microshades.

https://karstenslab.github.io/microshades
1. de Vargas C
2. Audic S
3. Henry N
4. Decelle J
5. Mahé F
6. Logares R
7. Lara E
8. Berney C
9. Le Bescot N
10. Probert I
11. Carmichael M
12. Poulain J
13. Romac S
14. Colin S
15. Aury JM
16. Bittner L
17. Chaffron S
18. Dunthorn M
19. Engelen S
20. Flegontova O
21. Guidi L
22. Horák A
23. Jaillon O
24. Lima-Mendez G
25. Lukeš J
26. Malviya S
27. Morard R
28. Mulot M
29. Scalco E
30. Siano R
31. Vincent F
32. Zingone A
33. Dimier C
34. Picheral M
35. Searson S
36. Kandels-Lewis S
37. Acinas SG
38. Bork P
39. Bowler C
40. Gorsky G
41. Grimsley N
42. Hingamp P
43. Iudicone D
44. Not F
45. Ogata H
46. Pesant S
47. Raes J
48. Sieracki ME
49. Speich S
50. Stemmann L
51. Sunagawa S
52. Weissenbach J
53. Wincker P
54. Karsenti E
55. Tara Oceans Coordinators
(2015) Ocean plankton - Eukaryotic plankton diversity in the sunlit ocean
Science 348:1261605.

https://doi.org/10.1126/science.1261605
- PubMed
- Google Scholar
1. Delmont TO
2. Gaia M
3. Hinsinger DD
4. Frémont P
5. Vanni C
6. Fernandez-Guerra A
7. Eren AM
8. Kourlaiev A
9. d’Agata L
10. Clayssen Q
11. Villar E
12. Labadie K
13. Cruaud C
14. Poulain J
15. Da Silva C
16. Wessner M
17. Noel B
18. Aury JM
19. de Vargas C
20. Bowler C
21. Karsenti E
22. Pelletier E
23. Wincker P
24. Jaillon O
25. Sunagawa S
26. Acinas SG
27. Bork P
28. Karsenti E
29. Bowler C
30. Sardet C
31. Stemmann L
32. de Vargas C
33. Wincker P
34. Lescot M
35. Babin M
36. Gorsky G
37. Grimsley N
38. Guidi L
39. Hingamp P
40. Jaillon O
41. Kandels S
42. Iudicone D
43. Ogata H
44. Pesant S
45. Sullivan MB
46. Not F
47. Lee KB
48. Boss E
49. Cochrane G
50. Follows M
51. Poulton N
52. Raes J
53. Sieracki M
54. Speich S
(2022a) Functional repertoire convergence of distantly related eukaryotic plankton lineages abundant in the sunlit ocean
Cell Genomics 2:100123.

https://doi.org/10.1016/j.xgen.2022.100123
- Google Scholar
(2022b) Heterotrophic bacterial diazotrophs are more abundant than their cyanobacterial counterparts in metagenomes covering most of the sunlit ocean
The ISME Journal 16:927–936.

https://doi.org/10.1038/s41396-021-01135-1
- PubMed
- Google Scholar
1. Dornelas M
2. Gotelli NJ
3. McGill B
4. Shimadzu H
5. Moyes F
6. Sievers C
7. Magurran AE
(2014) Assemblage time series reveal biodiversity change but not systematic loss
Science 344:296–299.

https://doi.org/10.1126/science.1248484
- PubMed
- Google Scholar
1. Eppley RW
(1972)
Temperature and phytoplankton growth in the sea

Fish Bull 70:1063–1085.
- Google Scholar
1. Flynn KJ
2. St John M
3. Raven JA
4. Skibinski DOF
5. Allen JI
6. Mitra A
7. Hofmann EE
(2015) Acclimation, adaptation, traits and trade-offs in plankton functional type models: reconciling terminology for biology and modelling
Journal of Plankton Research 37:683–691.

https://doi.org/10.1093/plankt/fbv036
- Google Scholar
Book
1. Fofonoff NP.
(1981)
The Gulf Stream In

In: Warren BA, Wunsch C, editors. Evolution of Physical Oceanography: Scientific Surveys in Honor of Henry Stommel. Cambridge, MA: MIT Press. pp. 112–139.
- Google Scholar
1. Franklin B
(1786) A letter from dr. Benjamin franklin, to mr. Alphonsus le roy, member of several academies, at paris: Containing sundry maritime observations
Transactions of the American Philosophical Society 2:294.

https://doi.org/10.2307/1005200
- Google Scholar
1. Frémont P
2. Gehlen M
3. Vrac M
4. Leconte J
5. Delmont TO
6. Wincker P
7. Iudicone D
8. Jaillon O
(2022) Restructuring of plankton genomic biogeography in the surface ocean under climate change
Nature Climate Change 12:393–401.

https://doi.org/10.1038/s41558-022-01314-8
- Google Scholar
1. Galili T
(2015) dendextend: an R package for visualizing, adjusting and comparing trees of hierarchical clustering
Bioinformatics 31:3718–3720.

https://doi.org/10.1093/bioinformatics/btv428
- PubMed
- Google Scholar
Software
1. Gerritsen H
(2014) Data Visualization on Maps, version 1.5.1
R Package Mapplots.

https://cran.r-project.org/web/packages/mapplots/mapplots.pdf
(2017) Ecological dispersal barrier across the equatorial Atlantic in a migratory planktonic copepod
Progress in Oceanography 158:203–212.

https://doi.org/10.1016/j.pocean.2016.07.001
- Google Scholar
1. Gorsky G
2. Ohman MD
3. Picheral M
4. Gasparini S
5. Stemmann L
6. Romagnan JB
7. Cawood A
8. Pesant S
9. Garcia-Comas C
10. Prejger F
(2010) Digital zooplankton image analysis using the ZooScan integrated system
Journal of Plankton Research 32:285–303.

https://doi.org/10.1093/plankt/fbp124
- Google Scholar
(2012) Beyond biogeographic patterns: processes shaping the microbial landscape
Nature Reviews. Microbiology 10:497–506.

https://doi.org/10.1038/nrmicro2795
- PubMed
- Google Scholar
(2014) Biogeographic patterns in ocean microbes emerge in a neutral agent-based model
Science 345:1346–1349.

https://doi.org/10.1126/science.1254421
- PubMed
- Google Scholar
1. Ikeda T
(1985) Metabolic rates of epipelagic marine zooplankton as a function of body mass and temperature
Marine Biology 85:1–11.

https://doi.org/10.1007/BF00396409
- Google Scholar
1. Jönsson BF
2. Watson JR
(2016) The timescales of global surface-ocean connectivity
Nature Communications 7:11239.

https://doi.org/10.1038/ncomms11239
- PubMed
- Google Scholar
1. Karsenti E
2. Acinas SG
3. Bork P
4. Bowler C
5. De Vargas C
6. Raes J
7. Sullivan M
8. Arendt D
9. Benzoni F
10. Claverie JM
11. Follows M
12. Gorsky G
13. Hingamp P
14. Iudicone D
15. Jaillon O
16. Kandels-Lewis S
17. Krzic U
18. Not F
19. Ogata H
20. Pesant S
21. Reynaud EG
22. Sardet C
23. Sieracki ME
24. Speich S
25. Velayoudon D
26. Weissenbach J
27. Wincker P
28. Tara Oceans Consortium
(2011) A holistic approach to marine eco-systems biology
PLOS Biology 9:e1001177.

https://doi.org/10.1371/journal.pbio.1001177
- PubMed
- Google Scholar
1. Kreft H
2. Jetz W
(2010) A framework for delineating biogeographical regions based on species distributions
Journal of Biogeography 37:2029–2053.

https://doi.org/10.1111/j.1365-2699.2010.02375.x
- Google Scholar
Book
1. Legendre P
2. Legendre L.
(2012)
Numerical Ecology

Elsevier.
- Google Scholar
(2016) Nutrient budgets in the subtropical ocean gyres dominated by lateral transport
Nature Geoscience 9:815–819.

https://doi.org/10.1038/ngeo2812
- Google Scholar
(2014) Phytoplankton diversity and community structure affected by oceanic dispersal and mesoscale turbulence
Limnology and Oceanography 4:67–84.

https://doi.org/10.1215/21573689-2768549
- Google Scholar
Book
1. Longhurst A.
(2006)
Ecological Geography of the Sea (2nd ed)

London: Academic Press.
- Google Scholar
Software
1. Losch M
(2022) MITgcm, version swh:1:rev:fe1862e69b370b599f9e14fcd7b7f1174abf21c9
GitHub.

https://github.com/MITgcm/MITgcm
(2016) Decoupling function and taxonomy in the global ocean microbiome
Science 353:1272–1277.

https://doi.org/10.1126/science.aaf4507
- PubMed
- Google Scholar
1. Lumpkin R
2. Johnson GC
(2013) Global ocean surface velocities from drifters: Mean, variance, El Niño-Southern Oscillation response, and seasonal cycle
Journal of Geophysical Research 118:2992–3006.

https://doi.org/10.1002/jgrc.20210
- Google Scholar
1. Madoui MA
2. Poulain J
3. Sugier K
4. Wessner M
5. Noel B
6. Berline L
7. Labadie K
8. Cornils A
9. Blanco-Bercial L
10. Stemmann L
11. Jamet JL
12. Wincker P
(2017) New insights into global biogeography, population structure and natural selection from the genome of the epipelagic copepod Oithona
Molecular Ecology 26:4467–4482.

https://doi.org/10.1111/mec.14214
- PubMed
- Google Scholar
Software
(2015) Cluster Analysis Basics and Extensions, version 2.1.3
R Package Cluster.

https://cran.r-project.org/web/packages/cluster/cluster.pdf
1. Martin K
2. Schmidt K
3. Toseland A
4. Boulton CA
5. Barry K
6. Beszteri B
7. Brussaard CPD
8. Clum A
9. Daum CG
10. Eloe-Fadrosh E
11. Fong A
12. Foster B
13. Foster B
14. Ginzburg M
15. Huntemann M
16. Ivanova NN
17. Kyrpides NC
18. Lindquist E
19. Mukherjee S
20. Palaniappan K
21. Reddy TBK
22. Rizkallah MR
23. Roux S
24. Timmermans K
25. Tringe SG
26. van de Poll WH
27. Varghese N
28. Valentin KU
29. Lenton TM
30. Grigoriev IV
31. Leggett RM
32. Moulton V
33. Mock T
(2021) The biogeographic differentiation of algal microbiomes in the upper ocean from pole to pole
Nature Communications 12:5483.

https://doi.org/10.1038/s41467-021-25646-9
- PubMed
- Google Scholar
1. Martiny JBH
2. Bohannan BJM
3. Brown JH
4. Colwell RK
5. Fuhrman JA
6. Green JL
7. Horner-Devine MC
8. Kane M
9. Krumins JA
10. Kuske CR
11. Morin PJ
12. Naeem S
13. Ovreås L
14. Reysenbach A-L
15. Smith VH
16. Staley JT
(2006) Microbial biogeography: putting microorganisms on the MAP
Nature Reviews Microbiology 4:102–112.

https://doi.org/10.1038/nrmicro1341
- PubMed
- Google Scholar
1. McGowan JA
2. Walker PW
(1979) Structure in the copepod community of the north pacific central gyre
Ecological Monographs 49:195–226.

https://doi.org/10.2307/1942513
- Google Scholar
Software
(2015) Map Projections, version 1.2.8
R Package Mapproj.

https://cran.r-project.org/web/packages/mapproj/mapproj.pdf
1. Menemenlis D
2. Campin J
3. Heimbach P
4. Hill C
5. Lee T
6. Nguyen A
7. Schodlok M
8. Zhang H
(2008)
ECCO2: High resolution global ocean and sea ice data synthesis

Mercat Ocean Q Newsl 31:13–21.
- Google Scholar
1. Moore CM
2. Mills MM
3. Arrigo KR
4. Berman-Frank I
5. Bopp L
6. Boyd PW
7. Galbraith ED
8. Geider RJ
9. Guieu C
10. Jaccard SL
11. Jickells TD
12. La Roche J
13. Lenton TM
14. Mahowald NM
15. Marañón E
16. Marinov I
17. Moore JK
18. Nakatsuka T
19. Oschlies A
20. Saito MA
21. Thingstad TF
22. Tsuda A
23. Ulloa O
(2013) Processes and patterns of oceanic nutrient limitation
Nature Geoscience 6:701–710.

https://doi.org/10.1038/ngeo1765
- Google Scholar
1. Morel A
2. Huot Y
3. Gentili B
4. Werdell PJ
5. Hooker SB
6. Franz BA
(2007) Examining the consistency of products derived from various ocean color sensors in open ocean (Case 1) waters in the perspective of a multi-sensor approach
Remote Sensing of Environment 111:69–88.

https://doi.org/10.1016/j.rse.2007.03.012
- Google Scholar
(2016) Evidence of small‐scale spatial structuring of phytoplankton alpha‐ and beta‐diversity in the open ocean
Journal of Ecology 104:1682–1695.

https://doi.org/10.1111/1365-2745.12634
- Google Scholar
Software
1. Oksanen J
2. Blanchet FG
3. Friendly M
4. Kindt R
5. Legendre P
6. McGlinn D
7. Minchin PR
8. O’Hara RB
9. Simpson GL
10. Solymos P
11. Stevens MHH
12. Szoecs E
13. Wagner H
(2019) R Package Vegan: Community Ecology Package
Package ‘Vegan.’.

https://cran.r-project.org/web/packages/vegan/vegan.pdf
1. Oliver MJ
2. Irwin AJ
(2008) Objective global ocean biogeographic provinces
Geophysical Research Letters 35:GL034238.

https://doi.org/10.1029/2008GL034238
- Google Scholar
1. Pesant S
2. Not F
3. Picheral M
4. Kandels-Lewis S
5. Le Bescot N
6. Gorsky G
7. Iudicone D
8. Karsenti E
9. Speich S
10. Troublé R
11. Dimier C
12. Searson S
13. Acinas SG
14. Bork P
15. Boss E
16. Bowler C
17. De Vargas C
18. Follows M
19. Gorsky G
20. Grimsley N
21. Hingamp P
22. Iudicone D
23. Jaillon O
24. Kandels-Lewis S
25. Karp-Boss L
26. Karsenti E
27. Krzic U
28. Not F
29. Ogata H
30. Pesant S
31. Raes J
32. Reynaud EG
33. Sardet C
34. Sieracki M
35. Speich S
36. Stemmann L
37. Sullivan MB
38. Sunagawa S
39. Velayoudon D
40. Weissenbach J
41. Wincker P
(2015) Open science resources for the discovery and analysis of Tara Oceans data
Scientific Data 2:150023.

https://doi.org/10.1038/sdata.2015.23
- PubMed
- Google Scholar
Website
(2017) EcoTaxa, a tool for the taxonomic classification of images
Accessed June 2, 2020.

http://ecotaxa.obs-vlfr.fr
(2011) How and why DNA barcodes underestimate the diversity of microbial eukaryotes
PLOS ONE 6:e16342.

https://doi.org/10.1371/journal.pone.0016342
- PubMed
- Google Scholar
Book
1. Pittman SJ
(editors) (2017)
Seascape Ecology

Wiley-Blackwell.
- Google Scholar
Book
1. R Core Team T
(2017)
R: A Language and Environment for Statistical Computing

Vienna, Austria: R Foundation for Statistical Computing.
- Google Scholar
1. Rattray A
2. Andrello M
3. Asnaghi V
4. Bevilacqua S
5. Bulleri F
6. Cebrian E
7. Chiantore M
8. Claudet J
9. Deudero S
10. Evans J
11. Fraschetti S
12. Guarnieri G
13. Mangialajo L
14. Schembri PJ
15. Terlizzi A
16. Benedetti-Cecchi L
(2016) Geographic distance, water circulation and environmental conditions shape the biodiversity of Mediterranean rocky coasts
Marine Ecology Progress Series 553:1–11.

https://doi.org/10.3354/meps11783
- Google Scholar
(2012) Biogeography of tuna and billfish communities
Journal of Biogeography 39:114–129.

https://doi.org/10.1111/j.1365-2699.2011.02582.x
- Google Scholar
(2013) Dynamic biogeochemical provinces in the global ocean
Global Biogeochemical Cycles 27:1046–1058.

https://doi.org/10.1002/gbc.20089
- Google Scholar
Book
1. Reygondeau G
2. Dunn D
(2019) Pelagic BiogeographyEncyclopedia of Ocean Sciences
Elsevier.

https://doi.org/10.1016/B978-0-12-409548-9.11633-1
- Google Scholar
(2002) Ocean interpolation by four-dimensional weighted least squares—application to the waters around australasia
Journal of Atmospheric and Oceanic Technology 19:1357–1375.

https://doi.org/10.1175/1520-0426(2002)019<1357:OIBFDW>2.0.CO;2
- Google Scholar
1. Rousseeuw PJ
(1987) Silhouettes: A graphical aid to the interpretation and validation of cluster analysis
Journal of Computational and Applied Mathematics 20:53–65.

https://doi.org/10.1016/0377-0427(87)90125-7
- Google Scholar
1. Roux S
2. Brum JR
3. Dutilh BE
4. Sunagawa S
5. Duhaime MB
6. Loy A
7. Poulos BT
8. Solonenko N
9. Lara E
10. Poulain J
11. Pesant S
12. Kandels-Lewis S
13. Dimier C
14. Picheral M
15. Searson S
16. Cruaud C
17. Alberti A
18. Duarte CM
19. Gasol JM
20. Vaqué D
21. Tara Oceans Coordinators
22. Bork P
23. Acinas SG
24. Wincker P
25. Sullivan MB
(2016) Ecogenomics and potential biogeochemical impacts of globally abundant ocean viruses
Nature 537:689–693.

https://doi.org/10.1038/nature19366
- PubMed
- Google Scholar
Book
1. Sarmiento JL
2. Gruber N
(2006) Ocean Biogeochemical Dynamics
Princeton, New Jersey: Princeton University Press.

https://doi.org/10.1515/9781400849079
- Google Scholar
1. Seeleuthner Y
2. Mondy S
3. Lombard V
4. Carradec Q
5. Pelletier E
6. Wessner M
7. Leconte J
8. Mangot J-F
9. Poulain J
10. Labadie K
11. Logares R
12. Sunagawa S
13. de Berardinis V
14. Salanoubat M
15. Dimier C
16. Kandels-Lewis S
17. Picheral M
18. Searson S
19. Pesant S
20. Poulton N
21. Stepanauskas R
22. Bork P
23. Bowler C
24. Hingamp P
25. Sullivan MB
26. Iudicone D
27. Massana R
28. Aury J-M
29. Henrissat B
30. Karsenti E
31. Jaillon O
32. Sieracki M
33. de Vargas C
34. Wincker P
35. Tara Oceans Coordinators
(2018) Single-cell genomics of multiple uncultured stramenopiles reveals underestimated functional diversity across oceans
Nature Communications 9:310.

https://doi.org/10.1038/s41467-017-02235-3
- PubMed
- Google Scholar
Book
1. Sneath PHA
2. Sokal RR.
(1973)
Numerical taxonomy. The principles and practice of numerical classification

San Francisco: W.H. Freeman and Company.
- Google Scholar
1. Sokal RR
2. Rohlf FJ
(1962) The comparison of dendrograms by objective methods
TAXON 11:33–40.

https://doi.org/10.2307/1217208
- Google Scholar
(2021) Global drivers of eukaryotic plankton biogeography in the sunlit ocean
Science 374:594–599.

https://doi.org/10.1126/science.abb3717
- PubMed
- Google Scholar
1. Speich S
2. Blanke B
3. Cai W
(2007) Atlantic meridional overturning circulation and the Southern Hemisphere supergyre
Geophysical Research Letters 34:n.

https://doi.org/10.1029/2007GL031583
- Google Scholar
1. Sunagawa S
2. Coelho LP
3. Chaffron S
4. Kultima JR
5. Labadie K
6. Salazar G
7. Djahanschiri B
8. Zeller G
9. Mende DR
10. Alberti A
11. Cornejo-Castillo FM
12. Costea PI
13. Cruaud C
14. d’Ovidio F
15. Engelen S
16. Ferrera I
17. Gasol JM
18. Guidi L
19. Hildebrand F
20. Kokoszka F
21. Lepoivre C
22. Lima-Mendez G
23. Poulain J
24. Poulos BT
25. Royo-Llonch M
26. Sarmento H
27. Vieira-Silva S
28. Dimier C
29. Picheral M
30. Searson S
31. Kandels-Lewis S
32. Bowler C
33. de Vargas C
34. Gorsky G
35. Grimsley N
36. Hingamp P
37. Iudicone D
38. Jaillon O
39. Not F
40. Ogata H
41. Pesant S
42. Speich S
43. Stemmann L
44. Sullivan MB
45. Weissenbach J
46. Wincker P
47. Karsenti E
48. Raes J
49. Acinas SG
50. Bork P
51. Tara Oceans coordinators
(2015) Ocean plankton: Structure and function of the global ocean microbiome
Science 348:1261359.

https://doi.org/10.1126/science.1261359
- PubMed
- Google Scholar
1. Suzuki R
2. Shimodaira H
(2006) Pvclust: an R package for assessing the uncertainty in hierarchical clustering
Bioinformatics 22:1540–1542.

https://doi.org/10.1093/bioinformatics/btl117
- PubMed
- Google Scholar
1. Tagliabue A
2. Bowie AR
3. Boyd PW
4. Buck KN
5. Johnson KS
6. Saito MA
(2017) The integral role of iron in ocean biogeochemistry
Nature 543:51–59.

https://doi.org/10.1038/nature21058
- PubMed
- Google Scholar
Book
(2011)
Descriptive Physical Oceanography: An Introduction (6th ed)

Boston: Elsevier.
- Google Scholar
Software
1. Terada Y
2. Luxburg U
(2016) R Package Loe: Local Ordinal Embedding
Loe.

https://cran.r-project.org/web/packages/loe/index.html
1. Vannier T
2. Leconte J
3. Seeleuthner Y
4. Mondy S
5. Pelletier E
6. Aury J-M
7. de Vargas C
8. Sieracki M
9. Iudicone D
10. Vaulot D
11. Wincker P
12. Jaillon O
(2016) Survey of the green picoalga Bathycoccus genomes in the global ocean
Scientific Reports 6:37900.

https://doi.org/10.1038/srep37900
- PubMed
- Google Scholar
1. Villarino E
2. Watson JR
3. Jönsson B
4. Gasol JM
5. Salazar G
6. Acinas SG
7. Estrada M
8. Massana R
9. Logares R
10. Giner CR
11. Pernice MC
12. Olivar MP
13. Citores L
14. Corell J
15. Rodríguez-Ezpeleta N
16. Acuña JL
17. Molina-Ramírez A
18. González-Gordillo JI
19. Cózar A
20. Martí E
21. Cuesta JA
22. Agustí S
23. Fraile-Nuez E
24. Duarte CM
25. Irigoien X
26. Chust G
(2018) Large-scale ocean connectivity and planktonic body size
Nature Communications 9:142.

https://doi.org/10.1038/s41467-017-02535-8
- PubMed
- Google Scholar
Software
1. Warnes GR
2. Bolker B
3. Bonebakker L
4. Gentleman R
5. Huber W
6. Liaw A
7. Lumley T
8. Maechler M
9. Magnusson A
10. Moeller S
11. Schwartz M
12. Venables B
(2015) Various R Programming Tools for Plotting Data, version 3.1.3
R Package Gplots.

https://cran.r-project.org/web/packages/gplots/gplots.pdf
1. Watson JR
2. Mitarai S
3. Siegel DA
4. Caselle JE
5. Dong C
6. McWilliams JC
(2010) Realized and potential larval connectivity in the Southern California Bight
Marine Ecology Progress Series 401:31–48.

https://doi.org/10.3354/meps08376
- Google Scholar
Software
1. Wei T
2. Simko V
(2016) Visualization of a Correlation Matrix, version 0.92
R Package Corrplot.

https://CRAN.R-project.org/package=corrplot
(2013) Advection shapes Southern Ocean microbial assemblages independent of distance and environment effects
Nature Communications 4:2457.

https://doi.org/10.1038/ncomms3457
- PubMed
- Google Scholar
1. Worden AZ
2. Lee JH
3. Mock T
4. Rouzé P
5. Simmons MP
6. Aerts AL
7. Allen AE
8. Cuvelier ML
9. Derelle E
10. Everett MV
11. Foulon E
12. Grimwood J
13. Gundlach H
14. Henrissat B
15. Napoli C
16. McDonald SM
17. Parker MS
18. Rombauts S
19. Salamov A
20. Von Dassow P
21. Badger JH
22. Coutinho PM
23. Demir E
24. Dubchak I
25. Gentemann C
26. Eikrem W
27. Gready JE
28. John U
29. Lanier W
30. Lindquist EA
31. Lucas S
32. Mayer KFX
33. Moreau H
34. Not F
35. Otillar R
36. Panaud O
37. Pangilinan J
38. Paulsen I
39. Piegu B
40. Poliakov A
41. Robbens S
42. Schmutz J
43. Toulza E
44. Wyss T
45. Zelensky A
46. Zhou K
47. Armbrust EV
48. Bhattacharya D
49. Goodenough UW
50. Van de Peer Y
51. Grigoriev IV
(2009) Green evolution and dynamic adaptations revealed by genomes of the marine picoeukaryotes Micromonas
Science 324:268–272.

https://doi.org/10.1126/science.1167222
- PubMed
- Google Scholar
1. Wu S
2. Xiong J
3. Yu Y
4. Fontaneto D
(2015) Taxonomic resolutions based on 18s rrna genes: A case study of subclass copepoda
PLOS ONE 10:e0131498.

https://doi.org/10.1371/journal.pone.0131498
- Google Scholar

Article and author information

Author details

Daniel J Richter
1. Sorbonne Université, CNRS, Station Biologique de Roscoff, UMR7144, ECOMAP, Roscoff, France
2. Institut de Biologia Evolutiva (CSIC‐Universitat Pompeu Fabra), Passeig Marítim de la Barceloneta, Barcelona, Spain
Contribution
Conceptualization, Data curation, Formal analysis, Methodology, Visualization, Writing – original draft, Writing – review and editing, Investigation, Software

Contributed equally with
Romain Watteaux and Thomas Vannier

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0002-9238-5571
Romain Watteaux
1. Stazione Zoologica Anton Dohrn, Villa Comunale, Naples, Italy
2. CEA, DAM, DIF, F‐91297, Arpajon Cedex, France
Contribution
Conceptualization, Formal analysis, Methodology, Visualization, Writing – original draft, Writing – review and editing, Data curation, Investigation, Software

Contributed equally with
Daniel J Richter and Thomas Vannier

Competing interests
No competing interests declared
Thomas Vannier
1. Génomique Métabolique, Genoscope, Institut de Biologie François Jacob, CEA, CNRS, Université Evry, Université Paris‐Saclay, Evry, France
2. Research Federation for the study of Global Ocean systems ecology and evolution, FR2O22/Tara GOSEE, Paris, France
3. Aix Marseille Univ., Université de Toulon, CNRS, IRD, MIO UM, Marseille, France
Contribution
Conceptualization, Formal analysis, Methodology, Visualization, Writing – original draft, Writing – review and editing, Data curation, Investigation, Software

Contributed equally with
Daniel J Richter and Romain Watteaux

Competing interests
No competing interests declared
Jade Leconte
1. Génomique Métabolique, Genoscope, Institut de Biologie François Jacob, CEA, CNRS, Université Evry, Université Paris‐Saclay, Evry, France
2. Research Federation for the study of Global Ocean systems ecology and evolution, FR2O22/Tara GOSEE, Paris, France
Contribution
Formal analysis, Visualization, Writing – review and editing, Investigation

Competing interests
No competing interests declared
Paul Frémont
1. Génomique Métabolique, Genoscope, Institut de Biologie François Jacob, CEA, CNRS, Université Evry, Université Paris‐Saclay, Evry, France
2. Research Federation for the study of Global Ocean systems ecology and evolution, FR2O22/Tara GOSEE, Paris, France
Contribution
Formal analysis, Visualization, Writing – review and editing, Investigation, Software

Competing interests
No competing interests declared
Gabriel Reygondeau
1. Changing Ocean Research Unit, Institute for the Oceans and Fisheries, University of British Columbia. Aquatic Ecosystems Research Lab, Vancouver, Canada
2. Ecology and Evolutionary Biology, Yale University, New Haven, CT, United States
Contribution
Formal analysis, Writing – review and editing

Competing interests
No competing interests declared
Nicolas Maillet

Institut pasteur, Université Paris Cité, Bioinformatics and Biostatistics Hub, Paris, France

Contribution
Formal analysis, Visualization

Competing interests
No competing interests declared
Nicolas Henry
1. Sorbonne Université, CNRS, Station Biologique de Roscoff, UMR7144, ECOMAP, Roscoff, France
2. Research Federation for the study of Global Ocean systems ecology and evolution, FR2O22/Tara GOSEE, Paris, France
Contribution
Formal analysis, Visualization

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0002-7702-1382
Gaëtan Benoit

Univ Rennes, CNRS, Inria, IRISA-UMR 6074, Rennes, France

Contribution
Methodology, Investigation

Competing interests
No competing interests declared
Ophélie Da Silva
1. Research Federation for the study of Global Ocean systems ecology and evolution, FR2O22/Tara GOSEE, Paris, France
2. Sorbonne Universités, CNRS, Laboratoire d’Oceanographie de Villefranche, LOV, Villefranche‐sur‐Mer, France
Contribution
Formal analysis, Investigation, Writing – review and editing

Competing interests
No competing interests declared
Tom O Delmont
1. Génomique Métabolique, Genoscope, Institut de Biologie François Jacob, CEA, CNRS, Université Evry, Université Paris‐Saclay, Evry, France
2. Research Federation for the study of Global Ocean systems ecology and evolution, FR2O22/Tara GOSEE, Paris, France
Contribution
Investigation, Resources, Writing – review and editing

Competing interests
No competing interests declared
Antonio Fernàndez-Guerra
1. Lundbeck Foundation GeoGenetics Centre, GLOBE Institute, University of Copenhagen, Copenhagen, Denmark
2. MARUM, Center for Marine Environmental Sciences, University of Bremen, Bremen, Germany
3. Max Planck Institute for Marine Microbiology, Bremen, Germany
Contribution
Formal analysis, Visualization, Writing – review and editing

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0002-8679-490X
Samir Suweis

Dipartimento di Fisica e Astronomia ‘G. Galilei’ & CNISM, INFN, Università di Padova, Padova, Italy

Contribution
Formal analysis, Visualization, Writing – review and editing

Competing interests
No competing interests declared
Romain Narci

MaIAGE, INRAE, Université Paris‐Saclay, Jouy‐en‐Josas, France

Contribution
Funding acquisition, Writing – review and editing

Competing interests
No competing interests declared
Cédric Berney
1. Sorbonne Université, CNRS, Station Biologique de Roscoff, UMR7144, ECOMAP, Roscoff, France
2. Research Federation for the study of Global Ocean systems ecology and evolution, FR2O22/Tara GOSEE, Paris, France
Contribution
Formal analysis

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0001-8689-9907
Damien Eveillard
1. Research Federation for the study of Global Ocean systems ecology and evolution, FR2O22/Tara GOSEE, Paris, France
2. Nantes Université, Ecole Centrale Nantes, CNRS, LS2N, Nantes, France
Contribution
Formal analysis, Writing – review and editing

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0002-8162-7360
Frederick Gavory

Génomique Métabolique, Genoscope, Institut de Biologie François Jacob, CEA, CNRS, Université Evry, Université Paris‐Saclay, Evry, France

Contribution
Investigation

Competing interests
No competing interests declared
Lionel Guidi
1. Research Federation for the study of Global Ocean systems ecology and evolution, FR2O22/Tara GOSEE, Paris, France
2. Sorbonne Universités, CNRS, Laboratoire d’Oceanographie de Villefranche, LOV, Villefranche‐sur‐Mer, France
Contribution
Data curation, Writing – review and editing

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0002-6669-5744
Karine Labadie

Genoscope, Institut de biologie François‐Jacob, Commissariat à l'Energie Atomique (CEA), Université Paris‐Saclay, Evry, France

Contribution
Investigation, Methodology, Resources

Competing interests
No competing interests declared
Eric Mahieu

Genoscope, Institut de biologie François‐Jacob, Commissariat à l'Energie Atomique (CEA), Université Paris‐Saclay, Evry, France

Contribution
Investigation

Competing interests
No competing interests declared
Julie Poulain
1. Génomique Métabolique, Genoscope, Institut de Biologie François Jacob, CEA, CNRS, Université Evry, Université Paris‐Saclay, Evry, France
2. Research Federation for the study of Global Ocean systems ecology and evolution, FR2O22/Tara GOSEE, Paris, France
Contribution
Data curation, Investigation, Project administration, Resources

Competing interests
No competing interests declared
Sarah Romac
1. Sorbonne Université, CNRS, Station Biologique de Roscoff, UMR7144, ECOMAP, Roscoff, France
2. Research Federation for the study of Global Ocean systems ecology and evolution, FR2O22/Tara GOSEE, Paris, France
Contribution
Resources

Competing interests
No competing interests declared
Simon Roux

Department of Microbiology, The Ohio State University, Columbus, United States

Contribution
Resources, Writing – review and editing

Competing interests
No competing interests declared
Céline Dimier
1. Sorbonne Université, CNRS, Station Biologique de Roscoff, UMR7144, ECOMAP, Roscoff, France
2. Institut de Biologie de l’Ecole Normale Supérieure (IBENS), Ecole Normale Supérieure, CNRS, INSERM, Université PSL, Paris, France
Contribution
Resources

Competing interests
No competing interests declared
Stefanie Kandels
1. Structural and Computational Biology, European Molecular Biology Laboratory, Heidelberg, Germany
2. Directors’ Research European Molecular Biology Laboratory, Heidelberg, Germany
Contribution
Project administration

Competing interests
No competing interests declared
Marc Picheral
1. Research Federation for the study of Global Ocean systems ecology and evolution, FR2O22/Tara GOSEE, Paris, France
2. Sorbonne Universités, CNRS, Laboratoire d’Oceanographie de Villefranche, LOV, Villefranche‐sur‐Mer, France
Contribution
Data curation, Investigation, Resources

Competing interests
No competing interests declared
Sarah Searson
1. Research Federation for the study of Global Ocean systems ecology and evolution, FR2O22/Tara GOSEE, Paris, France
2. Sorbonne Universités, CNRS, Laboratoire d’Oceanographie de Villefranche, LOV, Villefranche‐sur‐Mer, France
Contribution
Resources

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0002-4721-0027
Tara Oceans Coordinators

Contribution
Funding acquisition, Supervision, Writing – review and editing

Competing interests
No competing interests declared
1. Silvia G Acinas, Department of Marine Biology and Oceanography, Institut de Ciències del Mar (ICM), CSIC, Barcelona, Spain
2. Peer Bork, Structural and Computational Biology, European Molecular Biology Laboratory, Heidelberg, Germany
3. Emmanuel Boss, School of Marine Sciences, University of Maine, Orono, United States
4. Chris Bowler, Research Federation for the study of Global Ocean systems ecology and evolution, FR2022/Tara GOSEE, Paris, France
5. Guy Cochrane, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL‐EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom
6. Colomban de Vargas, Research Federation for the study of Global Ocean systems ecology and evolution, FR2022/Tara GOSEE, Paris, France
7. Gabriel Gorsky, Research Federation for the study of Global Ocean systems ecology and evolution, FR2022/Tara GOSEE, Paris, France
8. Nigel Grimsley, CNRS, UMR 7232, BIOM, Avenue Pierre Fabre, Banyuls‐sur‐Mer, France
9. Lionel Guidi, Research Federation for the study of Global Ocean systems ecology and evolution, FR2022/Tara GOSEE, Paris, France
10. Pascal Hingamp, Aix Marseille Univ., Université de Toulon, CNRS, IRD, MIO UM 110, 13288, Marseille, France
11. Daniele Iudicone, Stazione Zoologica Anton Dohrn, Villa Comunale, Naples, Italy
12. Olivier Jaillon, Research Federation for the study of Global Ocean systems ecology and evolution, FR2022/Tara GOSEE, Paris, France
13. Stefanie Kandels, Structural and Computational Biology, European Molecular Biology Laboratory, Heidelberg, Germany
14. Lee Karp-Boss, School of Marine Sciences, University of Maine, Orono, United States
15. Eric Karsenti, Research Federation for the study of Global Ocean systems ecology and evolution, FR2022/Tara GOSEE, Paris, France
16. Fabrice Not, Research Federation for the study of Global Ocean systems ecology and evolution, FR2022/Tara GOSEE, Paris, France
17. Hiroyuki Ogata, Institute for Chemical Research, Kyoto University, Gokasho, Kyoto, Japan
18. Stéphane Pesant, MARUM, Center for Marine Environmental Sciences, University of Bremen, Bremen, Germany
19. Jeroen Raes, Department of Microbiology and Immunology, Rega Institute, KU Leuven, Leuven, Belgium
20. Christian Sardet, Research Federation for the study of Global Ocean systems ecology and evolution, FR2022/Tara GOSEE, Paris, France
21. Mike Sieracki, National Science Foundation, Arlington, United States
22. Sabrina Speich, Laboratoire de Physique des Océans, UBO‐IUEM, Place Copernic, Plouzané, France
23. Lars Stemmann, Research Federation for the study of Global Ocean systems ecology and evolution, FR2022/Tara GOSEE, Paris, France
24. Matthew B Sullivan, Department of Microbiology, The Ohio State University, Columbus, United States
25. Shinichi Sunagawa, Structural and Computational Biology, European Molecular Biology Laboratory, Heidelberg, Germany
26. Patrick Wincker, Research Federation for the study of Global Ocean systems ecology and evolution, FR2022/Tara GOSEE, Paris, France
Stéphane Pesant
1. MARUM, Center for Marine Environmental Sciences, University of Bremen, Bremen, Germany
2. PANGAEA, Data Publisher for Earth and Environmental Science, University of Bremen, Bremen, Germany
Contribution
Data curation

Competing interests
No competing interests declared
Jean-Marc Aury

Génomique Métabolique, Genoscope, Institut de Biologie François Jacob, CEA, CNRS, Université Evry, Université Paris‐Saclay, Evry, France

Contribution
Data curation, Investigation, Software

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0003-1718-3010
Jennifer R Brum
1. Department of Microbiology, The Ohio State University, Columbus, United States
2. Department of Oceanography and Coastal Sciences, Louisiana State University, Baton Rouge, United States
Contribution
Resources

Competing interests
No competing interests declared
Claire Lemaitre

Univ Rennes, CNRS, Inria, IRISA-UMR 6074, Rennes, France

Contribution
Methodology, Investigation

Competing interests
No competing interests declared
Eric Pelletier
1. Génomique Métabolique, Genoscope, Institut de Biologie François Jacob, CEA, CNRS, Université Evry, Université Paris‐Saclay, Evry, France
2. Research Federation for the study of Global Ocean systems ecology and evolution, FR2O22/Tara GOSEE, Paris, France
Contribution
Investigation, Writing – review and editing

Competing interests
No competing interests declared
Peer Bork
1. Structural and Computational Biology, European Molecular Biology Laboratory, Heidelberg, Germany
2. Yonsei Frontier Lab, Yonsei University, Seoul, Republic of Korea
3. Department of Bioinformatics, Biocenter, University of Würzburg, Würzburg, Germany
Contribution
Writing – review and editing

Competing interests
No competing interests declared
Shinichi Sunagawa
1. Structural and Computational Biology, European Molecular Biology Laboratory, Heidelberg, Germany
2. Institute of Microbiology, Department of Biology, ETH Zurich, Vladimir‐Prelog‐Weg, Zurich, Switzerland
Contribution
Formal analysis, Writing – review and editing

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0003-3065-0314
Fabien Lombard
1. Research Federation for the study of Global Ocean systems ecology and evolution, FR2O22/Tara GOSEE, Paris, France
2. Sorbonne Universités, CNRS, Laboratoire d’Oceanographie de Villefranche, LOV, Villefranche‐sur‐Mer, France
3. Institut Universitaire de France (IUF), Paris, France
Contribution
Data curation, Formal analysis, Investigation, Writing – review and editing, Resources

Competing interests
No competing interests declared
Lee Karp-Boss

School of Marine Sciences, University of Maine, Orono, United States

Contribution
Writing – review and editing

Competing interests
No competing interests declared
Chris Bowler
1. Research Federation for the study of Global Ocean systems ecology and evolution, FR2O22/Tara GOSEE, Paris, France
2. Institut de Biologie de l’Ecole Normale Supérieure (IBENS), Ecole Normale Supérieure, CNRS, INSERM, Université PSL, Paris, France
3. Institut de Biologie de l’Ecole Normale Supérieure (IBENS), Ecole Normale Supérieure, CNRS, INSERM, Université PSL, Paris, France
Contribution
Investigation, Writing – review and editing

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0003-3835-6187
Matthew B Sullivan
1. Department of Microbiology, The Ohio State University, Columbus, United States
2. EMERGE Biology Integration Institute, The Ohio State University, Columbus, United States
3. Center of Microbiome Science, The Ohio State University, Columbus, United States
4. Department of Civil, Environmental and Geodetic Engineering, The Ohio State University, Columbus, United States
Contribution
Writing – review and editing

Competing interests
No competing interests declared
Eric Karsenti
1. Research Federation for the study of Global Ocean systems ecology and evolution, FR2O22/Tara GOSEE, Paris, France
2. Institut de Biologie de l’Ecole Normale Supérieure (IBENS), Ecole Normale Supérieure, CNRS, INSERM, Université PSL, Paris, France
3. Directors’ Research European Molecular Biology Laboratory, Heidelberg, Germany
Contribution
Supervision, Writing – review and editing

Competing interests
No competing interests declared
Mahendra Mariadassou

MaIAGE, INRAE, Université Paris‐Saclay, Jouy‐en‐Josas, France

Contribution
Funding acquisition, Writing – review and editing

Competing interests
No competing interests declared
Ian Probert
1. Sorbonne Université, CNRS, Station Biologique de Roscoff, UMR7144, ECOMAP, Roscoff, France
2. Research Federation for the study of Global Ocean systems ecology and evolution, FR2O22/Tara GOSEE, Paris, France
Contribution
Writing – original draft, Writing – review and editing

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0002-1643-1759
Pierre Peterlongo

Univ Rennes, CNRS, Inria, IRISA-UMR 6074, Rennes, France

Contribution
Methodology, Investigation, Writing – review and editing

Competing interests
No competing interests declared
Patrick Wincker
1. Génomique Métabolique, Genoscope, Institut de Biologie François Jacob, CEA, CNRS, Université Evry, Université Paris‐Saclay, Evry, France
2. Research Federation for the study of Global Ocean systems ecology and evolution, FR2O22/Tara GOSEE, Paris, France
Contribution
Supervision, Writing – review and editing, Funding acquisition

Competing interests
No competing interests declared
Colomban de Vargas
1. Sorbonne Université, CNRS, Station Biologique de Roscoff, UMR7144, ECOMAP, Roscoff, France
2. Research Federation for the study of Global Ocean systems ecology and evolution, FR2O22/Tara GOSEE, Paris, France
Contribution
Conceptualization, Software, Writing – original draft, Writing – review and editing

For correspondence
vargas@sb-roscoff.fr

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0002-6476-6019
Maurizio Ribera d'Alcalà

Stazione Zoologica Anton Dohrn, Villa Comunale, Naples, Italy

Contribution
Conceptualization, Formal analysis, Supervision, Writing – original draft, Writing – review and editing

For correspondence
maurizio@szn.it

Competing interests
No competing interests declared
Daniele Iudicone

Stazione Zoologica Anton Dohrn, Villa Comunale, Naples, Italy

Contribution
Conceptualization, Formal analysis, Methodology, Supervision, Writing – original draft, Writing – review and editing, Funding acquisition, Investigation

For correspondence
iudicone@szn.it

Competing interests
No competing interests declared
Olivier Jaillon
1. Génomique Métabolique, Genoscope, Institut de Biologie François Jacob, CEA, CNRS, Université Evry, Université Paris‐Saclay, Evry, France
2. Research Federation for the study of Global Ocean systems ecology and evolution, FR2O22/Tara GOSEE, Paris, France
Contribution
Conceptualization, Formal analysis, Methodology, Visualization, Writing – original draft, Writing – review and editing, Funding acquisition, Investigation, Supervision

For correspondence
ojaillon@genoscope.cns.fr

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0002-7237-9596

Funding

Agence Nationale de la Recherche (HYDROGEN/ANR-14-CE23-0001)

Gaëtan Benoit
Tom O Delmont
Romain Narci

Agence Nationale de la Recherche (OCEANOMICS/ANR-11-BTBR-0008)

Nicolas Henry
Cédric Berney
Sarah Romac
Colomban de Vargas

National Science Foundation (OCE-1536989)

Jennifer R Brum
Matthew B Sullivan
Simon Roux

European Commission (MicroB3/287589)

Antonio Fernàndez-Guerra

European Research Council (INMARE/634486)

Antonio Fernàndez-Guerra

Commissariat à l'Énergie Atomique et aux Énergies Alternatives (Graduate Student Fellowship)

Paul Frémont

Graphene Flagship (RITMARE)

Daniele Iudicone
Maurizio Ribera d'Alcalà

Premiale (MIUR NEMO)

Maurizio Ribera d'Alcalà
Daniele Iudicone

Conseil Régional de Bretagne (Postdoctoral Fellowship)

Daniel J Richter

Institute for Bioengineering of Catalonia (Beatriu de Pinós Postdoctoral Fellowship)

Daniel J Richter

“la Caixa” Foundation (LCF/BQ/PI19/11690008)

Daniel J Richter

European Research Council (949745)

Daniel J Richter

National Science Foundation (OCE-1829831)

Matthew B Sullivan

Gordon and Betty Moore Foundation (3709)

Matthew B Sullivan

Ohio Super Computer Center (HPC support)

Matthew B Sullivan

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Acknowledgements

We acknowledge Oliver Jahn and Mick Follows for providing numerical simulations of particle trajectories from Tara Oceans stations. We also acknowledge Stéphane Audic for assistance with metabarcoding analyses, Claude Scarpelli for support in high-performance computing, Mathieu Raffinot and Dominique Lavenier for discussions on sequence comparison algorithms, Samuel Chaffron for help with sample contextual data, Noan Le Bescot (Ternog Design) for assistance in preparing figures, and Marion Gehlen. We thank all members of the Tara Oceans consortium for maintaining a creative environment and for their constructive criticism.

We thank the commitment of the following people and sponsors who made this expedition possible: CNRS (in particular Groupement de Recherche GDR3280), European Molecular Biology Laboratory (EMBL), Genoscope/CEA, Fund for Scientific Research – Flanders, VIB, Stazione Zoologica Anton Dohrn, UNIMIB, Paris Sciences et Lettres (PSL) Research University (ANR-11-IDEX-0001–02), the French Government ANR (projects FRANCE GENOMIQUE/ANR-10-INBS-09, MEMO LIFE/ANR-10-LABX-54, POSEIDON/ANR-09-BLAN-0348, PROMETHEUS/ANR-09-PCS-GENM-217, MAPPI/ANR-2010-COSI-004, TARA-GIRUS/ANR-09-PCS-GENM-218), US NSF grant DEB-1031049, FWO, BIO5, Biosphere 2, Agnès b., the Veolia Environment Foundation, Région Bretagne, World Courier, Illumina, Cap L’Orient, the EDF Foundation EDF Diversiterre, FRB, the Prince Albert II de Monaco Foundation, Etienne Bourgois, the Tara schooner and its captain and crew. We thank MERCATOR-CORIOLIS and ACRI-ST for providing daily satellite data during the expedition. The bulk of genomic computations were performed using the Airain HPC machine provided through GENCI- [TGCC/CINES/IDRIS] (grants t2011076389, t2012076389, t2013036389, t2014036389, t2015036389 and t2016036389). We are also grateful to the French Ministry of Foreign Affairs for supporting the expedition and to the countries who granted us sampling permissions. Tara Oceans would not exist without continuous support from 23 institutes (http://oceans.taraexpeditions.org).

This article is contribution number 136 of Tara Oceans.

Copyright

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.

Metrics

3,902

views
746

downloads
65

citations

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Citations by DOI

65

citations for umbrella DOI https://doi.org/10.7554/eLife.78129

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Mendeley

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

Daniel J Richter
Romain Watteaux
Thomas Vannier
Jade Leconte
Paul Frémont
Gabriel Reygondeau
Nicolas Maillet
Nicolas Henry
Gaëtan Benoit
Ophélie Da Silva
Tom O Delmont
Antonio Fernàndez-Guerra
Samir Suweis
Romain Narci
Cédric Berney
Damien Eveillard
Frederick Gavory
Lionel Guidi
Karine Labadie
Eric Mahieu
Julie Poulain
Sarah Romac
Simon Roux
Céline Dimier
Stefanie Kandels
Marc Picheral
Sarah Searson
Tara Oceans Coordinators
Stéphane Pesant
Jean-Marc Aury
Jennifer R Brum
Claire Lemaitre
Eric Pelletier
Peer Bork
Shinichi Sunagawa
Fabien Lombard
Lee Karp-Boss
Chris Bowler
Matthew B Sullivan
Eric Karsenti
Mahendra Mariadassou
Ian Probert
Pierre Peterlongo
Patrick Wincker
Colomban de Vargas
Maurizio Ribera d'Alcalà
Daniele Iudicone
Olivier Jaillon

(2022)

Genomic evidence for global ocean plankton biogeography shaped by large-scale current systems

eLife 11:e78129.

https://doi.org/10.7554/eLife.78129

Share this article

Cite this article

Plankton biogeography, environmental variation, and ocean transport among Tara Oceans stations.

Metagenomic dissimilarity and travel time of plankton are maximally correlated up to ~1.5 years.

Plankton travel time, metagenomic dissimilarity, and environmental differences show different temporal patterns of pairwise correlation.

Author details

Daniel J Richter

Contribution

Contributed equally with

Competing interests

Romain Watteaux

Contribution

Contributed equally with

Competing interests

Thomas Vannier

Contribution

Contributed equally with

Competing interests

Jade Leconte

Contribution

Competing interests

Paul Frémont

Contribution

Competing interests

Gabriel Reygondeau

Contribution

Competing interests

Nicolas Maillet

Contribution

Competing interests

Nicolas Henry

Contribution

Competing interests

Gaëtan Benoit

Contribution

Competing interests

Ophélie Da Silva

Contribution

Competing interests

Tom O Delmont

Contribution

Competing interests

Antonio Fernàndez-Guerra

Contribution

Competing interests

Samir Suweis

Contribution

Competing interests

Romain Narci

Contribution

Competing interests

Cédric Berney

Contribution

Competing interests

Damien Eveillard

Contribution

Competing interests

Frederick Gavory

Contribution

Competing interests

Lionel Guidi

Contribution

Competing interests

Karine Labadie

Contribution

Competing interests

Eric Mahieu

Contribution

Competing interests

Julie Poulain

Contribution

Competing interests

Sarah Romac

Contribution

Competing interests

Simon Roux

Contribution

Competing interests

Céline Dimier

Contribution