Intra-species differences in population size shape life history and genome evolution

Version of Record: November 20, 2020
Version of Record: September 1, 2020

Download
Cite
Share
CommentOpen annotations (there are currently 0 annotations on this page).

Altmetric provides a collated score for online attention across various platforms and media.
See more details

1. Part of Collection
Aging, Geroscience and Longevity: A Special Issue

Edited by Matt Kaeberlien et al.

Abstract
Introduction
Results
Discussion
Materials and methods
Data availability
References
Article and author information
Metrics

Abstract

The evolutionary forces shaping life history divergence within species are largely unknown. Turquoise killifish display differences in lifespan among wild populations, representing an ideal natural experiment in evolution and diversification of life history. By combining genome sequencing and population genetics, we investigate the evolutionary forces shaping lifespan among wild turquoise killifish populations. We generate an improved reference genome assembly and identify genes under positive and purifying selection, as well as those evolving neutrally. Short-lived populations from the outer margin of the species range have small population size and accumulate deleterious mutations in genes significantly enriched in the WNT signaling pathway, neurodegeneration, cancer and the mTOR pathway. We propose that limited population size due to habitat fragmentation and repeated population bottlenecks, by increasing the genome-wide mutation load, exacerbates the effects of mutation accumulation and cumulatively contribute to the short adult lifespan.

Introduction

The extent to which drift and selection shape life history trait evolution across species in nature is a fundamental question in evolutionary biology. Variations in population size among natural populations is expected to affect the rate of accumulation of advantageous and slightly deleterious gene variants, hence impacting the relative contribution of selection and drift to genetic polymorphisms (Lanfear et al., 2014). Populations living in fragmented habitats, subjected to continuous and severe bottlenecks, are expected to undergo dramatic population size reduction and drift, which can significantly impact the accumulation of genetic polymorphisms in genes affecting important life history traits (Nonaka et al., 2019). The two main evolutionary theories of aging explain aging as the consequence of two fundamentally different processes. The mutation accumulation theory of aging (MA) attributes the evolution of aging to germline-encoded genetic variants accumulating in populations due to the age-dependent weakening of purifying selection, which becomes less efficient to remove from the gene pool gene variants that negatively impact fitness in late life (Charlesworth, 2000). The antagonistic pleiotropy (AP) theory of aging, instead, states that positive selection could favor gene variants that, while overall beneficial for individual fitness, may have detrimental effects in late life (Charlesworth, 2000; Williams, 1957). Although the two theories are not mutually exclusive and can both in principle explain the evolution of aging-related variants across species, their genetic traces in the genome should be distinguishable. In fact, while aging-determining gene variants occurring due to mutation accumulation evolve as nearly-neutral variants, aging-determining gene variants emerging via antagonistic pleiotropism evolve as positive selected variants.

Among vertebrates, killifish represent a unique system, as they repeatedly and independently colonized highly fragmented habitats, characterized by cycles of rainfalls and drought (Furness, 2016). While on the one hand intermittent precipitation and periodic drought pose strong selective pressures leading to the evolution of embryonic diapause, an adaptation that enables killifish to survive in absence of water (Cellerino et al., 2016; Hu and Brunet, 2018), on the other hand they cause habitat and population fragmentation, promoting inbreeding and genetic drift. The co-occurrence of strong selective pressure for early-life on the one hand and population size decline leading to genetic drift on the other hand characterizes life history evolution in African annual killifishes (Cui et al., 2019).

The turquoise killifish (Nothobranchius furzeri) is the shortest-lived vertebrate with a thoroughly documented post-embryonic life, which, in the shortest-lived strains, amounts to four months (Cellerino et al., 2016; Hu and Brunet, 2018; Kim et al., 2016; Blazek, 2017). Turquoise killifish has recently emerged as a powerful new laboratory model to study experimental biology of aging due to its short lifespan and to its wide range of aging-related changes, which include neoplasias (Di Cicco et al., 2011), decreased regenerative capacity (Wendler et al., 2015), cellular senescence (Ahuja et al., 2019; Valenzano et al., 2006), and loss of microbial diversity (Smith et al., 2017). At the same time, while sharing physiological adaptations that enable embryonic diapause and rapid sexual maturation, different wild turquoise killifish populations display differences in lifespan, both in the wild and in captivity (Terzibasi et al., 2008; Valenzano et al., 2015; Vrtílek et al., 2018), making this species an ideal evolutionary model to study the genetic basis underlying life history trait divergence within species.

Characterization of life history traits in wild-derived laboratory strains of turquoise killifish revealed that while different populations have similar rates of sexual maturation (Blazek, 2017), populations from arid regions exhibit the shortest lifespans, while populations from more semi-arid regions exhibit longer lifespans (Blazek, 2017; Terzibasi et al., 2008). Hence, speed of sexual maturation and adult lifespan appear to be independent in turquoise killifish populations. The evolutionary mechanisms responsible for the lifespan differences among turquoise killifish populations are not yet clearly understood. Mapping genetic loci associated with lifespan differences among turquoise killifish populations showed that adult survival has a complex genetic architecture (Valenzano et al., 2015; Kirschner et al., 2012). Here, combining genome sequencing and population genetics, we investigate to what extent genomic divergence in natural turquoise killifish populations that differ in lifespan is driven by adaptive or neutral evolution, compatible with either the antagonistic pleiotropy (AP) theory of aging or with the mutation accumulation (MA) theory of aging, respectively.

Results

Genome assembly improvement and gene annotation

To identify the genomic mechanism that led to the evolution of differences in lifespan between natural populations of the turquoise killifish (Nothobranchius furzeri), we combined the currently available reference genomes (Valenzano et al., 2015; Reichwald et al., 2015) into an improved reference turquoise killifish genome assembly. Due to the high repeat content, genome assembly from short reads required a highly integrated and multi-platform approach. We ran Allpaths-LG with all the available pair-end sequences, producing a combined assembly with a contig N50 of 7.8 kb, corresponding to a ~ 2 kb improvement from the previous versions. Two newly obtained 10X Genomics linked read libraries were used to correct and link scaffolds, resulting in a scaffold N50 of 1.5 Mb, that is a three-fold improvement from the best previous assembly. With the improved continuity, we assigned 92.2% of assembled bases to the 19 linkage groups using two RAD-tag maps (Valenzano et al., 2015). Gene content assessment using the BUSCO method improved “complete” BUSCOs from 91.43% (Valenzano et al., 2015) and 94.59% (Reichwald et al., 2015) to 95.20%. We mapped Genbank N. furzeri RefSeq RNA to the new assembly to predict gene models. The predicted gene model set is 96.1% for ‘complete’ BUSCOs. The overall size of repeated regions (masked regions) is 1.003 Gb, accounting for 66% of the entire genome, that is 20% higher than a previous estimate (Reichwald et al., 2009).

Population genetics of natural turquoise killifish populations

Natural populations of turquoise killifish occur along an aridity gradient in Zimbabwe and Mozambique and populations from more arid regions are associated with shorter captive lifespan (Blazek, 2017; Terzibasi et al., 2008). A QTL study performed between short-lived and long-lived turquoise killifish populations showed a complex genetic architecture of lifespan (measured as age at death), with several genome-wide loci associated with lifespan differences among long-lived and short-lived populations (Valenzano et al., 2015). To further investigate the evolutionary forces shaping genetic differentiation in the loci associated with lifespan among wild turquoise killifish populations, we performed pooled whole-genome-sequencing (WGS) of killifish collected from four sampling sites within the natural turquoise killifish species distribution, which vary in altitude, annual precipitation and aridity (Figure 1—figure supplement 1, Supplementary file 1A). Population GNP is located within the Gonarezhou National Park at high altitude and in an arid climate (Koeppen-Geiger classification ‘BWh’, Figure 1—figure supplement 1), in a region at the outer edge of the turquoise killifish distribution (Figure 1—figure supplement 1; Dorn et al., 2011; Bartáková et al., 2013; Bartáková et al., 2015), which corresponds to the place of origin of the ‘GRZ’ laboratory strain, which has the shortest lifespan of all laboratory strains of turquoise killifish (Terzibasi et al., 2008; Valenzano et al., 2015). Population NF414 (MZCS 414) is located in an arid area in the center of the Chefu river drainage in Mozambique (‘BWh’, Figure 1—figure supplement 1; Dorn et al., 2011; Bartáková et al., 2013; Bartáková et al., 2015), and population NF303 (MZCS 303) is located in a semi-arid area in transition to more humid climate zones in the center of the Limpopo river drainage system (Koeppen-Geiger classification ‘BSh’, Figure 1—figure supplement 1; Dorn et al., 2011; Bartáková et al., 2013; Bartáková et al., 2015). Altitude among localities ranges from 344 m (GNP) to 68 m (NF303, Figure 1—figure supplement 1a and Supplementary file 1A). The temporary habitat of turquoise killifish populations differs in terms of altitude and aridity, as the ephemeral pools at higher altitude are drained earlier and persist for shorter time, while water bodies in habitats at lower altitude last longer (Terzibasi et al., 2008). Population GNP is therefore named ‘dry’, population NF414 is named ‘intermediate’ and population NF303 ‘wet’ throughout the manuscript. The populations used in this study are from localities that belong to the same drainage system as those used in the previous QTL study and their relative position is included in Figure 1—figure supplement 2.

High genetic differentiation and contrasting population demography in dry and wet populations

We asked whether populations from dry, intermediate and wet areas, corresponding to shorter and progressively longer lifespan, differ in genetic variability. We calculated genome-wide estimates of average pairwise difference (π) and genetic diversity (θ_Watterson) based on 50kb-non-overlapping sliding windows using PoPoolation (Kofler et al., 2011a). We found that π and θ_Watterson decrease from wet to dry population (θ_{Watterson GNP}: 0.0011, θ_{Watterson NF414}: 0.0036, θ_{Watterson NF303}: 0.0072; π_GNP: 0.0009, π_NF414: 0.0031, and π_NF303: 0.0054). Hence, dry populations have overall smaller genetic diversity than populations from less dry regions. To infer the genetic distance between the populations, we computed the genome-wide pairwise genetic differentiation between populations using F_ST (Kofler et al., 2011b). Overall, the genetic differentiation between populations ranged between 0.14 and 0.26 and was the highest between the more geographically distant population GNP (dry) and population NF303 (wet) (Figure 1a).

Figure 1 with 2 supplements see all

Download asset Open asset

Demography and natural occurrence of turquoise killifish populations.

(a) Inferred ancestral effective population size (*N_e*) (using PSMC’) on y-axis and past generations on x-axis in GNP (red, orange), NF414 (black, grey) and NF303 (blue). Inset: unrooted neighbor joining tree based on pairwise genetic differentiation (F_ST) values. (b) Geographical locations of sampled natural population of turquoise killifish (*Nothobranchius furzeri)*. The area of the colored circles represents the estimated effective population size (*N_e*) based on θ_Watterson. (c) Natural environment of turquoise killifish and schematic of the annual life cycle. Figure 1 was partly made with Biorender.

Next, we inferred the demographic history of the populations using pairwise sequentially Markovian coalescent (PSMC) by resequencing at high-coverage single individuals for each population (Schiffels and Durbin, 2014). The population GNP (dry) experienced a strong population decline starting approximately 150 k generations ago, a result consistent for both the sequenced individuals from the two sampling sites (GNP-G1-3 and GNP-G4, Figure 1a). In contrast to the demographic history in GNP, we found indications for recent population expansions in populations from the center of the Chefu and Limpopo basins clades. Analysis of population NF414 (intermediate) (Figure 1a, NF414-Y and NF414-R) and NF303 (wet) (Figure 1a, blue line) shows population expansion until recent time (~50 k generations ago). To infer the effective population size (N_e) of the populations, we used the published mutational rate of 2.6321e−9 per base pair per generation for Nothobranchius, computed via dated phylogeny and θ_Watterson (Cui et al., 2019). In line with the decrease in genetic diversity from wet to dry population, we found a decrease in N_e estimates (107221.8, 338849.48 and 683693.25 for GNP, NF414 and NF303, respectively; Figure 1b). Hence, our findings show that dry populations from the outer edge of the species distribution show lower genetic diversity and smaller effective population size compared to population from intermediate and more wet regions.

Genetic differentiation among turquoise killifish populations

To test whether regions underlying longevity QTL in turquoise killifish (Valenzano et al., 2015; Kirschner et al., 2012) display a genetic signature for positive or purifying selection in these wild populations, we took advantage of the improved turquoise killifish genome assembly and the newly sequenced wild turquoise killifish populations (Figure 2). The strongest QTL for lifespan differences among long-lived and short-lived populations mapped on the sex chromosome (Valenzano et al., 2015; Kirschner et al., 2012), in proximity to the sex determining locus (Valenzano et al., 2015).

Figure 2

Download asset Open asset

Genomic regions of high and low genetic divergence between pairs of turquoise killifish populations.

Left) Genomic regions with high or low genetic differentiation between turquoise killifish populations identified with an F_ST outlier approach. Z-transformed F_ST values of all pairwise comparisons in solid lines, with ‘NF303vsNF414’ in yellow, ‘NF303vsGNP’ in blue, and ‘NF414vsGNP’ in green. The significance thresholds of upper and lower 5‰ are shown as dotted lines with same color coding. Center) Circos plot of Z-transformed F_ST values between all pairwise comparisons with ‘NF303vsNF414’ in the inner circle (yellow), ‘NF414vsGNP’ in the middle circle (green), and ‘NF303vsGNP’ in the outer circle (blue). Right) Pairwise genetic differentiation based on F_ST in the four main clusters associated with lifespan (QTL from Valenzano et al., 2015).

To identify a genomic signature of strong selection, we performed an outlier approach based on the pairwise genetic differentiation index (F_ST). To find highly differentiated regions that may underlie positive selection in natural turquoise killifish populations, we scanned for regions with elevated genetic differentiation between pairs of populations, that is exceeding the 0.995 quantile of Z-transformed non-overlapping 50 kb sliding windows of F_ST. To find regions under purifying selection, we scanned for regions with lowered genetic differentiation among populations, that is below the 0.005 quantile of Z-transformed non-overlapping 50 kb sliding windows of F_ST (Supplementary file 1G).

The outlier approach did not reveal clear signatures of positive or purifying selection based on genetic differentiation in the four main chromosomal clusters associated with lifespan in experimental strains of turquoise killifish (Figure 2).

We then analyzed genomic regions carrying signatures of positive and purifying selection in the natural turquoise killifish populations irrespective of the QTL regions (Figure 2). The F_ST outlier approach led to the identification of several potential regions under strong selection between populations, in particular between the intermediate and wet populations (Supplementary file 1D) and only two between the dry and wet populations (Supplementary file 1E). Genes significantly different and within regions of larger genetic differentiation based on Z-transformed non-overlapping sliding windows of F_ST were located on chromosomes 6 and 10. The region on chromosome six includes the gene slc8a1, which contains mutations with significant difference in allele frequencies between the wet and intermediate population (Fisher’s exact test implemented in PoPoolation; adjusted p value < 0.001). The region on chromosome 10 contains four genes: XM_015941868, XM_015941869, lss and hibch. All genes under the major F_ST peak on chromosome 10 showed significant difference in allele frequencies between the intermediate and wet population (Fisher’s exact test; adjusted p value < 0.001) and additionally, hibch had significantly different allele frequencies between the dry and wet population (Fisher’s exact test; adjusted p value < 0.001).

Age-specific changes in genes with sequence divergence between populations

Genes under F_ST peaks between populations that differ in lifespan, are not necessarily causally involved in lifespan differences between populations, as sequence differences could segregate in populations due to population structure and drift. However, to test whether the genes located in genomic regions that are significantly divergent between populations could be functionally involved in age-related phenotypes, we investigated whether gene expression in these genes varied as a function of age. Analyzing available turquoise killifish longitudinal RNA-Seq datasets generated in liver, brain and skin (Baumgart et al., 2017), we found that hibch, lss and slc8a1 are differentially expressed between adult and old killifish (Supplementary file 1J, adjusted p value < 0.01). hibch, lss and slc8a1 are involved in amino acid metabolism (Ferdinandusse et al., 2013), biosynthesis of cholesterol (Huff and Telford, 2005), and proton-mediated accelerated aging (Osanai et al., 2018), respectively. Gene XM_015956265 (ZBTB14) is the only gene that is an F_ST outlier and that is differentially expressed in adult vs. old individuals between at least two populations in all tissues (liver, brain and skin). XM_015956265 encodes a transcriptional modulator with ubiquitous functions, ranging from activation of dopamine transporter to repression of myc, fmr1 and thymidine kinase promoters (Orlov et al., 2007). However, although genomic regions that have sequence divergence between turquoise killifish populations contain genes that are differentially expressed during aging in different tissues, whether any of these genes are causally involved in modulating aging-related changes between turquoise killifish wild populations still remains to be assessed. We could not find enrichment of significant differentially expressed genes within the F_ST outlier regions (Fisher’s exact test p value > 0.05).

Genomic regions of low genetic differentiation among populations

Based on the outlier approach, we found two genomic regions with low genetic differentiation between all pairs of populations, suggesting strong purifying selection. The first region is located on the sex chromosome and contains the putative sex determining gene gdf6 (Reichwald et al., 2015), which is hence conserved among these populations. This same region also contains sybu, a maternal-effect gene associated with the establishment of embryo polarity (Nojima et al., 2010). The second region under low genetic differentiation is located on chromosome nine and harbors the genes XM_015965812 (abi2-like), cnot11 and lcp1, which are involved in phagocytosis (Ulvila et al., 2011), mRNA degradation (Mauxion et al., 2013) and cell motility (Kell et al., 2018), respectively. Signatures of low and high genetic differentiation between populations can be the result of purifying or positive selection. However, balancing selection, a mechanism that could maintain polymorphism above the expected genetic diversity, could also in part result in genetic differentiation (Brandt et al., 2018). To account for balancing selection, we compared the pairwise genetic diversity (π) among populations and we could not find signatures of elevated genetic diversity within the investigated regions under strong selection.

Hence, we could not find a clear evidence of positive or purifying selection in correspondence with the survival QTL previously identified, suggesting that genomic regions associated with natural lifespan differences may have not evolved due to positive selection or have being maintained under purifying selection. However, we cannot exclude that we could not detect positive selection at the QTL regions due to statistical power or that the populations used in this study and those used for the QTL analysis had a different genetic architecture of lifespan.

Evolutionary origin of the sex chromosome

Since we found reduced genetic differentiation among populations in the chromosomal region containing the putative sex-determining gene in the sex chromosome, we used synteny analysis and the new genome assembly to investigate the genomic events that led to evolution of this chromosomal region (Figure 3). We found that the structure of the turquoise killifish sex chromosome is compatible with a chromosomal translocation within an ancestral chromosome and a fusion event between two chromosomes. The translocation event within an ancestral chromosome corresponding to medaka´s chromosome 16 and platyfish´s linkage group three led to a repositioning of a chromosomal region containing the putative sex-determining gene gdf6 (Figure 3b). The fusion of the translocated chromosome with a chromosome corresponding to medaka´s chromosome eight and platyfish´s linkage group 16, possibly led to the origin of turquoise killifish's sex chromosome. We could hence reconstruct a model for the origin of the turquoise killifish sex chromosome (Figure 3c), which parsimoniously places a translocation event before a fusion event. The occurrence of two major chromosomal rearrangements could have then contributed to suppressing recombination around the sex-determining region (Valenzano et al., 2015; Valenzano et al., 2009).

Figure 3

Download asset Open asset

Synteny and sex chromosome evolution in turquoise killifish.

(a) Synteny circos plots based on 1-to-1 orthologous gene location between the new turquoise killifish assembly (black chromosomes) and platyfish (*Xiphophorus maculatus*, colored chromosomes, left circos plot) and between the new turquoise killifish assembly (black chromosomes) and medaka (*Oryzias latipes*, colored chromosomes, right circos plot). Orthologous genes in concordant order are visualized as one syntenic block. Synteny regions are connected via color-coded ribbons, based on their chromosomal location in platyfish or medaka. If the direction of the syntenic sequence is inverted compared to the compared species, the ribbon is twisted. Outer data plot shows –log(q-value) of survival quantitative trait loci (QTL, ordinate value between 0 and 3.5, every value above 3.5 is visualized at 3.5 [Valenzano et al., 2015]) and the inner data plot shows –log(q-value) of the sex QTL (ordinate value between 0 and 3.5, every value above 3.5 is visualized at 3.5). Boxes between the two circos plots show genes within the peak regions of the four highest –log(q-value) of survival QTL on independent chromosomes (red box) and the highest association to sex (black box). (b) High resolution synteny map between the sex-chromosome of the turquoise killifish (Chr3) with platyfish chromosome 16 and 3 in the upper plot, and between the turquoise killifish and medaka chromosome 8 and 16 (lower plot). The middle plot shows the QTLs for survival and sex along the turquoise killifish sex chromosome. (c) Model of sex chromosome evolution in the turquoise killifish. A translocation event within one ancestral autosome led to the emergence of a chromosomal region harboring a new sex-determining-gene (SDG). The fusion of a second autosome led to the formation of the current structure of the turquoise killifish sex chromosome.

Relaxed selection in turquoise killifish populations

Since we could not identify specific signatures of genetic differentiation in the genomic regions associated with longevity from previous QTL mapping, we asked whether other evolutionary forces than directional selection may underlie differences in survival among wild turquoise killifish populations. The difference in the recent and past demography between populations (Figure 1) led us to ask whether demography could have led to evolutionary changes on genome-wide scale between natural populations. For each population, we calculated the fraction of substitutions driven to fixation by positive selection since divergence from the outgroup species Nothobranchius orthonotus (NOR) using the asymptotic McDonald-Kreitman α (Messer and Petrov, 2013). The original McDonald-Kreitman α (which ranges from – ∞ to 1) was designed to calculated the rate of adaptation by comparing the polymorphisms (within species) and divergence (between species) at neutral and functional sites (McDonald and Kreitman, 1991). While McDonald-Kreitman α = 0 indicates neutrality, larger and positive values of α mean that a given population has an elevated proportion of genetic variants driven by natural selection, while negative values of α can be an indication of deleterious variants. The asymptotic McDonald-Kreitman α accounts for a range of derived allele frequencies, enabling to identify slightly deleterious mutations as those segregating at lower derived allele frequencies (Messer and Petrov, 2013). Variants at low derived allele frequency are either neutral (if they were beneficial they would have higher frequency) or are slightly deleterious. Hence, negative McDonald-Kreitman α values at low derived allele frequency bins likely reflect slightly deleterious gene variants. Additionally, negative α values at intermediate to higher frequency bins may indicate drift of deleterious variants. We set out to adopt this method to assess the genetic variants for each population, compared to two different outgroup species.

Using Nothobranchius orthonotus (NOR) as an outgroup, we inferred the fraction of positive selection by pooling all coding sites (Figure 4a). SNPs were called with the program SNAPE (Raineri et al., 2012), which specifically deals with pooled sequencing. We only included SNPs with a derived frequency between 0.05–0.95 and performed stringent filtering. The asymptotic McDonald-Kreitman α ranged from −0.21 to −0.01 in comparison to the very closely related sister species N. orthonotus, confirming limited genome-wide positive selection since divergence from N. orthonotus (Figure 4a). The population GNP, located in an arid region at higher altitude and associated with the shortest recorded lifespan, shows the lowest asymptotic McDonald-Kreitman α, as well as lower McDonald-Kreitman α values throughout all derived frequency bins, potentially suggesting a higher load of slightly deleterious mutations segregating in this population (Figure 4a). Using as an outgroup species another annual killifish species, Nothobranchius rachovii (NRC), we confirmed the lowest asymptotic McDonald-Kreitman α value in the dry population GNP (Figure 4b). Additionally, using Nothobranchius rachovii (NRC) as outgroup species, the asymptotic McDonald-Kreitman α ranged from −0.06 to 0.23 among populations, indicating that more alleles were driven to fixation by positive selection in the ancestral lineage leading to Nothobranchius furzeri and Nothobranchius orthonotus. In particular, the wet population NF303 had the highest asymptotic McDonald-Kreitman α value (Figure 4b). Using both N. orthonotus and N. rachovii as outgroups, we found that the dry GNP population had the lowest McDonald-Kreitman α values at the low derived frequency bins, potentially consistent with a genome-wide accumulation of slightly deleterious mutations in these isolated populations.

Figure 4 with 2 supplements see all

Download asset Open asset

Genome-wide signatures of natural and relaxed selection in turquoise killifish populations.

Asymptotic McDonald-Kreitman alpha (MK α) analysis based on derived frequency bins using as outgroups (a) *Nothobranchius orthonotus* and (b) *Nothobranchius rachovii*. Population GNP is shown in red, NF414 in black, and NF303 in blue. (c) Proportion of non-synonymous SNPs binned in allele frequencies of non-reference (alternative) alleles for GNP (red), NF414 (black) and NF303 (blue). (d) Negative distribution of fitness effects of populations GNP (red), NF414 (black) and NF303 (blue) with cumulative proportion of deleterious SNPs on y-axis and the compound measure of *4N_es* on x-axis. (e) Proportion of different effect types of SNPs in coding sequences of all populations. The effect on amino acid sequence for each genetic variant is represented by colors (legend). Significance is based on ratio between synonymous effects to non-synonymous effects (significance based on Chi-square test).

Estimating the distribution of fitness effect across populations

To directly estimate the fitness effect of gene variants associated with each population, we analyzed population-specific genetic polymorphisms to assign mutations as beneficial, neutral or detrimental, and determine the distribution of fitness effect (DFE) (Tataru et al., 2017) of new mutations. Consistently with the overall lower McDonald-Kreitman α values throughout all derived frequency bins, we found more new mutations assigned as the slightly deleterious category in the dry GNP population, compared to the other two populations (indicated by the higher number of deleterious SNPs in proximity to 4N_eS ~ 0 in the GNP population, Figure 4d, Supplementary file 1H). To independently validate our findings, we ran a simulation using SLiM3, which recapitulated the population divergence from an ancestral population, followed with diverging population size as inferred from the PSMC’ analysis (Figure 4—figure supplement 1). Analyzing the distribution of fitness effect in these simulated populations, we confirmed that populations with smaller effective population size have a higher proportion of new slightly deleterious variants, compared to larger populations, which have relatively more newly arising gene variants that are highly deleterious, indicating that purifying selection in the large population is more efficient in removing mutations with deleterious effects. To further infer the effect of the putative deleterious mutations on protein function, we used the new turquoise killifish genome assembly as a reference and adopted an approach that, by analyzing sequence polymorphism among populations, predicts functional consequences at the protein level (Cingolani et al., 2012). We found that the proportion of mutations causing a change in protein function is significantly larger in the GNP population compared to populations NF414 and NF303 (Chi-square test: P_GNP-NF303 <1.87e-119, P_GNP-NF414 <4.96e-57, P_NF303-NF414< 3.51e-35, Figure 4e). Additionally, the mutations with predicted deleterious effects on protein function reached also higher frequencies in the dry population GNP (Figure 4c).

Distribution of mutations at conserved sites

To further investigate the impact of mutations on protein function, we calculated the Consurf (Pupko et al., 2002; Mayrose et al., 2004; Glaser et al., 2003; Ashkenazy et al., 2016) score, which determines the evolutionary constraint on an amino acid, based on sequence conservation. Mutations at amino acid positions with high Consurf score (i.e. otherwise highly conserved) are considered to be more deleterious. We found that the dry population GNP had a significantly higher mean Consurf score for mutations at non-synonymous sites in frequency bins from 5–20% up to 40–60%, compared to populations NF414 (intermediate) and NF303 (wet) (Figure 4—figure supplement 2). The mutations in the dry GNP population had significantly higher Consurf scores than the other populations using both outgroup species N. orthonotus and N. rachovii (Figure S3). Upon exclusion of potential mutations at neighboring sites (CMD: codons with multiple differences), CpG hypermutation and genes containing mutations with highly detrimental effect on protein function based on SnpEFF analysis, the dry population GNP had higher mean Consurf score at the low frequency bin (Figure 4—figure supplement 2, Supplementary file 1K-L). To note, we also found a significantly higher average Consurf score at synonymous sites in GNP at low derived frequencies (Figure 4—figure supplement 1, Supplementary file 1K-L), possibly suggestive of an overall higher mutational rate in GNP.

Relaxation of selection in age-related disease pathways

To further test whether populations from dry environments accumulated a higher load of deleterious gene variants, we computed the gene-wise direction of selection (DoS) (Stoletzki and Eyre-Walker, 2011) index, which measures the strength of selection based on the count of mutations in non-synonymous and synonymous sites. Indeed, we found support to the hypothesis that the dry, short-lived population GNP has significantly more slightly deleterious mutations segregating in the population, compared to the populations NF414 and NF303 (Figure 5a, Median NOR: GNP: −0.17, NF414: −0.02, NF303: −0.01; Median NRC: GNP: −0.14, NF414: 0.00, NF303: 0.00; Wilcoxon rank sum test: NOR: P_GNPNF303 <2.21e-105, P_GNP-NF414 <1.19e-76, P_NF303-NF414< 1.39e-06; NRC: P_GNP-NF303 <4.61e-179, P_GNP-NF414 <1.42e-100, P_NF303-NF414< 5.96e-22), indicating that purifying selection is relaxed in GNP. We calculated DoS in all populations using independently as outgroup species N. orthonotus and N. rachovii (Figure 5a).

Figure 5

Download asset Open asset

Pathway enrichment in genes under adaptive and neutral evolution in turquoise killifish populations.

(a) Distribution of direction of selection (DoS) represented with median of distribution for population GNP (red), NF414 (grey) and NF303 (blue). Left panel shows DoS distribution computed using *Nothobranchius orthonotus* as outgroup and right panel shows DoS distribution computed using *Nothobranchius rachovii* as outgroup. Significance based on Wilcoxon-Rank-Sum test. (b) Pathway over-representation analysis of genes below the 2.5% level of gene-wise DoS values are shown with red background and above the 97.5% level of gene-wise DoS values are shown with green background. Only pathway terms with significance level of FDR corrected q-value <0.05 are shown (in -log(q-value)). Terms enriched in population GNP have red dots, enriched in population NF414 have black dots, and enriched in population NF303 have blue dots, respectively.

To assess whether specific biological pathways were significantly more impacted by the accumulation of slightly deleterious mutations, we performed pathway overrepresentation analysis. We found a significant overrepresentation in the lower 2.5^th DoS quantile (i.e. genes under relaxation of selection) in the GNP population for pathways associated with age-related diseases, including gastric cancer, breast cancer, neurodegenerative disease, mTOR signaling and WNT signaling (q-value <0.05, Figure 5b, Supplementary file 1I). Overall, relaxed selection in the dry GNP population affected accumulation of deleterious mutations in age-related and in the WNT pathway. Analyzing the pathways affected by genes within the upper 2.5^th DoS values – corresponding to genes undergoing adaptive evolution – we found a significant enrichment for mitochondrial pathways – potentially compensatory (Cui et al., 2019) – in population NF303 (Figure 5b, Supplementary file 1I). Overall, our results show that differences in effective population size among wild turquoise killifish are associated with an extensive relaxation of purifying selection, significantly affecting genes involved in age-related diseases, and which could have cumulatively contributed to reducing individual survival.

Discussion

The turquoise killifish (Nothobranchius furzeri) is the shortest-lived known vertebrate and while its natural populations show similar timing for sexual maturation, exhibit differences in lifespan along a cline of altitude and aridity in south-eastern Africa (Blazek, 2017; Terzibasi et al., 2008). Here we generate an improved genome assembly (NFZ v2.0) in turquoise killifish (Nothobranchius furzeri) and study the evolutionary forces shaping genome evolution among natural populations.

Using the new turquoise killifish genome assembly and synteny analysis with medaka and platyfish, we reconstructed the origin of the turquoise killifish sex chromosome, which appears to have evolved through two independent chromosomal events, that is a translocation and a fusion event.

Using the new genome assembly and pooled sequencing of natural turquoise killifish populations, we found that genetic differentiation among populations of the short-lived turquoise killifish is consistent with differences in demographic constraints. While we found that strong purifying selection maintains low genetic diversity among populations at genomic regions underlying key species-specific traits, such as in proximity to the sex-determining region, demography and genetic drift largely shape genome evolution, leading to relaxation of selection and the accumulation of deleterious mutations. We showed that isolated populations from an arid region, dwelling at higher altitude and characterized by shorter lifespan, experienced extensive population bottlenecking and a sharp decline in effective population size. Populations from dryer regions at higher altitudes experience genetic isolation and possibly steady decline in population size due to limited incoming gene flow and possibly more severe bottlenecks due to recent founder effect. However, populations from more wet regions likely undergo extensive gene flow, maintaining larger population size. We found that relaxation of selection in more drifted populations significantly affected the accumulation of deleterious gene variants in pathways associated with neurodegenerative diseases and WNT-signaling (Figure 5). While simple traits, such as male tail color and sex have a simple genetic architecture among turquoise killifish populations (Valenzano et al., 2015; Valenzano et al., 2009), we find that the complex genetic architecture of lifespan differences among killifish populations (Valenzano et al., 2015) is entirely compatible with genome-wide relaxation of selection. Additionally, the absence of genomic signature of positive selection in genomic regions underlying survival QTL in killifish suggest that, rather than directional selection, the neutral accumulation of deleterious mutations may be the evolutionary mechanism underlying survival differences among turquoise killifish populations, in line with the mutation accumulation theory of aging. The antagonistic pleiotropy theory of aging states that positive selection could lead to the fixation of gene variants that, while overall beneficial for fitness, could reduce survival and reproductive capacity in late life (Williams, 1957). However, the lack of genomic signatures of positive selection at the genomic regions underlying survival QTL in turquoise killifish rather suggests that the accumulation of deleterious mutations due to neutral drift may have played a key role in shaping genome and phenotype differences among natural turquoise killifish populations. One of the deductions of the antagonistic pleiotropy theory is that a reduction in speed of maturation should be associated with increased lifespan (Williams, 1957). However, different wild populations of turquoise killifish have similar time to sexual maturation and yet different lifespan (Blazek, 2017). Hence, the uncoupling of age of sexual maturation from adult lifespan in different turquoise killifish populations is more compatible with the mutation accumulation theory of aging. However, although we did not find evidence for it, our results do not exclude a priori the possibility that genes under strong selection may in part contribute to lifespan differences among different turquoise killifish populations, hence acting compatibly with the antagonistic pleiotropy theory of aging. However, historical fluctuations in the size of natural turquoise killifish populations, especially in isolated populations living in more arid and elevated habitats, weakened the strength of natural selection, ultimately contributing to increased load of deleterious gene variants, preferentially in genes associated with aging-related diseases and in the WNT pathway. We hypothesize that small effective population size leads to the accumulation of aging-causing mutations that together contribute to the genetic architecture of lifespan. Overall, our findings highlight the role of demographic constraints in shaping life history within species.

Share this article

Cite this article

Demography and natural occurrence of turquoise killifish populations.

Genomic regions of high and low genetic divergence between pairs of turquoise killifish populations.

Synteny and sex chromosome evolution in turquoise killifish.

Genome-wide signatures of natural and relaxed selection in turquoise killifish populations.

Pathway enrichment in genes under adaptive and neutral evolution in turquoise killifish populations.

Author details

David Willemsen

Contribution

Competing interests

Rongfeng Cui

Contribution

Competing interests

Martin Reichard

Contribution

Competing interests

Dario Riccardo Valenzano

Contribution

For correspondence

Competing interests

Citations by DOI

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

Categories and tags

Further reading