Abstract
Meiotic recombination is an important evolutionary force that acts by breaking up genomic linkage, thereby increasing the efficacy of selection. Meiotic recombination is initiated with a double-strand break which is resolved via a crossover, which involves the reciprocal exchange of genetic material between homologous chromosomes, or a non-crossover, which results in small tracts of non-reciprocal exchange of genetic material. While the meiotic process is largely conserved, crossover and non-crossover rates vary between species, populations, individuals, and across the genome. In recent years, recombination is observed to be positively associated with the distribution of ancestry derived from past interspecific hybridization (introgression) in a variety of species. This trend has been interpreted to signify that introgression carries genetic incompatibilities that are selected against, such that introgression is enriched in regions of high recombination. However, recombination is well known to be suppressed in divergent sequence to prevent non-homologous recombination. Since introgressed DNA is often divergent, we sought to explore this interaction of recombination and introgression by sequencing spores and detecting crossover and non-crossover events from two crosses of the budding yeast Saccharomyces uvarum. One cross is between strains isolated from natural environments, and the other cross is between strains from fermentation environments, in which each strain contains introgression from their sister species, S. eubayanus. We find that the recombination landscape is significantly different between S. uvarum crosses, and that most of these differences can be explained by the presence of heterozygous introgression in the fermentation cross. Crossovers are significantly reduced and non-crossovers are increased in heterozygous introgression compared to syntenic regions in the natural cross without introgression. This translates to reduced allele shuffling within introgressed regions, and an overall reduction of shuffling on most chromosomes with introgression compared to the syntenic regions and chromosomes without introgression. Our results indicate that recent hybridization can significantly influence the recombination landscape, and suggest that the reduction in allele shuffling contributes to the initial purging of introgressed ancestry in the generations following a hybridization event.
Introduction
Recombination is the exchange of genetic material between homologous chromosomes during meiosis and is a staple of eukaryotic sexual reproduction. While the processes involved in recombination are largely conserved (Arter & Keeney, 2023), recombination rates vary between sexes, populations, and species (Smukowski & Noor, 2011; Stapley et al., 2017). Recombination rates also vary along the genome, with conflicting patterns of enriched or depleted recombination in promoter regions and punctate or dispersed recombination depending on the species (Auton et al., 2013; Rockman & Kruglyak, 2009; Singhal et al., 2015; Smukowski Heil et al., 2015). These patterns in recombination can affect pairing of alleles after meiosis–in other words, the shuffling of alleles–in a population. Much of the evolutionary advantage of recombination is understood to originate from its role in shuffling alleles, which increases the number of different allele combinations segregating in a population. The increase in allele combinations can reduce selection interference–the effect that genetically linked sites have on the evolutionary fate of either beneficial or deleterious alleles (Felsenstein, 1974; Hill & Robertson, 1966; McDonald et al., 2016; McGaugh et al., 2012).
How much allele decoupling is produced by recombination will depend on the type of recombination event. Each recombination event begins with the severing of both strands of a sister chromatid of one of the homologous chromosomes in what is referred to as a double-strand break (DSB) (Keeney, 2001). The distribution of DSBs is influenced by a variety of factors, many of which are organism-dependent, but often include decreases of DSBs near telomeres and centromeres as well as increases in genomic regions enriched for GC content, CpG sites, and depleted of methylation (CpG islands) and near promoter regions (de Massy, 2013; Lam & Keeney, 2015; Pan et al., 2011; Zelkowski et al., 2019). When a DSB occurs, the homologous strand is recruited to repair the break, and during this process genetic information is exchanged. The most evident and widely studied resolution of a DSB is a crossover (CO), where all the genetic information on one side of the DSB from one homologous chromosome is spliced with all the genetic information on the other side of the DSB from the other homologous chromosome. DSB resolution can also involve gene conversions–or non-crossovers (NCOs)– which result in one small segment (typically 100-2000 bp) of a homologous chromosome’s genetic information being copied onto the other (Chovnick et al., 1971; Hilliker et al., 1994; Jeffreys & May, 2004; Judd & Petes, 1988). DSBs can be resolved through both of these processes in a single instance, and even more than one NCO can occur at a single break point. Each possible resolution can produce a variety of genetic patterns at the site of a DSB. COs generally produce more allele shuffling, and therefore degrade linkage faster than NCOs, because they recombine all loci from one side of the event with all loci on the other side. However, NCOs can occur in regions where COs are typically suppressed, like centromeres and inversions (Korunes & Noor, 2019; Mancera et al., 2008; Miller et al., 2016; Schaeffer & Anderson, 2005; Shi et al., 2010; Talbert & Henikoff, 2010; Wijnker et al., 2013). NCOs are also crucial to reducing linkage within coding regions and, unlike COs, result in 3:1 allele ratio in the meiotic product at heterozygous sites, potentially changing allele frequencies (Korunes & Noor, 2017).
The number of DSBs that occur per meiosis varies by species, as does the number of DSBs that are repaired as COs or NCOs (de Massy, 2013; Korunes & Noor, 2017). In Arabidopsis and maize, several hundred DSBs are resolved into only a handful of COs, whereas in Saccharomyces cerevisiae the CO:NCO ratio is close to 2:1 (Choi et al., 2018; He et al., 2017; Mancera et al., 2008). How CO:NCO resolution may be evolving is not well understood, but evidence from Saccharomyces suggests that the DSB landscape is well conserved across species, while recombination rates significantly differ between sister species S. cerevisiae and S. paradoxus (Lam & Keeney, 2015; Liu et al., 2019). This suggests that alternate resolution of DSBs may be important in altering recombination rates between closely related species. This is further supported by the finding that several genes involved in the CO vs. NCO decision of DSB resolution show evidence of rapid evolution and directional selection in mammals (Dapper & Payseur, 2019), and that some of these genes are linked to variation in recombination rates within and between species (Brand et al., 2018; Kong et al., 2008; Murdoch et al., 2010; Yang et al., 2015).
Variation in the number and distribution of COs and NCOs, and their respective associated effects on linkage, have important implications for molecular evolution. Recombination has long been appreciated to play a role in the distribution of various genomic features including nucleotide diversity. Nucleotide diversity has a positive correlation with recombination rate in a number of species, interpreted to result from selective sweeps and background selection removing genetic variation in regions of low recombination (Begun & Aquadro, 1992; Charlesworth et al., 1993; Smith & Haigh, 1974). Similarly, recombination breaking up genetic associations is particularly notable in the context of interspecific hybridization. In first-generation (F1) hybrids, each set of homologous chromosomes is composed of sister chromatids carrying genetic information from one of the parents. Since no recombination has physically separated any alleles on the chromosome at the hybrid F1 stage, each parent’s genetic contribution is perfectly linked. If the hybrids then back-cross to one of the parental populations, recombination will produce genomes that are a mosaic of genetic information from the two populations (introgression) (Aguillon et al., 2022). Recombination drives the pattern of fragmentation in the introgressed regions over time, and therefore plays an important role in the distribution of introgressed DNA in a population (Barton & Bengtsson, 1986; Butlin, 2005; Moran et al., 2021; Nachman Michael W. & Payseur Bret A., 2012; Schumer et al., 2018; Veller et al., 2023). When each population has evolved alleles that are deleterious when present in the background of the other population (the Dobzhansky-Muller hybrid incompatibility model) we expect introgressed regions with low rates of recombination to be quickly purged from the population, as the accumulation of incompatible alleles incurs a steep fitness cost. In contrast, when introgressed regions have high recombination rates, the break up of genetic associations will reduce selective interference between the incompatible alleles and their surrounding haplotypes, allowing for neutral and beneficial alleles brought in with the introgression to escape the fate of neighboring incompatibilities (Moran et al., 2021; Schumer et al., 2018). This theory is supported empirically through enrichment of introgressed segments in regions of higher recombination in a number of organisms including Mimulus, maize, butterflies, swordtail fish, stickleback, and humans (Brandvain et al., 2014; Calfee et al., 2021; Edelman et al., 2019; Martin et al., 2019; Ravinet et al., 2018; Schumer et al., 2018).
This positive correlation between introgressed ancestry and recombination is emerging as a nearly ubiquitous pattern (though see (Dagilis & Matute, 2023; Duranton & Pool, 2022; Pool, 2015)), however, it is unclear how these observations relate to the known effect of sequence divergence on DSB resolution. Introgression, particularly between highly diverged species, can have low sequence homology with the genomic region it is replacing. A DSB in a region of low homology will recruit mismatch repair proteins, which ensure COs are occurring between homologous chromosomes and at equivalent positions to prevent ectopic recombination (Harfe & Jinks-Robertson, 2000; Hunter et al., 1996). Mismatch repair proteins reduce the frequency of CO events as sequence divergence increases (Chen & Jinks-Robertson, 1999; Cooper et al., 2021; L. Li et al., 2006; Welz-Voegele & Jinks-Robertson, 2008). Given that heterozygous introgression will have divergent sequences, we expect a decrease in COs, and possibly an increase in NCOs as DSBs fail to be resolved as COs in introgressed regions.
To help us understand this interaction of introgression and recombination, and identify patterns in CO and NCO in closely related populations, we utilized the budding yeast Saccharomyces. Yeasts provide an excellent opportunity to study DSB resolution, as we can readily isolate and collect all four meiotic products of a given meiosis and detect both CO and more elusive NCO events (Figure 1A) (Brion et al., 2017; Gerton et al., 2000; Liu et al., 2018, 2019; Mancera et al., 2008). Recombination rates vary between strains of S. cerevisiae (Cubillos et al., 2011; Raffoux et al., 2018) and between S. cerevisiae and its sister species S. paradoxus (Liu et al., 2019; Tsai et al., 2010). Strains of different Saccharomyces species have often hybridized with other species and carry introgressed DNA from these events (Albertin et al., 2018; Almeida et al., 2014; Bendixsen et al., 2022; D’Angiolo et al., 2020; Langdon et al., 2019; Stelkens & Bendixsen, 2022; Tellini et al., 2023).
In this study, we look at patterns of recombination and introgression at the population level by crossing two pairs of Holarctic Saccharomyces uvarum strains. One pair of strains was isolated from natural environments in North America and the other pair was isolated from European fermentation environments (Almeida et al., 2014). The S. uvarum strains isolated from European fermentation environments each carry introgression from their sister species, Saccharomyces eubayanus, which is approximately 6% divergent from S. uvarum (Almeida et al., 2014; Langdon et al., 2020; Nespolo et al., 2020). The diploid F1 genome of these strains is heterozygous for nine different introgressions which make up approximately 10% of the genome (Figure 1B). The strains from the North American cross do not carry S. eubayanus introgression, thus allowing us to assess the impact of introgression on the recombination landscape. We obtained whole genome sequencing data from individual meiotic events from the first offspring generation of each cross and used this data to detect CO and NCO events along the genome (Figure 1A). From these maps, we aim to understand (i) how patterns of CO and NCO differ between closely related strains, (ii) how regions of introgression differ in their CO and NCO patterns, and (iii) how these different patterns affect shuffling of alleles locally and at the chromosome level. Understanding these objectives will provide us novel insights into how introgression impacts the recombination landscape.
Methods
Strain and library construction
S. uvarum strains (UCD61-137, yHCT78, GM14, and DBVPG7787) were obtained from the Portuguese Yeast Culture Collection and from Chris Hittinger (Table S1) (Almeida et al., 2014). All four S. uvarum strains had their HO locus replaced with a kanMX marker using a modified version of the high-efficiency yeast transformation using the LiAc/SS carrier DNA/PEG method. Briefly, the kanMX marker was amplified from plasmid pCSH2 with homology to genomic DNA flanking the HO ORF with primers CSH239 (GGTGGAAAACCACGAAAAGTTAGAACTACGTTCAGGCAAAgacatggaggcccagaatac) and CSH241 (GTGACCGTATTGGTACTTTTTTTGTTACCTGTTTTAGTAGcagtatagcgaccagcattc). For each strain, overnight cultures were inoculated in 25 mL of YPD at an OD of ∼ 0.0005 and incubated at room temperature on a shaker for ∼24 hours until the cultures reached an OD between 0.6 and 0.9. Subsequently, 1 ug of the template DNA was transformed with a heat shock temperature of 37°C for 45 minutes. The transformed cells were allowed to recover in liquid YPD for 4 hours before being plated onto G418 selective plates and incubated at room temperature for 2 days.
Single colonies were selected from the transformation plates, restreaked onto G418 plates and allowed to grow at room temperature for 2 days. Single colonies from those plates were then inoculated into 2 mL of YPD + G418 and incubated in a roller drum at room temperature overnight. From those cultures, 250 uL was used to inoculate 2 mL of sporulation media (1% potassium acetate, 0.1 % yeast extract, 0.05% dextrose) and incubated at room temperature for 3 to 5 days. Strains were confirmed to have the ho::KanMX via tetrad dissection on a Singer SporPlay+ microscope (Singer Instruments). Plates with tetrads were incubated at room temperature for 2 days and then replica plated to test for proper segregation of the kanMX marker and mating type within individual tetrads.
Crosses between strains UCD61-137 and yHCT78 (natural cross), and between strains GM14 and DBVPG7787 (fermentation cross) were set up by micromanipulation of single MATa and MATx cells using a Singer SporPlay+. The plates were incubated at room temperature for 2 days and then replica plated to mating type tester strains to test for potential diploids. Identified diploids were then sporulated by growing a culture of the cross in 2 mL YPD + G418 at room temperature overnight. From those cultures, 250 uL were used to inoculate 2 mL of sporulation media and incubated at room temperature for 3 to 5 days. Sporulated cultures were dissected on 3 YPD plates (24 tetrads per plate) using a Singer SporPlay+. Fifty of the fully viable tetrads were selected and had all their spores inoculated into YPD (200 spores total) and incubated at room temperature. The DNA was extracted from these cultures using a modified version of the Hoffman-Winston DNA Prep (Hoffman & Winston, 1987). The DNA concentration was then measured using SYBR green, and 150 ng of each sample’s DNA was used to prepare a sequencing library using an Illumina DNA Prep Kit, modified to use half the normal amounts of reagents. Libraries were pooled and run on an Illumina NovaSeq 500 with 150bp paired end reads.
Calling SNPs
We scored SNPs from parents and offspring using the S. uvarum reference genome (Scannell et al., 2011) and custom scripts that invoked bwa (v0.7.17), samtools (v1.12), bcftools (v1.13), picardtools (v2.25.6), and gatk (v4.2.0.0) (Danecek et al., 2021; H. Li & Durbin, 2009; McKenna et al., 2010). The custom scripts are available in the github repository: ejschwarzkopf/CO-NCO. We joint genotyped parents and offspring with default filters for gatk with the exception of the QUAL filter, which was set as < 100 for parents and < 30 for offspring. We further filtered variants by requiring they be fixed differences between the two parental strains. We kept a total of 24,574 markers for the natural cross and 74,619 markers for the fermentation cross. We utilized LUMPY to identify structural variants in the parent strains that were greater than 5000 bp and verified calls using the Integrative Genomics Viewer (Layer et al., 2014; Robinson et al., 2011). We identified three amplifications in strain GM14 (one of the fermentation cross parents) that were absent in other strains (Table S2).
Generating CO/NCO maps
We generated “seg” files by coding tetrad variants by their parental origin. These seg files were the input for CrossOver (v6.3) from the ReCombine suite of programs, which we used to detect COs and NCOs (Anderson et al., 2011). We then filtered to remove non-crossovers with fewer than three associated markers and split the genome into 20kb windows. In each window we counted crossovers, non-crossovers, and markers. We established regions of introgressions through visual inspection of marker density in the fermentation cross (introgressions showed more divergence between fermentation strains) and confirmed them using the findings of Almeida et al. (2014). We found nine heterozygous introgressions on chromosomes 4, 6, 7, 9, 10, 10, 13, 14, and 15 respectively that we included in further analyses (Table S3). We excluded two additional introgressions due to poor mapping (chromosome 13:0-17,000; chromosome 16: 642,000-648,000). To account for the difference in number of markers in introgressed vs non-introgressed windows and their effect on NCO detection, we applied a previously published simulation-based method (Liu et al., 2019; Wijnker et al., 2013). Choosing an average NCO tract length of 2kb, we simulated 1000 NCO events per window to establish our expected probability of detecting an NCO event in that window. We then divided our observed NCO count by our probability of detecting an NCO event. COs that occurred in large regions devoid of markers would be called in the middle of the empty windows. We decided to deal with this by splitting CO counts in regions with multiple consecutive windows without markers evenly between the empty windows. With these corrected maps, we calculated spearman correlations between crosses using R (v4.1.0, R Core Team 2021). Additionally, we modeled NCO and CO count as a function of introgression, introgression by cross, and GC content using a gaussian generalized linear model in R (v4.1.0, R Core Team 2021).
Homology
We calculated homology between the two fermentation cross strains in 51bp windows with 25bp overlaps. At each nucleotide position in the window, we counted fixed differences as zero homology, invariant sites between strains as full homology (1), and polymorphic sites in either or both strains as half homology (0.5). We then averaged these homology values across the window. This measure represents the probability that both strains will have the same nucleotide base at a given position. We used this measure of fine-scale homology to determine how homology related to NCO counts in introgressed regions. For this, we used Loess regressions and Spearman’s correlations on each of the introgressed regions comparing homology to NCO count, both implemented in R (v4.1.0, R Core Team 2021). We then focused on each recombination event (CO or NCO) and compared the homology 100bp up and downstream of CO breakpoints and 100bp up and downstream of NCO tracts. We then used Welch’s two sample t-tests to compare CO and NCO homology in each introgression.
We use , a measure genetic shuffling defined in Veller et al. (2019) to measure how much shuffling occurs in each chromosome for each cross. Our data provides parental origin for each fixed difference between parental strains. We assume that all loci between pairs of markers that come from the same parent are also from that parent. We also assume that when a pair of successive markers come from different parents, the location of the change from one parental origin to the other happens at the midpoint between our markers. With this in mind, we counted the number of bases that come from one parent and divided by the chromosome size to obtain the proportion of the chromosome that was inherited from said parent (p) and used the formula from Veller et al. (2019): = 2p(1 − p). We calculated for each full chromosome and each introgressed regions in every gamete from both crosses. We then averaged across gametes to obtain average values. We then compared average between crosses in each chromosome or introgressed region using Welch two sample t-tests and correcting for multiple tests using a Bonferroni correction in R (v4.1.0, R Core Team 2021).
Data availability
Sequences for the parental strains can be found on NCBI SRA (SRR1119189, SRR1119180 SRR1119199, SRR1119200) (Almeida et al., 2014). Sequencing of the tetrads is deposited at NCBI SRA under Project PRJNA1061120.
Results
The recombination landscape differs dramatically between closely related crosses
We isolated and sequenced products of 48 meioses (192 haploid spores) for two crosses of S. uvarum, a cross between strains isolated from North America (natural cross) and a cross between strains isolated from Europe (fermentation cross). We detected COs and NCOs across the 16 nuclear chromosomes of S. uvarum. Genomewide, we found significantly more COs on average in the natural cross (82.54 COs/meiosis, SE 1.5; 0.72 cM/kb) than in the fermentation cross (63.66 COs/meiosis, SE 1.9; 0.55 cM/kb) but no significant difference when comparing the average number of NCOs between the natural cross (60.44 NCOs/meiosis, SE 2.7) and the fermentation cross (68.1 NCOs/meiosis, SE 12.47). The number of COs per meiosis in the natural and fermentation crosses are slightly higher than those of S. paradoxus (54.8) and S. cerevisiae (76.5) respectively, but both crosses have NCO averages higher than those of S. cerevisiae (46.4) and S. paradoxus (26.9) (Figure 2A; Tables S4 & S5; Liu et al. 2019). The significant difference in COs per meiosis between our fermentation and natural crosses was unexpected, as we predicted that the recombination landscape would be more similar within a species (S. uvaum) than between species S. cerevisiae and S. paradoxus.
To further explore the differences in recombination landscapes between our crosses, we split the genome into 20kb, non-overlapping windows, and obtained CO, NCO, and marker counts for each region (Figure 2D). We corrected for differences in marker resolution between crosses (see Methods). We found modest, but significant genomewide correlation between our crosses for both COs (Spearman’s correlation: 0.273; p<0.0001) and NCOs (Spearman’s correlation: 0.1456; p<0.001). Again, these correlations were unexpectedly lower than those reported by Liu et al. (2019) between S. cerevisiae and S. paradoxus (0.48 for COs and 0.17 for NCOs).
We hypothesized that the low correlations between crosses might be impacted by the presence of heterozygous introgression from S. eubayanus in the fermentation cross. To explore this possibility, we separated the 20kb windows into introgressed and non-introgressed windows (based on whether they overlapped with an introgressed region). We will refer to introgression in the fermentation cross as “introgression” and use the term “introgressed region” to refer generally to the syntenic region, regardless of which cross we are focusing on. We calculated Spearman’s correlations of COs, NCOs, and marker counts between the two crosses for each chromosome. We find positive correlations between CO counts for all chromosomes when looking at non-introgressed regions (though some were not significant; Table S6). For introgressed regions, we found no significant CO correlations between crosses, which is consistent with the hypothesis that CO landscapes are changed in introgression (Table S7). There was one significant, highly positive correlation among NCOs (chromosome 1), but all other correlations were not significant. This was likely affected by the fact that markers used to detect NCOs are very differently distributed between the crosses, and the small size of NCO tracts means regions with more markers tend to reveal more NCOs.
We find that introgressions tend to have lower CO counts and higher NCO counts in the fermentation cross when compared to syntenic regions in the natural cross. The fermentation cross has fewer COs than the natural cross overall, but the difference is greater in the introgressed regions (Figure 2B-C). To further explore and test possible explanations for the patterns of COs and NCOs in introgressed regions, we constructed linear models for CO and NCO counts. Our model of CO counts showed a significant positive effect of the interactions between natural cross and introgression (whether a genomic window is from the natural cross and whether that window is in an introgressed region), as well as a significant negative effect of introgression on the number of COs, and a significant positive effect of GC content on CO count (Table 3). These results are consistent with increased COs in GC-rich regions and reduced COs in introgressions. Our linear model of NCO counts showed a similar positive effect of GC content on NCO counts, but showed opposite significant coefficients for the other two explanatory variables (Table 4). This indicates that GC content still plays an important role in localizing NCOs, and supports our findings that patterns of NCOs in introgressed regions are opposite to those of COs.
Reduced sequence homology helps explain non-crossover resolution of DSBs in introgressions
One possible explanation for an increase of NCOs in introgressions is that the reduced homology is biasing DSBs in the region to be resolved as NCOs rather than COs. We would therefore expect to see NCOs to be negatively correlated with homology. To evaluate the relationship of homology to the CO and NCO landscapes in introgressions, we measured homology, NCO depth, and CO count in 101 bp sliding windows along each of the introgressions. Mismatch repair proteins in Saccharomyces seem to suppress COs with very little mismatch in small regions (∼350 bp), which informed our window size (Chen & Jinks-Robertson, 1999; Datta et al., 1997). We counted the number of NCO tracts that intersect with each window as a measurement of NCO depth, and simply counted the CO events in a given window. We measured homology in the sliding windows by measuring the proportion of bases that are expected to match between the two parent strains (Figure 3). We then ran Spearman’s correlations and a loess regression along each introgression and found a weak, but often significant (p<<0.001) correlation between NCOs and homology in the introgression (Table 1), suggesting that resolution of double strand breaks is biased towards NCOs when homology is low. From the loess regression, we can observe an increase in NCO as homology reduces until about 0.9-0.8 homology, at which point NCOs level out or reduce. However, this effect is very weak with respect to the NCO counts, and at low levels of homology the uncertainty of the regression line is very large. This is primarily driven by the number of windows with no NCOs (Figure 4).
The low CO count in introgressed regions leaves us unable to investigate effects of homology on CO counts except on chromosomes 7 and 14, where we found significantly higher homology around COs than around NCOs (Table 2). The introgressions on these two chromosomes are unique in that they contain a highly homologous portion of sequence and therefore contain enough COs for us to have power to detect differences between CO and NCO neighborhoods (Figure 3).
Introgression decreases allele shuffling locally and at the chromosome level
Because NCOs still play a small role in shuffling alleles along the chromosome, we were interested in whether the increase in NCOs of the fermentation cross would supplement the lost shuffling from the suppression of COs in the introgressions. To test this hypothesis, we used the measure , which accounts for the number and positioning of recombination events to estimate the probability that a randomly chosen pair of loci shuffles their alleles in a gamete (Veller et al., 2019). We calculated the average per chromosome and for each introgressed region for each of the two crosses. We observed high levels of shuffling at the chromosome level when compared to humans. The intra-chromosomal component of in humans is 0.0135 in females and 0.0177 in males (Veller et al., 2019), while our measurements for chromosomes varied between 0.216 and 0.426. We find that most chromosomes do not have a significantly different amount of allele shuffling between the two crosses, even though the natural cross generally has more COs (Table S8; Bonferroni-adjusted α = 0.00313). However, of the six chromosomes with significantly different values, all of them showed more shuffling in the natural cross, and five of the six (chromosomes 4, 9, 10, 14, and 15) contained introgressed regions (Figure 5).
Chromosome 12 was the only chromosome without an introgressed region to have significantly different shuffling between crosses, and it also showed more shuffling in the natural cross. All of the introgressed regions showed significantly more shuffling in the natural cross, indicating that the large increase of NCOs in the introgressions does not make up for the loss of shuffling from the depletion of COs (Table S9; Bonferroni-adjusted α = 0.00556). This finding indicates that an introgression that is segregating in a population will incur a shuffling cost in heterozygous individuals on top of any other evolutionary effects the introgression may have.
Discussion
Our study is motivated by understanding recombination rate variation within a species and uncovering potential genetic factors underlying this variation. To investigate this question, we crossed two pairs of S. uvarum strains, one pair isolated from natural environments and one pair from fermentation environments, and explored the distribution of CO and NCO events from both crosses. We found a similar range of COs per meiosis as previous studies in S. cerevisiae and S. paradoxus (Liu et al., 2019; Mancera et al., 2008). The number of NCOs detected, however, was considerably higher, despite having less sequence differences between strains in each of our crosses and therefore lower genomic resolution for detection. We also detected more COs and fewer NCOs in our natural cross when compared to our fermentation cross.
We hypothesized that these differences in the recombination landscape were influenced by introgression, given that heterozygous introgression creates sequence divergence, and that COs in regions of lower homology are known to be curtailed (Chen & Jinks-Robertson 1999, Li et al. 2006, Weltz-Voegele & Jinks-Robertson 2008, Cooper et al. 2021). We therefore explored the relationship between introgressions and the differences in CO and NCO counts between crosses. Our correlations between crosses, though mostly not significant, pointed to more CO distribution differences in introgressed regions than in the rest of the genome. When we then modeled CO and NCO locations, correcting for GC content (a well characterized driver of recombination events (Kiktev et al., 2018; Marsolier-Kergoat & Yeramian, 2009)), we found that the distribution of COs and NCOs we observed was well explained by introgressions. While we are limited in our interpretations by only comparing two crosses (one cross with heterozygous introgression and one without introgression), these results are in line with findings in inversions, where heterozygotes show sharp decreases in COs and an increase in NCOs in the inverted region (Crown et al., 2018; Korunes & Noor, 2019). However, unlike heterozygous inversions where an increase in COs is observed on freely recombining chromosomes (the inter-chromosomal effect), we do not see an increase in COs outside of introgression or on chromosomes without introgression.
One likely effect of this CO reduction is a reduction in shuffling at the regional and chromosomal level. While NCOs can increase local shuffling, they likely have a much weaker effect on the likelihood of two random alleles being shuffled than COs do. We find this is the case for our two crosses, where despite a large number of NCOs in introgressions, the amount of shuffling (as measured by ) is significantly lower in the fermentation cross. This loss of shuffling translates to frequently lower in the fermentation cross at the chromosome level for chromosomes containing introgressions. Lower is not observed when introgressions are small and near telomeres, while even a small introgression near the center of the chromosome can lead to a large reduction in (as is the case for chromosome 15). This is consistent with the expectation that COs near the center of chromosomes generate much more shuffling of alleles than terminal COs (Veller et al., 2019). Our findings indicate that reducing COs, especially near the center of chromosomes, has a cost to shuffling that is not compensated by the increase of NCOs that we observe. If the benefit of recombination is its ability to generate new combinations of alleles, then the loss of shuffling resulting from being heterozygous for divergent DNA sequences may come at an additional cost beyond the possibilities of genetic incompatibilities between hybridizing species. This cost is likely higher as divergence increases and as the length of divergent sequences is greater, as is the case with early generation hybrids (Dagilis & Matute, 2023). Ultimately, if sequence divergence is too high, the resultant failure to recombine can become a postzygotic reproductive barrier (Bozdag et al., 2021; Hunter et al., 1996; Rogers et al., 2018).
The shuffling cost to introgression that we identify in our crosses may play an important role in the fate of introgression in the generations following hybridization. When heterozygotes for an introgression are formed, the reduction in shuffling inside the introgression will increase the likelihood that the introgression is purged from the population. This is because it will likely be inherited in its entirety and will carry the fitness cost of incompatibilities combined with a cost of shuffling. This cost is incurred because the reduction of COs in the introgression will reduce shuffling of alleles on either side of it and will vary in its intensity depending on the location and size of the introgression. In generations immediately following hybridization, introgressions will be much larger and are therefore expected to be more costly (although this likely depends on a number of factors including time since divergence). These predictions are consistent with modeling and empirical data on the purging of introgression in Drosophila and humans in the first generations following hybridization (Veller et al., 2023).
As to longer term dynamics of recombination and selection, we predict that the excess NCOs detected in heterozygous introgressions should begin to erode the divergence between the sequences, increasing homology and slowly reducing the cost of the introgression. This hypothesis posits that recombination can act to remove the larger, more deleterious regions of an introgression quickly while whittling away slightly deleterious alleles that may be linked to any beneficial regions of an introgression. However, in our current study, we are limited by only observing one generation of sexual reproduction between two pairs of strains. This means that we don’t capture longer term patterns of recombination or the landscape of recombination in introgressions that are segregating in a population. Consequently, our findings do not reflect the effects of long-term selection (and hypothesized degradation of sequence divergence through NCO) that would lead to introgressions preferentially remaining in high-CO regions, as has been observed in other organisms. Furthermore, Saccharomyces typically reproduce asexually, with only infrequent sexual cycles (Magwene et al., 2011; Ruderfer et al., 2006; Zeyl & Otto, 2007). When they do mate, they often mate within a tetrad resulting in increased homozygosity. For example, each diploid progenitor of the parents of our fermentation cross was homozygous for introgression across the genome, meaning that recombination would neither break up nor aid in purging the introgression in isolated populations of each parent. This suggests that the fate of introgressions in this species is perhaps more loosely tied to recombination patterns than it would be in an obligately sexually reproducing species.
Despite some limitations to interpretation, this study provides a unique view of the early dynamics of hybridization and the role of recombination in the presence of introgression. By focusing not only on the distribution of recombination events but on their specific role in shuffling alleles, we can more closely connect the physical process of recombination to its role among other evolutionary forces.
Acknowledgements
We are grateful to members of the Heil lab, Mohamed Noor, Nathan Layman, and Mark Smithson for comments on this manuscript. We thank Chris Hittinger and the Portuguese Yeast Culture Collection for S. uvarum strains. This work was supported by NIH R35GM142849 to CSH.
References
- IntrogressionCurrent Biology 32:R865–R868https://doi.org/10.1016/j.cub.2022.07.004
- Many interspecific chromosomal introgressions are highly prevalent in Holarctic Saccharomyces uvarum strains found in human-related fermentationsYeast 35:141–156https://doi.org/10.1002/yea.3248
- A Gondwanan imprint on global diversity and domestication of wine and cider yeast Saccharomyces uvarumNature Communications 5https://doi.org/10.1038/ncomms5044
- ReCombine: A Suite of Programs for Detection and Analysis of Meiotic Recombination in Whole-Genome DatasetsPLoS ONE 6https://doi.org/10.1371/journal.pone.0025509
- Divergence and conservation of the meiotic recombination machineryNature Reviews Genetics :1–17https://doi.org/10.1038/s41576-023-00669-8
- Genetic Recombination Is Targeted towards Gene Promoter Regions in DogsPLOS Genetics 9https://doi.org/10.1371/journal.pgen.1003984
- The barrier to genetic exchange between hybridising populationsHeredity 57https://doi.org/10.1038/hdy.1986.135
- Levels of naturally occurring DNA polymorphism correlate with recombination rates in D. melanogasterNature 356:519–520https://doi.org/10.1038/356519a0
- Saccharomyces yeast hybrids on the riseYeast 39:40–54https://doi.org/10.1002/yea.3684
- Breaking a species barrier by enabling hybrid recombinationCurrent Biology 31:R180–R181https://doi.org/10.1016/j.cub.2020.12.038
- Molecular Evolution at a Meiosis Gene Mediates Species Differences in the Rate and Patterning of RecombinationCurrent Biology: CB 28:1289–1295https://doi.org/10.1016/j.cub.2018.02.056
- Speciation and Introgression between Mimulus nasutus and Mimulus guttatusPLOS Genetics 10https://doi.org/10.1371/journal.pgen.1004410
- Variation of the meiotic recombination landscape and properties over a broad evolutionary distance in yeastsPLOS Genetics 13https://doi.org/10.1371/journal.pgen.1006917
- Recombination and speciationMolecular Ecology 14:2621–2635https://doi.org/10.1111/j.1365-294X.2005.02617.x
- Selective sorting of ancestral introgression in maize and teosinte along an elevational clinebioRxiv 2021:3–5https://doi.org/10.1101/2021.03.05.434040
- The effect of deleterious mutations on neutral molecular variationGenetics 134:1289–1303https://doi.org/10.1093/genetics/134.4.1289
- The role of the mismatch repair machinery in regulating mitotic and meiotic recombination between diverged sequences in yeastGenetics 151:1299–1313
- Nucleosomes and DNA methylation shape meiotic DSB frequency in Arabidopsis thaliana transposons and gene regulatory regionsGenome Research 28:532–546https://doi.org/10.1101/gr.225599.117
- Studies on gene conversion and its relationship to linked exchange in Drosophila melanogasterGenetics 69:179–209https://doi.org/10.1093/genetics/69.2.179
- Mismatch repair disturbs meiotic class I crossover controlbioRxiv https://doi.org/10.1101/480418
- Local Inversion Heterozygosity Alters Recombination throughout the GenomeCurrent Biology: CB 28:2984–2990https://doi.org/10.1016/j.cub.2018.07.004
- Assessing the complex architecture of polygenic traits in diverged yeast populationsMolecular Ecology 20:1401–1413https://doi.org/10.1111/j.1365-294X.2011.05005.x
- The fitness of an introgressing haplotype changes over the course of divergence and depends on its size and genomic locationPLOS Biology 21https://doi.org/10.1371/journal.pbio.3002185
- Twelve years of SAMtools and BCFtoolsGigaScience 10https://doi.org/10.1093/gigascience/giab008
- A yeast living ancestor reveals the origin of genomic introgressionsNature 587https://doi.org/10.1038/s41586-020-2889-1
- Molecular evolution of the meiotic recombination pathway in mammalsEvolution 73https://doi.org/10.1111/evo.13850
- Dual roles for DNA sequence identity and the mismatch repair system in the regulation of mitotic crossing-over in yeastProceedings of the National Academy of Sciences of the United States of America 94:9757–9762
- Initiation of Meiotic Recombination: How and Where? Conservation and Specificities Among EukaryotesAnnual Review of Genetics 47:563–599https://doi.org/10.1146/annurev-genet-110711-155423
- Interactions Between Natural Selection and Recombination Shape the Genomic Landscape of IntrogressionMolecular Biology and Evolution 39https://doi.org/10.1093/molbev/msac122
- Genomic architecture and introgression shape a butterfly radiationScience 366:594–599https://doi.org/10.1126/science.aaw2090
- The evolutionary advantage of recombinationGenetics 78:737–756
- Global mapping of meiotic recombination hotspots and coldspots in the yeast Saccharomyces cerevisiaeProceedings of the National Academy of Sciences 97:11383–11390https://doi.org/10.1073/pnas.97.21.11383
- Dna Mismatch Repair and Genetic InstabilityAnnual Review of Genetics 34:359–399https://doi.org/10.1146/annurev.genet.34.1.359
- Genomic features shaping the landscape of meiotic double-strand-break hotspots in maizeProceedings of the National Academy of Sciences 114:12231–12236https://doi.org/10.1073/pnas.1713225114
- The effect of linkage on limits to artificial selectionGenetical Research 8:269–294
- Meiotic gene conversion tract length distribution within the rosy locus of Drosophila melanogasterGenetics 137:1019–1026https://doi.org/10.1093/genetics/137.4.1019
- A ten-minute DNA preparation from yeast efficiently releases autonomous plasmids for transformation of Escherichia coliGene 57:267–272https://doi.org/10.1016/0378-1119(87)90131-4
- The mismatch repair system contributes to meiotic sterility in an interspecific yeast hybridThe EMBO Journal 15:1726–1733https://doi.org/10.1002/j.1460-2075.1996.tb00518.x
- Intense and highly localized gene conversion activity in human meiotic crossover hot spotsNature Genetics 36https://doi.org/10.1038/ng1287
- Physical Lengths of Meiotic and Mitotic Gene Conversion Tracts in Saccharomyces CerevisiaeGenetics 118https://doi.org/10.1093/genetics/118.3.401
- Mechanism and control of meiotic recombination initiationCurrent Topics in Developmental Biology 52:1–53https://doi.org/10.1016/s0070-2153(01)52008-6
- GC content elevates mutation and recombination rates in the yeast Saccharomyces cerevisiaeProceedings of the National Academy of Sciences 115:E7109–E7118https://doi.org/10.1073/pnas.1807334115
- Sequence Variants in the RNF212 Gene Associate with Genome-Wide Recombination RateScience 319:1398–1401https://doi.org/10.1126/science.1152422
- Gene conversion and linkage: Effects on genome evolution and speciationMolecular Ecology 26:351–364https://doi.org/10.1111/mec.13736
- Pervasive gene conversion in chromosomal inversion heterozygotesMolecular Ecology 28:1302–1315https://doi.org/10.1111/mec.14921
- Nonparadoxical evolutionary stability of the recombination initiation landscape in yeastScience 350:932–937https://doi.org/10.1126/science.aad0814
- Fermentation innovation through complex hybridization of wild and domesticated yeastsNature Ecology & Evolution :1–11https://doi.org/10.1038/s41559-019-0998-8
- Postglacial migration shaped the genomic diversity and global distribution of the wild ancestor of lager-brewing hybridsPLOS Genetics 16https://doi.org/10.1371/journal.pgen.1008680
- LUMPY: A probabilistic framework for structural variant discoveryGenome Biology 15https://doi.org/10.1186/gb-2014-15-6-r84
- Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics (OxfordEngland 25:1754–1760https://doi.org/10.1093/bioinformatics/btp324
- The impact of sequence divergence and DNA mismatch repair on homeologous recombination in ArabidopsisThe Plant Journal: For Cell and Molecular Biology 45:908–916https://doi.org/10.1111/j.1365-313X.2006.02657.x
- Tetrad analysis in plants and fungi finds large differences in gene conversion rates but no GC biasNature Ecology & Evolution 2https://doi.org/10.1038/s41559-017-0372-7
- Evolution of the Yeast Recombination LandscapeMolecular Biology and Evolution 36:412–422https://doi.org/10.1093/molbev/msy233
- Outcrossing, mitotic recombination, and life-history trade-offs shape genome evolution in Saccharomyces cerevisiaeProceedings of the National Academy of Sciences 108:1987–1992https://doi.org/10.1073/pnas.1012544108
- High-resolution mapping of meiotic crossovers and non-crossovers in yeastNature 454:479–485https://doi.org/10.1038/nature07135
- GC Content and Recombination: Reassessing the Causal Effects for the Saccharomyces cerevisiae GenomeGenetics 183:31–38https://doi.org/10.1534/genetics.109.105049
- Recombination rate variation shapes barriers to introgression across butterfly genomesPLOS Biology 17https://doi.org/10.1371/journal.pbio.2006288
- Sex speeds adaptation by altering the dynamics of molecular evolutionNature 531:233–236https://doi.org/10.1038/nature17143
- Recombination Modulates How Selection Affects Linked Sites in DrosophilaPLOS Biology 10https://doi.org/10.1371/journal.pbio.1001422
- The Genome Analysis Toolkit: A MapReduce framework for analyzing next-generation DNA sequencing dataGenome Research 20:1297–1303https://doi.org/10.1101/gr.107524.110
- Whole-Genome Analysis of Individual Meiotic Events in Drosophila melanogaster Reveals That Noncrossover Gene Conversions Are Insensitive to Interference and the Centromere EffectGenetics 203:159–171https://doi.org/10.1534/genetics.115.186486
- The genomic consequences of hybridizationeLife 10https://doi.org/10.7554/eLife.69016
- Multiple loci contribute to genome-wide recombination levels in male miceMammalian Genome 21:550–555https://doi.org/10.1007/s00335-010-9303-5
- Recombination rate variation and speciation: Theoretical predictions and empirical results from rabbits and micePhilosophical Transactions of the Royal Society B: Biological Sciences 367:409–421https://doi.org/10.1098/rstb.2011.0249
- An Out-of-Patagonia migration explains the worldwide diversity and distribution of Saccharomyces eubayanus lineagesPLOS Genetics 16https://doi.org/10.1371/journal.pgen.1008777
- A Hierarchical Combination of Factors Shapes the Genome-wide Topography of Yeast Meiotic Recombination InitiationCell 144:719–731https://doi.org/10.1016/j.cell.2011.02.009
- The Mosaic Ancestry of the Drosophila Genetic Reference Panel and the D. melanogaster Reference Genome Reveals a Network of Epistatic Fitness InteractionsMolecular Biology and Evolution 32:3236–3251https://doi.org/10.1093/molbev/msv194
- Role of Cis, Trans, and Inbreeding Effects on Meiotic Recombination in Saccharomyces cerevisiaeGenetics 210:1213–1226https://doi.org/10.1534/genetics.118.301644
- The genomic landscape at a late stage of stickleback speciation: High genomic divergence interspersed by small localized regions of introgressionPLOS Genetics 14https://doi.org/10.1371/journal.pgen.1007358
- Integrative genomics viewerNature Biotechnology 29:24–26https://doi.org/10.1038/nbt.1754
- Recombinational Landscape and Population Genomics of Caenorhabditis elegansPLOS Genetics 5https://doi.org/10.1371/journal.pgen.1000419
- Spore-autonomous fluorescent protein expression identifies meiotic chromosome mis-segregation as the principal cause of hybrid sterility in yeastPLOS Biology 16https://doi.org/10.1371/journal.pbio.2005066
- Population genomic analysis of outcrossing and recombination in yeastNature Genetics 38https://doi.org/10.1038/ng1859
- The Awesome Power of Yeast Evolutionary Genetics: New Genome Sequences and Strain Resources for the Saccharomyces sensu stricto GenusG3: Genes, Genomes, Genetics 1:11–25https://doi.org/10.1534/g3.111.000273
- Mechanisms of Genetic Exchange Within the Chromosomal Inversions of Drosophila pseudoobscuraGenetics 171:1729–1739https://doi.org/10.1534/genetics.105.041947
- Natural selection interacts with recombination to shape the evolution of hybrid genomesScience 360:656–660https://doi.org/10.1126/science.aar3684
- Widespread Gene Conversion in Centromere CoresPLoS Biology 8https://doi.org/10.1371/journal.pbio.1000327
- Stable recombination hotspots in birds. Science (New YorkN.Y 350:928–932https://doi.org/10.1126/science.aad0843
- The hitch-hiking effect of a favourable geneGenetical Research 23:23–35
- Recombination rate variation in closely related speciesHeredity 107:496–508https://doi.org/10.1038/hdy.2011.44
- Recombining without Hotspots: A Comprehensive Evolutionary Portrait of Recombination in Two Closely Related Species of DrosophilaGenome Biology and Evolution 7:2829–2842https://doi.org/10.1093/gbe/evv182
- Variation in recombination frequency and distribution across eukaryotes: Patterns and processesPhilosophical Transactions of the Royal Society B: Biological Sciences 372https://doi.org/10.1098/rstb.2016.0455
- The evolutionary and ecological potential of yeast hybridsCurrent Opinion in Genetics & Development 76https://doi.org/10.1016/j.gde.2022.101958
- Centromeres Convert but Don’t CrossPLOS Biology 8https://doi.org/10.1371/journal.pbio.1000326
- Tellini, N., De Chiara, M., Mozzachiodi, S., Tattini, L., Vischioni, C., Naumova, E., Warringer, J., Bergström, A., & Liti, G. (2023). Ancient and recent origins of shared polymorphisms in yeas. 10.21203/rs.3.rs-2573222/v1Ancient and recent origins of shared polymorphisms in yeas https://doi.org/10.21203/rs.3.rs-2573222/v1
- Conservation of recombination hotspots in yeastProceedings of the National Academy of Sciences https://doi.org/10.1073/pnas.0908774107
- Recombination and selection against introgressed DNAEvolution 77:1131–1144https://doi.org/10.1093/evolut/qpad021
- A rigorous measure of genome-wide genetic shuffling that takes into account crossover positions and Mendel’s second lawProceedings of the National Academy of Sciences of the United States of America 116:1659–1668https://doi.org/10.1073/pnas.1817482116
- Sequence Divergence Impedes Crossover More Than Noncrossover Events During Mitotic Gap Repair in YeastGenetics 179:1251–1262https://doi.org/10.1534/genetics.108.090233
- The genomic landscape of meiotic crossovers and gene conversions in Arabidopsis thalianaeLife 2https://doi.org/10.7554/eLife.01426
- TEX11 is mutated in infertile men with azoospermia and regulates genome-wide recombination rates in mouseEMBO Molecular Medicine 7:1198–1210https://doi.org/10.15252/emmm.201404967
- Diversity and Determinants of Meiotic Recombination LandscapesTrends in Genetics 35:359–370https://doi.org/10.1016/j.tig.2019.02.002
- A short history of recombination in yeastTrends in Ecology & Evolution 22:223–225https://doi.org/10.1016/j.tree.2007.02.005
Article and author information
Author information
Version history
- Preprint posted:
- Sent for peer review:
- Reviewed Preprint version 1:
- Reviewed Preprint version 2:
Copyright
© 2024, Schwarzkopf et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics
- views
- 294
- downloads
- 17
- citations
- 0
Views, downloads and citations are aggregated across all versions of this paper published by eLife.