The genetic architecture of the load linked to dominant and recessive self-incompatibility alleles in Arabidopsis halleri and A. lyrata

Audrey Le Veve; Mathieu Genete; Christelle Lepers-Blassiau; Chloé Ponitzki; Céline Poux; Xavier Vekemans; Eleonore Durand; Vincent Castric

doi:10.7554/eLife.94972.2

eLife assessment

This study presents valuable empirical work and simulations that are relevant for the evolution of genetic load linked to self-incompatibility alleles in two Arabidopsis species. The evidence supporting the findings is solid, although it remains to be seen how generalizable the conclusions are beyond the specific system investigated here, not least because the statistical significance varied between the two species. The work will be of relevance to geneticists interested in the evolution of allelic diversity in similar systems.

https://doi.org/10.7554/eLife.94972.2.sa2

Significance of findings

valuable: Findings that have theoretical or practical implications for a subfield

landmark
fundamental
important
valuable
useful

Strength of evidence

solid: Methods, data and analyses broadly support the claims with only minor weaknesses

exceptional
compelling
convincing
solid
incomplete
inadequate

During the peer-review process the editor and reviewers write an eLife assessment that summarises the significance of the findings reported in the article (on a scale ranging from landmark to useful) and the strength of the evidence (on a scale ranging from exceptional to inadequate). Learn more about eLife assessments

Abstract

The long-term balancing selection acting on mating-types or sex determining genes is expected to lead to the accumulation of deleterious mutations in the tightly linked chromosomal segments that are locally “sheltered” from purifying selection. However, the factors determining the extent of this accumulation are poorly understood. Here, we took advantage of variations in the intensity of balancing selection along a dominance hierarchy formed by alleles at the sporophytic self-incompatibility system of the Brassicaceae to compare the pace at which linked deleterious mutations accumulate among them. We first experimentally measured the phenotypic manifestation of the linked load at three different levels of the dominance hierarchy. We then sequenced and phased polymorphisms in the chromosomal regions linked to 126 distinct copies of S-alleles in two populations of Arabidopsis halleri and three populations of A. lyrata. We find that linkage to the S-locus locally distorts phylogenies over about 10-30kb along the chromosome. The more intense balancing selection on dominant S-alleles results in greater fixation of linked deleterious mutations, while recessive S-alleles accumulate more linked deleterious mutations that are segregating. Hence, the structure rather than the overall magnitude of the linked genetic load differs between dominant and recessive S-alleles. Our results have consequences for the long-term evolution of new S-alleles, the evolution of dominance modifiers between them, and raise the question of why the non-recombining regions of some sex and mating type chromosomes expand over evolutionary times while others, such as that the S-locus of the Brassicaceae, remain restricted to small chromosomal regions.

Introduction

The existence of sexes or mating-types leads to one the strongest forms of long-term balancing selection, and is often associated with clusters of polymorphisms around sex/mating-type controlling regions kept together by structural rearrangements. In some cases, such rearrangements can span almost entire chromosomes, e.g. sex chromosomes in mammals (Katsura et al., 2012) or mating-type chromosomes in ascomycete fungi (Hartmann et al., 2021), while in others they remain limited to relatively small genomic regions, e.g. chromosomal inversions controlling male reproductive morphs in the ruff (Lamichhaney et al., 2016), mating-type loci in some basidiomycete fungi, segregating indels controlling pin vs. thrum floral morphs in Primula (Cocker et al., 2018). The long-term balancing selection acting on these systems is expected to lead to the accumulation of deleterious mutations in the tightly linked chromosomal segments that are “sheltered” from purifying selection by the presence of the balanced polymorphism (Uyenoyama, 1997; Uyenoyama, 2005). These deleterious mutations can have drastic short- and long-term consequences for the evolution of the species, and determining the processes by which they accumulate is crucial to understand how the rearranged regions can either expand along the chromosomes or conversely remain restricted to limited genomic tracts (Jay et al,. 2021; Jay et al,. 2022).

Self-incompatibility (SI) is a genetic mechanism allowing recognition and rejection of self-pollen by hermaphrodite individuals, thereby preventing inbreeding and promoting outcrossing in hermaphroditic plant species (Nettancourt, 2001). In the Brassicaceae family, SI is controlled by a single non-recombining chromosomal region, the S-locus (Schopfer et al., 1999; Kusaba et al., 2001). SI is one of the most prominent examples of long-term balancing selection (Uyenoyama, 2003), and as such deleterious mutations are expected to accumulate in very close genetic linkage to the S-alleles because of the indirect effects of linked selection (Uyenoyama, 2005). Population genetics models predict that recessive deleterious variants should accumulate within specific S-allele lineages (Uyenoyama, 2003; Llaurens et al., 2009a), and should then be reshuffled among them by recombination. However, due to the technical difficulty of phasing polymorphisms, this process has rarely been characterised in detail (Castric and Vekemans, 2004).

A key feature of sporophytic SI systems, also shared by sex chromosomes, is the existence of dominance interactions between S-alleles. While most individuals are heterozygous at the S-locus and thus carry two different S-alleles, only one of them is generally expressed at the phenotypic level. This is especially true for the pollen specificity, where S-alleles follow a complex genetic dominance hierarchy (Llaurens et al., 2008; Durand et al., 2014). In Arabidopsis lyrata and A. halleri, the S-alleles are distributed into four classes of dominance, with increasing dominance from class I to class IV and linear dominance also observed inside some of the classes. Similar patterns of dominance between S-alleles occur in Brassica species, but with only two classes (Hatakeyama et al., 1998; Kakizaki et al., 2003; Yasuda et al., 2017). This system is genetically determined and controlled by small RNAs molecules produced by dominant S-alleles that are able to target and repress expression of the recessive S-alleles in pollen (Durand et al., 2014; Yasuda et al., 2017). The evolutionary properties of S-alleles are expected to vary in a predictable manner along the dominance hierarchy because balancing selection acts more strongly on dominant than on recessive S-alleles, as the latter are often masked at the phenotypic level (Billiard et al., 2007). As a result, the dynamics of accumulation of deleterious variation may differ in close linkage with dominant vs recessive S-alleles. Specifically, recessive S-alleles can form homozygous combinations in natural populations more often than dominant S-alleles (Billiard et al., 2007), such that recombination can occur occasionally between distinct gene copies of the same recessive S-allele, providing the opportunity for the linked recessive deleterious mutations to be purged from within the S-locus itself. In addition, because recessive S-alleles reach higher population frequencies (Billiard et al., 2006; Llaurens et al., 2008), purifying selection on linked deleterious variants is expected to have higher efficacy among gene copies of recessive than dominant S-alleles. This is expected to result in a higher fixation probability of deleterious variants linked to the class of dominant S-alleles than to the class of recessive S-alleles (Llaurens et al., 2009a). Thus, the level of dominance of S-alleles determines the intensity of purifying selection acting upon them. This situation closely resembles that of the differential evolution of sex chromosomes, where Y chromosomes (similar to dominant S-alleles) tend to accumulate more deleterious variation than X chromosomes (similar to recessive S-alleles, Llaurens et al. 2009a; Goubet et al. 2012). Empirical support for this simple prediction has been conflicting, though. Based on phenotypic measurements in A. halleri, Llaurens et al. (2009a) observed a decrease of fitness associated by enforced homozygosity for one of the most dominant S-alleles (Ah15) but not for the most recessive S-allele of the allelic series (Ah01). In contrast, Stift et al. (2013) observed no effect of dominance on the genetic load linked to three dominant vs. recessive S-alleles in a natural population of the closely related A. lyrata. Hence, the data available so far are inconclusive, but are restricted to very small numbers of S-alleles. They are also based on inherently limited phenotypic measurements, seriously limiting the power of the comparisons, and preventing proper generalisation of the effect of the intensity of balancing selection on the accumulation of linked deleterious variation.

In this study, we combined phenotypic, genomic and theoretical approaches to finely dissect the patterns of accumulation of deleterious variation linked to the S-locus supergene in A. halleri and A. lyrata, depending on dominance levels of S-allele. We first extended the phenotypic approach of Llaurens et al. (2009a) to a series of additional S-alleles from the same local A. halleri population to evaluate the effect of S-allele dominance on the sheltered load. We then used parent-offspring trios and targeted genome re-sequencing to directly quantify the accumulation of putative deleterious mutations linked to phased dominant vs. recessive S-alleles in two A. halleri and three A. lyrata natural populations. Finally, we used stochastic simulations to refine the theoretical predictions about the patterns of accumulation of recessive deleterious mutations linked to dominant vs. recessive S-alleles. Overall, our results provide a more nuanced view of the effect of the intensity of balancing selection on the sheltered load, in which the structure of the sheltered load rather than its magnitude differs among S-alleles from different dominance classes.

Results

The genetic load linked to the S-locus varies among S-alleles, but is not correlated with dominance

We first expanded the experimental approach of Llaurens et al. (2009a) to phenotypically evaluate the effect of S-allele dominance on the intensity of the sheltered load. The previous study focused on three S-alleles (Ah01, Ah02 and Ah15; Llaurens et al., 2009a). Here we included two S-alleles from the same local population (Nivelle, France): Ah03 and Ah04, and included Ah01 again for comparative purposes. In the Arabidopsis genus, S-alleles have been shown to form a complex dominance hierarchy (Llaurens et al., 2008; Durand et al., 2014). This hierarchy is largely associated with the phylogeny of S-alleles (Durand et al., 2014), and at least four phylogenetic classes (I, II, III and IV) have been described, from the most recessive (class I) to the most dominant of S-alleles (class IV). Dominance interactions also exist among S-alleles within classes, such that these five S-alleles form the following dominance hierarchy (Llaurens et al., 2008; Durand et al., 2014): Ah01<Ah03<Ah02<Ah04<Ah15, from the most recessive (Ah01) to the most dominant (Ah15). To reveal the linked load, we enforced homozygosity at the S-locus using controlled crosses between parental individuals sharing a given S-allele that was masked by different dominant S-alleles (e.g., to obtain Ah_xAh_x homozygotes we deposited pollen from a Ah_xAh_y plant, where Ah_y>Ah_x, on pistils of a Ah_xAh_z plant where z≠y, or on a Ah_xAh_x pistil when available; table S1). We obtained 399 offspring from a total of six such crosses. Note that our experimental procedure differs slightly from that of Llaurens et al. (2009a) in that their procedure required a CO₂ treatment to bypass the SI system and obtain selfed offspring, while here we took advantage of the dominance interactions to obtain outcrossed S-locus homozygous individuals that we phenotypically compared to their full-sibs with S-locus heterozygous genotypes. Note also that the S-locus homozygous offspring we obtained contain distinct gene copies of a given S-allele lineage. Hence, they could in principle carry distinct suites of linked deleterious mutations in case these mutations segregate within S-allele lineages.

We first tested whether homozygosity at the S-locus affected survival by measuring for each cross the proportion of homozygotes at the S-locus reaching the reproductive stage for three S-alleles (in two replicate families per S-allele, table S1). The proportion of Ah01/Ah01 and Ah04/Ah04 homozygotes surviving to the reproductive stage was consistent with mendelian expectations in their respective families. However, we observed a significant decrease of Ah03/Ah03 homozygotes at the reproductive stage compared with Mendelian expectations (Table 1), whereas the observed proportion of the Ah03 S-allele among heterozygous individuals did not depart from expectations (2/3=0.67; Table 1). Thus, the increased mortality is associated with Ah03 homozygosity, rather than with a lower performance of individuals carrying the Ah03 S-allele itself. Overall, a genetic load was thus observed linked to the Ah03 S-alleles, which is at an intermediate level of dominance, but neither to the most dominant (Ah04) nor to the most recessive (Ah01) S-allele. Hence, these observations do not support a positive correlation between S-allele dominance and the magnitude of the sheltered load.

Proportion of S-locus homozygous offspring having reached the reproductive stage for three different S-alleles.
The test is performed relative to the expected proportion of homozygous genotypes in the offspring (25% when both parents are heterozygous; 50% when one of the parents is homozygous and the other heterozygous).

Next we measured thirteen vegetative and reproductive traits in the resulting families and compared offspring that were homozygous for their S-alleles with their full sibs that were heterozygous (Fig. 1). We first used permutations to test whether the mean trait value of homozygotes differed from that in heterozygotes. Overall, with a single exception, we found no effect of homozygosity at the S-locus on variation of the traits measured (Fig. 1; table S2). The maximum length of flowering stems was the exception to this general pattern, with longer reproductive stems for S-locus homozygous than heterozygous genotypes, hence in the opposite direction from our expectation of lower fitness in homozygotes. For this trait, there was significant variation among replicate families for homozygotes of the recessive allele Ah01 but not of the dominant allele Ah04 (table S3). We then used generalised linear models (GLM) to evaluate the effect of dominance (considered as a continuous variable with fixed effect) on the mean phenotypic value of homozygotes compared to heterozygotes for each trait (table S4; treating family of origin, attacks by phytopathogens, phytophagous and oxidative stress as random effects whenever necessary). We also observed no effect of S-allele dominance on the contrast between S-locus homozygotes and heterozygotes for any of these traits. A single of the thirteen traits was an exception to this general pattern, but again the effect was in the opposite direction from our expectation, with an earlier rather than delayed appearance of the first leaf for homozygotes of more dominant S-alleles; table S4). Overall, our phenotypic results confirmed the presence of a detectable linked load on some phenotypic traits (survival; time to produce the first leaf), but we could not replicate the observation of Llaurens et al. (2009a) that dominant S-alleles carry a more severe deleterious load than recessive S-alleles, even though our samples were obtained from the same local population.

Effect of homozygosity at the S-locus on 13 phenotypic traits compared to heterozygotes.
For each trait, the phenotypic values in homozygotes (in grey) were normalised relative to the mean phenotypic values in heterozygotes (in black). The distributions were obtained by 10,000 random permutations. SD: Standard Deviation.

S-alleles are associated with specific sets of tightly linked mutations

The model of the sheltered load assumes that distinct S-allele lineages carry specific sets of linked deleterious mutations, but to our knowledge this prediction was never tested directly. We combined a parent-offspring trio approach with sequencing of the S-locus flanking regions to phase the mutations segregating in the S-locus flanking regions with their respective S-alleles. Briefly, we used a previously developed sequence capture protocol specifically targeting the nucleotide sequences over 75 kb on each side of the S-locus along with a series of 100 control regions from throughout the genome (Le Veve et al., 2023), and we analysed nucleotide sequence polymorphism (including only invariant and biallelic SNPs), based on the A. lyrata reference genome (Hu et al., 2011). We define a haplotype as a unique combination of mutations along the phased chromosome, and a S-allele lineage as the collection of gene copies of a given functional S-allele (different functional S-alleles are distinguished based on their strong sequence divergence at the S-locus pollen and pistil genes). Different gene copies within an S-allele lineage can thus be associated with distinct linked haplotypes in the flanking regions. The S-alleles were identified based on short reads sequences according to a previously published method (Genete et al., 2020). We analysed two closely related A. halleri populations from Europe (Nivelle and Mortagne) and three allogamous A. lyrata populations from North America (IND, PIN and TSS; Foxe et al., 2010). Overall, we were able to reconstruct 34 haplotypes linked to a total of 12 distinct S-allele lineages in Nivelle, 38 haplotypes linked to 11 distinct S-allele lineages in Mortagne and 16, 22 and 16 haplotypes associated with 6, 7 and 5 distinct S-allele lineages in populations IND, PIN and TSS, respectively (table S5). Nine of the S-alleles were shared between the two A. halleri populations (Ah01, Ah03, Ah04, Ah05, Ah12, Ah20, Ah24, Ah25 and Ah59). In the populations of A. lyrata, four S-alleles were shared between PIN and TSS (Ah01*, Ah03*, Ah18* and Ah63*), five S-alleles were shared between PIN and IND (Ah01*, Ah03*, Ah46* and Ah63*), four S-alleles were shared between IND and TSS (Ah01*, Ah03*, Ah31* and Ah63*), and three were shared across all three (Ah01*, Ah03* and Ah63*). Note that for convenience, we used A. halleri notations (with the addition of a *) to refer to the trans-specifically shared A. lyrata S-alleles. Altogether, we were able to obtain the phased flanking sequences of 126 S-locus haplotypes, comprising a total of 4,854 variable sites. This provides considerable power to evaluate the local accumulation of linked mutations across S-alleles of different levels of dominance and to examine their patterns of conservation between populations and between species.

Mutations in the S-locus flanking regions can be exchanged between S-alleles by recombination, and between local populations by migration (Charlesworth, 2006). The relative time scale of these two processes (recombination vs. migration) determines the distribution of the linked mutations. To capture the chromosomal extent of this effect of linkage to S-alleles, we developed a new phylogenetic method comparing the likelihood of two contrasted topologies of interest in overlapping windows along the chromosome: (1) the topology clustering haplotypes by the populations where they came from vs. (2) the topology clustering them by the S-allele to which they are linked (Fig 2A). This allowed us to evaluate the progressive shift from a predominant topology by S-alleles close to the S-locus to a topology by populations further along the chromosome and in unlinked control regions (Fig. 2B). The difference in log likelihood between the two topologies decreased significantly with distance to the S-locus (Pearson coefficient = −0.015 and −0.010 for A. halleri and A. lyrata respectively; p-values <2^e-16). In A. halleri, the topology grouping haplotypes by populations became more likely than the topology grouping them by S-alleles at a distance of around 30kb from the S-locus, but even at a distance of 50kb the phylogenetic structure was still different from that in regions unlinked to the S-locus used as controls for the genomic background (Le Veve et al., 2023; Fig 2B). In A. lyrata, the shift was even more rapid (within 10-15kb), although we note that the phylogenetic structure of the control regions was less resolved (Fig 2B). To evaluate these patterns more directly we first examined the data using a Major Component Analysis (MCA, a modified version of PCA adapted to binary data, Fig S1 and S2) and using simple phylogenetic reconstructions (Fig S3, S4, S5 and S6). We confirmed that haplotypes linked to a given S-allele tended to cluster together in the most tightly linked region, and that this grouping by S-alleles was progressively lost in favour of a grouping by population of origin in the most distant regions. Following Kamau et al. (2007), we compared the fixation index F_ST among local populations and among S-alleles in A. lyrata and A. halleri. In both A. halleri and A. lyrata, F_ST values among S-alleles were high in regions close to the S-locus and quickly decreased to reach the background level (Fig. S7) as the distance from the S-locus increased. In parallel, the differentiation among populations followed roughly the opposite pattern, i.e. it was initially low in regions close to the S-locus (as expected under strong balancing selection) and increased up to background level within the first few kilobases (Fig. S7). In line with our phylogenetic analysis, differentiation between populations started to exceed differentiation between S-alleles much closer to the S-locus in the A. lyrata than in the A. halleri populations (Fig S3, S4, S5 and S6). Finally, we explored the fine-scale patterns of association within populations between individual S-alleles and SNP in the linked and the control regions (Fig S8). As expected, the vast majority of significant associations were found for the most closely linked SNPs. With a single exception, all S-alleles were associated with unique SNPs in the 50kb region around the S-locus, albeit with substantial heterogeneity among S-alleles in the patterns and extent of associations that they show (Fig S8). Overall, our results indicate that due to limited recombination, the S-alleles carry a specific set of polymorphic sites in the linked region. This association fades away for more distant sites over a few kilobases, where population structure becomes predominant, as in the rest of the genome. Hence, different S-alleles are associated with specific sets of tightly linked mutations, but only within 10-30kb.

Linkage to the S-locus locally distorts the phylogenetic relationships.
A. The two topologies of interest cluster haplotypes either by the S-allele to which they are linked (top) or by the populations where they came (bottom). Different S-alleles are represented by symbols of different colours, different populations of origin are represented by symbols of different shapes. B. Difference in log likelihood of the two topologies of interest. Dots correspond to the difference in log likelihood for overlapping series of 50 SNPs around the S-locus for A. halleri (top panel) and A. lyrata (bottom panel). Positive values correspond to chromosomal positions where the topology by S-alleles explains the phylogeny of haplotypes better than the topology by populations. The right panels show the difference in log likelihood in the control regions. 2.5 and 97.5 percentiles of the distribution in the control regions are indicated by dashed lines.

No overall evidence that dominant S-alleles accumulate more linked deleterious mutations

Llaurens et al. (2009a) predicted that recessive deleterious mutations should fix more readily when linked to dominant S-alleles than when linked to recessive S-alleles. To test this prediction, we investigated the correlation between the level of dominance of the S-alleles and their total number of 0-fold degenerate mutations (S_0f) or the ratio of 0-fold to 4-fold mutations (S_0f/S_4f) for the phased haplotypes, assuming that the vast majority of 0-fold degenerate mutations are deleterious. Based on the results presented above and the results of our previous study (Le Veve et al., 2023), for the rest of our analyses we focused on the phased haplotypes over the first 25 kb on either side of the S-locus. We found no overall effect of dominance on S_0f (p-values= 0.54 and 0.07 for A. halleri and A. lyrata respectively; Fig. 3; table S6) or S_0f/S_4f (p-values= 0.54 and 0.07 for A. halleri and A. lyrata respectively; table S6). Extending the analysis to all non-synonymous mutations, or to deleterious mutations predicted by SIFT4G and by SNPeff led to identical conclusions (table S6). Overall, our genomic results did not confirm the prediction that dominant S-alleles accumulate a larger number of putatively deleterious mutations in their linked regions. We note that the particular S-allele whose sheltered load was quantified in Llaurens et al. (2009a) (Ah15, red arrow on Fig 3A) appears to be one of the S-alleles associated with the highest number of 0-fold degenerate mutations among all S-alleles of the most dominant class (class IV).

No overall effect of S-allele dominance on the total number of 0-fold degenerate mutations (S_0f) in the linked genomic regions within 25kb.
Each dot represents the mean number of mutations observed among haplotypes linked to one S-allele in one population. The correlations evaluated by a GLM are represented by lines, with confidence intervals represented in grey. The dominance was considered a continuous variable. A: A. halleri. Black dots correspond to the Nivelle population, grey dots to the Mortagne population. The red arrow points to the copy of Ah15, corresponding to the S-allele whose sheltered load was phenotypically characterised by Llaurens et al. (2009a) in Nivelle. B: A. lyrata. Red dots correspond to the IND population, black dots to the PIN population and blue dots to the TSS population.

The structure of the linked genetic load differs between dominant and recessive S-alleles

Theory predicts that dominant S-alleles should fix linked recessive deleterious mutations with a higher probability than recessive S-alleles (Llaurens et al., 2009a), but in natural populations we observed no difference in the total number of putatively deleterious linked to dominant vs. recessive S-alleles. To clarify this discrepancy, we took advantage of our sequencing of multiple copies of S-alleles to consider separately the fixed and the segregating mutations linked to each of the S-allele lineages. For each population, we included only mutations that were segregating, and excluded those that were locally fixed. In agreement with the prediction of Llaurens et al. (2009a), we observed that lineages of dominant S-alleles do indeed tend to fix deleterious mutations more readily (Fig. 4; table S6). This conclusion held true when extending the analysis to all non-synonymous mutations and to the lowly and the moderately deleterious mutations predicted by SNPeff (table S6). This was also true using SIFT4G to identify deleterious mutations, with the only exception of a non-significant correlation for A. halleri, which might be due to the low number of nucleotide sites included in the SIFT4G database, resulting in low power to detect differences (table S6). The fact that dominant S-alleles tend to fix deleterious mutations more readily but do not accumulate a larger total number of deleterious mutations is explained by the fact that the structure of the genetic load differs between dominant and recessive S-alleles: the dominant S-alleles tend to have more fixed deleterious mutations, but the recessive S-alleles compensate by accumulating a larger number of segregating mutations, resulting in similar numbers of deleterious mutations overall in most of the populations.

The number of 0-fold degenerate mutations fixed in the 25kb regions flanking the S-locus increases with dominance of the S-allele associated.
Each dot represents the value obtained for haplotypes linked to one S-allele in one population. The correlations evaluated by a GLM are represented by lines, with confidence intervals represented in grey. The dominance was considered as a continuous variable. A: A. halleri. Black dots correspond to the Nivelle population, grey dots to the Mortagne population. B: A. lyrata. Red dots correspond to the IND population, black dots to the PIN population, blue dots to the TSS population.

Motivated by these empirical observations, we built upon the model by Llaurens et al. (2009a), who showed that linked deleterious mutations (especially fully recessive ones) are expected to fix within dominant S-allele lineages more readily than within recessive S-allele lineages. Here, we adapted the model to focus not only on fixed deleterious mutations, but also on those that are segregating within allelic lineages. Our stochastic simulations confirmed that, at equilibrium, dominant S-alleles tend to accumulate a larger number of recessive deleterious mutations that are fixed among gene copies within S-allele lineages (Fig. 5A). In contrast, the number of segregating linked mutations was higher for recessive than for dominant S-alleles (Fig. 5B). These two effects eventually compensate each other, such that in the end the mean number of linked deleterious mutations per copy of S-allele was not expected to change between dominant and recessive S-alleles (Fig. 5C). These predictions are in line with our genomic observations and suggest that the dominance level of S-alleles modifies the structure of the genetic load they shelter: dominant S-alleles accumulate more fixed deleterious mutations, but recessive S-alleles accumulate more segregating mutations, resulting in an equivalent load overall.

Stochastic simulations confirm the contrasted architecture of the load of deleterious mutations linked to dominant vs. recessive S-alleles.
Number of fixed (A), segregating (B) and total (C) deleterious mutations linked to S-alleles at four different levels of dominance (I<II<III<IV). The means (bold lines) were estimated per S-allele dominance classes over 100 replicate simulations after discarding an initial burn-in of 100,000 generations. h=0. s=0.01.

Discussion

The genetic load linked to the S-locus is detectable and is manifested on different phenotypes

Our results contribute to a growing body of evidence confirming that the accumulation of deleterious mutations linked to strongly balanced allelic lines can be substantial, and that their effect can be detected at the phenotypic level (Lane and Lawrence, 1995; Stone, 2004; Llaurens et al., 2009a; Mena-Ali et al., 2009; Stift et al., 2013; Vieira et al., 2021). An interesting observation is that the phenotypes on which the load was revealed varied among these studies. Here, the effect of homozygosity at the S-locus was apparent on juvenile survival and on the length of the longest flowering stem, but we detected no effect on any other morphological measurements, including leaf and rosette traits. In the same population of A. halleri, Llaurens et al. (2009a) detected an effect on juvenile survival and on leaf size. A study in North American outcrossing populations of A. lyrate (Stift et al., 2013) detected an effect on juvenile survival, but not on any other traits that they measured. In the horsenettle Solanum carolinense, the load was associated with reduced seed viability, flower number and germination (Stone, 2004; Mena-Ali et al., 2009). Hence, the most consistent pattern seems to be a decrease of overall juvenile survival, possibly because it is a highly integrative measurement of fitness, whereas other morphological or life history traits can be associated with more specific components of overall fitness.

A unique genetic load associated with each allele in each population

The model of the sheltered load posits that each S-allele should be associated with a specific set of linked mutations (Llaurens et al., 2009a). In line with this prediction, the magnitude of the S-linked load varied among S-alleles, as the load linked to some S-alleles was phenotypically detectable, while for others it was not. This variation of the genetic load is expected, since deleterious mutations associated with the different alleles are likely to hit different linked genes, and affect different phenotypic traits with different effects on fitness. Also in line with the model of the sheltered load, our phasing of a large number of variants linked to S-haplotypes in several natural populations revealed that the same suite of linked mutations was consistently associated among different copies of a given S-allele when sampled from within the same population, in particular for the dominant S-allele lineages under more intense balancing selection. As expected for outcrossing populations with short-scale linkage disequilibrium, this association was lost when examining sites at increasing genetic distances from the S-locus along the chromosome (see also Le Veve et al., 2023). Finally, the association with linked sites was further lost when comparing gene copies of S-alleles sampled from different local populations, suggesting that recombination within populations decouples alleles from their linked sites faster than migration can homogenise the genetic composition among these natural populations. We note that the patterns of association and phylogenetic structure differed among populations, possibly due to their contrasted demographic histories. Indeed, the A. lyrata populations colonised North America from ancestral European populations about 20-30.000 years ago (Clauss et al., 2006; Ross-Ibarra et al., 2008), and are less diverse overall than the A. halleri populations we studied, who colonised the north of France during the last century from ancestral German populations (Pauwels et al., 2005). The progressive decoupling between alleles and their linked sites leads to the simple prediction that S-locus homozygous genotypes formed by crossing individuals carrying identical alleles from distinct populations should not reveal as much load as when they are formed by crossing individuals within populations. Hence, the S-locus region could contribute to overall hybrid vigour. Testing this prediction will be an interesting next step.

Different properties of the linked load according to S-allele dominance

The question of whether variations in the intensity of balancing selection, mediated by S-allele dominance, could explain variation of the linked load has received conflicting support in the literature. In line with Stift et al. (2013), but in contradiction with Llaurens et al. (2009a), we observed no overall effect of dominance on the magnitude of the load. Several technical and biological reasons could explain the contrasted results obtained in these different studies. First, phenotypic quantification of the linked load is experimentally demanding, such that these studies relied on the comparison of a limited number of alleles (three S-alleles in each of the studies) and therefore each of them had inherently low power. Second, the experimental procedures to reveal the load varied slightly. Llaurens et al. (2009a) used CO₂ treatment to by-pass the SI system and obtain homozygous progenies from crosses that would otherwise have been incompatible, whereas we used the “natural” masking by dominant S-alleles to enable the obtention of recessive homozygous genotypes. Our approach is experimentally simpler and avoids the possible contamination by offspring obtained by selfing, which may confound the effect of the sheltered load with that of genome-wide inbreeding depression (see Stift et al., 2013 for a detailed discussion of this caveat). Third, a limitation of our approach is that it is restricted to S-alleles that are recessive or intermediate along the dominance hierarchy, and is thus not applicable to quantify the load associated with the most dominant S-alleles under more intense balancing selection. It is therefore possible that the S-alleles we examined did not exhibit sufficiently contrasted levels of dominance, in particular if only the most dominant ones are generating a substantial load, as suggested for fully linked recessive deleterious mutations (Llaurens et al., 2009a). In addition, since the homozygous S-allele genotypes we created correspond to different gene copies from the population, they may carry distinct sets of linked variants, especially for the more recessive S-alleles. The variation we observed in the phenotypic magnitude of the load among families confirms that linked deleterious variants are unlikely to be fixed within all allele lineages. Finally, we note that our genomic analysis of the genetic load shows that the dominant allele Ah15 previously associated with reduced fitness in homozygotes (Llaurens et al., 2009a), is indeed unusual in terms of the number of mutations it carries. In fact, it is one of the most “loaded” alleles among all the dominant S-alleles present in this population, possibly explaining why Llaurens et al. (2009a) observed a significant effect despite the inherently limited experimental power of their analysis.

A possible caveat of the population genetics approach we used is that simply counting up the number of putatively deleterious linked mutations is a very crude estimate of the genetic load. We note that our conclusions are robust to differences in the way we define deleterious mutations: as variants at 0-fold degenerate sites, at non-synonymous sites, or using methods to quantify the severity of mutations such as SNPeff of SIFT4G. An obvious limitation is that none of these approaches allow for evaluation of recessivity - a concept critical to ideas concerning the sheltered load. Our stochastic simulations could be improved in several ways. First, we examined the accumulation of linked deleterious mutations that were assumed to be fully recessive. This choice was guided by the observation by Llaurens et al. (2009a) that fully recessive mutations accumulate more substantially, but it remains possible that the dynamics of deleterious mutations that are only partially recessive may involve a complex interaction with dominance of the S-alleles to which they are linked. Second, allowing for partial recombination between S-alleles and their linked deleterious mutations in the simulations would also be necessary to predict the length of the chromosomal haplotypes associated with dominant vs. recessive S-alleles. In spite of these limitations, our stochastic simulations and genomic analyses concur to the conclusion that the variation of the intensity of balancing selection among S-alleles affect the genetic architecture of the linked load: a larger proportion of putatively deleterious mutations are fixed among gene copies of the dominant as compared to the recessive S-alleles, while gene copies of the recessive S-alleles tend to accumulate more segregating deleterious variation. While these two processes eventually compensate one another, they may have distinct consequences for the evolution of S-alleles. Uyenoyama (2003) showed that the existence of a sheltered load should influence the evolutionary dynamics of new S-alleles through self-compatible intermediates. Specifically, antagonistic interactions are expected between ancestral and derived functional specificities because they would initially share their linked deleterious mutations, slowing down the establishment of new S-alleles. Our observation that partially different sets of linked mutations are associated with S-alleles from the different populations raises the question of whether the (short) time scale at which recombination decouples S-alleles from their sets of linked mutation is sufficiently fast to impede such antagonistic interactions to take place. In other words, the effect of the load on the diversification dynamics should be most important if the two mutational steps required for the emergence of new S-alleles under this model take place within local populations, rather than involving a metapopulation-scale process. As shown by Stetsenko et al. (2023), this is expected to occur under very low dispersal only. In addition, the observation that the architecture of the sheltered load differs between dominant and recessive S-alleles suggests that their diversification dynamics may also differ. Specifically, the self-compatible intermediates required for the formation of new S-alleles (Gervais et al., 2011; Bergero and Charlesworth, 2009) are expected to be capable of selfing as well as forming homozygous genotypes that would otherwise be prevented. While the consequences of selfing may be equivalent for all alleles (because the overall number of mutations to which they are linked are equivalent), the consequences of the formation of homozygotes allowed by the crossing of separate individuals sharing a given S-allele are expected to be more severe for dominant S-alleles. The segregation of distinct deleterious variants linked to different gene copies of recessive S-alleles implies that linked recessive deleterious mutations are likely to remain masked when two distinct gene copies of a given recessive S-allele are brought together. Hence, our results lead to the prediction that in natural populations self-compatible mutants may segregate more readily for the more recessive than for the more dominant S-alleles, and more generally for allelic lineages under lower intensity of balancing selection. Considering that self-compatible mutants are a necessary intermediate stage in the formation of new S-alleles, one may predict that the diversification dynamics should be more efficient for lineages of recessive than dominant S-alleles. This prediction is in line with the observation in Arabidopsis that the most dominant S-alleles exhibit the deepest phylogenetic divergence among them (Durand et al., 2014). Detailed quantification of the presence of self-compatible variants in natural populations will now be necessary to test this hypothesis. At this stage, however, a proper model of allelic diversification taking into account dominance interactions among S-alleles is still missing.

Variations of the genetic load among balanced allelic lines is a general phenomenon. The classical case of Y or W sex chromosomes are indeed examples where one balanced line accumulates a greater genetic load than the other (X or Z, respectively), eventually leading to substantial genetic degeneration (Wright et al., 2016; Ponnikas et al., 2018). Another example is the supergene controlling variation in male plumage phenotypes of the ruff, where the genetic load on the derived “Satellite” haplotype is higher than on the ancestral “Independent” haplotype (Lamichhaney et al., 2016; Hill et al., 2022). Similarly, in the butterfly Heliconius numata, the inverted haplotypes conferring mimetic wing patterns tend to accumulate a greater load than the non-inverted haplotypes (Rosser et al., 2022). Interestingly, in all these cases, the haplotypes with the greatest load also act genetically in a dominant manner, establishing a clear parallel with our observations.

It is clear from our results that S-allele dominance affects the linked load, but in turn the differences in structure of the linked load may affect the conditions under which dominance can evolve. The Brassicaceae S-locus is a unique system, where dominance is controlled by “dominance modifiers” (Durand et al., 2014; Durand et al. 2020). The presence of deleterious mutations linked to S-alleles has been shown to affect the evolution of dominance modifiers, favouring evolution towards greater dominance than towards greater recessivity (Llaurens et al., 2009b). This asymmetry arises from the fact that S-alleles that become recessive (e.g. following acquisition of a recessivity modifier such as a small RNA target) will start forming homozygous genotypes, leading to expression of their linked load, while S-alleles that become dominant will not. Our observation that many deleterious mutations linked to recessive S-alleles are indeed segregating, suggests that expression of the load will be less severe for recessive than for dominant S-alleles, hence decreasing this predicted asymmetry. It will now be essential to modify models for the evolution of dominance to allow for such differential load among S-alleles.

Material and Methods

Source plant material

We worked on natural accessions from two closely related species, A. halleri and A. lyrata, represented by two population samples named Mortagne (50°47’N, 3°47’E, France, n=60) and Nivelle (50°47’N, 3°47’E, France, n=61) for A. halleri, and three highly outcrossing population samples from the North American Great Lakes, named IND (Indiana Dunes National Lakeshore in Michigan, n=9), PIN (Pinery Provincial Park in Ontario, n=11) and TSS (Tobermory Provincial Park in Ontario, n=8; Foxe et al., 2010) for A. lyrata (Fig. S9). The A. lyrata populations colonised North America from ancestral European populations about 20-30.000 years ago (Clauss and Mitchell-Olds, 2006; Ross-Ibarra et al., 2008) and the A. halleri populations are peripheral and likely colonised the north of France during the last century from ancestral German populations (Pauwels et al., 2005).

We performed 92, 91, 40, 43 and 21 controlled crosses between randomly chosen individuals within the Nivelle, Mortagne, IND, PIN and TSS populations, respectively. We successfully obtained seeds from 60, 66, 21, 21 and 10 of these crosses, respectively. Because we were not interested in estimating population frequencies of S-alleles, we instead tried to maximise the number of reconstructed haplotypes and avoid over representing the most recessive S-allele (Ah01) that tends to segregate at very high frequencies in natural populations (Llaurens et al., 2008). To do this, we performed PCR with S-allele-specific primers (Llaurens et al., 2008; Goubet et al., 2012) to screen the parents of the crosses and we removed from the experiment offspring with two parents carrying allele Ah01. For A. halleri, we selected 19 individuals from the Nivelle population and 19 individuals from the Mortagne population, based on their genotype at the S-locus (Fig. S9; table S7). We also selected one offspring of 9, 11, 5, 6 and 5 pairs of selected individuals in the Nivelle, Mortagne, IND, PIN and TSS populations respectively for the phasing of S-haplotypes (table S8; Fig. S9). To increase sample size for the phenotypic measurements, we included offspring from five additional crosses from the Nivelle population (table S8).

Library preparation, capture and sequencing

We used a previously developed sequence capture approach to specifically sequence genomic regions of interest (Le Veve et al., 2023). Briefly, indexed genomic libraries were constructed for each individual and libraries were pooled in equimolar proportions. Fragments matching a series of regions of interest (including in particular the 75kb upstream and downstream of the non-recombining S-locus region as well as a series of 100 unlinked 25kb regions used as genomic controls; Le Veve et al., 2023), were then enriched using synthetic 120bp RNA probes and sequenced by Illumina MiSeq (a total of 159 million paired-end reads).

For six individuals (table S7, S8), we completed the sequencing with genome-wide resequencing (WGS) in order to distinguish the homozygous and heterozygous genotypes at the S-locus based on read depth (Genete et al., 2020), which is not possible using data from the capture protocol. The prepared libraries were sequenced by Illumina NovaSeq (2x 150pb, paired-end) from the GenoScreen platform (Lille, France).

Determination of the S-locus genotypes and dominance of S-alleles

We used a dedicated pipeline for genotyping the S-locus based on short reads sequencing (Genete et al., 2020) obtained from each individual (table S7 and S8). The level of dominance of S-alleles found in our study was determined based on either previous assessment of dominance in A. lyrata and A. helleri (Schierup et al., 2001; Mable et al., 2003; Bechsgaard,et al., 2004; Llaurens et al., 2008; Goubet et al., 2012) or indirectly inferred based on the observed association between the phylogeny of S-alleles and levels of dominance (Prigoda et al., 2005).

Read mapping and variant calling in A. halleri and A. lyrata populations

Raw reads were mapped on the complete A. lyrata reference genome V1.0.23 (Hu et al., 2011) using Bowtie2 v2.4.1 (Langmead and Salzberg, 2012), as described in Le Veve et al. (2023). File formats were then converted to BAM using samtools v1.3.1 (Li et al., 2009) and duplicated reads were removed with the MarkDuplicates program of picard-tools v1.119 (http://broadinstitute.github.io/picard). These steps were performed by the custom Python script sequencing_genome_vcf.py available at https://github.com/leveveaudrey/analysis-of-polymorphism-S-locus.

We obtained an average of 620 million properly mapped paired-end 300bp reads per population sample. For consistency, we conserved only reads which mapped to the S-locus flanking or control regions, even for samples sequenced by WGS, using the targetintercept option of bedtool v2.25.0 (Quinlan and Hall, 2010). We called all SNPs within the chromosomal segment comprising 50 kb upstream from the first base of the gene Ubox in 3’ and 50 kb downstream from the last base of the gene ARK3 in 5’ of the S-locus using the Genome Analysis Toolkit v. 3.8 (GATK; DePristo et al., 2011) with the option GVCF and a quality score threshold of 60 using vcftool v0.1.15 (Danecek et al., 2011). This region contains 20 annotated protein-coding genes. In this study we excluded the genes inside the S-locus itself (SCR, SRK). For each sample independently, we computed the distribution of coverage depth across control regions using samtools depth (Li et al., 2009). We excluded sites with either less than 15 reads aligned or coverage depth above the 97.5 % percentile, as the latter are likely to correspond to repeated sequences (e.g. transposable elements or paralogs). Finally, we removed SNPs fixed in each population using the script 1_fix_pos_vcf.py (https://github.com/leveveaudrey/dominance_and_sheltered_load), thus retaining only nucleotide sites that were variable in the population.

Quantifying the sheltered load of deleterious mutations

We examined deleterious mutations based on the accumulation of either 1) mutations on 0-fold degenerate sites, 2) all non-synonymous mutations, 3) mutations predicted to be deleterious based on the SIFT4G database (Vaser et al,. 2016) or 4) mutations predicted to be lowly, moderately and highly deleterious by SNPeff (Cingolani et al,. 2012). The 0-fold and 4-fold degenerate sites were identified and extracted from the reference genome and the gene annotation using the script NewAnnotateRef.py (Williamson et al., 2014). None of the tools used to predict deleterious mutations are able to determine dominance levels of the mutation. Thus, all the deleterious mutations were considered as recessive. Details of the number of deleterious for each type is presented in table S6.

Phasing S-haplotypes

For each of the 9, 11, 5, 6 and 5 trios analysed in the Nivelle, Mortagne, IND, PIN and TSS populations respectively, we phased mutations in the flanking regions, resulting in 130 phased haplotypes (Fig. S9). Briefly, we used sites that were heterozygous in the offspring to resolve parental haplotypes by assuming no recombination between parent and offspring, thus attributing the allelic state that was shared between a parent and its offspring to their shared S-allele, and the allelic state that was not shared to the other (untransmitted) haplotype of the parent. Twelve of the parents had been used in more than one cross, and in these cases we phased their haplotypes only once (table S8). We implemented the phasing procedure in the script 3_phase_S_allele.py available at https://github.com/leveveaudrey/dominance_and_sheltered_load.

Study of the structure of S-haplotypes

We first developed a new method to evaluate the distortion of the phylogenetic patterns caused by linkage to S-alleles. To do this, we used phyml v.3.3 (Guindon et al., 2010) to calculate the likelihood of two contrasted topologies of interest: (1) the topology clustering haplotypes by the populations where they came from vs. (2) the topology clustering them by the S-allele to which they are linked (Fig 2A). We used sliding windows of sequences with 50 SNPs to obtain the variation of the difference in log-likelihood between these two topologies along the chromosome. We then compared these values to their distribution throughout the genome obtained by random draws of sequences with 50 SNPs from the control regions. Second, we visualised the relationships among the phased haplotypes using maximum likelihood phylogenies based on the Tamura-Nei model (Tamura and Nei, 1993), with 1,000 replicates in MEGA X (Kumar et al., 2018). Third, we followed Kamau et al.’s (2007) approach and examined the variation of F_ST among populations within each species (Nivelle and Mortagne for A. halleri and IND, PIN and TSS for A. lyrata) along the flanking region in non-overlapping windows of 5kb. We also examined the variation of F_ST along the flanking region obtained by grouping haplotypes by their linked S-allele rather than by population of origin. Then, we compared these F_ST values computed in the S-locus flanking regions with their genomic distribution as determined from the 100 control regions. The F_ST values were estimated with the DNAsp 6 software (Rozas et al., 2017). Fourth, we performed a major component analysis (MCA) based on SNPs in the first 5kb, SNPs between 5 and 25kb and SNPs between 25 and 50kb around the S-locus, using the R packages ‘ggplot2’ (version 3.4.0), ‘factoextra’ (version 1.0.7) and ‘FactoMiner’ (version 2.7). We compared the patterns obtained by these MCAs with those obtained from identical numbers of SNP (+/- 1%) from the control regions. Finally, we analysed genetic association in each population independently between each of the locally segregating variants and the S-alleles considered as phenotypes, using STRAT V1.1 (Pritchard et al., 2000) combined with Structure V2.3 (Pritchard et al., 2010). We examined the distribution of the top 0.1% most significant associations detected specifically for each S-allele in each population.

Estimation of the number of fixed and segregating deleterious mutations within S-allele lineages

For each variable position considered in the phased haplotypes, we estimated the number of mutations on 0-fold (S_0f) and 4-fold degenerate sites (S_4f) compared with the reference genome. We distinguished SNPs that were fixed from those that were segregating within each of the allelic lines. We used GLM with a Poisson distribution to test whether the number of fixed and segregating mutations were associated with S-allele dominance, considering populations as random effects. The dominance of the S-allele was considered a continuous variable. We reiterated the GLM analysis with the number of non-synonymous (S_NS), synonymous (S_S), lowly and moderately deleterious mutations predicted by SNPeff and deleterious mutations predicted by SIFT4G mutations.

Estimation of the phenotypic impact of homozygosity at the S-locus for three S-alleles

To determine if the genetic sheltered load putatively linked to the S-locus has a detectable phenotypic impact, we performed 45 crosses (table S1) between offspring of the Nivelle individuals that we chose so that they shared one S-allele (Fig. S9). Based on the dominance hierarchy in pollen (Durand et al., 2014; table S1), these crosses should correspond to compatible partners. The general principle of the experiment was to take advantage of the dominance hierarchy to mask recessive S-alleles and generate full sibs that were either homozygous (because they inherited the S-allele that was shared by their two parents) or heterozygous at the S-locus, and thus isolate the effect of homozygosity at the S-locus. Note that all offspring in our experiments were thus “naturally” outcrossed, whereas Llaurens et al. (2009a) based their comparisons on outcrossed progenies obtained by enforced incompatible crosses and Stift et al.¹⁹ based their comparisons on enforced selfed progenies. These crosses generated 399 seeds overall, with homozygous genotypes expected for the S-alleles Ah01, Ah03 and Ah04 forming the following dominance relationship: Ah01<Ah03<Ah04.

Seedlings were grown in a greenhouse between 14.5 and 23.1°C and a photoperiod of 16 hr day/8 hr night. Offspring from the six families were placed on tables, and their position randomised every three days. After three months of growing, all the germinated plants were vernalised under a temperature between 6 and 8°C and a natural photoperiod for two months (January-February). Then, all surviving plants began reproduction in a greenhouse under temperature between 10.6 and 25.3°C and a natural photoperiod. The genotypes at the S-locus were determined in surviving plants by a PCR approach, using S-allele-specific primers for the pistil-expressed SRK gene. We assessed the reproductive success of offspring from the different crosses on the basis of fourteen phenotypic traits (detailed below) and computed the mean difference for the trait between homozygotes and heterozygotes within each family. We also tested for departures from mendelian proportions of each S-locus genotypic category in the family after the apparition of the first stem. Significant departures were interpreted as reflecting differences in survival between homozygous and heterozygous S-locus genotypes. We performed 10,000 replicate simulations of mendelian segregation based on the S-locus genotype of the parents. We used GLM to test whether the phenotypic impact of homozygosity at the S-locus increased with dominance of the S-alleles, considered as a continuous variable. The models used for GLM depended on the type of trait analysed (poisson for the counts like the number of leaves, flowers by stems or days; gaussian for continue traits like the lengths, widths and areas).

We measured the following fourteen phenotypic traits: the time (days) to the first leaf measured by visual control every day during seven weeks after sowing the seeds, the number of leaves, the area of the rosette (cm²), the mean length and width of leaves (cm), the standard deviation of length and width of leaves (cm) and the mean area of leaves (cm²) measured by ImageJ (Schneider et al., 2012) based on photographs taken seven weeks (+/− five days) after the first leaf. At reproduction, we measured the time to the first flower bud for the end of vernalisation (day), scored by visual control every three days during nine weeks, the number of flower buds per flower stem produced during four week after the appearance of the first bud, the number of flower stems, the length of the highest flower stem produced four weeks after the appearance of the first bud (cm), and finally the total duration of buds production (days), scored by visual control every three days during eleven weeks after the appearance of the first bud. The last trait we measured was the proportion of homozygotes per family that survived until reproduction assuming mendelian proportions in the seeds. During the whole experiment, the presence of phytophagous insects, pathogens and stress markers were scored as binary variables. The presence of phytophagous insects and pathogen attacks were detected by the occurrence of gaps in leaves. Oxidative stress was scored qualitatively based on the occurrence of purple leaves. We also controlled the effect of the family on the phenotypic trait. These effects were controlled by redistributing 10,000 times the values observed in groups of the same size observed for each effect (for example, presence or absence of pathogen attack) and comparing the difference for the trait observed with the distribution of the differences obtained in the permutations. We considered the impact of the effect on the trait if the observed difference between groups was higher than the 95% percentile of the distribution obtained randomly (table S9). When the test was significant, the effect was implemented as a random effect in the GLM. We used the same method to control for the family effect, which was included as a random effect in GLM if necessary (table S10). The general experimental procedure is summarised in Fig. S9 and all data analyses were done in R ver. 3.1.2 (R Development Core Team 2014).

Simulations

Finally, we refined the model of Llaurens et al. (2009a), in several ways. We simulated a panmictic population of N diploid individuals with non-overlapping generations. Each individual was defined by its genotype in a non-recombining genomic region. This region contains the S-locus, and a D locus where deleterious mutations accumulated. For the S-locus, we used a simple model of sporophytic SI, with four dominance classes, as observed in A. halleri (Genete et al., 2020; only three classes were considered before in Llaurens et al., 2009a), and fourteen S-alleles (eight alleles in the class IV, three in the class III, two in the class II and one allele in the class I). This distribution mirrors that of the Nivelle population (table S7), with the exception that a class II allele has been added because its presence has been reported in previous studies (Llaurens et al., 2008). Alleles within classes were assumed to be codominant with each other, and dominant over all alleles of the more recessive classes, with the following linear hierarchy between classes : classI<classII<classIII<classIV). We also assumed that no new S-allele could appear by mutation during the simulations. The population size was 10,000 diploid individuals, so as to be large enough to avoid S-allele loss by drift during the simulations (previously it was 1,000). The “D locus” comprised one hundred fully linked biallelic positions (versus a single one in Llaurens et al., 2009a). Fully recessive deleterious mutations were recurrently introduced (at a rate 10^-4), and reverse mutations were possible (at a rate 10^-5). We ignored partially recessive deleterious mutations because these mutations were predicted to be effectively eliminated by natural selection in Llaurens et al. (2009a). The survival probability p of a zygote depended on its genotype at the D locus: p = (1 − s)ⁿ with s the selection coefficient and n the number of positions homozygous for the mutated allele. We explored different values of the selection coefficient (0.1, 0.05, 0.03, 0.01 and 0.005). Under strong selection (s=0.1, 0.05 and 0.03), the combined effect of multiple mutations led to low-fitness individuals, eventually causing population extinction. Under weak selection, (s=0.005), we observed near fixation of the deleterious mutations under the influence of asymmetrical mutation. Hence, we focused on the intermediate value of the selection coefficient (s=0.01), where deleterious mutations segregated stably in the simulations.

We first ran simulations without deleterious mutations until a deterministic equilibrium for S-allele frequencies was reached, which was considered to be attained when allelic frequencies changed by less than 10^-3 between generations. Recessive deleterious mutations were then allowed to accumulate at the positions within the D locus. Each simulation was performed with 100 independent replicates of 100,000 generations, and the frequency of the deleterious alleles was recorded every 1,000 generations. At the end of the simulation runs, we estimated the number of deleterious mutations found in each haplotype associated with each S-allele to determine the expected patterns of association between the sheltered load and dominance at the S-locus.

The code of the program of simulations developed in Llaurens et al. (2009a) and used in our study is available in Github (https://github.com/leveveaudrey/model_ssi_Llaurens).

Data Availability

All sequence data are available in the NCBI Short Read Archive (SRA; https://www.ncbi.nlm.nih.gov/sra) with accession codes: PRJNA744343, PRJNA755829.

All scripts developed are available in Github (https://github.com/leveveaudrey/dominance_and_sheltered_load https://github.com/leveveaudrey/analysis-of-polymorphism-S-locus).

The code of the program of simulations developed in Llaurens et al. (2009a) and used in our study is available in Github (https://github.com/leveveaudrey/model_ssi_Llaurens).

Acknowledgements

This work was funded by the European Research Council (NOVEL project, grant #648321) and ANR TE-MoMa (grant ANR-18-CE02-0020-01). AL’s PhD thesis was funded by the ERC and the University of Lille. The authors thank Barbara Mable for sharing seeds of A. lyrata and Camille Roux for discussions. This work was performed using infrastructure and technical support of the Plateforme Serre, cultures et terrains expérimentaux - Université de Lille for the greenhouse/field facilities. The authors thank the UMR 8199 LIGAN-MP Genomics platform (Lille, France) which belongs to the ‘Federation de Recherche’ 3508 Labex EGID (European Genomics Institute for Diabetes; ANR-10-LABX-46) and was supported by the ANR Equipex 2010 session (ANR-10-EQPX-07-01; ‘LIGAN-MP’). The LIGAN-MP Genomics platform (Lille, France) is also supported by the FEDER and the Region des Hauts-de-France. The authors thank the GenoScreen platform (Lille, France).

Author contributions

AL, ED, VC and XV developed and designed the experiments for the study. AL, CBL and CP performed the experiments. AL and MG analysed and interpreted the data. AL, ED, VC and XV wrote the manuscript. All authors edited the manuscript.

Declaration of interests

The authors declare no competing interests.

Supplementary data

Analysis of Major Components obtained for haplotypes of A. halleri of the Nivelle (black dots) and Mortagne (grey dots) populations) based on SNPs in the first 5kb, between 5 and 25kb, and between 25kb and 50kb away from the S-locus.
The S-allele to which the SNPs are linked are represented by different symbols. The right panels show the analysis on control regions, each time matching the number of SNPs with that of the corresponding linked regions (left panels).

Analysis of major components (AMC) obtained for haplotypes of A. lyrata (of the PIN: grey dots, IND: red dots and TSS: blue dots) populations based on the SNPs in the first 5kb, between 5 and 25kb, and between 25kb and 50kb away from the S-locus.
The S-allele to which the SNPs are linked are represented by different symbols. The right panels show the analysis on control regions, each time matching the number of SNPs with that of the corresponding linked regions (left panels).

Phylogenetic tree obtained by Maximum Likelihood for haplotypes of A. halleri (populations Nivelle and Mortagne) across the first 25kb flanking the S-locus.
The Tamura-Nei model was used and the percentage of trees in which the associated haplotypes clustered together is shown next to the branches. The tree is drawn to scale, with branch lengths measured in the number of substitutions per site. The black braces indicate haplotypes clustering by populations. The overall pattern shows a structure by S-alleles, with exceptions highlighted in yellow.

Phylogenetic tree obtained by maximum likelihood for haplotypes of A. halleri (populations Nivelle and Mortagne) based on the nucleotide positions between 25kb and 50kb away from the S-locus.
The Tamura-Nei model was used and the percentage of trees in which the associated haplotypes clustered together is shown next to the branches. The tree is drawn to scale, with branch lengths measured in the number of substitutions per site. The black braces indicate haplotypes clustering by populations. The overall pattern shows a structure by S-alleles, with exceptions highlighted in yellow.

Phylogenetic tree obtained by maximum likelihood for haplotypes of A. lyrata (populations PIN, IND, TSS) across the first 5kb flanking the S-locus.
The Tamura-Nei model was used and the percentage of trees in which the associated haplotypes clustered together is shown next to the branches. The tree is drawn to scale, with branch lengths measured in the number of substitutions per site. The black braces indicate haplotypes clustering by populations. The overall pattern shows a structure by S-alleles, with exceptions highlighted in yellow.

Phylogenetic tree obtained by maximum likelihood for haplotypes of A. lyrata (populations PIN, IND, TSS) based on the nucleotide positions between 5kb and 10kb away from the S-locus.
The Tamura-Nei model was used and the percentage of trees in which the associated haplotypes clustered together is shown next to the branches. The tree is drawn to scale, with branch lengths measured in the number of substitutions per site. The black braces indicate haplotypes clustering by populations. The overall pattern shows a structure by S-alleles, with exceptions highlighted in yellow.

The genetic structure of SNPs in the S-locus flanking regions in A. halleri and A. lyrata.
Left: Mean F_ST (lines) and F_ST by pair (point) analysed among S-alleles (grey) or among populations (black) in 5 kb windows in the S-locus flanking regions. Right: Distribution (count) of F_ST in the control regions analysed among S-alleles (grey bars) or among populations (black bars). The 95% percentiles of the distributions are represented by dotted lines and the medians by solid lines.

Patterns of genetic associations between S-alleles and SNPs across the genome.
Each dot corresponds to a SNP showing statistically significant association (top 0.1%) with a given S-allele. The left panel confirms that SNPs physically linked to the S-locus (in red) are considerably more likely to show statistical associations with S-alleles. The chromosomes were signified by alternance of black and grey. The middle panel shows a zoom on the 50kb regions flanking the S-locus and shows that the statistical association extends over long distances, at least for some S-alleles. Each region of 25 kb was delimited by vertical lines. The right panel shows that the observed number of associated SNPs in the linked regions far exceeds that in regions of identical size from control regions. The histogram shows the distribution across control regions of the mean number of significantly associated SNPs per S-allele. The vertical lines correspond to the mean number of significantly associated SNPs in the first 25kb (solid lines) and the 25-50kb interval (dashed lines) away from S-locus.).

Experimental protocol.
A) We randomly crossed A. lyrata individuals from the PIN, TSS and IND populations in North America (left) and A. halleri from the Nivelle (middle) and Mortagne (right) populations. Individuals were sequenced by a capture protocol. Numbers between parentheses represent the number of individuals per dataset. B) One offspring from each cross was sequenced along with its two parents for trio haplotyping. Offspring from the Nivelle population (black circle) were conserved for the study of the impact of homozygosity at the S-locus on fitness (G1 population). C)Individuals sequenced in A and B were used to reconstruct haplotypes linked to each copy of S-allele, assuming no recombination between the S-locus and its flanking regions between parents and offspring. D) We used the dominance hierarchy between S-alleles expressed in pollen to cross G1 individuals from the Nivelle population and obtained six G2 families constituted of heterozygous and homozygous individuals for the alleles Ah01, Ah03 and Ah04. E) Description of the traits measured, and the methods used to estimate the impact of homozygosity at the S-locus in homozygotes. Traits 1-8 are related to biomass and traits 10-14 are related to reproductive success.

Supplemental items

Crosses performed to obtain homozygotes for three S-alleles.
For each S-allele, we obtained homozygous and heterozygous plants within two separate families (listed as separate rows).

Trait variation in S-locus homozygous individuals.

Trait variation in homozygous at the S-locus for the S-alleles Ah01 and Ah04 between families.

Effect of dominance on variation of phenotypic traits in S-locus homozygous individuals.

Number of phased haplotypes linked to S-alleles in each sample.

Effect of dominance on the accumulation of genetic load in S-flanking regions.

S-locus genotypes of individuals sequenced using the capture protocol.

S-locus genotypes of the offspring selected for haplotype phasing and the crosses for the study of phenotypic traits.

Effect of phytopathogen, phytophagous attacks and oxidative stress on the phenotypic

Difference on the phenotypic trait variation between the two families for each allele tested.

References

1. Bechsgaard J
2. Bataillon T
3. Schierup M.H
2004Uneven segregation of sporophytic self-incompatibility alleles in Arabidopsis lyrataJournal of Evolutionary Biology 17:554–561https://doi.org/10.1111/j.1420-9101.2004.00699.x Google Scholar
1. Bergero R
2. Charlesworth D
2009The evolution of restricted recombination in sex chromosomesTrends in Ecology & Evolution 24:94–102https://doi.org/10.1016/j.tree.2008.09.010 Google Scholar
1. Billiard S
2. Castric V
3. Vekemans X
2007A general model to explore complex dominance patterns in plant sporophytic self-incompatibility systemsGenetics 175:1351–1369https://doi.org/10.1534/genetics.105.055095 Google Scholar
1. Castric V
2. Vekemans X
2004Plant self-incompatibility in natural populations: a critical assessment of recent theoretical and empirical advancesMolecular Ecology 13:2873–2889https://doi.org/10.1111/j.1365-294X.2004.02267.x Google Scholar
1. Charlesworth D
2006Balancing selection and its effects on sequences in nearby genome regionsPLOS Genetics 2:e64https://doi.org/10.1371/journal.pgen.0020064 Google Scholar
1. Cingolani P.
2. Platts A.
3. Wang L.L.
4. Coon M.
5. Nguyen T.
6. Wang L.
7. Land S.J.
8. Lu X.
9. Ruden D.M
2012A program for annotating and predicting the effects of single nucleotide polymorphismsSnpEff. Fly (Austin) 6:80–92https://doi.org/10.4161/fly.19695 Google Scholar
1. Clauss M.J
2. Mitchell-Olds T
2006Population genetic structure of Arabidopsis lyrata in EuropeMolecular Ecology 15:2753–2766https://doi.org/10.1111/j.1365-294X.2006.02973.x Google Scholar
1. Cocker J.M
2. Wright J
3. Li J
4. Swarbreck D
5. Dyer S
6. Caccamo M
7. Gilmartin P.M
2018Primula vulgaris (primrose) genome assembly, annotation and gene expression, with comparative genomics on the heterostyly supergeneSci Rep 8:17942https://doi.org/10.1038/s41598-018-36304-4 Google Scholar
1. Danecek P
2. Auton A
3. Abecasis G
4. Albers C.A
5. Banks E
6. DePristo M.A
7. Handsaker R.E
8. Lunter G
9. Marth G.T
10. Sherry S.T
11. et al
2011The variant call format and VCFtoolsBioinformatics 27:2156–2158https://doi.org/10.1093/bioinformatics/btr330 Google Scholar
1. DePristo M.A
2. Banks E
3. Poplin R.E
4. Garimella K.V
5. Maguire J.R
6. Hartl C
7. Philippakis A.A
8. del Angel G
9. Rivas M.A
10. Hanna M
11. et al
2011A framework for variation discovery and genotyping using next-generation DNA sequencing dataNat Genet 43:491–498https://doi.org/10.1038/ng.806 Google Scholar
1. Durand E
2. Méheust R
3. Soucaze M
4. Goubet P.M
5. Gallina S
6. Poux C
7. Fobis-Loisy I
8. Guillon E
9. Gaude T
10. Sarazin A
11. et al
2014Dominance hierarchy arising from the evolution of a complex small RNA regulatory networkScience 346:1200–1205https://doi.org/10.1111/eva.12933 Google Scholar
1. Durand E
2. Chantreau M
3. Le Veve A
4. et al
2020Evolution of self-incompatibility in the Brassicaceae: Lessons from a textbook example of natural selectionEvolutionary Applications https://doi.org/10.1111/eva.12933 Google Scholar
1. Foxe J.P
2. Stift M
3. Tedder A
4. Haudry A
5. Wright S.I
6. Mable B.K
2010Reconstructing origins of loss of self-incompatibility and selfing in North american Arabidopsis lyrata: a population genetic contextEvolution 64:3495–3510https://doi.org/10.1111/j.1558-5646.2010.01094.x Google Scholar
1. Genete M
2. Castric V
3. Vekemans X
2020Genotyping and de novo discovery of allelic variants at the Brassicaceae self-incompatibility locus from short read sequencing dataMol Biol Evol https://doi.org/10.1093/molbev/msz258 Google Scholar
1. Gervais C.E
2. Castric V
3. Ressayre A
4. Billiard S
2011Origin and diversification dynamics of self-Incompatibility haplotypesGenetics 188:625–636https://doi.org/10.1534/genetics.111.127399 Google Scholar
1. Goubet P.M
2. Bergès H
3. Bellec A
4. Prat E
5. Helmstetter N
6. Mangenot S
7. Gallina S
8. Holl A.-C
9. Fobis-Loisy I
10. Vekemans X
11. et al
2012Contrasted patterns of molecular evolution in dominant and recessive self-incompatibility haplotypes in ArabidopsisPLoS Genetics 8:e1002495https://doi.org/10.1371/journal.pgen.1002495 Google Scholar
1. Guindon S
2. Dufayard J-F
3. Lefort V
4. et al
2010New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0Syst Biol 59:307–321https://doi.org/10.1093/sysbio/syq010 Google Scholar
1. Hartmann F.E
2. Duhamel M
3. Carpentier F
4. et al
2021Recombination suppression and evolutionary strata around mating-type loci in fungi: documenting patterns and understanding evolutionary and mechanistic causesNew Phytologist 229:2470–2491https://doi.org/10.1111/nph.17039 Google Scholar
1. Hatakeyama K
2. Watanabe M
3. Takasaki T
4. Ojima K
5. Hinata K
1998Dominance relationships between S -alleles in self-incompatible Brassica campestris LHeredity 80:241–247https://doi.org/10.1046/j.1365-2540.1998.00295.x Google Scholar
1. Hill J
2. Enbody E
3. Bi H
4. Lamichhaney S
5. Schwochow D
6. Younis S
7. Widemo F
8. Andersson L
2022Low mutation load in a supergene underpinning alternative male mating strategies in ruffMol Biol Evol https://doi.org/10.1101/2022.04.27.489720 Google Scholar
1. Hu T.T
2. Pattyn P
3. Bakker E.G
4. Cao J
5. Cheng J.-F
6. Clark R.M
7. Fahlgren N
8. Fawcett J.A
9. Grimwood J
10. Gundlach H
11. et al
2011The Arabidopsis lyrata genome sequence and the basis of rapid genome size changeNat Genet 43:476–481https://doi.org/10.1038/ng.807 Google Scholar
1. Jay P
2. Chouteau M
3. Whibley A
4. Bastide H
5. Parrinello H
6. Llaurens V
7. Joron M
2021Mutation load at a mimicry supergene sheds new light on the evolution of inversion polymorphismsNat Genet 53:288–293https://doi.org/10.1038/s41588-020-00771-1 Google Scholar
1. Jay P
2. Tezenas E
3. Véber A
4. Giraud T
2022Sheltering of deleterious mutations explains the stepwise extension of recombination suppression on sex chromosomes and other supergenesPLOS Biology 20:e3001698https://doi.org/10.1371/journal.pbio.3001698 Google Scholar
1. Kakizaki T
2. Takada Y
3. Ito A
4. Suzuki G
5. Shiba H
6. Takayama S
7. Isogai A
8. Watanabe M
2003Linear Dominance Relationship among Four Class-II S Haplotypes in Pollen is Determined by the Expression of SP11 in Brassica Self-IncompatibilityPlant and Cell Physiology 44:70–75https://doi.org/10.1093/pcp/pcg009 Google Scholar
1. Kamau E
2. Charlesworth B
3. Charlesworth D
2007Linkage Disequilibrium and Recombination Rate Estimates in the Self-Incompatibility Region of Arabidopsis lyrataGenetics 176:2357–2369https://doi.org/10.1534/genetics.107.072231 Google Scholar
1. Katsura Y
2. Iwase M
3. Satta Y
2012Evolution of genomic structures on mammalian sex chromosomesCurrent Genomics 13:115–123https://doi.org/10.2174/138920212799860625 Google Scholar
1. Kumar S
2. Stecher G
3. Li M
4. Knyaz C
5. Tamura K
2018MEGA X: Molecular evolutionary genetics analysis across computing platformsMolecular Biology and Evolution 35:1547–1549https://doi.org/10.1093/molbev/msy096 Google Scholar
1. Kusaba M
2. Dwyer K
3. Hendershot J
4. Vrebalov J
5. Nasrallah J.B
6. Nasrallah M.E
2001Self-incompatibility in the genus Arabidopsis: characterization of the S locus in the outcrossing A. lyrata and its autogamous relative A. thalianaPlant Cell 13:627–643Google Scholar
1. Lamichhaney S
2. Fan G
3. Widemo F
4. Gunnarsson U
5. Thalmann D.S
6. Hoeppner M.P
7. Kerje S
8. Gustafson U
9. Shi C
10. Zhang H
11. Chen W
12. Liang X
13. Huang L
14. Wang J
15. Liang E
16. Wu Q
17. Lee S.M.-Y
18. Xu X
19. Höglund J
20. Liu X
21. Andersson L
2016Structural genomic changes underlie alternative reproductive strategies in the ruff (Philomachus pugnax)Nat Genet 48:84–88https://doi.org/10.1038/ng.3430 Google Scholar
1. Lane M.D
2. Lawrence M.J
1995The population genetics of the self-incompatibility polymorphism in Papaver rhoeas. X. An association between incompatibility genotype and seed dormancyHeredity 75:92–97https://doi.org/10.1038/hdy.1995.108 Google Scholar
1. Langmead B
2. Salzberg S.L
2012Fast gapped-read alignment with Bowtie 2Nat Methods 9:357–359https://doi.org/10.1038/nmeth.1923 Google Scholar
1. Le Veve A
2. Burghgraeve N
3. Genete M
4. et al
2023Long-term balancing selection and the genetic load linked to the self-incompatibility locus in Arabidopsis halleri and A. lyrataMolecular Biology and Evolution msad 120https://doi.org/10.1093/molbev/msad120 Google Scholar
1. Li H
2. Handsaker B
3. Wysoker A
4. Fennell T
5. Ruan J
6. Homer N
7. Marth G
8. Abecasis G
9. Durbin R
10. 1000 Genome Project Data Processing Subgroup
2009The Sequence Alignment/Map format and SAMtoolsBioinformatics 25:2078–2079https://doi.org/10.1093/bioinformatics/btp352 Google Scholar
1. Llaurens V
2. Billiard S
3. Leducq J.-B
4. Castric V
5. Klein E.K
6. Vekemans X
2008Does frequency-dependent selection with complex dominance interactions accurately predict allelic frequencies at the self-incompatibility locus in Arabidopsis halleriEvolution 62:2545–2557https://doi.org/10.1111/j.1558-5646.2008.00469.x Google Scholar
1. Llaurens V
2. Gonthier L
3. Billiard S
2009aThe sheltered genetic load linked to the S locus in plants: new insights from theoretical and empirical approaches in sporophytic self-incompatibilityGenetics 183:1105–1118https://doi.org/10.1534/genetics.109.102707 Google Scholar
1. Llaurens V
2. Billiard S
3. Castric V
4. Vekemans X
2009bEvolution of dominance in sporophytic self-incompatibility systems: I. Genetic load and coevolution of levels of dominance in pollen and pistilEvolution 63:2427–2437https://doi.org/10.1111/j.1558-5646.2009.00709.x Google Scholar
1. Mable B.K
2. Schierup M.H
3. Charlesworth D
2003Estimating the number, frequency, and dominance of S -alleles in a natural population of Arabidopsis lyrata (Brassicaceae) with sporophytic control of self-incompatibilityHeredity 90:422–431https://doi.org/10.1038/sj.hdy.6800261 Google Scholar
1. Mattila T.M
2. Laenen B
3. Horvath R
4. Hämälä T
5. Savolainen O
6. Slotte T
2019Impact of demography on linked selection in two outcrossing Brassicaceae speciesEcology and Evolution 9:9532–9545https://doi.org/10.1002/ece3.5463 Google Scholar
1. Mena-Ali J. I
2. Keser L. H
3. Stephenson A. G
2009The effect of sheltered load on reproduction in Solanum carolinense, a species with variable self-incompatibilitySex. Plant Reprod 22:63–67https://doi.org/10.1007/s00497-008-0092-x Google Scholar
1. Nettancourt D
2001Incompatibility and incongruity in wild and cultivated plantsBerlin Heidelberg: Springer-Verlag Google Scholar
1. Pauwels M
2. Saumitou-Laprade P
3. Holl A.C
4. Petit D
5. Bonnin I
2005Multiple origin of metallicolous populations of the pseudometallophyte Arabidopsis halleri (Brassicaceae) in central Europe: the cpDNA testimonyMolecular Ecology 14:4403–4414https://doi.org/10.1111/j.1365-294X.2005.02739.x Google Scholar
1. Ponnikas S
2. Sigeman H
3. Abbott J.K
4. Hansson B
2018Why do sex chromosomes stop recombining?Trends Genet 34:492–503https://doi.org/10.1016/j.tig.2018.04.001 Google Scholar
1. Prigoda N.L
2. Nassuth A
3. Mable B.K
2005Phenotypic and genotypic expression of self-incompatibility haplotypes in Arabidopsis lyrata suggests unique origin of alleles in different dominance classesMolecular Biology and Evolution 22:1609–1620https://doi.org/10.1093/molbev/msi153 Google Scholar
1. Pritchard J. K
2. Stephens M
3. Rosenberg N. A
4. Donnelly P
2000Association mapping in structured populationsThe American Journal of Human Genetics 67:170–181https://doi.org/10.1086/302959 Google Scholar
1. Pritchard J. K
2. Wen X
3. Falush D
2010Documentation for structure software: Version 2.3Chicago, IL: University of Chicago :1–37Google Scholar
1. Quinlan A.R
2. Hall I.M
2010BEDTools: a flexible suite of utilities for comparing genomic featuresBioinformatics 26:841–842https://doi.org/10.1093/bioinformatics/btq033 Google Scholar
1. Ross-Ibarra J
2. Wright S.I
3. Foxe J.P
4. Kawabe A
5. DeRose-Wilson L
6. Gos G
7. Charlesworth D
8. Gaut B.S
2008Patterns of polymorphism and demographic history in natural populations of Arabidopsis lyrataPLOS ONE 3:e2411https://doi.org/10.1371/journal.pone.0002411 Google Scholar
1. Rosser N
2. Edelman N.B
3. Queste L.M
4. et al
2022Complex basis of hybrid female sterility and Haldane’s rule in Heliconius butterflies: Z-linkage and epistasisMolecular Ecology 31:959–977https://doi.org/10.1111/mec.16272 Google Scholar
1. Rozas J
2. Ferrer-Mata A
3. Sánchez-DelBarrio J.C
4. Guirao-Rico S
5. Librado P
6. Ramos-Onsins S.E
7. Sánchez-Gracia A
2017DnaSP 6: DNA sequence polymorphism analysis of large data setsMolecular Biology and Evolution 34:3299–3302https://doi.org/10.1093/molbev/msx248 Google Scholar
1. Schierup M.H
2. Mikkelsen A.M
3. Hein J
2001Recombination, balancing selection and phylogenies in MHC and self-incompatibility genesGenetics 159:1833–1844https://doi.org/10.1093/genetics/159.4.1833 Google Scholar
1. Schneider C.A
2. Rasband W.S
3. Eliceiri K.W
2012NIH Image to ImageJ: 25 years of image analysisNat Methods 9:671–675https://doi.org/10.1038/nmeth.2089 Google Scholar
1. Schopfer C.R
2. Nasrallah M.E
3. Nasrallah J.B
1999The male determinant of self-incompatibility in BrassicaScience 286:1697–1700https://doi.org/10.1126/science.286.5445.1697 Google Scholar
1. Stetsenko R
2. Brom T
3. Castric V
4. Billiard S
2023Balancing selection and the crossing of fitness valleys in structured populations: diversification in the gametophytic self-incompatibility systemEvolution 77:907–920https://doi.org/10.1093/evolut/qpac065 Google Scholar
1. Stift M
2. Hunter B.D
3. Shaw B
4. Adam A
5. Hoebe P.N
6. Mable B.K
2013Inbreeding depression in self-incompatible North-American Arabidopsis lyrata: disentangling genomic and S-locus-specific genetic loadHeredity 110:19–28https://doi.org/10.1038/hdy.2012.49 Google Scholar
1. Stone J.L
2004Sheltered load associated with S-alleles in Solanum carolinenseHeredity 92:335–342https://doi.org/10.1038/sj.hdy.6800425 Google Scholar
1. Tamura K
2. Nei M
1993Estimation of the number of nucleotide substitutions in the control region of mitochondrial DNA in humans and chimpanzeesMolecular Biology and Evolution 10:512–526https://doi.org/10.1093/oxfordjournals.molbev.a040023 Google Scholar
1. Uyenoyama M.K
1997Genealogical structure among alleles regulating self-incompatibility in natural populations of flowering plantsGenetics 147:1389–1400https://doi.org/10.1093/genetics/147.3.1389 Google Scholar
1. Uyenoyama M.K
2003Genealogy-dependent variation in viability among self-incompatibility genotypesTheoretical Population Biology 63:281–293https://doi.org/10.1016/S0040-5809(03)00020-0 Google Scholar
1. Uyenoyama M.K
2005Evolution under tight linkage to mating typeNew Phytol 165:63–70https://doi.org/10.1111/j.1469-8137.2004.01246.x Google Scholar
1. Vaser R
2. Adusumalli S
3. Leng S.N
4. Sikic M
5. Ng P.C
2016SIFT missense predictions for genomesNat Protoc 11:1–9https://doi.org/10.1038/nprot.2015.123 Google Scholar
1. Vieira J
2. Pimenta J
3. Gomes A
4. Laia J
5. Rocha S
6. Heitzler P
7. Vieira C.P
2021The identification of the Rosa S-locus and implications on the evolution of the Rosaceae gametophytic self-incompatibility systemsSci Rep 11:3710https://doi.org/10.1038/s41598-021-83243-8 Google Scholar
1. Williamson R.J
2. Josephs E.B
3. Platts A.E
4. Hazzouri K.M
5. Haudry A
6. Blanchette M
7. Wright S.I
2014Evidence for widespread positive and negative selection in coding and conserved noncoding regions of Capsella grandifloraPLOS Genetics 10:e1004622https://doi.org/10.1371/journal.pgen.1004622 Google Scholar
1. Wright A.E
2. Dean R
3. Zimmer F
4. Mank J.E
2016How to make a sex chromosomeNat Commun 7:12087https://doi.org/10.1038/ncomms12087 Google Scholar
1. Yasuda S
2. Wada Y
3. Kakizaki T
4. Tarutani Y
5. Miura-Uno E
6. Murase K
7. Fujii S
8. Hioki T
9. Shimoda T
10. Takada Y
11. Shiba H
12. Takasaki-Yasuda T
13. Suzuki G
14. Watanabe M
15. Takayama S
2017A complex dominance hierarchy is controlled by polymorphism of small RNAs and their targetsNature Plants 3https://doi.org/10.1038/nplants.2016.206 Google Scholar

Article and author information

Author information

Audrey Le Veve
Univ. Lille, CNRS, UMR 8198 – Evo-Eco-Paleo, F-59000 Lille, France, Department of Botany, Faculty of Science, Charles University, Benátská 2, CZ-128 01 Prague, Czechia
ORCID iD: 0000-0001-5895-370X
Mathieu Genete
Univ. Lille, CNRS, UMR 8198 – Evo-Eco-Paleo, F-59000 Lille, France
ORCID iD: 0000-0002-9640-8793
Christelle Lepers-Blassiau
Univ. Lille, CNRS, UMR 8198 – Evo-Eco-Paleo, F-59000 Lille, France
Chloé Ponitzki
Univ. Lille, CNRS, UMR 8198 – Evo-Eco-Paleo, F-59000 Lille, France
Céline Poux
Univ. Lille, CNRS, UMR 8198 – Evo-Eco-Paleo, F-59000 Lille, France
ORCID iD: 0000-0002-9379-2769
Xavier Vekemans
Univ. Lille, CNRS, UMR 8198 – Evo-Eco-Paleo, F-59000 Lille, France
ORCID iD: 0000-0002-4836-4394
Eleonore Durand
Univ. Lille, CNRS, UMR 8198 – Evo-Eco-Paleo, F-59000 Lille, France
ORCID iD: 0000-0002-0912-9252
Vincent Castric
Univ. Lille, CNRS, UMR 8198 – Evo-Eco-Paleo, F-59000 Lille, France
ORCID iD: 0000-0002-4461-4915
- Correspondence : vincent.castric@univ-lille.fr

Version history

Preprint posted: November 15, 2023
Sent for peer review: December 8, 2023
Reviewed Preprint version 1: March 5, 2024
Reviewed Preprint version 2: July 10, 2024
Version of Record published: September 2, 2024

Cite all versions

You can cite all versions using the DOI https://doi.org/10.7554/eLife.94972. This DOI represents all versions, and will always resolve to the latest one.

Copyright

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.

Revised: This Reviewed Preprint has been revised by the authors in response to the previous round of peer review; the eLife assessment and the public reviews have been updated where necessary by the editors and peer reviewers.

Reviewing Editor
Detlef Weigel
Max Planck Institute for Biology Tübingen, Tübingen, Germany
Senior Editor
Detlef Weigel
Max Planck Institute for Biology Tübingen, Tübingen, Germany

Joint Public Review:

An outside expert evaluated your responses to the original reviewers and offered the following comments:

The main criticism was whether deleterious variants were appropriately classified in the work. The authors use two different methods to characterize the effect of alleles to satisfy these comments. The result is somewhat complex. The authors do replicate the effect of dominance on fixation and segregation of deleterious alleles by classifying polymorphisms as synonymous or synonymous with SNPeff. This is not entirely surprising as it is approximately equivalent to classifying based on fold degeneracy (but it includes sites that have other than 0 or 4 fold degeneracy). However, the authors do not mention in the text that their observation of increased segregating deleterious mutations in recessive alleles was only statistically significant in A. halleri (for both analyses). Using SIFT, the authors only find an effect of dominance in A. lyrata. So in reality, while the trends are the same across the analyses, the statistical significance of the effects of dominance was not consistent.

Reviewer 2 had several more detailed criticisms of the manuscript. The first was that the authors should explore the dominance of linked deleterious mutations themselves. I agree that this would be interesting, but it is very difficult to accomplish, and I agree with the author's reluctance to do much more here. The reviewer also criticized the authors simulation approach. The authors provided their simulation script as requested, but declined to do additional simulations under varied selection coefficients. I felt this was a minimally adequate response to the reviewers concerns, but the authors could have reasonably conducted a few additional simulations under varied selection coefficients.

I think that the scope of the findings described in the assessment was reasonable. This is interesting work, but despite the author's arguments, the system is somewhat unique if for no other reason than that balancing selection at S-loci is uniquely strong

https://doi.org/10.7554/eLife.94972.2.sa1

Author response:

The following is the authors’ response to the original reviews.

Reviewer #1 (Public Review):

Summary:

The paper combines phenotypic and genomic analyses of the "sheltered load" (i.e. the accumulation of deleterious mutations linked to S-loci that are hidden from selection in the homozygous state) in Arabidopsis. The authors compare results to previous theoretical predictions concerning the extent of the load in dominant vs recessive S-alleles, and further develop exciting theory to reconcile differences between previous theory and observed results.

Strengths:

This is a very nice combination of theory and data to address a classical question in the field.

We thank the reviewer for this positive feedback.

Weaknesses:

The "genetic load" is a poorly defined concept in general, and its quantification via the number of putatively deleterious mutations is quite difficult. Furthermore counting up the number of derived mutations at fully constrained nucleotides may not be a great estimate of the load, and certainly does not allow for evaluation of recessivity -- a concept critical to ideas concerning the sheltered load. Alternative approaches - including estimating the severity of mutations - could be helpful as well. This imperfection in available approaches to test theory must be acknowledged more strongly by the authors.

As suggested by the reviewer, we implemented alternative approaches to estimate the severity of deleterious mutations and now report the results of SNPeff and

SIFT4G analyses in Table S6. The results we obtained with these other metrics were overall very similar to those based on our previous counting of mutations at 0-fold and 4-fold degenerate sites. More generally, we tried to improve the presentation of our strategy to estimate the genetic load (clarified in lines 262-268, 271, 292-295, 297. In particular, we made it clear that our population genetic analysis cannot assess the recessivity of the observed mutations (lines 428-434).

Reviewer #2 (Public Review):

Summary:

This study looks into the complex dominance patterns of S-allele incompatibilities in Brassicaceae, through which it attempts to learn more about the sheltering of deleterious load. I found several weak points in the analyses that diminished my excitement about the results. In particular, the way in which deleterious mutations were classified lacked the ability to distinguish the severity of the mutations and thus their expected associated dominance.

First, we would like to clarify that our goal with this study is NOT to learn something about dominance of the linked deleterious mutations (we can not). Instead, we compare the accumulation of deleterious mutations linked to dominant vs recessive S-ALLELES, but are agnostic regarding the dominance level of the LINKED mutations themselves. The rationale is that the different intensities of natural selection between dominant vs recessive S-alleles provide a powerful way to examine the process by which deleterious mutations are sheltered in general. We further clarified this aspect on lines 70-73 and 399-401.

Second, as mentioned above in response to Reviewer 1, we complemented the analysis by predicting the severity of the deleterious mutations by SIFT4G and SNPeff. The results were largely consistent, with the exception that the number of sites included in SIFT4G was low, such that the statistical power was reduced (lines 296-300).

Furthermore, the simulation approach could have provided this exact sort of insight but was not designed to do so, making this comparison to the empirical data also less than exciting for me.

As explained above, studying dominance of the linked mutations we observed is an interesting research question (albeit a difficult one), but it was not our goal here. Instead, our study was designed as an empirical test of the predictions presented in Llaurens et al (2009), and we re-analysed some aspects of the model outcome to illustrate our points.

We now better explain that we based our choice of parameters on the fact that in the theoretical study by Llaurens et al (2009), recessive deleterious mutations are predicted to accumulate in a much more straightforward manner (line 316-318).

We now dedicate a paragraph of the discussion to explain how our stochastic simulations could be improved, and acknowledge that a full exploration of the interaction between dominance of the S-alleles and dominance of the linked deleterious mutations would be an interesting follow-up - albeit beyond the scope of our study (line 437-441).

Major and minor comments:

I think the introduction (or somewhere before we dive into it in the results) of the dominance hierarchy for the S-alleles needs a more in-depth explanation. Not being familiar with this beforehand really made this paper inaccessible to me until I then went to find out more before continuing. I would expect this paper to be broad enough that self-contained information makes it accessible to all readers. For example, lines 110-115 could be in the Introduction.

We thank the reviewer for this useful remark. We now give a more comprehensive description of the dominance hierarchy and introduce the classes of dominance in A. lyrata already in the introduction, on lines 64-70.

Along with my above comment, perhaps it is not my place to comment, but I find the paper not of a broad enough scope to be of interest to a broad readership. This S-allele dominance system is more than simple balancing selection, it is a very complex and specific form of dominance between several haplotypes, and the mechanism of dominance does not seem to be genetic. I am not sure that it thus extrapolates to broad comments on general dominance and balancing selection, e.g. it would not be the same as considering inversions and this form of balancing selection where we also expect recessive deleterious mutations to accumulate.

We disagree with these interpretations by the reviewer, for two reasons:

First, the mechanism of dominance is actually entirely genetic. In fact, we uncovered some years ago that it is based on the molecular interaction between small non-coding RNAs from dominant alleles and their target sites on recessive alleles (Durand et al. Science 2014, see lines 68-70). If there is something specific with this system, it is that the dominance phenomenon is better understood at the mechanistic level than in most other cases, but the resulting phenomenon in itself (a dominance hierarchy) is rather common.

Second, the kind of variation in the intensity of linked selection created by this mechanism is actually a general phenomenon, so our results have broad relevance beyond our particular study system. We modified the introduction to explain this point

more clearly, highlighting in particular the fact that the situation we study closely resembles the case of sex chromosomes, where X (or Z) chromosomes are genetically recessive and Y (or W) chromosomes are genetically dominant. We cite this example in lines 83-87 of the introduction and also several well-studied other examples on lines 480-489 of the discussion.

It would have been particularly interesting, or a nice addition, to see deleterious mutations classed by something like SNPeff or GERP where you can have different classes of moderate to severe deleterious variants, which we would expect also to be more recessive the more deleterious they are. In line with my next comment on the simulations, I think relative differences between mutations expected to be more or less dominant may be even more insightful into the process of sheltering which may or may not be going on here.

We agree with the reviewer, and as detailed above we have now integrated such analyses with SNPeff and SIFT4G (Table S6). These new results reinforce our conclusion that while S-allele dominance influences the fixation of deleterious mutations, it has no effect on their total number. See lines 270-272 and 296-300.

In the simulations, h=0 and s=0.01 (as in Figure 5) for all deleterious mutations seems overly simplistic, and at the convenient end for realistic dominance. I think besides recessive lethals which we expect to be close to h=0 would have a much larger selection coefficient, and other deleterious mutations would only be partially recessive at such an s value. I expect this would change some of the simulation results seen, though to what degree I am not certain. It would be nice to at least check the same exact results for h=0.3 or 0.2 (or additionally also for recessive lethals, e.g. h=0 and s=-0.9). I would also disagree with the statement in line 677, many studies have shown, particularly those on balancing selection, that partially recessive deleterious mutations are not eliminated by natural selection and do play a role in population genetic dynamics. I am also not surprised that extinction was found for higher s values when the mutation rate for such mutations was very high and the distribution of s values was constant. An influx of such highly deleterious mutations is unlikely to ever let a population survive, yet that does NOT mean that in nature, the rare influx of such mutations does lead to them being sheltered. I find overall that the simulation results contribute very little, to none, to this paper, as without something more realistic, like a simultaneous distribution of s and h values, you cannot say which, if any class of these mutations are the ones expected to accumulate because of S-allele dominance.

We understand that the previous version of our manuscript was confusing between dominance of the S-alleles and dominance of the linked deleterious mutations. We clarified that our study focuses on the effect of the former only (lines 99, 263-264 and 581-583).

We agree that a complete exploration of the interaction between dominance of the S-alleles and dominance of the linked mutations being sheltered would have been an asset, but as explained above this is not the focus of our study. The previous work by Llaurens et al (2009) has already established that deleterious mutations can fix within S-allele lineages, especially when linked to dominant S-alleles, and when the number of S-alleles is large. Under the conditions they examined, deleterious mutations were much more strongly eliminated if not fully recessive (h=0 vs h=0.2), so for the present study we decided to simulate fully recessive mutations only. We now formally acknowledge the possibility that some complex interaction may take place between dominance of the S-alleles and dominance of the linked deleterious mutations (lines 440-442). However, as explained above we feel that fully exploring this complex interaction would require a detailed investigation, which is clearly beyond the scope of the present study.

Rather they only show the disappointing or less exciting result that fully recessive, weakly deleterious mutations (which I again think do not even exist in nature as I said above) have minor, to no effect across the classes of S-allele dominance. They provide no insight into whether any type of recessive deleterious mutation can accumulate under the S-allele dominance hierarchy, and that is the interesting question at hand. I would either remove these simulations or redo them in another approach. The authors never mention what simulation approach was used, so I can only assume this is custom, in-house code. Yet I do not find that code provided on the github page. I do not know if the lack of a distribution for h and s values is then a choice or a programming limitation, but I see it as one that should be overcome if these simulations are meant to be meaningful to the results of the study.

The code we used (in C) was adapted from the previous study by Llaurens et al. (2009), which at the time was not deposited in a data repertory, unfortunately. With the agreement of the authors of that study, this code is now available on Github:

(https://github.com/leveveaudrey/model_ssi_Llaurens; line 723).

It is correct that our simulations were not aimed at determining whether “any type of recessive deleterious mutation can accumulate”, but we strongly believe that they help interpreting the observations made in the genomic data.

Recommendations for the authors:

Notes from the editor:

I found Table 1 confusing, with column headings of observed proportion but perhaps numbers reflecting counts.

Thank you for pointing out this confusion. There was indeed an error in the last column, which we have now corrected.

I found Figure 2 a bit hard to parse, with the vertical lines being unclear and the x-axis ticks of insufficient resolution to evaluate the physical extent of the signals.

We increased the size of the label on the x-axis and detailed it on the Figure 2, which is now hopefully more clear. Moreover, we increase the size of the vertical lines.

Finally, I wonder, given the rapid decay of signal in lyrata, whether 25kb is the right choice for evaluating load and whether the pattern may look different on a smaller scale.

It is true that the signal decays rapidly in A. lyrata, as can be seen in the haplotype structure analysis and in line with our previous analysis of the same populations Le Veve et al (MBE 2023; in this study we explored the effect of the choice of the size of the chromosomal region analyzed; lines 266-269). However, for the sake of comparison, we prefer to stick to the same window size. The fact that we still see an effect of dominance in spite of the lower statistical power associated with the more rapid decay (because a smaller number of genes is expected to be impacted) actually reinforces our conclusions.

Reviewer #1 (Recommendations For The Authors):

I have a few additional suggestions to improve the manuscript.

(1) How does the load linked to the S-locus compare to that observed in other genomic regions? It would be useful to provide a comparison of the results quantified in Figures three and four to comparable genomic regions unlinked to the S-locus. How severe is the linked load?

This comparison to the genomic background was actually the core of our previous study (Le Veve et al MBE 2023), which was based on the same populations. This analysis revealed that polymorphism of the 0-fold degenerate sites was more than twice higher in the 25kb immediately flanking the S-locus than in a series of 100 unlinked control regions. Here, the main focus of the present study is on the effect of linkage to particular S-alleles (which was not possible previously because haplotypes had to be phased).

(2) Details of the GLM for data underlying Figures 3 and 4 are somewhat unclear. Is the key explanatory variable (Dominance) treated as continuous? Categorical? Ordinal etc…

Dominance is considered as a continuous variable. We specify this in line 162 of the results, in the legends of Figures 3 and 4, in the Material and Method (lines 627 and 660) and in the legend of Table S4.

(3) I had some trouble understanding the two different p-values in columns five and six of table one. Please provide more detail.

We understand that the two p-values in Table 1 were confusing. The first was related to the binomial test and the second to the permutation test. To be consistent with the rest of the manuscript, we conserved only the p-value of the permutation test.

(4) As mentioned in the "weaknesses" above, the authors should be more clear about what they are quantifying. They are explicitly counting the number of variants at 0-fold degenerate sites as a proxy for the genetic load. How good this proxy is is unclear. The most egregious misstatement here was on line 314 in which they make reference to the "total load." However, this limitation should be acknowledged throughout the manuscript and deserves more attention in the methods and discussion.

As mentioned above, we now integrate additional methods to define and quantify the load (SIFT4G and SNPeff), which reinforced our previous conclusions (lines 271-272, 297-302).

We clarified our wording and replaced the mention of “total load” by “mean number of linked deleterious mutations per copy of S-allele” (line 324-325). In the discussion we tried to better explain the limitations of approaches to estimate the genetic load (line 431-437).

Reviewer #2 (Recommendations For The Authors):

Line 60, it should be specified that this is only for recessive deleterious mutations.

Non-recessive deleterious mutations would certainly not be expected to accumulate.

As explained in details above, the question of whether and how non-recessive deleterious mutations can accumulate when linked to the S-locus is difficult and would in itself deserve a full treatment, which is clearly beyond the scope of the present study. We clarified this point on line 56.

https://doi.org/10.7554/eLife.94972.2.sa0

Significance of findings

Strength of evidence

Abstract

Introduction

Results

The genetic load linked to the S-locus varies among S-alleles, but is not correlated with dominance

Proportion of S-locus homozygous offspring having reached the reproductive stage for three different S-alleles.

Effect of homozygosity at the S-locus on 13 phenotypic traits compared to heterozygotes.

S-alleles are associated with specific sets of tightly linked mutations

Linkage to the S-locus locally distorts the phylogenetic relationships.

No overall evidence that dominant S-alleles accumulate more linked deleterious mutations

No overall effect of S-allele dominance on the total number of 0-fold degenerate mutations (S0f) in the linked genomic regions within 25kb.

The structure of the linked genetic load differs between dominant and recessive S-alleles

The number of 0-fold degenerate mutations fixed in the 25kb regions flanking the S-locus increases with dominance of the S-allele associated.

Stochastic simulations confirm the contrasted architecture of the load of deleterious mutations linked to dominant vs. recessive S-alleles.

Discussion

The genetic load linked to the S-locus is detectable and is manifested on different phenotypes

A unique genetic load associated with each allele in each population

Different properties of the linked load according to S-allele dominance

Material and Methods

Source plant material

Library preparation, capture and sequencing

Determination of the S-locus genotypes and dominance of S-alleles

Read mapping and variant calling in A. halleri and A. lyrata populations

Quantifying the sheltered load of deleterious mutations

Phasing S-haplotypes

Study of the structure of S-haplotypes

Estimation of the number of fixed and segregating deleterious mutations within S-allele lineages

Estimation of the phenotypic impact of homozygosity at the S-locus for three S-alleles

Simulations

Data Availability

Acknowledgements

Author contributions

Declaration of interests

Supplementary data

Analysis of Major Components obtained for haplotypes of A. halleri of the Nivelle (black dots) and Mortagne (grey dots) populations) based on SNPs in the first 5kb, between 5 and 25kb, and between 25kb and 50kb away from the S-locus.

Analysis of major components (AMC) obtained for haplotypes of A. lyrata (of the PIN: grey dots, IND: red dots and TSS: blue dots) populations based on the SNPs in the first 5kb, between 5 and 25kb, and between 25kb and 50kb away from the S-locus.

Phylogenetic tree obtained by Maximum Likelihood for haplotypes of A. halleri (populations Nivelle and Mortagne) across the first 25kb flanking the S-locus.

Phylogenetic tree obtained by maximum likelihood for haplotypes of A. halleri (populations Nivelle and Mortagne) based on the nucleotide positions between 25kb and 50kb away from the S-locus.

Phylogenetic tree obtained by maximum likelihood for haplotypes of A. lyrata (populations PIN, IND, TSS) across the first 5kb flanking the S-locus.

Phylogenetic tree obtained by maximum likelihood for haplotypes of A. lyrata (populations PIN, IND, TSS) based on the nucleotide positions between 5kb and 10kb away from the S-locus.

The genetic structure of SNPs in the S-locus flanking regions in A. halleri and A. lyrata.

Patterns of genetic associations between S-alleles and SNPs across the genome.

Experimental protocol.

Supplemental items

Crosses performed to obtain homozygotes for three S-alleles.

Trait variation in S-locus homozygous individuals.

Trait variation in homozygous at the S-locus for the S-alleles Ah01 and Ah04 between families.

Effect of dominance on variation of phenotypic traits in S-locus homozygous individuals.

Number of phased haplotypes linked to S-alleles in each sample.

Effect of dominance on the accumulation of genetic load in S-flanking regions.

S-locus genotypes of individuals sequenced using the capture protocol.

S-locus genotypes of the offspring selected for haplotype phasing and the crosses for the study of phenotypic traits.

Effect of phytopathogen, phytophagous attacks and oxidative stress on the phenotypic

Difference on the phenotypic trait variation between the two families for each allele tested.

References

Article and author information

Author information

Audrey Le Veve

Mathieu Genete

Christelle Lepers-Blassiau

Chloé Ponitzki

Céline Poux

Xavier Vekemans

Eleonore Durand

Vincent Castric

Version history

Cite all versions

Copyright

Peer review process

Editors

No overall effect of S-allele dominance on the total number of 0-fold degenerate mutations (S_0f) in the linked genomic regions within 25kb.