Single-cell eQTL mapping in yeast reveals a tradeoff between growth and reproduction

eLife assessment

This manuscript describes the mapping of natural DNA sequence variants that affect gene expression and its noise, as well as cell cycle timing, using as input single-cell RNA-sequencing of progeny from crosses between wild yeast strains. The method represents an important advance in the study of natural genetic variation. The findings, especially given the follow-up validation of the phenotypic impact of a mapped locus of major effect, provide convincing support for the rigor and utility of the method.

https://doi.org/10.7554/eLife.95566.3.sa0

Significance of the findings:

Important: Findings that have theoretical or practical implications beyond a single subfield

Landmark
Fundamental
Important
Valuable
Useful

Strength of evidence:

Convincing: Appropriate and validated methodology in line with current state-of-the-art

Exceptional
Compelling
Convincing
Solid
Incomplete
Inadequate

During the peer-review process the editor and reviewers write an eLife Assessment that summarises the significance of the findings reported in the article (on a scale ranging from landmark to useful) and the strength of the evidence (on a scale ranging from exceptional to inadequate). Learn more about eLife Assessments

Abstract
Introduction
Results
Discussion
Materials and methods
Appendix 1
Data availability
References
Article and author information
Metrics

Abstract

Expression quantitative trait loci (eQTLs) provide a key bridge between noncoding DNA sequence variants and organismal traits. The effects of eQTLs can differ among tissues, cell types, and cellular states, but these differences are obscured by gene expression measurements in bulk populations. We developed a one-pot approach to map eQTLs in Saccharomyces cerevisiae by single-cell RNA sequencing (scRNA-seq) and applied it to over 100,000 single cells from three crosses. We used scRNA-seq data to genotype each cell, measure gene expression, and classify the cells by cell-cycle stage. We mapped thousands of local and distant eQTLs and identified interactions between eQTL effects and cell-cycle stages. We took advantage of single-cell expression information to identify hundreds of genes with allele-specific effects on expression noise. We used cell-cycle stage classification to map 20 loci that influence cell-cycle progression. One of these loci influenced the expression of genes involved in the mating response. We showed that the effects of this locus arise from a common variant (W82R) in the gene GPA1, which encodes a signaling protein that negatively regulates the mating pathway. The 82R allele increases mating efficiency at the cost of slower cell-cycle progression and is associated with a higher rate of outcrossing in nature. Our results provide a more granular picture of the effects of genetic variants on gene expression and downstream traits.

Introduction

Genome-wide studies have identified thousands of loci that influence gene expression; these loci are known as expression quantitative trait loci or eQTLs (Albert and Kruglyak, 2015; Kang et al., 2023). eQTLs serve as an important bridge between DNA sequence variation and organismal phenotypes and provide a mechanism by which noncoding variants can underlie complex traits (Finucane et al., 2015; Umans et al., 2021; Gusev et al., 2016). The vast majority of eQTL studies to date have relied on measurements of average gene expression levels in bulk populations of cells (Aguet et al., 2020; Albert et al., 2018). This approach, while experimentally tractable, can lose information about known differences in genetic effects among tissues (Aguet et al., 2020), cell types (Westra et al., 2015; Chen et al., 2016; Kim-Hellmuth et al., 2020; Ota et al., 2021), and cellular states (Strober et al., 2019). Recently, studies in humans have leveraged single-cell RNA sequencing (scRNA-seq) to more flexibly investigate how eQTL effects are altered in different contexts (Elorbany et al., 2022; Cuomo et al., 2020; Neavin et al., 2021; Jerber et al., 2021; Yazar et al., 2022; van der Wijst et al., 2018), including cellular states that are difficult to access with bulk approaches (Nathan et al., 2022). However, obtaining these more granular eQTL maps with either bulk or single-cell approaches comes at the cost of substantial increases in the numbers of samples that must be obtained and analyzed one at a time.

In model organisms, such as the nematode Caenorhabditis elegans and the budding yeast Saccharomyces cerevisiae, mapping populations of millions of recombinant progeny can be generated in a single flask (Ehrenreich et al., 2010; Burga et al., 2019). Such populations can be combined with scRNA-seq in a ‘one-pot’ eQTL mapping design in which the same single-cell data enables measurement of gene expression, cell type classification, and genotyping of transcribed variants in each cell (Ben-David et al., 2021). This design has two major advantages. First, it retains information about tissues, cell types, and cellular states. Second, by replacing expensive and labor-intensive genotyping and expression profiling of many samples with a single scRNA-seq experiment, it enables facile exploration of genetics of gene expression in many different genetic backgrounds and in response to many environmental perturbations. Here, we implement this design in yeast (Figure 1A), which presents additional challenges due to small cell size and the presence of a cell wall (Gasch et al., 2017; Jariani et al., 2020; Nadal-Ribelles et al., 2019; Jackson et al., 2020; Brettner et al., 2022; N’Guessan et al., 2023), and use it to identify eQTLs in different genetic backgrounds, study interactions between eQTL effects and stages of the cell cycle, search for allele-specific effects on gene expression noise, and uncover a connection between a common variant in the gene GPA1, gene expression, progression through the cell cycle, and mating efficiency.

Figure 1 with 8 supplements see all

Download asset Open asset

One-pot eQTL mapping is feasible in yeast.

(A) One-pot eQTL mapping workflow. A large population of hybrid diploid cells is sporulated, and MATa haploid yeast progeny cells (segregants) are isolated by fluorescence-activated cell sorting. Cells are captured and processed with the 10× Chromium device. The resulting barcoded library of single-cell transcriptomes is sequenced by Illumina short-read sequencing. Unique molecular identifier (UMI) counts are tallied for each transcript in each segregant. The number of supporting molecules for each parental allele is identified at every transcribed sequence position that differs between the parental strains, and a hidden Markov model is used to infer the genotype of each segregant. In the cartoon example of an eQTL shown on the top right, segregants with the C allele have higher expression of the gene than those with the A allele. (B) Representative Uniform Manifold Approximation and Projection for Dimension Reduction (UMAP) plot of cells colored by their assigned cell-cycle stage. (C) Scatter plot of local eQTL effects from the one-pot experiment in the cross between BY and RM (x-axis) against local eQTL effects based on expression measurements from bulk RNA-seq in the same cross (y-axis) (Albert et al., 2018). Green dots denote one-pot eQTL effects that were significant at a false-discovery rate (FDR) of 0.05; yellow dots denote those that were not. The x- and y-axis were truncated at –1 and 1 for ease of visualization, which left out 67 of 4044 data points.

Results

scRNA-seq enables simultaneous expression profiling, cell-cycle stage determination, and genotyping in a segregating yeast population

eQTL mapping requires tracking the inheritance of genetic variants and measuring gene expression in the same individuals. scRNA-seq captures the transcriptomes of individual cells, and genotypes of expressed single-nucleotide polymorphisms (SNPs) in transcribed sequences can be used to track inheritance in these same cells. We previously showed that this approach enables single-cell eQTL mapping in C. elegans (Ben-David et al., 2021). To test the feasibility of the approach in yeast, we pooled 393 previously genotyped haploid segregants from a cross between a lab strain (BY) and a wine strain (RM) (Albert et al., 2018; Bloom et al., 2013; Figure 1—figure supplement 1; Supplementary file 1, tables S1–S3) and used scRNA-seq to obtain the transcriptomes of 7124 cells (Methods, Supplementary file 1, table S4). We captured a median of 1514 unique RNA molecules (unique molecular identifiers; henceforth UMIs) and a median of 1091 expressed SNPs per cell (Supplementary file 1, table S4).

The expression of hundreds of yeast genes varies during progression through the stages of the cell cycle (Spellman et al., 1998). We classified individual haploid yeast cells into five different cell-cycle stages (M/G1, G1, G1/S, S, and G2/M) via unsupervised clustering of the expression of 787 cell-cycle-regulated genes (Spellman et al., 1998) in combination with 22 cell-cycle-informative marker genes (Figure 1B, Figure 1—figure supplements 2–5). Using this classification approach, we found that expression of 2139 genes displayed significant variation by cell-cycle stage (likelihood ratio test, false-discovery rate [FDR] <0.05; Supplementary file 1, table S5). To account for the observed widespread effects of the cell cycle on gene expression, we incorporated the cell-cycle stage into subsequent eQTL analyses, unless otherwise stated.

We used a hidden Markov model (HMM) to reconstruct the patterns of inheritance of parental alleles in each cell based on the observed genotypes at expressed SNPs. This allowed us to match each cell to one of the 393 segregants. We observed a median of 17 cells per segregant, with 277 of the segregants sampled more than ten times (Figure 1—figure supplement 6). The genotypes measured from scRNA-seq data were in high agreement with those obtained from whole-genome sequencing of the same strains (median genotype agreement 92.5%). The agreement was higher in cells with more UMIs (Figure 1—figure supplement 7), and we leveraged higher yields of UMIs per cell in subsequent experiments to ensure better genotyping accuracy.

We used the two sets of genotypes to map local eQTLs—those that influence the expression of nearby genes, most commonly in cis. We modeled the genetic effects of the closest marker to each transcript on single-cell gene expression with a count-based model that did not include a cell-cycle term. We mapped 770 local eQTLs at an FDR of 5% with the HMM-based genotypes, and 697 with the matched genotypes obtained from whole-genome sequencing of the segregants; 611 eQTLs were detected in both analyses (Supplementary file 1, table S6). We further compared the local eQTL effects for all 4901 tested transcripts, regardless of statistical significance, and found that they were highly correlated between the two sets of genotypes (Spearman’s ⍴ = 0.93, p < 10^–15; Figure 1—figure supplement 8). A single-cell eQTL study on a different set of previously genotyped segregants from the same cross reached a similar conclusion (N’Guessan et al., 2023), providing further evidence that genotypes obtained from scRNA-seq data at transcribed SNPs are of sufficient quality for eQTL mapping.

One-pot eQTL mapping in de novo yeast segregants

One-pot eQTL mapping is an attractive experimental design compared to bulk RNA sequencing and genotyping because it lowers cost, eliminates individual sample preparation, and reduces other sources of technical variation. To compare one-pot eQTL mapping with the traditional bulk design, we generated segregants de novo from a cross between BY and RM (Albert et al., 2018). We used scRNA-seq to measure the expression of 5435 transcripts in 27,744 single cells (Supplementary file 1, table S4). We mapped 1031 local eQTLs at an FDR of 5%. We compared these results to those from bulk RNA-seq and genotyping in the same cross (Albert et al., 2018) and found that 717 (69.5%) of the 1031 local eQTLs were also detected as statistically significant in that study, with an additional 108 local eQTLs showing effects in the same direction (Figures 1C and 2A; Supplementary file 2, table S1). Thus, 825 (80%) of the local eQTLs detected with the one-pot approach were supported by the bulk results, despite differences in growth conditions and experimental procedures between the two studies.

Figure 2

Download asset Open asset

Single-cell eQTL map recapitulates bulk *trans*-eQTL hotspots and identifies new hotspots.

(A) Map of local and distant eQTLs. Each point denotes an eQTL, with the genomic position of the peak marker on the x-axis and the genomic location of the gene with the expression difference on the y-axis. The high density of points on the diagonal line with a slope of one indicates that many genes have local eQTLs. The dense vertical bands correspond to *trans*-eQTL hotspots. (B) Histogram showing the number of distant eQTLs in 50 kb windows top: one-pot eQTL map; bottom: bulk eQTL map (Albert et al., 2018). Red lines show statistical eQTL enrichment thresholds for a window to be designated a hotspot. Text labels highlight known and putative causal genes underlying hotspots, as well as loci that meet hotspot criteria only in the current study.

We next broadened our analysis to the trans-acting (distant) eQTLs, here defined as those that influence the expression of genes on a different chromosome. We mapped 1562 distant eQTLs at an FDR of 5% (Figure 2A; Supplementary file 3, table S1). As in previous studies (Albert et al., 2018; Brem et al., 2002), distant eQTLs were not uniformly distributed throughout the genome, but rather clustered at a number of hotspot loci that influence the expression of many genes. We identified 12 distant eQTL hotspots in the one-pot eQTL experiment (Supplementary file 3, table S1). When we applied the same criteria for defining a hotspot, we identified 21 hotspots in the bulk eQTL experiment in the same cross (Figure 2B). Five regions met the hotspot criteria in both studies, including the well-characterized hotspots driven by variants in the genes MKT1 (Zhu et al., 2008), GPA1 (Yvert et al., 2003), IRA2 (Smith and Kruglyak, 2008), and HAP1 (Brem et al., 2002). One hotspot on chromosome XIV in the bulk experiment was not observed here because it is caused by a de novo variant in the gene KRE33 that arose in the RM parent used in the construction of the bulk eQTL mapping panel (Albert et al., 2018; Jerison et al., 2017). The other hotspots from the bulk experiment generally affected the expression of fewer genes, and the fact that they did not meet hotspot criteria here can be explained by a combination of statistical power and different experimental conditions.

To learn more about the seven regions that met hotspot criteria only in the single-cell experiment but not in the bulk experiment, we performed a functional enrichment analysis of the genes they influence (Figure 2B; Supplementary file 3, table S1). The hotspot on chromosome X at position 323,158 changed the expression of 36 genes that were enriched for gene ontology (GO) terms related to zinc ion transmembrane transporter activity and transition metal ion transmembrane transporter activity. The hotspot region contains the gene ZAP1, which encodes a zinc-regulated transcription factor (Zhao and Eide, 1997). This gene contains nine missense variants between BY and RM, and we predict that ZAP1 is the causal gene underlying this hotspot (see also Weith et al., 2023). The hotspot on chromosome XIII at position 24,326 changed the expression of 26 genes that were enriched for GO terms related to acid phosphatase activity. The hotspot region contains the gene PHO84, which encodes an inorganic phosphate transporter (Bun-Ya et al., 1991). BY harbors a rare coding variant P259L in PHO84 (L allele frequency = 0.3%) (Peter et al., 2018) that has been shown to affect resistance to polychlorinated phenols (Perlstein et al., 2007) and is the likely causal variant for this hotspot. The other five hotspots were enriched for GO terms broadly related to growth. We grew the yeast segregants for scRNA-seq in a medium containing sheath fluid, a phosphate-buffered saline (PBS) solution with a pH of 7.4, whereas unbuffered minimal medium was used in the bulk eQTL study. Gene–environment interactions in gene expression are common in yeast (Smith and Kruglyak, 2008), especially for distant eQTLs, and subtle differences in the growth conditions between the two studies may explain why these new loci met the hotspot criteria only in the single-cell study.

We took advantage of the convenience of one-pot eQTL mapping and applied it to two additional yeast crosses, one between a clinical strain (YJM145) and a soil strain (YPS163) both isolated in the United States (44,784 cells, 5556 transcripts; Supplementary file 1, table S4), and another between a soil strain isolated in South Africa (CBS2888) and a clinical strain isolated in Italy (YJM981) (6595 cells, 4696 transcripts; Supplementary file 1,table S4). Hereafter, we refer to the BY × RM cross as cross A and the two new crosses as crosses B and C, respectively. We mapped a total of 1914 local eQTLs in the new crosses (1193 in cross B and 721 in cross C; Supplementary file 2, tables S2 and S3), as well as 1626 distant eQTLs (550 in cross B and 1126 in cross C; Figure 3A, B; Supplementary file 3, tables S2 and S3). These distant eQTLs clustered into 13 hotspots (6 in cross B and 7 in cross C; Figure 3C, D). Of the 25 hotspots detected in the three crosses, 14 (56%) were unique to a single cross (7/12 in cross A, 2/6 in cross B, and 5/7 in cross C). This observation is consistent with prior work suggesting that variants with widespread effects on gene expression are likely to be deleterious, and that purifying selection should reduce their allele frequencies, making them more likely to be strain specific (Ronald and Akey, 2007). We used functional annotations to identify candidate genes for two of the new hotspots: GPA1 for the hotspot on chromosome VIII in cross B (Supplementary file 4—Cross B chrVIII:46887–140660) and CYR1 for the hotspot on chromosome X in cross C (Supplementary file 4—Cross C chrX:397734–497167); the biological effects of these hotspots are discussed below.

Figure 3

Download asset Open asset

Single-cell eQTL maps in two new crosses.

(A) eQTL map for the YJM145 × YPS163 cross (cross B). (B) eQTL map for the CBS2888 × YJM981 cross (cross C). (C) Histogram of distal eQTLs showing hotspots in cross B. (D) Histogram of distal eQTLs showing hotspots in cross C. The y-axis has been truncated to have a maximum value 100 for ease of visualization purposes. The hotspot on chromosome X near the gene *CYR1* influences the expression of 175 genes, and the hotspot on chromosome XI influences the expression of 386 genes.

Distant eQTLs effects are more dependent on cell-cycle stage than local eQTLs effects

We asked whether the effects of eQTLs varied across the different stages of the cell cycle. Of the 2945 total local eQTLs detected in the three crosses, only 116 (4%) showed significant interactions between the eQTL effect and the cell-cycle stage at an FDR of 5%. In contrast, 790 (24.4%) of 3238 distant eQTLs showed significant interactions with the cell-cycle stage (OR = 7.8, Fisher’s exact test, p < 10^–15), which suggests that the effects of distant eQTLs depend more on the state of a cell than those of local eQTLs (Supplementary files 2 and 3). This observation is consistent with prior work which showed that the effects of distant eQTLs are often dependent on the environment (Smith and Kruglyak, 2008), tissue (Aguet et al., 2020; Battle et al., 2017), and cell type (Ben-David et al., 2021), while those of local eQTLs tend to be less affected by these factors, perhaps because their effects on expression are more direct. Our results extend this notion beyond external environments, tissues and cell types to internal cellular states in a single-cell type.

Identification of hundreds of genetic effects on expression noise

An outstanding question in genetics is whether, and to what extent, genetic variation influences noise in gene expression—that is, do some genetic variants alter the variability in the expression level of specific genes, separately from their effects on the average expression levels? Measurement of expression in single cells with different genotypes is uniquely suited to exploring this question, but separating the effects on noise from those on average expression is not trivial, and previously identified genetic effects on expression noise in scRNA-seq data could be explained by their effects on average expression (Sarkar et al., 2019). In mapping panels, apparent allelic effects on intrinsic expression variability can instead reflect extrinsic sources of expression variability that differ between cells, such as cell-cycle stage and genetic differences in trans-acting factors. To overcome these issues of interpretation, we investigated the genetic contribution to intrinsic noise in gene expression in scRNA-seq data we generated for F1 diploid hybrids of the parental strains. The F1 diploid yeast cells are isogenic and share all trans-acting factors, allowing us to exclude extrinsic genetic sources of expression variability and focus on allele-specific contributions to gene expression noise.

We obtained a total of 13,973 single-cell transcriptomes from F1 diploids used to generate the segregants for the three crosses (5890 for cross A, 2864 for cross B, and 5219 for cross C; Supplementary file 1, table S4). We classified each cell into one of four cell-cycle stages relevant for diploids (M/G1, G1/S, S, and G2/M). We found 3406 genes with allele-specific effects on average expression levels (668 for cross A, 996 for cross B, and 1742 for cross C; Supplementary file 5). These allele-specific effects were well correlated with local eQTL effects from the eQTL mapping experiments described above (Figure 4—figure supplement 1). We observed 160 genes with significant interactions between allele-specific expression and cell-cycle stage (Supplementary file 5).

We next looked for allele-specific effects on gene expression noise. We used an approach that tests for significant differences in gene expression noise between the two alleles in the F1 diploid hybrids after accounting for average differences in gene expression due to genotype, cell-cycle stage, and their interactions (Figure 4A, B, Figure 4—figure supplement 2; Methods). Using this approach, we found a total of 874 genes with allele-specific effects on expression noise at an FDR of 5%, independent of any effects on average expression (Figure 4C; Supplementary file 6).

Figure 4 with 3 supplements see all

Download asset Open asset

Genetic effects on expression noise.

(A) Cumulative distribution of simulated allele-specific counts for two alleles with different average expression but the same expression noise. (B) Cumulative distribution of simulated allele-specific counts for two alleles with different expression noise but the same average expression. These simulated distributions are shown to illustrate allele-specific effects on average expression and on expression noise, respectively. (C) Log–log scatter plot of change in expression noise between alleles (x-axis) against change in average expression between alleles (y-axis); points correspond to all 1487 genes with significant allele-specific effects on expression noise and/or average expression. Black line shows the predicted change in noise given a change in expression, with the 95% confidence interval for the trend shown in gray. The 377 genes with allele-specific effects on expression noise that cannot be accounted for by the overall trend are shown in red. The x and y axes have been truncated at –5 and 5 for ease of visualization purposes, which left out 30 of 1487 data points.

An additional consideration for these analyses is that prior work has revealed an empirical negative correlation between gene expression noise and average gene expression, even when noise is estimated while accounting for average expression (Antolović et al., 2017; Love et al., 2014). We observed this global trend in our data—across all genes, noise was negatively correlated with expression level (Spearman’s ⍴ = −0.42, p < 10^–15; Figure 4—figure supplement 3). To test whether the observed allele-specific effects on expression noise could arise from this trend, we asked whether the confidence interval (CI) of each significant allele-specific effect on noise overlapped the CI of the global trend line (Methods). Of the 874 genes with allele-specific effects on noise, these CIs did not overlap for 377, suggesting that these allele-specific effects on noise cannot be explained by the empirical global relationship between expression noise and average expression (Figure 4C; Supplementary file 6).

An illustrative example of an allele-specific effect on expression noise was found in crosses A and C for the gene HSP12, which encodes an intrinsically unstructured protein that improves membrane stability (Supplementary file 6; Welker et al., 2010). HSP12 is a member of the general stress response pathway regulated by Msn2/4 (Kuang et al., 2017) and helps yeast cells survive high-temperature shocks (Welker et al., 2010). In cross A, the RM allele did not significantly change the expression of HSP12 compared to the BY allele, but it did significantly increase the noise, whereas in cross C, the YJM981 allele decreased the expression of HSP12 compared to the CBS2888 allele and increased the noise. Previous work found that HSP12 has high extrinsic expression noise relative to other genes (Stewart-Ornstein et al., 2012), and it was proposed that the high noise arises from variability in the activity of the Msn2/4 stress response pathway and subsequent activation of Msn2/4 targets such as HSP12 (Gasch et al., 2017). Our experiments in F1 hybrids control for sources of extrinsic noise, such as Msn2/4 activity, and our results suggest that the RM and YJM981 alleles of HSP12 are intrinsically more variable than the BY and CBS2888 alleles, and that genetic differences acting in cis are responsible. We hypothesize that the RM and YJM981 allele of HSP12 may provide a fitness advantage during periods of extreme stress via a bet-hedging strategy in which noisy expression of HSP12 creates a subpopulation of cells with very high HSP12 expression that can better survive environmental shocks.

Natural genetic variants affect cell-cycle occupancy

Because the single-cell expression data allowed us to assign each genotyped cell to a stage of the cell cycle, we next moved beyond gene expression and searched for genetic effects on cell-cycle progression. Specifically, we looked for loci at which one allele is overrepresented in cells assigned to a particular cell-cycle stage. Because cell-cycle occupancy represents the proportion of cells assigned to a given stage, changes in the proportion of cells in each stage are correlated, potentially leading to QTLs with effects on occupancy of multiple cell-cycle stages. We found a total of 20 unique cell-cycle occupancy QTLs in the three crosses (4 for cross A, 10 for cross B, and 6 for cross C; Figure 5A; Supplementary file 7). One of the QTLs identified in cross A contained the gene MKT1, variation in which is known to affect dozens of growth traits and thousands of molecular traits, including gene expression (Figure 5B; Zhu et al., 2008). Segregants inheriting the RM allele of MKT1 are overrepresented in the G1 stage of the cell cycle and underrepresented in the G2/M stage. This observation suggests that some of the previously described cellular impacts of MKT1 variation may arise as a consequence of its effect on progression of yeast cells through the cell cycle.

Figure 5 with 1 supplement see all

Download asset Open asset

Natural genetic variants affect cell-cycle occupancy.

(A) Cell-cycle occupancy QTL map for three different crosses. LOD score for linkage with cell-cycle occupancy (y-axis) is plotted against the genomic location of genetic markers (x-axis). Colored lines show results for different cell-cycle stages as denoted in the legend. Horizontal line corresponds to a family-wise error rate (FWER) threshold of 0.05. Text labels highlight genes with QTL effects shown in panels B–D. Cell-cycle occupancy mapping was not performed on chromosome III. (B) Variation in *MKT1* increases G1 occupancy and decreases G2/M occupancy in the BY × RM cross. (C) Variation in *GPA1* decreases G1 occupancy and increases S and G2/M occupancy in the YJM145 × YPS163 cross. (D) Variation in *CYR1* decreases G1 occupancy and increases S and G2/M occupancy in the CBS2888 × YJM981 cross. Error bars in B–D represent 95% confidence intervals.

We identified a cell-cycle occupancy QTL on chromosome X in cross C whose location coincided with a distant eQTL hotspot (Figures 3D and 5D). This hotspot affected the expression of 224 genes that were enriched for GO terms related to oxidative phosphorylation and the citric acid cycle. We combined the eQTL mapping results with growth QTL (Bloom et al., 2019a) from the same cross and predicted that the likely causal gene underlying this hotspot is CYR1 (Supplementary file 4—Cross C chrX:397734–497167). CYR1 encodes adenylate cyclase, an enzyme which catalyzes the reaction that produces cyclic AMP (Matsumoto et al., 1982). Segregants which carry the CBS2888 allele of CYR1 more frequently occupy the G1 phase of the cell cycle and show improved growth in eight stressful conditions (Figure 5D). The CBS2888 allele of CYR1 contains multiple variants with predicted large deleterious effects on the gene, and these variants may act individually or together to compromise the function of CYR1, with the result that cells with this natural allele may mimic the G1 arrest phenotype observed in temperature-sensitive mutants of CYR1 (Matsumoto et al., 1983). Mutations in CYR1 are known to alter stress tolerance in yeast (Versele et al., 2004; Vanhalewyn et al., 1999; Vianna et al., 2010), providing additional support for our hypothesis that CYR1 is the causal gene underlying this hotspot.

The W82R variant of GPA1 alters gene expression and cell-cycle occupancy

We mapped a cell-cycle occupancy QTL in cross B to a region on chromosome VIII that contains the gene GPA1 (Figure 5C). GPA1 encodes the GTP-binding alpha subunit of a heterotrimeric G protein that mediates the response to mating pheromone (Nakafuku et al., 1987). Segregants carrying the YJM145 allele of GPA1 more frequently occupied G1, the cell-cycle stage during which the mating pathway is active (Figure 5A; Lang et al., 2009). This locus is also a distant eQTL hotspot that influenced the expression of 51 genes (Figure 2C). These genes are enriched for GO terms related to sexual reproduction and cellular response to pheromone (Supplementary file 4—Cross B chrVIII:46887–140660). Variants in the coding sequence of GPA1 are known to alter the expression of genes involved in mating (Yvert et al., 2003). The YJM145 allele of GPA1 contains a variant that changes a tryptophan to an arginine at position 82 of the Gpa1 protein. An evolutionary analysis revealed that this residue has been conserved as tryptophan for ~400 million years in the budding yeasts (Saccharomycotina) (Shen et al., 2018), and is commonly found as a aromatic amino acid (phenylalanine, tyrosine, or tryptophan) across the tree of life (Supplementary file 8). The conservation of this tryptophan is reflected in the prediction that the 82R allele is highly deleterious to the function of Gpa1 (Provean score of –13.915) (Choi and Chan, 2015). We thus hypothesized that this variant in GPA1 is responsible for the observed effects of the chromosome VIII locus on both gene expression and cell-cycle occupancy.

To test this hypothesis, we used CRISPR–Cas9 to engineer each allele of the W82R variant into a common genetic background (Supplementary file 1, table S1; Sadhu et al., 2018). We performed scRNA-seq on 26,859 cells from these isogenic strains that differed only in whether they carried the 82R (N = 11,695) or the 82W allele (N = 15,164) of GPA1. We observed that for 36 of the 50 genes affected by the hotspot and detected in our single-cell validation dataset, the sign of the expression difference was consistent between the eQTL effect and the W82R validation experiment (binomial test, p = 0.0026; Supplementary file 9). Importantly, the gene expression difference in the W82R experiment was statistically significant and concordant with the eQTL effect for all six mating-related genes affected by this hotspot (AGA1, AGA2, MFA1, STE2, FUS3, and PRM5), showing that the 82R allele isolated from other segregating genetic variation increases the expression of genes involved in the mating response. Consistent with the QTL effect, cells with the 82R allele were overrepresented in G1 (46.2% in G1 vs. 42.8% in other stages, logistic regression, p < 10^–15; Figure 5—figure supplement 1). We conclude that the W82R variant is responsible for the observed effects of this chromosome VIII QTL on gene expression and cell-cycle occupancy.

The 82R allele of GPA1 increases mating efficiency at the cost of growth rate

One possible consequence of increasing the proportion of cells in G1 is slowing progression through the cell cycle, with a corresponding decrease in the cell doubling rate. Previous work has shown that strains which carry a different variant in GPA1 (a G to T substitution at position 1406 in the coding sequence of the gene, which results in a serine to isoleucine substitution at position 469 of the protein) have decreased growth rates (Lang et al., 2009). We measured growth rates of the engineered strains and found that cells with the 82R allele grew slower than those with the 82W allele (relative growth rate = 0.993, T = −2.592, p = 0.0268), but that this effect was smaller than that observed for the S469I variant (relative growth rate of the 469I allele = 0.984, T = −5.291, p < 0.001; Figure 6A). The 469I allele is known to improve the efficiency of mating, a difference we successfully replicated (relative 469I mating efficiency = 110%, T = 9.73, p < 0.001). We observed that the 82R allele also increased mating efficiency (relative mating efficiency = 107%, T = 6.75, p < 0.001), but to a lesser extent than the 469I allele (82R mating efficiency compared to 469I = 97.2%, T = −2.98, p < 0.001; Figure 6B, Figure 6—figure supplement 1). These results show that natural variants in GPA1 increase mating efficiency at the cost of slower growth. Both effects may be explained by the impact of these variants on the mating pathway—enhanced activity of the pathway facilitates mating in the presence of partners, while inappropriate pathway activation in the absence of partners slows down the G1 phase of the cell cycle, as we have shown for the 82R allele, thereby decreasing growth rate.

Figure 6 with 3 supplements see all

Download asset Open asset

The 82R allele of *GPA1* increases mating efficiency at the cost of growth rate and is associated with increased outbreeding in natural populations.

(A) Boxplots show growth of allele replacement strains grown in glucose. Points represent replicate measurements of the doublings per hour for each strain. Tukey’s HSD adjusted p-values of pairwise comparisons of allele replacement strains are shown. (B) Boxplots show mating efficiency of allele replacement strains; details as in A. (C) Genome-wide neighbor-joining tree of 1011 sequenced yeast isolates. Strains in which only the 82R allele is present are denoted in blue; strains with support for both 82R and 82W alleles are denoted in red; and strains in which only 82W allele is present are denoted in gray. We observed that the 82R allele is enriched in mosaic strains (allele frequency = 45.3%, permutation test p = 0.007). Other clades mentioned in the text are labeled on the tree.

The W82R allele of GPA1 is common in the yeast population and is associated with increased outbreeding in natural populations

We searched for the 82R and 469I alleles in a worldwide collection of 1011 S. cerevisiae isolates (Peter et al., 2018) and found that the 469I allele is rare in the population (1.9%), whereas the 82R allele is common (20.5%) (Figure 6C). The 82R allele is fixed in a clade of strains isolated from Brazilian bioethanol (82R allele frequency = 100%) and is found at high frequency (58%) in a clade of strains isolated from Asian fermentation products such as rice wine (Figure 6—figure supplement 2). Peter et al. identified four groups of mosaic strains, which are characterized by admixture of two or more different lineages through outbreeding, and we observed that the 82R allele is enriched in these mosaic strains (allele frequency = 45.3%, permutation test p = 0.007). Strains derived from outbreeding events between genetically distinct parents are expected to show higher rates of heterozygosity than strains resulting from inbreeding or clonal propagation. We compared homozygous (<5% heterozygous sites) and heterozygous (>5% heterozygous sites) strains, as defined by Peter et al., and found that the 82R allele is enriched in heterozygous strains (OR = 3.3, Fisher’s exact test, p < 10^–7) and associated with higher rates of heterozygosity (Wilcoxon rank sum test, p < 10^–15; Figure 6—figure supplement 3). We have shown that the 82R allele increases mating efficiency in the lab, and these observations suggest that this increased mating efficiency may translate into higher outcrossing rates in nature.

Discussion

We used a one-pot single-cell eQTL mapping design, in which tens of thousands of cells from a segregating population are subjected to scRNA-seq, to map thousands of eQTLs in three different yeast crosses. We identified both local and distant eQTLs and showed that distant eQTLs in all three crosses cluster at hotspot loci that affect the expression of many genes, recapitulating and extending previous observations made with bulk eQTL mapping in the widely studied BY–RM cross. Notably, most of these hotspots are not shared between crosses, which suggests that they are caused by alleles unique to one of the six parent strains. This observation is consistent with the idea that alleles that alter the expression of many genes are likely to be selectively disfavored and therefore present at lower frequencies in the yeast population (Ronald and Akey, 2007; Bloom et al., 2019a).

Prior research has leveraged scRNA-seq data to detect genetic loci that alter gene expression noise, but their effects could not be separated from those on average expression levels and may reflect other sources of extrinsic cell-to-cell variability (Sarkar et al., 2019). To account for extrinsic factors, we obtained thousands of transcriptomes from single cells of hybrid diploid F1 yeast and tested for allele-specific differences in intrinsic gene expression noise. We employed an approach that accounts for average changes in gene expression and identified 874 genes with an allele-specific effect on gene expression noise. For 377 of these genes, the effects on noise could not be accounted for by the empirically observed negative correlation across genes between estimated gene expression noise and average gene expression (Love et al., 2014). We observed allele-specific effects on HSP12 expression noise in two separate crosses. HSP12 plays a role in protection against high-temperature shocks, and the high-noise alleles may provide a fitness advantage during high-temperature stress by creating a subpopulation of cells with very high HSP12 expression that can survive under these conditions. This observation adds to previous reports showing that noise mediated by promoter variants can provide a fitness advantage in times of environmental stress (Liu et al., 2015) and may constrain variation in promoter evolution (Metzger et al., 2015).

Single-cell RNA-seq data allowed us to assign each cell to a cell-cycle stage and explore genetic effects on expression during different stages of the cell cycle. We detected hundreds of eQTLs whose effects differed across cell-cycle stages. Distant eQTLs were more likely than local eQTLs to be cell cycle dependent, perhaps because the effects of distant eQTLs are more indirect and mediated by cellular regulatory networks that are affected by the cell cycle (Albert et al., 2018). Previous work has shown that effects of distant eQTLs are more sensitive than those of local eQTLs to tissue type (Battle et al., 2017) and external environment (Smith and Kruglyak, 2008), and our results extend these findings to show that they are more sensitive to internal cellular states within a single-cell type.

We used the ability to classify genotyped cells by their cell-cycle stage to identify 20 loci that altered the occupancy of different cell-cycle stages, one of which overlapped an eQTL hotspot. We used fine mapping and allele replacement with CRISPR–Cas9 to show that a common variant (W82R) in the gene GPA1 is responsible for the effects of this locus on cell-cycle occupancy and gene expression. We further showed that the 82R allele increases yeast mating efficiency at the cost of slower growth. Natural yeast isolates vary in their propensity to mate or enter the cell cycle upon germination (McClure et al., 2018), leading us to ask whether the 82R allele alters mating efficiency outside the lab. We searched for this allele in a collection of 1011 sequenced yeast isolates (Peter et al., 2018) and found that it is common (20.5%) and occurs more frequently in isolates that show evidence of recent outcrossing, suggesting that the observed increase in mating efficiency in the lab translates into more frequent mating in nature. Outcrossing rate has a major impact on the genetic structure of a population and its response to natural selection (Hartfield et al., 2017), and our results suggest that common variants can alter this key evolutionary parameter.

Studies of genetic effects on gene expression provide a molecular lens into the genetic basis of complex traits. One-pot single-cell eQTL mapping makes such studies cheaper, more efficient, and more flexible. This approach will power broader explorations of how genetic variants influence gene expression in different genetic backgrounds and under different experimental conditions. It also enables integration of information across multiple levels, as shown here for the case of gene expression, cell-cycle occupancy, and mating efficiency. The results of this study have the potential to inform the design, execution, and analysis of other one-pot studies of the effects of genetic variation on gene expression, such as human ‘cell villages’ (Wells et al., 2023; Neavin et al., 2023).

Materials and methods

Unless otherwise specified, computational analyses were performed in the R (v4.4.0) programming language (R Development Core Team, 2022) and visualizations were created using the ggplot2 package (v3.5.1) (Wickham, 2009).

Strains, plasmids, and primers used in this study are listed in Supplementary file 1, tables S1-S3.

Reagent type (species) or resource	Designation	Source or reference	Identifiers	Additional information
Strain, strain background (Saccharomyces cerevisiae)	YLK3301	Bloom et al., 2019a		YPS163 MatA ho∆::HphMX flo8∆::NatMX (YLK2438) x YJM145 MatAlpha ho∆::HphMX flo8∆::NatMX (YLK2436)
Strain, strain background (Saccharomyces cerevisiae)	YLK3004	Bloom et al., 2019a		YJM981 MatAlpha ho∆::HphMX (yST191) x CBS2888 MatA ho∆::KanMX (Box A1 C3)
Strain, strain background (Saccharomyces cerevisiae)	YLK3051	Bloom et al., 2019a		BY MatA (YLK1879) x RM MatAlpha AMN1-BY ho∆::HphMX flo8∆::NatMX (YLK1950)
Strain, strain background (Saccharomyces cerevisiae)	YLK1993	Albert et al., 2018		BY MatA (YLK1879) x RM MatAlpha AMN1-BY ho∆::HphMX flo8∆::NatMX (YLK1950)
Strain, strain background (Saccharomyces cerevisiae)	YLK3221	Sadhu et al., 2018		Mata met15Δ his3Δ1 leu2Δ0 ura3Δ0 nej1Δ::KanMX Gpa1-82W,469S [p415 GalL-Cas9-Cyc1t]
Strain, strain background (Saccharomyces cerevisiae)	YLK3302	This paper		Mata chrVII:113512_C chrVIII:113496_C Gpa1-469S [p415 GalL-Cas9-Cyc1t]
Strain, strain background (Saccharomyces cerevisiae)	YLK3303	This paper		Mata chrVII:113512_C chrVIII:113496_C Gpa1-469S [p415 GalL-Cas9-Cyc1t]
Strain, strain background (Saccharomyces cerevisiae)	YLK3304	This paper		Mata chrVII:113512_C chrVIII:113496_C chrVIII:114674_G chrVIII:114672_T Gpa1-469S Gpa1-82W [p415 GalL-Cas9-Cyc1t]
Strain, strain background (Saccharomyces cerevisiae)	YLK3305	This paper		Mata chrVII:113512_C chrVIII:113496_C chrVIII:114674_G chrVIII:114672_T Gpa1-469S Gpa1-82W [p415 GalL-Cas9-Cyc1t]
Strain, strain background (Saccharomyces cerevisiae)	YLK3306	This paper		Mata chrVII:113512_C chrVIII:113496_C Gpa1-469S
Strain, strain background (Saccharomyces cerevisiae)	YLK3307	This paper		Mata chrVII:113512_C chrVIII:113496_C Gpa1-469S
Strain, strain background (Saccharomyces cerevisiae)	YLK3308	This paper		Mata chrVII:113512_C chrVIII:113496_C chrVIII:114674_G chrVIII:114672_T Gpa1-469S Gpa1-82W
Strain, strain background (Saccharomyces cerevisiae)	YLK3309	This paper		Mata chrVII:113512_C chrVIII:113496_C chrVIII:114674_G chrVIII:114672_T Gpa1-469S Gpa1-82W
Strain, strain background (Saccharomyces cerevisiae)	YLK3310	This paper		Mata chrVII:113512_C chrVIII:113496_C chrVIII:114674_G chrVIII:114672_T Gpa1-469S Gpa1-82W
Strain, strain background (Saccharomyces cerevisiae)	YLK3311	This paper		Mata chrVII:113512_C chrVIII:113496_C chrVIII:114674_G chrVIII:114672_T Gpa1-469S Gpa1-82W
Strain, strain background (Saccharomyces cerevisiae)	YLK3312	This paper		Mata met15Δ his3Δ1 leu2Δ0 ura3Δ0 nej1Δ::KanMX Gpa1-82W, Gpa1-469I
Strain, strain background (Saccharomyces cerevisiae)	YLK3313	This paper		Mata met15Δ his3Δ1 leu2Δ0 ura3Δ0 nej1Δ::KanMX Gpa1-82W Gpa1-469I
Strain, strain background (Saccharomyces cerevisiae)	YLK3314	This paper		Mata chrVII:113512_C chrVIII:113496_C Gpa1-469S [PLK127]
Strain, strain background (Saccharomyces cerevisiae)	YLK3315	This paper		Mata chrVII:113512_C chrVIII:113496_C Gpa1-469S [PLK128]
Strain, strain background (Saccharomyces cerevisiae)	YLK3316	This paper		Mata chrVII:113512_C chrVIII:113496_C chrVIII:114674_G chrVIII:114672_T Gpa1-469S Gpa1-82W [PLK127]
Strain, strain background (Saccharomyces cerevisiae)	YLK3317	This paper		Mata chrVII:113512_C chrVIII:113496_C chrVIII:114674_G chrVIII:114672_T Gpa1-469S Gpa1-82W [PLK128]
Strain, strain background (Saccharomyces cerevisiae)	YLK3318	This paper		Mata met15Δ his3Δ1 leu2Δ0 ura3Δ0 nej1Δ::KanMX Gpa1-82W, Gpa1-469S [PLK127]
Strain, strain background (Saccharomyces cerevisiae)	YLK3319	This paper		Mata met15Δ his3Δ1 leu2Δ0 ura3Δ0 nej1Δ::KanMX Gpa1-82W, Gpa1-469S [PLK128]
Recombinant DNA reagent	MF2 p41 neo (plasmid)	Treusch et al., 2015	RRID:Addgene_58564	Flourescent magic marker plasmid with KanMX resistant cassette
Recombinant DNA reagent	MF2 p41 nat (plasmid)	Treusch et al., 2015	RRID:Addgene_58546	Flourescent magic marker plasmid with NatMX resistant cassette
Recombinant DNA reagent	p415 GalL-Cas9-Cyc1t (plasmid)	DiCarlo et al., 2013	RRID:Addgene_43804	Gal inducible CAS9 with LEU cassette
Recombinant DNA reagent	SNR52p-gRNA(BstEII/SphI). CAN1.Y-SUP4t (plasmid)	DiCarlo et al., 2013	RRID:Addgene_98814	Guide RNA expression plasmid with URA resistance
Recombinant DNA reagent	plk88+GPA1 novel variant (plasmid)	This paper		Guide RNA and coupled repair template to change 82 W to 82 R in Gpa1
Recombinant DNA reagent	plk88+GPA1 reversion (plasmid)	This paper		Guide RNA and coupled repair template to change 82I to 82 S in Gpa1
Recombinant DNA reagent	HIS3 2 um with ruby2 (plasmid)	This paper		ConLS-pTef1-mRuby2-tEno1-ConR1-His3-2micron-AmpR
Recombinant DNA reagent	HIS3 2 um with mTurquoise (plasmid)	This paper		ConLS-pTef1-mTurquoise-tEno1-ConR1-His3-2micron-AmpR
Commercial assay or kit	Chromium Single Cell 3' v3	10 x Genomics	10 X:CG000201
Software, algorithm	HMM, eQTL mapping, and noise analysis code	This paper		Avaliable at https://github.com/joshsbloom/single_cell_eQTL, archived at: https://doi.org/10.5281/zenodo.14834926
Software, algorithm	3' UTR extension script for cell ranger	This paper		Available at https://gist.github.com/theboocock/aacf72277a572ee3fe589c430bfd496e
Software, algorithm	Figure creation code	This paper		Avaliable at https://github.com/theboocock/yeast_single_cell_post_mapping_analysis, archived at: https://doi.org/10.5281/zenodo.14834916

Share this article

Cite this article

One-pot eQTL mapping is feasible in yeast.

Single-cell eQTL map recapitulates bulk trans-eQTL hotspots and identifies new hotspots.

Single-cell eQTL maps in two new crosses.

Genetic effects on expression noise.

Natural genetic variants affect cell-cycle occupancy.

The 82R allele of GPA1 increases mating efficiency at the cost of growth rate and is associated with increased outbreeding in natural populations.

Author details

James Boocock

Contribution

Competing interests

Noah Alexander

Contribution

Competing interests

Leslie Alamo Tapia

Contribution

Competing interests

Laura Walter-McNeill

Contribution

Competing interests

Shivani Prashant Patel

Contribution

Competing interests

Chetan Munugala

Contribution

Competing interests

Joshua S Bloom

Contribution

For correspondence

Competing interests

Leonid Kruglyak

Contribution

For correspondence

Competing interests

Citations by DOI

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

Categories and tags

Research organism

Further reading