Fitness effects of altering gene expression noise in Saccharomyces cerevisiae

Abstract
eLife digest
Introduction
Results and discussion
Materials and methods
Data availability
References
Article and author information
Metrics

Abstract

Gene expression noise is an evolvable property of biological systems that describes differences in expression among genetically identical cells in the same environment. Prior work has shown that expression noise is heritable and can be shaped by selection, but the impact of variation in expression noise on organismal fitness has proven difficult to measure. Here, we quantify the fitness effects of altering expression noise for the TDH3 gene in Saccharomyces cerevisiae. We show that increases in expression noise can be deleterious or beneficial depending on the difference between the average expression level of a genotype and the expression level maximizing fitness. We also show that a simple model relating single-cell expression levels to population growth produces patterns consistent with our empirical data. We use this model to explore a broad range of average expression levels and expression noise, providing additional insight into the fitness effects of variation in expression noise.

https://doi.org/10.7554/eLife.37272.001

eLife digest

Single-celled organisms that reproduce by dividing, like yeast, can create whole populations of genetically identical cells. However, some differences will exist among such cells, even when they have all experienced the same environment. These differences are known as “noise”. By definition, noise is not caused by differences in DNA sequence, but some DNA sequences are noisier than others (i.e. they cause more differences among cells). Because the amount of noise can be under genetic control, noise could evolve due to natural selection.

Scientists often study noise at the level of gene expression – in other words, how many RNA or protein molecules are produced from each gene within each cell. Prior work has suggested that this type of noise can affect how often individual cells divide in a population, which is a component of that population’s fitness. Yet directly measuring these effects has proven challenging. Different studies have in the past reached opposite conclusions about whether a change in gene expression noise would increase or decrease fitness.

One major reason for the lack of clear results is that most mutations that alter gene expression noise also alter the average level of expression of that gene. To find DNA sequences that produced the same average amount of protein but different levels of expression noise, Duveau et al. compared the effects of hundreds of mutations in the DNA sequence regulating the expression of a gene in baker’s yeast. Experiments focused on 43 DNA sequences then showed that increased expression noise could either speed up or slow down the growth of the population by affecting how long it took each cell to divide. More specifically, the effects of increasing expression noise depended on the average amount of protein produced among the cells in the population. If the average expression level was close to the optimum amount at which cells divided as fast as possible, increasing expression noise reduced the growth of the whole population. If, however, the average protein level caused cells to divide slower than their maximum rate, increasing expression noise resulted in faster growth of the population as a whole.

Duveau et al. explain their results as follows: more expression noise in a population that is already making the optimal amount of protein can reduce fitness because it increases the fraction of that population making a suboptimal amount of the protein. However, when the average expression level is not optimal, more expression noise would mean more cells producing an amount of protein that is closer to the optimum and thus having higher fitness.

These findings provide conceptual tools needed to understand how genetic variation affecting expression noise evolves. They could also help understand how expression noise might contribute to biological processes that depend upon cell division, such as diseases like cancer.

https://doi.org/10.7554/eLife.37272.002

Introduction

Gene expression is a dynamic process that results from a succession of stochastic biochemical events, including availability of transcription factors, binding of transcription factors to promoter sequences, recruitment of transcriptional machinery, transcriptional elongation, mRNA degradation, protein synthesis, and proteolysis. These events cause the expression level of a gene product to differ even among genetically identical cells grown in the same environment (Elowitz et al., 2002; Chong et al., 2015). This variability in gene expression is known as ‘expression noise’ and is under genetic control (Raser et al., 2004; Sanchez and Golding, 2013), with heritable variation causing differences in noise among genes (Newman et al., 2006) and genotypes (Murphy et al., 2007; Hornung et al., 2012; Fehrmann et al., 2013; Sharon et al., 2014; Liu et al., 2015).

Because gene expression noise is heritable and variable within populations, it can evolve in response to natural selection if it affects fitness. Indeed, prior studies have suggested that expression noise can be either beneficial or deleterious depending on the context (reviewed in Viney and Reece, 2013; Richard and Yvert, 2014; Liu et al., 2016). For example, Metzger et al. (2015) provides evidence that increased expression noise can be selected against in natural populations, presumably because elevated noise increases the probability that a given cell produces a suboptimal level of protein expression (Wang and Zhang, 2011; Duveau et al., 2017a). Consistent with this hypothesis, a negative correlation exists at the genomic scale between the expression noise of genes and their dosage sensitivity (Fraser et al., 2004; Batada and Hurst, 2007; Lehner, 2008; Keren et al., 2016). However, because the optimal level of gene expression can vary among environments, high gene expression noise has been suggested to be beneficial if it can produce individuals with phenotypes that are better adapted to a new environment than individuals produced with low gene expression noise. For instance, noise in gene expression can allow a small fraction of cells to survive when confronted with stressful environmental conditions (Blake et al., 2006; Fraser and Kaern, 2009; Ito et al., 2009; Levy et al., 2012; Viney and Reece, 2013; Liu et al., 2015; Wolf et al., 2015). Consistent with this idea, a genomic screen in yeast found that plasma-membrane transporters involved in cell-environment interactions displayed elevated expression noise in yeast (Zhang et al., 2009). Theoretical work also suggests the existence of cost-benefit tradeoffs that can make expression noise either beneficial or deleterious under different circumstances (Tănase-Nicola and ten Wolde, 2008).

Despite a growing body of evidence that selection has acted on expression noise for many genes, direct measurements of how changes in expression noise impact fitness remain scarce (Liu et al., 2016). A major reason for this scarcity is that most mutations that alter gene expression noise also alter average expression level (Newman et al., 2006; Hornung et al., 2012; Carey et al., 2013; Sharon et al., 2014), making it difficult to disentangle the fitness effects of changing expression noise and average expression level. Here, we directly estimate the effects of changing expression noise on fitness independently from changes in average expression level for the TDH3 gene of Saccharomyces cerevisiae.

TDH3 encodes an isozyme of the yeast glyceraldehyde-3-phosphate dehydrogenase (GAPDH) involved in glycolysis and gluconeogenesis (McAlister and Holland, 1985) as well as transcriptional silencing (Ringel et al., 2013), RNA-binding (Shen et al., 2014) and possibly antimicrobial defense (Branco et al., 2014). Variation in this gene’s promoter affecting expression noise has previously been shown to be a target of selection in natural populations (Metzger et al., 2015). To assess the impact of changes in expression noise on fitness at different expression levels, we generated mutant alleles of the TDH3 promoter that covered a broad range of average expression levels and expression noise. We find that increases in expression noise are detrimental when the average expression level of a genotype is close to the fitness optimum, but beneficial when the average expression level of a genotype is further from this optimum. This pattern was reproduced by a simple computational model that links expression in single cells to their doubling time to predict population fitness. We used this individual-based model to explore the fitness effects of a broader combination of average expression levels and expression noise than were explored empirically, showing that not only do the fitness effects of changing expression noise depend on the average expression level, but that the fitness effects of changing average expression level also depend upon the amount of expression noise.

Results and discussion

Generating variation in expression noise independent of average expression level

To disentangle the effects of changes in average expression level and expression noise on fitness, we examined a set of TDH3 promoter (P_TDH3) alleles with a broad range of activities. For each allele, we measured the average expression level and expression noise by cloning the allele upstream of a yellow fluorescent protein (YFP) coding sequence, integrating this reporter gene (P_TDH3-YFP) into the HO locus, and quantifying fluorescence in living cells using flow cytometry in six replicate populations per genotype (Figure 1A). The fluorescence value of each cell was transformed into an estimated mRNA level (Figure 1A) based on the relationship between fluorescence and YFP mRNA abundance (Figure 1B,C). The average expression level of a genotype was then calculated by averaging the median values from the six replicates (Figure 1A) and expressing this value as a percent change from the wild type allele. Expression noise was calculated for each replicate as the variance divided by the median expression among cells, a measure of noise strength similar to the Fano factor (Sanchez and Golding, 2013). The expression noise of each genotype was then calculated by averaging the noise strength from the six replicate populations, and this value was expressed as a percent change from the wild type allele. The main conclusions of this study are robust to the choice of noise metric, as shown in supplementary figures using three alternative metrics of noise.

Figure 1 with 1 supplement see all

Download asset Open asset

A collection of *TDH3* promoter alleles with incompletely correlated effects on average expression level and expression noise.

(A) Overview of experimental design used to quantify expression. The transcriptional activity of 171 different variants of the *TDH3* promoter (*P_TDH3*) inserted upstream of the *YFP* coding sequence was quantified using flow cytometry. After growth of six independent samples in rich medium (YPD) for each promoter variant, fluorescence intensity relative to cell size (forward scatter) was measured for ~10,000 individual cells and transformed into *YFP* mRNA estimates using the function shown in (B), allowing characterization of both the median and the standard deviation of expression of the reporter gene. (B) Non-linear relationship between *YFP* mRNA level and fluorescence intensity divided by cell size measured on a BD Accuri C6 flow cytometer. (C) Linear relationship between the logarithm of *YFP* mRNA level and the logarithm of fluorescence intensity divided by cell size. (**B–C**) *YFP* mRNA level was quantified by pyrosequencing and fluorescence intensity by flow cytometry in three biological replicates of eight strains expressing *YFP* under different variants of *P_TDH3*. Fluorescence intensity was normalized by cell size as described in the Materials and methods section. The red line is the best fit of a function of shape $\log (y) = a \times \log (x) + b$ to the data, with $a = 10.469$ and $b = - 9.586$ . The blue dot represents a strain with two copies of the wild type *P_TDH3-YFP* reporter. Data are available in Figure 1 – source data 1. (D) Median expression level and expression noise (noise strength: variance divided by median expression) for 43 *P_TDH3* alleles. These alleles were chosen to cover a broad range of median expression level and expression noise with an incomplete correlation between these two parameters. Colors represent different types of promoter mutations. Data are available in Source data 1. (**B–D**) Dotted lines show the activity of the wild type *TDH3* promoter. Error bars are 95% confidence intervals calculated from at least four replicates for each genotype and are only visible when larger than dots representing data.

https://doi.org/10.7554/eLife.37272.003

Figure 1—source data 1 Parallel measurements of fluorescence levels by flow cytometry and of YFP mRNA levels by pyrosequencing. Pyrosequencing data were analyzed with the R script provided in Supplementary file 3. Data used to make Figure 1B–C.: https://doi.org/10.7554/eLife.37272.006
Download elife-37272-fig1-data1-v2.xls

Effects of 236 point mutations in the TDH3 promoter, including mutations in RAP1 and GCR1 transcription factor binding sites (TFBS), have previously been described that cause a wide range of average expression levels and expression noise values (Metzger et al., 2015). But average expression level and expression noise strongly co-vary among these alleles (Metzger et al., 2015), making them insufficient for separating the effects of changes in average expression level and expression noise on fitness. We therefore sought to construct additional promoter alleles that showed a different relationship between average expression level and expression noise. First, we inserted a recognition motif for the GCN4 transcription factor at ten different positions in the TDH3 promoter because this TFBS was previously found to affect the relationship between expression level and expression noise (Sharon et al., 2014). However, the insertion of GCN4 binding sites into P_TDH3 did not show the expected departure from the relationship between expression level and expression noise observed for mutations in GCR1 and RAP1 TFBS (Figure 1—figure supplement 1). We next mutated the P_TDH3 TATA box because previous studies showed that TATA box mutations confer lower expression noise for a given expression level when compared to other types of promoter alterations (Blake et al., 2006; Mogno et al., 2010; Hornung et al., 2012). We generated 112 alleles of the TDH3 promoter that had between one and five random mutations in the TATA box sequence, which caused the expected lower levels of expression noise than TFBS mutant alleles with similar average expression levels (Figure 1D). We then combined mutations in the TATA box, GCR1 TFBS and/or RAP1 TFBS to further increase the range of expression phenotypes. Finally, we constructed alleles containing two tandem copies of the P_TDH3-YFP reporter gene with or without mutations in the P_TDH3 sequence to sample expression levels closer to and above the wild-type allele. These mutant alleles captured a much greater range of mean expression and expression noise than TDH3 promoter alleles segregating in natural populations (Metzger et al., 2015; Duveau et al., 2017a) and allowed us to more fully explore the relationship between mean, noise and fitness than would be possible using naturally occurring variation alone.

From this collection of 171 TDH3 promoter alleles (Figure 1—figure supplement 1, Figure 1—figure supplement 1—source data 1), we selected 43 alleles (Source data 1) to study the fitness effects of changes in average expression level and expression noise of the native TDH3 gene. The average expression level conferred by these 43 P_TDH3 alleles (including the wild type allele of P_TDH3) ranged from 0% to 207% of the wild type allele and the expression noise ranged from 3% to 371% of the wild type allele (Figure 1D). Most importantly, this set of alleles showed variation in expression noise independent of expression level at expression levels between 0% and 125% of the wild type allele (Figure 1D).

Figure 2 with 2 supplements see all

Download asset Open asset

Fitness consequences of variation in *TDH3* expression level.

(A) Overview of competition assays used to quantify fitness. The 43 *P_TDH3* alleles whose activity was described in Figure 1D were introduced upstream of the native *TDH3* coding sequence in a genetic background expressing YFP under control of the wild type *TDH3* promoter. A minimum of six replicate populations of the 43 strains were competed for ~20 generations in rich medium (YPD) against a common reference strain expressing GFP under control of the wild type *TDH3* promoter. The relative frequency of cells expressing YFP or GFP was measured every ~7 generations using flow cytometry. (B) Competitive fitness was calculated from the change in genotype frequency over time. The relative fitness of each strain was calculated as the mean competitive fitness of that strain across replicates divided by the mean competitive fitness of the strain carrying the wild type allele of *TDH3*. (C) Relationship between median expression level of *TDH3* and fitness in rich medium (YPD). Dots show the average median expression and average relative fitness measured among at least four replicates for each of the 43 *P_TDH3* alleles. Colors represent different types of promoter mutation. Error bars are 95% confidence intervals and are only visible when larger than dots. The dotted curve is the best fit of a LOESS regression of fitness on median expression, using a value of 2/3 for the parameter α controlling the degree of smoothing. The shaded area shows the 99% confidence interval of the LOESS fit. Data are available in Source data 1. Panels A and B were originally published as Figure 2A in Duveau et al. (2017a) and are reproduced here by permission of Oxford University Press [http://global.oup.com/academic].

https://doi.org/10.7554/eLife.37272.007

Figure 3 with 8 supplements see all

Download asset Open asset

Effect of *TDH3* expression noise on fitness.

(A) Separation of the 43 *P_TDH3* alleles into two categories based on their effects on median expression level and expression noise (noise strength). The gray curve shows the LOESS regression of noise on median expression using a value of 2/3 for the smoothing parameter. Data points falling below the curve (green) correspond to *P_TDH3* alleles with low noise given their median level of activity. Data points above the curve (orange) correspond to *P_TDH3* alleles with high noise given their median activity. The residual of the LOESS regression (‘Δ Noise’) is a measure of noise independent of median expression. (B) Relationships between median expression level and fitness for strains with low noise (green, Δ Noise < −1%) and high noise (orange, Δ Noise >+1%). The two LOESS regressions were performed with smoothing parameter α equal to 2/3. (C) Partition of *P_TDH3* alleles into two groups based on the distance of their median activity to the optimal level of *TDH3* expression. The expression optimum (vertical gray dotted line) corresponds to the expression level predicted to maximize fitness from the LOESS regression of fitness on median expression (gray curve). The expression level at which the predicted fitness is 0.005 below the maximal fitness was chosen as the threshold (vertical black dotted line) separating promoters with median activity ‘close to optimum’ from promoters with median activity ‘far from optimum’. The residual of the LOESS regression (‘Δ Fitness’) is a measure of fitness independent of the median *TDH3* expression level. Dots are colored as in (B). (D) Relationship between Δ Noise and Δ Fitness when median expression is far from optimum. (E) Relationship between Δ Noise and Δ Fitness when median expression is close to optimum. (**D–E**) The best linear fit between Δ Noise and Δ Fitness is shown as a gray line, with the coefficient of determination (‘R²’) and the significance of the Pearson’s correlation coefficient (‘P’) indicated in the upper left of each panel. Dots are colored based on median expression levels of the corresponding *P_TDH3* alleles as indicated by color gradient. (**A–E**) Error bars show 95% confidence intervals calculated from at least four replicate samples and are only visible when larger than symbols representing data points. (F) Comparison of Δ Fitness between genotypes with low noise strength (green, Δ Noise < −1%) and genotypes with high noise strength (red, Δ Noise >+1%). Thick horizontal lines represent the median Δ Fitness among genotypes and notches display the 95% confidence interval of the median. Bottom and top lines of each box represent 25^th and 75^th percentiles. Width of boxes is proportional to the square root of the number of genotypes included in each box. Permutation tests were used to assess the significance of the difference in median Δ Fitness between genotypes with low and high noise and P-values are shown in lower right corners. For each test, the values of ΔNoise were randomly shuffled among genotypes 100,000 times. The P values shown below each plot represent the proportion of permutations for which the absolute difference in median phenotype between genotypes with low and high ΔNoise was greater than the observed absolute difference in median phenotype between genotypes with low and high ΔNoise. Data are available in Source data 1.

https://doi.org/10.7554/eLife.37272.011

Fitness effects of changing average TDH3 expression level

To measure the fitness effects of changing TDH3 expression, we introduced each of these 43 P_TDH3 alleles upstream of the TDH3 coding sequence at the native TDH3 locus and performed competitive growth assays similar to those described in Duveau et al. (2017a) (Figure 2A). For each of the eight P_TDH3 alleles that contained a duplication of the P_TDH3-YFP reporter gene, we created a duplication of the entire TDH3 gene with the two corresponding P_TDH3 alleles. We also included a strain with a deletion of the promoter and coding sequence of TDH3 to sample a TDH3 expression level of 0%. Prior studies have found that deletion of TDH3 causes a moderate decrease in fitness in glucose-based media: −5% in Pierce et al. (2007) and −6.8% in Duveau et al. (2017a). Each strain tested was marked with YFP and grown competitively for 30 hr (~21 generations) with a reference genotype marked with a green fluorescent protein (GFP) (Figure 2A). Competitive fitness was determined from the rate of change in genotype frequencies over time and averaged across at least six replicate populations for each genotype tested (Figure 2B). The relative fitness of each strain was then calculated by dividing the competitive fitness of that strain by the competitive fitness of the strain with the wild type allele of TDH3 (Source data 1). This protocol provided a measure of fitness with an average 95% confidence interval of 0.2%. We then related these measures of relative fitness to the expression of the reporter gene driven by these P_TDH3 alleles at the HO locus. Expression of this reporter gene provided a reliable readout of average expression level and expression noise driven by the same P_TDH3 promoters at the native TDH3 locus, as measured using Tdh3-YFP fusion proteins (Figure 2—figure supplement 1A,B). These fusion protein alleles were not used for comparing fitness effects among TDH3 promoter alleles because the YFP fusion reduced fitness by 2.5% relative to a strain expressing TDH3 and YFP from independent promoters (Figure 2—figure supplement 1C).

A local regression (LOESS) of average expression level on fitness for the 43 TDH3 alleles and the TDH3 deletion showed a non-linear relationship with a plateau of maximal fitness near the wild type expression level (Figure 2C) similar to that described in Duveau et al. (2017a). Deletion of TDH3 (expression level of 0% in Figure 2C) caused a statistically significant decrease in fitness of 6.1% relative to the wild type allele (t-test, p=6.4×10⁻¹⁰). The minimum change in TDH3 expression level that significantly impacted fitness was a 14.6% decrease in average expression relative to wild type, which reduced fitness by 0.19% (t-test, p=0.00045). Overexpressing TDH3 up to 175% did not significantly impact growth rate, but the 207% expression level of the strain carrying a duplication of the wild type TDH3 gene caused a 0.92% reduction in fitness (Figure 2C; t-test, p=1.4×10⁻⁷). Notably, none of the 42 mutant alleles of TDH3 conferred a significantly higher fitness than the wild type allele (one-sided t-tests, p>0.05), indicating that the wild type expression level of TDH3 is near an optimum for growth in the environment assayed. We expect these differences in fitness among genotypes with different TDH3 promoter alleles to arise primarily from differences in TDH3 expression; however, differences in pleiotropy among promoter alleles might also contribute to differences in fitness.

Disentangling the effects of TDH3 expression level and expression noise on fitness

Residual variation was observed around the LOESS fitted line relating expression level to fitness (Figure 2C) that we hypothesized might be explained by differences in noise among genotypes. To examine the effects of differences in expression noise on fitness independent of differences in average expression level, we used the residuals from a local regression of expression noise on expression level for the alleles with average expression levels between 0% and 125% of the wild type allele to define a metric called ΔNoise (Figure 3A; Figure 3—figure supplement 1A–D). This metric was not significantly correlated with expression level (Figure 3—figure supplement 2). TDH3 alleles with positive ΔNoise values had a higher level of noise than expected based on their expression level and were classified as ‘high noise’, whereas TDH3 alleles with negative ΔNoise values had lower levels of noise than expected given their expression level and were classified as ‘low noise’.

We then compared the relationship between expression level and fitness for genotypes in the high noise and low noise classes (Figure 3B). We found that promoter alleles with positive ΔNoise tended to show a higher fitness than strains with negative ΔNoise (Figure 3B, Figure 3—figure supplement 1E–H). This beneficial effect of noise on fitness was surprising given prior evidence that selection favored alleles of P_TDH3 with low expression noise in natural populations (Metzger et al., 2015). We noticed, however, that the fitness benefit of increasing expression noise was limited to a particular range of average expression levels. Specifically, positive ΔNoise was associated with higher fitness only for average expression levels between 2% and 80% of the wild type expression level (Figure 3B). Above 80% of expression, no clear difference in fitness was detected between strains with positive and negative ΔNoise (Figure 3B). These same trends were also observed for the other metrics of noise (Figure 3—figure supplement 1E–H).

Based on these observations and prior theoretical work (Tănase-Nicola and ten Wolde, 2008), we hypothesized that the distance between the average expression level of a P_TDH3 allele and the optimal level of TDH3 expression influenced how a change in expression noise impacted fitness. To test this hypothesis, the 43 promoter alleles were split into two groups depending on the distance of their average expression level from the optimal expression level of TDH3. Using a local regression of fitness on average expression level, we inferred the value of average expression that would confer a fitness reduction of 0.5% from maximal fitness. Promoter alleles for which the median activity was below this threshold were considered to be ‘far from optimum’ and promoter alleles with median activity above the threshold were considered to be ‘close to optimum’ (Figure 3C). A metric called ΔFitness, corresponding to the residuals of a local regression of fitness on average expression, was calculated to remove the confounding effect of average expression levels on fitness (Figure 3C, Figure 3—figure supplement 1I–L, Figure 3—figure supplement 3). We found that changes in noise (ΔNoise) and changes in fitness (ΔFitness) were positively correlated for genotypes classified as far from the optimum (Pearson correlation coefficient: r = 0.74, p=9.36×10⁻⁷, Figure 3D, Figure 3—figure supplement 4A–D), but not for genotypes classified as close to the optimum (r = −0.08, p=0.84, Figure 3E, Figure 3—figure supplement 4E–H). This result was robust to variation in the choice of the smoothing parameter used for the local regression of noise on average expression, the choice of the smoothing parameter used for the local regression of fitness on average expression, and the fitness threshold used to separate strains with expression levels close and far from optimum (Figure 3—figure supplements 5, 6 and 7). We note, however, that the smaller number of genotypes with mean expression close to the optimum provided less power to detect a significant relationship than genotypes with mean expression far from the optimum.

As an alternative way to test for the impact of expression noise on fitness, we compared ΔFitness for genotypes with positive and negative values of ΔNoise. Permutation tests were used to assess the significance of differences in ΔFitness by randomly shuffling values of ΔNoise among genotypes. Consistent with the correlation analyses, genotypes with positive ΔNoise showed a significantly greater median value of ΔFitness than genotypes with negative ΔNoise at expression levels far from optimum (10⁵ permutations, P_ΔNoise ≤ 10⁻⁵; Figure 3F). Among genotypes with average expression close to optimum, no significant difference in median ΔFitness was detected between the positive and negative ΔNoise groups (10⁵ permutations, P_ΔNoise = 0.6442) (Figure 3F). The same pattern was observed for all metrics of noise and was not driven by differences in average expression levels between the two ΔNoise groups (Figure 3—figure supplement 8).

Direct measurements of the effect of expression noise on relative fitness

The results presented in the preceding section provide strong evidence that variation in TDH3 expression can directly affect fitness, but the methods used have at least two limitations. First, ΔNoise and ΔFitness values can be influenced by the set of P_TDH3 alleles included in the analyses since they are regression residuals. Second, comparisons of fitness among P_TDH3 genotypes rely upon the assumption that fitness effects are transitive, i.e. that differences in fitness between two strains are accurately reported by competitive growth against a third reference strain. Even though such transitivity has often been verified (de Visser and Lenski, 2002; Elena and Lenski, 2003; Gallet et al., 2012), intransitive competition has been observed in several organisms, including yeast (Paquin and Adams, 1983). To test whether differences in TDH3 expression noise affect fitness without calculating regression residuals and without assuming transitivity, we performed direct competition assays between strains with P_TDH3 promoter alleles that showed similar average expression levels but different levels of expression noise.

Five pairs of TDH3 alleles for which (i) the median level of activity was similar between the two promoters of each pair, (ii) the level of noise was different between the two promoters of each pair, and (iii) the median level of activity varied among different pairs were chosen from the full set of 171 alleles described above (Figure 4A; Figure 4—source data 1). The promoter variants of four of these pairs were included in the indirect competition assays and showed the general pattern of increased fitness with increased expression noise when average expression was far from optimum and no significant difference in fitness despite differences in expression noise when average expression was close to optimum (Figure 4B). Promoters in the fifth pair were not among the 43 alleles included in the indirect competition experiment but were selected for the direct competition assays because they showed variation in expression noise at average expression levels close to wild type (purple points in Figure 4A).

Figure 4 with 1 supplement see all

Download asset Open asset

Direct competition between genotypes with different levels of noise but similar median levels of *TDH3* expression in glucose.

(**A–C**) Different colors are used to distinguish pairs of genotypes (*P_TDH3* alleles) with different median expression levels. (A) Median expression level and expression noise (noise strength) for five pairs of genotypes that were competed against each other. Each pair comprises one genotype with low expression noise (circle) and one genotype with high expression noise (triangle). (B) Relative fitness for four pairs of genotypes measured in competition assays against the common GFP reference strain. One pair is missing (purple in (A)) because the corresponding *P_TDH3* alleles were not part of the 43 alleles included in the initial competition experiment. (**A–B**) Error bars show 95% confidence intervals obtained from at least three replicates. (C) Competitive fitness of high noise strains relative to low noise strains measured from direct competition assays. Each box represents fitness data from 16 replicate samples. The average median expression level of the two genotypes compared is shown below each box along with the difference in expression noise between these two genotypes (ΔNoise). Thick horizontal lines represent the median fitness across replicates and notches display the 95% confidence interval of the median. The bottom and top lines of each box represent 25^th and 75^th percentiles. Statistical difference from a fitness of 1 (same fitness between the two genotypes) was determined using t-tests (*: 0.01 < P < 0.05; **: 0.001 < P < 0.01; ***: p<0.001). Data are available in Figure 4—source data 1.

https://doi.org/10.7554/eLife.37272.020

Figure 4—source data 1 Fitness measured in direct competition assays between strains with low and high values of expression noise. Competitive fitness was measured by pyrosequencing and analyzed using the R script provided in Supplementary file 3. Data used to make Figure 4.: https://doi.org/10.7554/eLife.37272.022
Download elife-37272-fig4-data1-v2.xlsx

For each of the five pairs, the low noise genotype and the high noise genotype were directly competed against each other under the same conditions used in the competitive growth fitness assay described above except that we doubled the number of generations and the number of replicates to increase the sensitivity of our fitness estimates. In addition, we used pyrosequencing (Neve et al., 2002) instead of flow cytometry to determine the relative frequency of the two genotypes at each time point because the two strains competed against each other could not be distinguished based on fluorescence. Relative fitness of the high and low noise genotypes in each pair was calculated based on the changes in relative allele frequency during competitive growth.

For the three pairs of genotypes with an average expression level far from optimum (12%, 19%, and 59% average expression relative to wild type), fitness of the high noise genotype relative to the low noise genotype was significantly greater than 1 (Figure 4C), indicating that the high noise genotype grew faster than the low noise genotype. This result was consistent with the differences in fitness measured from the indirect competition assays (Figure 4B). By contrast, both pairs of strains with an average expression level closer to the fitness optimum (93% and 102% relative to wild type expression levels) showed slightly but significantly lower fitness of the high noise genotype than the low noise genotype (Figure 4C). In these cases, higher expression noise resulted in a ~ 0.1% decrease in fitness relative to lower noise. This difference was detectable with the direct competition assay because the average span of the 95% confidence intervals of fitness estimates was 0.1%, which is half of the 0.2% average 95% confidence intervals from the indirect competition assay described above.

Taken together, our empirical measures of relative fitness show that higher expression noise for TDH3 is beneficial when average expression level is far from the optimum, but deleterious when average expression is close to the optimum. An intuitive explanation of this phenomenon is that when the average expression level is close to the optimum, increasing expression noise can result in enough cells with suboptimal expression to decrease fitness of the population. Conversely, when the average expression level is far from the optimum, increasing expression noise can result in enough cells with expression closer to the optimum to increase fitness of the population. These effects of expression noise on population fitness can result from differences in expression level among cells causing differences in the cell division rate (a.k.a. doubling time) among cells (Kiviet et al., 2014). To better understand the interplay among average expression level, expression noise, and fitness, we developed a simple computational model that allowed us to (1) vary the expression mean and noise independently while holding all other parameters constant, (2) track the resulting single cell growth dynamics, and (3) evaluate the consequences for population fitness.

Simulating population growth reveals fitness effects of noise

To further investigate how the distribution of expression levels among genetically identical cells influences population fitness, we modeled the growth of clonal cell populations that differed in the mean expression level and expression noise for a single gene. In this model, we specified a function defining the relationship between the expression level of a cell and the doubling time of that cell. Following each cell division, the expression levels of mother and daughter cells were sampled independently from an expression distribution characterized by its mean and noise (Figure 5A). This independent sampling ignores any inheritance of expression noise, which is a conservative choice for detecting differences in fitness among genotypes due to differences in noise. The doubling time of each cell was then calculated from its expression level (Figure 5B), and each clonal population was allowed to expand for the same amount of time, increasing in size at a rate determined by the doubling times of the cells sampled (Figure 5C). Empirical measures of single-cell division rates were consistent with these elements of the model, showing more variable cell division times in genotypes with greater TDH3 expression noise and shorter cell division times in genotypes with mean TDH3 expression closer to the fitness optimum (Figure 5—figure supplement 1). Competitive fitness was ultimately determined in the model by comparing the population size obtained at the end of each simulation experiment to the population size obtain for a constant ‘wild type’ competitor (Figure 5D, Figure 6—source data 1). 100 independent simulations were performed for each unique combination of mean expression level and expression noise. Three metrics of expression noise were used for this work: noise strength (similar to Fano factor, Figure 6), standard deviation (Figure 6—figure supplement 1A,C) and coefficient of variation (Figure 6—figure supplement 1B,D).

Figure 5 with 1 supplement see all

Download asset Open asset

A simple model linking single cell expression levels to population fitness.

(A) In our model, the expression level $E$ of individual cells is randomly drawn from a normal distribution $𝒩 (μ_{E}, {σ_{E}}^{2})$ . $σ_{E}$ is lower for a genotype with low expression noise (top, green line) and higher for a genotype with high expression noise (bottom, orange line). (B) The doubling time $D T$ of individual cells is directly determined from their expression level using a function $D T = f (E)$ . (C) The growth of a cell population is simulated by drawing new values of expression converted into doubling time after each cell division. In this example, doubling time is more variable among cells for the population showing the highest level of expression noise. (D) Population growth is stopped after a certain amount of time (1000 minutes in our simulations) and competitive fitness is calculated from the total number of cells produced by the tested genotype relative to the number of cells in a reference genotype with $μ_{E} = 1$ and $σ_{E} = 0.1$ . In this example, fitness is lower for the genotype with higher expression noise (bottom) because it produced less cells than the genotype with lower expression noise (top).

https://doi.org/10.7554/eLife.37272.023

Figure 6 with 2 supplements see all

Download asset Open asset

Simulating the effect of expression noise on fitness at different median expression levels.

(A) The linear function $D T = - 40 \times E + 160$ relating the expression level of single cells to their doubling time used for the first set of simulations. (B) Relationship between mean expression ( $μ_{E}$ ) and fitness at nine values of expression noise (noise strength: ${σ_{E}}^{2} / μ_{E}$ ) ranging from 50% to 2000% using the linear function shown in (A). (C) Gaussian function $D T = - 160 \times \exp (- {(E - 1)}^{2} / 0.18) + 240$ relating the expression level of single cells to their doubling time used in the second set of simulations. This function shows an optimal expression level at $E = 1$ , where doubling time is minimal (*i.e.*, fastest growth rate). (D) Relationship between mean expression ( $μ_{E}$ ) and fitness at 11 values of expression noise (noise strength: ${σ_{E}}^{2} / μ_{E}$ ) ranging from 50% to 1400% using the Gaussian function shown in (C). (**B,D**) Error bars show 95% confidence intervals of mean fitness calculated from 100 replicate simulations for each combination of mean expression and expression noise values. Data are available in Figure 6—source data 1.

https://doi.org/10.7554/eLife.37272.027

Figure 6—source data 1 Fitness data obtained by modeling the growth of cell populations with different levels of mean expression and expression noise. Data used to make Figure 6B and D and generated with the Matlab code provided in Supplementary file 5.: https://doi.org/10.7554/eLife.37272.030
Download elife-37272-fig6-data1-v2.xlsx

To calculate doubling times from single cell expression levels, we first used a linear function akin to directional selection in which increases in expression level resulted in shorter doubling times (faster growth) (Figure 6A). With this relationship, higher levels of expression noise conferred higher population fitness for a given mean expression level (Figure 6B), a pattern more pronounced for high values of mean expression and observed for all metrics of noise (Figure 6—figure supplement 1A,B). This finding is consistent with prior work demonstrating that an increased variability of doubling time among individual cells is sufficient to increase fitness at the population level (Tănase-Nicola and ten Wolde, 2008; Cerulus et al., 2016; Hashimoto et al., 2016; Nozoe et al., 2017). This is because the doubling time of a population tends to be dominated by the doubling time of the fastest dividing cells in the population, i.e. population doubling time is higher than the mean doubling time among all cells in the population.

Next, we used a Gaussian function akin to stabilizing selection in which an intermediate expression level produced the shortest doubling time (faster growth), while lower or higher expression than this optimum would increase doubling time (slower growth) (Figure 6C). With this function, we found that the fitness effects of increasing expression noise depended on the mean expression level. Specifically, increasing expression noise increased fitness when the average expression level was far from the optimal expression level and it decreased fitness when the average expression level was close to the optimum (Figure 6D), similar to the pattern we observed with our empirical fitness data and in agreement with theoretical work by Tănase-Nicola and ten Wolde (2008). This result was observed for all three metrics of noise, suggesting it is robust to the different scaling relationships between the mean expression level and variability around the mean captured by different metrics of noise (Figure 6—figure supplement 1C,D).

These in silico analyses not only provide a plausible mechanistic explanation for our empirical finding that increasing noise can be both beneficial and deleterious in a single environment but they also show that increasing expression noise can alter the effects of changes in mean expression level on fitness. Specifically, when expression noise is high (red lines on Figure 6D and Figure 6—figure supplement 1C,D), changes in mean expression level are predicted to have much smaller impacts on fitness than equivalent changes when expression noise is low (blue lines on Figure 6 and Figure 6—figure supplement 1C,D). This pattern is also readily apparent when changes in expression noise, instead of changes in mean expression level, are plotted as a function of population fitness (Figure 6—figure supplement 2). These observations are consistent with a previously published population genetic model showing that increasing expression noise can reduce the efficacy of natural selection acting on mean expression level (Wang and Zhang, 2011).

Conclusions

Despite many studies providing evidence that natural selection can (Tănase-Nicola and ten Wolde, 2008; Wang and Zhang, 2011; Barroso et al., 2018) and has (Fraser et al., 2004; Lehner, 2008; Zhang et al., 2009; Metzger et al., 2015) acted on expression noise, the precise effects of expression noise on fitness have proven difficult to measure empirically. This difficulty arises from the facts that (1) most mutations that alter expression noise also alter mean expression in a correlated fashion, making it difficult to isolate the effects of changes in expression noise on fitness (Hornung et al., 2012; Keren et al., 2016; Liu et al., 2016), and (2) the magnitude of fitness effects resulting from changes in expression noise is expected to be smaller than that resulting from changes in mean expression level (Zhang et al., 2009). In this study, we overcame these challenges by surveying a broad range of mutant promoter alleles for their effects on mean expression level and expression noise, measuring the fitness effects of a subset of these alleles with reduced dependency between effects on mean expression level and expression noise, and using an assay for fitness with power to detect changes as small as 0.1%. We found that the fitness effects of changes in expression noise are indeed generally much smaller than changes in expression level, although they are large enough to be acted on by natural selection in wild populations of S. cerevisiae (Wagner, 2005; Metzger et al., 2015).

We also show that changes in expression noise can be beneficial or deleterious depending on the distance between the mean expression level and the expression level conferring optimal fitness in the environment examined, with increases in expression noise deleterious near the optimal expression level, consistent with data for TDH3 in Metzger et al. (2015). Although our empirical work focused solely on the TDH3 gene, the small number of parameters in our simulation model producing the same pattern as these empirical data suggests that the observed relationship among fitness, average expression level and expression noise are likely generalizable to other genes. That said, the precise relationship between expression noise and fitness at the population level is expected to be shaped by the relationship between average expression level and doubling time of single cells as well as the temporal dynamics of expression in single cells (Blake et al., 2006; Tănase-Nicola and ten Wolde, 2008). We provide some experimental measures of single-cell division rates here (Figure 5—figure supplement 1), but studies that more directly compare expression levels and division times in individual cells are needed to fully address this issue.

Assuming that the average expression level of a population is near the fitness optimum in a stable environment, but further from the optimum following a change in the environment, our results unify studies showing that increasing expression noise tends to be deleterious in a constant environment but beneficial in a fluctuating one (Fraser et al., 2004; Blake et al., 2006; Batada and Hurst, 2007; Lehner, 2008; Tănase-Nicola and ten Wolde, 2008; Zhang et al., 2009; Fraser and Kaern, 2009; Ito et al., 2009; Wang and Zhang, 2011; Levy et al., 2012; Wolf et al., 2015; Liu et al., 2015; Keren et al., 2016). Expression noise may be particularly important in the early phase of adaptation to a fluctuating environment, when a new expression optimum makes an increase in noise beneficial and before expression plasticity evolves as a more optimal strategy (Wolf et al., 2015). Such plasticity in expression level seems to have already evolved for TDH3 (Duveau et al., 2017b). Our data suggest that high levels of expression noise can also be beneficial in a stable environment when the mean expression level is far from optimal. For example, if an allele driving suboptimally low expression were to be fixed in a population, selection should initially favor alleles that increase mean expression and/or expression noise. After alleles driving mean expression close to the optimum are fixed, selection should then favor alleles with lower levels of expression noise. The relative frequency by which evolution proceeds through these two paths will depend on both the relative frequency of alleles that increase mean expression and expression noise, as well as the fitness differences between these alleles. We note, however, that the often correlated effects of promoter mutations on mean expression level and expression noise (Hornung et al., 2012; Carey et al., 2013; Sharon et al., 2014; Vallania et al., 2014) may limit the ability of natural selection to optimize both mean expression level and expression noise. Future work investigating the effects of other types of mutations on mean expression level, expression noise, and fitness in multiple environments is needed to more fully define the range of variation affecting gene expression upon which natural selection can act.

Share this article

Cite this article

A collection of TDH3 promoter alleles with incompletely correlated effects on average expression level and expression noise.

Figure 1—source data 1

Fitness consequences of variation in TDH3 expression level.

Effect of TDH3 expression noise on fitness.

Direct competition between genotypes with different levels of noise but similar median levels of TDH3 expression in glucose.

Figure 4—source data 1

A simple model linking single cell expression levels to population fitness.

Simulating the effect of expression noise on fitness at different median expression levels.

Figure 6—source data 1

Author details

Fabien Duveau

Contribution

Competing interests

Andrea Hodgins-Davis

Contribution

Competing interests

Brian PH Metzger

Contribution

Competing interests

Bing Yang

Contribution

Competing interests

Stephen Tryban

Contribution

Competing interests

Elizabeth A Walker

Contribution

Competing interests

Tricia Lybrook

Contribution

Competing interests

Patricia J Wittkopp

Contribution

For correspondence

Competing interests

Citations by DOI

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

Categories and tags

Research organism