Figures and data

Overview of the choice of encoding and the choice of time scale for quantifying relative fitness.
(A) Example trajectory of relative abundance x (top panel) for a mutant invading and eventually replacing a wild-type population. The same trajectory is plotted under the encoding log x (middle panel) and the encoding logit x = log(x/(1 − x)) (bottom panel). (B) Flowchart to predict the future relative abundance of a mutant given a relative fitness value sm = dm/dt for some encoding m. The current relative abundance xt is transformed into the new variable mt = m(xt), then projected into the future through a linear extrapolation using sm (upper horizontal arrow) and finally converted back into a frequency xt+Δt using the decoding function m−1. (C) Four scenarios for a mutant with the same relative fitness per-cycle but with different underlying population dynamics. For each scenario, we show an example trajectory of absolute abundance (stacked) for the wild-type (dark grey) and mutant population (light grey). Each scenario is mapped as a single-dot onto the fold-change diagram (center plot) and colored areas indicated positive (green area) and negative relative fitness per-cycle (blue area; compare Eq. (9)). Fitness per-cycle has isoclines that are parallel to the identity in the LFC diagram. (D) Scenario for rank discrepancy between relative fitness per-cycle 




Comparison of mutant fitness rankings with different time scales using empirical trait variation.
(A) Overview of pairwise coculture simulations. For each mutant strain (grey), we simulate a competition growth cycle against a reference wild-type strain (blue) using the estimated traits and laboratory parameters for the initial condition (N0 = 0.05 OD, R0 = 111 mM glucose, x = 0.5; Methods and see Figure for the underlying trait distribution). (B) Rank discrepancy between relative fitness per-cycle 




Discrepancy between fitness per-generation and per-cycle in epistasis detection and long-term trend.
(A) Schematic detection of epistasis for two single mutants (light and dark circle). Epistasis occurs if the corresponding double mutant (two-component circles ) deviates from the additive fitness (parallelogram shape in solid lines; Methods). (B) Correlation between epistasis in relative fitness per-cycle scycle and per-generation sgen. Each dot corresponds to a hypothetical double mutant that combines pairs of mutations systematically perturbing each of the three growth traits (Figure 2A) in our minimal model of population dynamics (Methods). (C) Heatmap for epistasis in the per-cycle fitness scycle for all double mutants in our simulated data set, same data as in panel B. (D) Heatmap for epistasis in the per-generation fitness sgen, same data as in panel B. All epistasis plots are based on 1:1 competition growth cycles with the wild-type (Figures S9–S14). See Figure for epistasis patterns at low mutant relative abundance. (E) Hypothetical scenario for trait evolution in an evolution experiment. An evolving population decreases lag time to zero (orange line), increases growth rate until saturation (blue line), and gradually decreases biomass yield (green line). This trend is similar to initial observations from the LTEE [98]. (F) Corresponding long-term trend in relative fitness based on the trait evolution in panel E. We estimate relative fitness per-cycle 


Rank discrepancies between fitness statistics in LTEE competition data.
(A) Rank difference between relative fitness per-cycle and per-generation as a function of time in the LTEE. For each of the 12 lines in the LTEE, we compute the relative fitness per-cycle 














The choice of library abundance and reference group in bulk competition experiments.
(A) Overview of a pairwise competition experiment with one mutant (top row) and scenarios for bulk competition experiments with many different mutants (middle and bottom rows). The two bulk competition scenarios differ in their initial fraction of the mutant library (colored ovals) in the inoculum (open box). For each scenario, we show a schematic growth cycle (log absolute abundance) in the inset on the right. (B) Schematic relative abundance trajectories for a mutant compared to two alternative subpopulations. We distinguish between the total relative abundance xi with respect to the population as a whole (height of green band in the top box) and the pairwise relative abundance xi,wt with respect to the wild-type (height of green band in the bottom box; Eq. (20)). We indicate the sign of total relative fitness (Eq. (24)) and pairwise relative fitness (Eq. (25)) on the right. (C) The absolute error between bulk and pairwise competition experiments. The total relative fitness (dark grey dots; Eq. (24)) and the pairwise relative fitness (orange dots; Eq. (25)) for mutants in the minimal population dynamics model (Figure 2A, Methods) parameterized by the yeast knockout data (Figure S4) in bulk competition growth cycle with low mutant library abundance (panel A, case II; Methods). The absolute error is defined as the bulk fitness statistic minus the relative fitness in pairwise competition (Eq. (S92) in Sec. S15). In the inset, the absolute error for pairwise relative fitness (Eq. (25)) for a bulk competition growth cycle with 99.9% library abundance (light grey dots; case III). This still includes a wild-type reference at small percentage to be able to compute a pairwise relative fitness. The x-axis and the orange dots in the inset are identical to the main plot. (D) The relative error in bulk competition experiments as a function of mutant library abundance in the inoculum. Each line corresponds to a knockout mutant in our data set, and represents the relative error between the pairwise relative fitness in bulk competition and the relative fitness in pairwise competition (Eq. (S102) in Sec. S16). The black vertical lines show the recommended mutant library abundance for our data set based on Eq. (11) (xlib ≈ 24.6%) and based on the more conservative Eq. (S118) (xlib ≈ 0.02%, Sec. S16). Note these bounds are calculated a priori and are not the result of a fitting procedure.

Empirical benchmarks of bulk fitness against pairwise competition experiments.
(A) Fitness measurements as reported by Opijnen et al. [103] for knockouts of S. pneumoniae. Pairwise fitness was measured in 1:1 competitions of knockouts strains with the wild-type over a single growth cycle using the per-generation statistic 




Parameter settings for Gaussian Process optimisation.

Predicting the absolute and relative abundance of microbial populations.
(A) Example time series of absolute abundance for a single microbial population (light grey). The observer (eye symbol) of the time series at time t can ask about the future trend in absolute abundance (arrows). (B) Same as panel (A) but for two microbial strains, such as a wild-type and a mutant. The absolute abundance of the wild-type strain (light grey) is stacked on top of the absolute abundance of the mutant strain (dark grey). (C) The relative abundance for the mutant strain corresponding to panel (B). We sketch the mean relative abundance x over time (green line) inferred from multiple replicate cocultures. An error band (light green) around the time series demonstrates the variation between replicates.

The effect of encodings on a non-logistic relative abundance trajectory.
Example trajectory of relative abundance x (top panel) for a mutant invading and eventually replacing a wild-type population, simulated with the Gompertz equation dx/dt = rx log(1/x). Below, we show the Gompertz trajectory under the encodings log x and logit x = log(x/(1 − x)). Compare to Figure 1A for a trajectory simulated with the logistic equation.

The variation of wild-type and mutant log fold-change under resource consumption constraints.
(A) Schematic fold-change variation under a perfect resource consumption constraint between wild-type and mutant. Each dot corresponds to a mutant strain in a 1:1 competition growth cycle with the wild-type strain (Sec. S5). There is negative covariation between mutant and wild-type LFCs because the resource constraint forces one to decrease if the other increases. For our model of population dynamics, we can calculate this constraint exactly (black line: Eq. (S36) in Sec. S5). For two mutants (red points) we highlight the bow tie-shaped areas (red shading; compare Figure 1D) where other competitions must lie to have a rank discrepancy between relative fitness per-cycle 




Empirical trait distribution for single gene-knockouts of yeast.
(A) Overview of the growth curve data set and the estimated growth traits for the knockout library of Saccharomyces cerevisiae (Methods) [118]. (B) Covariation between estimated maximum growth rate g and lag time λ across all mutant strains (grey dots; Pearson correlation coefficient r = −0.17, p = 7 × 10−30) as well as wild-type replicates (orange dots; r = −0.16, p = 0.002). The reference wild-type strain for our pairwise coculture simulations is defined by the median trait values (black cross) of all wild-type replicates. (C) Covariation between measured maximum growth rate g and biomass yield Y across all mutant strains (grey dots; r = 0.21, p = 8 × 10−44) as well as wild-type replicates (orange dots; r = −0.06, p = 0.25). Histograms on the top and right sides of (B) and (C) are marginal distributions along each axis, with colors matching the scatter points and the vertical black line marking the median wild-type trait value.

Replicate measurements for growth rate, lag time, and yield in our empirical trait data set.
(A) Co-variation of growth rate between replicate measurements of the knockouts (grey dots; Pearson correlation coefficient r = 0.94, p = 0.00). Each dot represents a mutant genotype from the single-gene knockout collection in Saccharomyces cerevisiae [118]. For the vast majority of genotypes in our data set (4163 out of 4492 knockouts) we are able to fit two traits from two independent growth curve measurements (Figure S4A; Methods). (B) Covariation of lag time between replicate measurements of the knockouts (grey dots; r = 0.90, p = 0.00) (C) Covariation of biomass yield between replicate measurements of the knockouts (grey dots; r = 0.43, p = 4.81 × 10−188).

The distribution of fitness effects under relative fitness per-cycle and per-generation.
(A) Distribution of relative fitness per-cycle 




Exploring alternative conditions for rank discrepancy between fitness statistics in yeast gene-deletion data.
Plots columns (A) and (B) have the same format as Figure S4B,C, and columns (C) and (D) match Figure 2B,C (in reverse order). Row (A): Low mutant frequency x = 0.01. Row (B): Standard mutant frequency x = 0.5, but where we artificially eliminate variation in lag time to test the effect on the fitness rank discrepancy. Row (C): Same as row (B) but no variation in growth rate. Row (D): Same as row (B) but no variation in biomass yield. Row (E): Same as row (B), but yields of mutants lower than the median wild-type yield are set to the wild-type value.

Hypothetical case where relative fitness per-cycle and per-generation are anticorrelated across mutants.
(A) Covariation between growth rate and lag time for a synthetically generated set of mutants (grey dots) and a single wild-type (orange dots). The variation here may represent the standing variation in an evolved population with improved growth rate and lag time over the wild-type ancestor. The graphic overlap between points means they appear as a single line. (B) Covariation between growth rate and biomass yield for the mutants (grey dots) and wild-type (orange dot) in this synthetic data set. (C) Covariation between relative fitness per-cycle 




Competition growth cycles for double mutants — first column of Figure 3C,D.
Each row corresponds to a pair of mutations that perturb growth traits in the minimal model of population dynamics (Figure 2A, Methods). This figure shows all combinations where the first mutant increases growth rate (first column of Figure 3C,D). Column (A): Perturbations of growth traits for single and double mutants. Column (B): Growth curves of wild-type (grey) and mutant 1 (blue). Column (C): Growth curves of wild-type (grey) and mutant 2 (red). Column (D): Growth curves of wild-type (grey) and double mutant (purple).

Competition growth cycles for double mutants — second column of Figure 3C,D.
Same as Figure S9 but for mutant pairs where the first mutant decreases growth rate.

Competition growth cycles for double mutants — third column of Figure 3C,D.
Same as Figure S9 but for mutant pairs where the first mutant increases lag time.

Competition growth cycles for double mutants — fourth column of Figure 3C,D.
Same as Figure S9 but for mutant pairs where the first mutant decreases lag time.

Competition growth cycles for double mutants — fifth column of Figure 3C,D.
Same as Figure S9 but for mutant pairs where the first mutant increases biomass yield.

Competition growth cycles for double mutants — sixth column of Figure 3C,D.
Same as Figure but for mutant pairs where the first mutant decreases biomass yield. For the double mutant where the second mutant also decreases yield, we cannot use the additive mutation effects (Methods) because that would result in a negative yield for the double mutant. Here we have defined the double mutant to have a yeield value of Ymut = 0.1. This choice does not affect the fitness or epistasis, since biomass yield is a neutral trait with fitness zero in our model of population dynamics.

Epistasis patterns at low mutant relative abundance.
Same as Figure 3B–D but where the mutant starts each competition at low relative abundance (x0 = 0.01; Sec. S8), compared to the 1:1 initial conditions in Figure (x0 = 0.5).

Long-term fitness trends in the LTEE under relative fitness per-cycle and per-generation.
(A) Fit of a hyperbolic model (pink line; Eq. (S56) in Sec. S9) and a power-law model (cyan line; Eq. (S57) in Sec. S9) to a pooled time-series of relative fitness per-generation 




Comparison of relative fitness per-generation and per-cycle across the complete LTEE competition data set.
(A) Rank discrepancy between relative fitness per-cycle 






Comparison of log-encoding and logit-encoding for relative fitness per-cycle in the LTEE competition data set.
(A) Same as Figure S17A but for rank discrepancy between logit-encoded relative fitness per-cycle 


Predicting relative fitness with monoculture proxies under different scenarios of trait variation.
Plots in columns (A) and (B) show growth trait variation across mutants, with the same format as Figure S4B,C. Plots in column (C) show quality of prediction for different monoculture proxies under the trait distribution in columns (A) and (B). As the ground truth, we estimate the relative fitness per-cycle 


The choice of the cutoff time for evaluating the area under the curve (AUC).
(A) Distribution of saturation times in monoculture for the knockouts (grey bars) and wild-type replicates (orange bars) in our empirical data set (Figure S4). The saturation time tsat is defined as the time when the limiting resource is depleted (R(tsat) = 0) and can be estimated numerically from the trait data (Methods). Three vertical lines indicate example choices for the cutoff time teval of the AUC: the most frequent saturation time (teval = 13 hours; red line), a saturation time halfway in the decay of the distribution (teval = 16 hours; black line), and an external time scale (teval = 24 hours; blue line). (B) Covariation between relative fitness per-cycle 



The error in mutant fitness rankings between bulk and pairwise competition experiments.
(A) Rank difference between total relative fitness and pairwise relative fitness in a bulk competition growth cycle with low mutant library abundance (Figure 5A, case II). Based on the fitness data in Figure 5C, we calculate a mutant ranking for total relative fitness in bulk (Eq. (24)) and a ranking for pairwise relative fitness in bulk (Eq. (25)) (higher rank means higher fitness and mutants with equal fitness are assigned the lowest rank in the group). The rank difference is defined as the rank in total relative fitness 


A decomposition for the error from higher-order interactions in bulk competition experiments.
For a bulk competition growth cycle with low mutant library abundance (Figure 5A, case II), we calculate the pairwise relative fitness (Eq. (25)) for the knockouts in our empirical data set (Figure S4A) using a previously established approximation [122] (Sec. S14). The error from higher-order interactions is defined as the pairwise fitness in bulk minus the fitness in pairwise competition. Based on the approximation for our model of population dynamics (Sec. (S14)), we derive a decomposition that separates the absolute error into two terms that we call the fitness-dependent error term (dark orange dots) and the fitness-independent error term (light orange dots). For full details on the decomposition, see Sec. (S15).

The relative error between bulk and pairwise competition experiments.
(A) Same as Figure 5C but showing relative rather than absolute error. The relative error of each bulk statistic is defined as the absolute error (Figure 5C), divided by the fitness in pairwise competition (Eq. (S102) in Sec. S16). A dashed grey line indicates the threshold of 1% relative error. (B) Same as Figure but showing relative rather than absolute error. (C) Same as the inset of Figure 5C but showing relative rather than absolute error. On the x-axis, we plot the absolute value of relative fitness per-cycle in the pairwise competition.

The bulk fitness error based on measurements by van Opijnen et al. [155].
(A) Absolute error 




The bulk fitness error based on measurements by Levy et al.
[32]. Same as Figure but for the set of evolved genotypes of S. cerevisiae shown in Figure 2d in the original publication. In (A), we calculate the Pearson correlation coefficient r and the p-value for the null hypothesis of zero correlation, and indicate the linear regression between the two variables with a red line. In (B) we define a set of knockouts with non-neutral fitness (blue dots; 