Abstract
Few concepts are as central to evolution as is fitness, and yet the quantification of fitness is often ambiguous. In particular, high-throughput experiments to measure mutant fitness in microbes are increasingly common but vary widely in their definitions of fitness, which makes their results difficult to compare. What are the consequences of these different fitness statistics, and is there a best way to quantify fitness in a given context? Here we systematize the set of possible fitness statistics according to the following three choices: 1) the encoding of relative abundance (e.g., transforming by a log or logit function), 2) the time scale over which to measure the change in relative abundance, and 3) the choice of a reference subpopulation for calculating relative fitness in bulk competition experiments, such as those using DNA-barcoded mutants. We show that these different choices can lead to significantly different interpretations of mutant fitness, affecting the magnitude of fitness effects, the apparent presence of epistasis, and even the fitness ranking across mutants. This can confound predictions for evolutionary dynamics and gene functions. Altogether our results demonstrate the importance of consistent fitness definitions for reproducible results across experiments.
Introduction
The concept of fitness is supposed to describe the fate of mutations as they arise in a population [1] and defines other core evolutionary concepts such as tradeoffs [2]. At the same time, fitness is often a confusing term in evolutionary biology [3–5]. It is possible to classify fitness measures by their role in theory [6] and their scale of measurement [7–9], but these arguments offer little guidance on how to measure fitness in practice. Empirical fitness measurements have become widespread in the microbial sciences — to test evolutionary theory [10–15], detect microbial interactions [16, 17], annotate gene functions [18, 19], and understand the spread of antibiotic resistance genes [20, 21] — but the approaches to quantifying fitness in these studies vary widely.
The classic approach to measuring the fitness effects of mutations uses pairwise competition experiments, where the mutant competes with its ancestral wild-type in co-culture [22–24]. This approach is ideal because it mimics the dynamics of spontaneous mutations, and thus is typically used in experimental evolution [10, 12, 25–27]. For example, the Long-Term Evolution Experiment (LTEE) has evolved Escherichia coli over tens of thousands of generations and measured the fitness of each evolved lineage in competition with the ancestor [11, 26, 28]. However, pairwise competition experiments require a way to distinguish mutants from the wild-type in cocultures (e.g., fluorescent labeling or selection markers), which is not always feasible to engineer. A second approach is thus to measure properties of genotypes in monocultures, like the growth rate, and estimate relative fitness between genotypes from these measurements. Monoculture growth curves are straightforward to measure, even for large numbers of mutants [29–31], but empirical tests have shown that the monoculture growth rate is insufficient to predict the relative fitness between genotypes [32–34]. Summary statistics of whole growth curves, like the area under the curve (AUC), perform better [35–37]. However, this approach breaks down for strains that engage in cross-feeding or toxin production, and attempts to improve monoculture-based fitness estimates require additional experiments [34, 37] and do not go beyond qualitative fitness rankings [37].
A third and more recent approach is to measure relative fitness in bulk competition experiments using high-throughput sequencing, often of DNA barcodes [13, 38]. For example, transposon-insertion mutagenesis generates libraries of gene knockouts that can be tracked as they grow in a single batch culture [39]. Estimating the fitness of each mutant allows us to identify genes that are particularly important in the growth environment [18, 19, 40], in the given genetic background [14, 15, 41, 42], and in the context of an ecological community [16, 17, 42, 43]. Bulk fitness estimation also plays a major role in identifying beneficial lineages in experimental evolution [13, 44] (not unlike the tracking of viral pathogens in public health [45–47]) and especially testing their fitness in new environments [48–52]. Many of these studies make different choices about the design of experiments and the calculation of fitness, without explaining why a certain fitness measure was used or how to compare between data sets.
The inconsistent choices for quantifying microbial fitness make it is difficult to compare fitness across experiments, especially because different groups tend to use different definitions. For example, the LTEE reports relative fitness per-generation [11, 26], whereas other evolution experiments exclusively report relative fitness percycle of batch growth [12, 53]. New techniques, like barcode sequencing, often spark their own fitness metrics [18, 40, 54–56]. A different choice of fitness metric might lead to a different ranking of mutants in a genetic screen (and the importance we assign to those genes) or change statistical properties of the distribution of mutant fitness effects, which is essential to predict the probability of mutant fixation [57], the speed of adaptation [58], and the effect size in statistical tests of gene function [18, 19]. This raises three key questions: 1) How do these fitness statistics differ? 2) Can we say which choices are optimal, in the light of first principles or some practical considerations? 3) Does this tell us how we should design these experiments? Previous work has addressed aspects of these questions [6–9, 59] but the arguments are scattered across disciplines and based on generic models of population growth [6, 7, 9, 60], rather than incorporating the specific dynamics of microbes in laboratory experiments. Here we address these questions from a unified framework, using realistic microbial population dynamics with empirical traits to systematically test different fitness metrics.
Results
Predictive power of fitness depends on the encoding of relative abundance
We distinguish between three related but distinct notions of fitness [6], all based on predicting dynamics of a population [61–64]. The first type of fitness is absolute fitness, which is a property of a single genotype by itself and serves to predict the change in the genotype’s absolute abundance N (t) over a future time window Δt (Figure S1A). This is important for questions about extinction and evolutionary rescue [65]. The second type of fitness is relative fitness, which is a property of two genotypes as it describes how the relative abundance x(t) of one genotype changes compared to the other over a time Δt (Figure S1B). This is important to determine the fixation probability of new mutations [57, 66, 67]. In general these dynamics are stochastic [66, 68, 69], but throughout this paper we focus on their average behavior across replicate cultures (as sketched in Figure S1C).
A practical challenge of working with relative fitness is that it must be measured between all pairs of genotypes in coculture competitions. Therefore it is common to infer relative fitness of two genotypes based on some individual properties of the genotypes [70–73]. We denote this third notion of fitness as the fitness potential. Like absolute fitness, fitness potential is a property of an individual genotype, but unlike the absolute fitness, it has no meaning by itself. In analogy with the potential energy of physical systems, fitness potential is intended only to be compared between two genotypes (e.g., by taking the difference or ratio) to estimate the relative fitness between those genotypes [36, 37]. The collection of fitness potential values across a large set of genotypes forms a fitness landscape [74, 75]. We note that fitness as defined here gives information about short-time dynamics but not necessarily the long-term outcome (compare Smith et al. [6]). For example, this excludes the ratio of growth rates [7, 73] or the resource concentration R∗ in chemostat equilibrium [76] since these quantities cannot tell you how fast the absolute or relative abundance is changing. We also note that in all these definitions, the fitness of a genotype is specific to one environment, including all abiotic and biotic factors. How fitness in one environment is related to fitness in other environments is important [14, 19, 42, 43, 49, 50, 73, 77, 78] but beyond the scope of this article, as we focus on specific laboratory measurements of fitness rather than abstract concepts.
In this article we will focus on relative fitness as this is the most common object of high-throughput laboratory experiments in microbes [18, 26, 40, 55, 56] and also has the most straightforward connection to key evolutionary outcomes such as fixation [57, 66, 67]. We define a genotype’s relative fitness as any number that is sufficient to predict the genotype’s relative abundance x(t) over a short time period (Figure S1). The simplest approach is to take a linear expansion of x(t) and use its slope at time t to predict the change over a time Δt (e.g., Figure 1A, top panel):

The slope

qualifies as a fitness statistic but often will incorrectly estimate the actual change in relative abundance because x(t) is nonlinear (Figure 1A, top panel).

Overview of the choice of encoding and the choice of time scale for quantifying relative fitness.
(A) Example trajectory of relative abundance x (top panel) for a mutant invading and eventually replacing a wild-type population. The same trajectory is plotted under the encoding log x (middle panel) and the encoding logit x = log(x/(1 − x)) (bottom panel). (B) Flowchart to predict the future relative abundance of a mutant given a relative fitness value sm = dm/dt for some encoding m. The current relative abundance xt is transformed into the new variable mt = m(xt), then projected into the future through a linear extrapolation using sm (upper horizontal arrow) and finally converted back into a frequency xt+Δt using the decoding function m−1. (C) Four scenarios for a mutant with the same relative fitness per-cycle but with different underlying population dynamics. For each scenario, we show an example trajectory of absolute abundance (stacked) for the wild-type (dark grey) and mutant population (light grey). Each scenario is mapped as a single-dot onto the fold-change diagram (center plot) and colored areas indicated positive (green area) and negative relative fitness per-cycle (blue area; compare Eq. (9)). Fitness per-cycle has isoclines that are parallel to the identity in the LFC diagram. (D) Scenario for rank discrepancy between relative fitness per-cycle 



Alternatively, we can encode the relative abundance x in a new variable m(x) (any smooth, strictly-increasing function) whose dynamics may be easier to linearly approximate (Figure 1B; compare Denniston and Crow [79]):

where

is the relative fitness under the encoding m. In Sec. S1 we show examples of the encoded relative fitness for an explicit model of population dynamics, and in Sec. S2 we show how this concept generalizes to absolute fitness as well.
The ideal encoding transforms the relative abundance x(t) into a linear trajectory that can be fully predicted from the slope sm. While the log-encoding m(x) = log x is common in many TnSeq studies [17–19, 43, 77, 78] (and is implicit in plots of relative abundance on a logarithmic scale [44]), the log-encoded fitness slog tends to overestimate the change at high relative abundance (Figure 1A, middle panel). Instead, the best encoding in most cases is the logit function (Figure 1A, bottom panel):

This is because the relative abundance x(t) typically follows logistic dynamics (Sec. S1), at least as a first-order approximation (Sec. S3, Figure S2), and the logit function is linearly related to the inverse function of logistic x(t). The logit-encoded relative fitness of a mutant is equivalent to the standard definition of its selection coefficient in population genetics [80]. For this reason we focus on the logit-encoded relative fitness for the remainder of this work. Note that the logit encoding and logistic dynamics apply to relative abundance and are not to be confused with the logistic growth model used with microbial growth curves, where it describes the absolute abundance [81].
Estimating relative fitness requires a choice of time scale
Since the relative abundance of a genotype can only be measured at discrete times, we must estimate the instantaneous relative fitness of Eq. (4) using a finite difference between consecutive time points (Figure S1C, Sec. S4):

For the logit-encoding this becomes

In the case of a single mutant competing against a wild-type strain, we can rewrite Eq. (7) in terms of the log fold-changes of the mutant and wild-type strains:

where the log fold-change of each strain is LFC = log N (t)/N (t − Δt) and N is the biomass of the strain. The LFC is sometimes also referred to as a Malthusian parameter [9, 26, 65].
Therefore a key element of estimating relative fitness from discrete time data is the choice of time interval Δt. There are two common choices in empirical measurements of microbial fitness. The first is to use a time scale extrinsic to the population. In particular, microbial populations in laboratory experiments [26, 82] but also in some natural environments [83–85] grow in discrete growth cycles dictated by external pulses of nutrients (e.g., batch culture). In this case, it is convenient to quantify relative fitness per-cycle, by choosing time units in Eq. (8) such that Δt = 1:

The second common choice for Δt is a time scale intrinsic to the population, typically its generation time. The relative fitness per-cycle may depend on the number of generations in the growth cycle, and it can be valuable to normalize for this dependence, especially when comparing across environments [26]. However, defining the number of generations for a population with multiple genotypes growing at different rates is ambiguous. In the case of a single mutant competing with a wild-type, it is common to consider only the number of generations experienced by the wild-type strain, which is estimated by its log fold-change LFCwt. It is convenient to express the generations in base e to match the natural logarithm in the logit-encoding, but this could equivalently be done by converting all logarithms in Eq. (8) to base 2. Thus the relative fitness of a mutant per-generation is defined by choosing Δt = LFCwt in Eq. (8):

This quantity is equivalent to the fitness statistic 
Relative fitness per-cycle and per-generation can rank mutants differently when wild-type growth depends on mutant
Although relative fitness of a mutant per growth-cycle (Eq. (9)) and per-generation (Eq. (10)) differ only by choice of time scale, they are not just equivalent quantities in different units. First, while fitness per-cycle is well-defined regardless of whether the population undergoes net growth or death (Figure 1C), fitness per-generation is not defined when the wild-type undergoes no change or net death (denominator of Eq. (10) is not positive) [5, 91]. This can occur for microbes under high drug concentrations, as well as for microbial populations under harsh environmental conditions as found in sediments or wastewater [92, 93].
However, even when the wild-type LFC is positive, the relative fitness per-cycle and per-generation are not necessarily proportional to each other across a set of mutants competed against the same wild-type. The reason is that while the growth cycle time scale is extrinsic to a population and therefore the same across all pairs of genotypes, the generation time scale depends on the growth of the wild-type, which may be affected by the specific mutant it is competing against. This discrepancy can be quantitative as well as qualitative, producing different rankings of mutants. For example, Figure 1D shows two mutant genotypes (red and blue) that have opposite rankings under these fitness statistics: the blue mutant has higher relative fitness per-cycle 

Basic features of microbial population dynamics are sufficient to cause rank discrepancy between fitness statistics
Since the discrepancy between fitness per-cycle 

We use a model that approximates microbial population dynamics as three growth phases — a lag phase, an exponential growth phase, and a stationary phase (Figure 2A) — where strains compete for a single limiting resource with no other interactions [94, 95], since resource competition is likely to be the main interaction between a wild-type and a mutant with only small genetic differences (e.g., a point mutation or gene knockout). This model captures the minimal dynamics of microbial growth under laboratory conditions, and therefore serves as a baseline for possible outcomes. We parameterize the model using growth curves of single-gene knockout mutants in Saccharomyces cerevisiae [29, 96] (Figures and S5, Sec. S6). We simulate a pairwise competition for each knockout against the wild-type (Figure 2A) and calculate relative fitness per-cycle 


Comparison of mutant fitness rankings with different time scales using empirical trait variation.
(A) Overview of pairwise coculture simulations. For each mutant strain (grey), we simulate a competition growth cycle against a reference wild-type strain (blue) using the estimated traits and laboratory parameters for the initial condition (N0 = 0.05 OD, R0 = 111 mM glucose, x = 0.5; Methods and see Figure for the underlying trait distribution). (B) Rank discrepancy between relative fitness per-cycle 



This model therefore shows that the basic features of microbial population dynamics and resource competition with realistic parameters are sufficient to cause substantial disagreement between fitness per-cycle and per-generation, underscoring the importance of this analysis choice. Furthermore, the model reveals a basic mechanism that causes this discrepancy: mutants must have diminished biomass yield plus variation in one additional trait (lag time or growth rate), and a high mutant relative abundance (e.g., 50%) at the start of the competition (Figure S7). These conditions are necessary because they allow mutants both to have variation in fitness and to alter the wild-type LFC in pairwise competition. Even though the wild-type growth rate is unaffected by mutants, the wild-type LFC will change in the presence of mutants that have lower yield because the population on average consumes more resources per cell division (Sec. S7). By this mechanism, it is possible to concoct a set of growth traits where relative fitness per-generation and per-cycle deliver completely opposite rankings (Figure S8).
Fitness statistics can disagree on epistasis and long-term fitness trends
Since the relative fitness of single mutants differs depending on whether fitness is quantified per-cycle or pergeneration, how does that choice impact the apparent epistatic interactions between multiple mutations and the long-term evolutionary trend in fitness? Using the same minimal model of population dynamics as in Figure 2A, we simulate a set of six mutants, each one having an increase or decrease in each of the three quantitative traits in the model (lag time, growth rate, and yield), and calculate the relative fitness per-cycle and per-generation for all pairs of those mutations (assuming the mutation effects on the traits themselves are additive; Methods). We quantify epistasis as the fitness difference (in 1:1 competition with the wild-type; Methods) between the double mutant and the sum of the single mutants (Figure 3A) [97].

Discrepancy between fitness per-generation and per-cycle in epistasis detection and long-term trend.
(A) Schematic detection of epistasis for two single mutants (light and dark circle). Epistasis occurs if the corresponding double mutant (two-component circles ) deviates from the additive fitness (parallelogram shape in solid lines; Methods). (B) Correlation between epistasis in relative fitness per-cycle scycle and per-generation sgen. Each dot corresponds to a hypothetical double mutant that combines pairs of mutations systematically perturbing each of the three growth traits (Figure 2A) in our minimal model of population dynamics (Methods). (C) Heatmap for epistasis in the per-cycle fitness scycle for all double mutants in our simulated data set, same data as in panel B. (D) Heatmap for epistasis in the per-generation fitness sgen, same data as in panel B. All epistasis plots are based on 1:1 competition growth cycles with the wild-type (Figures S9–S14). See Figure for epistasis patterns at low mutant relative abundance. (E) Hypothetical scenario for trait evolution in an evolution experiment. An evolving population decreases lag time to zero (orange line), increases growth rate until saturation (blue line), and gradually decreases biomass yield (green line). This trend is similar to initial observations from the LTEE [98]. (F) Corresponding long-term trend in relative fitness based on the trait evolution in panel E. We estimate relative fitness per-cycle 

Figure 3B shows that almost all the double mutants differ in their apparent epistasis per-cycle vs. per-generation. Epistasis per-cycle occurs when one of the mutations affects growth rate (Figure 3C), while epistasis per-generation occurs when one of the mutations affects lag time (Figure 3D). This is because the effect of a change in growth rate accrues per-generation (and hence is non-epistatic when measured over that time scale; Sec. S8), while the effect of a change in lag time accrues once per-cycle (and hence is non-epistatic in the per-cycle statistic). See Figures S9–S14 for growth curves in all cases. As an example, the double mutant in Figure S12, row D has two lag time-reducing mutations that are perfectly additive in the per-cycle statistic. However, the lag time mutations also reduce the saturation time of the growth cycle such that the fitness of the double mutant is spread over a fewer number of wild-type generations LFCwt, creating super-additive epistasis in the per-generation statistic. Note that this depends on measuring fitness for a mutant at relatively high abundance in the pairwise competition with the wild-type; when the mutant is rare, the wild-type fold-change LFCwt is effectively constant across mutants and epistasis measured per-cycle is proportional to its value measured per-generation (Figure S15). Differences in apparent epistasis can also lead to differences in apparent long-term trends of fitness per-cycle and per-generation in an evolution experiment. For example, if growth traits of a population in the minimal model evolve according to Figure 3E, then fitness per-cycle will saturate at an early time while fitness per-generation will continue to increase (Figure 3F). This is because the long-term decrease in yield reduces the wild-type LFC in 1:1 competitions between an evolved mutant and the wild-type, which causes fitness per-generation to increase (Eq. (10)).
Different fitness time scales agree on LTEE trend but not on mutant ranking
As an empirical test for discrepancies between relative fitness per-generation and per-cycle, we test how the choice of fitness time scale affects conclusions from the LTEE (Sec. S9). We reanalyze competition data from the LTEE [11] using the relative fitness per-generation 




Rank discrepancies between fitness statistics in LTEE competition data.
(A) Rank difference between relative fitness per-cycle and per-generation as a function of time in the LTEE. For each of the 12 lines in the LTEE, we compute the relative fitness per-cycle 













While we have focused on the logit encoding so far since logistic dynamics are usually the best null model of mutant frequency dynamics (Figure 1A), the log encoding is common in many TnSeq studies [17–19, 43, 77, 78]. The LTEE competition data is a valuable opportunity to empirically test the consequences of this choice as well. Figure 4C shows that the relative fitness per-cycle under the log-encoding 

Fitness in bulk competitions is distorted by higher-order effects depending on reference subpopulation
So far we have focused on relative fitness in pairwise competition experiments, but this is often not practical for large numbers of mutants due to the required strain labeling (e.g., with fluorescent proteins or selection markers). One alternative is to measure a fitness potential (as defined at the beginning of the Results section), such as growth rate or area under the growth curve (AUC) [32, 34, 36, 37], from monoculture experiments, under the assumption that true relative fitness is proportional to the difference in this quantity between genotypes. Using our minimal model of population dynamics parameterized by single-gene knockouts in yeast (Figure 2A), we find that common fitness potentials perform poorly overall (Figure S19; see refs. [32, 34, 36, 37] for empirical tests), with some success for the AUC but strongly depending on the choice of time window (Figure S20, Sec. S10).
Another approach to measuring relative fitness for a large number of mutants is to use bulk competition experiments, where many mutant genotypes compete simultaneously in a single culture and each genotype is tracked through high-throughput sequencing [39, 44]. This raises the question of how well relative fitness of a mutant in bulk competition corresponds to its relative fitness in a pairwise competition with the wild-type, since the growth in bulk might be influenced by the presence of other mutant genotypes, a phenomenon known as a higher-order interaction [78, 95, 100].
An important choice in bulk competition experiments is the relative abundance of the library of all mutant genotypes compared to the wild-type. The invasion of a spontaneous mutation into an existing population is best captured by pairwise competitions with low initial relative abundance of the mutant (Case I, Figure 5A), and one way to recreate this scenario in a bulk competition is to use a low relative abundance of the mutant library overall (Case II, Figure 5A) [40, 48–52]. A practical problem with Case II is that individual genotypes in the library will have low absolute abundances, which leads to stochasticity in the population dynamics and the sequencing process [13, 101, 102]. Therefore, it is common to compete the mutant library by itself (Case III, Figure 5A) [13, 15, 17–19, 77].

The choice of library abundance and reference group in bulk competition experiments.
(A) Overview of a pairwise competition experiment with one mutant (top row) and scenarios for bulk competition experiments with many different mutants (middle and bottom rows). The two bulk competition scenarios differ in their initial fraction of the mutant library (colored ovals) in the inoculum (open box). For each scenario, we show a schematic growth cycle (log absolute abundance) in the inset on the right. (B) Schematic relative abundance trajectories for a mutant compared to two alternative subpopulations. We distinguish between the total relative abundance xi with respect to the population as a whole (height of green band in the top box) and the pairwise relative abundance xi,wt with respect to the wild-type (height of green band in the bottom box; Eq. (20)). We indicate the sign of total relative fitness (Eq. (24)) and pairwise relative fitness (Eq. (25)) on the right. (C) The absolute error between bulk and pairwise competition experiments. The total relative fitness (dark grey dots; Eq. (24)) and the pairwise relative fitness (orange dots; Eq. (25)) for mutants in the minimal population dynamics model (Figure 2A, Methods) parameterized by the yeast knockout data (Figure S4) in bulk competition growth cycle with low mutant library abundance (panel A, case II; Methods). The absolute error is defined as the bulk fitness statistic minus the relative fitness in pairwise competition (Eq. (S92) in Sec. S15). In the inset, the absolute error for pairwise relative fitness (Eq. (25)) for a bulk competition growth cycle with 99.9% library abundance (light grey dots; case III). This still includes a wild-type reference at small percentage to be able to compute a pairwise relative fitness. The x-axis and the orange dots in the inset are identical to the main plot. (D) The relative error in bulk competition experiments as a function of mutant library abundance in the inoculum. Each line corresponds to a knockout mutant in our data set, and represents the relative error between the pairwise relative fitness in bulk competition and the relative fitness in pairwise competition (Eq. (S102) in Sec. S16). The black vertical lines show the recommended mutant library abundance for our data set based on Eq. (11) (xlib ≈ 24.6%) and based on the more conservative Eq. (S118) (xlib ≈ 0.02%, Sec. S16). Note these bounds are calculated a priori and are not the result of a fitting procedure.
A second important choice is whether to quantify fitness of a mutant relative to the whole population (“total relative fitness”) [18, 19, 77] or relative to another specific genotype, like the wild-type (“pairwise relative fitness”; see Figure 5B) [40, 48–52]. In the case of a mutant library growing by itself (Case III), it is still possible to estimate a pairwise fitness by using a few known neutral mutants as the reference population [15, 17] and calculating fitness relative to this group (Methods; Secs. and S12).
To test the consequences of these choices, we use the minimal population dynamics model (Figure 2A) parameterized by yeast single-gene knockouts and compare the estimates of total and pairwise relative fitness in bulk competitions to fitness measured in pairwise competition, which we use as the ground truth (Methods). Figure 5C shows that in bulk competitions, the error in total relative fitness (compared to fitness in pairwise competitions) is systematically offset from the error in pairwise relative fitness. For example, a mutant that grows identically to the wild-type (neutral phenotype) does not change in abundance relative to the wild-type, but it may have a net increase in its total relative abundance due to the poor growth of other mutants (compare top and bottom panel in Figure 5B). This offset affects the total relative fitness of all mutants in a uniform way (Figure 5C), such that total and pairwise relative fitness completely agree in the ranking of genotypes (Figure S21). Note that this off-set between total and relative fitness is general and does not depend on our minimal population dynamics model (Sec. S13).
While the pairwise relative fitness is the preferred method for estimating fitness in bulk competitions, the presence of higher-order interactions means it still deviates from the fitness in pairwise competitions. Figure 5C shows the absolute error for pairwise fitness has a positive trend with fitness and increases with the relative abundance of the mutant (Case III, inset in Figure 5C). For the specific population dynamics in our simulation, we decompose the higher-order interaction into two terms (Secs. and S15): a fitness-independent term and a fitness-dependent term that acts as an amplifier for fitness values (Figure S22) and drives the positive error trend in Figure 5C. Intuitively, the population in bulk competition consumes resources more slowly and gives more time for growth rate differences to accrue. The amplification of mutant pairwise fitness in bulk is important because it widens the apparent distribution of fitness effects (leading to, for example, higher estimates for the speed of adaptation) than if fitness was measured in pairwise competition.
In order to control the error from higher-order inter-actions, we derive an explicit error bound. Figure 5D shows the relative error from higher-order interactions for each mutant across a range of mutant library abundances. Using the specific population dynamics of our model, we derive the following estimate: Assuming that the mutants in the library only have variation in growth rate, the mutant library abundance should satisfy

where ε is the desired threshold on the relative error (Figure S23), gwt is the wild-type growth rate, and glib is the growth rate of the library as a whole (Sec. S16). In the case of our single-gene knockout library, Eq. (11) predicts a maximum library abundance of 24.6% based on the wild-type and library growth rate (gwt = 0.406, glib = 0.389) for a relative error ε = 1%. Figure 5D shows that this maximum library abundance keeps the relative error below 1% for high-fitness mutants (bright yellow), because they are dominated by growth rate effects, but fails for mutants close to neutrality because they have a tradeoff between growth rate and lag time, which the estimate in Eq. (11) neglects. It is possible to derive a more precise bound that keeps the relative error below 1% for all mutant genotypes (in this case xlib = 0.01%, see vertical line in Figure 5D), but this requires prior knowledge of the trait covariation in the mutant library (Sec. S16).
Previous benchmark experiments show deviation of bulk fitness from pairwise competition
Our models predict that higher-order effects amplify pairwise fitness measured in bulk compared to the fitness in a pairwise competition experiment. To test this prediction, we reanalyze two published benchmark experiments of bulk fitness measurements. The first data set by van Opijnen et al. [103] measured pairwise (1:1) and bulk competitions (xlib ≈ 99.9%) of 46 single-gene knockouts in Streptococcus pneumoniae using logit-encoded fitness per-generation and neutral strains as the reference group. Figure 6A shows that the fitness in bulk deviates from the diagonal, and the absolute error has a positive correlation with the underlying true fitness from pairwise competition (Figure S24A; Pearson r = 0.40,p = 0.006), similar to the pattern observed in our simulations (compare Figure S24A with Figure 5C case III, and Figure S24B with Figure S23C case III).

Empirical benchmarks of bulk fitness against pairwise competition experiments.
(A) Fitness measurements as reported by Opijnen et al. [103] for knockouts of S. pneumoniae. Pairwise fitness was measured in 1:1 competitions of knockouts strains with the wild-type over a single growth cycle using the per-generation statistic 



The second data set by Levy et al. [13] measured pairwise competitions (1:1) and relative abundances for 33 lab-evolved genotypes of S. cerevisiae with beneficial fitness. The bulk fitness is inferred under the log-encoding and per-cycle, with neutral strains as the reference group. Figure 6B shows that the fitness of strains in the bulk evolution environment is typically higher than in pair-wise competition, but the absolute error has no positive correlation (Figure S25A; Pearson r = −0.26,p = 0.140) and thus does not support the pattern we see our simulations (compare Figure S25A with Figure Figure case III, and S25B with Figure S23C case III).
Discussion
Significance of fitness quantification choices for microbial ecology and evolution
In this work, we have introduced a conceptual framework to derive common statistics of relative fitness from three essential choices:
The choice of the state variable for relative abundance (encoding m).
The choice of the time scale for the change in relative abundance (Δt).
In bulk competition experiments, the choice of subpopulation that acts as the reference for relative abundance of a chosen mutant genotype (Figure 5B).
The combination of these choices leads to a range of fitness statistics, including those commonly used in population genetics [80], experimental evolution [11, 12, 26], and gene knockout screens [17–19, 42, 43, 77, 78, 103].
However, as we compare these statistics in simulated competition experiments, we find that the choice of time scale can lead to different mutant rankings (Figures 2B and 4A,B), confound apparent epistasis between mutations (Figure 3B), and alter the apparent long-term trend in fitness (Figure 3F), while the choice of the reference subpopulation can lead to a systematic shift in inferred fitness values (Figure 5C). The quantitative differences between relative fitness statistics affect our ability to make evolutionary predictions. In particular, the fixation probability of a spontaneous mutation depends on the magnitude of relative fitness compared to other time scales like mutation and drift [66, 67], and in microbial populations with clonal interference, it depends on the entire distribution of fitness effects [58, 104]. The choice of statistic may also affect the outcome of a significance test in the context of gene essentiality tests (e.g., using log or logit with the test statistic in Wetmore et al. [18]). Since these measurements often serve as a first screen to narrow down the investigation to the top set of genes [17, 40], a difference in mutant ranking means the investigation might miss relevant genes because of the choice of fitness statistic.
Best practices for quantifying mutant fitness in high-throughput experiments
Based on these insights, we recommend the following choices for quantifying relative fitness of a mutant:
Use the logit encoding of relative abundance, because under the null model of logistic dynamics this linearizes the trajectory of relative abundance. Only use a different encoding if alternate, non-logistic dynamics are known.
For batch growth or any experiment with non-steady state growth, use a fixed extrinsic time scale (e.g., a single growth cycle), rather than an intrinsic time scale (e.g., the number of generations), which introduces additional variation between competitions. Only quantify fitness per generation for experiments that are known to have steady-state growth (e.g., in a chemostat).
In bulk competition experiments, always use the pairwise fitness relative to the wild-type (by including a barcoded wild-type or by grouping neutral mutants into a virtual wild-type) because this more closely matches the relative fitness in pairwise competition.
To control the error from higher-order interactions, we recommend minimizing relative abundance of the mutant library as practically feasible. For a given error tolerance, Eq. (11) gives an estimate for the maximum library abundance.
Our recommendations agree with previous criticisms of relative fitness per-generation [5, 8, 9, 91] and total relative fitness [59, 105] but differ from the standard practice in high-throughput genetic screens [18, 19, 77] and many evolution experiments [33, 88], including the LTEE [11, 26]. The choice of the reference group is a well-recognized issue in high-throughput evolutionary studies, in which one first estimates a total relative fitness and then subtracts a correction based on the fitness of the wild-type [12, 50, 52] or a mean population fitness [13, 41, 42, 51, 105]. In contrast, we recommend to choose the reference group at the level of relative abundance (see Methods for a step-by-step demonstration). While researchers must ultimately choose how to quantify fitness based on the specific goals and conditions of their experiment, we believe that a more consistent adherence to the recommendations above would be valuable to the field as a whole so that fitness data from one study can be quantitatively compared to fitness from another.
How general is the disagreement between fitness statistics in pairwise competition?
The discrepancy in mutant ranking for relative fitness per-cycle 

Our model focused on the effect of resource competition on fitness discrepancies since that interaction is almost always present between microbial genotypes, providing a baseline expectation. Moreover, resource competition is likely to be the only interaction in high-throughput fitness measurements of mutants with only small genetic differences (e.g., point mutations or gene knockouts) relative to a wild-type. We expect that in cases with additional interactions between mutants and the wild-type, there will be further variation in wild-type LFCs across competitions and thus even more discrepancies betwen fitness statistics. For example, mutants that stimulate wild-type growth via the secretion of nutrients should create positive LFC covariation across mutants and thus be ranked differently per-cycle and per-generation. Indeed, our reanalysis of LTEE competition data shows that fitness per-cycle 


How general is the deviation of bulk fitness statistics from pairwise competition experiments?
The offset in total relative fitness (between grey and orange points in Figure 5C), equivalent to the mean library fitness (Eq. (S88) in Sec. S13), is independent from the population dynamics (Figure 5B, Sec. S13). In contrast, the error between pairwise fitness in bulk competition and in pairwise competition (orange points in Figure 5C) is specific to the population dynamics studied here, although it points to more general mechanisms. Specifically, we expect pairwise fitness in bulk competition to deviate from pairwise competition as soon as the mutant library is able to influence the chemical environment of the competition, for example, when a large fraction of mutants fails to complete respiration of the main carbon source and instead turns to fermentation [106–108], with the release of pathway intermediates like acetate that indirectly lower the pH in the growth medium [109, 110].
The choice of fitness statistic in the light of other sources of discrepancy
Besides the conceptual choices of fitness quantification discussed in this article, experimental limitations can create discrepancies between replicate fitness measurements of the same mutant [36, 50, 111]. For example, relative abundance measurements entail sampling uncertainties (when we sample liquid for colony counting or DNA sequencing) [44, 59, 60], copy number variation [18, 44], as well as PCR jackpots and sequencing read errors [13, 101, 102]. Furthermore, there will inevitably be some variation in initial condition between replicates and fluctuations during the fitness assay [36, 102, 111].
Methods
Inferring growth traits for the single-gene knockout collection in yeast
We use a previously published data set [29] for the single-gene knockout collection in Saccharomyces cerevisiae [96], where the authors track growth of each genotype for 47 hours in monoculture, using microwell plates with defined growth medium. We download this growth curve data set from the PROPHECY database [112], choosing specifically the data set measured in Synthetic Defined medium. We last accessed the PROPHECY website (http://prophecy.lundberg.gu.se/) on March 30, 2020, but as of November 15, 2025 the website is no longer accessible. We have included a reformatted version of the data with our code repository (https://github.com/justuswfink/24FitnessQuantification). From the raw time series of optical density, the authors subtracted a background correction based on blank wells, applied another correction for nonlinearity at high optical densities, and smoothed the growth curve to remove electrical noise [29].
We start with the curves in the published data set (9951 curves) and apply further trimming and smoothing steps (Sec. S6). From this data we calculate the time series of instantaneous per-capita growth rate N−1dN/dt (where N is the optical density) and identify time windows where the rate is approximately constant, which we interpret as distinct growth phases (Sec. S6). We only include curves that have a single phase of constant exponential growth followed by a stationary phase of approximately zero growth (9424 curves).
We quantify the biomass yield, maximum growth rate, and the lag time from each remaining curve as follows. First, we estimate the initial abundance Ninitial (average optical density over first three time points) and the final abundance Nfinal (average optical density over stationary phase) from the growth curve. Then we calculate the biomass yield as Y = (Nfinal − Ninitial)/R(0), where R(0) = 111 mM (20 g/L) is the initial concentration of glucose (assuming glucose is the single limiting resource). To estimate the maximum growth rate, we average the instantaneous growth rate over the exponential phase. Finally, we estimate the lag time from the intersection of the log initial abundance (log N0) with the slope of the maximum growth rate during the exponential growth phase [81]. We excluded curves with negative initial OD, curves with negative inferred lag times, and curves with a low quality of fit between the measured time series and a simulated curve based on the inferred trait values (R2 < 0.95; see below for the model). Our final database includes trait estimates for 9195 curves, which represents 92.4% of the original data set [29].
Each single-gene deletion strain was measured in two technical replicate growth curves, using a second plate with identical layout in the same plate reader. Some genotypes have only one estimate of the growth traits because the other replicate did not pass our filters (273 genotypes), but for most genotypes we retain two replicate estimates in our final data set (4163 genotypes); a few genotypes even have three (2 genotypes) or four replicate estimates (54 genotypes) because these geno-types were included multiple times by the original authors [29]. From these replicates we finally calculate the average yield, growth rate, and lag time for each geno-type. The data set also contained many replicate growth curves of the wild-type strain, 374 of which passed our filters. Since wild-type traits inferred from these replicates had large variation (potentially due to measuring the wild-type across many different plates and days), we define the wild-type trait as the median, not the mean, of these replicates. Although the wild-type measurements included in this data set have large trait variation (orange dots in Figure S4B,C), the fact that the variation across knockouts is significantly greater than for the wild-type replicates in terms of growth rate (Levene’s test p = 1.3 × 10−9) and lag time (Levene’s test p = 0.025), and that the replicate measurements of gene knockouts are correlated (Figure S5), suggests that these trait measurements do capture true genetic variation. In contrast, the variation in biomass yield across knockouts is not significantly different from the variation across wild-type replicates (Levene’s test p = 0.44), suggesting that the variation in biomass yield is driven by non-genetic factors like the abundance or batch of resources used for each growth measurement.
Simulating population dynamics in competition experiments
Throughout this work, we model the competition between a set of genotypes during a single batch culture growth cycle using the following model of population dynamics [94, 95]:


where for each genotype j, Nj is the absolute abundance, xj(0) the initial relative abundance, gj(t, R) the growth rate, and Yj the biomass yield, where the genotypes include a wild-type as well as one or more mutants. The initial absolute abundance of all genotypes together is N0, and the concentration of the single limiting resource is R with initial value R0. Note that Eq. (12b) assumes that cells consume resources only for biomass growth and not for maintenance of existing biomass.
To capture the sigmoidal shape of typical growth curves with a lag phase, exponential growth phase, and saturation phase (e.g., Figure S4A), we model the dependence of growth rate on time and resource concentration as

where Θ is the step function:

In this model, a genotype has an initial lag phase of time λj where no growth occurs, followed by a phase of constant exponential growth at rate gj, and ending when the resource concentration R reaches zero. The time to resource depletion, which we also call the saturation time, is defined by the implicit equation R(tsat) = 0. The simple form of the growth response (Eq. (13)) means that the absolute abundance of genotype j at saturation is

and its log fold-change is

To calculate relative fitness per-cycle (Eq. (9) and per-generation (Eq. (10)), we numerically determine the saturation time tsat by integrating the biomass and resource dynamics (Eq. (12)) as described in previous work [94, 95]. We note that it also possible to derive an approximate expression for the saturation time tsat which we use for our analytical calculations (Sec. S7).
Testing epistasis between mutations in population dynamics model
Consider a wild-type strain with growth rate gwt = 1, lag time λwt = 2, and biomass yield Ywt = 1. We define six basic mutation that represent the six directions of the trait space in our model: a mutation that increases growth rate (gmut = 1.2; symbol +g), a mutation that decreases growth rate (gmut = 0.8; symbol −g), a mutation that increases lag time (λmut = 3; symbol +λ), a mutation that decreases lag time (λmut = 1; symbol −λ), a mutation that increases yield (Ymut = 4; symbol +Y ), and a mutation that decreases yield (Ymut = 0.25; symbol −Y ). We combine these mutations into double mutants as follows: mutations are additive at the level of traits (e.g., the double mutant (+g, −g) is neutral) and mutations in different traits are independent. For a given pair of mutations, epistasis is defined as [97]

where s(A) is the relative fitness of the single mutant with mutation A, s(B) is the relative fitness of the single mutant with mutation B, and s(A, B) is the relative fitness of the double mutant with both A and B. We estimate the fitness of each mutant by simulating one growth cycle of competition with the wild-type (at initially equal abundance) using an initial biomass of N0 = 0.01 and initial resource concentration R0 = 0.5. These conditions are chosen such that mutations that increase biomass yield reach a maximum density of up to N (tsat) = 2, on the same scale as the optical density measured for the empirical knockouts of S. cerevisiaea (Figure S4C).
Quantifying total and pairwise relative fitness in bulk competition experiments
We define total and pairwise relative fitness,two alternative measures of fitness in bulk competition experiments, as follows. Consider a population with multiple genotypes such that each genotype i has absolute abundance Ni(t) at time t. The total relative abundance of the genotype is

The “total” here refers to the fact that this is the abundance of genotype i relative to all other genotypes in the population. Similarly, we can think of the definition of relative fitness discussed in the main text (Eq. (4)) as the total relative fitness of this genotype under an encoding

However, sometimes we want to track the dynamics of a genotype relative to another specific genotype; for example, to follow the fate of a mutant against the wild-type in a bulk competition experiment. For a pair of genotypes i ≠ j, we thus define the pairwise relative abundance

which is the relative abundance of genotype i in a subpopulation of only genotypes i and j. To predict the change in the pairwise relative abundance, we can use the slope

which defines the pairwise relative fitness of genotype i with respect to genotype j (see Sec. for a generalize definition that covers the case i = j). In the special case of a population with only two strains, the pair-wise relative fitness and total relative fitness are identical 
It is also possible to define the total and pairwise relative fitness statistics as finite differences over a time interval, rather than as instantaneous derivatives. For example, we can define them over a growth cycle starting at t = 0 and ending at t = tsat:


For these fitness statistics in the bulk competitions, we use the logit encoding


where we have rewritten 
Unlike the comparison between per-cycle and per-generation relative fitness where we focused on rank differences (Figure 2B), here we can evaluate absolute differences in fitness estimates because the total and pairwise fitness in bulk fitness are measured in the same units as the relative fitness in pairwise competition. For all mutant genotypes, we calculate the absolute error between these bulk fitness estimates (Eqs. (24) and (25)) and the relative fitness in pairwise competitions (Eq. (9)), which we take as the ground truth (Figure 5C). Note that the pairwise competition could depend on the initial relative abundance of the mutant; we have chosen a very low relative abundance (10−6) that mimics a mutant arising de novo and where this dependence is very weak.
Clearly, minimizing the abundance of the mutant library minimizes the strength of higher-order interactions (also reducing ranking disagreement, Figure S21B,C). As low mutant abundances create practical problems for experiments, it is valuable to identify a maximum abundance for the mutant library that keeps the error from bulk fitness estimates below a desired threshold (Eq. (11)). It is convenient to express this threshold in relative rather than absolute terms (compare Figure S23 to Figure 5C).
A guide to calculate pairwise relative fitness under the logit encoding from bulk competition data
Here ri(t) represents the best estimate of read numbers for lineage i after correcting for experimental artifacts, like jackpot events in the sequencing (e.g., see discussion in Levy et al. [13]).
Calculate the total relative abundance xi of each lineage i by dividing with the total number of reads:

where the sum is over all n lineages in the experiment.
Define a group of lineages i = 1, . .., k as the reference (e.g., a group of neutral mutants) and sum their total relative abundance

Calculate the pairwise relative abundance xi,ref of each lineage by

Calculate the pairwise fitness per-cycle for each strain i under the logit encoding

where Δt is the number of growth cycles between time points t1 and t2.
Supplementary Information
S1. Different types of fitness under example models of population dynamics
In this section we explicitly calculate relative fitness of a mutant under a few example models of population dynamics, using the different encodings of relative abundance as described in the main text (main text Figure 1A). Consider a competition coculture between a wild-type genotype with absolute abundance Nwt(t) and a mutant genotype with absolute abundance Nmut(t). We can describe their dynamics according to the ordinary differential equations (ODEs)

Note that the per-capita growth rates gwt and gmut of each genotype can depend on both genotypes to reflect competition or other interactions. The relative abundance of the mutant genotype at time t is

The dynamics of the mutant relative abundance are therefore described by

As defined in the main text (Eq. (4)), the relative fitness of a mutant is sm = dm/dt for an encoding m(x) of the relative abundance x. Under the trivial linear encoding (m(x) = x), the relative fitness is therefore just the right-hand side of the relative abundance ODE (Eq. (S3)):

For the log encoding m(x) = log x, we use the identity d log x/dt = x−1dx/dt to obtain the log-encoded relative fitness:

Finally, for the logit encoding m(x) = logit x = log(x/(1 x)), we use the identity d logit x/dt = x−1(1 x)−1dx/dt to obtain the logit-encoded relative fitness:

By comparing relative fitness values under the linear encoding (Eq. (S4)) and under the logit encoding (Eq. (S6)), we see how the logit encoding has removed the explicit dependence on the mutant relative abundance (factors of x and 1 − x), although there can be implicit dependence on the mutant relative abundance within the per-capita growth rates of each strain (gwt and gmut) due to density-dependent growth rates.
If the per-capita growth rates gwt and gmut (Eq. (S1)) are constants, then these constant growth rates also act as fitness potentials since they each depend only on a single genotype but their difference determines relative fitness (under the logit encoding, Eq. (S6)) between the genotypes. The growth rate is not a fitness potential under more complex dynamics, however. For example, consider a competition model with explicit density dependence:

where the growth rates decrease as the genotype abundances reach their carrying capacities Kwt and Kmut, and the maximum growth rates at low abundances are rwt and rmut. In this case, the relative fitness under the logit encoding is (from Eq. (S6))

In this case, there is no fitness potential because it is not possible to separate Eq. (S8) into a difference between terms that only depend on each genotype separately.
S2. Definition of absolute fitness for a genotype
Here we give an explicit definition of a genotype’s absolute fitness, analogous to the definition of relative fitness in the main text (Eqs. (1)–(4)). Conceptually, absolute fitness is any number that is sufficient to predict a geno-type’s absolute abundance N over a short time window. Let an encoding m(N ) be any smooth, strictly-increasing function of the absolute abundance N . We can then predict the absolute abundance over a time window Δt using a linear expansion of the encoded abundance (analogous to main text Eq. (3) for relative fitness):

where

is defined as the absolute fitness of the genotype under the encoding m (analogous to main text Eq. (4) for relative fitness; see also main text Figure 1B). For example, the absolute fitness of a genotype under the log encoding m(N ) = log N is the per-capita growth rate:

In general, the ideal encoding of absolute abundance is the inverse function of the absolute abundance trajectory N (t) (up to a shift and rescaling), so that the first-order expansion in Eq. (S9) is exact and the absolute fitness am is sufficient to determine changes in absolute abundance up to any future time. The log encoding m(N ) = log N is therefore ideal when absolute abundance grows or decays exponentially at a constant rate, while the logit encoding m(N ) = logit N is ideal for a population that grows with logistic density dependence (Eq. (S7) in case of a single genotype).
Absolute fitness and relative fitness are related, since the relative abundance of a genotype is determined by normalizing its absolute abundance by the absolute abundance of all genotypes in the population. Specifically, the relative fitness of a genotype is determined by the absolute fitnesses for all genotypes in the population. For example, in the case of two genotypes, the relative fitness under the logit encoding (Eq. (S6)) is the difference between the genotypes’ absolute fitness under the log encoding (Eq. (S11)). In the case of constant per-capita growth rates, these log-encoded absolute fitnesses also act as fitness potentials.
S3. The role of logistic population dynamics in logit-encoded relative fitness
Here we show how the logit encoding of relative abundance is related to the logistic model of population dynamics. For a relative abundance x, logistic dynamics are

where r is the exponential rate at which relative abundance increases from low values. This form emerges from the general dynamics of relative abundance (Eq. (S3)) when the difference in per-capita growth rates gmut and gwt is constant. Once can also interpret the logistic model as a lowest-order approximation for more complex dynamics. That is, consider a general equation for relative abundance:

for an arbitrary function f (x). Since this function must obey the boundary conditions f (0) = 0 and f (1) = 0 (the relative abundance must stop changing when it either goes extinct or fixes), a polynomial expansion of f (x) must have roots at these values:

Thus the logistic model in Eq. (S12) is a lowest-order approximation even when the true dynamics are more complex.
The logistic differential equation in Eq. (S12) has the solution

The logit encoding of the logistic relative abundance has linear dependence on time:

This is another way to see why the relative fitness under the logit encoding (the time derivative of Eq. (S16)) is constant under logistic dynamics. Mathematically, this occurs because the logit function is the inverse of the logistic dynamics (Eq. (S15)), up to a shift and rescaling. Thus if the relative abundance dynamics are different from logistic (Eq. (S12)), the logit encoding no longer exactly linearizes the trajectory of relative abundance and thus is no longer the optimal encoding for relative fitness (see Figure S2 and the mathematical notes by Mallet [113] for more examples).
More generally, the ideal encoding for a given relative abundance dynamics x(t) is any linear function of the inverse m(x(t)) = at(x) + b of those dynamics. Note that the ambiguous scale a includes log and logit encodings with a different base of the logarithm (discussed by Chevin [114]). The inversion of x(t) is equivalent to removing frequency (relative abundance) dependence from the relative fitness statistic sm [115].
S4. Relative fitness predictions in discrete time: Additive vs. multiplicative form
In our framework, relative fitness is a statistic that predicts relative abundance in an additive equation (main text Eq. (3)) but sometimes the dynamics of individual genotypes are modeled using a multiplicative form of fitness. In this section, we show how these multiplicative fitness statistics are related to the additive fitness statistics, in particular for the logit encoding.
We consider a population of a wild-type and a mutant genotype, where we track the mutant’s relative abundance over multiple rounds of competition (e.g., growth cycles)

and the variable x changes each round according to some underlying population dynamics.
In a modeling approach typical to many studies in population genetics [116], these population dynamics are captured in the genotype-specific growth factors

which determine the update equation for the mutant relative abundance

which is the discrete-time analogue to a differential equation (Eq. (S1)). We divide Eq. (S19) by 1 − x(r + 1) to obtain the form

which allows us to recognize the ratio of growth factors fmut/fwt as a relative fitness statistic, since it is sufficient to predict the relative abundance of the mutant genotype (under the encoding m(x) = x/(1 − x)). But the statistic fmut/fwt acts as a multiplying factor in Eq. (S20), whereas the general form of relative fitness sm (main text Eq. (3)) acts as an additive factor. What is the relationship between the multiplicative and the additive form of discrete-time relative fitness statistics?
To describe discrete rounds of population dynamics in our framework, we treat the relative abundance x(r) as samples from a continuous time series, separated by a time-gap Δt = t(r + 1) − t(r). For a chosen encoding m, The relative abundance of the mutant genotype in the future round is predicted by

since this is how we defined relative fitness sm in main text Eq. (3). This relative fitness acts as additive factor, but we apply the exponential function on both sides of Eq. (S21) to obtain the updated equation

where the additive fitness slogit together with the time scale Δt acts as a multiplicative factor. Specifically for the logit encoding m(x) = logit x we have

We compare Eq. (S23) to Eq. (S20) and solve for the relative fitness of the mutant genotype

as a function of the growth factors fmut, fwt. Using the definition of the growth factors (Eq. (S18)), we see that Eq. (S24) is simply the discrete-time relative fitness for the logit encoding in terms of the mutant and wild-type LFC (main text Eq. (8)). As a general point, we note that the growth-factor fmut qualifies as an absolute fitness (since it is sufficient to predict the absolute abundance N (r +1)), but does not constitute a relative fitness statistic (since we also need to know fwt, see Eq. (S20) or Eq. (S24)).
More generally, for a given encoding function m(x) we define the mutant’s multiplicative relative fitness over a discrete growth cycle as

where sm is the additive relative fitness for this growth-cycle under the chosen encoding (main text Eq. (4)) and Δt is the duration of the growth-cycle in time units that match sm. Equation (S25) shows that the additive fitness sm has time units, but the multiplicative fitness does not and is formally a dimensionless quantity. These dimensionless units are preserved in the approximate formula

which is the first-order expansion of Eq. (S25) in the limit of weak selection ( | smΔt | ≪1). For example, if we choose to measure relative fitness with the logit encoding on the time scale per-cycle (Δt = 1 cycle), we get

which is the same multiplicative fitness as if we choose to measure the relative fitness on the time-scale per-generation (Δt = LFCwt):

In both cases, the time-units for the discrete-time additive fitness cancel in the product term.
So how would a mutant ranking in the multiplicative fitness wlogit rank a set of mutant genotypes compared to the fitness statistics 


Finally, we address the question how the relative fitness 

By comparing Eq. (S29) to the multiplicative fitness in Eq. (S26), we see that the LTEE fitness statistic W does not derive from the general form of the multiplicative fitness (Eq. (S25)) as all statistics derived from this approximation have a multiplying factor Δt that cancels the units of time in sm. The fact that the LTEE fitness statistic W (Eq. (S29)) misses the term LFCwt compared to the logit-based multiplicative fitness wlogit (Eq. (S28)) means that the objects have different units of time and different rankings.
S5. Derivation of mismatch conditions for relative fitness per-cycle and per-generation
In this section, we derive the conditions for a ranking mismatch between the relative fitness per-cycle and per-generation (for the logit-encoded relative abundance) across a set of competition experiments. Consider a batch culture where a competing wild-type and mutant genotype have log fold-changes LFCwt and LFCmut over a single growth cycle. The LFCs are convenient variables to describe these dynamics since we can express the mutant’s relative fitness per-cycle as (main text Eq. (9))

and the mutant’s relative fitness per-generation as (main text Eq. (10))

Here we assume both LFCs are nonzero (so that measuring fitness per-generation is meaningful; see discussion in main text).
To determine how these two fitness statistics lead to different rankings, we consider two competition experiments A and B, which may represent two different mutants competing against the same wild-type or the same mutant tested in two different environments. A mismatch in ranking occurs when the fitness per-cycle in competition is higher in B, while the fitness per-generation is higher in A:

We insert the expressions for relative fitness per-cycle (Eq. (S30)) and per-generation (Eq. (S31)) into Eq. (S32) to rewrite the condition for ranking mismatch in terms of the LFCs:


For a given competition A, Eq. (S33) defines an area in the space of LFCs where competition B can lie such that competitions’ fitness is ranked differently per-cycle versus per-generation (main text Figure 1D shows an example as the red-shaded area). Biologically, these constraints describe a situation where the mutant and wild-type LFCs are both higher or both lower in competition B than in competition A (i.e., so that they grey point is up and to the right of the red point in Figure 1D, or down and to the left), but the LFC differences must be sufficiently balanced between the mutant and wild-type (i.e., so that the point lies within the red area in Figure 1D).
Typically, however, the LFCs of the wild-type and the mutant are not independent, since these LFCs are jointly constrained by the fact both strains compete for the same finite resources. For example, assume that a single limiting resource with concentration R is consumed in proportion to the growth of each genotype’s biomass according to

where Ywt and Ymut are the wild-type and mutant biomass yields (stoichiometry of biomass to resource). We can integrate Eq. (S34) to obtain

The growth cycle stops when no resource remains (R(t) = 0), such that

where we have expressed the genotype abundances at the end of the growth cycle in terms of their LFCs. Equation (S36) thus entails a constraint between the wild-type and mutant LFCs. For a set of a mutant competitions with the same initial resource concentration R(0), initial abundances Nwt(0) and Nmut(0), and yields Ywt and Ymut, the mutant and wild-type LFCs are constrained by Eq. (S36) to fall along a one-dimensional curve (black line in Figure S3A). Geometrically, we see that this constraint on LFCs is incompatible with the requirements for a ranking mismatch between fitness per-cycle and per-generation for a pair of mutant competitions (compare black line and red shaded areas in Figure S3A). However, if some mutant competitions deviate from this constraint, for example by having different yields Ymut or initial conditions Nwt(0) and Nmut(0), then ranking mismatches may be possible (Figure S3B).
S6. Analysis of growth curves to identify growth phases
In this section, we describe in more detail how we identify growth phases from the original data set of growth curves and use this to choose a subset of curves that matches the simplified growth dynamics of our population dynamics model (Figure S4A). As mentioned in the main text (Methods), we downloaded this original data from the PROPHECY database (http://prophecy.lundberg.gu.se/), downloading specifically the data set for growth in Synthetic Defined medium as first analysed and reported by Warringer et al. [118]. We last accessed the PROPHECY website (http://prophecy.lundberg.gu.se/) on March 30, 2020, but as of November 15, 2025 the website is no longer accessible. We have included a reformatted version of the data with our code repository (https://github.com/justuswfink/24FitnessQuantification). The original growth curve data is already corrected for background and instrument non-linearities [118] (summarized in Methods), but we decided to apply additional corrections as follows: To begin with, we concatenate the original 51 data files (for different plate reader runs) into a single, consecutive dataframe and manually handle a duplication in one of the files (Experiment NO. 18). This file has no measurements for the first time point (t = 0) due to technical error in the original data export [118] and for curves from this experimental run, we set the initial time point to NAN value in Python, meaning that these points will be ignored for any subsequent calculations of averages. More generally, we decided to trim the first four time points of all growth curves (equivalent to 1h20min from 47h total) and remove OD measurement below a noise threshold (OD = 0.001) as this improves the quality of the fit later on.
After pre-processing, we estimate a smooth time series for the instantaneous growth rate in each growth curve, using a previously published script gaussianprocess.py by Swain et al. [119] that implements the Gaussian Process approach to smoothing (downloaded from https://swainlab.bio.ed.ac.uk/software.html). We apply this script to the logarithmic absolute abundance log OD, and reconstruct a smoothed trajectory f (t) log OD(t) as well as the first derivative df/dt and the second derivative d2f/dt2 [119]. Effectively, the script estimates three hyperparameters that capture the shape of each curve and we find that the estimation works best if we constrain the parameter ranges as outline in Table S1.

Parameter settings for Gaussian Process optimisation.
With a smoothed time series at hand, we now identify “plateaus” of constant growth rate using the functions available in Scipy [120]. For each growth curve, we start by identifying so-called “plateau seeds,” which are small intervals where the second derivative is below a chosen threshold d2f/dt2 < 5 10−6. For each plateau seed k in the growth curve, we calculate the average growth rate ĝk in the time-window of the plateau. Due to some noise in the second derivative, we find many plateau seeds that are adjacent and need to be merged. To do so, we iterate over the plateau seeds in the growth curve and merge the current candidate k with the previous plateau k 1, except one of the following conditions is true:
Both plateaus seeds have a duration that is too long (equal or greater than 100 minutes).
The transition time between the plateau seeds is too large (equal or greater than 200 minutes).
The first plateau has a significantly different growth rate that the second one, such that the following equation is satisfied

where gk, gk−1 are the average growth rate in each plateau (per minute). Empirically, we find that the duration of these merged plateaus is shorter than what one would expect from visually inspecting the growth curve. Therefore we extend the remaining plateaus in each growth curve as follows: For each plateau, we estimate the lower and upper growth rate df /dt in the time window and take this as a growth rate corridor. We extend the plateau to the left, until df /dt leaves that growth rate corridor, and similarly extend to the right. By definition, the resulting plateau is equal or larger to the original time-window and we recal the average growth rate over the time window.
From this analysis, we obtain a list of growth phases for each curve that allows us to choose a subset of curves that match our model of population dynamics (Methods). We only choose curves that have two plateaus, where the first plateau has significant growth (exponential phase), and the second plateau has no growth (stationary phase). Here we define significant growth as the average growth rate in the plateau time window is larger or equal to 0.0011 per minute. This forms the set of growth curves that we use to estimate growth traits (9424 curves).
S7. The saturation time in our model of population dynamics
In this section, we restate an explicit expression for the saturation time tsat in pairwise competition that was derived in earlier work and allows us to see how mutants can influence resource depletion and the wild-type LFC (main text Eq. (16)). Using the same model of population dynamics (Methods, main text Eq (12)), previous work [121, 122] derived an approximate formula for tsat that shows how it depends on the underlying parameters:

where

is the effective lag time of the population,

is the effective e-fold growth time (reciprocal exponential growth rate) of the population,

is the effective yield of the population, and

is the log fold-change of the total biomass, which depends on the initial absolute abundance N0 of all genotypes and the initial concentration of resources R0.
Equation (S38) shows how each competing genotype influences the saturation time tsat and thus the LFCs of all other genotypes via main text Eq. (16). For example, adding a mutant with slow growth rate increases the effective doubling time (Eq. (S40)), while a mutant with long lag time will increase the effective lag time (Eq. (S39)). Genotypes also influence the saturation time (Eq. (S38)) through the effective yield 
For bulk competition experiments, the saturation time of the culture can be expressed as an interpolation between the traits of the wild-type and the mutant library. We rewrite Eq. (S39) and Eq. (S40) as function of mutant library traits



where gwt is the wild-type growth rate, λwt is the wild-type lag time,

is the mutant library abundance at the beginning of the bulk competition,

is the mutant library lag time,

is the mutant library growth rate, and

is the mutant library yield. Note that the effective traits in Eq. (S47)–(S49) are independent under a rescaling of the mutant library abundance xlib as long as the relative proportion of the individual lineages xj/xlib stays constant.
To simplify calculations, we sometimes assume that the mutants have variation in lag time and growth rate, but no variation in biomass yield (Yi = Yj for all i and j). Under this assumptions, we can rewrite Eq. (S43) and Eq. (S44) as


S8. Epistasis detection for double mutants using relative fitness per-cycle and per-generation
In this section, we discuss the patterns of epistasis observed in relative fitness per-cycle and relative fitness pergeneration for the systematic scan of double mutants in our model of population dynamics (Methods). For relative fitness per-cycle, main text Figure 3C shows that epistasis occurs if and only if one of the mutations involves a change in growth rate. This is because the selection on growth rate in the per-cycle statistic depends on the duration of the growth cycle [121]

where tsat is the saturation time, tsat − λmut is the time of growth for the mutant strain and tsat − λwt is the time of growth for the wild-type strain. At high relative abundance of the mutant, the saturation time tsat depends on all three mutant traits (Eq. (S38)), leading to quadratic mutant trait terms in Eq. (S52) and non-zero epistasis in main text Eq. (17). For example, a double mutant in growth rate (+g, +g) has lower than expected relative fitness per-cycle (negative epistasis in Eq. (17)) because the second growth rate mutation shortens the growth window tsat − λmut in Eq. (S52) (see growth curves in Figure S9; row A). Intuitively, there is no epistasis between lag time and yield because changes to lag time play out early in the growth cycle and changes in yield only change the final time point of growth (see Figure S11). Note that here we have chosen examples where both mutants exit the lag phase before saturation – for deleterious mutations with extremely long lag time, a double mutant (+λ, +λ) eventually shows epistasis in relative fitness per-cycle since the wild-type growth advantage is bounded from above due to finite resources.
Using relative fitness per-generation sgen, main text Figure 3D shows that epistasis occurs if and only one of the mutations involves lag time, a pattern that is distinct from the per-cycle statistic (compare Figure 3C and D). This is because the relative fitness per-generation uses the wild-type LFC as a normalizing time scale (main text Eq. (10)) and nonlinearities in this time scale with respect to the mutant traits can modify or create new epistasis. For example, epistasis is modified in the per-generation statistic for a double mutant in growth rate and lag time (e.g., (+g, − λ); Figure S9, row D). There is positive epistasis in relative fitness per-cycle, since the shorter lag time leads to a longer mutant growth time which amplifies selection on growth rate (compare Eq. (S52)), but there is even higher positive epistasis in the per-generation statistic, because the double mutant growth is spread over less wild-type generations compared to the single mutant growth. Sometimes, the nonlinearity in the time-scale LFCwt works to remove epistasis. For example, a double-mutant of growth rate (+g, +g) (Figure S9, row A) is epistatic in the per-cycle statistic but non-epistatic in the per-generation statistic

where the mutant growth time tsat − λmut and the wild-type tsat − λwt are both reduced but happen to cancel. Finally, the mutant-dependent time-scale LFCwt of the per-generation statistic also introduces new epistasis between lag time and yield (Figure S11), as well as for mutants with two lag time mutations (Figure S11, row C and Figure S12, row D). These mutations have no epistasis in relative fitness per-cycle but the normalization by the wild-type LFC in main text Eq. (10) introduces an additional dependence on tsat that renders fitness a quadratic function of lag time.
We repeat the fitness estimates using a 1% mutant relative abundance in the simulated competition experiments (Figure S15). At this low mutant abundance, Figure S15B shows the epistasis in relative fitness per-cycle scycle is limited to combinations of mutants that perturb growth rate and lag time. Note that in this low abundance limit, the relative fitness per-generation sgen agrees on the presence and absence of epistasis (Figure S15C), simply because the wild-type LFC is approximately constant across mutants.
S9. Analysis of fitness trajectories from the long-term evolution experiment
A previous analysis of the Long-Term Evolution Experiment (LTEE) performed by Wiser et al. [123] found that evolved populations of Escherichia coli increased in relative fitness over 50,000 generations without converging to a maximum fitness. Here we re-analyze the same data by directly comparing the relative fitness (under the logit encoding) per-cycle 

Previous work performed competition experiments between the ancestral population and each evolved population (every 500–2000 generations) by combining them in equal proportions and growing them over a single batch culture growth cycle, with measurements of their initial and final absolute abundances taken by colony counting [123]. This data has been prepared in a convenient format by Good et al. [125] and is available for download at https://github.com/benjaminhgood/LTEE-metagenomic/blob/master/additional_data/Concatenated.LTEE.data.all.csv. The total dataset lists n = 928 competition experiments. We remove n = 6 entries because either the initial or final abundances are missing.
For a few of the 12 populations, the time series is truncated: population Ara+6 has competition measurements up to generation 4000, population Ara-2 has competition measurements up to generation 30,000, and population Ara+2 has competition measurements up to generation 32,000 [123]. Note that the evolved population tested in these competitions is not a single genotype, but a sample of many genotypes that were present in that evolving population at that time. From these values of absolute abundance, we compute the log fold-change (LFC) of the evolved and ancestral populations in each competition and then calculate the evolved population’s relative fitness per-cycle (main text Eq. (9)) and per-generation (main text Eq. (10)). The original data set has two to four replicate measurements for each evolved sample, corresponding to a repeat of the competition experiment at a different day [123]. Initially, we collect all competition experiments into a single data set (n = 928 competitions) and find that relative fitness per-generation and per-cycle differ in the ranking of these competitions (main text Figure 4B, Figure S17A). For example, one competition is ranked 261 positions lower in relative fitness pergeneration than per-cycle (where higher rank indicates higher fitness). The scatter occurs because the biomass yield evolves downward over time [126, 127]. We can understand the ranking mismatch from the underlying LFCs in the competition experiment (Figure S17B), that show considerable scatter, with some mutant-wild-type pairs in a positive covariation (compare Figure S17B and main text Figure 1D). The scatter occurs because the biomass yield evolves downward over time [126, 127], shifting the wild-type LFCs downward in 1:1 competitions with the evolved populations (horizontal trend across time points in Figure S17B).
We can also construct a time series of relative fitness for each population and compare fitness rankings at a individual time points. For each of the 12 populations in the LTEE, we define the relative fitness per-generation 


A key result from previous analysis by Wiser et al. [123] on this data set is that the evolving populations increase indefinitely in relative fitness, rather than leveling off at some maximum fitness value. The empirical analysis by Wiser et al. [123] tested the long-term trend by fitting the statistic 

where t is the evolutionary time point at which the evolved population is measured against its ancestor, and f (t) is the fitness statistic measured from that competition (here fitted to the measured values of W ). The important feature of the hyperbolic model is that it assumes that relative fitness saturates at a maximum fitness over long times (limt→∞ f (t) = 1 + a). To contrast this model, they also tested a power law

under which fitness increases without bound over long times (limt→∞ f (t) = ). To repeat this analysis with the fitness statistics 



Thus we use

as the hyperbolic model and

as the power law model.
As a control against the original analysis of Wiser et al. [123], we first perform our own fit of the time series of the relative fitness per-generation 




We next perform a new analysis by calculating the relative fitness per-cycle 


We note that the quality of fit R reported in the main text of Wiser et al. [123] differs from the quality of fit we show on our plots. While both studies performed the fits of the models on all 12 populations simultaneously, Wiser et al. [123] evaluated the quality of fit by calculating R for the fitted model (blue and pink lines in Figure S16A) against the fitness time series averaged over replicates (grey points in Figure S16C). That is, they fit the model to the data without averaging over population but calculate quality of fit using the population-averaged data. Since we believe this was inconsistent, we have followed a more standard approach of calculating the quality of fit on the same input data used for the fit (i.e., correlating the blue and pink lines in Figure S16A with grey points in the same plot). As a consequence, the correlations between the fitted models and data reported in the original publication (hyperbolic R = 0.969, power law R = 0.986 [123]) are systematically higher than the values we find in our re-analysis (hyperbolic R = 0.826, power law: R = 0.837; take square root of the values in Figure S16A). Although the long-term fitness dynamics in the case of the LTEE are robust to the choice of fitness statistic, we note that it is possible to construct scenarios of microbial growth trait evolution where relative fitness saturates in one statistic but not in the other (main text Figure 3E-F). For theory work that on the long-term trend and other models than the power law or hyperbolic model tested here, see refs. [128–132].
S10. Testing AUC and other fitness potentials using simulated competition experiments
In this section we explain our tests of estimated fitness potentials against true relative fitness in simulated competition experiments. We first focus on the area under the growth curve (AUC), which is defined as

where N (t) is a growth curve of absolute abundance (or a proxy such as optical density) and teval is a cutoff time for evaluating the area. Many previous studies [133–137] and growth curve analysis packages [138] have used this definition. The idea of AUC is that, unlike estimated fitness potentials that only account for individual traits of a genotype’s growth (e.g., growth rate or lag time; see below), the AUC literally integrates the whole growth dynamics into a single number. For example, both fast growth rate and short lag time are manifested in greater AUC. Note that while one can attempt to use the AUC as an approximate fitness potential (as we investigate here), it is not a measure of absolute fitness (Sec. S2) since it is insufficient to predict changes in absolute abundance (i.e., the area under the growth curve does not determine the change in absolute abundance from beginning to end).
To compute the AUC for the set of single-gene deletion genotypes in our data set, we first simulate a growth curve for each genotype under the population dynamics model in main text Eq. (12) with the traits estimated from the yeast single-gene knockout data (Methods). We use simulated growth curves, with trait values inferred from the measured growth curves, rather than the actual measured growth curves since we are comparing the AUCs to relative fitness also from simulations of competitions; we do not have actual competition data for these genotypes, so it would be an apples-to-oranges comparison if we used AUCs from the actual growth curve data. Furthermore, in empirical growth curves, the AUC is also influenced by technical variation in the initial biomass N0 and the initial concentration of resources R0, so using simulations removes these effects.
Figure S20A shows the distribution of saturation times tsat (as defined for the population dynamics model in Methods) numerically calculated for all simulated growth curves of the deletion mutants. We calculate the AUC for each growth curve using Eq. (S58) with an evaluation time of teval = 16 hours, since that includes the stationary phase in the vast majority of our simulated growth curves (Figure S20A). From the mutant’s AUC we calculate the AUC-based estimator of the mutant’s relative fitness

where AUCwt is the AUC for a simulated growth curve of the wild-type (using the median wild-type traits in our database; see Methods). We then simulate a single growth cycle of the mutant competing against the wild-type (Eq. (12) with equal initial abundance of the mutant and wild-type) to calculate the mutant’s “true” (under the model assumptions) relative fitness per-cycle with the logit encoding 

The success of the AUC depends on the evaluation time teval, which sets the time window from which information is captured from the growth curve by the integral (Eq. (S58)). Figure S20B–D shows the correlation between the relative fitness in coculture 
Besides AUC it is possible to use other features of the monoculture growth curve as approximate fitness potentials. For example, one can use the monoculture growth rates alone as fitness potentials:

or the monoculture lag times:

Another possible fitness potential is the absolute abundance at saturation, which in our model of population dynamics (Eq. (12)) is proportional to the biomass yield:

Finally, one can also use the difference in the monoculture log fold-changes (LFCs):

This looks similar to the definition of relative fitness per-cycle (main text Eq. (9)) but is distinct because it uses the LFCs in monoculture rather than the true LFCs in coculture (which may be different).
We test each of these fitness potentials against the relative fitness in pairwise competition, using different input data sets of trait variation and the “GNU Parallel” command to speed up the simulation process [139]. Figure S19 rows B and C show that growth rate and lag time can act as perfect fitness potentials if that trait is the only trait with variation across mutants. This is because the relative fitness is proportional to differences in each of these traits when they are the only source of variation (see Sec. S14). Section S14 also shows that differences in biomass yield (Ymut − Ywt) have no effect on fitness by themselves, which is why the biomass yield of the strains in monoculture is a poor fitness potential to estimate relative fitness. This large but neutral variation in the biomass yield across mutants in our data sets means that the LFC is also a poor fitness potential.
All of these trait-based fitness potentials are outper-formed by the AUC, which provides the best approximation of the mutant fitness ranking in coculture under realistic trait variation (Figure S19, row A). More broadly, it is important to treat absolute and relative fitness, as well as fitness potentials, as distinct concepts, serving different purposes [140, 141]. As we show here in simulation (Figure S19), and others have shown in experiments [134, 135, 142, 143], measuring fitness potentials is not enough to demonstrate that a mutant genotype will outcompete the wild-type.
S11. Coarse-graining pairwise relative fitness in multi-genotype populations
In this section, we point out the specific advantages of the logit-encoding for coarse-graining pairwise relative fitness in bulk competition experiments. While the pairwise relative fitness is defined for any encoding m, the logit encoding endows it with some convenient mathematical properties not shared by other encodings (e.g., the log encoding). The logit encoding of the pairwise relative abundance has the property

meaning that it is antisymmetric under exchange of the indices i and j (logit xij = − logit xji) and additive across pairs of indices (logit xij = logit xik + logit xkj). Since the logit-encoded pairwise relative fitness is just the time derivative of the logit function (Eq. (21)), it carries equivalent properties of antisymmetry and additivity:


We also note that the logit encoding of the pairwise relative abundance has the property:

Rescaling the relative abundances of either genotype thus does not change the pairwise relative fitness (since it only shifts the logit by a constant, which does not affect its derivative). This means that pairwise relative fitness is an “intensive” property of a genotype, analogous to intensive properties in statistical mechanics (such as temperature) that do not scale with system size. For example, if we split a mutant genotype into two subgroups (e.g., differentiated by a neutral marker), the pairwise relative fitness of each mutant subgroup with respect to the wild-type will be the same as the pairwise relative fitness of the mutant genotype as a whole compared to the wild-type. In contrast, the logit-encoded total relative fitness does not satisfy this property since logit(axi) ≠ logit xi+ constant.
When the encoding m is the logit function, the pairwise relative fitness per-cycle still satisfies the above properties (antisymmetry and additivity with respect to indices i, j, and invariance under relative abundance rescaling) since those are properties of the underlying logit encoding. This is also apparent from interpreting the per-cycle fitness statistic as an integral of the instantaneous statistic:

S12. The relative fitness between coarse-grained groups of genotypes
In this section, we generalize the concept of pairwise relative fitness to pairs of genotype groups rather than pairs of individual genotypes. In a multi-genotype population with subsets of genotypes 𝒜 and ℬ, define

as the relative abundance of 𝒜 genotypes compared to ℬ genotypes at time t, where A ⋃ B is the union of the two index sets. Analogous with main text Eq. (21), we define the fitness of group 𝒜 relative to group ℬ as

for an encoding m. Under the logit encoding, it turns out that the fitness between these two groups can be conveniently expressed as a weighted average of the pairwise fitness between the member genotypes in each group:

To prove Eq. (S71), we first note that

Thus we can rewrite the logit-encoded relative pairwise fitness between 𝒜 and ℬ as

where we have invoked Eq. (S72) on the second line. We can expand each term on the right-hand side of Eq. (S73) as

and then insert Eq. (S74) into Eq. (S73) to obtain

where we have collected the normalization factors (sums over relative abundances in 𝒜 and ℬ) as a single pre-factor. We then rewrite each product of sums in Eq. (S75) as

to finally obtain

Identifying the term in the inner parentheses as the pairwise selection coefficient 
Equation (S71) establishes that the relative fitness between a pair of genotype groups is a weighted sum of relative fitnesses between individual pairs of genotypes in those groups, but this holds for relative fitness defined at an instant in time (since it is based on derivatives). For relative fitness defined over a finite time interval (e.g., a growth cycle in batch culture), an analogous but approximate result holds. We first write the relative fitness over a growth cycle time interval as an integral over the instantaneous relative fitness (inserting Eq. (S71) into Eq. (S68)):

where tsat is the end time of the growth cycle (Methods, Sec. S7). The integral in Eq. (S78) is difficult to calculate as the relative abundance trajectories xi(t), xj(t) depend on the relative fitness of the genotypes 

Conceptually, assuming the relative abundances are approximately constant over the growth cycle is equivalent to assuming selection is weak; one can also show this mathematically by expressing the relative abundances xi(t) in terms of the pairwise relative fitnesses 

We finally note that the total relative fitness (instantaneous Eq. (19) and per-cycle Eq. (22) in the main text) is a special case of the relative fitness between groups (Eqs. (S71) and (S79)) where 𝒜 is the single genotype i and ℬ is all other genotypes besides i:


S13. Fitness error from the frame of reference in bulk competition experiments
Here we calculate the error that arises from measuring the total relative fitness of each mutant in a bulk competition experiment of a mutant library, rather than the pairwise relative fitness between each mutant and the wild-type. We call this difference the error from the frame of reference, the frame of reference being either the whole population in total fitness or the wild-type in pairwise fitness. Note that this is an error between two different fitness quantifications of the same bulk competition experiment; Sec. S15 addresses the error (arising from higher-order interactions) between fitness quantifications in bulk versus pairwise competition experiments. Here we only consider relative fitness under the logit encoding and measured per growth cycle, so we drop these labels to simplify notation.
Consider a bulk competition experiment of a wild-type and a library of large number of mutants over a single batch growth cycle of time tsat (Methods). The total relative fitness of mutant i is (main text Eq. (22))

while its pairwise relative fitness compared to the wild-type is (main text Eq. (23))

Using the coarse-graining rules from Sec. S12 (namely Eqs. (S79) and (S81)), we can express the total relative fitness of mutant i as a weighted sum of the pairwise relative fitnesses between the mutant and the wild-type and between the mutant and the rest of the mutant library:

where the notation lib \ i refers to the mutant library excluding the mutant i. The approximation here is due to our assumption in Eq. (S79) that selection is weak enough that the relative abundances of genotypes do not change too much over the growth cycle. Since we can rewrite the pairwise relative fitness between mutant i and the rest of the library as a difference between i and the wild-type and the rest of the library and the wild-type (using Eq. (S66))

we insert this into Eq. (S84) to obtain

The difference between the total and pairwise relative fitness is therefore

Since mutant libraries in these experiments typically contain hundreds or thousands of mutants, the contribution of a single mutant i is small and thus we can assume that the properties of the mutant library excluding mutant i (lib \ i) are approximately the same as the library as a whole (so that xlib\i ≈ xlib, slib\i,wt ≈ slib,wt). This allows us to further simplify the error as

Equation (S88) shows why this offset between the total and pairwise relative fitnesses is approximately independent of the focal mutant i (hence the constant shift for all mutant points in main text Figure 5C). The sign of the error from the frame of reference depends on the mean fitness of the mutant library: a mutant library that is overall deleterious relative to the wild-type (slib,wt < 0) causes the total relative fitness for a mutant σi to overestimate the mutant’s pairwise relative fitness si,wt. Intuitively, this is because the total relative fitness is comparing the mutant to a mixed population of wild-type and other mutants, which are on average worse competitors than the wild-type, which thus makes the mutant appear to be better than if it is just compared to the wild-type alone (compare top and bottom panel in main text Figure 5B). Equation (S88) also shows that the error from the frame of reference can be reduced if the mutant library is neutral relative to the wild-type (slib,wt = 0) or the mutant library has small relative abundance in the culture biomass (xlib 1). In bulk competition experiments with barcoded mutant libraries, these assumptions are often not met since the mutant libraries tend to be overall deleterious (as in our simulated bulk competition for the yeast single-gene knockouts, see Figure S6) and are inoculated at a high relative abundance [144–148] Since this makes the error in Eq. (S88) significant, we instead recommend including barcoded wild-type strains as references in the bulk competition, so that pairwise fitness can be quantified relative to that (Eq. (S83)) rather than using the total relative fitness. By using a mix of bar-coded wild-type cells and non-barcoded wild-type cells it is further possible to optimize this protocol and save on sequencing investment [149].
Finally, we want to point out a difference between the best practice we recommend here (Discussion) and a wide-spread practice in estimating fitness estimates in bulk competition experiments. In practice, transposon-seq experiments that grow the mutant library by itself start with an estimate of total relative fitness, and then subtract the median total relative fitness of the knockouts [33–35, 38] or a mean total relative fitness [144, 151, 152]. However, these corrections are not explicitly founded in the choice of a reference group (like a set of neutral genotypes or a wild-type), making the correction appear ad-hoc [145, 147, 150], or the reference is a strain that is not part of the culture, like in the fitness estimates for barcoded lineages that are evaluated against the initial ancestor without that ancestor actually being present [144, 151, 152]. To make things more confusing, even those studies that do include a wild-type then describe their method as an estimate of total relative fitness of the mutant under the log encoding, subtracted by the total relative fitness of the wild-type under the log-encoding [149, 153, 154]. The result is a pairwise relative fitness under the logit-encoding (Eq. (25)) but presenting it this way obscures that choice of encoding and the relationship to the classic, logit-based selection coefficient used in pairwise competition experiments (Eq. (7)). We hope that our framework can provide more clarity: The choice of the reference group happens at the level of relative abundance, by calculating a pairwise relative abundance (Eq. (20) or Eq. (S69)), and this removes the need for any correction on the fitness values themselves.
S14. Pairwise relative fitness using an explicit model of population dynamics
In the model of population dynamics (main text Methods, Eq. (12)), we can calculate the pairwise relative fitness of genotypes based on the approximation of the saturation time (Sec. S7; [121, 122]). The pairwise relative fitness of genotype i relative to genotype j (per-cycle and under the logit encoding) is

where



The e-fold growth time for genotype j is λj = 1/gj, and the terms Δλij = λi − λj and Δλij = λi − λj are the differences between the two genotypes lag times and growth times. Since the terms in Eq. (S89d) depend on the covariation between growth and lag, we interpret these terms as couplings between the growth and lag phases; they are zero if only two genotypes are present.
S15. Fitness error from higher-order interactions between pairwise and bulk competition experiments
In this section, we calculate the error in relative fitness of a mutant arising from higher-order interactions in bulk competition experiments with large mutant libraries, compared to the “true” relative fitness in pairwise competitions between just the focal mutant and wild-type alone. Here we only consider relative fitness under the logit encoding and measured per growth cycle, so we drop these labels to simplify notation. Let the pairwise relative fitness of mutant i compared to the wild-type (main text Eq. (23)) be

in the pairwise competition with the wild-type alone, and

in the bulk competition with all other mutants in the library. The superscripts pair and bulk indicate that the dynamics of xi,wt(t) and the saturation time tsat may be different in the two competitions. The difference between these two measurements of relative fitness is

Since the difference between the bulk and pairwise competitions is the presence of the other mutants in the library, we interpret this difference as the fitness error from higher-order interactions among the mutants.
We now calculate how this error depends on the underlying growth traits of the genotypes using the population dynamics model (main text Methods, Eq. (12)); and the explicit expression for relative fitness from Sec. S14. Based on the approximate pairwise relative fitness in this model (Eq. (S89)), the relative fitness in the pairwise competition is the sum of two terms

while the relative fitness in the bulk competition is the sum of three terms

where the third term represents the coupling between growth and lag phases present only in populations with more than two genotypes (Eq. (S89d)). We define the higher-order effects on the selection for lag time as

Using Eq. (S89b), we can express this in terms of the underlying traits as

where 
We similarly define the higher-order effects in the selection on growth rate as

and calculate 


where LFCbulk is the log fold-change (Eq. (S42)) of the total biomass in the bulk competition and LFCpair,i is the log fold-change of the total biomass in the pairwise competition between mutant i and the wild-type. We note that total biomass growth in the bulk competition LFCbulk depends on the mutant library through the effective biomass yield (Eq. (S41)), but this dependence is weak because the yield enters only logarithmically into Eq. (S42). We thus assume that the bulk and pairwise competition have equal biomass growth (LFCbulk ≈ LFCpair,i) for all mutants i such that the higher-order effect on the growth rate selection (Eq. (S98)) is proportional to the increase in the mean doubling time

Since this has the same form as the higher-order effect on lag time selection in Eq. (S96), we can combine Eq. (S99) and Eq. (S96)) into

Because the effective growth time in the pairwise competition 

That is, the relative error of lag and growth selection from higher-order interactions is approximately independent of the individual mutant, but rather depends on properties of the wild-type (τwt) and the whole mutant library (through 
Finally, we note that the growth-lag coupling terms 


S16. Choosing the mutant library abundance in bulk competition experiments
Section showed that measuring relative fitness of a mutant in a bulk competition (with a library of other mutants also present) entails an error due to higher-order interactions among the mutants, compared to measuring relative fitness in a pairwise competition consisting of just the mutant and wild-type. Here we show how this error depends on the relative abundance of the mutant library in the bulk competition, so that we can estimate the range of library abundances that keep the error below a desired threshold.
Calculating the relative error on fitness
Let the absolute error in relative fitness from higher-order interactions for a mutant i be Δsi,wt (Eq. (S92)). Let the relative fitness of this mutant in a pairwise competition be 

In Sec. we calculated the dependence of Δsi,wt on the underlying growth traits. However, since here we are mainly concerned with the dependence on the library abundance xlib, we present an alternative calculation that better captures that dependence.
We start by pointing out that we can express the relative fitness in pairwise (Eq. (S90)) and bulk competitions (Eq. (S91)) in terms of the saturation times for these competitions, using the explicit solution to the population dynamics model in main text Eq. (15):


We can thus express the error from higher-order interactions (Eq. (S92)) as

where Δgi,wt = gi gwt. Equation (S105) shows that the mutant library affects the fitness of individual mutants 


According to Eq. (S38), the saturation time tsat depends on the effective lag time 


Note that this means that the difference in saturation times is independent of the specific mutant i; the only dependence of the mutant i on the overall error from higher-order interactions is through the difference in growth rates in Eq. (S105).
We can thus write the fitness error as

Equation (S107) depends on the mutant library abundance xlib through the effective traits 


where Δλlib,wt = λlib − λwt is the difference in lag times, and Δτlib,wt = τlib − τwt the difference in e-fold growth times (reciprocal growth rates), between the library (defined according to Eq. (S47)–(S48)) and the wild-type. All the dependence on the library relative abundance xlib is now contained in the fraction outside the square brackets in Eq. (S108).
Since the relative fitness of the mutant library as a whole compared to the wild-type in bulk competition is (using Eq. (S89))

we can rewrite Eq. (S108) as

This shows that a mutant library that is neutral relative to the wild-type in the bulk competition 
Calculating the upper bound on relative abundance of the mutant library
To calculate the maximum relative library abundance that keeps the relative error below the threshold ϵ, we input the absolute error in Eq. (S108) into the relative error bound of Eq. (S102) and rearrange to isolate the dependence on the library abundance xlib:

where

depends on the focal mutant i but does not depend on the library abundance xlib. The left-hand side of Eq. (S111) varies between zero and one as a function of xlib. This means that if ϵAi > 1, any value of xlib will satisfy Eq. (S111), meaning that the relative error on mutant i from higher-order interactions will always be less than ϵ. For example, this holds when the mutant i has the same growth rate as the wild-type (Δgi,wt = 0, which causes Ai →∞), even if it varies in lag time and/or other mutants have variation in growth rates.
We thus next consider the case where ϵAi < 1. We multiply both sides of Eq. (S111) by the denominator on the left-hand side (always positive) to obtain

and then collect all the terms involving xlib on the left hand side:

Since ϵAi < 1, the factor multiplying xlib on the left-hand side of Eq. (S114) must be positive and thus we can divide both sides to obtain an upper bound on the library abundance xlib such that the relative error on fitness is less than ϵ:

Since ϵ will typically be small, we can simplify the right-hand side of Eq. (S115) by approximating it to first-order in ϵ:

This approximation holds as long as ϵAi(glib − gwt)/glib ≪1.
The maximum mutant library abundance determined by Eq. (S115) or (S116) is specific to a single mutant genotype i, meaning that the relative fitness errors for other mutants may exceed ϵ even if the inequality for mutant i is satisfied. To keep the relative fitness errors for all mutants less than ϵ, we need to choose a library abundance 

This means that Eq. (S116) must be satisfied for all mutants j:

Thus we must determine the minimum value of Aj over all mutants j. Using the definition in Eq. (S112), Aj is minimized for the mutant with minimum vale of

where we have calculated the pairwise relative fitness in Eq. (S103) using 


Special case of variation in growth rates only
To determine an even simpler estimate on the maximum mutant library abundance, we consider the special case where genotypes vary only in growth rates and not lag times. Using the definition in Eq. (S112), Ai = glib/(gwt − glib) for all mutants i, and thus there is a single bound on library abundance for all mutants (using Eq. (S116)):

This is the same as Eq. (11) in the main text.
We test Eq. (11) using our simulated competition data for the yeast single-gene deletion library (Methods) with variation in all three traits: lag time, growth rate, and biomass yield (Figure S4B,C). We compute the growth rates of the mutant library (glib = 0.389 h−1 using Eq. (S40)) and the wild-type (gwt = 0.406 h−1). Using a relative error threshold of ϵ = 0.01, the maximum mutant library relative abundance according to main text Eq. (11) is xlib = 24.6%. Figure 5D in the main text shows that this mutant library abundance indeed is able to keep the relative error below the threshold for mutants with high relative fitness, but, as expected, this mutant library abundance fails for mutants close to neutrality.
We furthermore compute the maximum library abundance with Eq. (S118), based on the precise trait variation in our data set. The mutant library has an effective lag time λlib = 1.95 h (compared to wild-type lag time λwt = 1.92 h) and an effective e-fold growth time τlib = 2.57 h (compared to the wild-type τwt = 2.46 h), leading to an overall negative relative fitness 

Supplementary figures

Predicting the absolute and relative abundance of microbial populations.
(A) Example time series of absolute abundance for a single microbial population (light grey). The observer (eye symbol) of the time series at time t can ask about the future trend in absolute abundance (arrows). (B) Same as panel (A) but for two microbial strains, such as a wild-type and a mutant. The absolute abundance of the wild-type strain (light grey) is stacked on top of the absolute abundance of the mutant strain (dark grey). (C) The relative abundance for the mutant strain corresponding to panel (B). We sketch the mean relative abundance x over time (green line) inferred from multiple replicate cocultures. An error band (light green) around the time series demonstrates the variation between replicates.

The effect of encodings on a non-logistic relative abundance trajectory.
Example trajectory of relative abundance x (top panel) for a mutant invading and eventually replacing a wild-type population, simulated with the Gompertz equation dx/dt = rx log(1/x). Below, we show the Gompertz trajectory under the encodings log x and logit x = log(x/(1 − x)). Compare to Figure 1A for a trajectory simulated with the logistic equation.

The variation of wild-type and mutant log fold-change under resource consumption constraints.
(A) Schematic fold-change variation under a perfect resource consumption constraint between wild-type and mutant. Each dot corresponds to a mutant strain in a 1:1 competition growth cycle with the wild-type strain (Sec. S5). There is negative covariation between mutant and wild-type LFCs because the resource constraint forces one to decrease if the other increases. For our model of population dynamics, we can calculate this constraint exactly (black line: Eq. (S36) in Sec. S5). For two mutants (red points) we highlight the bow tie-shaped areas (red shading; compare Figure 1D) where other competitions must lie to have a rank discrepancy between relative fitness per-cycle 




Empirical trait distribution for single gene-knockouts of yeast.
(A) Overview of the growth curve data set and the estimated growth traits for the knockout library of Saccharomyces cerevisiae (Methods) [118]. (B) Covariation between estimated maximum growth rate g and lag time λ across all mutant strains (grey dots; Pearson correlation coefficient r = −0.17, p = 7 × 10−30) as well as wild-type replicates (orange dots; r = −0.16, p = 0.002). The reference wild-type strain for our pairwise coculture simulations is defined by the median trait values (black cross) of all wild-type replicates. (C) Covariation between measured maximum growth rate g and biomass yield Y across all mutant strains (grey dots; r = 0.21, p = 8 × 10−44) as well as wild-type replicates (orange dots; r = −0.06, p = 0.25). Histograms on the top and right sides of (B) and (C) are marginal distributions along each axis, with colors matching the scatter points and the vertical black line marking the median wild-type trait value.

Replicate measurements for growth rate, lag time, and yield in our empirical trait data set.
(A) Co-variation of growth rate between replicate measurements of the knockouts (grey dots; Pearson correlation coefficient r = 0.94, p = 0.00). Each dot represents a mutant genotype from the single-gene knockout collection in Saccharomyces cerevisiae [118]. For the vast majority of genotypes in our data set (4163 out of 4492 knockouts) we are able to fit two traits from two independent growth curve measurements (Figure S4A; Methods). (B) Covariation of lag time between replicate measurements of the knockouts (grey dots; r = 0.90, p = 0.00) (C) Covariation of biomass yield between replicate measurements of the knockouts (grey dots; r = 0.43, p = 4.81 × 10−188).

The distribution of fitness effects under relative fitness per-cycle and per-generation.
(A) Distribution of relative fitness per-cycle 




Exploring alternative conditions for rank discrepancy between fitness statistics in yeast gene-deletion data.
Plots columns (A) and (B) have the same format as Figure S4B,C, and columns (C) and (D) match Figure 2B,C (in reverse order). Row (A): Low mutant frequency x = 0.01. Row (B): Standard mutant frequency x = 0.5, but where we artificially eliminate variation in lag time to test the effect on the fitness rank discrepancy. Row (C): Same as row (B) but no variation in growth rate. Row (D): Same as row (B) but no variation in biomass yield. Row (E): Same as row (B), but yields of mutants lower than the median wild-type yield are set to the wild-type value.

Hypothetical case where relative fitness per-cycle and per-generation are anticorrelated across mutants.
(A) Covariation between growth rate and lag time for a synthetically generated set of mutants (grey dots) and a single wild-type (orange dots). The variation here may represent the standing variation in an evolved population with improved growth rate and lag time over the wild-type ancestor. The graphic overlap between points means they appear as a single line. (B) Covariation between growth rate and biomass yield for the mutants (grey dots) and wild-type (orange dot) in this synthetic data set. (C) Covariation between relative fitness per-cycle 




Competition growth cycles for double mutants — first column of Figure 3C,D.
Each row corresponds to a pair of mutations that perturb growth traits in the minimal model of population dynamics (Figure 2A, Methods). This figure shows all combinations where the first mutant increases growth rate (first column of Figure 3C,D). Column (A): Perturbations of growth traits for single and double mutants. Column (B): Growth curves of wild-type (grey) and mutant 1 (blue). Column (C): Growth curves of wild-type (grey) and mutant 2 (red). Column (D): Growth curves of wild-type (grey) and double mutant (purple).

Competition growth cycles for double mutants — second column of Figure 3C,D.
Same as Figure S9 but for mutant pairs where the first mutant decreases growth rate.

Competition growth cycles for double mutants — third column of Figure 3C,D.
Same as Figure S9 but for mutant pairs where the first mutant increases lag time.

Competition growth cycles for double mutants — fourth column of Figure 3C,D.
Same as Figure S9 but for mutant pairs where the first mutant decreases lag time.

Competition growth cycles for double mutants — fifth column of Figure 3C,D.
Same as Figure S9 but for mutant pairs where the first mutant increases biomass yield.

Competition growth cycles for double mutants — sixth column of Figure 3C,D.
Same as Figure but for mutant pairs where the first mutant decreases biomass yield. For the double mutant where the second mutant also decreases yield, we cannot use the additive mutation effects (Methods) because that would result in a negative yield for the double mutant. Here we have defined the double mutant to have a yeield value of Ymut = 0.1. This choice does not affect the fitness or epistasis, since biomass yield is a neutral trait with fitness zero in our model of population dynamics.

Epistasis patterns at low mutant relative abundance.
Same as Figure 3B–D but where the mutant starts each competition at low relative abundance (x0 = 0.01; Sec. S8), compared to the 1:1 initial conditions in Figure (x0 = 0.5).

Long-term fitness trends in the LTEE under relative fitness per-cycle and per-generation.
(A) Fit of a hyperbolic model (pink line; Eq. (S56) in Sec. S9) and a power-law model (cyan line; Eq. (S57) in Sec. S9) to a pooled time-series of relative fitness per-generation 




Comparison of relative fitness per-generation and per-cycle across the complete LTEE competition data set.
(A) Rank discrepancy between relative fitness per-cycle 






Comparison of log-encoding and logit-encoding for relative fitness per-cycle in the LTEE competition data set.
(A) Same as Figure S17A but for rank discrepancy between logit-encoded relative fitness per-cycle 


Predicting relative fitness with monoculture proxies under different scenarios of trait variation.
Plots in columns (A) and (B) show growth trait variation across mutants, with the same format as Figure S4B,C. Plots in column (C) show quality of prediction for different monoculture proxies under the trait distribution in columns (A) and (B). As the ground truth, we estimate the relative fitness per-cycle 


The choice of the cutoff time for evaluating the area under the curve (AUC).
(A) Distribution of saturation times in monoculture for the knockouts (grey bars) and wild-type replicates (orange bars) in our empirical data set (Figure S4). The saturation time tsat is defined as the time when the limiting resource is depleted (R(tsat) = 0) and can be estimated numerically from the trait data (Methods). Three vertical lines indicate example choices for the cutoff time teval of the AUC: the most frequent saturation time (teval = 13 hours; red line), a saturation time halfway in the decay of the distribution (teval = 16 hours; black line), and an external time scale (teval = 24 hours; blue line). (B) Covariation between relative fitness per-cycle 



The error in mutant fitness rankings between bulk and pairwise competition experiments.
(A) Rank difference between total relative fitness and pairwise relative fitness in a bulk competition growth cycle with low mutant library abundance (Figure 5A, case II). Based on the fitness data in Figure 5C, we calculate a mutant ranking for total relative fitness in bulk (Eq. (24)) and a ranking for pairwise relative fitness in bulk (Eq. (25)) (higher rank means higher fitness and mutants with equal fitness are assigned the lowest rank in the group). The rank difference is defined as the rank in total relative fitness 


A decomposition for the error from higher-order interactions in bulk competition experiments.
For a bulk competition growth cycle with low mutant library abundance (Figure 5A, case II), we calculate the pairwise relative fitness (Eq. (25)) for the knockouts in our empirical data set (Figure S4A) using a previously established approximation [122] (Sec. S14). The error from higher-order interactions is defined as the pairwise fitness in bulk minus the fitness in pairwise competition. Based on the approximation for our model of population dynamics (Sec. (S14)), we derive a decomposition that separates the absolute error into two terms that we call the fitness-dependent error term (dark orange dots) and the fitness-independent error term (light orange dots). For full details on the decomposition, see Sec. (S15).

The relative error between bulk and pairwise competition experiments.
(A) Same as Figure 5C but showing relative rather than absolute error. The relative error of each bulk statistic is defined as the absolute error (Figure 5C), divided by the fitness in pairwise competition (Eq. (S102) in Sec. S16). A dashed grey line indicates the threshold of 1% relative error. (B) Same as Figure but showing relative rather than absolute error. (C) Same as the inset of Figure 5C but showing relative rather than absolute error. On the x-axis, we plot the absolute value of relative fitness per-cycle in the pairwise competition.

The bulk fitness error based on measurements by van Opijnen et al. [155].
(A) Absolute error 




The bulk fitness error based on measurements by Levy et al.
[32]. Same as Figure but for the set of evolved genotypes of S. cerevisiae shown in Figure 2d in the original publication. In (A), we calculate the Pearson correlation coefficient r and the p-value for the null hypothesis of zero correlation, and indicate the linear regression between the two variables with a red line. In (B) we define a set of knockouts with non-neutral fitness (blue dots; 
Data availability
All data and code are available at https://github.com/justuswfink/24FitnessQuantification.
Acknowledgements
JWF wishes to thank Luis-Miguel Chevin, Olivier Tenaillon, Henrique Teotónio, and the participants of the 2021 ENS autumn course on Experimental Evolution for valuable discussions early on. The authors are grateful to Jeremy Schreier, Benjamin Raach, and Gatwa Tshinsele-Van Bellingen for a critical reading of the manuscript. JWF and MM were supported by an Ambizione grant from the Swiss National Science Foundation (PZ00P3 180147).
Additional information
Funding
Swiss National Science Foundation (PZ00P3_180147)
Justus Wilhelm Fink
Michael Manhart
References
- [1]Population Genetics: A Concise GuideBaltimore: JHU Press Google Scholar
- [2]Trade-offs in life-history evolutionFunctional Ecology 3:259–268https://doi.org/10.2307/2389364Google Scholar
- [3]The fitness of fitness concepts and the description of natural selectionQ Rev Biol 69:3–29https://doi.org/10.1086/418431Google Scholar
- [4]The confusions of fitnessBritish J Philos Sci 55:347–363https://doi.org/10.1093/bjps/55.2.347Google Scholar
- [5]The stagnation paradox: The ever-improving but (more or less) stationary population fitnessProc R Soc B 288:20212145https://doi.org/10.1098/rspb.2021.2145PubMedGoogle Scholar
- [6]Why there are so many definitions of fitness in modelsEcoEvoRxiv https://doi.org/10.32942/x2v61tGoogle Scholar
- [7]On measuring selection in experimental evolutionBiol Lett 7:210–213https://doi.org/10.1098/rsbl.2010.0580PubMedGoogle Scholar
- [8]Measurement and meaning in biologyQ Rev Biol 86:3–34https://doi.org/10.1086/658408PubMedGoogle Scholar
- [9]The measurement theory of fitnessEvolution 64:1358–1376https://doi.org/10.1111/j.1558-5646.2009.00909.xPubMedGoogle Scholar
- [10]Relative fitness can decrease in evolving asexual populations of S. cerevisiaeNature 306:368–371https://doi.org/10.1038/306368a0PubMedGoogle Scholar
- [11]Long-term dynamics of adaptation in asexual populationsScience 342:1364–1367https://doi.org/10.1126/science.1243357PubMedGoogle Scholar
- [12]Global epistasis makes adaptation predictable despite sequence-level stochasticityScience 344:1519–1522https://doi.org/10.1126/science.1250939PubMedGoogle Scholar
- [13]Quantitative evolutionary dynamics using high-resolution lineage trackingNature 519:181–186https://doi.org/10.1038/nature14279PubMedGoogle Scholar
- [14]High-resolution lineage tracking reveals travelling wave of adaptation in laboratory yeastNature 575:494–499https://doi.org/10.1038/s41586-019-1749-3PubMedGoogle Scholar
- [15]Changing fitness effects of mutations through long-term bacterial evolutionScience 383:eadd1417https://doi.org/10.1126/science.add1417PubMedGoogle Scholar
- [16]A mutant fitness assay identifies bacterial interactions in a model ocean hot spotProc Natl Acad Sci USA 120:e2217200120https://doi.org/10.1073/pnas.2217200120PubMedGoogle Scholar
- [17]Bacterial–fungal interactions revealed by genome-wide analysis of bacterial mutant fitnessNat Microbiol 6:87–102https://doi.org/10.1038/s41564-020-00800-zPubMedGoogle Scholar
- [18]Rapid quantification of mutant fitness in diverse bacteria by sequencing randomly bar-coded transposonsmBio 6:e00306–15https://doi.org/10.1128/mbio.00306-15PubMedGoogle Scholar
- [19]Mutant phenotypes for thousands of bacterial genes of unknown functionNature 557:503–509https://doi.org/10.1038/s41586-018-0124-0PubMedGoogle Scholar
- [20]The biological cost of mutational antibiotic resistance: Any practical conclusions?Curr Opin Microbiol 9:461–465https://doi.org/10.1016/j.mib.2006.07.002PubMedGoogle Scholar
- [21]Fitness costs of plasmids: A limit to plasmid transmissionMicrobiol Spectrum 5https://doi.org/10.1128/microbiolspec.mtbp-0016-2017PubMedGoogle Scholar
- [22]Distribution of fitness effects caused by random insertion mutations in Escherichia coliGenetica 102:349https://doi.org/10.1023/a:1017031008316PubMedGoogle Scholar
- [23]Estimates of the rate and distribution of fitness effects of spontaneous mutation in Saccharomyces cerevisiaeGenetics 157:53–61https://doi.org/10.1093/genetics/157.1.53PubMedGoogle Scholar
- [24]Fitness effects of mutations in bacteriaJ Mol Microbiol Biotechnol 21:20–35https://doi.org/10.1159/000332747PubMedGoogle Scholar
- [25]Experimental studies of natural selection in bacteriaAnnu Rev Ecol Syst 21:373–398https://doi.org/10.1146/annurev.es.21.110190.002105Google Scholar
- [26]Long-term experimental evolution in Escherichia coli. I. Adaptation and divergence during 2,000 generationsAm Nat 138:1315–1341https://doi.org/10.1086/285289Google Scholar
- [27]Adaptive divergence in experimental populations of Pseudomonas fluorescens. I. Genetic and phenotypic bases of wrinkly spreader fitnessGenetics 161:33–46https://doi.org/10.1093/genetics/161.1.33PubMedGoogle Scholar
- [28]Daily transfers, archiving populations, and measuring fitness in the long-term evolution experiment with Escherichia coliJoVE 198https://doi.org/10.3791/65342PubMedGoogle Scholar
- [29]High-resolution yeast phenomics resolves different physiological features in the saline responseProc Natl Acad Sci USA 100:15724–15729https://doi.org/10.1073/pnas.2435976100PubMedGoogle Scholar
- [30]Quantifying the determinants of evolutionary dynamics leading to drug resistancePLOS Biol 13https://doi.org/10.1371/journal.pbio.1002299PubMedGoogle Scholar
- [31]Genomewide phenotypic analysis of growth, cell morphogenesis, and cell cycle events in Escherichia coliMol Syst Biol 14:e7573https://doi.org/10.15252/msb.20177573PubMedGoogle Scholar
- [32]Malthusian parameters as estimators of the fitness of microbes: A cautionary tale about the low side of high throughputPLOS One 10https://doi.org/10.1371/journal.pone.0126915PubMedGoogle Scholar
- [33]Competition dynamics in long-term propagations of Schizosaccharomyces pombe strain communitiesEcol Evol 11:15085–15097https://doi.org/10.1002/ece3.8191PubMedGoogle Scholar
- [34]Predicting microbial growth in a mixed culture from growth curve dataProc Natl Acad Sci USA 116:14698–14707https://doi.org/10.1073/pnas.1902217116PubMedGoogle Scholar
- [35]Growthcurver: An R package for obtaining interpretable metrics from microbial growth curvesBMC Bioinform 17:172https://doi.org/10.1186/s12859-016-1016-7PubMedGoogle Scholar
- [36]Case studies in the assessment of microbial fitness: Seemingly subtle changes can have major effects on phenotypic outcomesJ Mol Evol 91:311–324https://doi.org/10.1007/s00239-022-10087-9PubMedGoogle Scholar
- [37]Predicting bacterial interaction outcomes from monoculture growth and supernatant assaysISME Commun 4:iycae045https://doi.org/10.1093/ismeco/ycae045PubMedGoogle Scholar
- [38]Best practices in designing, sequencing, and identifying random DNA barcodesJ Mol Evol 91:263–280https://doi.org/10.1007/s00239-022-10083-zPubMedGoogle Scholar
- [39]Transposon insertion sequencing: A new tool for systems-level analysis of microorganismsNat Rev Microbiol 11:435–442https://doi.org/10.1038/nrmicro3033PubMedGoogle Scholar
- [40]Import of aspartate and malate by DcuABC drives H2/fumarate respiration to promote initial Salmonella gut-lumen colonization in miceCell Host Microbe 27:922–936https://doi.org/10.1016/j.chom.2020.04.013PubMedGoogle Scholar
- [41]Higher-fitness yeast genotypes are less robust to deleterious mutationsScience 366:490–493https://doi.org/10.1126/science.aay4199PubMedGoogle Scholar
- [42]Quantifying the local adaptive landscape of a nascent bacterial communityNat Commun 14https://doi.org/10.1038/s41467-022-35677-5PubMedGoogle Scholar
- [43]Mutualism reduces the severity of gene disruptions in predictable ways across microbial communitiesISME J 17:2270–2278https://doi.org/10.1038/s41396-023-01534-6PubMedGoogle Scholar
- [44]Beyond genome sequencing: Lineage tracking with barcodes to study the dynamics of evolution, infection, and cancerGenomics 104:417–430https://doi.org/10.1016/j.ygeno.2014.09.005PubMedGoogle Scholar
- [45]Estimated transmissibility and impact of SARS-CoV-2 lineage B.1.1.7 in EnglandScience 372https://doi.org/10.1126/science.abg3055PubMedGoogle Scholar
- [46]Inferring epistasis from genetic time-series dataMol Biol Evol 39https://doi.org/10.1093/molbev/msac199PubMedGoogle Scholar
- [47]Population immunity predicts evolutionary trajectories of SARS-CoV-2Cell 186:5151–5164https://doi.org/10.1016/j.cell.2023.09.022PubMedGoogle Scholar
- [48]Development of a comprehensive genotype-to-fitness map of adaptation-driving mutations in yeastCell 166:1585–1596https://doi.org/10.1016/j.cell.2016.08.002PubMedGoogle Scholar
- [49]Single nucleotide mapping of trait space reveals Pareto fronts that constrain adaptationNat Ecol Evol 3:1539–1551https://doi.org/10.1038/s41559-019-0993-0PubMedGoogle Scholar
- [50]Fitness variation across subtle environmental perturbations reveals local modularity and global pleiotropy of adaptationeLife 9:e61271https://doi.org/10.7554/eLife.61271PubMedGoogle Scholar
- [51]Dynamics and variability in the pleiotropic effects of adaptation in laboratory budding yeast populationseLife 10:e70918https://doi.org/10.7554/eLife.70918PubMedGoogle Scholar
- [52]Distinguishing mutants that resist drugs via different mechanisms by examining fitness tradeoffseLife 13:RP94144https://doi.org/10.7554/eLife.94144PubMedGoogle Scholar
- [53]The distribution of fitness effects of new beneficial mutations in Pseudomonas fluorescensBiol Lett 7:98–100https://doi.org/10.1098/rsbl.2010.0547PubMedGoogle Scholar
- [54]The fitness landscape of a tRNA geneScience 352:837–840https://doi.org/10.1126/science.aae0568PubMedGoogle Scholar
- [55]Quantifying variation in bacterial reproductive fitness: A high-throughput methodmSystems 6:e01323–20https://doi.org/10.1128/msystems.01323-20PubMedGoogle Scholar
- [56]Fitness land-scape analysis of a tRNA gene reveals that the wild type allele is sub-optimal, yet mutationally robustMol Biol Evol 39:msac178https://doi.org/10.1093/molbev/msac178PubMedGoogle Scholar
- [57]Distribution of fixed beneficial mutations and the rate of adaptation in asexual populationsProc Natl Acad Sci USA 109:4950–4955https://doi.org/10.1073/pnas.1119910109PubMedGoogle Scholar
- [58]Beneficial mutation–selection balance and the effect of linkage on positive selectionGenetics 176:1759–1798https://doi.org/10.1534/genetics.106.067678PubMedGoogle Scholar
- [59]A statistical guide to the design of deep mutational scanning experimentsGenetics 204:77–87https://doi.org/10.1534/genetics.116.190462PubMedGoogle Scholar
- [60]Resolving deleterious and near-neutral effects requires different pooled fitness assay designsJ Mol Evol 91:325–333https://doi.org/10.1007/s00239-023-10110-7PubMedGoogle Scholar
- [61]The propensity interpretation of fitnessPhil Sci 46:263–286https://doi.org/10.1086/288865Google Scholar
- [62]The arithmetic mean of what? A cautionary tale about the use of the geometric mean as a measure of fitnessBiol Phil 37https://doi.org/10.1007/s10539-022-09843-4Google Scholar
- [63]Book review: Two new books on life-history evolutionQ Rev Biol 68:262–264Google Scholar
- [64]Evolution in Age-Structured PopulationsCambridge: Cambridge University Press Google Scholar
- [65]Evolutionary rescue: Linking theory for conservation and medicineEvol Appl 7:1161–1179https://doi.org/10.1111/eva.12221PubMedGoogle Scholar
- [66]On the probability of fixation of mutant genes in a populationGenetics 47:713–719https://doi.org/10.1093/genetics/47.6.713PubMedGoogle Scholar
- [67]The fixation probability of beneficial mutationsJ R Soc Interface 5:1279–1289https://doi.org/10.1098/rsif.2008.0248PubMedGoogle Scholar
- [68]On the definition and measurement of fitness in finite populationsJ Theor Biol 419:36–43https://doi.org/10.1016/j.jtbi.2016.12.024PubMedGoogle Scholar
- [69]Towards a mechanistic foundation of evolutionary theoryeLife 6:e23804https://doi.org/10.7554/eLife.23804PubMedGoogle Scholar
- [70]The mutant selection window and antimicrobial resistanceJ Antimicrob Chemother 52:11–17https://doi.org/10.1093/jac/dkg269PubMedGoogle Scholar
- [71]High mutation rates limit evolutionary adaptation in Escherichia coliPLOS Genet 14:e1007324https://doi.org/10.1371/journal.pgen.1007324PubMedGoogle Scholar
- [72]Metabolic fitness landscapes predict the evolution of antibiotic resistanceNat Ecol Evol 5:677–687https://doi.org/10.1038/s41559-021-01397-0PubMedGoogle Scholar
- [73]Environmental modulation of global epistasis in a drug resistance fitness landscapeNat Commun 14https://doi.org/10.1038/s41467-023-43806-xPubMedGoogle Scholar
- [74]Empirical fitness landscapes and the predictability of evolutionNat Rev Genet 15:480–490https://doi.org/10.1038/nrg3744PubMedGoogle Scholar
- [75]Epistasis and adaptation on fitness landscapesAnnu Rev Ecol Evol Syst 53:457–479https://doi.org/10.1146/annurev-ecolsys-102320-112153Google Scholar
- [76]Resource CompetitionBoston: Springer Google Scholar
- [77]Changes in the genetic requirements for microbial interactions with increasing community complexityeLife 7:e37072https://doi.org/10.7554/eLife.37072PubMedGoogle Scholar
- [78]Higher-order interactions shape microbial interactions as microbial community complexity increasesSci Rep 12https://doi.org/10.1038/s41598-022-25303-1PubMedGoogle Scholar
- [79]Alternative fitness models with the same allele frequency dynamicsGenetics 125:201–205https://doi.org/10.1093/genetics/125.1.201PubMedGoogle Scholar
- [80]The Genetical Theory of Natural SelectionOxford: Clarendon Press Google Scholar
- [81]Modeling of the bacterial growth curveAppl Environ Microbiol 56:1875–1881https://doi.org/10.1128/aem.56.6.1875-1881.1990PubMedGoogle Scholar
- [82]Experimental evolution as a high-throughput screen for genetic adaptationsmSphere 3:e00121–18https://doi.org/10.1128/msphere.00121-18PubMedGoogle Scholar
- [83]Transkingdom control of microbiota diurnal oscillations promotes metabolic homeostasisCell 159:514–529https://doi.org/10.1016/j.cell.2014.09.048PubMedGoogle Scholar
- [84]High resolution time series reveals cohesive but short-lived communities in coastal planktonNat Commun 9https://doi.org/10.1038/s41467-017-02571-4PubMedGoogle Scholar
- [85]Complex marine microbial communities partition metabolism of scarce resources over the diel cycleNat Ecol Evol 6:218–229https://doi.org/10.1038/s41559-021-01606-wPubMedGoogle Scholar
- [86]Methods to Study Fitness and Compensatory Adaptation in Plasmid-Carrying BacteriaIn:
- De La Cruz F.
- [87]Transforming yeast into a facultative photo-heterotroph via expression of vacuolar rhodopsinCurr Biol 34:648–654https://doi.org/10.1016/j.cub.2023.12.044PubMedGoogle Scholar
- [88]Evolution of a minimal cellNature 620:122–127https://doi.org/10.1038/s41586-023-06288-xPubMedGoogle Scholar
- [89]Modelling and simulating Lenski’s long-term evolution experimentTheor Popul Biol 127:58–74https://doi.org/10.1016/j.tpb.2019.03.006PubMedGoogle Scholar
- [90]Community interactions drive the evolution of antibiotic tolerance in bacteriaProc Natl Acad Sci USA 120:e2209043119https://doi.org/10.1073/pnas.2209043119PubMedGoogle Scholar
- [91]On the use of selection rate versus relative fitnesshttps://lenski.mmg.msu.edu/ecoli/srvsrf.html
- [92]Estimating taxon-specific population dynamics in diverse microbial communitiesEcosphere 9:e02090https://doi.org/10.1002/ecs2.2090Google Scholar
- [93]Absolute quantification and genome-centric analyses elucidate the dynamics of microbial populations in anaerobic digestersWater Res 224https://doi.org/10.1016/j.watres.2022.119049PubMedGoogle Scholar
- [94]Trade-offs between microbial growth phases lead to frequency-dependent and non-transitive selectionProc R Soc B 285:20172459https://doi.org/10.1098/rspb.2017.2459PubMedGoogle Scholar
- [95]Growth tradeoffs produce complex microbial communities on a single limiting resourceNat Commun 9:3214https://doi.org/10.1038/s41467-018-05703-6PubMedGoogle Scholar
- [96]Functional profiling of the Saccharomyces cerevisiae genomeNature 418:387–391https://doi.org/10.1038/nature00935PubMedGoogle Scholar
- [97]Population genetics and evolutionarXiv https://doi.org/10.48550/arxiv.1803.08474Google Scholar
- [98]Long-term experimental evolution in Escherichia coli. II. Changes in life-history traits during adaptation to a seasonal environmentAm Nat 144:432–456Google Scholar
- [99]Changes in cell size and shape during 50,000 generations of experimental evolution with Escherichia coliJ Bacteriol 203:e00469–20https://doi.org/10.1128/jb.00469-20PubMedGoogle Scholar
- [100]Community structure follows simple assembly rules in microbial microcosmsNat Ecol Evol 1:0109https://doi.org/10.1038/s41559-017-0109PubMedGoogle Scholar
- [101]Improving the accuracy of bulk fitness assays by correcting barcode processing biasesMol Biol Evol 41:msae152https://doi.org/10.1093/molbev/msae152PubMedGoogle Scholar
- [102]Extreme sensitivity of fitness to environmental conditions: Lessons from #1BigBatchJ Mol Evol 91:293–310https://doi.org/10.1007/s00239-023-10114-3PubMedGoogle Scholar
- [103]Tn-seq: High-throughput parallel sequencing for fitness and genetic interaction studies in microorganismsNat Methods 6:767–772https://doi.org/10.1038/nmeth.1377PubMedGoogle Scholar
- [104]The dynamics of molecular evolution over 60,000 generationsNature 551:45–50https://doi.org/10.1038/nature24287PubMedGoogle Scholar
- [105]Unbiased fitness estimation of pooled barcode or amplicon sequencing studiesCell Syst 7:521–525https://doi.org/10.1016/j.cels.2018.09.004PubMedGoogle Scholar
- [106]Shifts in growth strategies reflect trade-offs in cellular economicsMol Syst Biol 5:MSB200982https://doi.org/10.1038/msb.2009.82PubMedGoogle Scholar
- [107]Control of key metabolic intersections in Bacillus subtilisNat Rev Microbiol 5:917–927https://doi.org/10.1038/nrmicro1772PubMedGoogle Scholar
- [108]Mitochondrial efficiency determines Crabtree effect across yeastsbioRxiv https://doi.org/10.1101/2024.11.01.621473Google Scholar
- [109]The acetate switchMicrobiol Mol Biol Rev 69:12–50https://doi.org/10.1128/mmbr.69.1.12-50.2005PubMedGoogle Scholar
- [110]Active pH regulation facilitates Bacillus subtilis biofilm development in a minimally buffered environmentmBio 15:e03387–23https://doi.org/10.1128/mbio.03387-23PubMedGoogle Scholar
- [111]Experimental and analytical approaches for improving the resolution of randomly barcoded transposon insertion sequencing (RB-TnSeq) studiesACS Synth Biol 11:2015–2021https://doi.org/10.1021/acssynbio.2c00119PubMedGoogle Scholar
- [112]PROPHECY—A database for high-resolution phenomicsNucl Acids Res 33:D369–D373https://doi.org/10.1093/nar/gki126PubMedGoogle Scholar
- [113]The struggle for existence. How the notion of carrying capacity obscures the links between demography, Darwinian evolution and speciationEvol Ecol Res 14:627–665Google Scholar
- [114]On measuring selection in experimental evolutionBiol Lett 7:210–213https://doi.org/10.1098/rsbl.2010.0580PubMedGoogle Scholar
- [115]Frequency-dependent selection in bacterial populationsPhil Trans R Soc B 319:459–472https://doi.org/10.1098/rstb.1988.0059PubMedGoogle Scholar
- [116]Population genetics and evolutionarXiv https://doi.org/10.48550/arxiv.1803.08474Google Scholar
- [117]Long-term experimental evolution in Escherichia coli. I. Adaptation and divergence during 2,000 generationsAm Nat 138:1315–1341https://doi.org/10.1086/285289Google Scholar
- [118]High-resolution yeast phenomics resolves different physiological features in the saline responseProc Natl Acad Sci USA 100:15724–15729https://doi.org/10.1073/pnas.2435976100PubMedGoogle Scholar
- [119]Inferring time derivatives including cell growth rates using Gaussian processesNat Commun 7https://doi.org/10.1038/ncomms13766PubMedGoogle Scholar
- [120]SciPy 1.0: Fundamental algorithms for scientific computing in PythonNat Methods 17:261–272https://doi.org/10.1038/s41592-020-0772-5Google Scholar
- [121]Trade-offs between microbial growth phases lead to frequency-dependent and non-transitive selectionProc R Soc B 285:20172459https://doi.org/10.1098/rspb.2017.2459PubMedGoogle Scholar
- [122]Growth tradeoffs produce complex microbial communities on a single limiting resourceNat Commun 9:3214https://doi.org/10.1038/s41467-018-05703-6PubMedGoogle Scholar
- [123]Long-term dynamics of adaptation in asexual populationsScience 342:1364–1367https://doi.org/10.1126/science.1243357PubMedGoogle Scholar
- [124]Daily transfers, archiving populations, and measuring fitness in the long-term evolution experiment with Escherichia coliJoVE 198:e65342https://doi.org/10.3791/65342PubMedGoogle Scholar
- [125]The dynamics of molecular evolution over 60,000 generationsNature 551:45–50https://doi.org/10.1038/nature24287PubMedGoogle Scholar
- [126]Long-term experimental evolution in Escherichia coli. II. Changes in life-history traits during adaptation to a seasonal environmentAm Nat 144:432–456Google Scholar
- [127]Changes in cell size and shape during 50,000 generations of experimental evolution with Escherichia coliJ Bacteriol 203:e00469–20https://doi.org/10.1128/jb.00469-20PubMedGoogle Scholar
- [128]Evolution and extinction dynamics in rugged fitness landscapesInt J Mod Phys B 12:361–391https://doi.org/10.1142/s0217979298000259Google Scholar
- [129]Evolution in random fitness landscapes: The infinite sites modelJ Stat Mech 2008https://doi.org/10.1088/1742-5468/2008/04/p04014Google Scholar
- [130]Stochastic tunneling across fitness valleys can give rise to a logarithmic long-term fitness trajectorySci Adv 5:eaav3842https://doi.org/10.1126/sciadv.aav3842PubMedGoogle Scholar
- [131]The effect of weak clonal interference on average fitness trajectories in the presence of macroscopic epistasisGenetics 220:iyac028https://doi.org/10.1093/genetics/iyac028PubMedGoogle Scholar
- [132]How microscopic epistasis and clonal interference shape the fitness trajectory in a spin glass model of microbial long-term evolutioneLife 12:RP87895https://doi.org/10.7554/eLife.87895PubMedGoogle Scholar
- [133]High mutation rates limit evolutionary adaptation in Escherichia coliPLOS Genet 14:e1007324https://doi.org/10.1371/journal.pgen.1007324PubMedGoogle Scholar
- [134]Case studies in the assessment of microbial fitness: Seemingly subtle changes can have major effects on phenotypic outcomesJ Mol Evol 91:311–324https://doi.org/10.1007/s00239-022-10087-9PubMedGoogle Scholar
- [135]Predicting bacterial interaction outcomes from monoculture growth and supernatant assaysISME Commun 4:ycae045https://doi.org/10.1093/ismeco/ycae045PubMedGoogle Scholar
- [136]Toxicity drives facilitation between 4 bacterial speciesProc Natl Acad Sci USA 116:15979–15984https://doi.org/10.1073/pnas.1906172116PubMedGoogle Scholar
- [137]Classifying interactions in a synthetic bacterial community is hindered by inhibitory growth mediummSystems 7:e00239–22https://doi.org/10.1128/msystems.00239-22PubMedGoogle Scholar
- [138]Growthcurver: An R package for obtaining interpretable metrics from microbial growth curvesBMC Bioinform 17:172https://doi.org/10.1186/s12859-016-1016-7PubMedGoogle Scholar
- [139]GNU ParallelZenodo https://doi.org/10.5281/zenodo.1146014
- [140]Fitness effects of mutations in bacteriaJ Mol Microbiol Biotechnol 21:20–35https://doi.org/10.1159/000332747PubMedGoogle Scholar
- [141]Why there are so many definitions of fitness in modelsEcoEvoRxiv https://doi.org/10.32942/x2v61tGoogle Scholar
- [142]Malthusian parameters as estimators of the fitness of microbes: A cautionary tale about the low side of high throughputPLOS One 10:e0126915https://doi.org/10.1371/journal.pone.0126915PubMedGoogle Scholar
- [143]Competition dynamics in long-term propagations of Schizosaccharomyces pombe strain communitiesEcol Evol 11:15085–15097https://doi.org/10.1002/ece3.8191PubMedGoogle Scholar
- [144]Quantitative evolutionary dynamics using high-resolution lineage trackingNature 519:181–186https://doi.org/10.1038/nature14279PubMedGoogle Scholar
- [145]Rapid quantification of mutant fitness in diverse bacteria by sequencing randomly bar-coded transposonsmBio 6:e00306–15https://doi.org/10.1128/mbio.00306-15PubMedGoogle Scholar
- [146]Mutant phenotypes for thousands of bacterial genes of unknown functionNature 557:503–509https://doi.org/10.1038/s41586-018-0124-0PubMedGoogle Scholar
- [147]Changes in the genetic requirements for microbial interactions with increasing community complexityeLife 7:e37072https://doi.org/10.7554/eLife.37072PubMedGoogle Scholar
- [148]Bacterial–fungal interactions revealed by genome-wide analysis of bacterial mutant fitnessNat Microbiol 6:87–102https://doi.org/10.1038/s41564-020-00800-zPubMedGoogle Scholar
- [149]Distinguishing mutants that resist drugs via different mechanisms by examining fitness tradeoffseLife 13:RP94144https://doi.org/10.7554/eLife.94144PubMedGoogle Scholar
- [150]Mutualism reduces the severity of gene disruptions in predictable ways across microbial communitiesISME J 17:2270–2278https://doi.org/10.1038/s41396-023-01534-6PubMedGoogle Scholar
- [151]Higher-fitness yeast genotypes are less robust to deleterious mutationsScience 366:490–493https://doi.org/10.1126/science.aay4199PubMedGoogle Scholar
- [152]High-resolution lineage tracking reveals travelling wave of adaptation in laboratory yeastNature 575:494–499https://doi.org/10.1038/s41586-019-1749-3PubMedGoogle Scholar
- [153]Dynamics and variability in the pleiotropic effects of adaptation in laboratory budding yeast populationseLife 10https://doi.org/10.7554/elife.70918PubMedGoogle Scholar
- [154]Changing fitness effects of mutations through long-term bacterial evolutionScience 383https://doi.org/10.1126/science.add1417PubMedGoogle Scholar
- [155]Tn-seq: High-throughput parallel sequencing for fitness and genetic interaction studies in microorganismsNat Methods 6:767–772https://doi.org/10.1038/nmeth.1377PubMedGoogle Scholar
Article and author information
Author information
Version history
- Preprint posted:
- Sent for peer review:
- Reviewed Preprint version 1:
- Reviewed Preprint version 2:
Cite all versions
You can cite all versions using the DOI https://doi.org/10.7554/eLife.102635. This DOI represents all versions, and will always resolve to the latest one.
Copyright
© 2024, Justus Wilhelm Fink & Michael Manhart
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics
- views
- 967
- downloads
- 54
- citations
- 2
Views, downloads and citations are aggregated across all versions of this paper published by eLife.