Abstract
Microbial collectives, capable of functions beyond the reach of individual populations, can be enhanced through artificial selection. However, this process presents unique challenges. Here, we explore the ‘waterfall’ phenomenon, a metaphor describing how the success in achieving a desired genotype or species composition in microbial collectives can depend on both the target characteristics and initial conditions. We focus on collectives comprising fast-growing (F) and slow-growing (S) types, aiming to achieve specific S frequencies. Through simulations and analytical calculations, we show that intermediate target S frequencies might be elusive, akin to maintaining a raft’s position within a waterfall, rather than above or below it. This challenge arises because intra-collective selection, favoring F during growth, is the strongest at intermediate S frequencies, which can overpower counteracting inter-collective selection effects. Achieving low target S frequencies is consistently possible as expected, but high target S frequencies require an initially high S frequency — similar to a raft that can descend but not ascend a waterfall. The range of attainable target frequencies is significantly influenced by the initial population size of the collectives, while the number of collectives under selection plays a less critical role. In scenarios involving more than two types, the evolutionary trajectory must navigate entirely away from the metaphorical ‘waterfall drop.’ Our findings illustrate that the strength of intra-collective evolution is frequency-dependent, with implications in experimental planning.
Introduction
Microbial collectives can carry out functions that arise from interactions among member species. These functions, such as waste degradation (1, 2), probiotics (3), and vitamin production (4), can be useful for human health and biotechnology. To improve collective functions, one can perform artificial selection (directed evolution) on collectives: Low-density “Newborn” collectives are allowed to “mature” during which cells proliferate and possibly mutate, and community function develops. “Adult” collectives with high functions are then chosen to reproduce, each seeding multiple offspring Newborns. Artificial selection of collectives have been attempted both in experiments (5–19) and in simulations (20–30), often with unimpressive outcomes.
One of the major challenges in selecting collectives is to ensure the inheritance of a collective function (31, 32). Inheritance can be compromised when genotype and species compositions change from a parent collective to offspring collectives. During maturation of a collective, genotype compositions within each species can be altered by intra-collective evolution, while species compositions can be shifted by ecological interactions. Furthermore, during reproduction of a collective, genotype and species compositions of offspring can vary stochastically from those of the parent.
Here, we consider the selection of collectives of two types with different growth rates to achieve a taget composition in the Adult collective. Earlier work has demonstrated that nearly any target species composition can be achieved when selecting communities of two competing species with unequal growth rates (24, 33), so long as the shared resource is depleted during collective maturation (24). In this case, both species initially evolved to grow faster, and the slower-growing species was preserved due to stochastic fluctuations in species composition during collective reproduction. Eventually, both species evolved to grow sufficiently fast to deplete the shared resource during collective maturation, and evolution in competition coefficients then acted to stabilize species ratio to the target value (24).
Here, we mathematically examine the selection of target type compositions in two-type collectives when nutrient is always in excess. This allows us to analytically examine the evolutionary tipping point between intra-collective and inter-collective selection. We show that this tipping point creates a “waterfall” effect which restricts not only which target compositions are achievable, but also the initial composition required to achieve the target. We also investigate how the range of achievable target composition is affected by the population size in Newborns and the total number of collectives under selection. Finally, we show that the waterfall phenomenon extends to systems with more than 2 types.
Results and Discussions
We start with selecting for a target composition in collectives of fast-growing F and slow-growing S types. We show that intermediate F frequencies or equivalently, intermediate S frequencies, are the hardest targets to achieve. We then show that similar conclusions hold when selecting for a target composition in collectives of more than two types. We mainly describe in terms of F frequency f = F/(S + F), which is related to S frequency s = 1 − f .
A selection cycle
A selection cycle (Fig. 1) starts with a total of g Newborn collectives. We consider two genotypes - slow-growing type (S) and fast-growing type (F), with S mutating to F at a rate μ. At the beginning of cycle k (t = 0), a collective has a fixed total cell number

Schematic for artificial selection on collectives.
Each selection cycle begins with a total of g Newborn collectives (black open circles), each with N0 total cells of slow-growing type (S, red-colored dots) and fast-growing type (F, blue-colored dots). During maturation (over time τ), S and F cells divide at rates r and r + ω (ω > 0), respectively. S mutates to F at rate μ. In the selection stage, the Adult collective with F frequency f closest to the target composition
Collectives are allowed to grow for time τ (‘Maturation’ in Fig. 1). During maturation, S and F grow at rates r and r + ω (ω > 0), respectively. If maturation time τ is too small, matured collectives (“Adults”) do not have enough cells to reproduce g Newborn collectives with N0 cells. On the other hand, if maturation time τ is too long, fast-growing F will take over. Hence we set the maturation time τ = ln(g + 1)/r, which guarantees sufficient cells to produce g Newborn collectives from a single Adult collective.
At the end of a cycle, the Adult with the highest function is chosen to reproduce g Newborn collectives each with N0 cells (‘Selection’ and ‘Reproduction’ in Fig. 1). This allows us to employ the simplest version of the extreme value theory, and in our case, choosing top 1 outperforms a less stringent “choosing top 5%” selection regime. Collective function is dictated by the F frequency f, and is maximized at the target value
The success of collective selection is constrained by the target composition, and sometimes also by the initial composition
Since intra-collective selection favors F, we expect that a higher target
We fixed N0,the total population size of a Newborn, to 1000, and obtained selection dynamics for various initial and target F frequencies by implementing stochastic simulations (Sec. 1 in Supplementary Information). If the target value is high (e.g.

Initial and target compositions determine the success of artificial selection on collectives.
(a-c) Mutant frequency of the selected Adult collective (f ∗) over cycles. The target frequency
In contrast, an intermediate target frequency (e.g.
If the target frequency is low (e.g.
In summary, two regions of target frequencies are “accessible” in the sense that a target value in the region can be maintained once reached; gold in Fig. 2d, e): (1) target frequencies above
Intra-collective evolution is the fastest at intermediate F frequencies, creating the “waterfall” phenomenon
To understand what gives rise to the two accessible regions, we calculated how the distribution of F frequency f might change after one cycle. We consider the case where
Mathematically, the conditional probability distribution Ψ of
Equation 1 can be broken down into two parts: (i)
When we approximate the Adults’ f in cycle k + 1 as Guassian with mean
where
and
e is Euler’s number (2.71828…), and Φ−1(…) is an inverse cumulative function of standard normal distribution. For those unfamiliar with Φ−1(…), y = Φ−1(x) (where 0 ≤ x ≤ 1) is the number such that the probability of a standard normal random variable being less than or equal to y is x (e.g. 0 = Φ−1(0.5)). Eq. [2] contains all experimental parameters, including g (the total number of collectives under selection), τ (maturation time), μ (mutation rate), r (growth rate of the slow-growing S type), and ω (growth advantage of F over S). Equation [2] has the same shape as results from numerically integrating Eq. [1] and from stochastic simulations (Fig. 3a).

Intra-collective selection and inter-collective selection jointly set the boundaries for selection success.
a The change in F frequency over one cycle. When
F frequency can be reduced if the selected
Not surprisingly, similar conclusions are derived where S and F are slow-growing and fast-growing species which cannot be converted through mutations (Sec. 3 and Fig. S7 in Supplementary Infomation).
Overall, inter-collective selection is akin to a raftman, rowing the raft to a target, while intra-collective selection is akin to a waterfall. This metaphore is best understood in terms of S frequency s = 1 − f . Then, the lower-threshold f L corresponds to higher-threshold in sH = 1 − f L, while higher-threshold f H corresponds to lower-threshold in sL = 1 − f H . Intra-collective selection is akin to a waterfall, driving the S frequency s from high to low (Fig. 2f). Intra-collective selection acts the strongest when s is intermediate (sL < s < sH), similar to the vertical drop of the fall. Intra-collective selection acts weakly at high (> sH) or low (< sL) s, similar to the gentle sloped upper and lower pools of the fall (regions 1 and 2 of Fig. 2d, f and Fig. 3a). Thus, an intermediate target frequency can be impossible to achieve - a raft starting from the upper pool will be flushed down to sL (f H), while a raft starting from the lower pool cannot go beyond sL (f H). In contrast, a low target S frequency (in the lower pool) is always achievable. Finally, a high target S frequency (in the upper pool) can only be achieved if starting from the upper pool (as the raft can not jump to the upper pool if starting from below).
Manipulating experimental setups to expand the achievable target region
Variation among collectives is a key element for successful selection. Here, variation among collectives originates from the stochastic nature of growth and mutation during collective maturation, and of sampling effects during collective reproduction. From Eq. [4], the variance of f at time τ

Expanding the success region for artificial collective selection.
a Reducing the population size in Newborn N0 expands the region of success. In gold area, the probability that
The number of collectives g also affects the selection outcomes. With more collectives, the chance to find a f closer to the target is more likely. Thus, a larger number of collectives broadens the region of success (Fig. 4b). However, the effect of g is not dramatic. To see why, we note that the only place that g appears is Eq. [2] in the form of
The waterfall phenomenon in a higher dimension
In contrast to the above two-type case where the evolutionary trajectory travels along a one-dimensional line from the initial frequency to the final frequency, in a three-type case the trajectory travels in a two-dimensional plane. To examine the waterfall effect in a higher dimension, we investigate a three-type system where a faster-growing type (FF) grows faster than the fast-growing type (F) which grows faster than the slow-growing type (S) (Fig. 5a and Sec. 6 in Supplementary Information). In this system, a target type composition can be achieved if inter-collective selection reduces the frequencies of F as well as FF (accessible region, gold in Fig. 5b). Strikingly, a target composition in an accessible region may not be achievable even when the initial composition is within the same region: once the composition escapes the accessible region, the trajectory cannot return back to the accessible region (Fig. 5biii, the leftmost initial condition). However, if the initial position is closer to the target in the accessible region, the target becomes achievable (Fig. 5biii, initial condition near the bottom).

In higher dimensions, the success of artificial selection requires the entire evolutionary trajectory remaining in the accessible region.
a During collective maturation, a slow-growing type (S) (with growth rate r; dark red) can mutate to a fast-growing type (F) (with growth rate r + ω; blue), which can mutate further into a faster-growing type (FF) (with growth rate r + 2ω; purple). Here, the rates of both mutational steps are μ, and ω > 0. b Evolutionary trajectories from various initial compositions (open circles) to various targets. Intra-collective evolution favors FF over F (vertical blue arrow) over S (horizontal blue arrow). The accessible regions are marked gold (see Sec. 1 in Supplementary Information). We obtain final compositions starting from several initial compositions while aiming for different target compositions in i, ii, and iii. The evolutionary trajectories are shown in dots with color gradients from the initial to final time. (i) A target composition with a high FF frequency is always achievable. (ii) A target composition with intermediate FF frequency is never achievable. (iii) A target composition with low FF frequency is achievable only if starting from an appropriate initial composition such that the entire trajectory never meanders away from the accessible region. The figures are drawn using mpltern package (35).
In conclusion, we have investigated the evolutionary trajectories of type composition in collectives under selection, which are governed by intra-collective selection (which favors fast-growing types) and inter-collective selection (which, in our case, strives to counter fast-growing types). Intra-collective selection has the strongest effect at intermediate frequencies of faster-growing types, potentially creating an inaccessible region analogous to the vertical drop of a waterfall. High and low target frequencies are both accessible, analogous to the lower- and the upper-pools of a waterfall, respectively. A less challenging target (low ŝ)is achievable from any initial position. In contrast, a more challenging target (high ŝ) is only achievable if the entire trajectory is contained within the region, similar to a raft striving to reach a point in the upper-pool must start at and remain in the upper pool. Our work suggests that the strength of intra-collective selection is not constant, and that strategically choosing an appropriate starting point can be essential for successful collective selection.
Materials and Methods
Stochastic simulations
A selection cycle is composed of three steps: maturation, selection, and reproduction. At the beginning of the cycle k, a collective i has
Analytical approach to the conditional probability
The conditional probability distribution Ψ
After the reproduction step, the Newborn collectives grow for time τ . The frequency is changed from the given frequency ζ to f by division and mutation processes. We assume that the frequency f of an Adult is also approximated by a Gaussian random variable
The conditional probability distribution of an Adult collective in cycle k + 1 (fk+1,τ) to have frequency f at a given
Since we select the minimum frequency
We assume that the conditional probability distribution in Eq. [7] follows a normal distribution, whose mean and variances are describe by Eq. [3] and Eq. [4]. Then, the extreme value theory (38) estimates the median of the selected Adult by
The F frequency difference in Eq. [2] is obtained by substracting
Data and source code availability
Data and source code of stochastic simulations are available in https://github.com/schwarzg/artificial_selection_collective_composition
Acknowledgements
J. Lee and H.J. Park were supported by the National Research Foundation of Korea grant funded by the Korea government (MSIT), Grant No. RS-2023-00214071 (H.J.P.) and by an appointment to the JRG Program at the APCTP through the Science and Technology Promotion Fund and Lottery Fund of the Korean Government. W. Shou was supported by an Academy of Medical Sciences Professorship and a Royal Society Wolfson Fellowship. This was also supported by the Korean Local Governments– Gyeongsangbuk-do Province and Pohang City and INHA UNIVERSITY Research Grant. We thank Su-Chan Park, Li Xie, and Alex Yuan for constructive comments and discussions.
Supplementary Information
1. Stochastic simulation for the evolution of collective function
In the main text, we design a simple model of artificial selection on collectives. The collectives are composed of two genotypes, slow-growing (S) and fast-growing (F) genotypes. We start with g ‘Newborn’ collectives and each collective i (i = 1, …, g) has
Maturation
In this subsection, we ignore the cycle number index k and collective index (i) in convenience. So we denote S(t) and F (t) as representative notations of
We consider the population growth of S(t) and F (t) as a stochastic process. We use P (S, F, t) to denote the probability to have S S cells and F F at time t from the beginning of a cycle. Birth and mutation events stochastically happen which are represented by the three chemical reaction rules:
where S and F denote individual cells of a S type and a F type, respectively. The reaction rules can be simulated via tau-leaping algorithm (1, 2), but the individual-based simulation requires long simulation times to get enough results. Hence we go over probability distribution P (S, F, t) to sample compositions instead of tracking all events that occurred in individual collectives. First, we calculate the distribution P (S, F, t) analytically, and then sample numbers of S and F cells from this distribution at the end of cycle t = τ . Below, we describe the derivation of P (S, F, t).
The master equation of P (S, F, t), which follows reaction rules Eq. [1-3], is given by
Here, dt is small such that at most one cell birth or mutation event could occur. The first term describes the scenario where a single birth event of a S cell happens during time interval [t, t + dt), which changes the collective’s composition from (S −1, F) to (S, F). Similarly, the second term comes from a birth event of a F cell. The third term indicates the mutation event. The last term corresponds to the outflow of probability density by birth and mutation processes, which induces the changes in composition from (S, F) to others.
Calculating the exact form of P (S, F, t) is not simple. Instead, we consider the probability distributions of S(t) and F (t) at time t. We assume that the mutation rate is much smaller than the growth rates, and hence the correlation between S and F is sufficiently small. Then, P (S, F, t) ∼ P (S, t)P (F, t). The distributions of S and F can be approximated as Gaussian (𝒩), which is supported by the Central Limit Theorem and Figure S1 (note the concordance between the blue and green distributions). That is, the probability distribution
The means are defined by
We assume that the mutation rate μ is very smaller than r and ω. By solving Eq. [5] and Eq. [6], the means
where
We define the second momenta of S and F as
Then, corresponding differential equations are given by
The solution of Eq. [11] is
where
where
The solution of Eq. [16] is given by
By using Eq. [17], the solution of Eq. [12] is given by
where
Using Eq. [7,8,15,19], we construct Gaussian distribution functions for S(t) and F (t),

Comparison between the calculated Gaussian distribution (“Gauss”, with the mean and variances computed from Eq. [7,8,15,19]) and simulations using tau-leaping (“tau”) and sampling (“samp”) methods. The simulations run 500 times. The initial number of cells are S0 = 200 and F0 = 800. The parameters are r = 0.5, ω = 0.03, μ = 0.0001, and τ = 4.8.
Selection
After the maturation step is finished in cycle k, we compute the frequencies of the F cells in each collectives
Reproduction
We divide the selected collective into g Newborn collectives. We sample
After the sampling, the numbers of S cells are set to be
Simulation Result
In the main text, we simulate artificial selection by using stochastic simulation. In Fig. S3, we draw the absolute error d between the target frequency
Figure S4 presents the composition trajectories of all collectives using tau-leaping algorithm. The selected collectives have the closest compotisition to the target composition
2. Conditional probability distribution of the selected collective frequency f *
Since maturation and reproduction are stochastic processes, the frequency
We focus on the situation where the change of frequency in the selection step has the opposite direction of the maturation step. That means the target frequency
Let us start the calculation from the reproduction step at the end of cycle k. The probability distribution of the F cell numbers in Newborn collectives is given in Eq. [21]. If the total number of cells in a Newborn collective N0 is large enough, Eq. [21] is approximated by the Gaussian distribution
The Newborn collective i maturates in cycle k + 1 with the initial cell numbers
where GS (t) and GF (t) are random variable following the standard distribution 𝒩 (0, 1). Here we ignore cycle index k + 1 in subscription for convenience. Then, we can approximately write f (t) as

Comparison between consecutive sampling and independent binomial sampling. A parent collective is divided into 10 collectives. The histogram labeled with ‘MHG’ is the probability mass function of F of the fifth collective sampled via multivariate hypergeometric distribution. The independent binomial sampling is labeled with ‘BN’. The initial numbers of cells are S = 8000 and F = 2000 for the left panel, and S = 20 and F = 5 for the right panel. 10000 samples are drawn for each distribution.

Color map of the absolute error

a Trajectories of F frequency for 10 collectives (g = 10) over time. The collective whose frequency is closest to the target value is selected in every cycle (black lines). The gray lines denote the other collectives. For parameters, we used S growth rate r = 0.5, F growth advantage ω = 0.03, mutation rate μ = 0.0001, maturation time τ ≈ 4.8, and N0 = 1000. b Comparison between frequency trajectories with (black) and without (blue) selection clearly shows the effect of artificial selection. The black line indicates F frequency of the selected collective
The mean of f is given by
and the variance is
The average size of collectives is
With indices of cycle k and k + 1, the conditional probability that a offspring collective has frequency f after the maturation in cycle k + 1 from the parent frequency
After maturation in cycle k + 1, the collective with the smallest frequency
The above Eq. [30] simply states that
Then, the probability density function
We compute the probability density function [32] by using numerical integration and compare it with the stochastic simulation results in Fig. S5. Both distributions agree with each other.
To get the analytic approximation of the median of Eq. [32], we assume that the F frequency distribution of Adult collective is Gaussian form. Thus, instead of calculating marginal probability with ζ variable as in Eq. [29], we only calculate the mean
which are Eq. [3] and Eq.[4] in the main text, respectively.

The probability density functions of the selected collective’s frequency
We note that
The median (Median
where Φ−1(y) is an inverse cumulative density function (CDF) of the normal distribution with the mean
Further, we get an asymptotic expression of Φ−1(ln 2/g) when g is large (or Φ−1(y) with small y). We start from the CDF of the standard normal distribution,
Replacing x2 in the righthand side in Eq. [36] into the expression itself, we get continued logarithmic form of
Inserting = erfc 1(y) (square root of Eq. [37]) into the inverse CDF
3. Selection without mutation
When the mutation rate is zero, two genotypes behave as two distinct species. So the S corresponds to the slower-growing species and the F corresponds to the faster-growing species. Then, our problem is generalized to the selection of two-species collective. The compositional change is provided by Eq. [35] with setting μ = 0. Corresponding
Equations [39] and [40] suggest that when a community consists of two competing species, we obtain similar conclusions on the accessible region for target composition. The stochastic simulation results are present in Fig. S7.
4. Selecting more than one collective
In the main text, we choose one collective which have the closest frequency value to the target among g collectives. Such ‘Top 1’ stratedgy has an advantage on applying extreme value theory while it has potentially lose the ‘unlucky’ Adults (6). Such ‘unlucky’ Adults are further to the target value than the selected one, but have possiblity to be selected in next cycle. The ‘Top tier’ stratedgy may secure the ‘unlucky’ Adult by choosing more than one collective (6). So we test the ‘top-tier’ stratedgy by choosing 5 among 100 Adults with the distance from the target value in Fig. S8. The top-tier stratedgy is shown to be inefficient in the two-genotype system. This is because the system is too simple, so lower ranked Adult is lower again in next cycle.

Scaling relation of F frequency variance (Eq. [27]) with Newborn collective size N0. The initial F frequency is 0.3. The parameters are r = 0.5, ω = 0.03, μ = 0.0001, and τ ≈ 4.8.

Color map of the absolute error

Comparison of selecting Top-tier 5 with Top 1. We breed 100 collectives and choose 5 collectives with the closest to the target value.
5. Deleterious mutation
In the main text, we show the target composition can be achieved in some ranges of initial and target values when the mutation is beneficial to growth. The same analogy can be applied even when the mutation is deleterious. Since the F cells grow slower than the S cells (ω < 0), the F frequency naturally decreases in the maturation step. Then, the conditional probability distribution
If all frequencies are independent and identical distributed random variables, the cumulative distribution function becomes
Likewise in the previous section, we get the conditional probability density function by differentiating Eq. [42] with respect to f and replacing
The distribution in Eq. [43] is evaluated for various
6. Extension to the system including double-mutation
In the main text, we categorize the target frequencies based on whether the frequency change in the maturation step can be compensated in the selection step. Here, we extend the idea into more complex systems in order to suggest a possible generalization. We assume that collectives consist of three genotypes with slow-growing(S), fast-growing(F), and faster-growing(FF) types. The growth rate of S is r. Each mutation adds the growth rate with ω. So the F and FF type have growth rates with r + ω and r + 2ω, respectively. The mutation rate is μ. So, the birth and mutation events are written by the chemical reactions:
Here, FF is an individual of a F type. We write a master equation of the processes for P (S, F, FF, t) which is the probability to have S S cells, F F cells, and FF FF cells at time t.
The composition of collective i in cycle k is now represented with two frequencies
At the reproduction step in cycle k, we choose N0 cells from the selected Adult whose composition is
where the number of S Sk+1,0 is automatically set to be Sk+1,0 = N0 − Fk+1,0 − FFk+1,0. Then, the approximated multivariate normal distribution is

Artificial selection also works for deleterious mutation. a Conditional Probability density functions of
Then a Newborn’s composition ρ(ζ, η) follows the multivariate Gaussian distribution
At the beginning of cycle k, a newborn collective starts to maturate from (S0, F0, FF0) cells (for convenience, cycle index k is dropped.) In terms of (ζ, η), each initial numbers are S0 = N0(1 − ζ− η), F0 = N0ζ, and FF0 = N0η. Their initial covariance matrix is
The initial conditions of the system in coupled Eqs. [53-61] are obtained by the mean and (co)variances of Eq. [50]. By solving equations numerically, we obtain a set of mean cell numbers
Then, the F frequency becomes
where
where
With cycle index k, we get conditional probability of matured collectives
We select the Adult collective among g Adult collectives such that the change in frequencies during maturation could be compensate. During maturation, a freuquency distribution moves different direction in (f, h) space depending on the initial composition
If the mean
The probability
where
Similarly, the probabilities
Then, the conditional probability of the selected collective is given by
where
If the mean
The probability
Thus, the joint cumulative distribution function is given by
In this case, the conditional probability distribution function is given by
By replacing (f, h) to
Using Eq. [79], we get the mean values of f * and h* as
We define the accessible region in frequency space where the signs of the changes in both F frequency and FF frequency after a cycle are opposite to that of maturation (see Fig. S10),
where

a The schematic procedure to determine accessible regions by collective selection. From the given parent composition (blue dot), the probability distribution of offspring Adults (Eq. [68]) is computed (marked in the orange-colored area). From Adult compositions, the probability distribution of the selected collective (Eq.[79]) is computed (marked in the green area). If the signs of the changes in both F frequency and FF frequency after the selection (from blue dot to green dot) are opposite to that of maturation (from blue dot to orange dot), the given composition is accessible. Otherwise, the composition is not accessible and will change after cycles. b The accessible regions are marked by the gray area. The vector field is the flow of compositions during maturation. The length and color of the arrows indicate the speed of composition changes. The figures are drawn using mpltern package (7).
7. Derivation of equations
In this section, we go over the derivation of Eq. [7-27] for readers not equipped with advanced mathematics training.
Assumptions: μ ≪ ω, r
Equations [7] and [8]. Equation [7] is straightforwardly solved by integrating Eq. [5]. Equation [8] is obtained from Eq. [6] using integration factor e−(r+ω)t:
Integrating both sides, we get
Equations [11] and [12]. Applying Eq. [4], we have
We collect the two violet-colored terms and change the order of summation. Note that the first violet-colored term does not change regardless of whether S starts from 0 or 1 because the term is zero for S = 0. Thus, the first violet-colored term is equivalent to
We collect the two green terms, and similarly obtain:
Finally, we collect the two orange terms. For the first orange term, the sum is the same regardless of whether we start from S = 0 or −1. Let S start from −1, and we have
Now, add the three parts together, and we have
which is Eq. [11]. Likewise,
which is Eq. [12].
Equation [13] and [15]. Using integration factor e−2(r−μ)t and Eq. [11], we have:
Since μ ≪ r,we have
where
Equation [16] and [17]. Since
We can solve this, again using the integration factor technique above:
Thus, we have
which results in
Equation [18]. From Eq. [12], we have
The right-hand side becomes
Note that we have checked that the second and third terms of
Then, we have
Equations [25]-[27]. To derive this equation, we use the fact that 1/(1 + x) ∼1 − x for small x. We will omit (t) for simplicity. Also note that we are considering relatively large populations so that the standard deviation is much smaller than the mean.
Recall that if A ∼ 𝒩 (μA, σA), B ∼ 𝒩 (μB, σB), then A − B ∼ 𝒩 (μA − μB,
Note that the initial value of mean
References
- 1.Fast and facile biodegradation of polystyrene by the gut microbial flora of Plesiophthalmus davidis larvaeAppl. Environ. Microbiol 86:e01361–20
- 2.Corrigendum: Insights into plastic biodegradation: community composition and functional capabilities of the superworm (Zophobas morio) microbiome in styrofoam feeding trialsMicrob. Genomics 8
- 3.Synthetic biology approaches to engineer probiotics and members of the human microbiota for biomedical applicationsAnnu. review biomedical engineering 20:277–300
- 4.Reorganization of a synthetic microbial consortium for one-step vitamin c fermentationMicrob. Cell Factories 15:21
- 5.Experimental studies of community evolution i: The response to selection at the community levelEvolution 44:1614–1624
- 6.Experimental studies of community evolution ii: The ecological basis of the response to community selectionEvolution 44:1625–1636
- 7.Artificial ecosystem selectionProc. Natl. Acad. Sci 97:9110–9114
- 8.Artificial selection of microbial ecosystems for 3-chloroaniline biodegradationEnviron. Microbiol 2:564–571
- 9.Selection on soil microbiomes reveals reproducible impacts on plant functionThe ISME journal 9:980–989
- 10.Levels and limits in artificial selection of communitiesEcol. Lett 18:1040–1048
- 11.Cultivated sub-populations of soil microbiomes retain early flowering plant traitMicrob. Ecol 73:394–403
- 12.Understanding microbial community dynamics to improve optimal microbiome selectionMicrobiome 7:1–14
- 13.Host-mediated microbiome engineering (hmme) of drought tolerance in the wheat rhizospherePLoS One 14:e0225933
- 14.Effect of the reproduction method in an artificial selection experiment at the community levelFront. Ecol. Evol 7:416
- 15.Artificially selecting bacterial communities using propagule strategiesEvolution 74:2392–2403
- 16.Effects of microbial evolution dominate those of experimental host-mediated indirect selectionPeerJ 8:e9350
- 17.Artificial selection on microbiomes to breed microbiomes that confer salt tolerance to plantsmSystems 6:e01125–21
- 18.Community diversity determines the evolution of synthetic bacterial communities under artificial selectionEvolution 76:1883–1895
- 19.Artificial selection of stable rhizosphere microbiota leads to heritable plant phenotype changesEcol. Lett 25:189–201
- 20.Modelling artificial ecosystem selection: A preliminary investigationAdvances in Artificial Life Berlin, Heidelberg: Springer Berlin Heidelberg :659–666
- 21.The role of non-genetic change in the heritability, variation, and response to selection of artificially selected ecosystemsArtificial Life IX: Proceedings of the Ninth International Conference on the Simulation and Synthesis of Artificial Life MIT Press :352
- 22.Artificial selection of simulated microbial ecosystemsProc. Natl. Acad. Sci 104:8918
- 23.Simulations reveal challenges to artificial community selection and possible strategies for successPLoS Biol 17:e3000295
- 24.Eco-evolutionary dynamics of nested darwinian populations and the emergence of community-level heredityElife 9:e53433
- 25.Steering ecological-evolutionary dynamics to improve artificial selection of microbial communitiesNat. Commun 12:6799
- 26.Engineering complex communities by directed evolutionNat. Ecol. & Evol 5:1011
- 27.Artificial selection of communities drives the emergence of structured interactionsJ. Theor. Biol 571:111557
- 28.Artificial selection methods from evolutionary computing show promise for directed evolution of microbesElife 11:e79665
- 29.Partner-assisted artificial selection of a secondary function for efficient bioremediationIscience 26:107632
- 30.Novel artificial selection method improves function of simulated microbial communitiesbioRxiv [preprint] https://doi.org/10.1101/2023.01.08.523165
- 31.A quantitative genetics framework for understanding the selection response of microbial communitiesbioRxiv [preprint] https://doi.org/10.1101/2023.10.24.563725
- 32.Artificial selection of microbial communities: what have we learnt and how can we improve?Curr. Opin. Microbiol 77:102400
- 33.Major evolutionary transitions in individuality between humans and aiPhilos. Transactions Royal Soc. B 378:20210408
- 34.The function inverfc thetaAust. J. Phys 13:13
- 35.mpltern 0.3.0: ternary plots as projections of Matplotlib
- 36.Approximate accelerated stochastic simulation of chemically reacting systemsThe J. Chem. Phys 115:1716–1733
- 37.Efficient step size selection for the tau-leaping simulation methodThe J. Chem. Phys 124:044109
- 38.Statistics of extremesColumbia university press
- 1.Approximate accelerated stochastic simulation of chemically reacting systemsThe J. Chem. Phys 115:1716–1733
- 2.Efficient step size selection for the tau-leaping simulation methodThe J. Chem. Phys 124:044109
- 3.Progress of a half century in the study of the luria–delbrück distributionMath. Biosci 162:1–32
- 4.Statistics of extremesColumbia university press
- 5.The function inverfc thetaAust. J. Phys 13:13
- 6.Simulations reveal challenges to artificial community selection and possible strategies for successPLoS Biol 17:e3000295
- 7.mpltern 0.3.0: ternary plots as projections of Matplotlib
Article and author information
Author information
Version history
- Preprint posted:
- Sent for peer review:
- Reviewed Preprint version 1:
- Reviewed Preprint version 2:
Copyright
© 2024, Lee et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics
- views
- 177
- downloads
- 3
- citation
- 1
Views, downloads and citations are aggregated across all versions of this paper published by eLife.