Figures and data

Schematic for artificial selection on collectives.
Each selection cycle begins with a total of g Newborn collectives, each with N0 total cells of slow-growing S population (light gray dots) and fast-growing F population (dark gray dots). During maturation (over time τ), S and F cells divide at rates rS and rS + ω (ω > 0), respectively, and S mutates to F at rate µ. In the selection stage, the Adult collective with F frequency f closest to the target composition

Nomenclature

Initial and target compositions determine the success of artificial selection on collectives.
(a-c) F frequency of the selected Adult collective (f *) over cycles at different target

Intra-collective selection and inter-collective selection jointly set the boundaries for selection success.
a The change in F frequency over one cycle. When

Expanding the success region for artificial collective selection.
a Reducing the population size in Newborn N0 expands the region of success. In gold area, the probability that

In higher dimensions, the success of artificial selection requires the entire evolutionary trajectory remaining in the accessible region.
a During collective maturation, a slow-growing population (S) (with growth rate rS ; light gray) can mutate to a fast-growing population (F) (with growth rate rS + ω; medium gray), which can mutate further into a faster-growing population (FF) (with growth rate rS + 2ω; dark gray). Here, the rates of both mutational steps are µ, and ω > 0. b Evolutionary trajectories from various initial compositions (open circles) to various targets. Intra-collective evolution favors FF over F (vertical blue arrow) over S (horizontal blue arrow). The accessible regions are marked gold (see Sec. I in Supplementary Information). We obtain final compositions starting from several initial compositions while aiming for different target compositions in i, ii, and iii. The evolutionary trajectories are shown in dots with color gradients from the initial to final time. (i) A target composition with a high FF frequency is always achievable. (ii) A target composition with intermediate FF frequency is never achievable. (iii) A target composition with low FF frequency is achievable only if starting from an appropriate initial composition such that the entire trajectory never meanders away from the accessible region. The figures are drawn using mpltern package [36]. c The accessible region in the three-population problem is interpreted as an extension of the two-population problem. First the accessible region between FF and S+F is given, and then the S+F region is stretched into S and F.

Comparison between the calculated Gaussian distribution (“Gauss”, with the mean and variances computed from Eqs. (10),(11),(20),(25)) and simulations using tau-leaping (“tau”). The simulations run 3000 times. The initial number of cells are (S0, F0) = (990, 10), (500, 500), and (10, 990) for each column. The parameters r = 0.5, ω = 0.03, µ = 0.0001, and τ = 4.8 are used.

Congruence between consecutive sampling (MHG for multivariate hypergeometric distribution) and independent binomial (BN) sampling. The initial numbers of cells are S = 8000 and F = 2000 for the left panel, and S = 20 and F = 5 for the right panel. 10000 samples are drawn for each distribution. Here, a parent collective is divided into 10 collectives.

a Trajectories of F frequency for 10 collectives (g = 10) over time. The collective whose frequency is closest to the target value is selected in every cycle (black lines). The gray lines denote the other collectives. For parameters, we used S growth rate rS = 0.5, F growth advantage ω = 0.03, mutation rate µ = 0.0001, maturation time τ ≈ 4.8, and N0 = 1000. b Comparison between frequency trajectories with selection (the chosen one Adult producing all offspring; black) and without selection (each Adult producing one offspring; blue) clearly shows the effect of artificial selection. The black line indicates F frequency of the selected collective

Color map of the absolute error

The probability density functions of the selected Adult’s F frequency

a Mean (Eq. (33)) and variance (Eq. (34)) of f values of Adult collectives with respect to the Newborn frequency f0. b Scaling relation of F frequency variance (Eq. (41)) with Newborn collective size N0. The initial F frequency is 0.5. The parameters are rS = 0.5, ω = 0.03, µ = 0.0001, and τ ≈ 4.8. c Relation of F frequency variance (Eq. (41)) with maturation time τ. Other parameters are the same as b.

Median (orange) and mean (violet) have similar distributions. We performed 1000 simulations to get probability density. a g = 10, b g = 100, and c g = 1000. Initial F frequency is

Simulation with zero mutation rate. Color map of the absolute error

Change of success region in varying selective advantage ω. rS = 0.5, ω = 0.03, µ = 0.0001, N0 = 1000, g = 10 and τ ≈ 4.8.

Artificial selection also works for deleterious mutation. a Conditional probability density functions of

Selecting Top-5% outperforms selecting Top 1. We bred 100 collectives and chose either top-1 collective (solid line) or top-5 collectives (dashed line) with f closest to the target value

a The flow of composition change in F and FF frequencies at each composition (f, h). Top corner indicates that FF cells fix in the collective. Right bottom corner means collectives with only F cells while collectives contain S cells only at left bottom corner. Arrow length means the speed of change. b The accessible regions are marked by the gold area. If the signs of changes in both F frequency and FF frequency after inter-collective selection are opposite to those during maturation, then the given composition is accessible. Otherwise, the composition is not accessible and will change after cycles. Dashed lines are the boundary of accessible region by projecting the collective into a two-population problem (FF vs. S+F). The figures are drawn using mpltern package [7].