A unified framework for measuring selection on cellular lineages and traits

  1. Shunpei Yamauchi
  2. Takashi Nozoe
  3. Reiko Okura
  4. Edo Kussell
  5. Yuichi Wakamoto  Is a corresponding author
  1. Department of Basic Science, Graduate School of Arts and Sciences, The University of Tokyo, Japan
  2. Department of Biology, New York University, United States
  3. Department of Physics, New York University, United States
  4. Research Center for Complex Systems Biology, The University of Tokyo, Japan
  5. Universal Biology Institute, The University of Tokyo, Japan
8 figures, 4 tables and 1 additional file

Figures

Representative single-cell lineage trees.

(A) Time-lapse images of a growing microcolony of Escherichia coli expressing green fluorescent protein (GFP) from plasmids. Scale bars, 5 μm. (B) Cellular lineage trees for the microcolony in A. Bifurcations in the trees represent cell divisions. σ denotes cell lineage labels. D(σ) shows the number of cell divisions in each lineage. Pcl(σ) and Prs(σ) are chronological and retrospective probabilities defined in the main text.

Conceptual illustration of the relationships between fitness landscapes, trait distributions, and selection strength.

(A) Non-uniform fitness landscape and broad trait distribution. The gray distribution represents a chronological distribution of lineage trait x; the cyan distribution represents a retrospective distribution of lineage trait x; and the black dashed line represents a fitness landscape. Due to the non-uniform fitness landscape and the broad chronological distribution, there is trait fitness heterogeneity for selection to act on. The retrospective distribution therefore shifts significantly from the chronological distribution, and the selection strength is large (S[X]>0). (B) Non-uniform fitness landscape and narrow trait distribution. Due to the lack of trait heterogeneity, there is little fitness heterogeneity for selection to act on. The retrospective distribution shifts only slightly from the chronological distribution, and the selection strength is small (S[X]0). (C) Uniform fitness landscape. When the fitness landscape is constant (=τΛ) across the lineage trait state x, there can be no trait fitness heterogeneity regardless of whether the trait distribution itself is narrow or broad. The selection strength is therefore zero (S[X]=0).

Relationships among chronological distributions’ shape and selection strength measures.

(A) Graphical representation of various fitness and selection strength measures by KX(ξ)-plot. Blue curve represents KX(ξ). The area between the horizontal axis and KX(ξ) in the interval 0ξ1 outlined in red corresponds to population growth τΛ. The gray and hatched regions correspond to h(X)cl and SKL(1)[X], respectively. The area between KX(ξ) and y=h(X)rs corresponds to SKL(2)[X]. (B-D) Representative shapes of KX(ξ) depending on κ3(X). Assuming that the contributions from fourth or higher-order cumulants are negligible, KX(ξ) becomes convex downward when κ3(X)>0 (B); a straight line when κ3(X)=0 (C); and convex upward when κ3(X)<0 (D). (E-G) Relationships between third-order fitness cumulant and skewness of chronological distribution Qcl(h).

Figure 4 with 1 supplement
Population growth rate response to cell removal perturbation.

(A) Scheme of random cell removal. Here, we consider the situation where cells were removed probabilistically after each cell division. Red crosses represent cell removal positions in the tree. The lineages after cell removal points disappear from the tree. Consequently, the number of cells at the end time point decreases. (B) Generation time distributions used in the simulation. We assumed that cellular generation time follows gamma distributions in the simulation. We set the shape parameter to either 1 (g1(x)), 2 (g2(x)), or 5 (g3(x)). (C-E) Population growth rate changes by cell removal perturbation. Gray points show the relative changes in population growth rate ΔΛ/Λ:=(Λ(ϵ)Λ(0))/Λ(0). Cell removal probability was set to 1-2-ϵ in each condition of perturbation strength ϵ. Broken red lines represent the theoretical prediction ΔΛ/Λ-h~(D)rsτΛϵ=-(1+SKL(2)[D]τΛ)ϵ. The lines of ΔΛ/Λ=-ϵ (blue) and -h~(D)clτΛϵ (green) are shown for reference. The generation time distributions used in the simulation are g1(x) for C, g2(x) for D, and g3(x) for E.

Figure 4—figure supplement 1
Response of population growth rate to cell removal perturbation with positive mother-daughter correlations of generation time.

We conducted the simulations of cell population growth with positive mother-daughter correlations (Gray points). We considered the generation time distribution that follows a gamma distribution with shape parameter 2 (g2(x) in Figure 4B). We generated correlated generation time of individual cells based on the mother cell’s value with the algorithm in Materials and methods. We randomly removed the individual cells from the population with the probability 1-2-ϵ after each cell division. The broken red lines represent the theoretical prediction of the changes ΔΛ/Λ=-h~(D)rsτΛϵ. The lines of ΔΛ/Λ=-ϵ (blue) and ΔΛ/Λ=-h~(D)clτΛϵ (green) were also shown for reference. (A) The result with no mother-daughter correlation of generation time (rmd=0). (B) rmd=0.2. (C) rmd=0.4. (D) rmd=0.6.

Figure 5 with 2 supplements
Application of cell lineage statistics to experimental data.

(A) Contributions of the cumulants of a fitness landscape to population growth. W1(D) and W2(D) were evaluated for the experimental cell lineage data from E. coli (red), M. smegmatis (blue), S. pombe (green), and L1210 mouse leukemia cells (yellow). The E. coli rpoS-mcherry data were newly obtained in this study (see Materials and methods). The other data were taken from literature: E. coli f3nw and f3ptn001 from Nozoe et al., 2017; M. smegmatis from Wakamoto et al., 2013; S. pombe from Nakaoka and Wakamoto, 2017; and L1210 from Seita et al., 2021. Circles and triangles represent W1(D) and W2(D), respectively. Error bars represent the two standard deviation ranges estimated by resampling the cellular lineages (see Materials and methods). (B) Relationship between SKL(1)[D]/τΛ and SKL(2)[D]/τΛ. Colors correspond to the cellular species as in A. The S. pombe data were further categorized into two groups: Circles for the EMM conditions; and triangles for the YE conditions. (C-E) Representative chronological distributions of division count, Qcl(D). (F-H) Graphical representation of KD(ξ). F for S. pombe EMM30; G for L1210 RMPI-1640; and H for E. coli f3nw-sm.

Figure 5—figure supplement 1
Chronological distributions of division count, Qcl(D).

(A-G) E. coli. (H). M. smegmatis. (I-O) S. pombe. P. L1210 mouse leukemia cells.

Figure 5—figure supplement 2
Graphical representation of KD(ξ).

A-G. E. coli. H. M. smegmatis. I-O. S. pombe. P. L1210 mouse leukemia cells.

Strong selection in the E.coli population regrowing from a late stationary phase.

(A) Time-lapse images. Cellular regrowing dynamics from early and late stationary phases were observed by time-lapse microscopy. Cells were enclosed in the microchambers etched on coverslips. The top three images show representative images of the cells from an early stationary phase. The bottom three images show the cells from a late stationary phase. Scale bar, 5 μm. (B) Population dynamics. The number of cells at each time point normalized by the initial cell number (N(t)/N(0)) was plotted against time t was 307 for the early stationary sample and 295 for the late stationary sample. (C, D) Representative cellular lineage trees in the regrowing kineics from the early stationary phase (C) and the late stationary phase (D). The trees correspond to the time-lapse images in A. (E) Cumulative contributions of the cumulants of the fitness landscape h(D) to population growth. Error bars represent the two standard deviation ranges estimated by resampling the cellular lineages (see Materials and methods). (F) Chronological distributions of division count Qcl(D). (G) Relationships between SKL(1)[D] and SKL(2)[D]. The blue and black points show the results for the early stationary phase sample and the late stationary phase sample, respectively. Gray points represent the results for the cell populations growing at approximately constant growth rates shown in Figure 5B.

Figure 7 with 1 supplement
Fitness landscapes and selection strength for RpoS expression levels.

(A) Fitness landscapes for the time-averaged concentration (mean fluorescent intensity) for RpoS-mCherry. The time-averaged mean fluorescent intensity of RpoS-mCherry was adoped as a lineage trait X and changes in fitness were plotted against the trait values x. Fitness landscapes were scaled by the lineage length (observation duration) τ. Error bars represent the two standard deviation ranges estimated by resampling the cellular lineages. (B) Fitness landscapes for the time-averaged concentration for GFP. The time-averaged mean fluorescent intensity of GFP was adoped as a lineage trait X and changes in fitness were plotted against the trait values x. (C) Relative selection strength for the time-averaged concentrations of RpoS-mCherry (red) and GFP (green). (D, E) Chronological distributions Qcl(x) for the time-averaged concentrations of RpoS-mCherry (D) and GFP (E). (F-H) Cumulative contributions of fitness cumulants to population growth, Wn(X), assuming that X is either time-averaged concentration of RpoS-mCherry (red) or time-averaged concentration of GFP (green). Error bars represent the two standard deviation ranges estimated by resampling the cellular lineages. Panel F is for the Glucose-37°C condition; Panel G for the Glucose-30°C condition; and Panel H for the Glycerol-37°C condition.

Figure 7—figure supplement 1
The relative selection strength values for time-averaged RpoS-mCherry and GFP fluorescence intensity compared with the randomized data.

The blue histograms show the distributions of the relative selection strength values calculated from the lineage data in which the correspondences between division counts and trait values are shuffled. The vertical lines indicate the relative selection strength values for the experimental data. A. GFP under the Glucose-37°C condition. B. RpoS-mCherry under the Glucose-37°C condition. C. GFP under the Glucose-30°C condition. D. RpoS-mCherry under the Glucose-30°C condition. E. GFP under the Glycerol-37°C condition. F. RpoS-mCherry under the Glycerol-37°C condition.

Appendix 1—figure 1
Analytical calculations of KD(ξ) and related relations given specific form of division count distributions.

(A) Chronological division count distributions. ϕ=0.3 and ϕ=ϕ0(=0.5857) are binomial, ϕ=1 is Poisson and ϕ=1.6 is negative binomial. D¯=20(1-ϕ0) is fixed. (B) Cumulative contributions of fitness cumulants. Parameter values are given in panel A legend. (C) The relation between two selection strength measures. Binomial (blue curve), Poisson (closed black circle) and negative binomial (orange curve) are indicated on the single curve plotted using Equations 37 and 38 within the range of 0<ϕ<2. The point where SKL(1)[D]=SKL(2)[D] (ϕ=ϕ0) is indicated by the open black circle. The grey dotted line corresponds to SKL(1)[D]=SKL(2)[D]. (D) Convexity of KD(ξ). Y-axis shows a rescaling of KD(ξ) according to (KD(ξ)-KD(0))/(KD(1)-KD(0)). The same values of ϕ as in A are used; ϕ=0.3 (blue), ϕ=ϕ0 (orange), ϕ=1 (green) and ϕ=1.6 (red). The grey dotted line indicates the case that KD(ξ) is a linear function of ξ

Tables

Table 1
Relationships between KX(ξ) and quantities in cellular lineage statistics.
Quantities in lineage statisticsSymbolCorrespondence to KX(ξ)
FitnessPopulation growthτΛKX(1)
Chronological mean fitnessh(X)clKX(0)
Retrospective mean fitnessh(X)rsKX(1)
Chronological fitness varianceVar[h(X)]clKX′′(0)
Retrospective fitness varianceVar[h(X)]rsKX′′(1)
Selection strengthJeffreys divergence bet. Qcl(X)
and Qrs(X)
SJF[X]KX(1)-KX(0)
KL divergence of Qcl(X) from 
Qrs(X)
SKL(1)[X]KX(1)-KX(0)
KL divergence of Qrs(X) from 
Qcl(X)
SKL(2)[X]KX(1)-KX(1)
Growth rate gain/lossGrowth rate gainSKL(1)[D]/τΛ1-KD(0)/KD(1)
Additional growth rate loss upon perturbation-SKL(2)[D]/τΛ1-KD(1)/KD(1)
Table 2
Summary of cellular species, culture conditions, and observation setup used in the experiments in Figure 5.
SpeciesLabelStrainMediumTemperature (°C)Device
E. colirpoS-mcherry glucose_30°CMG1655 F3 rpoS-mcherry /pUA66-PrpsL-gfpM9 minimal medium +0.2%(w/v) glucose +1/2 MEM amino acids solution (Sigma)30Microchamber arrayThis study
E. colirpoS-mcherry glucose_37°CMG1655 F3 rpoS-mcherry /pUA66-PrpsL-gfpM9 minimal medium +0.2%(w/v) glucose +1/2 MEM amino acids solution (Sigma)37Microchamber arrayThis study
E. colirpoS-mcherry glycerol_37°CMG1655 F3 rpoS-mcherry /pUA66-PrpsL-gfpM9 minimal medium +0.1%(v/v) glycerol +1/2 MEM amino acids solution (Sigma)37Microchamber arrayThis study
E. colif3nw -smF3NWM9 minimal medium +0.2%(w/v) glucose +1/2 MEM amino acids solution (Sigma)+0.1mM Isopropyl β-D-1 thiogalactopyranoside (IPTG)37Agar padNozoe et al., 2017
E. colif3nw +smF3NWM9 minimal medium +0.2%(w/v) glucose +1/2 MEM amino acids solution (Sigma)+0.1 mM Isopropylβ-D-1 thiogalactopyranoside (IPTG)+100
μg/ml streptomycin
37Agar padNozoe et al., 2017
E. colif3ptn001 -smF3/pTN001M9 minimal medium +0.2%(w/v) glucose +1/2 MEM amino acids solution (Sigma)+0.1 mM Isopropylβ-D-1 thiogalactopyranoside (IPTG)37Agar padNozoe et al., 2017
E. colif3ptn001+smF3/pTN001M9 minimal medium +0.2%(w/v) glucose +1/2 MEM amino acids solution (Sigma)+0.1 mM Isopropylβ-D-1 thiogalactopyranoside (IPTG)+200
μg/ml streptomycin
37Agar padNozoe et al., 2017
M. smegmatismc2155 7H9mc2155Middlebrook 7H9 medium +0.5% albumin +0.2% glucose +0.085% NaCl+0.5% glycerol +0.05% Tween-8037Membrane coverWakamoto et al., 2013
S. pombeEMM28HN0025Edinburgh minimal medium +2% (w/v) glucose28Mother machineNakaoka and Wakamoto, 2017
S. pombeEMM30HN0025Edinburgh minimal medium +2%(w/v) glucose30Mother machineNakaoka and Wakamoto, 2017
S. pombeEMM32HN0025Edinburgh minimal medium +2%(w/v) glucose32Mother machineNakaoka and Wakamoto, 2017
S. pombeEMM34HN0025Edinburgh minimal medium +2%(w/v) glucose34Mother machineNakaoka and Wakamoto, 2017
S. pombeYE28HN0025Yeast extract medium +3%(w/v) glucose28Mother machineNakaoka and Wakamoto, 2017
S. pombeYE30HN0025Yeast extract medium +3%(w/v) glucose30Mother machineNakaoka and Wakamoto, 2017
S. pombeYE34HN0025Yeast extract medium +3%(w/v) glucose34Mother machineNakaoka and Wakamoto, 2017
L1210 mouse leukemia cellL1210 RPMI-1640L1210 (ATCC CCL-219)RPMI-1640 medium (Wako)+10% fetal bovine serum (Biosera) under 5% CO2 atmosphere37Mother machineSeita et al., 2021
Table 3
Summary of the data used in the analysis in Figure 5.

tstart and tend are the start and end times for the analysis time window τ.

Specieslabelτ(hr)tstart(hr)tend(hr)N0Nτ
E. colirpoS-mcherry glucose_37°C50.955.951633989
E. colirpoS-mcherry glucose_30°C80.958.951976173
E. colirpoS-mcherry glycerol_37°C6.50.957.452535825
E. colif3nw-sm5053054343
E. colif3nw +sm5052913164
E. colif3ptn001-sm5059849229
E. colif3ptn001+sm5059777429
M. smegmatismc2155 7H9101.7511.7539311
S. pombeEMM2816701671148-
S. pombeEMM301310131963-
S. pombeEMM32123.50123.5883-
S. pombeEMM3415201521078-
S. pombeYE2810801081177-
S. pombeYE3090090866-
S. pombeYE3478078863-
L1210 mouse leukemia cellL1210 RPMI-164060060474-
Key resources table
Reagent type (species) or resourceDesignationSource or referenceIdentifiersAdditional information
Recombinant DNA reagentpUA66-PrpsL-gfp (plasmid)Zaslaver et al., 2006
Strain, strain background (Escherichia coli)MG1655 F3Wakamoto labMG1655ΔfliCΔfimAΔflu
Strain, strain background (Escherichia coli)MG1655 F3 rpoS-
mcherry /pUA66-P rplS-gfp
Wakamoto labMG1655ΔfliCΔfimAΔflu rpoS-mcherry /pUA66-PrplS-gfp

Additional files

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Shunpei Yamauchi
  2. Takashi Nozoe
  3. Reiko Okura
  4. Edo Kussell
  5. Yuichi Wakamoto
(2022)
A unified framework for measuring selection on cellular lineages and traits
eLife 11:e72299.
https://doi.org/10.7554/eLife.72299