Overflow metabolism originates from growth optimization and cell heterogeneity

eLife Assessment

This valuable study tackles the well-established overflow metabolism issue by applying a coarse-grained metabolic flux model to predict how individual cells execute various energy strategies, such as respiration versus fermentation. The model's population average is convincing enough to align with experimental observations on overflow metabolism. The potential source of metabolic or proteomic heterogeneity of individual cells remains an open question to be studied. How individual cells adjust their metabolic strategies also requires future study of the underlying mechanisms. Overall, this work provides a key aspect on cell-to-cell variability on general metabolic response.

https://doi.org/10.7554/eLife.94586.4.sa0

Significance of the findings:

Valuable: Findings that have theoretical or practical implications for a subfield

Landmark
Fundamental
Important
Valuable
Useful

Strength of evidence:

Convincing: Appropriate and validated methodology in line with current state-of-the-art

Exceptional
Compelling
Convincing
Solid
Incomplete
Inadequate

During the peer-review process the editor and reviewers write an eLife Assessment that summarises the significance of the findings reported in the article (on a scale ranging from landmark to useful) and the strength of the evidence (on a scale ranging from exceptional to inadequate). Learn more about eLife Assessments

Abstract
Introduction
Results
Discussion
Appendix 1
Appendix 2
Appendix 3
Appendix 4
Appendix 5
Appendix 6
Appendix 7
Appendix 8
Appendix 9
Appendix 10
Appendix 11
Data availability
References
Article and author information
Metrics

Abstract

A classic problem in metabolism is that fast-proliferating cells use seemingly wasteful fermentation for energy biogenesis in the presence of sufficient oxygen. This counterintuitive phenomenon, known as overflow metabolism or the Warburg effect, is universal across various organisms. Despite extensive research, its origin and function remain unclear. Here, we show that overflow metabolism can be understood through growth optimization combined with cell heterogeneity. A model of optimal protein allocation, coupled with heterogeneity in enzyme catalytic rates among cells, quantitatively explains why and how cells choose between respiration and fermentation under different nutrient conditions. Our model quantitatively illustrates the growth rate dependence of fermentation flux and enzyme allocation under various perturbations and is fully validated by experimental results in Escherichia coli. Our work provides a quantitative explanation for the Crabtree effect in yeast and the Warburg effect in cancer cells and can be broadly used to address heterogeneity-related challenges in metabolism.

Introduction

A prominent feature of cancer metabolism is that tumor cells excrete large quantities of fermentation products in the presence of sufficient oxygen (Hanahan and Weinberg, 2011; Liberti and Locasale, 2016; Vander Heiden et al., 2009). This process, discovered by Otto Warburg in the 1920s (Warburg, 1924) and known as the Warburg effect, aerobic glycolysis, or overflow metabolism (Basan et al., 2015; Hanahan and Weinberg, 2011; Liberti and Locasale, 2016; Vander Heiden et al., 2009), is ubiquitous among fast-proliferating cells across a broad spectrum of organisms (Vander Heiden et al., 2009), ranging from bacteria (Basan et al., 2015; Holms, 1996; Meyer et al., 1984; Nanchen et al., 2006; Neidhardt et al., 1990) and fungi (De Deken, 1966) to mammalian cells (Hanahan and Weinberg, 2011; Liberti and Locasale, 2016; Vander Heiden et al., 2009). For microbes, cells use standard respiration when nutrients are scarce, while they use the counterintuitive aerobic glycolysis when nutrients are adequate, just analogous to normal tissues and cancer cells, respectively (Vander Heiden et al., 2009).

Over the past century, and particularly through extensive studies in the last two decades (Liberti and Locasale, 2016), various rationales for overflow metabolism have been proposed (Basan et al., 2015; Chen and Nielsen, 2019; Majewski and Domach, 1990; Molenaar et al., 2009; Niebel et al., 2019; Peebo et al., 2015; Pfeiffer et al., 2001; Shlomi et al., 2011; Vander Heiden et al., 2009; Varma and Palsson, 1994; Vazquez et al., 2010; Vazquez and Oltvai, 2016; Zhuang et al., 2011). Notably, Basan et al., 2015 provided a systematic characterization of this process, including various types of experimental perturbations. Currently, prevalent explanations (Basan et al., 2015; Chen and Nielsen, 2019) hold that overflow metabolism arises from the proteome efficiency in fermentation being consistently higher than that in respiration. However, recent studies have shown that the measured proteome efficiency in respiration is actually higher than in fermentation for many yeast and cancer cells (Shen et al., 2024), even though these cells generate fermentation products through aerobic glycolysis. This finding (Shen et al., 2024) apparently contradicts the prevalent explanations (Basan et al., 2015; Chen and Nielsen, 2019). Furthermore, most explanations (Basan et al., 2015; Chen and Nielsen, 2019; Majewski and Domach, 1990; Shlomi et al., 2011; Varma and Palsson, 1994; Vazquez et al., 2010; Vazquez and Oltvai, 2016; Zhuang et al., 2011) rely on the assumption that cells optimize their growth rate for a given rate of carbon influx (i.e. nutrient uptake rate) under each nutrient condition (or its equivalents). However, this assumption remains open to further scrutiny, as the given factors in a nutrient condition are the identities and concentrations of the carbon sources (Molenaar et al., 2009; Scott et al., 2010; Wang et al., 2019), rather than the carbon influx. Therefore, the origin and function of overflow metabolism still remain unclear (DeBerardinis and Chandel, 2020; Hanahan and Weinberg, 2011; Liberti and Locasale, 2016; Vander Heiden et al., 2009).

Why have microbes and cancer cells evolved to possess the seemingly wasteful strategy of aerobic glycolysis? For unicellular organisms, there is evolutionary pressure (Vander Heiden et al., 2009) to optimize cellular resources for rapid growth (Dekel and Alon, 2005; Edwards et al., 2001; Hui et al., 2015; Li et al., 2018; Scott et al., 2010; Towbin et al., 2017; Wang et al., 2019; You et al., 2013). In particular, it has been shown that cells allocate protein resources for optimal growth (Hui et al., 2015; Scott et al., 2010; Wang et al., 2019; You et al., 2013), and the most efficient protein allocation corresponds to elementary flux mode (Müller et al., 2014; Wortel et al., 2014). For cancer cells, disrupting the growth control system and evading immune destruction from the host are prominent hallmarks of their survival (Hanahan and Weinberg, 2011), which in certain ways mimic the evolutionary pressure on microbes to optimize cell growth rate. In this study, we apply the optimal growth principle of microbes, which also roughly holds for cancer cells, to a heterogeneous framework to address the puzzle of aerobic glycolysis. We use Escherichia coli as a typical example to show that overflow metabolism can be understood from optimal protein allocation combined with heterogeneity in enzyme catalytic rates. The optimal growth strategy varies between respiration and fermentation depending on the concentration and type of the nutrient, and the combination with cell heterogeneity results in the standard picture (Basan et al., 2015; Holms, 1996; Meyer et al., 1984; Nanchen et al., 2006; van Hoek et al., 1998) of overflow metabolism. Our model quantitatively illustrates the growth rate dependence of fermentation/respiration flux and enzyme allocation under various types of perturbations in E. coli. Furthermore, it provides a quantitative explanation for the data on the Crabtree effect in yeast and the Warburg effect in cancer cells (Bartman et al., 2023; Shen et al., 2024).

Results

Coarse-grained model

Based on the topology of the metabolic network (Neidhardt et al., 1990; Nelson and Cox, 2008) (see Figure 1A), we classify the carbon sources that enter from the upper part of glycolysis into Group A (Wang et al., 2019) and the precursors of biomass components (such as amino acids) into five pools. Specifically, each pool is designated according to its entry point (see Figure 1A and Appendix 2.2 for details): a1 (entry point: G6P/F6P), a2 (entry point: GA3P/3PG/PEP), b (entry point: pyruvate/acetyl-CoA), c (entry point: $α$ -ketoglutarate), and d (entry point: oxaloacetate). Pools a1 and a2 are also combined as Pool a due to the joint synthesis of precursors. Then, the metabolic network for Group A carbon source utilization (see Figure 1A) can be coarse-grained into a model shown in Figure 1B (see Appendix 3.1 for details), where node $A$ represents an arbitrary carbon source of Group A. Evidently, Figure 1B is topologically identical to Figure 1A. Each coarse-grained arrow in Figure 1B represents a stoichiometric flux $J_{i}$ , which delivers carbon flux and may be accompanied by energy consumption or biogenesis (e.g. $J_{1}$ , $J_{a 1}$ ; see Figure 1A–B and Appendix 1—figure 1A).

Figure 1

Download asset Open asset

Model and results of overflow metabolism in *E. coli*.

(A) The central metabolic network of carbon source utilization. The Group A carbon sources (Wang et al., 2019) are labeled with green squares. (B) Coarse-grained model for Group A carbon source utilization. (C) Model predictions (see Equations S47 and S160) and experimental results (Basan et al., 2015; Holms, 1996) of overflow metabolism, covering the data for all the Group A carbon sources shown in (A). (D) Growth rate dependence of respiration and fermentation fluxes (see Equations S47 and S160). (E) The proteome efficiencies for energy biogenesis in the respiration and fermentation pathways vary with growth rate as functions of the nutrient quality of a Group A carbon source (see Equations S31 and S36). See Appendices 9 and 11 for model parameter settings and experimental data sources (Basan et al., 2015; Holms, 1996; Hui et al., 2015) for Figures 1—4 of *E. coli*.

In fact, the stoichiometric flux $J_{i}$ scales with the cell population. For comparison with experiments, we define the normalized flux $J_{i}^{(N)} \equiv J_{i} \cdot m_{0} / M_{c a r b o n}$ , which can be regarded as the flux per unit of biomass (the superscript ‘(N)’ stands for normalized; see Appendix 2.3–2.4 for details). Here, $M_{c a r b o n}$ represents the carbon mass of the cell population, and $m_{0}$ is the weighted average carbon mass of metabolite molecules at the entry of precursor pools (see Equation S17). Then, the cell growth rate $λ$ can be represented by the total outflow of the normalized fluxes: $λ = \sum_{i}^{a 1, a 2, b, c, d} J_{i}^{(N)}$ (see Appendix 2.4). The normalized fluxes of respiration and fermentation are $J_{r}^{(N)} \equiv J_{4}^{(N)}$ and $J_{f}^{(N)} \equiv J_{6}^{(N)}$ , respectively (see Figure 1A and B). In practice, each $J_{i}^{(N)}$ is characterized by two quantities: the proteomic mass fraction $ϕ_{i}$ of the enzyme dedicated to carrying the flux and the substrate quality $κ_{i}$ , such that $J_{i}^{(N)} = ϕ_{i} \cdot κ_{i}$ . We take the Michaelis-Menten form for the enzyme kinetics (Nelson and Cox, 2008), and then $κ_{i} \equiv k_{i} \cdot \frac{[S_{i}]}{[S_{i}] + K_{i}}$ (see Equation S12 and Appendix 2.4 for details), where $[S_{i}]$ is the concentration of substrate $S_{i}$ , and $K_{i}$ is the Michaelis constant. For each intermediate node and reaction along the pathway (e.g. node $M_{1}$ in $J_{a 1}$ ), the substrate quality $κ_{i}$ can be approximated as a constant (see Appendix 2.5): $κ_{i} \equiv k_{i} \cdot \frac{[S_{i}]}{[S_{i}] + K_{i}} \approx k_{i}$ , where $[S_{i}] \geq K_{i}$ generally holds true in bacteria (Bennett et al., 2009; Park et al., 2016). However, the nutrient quality $κ_{A}$ is a variable that depends on the nutrient type and concentration of a Group A carbon source (see Equation S27).

Generally, there are three independent fates for a Group A carbon source in the metabolic network (Chen and Nielsen, 2019): fermentation, respiration, and biomass generation (see Appendix 1—figure 1C-E). Each draws a distinct proteome fraction of $ϕ_{f}$ , $ϕ_{r}$ and $ϕ_{B M}$ , with no overlap between them (see Appendix 3.1). The net effect of the first two fates is energy biogenesis, while the last one generates precursors for biomass, accompanied by energy biogenesis. By applying the proteomic constraint that there is a maximum fraction, $ϕ_{max}$ , for proteome allocation: $ϕ_{max} \approx 0.48$ (Scott et al., 2010), we have:

ϕ_{f} + ϕ_{r} + ϕ_{B M} = ϕ_{max} .

In fact, Equation 1 is equivalent to $ϕ_{R} + ϕ_{A} + \sum_{j = 1}^{6} ϕ_{j} + \sum_{i}^{a 1, a 2, b, c, d} ϕ_{i} = ϕ_{max}$ (see Appendix 3.1 for derivation details), where $ϕ_{R}$ and $ϕ_{A}$ represent the proteomic mass fractions of the active ribosome-affiliated proteins and the cargo proteins responsible for the uptake of the Group A carbon source, respectively. During cell proliferation, ribosomes serve as the factories for protein synthesis and are primarily composed of proteins (Neidhardt et al., 1990; Nelson and Cox, 2008), while other biomass components, such as RNA, are optimally produced (Kostinski and Reuveni, 2020) in accordance with the growth rate determined by protein synthesis. Thus, the cell growth rate is proportional to $ϕ_{R} : λ = ϕ_{R} \cdot κ_{t}$ , where $κ_{t}$ is a parameter set by the translation rate (Scott et al., 2010) (see Appendix 2.1 for details), which can be approximated as a constant within the growth rate range of interest (Dai et al., 2017).

For balanced cell growth in bacteria, the energy demand $J_{E}$ , expressed as the stoichiometric energy flux in ATP, is generally proportional to the biomass production rate (Ebenhöh et al., 2024), since the proportion of maintenance energy is roughly negligible (Locasale and Cantley, 2010) (see Appendix 10 for the cases of yeast and tumor cells). Thus, the normalized flux of energy demand in ATP, denoted as $J_{E}^{(N)}$ , representing the energy demand per unit of biomass, is proportional to the growth rate $λ$ (see Appendix 3.1 for details):

J_{E}^{(N)} = η_{E} \cdot λ,

where $η_{E}$ is an energy coefficient (see Equations S25 and S26 for details). By converting all energy currencies (such as NADH, FADH2, etc.) into ATP, the normalized energy fluxes for respiration and fermentation are given by $J_{r}^{(E)} = β_{r}^{(A)} \cdot J_{r}^{(N)} / 2$ and $J_{f}^{(E)} = β_{f}^{(A)} \cdot J_{f}^{(N)} / 2$ , where $β_{r}^{(A)}$ and $β_{f}^{(A)}$ are the stoichiometric coefficients of ATP production per glucose in each pathway (see Appendix 1—figure 1C-E and Appendix 3.1 for details). The denominator coefficient of ‘2’ is derived from the stoichiometry of the coarse-grained reaction $M_{1} \to 2 M_{2}$ (see Figure 1A and B). Applying the criteria of flux balance (i.e. mass conservation; see Appendix 2.3) at each intermediate node ( $M_{i}$ , $i$ = 1, …, 5) and precursor pool (Pool $i$ , $i =$ a1, a2, b, c, d), along with the constraints of proteome allocation (see Equation 1) and energy demand (see Equation 2), we obtain the relations between normalized energy fluxes and growth rate for a given nutrient condition with a fixed $κ_{A}$ (see Appendix 3.1 for details):

{\begin{cases} J_{r}^{(E)} + J_{f}^{(E)} = φ \cdot λ, \\ \frac{J_{r}^{(E)}}{ε_{r}} + \frac{J_{f}^{(E)}}{ε_{f}} = ϕ_{max} - ψ \cdot λ, \end{cases}

where $φ$ is a constant coefficient primarily determined by the coefficient $η_{E}$ (see Equation S33), and $φ \cdot λ$ represents the normalized flux of energy demand, excluding energy biogenesis from the biomass synthesis pathway. The coefficients $ψ$ , $ε_{r}$ , and $ε_{f}$ are functions of $κ_{A}$ , such that their values are highly dependent on nutrient conditions. $ψ^{- 1}$ denotes the proteome efficiency for biomass generation in the biomass synthesis pathway (see Equation S32), defined as $ψ^{- 1} \equiv λ / ϕ_{BM}$ (see Appendix 3.1). $ε_{r}$ and $ε_{f}$ represent the proteome efficiencies for energy biogenesis in the respiration and fermentation pathways, respectively, defined as the normalized energy fluxes expressed in ATP generated per proteomic mass fraction, with $ε_{r} \equiv J_{r}^{(E)} / ϕ_{r}$ and $ε_{f} \equiv J_{f}^{(E)} / ϕ_{f}$ . Hence,

{\begin{cases} ε_{r} = \frac{β_{r}^{(A)}}{1 / κ_{A} + 1 / κ_{r}^{(A)}}, \\ ε_{f} = \frac{β_{f}^{(A)}}{1 / κ_{A} + 1 / κ_{f}^{(A)}}, \end{cases}

where both $κ_{r}^{(A)}$ and $κ_{f}^{(A)}$ are composite parameters that can be approximated as constants, with $1 / κ_{r}^{(A)} \equiv 1 / κ_{1} + 2 / κ_{2} + 2 / κ_{3} + 2 / κ_{4}$ and $1 / κ_{f}^{(A)} \equiv 1 / κ_{1} + 2 / κ_{2} + 2 / κ_{6}$ (see Appendices 2.5 and 3.1 for details).

Origin of overflow metabolism

The standard picture of overflow metabolism (Basan et al., 2015; Holms, 1996; Meyer et al., 1984; Nanchen et al., 2006; van Hoek et al., 1998) is exemplified by the experimental data (Basan et al., 2015) presented in Figure 1C, where the fermentation flux exhibits a threshold-analog dependence on the growth rate $λ$ . It is well established that respiration is significantly more efficient than fermentation in terms of energy biogenesis per unit of carbon (i.e. $β_{r}^{(A)} > β_{f}^{(A)}$ ) (Nelson and Cox, 2008; Vander Heiden et al., 2009). Then, why do cells bother to use the seemingly wasteful fermentation pathway? We proceed to address this issue by applying optimal protein allocation (Scott et al., 2010; Wang et al., 2019) within the framework of optimal growth.

For cell proliferation in a given nutrient condition (i.e. with a fixed $κ_{A}$ ), the values of $ε_{r}$ , $ε_{f}$ , and $ψ$ are determined (see Equations 4 and S32). However, the growth rate $λ$ can be influenced by protein allocation between respiration and fermentation, specifically $ϕ_{r}$ and $ϕ_{f}$ , according to the governing equation (Equation 3). If $ε_{r} > ε_{f}$ , that is, if the proteome efficiency in respiration is higher than that in fermentation, then $λ = \frac{ϕ_{max} - J_{f}^{(E)} (1 / ε_{f} - 1 / ε_{r})}{ψ + φ / ε_{r}} \leq \frac{ϕ_{max}}{ψ + φ / ε_{r}}$ . The optimal growth strategy is $ϕ_{f} = J_{f}^{(E)} = 0$ , meaning that the cell exclusively uses respiration. Conversely, if $ε_{f} > ε_{r}$ , then $ϕ_{r} = J_{r}^{(E)} = 0$ is optimal, and the cell solely uses fermentation. In either case, the choice between respiration and fermentation for growth optimization is determined by comparing their proteome efficiencies.

In practice, both proteome efficiencies $ε_{r}$ and $ε_{f}$ are functions of nutrient quality $κ_{A}$ , which can be significantly influenced by the nutrient type and concentration of the carbon source (see Equations 4 and S27). Therefore, the optimal growth strategy may vary depending on the nutrient conditions. In nutrient-poor conditions where $κ_{A} ≪ κ_{r}^{(A)}$ and $κ_{A} ≪ κ_{f}^{(A)}$ , the proteome efficiencies can be approximated by $ε_{r} \approx β_{r}^{(A)} \cdot κ_{A}$ and $ε_{f} \approx β_{f}^{(A)} \cdot κ_{A}$ (see Equation 4), and hence $ε_{r} (κ_{A}) > ε_{f} (κ_{A})$ (since $β_{r}^{(A)} > β_{f}^{(A)}$ ), meaning that the proteome efficiency of respiration is higher than that of fermentation under these conditions. In contrast, in rich media, using parameters for $κ_{i}$ derived from in vivo/in vitro experimental data for E. coli (see Appendix 1—table 1, Appendix 1—table 2 and Appendix 7.1–7.2), we obtain $ε_{r} (κ_{g l u c o s e}^{(S T)}) < ε_{f} (κ_{g l u c o s e}^{(S T)})$ with Equation 4 (see also Equations S39-S40), where $κ_{g l u c o s e}^{(S T)}$ represents the substrate quality of glucose at saturated concentration (abbreviated as ‘ST’ in the superscript). This indicates that the proteome efficiency in fermentation is higher than that in respiration for bacteria in rich media. Indeed, recent studies have validated that the measured proteome efficiency in fermentation is higher than in respiration for E. coli in lactose at saturated concentration (Basan et al., 2015), i.e., $ε_{r} (κ_{l a c t o s e}^{(S T)}) < ε_{f} (κ_{l a c t o s e}^{(S T)})$ . In Figure 1E, we present the growth rate dependence of proteome efficiencies $ε_{r}$ and $ε_{f}$ in a three-dimensional (3D) format using the collected data shown in Appendix 1—table 1, where $ε_{r}$ , $ε_{f}$ and the growth rate $λ$ all vary as functions of nutrient quality $κ_{A}$ . Furthermore, the ratio $Δ$ (defined as $Δ (κ_{A}) \equiv ε_{f} (κ_{A}) / ε_{r} (κ_{A})$ ) is a monotonically increasing function of $κ_{A}$ , and there exists a critical value of $κ_{A}$ (denoted as $κ_{A}^{(C)}$ ; see Appendix 3.2 for details) satisfying $Δ (κ_{A}^{(C)}) = 1$ . Below $κ_{A}^{(C)}$ , where the nutrient is poorer and the cell grows slowly, the proteome efficiency of fermentation is lower than that of respiration (i.e. $ε_{f} < ε_{r}$ ), hence respiration is the optimal choice (with $λ = ϕ_{max} \cdot {(ψ + φ / ε_{r})}^{- 1}$ ). Above $κ_{A}^{(C)}$ , where the nutrient is richer and the cell grows faster, fermentation is more efficient than respiration in terms of proteome efficiency (i.e. $ε_{f} > ε_{r}$ ) and becomes the optimal growth strategy (with $λ = ϕ_{max} \cdot {(ψ + φ / ε_{f})}^{- 1}$ ). This analysis qualitatively explains the phenomenon of aerobic glycolysis.

For a quantitative understanding of overflow metabolism, let us first consider the homogeneous case, where all cells share identical biochemical parameters. For optimal protein allocation, the relation between fermentation flux and growth rate under nutrient variation (with significantly varying $κ_{A}$ ) is given by $J_{f}^{(E)} = φ \cdot λ \cdot θ (λ - λ_{C})$ , where ‘ $θ$ ’ represents the Heaviside step function, and $λ_{C}$ denotes the critical growth rate corresponding to the nutrient condition with nutrient quality $κ_{A}^{(C)}$ (i.e. $λ_{C} \equiv λ (κ_{A}^{(C)})$ ). Similarly, the growth rate dependence of respiration flux is $J_{r}^{(E)} = φ \cdot λ \cdot [1 - θ (λ - λ_{C})]$ . These digital response outcomes are consistent with the numerical simulation findings of Molenaar et al., 2009. However, they are clearly incompatible with the threshold-analog response observed in the standard picture of overflow metabolism (Basan et al., 2015; Holms, 1996; Meyer et al., 1984; Nanchen et al., 2006; van Hoek et al., 1998).

To address this issue, we take into account cell heterogeneity, which is ubiquitous in both microbes (Ackermann, 2015; Bagamery et al., 2020; Balaban et al., 2004; Nikolic et al., 2013; Solopova et al., 2014; Wallden et al., 2016; Yaginuma et al., 2014; Zhang et al., 2018) and tumor cells (Duraj et al., 2021; Shibao et al., 2018; Hanahan and Weinberg, 2011; Hensley et al., 2016). In the context of the Warburg effect or overflow metabolism, experimental studies have reported significant metabolic heterogeneity in the choice between respiration and fermentation within a cell population (Bagamery et al., 2020; Duraj et al., 2021; Shibao et al., 2018; Hensley et al., 2016; Nikolic et al., 2013). Motivated by the observation that the turnover number ( $k_{c a t}$ value) of a catalytic enzyme varies considerably between in vitro and in vivo measurements (Davidi et al., 2016; García-Contreras et al., 2012), we note that the concentrations of potassium and phosphate, which vary from cell to cell, have a significant impact on the $k_{c a t}$ values of metabolic enzymes (García-Contreras et al., 2012). Therefore, within a cell population, there is a distribution of $k_{c a t}$ values for a catalytic enzyme, commonly referred to as extrinsic noise (Elowitz et al., 2002). For simplicity, we assume that the $k_{c a t}$ values for each enzyme follow a Gaussian distribution. Consequently, the proteome efficiencies $ε_{r}$ and $ε_{f}$ , which are crucial for determining the choice between respiration and fermentation, also follow Gaussian distributions (see Appendix 8 for details). This variability leads to diverse distributions of single-cell growth rates across different carbon sources (see Equations S155-S157 and S163-S165), which has been fully verified by recent experiments using isogenic E. coli at single-cell resolution (Wallden et al., 2016; see Appendix 1—figure 2B). Accordingly, the critical growth rate $λ_{C}$ is expected to follow a Gaussian distribution $N (μ_{λ_{C}}, σ_{λ_{C}}^{2})$ within a cell population (see Appendix 8 for details), where $μ_{λ_{C}}$ is approximated by the deterministic result of $λ_{C}$ (Equation S43). Assuming the coefficient of variation (CV) of $λ_{C}$ is $σ_{λ_{C}} / μ_{C} = 12 %$ , or equivalently that the CV for the catalytic rate of each metabolic enzyme is 25%, we derive the growth rate dependence of fermentation and respiration fluxes (see Appendix 3.3 for details):

{\begin{cases} J_{f}^{(N)} (λ) = \frac{φ \cdot λ}{β_{f}^{(A)}} \cdot [erf (\frac{λ - μ_{λ_{C}}}{\sqrt{2} σ_{λ_{C}}}) + 1], \\ J_{r}^{(N)} (λ) = \frac{φ \cdot λ}{β_{r}^{(A)}} \cdot [1 - erf (\frac{λ - μ_{λ_{C}}}{\sqrt{2} σ_{λ_{C}}})], \end{cases}

where ‘erf’ represents the error function. The fermentation flux exhibits a threshold-analog relation with the growth rate (the red curves in Figures 1C–D—3B, D and F), while the respiration flux (the blue curve in Figure 1D) decreases as the fermentation flux increases. In Figure 1C–D, we observe that the model results (see Equation 5 and Appendix 9 for details; parameters are set based on the experimental data shown in Appendix 1—table 1) quantitatively agree with the experimental data from E. coli (Basan et al., 2015; Holms, 1996). The fermentation flux is represented by the acetate secretion rate $J_{a c t a t e}^{(M)} = 2 J_{f}^{(N)}$ , and the respiration flux is exemplified by the carbon dioxide flux $J_{C O_{2}, r}^{(M)} = 6 J_{r}^{(N)}$ (the superscript ‘(M)’ represents the measurable flux in the unit of mM/OD600/h; see Appendix 9.1 for details). By incorporating cell heterogeneity, our model of optimal protein allocation quantitatively explains overflow metabolism.

Testing the model through perturbations

To further test our model, we systematically investigate its predictions under various types of perturbations and compare them with experimental data from existing studies (Basan et al., 2015; Holms, 1996) (see Appendices 4 and 5.1 for details).

First, we consider the proteomic perturbation caused by overexpression of useless proteins encoded by the lacZ gene (i.e. $ϕ_{Z}$ perturbation) in E. coli. The net effect of the $ϕ_{Z}$ perturbation is that the maximum fraction of the proteome available for resource allocation changes from $ϕ_{max}$ to $ϕ_{max} - ϕ_{Z}$ (Basan et al., 2015), where $ϕ_{Z}$ is the proteomic mass fraction of useless proteins. In a cell population, the critical growth rate $λ_{C} (ϕ_{Z})$ still follows a Gaussian distribution $N (μ_{λ_{C}} (ϕ_{Z}), σ_{λ_{C}} {(ϕ_{Z})}^{2})$ , where the CV of $λ_{C} (ϕ_{Z})$ remains unchanged. Consequently, the growth rate dependence of fermentation flux changes to $J_{f}^{(N)} = \frac{φ \cdot λ}{β_{f}^{(A)}} \cdot [e r f (\frac{λ - μ_{λ_{C}} (ϕ_{Z})}{\sqrt{2} σ_{λ_{C}} (ϕ_{Z})}) + 1]$ (see Appendix 4 for model perturbation results regarding respiration flux), where both the growth rate $λ (κ_{A}, ϕ_{Z})$ and the normalized fermentation flux $J_{f}^{(N)} (κ_{A}, ϕ_{Z})$ are bivariate functions of $κ_{A}$ and $ϕ_{Z}$ (see Equations S49, S56 and S57). For each degree of LacZ expression (with fixed $ϕ_{Z}$ ), similar to wild-type strains, the fermentation flux exhibits a threshold-analog response to growth rate as $κ_{A}$ varies (see Figure 2C), which agrees quantitatively with experimental results (Basan et al., 2015). The shifts in the critical growth rate $λ_{C} (ϕ_{Z})$ are fully captured by $μ_{λ_{C}} (ϕ_{Z}) = μ_{λ_{C}} (0) (1 - ϕ_{Z} / ϕ_{max})$ . In contrast, for nutrient conditions with each fixed $κ_{A}$ , since the growth rate changes with $ϕ_{Z}$ just like $λ_{C} (ϕ_{Z}) : λ (κ_{A}, ϕ_{Z}) = λ (κ_{A}, 0) (1 - ϕ_{Z} / ϕ_{m a x})$ , the fermentation flux is then proportional to the growth rate for the varying levels of LacZ expression: $J_{f}^{(N)} = \frac{φ}{β_{f}^{(A)}} \cdot [e r f (\frac{λ (κ_{A}, 0) - μ_{λ_{C}} (0)}{\sqrt{2} σ_{λ_{C}} (0)}) + 1] \cdot λ$ , where the slope is a monotonically increasing function of the substrate quality $κ_{A}$ . These scaling relations are well validated by the experimental data (Basan et al., 2015) shown in Figure 2B. Finally, in the case where both $κ_{A}$ and $ϕ_{Z}$ are free to vary, the growth rate dependence of fermentation flux presents a threshold-analog response surface in a 3D plot, where $ϕ_{Z}$ appears explicitly as the $y$ -axis (see Figure 2A). Experimental data points (Basan et al., 2015) lie right on this surface, which is highly consistent with the model predictions.

Figure 2

Download asset Open asset

Influence of protein overexpression on overflow metabolism in *E. coli*.

(A) A 3D plot of the relations among fermentation flux, growth rate, and the expression level of useless proteins. In this plot, both the acetate excretion rate and growth rate vary as bivariate functions of the nutrient quality of a Group A carbon source (denoted as $κ_{A}$ ) and the useless protein expression encoded by *lacZ* gene (denoted as $ϕ_{Z}$ perturbation; see Equations S57 and S160). (B) Growth rate dependence of the acetate excretion rate upon $ϕ_{Z}$ perturbation for each fixed nutrient condition (see Equations S58 and S160). (C) Growth rate dependence of the acetate excretion rate as $κ_{A}$ varies (see Equations S58 and S160), with each fixed expression level of LacZ.

Next, we study the influence of energy dissipation, which introduces an energy dissipation coefficient $w$ to Equation 2: $J_{E}^{(N)} = η_{E} \cdot λ + w$ . Similarly, the critical growth rate in this case, $λ_{C} (w)$ , follows a Gaussian distribution $N (μ_{λ_{C}} (w), σ_{λ_{C}} {(w)}^{2})$ in a cell population. The relation between the growth rate and fermentation flux can be characterized by: $J_{f}^{(N)} = \frac{φ \cdot λ + w}{β_{f}^{(A)}} \cdot [e r f (\frac{λ - μ_{λ_{C}} (w)}{\sqrt{2} σ_{λ_{C}} {(w)}^{2}}) + 1]$ (see Appendix 4.2 for details). In Figure 3A–B, we present a comparison between the model results and experimental data (Basan et al., 2015) in 3D and 2D plots, which demonstrate good agreement. A notable characteristic of energy dissipation, as distinguished from $ϕ_{Z}$ perturbation, is that the fermentation flux increases despite a decrease in the growth rate when $κ_{A}$ is fixed.

Figure 3

Download asset Open asset

Influence of energy dissipation, translation inhibition, and carbon source category alteration on overflow metabolism in *E. coli*.

(A) A 3D plot of the relations among fermentation flux, growth rate, and the energy dissipation coefficient (see Equations S70 and S160). (B) Growth rate dependence of the acetate excretion rate as the nutrient quality $κ_{A}$ varies, with each fixed energy dissipation coefficient determined by or fitted from experimental data. (C) A 3D plot of the relations among fermentation flux, growth rate, and the translation efficiency (see Equations 85 and S160). Here, the translation efficiency is adjusted by the dose of chloramphenicol (Cm). (D) Growth rate dependence of the acetate excretion rate as $κ_{A}$ varies, with each fixed dose of Cm. (E) Coarse-grained model for pyruvate utilization. (F) The growth rate dependence of fermentation flux in pyruvate (see Equations 105 and S160) significantly differs from that of the Group A carbon sources (see Equations 47 and S160).

We proceed to analyze the impact of translation inhibition with different sub-lethal doses of chloramphenicol on E. coli. This type of perturbation introduces an inhibition coefficient $ι$ to the translation rate, thus turning $κ_{t}$ into $κ_{t} / ι + 1$ . Still, the critical growth rate $λ_{C} (ι)$ follows a Gaussian distribution $N (μ_{λ_{C}} (ι), σ_{λ_{C}} {(ι)}^{2})$ , and then, the growth rate dependence of fermentation flux is given by: $J_{f}^{(N)} = \frac{φ \cdot λ}{β_{f}^{(A)}} \cdot [e r f (\frac{λ - μ_{λ_{C}} (ι)}{\sqrt{2} σ_{λ_{C}} (ι)}) + 1]$ (see Appendix 4.3 for details). In Appendix 1—figure 2D and E, we observe that the model predictions are generally consistent with the experimental data (Basan et al., 2015). However, a noticeable systematic discrepancy arises when the translation rate is low. Therefore, we consider maintenance energy, which is typically tiny and generally negligible for bacteria over the growth rate range of interest (Basan et al., 2015; Locasale and Cantley, 2010; Neidhardt, 1996). Encouragingly, by assigning a very small value to the maintenance energy coefficient $w_{0}$ (where $w_{0} = 2.5 (h^{- 1})$ ), the model results for the growth rate-fermentation flux relation $J_{f}^{(N)} = \frac{φ \cdot λ + w_{0}}{β_{f}^{(A)}} \cdot [e r f (\frac{λ - μ_{λ_{C}} (ι)}{\sqrt{2} σ_{λ_{C}} (ι)}) + 1]$ quantitatively agree with experiments (Basan et al., 2015) (see Figure 3C–D and Appendix 4.3 for details).

Finally, we consider the alteration of nutrient categories by switching to a non-Group A carbon source: pyruvate, which enters the metabolic network from the endpoint of glycolysis (Neidhardt et al., 1990; Nelson and Cox, 2008). The coarse-grained model for pyruvate utilization is shown in Figure 3E (see also Figure 1A), which shares identical precursor pools with those for Group A carbon sources, yet has several differences in the coarse-grained reactions. The growth rate dependencies of both the proteome efficiencies (see Appendix 1—figure 2H) and energy fluxes (see Figure 3F) are qualitatively similar to those of Group A carbon source utilization, while there are quantitative differences in the coarse-grained parameters (see Appendices 5.1 and 9 for derivation details). Most notably, the critical growth rate $λ_{C}^{(p y)}$ and the ATP production per glucose in the fermentation pathway $β_{f}^{(p y)}$ for pyruvate utilization are noticeably smaller than those for Group A sources (i.e. $λ_{C}$ and $β_{f}^{(A)}$ , respectively). Consequently, the growth rate dependence of fermentation flux in pyruvate should present a distinctly different curve from that of Group A carbon sources (see Equations 5 and S105), which is fully validated by experimental results (Holms, 1996; see Figure 3F).

Enzyme allocation under perturbations

As mentioned above, our coarse-grained model is topologically identical to the central metabolic network (see Figure 1A) and can thus predict enzyme allocation for each gene in glycolysis and the TCA cycle (see Appendix 1—figure 1B and Appendix 1—table 1) under various types of perturbations. In Figure 1B, the intermediate nodes $M_{1}$ , $M_{2}$ , $M_{3}$ , $M_{4}$ , and $M_{5}$ represent G6P, PEP, acetyl-CoA, $α$ -ketoglutarate, and oxaloacetate, respectively. Therefore, $ϕ_{1}$ and $ϕ_{2}$ correspond to enzymes involved in glycolysis (or at the junction of glycolysis and the TCA cycle), while $ϕ_{3}$ and $ϕ_{4}$ correspond to enzymes in the TCA cycle (see Figure 1A–B and Appendix 3.1).

We first consider enzyme allocation under carbon limitation by varying the nutrient type and concentration of a Group A carbon source (i.e. $κ_{A}$ perturbation). This has been extensively studied in more simplified models (Hui et al., 2015; You et al., 2013), where the growth rate dependence of enzyme allocation under $κ_{A}$ perturbation is generally described by a C-line response (Hui et al., 2015; You et al., 2013). Specifically, the genes responsible for digesting carbon compounds exhibit a linear increase in gene expression as the growth rate decreases (Hui et al., 2015; You et al., 2013). However, when it comes to enzymes catalyzing reactions between intermediate nodes, we gathered experimental data from existing studies (Hui et al., 2015) and found that the enzymes in glycolysis exhibit a completely different response pattern compared to those in the TCA cycle (see Appendix 1—figure 3A and B). This discrepancy cannot be explained by the C-line response. To address this issue, we apply the coarse-grained model described above (see Figure 1B) to calculate the growth rate dependence of enzyme allocation for each $ϕ_{i}$ ( $i = 1, 2, 3, 4$ ) using model settings for wild-type strains, with no fitting parameters influencing the shape (see Equations S118-S119 and Appendix 9). In Figure 4A–B and Appendix 1—figure 3C-D, we see that the model predictions overall match with the experimental data (Hui et al., 2015) for representative genes from either glycolysis or the TCA cycle, and maintenance energy (with $w_{0} = 2.5 (h^{- 1})$ ) has a negligible effect on this process. Still, there are minor discrepancies that arise from the basal expression of metabolic genes, which may be attributed to the fact that our model deals with relatively stable growth conditions while microbes need to be prepared for fluctuating environments (Basan et al., 2020; Kussell and Leibler, 2005; Mori et al., 2017).

We proceed to analyze the influence of $ϕ_{Z}$ perturbation and energy dissipation. In both cases, our model predicts a linear response to growth rate reduction for all genes in either glycolysis or the TCA cycle (see Appendix 6.2–6.3 for details). For $ϕ_{Z}$ perturbation, all predicted slopes are positive, and there are no fitting parameters involved (Equations S120-S121). In Figure 4C–D and Appendix 1—figure 3E-J, we show that our model quantitatively illustrates the experimental data (Basan et al., 2015) for representative genes in the central metabolic network, and there is a better agreement with experiments (Basan et al., 2015) by incorporating the maintenance energy (with $w_{0} = 2.5 (h^{- 1})$ as aforementioned). For energy dissipation, however, the predicted slopes of the enzymes corresponding to $ϕ_{4}$ are negative, and there is a constraint that the slope signs of the enzymes corresponding to the same $ϕ_{i}$ ( $i = 1, 2, 3$ ) should be the same. In Appendix 1—figure 3K-N, we see that the model results (Equations S127 and S123) are consistent with experiments (Basan et al., 2015).

Figure 4

Download asset Open asset

Relative protein expression of central metabolic enzymes in *E. coli* under carbon limitation and proteomic perturbation.

(**A, C**) Relative protein expression of representative genes from glycolysis. (**B, D**) Relative protein expression of representative genes from the TCA cycle. (**A, B**) Results of the perturbation through changes in nutrient quality $κ_{A}$ (see Equation S119). (**C, D**) Results of proteomic perturbation via varied levels of expression of the useless protein LacZ (i.e. $ϕ_{Z}$ perturbation; see Equation S121).

Explanation of the Crabtree effect in yeast and the Warburg effect in cancer cells

We proceed to apply our model to explain the Crabtree effect in yeast (Bagamery et al., 2020; De Deken, 1966; Shen et al., 2024) and the Warburg effect in tumors (Bartman et al., 2023; Duraj et al., 2021; Hanahan and Weinberg, 2011; Shen et al., 2024; Vander Heiden et al., 2009) with slight modifications using the optimal growth principle combined with cell heterogeneity (see Appendix 10 and Appendix 1—figure 5). For yeast and tumors, similar to the case of E. coli, the proteome efficiencies $ε_{r}$ and $ε_{f}$ are both increasing functions of nutrient quality $κ_{A}$ (see Equation S170). Under poor nutrient conditions (i.e. $κ_{A}$ is small), the proteome efficiency in respiration is higher than that in fermentation: $ε_{r} > ε_{f}$ (see Equations S174-S175), making respiration the optimal choice for growth optimization (see Equation S171). Conversely, when nutrients are abundant and $ε_{f} > ε_{r}$ , aerobic glycolysis (i.e. fermentation) becomes the optimal growth strategy (see Equation S172). Further combination with cell heterogeneity results in the standard picture of overflow metabolism, which has indeed been observed in yeast (van Hoek et al., 1998). However, it remains challenging to tune the growth rate of cancer cells in vivo.

Recently, Shen et al., 2024 discovered that the proteome efficiency measured at the cell population level in respiration (i.e. $⟨ ε_{r} ⟩$ ; where ‘ $⟨ ⟩$ ’ denotes the population average) is higher than that in fermentation (i.e. $⟨ ε_{f} ⟩$ ) for many yeast and cancer cells, despite the presence of fermentation fluxes through aerobic glycolysis. Evidently, this finding (Shen et al., 2024) contradicts prevalent explanations (Basan et al., 2015; Chen and Nielsen, 2019), which hold that overflow metabolism arises because the proteome efficiency in fermentation is consistently higher than in respiration. Nevertheless, our model may resolve this puzzle due to the incorporation of two important features. First, our model predicts that the proteome efficiency in respiration is larger than that in fermentation when nutrient quality is low (see Equations S174-S175). Second, and crucially, by accounting for cell heterogeneity, our model allows a proportion of cells to have a higher proteome efficiency in fermentation than in respiration, even when the overall proteome efficiency in respiration at the cell population level is greater than that in fermentation (i.e. $⟨ ε_{r} ⟩ > ⟨ ε_{f} ⟩$ ).

To compare our model results quantitatively with experimental data on yeast and tumors (Shen et al., 2024), we define ${Pr}_{f} \equiv \frac{J_{f}^{(E)}}{J_{f}^{(E)} + J_{r}^{(E)}}$ as the fraction of ATP produced through fermentation. To account for cell heterogeneity, we apply Gaussian distributions to enzyme turnover numbers, as described above. This yields the relationship between ${Pr}_{f}$ (i.e. $\frac{J_{f}^{(E)}}{J_{f}^{(E)} + J_{r}^{(E)}}$ ) and $⟨ ε_{r} ⟩$ and $⟨ ε_{f} ⟩$ through derivations (see Equations S180-S190 and Appendix 10 for details):

\frac{J_{f}^{(E)}}{J_{f}^{(E)} + J_{r}^{(E)}} = \frac{1}{2} [erf (\frac{1 - ⟨ ε_{r} ⟩ / ⟨ ε_{f} ⟩}{\sqrt{2} \cdot \sqrt{χ_{ε_{r}}^{2} + χ_{ε_{f}}^{2} \cdot {(⟨ ε_{r} ⟩ / ⟨ ε_{f} ⟩)}^{2}}}) + 1],

where $χ_{ε_{r}}$ and $χ_{ε_{f}}$ represent the CVs of proteome efficiencies $ε_{r}$ and $ε_{f}$ , respectively. Due to the higher levels of cell heterogeneity in yeast (Bagamery et al., 2020) and cancer cells (Duraj et al., 2021; Shibao et al., 2018; Hanahan and Weinberg, 2011; Hensley et al., 2016), the CVs of $ε_{r}$ and $ε_{f}$ (i.e. $χ_{ε_{r}}$ and $χ_{ε_{f}}$ ) in these cells are expected to be significantly higher than those in E. coli, although their precise values are unknown. The values for the variables shown in Equation 6 can be obtained from experiments. Therefore, we plot the theoretical results from Equation 6 using $χ_{ε_{r}}$ and $χ_{ε_{f}}$ values of 0.25, 0.40, and 0.58 to compare with experimental data from yeast and in vivo mouse tumors (Bartman et al., 2023; Shen et al., 2024). As shown in Figure 5A–B, the theoretical results with $χ_{ε_{r}} = χ_{ε_{f}} = 0.58$ align quantitatively with the experimental data (Bartman et al., 2023; Shen et al., 2024) on both logarithmic and linear scales, demonstrating that our model has the potential to quantitatively explain the Crabtree effect in yeast and the Warburg effect in cancer cells.

Figure 5

Download asset Open asset

Model comparison with data on the Crabtree effect in yeast and the Warburg effect in tumors.

(A) A linear scale representation on the $y$ -axis. (B) A log scale representation on the $y$ -axis. In (**A–B**), $⟨ ε_{r} ⟩$ and $⟨ ε_{f} ⟩$ represent the population averages of $ε_{r}$ and $ε_{f}$ , while $χ_{ε_{r}}$ and $χ_{ε_{f}}$ are the coefficients of variation (CVs) of $ε_{r}$ and $ε_{f} \cdot ⟨ ε_{r} ⟩ / ⟨ ε_{f} ⟩$ represents the ratio of proteome efficiency between respiration and fermentation at the population-averaged level, while $J_{f}^{(E)} / (J_{f}^{(E)} + J_{r}^{(E)})$ stands for the fraction of energy flux generated by the fermentation pathway (see Equation 6). The data for yeast in batch culture and chemostat were calculated from experimental data of *S. cerevisiae* and *I. orientalis* (Shen et al., 2024). The data for mouse tumors were calculated from in vivo experimental data of pancreatic ductal adenocarcinoma (PDAC) and leukemic spleen of mice (Bartman et al., 2023; Shen et al., 2024). See Appendix 11 for detailed information on the experimental data sources (Bartman et al., 2023; Shen et al., 2024).

Discussion

The phenomenon of overflow metabolism, or the Warburg effect, has been a long-standing puzzle in cell metabolism. Although many rationales have been proposed over the past century (Basan et al., 2015; Chen and Nielsen, 2019; Majewski and Domach, 1990; Molenaar et al., 2009; Niebel et al., 2019; Peebo et al., 2015; Pfeiffer et al., 2001; Shlomi et al., 2011; Vander Heiden et al., 2009; Varma and Palsson, 1994; Vazquez et al., 2010; Zhuang et al., 2011), contradictions persist (Shen et al., 2024), leaving the origin and function of this phenomenon unclear (DeBerardinis and Chandel, 2020; Hanahan and Weinberg, 2011; Vander Heiden et al., 2009). In this study, we use E. coli as a typical example and demonstrate that overflow metabolism can be understood through optimal protein allocation combined with cell heterogeneity. Under nutrient-poor conditions, the proteome efficiency of respiration is higher than that of fermentation (see Figure 1E), and thus the cell uses respiration to optimize growth. In rich media, however, the proteome efficiency of fermentation increases more rapidly and surpasses that of respiration (see Figure 1E), leading the cell to adopt fermentation as the optimal growth strategy. In further combination with cell heterogeneity in enzyme catalytic rates (Davidi et al., 2016; García-Contreras et al., 2012), our model quantitatively illustrates the threshold-analog response (Basan et al., 2015; Holms, 1996) in overflow metabolism (see Figure 1C). Furthermore, it quantitatively explains the data on the Crabtree effect in yeast and the Warburg effect in cancer cells (Bartman et al., 2023; Shen et al., 2024).

Mechanistically, the optimal growth strategy for the binary choice between respiration and fermentation can be facilitated by the direct sensing and comparison of proteome efficiencies between the two pathways (see Appendix 3.4). A growing body of evidence suggests that the cyclic AMP (cAMP)-cAMP receptor protein (CRP) system plays a crucial role in sensing proteome efficiency and executing the optimal strategy (Basan et al., 2015; Towbin et al., 2017; Valgepea et al., 2010; Wehrens et al., 2023). However, it has also been suggested that the cAMP-CRP system alone is insufficient, and that additional regulators remain to be identified to fully elucidate this mechanism (Basan et al., 2015; Valgepea et al., 2010). Furthermore, since the binary choice between respiration and fermentation is driven by the comparison of proteome efficiencies, the optimal growth principle in our model can be relaxed to the case where efficient protein allocation is required only for enzymes, rather than ribosomes. This allows our model to remain applicable under suboptimal growth conditions (see Appendix 3.4 for details), where recent experimental studies have shown that the inactive portion of ribosomes (i.e. ribosomes not bound to mRNAs) may vary with culturing conditions (Dai et al., 2017; Li et al., 2018) and between individual cells within the same culture (Pavlou et al., 2025), despite an overall trend toward growth optimization.

In existing rationales (Basan et al., 2015; Chen and Nielsen, 2019; Majewski and Domach, 1990; Shlomi et al., 2011; Varma and Palsson, 1994; Vazquez et al., 2010; Vazquez and Oltvai, 2016), the standard picture of overflow metabolism (Basan et al., 2015; Holms, 1996; Meyer et al., 1984; Nanchen et al., 2006; van Hoek et al., 1998) has primarily been illustrated by a threshold-linear response, which largely relies on the assumption that cells optimize their growth rate for a given rate of carbon influx under each nutrient condition (or similar equivalents; see Appendix 7.3). However, in practice, for microbes or tumor cells grown in vitro or in vivo, the given factors are the identity and concentration of the nutrient (Molenaar et al., 2009; Scott et al., 2010; Wang et al., 2019), rather than the rate of carbon influx. Additionally, prevalent explanations (Basan et al., 2015; Chen and Nielsen, 2019) suggest that overflow metabolism originates from the proteome efficiency in fermentation always being higher than that in respiration (see Appendix 7.3 for details). While it has been observed in E. coli that proteome efficiency in fermentation is higher than that in respiration for cells cultured in lactose at saturated concentration (Basan et al., 2015), Shen et al., 2024 reported that for many yeast and cancer cells, the proteome efficiency in fermentation is noticeably lower than that in respiration, despite the presence of aerobic glycolytic fermentation flux. This observation (Shen et al., 2024) evidently contradicts the prevalent explanations (Basan et al., 2015; Chen and Nielsen, 2019). Our model resolves this puzzle by significantly differing from existing rationales in its optimization principle, where we optimize cell growth rate purely through protein allocation without imposing a special constraint on carbon influx (see Appendix 7.3 for details). More importantly, our model incorporates cell heterogeneity, which is crucial for both explaining the threshold-analog response in overflow metabolism and for resolving this puzzle raised by Shen et al., 2024.

In the homogeneous case, the optimal growth strategy for growth rate dependent fermentation flux results in a digital response (see Equation S44), corresponding to an elementary flux mode (Müller et al., 2014; Wortel et al., 2014), which aligns with the numerical study by Molenaar et al., 2009 but is incompatible with the standard picture of overflow metabolism (Basan et al., 2015; Holms, 1996; Meyer et al., 1984; Nanchen et al., 2006; van Hoek et al., 1998). Furthermore, in this case, cells would not generate fermentation flux if the proteome efficiency in fermentation were lower than that in respiration, under the optimal growth framework. By incorporating heterogeneity in enzyme catalytic rates (Davidi et al., 2016; García-Contreras et al., 2012), the critical growth rate (i.e. threshold) shifts from a single value to a Gaussian distribution (see Equation S45 and Appendix 8 for details; see also Appendix 1—figure 4) across a cell population, thereby turning a digital response into the threshold-analog response observed in overflow metabolism (see Figure 1C). Moreover, cell heterogeneity allows a fraction of cells to possess a larger proteome efficiency in fermentation than in respiration despite the overall proteome efficiency in respiration at the cell population level is higher than in fermentation. This mechanism facilitates the fermentation flux in yeast and cancer cells observed by Shen et al., 2024 (see Figure 5A–B).

Our model results, based on cell heterogeneity, are further supported by observed distributions of single-cell growth rates in E. coli (Wallden et al., 2016) (see Appendix 1—figure 2B), as well as by experiments involving various types of perturbations (Basan et al., 2015; Holms, 1996; Hui et al., 2015), both in terms of acetate secretion patterns and gene expression in the central metabolic network (see Figures 2—4, Appendix 1—figures 2D and E and 3). Furthermore, the heterogeneity patterns predicted by our model for fermentation and respiration modes in an isogenic cell population under the same culturing conditions are highly consistent with the non-genetic heterogeneity observed in single-cell experiments with E. coli (Nikolic et al., 2013) and S. cerevisiae (Bagamery et al., 2020), and align with experiments on intra-tumor heterogeneity in glioblastoma (Duraj et al., 2021; Shibao et al., 2018). Finally, our model can be broadly applied to address heterogeneity-related challenges in metabolism on a quantitative basis, including diverse metabolic strategies of cells in various environments (Bagamery et al., 2020; Duraj et al., 2021; Escalante-Chong et al., 2015; Hensley et al., 2016; Liu et al., 2015; Solopova et al., 2014; Wang et al., 2019).

Appendix 1

Appendix 1—table 1

Molecular weight (MW) and in vivo/in vitro k_cat data for E. coli.

No.^*	Reaction	Enzyme	Gene name	EC	MW (kDa)	In vitro k_cat (s^-1)	References	In vivo^†k_cat (s^-1)	Selected k_cat (s^-1)
J₁	Glucose-6P ↔ Fructose-6P	Glucose-6-phosphate isomerase	pgi	EC:5.3.1.9	1.2×10²	2.6×10²	PMID:7004378; DOI:https://doi.org/10.1016/j.ijms.2004.09.017	8.7×10²	8.7×10²
	Fructose-6P → Fructose-1,6P	Phosphofructokin-ase	pfkA^‡	EC:2.7.1.11	1.4×10²	4.4×10²	PMID:6218375; 70226	1.7×10³	1.7×10³
	Fructose-1,6P ↔ Glyceraldehyde 3-phosphate+Dihydroxyacetone phosphate	Fructose-bisphosphate aldolase	fbaA^†	EC:4.1.2.13	7.8×10	1.4×10	PMID:8939754; 15531627	1.6×10²	1.6×10²
	Dihydroxyacetone phosphate ↔ Glyceraldehyde 3-phosphate	Triosephosphate Isomerase	tpiA	EC:5.3.1.1	5.4×10	4.3×10²	PMID:3887397; 6092857	2.7×10²	2.7×10²
	Glyceraldehyde 3-phosphate ↔ 1,3-Bisphosphoglycerate	Glyceraldehyde-3-phosphate dehydrogenase	gapA	EC:1.2.1.12	1.4×10²	9.5×10	PMID:4932978; 2200929	1.5×10²	1.5×10²
	1,3-Bisphosphoglycerate ↔ 3-Phosphoglycerate	Phosphoglycerate kinase	pgk	EC:2.7.2.3	4.4×10	3.5×10²	PMID:367367; 166274	1.9×10²	1.9×10²
	3-Phosphoglycerate ↔ 2-Phosphoglycerate	Phosphoglycerate mutase	gpmA^‡	EC:5.4.2.11	4.9×10	3.3×10²	PMID:10437801	4.5×10²	4.5×10²
	2-Phosphoglycerate ↔ Phosphoenolpyruvate	Enolase	eno	EC:4.2.1.11	9.0×10	2.2×10²	PMID:1094232; 4942326	1.7×10²	1.7×10²
J₂	Phosphoenolpyruvate → Pyruvate	Pyruvate kinase	pykF^‡	EC:2.7.1.40	2.4×10²	5.0×10²	PMID:6759852	1.6×10³	1.6×10³
J₂	Pyruvate → Acetyl-CoA	Pyruvate dehydrogenase	aceE^‡	EC:1.2.4.1	1.0×10²	1.2×10²	PMID:23088422	3.4×10²	3.4×10²
J₃	Oxaloacetate +Acetyl CoA → Citrate	Citrate synthase	gltA	EC:2.3.3.1	9.7×10	2.4×10²	PMID:4900996; 23954305	7.1×10	7.1×10
	Citrate ↔ Isocitrate	Aconitate hydratase	acnB^‡	EC:4.2.1.3	9.4×10	7.0×10	PMID:15963579; 15963579	6.3×10	6.3×10
	Isocitrate→ α-Ketoglutarate	Isocitrate dehydrogenase	icd	EC:1.1.1.42	9.5×10	2.0×10²	PMID:8141; 36923; 2200929	3.3×10	3.3×10
J₄	α-Ketoglutarate → Succinyl-CoA	α-Ketoglutarate dehydrogenase complex E1 component	suc A suc B^‡	EC:1.2.4.2, EC:2.3.1.61	1.9×10²	1.5×10²	PMID:6380583; 4588679	1.3×10²	1.3×10²
	Succinyl-CoA ↔ Succinate	Succinyl-CoA synthetase	suc C suc D	EC:6.2.1.5	1.6×10²	9.1×10	PMID:5338130	1.0×10²	1.0×10²
	Succinate → Fumarate	Succinate dehydrogenase	sdh A sdh B^‡	EC:1.3.5.1	1.0×10²	1.1×10²	PMID:4334990; 16484232	1.1×10²	1.1×10²
	Fumarate ↔ Malate	Fumarase	fumA^‡	EC:4.2.1.2	2.0×10²	1.2×10³	PMID:3282546; 12021453	4.9×10²	4.9×10²
	Malate ↔ Oxaloacetate	Malate dehydrogenase	mdh	EC:1.1.1.37	6.1×10	5.5×10²	doi:https://doi.org/10.1016/0076-6879(69)13029-3	6.6×10	6.6×10
J₅	Phosphoenolpyruvate →Oxaloacetate	Phosphoenolpyru-vate carboxylase	ppc	EC:4.1.1.31	4.0×10²	1.5×10²	PMID:9927652; 4932977	/	1.5×10²
J₆	Acetyl-CoA ↔ Acetyl phosphate	Phosphate acetyltransferase	pta	EC:2.3.1.8	7.7×10	3.0×10	PMID:20236319	3.7×10²	3.7×10²
	Acetyl phosphate↔ Acetate	Acetate kinase	ackA	EC:2.7.2.1	4.3×10	3.6×10³	EcoCyc: EG10027; PMID:24801996	3.3×10²	3.3×10²
	Acetate (intracellular) ↔ Acetate (extracellular)	Acetate transporter	actP	/	2×10	4.7×10²	PMID:31405984 (Estimated)	/	4.7×10²
J₇	Pyruvate → Phosphoenolpyruvate	Pyruvate, water dikinase	ppsA	EC:2.7.9.2	2.5×10²	3.5×10	PMID:4319237	/	3.5×10
J_A	Glucose-6P (extracellular) → Glucose-6P (intracellular)	Glucose-6-phosphate transporter	UhpT	/	5×10	2×10²	PMID:3283129; 2197272; 20018695 (Estimated)	/	2×10²
	Glucose (extracellular) → Glucose-6P	Glucose-specific PTS enzyme	ptsG	EC: 2.7.1.199	5×10	1×10²	PMID:9575173; 20018695; 12146972	/	1×10²
	Lactose (extracellular) → Lactose (intracellular)	Lactose transporter	lacY	/	4.6×10	6×10	PMID:6444453; 20018695	/	6×10
	Lactose →Glucose +Galactose	β-galactosidase	lacZ	EC:3.2.1.23	4.6×10²	6.4×10²	PMID:8008071; 23011886 (Estimated)	/	6.4×10²
*J_py*	Pyruvate (extracellular) → Pyruvate (intracellular)	Pyruvate transporter	btsT CstA	/	8×10	6×10	PMID:20018695; 33260635; EcoCyc: G7942; EG10167 (Estimated)	/	6×10

*

The classification of J_i follows the coarse-grained models shown in Figures 1B and 3E.
†

In vivo k_cat values were obtained using the experimental data shown in Appendix 1—table 2, combined with Equations S134-S135.
‡

See Appendix 1—figure 1B for additional genes that may play a secondary role.

Appendix 1—table 2

Proteome and flux data (Basan et al., 2015) used to calculate the in vivo k_cat of E. coli.

	Culture 1	Culture 2	Culture 3	Culture 4
Growth rate λ (h^–1)*	0.82	0.87	0.97	1.03
J_acetate (mM OD₆₀₀^–1 h^–1)^†	0.39	1.18	2.68	2.84
J_{CO2, r} (mM OD₆₀₀^–1 h^–1) ^†	7.44	6.05	4.30	3.04
Gene name	Proteomic mass fractions obtained using absolute abundance (ϕ_i)
pgi	0.09%	0.09%	0.10%	0.11%
pfkA	0.06%	0.06%	0.06%	0.06%
fbaA	0.32%	0.35%	0.35%	0.39%
tpiA	0.12%	0.15%	0.13%	0.18%
gapA	1.19%	1.29%	1.33%	1.47%
pgk	0.30%	0.31%	0.32%	0.36%
gpmA	0.15%	0.15%	0.15%	0.16%
eno	0.63%	0.70%	0.75%	0.83%
pykF	0.15%	0.15%	0.18%	0.21%
aceE	0.30%	0.32%	0.34%	0.41%
gltA	0.88%	0.80%	0.61%	0.48%
acnB	0.92%	0.84%	0.66%	0.57%
icd	1.55%	1.55%	1.31%	1.39%
suc A suc B	0.71%	0.75%	0.64%	0.55%
suc C suc D	0.88%	0.84%	0.66%	0.52%
sdh A sdh B	0.49%	0.45%	0.42%	0.35%
fumA	0.24%	0.21%	0.17%	0.13%
mdh	0.45%	0.45%	0.41%	0.39%
pta	0.10%	0.10%	0.10%	0.10%
ackA	0.06%	0.07%	0.06%	0.06%

*

For calibration purposes, a factor of 1.03/0.97 was multiplied by the reference data (Basan et al., 2015)^‡.
†

For calibration purposes, a factor of 2.84/3.24 was multiplied by the reference data (Basan et al., 2015)^‡.
‡

Here, (1.03, 2.84) and (0.97, 3.24) are both the data points for (λ h^-1, J_acetate mM OD₆₀₀^-1 h^-1) for E. coli strain NCM3722 cultured with lactose in the same reference (Basan et al., 2015). The former is specified in the source data of the reference’s figure 1 (Basan et al., 2015), while the latter is recorded in the reference’s extended data figure 3a (Basan et al., 2015). With the calibrations above, the data for the $J_{a c e t a t e}^{(M)} - λ$ relation shown here align with the curve depicted in Figure 1C.

Appendix 1—table 3

Illustrations of symbols in this manuscript.

Symbols	Illustrations/Definitions	Model variable/parameter settings for E. coli^*
A (in the figures)	A Group A carbon source joining the metabolic network from the upper part of glycolysis.	NA ^†
*M_i* (in the figures)	A metabolite in the metabolic network that serve as intermediate node.	NA
*J_i* (in the figures)	The stoichiometric flux delivering carbon flux, an extensive variable‡; see Equation S7.	see Equations S7-S8.
*r_i* (in the figures)	The mass fraction of carbon flux drawn from a precursor pool.	r_a1=24%; r_a2=24%; r_b = 28%; r_c = 12%; r_d = 12% (Nelson and Cox, 2008).
λ	Growth rate of the cell population; see Equation S36 for the optimal model solution.	see Equations S4 and S36.
*J_r, J_f*	J_r and J_f are stoichiometric fluxes of respiration and fermentation, extensive variables.	J_r = J₄; J_f = J₆ (see Equation S22)
$m_{0}$	The weighted average carbon mass of metabolite molecules at the entrance of precursor pools.	See Equation S17.
M_carbon	The carbon mass of the cell population, an extensive variable.	NA
M_protein	The protein mass of the cell population; an extensive variable.	NA
$M_{Q}^{(P)}$ , $M_{R}^{(P)}$ , $M_{C}^{(P)}$	The mass of Q-class, R-class, or C-class proteome.	See Equation S2.
$f_{Q}$ , $f_{R}$ , $f_{C}$	The ribosome allocation fraction for protein synthesis of Q-class, R-class, or C-class.	$f_{Q}$ = $ϕ_{Q}$ .
$m_{A A}$	The average molecular weight of amino acids.	A reducible parameter for the results.
$k_{T}$	Translation speed of ribosomes.	$k_{T}$ =20.1 aa/s (Scott et al., 2010).
$ϕ_{Q}$ , $ϕ_{R}$ , $ϕ_{C}$	The mass fraction of Q-class, R-class, or C-class proteome; see Appendix 2.1.	$ϕ_{Q}$ =52% (Scott et al., 2010).
$ϕ_{m a x}$	The maximum proteomic mass fraction of proteome allocation for fermentation, respiration, and biomass generation, with $ϕ_{m a x} \equiv 1 - ϕ_{Q}$ .	$ϕ_{m a x}$ =48% (Scott et al., 2010).
$m_{R}$	The protein mass of a single ribosome.	$m_{R} = 7336 m_{A A}$ (Neidhardt et al., 1990).
V_cell	The cell volume of the cell population (the ‘big cell’); an extensive variable.	NA
$N_{R}$ , $M_{r p}^{(P)}$	The number or the total protein mass of ribosomes in the big cell; extensive variables.	NA
$ς$	The ratio of the mass of R-class proteome to the protein mass of ribosomes: $ς \equiv M_{R}^{(P)} / M_{r p}^{(P)}$ .	$ς$ =1.67 (Scott et al., 2010).
*[E_i], [S_i]*	The concentration of enzyme E_i or substrate S_i; intensive variables.	NA
*a_i, d_i, b_i, c_i*	a_i and d_i are reaction parameters; bi and c_i are stoichiometric coefficients. See Appendix 2.3.	NA
*K_i*	The Michaelis constant, defined as K_i≡(d_i+ $k_{i}^{c a t}$ )/ a_i.	Obtainable from Bennett et al., 2009, yet unused in practice since [Si]>K_i (see Appendix 2.5).
*v_i*	The reaction rate per volume of a biochemical reaction catalyzed by E_i; an intensive variable.	See Equation S6.
$N_{E_{i}}$ , $M_{E_{i}}$	The copy number or the total weight enzyme E_i in the cell population; extensive variables.	$N_{E_{i}} = V_{c e l l} \cdot [E_{i}]$ ; $M_{E_{i}} = N_{E_{i}} \cdot m_{E_{i}} .$
m_carbon	The mass of a carbon atom.	$m_{c a r b o n} = \frac{12}{N_{A v o g a d r o}} g$ , where g represents gram and $N_{A v o g a d r o}$ is the Avogadro constant.
$Φ_{i}$	The enzyme cost of all E_i molecules in the cell population; an extensive variable.	$Φ_{i} \equiv N_{E_{i}} \cdot n_{E_{i}}$ .
$ξ_{i}$	$ξ_{i}$ is defined such that $ξ_{i} = J_{i} / Φ_{i}$ .	$ξ_{i} \equiv \frac{k_{i}^{c a t}}{n_{E_{i}}} \cdot \frac{[S_{i}]}{[S_{i}] + K_{i}}$ .
$J_{i}^{(N)}$	The normalized flux, i.e., flux per unit of biomass; an intensive variable^§	$J_{i}^{(N)} \equiv J_{i} \cdot m_{0} / M_{c a r b o n}$ see Equations S15-S16.
$J_{r}^{(N)}$ , $J_{f}^{(N)}$	$J_{r}^{(N)}$ and $J_{f}^{(N)}$ are the normalized fluxes of respiration and fermentation, intensive variables.	$J_{r}^{(N)}$ = $J_{4}^{(N)}$ ; $J_{f}^{(N)}$ = $J_{6}^{(N)}$ .
$N_{{E P}_{i}}^{c a r b o n}$	The number of carbon atoms in the entry point metabolite molecule of Precursor Pool i.	$N_{{E P}_{a 1}}^{c a r b o n} = 6$ ; $N_{{E P}_{a 2}}^{c a r b o n} = 3$ ; $N_{{E P}_{b}}^{c a r b o n} = 3$ ; $N_{{E P}_{c}}^{c a r b o n} = 5$ ; $N_{{E P}_{d}}^{c a r b o n} = 4$ (Nelson and Cox, 2008).
k_cat, $k_{i}^{c a t}$	The turnover number of a catalytic enzyme.	See Appendix 1—table 1.
$m_{E_{i}}$ , $n_{E_{i}}$	$m_{E_{i}}$ and $n_{E_{i}}$ are the molecular weight and the enzyme cost of an E_i molecule, respectively.	See Appendix 1—table 1.
$r_{c a r b o n}$ , $r_{p r o t e i n}$	$r_{c a r b o n}$ and $r_{p r o t e i n}$ are the mass fractions of all carbon and protein within a cell, respectively.	$r_{p r o t e i n} = 0.55$ ; $r_{c a r b o n} = 0.48$ (Neidhardt et al., 1990).
$κ_{i}$	Substrate quality of a metabolite in a biochemical reaction; see Equation S12 and S20.	Calculated from the values of $k_{i}^{c a t}$ , $m_{E_{i}}$ , $m_{0}$ , $r_{p r o t e i n}$ , $r_{c a r b o n}$ .
$κ_{A}$	Substrate quality of a Group A carbon source; see Equation S27.	Calculated from the values of $k_{A}^{c a t}$ , $m_{E_{A}}$ , $m_{0}$ , $r_{p r o t e i n}$ , $r_{c a r b o n}$ , $K_{A}$ and the concentration of the Group A carbon source [A].
$ϕ_{i}$	The proteomic mass fraction of enzyme E_i: $ϕ_{i} \equiv M_{E_{i}} / M_{p r o t e i n}$ ; an intensive variable.	See Equation S9.
$η_{i}$	The fraction of stoichiometric flux drawn from a precursor pool; see Equations S13, S14 and S18.	η_a1=15%; η_a2=30%; η_b=35%; η_c=9%; η_d=11% (calculated from the values of r_i and $N_{{E P}_{i}}^{c a r b o n}$ ).
$ϕ_{r}$ , $ϕ_{f}$ , $ϕ_{B M}$	$ϕ_{f}$ , $ϕ_{f}$ , $ϕ_{B M}$ are the proteomic mass fraction of enzymes dedicated to fermentation, respiration, and biomass generation, respectively.	NA
$κ_{t}$	A parameter determined by the translation rate, defined as $κ_{t} \equiv k_{T} \cdot m_{A A} / (ς \cdot m_{R})$ .	$κ_{t} = 1 / 610$ (s^–1) (calculated from the values of $k_{T}$ , $ς$ and $m_{R}$ ).
J_BM	The carbon flux of biomass production; an extensive variable.	See Equation S10.
J_E	The energy demand for cell growth, expressed as the stoichiometric energy flux in ATP; an extensive variable.	See Equation S25.
$J_{E}^{(N)}$	The normalized flux of energy demand in ATP; an intensive variable.	$J_{E}^{(N)} \equiv J_{E} \cdot m_{0} / M_{c a r b o n} .$
r_E, $η_{E}$	r_E and $η_{E}$ are energy coefficients. r_E is the slope of J_E versus J_BM; $η_{E} = r_{E} \cdot [\sum_{i} r_{i} / N_{{E P}_{i}}^{c a r b o n}]$ .	See Appendix 9.2.
$β_{i}$	The stoichiometric coefficient of ATPs in biochemical reactions shown in Figures 1B and 3E (for E. coli) or Appendix 1—figure 5E and F (for yeast and mammalian cells).	$β_{1} = 4$ , $β_{2} = 3$ , $β_{3} = 2$ , $β_{4} = 6$ , $β_{6} = 1$ , $β_{a 1} = 4$ , $β_{7} = 1$ , $β_{8} = 2$ , $β_{9} = 6$ (E. coli); $β_{1} = 5$ , $β_{2} = 1$ , $β_{3} = 5$ , $β_{4} = 7.5$ , $β_{6} = 2.5$ , $β_{a 1} = 5$ (eukaryotic cells) (Neidhardt et al., 1990; Sauer et al., 2004).
$β_{r}^{(A)}$ , $β_{f}^{(A)}$	$β_{r}^{(A)}$ and $β_{f}^{(A)}$ are the stoichiometric coefficients of ATP production per glucose in respiration and fermentation, respectively.	$β_{r}^{(A)} = 26$ , $β_{f}^{(A)} = 12$ (E. coli); $β_{r}^{(A)} = 32$ , $β_{f}^{(A)} = 2$ (eukaryotic cells) (Neidhardt et al., 1990).
$J_{r}^{(E)}$ , $J_{f}^{(E)}$	$J_{r}^{(E)}$ and $J_{f}^{(E)}$ are normalized energy fluxes of respiration and fermentation, intensive variables.	$J_{r}^{(E)} \equiv \frac{β_{r}^{(A)}}{2} \cdot J_{r}^{(N)}$ ; $J_{f}^{(E)} \equiv \frac{β_{f}^{(A)}}{2} \cdot J_{f}^{(N)} .$
$ε_{r}$ , $ε_{f}$ $ε_{r}^{(d t)}$ , $ε_{f}^{(d t)}$	$ε_{r}$ (or $ε_{r}^{(d t)}$ ) and $ε_{f}$ (or $ε_{f}^{(d t)}$ ) are the proteome efficiencies for energy biogenesis in the respiration and fermentation pathways: $ε_{r} \equiv J_{r}^{(E)} / ϕ_{r}$ and $ε_{f} \equiv J_{f}^{(E)} / ϕ_{f}$ .	Calculated from the values of $κ_{A}$ , $κ_{i}$ , $β_{r}^{(A)}$ and $β_{f}^{(A)}$ with Equations S132 and S161.
$φ$	$φ$ is an energy demand coefficient, defined in Equation S33 and mainly determined by $η_{E}$ .	Calculated from the values of $η_{E}$ , $β_{i}$ , $η_{i}$ with Equation S33. See Appendix 9.2.
$ψ$ , $ψ_{d t}$	$ψ^{- 1}$ (or $ψ_{d t}^{- 1}$ ) is the proteome efficiency for biomass generation in the biomass pathway, with $ψ^{- 1} \equiv / λ / ϕ_{B M}$ .	Calculated from the values of $η_{i}$ , $κ_{A}$ , $κ_{i}$ , $Ω$ , $κ_{t}$ with Equations S133 and S162.
$κ_{r}^{(A)}$ , $κ_{f}^{(A)}$	$κ_{r}^{(A)}$ and $κ_{f}^{(A)}$ are parameters defined as $κ_{r}^{(A)} \equiv {[\frac{1}{κ_{1}} + \frac{2}{κ_{2}} + \frac{2}{κ_{3}} + \frac{2}{κ_{4}}]}^{- 1}$ and $κ_{f}^{(A)} \equiv {[\frac{1}{κ_{1}} + \frac{2}{κ_{2}} + \frac{2}{κ_{6}}]}^{- 1} .$	Calculated from the values of $κ_{i}$ .
$Ω$	$Ω$ is a composite parameter defined as $Ω \equiv 1 / κ_{t} + \sum_{i}^{a 1, a 2, b, c, d} η_{i} / κ_{i} .$	See Appendix 9.2.
$κ_{g l u c o s e}^{(S T)}$ , $κ_{l a c t o s e}^{(S T)}$	The substrate quality of glucose or lactose at saturated concentration.	Calculated using Equation S27 and the approximation used in Equation S20.
$Δ$	Δ is a function of $κ_{A}$ defined as $Δ (κ_{A}) \equiv ε_{f} (κ_{A}) / ε_{r} (κ_{A})$ .	$Δ \equiv ε_{f} / ε_{r}$ .
$κ_{A}^{(C)}$	The critical value of $κ_{A}$ which satisfy $Δ (κ_{A}) = 1$ and thus $ε_{f} (κ_{A}) = ε_{r} (κ_{A})$ ; See Equation S42 (for E. coli) and S176 (for yeast and mammalian cells).	Calculated from the values of $β_{i}$ and $κ_{i}$ with Equation S42.
$λ_{C}$	The critical growth rate at the transition point: $λ_{C} \equiv λ (κ_{A}^{(C)})$ ; See Equations S43 and S177.	Calculated from the values of $ϕ_{m a x}$ , $φ$ , $β_{i}$ , $κ_{i}$ , $κ_{A}^{(C)}$ , $Ω$ , $η_{i}$ with Equations S43, S32 and S162.
$θ$	The Heaviside step function.	NA
$J_{a c e t a t e}$ , $J_{{C O}_{2}, r}$	$J_{a c e t a t e}$ and $J_{{C O}_{2}, r}$ are the stoichiometric fluxes of acetate from the fermentation pathway and CO2 from the respiration pathway; extensive variables.	$J_{a c e t a t e} = J_{f}$ ; $J_{{C O}_{2}, r} = 3 \cdot J_{r}$ . See Appendix 9.1 and Equations S158.
$J_{a c e t a t e}^{(M)}$ , $J_{{C O}_{2}, r}^{(M)}$	$J_{a c e t a t e}^{(M)}$ and $J_{{C O}_{2}, r}^{(M)}$ are the fluxes of $J_{a c e t a t e}$ and $J_{C O_{2}, r}$ (per biomass) in the unit of mM/OD600/h, which are measurable in experiment. Intensive variables.	$J_{a c e t a t e}^{(M)} \approx 2 \cdot J_{f}^{(N)}$ ; $J_{{C O}_{2}, r}^{(M)} \approx 6 \cdot J_{r}^{(N)}$ . See Appendix 9.1 and Equation S160.
$κ_{A}^{m a x}$	The maximum value of $κ_{A}$ available across different Group A carbon sources.	Approximated by the max $κ_{A}$ across Group A carbon sources, calculated with Equation S27 and the approximation used in Equation S20.
$λ_{m a x}$	The population cell growth rate for the maximum value of $κ_{A}$ : $λ_{m a x} = λ (κ_{A}^{m a x})$ .	Calculated from the maximum of Equation S36 with the values of $β_{i}$ , $κ_{i}$ , $κ_{A}^{m a x}$ , $φ$ , $Ω$ , $κ_{t}$ , and Equations S32, S132, Equation S161 and S162.
$N (μ, σ^{2})$	A Gaussian distribution with a mean of μ and a standard deviation of $σ$ .	The probability density function is $f (x) = \frac{1}{σ \sqrt{2 π}} e^{- \frac{1}{2} {(\frac{x - μ}{σ})}^{2}}$ .
$μ_{λ_{C}}$ , $σ_{λ_{C}}$	$μ_{λ_{C}}$ and $σ_{λ_{C}}$ are the mean and standard deviation of $λ_{C}$ , respectively.	$μ_{λ_{C}}$ is approximated by the deterministic value of $λ_{C}$ ; see Appendix 3.3 for $σ_{λ_{C}}$ settings. See Appendix 9.2 for the values.
erf	The error function in mathematics.	$e r f (x) = \frac{2}{\sqrt{π}} \int_{0}^{x} e x p (- t^{2}) d t$
$ϕ_{Z}$	The proteomic mass fraction of useless proteins encoded by the LacZ gene.	See Appendix 4.1.
$w$	An energy dissipation coefficient.	See Appendix 4.2.
$w_{0}$	The maintenance energy coefficient.	$w_{0}$ =0 or 2.5 (h^–1) as specified in Figures 3–4, Appendix 1—figures 2 and 3. See Appendices 4.3 and 9.2.
$ι$	$ι$ is the inhibition coefficient such that ${(1 + ι)}^{- 1}$ represents the translation efficiency.	See Appendices 4.3 and 9.2
$ι_{w_{0} = 0}^{(2 μ m C m)}$ , $ι_{w_{0} = 0}^{(4 μ m C m)}$ , $ι_{w_{0} = 0}^{(8 μ m C m)}$ , $ι_{w_{0} = 2.5}^{(2 μ m C m)}$ , $ι_{w_{0} = 2.5}^{(4 μ m C m)}$ , $ι_{w_{0} = 2.5}^{(8 μ m C m)}$	The values for $ι$ in the cases with 2 μm , 4 μm, or 8 μm of chloramphenicol and the maintenance energy coefficient $w_{0}$ chosen as 0 or 2.5 (h^–1).	$ι_{w_{0} = 0}^{(2 μ m C m)} = 1.15$ ; $ι_{w_{0} = 0}^{(4 μ m C m)} = 2.33$ ; $ι_{w_{0} = 0}^{(8 μ m C m)} = 6.25$ ; $ι_{w_{0} = 2.5}^{(2 μ m C m)} = 1.05$ ; $ι_{w_{0} = 2.5}^{(4 μ m C m)} = 2.00$ ; $ι_{w_{0} = 2.5}^{(8 μ m C m)} = 5.40$ . See Appendix 9.2.
$κ_{p y}$	The substrate quality of pyruvate; see Equation S89.	Calculated from the values of $k_{p y}^{c a t}$ , $m_{E_{p y}}$ , $m_{0}$ , $r_{p r o t e i n}$ , $r_{c a r b o n}$ , $K_{p y}$ and the external concentration of pyruvate [py].
$β_{r}^{(p y)}$ , $β_{f}^{(p y)}$	$β_{r}^{(p y)}$ and $β_{f}^{(p y)}$ are the stoichiometric coefficients of ATP production per pyruvate in respiration and fermentation, respectively.	$β_{r}^{(p y)} = 10$ ; $β_{f}^{(p y)} = 3$ . (Neidhardt et al., 1990).
$J_{r}^{(E, p y)}$ , $J_{f}^{(E, p y)}$	$J_{r}^{(E, p y)}$ and $J_{f}^{(E, p y)}$ are the normalized energy fluxes of respiration and fermentation for pyruvate utilization; intensive variables.	The corresponding variables of $J_{r}^{(E)}$ and $J_{f}^{(E)}$ in the case of pyruvate utilization.
$ε_{r}^{(p y)}$ , $ε_{f}^{(p y)}$	$ε_{r}^{(p y)}$ and $ε_{f}^{(p y)}$ are the proteome efficiencies for energy biogenesis using pyruvate in the respiration and fermentation pathways.	The corresponding variables of $ε_{r}$ and $ε_{f}$ in the case of pyruvate utilization.
$Ω_{G g}^{'}$	${Ω^{`}}_{G g}$ is a composite parameter defined as $Ω_{G g}^{'} \equiv (η_{b} + η_{c}) / κ_{8} + η_{a 1} / κ_{9}$ .	See Appendix 9.2.
$ψ_{p y}$ , $φ_{p y}$ , $κ_{p y}^{(S T)}$ $κ_{p y}^{(C)}$ , $λ_{m a x}^{(p y)}$	$ψ_{p y}$ , $φ_{p y}$ , $κ_{p y}^{(S T)}$ , $κ_{p y}^{(C)}$ and $λ_{m a x}^{(p y)}$ are the corresponding variables/parameters of $ψ$ , $φ$ , $κ_{A}^{m a x}$ , $κ_{A}^{(C)}$ and $λ_{m a x}$ in the case of pyruvate utilization.	See Appendices 5.1 and 9.2.
$λ_{C}^{(p y)}$ , $μ_{λ_{C}^{(p y)}}$ , $σ_{λ_{C}^{(p y)}}$	$λ_{C}^{(p y)}$ , $μ_{λ_{C}^{(p y)}}$ and $σ_{λ_{C}^{(p y)}}$ are the corresponding variables/parameters of $λ_{C}$ , $μ_{λ_{C}}$ and $σ_{λ_{C}}$ in the case of pyruvate utilization.	See Appendices 5.1 and 9.2.
$N_{P_{i}}^{c a r b o n}$	The number of carbon atoms in a molecule of Pool i.	The value of $N_{P_{i}}^{c a r b o n}$ is approximated by $N_{{E P}_{i}}^{c a r b o n}$ (Equation S107).
$κ_{i}^{(21 A A)}$	The substrate quality of the external supplied amino acids identical to those in Pool i.	See Appendices 5.2 and 9.2.
$Ω_{21 A A}$	$Ω_{21 A A}$ is a composite parameter defined as $\begin{array}{ll} Ω_{21 A A} \equiv 1 / κ_{t} + η_{a 1} / κ_{a 1} \\ + \sum_{i}^{a 2, b, c, d} η_{i} / κ_{i}^{(21 A A)} \end{array}$ .	See Appendices 5.2 and 9.2.
$ψ_{21 A A}$ , $φ_{21 A A}$ , $λ_{m a x}^{(21 A A)}$ , $λ_{C}^{(21 A A)}$ , $μ_{λ_{C}^{(21 A A)}}$ , $σ_{λ_{C}^{(21 A A)}}$	$ψ_{21 A A}$ , $φ_{21 A A}$ , $λ_{m a x}^{(21 A A)}$ , $λ_{C}^{(21 A A)}$ , $μ_{λ_{C}^{(21 A A)}}$ and $σ_{λ_{C}^{(21 A A)}}$ are the corresponding variables/parameters of $ψ$ , $φ$ , $λ_{m a x}$ , $λ_{C}$ , $μ_{λ_{C}}$ and $σ_{λ_{C}}$ in the case of a Group A carbon source is mixed with 21 types of amino acids at saturated concentrations.	See Appendices 5.2 and 9.2.
$Ω_{7 A A}$ , $φ_{7 A A}$ , $μ_{λ_{C}^{(7 A A)}}$ , $σ_{λ_{C}^{(7 A A)}}$	$Ω_{7 A A}$ , $φ_{7 A A}$ , $μ_{λ_{C}^{(7 A A)}}$ and $σ_{λ_{C}^{(7 A A)}}$ are the corresponding parameters of $Ω$ , $φ$ , $μ_{λ_{C}}$ and $σ_{λ_{C}}$ in the case of a Group A carbon source is mixed with 7 types of amino acids.	See Appendices 5.2 and 9.2.
$J_{i n}^{(N)}$ , $ϑ$	$J_{i n}^{(N)}$ is the normalized stoichiometric influx of a Group A carbon source (Equation S136). $ϑ$ is a parameter defined as $ϑ = η_{a 1} + η_{c} + (η_{a 2} + η_{b} + η_{d}) / 2$ for the model shown in Figure 1B.	See Appendix 7.3
$χ_{e x t}$ , $χ_{i n t}$ , $χ_{t o t}$	$χ_{e x t}$ , $χ_{i n t}$ and $χ_{t o t}$ are the level of extrinsic noise, intrinsic noise and total noise in a system.	See Appendix 8.1
$μ_{k_{i}^{c a t}}$ , $σ_{k_{i}^{c a t}}$ , $μ_{1 / k_{i}^{c a t}}$ , $σ_{1 / k_{i}^{c a t}}$ , $μ_{1 / k_{i}^{c a t}}^{'}$ , $σ_{1 / k_{i}^{c a t}}^{'}$	$μ_{k_{i}^{c a t}}$ and $σ_{k_{i}^{c a t}}$ are the mean and standard deviation of $k_{i}^{c a t}$ . $μ_{1 / k_{i}^{c a t}}$ (or $μ_{1 / k_{i}^{c a t}}^{'}$ ) and $σ_{1 / k_{i}^{c a t}}$ (or $σ_{1 / k_{i}^{c a t}}^{'}$ ) are the mean and standard deviation of $1 / k_{i}^{c a t}$ . See Appendix 8.1.	$μ_{k_{i}^{c a t}}$ is approximated by the deterministic value of $k_{i}^{c a t}$ . The CV of $k_{i}^{c a t}$ is set to 25%. $μ_{1 / k_{i}^{c a t}}$ ≈1/ $μ_{k_{i}^{c a t}}$ ; $σ_{1 / k_{i}^{c a t}}$ / $μ_{1 / k_{i}^{c a t}}$ ≈ $σ_{k_{i}^{c a t}}$ / $μ_{k_{i}^{c a t}}$ .
$I G (x; μ, ζ)$	The inverse Gaussian (IG) distribution: variable x>0 with parameters $μ$ and $ζ$ . See Equation S142.	The probability density function is $\sqrt{\frac{ζ}{2 π x^{3}}} e x p (- \frac{ζ {(x - μ)}^{2}}{2 μ^{2} x})$ .
$I O G (x; μ, ζ)$	The positive inverse of Gaussian (IOG) distribution: variable x>0 with parameters $μ$ and $ζ$ . See Equation S140 and Appendix 8.1.	The probability density function is $\sqrt{\frac{ζ}{2 π x^{4}}} e x p (- \frac{ζ {(x - μ)}^{2}}{2 μ^{2} x^{2}})$ .
$ζ_{1 / k_{i}^{c a t}}$ , $ζ_{1 / k_{i}^{c a t}}^{'}$	Distributional parameters of $1 / k_{i}^{c a t}$ corresponding to $ζ$ in an IG or IOG distribution.	See Appendix 8.1
$G (k)$	The characteristic function of IG distribution. See Equation S147.	$G (k) = \int_{- \infty}^{\infty} e^{i k x} \cdot I G (x; μ, ζ) d x$
$X_{i}$ , $α_{i}$ , $Θ$ , $T_{Θ}$ , $Γ_{i} (t)$	$X_{i}$ , $α_{i}$ , $Θ$ and $Γ_{i} (t)$ are variables and parameters used to calculate the first passage time $T_{Θ}$ of a stochastic process that mimics the duration of an enzyme to finishing a catalytic job.	See Appendix 8.1.
$γ_{i}$ , $Ξ$ , $μ_{Ξ}$ , $σ_{Ξ}$	$γ_{i}$ is a real number; $Ξ$ is a variable defined as $Ξ \equiv \sum_{i = 1}^{n} γ_{i} / k_{i}^{c a t}$ ; $μ_{Ξ}$ and $σ_{Ξ}$ are the mean and standard deviation of $Ξ$ .	See Equation S153 and Appendix 8.1.
$μ_{κ_{i}}$ , $σ_{κ_{i}}$ , $μ_{1 / κ_{i}}$ , $σ_{1 / κ_{i}}$	$μ_{κ_{i}}$ and $σ_{κ_{i}}$ are the mean and standard deviation of $κ_{i}$ ; $μ_{1 / κ_{i}}$ and $σ_{1 / κ_{i}}$ are the mean and standard deviation of ${1 / κ}_{i}$ .	See Equation S154 and Appendices 8.1 and 9.2.
$λ_{r}$ , $λ_{f}$ , $μ_{λ_{r}}$ , $σ_{λ_{r}}$ , $μ_{λ_{f}}$ , $σ_{λ_{f}}$ , $ρ_{r f}$	$λ_{r}$ and $λ_{f}$ are the growth rates when cells choose respiration or fermentation; $μ_{λ_{r}}$ , $μ_{λ_{f}}$ and $σ_{λ_{r}}$ , $σ_{λ_{f}}$ are the means and standard deviations of $λ_{r}$ and $λ_{f}$ ; $ρ_{r f}$ is the correlation of $λ_{r}$ and $λ_{f}$ .	See Equation S36 and Appendices 8.1 and 9.2.
$λ_{s u c c i n a t e}^{(21 A A)}$ , $λ_{a c e t a t e}$ , $μ_{λ_{s u c c i n a t e}^{(21 A A)}}$ , $μ_{λ_{a c e t a t e}}$ , $σ_{λ_{s u c c i n a t e}^{(21 A A)}}$ , $σ_{λ_{a c e t a t e}}$	$λ_{s u c c i n a t e}^{(21 A A)}$ and $λ_{a c e t a t e}$ are the growth rates for succinate mixed with 21AA or acetate as the sole carbon source; $μ_{λ_{s u c c i n a t e}^{(21 A A)}}$ , $μ_{λ_{a c e t a t e}}$ and $σ_{λ_{s u c c i n a t e}^{(21 A A)}}$ , $σ_{λ_{a c e t a t e}}$ are the means and standard deviations of $λ_{s u c c i n a t e}^{(21 A A)}$ and $λ_{a c e t a t e}$ .	See Appendix 9.2.
$ϕ_{M T}$ , $κ_{M T}$	$ϕ_{M T}$ and $κ_{M T}$ are the proteomic mass fraction of the enzymes and the effective substrate quality of related metabolites in the mitochondria for yeast and mammalian cells, respectively.	NA
${P r}_{f}$	The proportion of ATP generated from fermentation: $P r_{f} \equiv \frac{J_{f}^{(E)}}{J_{f}^{(E)} + J_{r}^{(E)}}$ .	See Equations S180, S189 and Appendix 10.
$\bar{Δ}$	The proteome efficiency difference between respiration and fermentation: $\bar{Δ} \equiv 1 / ε_{r} - 1 / ε_{f} .$	See Equations S181, S187 and Appendix 10.
$μ_{ε_{r}}$ , $μ_{ε_{f}}$ , $μ_{1 / ε_{r}}$ , $μ_{1 / ε_{f}}$	$μ_{ε_{r}}$ , $μ_{ε_{f}}$ , $μ_{1 / ε_{r}}$ and $μ_{1 / ε_{f}}$ are the mean values of $ε_{r}$ , $ε_{f}$ , $1 / ε_{r}$ and $1 / ε_{f}$ , respectively.	See Equations S182-S184 and Appendix 10.
$σ_{ε_{r}}$ , $σ_{ε_{f}}$ , $σ_{1 / ε_{r}}$ , $σ_{1 / ε_{f}}$	$σ_{ε_{r}}$ , $σ_{ε_{f}}$ , $σ_{1 / ε_{r}}$ , and $σ_{1 / ε_{f}}$ are the standard deviations of $ε_{r}$ , $ε_{f}$ , $1 / ε_{r}$ and $1 / ε_{f}$ , respectively.	See Equations S182, S185 and Appendix 10.
$χ_{ε_{r}}$ , $χ_{ε_{f}}$ , $χ_{1 / ε_{r}}$ , $χ_{1 / ε_{f}}$	$χ_{ε_{r}}$ , $χ_{ε_{f}}$ , $χ_{1 / ε_{r}}$ , and $χ_{1 / ε_{f}}$ are the coefficients of variation of $ε_{r}$ , $ε_{f}$ , $1 / ε_{r}$ and $1 / ε_{f}$ , respectively.	See Equations S185-S186 and Appendix 10.
$μ_{\bar{Δ}}$ , $σ_{\bar{Δ}}$	$μ_{\bar{Δ}}$ and $σ_{\bar{Δ}}$ are the mean and standard deviation of $\bar{Δ}$ , respectively.	See Equations S187-S188 and Appendix 10.
$〈ε_{r}〉$ , $⟨ ε_{f} ⟩$	$〈ε_{r}〉$ and $〈ε_{f}〉$ are the population-averaged values of $ε_{r}$ and $ε_{f}$ , respectively.	Measurable from experiments. See Equations S183-S184 and Appendix 10.

*

Parameter settings for yeast and mammalian cells are specifically labeled as ‘eukaryotic cells.’
†

‘NA’ represents ‘Not applicable.’
‡

Extensive variables scale with the size of the cell population.
§

Intensive variables are scale-invariant with respect to the cell population.

Appendix 1—figure 1

Download asset Open asset

Central metabolic network and carbon utilization pathways of *E. coli*.

(A) Energy biogenesis details in the central metabolic network. In *E. coli,* NADPH and NADH are interconvertible (Sauer et al., 2004), and all energy carriers can be converted to ATP through ADP phosphorylation. The conversion factors are: NADH = 2 ATP, NADPH = 2 ATP, FADH₂=1 ATP (Neidhardt et al., 1990). (B) Relevant genes encoding enzymes in the central metabolic network of *E. coli*. (**C–E**) Three independent fates of glucose metabolism in *E. coli*. (C) For energy biogenesis through fermentation, a molecule of glucose generates 12 ATPs. (D) For energy biogenesis via respiration, a molecule of glucose generates 26 ATPs. (E) For biomass synthesis, glucose is converted into precursors of biomass. Note that biomass synthesis is accompanied by ATP production (see Appendix 3.1).

Appendix 1—figure 2

Download asset Open asset

Model and results for experimental comparison of *E. coli*.

(**A–C**) Model analysis for carbon utilization in mixtures with amino acids. (A) Coarse-grained model for the case of a Group A carbon source mixed with extracellular amino acids. (B) Model predictions (Equations S157, S164-S165) and single-cell reference experimental results (Wallden et al., 2016) showing growth rate distributions for *E. coli* in three culturing conditions. (C) Comparison of the growth rate-fermentation flux relation for *E. coli* in Group A carbon sources between minimal media and enriched media (those with 7AA). (**D–E**) Influence of translation inhibition on overflow metabolism in *E. coli*. (D) A 3D plot illustrating the relations among fermentation flux, growth rate, and translation efficiency (Equations S79 and S160). (E) Growth rate dependence of acetate excretion rate as $κ_{A}$ varies, with each fixed dose of Cm. Translation efficiency is tuned by the dose of Cm, and the maintenance energy coefficient is set to 0 (i.e. $w_{0} = 0$ ). (F) Coarse-grained model for Group A carbon source utilization, which includes more details to compare with experiments. (G) Comparison of in vivo and in vitro catalytic rates for enzymes of *E. coli* within glycolysis and the TCA cycle (see Appendix 1—table 1 for details). (H) The proteome efficiencies for energy biogenesis in the respiration and fermentation pathways vary with growth rate as functions of the substrate quality of pyruvate (Equations S93 and S96)

Appendix 1—figure 3

Download asset Open asset

Relative protein expression of central metabolic enzymes in *E. coli* under various types of perturbations.

(**A–D**) Relative protein expression under $κ_{A}$ perturbation. (A) Experimental data (Hui et al., 2015) for the catalytic enzymes at each step of glycolysis. (B) Experimental data (Hui et al., 2015) for the catalytic enzymes at each step of the TCA cycle. (C) Model predictions (Equation S118, with $w_{0} = 0$ ) and experimental data (Hui et al., 2015) for representative glycolytic genes. (D) Model predictions (Equation S118, with $w_{0} = 0$ ) and experimental data (Hui et al., 2015) for representative genes from the TCA cycle. (**E–J**) Relative protein expression under $ϕ_{Z}$ perturbation. (**E, F, I**) Model predictions and experimental data (Basan et al., 2015) for representative glycolytic genes. (**G, H, J**) Model predictions and experimental data (Basan et al., 2015) for representative genes from the TCA cycle. (**E–H**) Results of $ϕ_{Z}$ perturbation with $w_{0} = 0$ (Equation S120). (**I–J**) Results of $ϕ_{Z}$ perturbation with $w_{0} = 2.5 (h^{- 1})$ (Equation S121). (**K–N**) Relative protein expression upon energy dissipation. (**K–L**) Model fits (Equations S127 and S123) and experimental data (Basan et al., 2015) for representative glycolytic genes. (**M–N**) Model fits (Equations S127 and S123) and experimental data (Basan et al., 2015) for representative genes from the TCA cycle.

Appendix 1—figure 4

Download asset Open asset

Asymptotic distributions of inverse Gaussian distribution and the inverse of Gaussian distribution.

(A) Comparison between the inverse of Gaussian distribution and the corresponding Gaussian distribution for various values of the coefficient of variation (CV) (Equations S140 and S145). (B) Comparison between the inverse Gaussian distribution and the corresponding Gaussian distribution for various values of CV (Equations S142 and S146). Both the inverse Gaussian distribution and the inverse of Gaussian distribution converge to Gaussian distributions when CV is small.

Appendix 1—figure 5

Download asset Open asset

Carbon utilization in yeast and mammalian cells.

(**A–D**) Three independent fates of glucose metabolism in yeast and mammalian cells. (**A–B**) For energy biogenesis through fermentation, one molecule of glucose generates 2 ATPs. (C) For energy biogenesis through respiration, one molecule of glucose generates 32 ATPs. (D) For biomass synthesis, glucose is converted into biomass precursors, with ATP produced as a byproduct. In yeast and mammalian cells, the energy stored in NADH and FADH₂ converts ADP into ATP in the mitochondria, with higher conversion factors than in *E. coli*: NADH = 2.5 ATP, FADH₂=1.5 ATP (Nelson and Cox, 2008). (E) Coarse-grained model for Group A carbon source utilization in yeast. (F) Coarse-grained model for Group A carbon source utilization in mammalian cells.

Appendix 2

Model framework

2.1 Proteome partition

Here, we adopt the proteome partition framework similar to that introduced by Scott et al., 2010. All proteins in a cell are classified into three classes: the fixed portion Q-class, the active ribosome-affiliated R-class, and the remaining catabolic/anabolic enzymes C-class. Each proteome class has a mass $M_{i}^{(P)}$ ( $i = Q, R, C$ ) and mass fraction $ϕ_{i}$ , where $ϕ_{Q}$ is a constant, and we define $ϕ_{m a x} \equiv 1 - ϕ_{Q}$ . In the exponential growth phase, the ribosome allocation for protein synthesis of each class is $f_{i}$ , with $f_{Q} + f_{R} + f_{C} = 1$ .

To analyze cell growth optimization, we first consider the homogeneous case where all cells share identical biochemical parameters, simplifying the mass accumulation of the cell population into a ‘big cell.’ This simplification does not affect the value of growth rate $λ$ . For bacteria, the protein turnover is negligible, so the mass accumulation of each class follows:

d M_{i}^{(P)} / d t = f_{i} \cdot k_{T} \cdot N_{R} \cdot m_{A A} (i = Q, R, C),

where $m_{AA}$ stands for the average molecular weight of amino acids, $k_{T}$ is the translation rate, $N_{R} = M_{rp}^{(P)} / m_{R}$ is the number of ribosomes, $m_{R}$ is the protein mass of a single ribosome, and $M_{rp}^{(P)}$ is the total protein mass of ribosomes, with $M_{R}^{(P)} / M_{rp}^{(P)} = ς \approx 1.67$ (Neidhardt, 1996; Scott et al., 2010). For a specific stable nutrient environment, $f_{R}$ and $k_{T}$ are temporal invariants. Thus,

M_{i}^{(P)} (t) = M_{i}^{(P)} (0) + f_{i} / f_{R} \cdot M_{R}^{(P)} (0) \cdot [e x p (λ \cdot t) - 1] (i = Q, R, C),

where $λ = f_{R} \cdot k_{T} \cdot m_{AA} / (ς \cdot m_{R})$ , and the total protein mass of the cell population $M_{protein} \equiv \sum_{i}^{Q,R,C} M_{i}^{(P)}$ follows:

M_{protein} (t) = M_{protein} (0) + M_{R} (0) \cdot [e x p (λ \cdot t) - 1] / f_{R}

Over a long period in the exponential growth phase (i.e. $t \to + \infty$ ), we have $ϕ_{i} = f_{i}$ $(i = Q, R, C)$ , and

λ = ϕ_{R} \cdot κ_{t},

where $κ_{t} = k_{T} \cdot m_{AA} / (ς \cdot m_{R}) .$

2.2 Precursor pools

Based on the entry points of the metabolic network, we classify the precursors of biomass components into five pools (Figure 1A and B): a1 (entry point: G6P/F6P), a2 (entry point: GA3P/3PG/PEP), b (entry point: pyruvate/acetyl-CoA), c (entry point: α-ketoglutarate), and d (entry point: oxaloacetate). These five pools draw approximately $r_{a1} = 24 %$ , $r_{a2} = 24 %$ , $r_{b} = 28 %$ , $r_{c} = 12 %$ , and $r_{d} = 12 %$ of the carbon flux (Nelson and Cox, 2008; Wang et al., 2019). There are overlapping components between Pools a1 and a2 due to the joint synthesis of some precursors. Therefore, we use Pool a to represent both Pools a1 and a2 in the descriptions.

2.3 Stoichiometric flux

We consider the following biochemical reaction between substrate $S_{i}$ and enzyme $E_{i}$ :

E_{i} + S_{i} ⇌_{d_{i}}^{a_{i}} E_{i} \cdot S_{i} \overset{k_{i}^{c a t}}{\to} E_{i} + b_{i} \cdot S_{i + 1} + c_{i} \cdot {CO}_{2},

where $a_{i}$ , $d_{i}$ and $k_{i}^{cat}$ are the reaction parameters, $S_{i + 1}$ is the product, $b_{i}$ and $c_{i}$ are the stoichiometric coefficients. For most of the reactions in the central metabolism, $b_{i} = 1$ and $c_{i} = 0$ . The reaction rate follows Michaelis–Menten kinetics (Nelson and Cox, 2008):

v_{i} = k_{i}^{cat} \cdot [E_{i}] \cdot \frac{[S_{i}]}{[S_{i}] + K_{i}},

where $K_{i} \equiv (d_{i} + k_{i}^{cat}) / a_{i}$ , $[E_{i}]$ and $[S_{i}]$ are the Michaelis constant, and the concentrations of enzyme $E_{i}$ and substrate $S_{i}$ , respectively. For this reaction (Equation S5), $d [S_{i + 1}] / d t = b_{i} \cdot v_{i}$ and $d [S_{i}] / d t = - v_{i}$ . In the cell population (the ‘big cell’), suppose that the cell volume is $V_{cell}$ , then the stoichiometric flux of the reaction is:

J_{i} \equiv V_{cell} \cdot v_{i} .

The copy number of enzyme $E_{i}$ is $N_{E_{i}} = V_{cell} \cdot [E_{i}]$ with a total weight of $M_{E_{i}} = N_{E_{i}} \cdot m_{E_{i}}$ , where $m_{E_{i}}$ is the molecular weight of $E_{i}$ . By defining the enzyme cost of an $E_{i}$ molecule as $n_{E_{i}} \equiv m_{E_{i}} / m_{0}$ , where $m_{0}$ is a unit mass, then the cost of all $E_{i}$ molecules is $Φ_{i} \equiv N_{E_{i}} \cdot n_{E_{i}}$ (Wang et al., 2019). By further defining $ξ_{i} \equiv \frac{k_{i}^{cat}}{n_{E_{i}}} \cdot \frac{[S_{i}]}{[S_{i}] + K_{i}}$ , then:

J_{i} = Φ_{i} \cdot ξ_{i} .

The mass fraction of enzyme $E_{i}$ in the proteome is $ϕ_{i} = M_{E_{i}} / M_{protein}$ , and thus:

ϕ_{i} = Φ_{i} \cdot \frac{m_{0}}{M_{protein}} .

2.4 Carbon flux and cell growth rate

To clarify the relation between the stoichiometric flux $J_{i}$ and growth rate $λ$ , we consider the carbon flux in the biomass production. The carbon mass of the cell population (the ‘big cell’) is given by $M_{carbon} = M_{protein} \cdot r_{carbon} / r_{protein}$ , where $r_{carbon}$ and $r_{protein}$ represent the mass fraction of carbon and protein within a cell. In the exponential growth phase, the carbon flux of the biomass production is given by:

J_{BM} = \frac{1}{m_{carbon}} \cdot \frac{d M_{carbon}}{d t} = λ \cdot \frac{M_{carbon}}{m_{carbon}},

where $m_{carbon}$ is the mass of a carbon atom. In fact, the carbon mass flux per stoichiometry varies depending on the entry point of the precursor pool. Taking Pool b as an example, there are three carbon atoms in a molecule of the entry point metabolite (i.e. pyruvate). Assuming that carbon atoms are conserved from pyruvate to Pool b, then the carbon flux of Pool b is given by $J_{b}^{carbon} = J_{b} \cdot N_{py}^{carbon}$ , where $J_{b}$ is the stoichiometric flux from pyruvate to Pool b (Figure 1A and B) and $N_{py}^{carbon}$ stands for the carbon number of a pyruvate molecule. Combining with Equation S10 and noting that $J_{b}^{carbon} = r_{b} \cdot J_{BM}$ , we get $J_{b} \cdot N_{py}^{carbon} \cdot m_{carbon} = r_{b} \cdot λ \cdot M_{carbon}$ . Similarly, for each precursor pool, we have:

J_{i} \cdot N_{{EP}_{i}}^{c a r b o n} \cdot m_{carbon} = r_{i} \cdot λ \cdot M_{carbon} (i = a 1, a 2, b, c, d),

where the subscript ‘ ${E P}_{i}$ ’ represents the entry point of Pool i, and $N_{{EP}_{i}}$ is the number of carbon atoms in a molecule of the entry-point metabolite.

For each substrate in intermediate steps of the metabolic network, we define $κ_{i}$ as the substrate quality:

κ_{i} \equiv ξ_{i} \cdot \frac{r_{protein}}{r_{carbon}} = \frac{r_{protein}}{r_{carbon}} \cdot \frac{k_{i}^{cat}}{n_{E_{i}}} \cdot \frac{[S_{i}]}{[S_{i}] + K_{i}},

and for each precursor pool, we define:

η_{i} \equiv r_{i} \cdot m_{0} / (N_{E P_{i}}^{c a r b o n} \cdot m_{c a r b o n}) (i = a 1, a 2, b, c, d) .

Combining Equations S8, S9 and S11, we have

ϕ \cdot κ_{i} = η_{i} \cdot λ (i = a 1, a 2, b, c, d) .

Then, we define the normalized flux, which can be regarded as the flux per unit of biomass:

J_{i}^{(N)} \equiv ϕ_{i} \cdot κ_{i,}

where the superscript ‘(N)’ stands for normalized. Combined with Equations S8, S9 and S12, we have:

J_{i}^{(N)} \equiv J_{i} \cdot \frac{m_{0}}{M_{carbon}} .

Since $\sum_{i}^{a 1, a 2, b, c, d} r_{i} = 1$ , by setting

m_{0} = {[\sum_{i} r_{i} / N_{{EP}_{i}}^{carbon}]}^{- 1} \cdot m_{carbon},

we then obtain:

η_{i} = \frac{r_{i}}{N_{{EP}_{i}}^{c a r b o n}} \cdot {[\sum_{j}^{a 1, a 2, b, c, d} \frac{r_{j}}{N_{{EP}_{j}}^{c a r b o n}}]}^{- 1} (i = a 1, a 2, b, c, d),

and we have $\sum_{i}^{a 1, a 2, b, c, d} η_{i} = 1$ , and

\sum_{i}^{a 1, a 2, b, c, d} ϕ_{i} \cdot κ_{i} = λ .

2.5 Intermediate nodes

In a metabolic network, the metabolites between the carbon source and precursor pools are referred to as intermediate nodes. As specified by Wang et al., 2019, to optimize cell growth rate, the substrate of each intermediate node is nearly saturated, and thus:

κ_{i} \approx \frac{r_{protein}}{r_{carbon}} \cdot \frac{k_{i}^{cat}}{n_{E_{i}}}

Real cases could be more complicated due to other forms of metabolic regulations. Recent quantitative studies (Bennett et al., 2009; Park et al., 2016) have shown that, at least in E. coli, for most of the substrate-enzyme pairs, $K_{i}$ is lower than the substrate concentration (i.e. $[S_{i}] > K_{i}$ ), which implies $κ_{i} \approx \frac{r_{protein}}{r_{carbon}} \cdot \frac{k_{i}^{cat}}{n_{E_{i}}}$ .

Appendix 3

Model and analysis

3.1 Coarse-grained model

In the coarse-grained model shown in Figure 1B, node A represents an arbitrary carbon source of Group A (Wang et al., 2019), which joins at the upper part of glycolysis. Nodes M1, M2, M3, M4, and M5 stand for G6P, PEP, acetyl-CoA, α-ketoglutarate, and oxaloacetate, respectively. In the analysis of carbon supply into precursor pools, we lump sum G6P/F6P as M1, GA3P/3PG/PEP as M2, and pyruvate/acetyl-CoA as M3 for approximation. For the biochemical reactions, each follows Equation S5 with $b_{i} = 1$ except for M1→2M2 and M3 +M5→M4. Basically, there are three independent possible fates for a Group A carbon source (e.g. glucose; see Appendix 1—figure 1C-E; Chen and Nielsen, 2019): energy biogenesis through fermentation; energy biogenesis via respiration (Appendix 1—figure 1C and D), or conversion into biomass components accompanied by energy biogenesis in the biomass pathway. Each fate involves a distinct fraction of the proteome, with no overlap between them (Appendix 1—figure 1).

By applying flux balance to the stoichiometric fluxes and combining with Equation S8, we have:

{\begin{cases} Φ_{A} \cdot ξ_{A} = Φ_{1} \cdot ξ_{1} + Φ_{a 1} \cdot ξ_{a 1}, \\ 2 Φ_{1} \cdot ξ_{1} = Φ_{2} \cdot ξ_{2} + Φ_{5} \cdot ξ_{5} + Φ_{a 2} \cdot ξ_{a 2}, \\ Φ_{2} \cdot ξ_{2} = Φ_{3} \cdot ξ_{3} + Φ_{6} \cdot ξ_{6} + Φ_{b} \cdot ξ_{b}, \\ Φ_{5} \cdot ξ_{5} + Φ_{4} \cdot ξ_{4} = Φ_{3} \cdot ξ_{3} + Φ_{d} \cdot ξ_{d}, \\ Φ_{3} \cdot ξ_{3} = Φ_{4} \cdot ξ_{4} + Φ_{c} \cdot ξ_{c} . \end{cases}

Obviously, the stoichiometric fluxes of respiration $J_{r}$ and fermentation $J_{f}$ (Appendix 1—figure 1C and D) are:

{\begin{cases} J_{r} \equiv J_{4} = Φ_{4} \cdot ξ_{4}, \\ J_{f} \equiv J_{6} = Φ_{6} \cdot ξ_{6} . \end{cases}

We further assume that the carbon atoms are conserved from each entry point metabolite to the precursor pool, and then,

Φ_{i} \cdot ξ_{i} \cdot N_{{EP}_{i}}^{carbon} = r_{i} \cdot J_{BM} (i = a 1, a 2, b, c, d) .

In terms of energy biogenesis for the relevant reactions, for convenience, we convert all the energy currencies into ATPs, namely, NADH→2ATP (Neidhardt et al., 1990), NADPH→2ATP (Neidhardt et al., 1990; Sauer et al., 2004), FADH₂→1ATP (Neidhardt et al., 1990). Then, we have

β_{1} \cdot Φ_{1} \cdot ξ_{1} + β_{2} \cdot Φ_{2} \cdot ξ_{2} + β_{3} \cdot Φ_{3} \cdot ξ_{3} + β_{4} \cdot Φ_{4} \cdot ξ_{4} + β_{6} \cdot Φ_{6} \cdot ξ_{6} + β_{a 1} \cdot Φ_{a 1} \cdot ξ_{a 1} = J_{E},

where $J_{E}$ represents the energy demand for cell proliferation, expressed as the stoichiometric energy flux in ATP. $β_{i}$ is the stoichiometric coefficient with $β_{1} = 4$ , $β_{2} = 3$ , $β_{3} = 2$ , $β_{4} = 6$ , $β_{6} = 1$ , and $β_{a 1} = 4$ for E. coli (Neidhardt et al., 1990; Sauer et al., 2004). For bacteria, the energy demand is generally proportional to the carbon flux infused into biomass production, as the proportion of maintenance energy is roughly negligible (Locasale and Cantley, 2010). Thus,

J_{E} = r_{E} \cdot J_{BM,}

where $r_{E}$ is the ratio and also a constant.

By applying the substitutions specified in Equations S9, S12, S14-S18, combined with Equations S4, S10, S21-S25, and the constraint of proteome resource allocation $ϕ_{R} + ϕ_{C} = ϕ_{m a x}$ , we have:

{\begin{cases} ϕ_{A} \cdot κ_{A} = ϕ_{1} \cdot κ_{1} + ϕ_{a 1} \cdot κ_{a 1}, \\ 2 ϕ_{1} \cdot κ_{1} = ϕ_{2} \cdot κ_{2} + ϕ_{5} \cdot κ_{5} + ϕ_{a 2} \cdot κ_{a 2}, \\ ϕ_{2} \cdot κ_{2} = ϕ_{3} \cdot κ_{3} + ϕ_{6} \cdot κ_{6} + ϕ_{b} \cdot κ_{b}, \\ ϕ_{5} \cdot κ_{5} + ϕ_{4} \cdot κ_{4} = ϕ_{3} \cdot κ_{3} + ϕ_{d} \cdot κ_{d}, \\ ϕ_{3} \cdot κ_{3} = ϕ_{4} \cdot κ_{4} + ϕ_{c} \cdot κ_{c}, \\ ϕ_{a 1} \cdot κ_{a 1} = η_{a 1} \cdot λ, ϕ_{a 2} \cdot κ_{a 2} = η_{a 2} \cdot λ, ϕ_{b} \cdot κ_{b} = η_{b} \cdot λ, ϕ_{c} \cdot κ_{c} = η_{c} \cdot λ, ϕ_{d} \cdot κ_{d} = η_{d} \cdot λ, \\ β_{1} \cdot ϕ_{1} \cdot κ_{1} + β_{2} \cdot ϕ_{2} \cdot κ_{2} + β_{3} \cdot ϕ_{3} \cdot κ_{3} + β_{4} \cdot ϕ_{4} \cdot κ_{4} + β_{6} \cdot ϕ_{6} \cdot κ_{6} + β_{a 1} \cdot ϕ_{a 1} \cdot κ_{a 1} = J_{E}^{(N)}, \\ J_{E}^{(N)} = η_{E} \cdot λ, λ = ϕ_{R} \cdot κ_{t}, J_{r}^{(N)} = ϕ_{4} \cdot κ_{4}, J_{f}^{(N)} = ϕ_{6} \cdot κ_{6}, \\ ϕ_{R} + ϕ_{A} + ϕ_{1} + ϕ_{2} + ϕ_{3} + ϕ_{4} + ϕ_{5} + ϕ_{6} + ϕ_{a 1} + ϕ_{a 2} + ϕ_{b} + ϕ_{c} + ϕ_{d} = ϕ_{m a x}, \end{cases}

where $J_{E}^{(N)}$ and $η_{E}$ are defined as $J_{E}^{(N)} \equiv J_{E} \cdot \frac{m_{0}}{M_{carbon}}$ and $η_{E} \equiv r_{E} \cdot {[\sum_{i} r_{i} / N_{{EP}_{i}}^{carbon}]}^{- 1}$ , respectively. Here, for each intermediate node, $κ_{i}$ follows Equation S20, which can be approximated as a constant. The substrate quality of the Group A carbon source $κ_{A}$ varies with the identity and concentration of the Group A carbon source:

κ_{A} \equiv \frac{r_{protein}}{r_{carbon}} \cdot \frac{k_{A}^{cat}}{m_{E_{A}}} \cdot \frac{[A]}{[A] + K_{A}} \cdot m_{0},

which is determined externally by the culture condition. From Equation S26, all $ϕ_{i}$ and $ϕ_{R}$ can be expressed in terms of $J_{r}^{(N)}$ , $J_{f}^{(N)}$ , and $λ$ :

{\begin{aligned} ϕ_{A} & = [J_{r}^{(N)} + J_{f}^{(N)} + (2 η_{a 1} + η_{a 2} + η_{b} + 2 η_{c} + η_{d}) \cdot λ] / (2 \cdot κ_{A}), \\ ϕ_{1} & = [J_{r}^{(N)} + J_{f}^{(N)} + (η_{a 2} + η_{b} + 2 η_{c} + η_{d}) \cdot λ] / (2 \cdot κ_{1}), \\ ϕ_{2} & = [J_{r}^{(N)} + J_{f}^{(N)} + (η_{b} + η_{c}) \cdot λ] / κ_{2}, \\ ϕ_{3} & = (J_{r}^{(N)} + η_{c} \cdot λ) / κ_{3}, ϕ_{4} = J_{r}^{(N)} / κ_{4}, \\ ϕ_{5} & = (η_{c} + η_{d}) \cdot λ / κ_{5}, ϕ_{6} = J_{f}^{(N)} / κ_{6}, ϕ_{R} = λ / κ_{t}, \\ ϕ_{i} & = η_{i} \cdot λ / κ_{i} (i = a 1, a 2, b, c, d) . \end{aligned}

In Equation S28, for each $ϕ_{i}$ or $ϕ_{R}$ , the $J_{r}^{(N)}$ - and $J_{f}^{(N)}$ -related proteome fraction terms belong to the fractions of the proteome dedicated to respiration (denoted as $ϕ_{r}$ ) and fermentation (denoted as $ϕ_{f}$ ), respectively. The $λ$ -related proteome fraction terms belong to those involved in the biomass synthesis pathway (denoted as $ϕ_{BM}$ ). Thus, $ϕ_{r} = J_{r}^{(N)} \cdot [1 / (2 \cdot κ_{A})$ $+ 1 / (2 \cdot κ_{1})$ $+ 1 / κ_{2}$ $+ 1 / κ_{3}$ $+ 1 / κ_{4}]$ , $ϕ_{f} = J_{f}^{(N)} \cdot$ $[1 / (2 \cdot κ_{A})$ $+ 1 / (2 \cdot κ_{1})$ $+ 1 / κ_{2}$ $+ 1 / κ_{6}]$ , and $ϕ_{BM} = λ \cdot (\frac{1}{κ_{t}}$ $+ \frac{1 + η_{a 1} + η_{c}}{2 κ_{A}}$ $+ \frac{1 - η_{a 1} + η_{c}}{2 κ_{1}}$ $+ \frac{η_{b} + η_{c}}{κ_{2}}$ $+ \frac{η_{c}}{κ_{3}}$ $+ \frac{η_{c} + η_{d}}{κ_{5}}$ $+ \sum_{i}^{a 1, a 2, b, c, d} \frac{η_{i}}{κ_{i}})$ . By substituting Equation S28 into Equation S26, we have:

{\begin{aligned} J_{r}^{(E)} + J_{f}^{(E)} = φ \cdot λ, \\ \frac{J_{r}^{(E)}}{ε_{r}} + \frac{J_{f}^{(E)}}{ε_{f}} = ϕ_{max} - ψ \cdot λ . \end{aligned}

Here, $J_{r}^{(E)}$ and $J_{f}^{(E)}$ stand for the normalized energy fluxes of respiration and fermentation, with

{\begin{cases} J_{r}^{(E)} = β_{r}^{(A)} \cdot J_{r}^{(N)} / 2, \\ J_{f}^{(E)} = β_{f}^{(A)} \cdot J_{f}^{(N)} / 2, \end{cases}

where $β_{r}^{(A)} = β_{1} + 2 (β_{2} + β_{3} + β_{4})$ and $β_{f}^{(A)} = β_{1} + 2 (β_{2} + β_{6})$ , with $β_{r}^{(A)} = 26$ and $β_{f}^{(A)} = 12$ for E. coli. $ε_{r}$ and $ε_{f}$ represent the proteome efficiencies for energy biogenesis in the respiration and fermentation pathways (Appendix 1—figure 1C-D), defined as $ε_{r} \equiv J_{r}^{(E)} / ϕ_{r}$ and $ε_{f} \equiv J_{f}^{(E)} / ϕ_{f}$ ; that is, the normalized energy fluxes expressed in ATP generated per proteomic mass fraction dedicated to respiration and fermentation, respectively. Hence,

{\begin{cases} ε_{r} & = \frac{β_{r}^{(A)}}{1 / κ_{A} + 1 / κ_{1} + 2 / κ_{2} + 2 / κ_{3} + 2 / κ_{4}}, \\ ε_{f} & = \frac{β_{f}^{(A)}}{1 / κ_{A} + 1 / κ_{1} + 2 / κ_{2} + 2 / κ_{6}} . \end{cases}

$ψ^{- 1}$ is the proteome efficiency for biomass generation in the biomass synthesis pathway (Appendix 1—figure 1E), defined as $ψ^{- 1} \equiv λ / ϕ_{BM} = \sum_{i}^{a 1, a 2, b, c, d} J_{i}^{(N)} / ϕ_{BM}$ (see Equations S15 and S19); that is, the normalized flux (which differs from the normalized energy flux used to define $ε_{r}$ and $ε_{f}$ ) generated per proteomic mass fraction dedicated to biomass synthesis. Hence

ψ = \frac{1}{κ_{t}} + \frac{1 + η_{a 1} + η_{c}}{2 κ_{A}} + \frac{η_{a 2} + η_{b} + 2 η_{c} + η_{d}}{2 κ_{1}} + \frac{η_{b} + η_{c}}{κ_{2}} + \frac{η_{c}}{κ_{3}} + \frac{η_{c} + η_{d}}{κ_{5}} + \sum_{i}^{a 1, a 2, b, c, d} \frac{η_{i}}{κ_{i}} .

$φ$ is an energy demand coefficient (a constant), with

φ \equiv η_{E} - β_{1} \cdot (η_{a 2} + η_{b} + 2 η_{c} + η_{d}) / 2 - β_{2} \cdot (η_{b} + η_{c}) - β_{3} \cdot η_{c} - β_{a 1} \cdot η_{a 1},

and $φ \cdot λ$ stands for the normalized flux of energy demand other than the accompanying energy biogenesis from the biomass synthesis pathway.

3.2 The reason for overflow metabolism

Microbes optimize their growth rate to survive through the evolutionary process (Vander Heiden et al., 2009). The optimal growth principle also roughly holds for tumor cells, which proliferate while ignoring growth restriction signals and evading immune destruction by the host (Vander Heiden et al., 2009). First, we consider the optimal growth strategy for a single cell. The coarse-grained model for bacteria is summarized in Equation S26 and further simplified in Equation S29. Here, $ε_{r}$ , $ε_{f}$ and $ψ$ are functions of $κ_{A}$ (see Equations S31, S32), so we also denote them as $ε_{r} (κ_{A})$ , $ε_{f} (κ_{A})$ , $ψ (κ_{A})$ . Evidently, the fluxes of both respiration and fermentation take non-negative values, i.e., $J_{r}^{(E)}, J_{f}^{(E)} \geq 0$ , and all the coefficients are positive: $ε_{r} (κ_{A}), ε_{f} (κ_{A}), ψ (κ_{A}), φ > 0$ .

Thus, if $ε_{r} > ε_{f}$ , then $(ψ + φ / ε_{r}) \cdot λ = ϕ_{m a x} - J_{f}^{(E)} (1 / ε_{f} - 1 / ε_{r}) \leq ϕ_{m a x}$ . Obviously, the solution for optimal growth is:

{\begin{cases} J_{f}^{(E)} = 0, \\ J_{r}^{(E)} = φ \cdot λ . \end{cases} ε_{r} > ε_{f} .

Similarly, if $ε_{f} > ε_{r}$ , then the optimal growth solution is:

{\begin{cases} J_{f}^{(E)} = φ \cdot λ, \\ J_{r}^{(E)} = 0. \end{cases} ε_{r} < ε_{f} .

In both cases, the growth rate $λ$ takes the maximum value for a given nutrient condition (i.e. given $κ_{A}$ ):

λ = {\begin{aligned} λ_{r} & = \frac{ϕ_{m a x}}{φ / ε_{r} (κ_{A}) + ψ (κ_{A})} ε_{r} (κ_{A}) > ε_{f} (κ_{A}), \\ λ_{f} & = \frac{ϕ_{m a x}}{φ / ε_{f} (κ_{A}) + ψ (κ_{A})} ε_{r} (κ_{A}) < ε_{f} (κ_{A}) . \end{aligned}

So, why do microbes use the seemingly wasteful fermentation pathway when the growth rate is large under aerobic conditions? Prevalent explanations (Basan et al., 2015; Chen and Nielsen, 2019) suggest that it originates from that the proteome efficiency in fermentation is consistently higher than in respiration (i.e. $ε_{f} > ε_{r}$ ). If this is the case, why do microbes still use the normal respiration pathway when the growth rate is small? The answer lies in the fact that both $ε_{r} (κ_{A})$ and $ε_{f} (κ_{A})$ are not constants, but are dependent on nutrient conditions. In Equation S31, when $κ_{A}$ is small, consider the extreme case of $κ_{A} \to 0$ , and then

{\begin{cases} ε_{r} (κ_{A} \to 0) \approx β_{r}^{(A)} \cdot κ_{A}, \\ ε_{f} (κ_{A} \to 0) \approx β_{f}^{(A)} \cdot κ_{A} . \end{cases}

Since $β_{r}^{(A)} ≫ β_{f}^{(A)}$ , clearly,

ε_{r} (κ_{A} \to 0) > ε_{f} (κ_{A} \to 0) .

Combined with Equation S36, thus cells would certainly use the respiration pathway when the growth rate is very small. Meanwhile, suppose that $κ_{A}^{m a x}$ is the maximum value of $κ_{A}$ available across different Group A carbon sources, and if there exists a $κ_{A}$ (with $κ_{A} \leq κ_{A}^{m a x}$ ) satisfying $ε_{r} (κ_{A}) < ε_{f} (κ_{A})$ , specifically,

\frac{β_{r}^{(A)} - β_{f}^{(A)}}{κ_{A}} < β_{f}^{(A)} (\frac{1}{κ_{1}} + \frac{2}{κ_{2}} + \frac{2}{κ_{3}} + \frac{2}{κ_{4}}) - β_{r}^{(A)} \cdot (\frac{1}{κ_{1}} + \frac{2}{κ_{2}} + \frac{2}{κ_{6}}),

then $Δ (κ_{A}) \equiv ε_{f} (κ_{A}) / ε_{r} (κ_{A})$ is a monotonically increasing function of $κ_{A}$ . Thus,

ε_{r} (κ_{A}^{m a x}) < ε_{f} (κ_{A}^{m a x}),

and cells would use the fermentation pathway when the growth rate is large.

In practice, experimental studies using E. coli (Basan et al., 2015) have demonstrated that proteome efficiency in fermentation is higher than in respiration when the Group A carbon source is lactose at a saturated concentration, i.e., $ε_{r} (κ_{lactose}^{(ST)}) < ε_{f} (κ_{lactose}^{(ST)})$ . Here, $κ_{lactose}^{(ST)}$ represents the substrate quality of lactose and the superscript ‘(ST)’ signifies saturated concentration. In fact, E. coli grows much faster in G6p than lactose (Basan et al., 2015), thus, $κ_{A}^{m a x} > κ_{lactose}^{(ST)}$ , and hence, Equation S40 holds for E. coli. From a theoretical perspective, we can verify Equation S39 and consequently Equation S40 using Equation S20, combined with the in vivo/in vitro biochemical parameters obtained from experimental data (see Appendix 1—table 1; Appendix 1—table 2). For example, it is straightforward to confirm that $ε_{r} (κ_{glucose}^{(ST)}) < ε_{f} (κ_{glucose}^{(ST)})$ using this method (see Appendix 9.2), further supporting the validity of Equations S39-S40 (see also Appendix 10).

Now that Equations S38-S40 are all valid, a critical value of $κ_{A}$ , denoted as $κ_{A}^{(C)}$ , exists, satisfying $Δ (κ_{A}^{(C)}) = 1$ . Thus,

{\begin{cases} ε_{f} (κ_{A}) > ε_{r} (κ_{A}), κ_{A} > κ_{A}^{(C)}; \\ ε_{f} (κ_{A}) = ε_{r} (κ_{A}), κ_{A} = κ_{A}^{(C)}; \\ ε_{f} (κ_{A}) < ε_{r} (κ_{A}), κ_{A} < κ_{A}^{(C)} . \end{cases}

Combined with Equation S31, we have:

κ_{A}^{(C)} = \frac{β_{r}^{(A)} - β_{f}^{(A)}}{β_{f}^{(A)} (1 / κ_{1} + 2 / κ_{2} + 2 / κ_{3} + 2 / κ_{4}) - β_{r}^{(A)} (1 / κ_{1} + 2 / κ_{2} + 2 / κ_{6})} .

By substituting Equation S42 into Equations S31, S32 and S36, we obtain the expressions for $ε_{r} (κ_{A}^{(C)})$ , $ε_{f} (κ_{A}^{(C)})$ and the critical growth rate at the transition point (i.e. $λ_{C} \equiv λ (κ_{A}^{(C)})$ ):

{\begin{aligned} ε_{r} (κ_{A}^{(C)}) = ε_{f} (κ_{A}^{(C)}) = \frac{β_{r}^{(A)} - β_{f}^{(A)}}{2 (1 / κ_{3} + 1 / κ_{4} - 1 / κ_{6})} = \frac{β_{3} + β_{4} - β_{6}}{1 / κ_{3} + 1 / κ_{4} - 1 / κ_{6}}, \\ λ_{C} = \frac{ϕ_{m a x}}{φ / ε_{r / f} (κ_{A}^{(C)}) + ψ (κ_{A}^{(C)})}, \end{aligned}

where $ε_{r / f}$ represents either $ε_{r}$ or $ε_{f}$ . In Figure 1E, we show the dependencies of $ε_{r} (κ_{A})$ , $ε_{f} (κ_{A})$ , and $λ (κ_{A})$ on $κ_{A}$ in a three-dimensional form, as $κ_{A}$ changes.

3.3 The relation between respiration/fermentation flux and growth rate

We proceed to study the relation between the respiration/fermentation flux and the cell growth rate. From Equations S16 and S30, we see that the stoichiometric fluxes $J_{r}$ , $J_{f}$ , the normalized fluxes $J_{r}^{(N)}$ , $J_{f}^{(N)}$ , and the normalized energy fluxes $J_{r}^{(E)}$ , $J_{f}^{(E)}$ are all interconvertible. For convenience, we first analyze the relations between $J_{r}^{(E)}$ , $J_{f}^{(E)}$ , and $λ$ under growth rate optimization. In fact, all these terms are merely functions of $κ_{A}$ (see Equations S34-S36), which is determined by the nutrient condition (Equation S27).

In the homogeneous case, where all microbes share identical biochemical parameters, as $λ (κ_{A})$ increases with $κ_{A}$ , $J_{f}^{(E)}$ appear abruptly and $J_{r}^{(E)}$ vanish simultaneously as $κ_{A}$ exceeds $κ_{A}^{(C)}$ (Figure 1E; see also Equations S34-S35, S41). Combining Equations S34-S36 and S43, we obtain:

{\begin{cases} J_{f}^{(E)} = φ \cdot λ \cdot θ (λ - λ_{C}), \\ J_{r}^{(E)} = φ \cdot λ \cdot [1 - θ (λ - λ_{C})] . \end{cases}

where ‘ $θ$ ’ stands for the Heaviside step function. Defining $λ_{m a x} = λ (κ_{A}^{m a x})$ , and then, $[0, λ_{m a x}]$ is the relevant range of the x-axis. In fact, the digital responses in Equation S44 are consistent with the numerical simulation results of Molenaar et al., 2009. However, these results are incompatible with the threshold-analog response in the standard picture of overflow metabolism (Basan et al., 2015; Holms, 1996).

In practice, the values of $k_{i}^{cat}$ can be greatly influenced by the concentrations of potassium and phosphate (García-Contreras et al., 2012), which vary from cell to cell. Consequently, there is a distribution of values for $k_{i}^{cat}$ among cell populations, commonly referred to as extrinsic noise (Elowitz et al., 2002). For convenience, we assume that each $k_{i}^{cat}$ (and thus $κ_{i}$ ) follows a Gaussian distribution with a coefficient of variation (CV) of 25%. Therefore, the distributions of proteome efficiencies that determine the choice between respiration and fermentation, $ε_{r}$ and $ε_{f}$ , and the critical growth rate for the transition, $λ_{C}$ , can be approximated by Gaussian distributions for a cell population (see Appendix 8.1 for details). Specifically, $λ_{C}$ follows:

λ_{C} \sim N (μ_{λ_{C}}, σ_{λ_{C}}^{2}),

where $μ_{λ_{C}}$ and $σ_{λ_{C}}$ represent the mean and standard deviation of $λ_{C}$ , with the CV $σ_{λ_{C}} / μ_{λ_{C}}$ calculated to be 12% (see Appendix 9.2 for details). Note that $λ$ is $κ_{A}$ dependent, while $λ_{C}$ is independent of $κ_{A}$ . Thus, given the growth rate of microbes in a culturing medium (e.g. in a chemostat), the normalized energy fluxes are:

{\begin{aligned} J_{f}^{(E)} (λ) = \frac{1}{2} φ \cdot λ \cdot [erf (\frac{λ - μ_{λ_{C}}}{\sqrt{2} σ_{λ_{C}}}) + 1], \\ J_{r}^{(E)} (λ) = \frac{1}{2} φ \cdot λ \cdot [1 - erf (\frac{λ - μ_{λ_{C}}}{\sqrt{2} σ_{λ_{C}}})], \end{aligned}

where ‘erf’ represents the error function. In practice, given a culturing medium, there is also a probability distribution for the growth rate (Appendix 1—figure 2B; see also Equation S157). For approximation, in plotting the growth rate-respiration/fermentation flux relations, we use the deterministic (noise-free) value of the growth rate as a proxy. To compare with experiments, we essentially compare the normalized fluxes, $J_{r}^{(N)}$ and $J_{f}^{(N)}$ (see Appendix 9.1 for details). Combining Equations S30 and S46, we obtain:

{\begin{aligned} J_{f}^{(N)} (λ) = \frac{φ}{β_{f}^{(A)}} \cdot λ \cdot [erf (\frac{λ - μ_{λ_{C}}}{\sqrt{2} σ_{λ_{C}}}) + 1], \\ J_{r}^{(N)} (λ) = \frac{φ}{β_{r}^{(A)}} \cdot λ \cdot [1 - erf (\frac{λ - μ_{λ_{C}}}{\sqrt{2} σ_{λ_{C}}})] . \end{aligned}

In Figure 1C–D, we see that Equation S47 quantitatively illustrates the experimental data (Basan et al., 2015), where the model parameters were obtained using biochemical data for the catalytic enzymes (see Appendix 1—table 1 for details).

3.4 Dependence of the model on optimization principles

In the derivation of the growth rate dependence of respiration/fermentation flux described above (Equation S44 for the single-cell level and Equation S47 for the population-averaged level), we applied the principles of optimal growth, incorporating both efficient protein allocation to enzymes and ribosomes (through ribosomal proteins). However, recent experimental studies show that the inactive portion of ribosomes (i.e. ribosomes not bound to mRNAs) may vary with culturing conditions (Dai et al., 2017; Li et al., 2018) and between individual cells within the same culture (Pavlou et al., 2025), despite an overall trend toward growth optimization. These findings (Dai et al., 2017; Li et al., 2018; Pavlou et al., 2025) suggest that ribosome allocation may be suboptimal under many culturing conditions, likely as a result of cells preparing for potential environmental changes (Li et al., 2018). Nevertheless, since our model’s predictions regarding the binary choice between respiration and fermentation rely solely on comparing proteome efficiency between these two pathways, which involves only efficient protein allocation to enzymes, and because the active portion of ribosomes and the translation elongation rate can be approximated as constants within the growth rate range of interest for cells exhibiting overflow metabolism (Dai et al., 2017), our model remains applicable to suboptimal growth conditions. This can be achieved by incorporating suboptimal ribosome allocation factors, lowering the parameter $κ_{t}$ (which results in a larger $ψ$ through Equation S32), to account for these influences. For convenience, we present results for optimal growth below, while all model results can be extended to cases of suboptimal ribosome allocation.

Regarding the mechanism by which cells sense and choose between respiration and fermentation, although the standard picture of overflow metabolism (Basan et al., 2015; Holms, 1996) presents a growth rate dependence of fermentation flux, it is the proteome efficiency of respiration and fermentation, rather than the growth rate, that a cell should sense directly. Due to stochasticity in gene expression and metabolic reactions, the cell growth rate may fluctuate within a cell cycle (Kiviet et al., 2014; Pavlou et al., 2025), and suboptimal factors related to ribosome allocation (Dai et al., 2017; Li et al., 2018) would further complicate the scheme if cells were sensing via growth rate. Essentially, to expedite cell growth and survive under evolutionary pressure, cells should adopt the optimal strategy by directly sensing and comparing proteome efficiencies between respiration and fermentation, choosing the pathway with higher efficiency. This is analogous to how microbes choose between two types of carbon sources in a mixture for nutrient uptake (Wang et al., 2019). Mechanistically, the cyclic AMP (cAMP)-cAMP receptor protein (CRP) system plays an important role in sensing proteome efficiency and executing the optimal strategy between respiration and fermentation (Basan et al., 2015; Towbin et al., 2017; Valgepea et al., 2010; Wehrens et al., 2023). However, the roles of additional unidentified regulators are required to fully elucidate this mechanism (Basan et al., 2015; Valgepea et al., 2010).

Appendix 4

Model perturbations

4.1 Overexpression of useless proteins

Here, we consider the case of overexpression of the protein encoded by the lacZ gene (i.e. $ϕ_{Z}$ perturbation) in E. coli. Effectively, this limits the proteome by altering $ϕ_{m a x}$ :

ϕ_{m a x} \overset{LacZ overexpression}{\to} ϕ_{m a x} - ϕ_{Z},

where $ϕ_{Z}$ stands for the proteomic mass fraction of useless proteins, which is controllable in experiments. Then, the growth rate changes into a bivariate function of $κ_{A}$ and $ϕ_{Z}$ :

λ (κ_{A}, ϕ_{Z}) = {\begin{aligned} \frac{ϕ_{m a x} - ϕ_{Z}}{φ / ε_{r} (κ_{A}) + ψ (κ_{A})} ε_{r} (κ_{A}) > ε_{f} (κ_{A}), \\ \frac{ϕ_{m a x} - ϕ_{Z}}{φ / ε_{f} (κ_{A}) + ψ (κ_{A})} ε_{r} (κ_{A}) < ε_{f} (κ_{A}), \end{aligned}

and thus,

λ (κ_{A}, ϕ_{Z}) = λ (κ_{A}, 0) (1 - ϕ_{Z} / ϕ_{m a x}) .

Obviously, $κ_{A}^{(C)}$ remains a constant (following Equation S42), while $λ_{C} (ϕ_{Z}) \equiv λ (κ_{A}^{(C)}, ϕ_{Z})$ and $λ_{m a x} (ϕ_{Z}) \equiv λ (κ_{A}^{m a x}, ϕ_{Z})$ become functions of $ϕ_{Z}$ :

{\begin{cases} λ_{C} (ϕ_{Z}) = λ_{C} (0) (1 - ϕ_{Z} / ϕ_{m a x}), \\ λ_{m a x} (ϕ_{Z}) = λ_{m a x} (0) (1 - ϕ_{Z} / ϕ_{m a x}) . \end{cases}

In the homogeneous case, $J_{f}^{(E)}$ and $J_{r}^{(E)}$ follow:

{\begin{cases} J_{f}^{(E)} (κ_{A}, ϕ_{Z}) = φ \cdot λ (κ_{A}, ϕ_{Z}) \cdot θ (λ (κ_{A}, ϕ_{Z}) - λ_{C} (ϕ_{Z})), \\ J_{r}^{(E)} (κ_{A}, ϕ_{Z}) = φ \cdot λ (κ_{A}, ϕ_{Z}) \cdot [1 - θ (λ (κ_{A}, ϕ_{Z}) - λ_{C} (ϕ_{Z}))] . \end{cases}

Combined with Equations S50-S51, we have:

{\begin{cases} J_{f}^{(E)} (κ_{A}, ϕ_{Z}) = φ \cdot λ (κ_{A}, ϕ_{Z}) \cdot θ (λ (κ_{A}, 0) - λ_{C} (0)), \\ J_{r}^{(E)} (κ_{A}, ϕ_{Z}) = φ \cdot λ (κ_{A}, ϕ_{Z}) \cdot [1 - θ (λ (κ_{A}, 0) - λ_{C} (0))] . \end{cases}

To compare with experiments, we assume that each $k_{i}^{cat}$ and $κ_{i}$ follow the extrinsic noise with a CV of 25% specified in Appendix 3.3, and we neglect the noise on $ϕ_{Z}$ and $ϕ_{m a x}$ . Combining Equations S45 and S51, $λ_{C} (ϕ_{Z})$ approximately follows a Gaussian distribution:

λ_{C} (ϕ_{Z}) \sim N (μ_{λ_{C}} (ϕ_{Z}), σ_{λ_{C}} {(ϕ_{Z})}^{2}),

where $μ_{λ_{C}} (ϕ_{Z})$ and $σ_{λ_{C}} (ϕ_{Z})$ represent the mean and standard deviation of $λ_{C} (ϕ_{Z})$ , with

{\begin{cases} μ_{λ_{C}} (ϕ_{Z}) = μ_{λ_{C}} (0) (1 - ϕ_{Z} / ϕ_{m a x}), \\ σ_{λ_{C}} (ϕ_{Z}) = σ_{λ_{C}} (0) (1 - ϕ_{Z} / ϕ_{m a x}) . \end{cases}

Here, $μ_{λ_{C}} (0)$ , $σ_{λ_{C}} (0)$ , $λ_{C} (0)$ , $λ_{m a x} (0)$ , and $λ (κ_{A}, 0)$ represent the parameters or variables free from $ϕ_{Z}$ perturbation, just as those in Appendix 3.3. Since the noise on the multiplier term (i.e. $1 - ϕ_{Z} / ϕ_{m a x}$ ) is negligible, the CV of $λ_{C} (ϕ_{Z})$ (i.e. $σ_{λ_{C}} (ϕ_{Z}) / μ_{λ_{C}} (ϕ_{Z})$ ) is unaffected by $ϕ_{Z}$ . By combining Equations S46 and S48, we obtain the relations between the normalized energy fluxes and growth rate:

{\begin{aligned} J_{f}^{(E)} (λ (κ_{A}, ϕ_{Z}), ϕ_{Z}) = \frac{1}{2} φ \cdot λ (κ_{A}, ϕ_{Z}) \cdot [erf (\frac{λ (κ_{A}, ϕ_{Z}) - μ_{λ_{C}} (ϕ_{Z})}{\sqrt{2} σ_{λ_{C}} (ϕ_{Z})}) + 1], \\ J_{r}^{(E)} (λ (κ_{A}, ϕ_{Z}), ϕ_{Z}) = \frac{1}{2} φ \cdot λ (κ_{A}, ϕ_{Z}) \cdot [1 - erf (\frac{λ (κ_{A}, ϕ_{Z}) - μ_{λ_{C}} (ϕ_{Z})}{\sqrt{2} σ_{λ_{C}} (ϕ_{Z})})], \end{aligned}

where $λ (κ_{A}, ϕ_{Z})$ , $μ_{λ_{C}} (ϕ_{Z})$ , and $σ_{λ_{C}} (ϕ_{Z})$ follow Equations S50 and S55 accordingly. For a given value of $ϕ_{Z}$ , i.e., $ϕ_{Z}$ is fixed, then, $λ (κ_{A}, ϕ_{Z})$ changes monotonically with $κ_{A}$ . Combining Equations S55-S56 and S30, we obtain the relation between the normalized fluxes $J_{r}^{(N)}$ , $J_{f}^{(N)}$ , and the growth rate (where $ϕ_{Z}$ is a parameter):

{\begin{aligned} J_{f}^{(N)} (λ, ϕ_{Z}) = \frac{φ}{β_{f}^{(A)}} \cdot λ \cdot [erf (\frac{λ - μ_{λ_{C}} (0) (1 - ϕ_{Z} / ϕ_{m a x})}{\sqrt{2} σ_{λ_{C}} (0) (1 - ϕ_{Z} / ϕ_{m a x})}) + 1], \\ J_{r}^{(N)} (λ, ϕ_{Z}) = \frac{φ}{β_{r}^{(A)}} \cdot λ \cdot [1 - erf (\frac{λ - μ_{λ_{C}} (0) (1 - ϕ_{Z} / ϕ_{m a x})}{\sqrt{2} σ_{λ_{C}} (0) (1 - ϕ_{Z} / ϕ_{m a x})})] . \end{aligned}

In Figure 2C. we show that the model predictions (Equation S57) quantitatively agree with the experiments (Basan et al., 2015).

Meanwhile, we can also perturb the growth rate by tuning $ϕ_{Z}$ in a stable culturing environment with fixed concentration of a Group A carbon source (i.e. given [A]). In fact, for this case there is a distribution of $κ_{A}$ values due to the extrinsic noise in $k_{A}^{cat}$ , yet this distribution is fixed. For convenience of description, we still referred to it as fixed $κ_{A}$ . Then, combining Equations S30, S50, S55 and S56, we get:

{\begin{aligned} J_{f}^{(N)} (λ, ϕ_{Z}) = \frac{φ}{β_{f}^{(A)}} \cdot [erf (\frac{λ (κ_{A}, 0) - μ_{λ_{C}} (0)}{\sqrt{2} σ_{λ_{C}} (0)}) + 1] \cdot λ, \\ J_{r}^{(N)} (λ, ϕ_{Z}) = \frac{φ}{β_{r}^{(A)}} \cdot [1 - erf (\frac{λ (κ_{A}, 0) - μ_{λ_{C}} (0)}{\sqrt{2} σ_{λ_{C}} (0)})] \cdot λ . \end{aligned}

Here, $λ (κ_{A}, 0)$ remains unaltered as $κ_{A}$ is fixed. Therefore, in this case, $J_{f}^{(N)}$ and $J_{r}^{(N)}$ are proportional to $λ$ , where the slopes are both functions of $κ_{A}$ . More specifically, the slope of $J_{f}^{(N)}$ is a monotonically increasing function of $κ_{A}$ , while that of $J_{r}^{(N)}$ is a monotonically decreasing function of $κ_{A}$ . In Figure 2B, we see that the model predictions (Equation S58) agree quantitatively with the experiments (Basan et al., 2015).

In fact, the growth rate can be altered by tuning $ϕ_{Z}$ and $κ_{A}$ simultaneously. Then, the relations among the energy fluxes, growth rate, and $ϕ_{Z}$ still follow Equation S57 (where $ϕ_{Z}$ is a variable). In a 3-D representation, these relations correspond to a surface. In Figure 2A, we show that the model predictions (Equation S57) match well with the experimental data (Basan et al., 2015).

4.2 Energy dissipation

In practice, energy dissipation disrupts the proportional relationship between energy demand and biomass production. Thus, Equation S25 becomes:

J_{E} = r_{E} \cdot J_{BM} + w \cdot \frac{M_{carbon}}{m_{0}},

where $w$ represents the dissipation coefficient. In fact, maintenance energy contributes to energy dissipation, and we define the maintenance energy coefficient as $w_{0}$ . In bacteria, the impact of maintenance energy is roughly negligible, yet in tumor cells, it plays a much more significant role (Locasale and Cantley, 2010).

The introduction of energy dissipation leads to a modification of Equation S26: combining Equation S59 and Equation S16, we have:

J_{E}^{(N)} = η_{E} \cdot λ + w .

Then, Equation S29 changes to:

{\begin{aligned} J_{r}^{(E)} + J_{f}^{(E)} = φ \cdot λ + w, \\ \frac{J_{r}^{(E)}}{ε_{r}} + \frac{J_{f}^{(E)}}{ε_{f}} = ϕ_{m a x} - ψ \cdot λ . \end{aligned}

Consequently, if $ε_{r} > ε_{f}$ , the optimal growth strategy for the cell is:

{\begin{array}{l} J_{f}^{(E)} = 0, \\ J_{r}^{(E)} = φ \cdot λ + w, \end{array} ε_{r} > ε_{f},

and if $ε_{f} > ε_{r}$ , the optimal growth strategy is:

{\begin{array}{l} J_{f}^{(E)} = φ \cdot λ + w, \\ J_{r}^{(E)} = 0. \end{array} ε_{r} < ε_{f} .

Then, the growth rate becomes a bivariate function of both $κ_{A}$ and $w$ :

λ (κ_{A}, w) = {\begin{aligned} \frac{ϕ_{m a x} - w / ε_{r} (κ_{A})}{φ / ε_{r} (κ_{A}) + ψ (κ_{A})} ε_{r} (κ_{A}) > ε_{f} (κ_{A}), \\ \frac{ϕ_{m a x} - w / ε_{f} (κ_{A})}{φ / ε_{f} (κ_{A}) + ψ (κ_{A})} ε_{r} (κ_{A}) < ε_{f} (κ_{A}) . \end{aligned}

Clearly, $κ_{A}^{(C)}$ is still a constant, while $λ_{C} (w) \equiv λ (κ_{A}^{(C)}, w)$ and $λ_{m a x} (w) \equiv λ (κ_{A}^{m a x}, w)$ become functions of $w$ :

{\begin{aligned} λ_{C} (w) = λ_{C} (0) {1 - w / [ε_{r / f} (κ_{A}^{(C)}) ϕ_{m a x}]}, \\ λ_{m a x} (w) = λ_{m a x} (0) {1 - w / [ε_{f} (κ_{A}^{m a x}) ϕ_{m a x}]} . \end{aligned}

For a cell population, in the homogeneous case, $J_{f}^{(E)}$ and $J_{r}^{(E)}$ follow:

{\begin{cases} J_{f}^{(E)} (κ_{A}, w) = [φ \cdot λ (κ_{A}, w) + w] \cdot θ (λ (κ_{A}, w) - λ_{C} (w)), \\ J_{r}^{(E)} (κ_{A}, w) = [φ \cdot λ (κ_{A}, w) + w] \cdot [1 - θ (λ (κ_{A}, w) - λ_{C} (w))] . \end{cases}

To compare with experiments, we assume the same extent of extrinsic noise in $k_{i}^{cat}$ (and thus $κ_{i}$ ) as that specified in Appendix 3.3. Combining Equations S45 and S65, $λ_{C} (w)$ approximately follows a Gaussian distribution:

λ_{C} (w) \sim N (μ_{λ_{C}} (w), σ_{λ_{C}} {(w)}^{2}),

where $μ_{λ_{C}} (w)$ and $σ_{λ_{C}} (w)$ represent the mean and standard deviation of $λ_{C} (w)$ , and

{\begin{aligned} μ_{λ_{C}} (w) = μ_{λ_{C}} (0) {1 - w / [ε_{r / f} (κ_{A}^{(C)}) ϕ_{m a x}]}, \\ σ_{λ_{C}} (w) \approx σ_{λ_{C}} (0) {1 - w / [ε_{r / f} (κ_{A}^{(C)}) ϕ_{m a x}]} . \end{aligned}

Here, $μ_{λ_{C}} (0)$ , $σ_{λ_{C}} (0)$ , $λ_{C} (0)$ , $λ_{m a x} (0)$ , and $λ (κ_{A}, 0)$ represent parameters or variables unaffected by energy dissipation. In fact, there is a distribution of values for $ε_{r / f} (κ_{A}^{(C)})$ . For approximation, we use the deterministic value of $ε_{r / f} (κ_{A}^{(C)})$ in Equation S68, and then the CV of $λ_{C} (w)$ remains largely unperturbed by $w$ . Combining Equations S46, S66 and S67, we have:

{\begin{aligned} J_{f}^{(E)} (λ (κ_{A}, w), w) = \frac{1}{2} (φ \cdot λ (κ_{A}, w) + w) \cdot [erf (\frac{λ (κ_{A}, w) - μ_{λ_{C}} (w)}{\sqrt{2} σ_{λ_{C}} (w)}) + 1], \\ J_{r}^{(E)} (λ (κ_{A}, w), w) = \frac{1}{2} (φ \cdot λ (κ_{A}, w) + w) \cdot [1 - erf (\frac{λ (κ_{A}, w) - μ_{λ_{C}} (w)}{\sqrt{2} σ_{λ_{C}} (w)})] . \end{aligned}

Since the dissipation coefficient $w$ is tunable in experiments, for a given value of $w$ , $λ (κ_{A}, w)$ changes monotonically with $κ_{A}$ . Combining Equations S68-S69 and S30, we have (here $w$ is a parameter):

{\begin{aligned} J_{f}^{(N)} (λ, w) = \frac{φ \cdot λ + w}{β_{f}^{(A)}} \cdot [erf (\frac{λ - μ_{λ_{C}} (0) {1 - w / [ε_{r / f} (κ_{A}^{(C)}) ϕ_{m a x}]}}{\sqrt{2} σ_{λ_{C}} (0) {1 - w / [ε_{r / f} (κ_{A}^{(C)}) ϕ_{m a x}]}}) + 1], \\ J_{r}^{(N)} (λ, w) = \frac{φ \cdot λ + w}{β_{r}^{(A)}} \cdot [1 - erf (\frac{λ - μ_{λ_{C}} (0) {1 - w / [ε_{r / f} (κ_{A}^{(C)}) ϕ_{m a x}]}}{\sqrt{2} σ_{λ_{C}} (0) {1 - w / [ε_{r / f} (κ_{A}^{(C)}) ϕ_{m a x}]}})] . \end{aligned}

The comparison between model predictions (Equation S70) and experimental results (Basan et al., 2015) is shown in Figure 3B, which shows quantitative agreement. Meanwhile, the growth rate can also be perturbed by changing $κ_{A}$ and $w$ simultaneously. The relations among the energy fluxes, growth rate and $w$ follow Equation S70 (here $w$ is a variable). In a 3D representation, these relations form a surface. As shown in Figure 3A, the model predictions (Equation S70) agree quantitatively with the experimental results (Basan et al., 2015).

4.3 Translation inhibition

In E. coli, the translation rate can be modified by adding different concentrations of translation inhibitors, e.g., chloramphenicol (Cm). The net effect of this perturbation is represented as:

κ_{t} \overset{T r a n s l a t i o n i n h i b i t i o n}{\to} κ_{t} / (ι + 1),

where $ι$ stands for the inhibition coefficient with $ι > 0$ , and ${(1 + ι)}^{- 1}$ represents the translation efficiency. Thus, Equation S32 changes to:

ψ (κ_{A}, ι) = \frac{ι + 1}{κ_{t}} + \frac{1 + η_{a 1} + η_{c}}{2 κ_{A}} + \frac{η_{a 2} + η_{b} + 2 η_{c} + η_{d}}{2 κ_{1}} + \frac{η_{b} + η_{c}}{κ_{2}} + \frac{η_{c}}{κ_{3}} + \frac{η_{c} + η_{d}}{κ_{5}} + \sum_{i}^{a 1, a 2, b, c, d} \frac{η_{i}}{κ_{i}}

First, we consider the case where maintenance energy is neglected, i.e., $w_{0} = 0$ . In this case, the growth rate takes the following form:

λ (κ_{A}, ι) = {\begin{aligned} \frac{ϕ_{m a x}}{φ / ε_{r} (κ_{A}) + ψ (κ_{A}, ι)} ε_{r} (κ_{A}) > ε_{f} (κ_{A}), \\ \frac{ϕ_{m a x}}{φ / ε_{f} (κ_{A}) + ψ (κ_{A}, ι)} ε_{r} (κ_{A}) < ε_{f} (κ_{A}), \end{aligned}

where $λ (κ_{A}, 0)$ and $ψ (κ_{A}, 0)$ represent the terms unaffected by translation inhibition. Thus, $λ_{C} (ι) \equiv λ (κ_{A}^{(C)}, ι)$ and $λ_{m a x} (ι) \equiv λ (κ_{A}^{m a x}, ι)$ become functions of $ι$ :

{\begin{aligned} λ_{C} (ι) = λ_{C} (0) \frac{φ / ε_{r / f} (κ_{A}^{(C)}) + ψ (κ_{A}^{(C)}, 0)}{φ / ε_{r / f} (κ_{A}^{(C)}) + ψ (κ_{A}^{(C)}, ι)}, \\ λ_{m a x} (ι) = λ_{m a x} (0) \frac{φ / ε_{f} (κ_{A}^{m a x}) + ψ (κ_{A}^{m a x}, 0)}{φ / ε_{f} (κ_{A}^{m a x}) + ψ (κ_{A}^{m a x}, ι)} . \end{aligned}

In the homogeneous case, $J_{f}^{(E)}$ and $J_{r}^{(E)}$ follow:

{\begin{cases} J_{f}^{(E)} (κ_{A}, ι) = φ \cdot λ (κ_{A}, ι) \cdot θ (λ (κ_{A}, ι) - λ_{C} (ι)), \\ J_{r}^{(E)} (κ_{A}, ι) = φ \cdot λ (κ_{A}, ι) \cdot [1 - θ (λ (κ_{A}, ι) - λ_{C} (ι))] . \end{cases}

To compare with experiments, we assume that extrinsic noise exists in $k_{i}^{cat}$ and $κ_{i}$ as specified in Appendix 3.3. Combining Equations S45 and S74, $λ_{C} (ι)$ can be approximated by a Gaussian distribution:

λ_{C} (ι) \sim N (μ_{λ_{C}} (ι), σ_{λ_{C}} {(ι)}^{2}),

where $μ_{λ_{C}} (ι)$ and $σ_{λ_{C}} (ι)$ represent the mean and standard deviation of $λ_{C} (ι)$ , with

{\begin{aligned} μ_{λ_{C}} (ι) = μ_{λ_{C}} (0) \frac{φ / ε_{r / f} (κ_{A}^{(C)}) + ψ (κ_{A}^{(C)}, 0)}{φ / ε_{r / f} (κ_{A}^{(C)}) + ψ (κ_{A}^{(C)}, ι)}, \\ σ_{λ_{C}} (ι) \approx σ_{λ_{C}} (0) \frac{φ / ε_{r / f} (κ_{A}^{(C)}) + ψ (κ_{A}^{(C)}, 0)}{φ / ε_{r / f} (κ_{A}^{(C)}) + ψ (κ_{A}^{(C)}, ι)} . \end{aligned}

Here, $μ_{λ_{C}} (0)$ , $σ_{λ_{C}} (0)$ , $ψ (κ_{A}^{(C)}, 0)$ , $λ_{C} (0)$ and $λ_{m a x} (0)$ stand for the terms unaffected by translation inhibition. Essentially, there are distributions of values for $ε_{r / f} (κ_{A}^{(C)})$ , $ψ (κ_{A}^{(C)}, 0)$ and $ψ (κ_{A}^{(C)}, ι)$ . For approximation, we use the deterministic values of these terms in Equation S77, and then the CV of $λ_{C} (ι)$ can be approximated by $λ_{C} (0)$ . Combining Equations S46, S75 and S76, we have:

{\begin{aligned} J_{f}^{(E)} (λ (κ_{A}, ι), ι) = \frac{1}{2} φ \cdot λ (κ_{A}, ι) \cdot [erf (\frac{λ (κ_{A}, ι) - μ_{λ_{C}} (ι)}{\sqrt{2} σ_{λ_{C}} (ι)}) + 1], \\ J_{r}^{(E)} (λ (κ_{A}, ι), ι) = \frac{1}{2} φ \cdot λ (κ_{A}, ι) \cdot [1 - erf (\frac{λ (κ_{A}, ι) - μ_{λ_{C}} (ι)}{\sqrt{2} σ_{λ_{C}} (ι)})] . \end{aligned}

In the experiments, the inhibition coefficient $ι$ is controllable by adjusting the concentration of the translation inhibitor. For a given value of $ι$ , $λ (κ_{A}, ι)$ changes monotonically with $κ_{A}$ . Combining Equations S30 and S78, we have (here $ι$ is a parameter):

{\begin{aligned} J_{f}^{(N)} (λ, ι) = \frac{φ \cdot λ}{β_{f}^{(A)}} \cdot [erf (\frac{λ - μ_{λ_{C}} (ι)}{\sqrt{2} σ_{λ_{C}} (ι)}) + 1], \\ J_{r}^{(N)} (λ, ι) = \frac{φ \cdot λ}{β_{r}^{(A)}} \cdot [1 - erf (\frac{λ - μ_{λ_{C}} (ι)}{\sqrt{2} σ_{λ_{C}} (ι)})], \end{aligned}

where $μ_{λ_{C}} (ι)$ and $σ_{λ_{C}} (ι)$ follow Equation S77. The growth rate can also be perturbed by altering both $κ_{A}$ and $ι$ simultaneously. In this case, the relations among the energy fluxes, growth rate and $ι$ still follow Equation S79 (here $ι$ is a variable). The comparison between Equation S79 and experimental data (Basan et al., 2015) is shown in Appendix 1—figure 2D (3-D) and E(2-D). Overall, there is good consistency; however, there remains a noticeable discrepancy when $ι$ is large (i.e. at high concentration of the translation inhibitor). This led us to consider the maintenance energy through the coefficient $w_{0}$ , which is small but may account for this discrepancy. Then, $λ (κ_{A}, ι)$ changes into:

λ (κ_{A}, ι) = {\begin{aligned} \frac{ϕ_{m a x} - w_{0} / ε_{r} (κ_{A})}{φ / ε_{r} (κ_{A}) + ψ (κ_{A}, ι)} ε_{r} (κ_{A}) > ε_{f} (κ_{A}), \\ \frac{ϕ_{m a x} - w_{0} / ε_{f} (κ_{A})}{φ / ε_{f} (κ_{A}) + ψ (κ_{A}, ι)} ε_{r} (κ_{A}) < ε_{f} (κ_{A}), \end{aligned}

while $λ_{C} (ι) \equiv λ (κ_{A}^{(C)}, ι)$ and $λ_{m a x} (ι) \equiv λ (κ_{A}^{m a x}, ι)$ still follow Equation S74, though the forms of $λ_{C} (0)$ and $λ_{m a x} (0)$ change to:

{\begin{aligned} λ_{C} (0) = \frac{ϕ_{m a x} - w_{0} / ε_{r / f} (κ_{A}^{(C)})}{φ / ε_{r / f} (κ_{A}^{(C)}) + ψ (κ_{A}^{(C)}, 0)}, \\ λ_{m a x} (0) = \frac{ϕ_{m a x} - w_{0} / ε_{f} (κ_{A}^{m a x})}{φ / ε_{f} (κ_{A}^{m a x}) + ψ (κ_{A}^{m a x}, 0)} . \end{aligned}

In the homogeneous case, $J_{f}^{(E)}$ and $J_{r}^{(E)}$ follow:

{\begin{cases} J_{f}^{(E)} (κ_{A}, ι) = [φ \cdot λ (κ_{A}, ι) + w_{0}] \cdot θ (λ (κ_{A}, ι) - λ_{C} (ι)), \\ J_{r}^{(E)} (κ_{A}, ι) = [φ \cdot λ (κ_{A}, ι) + w_{0}] \cdot [1 - θ (λ (κ_{A}, ι) - λ_{C} (ι))] . \end{cases}

To compare with experiments, we assume that the extrinsic noise follows the specification in Appendix 3.3. Combining Equations S45, S74 and S81, $λ_{C} (ι)$ approximately follows a Gaussian distribution:

λ_{C} (ι) \sim N (μ_{λ_{C}} (ι), σ_{λ_{C}} {(ι)}^{2})

Here $μ_{λ_{C}} (ι)$ and $σ_{λ_{C}} (ι)$ still follow Equation S77, while $μ_{λ_{C}} (0)$ and $σ_{λ_{C}} (0)$ change accordingly with $λ_{C} (0)$ (see Equation S81). For approximation, we use the deterministic values of the relevant terms in Equation S77, and then the CV of $λ_{C} (ι)$ is roughly the same as $λ_{C} (0)$ . Combining Equations S46, S82 and S83, we have:

{\begin{aligned} J_{f}^{(E)} (λ (κ_{A}, ι), ι) = \frac{1}{2} (φ \cdot λ (κ_{A}, ι) + w_{0}) \cdot [erf (\frac{λ (κ_{A}, ι) - μ_{λ_{C}} (ι)}{\sqrt{2} σ_{λ_{C}} (ι)}) + 1], \\ J_{r}^{(E)} (λ (κ_{A}, ι), ι) = \frac{1}{2} (φ \cdot λ (κ_{A}, ι) + w_{0}) \cdot [1 - erf (\frac{λ (κ_{A}, ι) - μ_{λ_{C}} (ι)}{\sqrt{2} σ_{λ_{C}} (ι)})] . \end{aligned}

Thus, for a given $ι$ , $λ (κ_{A}, ι)$ changes monotonically with $κ_{A}$ . Combining Equations S30 and S84, we have (here $ι$ is a parameter):

{\begin{aligned} J_{f}^{(N)} (λ, ι) = \frac{φ \cdot λ + w_{0}}{β_{f}^{(A)}} \cdot [erf (\frac{λ - μ_{λ_{C}} (ι)}{\sqrt{2} σ_{λ_{C}} (ι)}) + 1] . \\ J_{r}^{(N)} (λ, ι) = \frac{φ \cdot λ + w_{0}}{β_{r}^{(A)}} \cdot [1 - erf (\frac{λ - μ_{λ_{C}} (ι)}{\sqrt{2} σ_{λ_{C}} (ι)})] . \end{aligned}

The growth rate and fluxes can also be perturbed by altering both $κ_{A}$ and $ι$ simultaneously. The relations among the energy fluxes, growth rate, and $ι$ would still follow Equation S85, except that $ι$ is now regarded as a variable. Assuming a small amount of maintenance energy by assigning $w_{0} = 2.5 (h^{- 1})$ , we find that the experimental results (Basan et al., 2015) agree quantitatively well with the model predictions (Figure 3C and D).

Appendix 5

Overflow metabolism in substrates other than Group A carbon sources

Due to the topology of the metabolic network, for cells using Group A carbon sources, the behavior of overflow metabolism follows Equation 5 (or Equation S47) upon $κ_{A}$ perturbation (i.e. varying the type or concentration of a Group A carbon source). This has been demonstrated clearly in the above analysis and agrees quantitatively with experiments. However, further analysis is required for cells using substrates other than Group A sources due to the topological differences in carbon utilization (Wang et al., 2019). In principle, substrates entering from glycolysis or the points before acetyl-CoA are potentially involved in overflow metabolism, while those joining from the TCA cycle are not relevant to this behavior. Still, mixed carbon sources are likely to induce a different profile of overflow metabolism, as long as there is a carbon source derived from glycolysis.

5.1 Pyruvate

The coarse-grained model for pyruvate utilization is shown in Figure 3E. Here, nodes M₁, M₂, M₃, M₄, M₅ follow the descriptions in Appendix 3.1. Each biochemical reaction follows Equation S5 with $b_{i} = 1$ except that 2M₂→M₁ and M₃+M₅→M₄. By applying flux balance to the stoichiometric fluxes, combining with Equation S8, we have:

{\begin{cases} Φ_{p y} \cdot ξ_{p y} = Φ_{7} \cdot ξ_{7} + Φ_{8} \cdot ξ_{8}, \\ Φ_{7} \cdot ξ_{7} = 2 Φ_{9} \cdot ξ_{9} + Φ_{5} \cdot ξ_{5} + Φ_{a 2} \cdot ξ_{a 2}, \\ Φ_{9} \cdot ξ_{9} = Φ_{a 1} \cdot ξ_{a 1}, \\ Φ_{8} \cdot ξ_{8} = Φ_{3} \cdot ξ_{3} + Φ_{6} \cdot ξ_{6} + Φ_{b} \cdot ξ_{b}, \\ Φ_{5} \cdot ξ_{5} + Φ_{4} \cdot ξ_{4} = Φ_{3} \cdot ξ_{3} + Φ_{d} \cdot ξ_{d}, \\ Φ_{3} \cdot ξ_{3} = Φ_{4} \cdot ξ_{4} + Φ_{c} \cdot ξ_{c} . \end{cases}

For energy biogenesis, we convert all the energy currencies into ATPs, and then,

β_{8} \cdot Φ_{8} \cdot ξ_{8} + β_{3} \cdot Φ_{3} \cdot ξ_{3} + β_{4} \cdot Φ_{4} \cdot ξ_{4} + β_{6} \cdot Φ_{6} \cdot ξ_{6} + β_{a 1} \cdot Φ_{a 1} \cdot ξ_{a 1} - β_{7} \cdot Φ_{7} \cdot ξ_{7} - β_{9} \cdot Φ_{9} \cdot ξ_{9} = J_{E}

where $β_{7} = 1$ , $β_{8} = 2$ , $β_{3} = 2$ , $β_{4} = 6$ , $β_{6} = 1$ , $β_{9} = 6$ , $β_{a 1} = 4$ for E. coli (Neidhardt et al., 1990; Sauer et al., 2004), and $J_{E}$ follows Equation S25. By applying the substitutions specified in Equations S9, S12, S14-S18, combined with Equations S4, S10, S22, S23, S25, S86-S87, and the constraint of proteome resource allocation, we have:

{\begin{cases} ϕ_{p y} \cdot κ_{p y} = ϕ_{7} \cdot κ_{7} + ϕ_{8} \cdot κ_{8}, \\ ϕ_{7} \cdot κ_{7} = 2 ϕ_{9} \cdot κ_{9} + ϕ_{5} \cdot κ_{5} + ϕ_{a 2} \cdot κ_{a 2}, \\ ϕ_{9} \cdot κ_{9} = ϕ_{a 1} \cdot κ_{a 1} \\ ϕ_{8} \cdot κ_{8} = ϕ_{3} \cdot κ_{3} + ϕ_{6} \cdot κ_{6} + ϕ_{b} \cdot κ_{b} \\ ϕ_{3} \cdot κ_{3} = ϕ_{4} \cdot κ_{4} + ϕ_{c} \cdot κ_{c} \\ ϕ_{5} \cdot κ_{5} + ϕ_{4} \cdot κ_{4} = ϕ_{3} \cdot κ_{3} + ϕ_{d} \cdot κ_{d} \\ ϕ_{a 1} \cdot κ_{a 1} = η_{a 1} \cdot λ, ϕ_{a 2} \cdot κ_{a 2} = η_{a 2} \cdot λ, ϕ_{b} \cdot κ_{b} = η_{b} \cdot λ, ϕ_{c} \cdot κ_{c} = η_{c} \cdot λ, ϕ_{d} \cdot κ_{d} = η_{d} \cdot λ, \\ β_{8} \cdot ϕ_{8} \cdot κ_{8} + β_{3} \cdot ϕ_{3} \cdot κ_{3} + β_{4} \cdot ϕ_{4} \cdot κ_{4} + β_{6} \cdot ϕ_{6} \cdot κ_{6} + β_{a 1} \cdot ϕ_{a 1} \cdot κ_{a 1} \\ - β_{7} \cdot ϕ_{7} \cdot κ_{7} - β_{9} \cdot ϕ_{9} \cdot κ_{9} = J_{E}^{(N)}, \\ J_{E}^{(N)} = η_{E} \cdot λ, λ = ϕ_{R} \cdot κ_{t}, J_{r}^{(N)} = ϕ_{4} \cdot κ_{4}, J_{f}^{(N)} = ϕ_{6} \cdot κ_{6}, \\ ϕ_{R} + ϕ_{p y} + ϕ_{3} + ϕ_{4} + ϕ_{5} + ϕ_{6} + ϕ_{7} + ϕ_{8} + ϕ_{9} + ϕ_{a 1} + ϕ_{a 2} + ϕ_{b} + ϕ_{c} + ϕ_{d} = ϕ_{max}, \end{cases}

where $η_{E} = r_{E} \cdot {[\sum_{i} r_{i} / N_{{EP}_{i}}^{carbon}]}^{- 1}$ . $κ_{i}$ is approximately a constant which follows Equation S20 for each of the intermediate node. The substrate quality of $κ_{py}$ varies with the external concentration of pyruvate ([py]),

κ_{py} \equiv \frac{r_{protein}}{r_{carbon}} \cdot \frac{k_{py}^{cat}}{m_{E_{py}}} \cdot \frac{[py]}{[py] + K_{py}} \cdot m_{0} .

From Equation S88, all $ϕ_{i}$ can be expressed by $J_{r}^{(N)}$ , $J_{f}^{(N)}$ , and $λ$ :

{\begin{aligned} ϕ_{py} = [(2 η_{a 1} + η_{a 2} + η_{b} + 2 η_{c} + η_{d}) λ + J_{r}^{(N)} + J_{f}^{(N)}] / κ_{py}, \\ ϕ_{7} = (2 η_{a 1} + η_{a 2} + η_{c} + η_{d}) λ / κ_{7}, ϕ_{9} = η_{a 1} \cdot λ / κ_{9} \\ ϕ_{8} = [J_{r}^{(N)} + J_{f}^{(N)} + (η_{b} + η_{c}) λ] / κ_{8} \\ ϕ_{3} = (J_{r}^{(N)} + η_{c} \cdot λ) / κ_{3}, ϕ_{4} = J_{r}^{(N)} / κ_{4}, \\ ϕ_{5} = (η_{c} + η_{d}) λ / κ_{5}, ϕ_{6} = J_{f}^{(N)} / κ_{6}, \\ ϕ_{i} = η_{i} \cdot λ / κ_{i} (i = a 1, a 2, b, c, d) . \end{aligned}

By substituting Equation S90 into Equation S88, we have:

{\begin{aligned} J_{r}^{(E,py)} + J_{f}^{(E,py)} = φ_{py} \cdot λ, \\ \frac{J_{r}^{(E,py)}}{ε_{r}^{(py)}} + \frac{J_{f}^{(E,py)}}{ε_{f}^{(py)}} = ϕ_{m a x} - ψ_{py} \cdot λ . \end{aligned}

Here, $J_{r}^{(E,py)}$ and $J_{f}^{(E,py)}$ stand for the normalized energy fluxes of respiration and fermentation, respectively, with

{\begin{cases} J_{r}^{(E,py)} = β_{r}^{(py)} \cdot J_{r}^{(N)}, \\ J_{f}^{(E,py)} = β_{f}^{(py)} \cdot J_{f}^{(N)} . \end{cases}

where $β_{r}^{(py)} = β_{3} + β_{4} + β_{8}$ and $β_{f}^{(py)} = β_{6} + β_{8}$ , with $β_{r}^{(py)} = 10$ and $β_{f}^{(py)} = 3$ for E. coli. The coefficients $ε_{r}^{(py)}$ and $ε_{f}^{(py)}$ represent the proteome efficiencies for energy biogenesis using pyruvate in respiration and fermentation pathways, respectively, with

{\begin{aligned} ε_{r}^{(py)} & = \frac{β_{r}^{(py)}}{1 / κ_{py} + 1 / κ_{8} + 1 / κ_{3} + 1 / κ_{4}}, \\ ε_{f}^{(py)} & = \frac{β_{f}^{(py)}}{1 / κ_{py} + 1 / κ_{8} + 1 / κ_{6}} . \end{aligned}

$ψ_{py}^{- 1}$ is the proteome efficiency for biomass generation using pyruvate in the biomass synthesis pathway, with

ψ_{py} = \frac{1}{κ_{t}} + \frac{1 + η_{a 1} + η_{c}}{κ_{py}} + \frac{1 - η_{b} + η_{a 1}}{κ_{7}} + \frac{η_{b} + η_{c}}{κ_{8}} + \frac{η_{a 1}}{κ_{9}} + \frac{η_{c}}{κ_{3}} + \frac{η_{c} + η_{d}}{κ_{5}} + \sum_{i}^{a 1, a 2, b, c, d} \frac{η_{i}}{κ_{i}}

$φ_{py}$ is an energy demand coefficient (a constant), with

φ_{py} \equiv η_{E} + β_{7} \cdot (1 - η_{b} + η_{a 1}) + β_{9} \cdot η_{a 1} - β_{8} \cdot (η_{c} + η_{b}) - β_{3} \cdot η_{c} - β_{a 1} \cdot η_{a 1,}

Evidently, Equation S91 is identical in form with Equation S29. The growth rate changes into $κ_{py}$ dependent:

λ (κ_{py}) = {\begin{aligned} \frac{ϕ_{m a x}}{φ_{py} / ε_{r}^{(py)} (κ_{py}) + ψ_{py} (κ_{py})} ε_{r}^{(py)} (κ_{py}) > ε_{f}^{(py)} (κ_{py}), \\ \frac{ϕ_{m a x}}{φ_{py} / ε_{f}^{(py)} (κ_{py}) + ψ_{py} (κ_{py})} ε_{r}^{(py)} (κ_{py}) < ε_{f}^{(py)} (κ_{py}) . \end{aligned}

When $κ_{py}$ is very small, combined with Equation S93, then,

{\begin{cases} ε_{r}^{(py)} (κ_{py} \to 0) \approx β_{r}^{(py)} \cdot κ_{py}, \\ ε_{f}^{(py)} (κ_{py} \to 0) \approx β_{f}^{(py)} \cdot κ_{py} . \end{cases}

Obviously, $β_{r}^{(py)} ≫ β_{f}^{(py)}$ , and hence

ε_{r}^{(py)} (κ_{py} \to 0) > ε_{f}^{(py)} (κ_{py} \to 0) .

As long as

\frac{β_{r}^{(py)} - β_{f}^{(py)}}{κ_{py}^{(ST)}} < β_{f}^{(py)} (\frac{1}{κ_{8}} + \frac{1}{κ_{3}} + \frac{1}{κ_{4}}) - β_{r}^{(py)} \cdot (\frac{1}{κ_{8}} + \frac{1}{κ_{6}}),

where the superscript ‘(ST)’ stands for the saturated concentration, then,

ε_{r}^{(py)} (κ_{py}^{(ST)}) < ε_{f}^{(py)} (κ_{py}^{(ST)}),

and there exists a critical value of $κ_{py}$ , denoted as $κ_{py}^{(C)}$ , with

{\begin{aligned} ε_{r}^{(py)} (κ_{py}^{(C)}) = ε_{f}^{(py)} (κ_{py}^{(C)}) = \frac{β_{r}^{(py)} - β_{f}^{(py)}}{1 / κ_{3} + 1 / κ_{4} - 1 / κ_{6}} = \frac{β_{3} + β_{4} - β_{6}}{1 / κ_{3} + 1 / κ_{4} - 1 / κ_{6}}, \\ λ_{C}^{(py)} \equiv λ (κ_{py}^{(C)}) = \frac{ϕ_{m a x}}{φ_{py} / ε_{r / f}^{(py)} (κ_{py}^{(C)}) + ψ_{py} (κ_{py}^{(C)})} . \end{aligned}

Here, $λ_{C}^{(py)}$ is the growth rate at the transition point, and $ε_{r / f}^{(py)}$ stands for either $ε_{r}^{(py)}$ or $ε_{f}^{(py)}$ . In Appendix 1—figure 2H, we show the dependencies of $ε_{r}^{(py)} (κ_{py})$ , $ε_{f}^{(py)} (κ_{py})$ and $λ (κ_{py})$ on $κ_{py}$ in a 3-dimensional form. In the homogeneous case, $J_{f}^{(E,py)}$ and $J_{r}^{(E,py)}$ follow:

{\begin{aligned} J_{f}^{(E,py)} = φ_{py} \cdot λ \cdot θ (λ - λ_{C}^{(py)}), \\ J_{r}^{(E,py)} = φ_{py} \cdot λ \cdot [1 - θ (λ - λ_{C}^{(py)})] . \end{aligned}

Defining $λ_{m a x}^{(py)} = λ (κ_{py}^{(ST)})$ , and then, $[0, λ_{m a x}^{(py)}]$ is the relevant range of the x axis. To compare with experiments, we assume the same extent of extrinsic noise in $k_{i}^{cat}$ as specified in Appendix 3.3. Then, $λ_{C}^{(py)}$ approximately follows a Gaussian distribution:

λ_{C}^{(py)} \sim N (μ_{λ_{C}^{(py)}}, σ_{λ_{C}^{(py)}}^{2}),

where $μ_{λ_{C}^{(py)}}$ and $σ_{λ_{C}^{(py)}}$ stand for the mean and standard deviation of $λ_{C}^{(py)}$ . Then, the relations between the normalized energy fluxes and growth rate are:

{\begin{aligned} J_{f}^{(E,py)} (λ) = \frac{1}{2} φ_{py} \cdot λ \cdot [erf (\frac{λ - μ_{λ_{C}^{(py)}}}{\sqrt{2} σ_{λ_{C}^{(py)}}}) + 1], \\ J_{r}^{(E,py)} (λ) = \frac{1}{2} φ_{py} \cdot λ \cdot [1 - erf (\frac{λ - μ_{λ_{C}^{(py)}}}{\sqrt{2} σ_{λ_{C}^{(py)}}})] . \end{aligned}

Combined with Equation S92, we have:

{\begin{aligned} J_{f}^{(N)} (λ) = \frac{φ_{py}}{2 β_{f}^{(py)}} \cdot λ \cdot [erf (\frac{λ - μ_{λ_{C}^{(py)}}}{\sqrt{2} σ_{λ_{C}^{(py)}}}) + 1], \\ J_{r}^{(N)} (λ) = \frac{φ_{py}}{2 β_{r}^{(py)}} \cdot λ \cdot [1 - erf (\frac{λ - μ_{λ_{C}^{(py)}}}{\sqrt{2} σ_{λ_{C}^{(py)}}})] . \end{aligned}

In Figure 3F, we show that the model predictions (Equation S105) align quantitatively with the experimental results (Holms, 1996).

5.2 Mixture of a Group A carbon source with extracellular amino acids

In the case of a Group A carbon source mixed with amino acids, the coarse-grained model is shown in Appendix 1—figure 2A. This model can be used to analyze mixtures with one or multiple types of extracellular amino acids. Here, Equations S21, S22, S24 and S25 still apply, but Equation S23 changes to (the case of $i = a 1$ remains the same as Equation S23):

Φ_{i} \cdot ξ_{i} \cdot N_{E P_{i}}^{c a r b o n} + Φ_{i}^{'} \cdot ξ_{i}^{'} \cdot N_{P_{i}}^{c a r b o n} = r_{i} \cdot J_{B M} (i = a 2, b, c, d) .

Here, $N_{P_{i}}^{carbon}$ represents the number of carbon atoms in a molecule of Pool i. For simplicity, we assume:

N_{P_{i}}^{carbon} \approx N_{{EP}_{i}}^{carbon} .

In the case where all 21 types of amino acids are present and each is at saturated concentration (denoted as ‘21AA’), we have:

{\begin{cases} ϕ_{A} \cdot κ_{A} = ϕ_{1} \cdot κ_{1} + ϕ_{a 1} \cdot κ_{a 1}, \\ 2 ϕ_{1} \cdot κ_{1} = ϕ_{2} \cdot κ_{2} + ϕ_{5} \cdot κ_{5} + ϕ_{a 2} \cdot κ_{a 2}, \\ ϕ_{2} \cdot κ_{2} = ϕ_{3} \cdot κ_{3} + ϕ_{6} \cdot κ_{6} + ϕ_{b} \cdot κ_{b}, \\ ϕ_{5} \cdot κ_{5} + ϕ_{4} \cdot κ_{4} = ϕ_{3} \cdot κ_{3} + ϕ_{d} \cdot κ_{d}, \\ ϕ_{3} \cdot κ_{3} = ϕ_{4} \cdot κ_{4} + ϕ_{c} \cdot κ_{c}, \\ ϕ_{a 1} \cdot κ_{a 1} = η_{a 1} \cdot λ, ϕ_{a 2} \cdot κ_{a 2} + ϕ_{a 2}^{(21 A A)} \cdot κ_{a 2}^{(21 A A)} = η_{a 2} \cdot λ, ϕ_{b} \cdot κ_{b} + ϕ_{b}^{(21 A A)} \cdot κ_{b}^{(21 A A)} = η_{b} \cdot λ, \\ ϕ_{c} \cdot κ_{c} + ϕ_{c}^{(21 A A)} \cdot κ_{c}^{(21 A A)} = η_{c} \cdot λ, ϕ_{d} \cdot κ_{d} + ϕ_{d}^{(21 A A)} \cdot κ_{d}^{(21 A A)} = η_{d} \cdot λ, \\ β_{1} \cdot ϕ_{1} \cdot κ_{1} + β_{2} \cdot ϕ_{2} \cdot κ_{2} + β_{3} \cdot ϕ_{3} \cdot κ_{3} + β_{4} \cdot ϕ_{4} \cdot κ_{4} + β_{6} \cdot ϕ_{6} \cdot κ_{6} + β_{a 1} \cdot ϕ_{a 1} \cdot κ_{a 1} = J_{E}^{(N)}, \\ J_{E}^{(N)} = η_{E} \cdot λ, λ = ϕ_{R} \cdot κ_{t}, J_{r}^{(N)} = ϕ_{4} \cdot κ_{4}, J_{f}^{(N)} = ϕ_{6} \cdot κ_{6}, \\ ϕ_{R} + ϕ_{A} + \sum_{i}^{6} ϕ_{i} + \sum_{j}^{a 1, a 2, b, c, d} ϕ_{j} + ϕ_{a 2}^{(21 A A)} + ϕ_{b}^{(21 A A)} + ϕ_{c}^{(21 A A)} + ϕ_{d}^{(21 A A)} = ϕ_{max}, \end{cases}

where $ϕ_{i}$ and $κ_{i}$ are defined following Equations S9 and S12. Since the cell growth rate significantly increases with the mixture of amino acids, we deduce that Pools a2-d are supplied by amino acids in growth optimization, with

ϕ_{i} = 0 (i = a 2, b, c, d)

Amino acids should be more efficient in the supply of biomass synthesis than the Group A carbon source for Pools a2-d, i.e.,

{\begin{cases} 1 / κ_{a 2}^{(21AA)} < 1 / κ_{a 2} + 1 / (2 κ_{1}) + 1 / (2 κ_{A}), \\ 1 / κ_{b}^{(21AA)} < 1 / κ_{b} + 1 / κ_{2} + 1 / (2 κ_{1}) + 1 / (2 κ_{A}), \\ 1 / κ_{c}^{(21AA)} < 1 / κ_{c} + 1 / κ_{5} + 1 / κ_{3} + 1 / κ_{2} + 1 / κ_{1} + 1 / κ_{A}, \\ 1 / κ_{d}^{(21AA)} < 1 / κ_{d} + 1 / κ_{5} + 1 / (2 κ_{1}) + 1 / (2 κ_{A}) . \end{cases}

In practice, the requirement for proteome efficiency when using amino acids is even higher, since the biomass synthesis pathway is accompanied by energy biogenesis for Group A carbon sources, but not for amino acids. Combining Equations S108 and S109, we have:

{\begin{aligned} J_{r}^{(E)} + J_{f}^{(E)} = φ_{21AA} \cdot λ, \\ \frac{J_{r}^{(E)}}{ε_{r}} + \frac{J_{f}^{(E)}}{ε_{f}} = ϕ_{m a x} - ψ_{21AA} \cdot λ, \end{aligned}

where $J_{r}^{(E)}$ , $J_{f}^{(E)}$ follow Equation S30, while $ε_{r}$ and $ε_{f}$ follow Equation S31. $ψ_{21AA}^{- 1}$ is the proteome efficiency for biomass generation in the biomass synthesis pathway under this nutrient condition, with

ψ_{21AA} = \frac{1}{κ_{t}} + \frac{η_{a 1}}{κ_{A}} + \frac{η_{a 1}}{κ_{a 1}} + \frac{η_{a 2}}{κ_{a 2}^{(21AA)}} + \frac{η_{b}}{κ_{b}^{(21AA)}} + \frac{η_{c}}{κ_{c}^{(21AA)}} + \frac{η_{d}}{κ_{d}^{(21AA)}}

$φ_{21AA}$ is an energy demand coefficient, with

φ_{21AA} \equiv η_{E} - β_{a 1} \cdot η_{a 1}

Combining Equations S111 and S31, the formula for the growth rate is:

λ (κ_{A}) = {\begin{aligned} λ_{r}^{(21AA)} = \frac{ϕ_{m a x}}{φ_{21AA} / ε_{r} (κ_{A}) + ψ_{21AA} (κ_{A})} ε_{r} (κ_{A}) > ε_{f} (κ_{A}), \\ λ_{f}^{(21AA)} = \frac{ϕ_{m a x}}{φ_{21AA} / ε_{f} (κ_{A}) + ψ_{21AA} (κ_{A})} ε_{r} (κ_{A}) < ε_{f} (κ_{A}) . \end{aligned}

In fact, Equations S37-S42 still apply. $ε_{r / f} (κ_{A}^{(C)})$ satisfies Equation S43, while $λ_{C}^{(21AA)} \equiv λ (κ_{A}^{(C)})$ and $λ_{m a x}^{(21AA)} \equiv λ (κ_{A}^{m a x})$ are:

{\begin{aligned} λ_{C}^{(21AA)} = \frac{ϕ_{max}}{\frac{φ_{21AA}}{ε_{r / f} (κ_{A}^{(C)})} + ψ_{21AA} (κ_{A}^{(C)})}, \\ λ_{max}^{(21AA)} = \frac{ϕ_{max}}{\frac{φ_{21AA}}{ε_{f} (κ_{A}^{max})} + ψ_{21AA} (κ_{A}^{max})} . \end{aligned}

When extrinsic noise is taken into account, $λ_{C}^{(21AA)}$ approximately follows a Gaussian distribution:

λ_{C}^{(21AA)} \sim N (μ_{λ_{C}^{(21AA)}}, σ_{λ_{C}^{(21AA)}}^{2}),

and the normalized fluxes $J_{r}^{(N)}$ , $J_{f}^{(N)}$ change to:

{\begin{aligned} J_{f}^{(N)} (λ) = \frac{φ_{21AA}}{β_{f}^{(A)}} \cdot λ \cdot [erf (\frac{λ - μ_{λ_{C}^{(21AA)}}}{\sqrt{2} σ_{λ_{C}^{(21AA)}}}) + 1], \\ J_{r}^{(N)} (λ) = \frac{φ_{21AA}}{β_{r}^{(A)}} \cdot λ \cdot [1 - erf (\frac{λ - μ_{λ_{C}^{(21AA)}}}{\sqrt{2} σ_{λ_{C}^{(21AA)}}})] . \end{aligned}

The above analysis can be extended to cases where a Group A carbon source is mixed with arbitrary combinations of amino acids. Equations S111, S114-S117 would remain in a similar form, while Equations S112-S113 would change depending on the combinations of amino acid. In Appendix 1—figure 2B and C, we compare model predictions (see also Appendix 8.2 and Equation S157) with experimental data (Basan et al., 2015; Wallden et al., 2016) from mixtures of 21 or 7 types of amino acids along with a Group A carbon source, demonstrating quantitative agreement. Additionally, the increase in the critical threshold of growth rate for the growth rate-dependent fermentation flux in mixtures with extracellular amino acids (i.e. $λ_{C}^{(21AA)}, λ_{C}^{(7AA)} > λ_{C}$ ; see Appendix 1—figure 2C) has also been observed in other experimental findings (Peebo et al., 2015).

Appendix 6

Enzyme allocation upon perturbations

6.1 Carbon limitation within Group A carbon sources

In Equation S28, we present the model predictions for the dependencies of enzyme proteomic mass fractions on growth rate and energy fluxes. To compare with experiments, we assume the same extent of extrinsic noise in $k_{i}^{cat}$ as specified in Appendix 3.3. Relative protein expression data for enzymes within glycolysis and the TCA cycle are available from existing studies and are comparable to the $ϕ_{1} - ϕ_{4}$ enzymes of our model (Figure 1B). Upon $κ_{A}$ perturbation, $κ_{A}$ is a variable while $w_{0}$ is fixed (see Appendix 2.5). Combining Equations S28 and S47 (with $w_{0} = 0$ ), we obtain:

{\begin{aligned} ϕ_{1} & = \frac{λ}{κ_{1}} {\frac{φ \cdot (β_{r}^{(A)} - β_{f}^{(A)})}{2 β_{r}^{(A)} \cdot β_{f}^{(A)}} \cdot [e r f (\frac{λ - μ_{λ_{C}}}{\sqrt{2} σ_{λ_{C}}}) + 1] + \frac{φ}{β_{r}^{(A)}} + \frac{η_{a 2} + η_{b} + 2 η_{c} + η_{d}}{2}}, \\ ϕ_{2} & = \frac{λ}{κ_{2}} {\frac{φ \cdot (β_{r}^{(A)} - β_{f}^{(A)})}{β_{r}^{(A)} \cdot β_{f}^{(A)}} \cdot [e r f (\frac{λ - μ_{λ_{C}}}{\sqrt{2} σ_{λ_{C}}}) + 1] + \frac{2 φ}{β_{r}^{(A)}} + η_{b} + η_{c}}, \\ ϕ_{3} & = \frac{λ}{κ_{3}} {\frac{φ}{β_{r}^{(A)}} \cdot [1 - e r f (\frac{λ - μ_{λ_{C}}}{\sqrt{2} σ_{λ_{C}}})] + η_{c}}, \\ ϕ_{4} & = \frac{λ}{κ_{4}} \cdot \frac{φ}{β_{r}^{(A)}} \cdot [1 - e r f (\frac{λ - μ_{λ_{C}}}{\sqrt{2} σ_{λ_{C}}})] . \end{aligned}

In Appendix 1—figure 3C and D, we show the comparisons between model predictions (Equation S118, $w_{0} = 0$ ) and experimental data (Hui et al., 2015), which are consistent overall. We then consider the influence of maintenance energy as specified in Appendix 4.2. Here, we continue to choose $w_{0} = 2.5 (h^{- 1})$ as previously adopted in Appendix 4.3. Thus, Equation S28 still holds. Combined with Equation S85 under the condition that $ι = 0$ , we have:

{\begin{cases} ϕ_{1} & = \frac{1}{2 \cdot κ_{1}} {\begin{cases} \frac{φ \cdot λ + w_{0}}{β_{f}^{(A)}} \cdot [e r f (\frac{λ - μ_{λ_{C}}}{\sqrt{2} σ_{λ_{C}}}) + 1] + \frac{φ \cdot λ + w_{0}}{β_{r}^{(A)}} \cdot [1 - e r f (\frac{λ - μ_{λ_{C}}}{\sqrt{2} σ_{λ_{C}}})] \\ + (η_{a 2} + η_{b} + 2 η_{c} + η_{d}) λ \end{cases}}, \\ ϕ_{2} & = \frac{1}{κ_{2}} {\frac{φ \cdot λ + w_{0}}{β_{f}^{(A)}} \cdot [e r f (\frac{λ - μ_{λ_{C}}}{\sqrt{2} σ_{λ_{C}}}) + 1] + \frac{φ \cdot λ + w_{0}}{β_{r}^{(A)}} \cdot [1 - e r f (\frac{λ - μ_{λ_{C}}}{\sqrt{2} σ_{λ_{C}}})] + (η_{b} + η_{c}) λ}, \\ ϕ_{3} & = \frac{1}{κ_{3}} {\frac{φ \cdot λ + w_{0}}{β_{r}^{(A)}} \cdot [1 - e r f (\frac{λ - μ_{λ_{C}}}{\sqrt{2} σ_{λ_{C}}})] + η_{c} \cdot λ}, \\ ϕ_{4} & = \frac{1}{κ_{4}} \cdot \frac{φ \cdot λ + w_{0}}{β_{r}^{(A)}} \cdot [1 - e r f (\frac{λ - μ_{λ_{C}}}{\sqrt{2} σ_{λ_{C}}})] . \end{cases}

In Figure 4A–B, we show that the model predictions (Equation S119, $w_{0} = 2.5 (h^{- 1})$ ) generally agree with the experiments (Hui et al., 2015). However, there are different basal expressions of these enzymes, likely due to living demands other than cell proliferation, such as preparation for starvation (Mori et al., 2017) or changes in the type of the nutrient (Basan et al., 2020; Kussell and Leibler, 2005).

6.2 Overexpression of useless proteins

In the case of $ϕ_{Z}$ perturbation under each nutrient condition with fixed $κ_{A}$ (see Appendix 4.1), we consider the same extent of extrinsic noise in $k_{i}^{cat}$ as specified in Appendix 3.3. The relation between enzyme allocation and growth rate can be obtained by combining Equations S28 and S58 (with $w_{0} = 0$ ):

{\begin{aligned} ϕ_{1} & = \frac{λ}{2 \cdot κ_{1}} {\begin{cases} (η_{a 2} + η_{b} + 2 η_{c} + η_{d}) + \frac{φ}{β_{r}^{(A)}} \cdot [1 - e r f (\frac{λ (κ_{A}, 0) - μ_{λ_{C}} (0)}{\sqrt{2} σ_{λ_{C}} (0)})] \\ + \frac{φ}{β_{f}^{(A)}} \cdot [e r f (\frac{λ (κ_{A}, 0) - μ_{λ_{C}} (0)}{\sqrt{2} σ_{λ_{C}} (0)}) + 1] \end{cases}}, \\ ϕ_{2} & = \frac{λ}{κ_{2}} {\begin{cases} (η_{b} + η_{c}) + \frac{φ}{β_{r}^{(A)}} \cdot [1 - e r f (\frac{λ (κ_{A}, 0) - μ_{λ_{C}} (0)}{\sqrt{2} σ_{λ_{C}} (0)})] \\ + \frac{φ}{β_{f}^{(A)}} \cdot [e r f (\frac{λ (κ_{A}, 0) - μ_{λ_{C}} (0)}{\sqrt{2} σ_{λ_{C}} (0)}) + 1] \end{cases}}, \\ ϕ_{3} & = \frac{λ}{κ_{3}} {\frac{φ}{β_{r}^{(A)}} \cdot [1 - e r f (\frac{λ (κ_{A}, 0) - μ_{λ_{C}} (0)}{\sqrt{2} σ_{λ_{C}} (0)})] + η_{c}}, \\ ϕ_{4} & = \frac{λ}{κ_{4}} {\frac{φ}{β_{r}^{(A)}} \cdot [1 - e r f (\frac{λ (κ_{A}, 0) - μ_{λ_{C}} (0)}{\sqrt{2} σ_{λ_{C}} (0)})]} . \end{aligned}

Here $λ (κ_{A}, 0)$ is the growth rate for $ϕ_{Z} = 0$ , and thus it is a parameter rather than a variable. The growth rate is defined as $λ (κ_{A}, ϕ_{Z})$ , which follows Equation S50. Thus, $ϕ_{i}$ is proportional to the growth rate $λ$ . In Appendix 1—figure 3E and F, we observe that the model predictions (Equation S120) generally agree with the experiments (Basan et al., 2015). Next, we consider the influence of maintenance energy with $w_{0} = 2.5 (h^{- 1})$ . Combining Equations S28, S58 and S85 (with $ι = 0$ ), we get:

{\begin{aligned} ϕ_{1} = \frac{w_{0}}{2 κ_{1}} {\frac{1}{β_{f}^{(A)}} \cdot [e r f (\frac{λ (κ_{A}, 0) - μ_{λ_{C}} (0)}{\sqrt{2} σ_{λ_{C}} (0)}) + 1] + \frac{1}{β_{r}^{(A)}} \cdot [1 - e r f (\frac{λ (κ_{A}, 0) - μ_{λ_{C}} (0)}{\sqrt{2} σ_{λ_{C}} (0)})]} \\ + \frac{λ}{2 κ_{1}} {\begin{cases} \frac{φ}{β_{f}^{(A)}} \cdot [e r f (\frac{λ (κ_{A}, 0) - μ_{λ_{C}} (0)}{\sqrt{2} σ_{λ_{C}} (0)}) + 1] + \frac{φ}{β_{r}^{(A)}} \cdot [1 - e r f (\frac{λ (κ_{A}, 0) - μ_{λ_{C}} (0)}{\sqrt{2} σ_{λ_{C}} (0)})] \\ + (η_{a 2} + η_{b} + 2 η_{c} + η_{d}) \end{cases}}, \\ ϕ_{2} = \frac{w_{0}}{κ_{2}} {\frac{1}{β_{f}^{(A)}} \cdot [e r f (\frac{λ (κ_{A}, 0) - μ_{λ_{C}} (0)}{\sqrt{2} σ_{λ_{C}} (0)}) + 1] + \frac{1}{β_{r}^{(A)}} \cdot [1 - e r f (\frac{λ (κ_{A}, 0) - μ_{λ_{C}} (0)}{\sqrt{2} σ_{λ_{C}} (0)})]} \\ + \frac{λ}{κ_{2}} {\begin{cases} \frac{φ}{β_{f}^{(A)}} \cdot [e r f (\frac{λ (κ_{A}, 0) - μ_{λ_{C}} (0)}{\sqrt{2} σ_{λ_{C}} (0)}) + 1] + \frac{φ}{β_{r}^{(A)}} \cdot [1 - e r f (\frac{λ (κ_{A}, 0) - μ_{λ_{C}} (0)}{\sqrt{2} σ_{λ_{C}} (0)})] \\ + (η_{b} + η_{c}) \end{cases}}, \\ ϕ_{3} = \frac{λ}{κ_{3}} {\frac{φ}{β_{r}^{(A)}} \cdot [1 - e r f (\frac{λ (κ_{A}, 0) - μ_{λ_{C}} (0)}{\sqrt{2} σ_{λ_{C}} (0)})] + η_{c}} \\ + \frac{w_{0}}{κ_{3}} \cdot \frac{1}{β_{r}^{(A)}} \cdot [1 - e r f (\frac{λ (κ_{A}, 0) - μ_{λ_{C}} (0)}{\sqrt{2} σ_{λ_{C}} (0)})], \\ ϕ_{4} = \frac{λ}{κ_{4}} \cdot \frac{φ}{β_{r}^{(A)}} \cdot [1 - e r f (\frac{λ (κ_{A}, 0) - μ_{λ_{C}} (0)}{\sqrt{2} σ_{λ_{C}} (0)})] + \frac{w_{0}}{κ_{4}} \cdot \frac{1}{β_{r}^{(A)}} \cdot [1 - e r f (\frac{λ (κ_{A}, 0) - μ_{λ_{C}} (0)}{\sqrt{2} σ_{λ_{C}} (0)})] . \end{aligned}

Here, the growth rate is defined as $λ (κ_{A}, ϕ_{Z})$ , and $λ (κ_{A}, 0)$ is a parameter rather than a variable. Thus, $ϕ_{i}$ is a linear function of the growth rate $λ$ , with a positive slope and a positive y-intercept. In Figure 4C–D and Appendix 1—figure 3I-J, we show that the model predictions (Equation S121) agree quantitively with the experimental data (Basan et al., 2015).

6.3 Energy dissipation

In the case of energy dissipation under each nutrient condition, $w$ is perturbed while $κ_{A}$ is fixed. The relation between protein allocation and growth rate can be obtained by combining Equations S28 and S70. However, since $w$ is explicitly present in Equation S70, it is necessary to reduce this variable to obtain the growth rate dependence of enzyme allocation. From Equation S64, we have:

λ (κ_{A}, w) = λ (κ_{A}, 0) {1 - \frac{w}{ϕ_{max}} \cdot [\frac{1}{ε_{r} (κ_{A})} - θ (ε_{f} (κ_{A}) - ε_{r} (κ_{A})) \cdot (\frac{1}{ε_{r} (κ_{A})} - \frac{1}{ε_{f} (κ_{A})})]} .

Here, $λ (κ_{A}, 0) \equiv λ (κ_{A}, w = 0)$ (satisfying Equation S64) is a parameter rather than a variable. ‘ $θ$ ’ stands for the Heaviside step function. Thus, we have:

w (λ) = \frac{ϕ_{m a x} \cdot [1 - λ / λ (κ_{A}, 0)]}{[1 / ε_{r} (κ_{A}) - θ (ε_{f} (κ_{A}) - ε_{r} (κ_{A})) \cdot (1 / ε_{r} (κ_{A}) - 1 / ε_{f} (κ_{A}))]},

where the energy dissipation coefficient $w$ is regarded as a function of the growth rate.

Combining Equations S28, S70 and S123, we get:

{\begin{aligned} ϕ_{1} & = \frac{1}{2 κ_{1}} {\begin{cases} e r f (\frac{λ - μ_{λ_{C}} (0) [1 - \frac{w (λ)}{ε_{r / f} (κ_{A}^{(C)}) ϕ_{max}}]}{\sqrt{2} σ_{λ_{C}} (0) [1 - \frac{w (λ)}{ε_{r / f} (κ_{A}^{(C)}) ϕ_{max}}]}) \cdot [\frac{φ \cdot λ + w (λ)}{β_{f}^{(A)}} - \frac{φ \cdot λ + w (λ)}{β_{r}^{(A)}}] \\ + [φ \cdot λ + w (λ)] (\frac{1}{β_{f}^{(A)}} - \frac{1}{β_{r}^{(A)}} + \frac{2}{β_{r}^{(A)}}) + (η_{a 2} + η_{b} + 2 η_{c} + η_{d}) \cdot λ \end{cases}}, \\ ϕ_{2} & = \frac{1}{κ_{2}} {\begin{cases} e r f (\frac{λ - μ_{λ_{C}} (0) [1 - \frac{w (λ)}{ε_{r / f} (κ_{A}^{(C)}) ϕ_{max}}]}{\sqrt{2} σ_{λ_{C}} (0) [1 - \frac{w (λ)}{ε_{r / f} (κ_{A}^{(C)}) ϕ_{max}}]}) \cdot [\frac{φ \cdot λ + w (λ)}{β_{f}^{(A)}} - \frac{φ \cdot λ + w (λ)}{β_{r}^{(A)}}] \\ + [φ \cdot λ + w (λ)] (\frac{1}{β_{f}^{(A)}} - \frac{1}{β_{r}^{(A)}} + \frac{2}{β_{r}^{(A)}}) + (η_{b} + η_{c}) \cdot λ \end{cases}}, \\ ϕ_{3} & = \frac{1}{κ_{3}} {\frac{[φ \cdot λ + w (λ)]}{β_{r}^{(A)}} \cdot [1 - e r f (\frac{λ - μ_{λ_{C}} (0) [1 - \frac{w (λ)}{ε_{r / f} (κ_{A}^{(C)}) ϕ_{max}}]}{\sqrt{2} σ_{λ_{C}} (0) [1 - \frac{w (λ)}{ε_{r / f} (κ_{A}^{(C)}) ϕ_{max}}]})] + η_{c} \cdot λ}, \\ ϕ_{4} & = \frac{1}{κ_{4}} \cdot \frac{[φ \cdot λ + w (λ)]}{β_{r}^{(A)}} \cdot [1 - e r f (\frac{λ - μ_{λ_{C}} (0) [1 - \frac{w (λ)}{ε_{r / f} (κ_{A}^{(C)}) ϕ_{max}}]}{\sqrt{2} σ_{λ_{C}} (0) [1 - \frac{w (λ)}{ε_{r / f} (κ_{A}^{(C)}) ϕ_{max}}]})], \end{aligned}

where $w (λ)$ follows Equation S123. When $κ_{A}$ lies in the vicinity of $κ_{A}^{(C)}$ or $w$ is small so that

(1 - \frac{w}{ε_{r / f} (κ_{A}) \cdot ϕ_{max}}) / (1 - \frac{w}{ε_{r / f} (κ_{A}^{(C)}) \cdot ϕ_{max}}) \approx 1

then we have:

{\begin{aligned} J_{f}^{(N)} (λ, w) = \frac{φ \cdot λ + w}{β_{f}^{(A)}} \cdot [e r f (\frac{λ (κ_{A}, 0) - μ_{λ_{C}} (0)}{\sqrt{2} σ_{λ_{C}} (0)}) + 1], \\ J_{r}^{(N)} (λ, w) = \frac{φ \cdot λ + w}{β_{r}^{(A)}} \cdot [1 - e r f (\frac{λ (κ_{A}, 0) - μ_{λ_{C}} (0)}{\sqrt{2} σ_{λ_{C}} (0)})], \end{aligned}

and thus:

{\begin{aligned} ϕ_{1} & = \frac{1}{2 κ_{1}} {\begin{cases} [φ \cdot λ + w (λ)] (\frac{1}{β_{f}^{(A)}} - \frac{1}{β_{r}^{(A)}}) \cdot [e r f (\frac{λ (κ_{A}, 0) - μ_{λ_{C}} (0)}{\sqrt{2} σ_{λ_{C}} (0)}) + 1] \\ + \frac{2}{β_{r}^{(A)}} [φ \cdot λ + w (λ)] + (η_{a 2} + η_{b} + 2 η_{c} + η_{d}) \cdot λ \end{cases}}, \\ ϕ_{2} & = \frac{1}{κ_{2}} {\begin{cases} [φ \cdot λ + w (λ)] (\frac{1}{β_{f}^{(A)}} - \frac{1}{β_{r}^{(A)}}) \cdot [e r f (\frac{λ (κ_{A}, 0) - μ_{λ_{C}} (0)}{\sqrt{2} σ_{λ_{C}} (0)}) + 1] \\ + \frac{2}{β_{r}^{(A)}} [φ \cdot λ + w (λ)] + (η_{b} + η_{c}) \cdot λ \end{cases}}, \\ ϕ_{3} & = \frac{1}{κ_{3}} {\frac{[φ \cdot λ + w (λ)]}{β_{r}^{(A)}} \cdot [1 - e r f (\frac{λ (κ_{A}, 0) - μ_{λ_{C}} (0)}{\sqrt{2} σ_{λ_{C}} (0)})] + η_{c} \cdot λ}, \\ ϕ_{4} & = \frac{1}{κ_{4}} \cdot \frac{[φ \cdot λ + w (λ)]}{β_{r}^{(A)}} \cdot [1 - e r f (\frac{λ (κ_{A}, 0) - μ_{λ_{C}} (0)}{\sqrt{2} σ_{λ_{C}} (0)})], \end{aligned}

Note that in Equation S123, $w$ is a linear function of $λ$ with a negative slope. Thus $ϕ_{i}$ exhibits a linear relation with $λ$ when Equation S125 is satisfied (see Equation S127). In fact, the slope of $ϕ_{4}$ is certainly negative (combining Equations S64, S123 and S127), while the sign of the slope for other $ϕ_{i}$ depends on parameters. For a given nutrient, the enzymes corresponding to the same $ϕ_{i}$ should exhibit the same slope sign. Another restriction is that if the slope sign of $ϕ_{1}$ is negative, then the slope sign of $ϕ_{2}$ is surely negative. In Appendix 1—figure 3K-N, we show that our model results agree well with the experimental data (Basan et al., 2015; Equation S127).

Appendix 7

Other aspects of the model

7.1 A coarse-grained model with more details

To compare with experiments, we consider a coarse-grained model with more details, as shown in Appendix 1—figure 2F. Here, nodes M₆, M₇ represent GA3P and DHAP, respectively. Other nodes follow the descriptions specified in Appendix 3.1. Each biochemical reaction follows Equation S5 with $b_{i} = 1$ except that M₁→M₆+M₇ and M₃+M₅→M₄. By applying flux balance to the stoichiometric fluxes, combined with Equation S8, we obtain:

{\begin{cases} Φ_{A} \cdot ξ_{A} = Φ_{1} \cdot ξ_{1} + Φ_{a 1} \cdot ξ_{a 1}, \\ Φ_{11} \cdot ξ_{11} = Φ_{10} \cdot ξ_{10} + Φ_{1} \cdot ξ_{1}, Φ_{10} \cdot ξ_{10} = Φ_{1} \cdot ξ_{1}, \\ Φ_{11} \cdot ξ_{11} = Φ_{2} \cdot ξ_{2} + Φ_{5} \cdot ξ_{5} + Φ_{a 2} \cdot ξ_{a 2}, \\ Φ_{2} \cdot ξ_{2} = Φ_{3} \cdot ξ_{3} + Φ_{6} \cdot ξ_{6} + Φ_{b} \cdot ξ_{b}, \\ Φ_{5} \cdot ξ_{5} + Φ_{4} \cdot ξ_{4} = Φ_{3} \cdot ξ_{3} + Φ_{d} \cdot ξ_{d}, \\ Φ_{3} \cdot ξ_{3} = Φ_{4} \cdot ξ_{4} + Φ_{c} \cdot ξ_{c} . \end{cases}

While Equations S22-S25 still hold. By applying the substitutions specified in Equations S9, S12, S14-S18, combined with Equations S4, S10, S22-S25, S128, and the constraint of proteome resource allocation, we get:

{\begin{cases} ϕ_{A} \cdot κ_{A} = ϕ_{1} \cdot κ_{1} + ϕ_{a 1} \cdot κ_{a 1}, \\ ϕ_{11} \cdot κ_{11} = ϕ_{10} \cdot κ_{10} + ϕ_{1} \cdot κ_{1}, ϕ_{10} \cdot κ_{10} = ϕ_{1} \cdot κ_{1}, \\ ϕ_{11} \cdot κ_{11} = ϕ_{2} \cdot κ_{2} + ϕ_{5} \cdot κ_{5} + ϕ_{a 2} \cdot κ_{a 2}, \\ ϕ_{2} \cdot κ_{2} = ϕ_{3} \cdot κ_{3} + ϕ_{6} \cdot κ_{6} + ϕ_{b} \cdot κ_{b}, \\ ϕ_{5} \cdot κ_{5} + ϕ_{4} \cdot κ_{4} = ϕ_{3} \cdot κ_{3} + ϕ_{d} \cdot κ_{d}, \\ ϕ_{3} \cdot κ_{3} = ϕ_{4} \cdot κ_{4} + ϕ_{c} \cdot κ_{c}, \\ ϕ_{a 1} \cdot κ_{a 1} = η_{a 1} \cdot λ, ϕ_{a 2} \cdot κ_{a 2} = η_{a 2} \cdot λ, ϕ_{b} \cdot κ_{b} = η_{b} \cdot λ, ϕ_{c} \cdot κ_{c} = η_{c} \cdot λ, ϕ_{d} \cdot κ_{d} = η_{d} \cdot λ, \\ β_{1} \cdot ϕ_{1} \cdot κ_{1} + β_{2} \cdot ϕ_{2} \cdot κ_{2} + β_{3} \cdot ϕ_{3} \cdot κ_{3} + β_{4} \cdot ϕ_{4} \cdot κ_{4} + β_{6} \cdot ϕ_{6} \cdot κ_{6} + β_{a 1} \cdot ϕ_{a 1} \cdot κ_{a 1} = J_{E}^{(N)}, \\ J_{E}^{(N)} = η_{E} \cdot λ, λ = ϕ_{R} \cdot κ_{t}, J_{r}^{(N)} = ϕ_{4} \cdot κ_{4}, J_{f}^{(N)} = ϕ_{6} \cdot κ_{6}, \\ ϕ_{R} + ϕ_{A} + ϕ_{1} + ϕ_{2} + ϕ_{3} + ϕ_{4} + ϕ_{5} + ϕ_{6} + ϕ_{7} + ϕ_{8} + ϕ_{a 1} + ϕ_{a 2} + ϕ_{b} + ϕ_{c} + ϕ_{d} = ϕ_{max} . \end{cases}

Then, Equation S28 still holds, while $ϕ_{10}$ and $ϕ_{11}$ are:

{\begin{aligned} ϕ_{10} & = [J_{r}^{(N)} + J_{f}^{(N)} + (η_{a 2} + η_{b} + 2 η_{c} + η_{d}) λ] / 2 \cdot κ_{10} (2 \cdot κ_{10}), \\ ϕ_{11} & = [J_{r}^{(N)} + J_{f}^{(N)} + (η_{a 2} + η_{b} + 2 η_{c} + η_{d}) λ] / κ_{11} . \end{aligned}

By substituting Equations S28 and S130 into Equation S129, we get:

{\begin{aligned} J_{r}^{(E)} + J_{f}^{(E)} = φ \cdot λ, \\ \frac{J_{r}^{(E)}}{ε_{r}^{(d t)}} + \frac{J_{f}^{(E)}}{ε_{f}^{(d t)}} = ϕ_{max} - ψ_{d t} \cdot λ, \end{aligned}

where ‘dt’ stands for details. Equations S30 and S33 still hold. $ε_{r}^{(dt)}$ and $ε_{f}^{(dt)}$ represent the proteome efficiencies for energy biogenesis in the respiration and fermentation pathways, respectively, with

{\begin{aligned} ε_{r}^{(dt)} & = \frac{β_{r}^{(A)}}{1 / κ_{A} + 1 / κ_{1} + 1 / κ_{10} + 2 / κ_{11} + 2 / κ_{2} + 2 / κ_{3} + 2 / κ_{4}}, \\ ε_{f}^{(dt)} & = \frac{β_{f}^{(A)}}{1 / κ_{A} + 1 / κ_{1} + 1 / κ_{10} + 2 / κ_{11} + 2 / κ_{2} + 2 / κ_{6}} . \end{aligned}

$ψ_{dt}^{- 1}$ is the proteome efficiency for biomass generation in the biomass synthesis pathway, with

\begin{aligned} ψ_{dt} = \frac{1}{κ_{t}} + \frac{1 + η_{a 1} + η_{c}}{2 κ_{A}} + (η_{a 2} + η_{b} + 2 η_{c} + η_{d}) (\frac{1}{2 κ_{1}} + \frac{1}{2 κ_{10}} + \frac{1}{κ_{11}}) \\ + \frac{η_{b} + η_{c}}{κ_{2}} + \frac{η_{c}}{κ_{3}} + \frac{η_{c} + η_{d}}{κ_{5}} + \sum_{i}^{a 1, a 2, b, c, d} \frac{η_{i}}{κ_{i}} . \end{aligned}

7.2 Estimation of the in vivo enzyme catalytic rates

We use the method introduced by Davidi et al., 2016, combined with proteome experimental data (Basan et al., 2015; Appendix 1—table 2), to estimate the in vivo enzyme catalytic rates. Combining Equations S28 and S130, we have:

{\begin{aligned} κ_{1} & = [J_{r}^{(N)} + J_{f}^{(N)} + (η_{a 2} + η_{b} + 2 η_{c} + η_{d}) λ] / (2 \cdot ϕ_{1}), \\ κ_{2} & = [J_{r}^{(N)} + J_{f}^{(N)} + (η_{b} + η_{c}) λ] / ϕ_{2}, \\ κ_{3} & = (J_{r}^{(N)} + η_{c} \cdot λ) / ϕ_{3}, κ_{4} = J_{r}^{(N)} / ϕ_{4}, \\ κ_{5} & = (η_{c} + η_{d}) λ / ϕ_{5}, κ_{6} = J_{f}^{(N)} / ϕ_{6} \\ κ_{10} & = [J_{r}^{(N)} + J_{f}^{(N)} + (η_{a 2} + η_{b} + 2 η_{c} + η_{d}) λ] / (2 \cdot ϕ_{10}), \\ κ_{11} & = [J_{r}^{(N)} + J_{f}^{(N)} + (η_{a 2} + η_{b} + 2 η_{c} + η_{d}) λ] / ϕ_{11} . \end{aligned}

Here, $J_{r}^{(N)}$ , $J_{f}^{(N)}$ , $λ$ and $ϕ_{i}$ ( $i = 1 -6,10-11$ ) are measurable from experiments (see Appendix 9.1 and Appendix 1—table 2). Thus, we can obtain the in vivo values of $κ_{i}$ from Equation S134. Combined with Equations S17 and S20, we have

k_{i}^{c a t} = \frac{r_{c a r b o n}}{r_{p r o t e i n}} \cdot \frac{m_{E_{i}}}{m_{c a r b o n}} \cdot κ_{i} \cdot [\sum_{i} r_{i} / N_{E P_{i}}^{c a r b o n}]

Equation S135 is the in vivo result for the enzyme catalytic rate. In Appendix 1—figure 2G, we show a comparison between in vivo and in vitro results for $k_{cat}$ values of enzymes within glycolysis and the TCA cycle, which are roughly consistent. In the applications, we prioritized the use of in vivo results for enzyme catalytic rates, and use in vitro data as a substitute when there were gaps.

7.3 Comparison with existing models that illustrate experimental results

For the coarse-grained model described in Appendix 3, the normalized stoichiometric influx of a Group A carbon source is given by:

J_{i n}^{(N)} \equiv J_{A}^{(N)} = ϕ_{A} \cdot κ_{A} .

Combined with the first equation in Equation S28 and Equation S30, we obtain:

J_{i n}^{(N)} - ϑ \cdot λ = \frac{J_{r}^{(E)}}{β_{r}^{(A)}} + \frac{J_{f}^{(E)}}{β_{f}^{(A)}},

where $ϑ = η_{a 1} + η_{c} + (η_{a 2} + η_{b} + η_{d}) / 2$ . Evidently, $β_{r}^{(A)}$ , $β_{f}^{(A)}$ and $ϑ$ are constant parameters. In this subsection, we highlight the major differences between our model presented in Appendix 3 and existing models that illustrate the growth rate dependence of fermentation flux in the standard picture of overflow metabolism (Basan et al., 2015; Holms, 1996; Meyer et al., 1984; Nanchen et al., 2006).

Based on the modeling principles rather than the detailed mechanisms, there are two major classes of existing models that can illustrate experimental results. Both classes of models regard the proteome efficiencies $ε_{r}$ and $ε_{f}$ as constants, with $ε_{f} > ε_{r}$ if used, or follow functionally equivalent propositions. However, in our model, $ε_{r}$ and $ε_{f}$ are both functions of $κ_{A}$ , which vary significantly upon nutrient perturbation, with $ε_{r} (κ_{A} \to 0) > ε_{f} (κ_{A} \to 0)$ and $ε_{r} (κ_{A}^{m a x}) < ε_{f} (κ_{A}^{m a x})$ (see Equations S38, S40-S41). Furthermore, there are significant differences in the modeling and optimization principles, as listed below.

The first class of models (Chen and Nielsen, 2019; Majewski and Domach, 1990; Shlomi et al., 2011; Varma and Palsson, 1994; Vazquez et al., 2010; Vazquez and Oltvai, 2016; Zhuang et al., 2011) optimize the ratio of biomass outflow to carbon influx $λ / J_{i n}^{(N)}$ , either to optimize the growth rate for a given carbon influx or to minimize the carbon influx for a given growth rate. Since respiration is far more efficient than fermentation in terms of energy biogenesis per unit carbon, to optimize the ratio $λ / J_{i n}^{(N)}$ , cells would preferentially use respiration when the carbon influx is small. As carbon influx increases above a certain threshold, factors such as proteome allocation direct cells toward fermentation in a threshold-linear response, since they consider $ε_{f} > ε_{r}$ . Our model is significantly different from this class of models in the optimization principle, as we purely optimize the cell growth rate for a given nutrient condition, without imposing a special constraint on the carbon influx.

The second class of models, represented by Basan et al., 2015, also adopt the optimization of $λ / J_{i n}^{(N)}$ in the interpretation of their model results. However, the growth rate dependence of fermentation flux was derived prior to the application of growth rate optimization (although it can be derived by optimizing $λ / J_{i n}^{(N)}$ ). In fact, Equations S29 and S137 in our model are very similar in form to those in Basan et al., 2015, yet there are critical differences, which we list below. In Equation S29, by regarding $J_{r}^{(E)}$ and $J_{f}^{(E)}$ as the two variables in a system of linear equations, we obtain the following expressions:

{\begin{cases} J_{r}^{(E)} & = \frac{ϕ_{max} - (ψ + \frac{φ}{ε_{f}}) \cdot λ}{\frac{1}{ε_{r}} - \frac{1}{ε_{f}}}, \\ J_{f}^{(E)} & = \frac{(ψ + \frac{φ}{ε_{r}}) \cdot λ - ϕ_{max}}{\frac{1}{ε_{r}} - \frac{1}{ε_{f}}} . \end{cases}

In Basan et al., 2015, Equation S138 is considered to be the relation between $J_{r / f}^{(E)}$ and $λ$ upon nutrient (and thus $J_{i n}^{(N)}$ ) perturbation, while $ε_{r}$ and $ε_{f}$ are regarded as constants throughout the perturbation. By contrast, in our model, Equation S138 serves as a constraint under a given nutrient condition with fixed $κ_{A}$ , and is not relevant to nutrient perturbation. For wild-type strains, if $ε_{r} (κ_{A}) > ε_{f} (κ_{A})$ (or vice versa), then the solution for optimal growth is $J_{r}^{(E)} (κ_{A}) = φ \cdot λ (κ_{A})$ and $J_{f}^{(E)} (κ_{A}) = 0$ , with $λ (κ_{A}) = \frac{ε_{r} (κ_{A}) \cdot ϕ_{m a x}}{φ + ε_{r} (κ_{A}) \cdot ψ (κ_{A})}$ . This solution, which satisfies Equation S138, corresponds to a point rather than a line in the relation between growth rate $λ$ and normalized energy flux $J_{r / f}^{(E)}$ upon $κ_{A}$ perturbation.

Appendix 8

Probability density functions of variables and parameters

8.1 Probability density function of $κ_{i}$

Enzyme catalysis is crucial for the survival of living organisms, as it significantly accelerates biochemical reactions by reducing the energy barrier between the substrate and product (Nelson and Cox, 2008). However, the maximal turnover rate of enzymes, $k_{cat}$ , varies notably between in vivo and in vitro measurements (Davidi et al., 2016). Recent studies suggest that differences in the aquatic medium are the primary cause of this variation (Davidi et al., 2016; García-Contreras et al., 2012). In particular, potassium and phosphate concentrations have a significant influence on $k_{cat}$ (García-Contreras et al., 2012), and these concentrations exhibit some degree of variation among cell populations under intracellular conditions (García-Contreras et al., 2012). For simplicity, we assume that the turnover rate of each enzyme $E_{i}$ , $k_{i}^{cat}$ , follows a Gaussian distribution $N (μ_{k_{i}^{cat}}, σ_{k_{i}^{cat}}^{2})$ with $k_{i}^{cat} > 0$ among cells (representing extrinsic noise [Elowitz et al., 2002], denoted as $χ_{ext}$ ). The probability density function of $k_{i}^{cat}$ is then given by:

k_{i}^{c a t} \sim N^{'} (x; μ_{k_{i}^{c a t}}, σ_{k_{i}^{c a t}}^{2}) = {\begin{aligned} l & \frac{1}{σ_{k_{i}^{c a t}} \sqrt{2 π}} e^{- \frac{1}{2} {(\frac{x - μ_{k_{i}^{c a t}}}{σ_{k_{i}^{c a t}}})}^{2}}, & x \geq 0. \\ 0, & x < 0. \end{aligned}

When the CV of the $k_{i}^{cat}$ distribution (i.e. $σ_{k_{i}^{cat}} / μ_{k_{i}^{cat}}$ ) is less than $1 / 3$ , $N^{'} (x; μ_{k_{i}^{cat}}, σ_{k_{i}^{cat}}^{2})$ is almost identical to $N (μ_{k_{i}^{cat}}, σ_{k_{i}^{cat}}^{2})$ . In this case, $1 / k_{i}^{cat}$ follows the positive inverse of Gaussian (IOG) distribution, and the probability density function is:

IOG (x; μ_{1 / k_{i}^{cat}}, ζ_{1 / k_{i}^{cat}}) = {\begin{aligned} \sqrt{\frac{ζ_{1 / k_{i}^{cat}}}{2 π x^{4}}} e x p (- \frac{1}{2} \frac{ζ_{1 / k_{i}^{cat}} {(x - μ_{1 / k_{i}^{cat}})}^{2}}{x^{2} μ_{1 / k_{i}^{cat}}^{2}}), & x \geq 0, \\ 0, & x < 0, \end{aligned}

where $ζ_{1 / k_{i}^{cat}} = 1 / σ_{k_{i}^{cat}}^{2}$ is the shape parameter, and $μ_{1 / k_{i}^{cat}} = 1 / μ_{k_{i}^{cat}}$ is the mean.

Meanwhile, due to the stochastic nature of biochemical reactions, we apply Gillespie’s chemical Langevin equation (Gillespie, 2000) to account for intrinsic noise (Elowitz et al., 2002) (denoted as $χ_{int}$ ). For cell size regulation of E. coli within a cell cycle, the cell mass at the initiation of DNA replication per chromosome origin remains constant (Donachie, 1968). Thus, the time required for enzyme $E_{i}$ to complete a catalytic job (with a timescale of $1 / k_{i}^{cat}$ ) can be approximated as the first passage time of a stochastic process, with

{\begin{cases} X_{i} (t = 0) = 0, \\ d X_{i} / d t = α_{i} + \sqrt{α_{i}} Γ_{i} (t), \\ T_{Θ} = i n f {t > 0 | X_{i} (t) = Θ} . \end{cases}

Here $α_{i} \equiv k_{i}^{cat} \cdot Θ$ , where $Θ$ is proportional to the cell volume, and $Γ_{i} (t)$ represents independent, temporally uncorrelated Gaussian white noise. Then, for a given value of $k_{i}^{cat}$ , the first passage time $T_{Θ}$ follows an Inverse Gaussian (IG) distribution (Folks and Chhikara, 1978):

IG (x; μ_{1 / k_{i}^{cat}}^{'}, ζ_{1 / k_{i}^{cat}}^{'}) = {\begin{aligned} \sqrt{\frac{ζ_{1 / k_{i}^{cat}}^{'}}{2 π x^{3}}} e x p (- \frac{1}{2} \frac{ζ_{1 / k_{i}^{cat}}^{'} {(x - μ_{1 / k_{i}^{cat}}^{'})}^{2}}{x μ_{1 / k_{i}^{cat}}^{' 2}}), & x \geq 0, \\ 0, & x < 0, \end{aligned}

where $ζ_{1 / k_{i}^{cat}}^{'} = Θ / k_{i}^{cat}$ is the shape parameter, and $μ_{1 / k_{i}^{cat}}^{'} = 1 / k_{i}^{cat}$ represents the mean. The variance of this distribution is $σ_{1 / k_{i}^{cat}}^{' 2} \equiv μ_{1 / k_{i}^{cat}}^{' 3} / ζ_{1 / k_{i}^{cat}}^{'} = 1 / [Θ \cdot {(k_{i}^{cat})}^{2}]$ . Thus, we can obtain the CV:

σ_{1 / k_{i}^{c a t}}^{'} / μ_{1 / k_{i}^{c a t}}^{'} = Θ^{- \frac{1}{2}},

which is inversely proportional to the square root of cell volume. Evidently, the intrinsic and extrinsic noise make orthogonal contributions to the total noise (Elowitz et al., 2002) (denoted as $χ_{tot}$ ):

χ_{tot}^{2} = χ_{int}^{2} + χ_{ext}^{2} .

In fact, when the CV is small (i.e. CV <<1), both the IOG and IG distributions converge into Gaussian distributions (Appendix 1—figure 4). In the back-of-the-envelope calculations, we approximate x in all denominator terms of $IOG (x; μ, ζ)$ and $IG (x; μ, ζ)$ as μ (since CV <<1). Then, both the IOG and IG distributions can be approximated as follows:

IOG (x; μ_{1 / k_{i}^{cat}}, ζ_{1 / k_{i}^{cat}}) \overset{CV ≪ 1}{\to} N (μ_{1 / k_{i}^{cat}}, σ_{1 / k_{i}^{cat}}^{2}),

with a variance of $σ_{1 / k_{i}^{cat}}^{2} = μ_{1 / k_{i}^{cat}}^{4} / ζ_{1 / k_{i}^{cat}}$ , and

IG (x; μ_{1 / k_{i}^{cat}}^{'}, ζ_{1 / k_{i}^{cat}}^{'}) \overset{CV ≪ 1}{\to} N (μ_{1 / k_{i}^{cat}}^{'}, σ_{1 / k_{i}^{cat}}^{'^{2}}),

with a variance of $σ_{1 / k_{i}^{cat}}^{'^{2}} = μ_{1 / k_{i}^{cat}}^{'^{3}} / ζ_{1 / k_{i}^{cat}}^{'}$ . Rigorously, we show below that $IG (x; μ, ζ)$ shrinks to be $N (μ, μ^{3} / ζ)$ when the CV is small. For the IG distribution, the characteristic function of the variable x is given by Folks and Chhikara, 1978; Kampen, 1992:

G (k) = \int_{- \infty}^{\infty} e^{i k x} \cdot IG (x; μ, ζ) d x = e x p {\frac{ζ}{μ} [1 - \sqrt{1 - \frac{2 i μ^{2} k}{ζ}}]},

and therefore,

IG (x; μ, ζ) = \frac{1}{2 π} \int_{- \infty}^{\infty} e^{- i k x} \cdot G (k) d k .

When the variance $σ^{2} \equiv μ^{3} / ζ$ is very small, we essentially require $2 μ^{2} k / ζ = 2 σ^{2} k / μ ≪ 1$ , and then $\sqrt{1 - \frac{2 i μ^{2} k}{ζ}} \approx 1 - \frac{μ^{2}}{ζ} k i + \frac{μ^{4}}{2 ζ^{2}} k^{2}$ . Thus,

{\begin{aligned} G (k) \approx \exp (μ k i - \frac{μ^{3}}{2 ζ} k^{2}), \\ IG (x; μ, ζ) \approx \sqrt{\frac{ζ}{2 π μ^{3}}} \exp (- \frac{ζ (x - μ)^{2}}{2 μ^{3}}) = N (μ, \frac{μ^{3}}{ζ}) . \end{aligned}

This leads to:

\underset{σ \to 0}{l i m} IG (x; μ, ζ) = N (μ, μ^{3} / ζ) .

In fact, intrinsic noise does affect the short-term measurement of enzyme catalytic rate and growth rate at the single-cell level. However, its contribution in the long term is averaged out and thus becomes negligible. For simplicity, we approximate $χ_{tot} \approx χ_{ext}$ . Combined with Equations S145-S146, it is straightforward to verify that $1 / k_{i}^{cat}$ shares roughly the same CV as $k_{i}^{cat}$ :

σ_{1 / k_{i}^{c a t}} / μ_{1 / k_{i}^{c a t}} = σ_{k_{i}^{c a t}} / μ_{k_{i}^{c a t}} .

For convenience, in the model analysis, we approximate both IOG and IG distributions as Gaussian distributions. Then, all $1 / k_{i}^{cat}$ are independent, normally distributed random variables following Gaussian distributions:

1 / k_{i}^{cat} \sim N (μ_{1 / k_{i}^{cat}}, σ_{1 / k_{i}^{cat}}^{2}) .

Using the properties of Gaussian distributions, for a series of constant real numbers $γ_{i}$ , the summation of $γ_{i} / k_{i}^{cat}$ , which we define as $Ξ \equiv \sum_{i = 1}^{n} γ_{i} / k_{i}^{cat}$ , follows a Gaussian distribution (Kampen, 1992):

Ξ \sim N (μ_{Ξ}, σ_{Ξ}^{2}),

with $μ_{Ξ} = \sum_{i = 1}^{n} γ_{i} μ_{1 / k_{i}^{cat}}$ and $σ_{Ξ}^{2} = \sum_{i = 1}^{n} {(γ_{i} σ_{1 / k_{i}^{cat}})}^{2}$ . The relation between $κ_{i}$ and $k_{i}^{cat}$ is shown in Equation S12. To optimize cell growth rate, each $κ_{i}$ of the intermediate nodes satisfies Equation S20, while $κ_{A}$ satisfies Equation S27. Thus, for a given nutrient condition ([A] is fixed), all the ratios $k_{i}^{cat} / κ_{i}$ are constants. Combined with Equations S139, S145-S146, and S152, the distributions of all $κ_{i}$ and $1 / κ_{i}$ can be approximated as Gaussian distributions:

{\begin{aligned} κ_{i} & \sim N (μ_{κ_{i}}, σ_{κ_{i}}^{2}), \\ \frac{1}{κ_{i}} & \sim N (μ_{1 / κ_{i}}, σ_{1 / κ_{i}}^{2}), \end{aligned}

where $μ_{κ_{i}}$ and $μ_{1 / κ_{i}}$ are the means of $κ_{i}$ and $1 / κ_{i}$ , and $σ_{κ_{i}}$ and $σ_{1 / κ_{i}}$ are their standard deviations. Using the properties of Gaussian distributions, combined with Equation S31, S32, S36, S42-S43, S145-S146 and S153, $ε_{r}$ , $ε_{f}$ , $ψ$ , $λ_{r}$ , $λ_{f}$ , $κ_{A}^{(C)}$ and $λ_{C}$ also roughly follow Gaussian distributions.

8.2 Probability density function of the growth rate $λ$

From Appendix 8.1, we note that $λ_{r}$ and $λ_{f}$ (see Equation S36) roughly follow Gaussian distributions, with

{\begin{cases} λ_{r} \sim N (μ_{λ_{r}}, σ_{λ_{r}}^{2}), \\ λ_{f} \sim N (μ_{λ_{f}}, σ_{λ_{f}}^{2}), \end{cases}

where $μ_{λ_{r / f}}$ and $σ_{λ_{r / f}}$ represent the mean and standard deviation, respectively. We further assume that the correlation between $λ_{r}$ and $λ_{f}$ is $ρ_{r f}$ . From Equation S36, we see that the growth rate $λ$ takes the maximum of $λ_{r}$ and $λ_{f}$ , i.e.,

λ = m a x (λ_{r}, λ_{f}) .

Then, the cumulative distribution function of $λ$ is $P (λ \leq x) = \int_{- \infty}^{x} \int_{- \infty}^{x} f (x_{1}, x_{2}) d x_{1} d x_{2}$ , where

\begin{aligned} f (x_{1}, x_{2}) = & \frac{{(1 - ρ_{r f}^{2})}^{- \frac{1}{2}}}{2 π σ_{λ_{r}} σ_{λ_{f}}} \exp (- \frac{1}{2 (1 - ρ_{r f}^{2})} [{(\frac{x_{1} - μ_{λ_{r}}}{σ_{λ_{r}}})}^{2} \\ - 2 ρ_{r f} (\frac{x_{1} - μ_{λ_{r}}}{σ_{λ_{r}}}) (\frac{x_{2} - μ_{λ_{f}}}{σ_{λ_{f}}}) + {(\frac{x_{2} - μ_{λ_{f}}}{σ_{λ_{f}}})}^{2}]) . \end{aligned}

Thus, the probability density function of the growth rate $λ$ is given by:

\begin{aligned} f_{λ} (x) = & \frac{1}{2 \sqrt{2 π} σ_{λ_{r}}} e^{- \frac{1}{2} {(\frac{x - μ_{λ_{r}}}{σ_{λ_{r}}})}^{2}} [erf (\frac{(x - μ_{λ_{f}}) σ_{λ_{r}} - ρ_{r f} σ_{λ_{f}} (x - μ_{λ_{r}})}{σ_{λ_{r}} σ_{λ_{f}} \sqrt{2 (1 - ρ_{r f}^{2})}}) + 1] \\ + \frac{1}{2 \sqrt{2 π} σ_{λ_{f}}} e^{- \frac{1}{2} {(\frac{x - μ_{λ_{f}}}{σ_{λ_{f}}})}^{2}} [erf (\frac{(x - μ_{λ_{r}}) σ_{λ_{f}} - ρ_{r f} σ_{λ_{r}} (x - μ_{λ_{f}})}{σ_{λ_{r}} σ_{λ_{f}} \sqrt{2 (1 - ρ_{r f}^{2})}}) + 1] . \end{aligned}

In Appendix 1—figure 2B, we show that Equation S157 quantitatively matches the experimental data for E. coli under the relevant conditions.

Appendix 9

Model comparison with experiments on E. coli

9.1 Flux comparison with experiments on E. coli

In Appendix 7.2, we see that the values of $J_{f}^{(N)}$ and $J_{r}^{(N)}$ are required to calculate the in vivo enzyme catalytic rates of the intermediate nodes. Here, we use $J_{acetate}$ and $J_{{CO}_{2}, r}$ to represent the stoichiometric fluxes of acetate from the fermentation pathway and CO₂ from the respiration pathway, respectively. Combined with the stoichiometric coefficients of both pathways, we have:

{\begin{cases} J_{acetate} = J_{f}, \\ J_{{CO}_{2}, r} = 3 \cdot J_{r} . \end{cases}

By further combining with Equations S16-S17, we get:

{\begin{aligned} J_{f}^{(N)} & = J_{a c e t a t e} \cdot \frac{m_{c a r b o n}}{M_{c a r b o n}} \cdot {[\sum_{i} r_{i} / N_{E P_{i}}^{c a r b o n}]}^{- 1}, \\ J_{r}^{(N)} & = \frac{1}{3} \cdot J_{C O_{2}, r} \cdot \frac{m_{c a r b o n}}{M_{c a r b o n}} \cdot {[\sum_{i} r_{i} / N_{E P_{i}}^{c a r b o n}]}^{- 1} . \end{aligned}

In fact, the values of $J_{acetate}$ and $J_{{CO}_{2}, r}$ scale with the mass of the ‘big cell,’ which increases over time. In experiments, the measurable fluxes are typically expressed in the unit of mM/OD₆₀₀/h (Basan et al., 2015). Thus, we define $J_{acetate}^{(M)}$ and $J_{{CO}_{2}, r}^{(M)}$ as the fluxes of $J_{acetate}$ and $J_{{CO}_{2}, r}$ (per biomass) in the unit of mM/OD₆₀₀/h, respectively. The superscript ‘(M)’ represents the measurable flux in this unit. For E. coli, we use the following biochemical data collected from published literature: 1 OD₆₀₀ roughly corresponds to 6×10⁸ cells/mL (Stevenson et al., 2016), the average mass of a cell is 1 pg (Milo and Phillips, 2015), the biomass percentage of the cell weight is 30% (Neidhardt et al., 1990), the molar mass of carbon is 12 g (Nelson and Cox, 2008), $r_{carbon} = 0.48$ (Neidhardt et al., 1990) and $r_{protein} = 0.55$ (Neidhardt et al., 1990). Combined with the values of $r_{i}$ (see Appendix 2.2) and $N_{{EP}_{i}}^{carbon}$ , where $N_{{EP}_{a 1}}^{carbon} = 6$ , $N_{{EP}_{a 2}}^{carbon} = 3$ , $N_{{EP}_{b}}^{carbon} = 3$ , $N_{{EP}_{c}}^{carbon} = 5$ , and $N_{{EP}_{d}}^{carbon} = 4$ (Nelson and Cox, 2008), we have:

{\begin{cases} J_{f}^{(N)} \approx J_{a c e t a t e}^{(M)} / 2, \\ J_{r}^{(N)} \approx J_{C O_{2}, r}^{(M)} / 6. \end{cases}

From Equation S18, we obtain the values of $η_{i}$ for each precursor pool: $η_{a 1} = 0.15$ , $η_{a 2} = 0.30$ , $η_{b} = 0.35$ , $η_{c} = 0.09$ , and $η_{d} = 0.11$ . Still, the value of $η_{E}$ is required to compare the growth rate dependence of fermentation/respiration fluxes between model results and experiments, which we will specify in Appendix 9.2.

9.2 Model parameter settings using experimental data of E. coli

We have collected biochemical data for E. coli, as shown in Appendix 1—table 1 and Appendix 1—table 2, to set the model parameters. This includes the molecular weight (MW) and in vitro k_cat values of the catalytic enzymes, as well as the proteome and flux data used to calculate the in vivo turnover numbers. To reduce measurement noise, we take the average rather than the maximum value of in vivo k_cat from calculations using data from four cultures (see Appendix 1—table 2). Here, we prioritize the use of in vivo k_cat wherever applicable unless there is a gap in the in vivo data (see Appendix 1—table 1).

Note that our models are coarse grained. For example, the flux $J_{3}$ shown in Figure 1B actually corresponds to three different reactions in the metabolic network (see Figure 1A and Appendix 1—table 1), which we label as $J_{3}^{(i)}$ (i=1, 2, 3). For each $J_{3}^{(i)}$ , there are corresponding variables/parameters of $Φ_{3}^{(i)}$ , $ξ_{3}^{(i)}$ , $ϕ_{3}^{(i)}$ , $κ_{3}^{(i)}$ satisfying Equations S8, S9 and S12, Evidently, $J_{3}^{(i)} = J_{3}$ (i=1, 2, 3), and it is straightforward to derive the following relation between $κ_{3}^{(i)}$ and $κ_{3}$ :

1 / κ_{3} = \sum_{i = 1}^{3} 1 / κ_{3}^{(i)} .

In fact, Equation S161 can be generalized to determine the values of other $κ_{i}$ in the coarse-grained models combined with the biochemical data. For the coarse-grained model of Group A carbon source utilization shown in Figure 1B, we have the values for parameters $κ_{i}$ (i=1, …, 6), and then $ε_{r / f} (κ_{A}^{(C)}) = 122 (h^{- 1})$ . Evidently, $ε_{r} (κ_{glucose}^{(ST)}) < ε_{f} (κ_{glucose}^{(ST)})$ , $ε_{r} (κ_{lactose}^{(ST)}) < ε_{f} (κ_{lactose}^{(ST)})$ , and thus $ε_{r} (κ_{A}^{m a x}) < ε_{f} (κ_{A}^{m a x})$ . For pyruvate, we have $ε_{r / f}^{(py)} (κ_{py}^{(C)}) = ε_{r / f} (κ_{A}^{(C)}) = 122 (h^{- 1})$ (see Equations S43 and S101), and it is easy to check that $ε_{r} (κ_{py}^{(ST)}) < ε_{f} (κ_{py}^{(ST)})$ .

For the remaining model parameters, note that we have classified the inactive ribosomal-affiliated proteins into the Q-class, and then $ϕ_{m a x} = 48 %$ (Scott et al., 2010). The value of $κ_{t}$ is obtainable from experiments: the translation speed is 20.1aa/s (Scott et al., 2010), with 7336 amino acids per ribosome (Neidhardt, 1996) and $ς \approx 1.67$ (Neidhardt, 1996; Scott et al., 2010) (see Appendix 2.1), hence $κ_{t} = 1 / 610 (s^{- 1})$ . However, there are insufficient data to determine the values of $κ_{i}$ (i=a1, a2, b, c, d) for the metabolites between the entry point metabolites shown in Figure 1A to the precursor pools. These processes involves many steps, so these values are expected to be quite large. Here, we combine the contributions of $κ_{t}$ and $κ_{i}$ (i=a1, a2, b, c, d) by defining a composite parameter:

Ω \equiv 1 / κ_{t} + \sum_{i}^{a 1, a 2, b, c, d} η_{i} / κ_{i} .

We proceed to estimate the values of $Ω$ and $φ$ using experimental data (Basan et al., 2015) for wild-type strains on the $J_{acetate}^{(M)} - λ$ relation (Figure 1C), and then all the remaining model parameters are set accordingly.

For the case of $w_{0} = 0$ , where all k_cat values follow a Gaussian distribution with an extrinsic noise of 25% CV (which is the general setting we use unless otherwise specified), we have $φ = 10.8$ and $Ω = 1345 (s)$ . Accordingly, we obtain $η_{E} = 14.78$ , $μ_{λ_{C}} = 0.92 (h^{- 1})$ , and $σ_{λ_{C}} = 0.12 μ_{λ_{C}}$ , where the CV of the extrinsic noise for $Ω$ is estimated using the averaged CV of other $κ_{i}$ . For the translation inhibition effect of Cm, we estimate the values for $ι$ as $ι_{w_{0} = 0}^{(2 μ m Cm)} = 1.15$ , $ι_{w_{0} = 0}^{(4 μ m Cm)} = 2.33$ , and $ι_{w_{0} = 0}^{(8 μ m Cm)} = 6.25$ , where the superscript stands for the concentration of Cm, and the subscript represents the choice of $w_{0}$ .

For pyruvate, with the value of $η_{E}$ , we get $φ_{py} = 14.82$ . However, there is still a lack of proteome data to determine the value of $κ_{9}$ , which involves many steps in the metabolic network and thus can be considerably large. Here we define another composite parameter, ${Ω^{'}}_{Gg} \equiv (η_{b} + η_{c}) / κ_{8} + η_{a 1} / κ_{9}$ , and estimate its value as ${Ω^{'}}_{Gg} = 690 (s)$ from growth rate data for E. coli measured under the relevant nutrient conditions (Basan et al., 2015), where the subscript ‘Gg’ stands for glucogenesis. Then, $μ_{λ_{C}^{(py)}} = 0.67 (h^{- 1})$ , and $σ_{λ_{C}^{(py)}} = 0.10 μ_{λ_{C}^{(py)}}$ , where the same CV of extrinsic noise for $Ω$ applies to ${Ω^{'}}_{Gg}$ .

For the case of a Group A carbon source mixed with 21 amino acids (21AA, with saturated concentrations), we have $φ_{21AA} = 14.2$ . Comparing Equation S32 with Equation S112, the parameter $Ω$ should change to $Ω_{21AA} \equiv 1 / κ_{t} + η_{a 1} / κ_{a 1} + \sum_{i}^{a 2, b, c, d} η_{i} / κ_{i}^{(21AA)}$ . Obviously, $1 / κ_{t} < Ω_{21AA} < Ω$ , and we estimate $Ω_{21AA} = 1000 (s)$ from the growth rate data for E. coli measured under the relevant nutrient conditions (Wallden et al., 2016). Then, we have $μ_{λ_{C}^{(21 AA)}} = 1.13 (h^{- 1})$ , and $σ_{λ_{C}^{(21 AA)}} = 0.12 μ_{λ_{C}^{(21 AA)}}$ .

For the case of a Group A carbon source mixed with 7 amino acids (7AA: His, Iso, Leu, Lys, Met, Phe, and Val), similar to the roles of $φ_{21AA}$ and $Ω_{21AA}$ , we define $φ_{7AA}$ and $Ω_{7AA}$ . Using the mass fraction of the 7AA combined with Equation S18, we have $φ_{7AA} = 11.6$ . For the value of $Ω_{7AA}$ , evidently, $Ω_{21AA} < Ω_{7AA} < Ω$ , and we estimate $Ω_{7AA} = 1215 (s)$ from growth rate data for E. coli measured under the relevant culture media (Basan et al., 2015). Then, $μ_{λ_{C}^{(7AA)}} = 0.98 (h^{- 1})$ , and $σ_{λ_{C}^{(7 AA)}} = 0.12 μ_{λ_{C}^{(7 AA)}}$ .

For the case of $w_{0} = 2.5 (h^{- 1})$ , we have $φ = 8.3$ , and thus $η_{E} = 12.28$ , while other parameters such as $Ω$ , $μ_{λ_{C}}$ and $σ_{λ_{C}}$ remain the same as for $w_{0} = 0$ . Nevertheless, the values for $ι$ under translation inhibition by Cm are influenced by the choice of $w_{0}$ , where the values of $ι$ change to $ι_{w_{0} = 2.5}^{(2 μ m Cm)} = 1.05$ , $ι_{w_{0} = 2.5}^{(4 μ m Cm)} = 2.00$ , and $ι_{w_{0} = 2.5}^{(8 μ m Cm)} = 5.40$ .

From Appendix 8.1–8.2, combined with Equation S114, the distributions of $λ_{r}^{(21 AA)}$ and $λ_{f}^{(21 AA)}$ can be approximated by Gaussian distributions:

{\begin{aligned} λ_{r}^{(21 AA)} & \sim N (μ_{λ_{r}^{(21 AA)}}, σ_{λ_{r}^{(21 AA)}}^{2}), \\ λ_{f}^{(21 AA)} & \sim N (μ_{λ_{f}^{(21 AA)}}, σ_{λ_{f}^{(21 AA)}}^{2}), \end{aligned}

where $μ_{λ_{r}^{(21 AA)}}$ and $μ_{λ_{f}^{(21 AA)}}$ stand for the mean values, while $σ_{λ_{r}^{(21 AA)}}$ and $σ_{λ_{f}^{(21 AA)}}$ represent the standard deviations. For the case of glucose mixed with 21AA (labeled as ‘Glucose + 21AA’), the distribution of the growth rate $λ_{glucose}^{(21 AA)}$ follows Equation S157. With $Ω_{21AA} = 1000 (s)$ , we have $μ_{λ_{glucose, r}^{(21 AA)}} = 1.34 (h^{- 1})$ , $μ_{λ_{glucose, f}^{(21 AA)}} = 1.46 (h^{- 1})$ (both definitions follow Equation S163), and $ρ_{r f} \approx 1.0$ (obtained from numerical results).

For the case of succinate mixed with 21AA (labeled as ‘Succinate +21AA’), the respiration pathway is always more efficient since succinate lies within the TCA cycle. Thus, the cell growth rate (defined as $λ_{succinate}^{(21 AA)}$ ) would take the value of the respiration one and follows a Gaussian distribution:

λ_{succinate}^{(21 AA)} \sim N (μ_{λ_{succinate}^{(21 AA)}}, σ_{λ_{succinate}^{(21 AA)}}^{2}) .

For the case where acetate is the sole carbon source, the cells exclusively use the respiration pathway, and the growth rate (defined as $λ_{acetate}$ ) follows a Gaussian distribution:

λ_{acetate} \sim N (μ_{λ_{acetate}}, σ_{λ_{acetate}}^{2}) .

Using the measured growth rate data (Wallden et al., 2016), we estimate $μ_{λ_{succinate}^{(21 AA)}} = 0.67 (h^{- 1})$ and $μ_{λ_{acetate}} = 0.253 (h^{- 1})$ . To illustrate the distribution of growth rates $λ_{glucose}^{(21 AA)}$ , $λ_{succinate}^{(21 AA)}$ and $λ_{acetate}$ shown in Appendix 1—figure 2B, if no other source of noise existed, extrinsic noise with a CV of 40% would be required for each k_cat value. Then, $σ_{λ_{glucose, r}^{(21 AA)}} \approx 0.21 μ_{λ_{glucose, r}^{(21 AA)}}$ , $σ_{λ_{glucose, f}^{(21 AA)}} \approx 0.23 μ_{λ_{glucose, f}^{(21 AA)}}$ , $σ_{λ_{succinate}^{(21 AA)}} = 0.22 μ_{λ_{succinate}^{(21 AA)}}$ , and $σ_{λ_{acetate}} = 0.22 μ_{λ_{acetate}}$ . Allowing for the possibility that intrinsic noise may also play a non-negligible role in the observed single-cell growth rate (which is not a long-term average), we still use extrinsic noise with a CV of 25% for the model results of E. coli, except for those shown in Appendix 1—figure 2B.

Appendix 10

Explanation of the Crabtree effect in yeast and the Warburg effect in tumors

Our model, along with the analysis presented in Appendix 3, can be extended with modifications to explain the Crabtree effect in yeast and the Warburg effect in tumors. In both cases, the optimization objective remains maximizing the cell growth rate. Consequently, yeast and tumor cells use the most efficient pathway for ATP production at the single-cell level.

For model applications in yeast or tumor cell metabolism, the fermentation flux shifts from acetate secretion to ethanol and lactate secretion, respectively (see Appendix 1—figure 5A and B). The respiration and biomass generation pathways remain largely similar to those of E. coli, except that the biochemical reactions within the TCA cycle and respiratory chain occur in the mitochondria (see Appendix 1—figure 5C and D). This leads to an increased enzyme cost for the respiration pathway due to energy currency exchanges between NADH or FADH₂ and ATP in the mitochondria. The coarse-grained models for Group A carbon source utilization in yeast and mammalian cells are shown in Appendix 1—figure 5E and F, where M₃ represents pyruvate. In yeast and mammalian cells, the stoichiometric coefficients for ATP production (i.e. $β_{i}$ ) are identical to each other but differ from those of E. coli (see Figure 1B and Appendix 1—figure 5C and D), with $β_{1} = 5$ , $β_{2} = 1$ , $β_{3} = 5$ , $β_{4} = 7.5$ , $β_{6} = - 2.5$ , and $β_{a 1} = 5$ (Nelson and Cox, 2008). Hence, the stoichiometric coefficients of ATP production per glucose in each pathway are $β_{r}^{(A)} = 32$ and $β_{f}^{(A)} = 2$ , respectively, where $β_{r}^{(A)} = β_{1} + 2 (β_{2} + β_{3} + β_{4})$ and $β_{f}^{(A)} = β_{1} + 2 (β_{2} + β_{6})$ .

The impact of maintenance energy in yeast and tumor cells is significantly higher than that in E. coli (Locasale and Cantley, 2010). Therefore, Equation S25 changes to (see Equation S59):

J_{E} = r_{E} \cdot J_{BM} + w_{0} \cdot \frac{M_{carbon}}{m_{0}},

where $w_{0}$ is the aforementioned maintenance energy coefficient. Thus, we have (see Equation S60):

J_{E}^{(N)} = η_{E} \cdot λ + w_{0} .

To account for the protein cost of energy currency exchanges in the mitochondria, we introduce $ϕ_{MT}$ and $κ_{MT}$ to represent the proteomic mass fraction of the enzymes and the effective substrate quality of related metabolites in the mitochondria, respectively. Note that the energy currency exchanges between NADH or FADH₂ and ATP only occur during respiration, as there is no net NADH or FADH₂ generation during fermentation (see Appendix 1—figure 5C and D). Combined with Equation S167, Equation S25 changes to:

{\begin{cases} ϕ_{A} \cdot κ_{A} = ϕ_{1} \cdot κ_{1} + ϕ_{a 1} \cdot κ_{a 1}, \\ 2 ϕ_{1} \cdot κ_{1} = ϕ_{2} \cdot κ_{2} + ϕ_{5} \cdot κ_{5} + ϕ_{a 2} \cdot κ_{a 2}, \\ ϕ_{2} \cdot κ_{2} = ϕ_{3} \cdot κ_{3} + ϕ_{6} \cdot κ_{6} + ϕ_{b} \cdot κ_{b}, \\ ϕ_{5} \cdot κ_{5} + ϕ_{4} \cdot κ_{4} = ϕ_{3} \cdot κ_{3} + ϕ_{d} \cdot κ_{d}, \\ ϕ_{3} \cdot κ_{3} = ϕ_{4} \cdot κ_{4} + ϕ_{c} \cdot κ_{c}, \\ ϕ_{a 1} \cdot κ_{a 1} = η_{a 1} \cdot λ, ϕ_{a 2} \cdot κ_{a 2} = η_{a 2} \cdot λ, ϕ_{b} \cdot κ_{b} = η_{b} \cdot λ, ϕ_{c} \cdot κ_{c} = η_{c} \cdot λ, ϕ_{d} \cdot κ_{d} = η_{d} \cdot λ, \\ β_{1} \cdot ϕ_{1} \cdot κ_{1} + β_{2} \cdot ϕ_{2} \cdot κ_{2} + β_{3} \cdot ϕ_{3} \cdot κ_{3} + β_{4} \cdot ϕ_{4} \cdot κ_{4} + β_{6} \cdot ϕ_{6} \cdot κ_{6} + β_{a 1} \cdot ϕ_{a 1} \cdot κ_{a 1} = J_{E}^{(N)}, \\ J_{E}^{(N)} = η_{E} \cdot λ + w_{0}, λ = ϕ_{R} \cdot κ_{t}, J_{r}^{(N)} = ϕ_{4} \cdot κ_{4} = ϕ_{M T} \cdot κ_{M T}, J_{f}^{(N)} = ϕ_{6} \cdot κ_{6}, \\ ϕ_{R} + ϕ_{A} + ϕ_{1} + ϕ_{2} + ϕ_{3} + ϕ_{4} + ϕ_{5} + ϕ_{6} + ϕ_{M T} + ϕ_{a 1} + ϕ_{a 2} + ϕ_{b} + ϕ_{c} + ϕ_{d} = ϕ_{max} . \end{cases}

Here, Equation S28 still holds, and we have:

{\begin{aligned} J_{r}^{(E)} + J_{f}^{(E)} = φ \cdot λ + w_{0}, \\ \frac{J_{r}^{(E)}}{ε_{r}} + \frac{J_{f}^{(E)}}{ε_{f}} = ϕ_{max} - ψ \cdot λ, \end{aligned}

where $J_{r}^{(E)}$ and $J_{f}^{(E)}$ follow Equation S30, and $ψ$ and $φ$ satisfy Equation S32 and S33, respectively. The expression for $ε_{f}$ follows Equation S31. However, the expression for $ε_{r}$ differs from that in Equation S31. For yeast and mammalian cells, we have:

{\begin{aligned} ε_{r} & = \frac{β_{r}^{(A)}}{1 / κ_{A} + 1 / κ_{1} + 2 / κ_{2} + 2 / κ_{3} + 2 / κ_{4} + 2 / κ_{MT}}, \\ ε_{f} & = \frac{β_{f}^{(A)}}{1 / κ_{A} + 1 / κ_{1} + 2 / κ_{2} + 2 / κ_{6}} . \end{aligned}

At the single-cell level, from Equation S169, and similar to Equation S61-S63, if $ε_{r} > ε_{f}$ , the optimal growth strategy is:

{\begin{cases} J_{f}^{(E)} = 0, \\ J_{r}^{(E)} = φ \cdot λ + w_{0}, \end{cases} ε_{r} > ε_{f},

while if $ε_{f} > ε_{r}$ , the optimal growth strategy is:

{\begin{cases} J_{f}^{(E)} = φ \cdot λ + w_{0}, \\ J_{r}^{(E)} = 0. \end{cases} ε_{r} < ε_{f} .

In both cases, the growth rate $λ$ reaches its maximum value for a given nutrient condition with fixed $κ_{A}$ :

λ (κ_{A}) = {\begin{aligned} \frac{ϕ_{m a x} - w_{0} / ε_{r} (κ_{A})}{φ / ε_{r} (κ_{A}) + ψ (κ_{A})} ε_{r} (κ_{A}) > ε_{f} (κ_{A}), \\ \frac{ϕ_{m a x} - w_{0} / ε_{f} (κ_{A})}{φ / ε_{f} (κ_{A}) + ψ (κ_{A})} ε_{r} (κ_{A}) < ε_{f} (κ_{A}) . \end{aligned}

From Equation S170, when $κ_{A}$ is very small such that $κ_{A} \to 0$ , it is evident that for yeast and mammalian cells, we still have:

{\begin{matrix} ε_{r} (κ_{A} \to 0) \approx β_{r}^{(A)} \cdot κ_{A}, \\ ε_{f} (κ_{A} \to 0) \approx β_{f}^{(A)} \cdot κ_{A} . \end{matrix}

Thus,

ε_{r} (κ_{A} \to 0) > ε_{f} (κ_{A} \to 0),

since $β_{r}^{(A)} ≫ β_{f}^{(A)}$ still holds. Then, as long as $ε_{r} (κ_{A}^{m a x}) < ε_{f} (κ_{A}^{m a x})$ , there exists a critical switching point for $κ_{A}$ (denoted as $κ_{A}^{(C)}$ ; see Equation S41), below which respiration is more efficient, while above $κ_{A}^{(C)}$ , fermentation becomes more efficient in ATP production per proteome. Combined with Equation S170, we have:

κ_{A}^{(C)} = \frac{β_{r}^{(A)} - β_{f}^{(A)}}{β_{f}^{(A)} (1 / κ_{1} + 2 / κ_{2} + 2 / κ_{3} + 2 / κ_{4} + 2 / κ_{MT}) - β_{r}^{(A)} (1 / κ_{1} + 2 / κ_{2} + 2 / κ_{6})} .

Accordingly, we obtain the expressions for $ε_{r} (κ_{A}^{(C)})$ , $ε_{f} (κ_{A}^{(C)})$ and $λ_{C}$ (i.e. $λ (κ_{A}^{(C)})$ ):

{\begin{aligned} ε_{r} (κ_{A}^{(C)}) = ε_{f} (κ_{A}^{(C)}) = \frac{β_{r}^{(A)} - β_{f}^{(A)}}{2 (1 / κ_{3} + 1 / κ_{4} + 1 / κ_{MT} - 1 / κ_{6})}, \\ λ_{C} = \frac{ϕ_{m a x} - w_{0} / ε_{r / f} (κ_{A}^{(C)})}{φ / ε_{r / f} (κ_{A}^{(C)}) + ψ (κ_{A}^{(C)})}, \end{aligned}

Consequently, yeast and tumor cells would preferentially use respiration under starvation conditions (where $ε_{r} > ε_{f}$ ), yet switch to aerobic glycolysis when nutrients are abundant (where $ε_{r} < ε_{f}$ ) for optimal cell growth. This qualitatively illustrates the Crabtree effect in yeast and the Warburg effect in tumors.

At the cell population level, cell heterogeneity resulting from intrinsic and extrinsic noise causes the turnover numbers (i.e. $k_{cat}$ ) of enzymes and the critical growth rates at the transition point ( $λ_{C}$ ) to follow distributions, which we assume to be Gaussian (see Equation S45, Appendices 3.3 and 8.1). Due to the higher level of heterogeneity observed in tumor cells (Duraj et al., 2021; Shibao et al., 2018; Hanahan and Weinberg, 2011; Hensley et al., 2016) and yeast (Bagamery et al., 2020) compared to E. coli, the extent of noise—and thus the CVs of $k_{cat}$ and $λ_{C}$ —in yeast and tumor cells are expected to be larger than those in E. coli. The growth rate dependence of the normalized energy fluxes is as follows:

{\begin{aligned} J_{f}^{(E)} (λ) & = \frac{1}{2} (φ \cdot λ + w_{0}) \cdot [erf (\frac{λ - μ_{λ_{C}}}{\sqrt{2} σ_{λ_{C}}}) + 1], \\ J_{r}^{(E)} (λ) & = \frac{1}{2} (φ \cdot λ + w_{0}) \cdot [1 - erf (\frac{λ - μ_{λ_{C}}}{\sqrt{2} σ_{λ_{C}}})], \end{aligned}

where $μ_{λ_{C}}$ and $σ_{λ_{C}}$ are the mean and standard deviation of $λ_{C}$ , respectively, similar to the case of E. coli. Therefore, the growth rate dependence of the normalized fluxes is:

{\begin{aligned} J_{f}^{(N)} (λ) & = \frac{φ \cdot λ + w_{0}}{β_{f}^{(A)}} \cdot [erf (\frac{λ - μ_{λ_{C}}}{\sqrt{2} σ_{λ_{C}}}) + 1], \\ J_{r}^{(N)} (λ) & = \frac{φ \cdot λ + w_{0}}{β_{r}^{(A)}} \cdot [1 - erf (\frac{λ - μ_{λ_{C}}}{\sqrt{2} σ_{λ_{C}}})] . \end{aligned}

Combined with Equation S160, Equation S179 can be compared to experimental results, although in practice, it is difficult to tune the growth rate of tumor cells in vivo in experiments.

Recently, Shen et al., 2024 reported that in many yeast and tumor cells, the measured proteome efficiencies in respiration at the cell population level are higher than the corresponding proteome efficiencies in fermentation, even though aerobic glycolysis fermentation fluxes still occur. This finding apparently contradicts prevalent explanations (Basan et al., 2015; Chen and Nielsen, 2019), which assert that overflow metabolism originates from the proteome efficiency in fermentation always being higher than in respiration.

Our model can resolve the puzzle above based on two important features: First, our model predicts that as long as ATP generation per glucose in respiration is higher than in fermentation (i.e. $β_{r}^{(A)} > β_{f}^{(A)}$ ), which definitely holds true for all organisms, the proteome efficiency in respiration is higher than that in fermentation when the nutrient quality $κ_{A}$ is low (see Equations S37-S38 and S174-S175). Second, and importantly, due to cell heterogeneity at the population level, a subset of cells exhibiting greater proteome efficiency in fermentation compared to respiration could exist, even if the proteome efficiency at the cell population level in respiration is higher than in fermentation.

To facilitate comparison between our model and the experiments of Shen et al., 2024, we define ${Pr}_{f}$ as the proportion of ATP generated from fermentation, and $\bar{Δ}$ as the proteome efficiency difference between respiration and fermentation, with

{Pr}_{f} \equiv \frac{J_{f}^{(E)}}{J_{f}^{(E)} + J_{r}^{(E)}},

and

\bar{Δ} \equiv 1 / ε_{r} - 1 / ε_{f} .

At the cell population level, $ε_{r}$ , $ε_{f}$ , $1 / ε_{r}$ , and $1 / ε_{f}$ roughly follow Gaussian distributions (see Appendix 8.1 and Equation S170), with

{\begin{cases} ε_{r} \sim N (μ_{ε_{r}}, σ_{ε_{r}}^{2}), ε_{f} \sim N (μ_{ε_{f}}, σ_{ε_{f}}^{2}), \\ 1 / ε_{r} \sim N (μ_{1 / ε_{r}}, σ_{1 / ε_{r}}^{2}), 1 / ε_{f} \sim N (μ_{1 / ε_{f}}, σ_{1 / ε_{f}}^{2}) . \end{cases}

Here, $σ_{ε_{r}}$ , $σ_{ε_{f}}$ , $σ_{1 / ε_{r}}$ , $σ_{1 / ε_{f}}$ , and $μ_{ε_{r}}$ , $μ_{ε_{f}}$ , $μ_{1 / ε_{r}}$ , $μ_{1 / ε_{f}}$ are the standard deviations and mean values of $ε_{r}$ , $ε_{f}$ , $1 / ε_{r}$ and $1 / ε_{f}$ , respectively. Thus,

{\begin{matrix} μ_{ε_{r}} = 〈 ε_{r} 〉, \\ μ_{ε_{f}} = 〈 ε_{f} 〉, \end{matrix}

where the angle bracket ‘ $⟨ ⟩$ ’ represents the average over the cell population, and $〈 ε_{r} 〉$ and $〈 ε_{f} 〉$ are the population-averaged values of $ε_{r}$ and $ε_{f}$ , respectively, which are both measurable in experiments. From the derivations shown in Appendix 8.1, we approximately have

{\begin{matrix} μ_{1 / ε_{r}} = 1 / μ_{ε_{r}} = 1 / ⟨ ε_{r} ⟩, \\ μ_{1 / ε_{f}} = 1 / μ_{ε_{f}} = 1 / ⟨ ε_{f} ⟩ . \end{matrix}

Here, we use $χ_{ε_{r}}$ , $χ_{ε_{f}}$ , $χ_{1 / ε_{r}}$ and $χ_{1 / ε_{f}}$ to represent the CVs of $ε_{r}$ , $ε_{f}$ , $1 / ε_{r}$ and $1 / ε_{f}$ , respectively, with

{\begin{cases} χ_{ε_{r}} = σ_{ε_{r}} / μ_{ε_{r}}, χ_{ε_{f}} = σ_{ε_{f}} / μ_{ε_{f}}, \\ χ_{1 / ε_{r}} = σ_{1 / ε_{r}} / μ_{1 / ε_{f}}, χ_{1 / ε_{f}} = σ_{1 / ε_{f}} / μ_{1 / ε_{f}} \end{cases}

Similar to Equation S151, the CVs of $1 / ε_{r}$ and $1 / ε_{f}$ are roughly equal to those of $ε_{r}$ and $ε_{f}$ , respectively. Thus,

{\begin{cases} χ_{1 / ε_{r}} \approx χ_{ε_{r}}, \\ χ_{1 / ε_{f}} \approx χ_{ε_{f}} . \end{cases}

Combining Equations S181 and S182, and using the properties of Gaussian distributions, $\bar{Δ}$ follows a Gaussian distribution:

\bar{Δ} \sim N (μ_{\bar{Δ}}, σ_{\bar{Δ}}^{2}),

where $μ_{\bar{Δ}}$ and $σ_{\bar{Δ}}$ are the mean and standard deviation of $\bar{Δ}$ , respectively. Evidently, we have

{\begin{matrix} μ_{\bar{Δ}} = μ_{1 / ε_{r}} - μ_{1 / ε_{f}}, \\ σ_{\bar{Δ}}^{2} = σ_{1 / ε_{r}}^{2} + σ_{1 / ε_{f}}^{2} . \end{matrix}

Then, we proceed to calculate the relation between ${Pr}_{f}$ and $\bar{Δ}$ using Equation S187, and hence we obtain:

{Pr}_{f} = \int_{0}^{+ \infty} \frac{1}{σ_{\bar{Δ}} \sqrt{2 π}} e^{- \frac{1}{2} {(\frac{x - μ_{\bar{Δ}}}{σ_{\bar{Δ}}})}^{2}} d x = \frac{1}{2} [erf (\frac{μ_{\bar{Δ}}}{\sqrt{2} σ_{\bar{Δ}}}) + 1] .

Combining Equations S180, S183-S185, and S188-S189, we have:

\frac{J_{f}^{(E)}}{J_{f}^{(E)} + J_{r}^{(E)}} = \frac{1}{2} [erf (\frac{1 - ⟨ ε_{r} ⟩ / ⟨ ε_{f} ⟩}{\sqrt{2} \cdot \sqrt{χ_{ε_{r}}^{2} + χ_{ε_{f}}^{2} \cdot {(⟨ ε_{r} ⟩ / ⟨ ε_{f} ⟩)}^{2}}}) + 1] .

Note that the normalized energy fluxes $J_{r}^{(E)}$ and $J_{f}^{(E)}$ are proportional to the measured ATP fluxes generated in respiration and fermentation, respectively. Hence, Equation S190 can be directly compared to experimental data. For yeast and tumor cells, due to a higher level of heterogeneity, the CVs of $ε_{r}$ and $ε_{f}$ , i.e., $χ_{ε_{r}}$ and $χ_{ε_{f}}$ , could be significantly higher than the corresponding values in E. coli, though their exact values are unknown. Consequently, we plot theoretical results with the values of $χ_{ε_{r}}$ and $χ_{ε_{f}}$ chosen as 0.25, 0.40, and 0.58 to compare with the experimental data for yeast and in vivo mouse tumors (Bartman et al., 2023; Shen et al., 2024). In Figure 5A, B, we observe that the theoretical results using $χ_{ε_{r}} = χ_{ε_{f}} = 0.58$ agree well with the experimental data (Bartman et al., 2023; Shen et al., 2024), both on a log scale and linear scale. This demonstrates that our model has the potential to quantitatively illustrate the Crabtree effect in yeast and the Warburg effect in tumors.

Appendix 11

Notes on the application of reference data

Data calibration

Throughout our manuscript, we use experimental data from the original references, except for two calibrations. The first calibration is noted in the footnote of Appendix 1—table 2. With this calibration, the $J_{acetate}^{(M)} - λ$ data for E. coli (Basan et al., 2015) in Appendix 1—table 2 align with the curve shown in Figure 1C, which includes experimental data for E. coli from other sources. The second calibration applies to the data shown in Figures 3F and 1C (chemostat data for E. coli). The unit in the original reference (Holms, 1996) is mmol/(dry mass)g/h. To convert this to the unit mM/OD₆₀₀/h, used in our text, the conversion factor should be 0.18. Here, we deduce that only 60% of the measured dry biomass in centrifuged material is effective when calibrating with other experimental results. Therefore, there is a calibration factor of 0.6, and the conversion factor changes to 0.3.

Data from the inducible strains

Some of the experimental data in the original references (Basan et al., 2015; Hui et al., 2015) were obtained using E. coli strains with titratable systems (e.g. titratable ptsG, LacY). The $J_{acetate}^{(M)} - λ$ relation of these inducible strains generally aligns with the same curve as that of wild-type E. coli (Figure 1C). Since evolutionary treatment was not applied to the inducible strains, we approximate titration perturbation as a technique that mimics culturing the strains in a less efficient Group A carbon source.

Experimental data sources

The batch culture data for E. coli shown in Figure 1C (labeled as minimum/rich media or inducible strains) and Appendix 1—figure 2C were taken from the source data of the reference’s figure 1 (Basan et al., 2015). The chemostat data for E. coli shown in Figure 1C were taken from the reference’s table 7 (Holms, 1996). The data for E. coli shown in Figure 1D were taken from the reference’s extended data figure 3a (Basan et al., 2015), with the calibration specified in the footnote to Appendix 1—table 2.

The data for E. coli shown in Figure 2A were adopted from the reference’s extended data figure 4a–b (Basan et al., 2015). The data for E. coli shown in Figure 2B were taken from the source data of the reference’s figure 2a (Basan et al., 2015). The data for E. coli shown in Figure 2C were taken from the source data of the reference’s figure 3a (Basan et al., 2015). The data for E. coli shown in Figure 3A–B were taken from the source data of the reference’s figure 3d (Basan et al., 2015).

The data for E. coli shown in Figure 3C–D and Appendix 1—figure 2D-E were taken from the source data of the reference’s figure 3c (Basan et al., 2015). The data for E. coli shown in Figure 3F were taken from the reference’s table 7 (Holms, 1996), with a calibration factor specified in the above paragraph (‘Data calibration’).

The data for E. coli shown in Figure 4A–B and Appendix 1—figure 3A-D were taken from the reference’s table S2 with the label ‘C-lim’ (Hui et al., 2015). We excluded the reference’s data with λ=0.45205 h^–1 as there are other unconsidered factors involved during slow growth (Dai et al., 2017) (for λ<0.5 h^–1), and we suspect that unknown calibration factors may exist. The data for E. coli shown in Figure 4C–D and Appendix 1—figure 3E-N were adopted from the reference’s extended data figure 6-7 (Basan et al., 2015).

The batch culture data for yeast shown in Figure 5 were derived from the source data of the reference’s extended figure 4c-d (Shen et al., 2024). The chemostat data for yeast shown in Figure 5 were derived from the source data of the reference’s figure 3d-e (Shen et al., 2024), where glucose is the limiting nutrient. We excluded the reference’s data for I. orientalis under condition C2, where the ATP flux was abnormally small. The mouse tumor in vivo data shown in Figure 5 were derived from the source data of the reference’s figure 4e-g (Shen et al., 2024), which were originally reported by Bartman et al., 2023, the same research group as Shen et al., 2024. We did not include the cancer cell line data shown in figure 4a-c of Shen et al., 2024 since it appears that the proteomic data and flux data were obtained from two different references with inconsistent culturing conditions.

The gene names of E. coli depicted in Appendix 1—figure 1B were identified using the KEGG database. The data for E. coli shown in Appendix 1—figure 2G were drawn from Appendix 1—table 1, which includes the original references themselves. The flux data for E. coli presented in Appendix 1—table 2 were obtained from the reference’s extended data figure 3a (Basan et al., 2015), with the calibration specified in the footnote. The proteome data for E. coli shown in Appendix 1—table 2 were taken from the reference’s supplementary Table N5 (Basan et al., 2015).

Data availability

All study data are included in the manuscript and supporting files. All model results were generated using analytical formulas, with the relevant formulas and parameters specified in the manuscript and appendices. Source data files have been provided for Figures 1–5 and Appendix 1—figures 2–4.

References

1. Ackermann
(2015) A functional perspective on phenotypic heterogeneity in microorganisms
Nature Reviews Microbiology 13:497–508.

https://doi.org/10.1038/nrmicro3491
- PubMed
- Google Scholar
(2020) A putative bet-hedging strategy buffers budding yeast against environmental instability
Current Biology 30:4563–4578.

https://doi.org/10.1016/j.cub.2020.08.092
- PubMed
- Google Scholar
1. Balaban NQ
2. Merrin J
3. Chait R
4. Kowalik L
5. Leibler S
(2004) Bacterial persistence as a phenotypic switch
Science 305:1622–1625.

https://doi.org/10.1126/science.1099390
- PubMed
- Google Scholar
1. Bartman CR
2. Weilandt DR
3. Shen Y
4. Lee WD
5. Han Y
6. TeSlaa T
7. Jankowski CSR
8. Samarah L
9. Park NR
10. da Silva-Diz V
11. Aleksandrova M
12. Gultekin Y
13. Marishta A
14. Wang L
15. Yang L
16. Roichman A
17. Bhatt V
18. Lan T
19. Hu Z
20. Xing X
21. Lu W
22. Davidson S
23. Wühr M
24. Vander Heiden MG
25. Herranz D
26. Guo JY
27. Kang Y
28. Rabinowitz JD
(2023) Slow TCA flux and ATP production in primary solid tumours but not metastases
Nature 614:349–357.

https://doi.org/10.1038/s41586-022-05661-6
- PubMed
- Google Scholar
1. Basan M
2. Hui S
3. Okano H
4. Zhang Z
5. Shen Y
6. Williamson JR
7. Hwa T
(2015) Overflow metabolism in Escherichia coli results from efficient proteome allocation
Nature 528:99–104.

https://doi.org/10.1038/nature15765
- PubMed
- Google Scholar
1. Basan M
2. Honda T
3. Christodoulou D
4. Hörl M
5. Chang YF
6. Leoncini E
7. Mukherjee A
8. Okano H
9. Taylor BR
10. Silverman JM
11. Sanchez C
12. Williamson JR
13. Paulsson J
14. Hwa T
15. Sauer U
(2020) A universal trade-off between growth and lag in fluctuating environments
Nature 584:470–474.

https://doi.org/10.1038/s41586-020-2505-4
- PubMed
- Google Scholar
(2009) Absolute metabolite concentrations and implied enzyme active site occupancy in Escherichia coli
Nature Chemical Biology 5:593–599.

https://doi.org/10.1038/nchembio.186
- PubMed
- Google Scholar
1. Chen Y
2. Nielsen J
(2019) Energy metabolism controls phenotypes by protein efficiency and allocation
PNAS 116:17592–17597.

https://doi.org/10.1073/pnas.1906569116
- PubMed
- Google Scholar
1. Dai X
2. Zhu M
3. Warren M
4. Balakrishnan R
5. Patsalo V
6. Okano H
7. Williamson JR
8. Fredrick K
9. Wang Y-P
10. Hwa T
(2017) Reduction of translating ribosomes enables Escherichia coli to maintain elongation rates during slow growth
Nature Microbiology 2:16231.

https://doi.org/10.1038/nmicrobiol.2016.231
- PubMed
- Google Scholar
1. Davidi D
2. Noor E
3. Liebermeister W
4. Bar-Even A
5. Flamholz A
6. Tummler K
7. Barenholz U
8. Goldenfeld M
9. Shlomi T
10. Milo R
(2016) Global characterization of in vivo enzyme catalytic rates and their correspondence to in vitro kcat measurements
PNAS 113:3401–3406.

https://doi.org/10.1073/pnas.1514240113
- PubMed
- Google Scholar
1. DeBerardinis RJ
2. Chandel NS
(2020) We need to talk about the Warburg effect
Nature Metabolism 2:127–129.

https://doi.org/10.1038/s42255-020-0172-2
- PubMed
- Google Scholar
1. De Deken RH
(1966) The Crabtree effect: a regulatory system in yeast
Journal of General Microbiology 44:149–156.

https://doi.org/10.1099/00221287-44-2-149
- PubMed
- Google Scholar
1. Dekel E
2. Alon U
(2005) Optimality and evolutionary tuning of the expression level of a protein
Nature 436:588–592.

https://doi.org/10.1038/nature03842
- PubMed
- Google Scholar
1. Donachie WD
(1968) Relationship between cell size and time of initiation of DNA replication
Nature 219:1077–1079.

https://doi.org/10.1038/2191077a0
- PubMed
- Google Scholar
(2021) Beyond the warburg effect: oxidative and glycolytic phenotypes coexist within the metabolic heterogeneity of glioblastoma
Cells 10:202.

https://doi.org/10.3390/cells10020202
- PubMed
- Google Scholar
1. Ebenhöh O
2. Ebeling J
3. Meyer R
4. Pohlkotte F
5. Nies T
(2024) Microbial pathway thermodynamics: stoichiometric models unveil anabolic and catabolic processes
Life 14:247.

https://doi.org/10.3390/life14020247
- PubMed
- Google Scholar
(2001) In silico predictions of Escherichia coli metabolic capabilities are consistent with experimental data
Nature Biotechnology 19:125–130.

https://doi.org/10.1038/84379
- Google Scholar
(2002) Stochastic gene expression in a single cell
Science 297:1183–1186.

https://doi.org/10.1126/science.1070919
- PubMed
- Google Scholar
1. Escalante-Chong R
2. Savir Y
3. Carroll SM
4. Ingraham JB
5. Wang J
6. Marx CJ
7. Springer M
(2015) Galactose metabolic genes in yeast respond to a ratio of galactose and glucose
PNAS 112:1636–1641.

https://doi.org/10.1073/pnas.1418058112
- PubMed
- Google Scholar
1. Folks JL
2. Chhikara RS
(1978) The inverse gaussian distribution and its statistical application—a review
Journal of the Royal Statistical Society Series B 40:263–275.

https://doi.org/10.1111/j.2517-6161.1978.tb01039.x
- Google Scholar
(2012) Why in vivo may not equal in vitro - new effectors revealed by measurement of enzymatic activities under the same in vivo-like assay conditions
The FEBS Journal 279:4145–4159.

https://doi.org/10.1111/febs.12007
- PubMed
- Google Scholar
1. Gillespie DT
(2000) The chemical Langevin equation
The Journal of Chemical Physics 113:297–306.

https://doi.org/10.1063/1.481811
- Google Scholar
1. Hanahan D
2. Weinberg RA
(2011) Hallmarks of cancer: the next generation
Cell 144:646–674.

https://doi.org/10.1016/j.cell.2011.02.013
- PubMed
- Google Scholar
1. Hensley CT
2. Faubert B
3. Yuan Q
4. Lev-Cohain N
5. Jin E
6. Kim J
7. Jiang L
8. Ko B
9. Skelton R
10. Loudat L
11. Wodzak M
12. Klimko C
13. McMillan E
14. Butt Y
15. Ni M
16. Oliver D
17. Torrealba J
18. Malloy CR
19. Kernstine K
20. Lenkinski RE
21. DeBerardinis RJ
(2016) Metabolic heterogeneity in human lung tumors
Cell 164:681–694.

https://doi.org/10.1016/j.cell.2015.12.034
- PubMed
- Google Scholar
1. Holms H
(1996) Flux analysis and control of the central metabolic pathways in Escherichia coli
FEMS Microbiology Reviews 19:85–116.

https://doi.org/10.1111/j.1574-6976.1996.tb00255.x
- PubMed
- Google Scholar
1. Hui S
2. Silverman JM
3. Chen SS
4. Erickson DW
5. Basan M
6. Wang J
7. Hwa T
8. Williamson JR
(2015) Quantitative proteomic analysis reveals a simple strategy of global resource allocation in bacteria
Molecular Systems Biology 11:784.

https://doi.org/10.15252/msb.20145697
- PubMed
- Google Scholar
Book
1. Kampen NG
(1992)
Stochastic Processes in Physics and Chemistry

Elsevier.
- Google Scholar
1. Kiviet DJ
2. Nghe P
3. Walker N
4. Boulineau S
5. Sunderlikova V
6. Tans SJ
(2014) Stochasticity of metabolism and growth at the single-cell level
Nature 514:376–379.

https://doi.org/10.1038/nature13582
- PubMed
- Google Scholar
1. Kostinski S
2. Reuveni S
(2020) Ribosome composition maximizes cellular growth rates in E. coli
Physical Review Letters 125:028103.

https://doi.org/10.1103/PhysRevLett.125.028103
- PubMed
- Google Scholar
1. Kussell E
2. Leibler S
(2005) Phenotypic diversity, population growth, and information in fluctuating environments
Science 309:2075–2078.

https://doi.org/10.1126/science.1114383
- PubMed
- Google Scholar
1. Li SHJ
2. Li Z
3. Park JO
4. King CG
5. Rabinowitz JD
6. Wingreen NS
7. Gitai Z
(2018) Escherichia coli translation strategies differ across carbon, nitrogen and phosphorus limitation conditions
Nature Microbiology 3:939–947.

https://doi.org/10.1038/s41564-018-0199-2
- PubMed
- Google Scholar
1. Liberti MV
2. Locasale JW
(2016) The warburg effect: how does it benefit cancer cells?
Trends in Biochemical Sciences 41:287.

https://doi.org/10.1016/j.tibs.2016.01.004
- PubMed
- Google Scholar
1. Liu X
2. Wang X
3. Yang X
4. Liu S
5. Jiang L
6. Qu Y
7. Hu L
8. Ouyang Q
9. Tang C
(2015) Reliable cell cycle commitment in budding yeast is ensured by signal integration
eLife 4:e03977.

https://doi.org/10.7554/eLife.03977
- PubMed
- Google Scholar
1. Locasale JW
2. Cantley LC
(2010) Altered metabolism in cancer
BMC Biology 8:88.

https://doi.org/10.1186/1741-7007-8-88
- PubMed
- Google Scholar
1. Majewski RA
2. Domach MM
(1990) Simple constrained‐optimization view of acetate overflow in E. coli
Biotechnology and Bioengineering 35:732–738.

https://doi.org/10.1002/bit.260350711
- Google Scholar
(1984) Acetate formation in continuous culture of Escherichia coli K12 D1 on defined and complex media
Journal of Biotechnology 1:355–358.

https://doi.org/10.1016/0168-1656(84)90027-0
- Google Scholar
Book
1. Milo R
2. Phillips R
(2015) Cell Biology by the Numbers
Garland Science.

https://doi.org/10.1201/9780429258770
- Google Scholar
(2009) Shifts in growth strategies reflect tradeoffs in cellular economics
Molecular Systems Biology 5:323.

https://doi.org/10.1038/msb.2009.82
- PubMed
- Google Scholar
1. Mori M
2. Schink S
3. Erickson DW
4. Gerland U
5. Hwa T
(2017) Quantifying the benefit of a proteome reserve in fluctuating environments
Nature Communications 8:1225.

https://doi.org/10.1038/s41467-017-01242-8
- PubMed
- Google Scholar
(2014) Enzyme allocation problems in kinetic metabolic networks: optimal solutions are elementary flux modes
Journal of Theoretical Biology 347:182–190.

https://doi.org/10.1016/j.jtbi.2013.11.015
- PubMed
- Google Scholar
(2006) Nonlinear dependency of intracellular fluxes on growth rate in miniaturized continuous cultures of Escherichia coli
Applied and Environmental Microbiology 72:1164–1172.

https://doi.org/10.1128/AEM.72.2.1164-1172.2006
- Google Scholar
Book
(1990)
Physiology of the Bacterial Cell

Sinauer Associates, Inc.
- Google Scholar
Book
1. Neidhardt FC
(1996)
Escherichia coli and Salmonella: Cellular and Molecular Biology

ASM Press.
- Google Scholar
Book
1. Nelson DL
2. Cox MM
(2008)
Lehninger Principles of Biochemistry

Macmillan.
- Google Scholar
(2019) An upper limit on Gibbs energy dissipation governs cellular metabolism
Nature Metabolism 1:125–132.

https://doi.org/10.1038/s42255-018-0006-7
- Google Scholar
(2013) Analysis of fluorescent reporters indicates heterogeneity in glucose uptake and utilization in clonal bacterial populations
BMC Microbiology 13:258.

https://doi.org/10.1186/1471-2180-13-258
- PubMed
- Google Scholar
1. Park JO
2. Rubin SA
3. Xu Y-F
4. Amador-Noguez D
5. Fan J
6. Shlomi T
7. Rabinowitz JD
(2016) Metabolite concentrations, fluxes and free energies imply efficient enzyme usage
Nature Chemical Biology 12:482–489.

https://doi.org/10.1038/nchembio.2077
- PubMed
- Google Scholar
(2025) Single-cell data reveal heterogeneity of investment in ribosomes across a bacterial population
Nature Communications 16:285.

https://doi.org/10.1038/s41467-024-55394-5
- PubMed
- Google Scholar
1. Peebo K
2. Valgepea K
3. Maser A
4. Nahku R
5. Adamberg K
6. Vilu R
(2015) Proteome reallocation in Escherichia coli with increasing specific growth rate
Molecular BioSystems 11:1184–1193.

https://doi.org/10.1039/C4MB00721B
- Google Scholar
(2001) Cooperation and competition in the evolution of ATP-producing pathways
Science 292:504–507.

https://doi.org/10.1126/science.1058079
- Google Scholar
1. Sauer U
2. Canonaco F
3. Heri S
4. Perrenoud A
5. Fischer E
(2004) The soluble and membrane-bound transhydrogenases UdhA and PntAB have divergent functions in NADPH metabolism of Escherichia coli
The Journal of Biological Chemistry 279:6613–6619.

https://doi.org/10.1074/jbc.M311657200
- PubMed
- Google Scholar
1. Scott M
2. Gunderson CW
3. Mateescu EM
4. Zhang Z
5. Hwa T
(2010) Interdependence of cell growth and gene expression: origins and consequences
Science 330:1099–1102.

https://doi.org/10.1126/science.1192588
- Google Scholar
1. Shen Y
2. Dinh HV
3. Cruz ER
4. Chen Z
5. Bartman CR
6. Xiao T
7. Call CM
8. Ryseck R-P
9. Pratas J
10. Weilandt D
11. Baron H
12. Subramanian A
13. Fatma Z
14. Wu Z-Y
15. Dwaraknath S
16. Hendry JI
17. Tran VG
18. Yang L
19. Yoshikuni Y
20. Zhao H
21. Maranas CD
22. Wühr M
23. Rabinowitz JD
(2024) Mitochondrial ATP generation is more proteome efficient than glycolysis
Nature Chemical Biology 20:1123–1132.

https://doi.org/10.1038/s41589-024-01571-y
- PubMed
- Google Scholar
1. Shibao S
2. Minami N
3. Koike N
4. Fukui N
5. Yoshida K
6. Saya H
7. Sampetrean O
(2018) Metabolic heterogeneity and plasticity of glioma stem cells in a mouse glioblastoma model
Neuro-Oncology 20:343–354.

https://doi.org/10.1093/neuonc/nox170
- PubMed
- Google Scholar
1. Shlomi T
2. Benyamini T
3. Gottlieb E
4. Sharan R
5. Ruppin E
(2011) Genome-scale metabolic modeling elucidates the role of proliferative adaptation in causing the Warburg effect
PLOS Computational Biology 7:e1002018.

https://doi.org/10.1371/journal.pcbi.1002018
- PubMed
- Google Scholar
1. Solopova A
2. van Gestel J
3. Weissing FJ
4. Bachmann H
5. Teusink B
6. Kok J
7. Kuipers OP
(2014) Bet-hedging during bacterial diauxic shift
PNAS 111:7427–7432.

https://doi.org/10.1073/pnas.1320063111
- PubMed
- Google Scholar
(2016) General calibration of microbial growth in microplate readers
Scientific Reports 6:38828.

https://doi.org/10.1038/srep38828
- PubMed
- Google Scholar
1. Towbin BD
2. Korem Y
3. Bren A
4. Doron S
5. Sorek R
6. Alon U
(2017) Optimality and sub-optimality in a bacterial growth law
Nature Communications 8:14123.

https://doi.org/10.1038/ncomms14123
- PubMed
- Google Scholar
1. Valgepea K
2. Adamberg K
3. Nahku R
4. Lahtvee P-J
5. Arike L
6. Vilu R
(2010) Systems biology approach reveals that overflow metabolism of acetate in Escherichia coli is triggered by carbon catabolite repression of acetyl-CoA synthetase
BMC Systems Biology 4:166.

https://doi.org/10.1186/1752-0509-4-166
- PubMed
- Google Scholar
(2009) Understanding the Warburg effect: the metabolic requirements of cell proliferation
Science 324:1029–1033.

https://doi.org/10.1126/science.1160809
- PubMed
- Google Scholar
(1998) Effects of pyruvate decarboxylase overproduction on flux distribution at the pyruvate branch point in Saccharomyces cerevisiae
Applied and Environmental Microbiology 64:2133–2140.

https://doi.org/10.1128/AEM.64.6.2133-2140.1998
- PubMed
- Google Scholar
1. Varma A
2. Palsson BO
(1994) Stoichiometric flux balance models quantitatively predict growth and metabolic by-product secretion in wild-type Escherichia coli W3110
Applied and Environmental Microbiology 60:3724–3731.

https://doi.org/10.1128/aem.60.10.3724-3731.1994
- PubMed
- Google Scholar
1. Vazquez A
2. Liu J
3. Zhou Y
4. Oltvai ZN
(2010) Catabolic efficiency of aerobic glycolysis: the Warburg effect revisited
BMC Systems Biology 4:58.

https://doi.org/10.1186/1752-0509-4-58
- PubMed
- Google Scholar
1. Vazquez A
2. Oltvai ZN
(2016) Macromolecular crowding explains overflow metabolism in cells
Scientific Reports 6:31007.

https://doi.org/10.1038/srep31007
- PubMed
- Google Scholar
1. Wallden M
2. Fange D
3. Lundius EG
4. Baltekin Ö
5. Elf J
(2016) The synchronization of replication and division cycles in individual E. coli Cells
Cell 166:729–739.

https://doi.org/10.1016/j.cell.2016.06.052
- PubMed
- Google Scholar
1. Wang X
2. Xia K
3. Yang X
4. Tang C
(2019) Growth strategy of microbes on mixed carbon sources
Nature Communications 10:1279.

https://doi.org/10.1038/s41467-019-09261-3
- PubMed
- Google Scholar
1. Warburg O
(1924) Über den Stoffwechsel der Carcinomzelle
Die Naturwissenschaften 12:1131–1137.

https://doi.org/10.1007/BF01504608
- Google Scholar
1. Wehrens M
2. Krah LHJ
3. Towbin BD
4. Hermsen R
5. Tans SJ
(2023) The interplay between metabolic stochasticity and cAMP-CRP regulation in single E. coli cells
Cell Reports 42:113284.

https://doi.org/10.1016/j.celrep.2023.113284
- PubMed
- Google Scholar
(2014) Metabolic states with maximal specific rate carry flux through an elementary flux mode
The FEBS Journal 281:1547–1555.

https://doi.org/10.1111/febs.12722
- PubMed
- Google Scholar
1. Yaginuma H
2. Kawai S
3. Tabata KV
4. Tomiyama K
5. Kakizuka A
6. Komatsuzaki T
7. Noji H
8. Imamura H
(2014) Diversity in ATP concentrations in a single bacterial cell population revealed by quantitative single-cell imaging
Scientific Reports 4:6522.

https://doi.org/10.1038/srep06522
- PubMed
- Google Scholar
1. You C
2. Okano H
3. Hui S
4. Zhang Z
5. Kim M
6. Gunderson CW
7. Wang Y-P
8. Lenz P
9. Yan D
10. Hwa T
(2013) Coordination of bacterial proteome with metabolism by cyclic AMP signalling
Nature 500:301–306.

https://doi.org/10.1038/nature12446
- PubMed
- Google Scholar
(2018) Dynamic single-cell NAD(P)H measurement reveals oscillatory metabolism throughout the E. coli cell division cycle
Scientific Reports 8:2162.

https://doi.org/10.1038/s41598-018-20550-7
- PubMed
- Google Scholar
(2011) Economics of membrane occupancy and respiro-fermentation
Molecular Systems Biology 7:500.

https://doi.org/10.1038/msb.2011.34
- PubMed
- Google Scholar

Article and author information

Author details

Xin Wang

School of Physics, Sun Yat-sen University, Guangzhou, China

Contribution
Conceptualization, Resources, Data curation, Software, Formal analysis, Supervision, Funding acquisition, Validation, Investigation, Visualization, Methodology, Writing – original draft, Project administration, Writing – review and editing

For correspondence
wangxin36@mail.sysu.edu.cn

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0001-6479-395X

Funding

National Natural Science Foundation of China (12004443)

Xin Wang

National Natural Science Foundation of China (12474207)

Xin Wang

Guangzhou Municipal Science and Technology Bureau (202102020284)

Xin Wang

Sun Yat-sen University (The Hundred Talents Program)

Xin Wang

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Acknowledgements

The author thanks Chao Tang, Qi Ouyang, Yang-Yu Liu, and Kang Xia for helpful discussions. This work was supported by the National Natural Science Foundation of China (Grant Nos.12004443 and 12474207), Guangzhou Municipal Innovation Fund (Grant No.202102020284), and the Hundred Talents Program of Sun Yat-sen University.

Version history

Sent for peer review: November 30, 2023
Preprint posted: December 14, 2023
Reviewed Preprint version 1: February 7, 2024
Reviewed Preprint version 2: December 19, 2024
Reviewed Preprint version 3: March 24, 2025
Version of Record published: June 5, 2025
Version of Record updated: June 12, 2025

Cite all versions

You can cite all versions using the DOI https://doi.org/10.7554/eLife.94586. This DOI represents all versions, and will always resolve to the latest one.

Copyright

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.

Metrics

2,778

views
197

downloads
7

citations

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Citations by DOI

1

citation for umbrella DOI https://doi.org/10.7554/eLife.94586

2

citations for Reviewed Preprint v3 https://doi.org/10.7554/eLife.94586.3

4

citations for Version of Record https://doi.org/10.7554/eLife.94586.4

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Article PDF

Open citations (links to open the citations from this article in various online reference manager services)

Mendeley

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

Xin Wang

(2025)

Overflow metabolism originates from growth optimization and cell heterogeneity

eLife 13:RP94586.

https://doi.org/10.7554/eLife.94586.4

Share this article

Cite this article

Model and results of overflow metabolism in E. coli.

Influence of protein overexpression on overflow metabolism in E. coli.

Influence of energy dissipation, translation inhibition, and carbon source category alteration on overflow metabolism in E. coli.

Relative protein expression of central metabolic enzymes in E. coli under carbon limitation and proteomic perturbation.

Model comparison with data on the Crabtree effect in yeast and the Warburg effect in tumors.

Molecular weight (MW) and in vivo/in vitro kcat data for E. coli.

Proteome and flux data (Basan et al., 2015) used to calculate the in vivo kcat of E. coli.

Illustrations of symbols in this manuscript.

Central metabolic network and carbon utilization pathways of E. coli.

Model and results for experimental comparison of E. coli.

Relative protein expression of central metabolic enzymes in E. coli under various types of perturbations.

Asymptotic distributions of inverse Gaussian distribution and the inverse of Gaussian distribution.

Carbon utilization in yeast and mammalian cells.

Author details

Xin Wang

Contribution

For correspondence

Competing interests

Citations by DOI

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

Categories and tags

Research organisms

Molecular weight (MW) and in vivo/in vitro k_cat data for E. coli.

Proteome and flux data (Basan et al., 2015) used to calculate the in vivo k_cat of E. coli.