Abstract
A classic problem in metabolism is that fast-proliferating cells use seemingly wasteful fermentation to generate energy in the presence of sufficient oxygen. This counterintuitive phenomenon, known as overflow metabolism, or the Warburg effect in cancer, is universal across various organisms. Despite extensive research, its origin and function remain unclear. Here, we take Escherichia coli as a typical example and show that overflow metabolism can be understood through growth optimization combined with cell heterogeneity. A model of optimal protein allocation, coupled with heterogeneity in enzyme catalytic rates among cells, quantitatively explains why and how cells make the choice between respiration and fermentation under different nutrient conditions. Our model quantitatively illustrates the growth rate dependence of fermentation flux and enzyme allocation under various perturbations, which is fully validated by experimental results. Our work solves the long-standing puzzle of overflow metabolism and can be broadly used to address heterogeneity-related challenges in metabolism.
Introduction
A prominent feature of cancer metabolism is that tumor cells excrete large quantities of fermentation products in the presence of sufficient oxygen (Hanahan and Weinberg, 2011; Liberti and Locasale, 2016; Vander Heiden et al., 2009). This process, discovered by Otto Warburg in the 1920s (Warburg et al., 1924) and known as the Warburg effect, aerobic glycolysis or overflow metabolism, (Basan et al., 2015; Hanahan and Weinberg, 2011; Liberti and Locasale, 2016; Vander Heiden et al., 2009) is ubiquitous for fast-proliferating cells across a broad spectrum of organisms (Vander Heiden et al., 2009), ranging from bacteria (Basan et al., 2015; Holms, 1996; Meyer et al., 1984; Nanchen et al., 2006; Neidhardt et al., 1990) and fungi (De Deken, 1966) to mammalian cells (Hanahan and Weinberg, 2011; Liberti and Locasale, 2016; Vander Heiden et al., 2009). For microbes, cells use standard respiration when nutrients are scarce, while they use the counterintuitive aerobic glycolysis when nutrients are adequate, just analogous to normal tissues and cancer cells, respectively (Vander Heiden et al., 2009). Over the past century, especially with extensive studies in the last two decades (Liberti and Locasale, 2016), various rationales for overflow metabolism have been proposed (Basan et al., 2015; Chen and Nielsen, 2019; Majewski and Domach, 1990; Molenaar et al., 2009; Niebel et al., 2019; Pfeiffer et al., 2001; Shlomi et al., 2011; Vander Heiden et al., 2009; Varma and Palsson, 1994; Vazquez et al., 2010; Vazquez and Oltvai, 2016). In particular, Basan et al. (Basan et al., 2015) provided a systematic characterization of this process, including various types of perturbations in experiments. However, the origin and function of overflow metabolism still remain unclear (DeBerardinis and Chandel, 2020; Hanahan and Weinberg, 2011; Liberti and Locasale, 2016; Vander Heiden et al., 2009).
Why have microbes and cancer cells evolved to possess the seemingly wasteful aerobic glycolysis strategy? For unicellular organisms, there is evolutionary pressure (Vander Heiden et al., 2009) to optimize cellular resources for rapid growth (Dekel and Alon, 2005; Edwards et al., 2001; Hui et al., 2015; Li et al., 2018; Scott et al., 2010; Towbin et al., 2017; Wang et al., 2019; You et al., 2013). In particular, it has been shown that cells allocate protein resources for optimal growth (Hui et al., 2015; Scott et al., 2010; Wang et al., 2019; You et al., 2013), and the most efficient protein allocation corresponds to elementary flux mode (Müller et al., 2014; Wortel et al., 2014). In this study, we extend these approaches in a heterogeneous framework to address the puzzle of aerobic glycolysis. We use Escherichia coli as a typical example and show that overflow metabolism can be understood from optimal protein allocation combined with the heterogeneity in enzyme catalytic rates. The optimal growth strategy varies between respiration and fermentation depending on the concentration and type of the nutrient, and the combination with cell heterogeneity results in the standard picture (Basan et al., 2015; Holms, 1996; Meyer et al., 1984; Nanchen et al., 2006) of overflow metabolism. Our model quantitatively illustrates the growth rate dependence of fermentation/respiration flux and enzyme allocation under various types of perturbations, and can be used to explain the Warburg effect in tumor cells.
Results
Coarse-grained model
Based on the topology of the metabolic network (Neidhardt et al., 1990; Nelson et al., 2008) (Fig. 1A) , we classify the carbon sources that enter from the upper part of glycolysis into Group A (Wang et al., 2019), and the precursors of biomass components (such as amino acids) into five pools (see Appendix 1.2 for details): a1 (entry point: G6P/F6P), a2 (entry point: GA3P/3PG/PEP), b (entry point: pyruvate/acetyl-CoA), c (entry point: a-ketoglutarate), and d (entry point: oxaloacetate). Pools a1 and a2 are also combined as Pool a due to joint synthesis of precursors. Then, the metabolic network for Group A carbon source utilization (Fig. 1A) is coarse-grained into a model shown in Fig. 1B (see Appendix 2.1 for details) , where node A represents an arbitrary carbon source of Group A. Evidently, Fig. 1B is topologically identical to Fig. 1A. Each coarse-grained arrow in Fig. 1B carries a stoichiometric flux Ji, which delivers carbon flux and may consume or produce energy (e.g., J1, Ja1, see Figs. 1A-B and Appendix-fig. 1A).

Model and results of overflow metabolism.
(A) The central metabolic network of carbon source utilization. The Group A carbon sources (Wang et al., 2019) are labeled with green squares. (B) Coarse-grained model for Group A carbon source utilization. (C) Model predictions (Eqs. S47 and S160) and experimental results (Basan et al., 2015; Holms, 1996) of overflow metabolism, covering the data for all the Group A carbon sources shown in (A). (D) Growth rate dependence of respiration and fermentation fluxes (Eqs. S47 and S160). (E) The energy efficiencies of respiration and fermentation pathways vary with the growth rate as functions of the substrate quality of a Group A carbon source (Eqs. S31 and S36). See Appendix 8 for model parameter settings and experimental data sources (Basan et al., 2015; Holms, 1996; Hui et al., 2015) of Figs. 1–4.
In fact, the stoichiometric flux Ji scales with the cell population. For comparison with experiments, we define the normalized flux
Generally, there are three distinct destinies of a Group A carbon source in the metabolic network (Appendix-fig. 1C-E) : fermentation, respiration, and biomass generation. Each draws a proteome fraction of ϕf, ϕr and ϕBM. The net effect of the first two destinies is energy production, while the last one generates precursors of biomass accompanied by energy production. By applying the proteomic constraint(Scott et al., 2010) that there is a maximum fraction ϕmax for proteome allocation (ϕmax ≈ 0.48 (Scott et al., 2010)), we have:
In fact, Eq. 1 is equivalent to
For balanced cell growth, the energy demand is generally proportional to the biomass production rate. Thus, the normalized energy production rate
where ηE is the energy coefficient. By converting all the energy currencies into ATPs, the normalized energy fluxes of respiration and fermentation are
where φ is a constant coefficient mainly determined by ηE (see Eq. S33), and φ·λ represents the normalized energy demand other than the biomass pathway. The coefficients ψ, εr and εf are all functions of κA·. ψ-1 is the proteome efficiency of the biomass pathway (see Eq. S32), with ψ-1 = λ/ϕBM · εr and εf are the proteome energy efficiencies of the respiration and fermentation pathways, with
where both
Origin of overflow metabolism
The standard picture of overflow metabolism (Basan et al., 2015; Holms, 1996; Meyer et al., 1984; Nanchen et al., 2006) is exemplified by the experimental data (Basan et al., 2015) shown in Fig. 1C: the fermentation flux exhibits a threshold-analog dependence on the growth rate λ. It is well known that respiration is far more efficient than fermentation in terms of energy production per unit carbon (i.e.,
For cell proliferation in a given nutrient with fixed κA , the values of εr, εf and ψ are determined (Eqs. 4 and S32). However, the growth rate λ can be influenced by protein allocation between ϕr and ϕf with the governing equation Eq. 3. If εr < εr, then
In practice, both εr and εf are functions of κA (Eq. 4), and therefore the optimal choice may vary depending on the nutrient conditions. In nutrient-poor conditions where κA ≪
For a quantitative understanding of overflow metabolism, let us first consider the homogeneous case, where all cells share identical biochemical parameters. For optimal protein allocation, the relation between fermentation flux and growth rate is
To address this issue, we take into account cell heterogeneity, which is ubiquitous in both microbes (Ackermann, 2015; Bagamery et al., 2020; Balaban et al., 2004; Nikolic et al., 2013; Solopova et al., 2014; Wallden et al., 2016) and tumor cells (Duraj et al., 2021; Hanahan and Weinberg, 2011; Hensley et al., 2016). For the Warburg effect or overflow metabolism of our concern, experimental studies have reported significant metabolic heterogeneity in the choice between respiration and fermentation within a cell population (Bagamery et al., 2020; Duraj et al., 2021; Hensley et al., 2016; Nikolic et al., 2013). Motivated by the fact that the turnover number (kcat value) of a catalytic enzyme varies considerably between in vitro and in vivo measurements (Davidi et al., 2016; García - Contreras et al., 2012), we note that the concentrations of potassium and phosphate, which vary from cell to cell, have a significant impact on the kcat values of the metabolic enzymes (García - Contreras et al., 2012). Therefore, in a cell population, there is a distribution of values for kcat, which is commonly referred to as extrinsic noise (Elowitz et al., 2002). For simplicity, we assume that each kcat value follows a Gaussian distribution. This gives the distributions of single-cell growth rate in various types of carbon sources (see Eqs. S155–S157, S163–S165), which are fully verified by recent experiments using isogenic Escherichia coli with single-cell resolution (Wallden et al., 2016) (Appendix-fig. 2B). Then, the critical growth rate λC should follow a Gaussian distribution
where “erf” represents the error function. The fermentation flux exhibits a threshold-analog relation with the growth rate (the red curves in Figs. 1C-D, 2B-C and 3B, D, F), while the respiration flux (the blue curve in Fig. 1D) decreases with an increase in fermentation flux. In Fig. 1C-D, we see that the model results (see Eq. 5 and Appendix 8; parameters are set by the experimental data shown in Appendix-table S1) agree quantitatively with the experimental data of Escherichia coli (Basan et al., 2015; Holms, 1996). The fermentation flux was determined by the acetate secretion rate

Influence of protein overexpression on overflow metabolism.
(A) A 3D plot of the relations among fermentation flux, growth rate, and the expression level of useless proteins. In this plot, both the acetate excretion rate and growth rate vary as bivariate functions of the substrate quality of a Group A carbon source (denoted as κA) and the useless protein expression encoded by LacZ (denoted as ϕZ perturbation, see Eqs. S57 and S160). (B) Growth rate dependence of the acetate excretion rate upon ϕZ perturbation for each fixed nutrient condition (Eq. S58 and S160). (C) Growth rate dependence of the acetate excretion rate as κA varies (Eqs. S57 and S160), with each fixed expression level of LacZ.

Influence of energy dissipation, translation inhibition, and carbon source category alteration on overflow metabolism.
(A) A 3D plot of the relations among fermentation flux, growth rate, and the energy dissipation coefficient (Eqs. S70 and S160). (B) Growth rate dependence of the acetate excretion rate as κA varies, with each fixed energy dissipation coefficient determined by/fitted from experimental data. (C) A 3D plot of the relations among fermentation flux, growth rate, and the translation efficiency (Eqs. 85 and S160). Here, the translation efficiency is adjusted by the dose of chloramphenicol (Cm). (D) Growth rate dependence of the acetate excretion rate as κA varies, with each fixed dose of Cm. (E) Coarse-grained model for pyruvate utilization. (F) The growth rate dependence of fermentation flux in pyruvate (Eqs. 105 and S160) significantly differs from that of the Group A carbon sources (Eqs. 47 and S160).
Testing the model through perturbations
To further test our model, we systematically investigate model predictions under various types of perturbations and compare them with the experimental data from existing studies (Basan et al., 2015; Holms, 1996) (see Appendices 3 and 4.1 for details).
First, we consider the proteomic perturbation by overexpression of useless proteins encoded by the Lacz gene (i.e., ϕZ perturbation) in Escherichia coli. The net effect of the ϕZ perturbation is that the maximum fraction of proteome available for resource allocation changes from ϕmax to ϕmax − ϕZ5, where ϕZ is the mass fraction of useless proteins. In a cell population, the critical growth rate λC(ϕZ) still follows a Gaussian distribution N(μλC (ϕZ), σλC (ϕZ)2), where the CV of λC (ϕZ) remains unchanged. Consequently, the growth rate dependence of fermentation flux changes into
Next, we study the influence of energy dissipation, which introduces an energy dissipation coefficient “w” in Eq. 2:
We proceed to analyze the impact of translation inhibition with different sub-lethal doses of chloramphenicol on Escherichia coli. This type of perturbation introduces an inhibition coefficient “l” in the translation rate, thus turning κt into κt/(l+1). Still, the critical growth rate λC(l) follows a Gaussian distribution N(μλC(l), σλC(l)2), and then, the growth rate dependence of fermentation flux is:
Finally, we consider the alteration of nutrient categories by switching to a non-Group A carbon source: pyruvate, which enters the metabolic network from the endpoint of glycolysis (Neidhardt et al., 1990; Nelson et al., 2008). The coarse-grained model for pyruvate utilization is shown in Fig. 3E (see also Fig. 1A), which shares identical precursor pools as those for Group A carbon sources (Fig. 1B), yet with several differences in the coarse-grained reactions. The growth rate dependencies of both the proteome energy efficiencies (Appendix-fig. 2H) and energy fluxes are qualitatively similar to those of the Group A carbon source utilization, while there are quantitative differences in the coarse-grained parameters (see Appendices 4.1 and 8 for details). Most notably, the critical growth rate
Enzyme allocation under perturbations
As mentioned above, our coarse-grained model is topologically identical to the central metabolic network (Fig. 1A), and thus it can predict enzyme allocation for each gene in glycolysis and the TCA cycle (see Appendix-fig. 1B and Appendix-table 1) under various types of perturbations. In Fig. 1B, the intermediate nodes M1, M2, M3, M4, and M5 represent G6P, PEP, acetyl-CoA, α-ketoglutarate, and oxaloacetate, respectively. Then, ϕ1 and ϕ2 correspond to enzymes of glycolysis (or at the junction of glycolysis and the TCA cycle), while ϕ3 and ϕ4 correspond to enzymes in the TCA cycle.
We first consider enzyme allocation under carbon limitation by varying the nutrient type and concentration of a Group A carbon source (i.e., κA perturbation). In fact, this has been extensively studied in more simplified models (Hui et al., 2015; You et al., 2013), where the growth rate dependence of enzyme allocation under κA perturbation was generally considered to be a C-line response (Hui et al., 2015; You et al., 2013), i.e., the genes responsible for digesting carbon compounds show a linear increase in gene expression as the growth rate decreases (Hui et al., 2015; You et al., 2013). However, when it comes to enzymes catalyzing reactions between intermediate nodes, we collected experimental data from existing studies (Hui et al., 2015) and found that the enzymes in glycolysis exhibit a completely different response pattern compared to those in the TCA cycle (Appendix-fig. 3A-B). This discrepancy cannot be explained by the C-line response. To address this issue, we apply the coarse-grained model described above (Fig. 1B) to calculate the growth rate dependence of enzyme allocation for each ϕi (i = 1, 2, 3, 4) using the model settings for wild-type strains, where no fitting parameters are involved in determining the shape (see Eqs. S118–S119 and Appendix 8). In Figs. 4A-B and Appendix-fig. 3C-D, we see that the model predictions overall match with the experimental data (Hui et al., 2015) for representative genes from either glycolysis or the TCA cycle, and maintenance energy (with w0 = 2.5 (h-1)) has a negligible effect on this process. Still, there are minor discrepancies that arise from the basal expression of metabolic genes, which may be attributed to the fact that our model deals with relatively stable growth conditions while microbes need to be prepared for fluctuating environments (Basan et al., 2020; Kussell and Leibler, 2005; Mori et al., 2017).

Relative protein expression of central metabolic enzymes under κA and ϕZ perturbations.
(A, C) Relative protein expression of representative genes from glycolysis. (B, D) Relative protein expression of representative genes from the TCA cycle. (A, B) Results of κA perturbation (Eq. S119). (C, D) Results of ϕZ perturbation (Eq. S121).
We proceed to analyze the influence of ϕZ perturbation and energy dissipation. In both cases, our model predicts a linear response to the growth rate reduction for all genes in either glycolysis or the TCA cycle (see Appendix 5.2–5.3 for details). For ϕZ perturbation, all predicted slopes are positive, and there are no fitting parameters involved (Eqs. S120–S121). In Figs. 4C-D and Appendix-fig. 3E-J, we show that our model quantitatively illustrates the experimental data (Basan et al., 2015) for representative genes in the central metabolic network, and there is a better agreement with experiments (Basan et al., 2015) by incorporating the maintenance energy (with w0 = 2.5 (h-1)). For energy dissipation, however, the predicted slopes of the enzymes corresponding to ϕ4 are surely negative, and there is a constraint that the slope signs of the enzymes corresponding to the same ϕi (i = 1, 2, 3) should be the same. In Appendix-fig. 3K-N, we see that the model results (Eqs. S127 and S123) are consistent with experiments (Basan et al., 2015).
Discussion
The phenomenon of overflow metabolism, or the Warburg effect, has been a long-standing puzzle in cell metabolism. Although many rationales have been proposed (Basan et al., 2015; Chen and Nielsen, 2019; Majewski and Domach, 1990; Molenaar et al., 2009; Niebel et al., 2019; Pfeiffer et al., 2001; Shlomi et al., 2011; Vander Heiden et al., 2009; Varma and Palsson, 1994; Vazquez et al., 2010; Vazquez and Oltvai, 2016) over the past century, the origin and function of this phenomenon remain unclear (DeBerardinis and Chandel, 2020; Hanahan and Weinberg, 2011; Liberti and Locasale, 2016; Vander Heiden et al., 2009). In this study, we use Escherichia coli as a typical example and demonstrate that overflow metabolism can be understood through optimal protein allocation combined with cell heterogeneity. In nutrient-poor conditions, the proteome energy efficiency of respiration is higher than that of fermentation (Fig. 1E), and thus the cell uses respiration to optimize growth. In rich media, however, the proteome energy efficiency of fermentation increases faster and is higher than that of respiration (Fig. 1E), leading the cell to use fermentation to accelerate growth. In further combination with cell heterogeneity in enzyme catalytic rates (Davidi et al., 2016; García - Contreras et al., 2012), our model quantitatively illustrates the thresholdanalog response (Basan et al., 2015; Holms, 1996) in overflow metabolism (Fig. 1C).
Cell heterogeneity is crucial for the threshold-analog response in overflow metabolism. In the homogeneous case, the optimal solution is a digital response (Eq. S44) that corresponds to an elementary flux mode (Müller et al., 2014; Wortel et al., 2014) and agrees with the numerical study of Molenaar et al. (Molenaar et al., 2009). However, this digital response is incompatible with the standard picture of overflow metabolism (Basan et al., 2015; Holms, 1996; Meyer et al., 1984; Nanchen et al., 2006). By incorporating heterogeneity in enzyme catalytic rates (Davidi et al., 2016; García - Contreras et al., 2012), the critical growth rate (i.e., threshold) changes from a single value into a Gaussian distribution (Eq. 45, see Appendix 7 for details; see also Appendix-fig. 4) for a cell population, thus turning a digital response into the threshold-analog response in overflow metabolism (Fig. 1C). Our model results relying on cell heterogeneity are fully validated by the observed distributions of single-cell growth rate (Wallden et al., 2016) (Appendix-fig. 2B) and experiments with various types of perturbations (Basan et al., 2015; Holms, 1996; Hui et al., 2015), both for acetate secretion patterns and gene expression in the central metabolic network (Figs. 2–4 and Appendix-figs. 2D-E and 3).
Finally, our model can be broadly used to address heterogeneity-related challenges in metabolism on a quantitative basis, including the Crabtree effect in yeast (Bagamery et al., 2020; De Deken, 1966), the Warburg effect in cancer (Duraj et al., 2021; Hanahan and Weinberg, 2011; Liberti and Locasale, 2016; Vander Heiden et al., 2009) (see Appendix 6.4 for an explanation of the Warburg effect), and the heterogeneous metabolic strategies of cells in various types of environments (Bagamery et al., 2020; Duraj et al., 2021; Escalante-Chong et al., 2015; Hensley et al., 2016; Liu et al., 2015; Solopova et al., 2014; Wang et al., 2019).
Acknowledgements
The author thanks Chao Tang, Qi Ouyang, Yang-Yu Liu and Kang Xia for helpful discussions. This work was supported by National Natural Science Foundation of China (Grant No.12004443), Guangzhou Municipal Innovation Fund (Grant No.202102020284) and the Hundred Talents Program of Sun Yat-sen University.
Data, Materials, and Software Availability
All study data are included in the article and/or appendices.
Appendix 1 Model framework
Appendix 1.1 Proteome partition
Here we adopt the proteome partition framework similar to that introduced by Scott et al.(Scott et al., 2010). All proteins in a cell are classified into three classes: the fixed portion Q-class, the active ribosome-affiliated R-class, and the remaining catabolic/anabolic enzymes C-class. Each proteome class has a mass
To analyze cell growth optimization, we first consider the homogeneous case where all cells share identical biochemical parameters and simplify the mass accumulation of a cell population into a big cell. Essentially, this approximation would not influence the value of growth rate λ. For bacteria, the protein turnover rate is negligible, and thus the mass accumulation of each class follows:
where mAA stands for the average molecular weight of amino acids, kT is the translation rate,
where
Over a long period in the exponential growth phase (t → +∞), ϕi = fi (i = Q, R, C) and
where
Appendix 1.2 Precursor pools
Based on the entry point of the metabolic network, we classify the precursors of biomass components into five pools (Fig. 1A-B): a1 (entry point: G6P/F6P), a2 (entry point: GA3P/3PG/PEP), b (entry point: pyruvate/acetyl-CoA), c (entry point: α-ketoglutarate) and d (entry point: oxaloacetate). For bacteria, these five pools draw roughly ra1 = 24 %, ra2 = 24 %, rb = 28 %, rc = 12 % and rd = 12 % of the carbon flux(Nelson et al., 2008; Wang et al., 2019). There are overlapping components between Pools a1 and a2 due to the joint synthesis of some precursors, thus we also use Pool a to represent Pools a1-a2 in the descriptions.
Appendix 1.3 Stoichiometric flux
We consider the following biochemical reaction between substrate Si and enzyme Ei:
where ai, di and
where
The copy number of enzyme Ei is
The mass fraction of Ei is
Appendix 1.4 Carbon flux and cell growth rate
To clarify the relation between the stoichiometric flux Ji and growth rate λ, we consider the carbon flux in the biomass production. The carbon mass of the cell population (the “big cell”) is given by Mcarbon = Mprotein · rcarbon/rprotein, where rcarbon and rprotein represent the mass fraction of carbon and protein within a cell. In the exponential growth phase, the carbon flux of the biomass production is given by:
where mcarbon is the mass of a carbon atom. In fact, the carbon mass flux per stoichiometry varies depending on the entry point of the precursor pool. Taking Pool b as an example, there are three carbon atoms in a molecule of the entry point metabolite (i.e., pyruvate). Assuming that carbon atoms are conserved from pyruvate to Pool b, then the carbon flux of Pool b is given by
where the subscript “EPi” represents the entry point of Pool i, and
For each substrate in intermediate steps of the metabolic network, we define κi as the substrate quality:
and for each precursor pool, we define:
Combining Eqs. S8, S9 and S11, we have
Then, we define the normalized flux, which can be regarded as the flux per unit of biomass:
where the superscript “(N)” stands for normalized. Combined with Eqs. S8, S9 and S12, we have:
Since
then,
and we have
Appendix 1.5 Intermediate nodes
In a metabolic network, the metabolites between the carbon source and precursor pools are the intermediate nodes. As specified in Wang et al.(Wang et al., 2019), to optimize cell growth rate, the substrate of each intermediate node is nearly saturated, and thus
The real cases could be more complicated because of other metabolic regulations. Recent quantitative studies(Bennett et al., 2009; Park et al., 2016)have shown that at least in E. coli, for most of the substrate-enzyme pairs [Si] > Ki, which implies
Appendix 2 Model and analysis
Appendix 2.1 Coarse-grained model
In the coarse-grained model shown in Fig. 1B, node A represents an arbitrary carbon source of Group A(Wang et al., 2019), which joins at the upper part of glycolysis. Nodes M1, M2, M3, M4, and M5 stand for G6P, PEP, acetyl-CoA, α-ketoglutarate, and oxaloacetate, respectively. In the analysis of carbon supply into precursor pools, we lump sum G6P/F6P as M1, GA3P/3PG/PEP as M2, and pyruvate/acetyl-CoA as M3 for approximation. For the biochemical reactions, each follows Eq. S5 with bi = 1 except that M1→2M2 and M3+M5→M4. Basically, there are 3 possible destinies of a Group A carbon source (e.g., glucose, see Appendix-fig. 1C-E): energy contributions in the fermentation and respiration pathways (Appendix-fig.1C-D), or biomass components accompanied by energy production in the biomass pathway (Appendix-fig.1E).
By applying flux balance to the stoichiometric fluxes and combining with Eq. S8, we have:
Obviously, the stoichiometric fluxes of respiration Jr and fermentation Jf (Appendix-fig. 1C-D) are:
We further assume that the carbon atoms are conserved from each entry point metabolite to the precursor pool, and then,
In terms of energy production for the relevant reactions, for convenience, we convert all the energy currencies into ATPs, namely, NADH → 2ATP(Neidhardt et al., 1990), NADPH → 2ATP(Neidhardt et al., 1990; Sauer et al., 2004), FADH2 → 1ATP(Neidhardt et al., 1990). Then, we have
where JE represents the stoichiometric flux of ATPs, and βi is the stoichiometric coefficient with β1 = 4, β2 = 3, β3 = 2, β4 = 6, and βa1 = 4 (Neidhardt et al., 1990; Sauer et al., 2004). Generally, the energy demand is proportional to the carbon flux infused into biomass production, thus,
where rE is the ratio and also a constant.
By applying the substitutions specified in Eqs. S9, S12, S14–S18, combined with Eqs. S4, S10, S21–S25, and the constraint of proteome resource allocation ϕR + ϕC = ϕmax, we have:
where
which is determined externally by the culture condition. From Eq. S26, all ϕi can be expressed by
By substituting Eq. S28 into Eq. S26, we have:
Here,
The coefficients εr and εf represent the proteome energy efficiencies of the respiration and fermentation pathways (Appendix-fig.1C-D), respectively, with
ψ-1 is the proteome efficiency of biomass pathway (Appendix-fig. 1E), with
φ is the energy demand coefficient (a constant), with
and φ · λ stands for the normalized energy demand other than the accompanying energy production from the biomass pathway.
Appendix 2.2 The reason for overflow metabolism
Microbes optimizetheir growth rate to survive in the evolutionary process(Vander Heiden et al., 2009).Basically, this also applies to tumor cells, which proliferate rapidly ignoring signals of growth restriction(Vander Heiden et al., 2009). To optimize cell growth, we first consider the best strategy for a single cell. The coarse-grained model is summarized in Eq. S26 and further simplified into Eq. S29. Here, εr, εf and ψ are functions of κA (see Eqs. S31, S32), so we also denote them as εr(κA), εf(κA), ψ(κA). Apparently, the fluxes of both respiration and fermentation take non-negative values, i.e.,
Thus, if εr > εf, then
Similarly, if εf > εf, then the optimal solution is:
In both cases, the growth rate λ takes the maximum value for a given nutrient condition (i.e., given κA):
So, why do microbes use the wasteful fermentation pathway when the growth rate is large under aerobic conditions? An intuitive speculation is that the fermentation pathway is more efficient in terms of the proteome energy efficiency, i.e., εf > εr. If so, then why do microbes still use the normal respiration pathway when the growth rate is small? The answer lies in that both εr(κA) and εf(κA) are not constants, but are dependent on nutrient conditions. In Eq. S31, when κA is small, just consider the extreme case of κA → 0, and then
Since β3 + β4 ≫ β6, clearly,
Combined with Eq. S36, thus cells would certainly use the respiration pathway when the growth rate is very small. Meanwhile, suppose that
then Δ(κA) ≡ εf(κA)/εr(κA) is a monotonously increasing function of κA. Thus,
and cells would use the fermentation pathway when the growth rate is large.
In practice, experimental studies(Basan et al., 2015)in E. coli have reported that the proteome energy efficiency in fermentation is higher than that in respiration when the Group A carbon source is lactose at saturated concentration(Molenaar et al., 2009), i.e.,
Now that Eqs. S38–S40 are all valid, then there exists a critical value of κA (denoted as
Combined with Eq. S31, we have:
By substituting Eq. S42 into Eqs. S31, S32 and S36, we obtain the expressions for
where εr/f represents either εr or εf. In Fig. 1E, we show the dependencies of εr(κA), εf(κA) and λ(κA) on κA in a 3-dimensional form, as κA changes.
Appendix 2.3 The relation betweenrespiration/fermentationfluxes and growth rate
We proceed to study the relation between the respiration/fermentation flux and the cell growth rate. From Eqs. S16 and S30, we see that the stoichiometric fluxes Jr, Jf, the normalized fluxes
In the homogeneous case, i.e., all microbes share identical biochemical parameters, as λ(κA) increases with κA,
where “θ” stands for the Heaviside step function. Defining
In practice, the values of
where
where “erf” represents the error function. In practice, given a culturing medium, there is also a probability distribution for the growth rate (Appendix-fig.2B, see also Eq. S157). For approximation, in plotting the flux-growth rate relations, we use the deterministic (noise-free) value of the growth rate as the proxy. To compare with experiments, basically, we are comparing the normalized fluxes
In Fig. 1C-D, we see that Eq. S47 quantitatively illustrates the experimental data(Basan et al., 2015), where the model parameters were obtained using the biochemical data for the catalytic enzymes(see Appendix-table1 for details).
Appendix 3 Model perturbations
Appendix 3.1 Overexpression of useless proteins
Here we consider the case of overexpression of the protein encoded by Lacz gene in E. coli. Effectively, this limits the proteome by altering ϕmax:
where ϕZ stands for the fraction of useless proteins, which is controllable in experiments. Then, the growth rate changes into a bivariate function of κA and ϕZ:
and thus,
Obviously,
In the homogeneous case,
Combined with Eqs. S50–S51, we have:
To compare with experiments, we assume that each
where
Here,
where λ(κA, ϕZ),
In Fig. 2C. we show that the model predictions (Eq. S57) quantitatively agree with the experiments(Basan et al., 2015).
Meanwhile, we can also perturb the growth rate by tuning ϕZ in a stable culturing environment with fixed concentration of a Group A carbon source (i.e., given [A]). In fact, for this case there is a distribution of κA values due to the extrinsic noise in
Here, λ(κA, 0) remains unaltered as κA is fixed. Therefore, in this case,
In fact, the growth rate can be altered by tuning ϕZ and κA simultaneously. Then, the relations among the energy fluxes, growth rate and ϕZ still follow Eq. S57 (here ϕZ is a variable). In a 3-D representation, these relations correspond to a surface. In Fig. 2A, we show that the model predictions (Eq. S57) match well with the experimental data(Basan et al., 2015).
Appendix 3.2 Energy dissipation
In practice, energy dissipation breaks the proportional relationship between energy demand and biomass production. Thus, Eq. S25 changes to:
where W represents the dissipation coefficient. In fact, maintenance energy contributes to energy dissipation, and we define the maintenance energy coefficient as w0. In bacteria, the impact of w0 is often negligible, particularly for all the analysis in the sections above. While in tumor cell, w0 plays a much more significant role.
The introduction of energy dissipation leads to a modification to Eq. S26: combining Eq. S59 and Eq. S16, we have:
Then, Eq. S29 changes to:
Consequently, if εr > εf, the best strategy for the cell is:
and if εf > εr, the best strategy is:
Then, the growth rate turns into a bivariate function of both κA and w:
Clearly,
For a cell population, in the homogeneous case,
To compare with experiments, we assume the same extent of extrinsic noise in
where
Here,
Since the dissipation coefficient w is tunable in experiments, for a given value of w, λ(κA, w) changes monotonously with κA. Combining Eqs. S68–S69 and S30, we have (here w is a parameter):
The comparison between model predictions (Eq. S70) and experimental results(Basan et al., 2015) is shown in Fig.3B, which agrees quantitatively. Meanwhile, the growth rate can also be perturbed by changing κA and w simultaneously. Then, the relations among the energy fluxes, growth rate and follow Eq. S70 (here w is a variable). In a 3D representation, these relations correspond to a surface. As shown in Fig. 3A, the model predictions (Eq. S70) agree quantitatively with the experimental results(Basan et al., 2015).
Appendix 3.3 Translation inhibition
In E. coli, the translation rate can be modified by adding different concentrations of translation inhibitors, e.g., chloramphenicol (Cm). The net effect of this perturbation is represented as:
where ι stands for the inhibition coefficient with ι > 0, and (1+ ι)-1 represents the translation efficiency. Thus, Eq. S32 changes to:
First, we consider the case of neglecting the maintenance energy, i.e., w0 = 0. Then, the growth rate takes the following form:
where λ(κA, 0) and ψ(κA, 0) represent the terms free from translation inhibition. Thus,
In the homogeneous case,
To compare with experiments, we assume that there exists extrinsic noise in
where
Here,
In the experiments, the inhibition coefficient ι is controllable by tuning the concentration of translation inhibitor. For a given value of ι, λ(κA, ι) changes monotonously with κA. Combining Eqs. S30 and S78, we have (ι is a parameter here):
where
while
In the homogeneous case,
To compare with experiments, we assume that the extrinsic noise follows that specified in Appendix 2.3. Combining Eqs. S45, S74 and S81, then, λC(ι) approximately follows a Gaussian distribution:
Here
Thus, for a given ι, λ(κA, ι) changes monotonously with κA. Combining Eqs. S30 and S84, we have (here ι is a parameter):
The growth rate and fluxes can also be perturbed by altering κA and ι simultaneously. The relations among the energy fluxes, growth rate and ι would still follow Eq. S85 other than that now ι is regarded as a variable. Assuming that there is a tiny amount of maintenance energy. Basically, we assign w0 = 2.5(h-1). Then, we see that the experimental results(Basan et al., 2015) agree quantitatively well with the model predictions (Fig.3C-D).
Appendix 4 Overflow metabolism in substrates other than Group A carbon sources
Due to the topology of metabolic network, Group A carbon sources follow the equation (Eq. S47) of overflow metabolism upon κA perturbation (i.e., varying the type or concentration of a Group A carbon source). This has been demonstrated clearly in the above analysis, which agrees quantitatively with experiments. However, further analysis is required for substrates other than Group A sources due to the topological differences in carbon utilization(Wang et al., 2019). Basically, substrates entering from glycolysis or the points before acetyl-CoA are potentially involved in overflow metabolism, while those join from the TCA cycle are not relevant to this behavior. Still, mixed carbon sources are likely to induce a different profile of overflow metabolism, so long as there is a carbon source coming from glycolysis.
Appendix 4.1 Pyruvate
The coarse-grained model for pyruvate utilization is shown in Fig. 3E. Here, nodes M1, M2, M3, M4, M5 follow everything depicted in Appendix 2.1. Each biochemical reaction follows Eq. S5 with bi = 1 except that 2M2→M1 and M3+M5→M4. By applying flux balance to the stoichiometric fluxes, combining with Eq. S8, we have:
For energy production, we convert all the energy currencies into ATPs, and then,
where β7 = 1, β8 = 2, β3 = 2, β4 = 6, β6 = 1, β9 = 6, βa1 = 4 (Neidhardt et al., 1990; Sauer et al., 2004), and JE follows Eq. S25. By applying the substitutions specified in Eqs. S9, S12, S14–S18, combined with Eqs. S4, S10, S22, S23, S25, S86–S87, and the constraint of proteome resource allocation, we have:
where
From Eq. S88, all ϕi can be expressed by
By substituting Eq. S90 into Eq. S88, we have:
Here,
The coefficients
φpy is the energy demand coefficient (a constant), with
Evidently, Eq. S91 is identical in form with Eq. S29. The growth rate changes into κpy dependent:
When κpy is very small, combined with Eq. S93, then,
Obviously,
As long as
where the superscript “(ST)” stands for the saturated concentration, then,
and there exists a critical value of κpy, denoted as
Here,
Defining
where
Combined with Eq. S92, we have:
In Fig. 3F, we show that the model predictions (Eq. S105) agree quantitatively with the experimental results(Holms, 1996).
Appendix 4.2 Mixture of a Group A carbon source with extracellular amino acids
In the case of a Group A carbon source mixed with amino acids, the coarse-grained model is shown in Appendix-fig. 2A.In fact, this model can be used to analyze mixtures with one or multiple types of extracellular amino acids. Here, Eqs. S21, S22, S24 and S25 still apply, but Eq. S23 changes to (the case of i remains the same as Eq. S23):
Here,
In the case where all 21 types of amino acids are present and each in saturated concentration (denoted as “21AA”), we have:
where ϕi and κi are defined following Eqs. S9 and S12. Since the cell growth rate elevates significantly with the mixture of amino acids, we deduce that Pools a2-d are supplied by amino acids in growth optimization, with
Basically, amino acids should be more efficient in the supply of biomass production than the Group A carbon source for Pools a2-d, i.e.,
In practice, the requirement is even higher for proteome efficiency using amino acids, since the biomass production pathway is accompanied by energy production in the case of Group A carbon sources, yet not for amino acids. Combining Eqs. S108 and S109, we have:
where
φ21AA is the energy demand coefficient, with
Combining Eqs. S111 and S31, it is easy to obtain the formula for the growth rate:
In fact, Eqs. S37–S42 still apply.
When extrinsic noise is taken into account,
and the normalized fluxes
In fact, the above analysis can be extended to cases where a Group A carbon source is mixed with arbitrary combinations of amino acids. Eqs. S111, S114–S117 would remain in a similar form, while Eqs. S112–S113 would change depending on the amino acid combinations.In Appendix-fig.2B-C, we show the comparisons between model predictions (see also Appendix7.2 and Eq. S157) and experimental data(Basan et al., 2015; Wallden et al., 2016) in mixtures of 21 or 7 types of amino acids together with a Group A carbon source, which agree quantitatively.
Appendix 5 Enzyme allocation upon perturbations
Appendix 5.1 Carbon limitationwithin Group A carbon sources
In Eq. S28, we present the model predictions of the dependencies of enzyme protein fractions on growth rate and energy fluxes. To compare with experiments, we assume the same extent of extrinsic noise in
In Appendix-fig.3C-D, we show the comparisons between model predictions (Eq. S118, w0 = 0) and experimental data(Hui et al., 2015), which are consistent overall. We proceed to consider the influence of maintenance energy as specified in Appendix3.2. Here, we still choose w0 = 2.5 (h-1) as previously adopted in Appendix 3.3. Then, Eq. S28 still holds, combined with Eq. S85 in the condition that ι = 0, we have:
In Fig. 4A-B, we show that the model predictions (Eq. S119, w0 = 2.5 (h-1)) generally agree with the experiments(Hui et al., 2015). However, there are different basal expressions of these enzymes, which are probably due to living demands other than cell proliferation, such as preparation for starvation(Mori et al., 2017) or alteration in the type of the nutrient(Basan et al., 2020; Kussell and Leibler, 2005).
Appendix 5.2 Overexpression of useless proteins
In the case of ϕZ perturbation under each nutrient condition with fixed κA (see Appendix 3.1), we consider the same extent of extrinsic noise in
Here λ(κA, 0) is the growth rate for ϕZ = 0 and thus is a parameter rather than a variable. The growth rate is defined as λ(κA, ϕZ), which follows Eq. S50. Thus, ϕi is proportional to the growth rate λ.In Appendix-fig. 3E-F, we see that the model predictions (Eq. S120) agree with the experiments(Basan et al., 2015) overall. Next, we consider the influence of maintenance energy with w0 = 2.5 (h-1). Combining Eqs. S28, S58 and S85 (with ι = 0), we get:
Here, the growth rate is defined as λ(κA, ϕZ), and λ(κA, 0) is a parameter rather than a variable. Thus, ϕi is a linear function of the growth rate λ, with a positive slope and a positive y-intercept.In Fig. 4C-D and Appendix-fig.3I-J, we show that the model predictions (Eq. S121) agree quantitively with the experimental data(Basan et al., 2015).
Appendix 5.3 Energy dissipation
In the case of energy dissipation under each nutrient condition, w is perturbed while κA is fixed. The relation between protein allocation and growth rate can be obtained by combining Eqs. S28 and S70. However, since is explicitly present in Eq. S70, we need to reduce this variable to obtain the growth rate dependence of enzyme allocation. In fact, from Eq. S64, we have:
Here,
where the energy dissipation coefficient w is regarded as a function of the growth rate.
Combining Eqs. S28, S70 and S123, we get:
where w(λ) follows Eq. S123. When κA lies in the vicinity of
then we have:
and thus,
Note that in Eq. S123, w is a linear function of λ with a negative slope. Thus ϕi exhibits a linear relation with λ when Eq. S125 is satisfied (see Eq. S127). In fact, the slope of ϕ4 is surely negative (combining Eqs. S64, S123 and S127), while the slope sign of other ϕi depends on parameters. For a given nutrient, the enzymes corresponding to the same ϕi should exhibit the same slope sign. Another restriction is that if the slope sign of ϕ1 is negative, then the slope sign of ϕ2 is surely negative. In Appendix-fig. 3K-N, we show that our model results agree well with the experimental data(Basan et al., 2015) (Eq. S127).
Appendix 6 Other aspects of the model
Appendix 6.1 A coarse-grained model with more details
To compare with experiments, we consider a coarse-grained model with more details as shown in Appendix-fig. 2F.Here, nodes M6, M7 represent GA3P and DHAP, respectively. Other nodes follow the descriptions specified in Appendix 2.1. Each biochemical reaction follows Eq. S5 with bi = 1 except that M1→M6+M7 and M3+M5→M4. By applying flux balance to the stoichiometric fluxes, combined with Eq. S8, we obtain:
While Eqs. S22–S25 still hold. By applying the substitutions specified in Eqs. S9, S12, S14–S18, combined with Eqs. S4, S10, S22–S25, S128, and the constraint of proteome resource allocation, we get:
Then, Eq. S28 still hold, while ϕ10 and ϕ11 are:
By substituting Eqs. S28 and S130 into Eq. S129, we get:
where “dt” stands for details. Eqs. S30 and S33 still hold.
Appendix 6.2 Estimation of the in vivo enzyme catalytic rates
We use the method introduced by Davidi et al.(Davidi et al., 2016),combined with proteome experimental data(Basan et al., 2015) (Appendix-table2), to estimate the in vivo enzyme catalytic rates. Combining Eqs. S28 and S130, we have:
Here,
Eq. S135 is the in vivo result for the enzyme catalytic rate. In Appendix-fig. 2G, we show a comparison between in vivo and in intro results for kcat values of enzymes within glycolysis and the TCA cycle, which are roughly consistent. In the applications, we prioritized the use of in vivo results for enzyme catalytic rates, and use in intro data as a substitute when there were vacancies.
Appendix 6.3 Comparison with existing models thatillustrate experimental results
For the coarse-grained model described in Appendix 2, the normalized stoichiometric influx of a Group A carbon source is given by:
Combined with the first equation in Eq. S28, we obtain:
where er = β1 + 2(β2 + β3 + β4), ef = β1 + 2(β2 + β6), and
Based on the modeling principles rather than the detailed mechanisms, there are two major classes of existing models that can illustrate experimental results. In fact, both classes of models regard the proteome energy efficiencies εr and εf as constants, with εf > εr if used, or follow functionally equivalent propositions. In our model, however, εr and εf are both functions of κA, which vary significantly upon nutrient perturbation, where
The first class of models(Chen and Nielsen, 2019; Majewski and Domach, 1990; Niebel et al., 2019; Shlomi et al., 2011; Varma and Palsson, 1994; Vazquez et al., 2010; Vazquez and Oltvai, 2016)optimize the ratio of biomass outflow to carbon influx:
The second class of models, represented by Basan et al.(Basan et al., 2015), also adopt the optimization of
In Basan et al.(Basan et al., 2015), Eq. S138 is considered to be the relation between
Appendix 6.4 Explanation of the Warburg effect in tumor cells
Our model and analysis shown in Appendix 2 can be naturally extended to explain the Warburg effect in tumor cell metabolism with the following modifications in the model settings. In the applications for tumor cell metabolism, the fermentation flux changes from acetate secretion rate into the lactate secretion rate, and thus the stoichiometric coefficients (βi) for ATP production change accordingly. Consequently, in the coarse-grained model shown in Fig. 1B, M3 stands for pyruvate, and β2, β6 change into β2 = 1, β6 = -2 (Nelson et al., 2008).
Evidently, Eqs. S37–S38 still hold. As long as
Appendix 7 Probability density functions of variables and parameters
Appendix 7.1 Probability density function of κi
Enzyme catalysis is crucial for the survival of living organisms, as it can significantly accelerate the rate of a biochemical reaction by moderating the energy barrier between the substrate and product(Nelson et al., 2008).However, the maximal turnover rate of enzymes, kcat values, vary notably between the in vivo and in vitro measurements(Davidi et al., 2016). Recent studies suggest thatdifferences in the aquatic medium should be the causes(Davidi et al., 2016; García‐Contreras et al., 2012). In particular, the concentrations of potassium and phosphate have a biginfluence on kcat (García‐Contreras et al., 2012), which possess a certain degree of variation among the cellpopulations under intracellular conditions(García‐Contreras et al., 2012).For simplicity, we assume that the turnover rate of each Ei enzyme
When the CV of the
where
Meanwhile, due to the stochastic nature of biochemical reactions, we apply Gillespie’s chemical Langevin equation(Gillespie, 2000) to account for the intrinsic noise(Elowitz et al., 2002)(denoted as ηint). For cell size regulation of E. coli within a cell cycle, the cell mass at initiation of DNA per chromosome origin remains constant(Donachie, 1968). Thus, the duration of enzyme Ei to finish a catalytic job (with a timescale of
Here
where
which is inversely scaled with the square root of cell volume. Evidently, the intrinsic and extrinsic noise make orthogonal contributions to the total noise(Elowitz et al., 2002)(denoted as θtot):
In fact, when the CV is small (i.e., CV<<1), both IOG and IG distributions converge into Gaussian distributions (Appendix-fig.4). In the back-of-the-envelope calculations, we approximate x in all the denominator terms of IOG(x, μ, ζ) and IG(x, μ, ζ) as μ (since CV<<1). Then, both IOG and IG distributions can be approximated as follows:
with
with
and thus,
When the variance σ2 ≡ μ3/ζ is very small, basically, we require 2μ2k/ζ = 2σ2k/μ ≪ 1, and then
Then, we have:
In fact, although intrinsic noise affects the short-term measurement of enzyme catalytic rate and growth rate at the single-cell level, its contribution in the long-term is negligible. Thus, we approximate ηtot ≈ ηext. Combined with Eqs. S145–S146, it is easy to check that
For convenience, in the model analysis, we approximate both IOG and IG distributions as Gaussian distributions. Then, all
Using the properties of Gaussian distributions, for a series of constant real numbers γi, the summation of
with
where
Appendix 7.2 Probability density function of the growth rate λ
From Appendix 7.1, we note that λr and λf (see Eq. S36) roughly follow Gaussian distributions, with
where
Then, the cumulative distribution function of λ is
In Appendix-fig. 2B, we show that Eq. S157 quantitatively illustrates the experimentaldata of E.coli under the relevant conditions.
Appendix 8 Model comparison withexperiments
Appendix 8.1 Flux comparison with experiments
In Appendix 6.2, we see that the values of
By further combining with Eqs. S16–S17, we get:
In fact, the values of Jacetate and
From Eq. S18, we obtain the values of ηi for each precursor pool : ηa1 = 0.15, ηa2 = 0.30, ηb = 0.35, ηc = 0.09, ηd = 0.11. Still, the value of ηE is required to compare the growth rate dependence of fermentation/respiration fluxes between model results and experiments, which we will specify in Appendix 8.2.
Appendix 8.2 Model parameter settings using experimental data
We have collected biochemical data of E. coli shown in Appendix-tables 1–2 to set the model parameters. This includes the molecular weight (MW) and in vitro kcat values of the catalytic enzymes, as well as the proteome and flux data used to calculate the in vivo turnover numbers. To reduce measurement noise, we take the average rather than the maximum value of in vivo kcat from calculations with data from four cultures (see Appendix-table 2). Here, we prioritize the use of in vivo kcat wherever applicable unless there is a vacancy (see Appendix-table1).
Note that our models are coarse grained. For example, the flux J3 shown in Fig. 1B actually corresponds to three different reactions in the metabolic network (see Fig.1A and Appendix-table1),which we label as
In fact, Eq. S161 can be generalized to determine the values of other κi in the coarse-grained models combined with the biochemical data. For the coarse-grained model of Group A carbon source utilization shown in Fig. 1B, we have the values for parameters κi (i=1, …, 6), and then
For the remaining model parameters, note that we have classified the inactive ribosomal-affiliated proteins into the Q-class, and then ϕmax = 48 % (Scott et al., 2010). The values of κt is obtainable from experiments: the translational speed is 20.1aa/s(Scott et al., 2010), with 7336 amino acids per ribosome(Neidhardt, 1996) and ϛ ≈ 1.67 (Neidhardt, 1996; Scott et al., 2010)(see Appendix 1.1), hence κt = 1/610 (s-1).However, there are insufficient data to determine the values of κi (i=a1, a2, b, c, d) from the entry point metabolites to the precursor pools. Basically, it involves many steps, and thus these values should be considerably large. Here, we lump sum the contributions of κt and κi (i=a1, a2, b, c, d) by defining a composite parameter:
We proceed to estimate the values Ω of φ and using experimental data(Basan et al., 2015)for wild-type strainson the
For the case of w0 = 0, where all kcat values follow a Gaussian distribution with an extrinsic noise of 25% CV, which is the general setting we use unless otherwise specified, then φ = 10.8 and Ω = 1345 (s). Accordingly, we have ηE = 14.78,
For pyruvate, with the value of ηE, we get φpy = 14.82. However, there is still a lack of proteome data to determine the value of κ9, which actually involves many steps in the metabolic network and thus can be considerably large. Here we define another composite parameter
For the case of a Group A carbon source mixed with 21 amino acids (21AA, with saturated concentrations), then φ21AA = 14.2. Comparing Eq. S32 with Eq. S112, the parameter Ω should change to
For the case of a Group A carbon source mixed with 7 amino acids (7AA: His, Iso, Leu, Lys, Met, Phe, and Val), similar to the roles of φ21AA and Ω21AA, we define φ7AA and Ω7AA. Using the mass fraction of the 7AA combined with Eq. S18, we have φ7AA = 11.6. For the value of Ω7AA, evidently, Ω21AA < Ω7AA < Ω, and we estimate φ7AA = 1215 (s) from the growth rate data for E. coli measured under the relevant culture media(Basan et al., 2015). Then,
For the case of w0 = 2.5 (h-1), we have φ = 8.3, and thus ηE = 12.28,, while other parameters such as Ω,
From Appendix 7.1–7.2, combined with Eq. S114, the distributions of
where
For the case of succinate mixed with 21AA (labeled as “Succinate+21AA”), the respiration pathway is always more efficient since succinate lies within the TCA cycle, then the cell growth rate (defined as
For the case that acetate is the sole carbon source, evidently, the cells only use the respiration pathway, and thus the growth rate (defined as λacetate) follows a Gaussian distribution:
With the measured growth rate data(Wallden et al., 2016),
we estimate
Appendix 8.3 Notes on the application of reference data
Data calibration
Throughout our manuscript, we use experimental data from the original references except for two calibrations. The first calibration is shown in the footnote to Appendix-table2. With this calibration, the
Data of the inducible strains
We note that part of the experiment data in the original references(Basan et al., 2015; Hui et al., 2015)were obtained using strains with titratable systems (e.g. titratable ptsG, LacY). Basically, the
Experimental data sources
The batch culture data shown in Fig. 1C(labeled with minimum/rich media or inducible strains) and Appendix-fig.2C were taken from the source data of the reference’s figure 1(Basan et al., 2015). The chemostat data shown in Fig. 1C were taken from the reference’s table 7(Holms, 1996). The data shown in Fig. 1D were taken from the reference’s extended data figure 3a(Basan et al., 2015) with the calibration specified in the footnote to Appendix-table2.
The data shown in Fig. 2A were adopted from the reference’s extended data figure 4a-b(Basan et al., 2015). The data shown in Fig. 2B were taken from the source data of the reference’s figure 2a(Basan et al., 2015). The data shown in Fig. 2C were taken from the source data of the reference’s figure 3a(Basan et al., 2015). The data shown in Fig. 3A-B were taken from the source data of the reference’s figure 3d(Basan et al., 2015).
The data shown in Fig. 3C-D and Appendix-fig.2D-E were taken from the source data of the reference’s figure 3c(Basan et al., 2015). The data shown in Fig. 3F were taken from the reference’s table 7(Holms, 1996), with a calibration factor specified in the above paragraph (“Data calibration”).
The data shown in Fig.4A-B and Appendix-fig.3A-D were taken from the reference’s table S2 with the label “C-lim”(Hui et al., 2015).We excludedthe reference’s data withλ=0.45205 (h-1)as there are other unconsidered factors involved during slow growth(Dai et al., 2016) (for λ<0.5 h-1), and we suspect that there may be unknown calibration factors. The data shown in Fig. 4C-D and Appendix-fig.3E-N were adopted from the reference’s extended data figure 6–7(Basan et al., 2015).
The gene names depicted in Appendix-fig. 1B were identified using KEGG database. The data shown in Appendix-fig. 2G were drawn from Appendix-table1, which includes the original references themselves. The flux data presented in Appendix-table2 were obtained from the reference’s extended data figure 3a(Basan et al., 2015), with the calibration specified in the footnote. The proteome data shown in Appendix-table2 were taken from the reference’s supplementary Table N5(Basan et al., 2015).


Molecular weight (MW) andinvivo/in vitrokcatdata forE. coli

Proteome and flux data(Basan et al., 2015)used to calculate the in vivo kcat

Central metabolic network and carbon utilization pathways
(A) Energy production details of the central metabolic network. In E. coli, NADPH and NADH are interconvertible(Sauer et al., 2004), and all energy carriers can be converted to ATP with ADP. The conversion factors are: NADH=2ATP, NADPH=2ATP, FADH2=1ATP(Neidhardt et al., 1990). (B) Relevant genes for enzymes in the central metabolic network.(C-E) Three destinies of glucose metabolism.(C) Fermentation pathway, where a molecule of glucose generates 12 ATPs in E. coli. (D) Respiration pathway, where a molecule of glucose generates 26 ATPs. (E) Biomass pathway, where glucose turns into precursors of biomass. Note that the process of biomass generation is accompanied by ATPs production (see Appendix 2.1).

Model and results for experimental comparison
(A-C) Model analysis for carbon utilization in mixtures with amino acids.(A) Coarse-grained model for the case of a Group A carbon source mixed with extracellular amino acids. (B) Model predictions (Eqs. S157, S164–S165) and single-cell reference experimental results(Wallden et al., 2016) of the growth rate distributions forE. coli in three culturing conditions. (C) Comparison of thegrowth rate-fermentation flux relationfor E. coli in Group A carbon sources between minimum media and enriched media (those with 7AA).(D-E)Influence of translation inhibition on overflow metabolism.(D) A 3D plot of the relations among the fermentation flux, growth rate, and the translation efficiency (Eqs. 79 and S160). (E) Growth rate dependence of acetate excretion rate as κA varies, with each fixed dose of Cm.The translation efficiency is tuned by the dose of Cm, and the maintenance energy coefficient is set to be 0 (i.e., w0 = 0).(F) The coarse-grained model for Group A carbon source utilization. This model includes more details to compare with experiments. (G) Comparison of the in vivo and in vitro catalytic rates for enzymes within glycolysis and the TCA cycle (see Appendix-table1 for details). (H) The energy efficiencies of respiration and fermentation pathways vary with growth rate as functions of the substrate quality of pyruvate (Eqs. S93 and S96).

Relative protein expression of central metabolic enzymes under various types of perturbations
(A-D) Relative protein expression under κA perturbation.(A) Experimental data(Hui et al., 2015)of the catalytic enzymes for each step of glycolysis.(B) Experimental data(Hui et al., 2015)of the catalytic enzymes for each step of the TCA cycle. (C) Model predictions (Eq. S118, with w0 = 0) and experimental data(Hui et al., 2015) of representative genes from glycolysis. (D) Model predictions (Eq. S118,with w0 = 0) and experimental data(Hui et al., 2015) of representative genes from the TCA cycle.(E-J)Relative protein expression under ϕZ perturbation.(E, F, I) Model predictions and experimental data(Basan et al., 2015) of representative genes from glycolysis. (G, H, J) Model predictions and experimental data(Basan et al., 2015) of representative genes from the TCA cycle. (E-H) Results of ϕZ perturbation with w0 = 0 (Eq. S120). (I-J) Results of ϕZ perturbation with w0 = 2.5 (Eq. S121).(K-N)Relative protein expressionuponenergy dissipation.(K-L) Model fits (Eqs. S127 and S123) and experimental data(Basan et al., 2015) of representative genes from glycolysis. (M-N) Model fits(Eqs. S127 and S123) and experimental data(Basan et al., 2015) of representative genes from the TCA cycle.

Asymptotic distributions ofinverse Gaussian distribution and the inverse of Gaussian distribution
(A) Comparison between the inverse of Gaussian distribution and the corresponding Gaussian distribution for each value of coefficient of variation (CV) (Eqs. S140 and S145). (B)Comparison between the inverse Gaussian distribution and the corresponding Gaussian distribution for each value of CV (Eqs. S142 and S146). Both inverse Gaussian distribution and the inverse of Gaussian distribution converge to Gaussian distributions when CV is small.
References
- A functional perspective on phenotypic heterogeneity in microorganismsNature Reviews Microbiology 13:497–508
- A putative bet-hedging strategy buffers budding yeast against environmental instabilityCurrent Biology 30:4563–4578
- Bacterial persistence as a phenotypic switchScience 305:1622–1625
- A universal trade-off between growth and lag in fluctuating environmentsNature 584:470–474
- Overflow metabolism in Escherichia coli results from efficient proteome allocationNature 528:99–104
- Absolute metabolite concentrations and implied enzyme active site occupancy in Escherichia coliNature Chemical Biology 5:593–599
- Energy metabolism controls phenotypes by protein efficiency and allocationProceedings of the National Academy of Sciences 116:17592–17597
- Reduction of translating ribosomes enables Escherichia coli to maintain elongation rates during slow growthNature Microbiology 2:1–9
- Global characterization of in vivo enzyme catalytic rates and their correspondence to in vitro kcat measurementsProceedings of the National Academy of Sciences 113:3401–3406
- The Crabtree effect: a regulatory system in yeastMicrobiology 44:149–156
- We need to talk about the Warburg effectNature Metabolism 2:127–129
- Optimality and evolutionary tuning of the expression level of a proteinNature 436:588–592
- Beyond the Warburg effect: Oxidative and glycolytic phenotypes coexist within the metabolic heterogeneity of glioblastomaCells 10:202
- In silico predictions of Escherichia coli metabolic capabilities are consistent with experimental dataNature biotechnology 19:125–130
- Stochastic gene expression in a single cellScience 297:1183–1186
- Galactose metabolic genes in yeast respond to a ratio of galactose and glucoseProceedings of the National Academy of Sciences 112:1636–1641
- Why in vivo may not equal in vitro-new effectors revealed by measurement of enzymatic activities under the same in vivo-like assay conditionsThe FEBS Journal 279:4145–4159
- Hallmarks of cancer: the next generationCell 144:646–674
- Metabolic heterogeneity in human lung tumorsCell 164:681–694
- Flux analysis and control of the central metabolic pathways in Escherichia coliFEMS Microbiology Reviews 19:85–116
- Quantitative proteomic analysis reveals a simple strategy of global resource allocation in bacteriaMolecular Systems Biology 11:784
- Ribosome composition maximizes cellular growth rates in E. coliPhysical Review Letters 125:028103
- Phenotypic diversity, population growth, and information in fluctuating environmentsScience 309:2075–2078
- Escherichia coli translation strategies differ across carbon, nitrogen and phosphorus limitation conditionsNature Microbiology 3:939–947
- The Warburg effect: how does it benefit cancer cells?Trends in Biochemical Sciences 41:211–218
- Reliable cell cycle commitment in budding yeast is ensured by signal integrationeLife 4:e03977
- Simple constrained-optimization view of acetate overflow in E. coliBiotechnology Bioengineering 35:732–738
- Acetate formation in continuous culture of Escherichia coli K12 D1 on defined and complex mediaJournal of Biotechnology 1:355–358
- Shifts in growth strategies reflect tradeoffs in cellular economicsMolecular Systems Biology 5:323
- Quantifying the benefit of a proteome reserve in fluctuating environmentsNature Communications 8:1225
- Enzyme allocation problems in kinetic metabolic networks: Optimal solutions are elementary flux modesJournal of Theoretical Biology 347:182–190
- Nonlinear dependency of intracellular fluxes on growth rate in miniaturized continuous cultures of Escherichia coliApplied Environmental Microbiology 72:1164–1172
- Escherichia coli and Salmonella: cellular and molecular biologyWashington, DC: ASM Press
- Physiology of the bacterial cellSinauer associates
- Lehninger principles of biochemistryMacmillan
- An upper limit on Gibbs energy dissipation governs cellular metabolismNature Metabolism 1:125–132
- Analysis of fluorescent reporters indicates heterogeneity in glucose uptake and utilization in clonal bacterial populationsBMC Microbiology 13:1–13
- Metabolite concentrations, fluxes and free energies imply efficient enzyme usageNature Chemical Biology 12:482–489
- Cooperation and competition in the evolution of ATP-producing pathwaysScience 292:504–507
- Interdependence of cell growth and gene expression: origins and consequencesScience 330:1099–1102
- Genome-scale metabolic modeling elucidates the role of proliferative adaptation in causing the Warburg effectPLoS Computational Biology 7:e1002018
- Bet-hedging during bacterial diauxic shiftProceedings of the National Academy of Sciences 111:7427–7432
- Optimality and sub-optimality in a bacterial growth lawNature Communications 8:14123
- Understanding the Warburg effect: the metabolic requirements of cell proliferationScience 324:1029–1033
- Stoichiometric flux balance models quantitatively predict growth and metabolic by-product secretion in wild-type Escherichia coli W3110Applied Environmental Microbiology 60:3724–3731
- Catabolic efficiency of aerobic glycolysis: the Warburg effect revisitedBMC Systems Biology 4:1–9
- Macromolecular crowding explains overflow metabolism in cellsScientific Reports 6:31007
- The synchronization of replication and division cycles in individual E. coli cellsCell 166:729–739
- Growth strategy of microbes on mixed carbon sourcesNature Communications 10:1279
- Über den Stoffwechsel der CarcinomzelleBiochemische Zeitschrift 152:309–344
- Metabolic states with maximal specific rate carry flux through an elementary flux modeThe FEBS Journal 281:1547–1555
- Coordination of bacterial proteome with metabolism by cyclic AMP signallingNature 500:301–306
- Relationship between cell size and time of initiation of DNA replicationNature 219:1077–1079
- The inverse Gaussian distribution and its statistical application—a reviewJournal of the Royal Statistical Society: Series B 40:263–275
- The chemical Langevin equationThe Journal of Chemical Physics 113:297–306
- Cell biology by the numbersGarland Science
- The soluble and membrane-bound transhydrogenases UdhA and PntAB have divergent functions in NADPH metabolism of Escherichia coliJournal of Biological Chemistry 279:6613–6619
- General calibration of microbial growth in microplate readersScientific Reports 6:1–7
- Stochastic processes in physics and chemistryElsevier
Article and author information
Author information
Version history
- Sent for peer review:
- Preprint posted:
- Reviewed Preprint version 1:
- Reviewed Preprint version 2:
- Reviewed Preprint version 3:
Copyright
© 2024, Xin Wang
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics
- views
- 929
- downloads
- 54
- citations
- 0
Views, downloads and citations are aggregated across all versions of this paper published by eLife.