Research Article

Cell Biology

Genome concentration limits cell growth and modulates proteome composition in Escherichia coli

Howard Hughes Medical Institute, Stanford University, United States
Sarafan Chemistry, Engineering, and Medicine for Human Health Institute, Stanford University, United States
Institute of Biotechnology, Helsinki Institute of Life Science, University of Helsinki, Finland
Department of Biology, Stanford University, United States
Chan Zuckerberg Biohub, United Kingdom
Department of Genetics, Stanford University, United States
Department of Microbiology and Immunology, Stanford School of Medicine, United States

Dec 23, 2024

https://doi.org/10.7554/eLife.97465.3

Open access
Copyright information

eLife Assessment

This fundamental work by Mäkelä et al. presents compelling experimental evidence supported by a theoretical model that the amount of chromosomal DNA can become limiting for the total rate of mRNA transcription and consequently protein production in the model bacterium Escherichia coli. The work is based on a mutant that allows inhibition of DNA replication while following growth at the single-cell level due to cell filamentation. The work significantly advances our understanding of growth and of the central dogma, and will be of considerable interest within both systems biology and microbial physiology.

https://doi.org/10.7554/eLife.97465.3.sa0

Significance of the findings:

Fundamental: Findings that substantially advance our understanding of major research questions

Landmark
Fundamental
Important
Valuable
Useful

Strength of evidence:

Compelling: Evidence that features methods, data and analyses more rigorous than the current state-of-the-art

Exceptional
Compelling
Convincing
Solid
Incomplete
Inadequate

During the peer-review process the editor and reviewers write an eLife Assessment that summarises the significance of the findings reported in the article (on a scale ranging from landmark to useful) and the strength of the evidence (on a scale ranging from exceptional to inadequate). Learn more about eLife Assessments

Abstract
Introduction
Results
Discussion
Materials and methods
Appendix 1
Appendix 2
Appendix 3
Appendix 4
Data availability
References
Article and author information
Metrics

Abstract

Defining the cellular factors that drive growth rate and proteome composition is essential for understanding and manipulating cellular systems. In bacteria, ribosome concentration is known to be a constraining factor of cell growth rate, while gene concentration is usually assumed not to be limiting. Here, using single-molecule tracking, quantitative single-cell microscopy, and modeling, we show that genome dilution in Escherichia coli cells arrested for DNA replication limits total RNA polymerase activity within physiological cell sizes across tested nutrient conditions. This rapid-onset limitation on bulk transcription results in sub-linear scaling of total active ribosomes with cell size and sub-exponential growth. Such downstream effects on bulk translation and cell growth are near-immediately detectable in a nutrient-rich medium, but delayed in nutrient-poor conditions, presumably due to cellular buffering activities. RNA sequencing and tandem-mass-tag mass spectrometry experiments further reveal that genome dilution remodels the relative abundance of mRNAs and proteins with cell size at a global level. Altogether, our findings indicate that chromosome concentration is a limiting factor of transcription and a global modulator of the transcriptome and proteome composition in E. coli. Experiments in Caulobacter crescentus and comparison with eukaryotic cell studies identify broadly conserved DNA concentration-dependent scaling principles of gene expression.

Introduction

Cells regulate the intracellular concentration of various proteins and macromolecules to modulate the rate of essential cellular processes, including growth. In bacteria, cell mass and volume typically double between division cycles. Proportionality between biosynthetic capacity and biomass accumulation results in exponential or near-exponential cell growth during the cell cycle (Campos et al., 2014; Schaechter et al., 1958; Schaechter et al., 1962; Siegal-Gaskins and Crosson, 2008; Taheri-Araghi et al., 2015; Wang et al., 2010). What drives exponential growth has been a longstanding question in the microbiology field (Belliveau et al., 2021; Churchward et al., 1982; Ecker and Schaechter, 1963; Zhurinsky et al., 2010). Quantitative studies on model bacteria such as Escherichia coli place the concentration of ribosomes and their kinetics as the principal rate-limiting factors (Belliveau et al., 2021; Bosdriesz et al., 2015; Koch, 1988; Scott et al., 2014; Scott et al., 2010). Most other cellular components essential for growth are estimated to be at least an order of magnitude above the level required for proper enzymatic reactions (Belliveau et al., 2021), indicating that they are well in excess in terms of metabolic concentrations. Thus, translation is generally seen as the rate-governing process for cellular growth. While the translocation rate of ribosomes poses an inherent limit on the growth rate of the cell, protein concentrations are predominantly set transcriptionally at the promoter level, with tight coordination between transcription and translation (Balakrishnan et al., 2022).

Whereas the importance of ribosome concentration in growth rate determination has been extensively studied, a potential role for genome concentration has received less attention. An early population study on an E. coli thymine auxotroph proposed that global transcription is not limited by the concentration of the genome but is instead constrained by the availability of RNA polymerases (RNAPs) (Churchward et al., 1982). However, the potential impact of DNA concentration on determining the growth rate of E. coli or other bacteria has, to our knowledge, not been formally tested. Interestingly, E. coli and Bacillus subtilis have been shown to display small but reproducible deviations from exponential growth during the division cycle (Kar et al., 2021; Nordholt et al., 2020), with the growth rate increasing after the initiation of DNA replication under some conditions. Furthermore, at the population level, these organisms initiate DNA replication at a fixed cell volume (mass) per chromosomal origin of replication (oriC) across a wide range of nutrient and genetic conditions (Donachie, 1968; Govers et al., 2024; Si et al., 2017; Zheng et al., 2016), suggesting that DNA concentration is an important physiological parameter for these bacteria. In eukaryotes where genome concentration is also tightly controlled (Ginzberg et al., 2015; Turner et al., 2012), a change in DNA-to-cell-volume ratio has recently been demonstrated to remodel the proteome and promote cellular senescence (Crozier et al., 2023; Foy et al., 2023; Lanz et al., 2024; Lanz et al., 2022; Manohar et al., 2023; Neurohr et al., 2019; Wilson et al., 2023).

In this study, we combined single-cell and single-molecule microscopy experiments with tandem-mass-tag (TMT)-mass spectrometry (MS), RNA sequencing (RNA-seq), and modeling to investigate the potential physiological role of genome concentration in cell growth and proteome composition in E. coli.

Results

Growth rate correlates with the genome copy number

To examine the potential effect of DNA content on the growth rate of E. coli, we used two CRISPR interference (CRISPRi) strains with arabinose-inducible control of expression of dCas9 (Li et al., 2016; Si et al., 2017). One strain expressed a single-guide RNA (sgRNA) against oriC where sequestration by dCas9 binding prevents the initiation of DNA replication to produce cells with a single copy of the chromosome after already initiated DNA replication rounds are completed and cells undergo reductive division (Si et al., 2017). These cells, referred to as ‘1N cells’ below, grew into filaments as a block in DNA replication prevents cell division, but not cell growth, from occurring (Figure 1A; Carl, 1970; Si et al., 2017; Withers and Bernander, 1998). The second CRISPRi strain, which served as a comparison, expressed an sgRNA against the cell division protein FtsZ. FtsZ depletion blocks cell division while allowing DNA replication to proceed (Addinall et al., 1996; Li et al., 2016). Ongoing growth resulted in filamenting cells with multiple replicating chromosomes, hereafter referred to as ‘multi-N cells’ (Figure 1A). For both strains, we used time-lapse microscopy to monitor growth at the single-cell level at 37°C in M9 minimal medium supplemented with glycerol, casamino acids, and thiamine (M9glyCAAT). Cell area ( $A$ ) was automatically detected from phase-contrast images using a deep convolutional network (Wiktor et al., 2021), and the absolute growth rate ( $\frac{d A}{d t}$ ) was determined by calculating the difference in cell area between frames. The relative growth rate ( $\frac{1}{A} \frac{d A}{d t}$ ), which is constant for exponential growth, was calculated by dividing the absolute growth rate by the cell area. We used wild-type (WT) cells to verify that the transition from liquid cultures to agarose pads led to stable growth from the start of image acquisition (Figure 1—figure supplement 1).

Figure 1 with 8 supplements see all

Download asset Open asset

Growth rate and genome copy number in *E. coli* growing in M9glyCAAT.

(A) Illustration of 1N (CRISPR interference [CRISPRi] *oriC*, CJW7457) and multi-N (CRISPRi *ftsZ*, CJW7576) cells with different numbers of chromosomes along with representative microscopy images at different time points following CRISPRi induction. Scale bars: 1 µm. (B) Plot showing representative single-cell trajectories of cell area as a function of time for the CRISPRi strains following a block in DNA replication and/or cell division. (C) Plot showing the absolute growth rate as a function of cell area for 1N (32735 datapoints from 1568 cells) and multi-N cells (14,006 datapoints from 916 cells) in M9glyCAAT. Lines and shaded areas denote mean ± SD from three experiments. This also applies to the panels below. (D) Absolute and (E) relative growth rate in 1N (32735 datapoints from 1568 cells, CJW7457), multi-N (14,006 datapoints from 916 cells, CJW7576), and *dnaC2* 1N (13,933 datapoints from 1043 cells, CJW7374) cells as a function of cell area in M9glyCAAT. (F) Absolute and (G) relative growth rate in 1N (13,933 datapoints from 1043 cells), 2N (6265 datapoints from 295 cells), and >2N (2116 datapoints from 95 cells) *dnaC2* (CJW7374) cells as a function of cell area in M9glyCAAT.

As the induced CRISPRi oriC phenotype is not fully penetrant, we limited our analysis to 1N cells that contained a single DNA object (nucleoid) labeled by a mCherry fusion to the nucleoid-binding protein HupA (referred to as HU below). To confirm this 1N chromosome designation, we used a CRISPRi oriC strain that expresses HU-CFP and carries an oriC-proximal parS site labeled with ParB-mCherry (Figure 1—figure supplement 2), used here to determine the number of nucleoids and chromosomal origins per cell. We found that 96 ± 1% (mean ± standard deviation, SD, three biological replicates) of cells (n=3378) with a single HU-labeled nucleoid contained no more than one ParB-mCherry focus, indicative of a single oriC.

Using this methodology, we observed a significant difference in growth rate between 1N and multi-N cells as shown in representative single-cell growth trajectories (Figure 1B) and in aggregated absolute growth rate measurements (Figure 1C). In multi-N cells, the absolute growth rate rapidly increased with cell area. In 1N cells, the absolute growth rate only moderately increased with cell area, approaching an apparent plateau at large cell sizes (Figure 1C). As an independent validation, we used an orthogonal system to block DNA replication using the temperature-sensitive mutant dnaC2, which encodes a deficient DNA helicase loader at the restrictive temperature of 37°C (Carl, 1970; Withers and Bernander, 1998). We observed that the relationship between absolute growth rate and cell area in dnaC2 cells with a single nucleoid was similar to that of 1N cells produced by the CRISPRi oriC system (Figure 1D). This sub-exponential growth in 1N and dnaC2 cells resulted in a relative growth rate that decreased with cell area (Figure 1E). For multi-N cells, the relative growth rate was not perfectly constant but appeared to increase somewhat with cell area (Figure 1E). It is unclear whether this slight increase is biologically meaningful, as simulations show that a small inaccuracy in cell size from cell segmentation can produce the appearance of super-exponential growth (Figure 1—figure supplement 3). Regardless, and most importantly, the multi-N cells grew identically to WT within the same cell size range while 1N cells grew significantly slower (Figure 1—figure supplement 4).

The striking divergence in growth between 1N and multi-N cells of the same size suggested that DNA concentration can affect growth rate. The difference in growth rate between 1N and multi-N cells was already apparent in the physiological range of cell sizes when compared to WT cells (Figure 1—figure supplement 4), suggesting that growth rate reduction occurs soon after DNA replication fails to initiate. We confirmed that the slower growth of 1N cells did not depend on the time that cells spent on agarose pads (Figure 1—figure supplement 5A). We also ruled out that the growth reduction was due to an induction of the SOS response or to an increased level in the nucleotide alarmone (p)ppGpp, as inactivation of either stress pathway (through deletion of recA or spoT/relA, respectively) in 1N cells made little to no difference to their growth rate (Figure 1—figure supplement 5B).

We noticed that, even at the restrictive temperature, the dnaC2 strain produced a sizeable fraction of cells with more than one HU-mCherry-labeled nucleoid (Figure 1—figure supplement 6A and B), indicating that the temperate-sensitive effect on DNA replication is not fully penetrant. We took advantage of this phenotypic ‘leakiness’ to measure the growth rate of cells with different numbers of nucleoids (and thus chromosomes) within the dnaC2 population. We observed a notable difference in growth rate between cells of 1, 2, and >2 nucleoids in the population, with each additional nucleoid contributing to higher cellular growth at a given cell size (Figure 1F and G). This finding is consistent with DNA-limited growth in which cellular growth rate increases with genome concentration. We obtained similar results when we calculated absolute and relative growth rates based on extracted cell volumes instead of areas (Figure 1—figure supplement 7A–F), as cell width remained largely constant during cell filamentation (Figure 1—figure supplement 7G).

A growth rate dependency on genome concentration is unlikely to be a particularity of E. coli, as we also observed a divergence in absolute and relative growth rates with increasing cell area between 1N and multi-N cells of C. crescentus (Figure 1—figure supplement 8A and B). We generated filamenting 1N and multi-N C. crescentus cells by depleting the DNA replication initiation factor DnaA (Gorbatyuk and Marczynski, 2001) and the cell division protein FtsZ (Wang et al., 2001), respectively. We confirmed the 1N vs. multi-N designation by visualizing the number of chromosomal origins of replication (one vs. multiple) per cell using the parS/ParB-eCFP labeling system (Figure 1—figure supplement 8C).

The concentration of ribosomal proteins remains relatively constant in genome-diluted E. coli cells

Ribosome content is often proposed to explain the exponential growth of biomass in bacteria, with growth rate being directly proportional to ribosome concentration (Bremer and Dennis, 2008; Ecker and Schaechter, 1963; Scott et al., 2014; Scott et al., 2010). Therefore, we first quantified the fluorescence concentration of a monomeric superfolder green fluorescent protein (msfGFP) fusion to the ribosomal protein RpsB (expressed from the native chromosomal locus) in 1N and multi-N cells in M9glyCAAT as a function of cell area. We found it to be almost identical between the two CRISPRi strains and relatively constant across cell areas, regardless of DNA content (Figure 2A).

Figure 2 with 2 supplements see all

Download asset Open asset

Lower ribosome activity explains the reduced growth rate of 1N cells growing in M9glyCAAT.

(A) RpsB-msfGFP fluorescence concentration in 1N (6542 cells, CJW7478) and multi-N (10,537 cells, CJW7564) cells as a function of cell area. Lines and shaded areas denote mean ± SD from three experiments. (B) Relative protein concentration of different ribosomal proteins in 1N (SJ_XTL676) and multi-N (SJ_XTL229) cells by tandem-mass-tag (TMT)-mass spectrometry (MS). 1N-rich cells were collected 0, 120, 180, 240, and 300 min after addition of 0.2% arabinose, while multi-N cells were collected after 0, 60, and 120 min of induction. Blue and cyan represent two independent experiments. Only proteins with at least four peptide measurements are plotted. (C) Apparent diffusion coefficients (D_a) of JF549-labeled RspB-HaloTag in wild-type (WT) (32,410 tracks from 771 cells, CJW7528), 1N (848,367 tracks from 2478 cells, CJW7529), and multi-N cells (107,095 tracks from 1139 cells, CJW7530). Only tracks of length ≥9 displacements are included. 1N cells are color-binned according to their cell area while multi-N cells contain aggregated data for ~2–10 µm² cell areas. (D) D_a in WT cells fitted by a three-state Gaussian mixture model (GMM): 77 ± 1%, 20 ± 1%, and 3.2 ± 0.5% (± standard error of the mean [SEM]) of the ribosome population, from the slowest moving to the fastest moving (32,410 tracks from 771 cells). (E) Example WT and 1N cells where active (red, slow-moving) and inactive (gray, fast-moving) ribosomes are classified according to the GMM. (F) Active (slow-moving) ribosome fraction in individual WT (237 cells) and 1N (2453 cells) cells as a function of cell area. Only cells with ≥50 tracks are included. Lines and shaded areas denote mean and 95% confidence interval (CI) of the mean from bootstrapping. (G) Same as (F) but for WT (237 cells) and multi-N (683 cells) cells. (H) Absolute growth rate of 1N and multi-N cells (Figure 1C) as a function of cell area was overlaid with the total active ribosome amount (calculated from **A, F, and G**). Lines and shaded areas denote mean and 95% CI of the mean from bootstrapping. All microscopy data are from three biological replicates. msfGFP, monomeric superfolder green fluorescent protein.

To exclude the possibility that the msfGFP tag altered the synthesis of RpsB or that this protein behaved differently from other ribosomal proteins, we adapted a TMT MS method recently developed to examine cell size-dependent proteome scaling in yeast and human cells (Lanz et al., 2022). Note that, for the CRISPRi oriC strain, a minority (~10–15%) of cells have more than one nucleoid. These cells were excluded from the analysis of our single-cell microscopy experiments. However, this could not be done for the TMT-MS experiments, which provide population-level measurements. Therefore, for this TMT-MS section, we will refer to the CRISPRi oriC cell population as ‘1N-rich’ cells, instead of only ‘1N’ cells. Using the TMT-MS approach, we found that the relative concentration of all (54) high-abundance ribosomal proteins (including untagged RpsB) remained approximately constant across all sizes of 1N-rich cells, and was similar between 1N-rich and multi-N cells (Figure 2B). Only the relative concentration of the ribosomal protein L31B, a stationary phase paralog of the more prevalent exponential phase ribosomal protein L31A (Lilleorg et al., 2019), significantly decreased in 1N cells (Supplementary file 1). Thus, the concentration of ribosomal proteins does not explain the difference in growth rate between cells with different ploidy.

The fraction of active ribosomes is reduced in genome-diluted cells

To more specifically probe the translational activity of ribosomes in 1N cells, we performed single-molecule tracking in live cells growing in M9glyCAAT. Ribosomes are expected to exhibit at least two different dynamic states: slow mobility when active (i.e. engaged in translation on the mRNA, often in polyribosome form), and faster mobility when inactive ribosomes (or ribosomal subunits) are diffusing in the cytoplasm (Mohapatra and Weisshaar, 2018; Sanamrad et al., 2014). To track ribosomes, we introduced a HaloTag fusion to RpsB (through genetic modification at the endogenous chromosomal locus) and labeled the HaloTag using the membrane-permeable Janelia Fluor 549 (JF549) fluorescent dye (Grimm et al., 2015). We quantified the apparent diffusion coefficient (D_a) of single-molecule tracks in WT cells, as well as in 1N and multi-N cells at multiple time points following CRISPRi induction (Figure 2C). We found that the distribution of D_a in multi-N cells of all sizes (~2–10 µm²) was similar to that in WT cells despite the considerable differences in cell sizes. In contrast, 1N cells displayed distributions clearly distinct from WT and multi-N cells, gradually shifting toward faster mobilities (higher D_a) with increasing cell size. This shift suggests that ribosome activity is altered in 1N cells.

Gaussian fitting of the D_a logarithmic data in WT cells revealed two predominant dynamic states of ribosomes: a slow-diffusing and a fast-diffusing state, representing 77 ± 1% (mean ± standard error of the mean [SEM]) and 20 ± 1% of the ribosome population, respectively (Figure 2D). In addition, we observed a small fraction (3.2 ± 0.5%) of faster-moving molecules with D_a expected for freely diffusing proteins (Banaz et al., 2019; Elowitz et al., 1999), likely indicative of a small pool of free RpsB-HaloTag proteins (i.e. not assembled into ribosomes). To confirm that the slow-diffusing fraction corresponded to translationally active ribosomes, we showed that this fraction nearly vanished (down to 1.10 ± 0.02%) when cells were depleted of mRNAs following 30 min treatment with the transcription inhibitor rifampicin (Figure 2—figure supplement 1). The estimated fraction (~77%) of active ribosomes in untreated cells was in good agreement with previous single-molecule and biochemical studies under similar growth conditions (Forchhammer and Lindahl, 1971; Mohapatra and Weisshaar, 2018; Sanamrad et al., 2014).

Upon fitting the D_a values of ribosomes in WT and 1N cells (Figure 2E), we observed a significant reduction in the slow-diffusing ribosome population in 1N cells of increasing area (Figure 2—figure supplement 2). Quantification of the active (slow-diffusing) ribosome fraction per cell revealed that 1N cells have overall lower ribosome activity than WT cells, and that ribosome activity decreases monotonically with increasing cell area (Figure 2F). In contrast, ribosome activity in multi-N cells remained the same as in WT across different cell sizes (Figure 2G).

To estimate the total number of active ribosomes per cell, we multiplied the total amount of ribosomes by the fraction of active ribosomes and plotted the result as a function of cell area (Figure 2H). We found that the difference in the total number of active ribosomes between 1N and multi-N cells matches the observed difference in growth rate (Figure 2H), indicating that cell growth rate is directly proportional to the increase in total active ribosomes. Altogether, the results are consistent with the hypothesis that DNA limitation decreases total ribosome activity, which, in turn, reduces the growth rate.

Genome dilution reduces the activity of RNAPs

We reasoned that the observed changes in ribosome activity in 1N cells may reflect the available pool of transcripts. If true, we would expect the total activity of RNAPs to be reduced in 1N cells. The total activity of RNAPs in cells is determined by the concentration of RNAPs multiplied by the fraction of active RNAPs. Therefore, we first determined whether RNAP concentration was lower in 1N cells relative to multi-N cells by quantifying the fluorescence intensity of a functional fusion of YFP to the RNAP β’ subunit (encoded by rpoC) expressed from its native chromosomal locus. As expected, RNAP concentration remained constant in multi-N cells (Figure 3A). In 1N cells, the RNAP concentration increased with cell size (Figure 3A), the opposite of what would be expected to explain the growth rate defect. We confirmed this increasing trend in concentration for other protein subunits of the core RNAP and the primary sigma factor σ⁷⁰ (encoded by rpoD) using TMT-MS (Figure 3B), clearly demonstrating that the abundance of RNAPs was not the limiting factor.

Figure 3 with 2 supplements see all

Download asset Open asset

RNA polymerase (RNAP) activity is reduced in 1N cells growing in M9glyCAAT.

(A) RpoC-YFP fluorescence concentration in 1N (3580 cells, CJW7477) and multi-N (5554 cells, CJW7563) cells as a function of cell area. Lines and shaded areas denote mean ± SD from three experiments. (B) Relative protein concentration of core RNAP subunits and σ⁷⁰ in 1N-rich (SJ_XTL676) and multi-N (SJ_XTL229) cells by tandem-mass-tag (TMT)-mass spectrometry (MS). 1N-rich cells were collected 0, 120, 180, 240, and 300 min after addition of 0.2% L-arabinose, while multi-N cells were collected after 0, 60, and 120 min of induction. (C) Apparent diffusion coefficients of JF549-labeled RpoC-HaloTag in wild-type (WT) (91,280 tracks from 1000 cells, CJW7519), 1N (175,884 tracks from 1219 cells, CJW7520) and multi-N cells (186,951 tracks from 1040 cells, CJW7527). Only tracks of length ≥9 displacements are included. 1N cells are binned according to cell area while multi-N cells contain aggregated data for ~2–15 µm² cell areas. (D) D_a in WT cells fitted by a three-state Gaussian mixture model (GMM): 49 ± 4%, 49 ± 4%, and 2 ± 0.1% (± standard error of the mean [SEM]) of the RNAP population, from the slowest moving to the fastest moving (91,280 tracks from 1000 cells). (E) Example WT and 1N cells where active (red, slow-moving) and inactive (gray, fast-moving) RNAPs are classified according to the GMM. (F) Active RNAP fraction in individual WT (854 cells) and 1N (1024 cells) cells as a function of cell area. Only cells with at least 50 tracks are included. Lines and shaded areas denote mean ±95% CI of the mean from bootstrapping (three experiments). (G) Same as (F) but for WT (854 cells) and multi-N (924 cells) cells. (H) Total amount of active RNAP in WT, 1N, and multi-N cells as a function of cell area (calculated from **A, F, and G**). Also shown is a linear fit to multi-N data ( $f (x) = 4.16 ∙ 10^{4} ∙ x$ , R² 0.98). Lines and shaded areas denote mean and 95% CI of the mean from bootstrapping. All microscopy data are from three biological replicates.

To quantify RNAP activity in 1N and multi-N cells, we performed single-molecule tracking in live cells using a functional fusion of HaloTag to the β’ protein subunit RpoC labeled with the JF549 dye. As expected, the D_a values of RpoC-HaloTag in multi-N cells were distributed similarly to those in WT cells (Figure 3C). In contrast, the distribution in 1N cells changed gradually toward higher D_a values (faster mobility) with increasing cell size (Figure 3C). As with ribosomes, RNAPs primarily exhibited two major states of diffusivity (Figure 3D): a slower-diffusing fraction (49 ± 4%; mean ± standard error of the mean [SEM]) and a faster-diffusing fraction (49 ± 4%), likely representing transcriptionally active RNAPs and inactive, diffusing RNAPs, respectively. A small fraction of RpoC-HaloTag (2 ± 0.1%) diffused very fast, with D_a values expected for free proteins, suggesting that it reflects the few β’ proteins not assembled into the RNAP core complex. Using rifampicin treatment, we confirmed that the slowest state corresponds to RNAPs actively engaged in transcription (Figure 3—figure supplement 1). In these rifampicin-treated cells, the slow-diffusing fraction was reduced to 13 ± 4%. Rifampicin does not prevent promoter binding or open complex formation and instead blocks transcription elongation following the synthesis of 3-nucleotide-long RNAs (Campbell et al., 2001). Thus, the observation that slow-moving RNAPs did not completely disappear after rifampicin treatment is consistent with the mechanism of action of the drug, leaving a fraction of RNAPs bound at promoter sites.

Unlike in WT and multi-N cells, the fraction of active RNAPs in 1N cells decreased monotonically with increasing cell area (Figure 3F and G). However, because the RNAP concentration simultaneously increased in 1N cells, it remained possible that the total amount of active RNAP, which is the relevant metric of transcription activity, remained equal to that of multi-N cells. By calculating the total amount of active RNAPs, we showed that the decrease in the active fraction in 1N cells was not the mere result of the increase in RNAP concentration. Indeed, the total amount of active RNAPs hardly increased with cell size in 1N cells whereas it increased proportionally with cell size in both multi-N and WT cells (Figure 3H).

A recent study has shown that the intracellular concentration of Rsd, the anti-sigma factor of σ⁷⁰, increases in WT cells under slower growth conditions, causing a reduction in global mRNA synthesis (Balakrishnan et al., 2022). Therefore, we verified that the concentration of Rsd remains approximately constant in both 1N-rich and multi-N cells based on our TMT-MS data (Figure 3—figure supplement 2), eliminating Rsd as a possible source of reduced RNAP activity in 1N cells. Instead, our data supports the notion that substrate (DNA) limitation leads to a reduced transcription rate, which reduces the pool of transcripts available for ribosomes.

Chromosome dilution reduces the concentration of transcripts

To test the idea that genome dilution affects growth rate through transcript limitation, we performed live-cell staining with SYTO RNASelect, a fluorogenic RNA-specific dye (Wu et al., 2020). This dye has been proposed to preferentially bind mRNAs based on the observed decay of intracellular RNASelect signal in E. coli during rifampicin treatment (Bakshi et al., 2014), which causes mRNA depletion. However, a recent study has shown that the levels of ribosomal RNAs (rRNAs) also decrease in rifampicin-treated cells (Hamouche et al., 2021), though at a slower rate than mRNAs. Therefore, to complement the RNASelect staining experiments and examine the potential effect of genome dilution specifically on rRNAs, we also carried out fluorescence in situ hybridization (FISH) microscopy on fixed cells using EUB338-Cy3, a DNA probe complementary to an exposed region in the 16S rRNA (Amann et al., 1990). For both experiments, we mixed 1N cells with multi-N cells of similar size ranges prior to incubation with RNASelect or EUB338-Cy3 to mitigate variability in staining. We next imaged the mixed populations and distinguished 1N cells from multi-N cells by examining the difference in nucleoid number (one vs. multiple) per cell using HU-mCherry or DAPI as a DNA marker (Figure 4A). Single cells were sampled to ensure that the cell area distributions of the two populations matched (Figure 4—figure supplement 1).

Figure 4 with 3 supplements see all

Download asset Open asset

RNASelect and EUB338 concentration measurements in 1N and multi-N cells.

(A) Images of representative cells from a mixed population of 1N (CRISPR interference [CRISPRi] *oriC*) and multi-N (CRISPRi *ftsZ*) cells. Strains CJW7457 and CJW7576 carrying HU-mCherry were used for the SYTO RNASelect staining experiment, whereas DAPI-stained strains SJ_XTL676 and SJ_XTL229 were used for the EUB338 ribosomal RNA (rRNA) fluorescence in situ hybridization (FISH) experiment. (B) Concentration distribution of SYTO RNASelect (3077 cells for each population from five biological replicates) and EUB338 (1254 cells for each population from three biological replicates) in 1N and multi-N cells. (C) The average 1N/multi-N SYTO RNASelect and EUB338 concentration ratio (gray bar) calculated from five and three biological replicates (white circles), respectively. (D) RNASelect and EUB338 concentration ratios as functions of cell area (mean ± SD from five and three biological replicates, respectively). Single exponential decay functions were fitted to the average ratios (R²>97%) for each indicated reporter. All concentration comparisons or ratio calculations were performed for equal numbers of 1N and multi-N cells and overlapping cell area distributions (see Materials and methods and Figure 4—figure supplement 1).

Comparison between the two sampled populations revealed a reduced concentration of RNASelect signal by ~50% in 1N cells relative to multi-N cells for a cell size range of 4–10 µm² (Figure 4B and C). For a similar cell area range, the EUB338 signal concentration was reduced by only ~5%. Furthermore, the RNASelect concentration ratio between 1N and multi-N cells displayed a rapid exponential decay with increasing cell area, whereas the decrease in EUB338 concentration ratio was considerably slower (Figure 4D).

To verify that the decrease in RNASelect signal in 1N cells was not caused by a global change in membrane permeability to small molecules, we performed similar live-cell staining experiments with the HaloTag dye JF549 in CRISPRi strains expressing RpoC-HaloTag (Figure 4—figure supplement 2A). We matched the cell distributions between 1N and multi-N cells for fair comparison (Figure 4—figure supplement 2B). Because RpoC concentration increases with cell size in 1N cells relative to multi-N cells (Figure 3A and B), we expected a similar increase in the ratio of JF549 signal between these two cell types if the membrane permeability to small molecules remained unchanged. This is indeed what we observed (Figure 4—figure supplement 2C–E). In parallel, to examine the ability of our rRNA FISH method to detect a reduction in 16S rRNA concentrations, we compared the EUB338 staining of WT cells (MG1655) growing in M9 glycerol with or without casamino acids and thiamine (M9glyCAAT vs. M9gly), which results in a difference in growth rate of ~40% (Govers et al., 2024) due to the expected lower concentration of ribosomes and thus 16S rRNAs in nutrient-poor media. Consistent with this expectation, we found that the EUB338 concentration signal was reduced by ~50% in M9gly relative to M9glyCAAT (Figure 4—figure supplement 3). Given these validations, our results in Figure 4 suggest that the RNASelect signal primarily reflects the bulk of mRNAs, and that the concentration of mRNAs decreases more rapidly than that of rRNAs upon genome dilution.

DNA dilution can result in sub-exponential growth through mRNA limitation

In a previous theoretical study, Lin and Amir, 2018 considered distinct scenarios for gene expression. Their model predicted that if DNA and mRNAs are in excess, cells will display exponential growth. On the other hand, cells will adopt linear growth if DNA and mRNAs become limiting. Our experiments showed that 1N cells indeed converge toward linear growth (toward slope 0 in Figure 1C), though the complete transition to linear growth required a large decrease in DNA concentration. To quantitatively examine this transition from exponential to linear growth through genome dilution, we developed two deterministic ordinary differential equation (ODE) models of the flow of genetic information that include parameters for the fractions of active RNAPs and ribosomes. In these models, the dynamics of mRNA ( $X$ ) and protein ( $Y$ ) numbers in the cell are described by

\frac{d X}{d t} = r_{1} α_{R N A P} (X, Y) Y - δ X

\frac{d Y}{d t} = r_{2} α_{r i b o} (X, Y) Y

where $r_{1}$ is the bulk transcription rate normalized by the total protein number, $r_{2}$ is the bulk translation rate normalized by the total protein number, and $δ$ is the mRNA degradation rate. The quantities $α_{R N A P} (X, Y)$ and $α_{r i b o} (X, Y)$ are the fractions of active RNAPs and ribosomes expressed as a percentage of the total RNAPs and ribosomes, respectively. For simplicity, we assumed that protein degradation is negligible and that the cell volume and the number of rRNAs grow proportional to protein $Y$ (Balakrishnan et al., 2022; Lin and Amir, 2018). As a result, the rate of protein increase $\frac{d Y}{d t}$ corresponds to the absolute growth rate and the relative protein increase rate $\frac{1}{Y} \frac{d Y}{d t}$ corresponds to the relative growth rate. For detailed description and estimation of the model parameters, see Supplementary file 2 and Appendices 1 and 2.

Based on the function form of $α_{R N A P} (X, Y)$ , we consider two ODE model variants. In model A, we assumed that DNA is a limiting factor while RNAPs are not. In model B, both DNA and RNAPs were considered as growth-limiting factors. In both models, higher DNA concentration increases the probability that an RNAP will encounter and bind to a promoter. In model terms, $α_{R N A P}$ (as well as the downstream transcription rate) increases with DNA concentration. In model A, we examined the effect of DNA limitation with minimal mathematical complexity by assuming that the proteome does not change (see Materials and methods). In model B, we considered RNAP kinetics (with three different RNAP states: free, promoter-bound, and transcribing) based on the law of mass action (see Materials and methods and Appendix 3) and took into consideration the experimentally observed increase in RNAP concentration in 1N cells (Figure 3A and B). For both models, $α_{R N A P}$ depended on DNA concentration.

We used these models to perform simulations and compared the results to our measurements, starting with parameter values extracted or estimated from the E. coli literature (Supplementary file 3). In 1N cells, the DNA amount was fixed to one genome while it scaled with cell volume in multi-N cells. The parameters were then optimized to fit six experimental datasets simultaneously: cell growth rate, the fraction of active RNAPs, and the fraction of active ribosomes in both 1N and multi-N cells (see Materials and methods and Appendix 4).

As shown in Figure 5A–D (model A) and Figure 5—figure supplement 1 (model B), both models performed similarly after parameter optimization. While the model curves (solid lines) did not perfectly match the average behavior of our experimental results (open squares), they displayed similar trends and fell within the variance of the single-cell data (dots). The models showed that multi-N cells (blue) display balanced exponential growth while the 1N cells (yellow) exhibit sub-exponential growth (Figure 5A and Figure 5—figure supplement 1A), consistent with experiments. At the same time, both models recapitulated the observed experimental trends in active fractions of both ribosomes and RNAPs, which remained constant in multi-N cells while decaying gradually with DNA concentration in 1N cells (Figure 5B–D and Figure 5—figure supplement 1B–D).

Figure 5 with 2 supplements see all

Download asset Open asset

Mathematical modeling of DNA limitation.

(**A–C**) Plots comparing simulation results of model A (solid lines) with experimental data points (dots) and averages (open squares) in the M9glyCAAT condition. The multi-N and 1N cells are indicated as blue and yellow, respectively: (A) The relation between the absolute growth rate ( $\frac{d A}{d t}$ ) and cell area ( $A$ ). (B) The relation between the active RNA polymerase (RNAP) fraction and cell area. (C) The relation between the active ribosome fraction and cell area. (D) Diagram showing how the fractions of active RNAPs and ribosomes change with DNA concentration (colored from yellow to blue). Simulated results (filled dots) are based on model A. Experimental data (points with 2D error bars: 95% CI) from multi-N and 1N cells were combined and shown in the same plot. (E) Plot showing the effect of DNA limitation (using the ordinary differential equation [ODE] model A) on the decay of DNA concentration, mRNA concentration, and relative growth rate in 1N cells. Each quantity was normalized to their value at normal cell size (cell area = 2.5 µm²).

The simulation results of 1N cells suggest the following cascade of events when DNA is limiting. Lower DNA concentration results in fewer substrates for RNAPs, which reduces the transcription rate. This results in a decrease in mRNA concentration. As mRNAs become limiting, the fraction of ribosomes engaged in translation decreases. This, in turn, decreases the rate of bulk protein synthesis, which decreases the relative growth rate. The greater the DNA dilution (through cell growth), the more severe the downstream effects become, explaining the decay in relative growth rate in 1N cells (Figure 5E).

Genome dilution rapidly limits RNAP activity under both nutrient-rich and -poor conditions, but the extent of downstream effects on ribosome activity and cell growth can vary with the nutrient condition

In the relatively nutrient-rich M9glyCAAT condition, WT cells at birth are expected to have higher DNA content than 1N cells on average due to overlapping DNA replication (Fossum et al., 2007). To examine whether cells are also subject to DNA-limited transcription when multi-fork DNA replication is rare or nonexistent, we examined the total RNAP activity of 1N cells relative to WT cells in two different nutrient-poor media, M9gly and M9 L-alanine (M9ala). Abundance and diffusivity measurements of RpoC-labeled RNAPs (Figure 6—figure supplement 1) showed that the scaling between the total amount of active RNAPs (i.e. global transcriptional activity) and cell area was strongly reduced in 1N cells, even within the range of WT cell sizes (Figure 6A and B). Thus, genome dilution rapidly limits global transcription in nutrient-poor (slow growth) conditions, as in richer (faster growth) conditions (Figure 3H).

Figure 6 with 2 supplements see all

Download asset Open asset

Scaling of the total active RNA polymerases (RNAPs), total active ribosomes, and growth rate with cell area during genome dilution in nutrient-poor media.

(A) Plot showing the total amount of active RNAPs (calculated by multiplying the total amount of RNAPs by the fraction of active RNAPs from Figure 6—figure supplement 1A and G) in wild-type (WT) (CJW7339) and 1N (CJW7457) cells grown in M9gly as a function of cell area. Also shown is a linear fit to WT data ( $f (x) = 3.99 ∙ 10^{4} ∙ x$ , R²=0.90). Shaded areas denote 95% CI of the mean from bootstrapping. All data are from three biological replicates. (B) Same as (A) but for cells grown in M9ala (calculated from Figure 6—figure supplement 1B and H). The linear fit for WT data is $f (x) = 3.21 ∙ 10^{4} ∙ x$ , R²=0.95. (C) Plot showing the total active ribosome amount of 1N and multi-N cells grown in M9gly as a function of cell area. The total amount of active ribosomes was calculated by multiplying the total amount of ribosomes by the fraction of active ribosomes (from Figure 6—figure supplement 2A and G). Also shown is a linear fit to WT data ( $f (x) = 2.99 ∙ 10^{4} ∙ x$ , R²=0.97). Lines and shaded areas denote mean and 95% CI of the mean from bootstrapping. All data are from three biological replicates. (D) Same as (C) but for cells grown in M9ala (calculated from Figure 6—figure supplement 2B and H). Here, the linear fit to the WT data is $f (x) = 1.90 ∙ 10^{4} ∙ x$ , R²=0.99. (E) Absolute growth rate in 1N (50,352 datapoints from 973 cells) and WT (80,269 datapoints from 12,544 cells) cells in M9gly. The linear fit for WT data is $f (x) = 6.50 ∙ 10^{- 3} ∙ x$ , R²=0.99. (F) Absolute growth rate in 1N (71,736 datapoints from 909 cells) and WT (63,367 datapoints from 6880 cells) cells in M9ala. The linear fit for WT data is $f (x) = 4.05 ∙ 10^{- 3} ∙ x$ , R²=0.97. Lines and shaded areas denote mean ± SD from three biological replicates.

In contrast, abundance and diffusivity measurements of fluorescently labeled ribosomes in cells growing in M9gly and M9ala (Figure 6—figure supplement 2) revealed that the total amount of active ribosomes (i.e. bulk translational activity) and the absolute growth rate of 1N cells started to deviate from proportional scaling with cell areas mostly when cells reached large (non-physiological) sizes (Figure 6C-F). As a result, the difference in absolute growth rate between 1N and multi-N cells was not as pronounced as in cells growing in the richer M9glyCAAT medium (Figure 6E and F vs. Figure 1C). This suggests that one or more cellular buffering activities may help mitigate the limitation of DNA concentration on transcription in nutrient-poor media (see Discussion).

Genome dilution changes the composition of the transcriptome and proteome

The fact that the relative concentrations of ribosomal proteins and RNAP subunits scaled differently with cell area in 1N cells (Figures 2A, B, 3A, B) indicated that all genes are not equally impacted by DNA dilution. In yeast and mammalian cells, a decrease in the DNA-per-volume ratio has recently been demonstrated to alter the composition of the proteome, with some proteins increasing in relative concentration while others become comparatively more diluted (Lanz et al., 2024; Lanz et al., 2022). To examine whether this effect may be conserved across domains of life, we used our proteomic TMT-MS data on the CRISPRi strains to quantify the relative concentration of each detected protein across cell areas following DNA replication or cell division arrest in M9glyCAAT. For each protein, we calculated the relative change in concentration against the relative change in cell size through regression fitting, yielding a slope value. A slope of zero indicates that the concentration of a protein remains constant relative to the proteome whereas a slope of –1 (or 1) means that the relative concentration is decreasing (or increasing) by twofold with each cell size doubling (Figure 7A).

Figure 7 with 4 supplements see all

Download asset Open asset

Proteome and transcriptome remodeling in 1N-rich cells.

(A) Schematic explaining the calculation of the protein slopes, which describes the scaling of the relative protein concentration (concentration of a given protein relative to the proteome) with cell area. (B) Plot showing the protein scaling (average slopes from two reproducible biological replicates, see Figure 7—figure supplement 1A and B) in 1N (x-axis) and multi-N (y-axis) cells across the detected proteome (2360 proteins). The colormap corresponds to a Gaussian kernel density estimation (KDE). (C) Plot showing the first principal component (PC1) used to reduce the dimensionality of the relative protein concentration during cell growth. The PC1, which represents the overall change in relative concentration regardless of the sign of the slope, explains 69% of the total variance considering both 1N-rich and multi-N cells. The x-axis corresponds to the log-transformed cell area, whereas the marker size shows the cell area increase in linear scale. (D) Correlation between average protein and RNA slopes across 2324 genes. The colormap corresponds to a KDE. (E) Relation between mRNA abundance (transcripts per million 60 min after CRISPR interference [CRISPRi] induction) and RNA slopes in 1N-rich cells. The colormap indicates a KDE (3446 genes in total). The binned data are also shown (orange markers: mean ± standard error of the mean [SEM], ~380 genes per bin). The Spearman correlation (ρ=–0.04) is considered not significant (NS, p-value>10^–10). (F) Correlation between RNA slopes and mRNA degradation rate from a published dataset (Balakrishnan et al., 2022) across genes. The colormap indicates a KDE (2570 genes with quantified slopes and positive mRNA degradation rates). The binned data are also shown (orange markers: mean ± SEM, ~280 genes per bin). A significant negative Spearman correlation (p-value<10^–10) is shown for mRNAs with a degradation rate above 0.7 min^–1. (G) RNA slope comparison between essential and non-essential genes in *E. coli*. Three different published sets of essential genes were used (Gerdes et al., 2003; Goodall et al., 2018; Hashimoto et al., 2005). The horizontal white lines indicate the inter-quartile range of each distribution. Mann-Whitney non-parametric tests justify the significant difference (p-value<10^–10) between the two gene groups (essential vs. non-essential genes).

We found that the slope distribution was highly reproducible between biological replicates (Figure 7—figure supplement 1A and B) but drastically different between 1N-rich cells and multi-N cells (Figure 7B, Supplementary file 1). In the control multi-N cells where the genome concentration does not change with cell growth, the relative concentration of ~94% of the detected proteins (2217/2360) remained roughly constant, with their relative concentrations decreasing or increasing by less than 20% per cell size doubling (i.e. slopes>–0.2 or <0.2; Figure 7B and Supplementary file 1). This suggests that protein amounts largely scale with cell size, as generally assumed. However, in 1N-rich cells where the genome dilutes with cell growth, the proportion of detected proteins with slopes near zero (>–0.2 or <0.2) dropped to ~37% (859/2360) (Figure 7B and Supplementary file 1). A principal component analysis on the relative protein concentration during cell growth confirmed that the relative proteome composition changed proportionally with genome dilution (1N-rich cells), whereas it remained constant when the DNA-to-cell volume ratio was maintained (multi-N cells) (Figure 7C).

To examine whether the proteome scaling behavior stems from differential changes in mRNA levels, we performed transcriptomic (RNA-seq) analysis on two biological replicates of 1N-rich cells at different time points after induction of DNA replication arrest. The two replicates were strongly correlated at the transcript level (Spearman ρ=0.91, p-value<10^–10, Figure 7—figure supplement 1C). We also found a strong correlation (Spearman ρ=0.76, p-value<10^–10) in scaling behavior with cell area between mRNAs and proteins across the genome of 1N-rich cells (Figure 7D and Supplementary file 4), indicating that most of the changes in protein levels observed upon genome dilution take place at the mRNA level.

To investigate whether central processes may contribute to the observed transcriptome remodeling during DNA limitation, we examined whether the RNA slopes correlate with gene-specific rates of transcription initiation or mRNA degradation obtained from a published dataset (Balakrishnan et al., 2022). Note that the reference dataset was generated from experiments on E. coli growing in M9 glucose (M9glu) and not M9glyCAAT. However, both media give similar growth rates (Govers et al., 2024) and our transcriptome measurements agree well with the reference data in terms of mRNA abundance (Spearman ρ=0.76, p-value<10^–10, Figure 7—figure supplement 2A). We found no significant correlation between the rates of transcription initiation from the reference dataset and RNA or protein slopes across genes (Figure 7—figure supplement 2B and C). Consistent with this finding, mRNA abundance was not a predictor of RNA slopes (Figure 7E). This was somewhat surprising as one might anticipate highly transcribed genes to saturate with RNAPs faster than other genes. However, we found that the mRNA degradation rate partly explains the variance in RNA and protein slopes. Specifically, for genes producing short-lived transcripts (decay rate>0.7 min^–1), the RNA and protein slopes slightly negatively correlated with the rate of mRNA decay (Spearman correlation coefficient ρ=–0.18, p-value<10^–10, Figure 7F and Figure 7—figure supplement 2D, Supplementary file 5). These results suggest that genes that generate short-lived mRNAs are more susceptible to DNA limitation, presumably because their mRNAs are more rapidly diluted with cell growth due to their fast decay, though we cannot exclude potential indirect effects.

Next, we examined whether genes reported to be essential for viability in three independent studies (Gerdes et al., 2003; Goodall et al., 2018; Yamazaki et al., 2008) displayed biases in RNA and protein slopes given the importance of their products for cell growth. Remarkably, essential genes, which share similar mRNA decay rates as other genes (Figure 7—figure supplement 2E, Mann-Whittney p-value>0.01), tended to exhibit superscaling behavior in 1N cells as shown by their enrichment in positive RNA slopes regardless of the selected dataset (Figure 7G, Mann-Whittney p-value<10^–10). This suggests that cells have evolved regulatory mechanisms to minimize dilution of mRNAs encoded by essential genes.

Discussion

Our data suggest that DNA limitation in E. coli cells affects cell growth rate through modulation of downstream transcription and translation activities (Figures 1—7 and associated figure supplements). The fact that DNA limitation for cellular growth was also observed in C. crescentus (Figure 1—figure supplement 8) is significant not only because this bacterium is distantly related to E. coli, but also because it has a different pattern of cell wall growth and distinct control mechanisms of DNA replication (Aaron et al., 2007; Banerjee et al., 2017; Frandi and Collier, 2019; Lasker et al., 2016; Terrana and Newton, 1975). This suggests that DNA concentration may be a prevalent growth constraint across bacterial species. It also helps explain why the timing of DNA replication in bacteria is so robustly linked to cell volume across environmental and genetic conditions that affect cell size (Donachie, 1968; Govers et al., 2024; Sauls et al., 2019; Si et al., 2017; Zheng et al., 2016).

Comparison with studies on eukaryotic cells suggests conservation of gene expression principles across domains of life. For instance, in yeast, it has been shown that the global transcription rate in G1-arrested cells is higher in diploids than haploids of similar sizes (Swaffer et al., 2023), consistent with DNA concentration being a limiting factor for transcription. Furthermore, in both yeast and mammalian cells, small G1-arrested cells display higher growth rate (or global RNA or protein synthesis rate) per cell volume than large ones that have exceeded a certain volume (Cadart et al., 2018; Liu et al., 2024; Neurohr et al., 2019; Lanz et al., 2022). This is likely due to a change in genome concentration rather than a change in cell volume, as the relative growth rate is unaffected in very large cells as long as they undergo a proportional increase in ploidy (Virtanen et al., 2020).

We found that even a relatively small dilution in DNA concentration—as expected in DNA replication-arrested E. coli cells that are still within or close to physiological sizes—results in a reduction of total RNAP activity in both rich and poor media (Figures 3H and 6A, B). Crude estimations suggest that ≤40% DNA dilution is sufficient to negatively affect transcription (total RNAP activity) in M9glyCAAT, whereas the same effect was observed after less than ~10% dilution in poor media (M9gly or M9ala) (see Materials and methods). Thus, cells appear to live at the cusp of DNA limitation for transcription, especially under slow growth (nutrient-poor) conditions. This suggests that cells make enough—but not too much—DNA, presumably because DNA replication is a costly process that represents a significant fraction (~6% in minimal media) of the cellular energy budget (Neidhardt et al., 1990).

What may be the implications of living close to DNA limitation? While E. coli carefully controls its genome concentration across various conditions and growth rates at the population level (Donachie, 1968; Govers et al., 2024; Si et al., 2017; Zheng et al., 2016), there remains variability in DNA concentration at the single-cell level, with some cells initiating DNA replication at smaller or larger cell volumes than others (Si et al., 2019; Witz et al., 2019). In future studies, it will be interesting to explore whether this variability contributes to the known growth rate heterogeneity across isogenic cells (Lin and Jacobs-Wagner, 2022; Wang et al., 2010). It is also tempting to speculate that changes in genome concentration may, at least in part, contribute to the deviations from exponential growth that have been reported during the division cycle of B. subtilis, E. coli, and stalked C. crescentus progeny (Banerjee et al., 2017; Kar et al., 2021; Nordholt et al., 2020; Reshes et al., 2008). More substantial forms of DNA dilution may occur under other circumstances. C. crescentus cells in freshwater lakes often form long filaments during algal blooms in the summer months (Heinrich et al., 2019). These filament cells are thought to be the result of a DNA replication arrest in response to the combination of an alkaline pH, a depletion in phosphate, and an excess of ammonium (Heinrich et al., 2019). Another example is illustrated by the Lyme disease agent Borrelia burgdorferi. This pathogen, which forms long polyploid cells during exponential growth, experiences a progressive decrease in genome concentration (up to eightfold) in stationary phase laboratory cultures through the gradual loss of genome copies (Takacs et al., 2022).

In yeast cells, decreased mRNA turnover combined with increased RNAP II gene occupancy helps mitigate DNA dilution on global transcriptional activities up to a certain (non-physiological) cell volume, beyond which the compensation breaks down (Swaffer et al., 2023; Zhurinsky et al., 2010). Such buffering activities, which are consistent with model predictions (Figure 5—figure supplement 2; Swaffer et al., 2023), may also be at play in E. coli in a growth medium-dependent manner. While genome dilution rapidly impacted transcription in all tested media based on total RNAP activity measurements (Figures 3H and 6A, B), we found that the negative impact on downstream processes—total ribosome activity and cell growth—occurred later (i.e. mostly beyond physiological cell sizes) in M9gly and M9ala (Figure 6C–F), in contrast to M9glyCAAT (Figures 1B, C, 2H). This suggests the existence of mechanisms that compensate for DNA-limited transcription under slow growth such as a decrease in mRNA decay, an increase in ribosome loading, and/or an increase in translation elongation rate. Perhaps such buffering activities are not as effective under nutrient-rich conditions due to the rapid mRNA dilution during fast growth. Testing these hypotheses will require future experimentation.

Another remarkable similarity between bacteria and eukaryotes is the effect of genome concentration on proteome composition. While protein abundance is typically assumed to scale with cell size in bacteria, we found that this is true at the proteome level only when ploidy also scales (Figure 7B). This requirement was also recently shown in yeast and mammalian cells (Lanz et al., 2024; Lanz et al., 2022). This conservation of scaling principle further highlights the importance of genome concentration in controlling protein expression.

What determines the scaling behavior of proteins in E. coli is not clear. We found that it largely occurs at the mRNA level (Figure 7D), and that short-lived mRNAs are slightly more susceptible to subscaling behavior (Figure 7F and Figure 7—figure supplement 2D). Conversely, the majority of essential genes (Gerdes et al., 2003; Goodall et al., 2018; Yamazaki et al., 2008) tended to display superscaling behavior relative to the rest of the genome (Figure 7G, Supplementary file 5). This suggests the existence of regulatory mechanisms that prioritize the expression of essential genes over less important ones when genome concentration becomes limiting for cell growth.

While the scaling of proteins in 1N cells is largely driven by that of mRNAs (Figure 7D), we found that protein slopes, but not RNA slopes, displayed a slight yet significant positive correlation (Spearman ρ=0.23, p-value<10^–10) with oriC proximity for genes within 1.35 Mb from oriC (Figure 7—figure supplement 3A and B). Why and how this occurs is unclear, but it suggests that mRNA-independent mechanisms (i.e. independent of mRNA synthesis or decay) also contribute to protein scaling behavior. At the GO term level, we did not identify any specific trends in proteome changes (Supplementary file 1). In eukaryotic cells, histones are known to scale in proportion with DNA rather than cell size (Claude et al., 2021; Swaffer et al., 2023; Wiśniewski et al., 2014). As a result, their concentration proportionally decreases (i.e. slope = –1) with growth in G1 phase. In E. coli, the relative abundance of some nucleoid-associated proteins (H-NS, HU, and Dps) decreased with genome dilution, while others (IHF and Fis) displayed superscaling (protein slopes > 0) behavior (Figure 7—figure supplement 4).

Given the prevalent use of E. coli in the biotechnological world, we hope that our findings will be helpful to future bioengineering studies and growth rate optimization efforts. We show that protein content and cellular growth depend on the ploidy-to-cell volume ratio (Figures 1 and 7). As such, models of protein expression that take into consideration the DNA concentration and the active number of RNAPs and ribosomes could provide a starting point to identify the parameter space that leads to growth rate improvement. Experimentally, it will be important to determine which specific genes exert the largest growth rate-limiting effect. In this context, the few essential genes with strong subscaling behavior (large negative values of RNA and protein slopes) in 1N cells (Figure 7G, Supplementary file 5) suggest potential candidates for future studies given the rapid dilution of their mRNAs and proteins relative to other genes.

Share this article

Cite this article

Growth rate and genome copy number in E. coli growing in M9glyCAAT.

Lower ribosome activity explains the reduced growth rate of 1N cells growing in M9glyCAAT.

RNA polymerase (RNAP) activity is reduced in 1N cells growing in M9glyCAAT.

RNASelect and EUB338 concentration measurements in 1N and multi-N cells.

Mathematical modeling of DNA limitation.

Scaling of the total active RNA polymerases (RNAPs), total active ribosomes, and growth rate with cell area during genome dilution in nutrient-poor media.

Proteome and transcriptome remodeling in 1N-rich cells.

Estimation of DNA concentration.

Interpolation and extrapolation of parameters.

Topology of the three RNA-polymerase states and their transition fluxes.

Comparing changes in active RNAP fraction between ODE models A and B.

Comparison between initial (Ini) and optimized (Opt) parameters.

Author details

Jarno Mäkelä

Contribution

Contributed equally with

Competing interests

Alexandros Papagiannakis

Contribution

Contributed equally with

Competing interests

Wei-Hsiang Lin

Contribution

Contributed equally with

Competing interests

Michael Charles Lanz

Contribution

Contributed equally with

Competing interests

Skye Glenn

Contribution

Competing interests

Matthew Swaffer

Present address

Contribution

Competing interests

Georgi K Marinov

Contribution

Competing interests

Jan M Skotheim

Contribution

Competing interests

Christine Jacobs-Wagner

Contribution

For correspondence

Competing interests

Citations by DOI

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

Categories and tags

Research organism