Speed variations of bacterial replisomes
Abstract
Replisomes are multiprotein complexes that replicate genomes with remarkable speed and accuracy. Despite their importance, their dynamics is poorly characterized, especially in vivo. In this paper, we present an approach to infer the replisome dynamics from the DNA abundance distribution measured in a growing bacterial population. Our method is sensitive enough to detect subtle variations of the replisome speed along the genome. As an application, we experimentally measured the DNA abundance distribution in Escherichia coli populations growing at different temperatures using deep sequencing. We find that the average replisome speed increases nearly fivefold between 17 °C and 37 °C. Further, we observe wavelike variations of the replisome speed along the genome. These variations correlate with previously observed variations of the mutation rate, suggesting a common dynamical origin. Our approach has the potential to elucidate replication dynamics in E. coli mutants and in other bacterial species.
Editor's evaluation
This manuscript combines theory with experiments to characterize the replication speed of bacteria chromosomes through the cell cycle. The authors show oscillatory patterns in the replication speed of E. coli, which they relate to the heterogeneity of mutation rates along the genome, suggesting a tradeoff between the speed and accuracy of replication. This work presents an elegant approach for investigating bacterial growth from a systems biology perspective.
https://doi.org/10.7554/eLife.75884.sa0Introduction
Every cell must copy its genome in order to reproduce. This task is carried out by large protein complexes called replisomes. Each replisome separates the two DNA strands and synthesizes a complementary copy of each of them, thereby forming a Y–shaped DNA junction called a replication fork. The speed and accuracy of replisomes is impressive (Baker and Bell, 1998). They proceed at several hundreds to one thousand base pairs per second (Pham et al., 2013; Elshenawy et al., 2015), with an inaccuracy of about one misincorporated monomer every 10 billion base pairs (Schaaper, 1993). In bacteria, two replisomes initiate replication at a welldefined origin site on the circular genome, progress in opposite directions, and complete replication upon encountering each other in a terminal region.
The initiation and the completion of DNA replication conventionally delimit the three stages of the bacterial cell cycle (Dewachter et al., 2018; Wang and Levin, 2009). The first stage, B, spans the period from cell birth until the initiation of DNA replication. The second stage, C, encompasses the time needed for replication. The last phase, D, begins at the end of DNA replication and concludes with cell division. While it is established that DNA replication and the cell cycle must be coordinated, their precise relation has been a puzzle for decades (Willis and Huang, 2017). A classic study by Cooper and Helmstetter, 1968 finds that, upon modifying the growth rate by changing the nutrient composition in Escherichia coli, the durations of stages C and D remain constant at about 40 min and 20 min, respectively. This means that the replisome speed must be unaffected by the nutrient composition, at least on average. When the cell division time is shorter than one hour, DNA replication is initiated in a previous generation. This implies that, in fast growth conditions, multiple pairs of replisomes simultaneously replicate the same genome (Fossum et al., 2007). Tuning the growth rate by changing the temperature has a radically different effect on bacterial physiology. For example, in vivo (Pierucci, 1972) and in vitro (Yao et al., 2009) studies show that the speed of replisomes is affected in this case.
More precise features of replisome dynamics, such as whether their speed is approximately constant or varies along the genome, are important to determine the location of their encounter point and the distribution of replication errors on the genome (Niccum et al., 2019; Dillon et al., 2018). However, this detailed information is hard to obtain (Pham et al., 2013). One way for inferring it is to measure the DNA abundance distribution, that is the frequency of DNA fragments along the genome in an exponentially growing cell population. In fact, the frequency of these fragments in the population depends on the proportions of synthesizing genomes of different lengths, which in turn depend on the replisome dynamics. Previous studies have used the DNA abundance distribution to understand the functioning of bacterial replication and how different proteins assist completion of DNA replication (Wendel et al., 2014; Wendel et al., 2018; Rudolph et al., 2013; MidgleySmith et al., 2019; MidgleySmith et al., 2018). However, these studies focused on qualitative analysis of the observed changes of the DNA distribution in knockout mutants with respect to the wild type, and did not attempt to predict the shape of the distribution using quantitative theoretical models. The DNA abundance distribution has also been used to identify actively growing species in a microbiome (Korem et al., 2015).
In this paper, we introduce a method to infer the replisome dynamics from the DNA abundance distribution. As an application, we experimentally measured the DNA abundance distribution of E. coli growing at different temperatures between $17{}^{\mathrm{o}}\mathrm{C}$ and $37{}^{\mathrm{o}}\mathrm{C}$ using highthroughput sequencing. Our approach, combined with our experiments, shows that the average speed of replisomes exhibits an Arrhenius dependence on the temperature, with an almost fivefold variation in the range we considered. Moreover, the precision of our experiments reveals that the speed of replisomes varies along the genome in a seemingly periodic and highly repeatable fashion around this average value. We find that this pattern is highly correlated with previously observed wavelike variations of the single base pair mutation rate along the bacterial genome (Niccum et al., 2019; Dillon et al., 2018). We discuss possible common causes for these two patterns.
Results
Distribution of genome types
We consider a population of bacteria that grow exponentially in a steady environment. Each cell in the growing population can encompass three types of genomes, see Figure 1a and Figure 1b: (i) one template genome, that is, the genome that the cell inherited at its birth. (ii) incomplete genomes, that is, genomes which are being synthesized. (iii) postreplication genomes that will be passed to new cells and become their templates.
In nutrientrich conditions, bacteria replicate their genome in parallel, so that the numbers of incomplete genomes and postreplication genomes per cell are variable, see Figure 1b. The classic CooperHelmstetter model (Cooper and Helmstetter, 1968) describes the dynamics of these genomes in a given cell through generations. We adopt a different approach and focus on the abundance dynamics of the three types of genomes in the whole population. We call ${N}_{T}(t)$, ${N}_{S}(t)$, ${N}_{P}(t)$ the total number of template genomes, incomplete (synthesizing) genomes, and postreplication genomes, respectively, that are present in the population at time $t$. Our first aim is to quantify the relative fractions of these three types of genomes.
The total number of genomes is $N(T)={N}_{T}(t)+{N}_{S}(t)+{N}_{P}(t)$. Since each cell contains exactly one template, the total number of cells is equal to ${N}_{T}(t)$. The total number of genomes evolves as effect of: (a) replication initiation, which creates new synthesizing genomes at a rate $k$; (b) completion of replication, which transforms synthesizing genomes into postreplication ones at rate β; and (c) cell division, which turns postreplication genomes into templates at a rate α, see Figure 1c. This dynamics is described by the set of equations:
It follows from Equations 1–3 that, in steady growth, the total number of genomes grows exponentially at a rate equal to the fork firing rate $k$. In this exponential regime, the fractions of the three genome types are constant:
The ratio $N/{N}_{T}$ can be interpreted as the average number of genomes per cell. Since this ratio is constant, the fork firing rate $k$ can also be identified as the exponential growth rate of the number of cells. For this reason, from now on, we refer to $k$ as the ‘fork firing rate’ or the ‘growth rate’ interchangeably.
In principle, the rates α, β, and $k$ should depend on the ‘age’ of each genome, that is the time spent by the genome in each stage. In Appendix 1, we generalize our model to an agedependent model to account for this fact. We find that this agedependent model is equivalent to Equations 1–3 in the exponential growth regime. This result supports the use of our simple model of genometype dynamics.
We now analyze the incomplete genomes in more detail. We call $x}_{1$ and $x}_{2$ the portions of a given incomplete genome copied by the two replisomes at a given time, with $0\le {x}_{1},\phantom{\rule{thinmathspace}{0ex}}{x}_{2}\le L$, see Figure 1d. Replication initiates at ${x}_{1}={x}_{2}=0$ and completes once the replisomes meet each other, that is, ${x}_{1}+{x}_{2}=L$. The replisome dynamics proceeds as follows. Each replisome is characterized by a speed which depends, in principle, on the replisome position (be it $x}_{1$ or $x}_{2$) and by a diffusion coefficient representing random fluctuations of the speed. The coordinates $x}_{1},{x}_{2$ of the two replisomes evolve according to the stochastic differential equations:
where ${\xi}_{1}(t)$ and ${\xi}_{2}(t)$ are white noise variables satisfying $\u27e8{\xi}_{1}(t)\u27e9=\u27e8{\xi}_{2}(t)\u27e9=0$, $\u27e8{\xi}_{1}(t){\xi}_{1}({t}^{\prime})\u27e9=\u27e8{\xi}_{2}(t){\xi}_{2}({t}^{\prime})\u27e9=\delta (t{t}^{\prime})$, and $\u27e8{\xi}_{1}(t){\xi}_{2}({t}^{\prime})\u27e9=0$. Here, $\u27e8\mathrm{\dots}\u27e9$ denotes an average over realizations.
Close to thermodynamic equilibrium, the diffusion coefficient $D$ can be estimated by the StokesEinstein relation (Hynes, 1977). However, since replisomes are driven far from equilibrium by hydrolysis of dNTPs, their diffusion coefficient could deviate from this estimate. In the absence of fluctuations ($D=0$), each of the two replisomes copies exactly half of the genome, whereas for $D>0$ their meeting point is characterized by a certain degree of uncertainty.
In steady exponential growth, we call ${p}^{\mathrm{st}}({x}_{1},{x}_{2})$ the stationary probability distribution of finding an incomplete genome with copied portions x_{1} and x_{2}. This probability distribution satisfies the equation:
where we introduce the vector notation $\overrightarrow{v}=(v({x}_{1}),v({x}_{2}))$ and $\overrightarrow{\nabla}=(\partial /\partial {x}_{1},\partial /\partial {x}_{2})$. The last term in the right hand side of Equation 8 is a dilution term that accounts for the exponential increase in newborn cells. We formally derive Equation 8 and discuss its boundary conditions in Methods.
DNA abundance distribution
The DNA abundance distribution $\mathcal{A}(y)$ is the probability that a small DNA fragment randomly picked from the population originates from genome position $y$, see Figure 1e. We define the genome coordinate $y\in [L/2,L/2]$, where $y=0$ corresponds to the origin of replication and $L$ is the genome length. Templates and postreplication genomes yield a uniform contribution to the distribution $\mathcal{A}(y)$ (red and green in Figure 1e). In contrast, incomplete genomes contribute in a way that depends on the replisome positions along the genomes (blue in Figure 1e). Our experiments permit to measure the distribution $\mathcal{A}(y)$ with high accuracy, see Figure 1f and Methods.
To express the distribution $\mathcal{A}(y)$ in our model, we first introduce the probability $\mathcal{P}(y)$ that a randomly chosen genome (either complete or incomplete) in the population includes the genome location $y$. This probability is expressed by
where the two terms in square brackets reflect the fact that either of the replisomes can in principle have copied position $y$, and we used that a randomly chosen genome is complete with probability $(1{N}_{S}/N)=\beta /(k+\beta )$, see Equation 5. The DNA abundance distribution $\mathcal{A}(y)$ is proportional to $\mathcal{P}(y)$, up to a normalization constant:
For given choices of $v(x)$, $D$, and $k$, our theory permits to compute the distribution of incomplete genomes ${p}^{\mathrm{st}}({x}_{1},{x}_{2})$ via Equation 8. From this solution, we can also calculate β as the steady rate at which replisomes complete replication (see Methods). This information can be used to compute the DNA abundance distribution $\mathcal{A}(y)$ using Equation 10. Therefore, by experimentally measuring the DNA abundance distribution, we can test our hypotheses on the speed function $v(x)$ and the diffusion coefficient $D$.
Constant speed model
We first consider a scenario in which replisomes progress at a constant speed $\overline{v}$ and without fluctuations, $D=0$. We find that, in this case, the DNA abundance distribution is expressed by
see Methods. We fit this distribution to the experimental data using maximum likelihood, see Figure 2a. The speed $\overline{v}$ is the only fitting parameter, because we independently measure the exponential growth rate $k$ from the optical density, see Methods.
We find that the speed increases nearly fivefold with temperature in the range we considered and appears to follow an Arrhenius law, see Figure 2b. This behavior resembles that of the growth rate. The effective activation energy characterizing the cell cycle is larger than that characterizing the replisome speed, see Figure 2c, possibly due to the large number of molecular reactions involved in the cell cycle. The data point at 17°C appears to deviate from the Arrhenius law for both the speed and the growth rate (Roy et al., 2021), see Figure 2c.
As seen in Figure 2c, the replisome speed does not increase as fast as the growth rate at increasing temperature. This observation suggests that the number of replisomes per genome should increase with temperature to compensate for this gap. Indeed, in the temperature range we studied, our model predicts that the average number of replisomes per complete genome increases from two to almost six, see Figure 2d and Methods.
We now focus on the average genome content per cell. Since the model assumes that genomes evolve independently, the average DNA content per cell $\mathcal{C}$ is the product of the average genome length $\mathrm{\ell}$ times the average number of genomes per cell $N/{N}_{T}$. Computing the average genome length in the model (see Methods) and using Equation 4 for the average number of genomes per cell, we obtain that the average DNA content per cell is expressed by
The classic CooperHelmstetter model (Cooper and Helmstetter, 1968) predicts the DNA content per cell assuming constant durations of stages B, C, and D of the cell cycle. Since we assumed constant speed and $D=0$, the duration of the replication cycle $L/(2\overline{v})$ is constant in our case as well. As a consequence, the prediction of Equation 12 is equivalent to that of the CooperHelmstetter model (see Appendix 3).
It is generally believed that the ratio between the average DNA content per cell $\mathcal{C}$ and the protein content per cell should be maintained approximately constant at varying physiological conditions. This implies that $\mathcal{C}$ should be proportional to the cell size. If the growth rate is varied by changing the nutrient composition, $\overline{v}$ remains constant (Cooper and Helmstetter, 1968). Equation 12 then predicts an approximately exponential growth of $\mathcal{C}$ with $k$, which is consistent with observations. In this case, the Schaechter–Maaloe–Kjeldgaard growth law states that the cell size grows exponentially with $k$ (Schaechter et al., 1958), thereby ensuring DNAprotein homeostasis. In the case of varying temperature, we find that $\overline{v}$ and $k$ present a similar dependence on $T$ (see Figure 2c), so that their ratio and thereby $\mathcal{C}$ weakly depends on $k$ (see Appendix 3—figure 1). Our result is consistent with observations showing that, at increasing temperature, the cell size remains approximately constant (Shehata and Marr, 1975) or possibly slightly increases (Trueba et al., 1982).
Oscillating speed model
The assumption of constant speed leads to a rather good fit of our DNA abundance data. However, the precision of our data permits us to appreciate systematic deviations from the model predictions under the constant speed hypothesis, see Figure 3ae. These deviations appear as regular oscillations as a function of the genome coordinate. They are evident at all the temperatures we studied except for $17{}^{\mathrm{o}}\mathrm{C}$, where they are barely visible. They are highly repeatable (see Figure 3—figure supplement 1, Figure 3—figure supplement 2 and Figure 3—figure supplement 3) and approximately symmetric with respect to the origin of replication. We also analyzed previous experimental data from MidgleySmith et al., 2018. We observed clear oscillations for experiments in nutrientrich LB medium, but not for experiments in M9 minimal medium, see Figure 3—figure supplement 4. This analysis further supports that this phenomenon is robust, at least in fast growth conditions.
To account for these observations, we introduce a more refined model in which the replisome speed oscillates along the genome:
where δ represents the relative amplitude of oscillations; ω their angular frequency along the genome; and $\varphi $ their initial phase. We also take into account random speed fluctuations in this case, $D\ge 0$. In this case, we predict the DNA abundance distribution using stochastic simulations, see Methods. By fitting the DNA abundance, we estimate the parameters $\overline{v}$, δ, ω, $\varphi $, and $D$, see Figure 3(f–j) and Table 1.
Our fitted speed oscillations are reminiscent of a previously observed wavelike pattern in the mutation rate along the genome of different bacterial species (Dillon et al., 2018; Niccum et al., 2019). For a quantitative comparison, we analyze this pattern in a mutant E. coli strain lacking DNA mismatch repair (Niccum et al., 2019). We find that the oscillations in mutation rate and speed are highly correlated, see Figure 4a. The mutation rate appears approximately in phase with the speed, meaning that regions where replisomes proceed at higher speed are characterized by a higher mutation rate. This observation leads to the hypothesis that the two phenomena have a common cause.
We consider two possible explanations for these oscillations. The first is that the oscillations originate from a systematic process related with the cell cycle (Niccum et al., 2019). The second explanation is that the oscillations are caused by competition among replisomes for nucleotides or other molecules required for replication. Assuming approximately constant cell division times, we estimate the cell division time as $\tau =(\mathrm{ln}2)/k$. Since $k$ is also equal to the fork firing rate per genome, the time between firing events in a cell is also approximately equal to τ, so that the two hypotheses lead to the same quantitative prediction. If the speed of replisomes was coupled to a factor oscillating with period τ, this would cause spatial oscillation of speed with angular frequency $\omega =2\pi /(\overline{v}\tau )=2\pi k/[(\mathrm{ln}2)\overline{v}]$. This prediction qualitatively agrees with our fitted values of ω, see Figure 4b.
If the wavelike pattern were caused by competition among replisomes, one would expect either a minimum of the speed every time a new fork is fired ($\varphi =\pi $) or the speed to start decreasing when a new fork is fired ($\varphi =\pi /2$). Our fitted values of the phase $\varphi $ are also compatible with this range, see Figure 4c.
Our results show that the diffusion coefficient $D$ is quite small. For about one third of our experimental realizations at each temperature, our fitted value of $D$ is not significant according to the Akaike information criterion (see Figure 4d and Figure 4—figure supplement 2). For comparison, we estimate the equilibrium diffusion constant of replisomes in the cytoplasm from the StokesEinstein relation as $D}_{\mathrm{S}\mathrm{E}}\approx 6\phantom{\rule{thinmathspace}{0ex}}\mathrm{k}\mathrm{b}{\mathrm{p}}^{2}{\mathrm{s}}^{1$ (see Appendix 5), of the same order of magnitude as our fitted values, see Table 1 and Figure 4. These results suggest that, despite their high average speed, the fluctuations of the replisome position are remarkably similar to the equilibrium case.
The diffusion coefficient determines the uncertainty about the genome site where the two replisomes meet. In the absence of diffusion ($D=0$), replisomes would always meet at the midpoint on the circular genome. For $D>0$, we estimate the typical size ${l}_{\mathrm{D}}$ of the region in which the two replisomes meet as follows. Since the fitted values of δ and $D$ are both small, we approximate the replication time as ${\tau}_{C}\approx L/(2\overline{v})$. In this time, the accumulated uncertainty due to diffusion is equal to ${l}_{\mathrm{D}}\approx 2\sqrt{2D{\tau}_{D}}$. From our estimated diffusion coefficients and average velocities, we obtain values of ${l}_{\mathrm{D}}$ on the order of $100200\phantom{\rule{thinmathspace}{0ex}}\mathrm{k}\mathrm{b}\mathrm{p}$, depending on temperature.
We remark that our bacterial cultures are grown in LB medium. Although the growth curves appear exponential before saturating (see Methods), the nutrient composition can be such that the assumption of steady growth made in our model is not valid. It is therefore important to scrutinize whether the oscillations can be a consequence of this issue. To this aim, we analyzed a version of the model in which the fork firing rate is not steady, but gradually declines with time, see Appendix 7. We find that the discrepancy between the DNA abundance distributions predicted by this model and by the steadystate model is very small compared to the oscillations we observe. This observation supports that a potential lack of steadystate in the LB medium is not a likely cause for the oscillations.
Discussion
In this paper, we infer the dynamics of replisomes from the DNA abundance distribution in a growing bacterial population. Our theory can be seen as a generalization of the classic CooperHelmstetter theory (Cooper and Helmstetter, 1968; Bremer and Churchward, 1977), that permits to estimate the duration of the replication period from the abundance of certain genomic locations in a growing population, see e.g. (Zheng et al., 2016; Si et al., 2017). While the CooperHelmstetter theory assumes constant replisome speed, our approach allows for varying speeds. We test our method by measuring the DNA abundance distribution of E. coli populations growing at different temperatures. We thereby accurately estimate the average speed of replisomes in vivo, and their speed variations along the genome.
We find that the dependence of the average replisome speed on the temperature is well described by an Arrhenius law, similar to that governing the population growth rate. This quantitative dependence can be used to deduce other laws governing bacterial physiology at varying temperature. For example, we argue that precise DNAprotein homeostasis requires the cell size to mildly vary with temperature. This prediction is in qualitative agreement with previous observations (Shehata and Marr, 1975; Trueba et al., 1982) and calls for more systematic measurements of cell parameters at varying temperature, similar to what has been done in the case of varying nutrient composition (Si et al., 2017).
Our approach reveals a wavelike oscillation of the replisome speed along the E. coli genome. The relative amplitude of these oscillations ranges from 10% to 20% of the average replisome speed. A quantitatively similar pattern was observed in the bacterial mutation rate along the DNA of an E. coli mutant strain (Niccum et al., 2019) and of other bacterial species (Dillon et al., 2018). This similarity suggests that the two phenomena have a common dynamical origin. In particular, we hypothesize that this correlation could be a manifestation of the tradeoff between accuracy and speed that characterizes DNA polymerases (Sartori and Pigolotti, 2013; Banerjee et al., 2017; Fitzsimmons et al., 2018). Because of this tradeoff, any mechanism increasing the speed of a polymerase is expected to increase its error rate as well.
Our analysis of the frequency of these oscillations supports that this pattern may originate from a process synchronized with the cell cycle (Dillon et al., 2018), whose activity alters the replisome function. An alternative hypothesis is that the oscillations originate from competition among replisomes for shared resources, such as nucleotides. According to this idea, the firing of new forks can hinder the progression of existing replisomes. The frequency of oscillations is compatible with both explanations. The following additional evidence supports the latter hypothesis. We did not observe appreciable oscillations for our lowest temperature of 17°C, which according to our estimates falls outside the multifork replication regime. Further, we found that oscillations disappeared when analyzing previous data from a culture grown in a minimal medium (MidgleySmith et al., 2018), where multifork replication is also not expected. Similarly, oscillations in the mutation rate are also less evident at lower temperature and disappear in minimal medium (Niccum et al., 2019). On one hand, these facts point to competition between multiple replisomes in the same cell as a likely source for the oscillations. On the other hand, given the difficulty of obtaining steady exponential growth in LB medium, further experiments will be important to assess an eventual effect of the growth medium on the DNA abundance shape.
Beside these regular and repeatable variations, our analysis shows that random fluctuations of replisome speed are quite small, leading to an uncertainty of about $100200\phantom{\rule{thinmathspace}{0ex}}\mathrm{k}\mathrm{b}\mathrm{p}$ on the location of the replisome meeting point. In bacteria, the terminal region of replication is flanked by two groups of termination (Ter) sites having opposite orientations. Ter sites are the binding sites for the Tus protein and permit passage of replication forks in one direction only (Elshenawy et al., 2015), so that the two groups effectively trap the two forks in the terminal region (Duggin et al., 2008). Out of the ten Ter sequences in E. coli, only two of them (TerB and TerC) are within $100200\phantom{\rule{thinmathspace}{0ex}}\mathrm{k}\mathrm{b}\mathrm{p}$ of the point diametrically opposite to the origin. These two sequences have the same orientation. Our result therefore implies that most Ter sequences are usually not needed to localize the replisome meeting point. This prediction is consistent with previous observations that the phenotypes of Tus E. coli mutants (Roecklein et al., 1991) or mutants lacking Ter sequences (Duggin et al., 2008) do not appear distinct from that of the wild type.
Quantitative modeling of the DNA abundance distribution has the potential to shed light on aspects of replisome dynamics beyond those explored in this paper. For example, it was observed that the knockout of proteins involved in the completion of DNA replication leads to either overexpression or underexpression of DNA in the terminal region (Wendel et al., 2014; Wendel et al., 2018; Sinha et al., 2018). Incorporating the role of these proteins into our model will permit to validate possible explanations for these patterns. More in general, our approach is simple and general enough to be readily applied to other bacterial species, to unravel common principles and differences in their DNA replication dynamics.
Methods
Cultivation and DNA extraction
E. coli MG1655 was cultured in LB medium supplemented with 50 mM MOPS pH 7.2 and 0.2% glucose. Overnight cultures grown at 37 °C were diluted into fresh medium and grown until reaching an OD600 of about 1.0 at the target temperature. These cultures were used to inoculate 50 ml medium at the desired temperature in 500 ml Erlenmeyer flasks with baffles at a target OD of 0.01. Cultivation was performed with shaking at 250 rpm. OD was determined with a NanoDrop One in cuvette mode.
The growth curves of E. coli were highly repeatable (over three replicate experiments for each temperature), see Figure 5a and Figure 5b. We computed the growth rate at each temperature by fitting a logistic function to individual growth curves, see Figure 5c. When the time was rescaled by the average growth rate, OD of different temperatures collapsed along a single curve. The cultures (1.4 ml) were harvested by centrifugation at 21,000 g for 20 s after reaching an OD of around 0.5 (midexponential phase, dashed lines in Figure 5a). Cells were kept growing for at least 45 doubling times (as measured in exponential phase) to reach stationary phase. Samples of 0.2 ml from the stationary phase cultures grown at 17°C, 27°C, and 37°C were harvested for DNA extraction. The pellets were immediately frozen at –80 °C until DNA extraction. DNA was extracted in parallel using Genomic DNA Purification Kit from Thermo Fisher Scientific.
Sequencing
We sequenced three samples in the exponential phase from different experimental realizations for each temperature. In addition, we sequenced three stationary samples at three different temperatures. DNA samples were sheared by ultrasound using Covaris AFA technology. Libraries were prepared using the Illumina DNA PCRFree Library Prep Kit. Sequencing was performed on a Novaseq6000 using pairedend 150 bp reads.
Alignment and bias elimination
We aligned reads from each sample using Bowtie2 2.3.4.1 (Langmead and Salzberg, 2012), using the MG1655 genome as a reference. We calculated the frequency of reads as a function of the genome coordinate with bin size 10kbp. To attenuate bias, we divided the frequency at each genome coordinate in a sample from the exponential phase by the frequency of the corresponding bin in a stationary sample (Wendel et al., 2014; MidgleySmith et al., 2018). We alternatively used all of our three stationary samples to correct the bias of each sample in the exponential phase. Therefore, after bias elimination, we effectively have $3\times 3=9$ different DNA abundance curves in the exponential phase at each temperature. See Figure 2—figure supplement 1, Figure 2—figure supplement 2 and Figure 2—figure supplement 3 for details.
Stationary distribution of replisome positions
In this section, we discuss how to compute the stationary distribution of incomplete replisome positions ${p}^{\mathrm{st}}({x}_{1},{x}_{2};t)$. We call ${n}_{S}({x}_{1},{x}_{2};t)$ the number density of incomplete genomes at time $t$ with replisome positions at x_{1} and x_{2}. By definition ${\int}_{0}^{L}d{x}_{1}{\int}_{0}^{L{x}_{1}}d{x}_{2}{n}_{S}({x}_{1},{x}_{2};t)={N}_{S}(t)$. It follows from Equation 7 that this number density evolves according to
where $\overrightarrow{\nabla}=(\partial /{\partial}_{{x}_{1}},\partial /{\partial}_{{x}_{2}})$ and $\overrightarrow{v}=(v({x}_{1}),v({x}_{2}))$. We now introduce the normalized probability $p({x}_{1},{x}_{2};t)={n}_{S}({x}_{1},{x}_{2};t)/{N}_{S}(t)$. By substituting this definition into Equation 14, we obtain
The stationary distribution ${p}^{\mathrm{st}}({x}_{1},{x}_{2})$ is a timeindependent solution of Equation 15, see Equation 8.
Because of replication completion, the line ${x}_{1}+{x}_{2}=L$ is an absorbing state for the dynamics described by Equation 15. Equation 15 must be consistent with Equation 2, which describes the dynamics of incomplete genomes regardless of the coordinates of their replisomes. This implies that the rate β at which replication completes (see Equation 2) must equal to the probability flux through the absorbing boundary:
where we introduce the probability current $\overrightarrow{J}({x}_{1},{x}_{2})=\overrightarrow{v}pD\overrightarrow{\nabla}p$, the unit vector $\widehat{n}=(1/\sqrt{2},1/\sqrt{2})$, and the infinitesimal line increment $dl$ along the absorbing boundary. Similarly, the probability flux entering the system at $({x}_{1},{x}_{2})=(0,0)$ must match the rate of replication initiation as given by Equation 2.
Given a hypothesis on the speed function $v(x)$ and the diffusion coefficient $D$, we solve Equation 15 at stationarity using the experimentally measured growth rate $k$. From the stationary solution ${p}^{\mathrm{st}}({x}_{1},{x}_{2})$, we obtain β using Equation 16. Our approach does not permit to determine the cell division rate α appearing in Equation 1Equation 3. However, this rate is not necessary to compute the DNA abundance distribution, which is expressed by Equation 9 and Equation 10 in terms of ${p}^{\mathrm{st}}({x}_{1},{x}_{2})$ and β only.
Average genome length
To compute the average genome length, we first note that the integral of $\mathcal{P}(y)$ is equal to the average genome length $\mathrm{\ell}$ in the population
Combining Equation 17, Equation 10, and the fact that $\mathcal{P}(0)=1$, we obtain a simple relation between the DNA abundance distribution and the average genome length:
Average number of replisomes per complete genome
We estimate the average number of replisomes per complete genome $\mathcal{N}$ in two alternative ways. On the one hand, using Equation 4Equation 6 we find that
On the other hand, it can be seen in Figure 2d that the fraction of complete genome in the population is equal to the ratio $\mathcal{A}(L/2)/\mathcal{A}(0)$ between the DNA abundance at the terminal and at the origin. It follows that
Constant speed
We focus on the scenario with constant speed and $D=0$. In this case, the steady solution of Equation 15 is given by
The rate at which replication completes is equal to
Substituting Equation 21 and Equation 22 into Equation 9, we obtain.
from which Equation 11 follows by normalizing, see Equation 10.
We exactly solved Equation 15 also in the case where the speed depends on the genome coordinate, provided that the diffusion coefficient vanishes, see Appendix 6.
Stochastic simulations
In the case of oscillating speed and $D>0$, we compute the stationary solution of Equation 15 using numerical simulations. To this aim, we interpret Equation 15 as describing a stochastic process subject to stochastic resetting (Evans and Majumdar, 2011). Specifically, we perform stochastic simulations of Equation 7. In addition to the dynamics described by Equation 7, with a stochastic rate equal to the fork firing rate $k$, trajectories are reset to the origin, ${x}_{1}={x}_{2}=0$ (blue trajectory in Figure 6a). Since the boundary ${x}_{1}+{x}_{2}=L$ is an absorbing state, trajectories that reach this boundary are also reset to the origin (green trajectory in Figure 6a). The probability distribution associated with this dynamics evolves according to Equation 15. We simulate this stochastic dynamics to estimate the stationary distribution ${p}^{\mathrm{st}}({x}_{1},{x}_{2})$ in a computationally efficient way, see Figure 6b. We estimate from the same simulations the parameter β as the empirical rate at which the absorbing boundary is reached, see Equation 16.
Appendix 1
Agedependent dynamics of genome types
In this Appendix, we generalize the model embodied in Equations 1–3 to the more realistic case in which genomes transition from one stage to another with an agedependent rate. Specifically, we call τ the time since a genome fired its last fork and define the timedependent fork firing rate $k(\tau )$. The timedependent fork firing rate can be expressed in terms of the forkfiring timedistribution $f(\tau )$ as $k(\tau )=f(\tau )/[1{\int}_{0}^{\tau}f({\tau}^{\mathrm{\prime}})d{\tau}^{\mathrm{\prime}}]$.
We also define the age $a$ of genomes in synthesizing or postreplication stage, i.e. the time they spent in their stage. We define the agedependent rate of completion $\beta (a)$ and the agedependent duration of postreplication stage $\alpha (a)$. These rates can be expressed in terms of the distribution of time to complete the synthesizing stage, $g(a)$ and the distribution of time to complete the postreplication stage, $h(a)$. The relations are: $\beta (a)=g(a)/[1{\int}_{0}^{a}g({a}^{\mathrm{\prime}})d{a}^{\mathrm{\prime}}]$ and $\alpha (a)=h(a)/[1{\int}_{0}^{a}h({a}^{\mathrm{\prime}})d{a}^{\mathrm{\prime}}]$.
We call ${n}_{T}(\tau ;t)$ the number density of templates at time $t$ that fired their last fork at time $t\tau $. We call ${n}_{S}(a,\tau ;t)$, and ${n}_{P}(a,\tau ;t)$ the number density of synthesizing and postreplication genomes (respectively) at time $t$ that fired their last fork at time $t\tau $ and with age $a$. These densities evolve according to the coupled equations:
where $n(\tau ;t)={n}_{T}(\tau ;t)+{\int}_{0}^{\mathrm{\infty}}{n}_{S}(a,\tau ;t)\phantom{\rule{thinmathspace}{0ex}}da+{\int}_{0}^{\mathrm{\infty}}{n}_{P}(a,\tau ;t)\phantom{\rule{thinmathspace}{0ex}}da$ is the total number density of genomes at time $t$ that fired their last fork at time $t\tau $.
The total numbers of templates, synthesizing, and postreplication genomes are respectively given by ${N}_{T}(t)={\int}_{0}^{\mathrm{\infty}}{n}_{T}(\tau ;t)d\tau$, ${N}_{S}(t)={\iint}_{0}^{\mathrm{\infty}}{n}_{S}(a,\tau ;t)dad\tau$, and ${N}_{P}(t)={\iint}_{0}^{\mathrm{\infty}}{n}_{P}(a,\tau ;t)dad\tau$. Integrating Equation 24 with respect to τ and Equations 25; 26 with respect to $a$ and τ we obtain
The total number of genomes, $N(t)={N}_{T}+{N}_{S}+{N}_{P}$ grows with time as
If the fork firing rate is independent of τ, $k(\tau )=k$, then the number of genomes grows as $N=N(0){e}^{kt}$. A computation of the growth exponent for general case requires knowledge of $n(\tau ,t)$. Using the definition of $n(\tau ,t)$ in Equation 24Equation 26 leads to
We assume that the age dependent genome population scales with the total number of genomes, $n(t;\tau )=q(\tau ;t)N(t)$. This assumption for $n(t;\tau )$ yields an exponential growth for the number of genomes, $N=N(0){e}^{\widehat{k}(t)t}$, with a timedependent exponent,
Substituting this along with the relation $n(\tau ;t)=q(\tau ;t)N(t)$ in Equation 31, we find
In the steady state, ${q}_{st}(\tau )\equiv q(\tau ;t\to \mathrm{\infty})$ is independent of time. In this limit, the growth rate in Equation 32 is also timeindependent, $\hat{k}={\int}_{0}^{\mathrm{\infty}}k(\tau ){q}_{\mathrm{s}\mathrm{t}}(\tau )d\tau$. Using these conditions in Equation 33, we find
where we used $e}^{{\int}_{0}^{\tau}k({\tau}^{\mathrm{\prime}})d{\tau}^{\mathrm{\prime}}}=1{\int}_{0}^{\tau}f({\tau}^{\mathrm{\prime}})d{\tau}^{\mathrm{\prime}$. The normalization condition, ${\int}_{0}^{\mathrm{\infty}}{q}_{\mathrm{s}\mathrm{t}}(\tau )d\tau =1$, yields the EulerLotka equation for the relation between $\widehat{k}$ and fork firing time distribution $f(\tau )$,
In the steady state, we assume that ${n}_{T}(\tau ;t)={N}_{T}(t){q}_{T}(\tau )$, ${n}_{S}(a,\tau ;t)={N}_{S}(t){q}_{S}(a,\tau )$ and ${n}_{P}(a,\tau ;t)={N}_{P}(t){q}_{P}(a,\tau )$, see, e.g., (Jafarpour et al., 2018). Because of the definitions of ${N}_{T}$, ${N}_{S}$ and ${N}_{P}$, we have the normalization conditions
These conditions permit to express the dynamics of genome types as
where $\hat{\alpha}={\iint}_{0}^{\mathrm{\infty}}\alpha (a){q}_{P}(a,\tau )dad\tau$ and $\hat{\beta}={\iint}_{0}^{\mathrm{\infty}}\text{}\beta (a){q}_{S}(a,\tau )dad\tau$. From Equation 37, we obtain in the long time limit:
These relations are equivalent to Equation 4Equation 6 for age dependent rates.
To compute $\widehat{\beta}$ and $\widehat{\alpha}$ more explicitly from Equation 25 and Equation 26, we solve for the marginals ${\overline{q}}_{S}(a)={\int}_{0}^{\mathrm{\infty}}{q}_{S}(a,\tau )d\tau$ and ${\overline{q}}_{P}(a)={\int}_{0}^{\mathrm{\infty}}{q}_{P}(a,\tau )d\tau$ respectively. We obtain
Using these marginals in the definition of $\widehat{\beta}$ and $\widehat{\alpha}$, we find
In deriving Equation 40, we also used the relations $e}^{{\int}_{0}^{a}\beta ({a}^{\mathrm{\prime}})d{a}^{\mathrm{\prime}}}=1{\int}_{0}^{a}g({a}^{\mathrm{\prime}})d{a}^{\mathrm{\prime}$ and $e}^{{\int}_{0}^{a}\alpha ({a}^{\mathrm{\prime}})d{a}^{\mathrm{\prime}}}=1{\int}_{0}^{a}h({a}^{\mathrm{\prime}})d{a}^{\mathrm{\prime}$. In the limiting case of ageindependent rates α and β, then $g(a)=\beta {e}^{\beta a}$ and $h(a)=\alpha {e}^{\alpha a}$, and therefore $\widehat{\beta}=\beta $ and $\widehat{\alpha}=\alpha $ as expected.
Appendix 2
Parameter estimation based on maximum likelihood
In this section, we outline the maximum likelihood method to fit the parameters of the model. We make a histogram of our data with bin size $b=10$ kbp. We call $B$ the total number of bins. We empirically estimate the DNA abundance in each bin $i$ as
where ${N}_{i}^{e}$ and ${N}_{i}^{s}$ are the numbers of reads in the $i$ th bin from the exponential and stationary culture, respectively. We assume that the number of reads in each bin follows a Poisson distribution (Aird et al., 2011). In the limit of large ${N}_{i}^{e}$ and ${N}_{i}^{s}$, the distribution of the empirical DNA abundance ${\mathcal{E}}_{i}$ is Gaussian with a standard deviation
The assumption of large number of reads is well satisfied for our sequencing depth: ${N}_{i}^{e}$ ranges from $2.8\times {10}^{4}$ to $19.9\times {10}^{4}$ and ${N}_{i}^{s}$ from $4.6\times {10}^{4}$ to $9.0\times {10}^{4}$. We call ${\mathcal{A}}_{i}\equiv \mathcal{A}(ib)$ the DNA abundance predicted by our model at bin $i$ for a given set of parameters. The joint likelihood of the empirical DNA abundance in all the bins is given by
We fit the model parameters by maximizing the logarithm of the likelihood $\mathrm{ln}\mathcal{L}$.
In the constant speed case, we have a single fitting parameter $k/\overline{v}$. In the oscillatory model without and with diffusion we have four and five fitting parameters, respectively.
Appendix 3
Correspondence with the CooperHelmstetter Theory
In this section, we study our model in the case of constant speed, constant duration of the B and D stages, and slow growth condition, so that all stages B, C, and D of the cell cycle are present. Our goal is to prove that, under these additional assumptions, our model predicts the same total DNA abundance as the CooperHelmstetter model.
The expression for the total DNA abundance in the constant speed case reads
see Equation 12. The analogous expression for the CooperHelmstetter model is
where ${\tau}_{C}$ is the time taken by replisomes to complete replication on the genome and ${\tau}_{D}$ is postreplication period (Cooper and Helmstetter, 1968).
Since we assumed constant replisome speed, we have ${\tau}_{C}=L/2\overline{v}$. Therefore, proving that Equation 44 and Equation 45 are equivalent boils down to showing that ${e}^{k{\tau}_{D}}=(\alpha +k)/\alpha $. We call ${\tau}_{B}$ the time spent by a bacterium in stage B. In the CH model the times ${\tau}_{B}$, ${\tau}_{C}$ are constant. The division time is ${\tau}_{\mathrm{div}}={\tau}_{B}+{\tau}_{C}+{\tau}_{D}$. The population growth rate is $k=\mathrm{ln}2/{\tau}_{\mathrm{div}}$.
Since the division time is constant, the steadystate age distribution is expressed by
see e.g. Powell, 1956
We now express the steadystate fractions ${f}_{B}$, ${f}_{C}$, ${f}_{D}$ of cells in stage B, C, D as
The ratio of the number of post replication genomes to the number of templates is equal to the fraction of cells in stage D. Therefore, equating the fraction ${f}_{D}$ with ${N}_{P}/{N}_{T}=k/\alpha $ obtained from Equation 4 and Equation 6, we obtain ${e}^{k{\tau}_{D}}=(\alpha +k)/\alpha $ as expected.
Appendix 4
Appendix 5
Estimate of the diffusion coefficient from the StokesEinstein relation
The StokesEinstein relation expresses the diffusion coefficient of a spherical particle immersed in a fluid (Hynes, 1977). If $r$ is the radius of the particle, η and $T$ are viscosity and the temperature of the fluid respectively, then according to the StokesEinstein relation, the diffusion coefficient is
where ${k}_{B}$ is the Boltzmann constant. The radius of an E. coli replisome is $r\approx 50nm$(ReyesLamothe et al., 2010) and the viscosity of water is $\eta =0.7\text{mPa s}$ at room temperature $T=310\mathrm{K}$. Using that ${k}_{B}=1.38\times {10}^{23}J{K}^{1}$, we estimate a diffusion coefficient of replisomes in water equal to ${D}_{\mathrm{SE},\mathrm{W}}=6\mu {\mathrm{m}}^{2}{\mathrm{s}}^{1}$. The typical base pair distance is $3.4{A}^{o}$. Therefore in terms of basepair (bp), ${D}_{\mathrm{SE},\mathrm{W}}\approx 56\mathrm{k}\mathrm{b}{\mathrm{p}}^{2}{\mathrm{s}}^{1}$. The diffusion constant of large macromolecules in the cytoplasm is found to be about 10 times smaller than in water (Verkman, 2002). This results in an estimate of the diffusion constant of replisomes in the cytoplasm of ${D}_{\mathrm{SE},\mathrm{C}}\approx 6\mathrm{k}\mathrm{b}{\mathrm{p}}^{2}{\mathrm{s}}^{1}$, as reported in the Results.
Appendix 6
Exact solution of the oscillatory speed case for $D=0$
In this section, we exactly solve the oscillatory speed model in the absence of diffusion ($D=0$). For an arbitrary choice of the function $v(x)$ and $D=0$, the steady state solution of Equation 15 reads
where $A$ is a normalization constant that ensures ${\int}_{0}^{L}d{x}_{1}{\int}_{0}^{{x}_{1}}d{x}_{2}{p}^{\mathrm{s}\mathrm{t}}({x}_{1},{x}_{2})=1$. The rate at which replication completes is $\beta =A{e}^{{\int}_{0}^{L/2}d{x}^{\mathrm{\prime}}k/v({x}^{\mathrm{\prime}})}$. From Equation 9, we obtain
For the specific form of $v(x)$ given in Equation 13, the integral in Equation 52 is equal to
where $\lfloor \cdot \rfloor $ is the floor function. We use the expression of this integral in Equation 52 and substitute the result in Equation 10 to obtain the DNA abundance distribution. We computed the normalization factor of the DNA abundance distribution (i.e., the denominator of Equation 10) by numerical integration.
Appendix 7
Replisome dynamics with timedependent fork firing rate
To verify the robustness of our results, we generalize our model to the case in which the fork firing rate $k$ depends on time. For simplicity, we assume that replisomes move with constant speed and without diffusion ($D=0$).
We assume that the total number of genomes grows with time as
where ${N}_{0}=a/(1+b)$ is the initial number of genomes and we introduced the timevarying fork firing rate
The step function $\theta (t)$ in Equation 54 serves to impose that replication occurs for $t\ge 0$ only.
The corresponding dynamics of synthesizing genomes reads
where the delta function ensures that the replication initiates at ${x}_{1}=0$. As we have set $D=0$, the coordinate x_{2} of the second replisome obeys the same dynamics expressed in Equation 56 and the DNA abundance is symmetric around the replication origin.
The general solution to Equation 56 is of the form ${n}_{s}({x}_{1};t)=f({x}_{1}vt)$. Because of the boundary term at ${x}_{1}=0$, i.e. ${n}_{s}(0;t)=\widehat{k}(t)N(t)/v$, the general solution is
The probability that a randomly chosen genome (either complete or incomplete) includes the genome location $y$ at time $t$ is given by
where $\int}_{y}^{L/2}{n}_{S}({x}_{1},t)d{x}_{1$ is the number of incomplete genomes which include the genome location $y$ and ${N}_{P}(t)+{N}_{T}(t)$ is the total number of complete genomes. From Equations 54; 55, and (Equation 56) , we obtain
Therefore, the total number of incomplete genomes is ${N}_{s}(t)={\int}_{0}^{L/2}{n}_{S}({x}_{1};t)d{x}_{1}=N(t)N\left(t\frac{L}{2v}\right){N}_{0}\theta (t)\theta \left(\frac{L}{2v}t\right)$. Further, because of the conservation of the total number of genomes, the number of complete genomes at time $t$ is
Substituting Equation 54, Equation 59 and Equation 60 in Equation 58, and using Equation 10, we compute the DNA abundance distribution for $t>L/(2v)$ as
We now use this result to test robustness of the steadystate assumption. To this aim, we assume that the number of genomes follows a similar dynamics as the optical density. We accordingly consider, for each sample at each temperature, the parameters a_{i}, b_{i}, k_{i} obtained from the OD fits (see Figure 5—figure supplement 1), and the average speed $\overline{v}$ estimated from the constant speed model (see Table 1). We substitute these in Equation 61 and compute the DNA abundance at time $t=4/k$. This value is chosen as, in our experiments, cells were extracted after approximately four generations, see Figure 5.
We then attempt to fit these DNA abundance distribution with the one predicted the constant speed model, Equation 11, by leastsquare minimization and thus estimating the speed again. The newly estimated speeds at different temperatures are comparable to those inputted in the timevarying fork firing rate model, see Appendix 7—table 1. Finally, we plotted the discrepancy ratio in each of the cases, see Appendix 7—figure 1. At all temperatures, the discrepancy ratio is much smaller than the oscillations we observed in the data. This result supports that the potential nonstationary nature of the fork firing rate is not likely to be the cause of the oscillations.
We remark that, in the estimate presented in this section, we assumed that cells were not growing before the initial time $t=0$. In our culture, we expect the effect of saturation to be even milder since cells were inoculated from an already exponentially growing culture.
Data availability
Sequence reads were deposited in the NCBI Sequence Read Archive with links to BioProject accession number PRJNA772106. Corresponding read frequencies along the genome were deposited in Zenodo (DOI:https://doi.org/10.5281/zenodo.5577986).

NCBI BioProjectID PRJNA772106. Escherichia coli DNA replication study.

ZenodoEscherichia coli DNA replication study: processed alignment data.https://doi.org/10.5281/zenodo.5577986
References

An examination of the CooperHelmstetter theory of DNA replication in bacteria and its underlying assumptionsJournal of Theoretical Biology 69:645–654.https://doi.org/10.1016/00225193(77)903733

Chromosome replication and the division cycle of Escherichia coli B/rJournal of Molecular Biology 31:519–540.https://doi.org/10.1016/00222836(68)904257

An integrative view of cell cycle control in Escherichia coliFEMS Microbiology Reviews 42:116–136.https://doi.org/10.1093/femsre/fuy005

The replication fork trap and termination of chromosome replicationMolecular Microbiology 70:1323–1333.https://doi.org/10.1111/j.13652958.2008.06500.x

Diffusion with stochastic resettingPhysical Review Letters 106:160601.https://doi.org/10.1103/PhysRevLett.106.160601

Statistical mechanics of molecular motion in dense fluidsAnnual Review of Physical Chemistry 28:301–321.https://doi.org/10.1146/annurev.pc.28.100177.001505

Bridging the timescales of singlecell and population dynamicsPhysical Review X 8:021007.https://doi.org/10.1103/PhysRevX.8.021007

Fast gappedread alignment with Bowtie 2Nature Methods 9:357–359.https://doi.org/10.1038/nmeth.1923

A role for 3’ exonucleases at the final stages of chromosome duplication in Escherichia coliNucleic Acids Research 47:1847–1860.https://doi.org/10.1093/nar/gky1253

Chromosome replication and cell division in Escherichia coli at various temperatures of growthJournal of Bacteriology 109:848–854.https://doi.org/10.1128/jb.109.2.848854.1972

Growth rate and generation time of bacteria, with special reference to continuous cultureJournal of General Microbiology 15:492–511.https://doi.org/10.1099/00221287153492

Kinetic versus energetic discrimination in biological copyingPhysical Review Letters 110:188101.https://doi.org/10.1103/PhysRevLett.110.188101

Base selection, proofreading, and mismatch repair during DNA replication in Escherichia coliThe Journal of Biological Chemistry 268:23762–23765.https://doi.org/10.1016/S00219258(20)804463

Dependency on medium and temperature of cell size and chemical composition during balanced grown of Salmonella typhimuriumJournal of General Microbiology 19:592–606.https://doi.org/10.1099/00221287193592

Effect of temperature on the size of Escherichia coli cellsJournal of Bacteriology 124:857–862.https://doi.org/10.1128/jb.124.2.857862.1975

Effects of temperature on the size and shape of Escherichia coli cellsArchives of Microbiology 131:235–240.https://doi.org/10.1007/BF00405885

Solute and macromolecule diffusion in cellular aqueous compartmentsTrends in Biochemical Sciences 27:27–33.https://doi.org/10.1016/s09680004(01)020035

Metabolism, cell growth and the bacterial cell cycleNature Reviews. Microbiology 7:822–827.https://doi.org/10.1038/nrmicro2202

Sizing up the bacterial cell cycleNature Reviews. Microbiology 15:606–620.https://doi.org/10.1038/nrmicro.2017.79

Modeling of the bacterial growth curveApplied and Environmental Microbiology 56:1875–1881.https://doi.org/10.1128/aem.56.6.18751881.1990
Decision letter

Armita NourmohammadReviewing Editor; University of Washington, United States

Jessica K TylerSenior Editor; Weill Cornell Medicine, United States

Ariel AmirReviewer; Harvard University, United States
Our editorial process produces two outputs: (i) public reviews designed to be posted alongside the preprint for the benefit of readers; (ii) feedback on the manuscript for the authors, including requests for revisions, shown below. We also include an acceptance summary that explains what the editors found interesting or important about the work.
Decision letter after peer review:
Thank you for submitting your article "Speed fluctuations of bacterial replisomes" for consideration by eLife. Your article has been reviewed by 3 peer reviewers, and the evaluation has been overseen by a Reviewing Editor and Jessica Tyler as the Senior Editor. The following individuals involved in review of your submission have agreed to reveal their identity: Ariel Amir (Reviewer #3).
The reviewers have discussed their reviews with one another, and the Reviewing Editor has drafted this to help you prepare a revised submission.
Essential revisions:
1. The interpretation in terms of a speederror tradeoff is rather speculative and perhaps less emphasis should be placed on it (e.g. in the abstract and the beginning of p.9).
2. There is a possible problem with LB used as growth media. To infer the replication speed, the firing rate needs to be determined, which in the steady state is equal to the growth rate. However, it is close to impossible to achieve well defined steady growth rate when cells grow in LB since nutrients become depleted and growth rate decreases quite substantially. Which growth rate was used to infer the speed (It was unclear) An experiment in a minimal medium (e.g. with glucose as carbon source) with a welldefined growth rate might provide much more reliable results.
3. This study was done in the regime of fast growth. It is known that for E. coli there are many changes in the cell cycle properties when the doubling time (at 37 Celcius) exceeds 60 minutes (i.e. the regime where there are no overlapping replication forks). How do the results change in slow growth conditions?
4. Oscillation of replication speeds along the chromosome is reported for different temperatures. This is an interesting observation. While an experimental investigation of these oscillations is beyond the scope of this manuscript, a discussion about the physiological implications of these oscillations would be insightful.
5. The idea of using the frequency inferred from sequencing was also used in: Growth dynamics of gut microbiota in health and disease inferred from single metagenomic samples, Korem et al. Science (2015). Are the oscillations also observed in those measurements? If so, is there information which could be gleaned from them?
6. Figure 2D: Why do replisome numbers increase with increasing temperature and how does this observation reconcile with the increased speeds as well? Do both replication fork numbers and speed of individual replisomes contribute to the observed growth rates across temperatures?
7. There is a systematic difference in the dependence of speed and growth rate on temperature, which the authors discuss. What is the expected change in cell size if the CooperHelmstetter model is correct? Should it be observable experimentally? Is it?
8. Please clarify the arguments in Appendix 2 and L140150.
9. Please modify the Discussion section so it is more focused on the conclusions of this work.
10. Provide a more detailed comparison of the considered approach with previously published work. How is the introduced method different and to what extent it goes beyond the previous work? Is this model more predictive of speed?
11. Lines 131133: why is the average DNA per cell the product of the two other averages? Is this an approximation or are the two other variables uncorrelated?
Reviewer #1 (Recommendations for the authors):
I missed a more detail comparison of the considered approach with previously published approaches and results. How is the introduced method different and to which extent is it going beyond the published results? Does it allow a more precis derivation of speed? What was is with the previous approaches which was limiting the quantification? As presented the introduction of the method is the major contribution of this paper and I thus think that these questions should be clearly answered.
There is a possible problem with LB used as growth media. To infer the replication speed, the firring rate needs to be determined, which in steady state growth is equal to the growth rate. However, it is close to impossible to achieve well defined steady growth rate when cells grow in LB since nutrients become depleted and growth rates decreases quite substantially. Which growthrate was thus used to infer the speed (It was unclear) An experiment in a minimal medium (e.g. with glucose as carbon source) with a well define growth rate might provide much more reliable results. An observation confirming that (average) replication speeds are not varying much with media conditions (at the same temperature) would also make a further important physiological point.
The authors show that the interfered replication speed is changing strongly with temperature. This has been reported before. With their measurements the authors state that the change is comparable to the change of growth when temperature is varied, but the authors do not comment much about the physiological relevance of this relation. An Arhenius like change of enzymatic reactions is expected for simple enzymatic processes, but that a similar relation holds for the entire growth rate is remarkable and still poorly understood, as emphasized by many studies. Given this context, what do we learn about the temperature dependence of growth by the observation that the replication speed is changing in an Arrhenius like form? The data the authors present show that the change in replication speed differs from the change in growth rate with a steeper change of growth rates. What does this mean? Does this support for example the idea that replication speeds are not so much a bottleneck of cellgrowth but replication is efficiently regulated to prevent DNA replication from impeding growth?
From a physiological point of view, I think the oscillation of replication speeds with position along the chromosome are more interesting than the variation of speed with temperature. However, it remains unclear how well such oscillations and their frequency are maintained across conditions and maybe even more robustly a function of growth rate. I think the manuscript would thus largely benefit from the analysis of oscillations when cells grow with different rates but at the same temperature. To keep the number of new experiments small, this could for example just be the quantification for growth in a minimal media condition.
Reviewer #3 (Recommendations for the authors):
Feedback for the authors:
1. Figure 2D: Why do replisome numbers increase with increasing temperature and how does this observation reconcile with the increased speeds as well? Do both replication fork numbers and speed of individual replisomes contribute to the observed growth rates across temperatures?
2. Arguments in Appendix 2 and L140150 are unclear and this reviewer is unable to understand the context of the same.
3. Did the authors consider sequencing a dnaQ proof reading mutant (PMID: 8610131) to test the speed vs accuracy tradeoff hypothesis?
4. The Discussion section needs to be more focused on what the main conclusions of this work are. For example, the discussion on replication termination is not very informative. Instead, it would have been helpful if the authors could have explained how they envision the impact of temperature on replisome speed and cell cycle progression, a relationship that seems distinct from the impact of media/ nutrient on these dynamics. Secondly, do mutation rates also increase with increasing temperature, and do the authors envision mutation hotspots given their periodic positional data on the replisome? If so, what determines this periodicity? In the current manuscript, these ideas are not synthesized in a coherent fashion, that makes it harder to appreciate the unique findings.
[Editors' note: further revisions were suggested prior to acceptance, as described below.]
Thank you for resubmitting your work entitled "Speed variations of bacterial replisomes" for further consideration by eLife. Your revised article has been evaluated by Jessica Tyler (Senior Editor) and a Reviewing Editor.
The manuscript has been improved but there are some remaining issues that need to be addressed, as outlined below:
As noted by reviewer #1, the statement that the growth rate is in the steady state in LB is not very solid. Please address the concerns regarding this issue. Unless the authors feel that it is necessary, we do not require additional experiments. However, the authors should:
1. Clarify throughout the manuscript the nuances regarding the lack of stationarity in LB.
2. Improve the growth figures to better show the dependence of growth rate on temperature, and whether the populations are in the exponential growth phase.
3. Further elaborate if the oscillations could be due to the fact that the population may have not reached the steady state in LB.
Reviewer 1 (Recommendations for the authors):
I feel the requests and suggestions which were presented to the authors to revise the presented manuscript were very reasonable However, after reading the revised manuscript and the responses provided by the authors I now feel that the authors did not even do the bare minimum to address the raised concerns. This particularly concerns a major concern about steady state growth which I raised, and which have been discussed away very carelessly by the authors. I do not support publication until this issue is honestly addressed and fixed in the manuscript as it relates to the major finding promoted by the authors: oscillating replication speeds at fast growth.
In their reply the authors commented on my concern about the steady growth assumption in LB and state that their ‘medium leads to welldefined steady growth rates’. I think this statement is simply false. The fact that a growthrate can be reproducibly obtained is not sufficient to prove that cells are in a steady state. For that one needs to show that the same growthrate is obtained over several generations. Why is being picky on this issue so important? Because the authors assume a steady state in their method to infer replication speeds (with e.g. the firing rate being equal the growth rate etc). But it really remains unclear how well the inference method works in cultures which adjust to changing conditions and which are thus not in a steady state. This is a real concern for growth in LB which the authors chose as major growth condition. It is impossible to obtain a good steadily growing culture in LB as the availability of amino acids and other nutrient sources is drastically changing. This can for example be seen in growth curves where the density clearly derivates from an exponential fit when analyzing growth over several doublings. So with this the question remains: are the oscillations the authors report real or are they rather an artifact of falsely assuming steady state growth when doing their analysis. This concerns is highlighted even further by the newly presented findings stating that oscillations in speed are not observed for slower growth in minimal medium. So are these oscillations only observed at fast growth in LB because oscillations in speed are a biological reality at fast growth, or are they observed because a steady state assumption has been falsely assumed? I really want to be constructive here but feel the authors must address this concern very rigorously before this work should be published. So let me try to phrase my concerns in even more detail and in the context of the author's reply
To address my concern the authors simply showed the growthcurves in Figure 5 and stated that this supports cultures are in exponential steady growth. However, the presented analysis is not sufficient to prove steady state. For example, there are not exponential fits shown which prove that growthrates are maintained for reasonable ranges of density and time. In fact, when looking at the presented panel 5a it appears that the growthcurves at different temperatures are pretty much comparable, while a more detailed fit presumably used to infer growthrate values (instantaneous rates at a specific OD/timepoint) suggest that growthrates are different and this difference is what the authors use in the second panel of that figure and to discuss the relation between growth rate and replication speed. Given what is known about growth on LB a zoomed in version of these growth curves will reveal that growth is not really exponential for a reasonable long amount of time (say dozens of generations until protein levels reach a steady state) and as such no steady state have been obtained. In this context I strongly believe that a more careful experimentation with a welldefined steady states in a welldefined media (e.g. rich defined media with exponential growth and the some growth rates observed over > 10 doublings) would have been way better to establish that the inferred oscillations in speed are a real biological phenomenon. I understand that the authors are reluctant to add such additional experiments and I am not insisting on them. However, the concern needs to be taken seriously. I think the authors should:
1) Clearly state in the manuscript that the presented measurements have not been taken during a welldefined steady state
2) Plot and analyze better the growthcurves such that the growth behavior and the differences in growth for different temperatures become visible). In such a plot exponential fits to the growth curves should also be shown which directly allow the reader to compare how welldefined and distinct the growthrates really are.
3) Raise in the conclusion of the paper that there is a possibility that oscillations might occur because the cultures grew in LB and have thus not reached steady state.
4) Include some sort of error propagation analysis which shows how strong the errors in replication speed can be when assuming a culture not yet in steady growth but running the analysis to infer steady state with the assumption of a steady state. For example, assuming the number of cells with different replication forks is still slowly adjusting with a certain timescale and assuming the replication speed is constant, use a fully dynamical model to derive a DNA abundance distribution. Then inferring the speed by analysis the distribution (using the presented approach assuming steady state).
https://doi.org/10.7554/eLife.75884.sa1Author response
Essential revisions:
1. The interpretation in terms of a speederror tradeoff is rather speculative and perhaps less emphasis should be placed on it (e.g. in the abstract and the beginning of p.9).
We agree with the Reviewer that, strictly speaking, this interpretation is speculative, although the degree of correlation between mutation experiments and the speed oscillations makes a rather compelling case. In the revised version, we place less emphasis on this interpretation as requested.
2. There is a possible problem with LB used as growth media. To infer the replication speed, the firing rate needs to be determined, which in the steady state is equal to the growth rate. However, it is close to impossible to achieve well defined steady growth rate when cells grow in LB since nutrients become depleted and growth rate decreases quite substantially. Which growth rate was used to infer the speed (It was unclear) An experiment in a minimal medium (e.g. with glucose as carbon source) with a welldefined growth rate might provide much more reliable results.
The LB medium that we used was treated to guarantee an extended exponential growth regime. The growth curves that we observed at different temperature are highly repeatable and present a clear exponential regime. The resulting estimates of the growth rate k are very consistent among replicates, with variations on the order of 5%. Moreover, when rescaled by the estimated doubling time, all growth curves at different temperatures collapse on each other. Taken together, these observations show that our medium leads to welldefined steady growth rates.
We realized that some of these aspects were not clearly presented in the manuscript, partially because we presented the growth curves in a supplementary figure. In the revised version, we present the growth curves in the Methods (Figure 5) and discuss the growth rate estimation in more detail.
Additionally, we present an analysis of E. coli growing in a minimal medium using data from a previous reference. We discuss these results in the context of the next comment.
3. This study was done in the regime of fast growth. It is known that for E. coli there are many changes in the cell cycle properties when the doubling time (at 37 Celcius) exceeds 60 minutes (i.e. the regime where there are no overlapping replication forks). How do the results change in slow growth conditions?
We thank the Reviewer for this comment. In the revised version, we present an additional analysis of sequencing data from E. coli growing in a minimal medium (data from MidgleySmith et al., 2018), see Figure 3—figure supplement 4. We did not observe appreciable speed oscillations in this case. This result suggests that oscillations are linked with the multiple forks regime and disappear when the cell cycle is slowed down by either reducing temperature or nutrient composition. As discussed in the revised manuscript, this result supports the hypothesis that the cause of the oscillations might be competition among replisomes.
4. Oscillation of replication speeds along the chromosome is reported for different temperatures. This is an interesting observation. While an experimental investigation of these oscillations is beyond the scope of this manuscript, a discussion about the physiological implications of these oscillations would be insightful.
We thank the Reviewer for this comment. In the revised Discussion, we expand on the physiological implications of our results, in particular on the hypothesis that these oscillations are linked with the multiple fork regime of replication. The revised Discussion also expands on other implications of our work for bacterial physiology, in particular on the laws governing DNAprotein homeostasis at varying temperature.
5. The idea of using the frequency inferred from sequencing was also used in: Growth dynamics of gut microbiota in health and disease inferred from single metagenomic samples, Korem et al. Science (2015). Are the oscillations also observed in those measurements? If so, is there information which could be gleaned from them?
We thank the Reviewer for pointing out this interesting reference. Following this suggestion, we reanalyzed the DNA abundance from the E. coli sequencing data by Korem et al. We found that this dataset is characterized by a much smaller coverage than our experiment. As a result, the DNA abundance distribution is too noisy to infer replisome speed variations, see Figure 1 in this document. In any case, in the Introduction of the revised version we cite this reference as another important application of the DNA abundance distribution.
6. Figure 2D: Why do replisome numbers increase with increasing temperature and how does this observation reconcile with the increased speeds as well? Do both replication fork numbers and speed of individual replisomes contribute to the observed growth rates across temperatures?
We thank the Reviewer for this comment. The key point is that, at increasing temperature, the average replisome speed does not increase as fast as the growth rate, as it can be seen from Figure 2c in the manuscript.
Therefore, multiple fork replication is necessary to fill this gap. In conclusion, both replication fork numbers and speed of individual replisomes provide a significant contribution, as the Reviewer suggests. We clarify this point in the revised draft.
7. There is a systematic difference in the dependence of speed and growth rate on temperature, which the authors discuss. What is the expected change in cell size if the CooperHelmstetter model is correct? Should it be observable experimentally? Is it?
Assuming perfect DNA–protein homeostasis, the expected change in cell size should be proportional to the DNA content as shown in Appendix 2, Figure 1b. We are not aware of recent systematic studies of the dependence of cell size on temperatures. Trueba et al. (1982) suggest a moderate increase of the cell size with the growth rate, which seems compatible with our theory. However, this dependence strongly depends on the choice of the medium and the paper only reports a few data points. A systematic study of this interesting issue would require additional experiments, which are beyond the scope of our work.
In the revised version, we clarify our prediction on the cell size behavior on temperature, and comment more extensively on its implications in the Discussion section.
8. Please clarify the arguments in Appendix 2 and L140150.
We rephrased the arguments to make them clearer.
9. Please modify the Discussion section so it is more focused on the conclusions of this work.
We thank the reviewers for this suggestion. We substantially revised the Discussion section, focusing on the conclusions of our work.
10. Provide a more detailed comparison of the considered approach with previously published work. How is the introduced method different and to what extent it goes beyond the previous work? Is this model more predictive of speed?
The main novelty of our method is that it permits to characterize speed variation of the replisomes. In this sense, it can be seen as a generalization of the classic CooperHelmstetter theory, that instead assumes constant replisome speed. We clarify this point in the first paragraph of the revised Discussion.
11. Lines 131133: why is the average DNA per cell the product of the two other averages? Is this an approximation or are the two other variables uncorrelated?
We thank the Reviewer for this observation. Our model of genome dynamics embodied in Equations (3, 4, 5) assumes, for simplicity, that genomes evolve independently. Because of this assumption, the two averages factorize. We clarify this point in the revised manuscript.
Reviewer #1 (Recommendations for the authors):
I missed a more detail comparison of the considered approach with previously published approaches and results. How is the introduced method different and to which extent is it going beyond the published results? Does it allow a more precis derivation of speed? What was is with the previous approaches which was limiting the quantification? As presented the introduction of the method is the major contribution of this paper and I thus think that these questions should be clearly answered.
The main novelty of our method is that it permits to characterize speed variation of the replisomes. In this sense, it can be seen as a generalization of the classic CooperHelmstetter theory, that instead assumes constant replisome speed. We clarify this point in the first paragraph of the revised Discussion.
There is a possible problem with LB used as growth media. To infer the replication speed, the firring rate needs to be determined, which in steady state growth is equal to the growth rate. However, it is close to impossible to achieve well defined steady growth rate when cells grow in LB since nutrients become depleted and growth rates decreases quite substantially. Which growthrate was thus used to infer the speed (It was unclear) An experiment in a minimal medium (e.g. with glucose as carbon source) with a well define growth rate might provide much more reliable results. An observation confirming that (average) replication speeds are not varying much with media conditions (at the same temperature) would also make a further important physiological point.
The LB medium that we used was treated to guarantee an extended exponential growth regime. The growth curves that we observed at different temperature are highly repeatable and present a clear exponential regime. The resulting estimates of the growth rate k are very consistent among replicates, with variations on the order of 5%. Moreover, when rescaled by the estimated doubling time, all growth curves at different temperatures collapse on each other. Taken together, these observations show that our medium leads to welldefined steady growth rates.
We realized that some of these aspects were not clearly presented in the manuscript, partially because we presented the growth curves in a supplementary figure. In the revised version, we present the growth curves in the Methods (Figure 5) and discuss the growth rate estimation in more detail.
Additionally, we present an analysis of E. coli growing in a minimal medium using data from a previous reference. We discuss these results in the context of the last Reviewer’s comment.
The authors show that the interfered replication speed is changing strongly with temperature. This has been reported before. With their measurements the authors state that the change is comparable to the change of growth when temperature is varied, but the authors do not comment much about the physiological relevance of this relation. An Arhenius like change of enzymatic reactions is expected for simple enzymatic processes, but that a similar relation holds for the entire growth rate is remarkable and still poorly understood, as emphasized by many studies. Given this context, what do we learn about the temperature dependence of growth by the observation that the replication speed is changing in an Arrhenius like form? The data the authors present show that the change in replication speed differs from the change in growth rate with a steeper change of growth rates. What does this mean? Does this support for example the idea that replication speeds are not so much a bottleneck of cellgrowth but replication is efficiently regulated to prevent DNA replication from impeding growth?
We thank the Reviewer for this comment. In the revised version, we expand on the physiological relevance of the Arrhenius dependence of the replisome speed on temperature. In particular, we argue that the different slopes of the growth rate and replisome speed as a function of temperature imply that the average number of forks per cell must vary as a function of temperature as well. Moreover, in the revised Discussion, we comment on how the Arrhenius relation leads to a prediction about the behavior of the cell size as a function of temperature.
From a physiological point of view, I think the oscillation of replication speeds with position along the chromosome are more interesting than the variation of speed with temperature. However, it remains unclear how well such oscillations and their frequency are maintained across conditions and maybe even more robustly a function of growth rate. I think the manuscript would thus largely benefit from the analysis of oscillations when cells grow with different rates but at the same temperature. To keep the number of new experiments small, this could for example just be the quantification for growth in a minimal media condition.
We thank the Reviewer for this comment. In the revised version, we present an additional analysis of sequencing data from E. coli growing in a minimal medium (data from MidgleySmith et al., 2018), see Figure 3—figure supplement 4. We did not observe appreciable speed oscillations in this case. This result suggests that oscillations are linked with the multiple forks regime and disappear when the cell cycle is slowed down by either reducing temperature or nutrient composition. As discussed in the revised manuscript, this result supports the hypothesis that the cause of the oscillations might be competition among replisomes.
Reviewer #3 (Recommendations for the authors):
Feedback for the authors:
1. Figure 2D: Why do replisome numbers increase with increasing temperature and how does this observation reconcile with the increased speeds as well? Do both replication fork numbers and speed of individual replisomes contribute to the observed growth rates across temperatures?
We thank the Reviewer for this comment. The key point is that, at increasing temperature, the average replisome speed does not increase as fast as the growth rate, as it can be seen from Figure 2c in the manuscript. Therefore, multiple fork replication is necessary to fill this gap. In conclusion, both replication fork numbers and speed of individual replisomes provide a significant contribution, as the Reviewer suggests. We clarify this point in the revised draft.
2. Arguments in Appendix 2 and L140150 are unclear and this reviewer is unable to understand the context of the same.
We rephrased the argument to make it clearer.
3. Did the authors consider sequencing a dnaQ proof reading mutant (PMID: 8610131) to test the speed vs accuracy tradeoff hypothesis?
We thank the Reviewer for suggesting this test. We agree that this would be an interesting experiment. However, we feel that this beyond the scope of our manuscript and more appropriate for a future study.
4. The Discussion section needs to be more focused on what the main conclusions of this work are. For example, the discussion on replication termination is not very informative. Instead, it would have been helpful if the authors could have explained how they envision the impact of temperature on replisome speed and cell cycle progression, a relationship that seems distinct from the impact of media/ nutrient on these dynamics. Secondly, do mutation rates also increase with increasing temperature, and do the authors envision mutation hotspots given their periodic positional data on the replisome? If so, what determines this periodicity? In the current manuscript, these ideas are not synthesized in a coherent fashion, that makes it harder to appreciate the unique findings.
We thank the Reviewer for this comment. We revised the Discussion section, that is now more focused on the conclusion of this work as requested. In particular, we expand on the impact of temperature of replisome speed and the implication for bacterial physiology. Other aspects of our results are also discussed more extensively, for example the hypotheses on the causes of the oscillations. We did not expand on the behavior of the mutation rate, as we felt that this falls beyond the direct implications of our work.
[Editors' note: further revisions were suggested prior to acceptance, as described below.]
The manuscript has been improved but there are some remaining issues that need to be addressed, as outlined below:
As noted by reviewer #1, the statement that the growth rate is in the steady state in LB is not very solid. Please address the concerns regarding this issue. Unless the authors feel that it is necessary, we do not require additional experiments. However, the authors should
1. Clarify throughout the manuscript the nuances regarding the lack of stationarity in LB.
We thank the Editor for highlighting this point in the reviewer’s comment. In the revised version, we clarify that in LB medium we cannot be sure that our cultures are at steady growth.
Importantly, although our model assumes steady growth for simplicity, this assumption is not crucial for our results. We prove this point by solving our model in the case of a fork firing rate that declines with time, thus violating the steadystate assumption.
The solution of this model shows that relaxing the steadystate approximation only leads to minor changes in the DNA abundance distribution, as shown in the new Appendix 7 in the revised manuscript.
2. Improve the growth figures to better show the dependence of growth rate on temperature, and whether the populations are in the exponential growth phase.
We agree that providing individual growth curves can help to appreciate whether the populations are in the exponential growth phase. Following this suggestion, we now include average growth curves for each temperature (without rescaling with the growth rate) in Figure 5a of the revised manuscript. We show the exponential fits corresponding to each temperature in the same figure. In addition, we provided growth curves for each different samples at each temperature in Figure 5—Figure supplement 1.
3. Further elaborate if the oscillations could be due to the fact that the population may have not reached the steady state in LB.
As mentioned in point 1., the new Appendix 7. presents a solution of a model that does not assume a steady state. We instead assume that the fork firing rate k declines with time as a consequence of nutrient depletion. We find that the DNA abundance predicted by this model presents only minor differences from that predicted by the model assuming steady growth.
In particular, we find that the discrepancy between these two models is not compatible with the shape as the oscillations of DNA abundance that we observe in the data. Most importantly, this discrepancy is much smaller than the magnitude of the observed oscillations. This result supports that the oscillations are not an artefact of the possible lack of a welldefined steadystate.
We do, however, agree that it will be important to verify our result in different kinds of media, and mention this point in the revised Discussion.
Reviewer 1 (Recommendations for the authors):
I feel the requests and suggestions which were presented to the authors to revise the presented manuscript were very reasonable However, after reading the revised manuscript and the responses provided by the authors, I now feel that the authors did not even do the bare minimum to address the raised concerns. This particularly concerns a major concern about steady state growth which I raised, and which have been discussed away very carelessly by the authors. I do not support publication until this issue is honestly addressed and fixed in the manuscript as it relates to the major finding promoted by the authors: oscillating replication speeds at fast growth.
In their reply the authors commented on my concern about the steady growth assumption in LB and state that their ‘medium leads to welldefined steady growth rates’. I think this statement is simply false. The fact that a growthrate can be reproducibly obtained is not sufficient to prove that cells are in a steady state. For that one needs to show that the same growthrate is obtained over several generations. Why is being picky on this issue so important? Because the authors assume a steady state in their method to infer replication speeds (with e.g. the firing rate being equal the growth rate etc). But it really remains unclear how well the inference method works in cultures which adjust to changing conditions and which are thus not in a steady state. This is a real concern for growth in LB which the authors chose as major growth condition. It is impossible to obtain a good steadily growing culture in LB as the availability of amino acids and other nutrient sources is drastically changing. This can for example be seen in growth curves where the density clearly derivates from an exponential fit when analyzing growth over several doublings. So with this the question remains: are the oscillations the authors report real or are they rather an artifact of falsely assuming steady state growth when doing their analysis. This concerns is highlighted even further by the newly presented findings stating that oscillations in speed are not observed for slower growth in minimal medium. So are these oscillations only observed at fast growth in LB because oscillations in speed are a biological reality at fast growth, or are they observed because a steady state assumption has been falsely assumed? I really want to be constructive here but feel the authors must address this concern very rigorously before this work should be published. So let me try to phrase my concerns in even more detail and in the context of the author's reply
We thank the Reviewer for his/her comments. We apologize for having misinterpreted his/her major concern, which has lead to a revision that did not fully address this concern.
Let us provide a general answer to the Reviewer’s comments and then enter the more specific points in the following. First of all, our individual growth curves are very well fitted by a logistic function. This function serves to describe both the exponential regime and the consequent saturation to the stationary phase, as customarily done and as discussed in (Zwietering et al. 1990). In the new revision, we provide additional figures showing fits of individual curves and summary curves at different temperatures, making it possible to appreciate the quality of the data. In all cases, we harvested cells during the exponential growth phase. We also remark that the cultures used to inoculate medium were previously grown in fresh medium until reaching an OD600 of about 1.0 at the target temperature.
In summary, our results show that our growth curves are very reproducible and very well characterized by a logistic growth, i.e. an exponential phase followed by a stationary phase. The exponential phase lasts for about 4 generations at all temperatures. On the other hand, although this exponential phase was preceded by another exponential growth, we agree with the Reviewer that we cannot be sure that cells are in a steady growth regime.
To address this issue, we generalized our model to a situation in which the fork firing rate is timedependent and, in particular, slows down with time following nutrient depletion. These new results show that relaxing the assumption of steady growth has very limited impact on our results. In particular, the deviations we see from the steady model are far too small to explain the oscillations we see in our data.
In conclusion, we appreciate the issue raised by the Reviewer and agree that future experiments with simpler media will be important. However, our new theoretical results show that assuming nearexponential growth for a few generations is sufficient for our aims, and support the idea that the oscillations we observe are a real phenomenon.
To address my concern the authors simply showed the growthcurves in Figure 5 and stated that this supports cultures are in exponential steady growth. However, the presented analysis is not sufficient to prove steady state. For example, there are not exponential fits shown which prove that growthrates are maintained for reasonable ranges of density and time. In fact, when looking at the presented panel 5a it appears that the growthcurves at different temperatures are pretty much comparable, while a more detailed fit presumably used to infer growthrate values (instantaneous rates at a specific OD/timepoint) suggest that growthrates are different and this difference is what the authors use in the second panel of that figure and to discuss the relation between growth rate and replication speed. Given what is known about growth on LB a zoomed in version of these growth curves will reveal that growth is not really exponential for a reasonable long amount of time (say dozens of generations until protein levels reach a steady state) and as such no steady state have been obtained. In this context I strongly believe that a more careful experimentation with a welldefined steady states in a welldefined media (e.g. rich defined media with exponential growth and the some growth rates observed over > 10 doublings) would have been way better to establish that the inferred oscillations in speed are a real biological phenomenon. I understand that the authors are reluctant to add such additional experiments and I am not insisting on them. However, the concern needs to be taken seriously. I think the authors should:
1) Clearly state in the manuscript that the presented measurements have not been taken during a welldefined steady state
In the revised version, we clearly state at the end of the Results section that, although the growth curves appear as exponential, we cannot be sure that the cultures have reached a regime of steady growth. Then, to address this shortcoming, we relax the assumption of steady growth in the model. Our results, presented in the new Appendix 7, show that assuming a nearconstant fork firing rate for about 4 generations (as we observe) is sufficient to obtain DNA abundance curves that are very similar to those of the steadystate model. This result indicates that the possible lack of steady growth in LB medium should not significantly affect our conclusions.
2) Plot and analyze better the growthcurves such that the growth behavior and the differences in growth for different temperatures become visible). In such a plot exponential fits to the growth curves should also be shown which directly allow the reader to compare how welldefined and distinct the growthrates really are.
We agree with the Reviewer that providing individual growth curves can help to appreciate whether the populations are in the exponential growth phase. Following this suggestion, we now include average growth curves for each temperature (without rescaling with the growth rate) in Figure 5a of the revised manuscript. We also show the exponential fits corresponding to each temperature in the same figure. In addition, we provided growth curves for each different sample at each temperature in Figure 5—figure supplement 1. We thank the Reviewer for suggesting this addition that should provide the reader with all the necessary information.
3) Raise in the conclusion of the paper that there is a possibility that oscillations might occur because the cultures grew in LB and have thus not reached steady state.
Our theoretical results do not support the idea that oscillations might be caused by the fact that in LB medium cultures have not reached steady state. In the revised conclusion, we do however mention this issue and suggest that future studies in different media will be important.
4) Include some sort of error propagation analysis which shows how strong the errors in replication speed can be when assuming a culture not yet in steady growth but running the analysis to infer steady state with the assumption of a steady state. For example, assuming the number of cells with different replication forks is still slowly adjusting with a certain timescale and assuming the replication speed is constant, use a fully dynamical model to derive a DNA abundance distribution. Then inferring the speed by analysis the distribution (using the presented approach assuming steady state).
As stated in the answer to point (1), we studied a variant of the model which does not assume steady state to address this point. This analysis is presented in the new Appendix 7.
We find that this model leads to an inference of the replisome speed which is very close to that of the original steadystate model. Further, the discrepancy between these two models is not consistent with the shape of the oscillations of DNA abundance that we observe. Most importantly, this discrepancy is much smaller than the magnitude of the oscillations of our data. This result indicates that the oscillations are not caused by a possible lack of a welldefined steadystate.
https://doi.org/10.7554/eLife.75884.sa2Article and author information
Author details
Funding
Japan Society for the Promotion of Science (JP18K03473)
 Simone Pigolotti
Deutsche Forschungsgemeinschaft (452628014)
 Samuel Hauf
Deutsche Forschungsgemeinschaft (HA9374/11)
 Samuel Hauf
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Acknowledgements
We are grateful for the help and support provided by the Scientific Computing and Data Analysis section of the Research Support Division at OIST. We thank the DNA Sequencing Section of OIST, in particular N Arasaki, for support with sequencing. This work was supported by JSPS KAKENHI Grant No. JP18K03473 (to SP) and a grant from Deutsche Forschungsgemeinschaft (DFG, grant number: 452628014 to SH). We thank M Cencini and P Sartori for feedback on a preliminary version of our manuscript.
Senior Editor
 Jessica K Tyler, Weill Cornell Medicine, United States
Reviewing Editor
 Armita Nourmohammad, University of Washington, United States
Reviewer
 Ariel Amir, Harvard University, United States
Publication history
 Preprint posted: October 15, 2021 (view preprint)
 Received: November 26, 2021
 Accepted: July 13, 2022
 Accepted Manuscript published: July 25, 2022 (version 1)
 Version of Record published: August 17, 2022 (version 2)
Copyright
© 2022, Bhat et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics

 485
 Page views

 214
 Downloads

 0
 Citations
Article citation count generated by polling the highest count across the following sources: Crossref, PubMed Central, Scopus.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading

 Computational and Systems Biology
 Physics of Living Systems
Cellular behaviors emerge from layers of molecular interactions: proteins interact to form complexes, pathways, and phenotypes. We show that hierarchical networks of protein interactions can be defined from the statistical pattern of proteome variation measured across thousands of diverse bacteria and that these networks reflect the emergence of complex bacterial phenotypes. Our results are validated through geneset enrichment analysis and comparison to existing experimentallyderived databases. We demonstrate the biological utility of our approach by creating a model of motility in Pseudomonas aeruginosa and using it to identify a protein that affects pilusmediated motility. Our method, SCALES (Spectral Correlation Analysis of Layered Evolutionary Signals), may be useful for interrogating genotypephenotype relationships in bacteria.

 Physics of Living Systems
Animal behavior and neural recordings show that the brain is able to measure both the intensity and the timing of odor encounters. However, whether intensity or timing of odor detections is more informative for olfactorydriven behavior is not understood. To tackle this question, we consider the problem of locating a target using the odor it releases. We ask whether the position of a target is best predicted by measures of timing vs intensity of its odor, sampled for a short period of time. To answer this question, we feed data from accurate numerical simulations of odor transport to machine learning algorithms that learn how to connect odor to target location. We find that both intensity and timing can separately predict target location even from a distance of several meters; however, their efficacy varies with the dilution of the odor in space. Thus, organisms that use olfaction from different ranges may have to switch among different modalities. This has implications on how the brain should represent odors as the target is approached. We demonstrate simple strategies to improve accuracy and robustness of the prediction by modifying odor sampling and appropriately combining distinct measures together. To test the predictions, animal behavior and odor representation should be monitored as the animal moves relative to the target, or in virtual conditions that mimic concentrated vs dilute environments.