Abstract
Key enzymatic processes use the nonequilibrium error correction mechanism called kinetic proofreading to enhance their specificity. The applicability of traditional proofreading schemes, however, is limited because they typically require dedicated structural features in the enzyme, such as a nucleotide hydrolysis site or multiple intermediate conformations. Here, we explore an alternative conceptual mechanism that achieves error correction by having substrate binding and subsequent product formation occur at distinct physical locations. The time taken by the enzyme–substrate complex to diffuse from one location to another is leveraged to discard wrong substrates. This mechanism does not have the typical structural requirements, making it easier to overlook in experiments. We discuss how the length scales of molecular gradients dictate proofreading performance, and quantify the limitations imposed by realistic diffusion and reaction rates. Our work broadens the applicability of kinetic proofreading and sets the stage for studying spatial gradients as a possible route to specificity.
Introduction
The nonequilibrium mechanism called kinetic proofreading (Hopfield, 1974; Ninio, 1975) is used for reducing the error rates of many biochemical processes important for cell function (e.g. DNA replication [Kunkel, 2004], transcription [Sydow and Cramer, 2009], translation [Rodnina and Wintermeyer, 2001; Ieong et al., 2016], signal transduction [Swain and Siggia, 2002], or pathogen recognition [McKeithan, 1995; Goldstein et al., 2004; Cui and Mehta, 2018]). Proofreading mechanisms operate by inducing a delay between substrate binding and product formation via intermediate states for the enzyme–substrate complex. Such a delay gives the enzyme multiple chances to release the wrong substrate after initial binding, allowing far lower error rates than what one would expect solely from the binding energy difference between right and wrong substrates.
Traditional proofreading schemes require dedicated molecular features such as an exonuclease pocket in DNA polymerases (Kunkel, 2004) or multiple phosphorylation sites on Tcell receptors (McKeithan, 1995; Goldstein et al., 2004); such features create intermediate states that delay product formation (Figure 1a) and thus allow proofreading. Additionally, since proofreading is an active nonequilibrium process often involving near–irreversible reactions, the enzyme typically needs to have an ATP or GTP hydrolysis site to enable the use of energy supplies of the cell (Yamane and Hopfield, 1977; Rodnina and Wintermeyer, 2001). Due to such stringent structural requirements, the number of confirmed proofreading enzymes is relatively small. Furthermore, generic enzymes without such dedicated features are assumed to not have active error correction available to them.
In this work, we propose an alternative scheme where the delay between initial substrate binding and product formation steps is achieved by separating these events in space. If substrates are spatially localized and product formation is favorable only in a region of low substrate concentration where an activating effector is present then the time taken by the enzyme–substrate complex to travel from one location to the other can be used to discard the wrong substrates, which are assumed to unbind from the enzyme more readily than the right substrates (Figure 1b). When this delay is longer than substrate unbinding time scales, very low error rates of product formation can be achieved, allowing this spatial proofreading scheme to outperform biochemical mechanisms with a finite number of proofreading steps.
In contrast to traditional proofreading, the nonequilibrium mechanism here does not require any direct energy consumption by the enzyme or substrate itself (e.g. through ATP hydrolysis). This liberates the enzyme from any proofreadingspecific molecular features; indeed, any ‘equilibrium’ enzyme with a localized effector can proofread using our scheme if appropriate concentration gradients of the substrates or enzymes are set up. In this way, the energetic and structural requirements of proofreading can be outsourced from the enzyme and substrate to the gradient maintaining mechanism. It also means that spatial proofreading is easy to overlook in experiments, and that the fidelity of reconstituted reactions in vitro could be lower than the fidelity in vivo.
The lack of reliance on structure makes spatial proofreading more adaptable. We study how tuning the length scale of concentration gradients can trade off error rate against speed and energy consumption on the fly. In contrast, traditional proofreading schemes rely on nucleotide chemical potentials, for example, the out of equilibrium [ATP]/[ADP] ratio in the cell, and cannot modulate their operation without broader physiological disruptions.
Our proposed scheme can be leveraged for specificity if appropriate concentration gradients are set. Such gradients arise in multiple cellular contexts (e.g. near the nucleus, the plasma membrane, the Golgi apparatus, the endoplasmic reticulum [ER], kinetochores, microtubules [Bivona et al., 2003; Caudron et al., 2005; Kholodenko, 2006]) and several gradientforming mechanisms have been discussed in the literature (Wu et al., 2018; Kholodenko, 2006; Kholodenko, 2003). We conclude our analysis of spatial proofreading by quantifying its limitations as set by realistic reaction rates and gradient formation mechanisms, and discuss examples from the literature, including the localization of mRNAs in polarised cells, and the nonvesicular transport of lipids in eukaryotic cells, in which this mechanism might be in play. Our work motivates a detailed investigation of spatial structures and compartmentalization in living cells as possible delay mechanisms for proofreading enzymatic reactions.
Results
Slow transport of enzymatic complex enables proofreading
Our proposed scheme is based on spatially separating substrate binding and product formation events for the enzyme (Figure 1b). Such a setting arises naturally if substrates are spatially localized by having concentration gradients in a cellular compartment. Similarly, an effector needed for product formation (e.g. through allosteric activation) may have a spatial concentration gradient localized elsewhere in that compartment. To keep our model simple, we assume that the right (R) and wrong (W) substrates have identical concentration gradients of length scale ${\lambda}_{{}_{\text{S}}}$ but that the effector is entirely localized to one end of the compartment, for example via membrane tethering. In Appendix 4, we extend our study of model performance to the scenario where the two substrates have different localization length scales.
We model our system using coupled reaction–diffusion equations for the substratebound (‘ES’ with $\text{S}=\text{R},\text{W}$) and free (‘E’) enzyme densities, namely,
Here, D is the enzyme diffusion constant, ${k}_{\text{on}}$ and ${k}_{\text{off}}^{\text{S}}$ (with ${k}_{\text{off}}^{\text{W}}>{k}_{\text{off}}^{\text{R}}$) are the substrate binding and unbinding rates, respectively, and ${\rho}_{{}_{\text{S}}}(x)\sim {e}^{x/{\lambda}_{{}_{\text{S}}}}$ is the spatially localized substrate concentration profile which we take to be exponentially decaying, which is often the case for profiles created by cellular gradient formation mechanisms (Driever and NüssleinVolhard, 1988; Brown and Kholodenko, 1999). We limit our discussion to this onedimensional setting of the system, though our treatment can be generalized to two and three dimensions in a straightforward way.
The above model does not explicitly account for several effects relevant to living cells, such as depletion of substrates or distinct diffusion rates for the free and substratebound enzymes. More importantly, it does not account for the mechanism of substrate gradient formation. We analyze a biochemically detailed model with this latter feature and experimentally constrained parameters later in the paper. Here, we proceed with the minimal model above for explanatory purposes. To identify the key determinants of the model’s performance, we assume throughout our analysis that the amount of substrates is sufficiently low that the enzymes are mostly free with a roughly uniform profile (i.e. ${\rho}_{{}_{\text{E}}}\approx \text{constant}$). This assumption makes Equations (13) linear and allows us to solve them analytically at steady state. We demonstrate in Appendix 5 that proofreading is, in fact, most effective under this assumption and discuss the consequences of having high substrate amounts on the performance of the scheme.
In our simplified picture, enzyme activation and catalysis take place upon reaching the right boundary at a rate r that is identical for both substrates. Therefore, the density of substrate–bound enzymes at the right boundary can be taken as a proxy for the rate of product formation ${v}_{\text{S}}$, since
where L is the size of the compartment. In order to keep the analytical results concise and intuitive, we perform our main analyses under the assumption that catalysis is slow, mirroring the study of traditional proofreading schemes (Hopfield, 1974). In Appendix 3, we derive the precise conditions under which this treatment is valid, and generalize our analysis to arbitrary catalysis rates.
To demonstrate the proofreading capacity of the model, we first analyze the limiting case where substrates are localized to the left end of the compartment (${\lambda}_{{}_{\text{S}}}\to 0$). In this limit, the fidelity $\eta $, defined as the number of right products formed per single wrong product, becomes
where ${\eta}_{\text{eq}}={k}_{\text{off}}^{\text{W}}/{k}_{\text{off}}^{\text{R}}$ is the equilibrium fidelity, and ${\tau}_{D}={L}^{2}/D$ is the characteristic time scale of diffusion across the compartment (see Appendix 1 for the derivation).
Equation 5 is plotted in Figure 2 for a family of different parameter values. As can be seen, when diffusion is fast (small ${\tau}_{D}$), fidelity converges to its equilibrium value and proofreading is lost ($\eta \approx \sqrt{{\eta}_{\text{eq}}}\times \sqrt{{\tau}_{D}{k}_{\text{off}}^{\text{W}}/{\tau}_{D}{k}_{\text{off}}^{\text{R}}}={\eta}_{\text{eq}}$). Conversely, when diffusion is slow (large ${\tau}_{D}$), the enzyme undergoes multiple rounds of binding a substrate at the left end and unbinding midway until it manages to diffuse across the whole compartment as a complex and form a product. These rounds serve as ‘futile cycles’ that endow the system with proofreading. In this regime, fidelity scales as
To get further insights, we introduce an effective number of extra biochemical intermediates (n) that a traditional proofreading scheme would need to have in order to yield the same fidelity, that is $\eta /{\eta}_{\text{eq}}={\eta}_{\text{eq}}^{n}$. We calculate this number as (see Appendix 1)
Notably, since ${\tau}_{D}\sim {L}^{2}$, the result above suggests a linear relationship between the effective number of proofreading realizations and the compartment size ($n\sim L$). In addition, because the righthand side of Equation 7 is an increasing function of ${k}_{\text{off}}^{\text{W}}$, the proofreading efficiency of the scheme rises with larger differences in substrate offrates (Figure 2) – a feature that ‘hard–wired’ traditional proofreading schemes with a fixed number of proofreading steps lack.
Navigating the speed–fidelity tradeoff
As is inherent to all proofreading schemes, the fidelity enhancement described earlier comes at a cost of reduced product formation speed. This reduction, in our case, happens because of increased delays in diffusive transport. Here, we explore the resulting speed–fidelity tradeoff and its different regimes by varying two of the model parameters: diffusion time scale ${\tau}_{D}$ and the substrate localization length scale ${\lambda}_{{}_{\text{S}}}$.
Speed and fidelity for different sampled values of ${\tau}_{D}$ and ${\lambda}_{{}_{\text{S}}}$ are depicted in Figure 3a. As can be seen, for a fixed ${\tau}_{D}$, the reduction of ${\lambda}_{{}_{\text{S}}}$ can trade off fidelity against speed. This tradeoff is intuitive; with tighter substrate localization, the complexes are formed closer to the left boundary. Hence, a smaller fraction of complexes reach the activation region, reducing reaction speed. The Paretooptimal front of the tradeoff over the whole parameter space, shown as a red curve on the plot, is reached in the limit of ideal substrate localization (${\lambda}_{{}_{\text{S}}}\to 0$). Varying the diffusion time scale allows one to navigate this optimal tradeoff curve and access different performance regimes.
Specifically, if the diffusion time scale is fast compared with the time scales of substrate unbinding (i.e. ${\tau}_{D}\ll 1/{k}_{\text{off}}^{\text{R}},1/{k}_{\text{off}}^{\text{W}}$), then both right and wrong complexes that form near the left boundary arrive at the activation region with high probability, resulting in high speeds, although at the expense of error–prone product formation (Figure 3b, top). In the opposite limit of slow diffusion, both types of complexes have exponentially low densities at the activation region, but due to the difference in substrate offrates, production is highly accurate (Figure 3b, bottom). There also exists an intermediate regime where a significant fraction of right complexes reach the activation region while the vast majority of wrong complexes do not (Figure 3b, middle). As a result, an advantageous tradeoff is achieved where a moderate decrease in the production rate yields high fidelity enhancement – a feature that was also identified in multistep traditional proofreading models (Murugan et al., 2012).
In Appendix 3, we also study this tradeoff caused by varying the catalysis rate $r$. Briefly, we find that when all other parameters are fixed, increasing $r$ trades off fidelity against speed in a linear fashion, with the ratio of highest and lowest fidelity values falling in the $[\sqrt{{\eta}_{\text{eq}}},{\eta}_{\text{eq}}]$ range. The Pareto–optimal front of the tradeoff, however, monotonically shifts toward the higher speed region, suggesting that faster catalysis is, in fact, more favorable if the diffusion time scale ${\tau}_{D}$ can be adjusted accordingly (see Appendix 3 for details).
We saw in Figure 3a that in the case of ideal substrate localization, the slowdown of diffusive transport necessarily reduced the production rate and increased the fidelity. The latter part of this statement, however, breaks down when substrate gradients are weak. Indeed, fidelity exhibits a nonmonotonic response to tuning ${\tau}_{D}$ when the substrate gradient length scale ${\lambda}_{{}_{\text{S}}}$ is nonzero (Figure 3c). The reason for the eventual decay in fidelity is the fact that with slower diffusion (larger ${\tau}_{D}$), substrate binding and unbinding events take place more locally and therefore, the right and wrong complex profiles start to resemble the substrate profile itself, which does not discriminate between the two substrate kinds. We show in Appendix 1 that the optimal diffusion time scale can be roughly approximated as ${\tau}_{D}^{*}/{\tau}_{\text{off}}^{\text{R}}\approx {\eta}_{\text{eq}}^{1}{(L/{\lambda}_{{}_{\text{S}}})}^{2}$, which increases monotonically with $L/{\lambda}_{{}_{\text{S}}}$, consistent with the shifting peaks in Figure 3c.
Not surprisingly, the error–correcting capacity of the scheme improves with better substrate localization (lower ${\lambda}_{{}_{\text{S}}}$). For a fixed ${\tau}_{D}$, the bulk of this improvement takes place when $L/{\lambda}_{{}_{\text{S}}}$ is tuned in a range set by the two key dimensionless numbers of the model, namely, $\sqrt{{\tau}_{D}{k}_{\text{off}}^{\text{R}}}$ and $\sqrt{{\tau}_{D}{k}_{\text{off}}^{\text{W}}}$ (Figure 3c, inset). In Appendix 1, we provide an analytical justification for this result. Taken together, these parametric studies uncover the operational principles of the spatial proofreading scheme and demonstrate how the speed–fidelity tradeoff could be dynamically navigated as needed by tuning the key time and length scales of the model.
Energy dissipation and limits of proofreading performance
A hallmark signature of proofreading is that it is a nonequilibrium mechanism with an associated free energy cost. In our scheme, the enzyme itself is not directly involved in any energyconsuming reactions, such as hydrolysis. Instead, the free energy cost comes from maintaining the spatial gradient of substrates, which the enzymatic reaction tends to homogenize by releasing bound substrates in regions of low substrate concentration. As the activating effectors are assumed to be tethered at $x=L$, they do not require dissipation to remain localized.
While mechanisms of substrate gradient maintenance may differ in their energetic efficiency, there exists a thermodynamically dictated minimum energy that any such mechanism must dissipate per unit time. We calculate this minimum power P as
Here ${j}_{{}_{\text{S}}}(x)={k}_{\text{on}}{\rho}_{{}_{\text{S}}}(x){\rho}_{{}_{\text{E}}}{k}_{\text{off}}^{\text{S}}{\rho}_{{}_{\text{ES}}}(x)$ is the net local binding flux of substrate ‘S’, and $\mu (x)$ is the local chemical potential (see Appendix 2.1 for details). For substrates with an exponentially decaying profile considered here, the chemical potential is given by
where ${k}_{\text{B}}T$ is the thermal energy scale. Notably, the chemical potential difference across the compartment, which serves as an effective driving force for the scheme, is set by the inverse of the nondimensionalized substrate localization length scale, namely,
where ${\beta}^{1}={k}_{\text{B}}T$. This driving force is zero for a uniform substrate profile (${\lambda}_{{}_{\text{S}}}\to \mathrm{\infty}$) and increases with tighter localization (lower ${\lambda}_{{}_{\text{S}}}$), as intuitively expected.
We used Equation 8 to study the relationship between dissipation and fidelity enhancement as we tuned $\mathrm{\Delta}\mu $ for different choices of the diffusion time scale ${\tau}_{D}$. As can be seen in Figure 4, power rises with increasing fidelity, diverging when fidelity reaches its asymptotic maximum given by Equation 5 in the large $\mathrm{\Delta}\mu $ limit. For the bulk of each curve, power scales as the logarithm of fidelity, suggesting that a linear increase in dissipation can yield an exponential reduction in error. Notably, such a scaling relationship has also been calculated in the context of E. coli chemoreceptor adaptation (Lan et al., 2012). In particular, it was shown that the adaptation error decreases exponentially with energy dissipated through multiple methylation–demethylation cycles which are used to stabilize the activity state of the receptor. Analogies in the costperformance tradeoff across these functionally distinct mechanisms contribute to the search for overarching thermodynamic themes underlying cellular information processing (Lan et al., 2012; Lan and Tu, 2013; Horowitz et al., 2017; Sartori and Pigolotti, 2015).
The logarithmic scaling is achieved in our model when the driving force is in a range where most of the fidelity enhancement takes place, namely,
At the end of this range, the cost per substrate binding event approaches $\sqrt{{\eta}_{\text{eq}}}$ in ${k}_{\text{B}}T$ units (see Appendix 2.1 for details). And beyond the range, additional error correction is attained at an increasingly higher cost.
Note that the power computed here does not include the baseline cost of creating the substrate gradient, which, for instance, would depend on the substrate diffusion constant. We only account for the additional cost to be paid due to the operation of the proofreading scheme which works to homogenize this substrate gradient. The baseline cost in our case is analogous to the work that ATP synthase needs to perform to maintain a nonequilibrium [ATP]/[ADP] ratio in the cell, whereas our calculated power is analogous to the rate of ATP hydrolysis by a traditional proofreading enzyme. We discuss these two classes of dissipation in greater detail in Appendix 2.3.
Just as the cellular chemical potential of ATP or GTP imposes a thermodynamic upper bound on the fidelity enhancement by any proofreading mechanism (Qian, 2006), the effective driving force $\mathrm{\Delta}\mu $ imposes a similar constraint for the spatial proofreading model. This thermodynamic limit depends only on the available chemical potential and is equal to ${e}^{\beta \mathrm{\Delta}\mu}$. This limit can be approached very closely by our model, which for $\mathrm{\Delta}\mu \gtrsim 1$ achieves the exponential enhancement with an additional linear prefactor, namely, ${(\eta /{\eta}_{\text{eq}})}^{\text{max}}\approx {e}^{\beta \mathrm{\Delta}\mu}/\beta \mathrm{\Delta}\mu $ (see Appendix 2.2). Such scaling behavior was theoretically accessible only to infinitestate traditional proofreading schemes (Qian, 2006; Ehrenberg and Blomberg, 1980). This offers a view of spatial proofreading as a procession of the enzyme through an infinite series of spatial filters and suggests that, from the perspective of peak error reduction capacity, our model outperforms the finitestate schemes.
Proofreading by biochemically plausible intracellular gradients
Our discussion of the minimal model thus far was not aimed at a particular biochemical system and thus did not involve the use of realistic reaction rates and diffusion constants typically seen in living cells. Furthermore, we did not account for the possibility of substrate diffusion, as well as for the homogenization of substrate concentration gradients due to enzymatic reactions, and have thereby abstracted away the gradient maintaining mechanism. The quantitative inspection of such mechanisms is important for understanding the constraints on spatial proofreading in realistic settings.
Here, we investigate proofreading based on a widely applicable mechanism for creating gradients by the spatial separation of two opposing enzymes (Stelling and Kholodenko, 2009; Bivona et al., 2003; Brown and Kholodenko, 1999). Consider a protein $S$ that in its free state is phosphorylated by a membranebound kinase and dephosphorylated by a delocalized cytoplasmic phosphatase, as shown in Figure 5a. This setup will naturally create a gradient of the active form of protein (${S}^{*}$), with the gradient length scale controlled by the rate of phosphatase activity ${k}_{\text{p}}$ (${S}^{*}\stackrel{{k}_{\text{p}}}{\to}S$). Such mechanisms are known to create gradients of the active forms of MEK and ERK (Kholodenko, 2006), of GTPases such as Ran (with GEF and GAP [Kalab et al., 2002] playing the role of kinase and phosphatase, respectively), of cAMP (Kholodenko, 2006) and of stathmin oncoprotein 18 (Op18) (Bastiaens et al., 2006; Niethammer et al., 2004) near the plasma membrane, the Golgi apparatus, the ER, kinetochores and other places.
We test the proofreading power of such gradients, assuming experimentally constrained biophysical parameters for the gradient forming mechanism. Specifically, we consider an enzyme $E$ that acts on the active forms of cognate (${R}^{*}$) and noncognate (${W}^{*}$) substrates which have offrates 0.1 s^{1} and 1 s^{1}, respectively (hence, ${\eta}_{\text{eq}}=10$). These offrates are consistent with typical values for substrates proofread by cellular signaling systems (Cui and Mehta, 2018; Gascoigne et al., 2001). The kinases and phosphatases in our setup act identically on right and wrong substrates. We consider a dephosphorylation rate constant ${k}_{\text{p}}=5$ s^{1} that falls in the range 0.1−100 s^{1} reported for different phosphatases (Brown and Kholodenko, 1999; Kholodenko et al., 2000; Todd et al., 1999), and a cytosolic diffusion constant $D=1\phantom{\rule{thinmathspace}{0ex}}$ μm^{2}/s for all proteins in this model. With this setup, exponential gradients of length scale ∼0.5 μm are formed for ${R}^{*}$ and ${W}^{*}$. We evaluate the proofreading and energetic performance of the model in a compartment of size $L=10$ μm – a typical length scale in eukaryotic cells (see Appendix 6 for details).
Although not costefficient, this setup achieves proofreading in a wide range of regimes. Specifically, it is most effective when the enzyme–substrate binding is slow, in which case the exponential substrate profile is maintained and the system attains the fidelity predicted by our earlier explanatory model (Figure 5b). The system’s proofreading capacity is retained if the first–order onrate is raised up to ${k}_{\text{on}}{\rho}_{{}_{\text{E}}}\sim 10$ s^{1}, where around 10fold increase in fidelity is still possible. If the binding rate constant (${k}_{\text{on}}$) or the enzyme’s expression level (${\rho}_{{}_{\text{E}}}$) is any higher, then enzymatic reactions overwhelm the ability of the kinase/phosphatase system to keep the active forms of substrates sufficiently localized (Figure 5c) and proofreading is lost. Overall, this model suggests that enzymes can work at reasonable binding rates and still proofread, when accounting for an experimentally characterized gradient maintaining mechanism.
Discussion
We have outlined a way for enzymatic reactions to proofread and improve specificity by exploiting spatial concentration gradients of substrates. Like the classic model, our proposed spatial proofreading scheme is based on a time delay; but unlike the classic model, here the delay is due to spatial transport rather than transitions through biochemical intermediates. Consequently, the enzyme is liberated from the stringent structural requirements imposed by traditional proofreading, such as multiple intermediate conformations and hydrolysis sites for energy coupling. Instead, our scheme exploits the free energy supplied by active mechanisms that maintain spatial structures.
The decoupling of the two crucial features of proofreading – time delay and free energy dissipation – allows the cell to tune proofreading on the fly. For instance, all proofreading schemes offer fidelity at the expense of reaction speed and energy. For traditional schemes, navigating this tradeoff is not always feasible, as it needs to involve structural changes via mutations or modulation of the [ATP]/[ADP] ratio which can cause collateral effects on the rest of the cell. In contrast, the spatial proofreading scheme is more adaptable to the changing conditions and needs of the cell. The scheme can prioritize speed in one context, and fidelity in another, simply by tuning the length scale of intracellular gradients (e.g. through the regulation of the phosphotase or free enzyme concentration in the scheme discussed earlier).
On the other hand, this modular decoupling can complicate the experimental identification of proofreading enzymes and the interpretation of their fidelity. Here, the enzymes need not be endowed with the structural and biochemical properties typically sought for in a proofreading enzyme. At the same time, any attempt to reconstitute enzymatic activity in a wellmixed, in vitro assay, will show poor fidelity compared to in vivo measurements, even when all necessary molecular players are present in vitro. Therefore, more care is required in studies of cellular information processing mechanisms that hijack a distant source of free energy compared to the case where the relevant energy consumption is local and easier to link causally to function.
While we focused on spatially localized substrates and delocalized enzymes, our framework would apply equally well to other scenarios, like one with a spatially localized enzyme (or its active form [Kalab et al., 2002; Nalbant et al., 2004]) and effector with delocalized substrates, an example of which would be an alternative version of the scheme in Figure 5a where the target of the kinase/phosphatase activity is changed from substrates to enzymes. Our framework can also be extended to signaling cascades, where slightly different phosphatase activities can result in magnified concentration ratios of two competing signaling molecules at the spatial location of the next cascade step (Roy and Cyert, 2009; Bauman and Scott, 2002; Kholodenko, 2006).
The spatial gradients needed for the operation of our model can be created and maintained through multiple mechanisms in the cell, ranging from the kinase/phosphatase system modeled here, to the passive diffusion of substrates/ligands combined with active degradation (e.g. Bicoid and other developmental morphogens), to active transport processes combined with diffusion. A particularly simple implementation of our scheme is via compartmentalization – substrates and effectors are localized in two spatially separated compartments with the enzyme–substrate complex having to travel from one to another to complete the reaction.
Many molecular localization pathways involving the naturally compartmentalized parts of the cell require high substrate selectivity and are therefore potential candidates for the implementation of spatial proofreading. For example, in polarized, asymmetric cells (e.g. budding yeast or neuronal cells) gene expression often needs to be spatially regulated (Parton et al., 2014; Martin and Ephrussi, 2009). Such regulation is achieved with designated ribonucleoproteins that bind specific mRNAs near the cell nucleus, perform a biased random walk to the mRNA localization site and deliver them for translation. During transport, mRNAs are protected from ribosome binding and when they unbind, they are subject to degradation which would prevent rebinding events at intermediate locations. Another example process is the nonvesicular transport of lipids between the membrane–bound domains of the cells (e.g. the ER, mitochondria, the Golgi apparatus, or the plasma membrane). This transport mechanism is mediated by lipidtransfer proteins that bind lipids on the donor membrane, diffuse to the acceptor membrane and upon interacting with it, undergo a conformational change, delivering the ‘cargo’ (Lev, 2010). Although the higher proximity of the two membranes is thought to enhance the transport efficiency, it would be interesting to study the optimality of the intermembrane distance in the context of fidelity–transport efficiency tradeoff, given the fact that some of the lipidtransfer proteins are known to exhibit specificity for their cognate substrates.
Our scheme may also be applicable as a quality control mechanism in protein secretion pathways (Ellgaard and Helenius, 2003; Arvan et al., 2002), in highfidelity targeting of membrane proteins mediated by signal recognition particles (Rao et al., 2016; Chio et al., 2017), as well as in selective glycosylation reactions in the Golgi apparatus (Jaiman and Thattai, 2020). Lastly, considering the recent advances in generating synthetic morphogen patterns in multicellular organisms (Toda et al., 2020; Stapornwongkul et al., 2020), spatial proofreading could also be employed in pathways acting on engineered protein gradients. Experimental investigations of these processes in light of our work will reveal the extent to which spatial transport promotes specificity.
In conclusion, we have analyzed the role played by spatial structures in endowing enzymatic reactions with kinetic proofreading. Simply by spatially segregating substrate binding from catalysis, enzymes can enhance their specificity. This suggests that enzymatic reactions may acquire de novo proofreading capabilities by coupling to preexisting spatial gradients in the cell.
Materials and methods
Detailed derivations of the analytical results presented in the main text along with additional studies on our model are included in the Appendices. In addition, Python scripts and Jupyter notebooks used to generate all the plots in the main text and Appendices are included as Supplementary files.
Appendix 1
Analytical calculations of the complex density profile and fidelity
We begin this section by deriving an analytical expression for the density profile of substratebound enzymes (${\rho}_{{}_{\text{ES}}}(x)$) in the case where the $\rho (x)\approx \text{constant}$ assumption holds. Based on this result, we then obtain expressions for fidelity in low, high, and intermediate substrate localization regimes. We reserve the studies of speed and fidelity in the general case of a nonuniform free enzyme profile to Appendix 5.
1. Derivation of the complex density profile ${\rho}_{{}_{\text{ES}}}(x)$
The ordinary differential equation (ODE) that defines the steady state profile of substratebound enzymes is
Here, ${\rho}_{{}_{\text{S}}}(0)$ is the substrate density at the leftmost boundary, whose value can be calculated from the condition that the total number of free substrates is ${S}_{\text{total}}$, namely,
In the limit of low substrate amounts where the approximation ${\rho}_{{}_{\text{E}}}(x)\approx \text{constant}$ is valid, Equation S1 represents a linear nonhomogeneous ODE. Hence, its solution can be written as
where ${\rho}_{{}_{\text{ES}}}^{\text{(h)}}(x)$ is the general solution to the corresponding homogeneous equation, while ${\rho}_{{}_{\text{ES}}}^{\text{(p)}}(x)$ is a particular solution.
Looking for solutions of the form $C{e}^{x/\lambda}$ for the homogeneous part, we find
The two possible roots for $\lambda $ are $\pm \sqrt{D/{k}_{\text{off}}^{\text{S}}}$. Calling the positive root ${\lambda}_{{}_{\text{ES}}}$, which represents the mean distance traveled by the substrate–bound enzyme before releasing the substrate, we can write the general solution to the homogeneous part of Equation S1 as
where C_{1} and C_{2} are constants which will be determined from the boundary conditions.
Since the nonhomogeneous part of Equation S1 is a scaled exponential, we look for a particular solution of the same functional form, namely, ${\rho}_{{}_{\text{ES}}}^{\text{(p)}}(x)={C}_{\text{p}}{e}^{x/{\lambda}_{{}_{\text{S}}}}$. Substituting this form into the ODE, we obtain
The constant coefficient ${C}_{\text{p}}$ can then be found as
where we have used the equality ${\lambda}_{{}_{\text{ES}}}=\sqrt{D/{k}_{\text{off}}^{\text{S}}}$.
Now, to find the unknown coefficients C_{1} and C_{2}, we impose the noflux boundary conditions for the density ${\rho}_{{\scriptscriptstyle \mathrm{E}\mathrm{S}}}(x)$ at the left and right boundaries of the compartment, namely,
Note that we did not take into account the product formation flux at the rightmost boundary when writing Equation S10 in order to simplify our calculations. This is justified in the limit of slow catalysis – an assumption that we make in our treatment. The above system of two equations can then be solved for C_{1} and C_{2}, yielding
With the constant coefficients known, we obtain the general solution for the complex profile as
2. Density profile in low and high substrate localization regimes
If substrate localization is very poor (${\lambda}_{{}_{\text{S}}}\gg L$), the substrate distribution will be uniform (${\rho}_{{}_{\text{S}}}(x)={\overline{\rho}}_{{}_{\text{S}}}={S}_{\text{total}}/L$), resulting in a similarly flat profile of enzyme–substrate complexes with their density ${\rho}_{{}_{\text{ES}}}^{\mathrm{\infty}}$ given by
This is the expected equilibrium result where the complex concentration is inversely proportional to the dissociation constant (${k}_{\text{off}}^{\text{S}}/{k}_{\text{on}}$).
In the opposite limit where the substrates are highly localized (${\lambda}_{{}_{\text{S}}}\ll {\lambda}_{{}_{\text{ES}}},L$ and ${\rho}_{{}_{\text{S}}}(0)\approx {S}_{\text{total}}/{\lambda}_{{}_{\text{S}}}$ from Equation S3), the complex density profile simplifies into
The xdependence through the $\mathrm{cosh}(\cdot )$ function suggests that the complex density is the highest at the leftmost boundary and lowest at the rightmost boundary, with the degree of complex localization dictated by the length scale parameter ${\lambda}_{{}_{\text{ES}}}$. Notably, this localization of complexes does not alter their total number, since the average complex density is conserved, that is,
Equation S15 for the complex profile can be alternatively written in terms of the diffusion time scale ${\tau}_{D}={L}^{2}/D$ and the substrate offrate ${k}_{\text{off}}^{\text{S}}$. Noting that $L/{\lambda}_{{}_{\text{ES}}}=\sqrt{{L}^{2}{k}_{\text{off}}^{\text{S}}/D}=\sqrt{{\tau}_{D}{k}_{\text{off}}^{\text{S}}}$ and introducing a dimensionless coordinate $\stackrel{~}{x}=x/L$, we find
The above equation is what was used for generating the plots in Figure 3b of the main text for different choices of the diffusion time scale.
3. Fidelity in low and high substrate localization regimes
Let us now evaluate the fidelity of the model in the two limiting regimes discussed earlier. In the poor substrate localization case, which corresponds to an equilibrium setting, the fidelity can be found from Equation S14 as
where we have employed the assumption about the right and wrong substrates having identical density profiles. This is the expected result for equilibrium discrimination where no advantage is taken of the system’s spatial structure.
In the regime with high substrate localization, the enzyme–substrate complexes have a nonuniform spatial distribution. What matters for product formation is the complex density at the rightmost boundary ($\stackrel{~}{x}=1$), which we obtain from Equation S17 as
Substituting the above expression written for right and wrong complexes into the definition of fidelity, we find
This is the result reported in Equation 5 of the main text. To gain more intuition about it and draw parallels with traditional kinetic proofreading, let us consider the limit of long diffusion time scales where proofreading is the most effective. In this limit, the hyperbolic sine functions above can be approximated as $\mathrm{sinh}(\sqrt{{\tau}_{D}{k}_{\text{off}}^{\text{S}}})\approx 0.5\phantom{\rule{thinmathspace}{0ex}}{e}^{\sqrt{{\tau}_{D}{k}_{\text{off}}^{\text{S}}}}$, simplifying the fidelity expression into
where we have used the definition of equilibrium fidelity (Equation S18). In traditional proofreading, a scheme with n proofreading realizations can yield a maximum fidelity of $\eta /{\eta}_{\text{eq}}={\eta}_{\text{eq}}^{n}$. The value of n for the original Hopfield model, for instance, is 1. It would be informative to also know the effective parameter n for the spatial proofreading model. Dividing Equation S21 by ${\eta}_{\text{eq}}$, we find
This exact result can be simplified into an approximate form when diffusion is slow and ${\eta}_{\text{eq}}\gg 1$, yielding the expression reported in Equation 7 of the main text, namely,
4. Fidelity in an intermediate substrate localization regime
The generic expression for complex density at the rightmost boundary ($x=L$) can be written using Equation S13 as
For the system to proofread, substrates need to be sufficiently localized (${\lambda}_{{}_{\text{S}}}<L$) and diffusion needs to be sufficiently slow (${\tau}_{D}{k}_{\text{off}}^{\text{S}}>1$ or, ${\lambda}_{{}_{\text{ES}}}<L$). Under these conditions, the substrate profile can be approximated using Equation S3 as ${\rho}_{{}_{\text{S}}}(x)\approx {\lambda}_{{}_{\text{S}}}^{1}{S}_{\text{total}}{e}^{x/{\lambda}_{{}_{\text{S}}}}$, while the hyperbolic sine and cosine functions used above can be approximated as $\mathrm{sinh}(L/{\lambda}_{{}_{\text{ES}}})\approx \mathrm{cosh}(L/{\lambda}_{{}_{\text{ES}}})\approx 0.5{e}^{L/{\lambda}_{{}_{\text{ES}}}}$. With these approximations, the complex density expression simplifies into
Now, depending on how ${\lambda}_{{}_{\text{S}}}$ compares with ${\lambda}_{{}_{\text{ES}}}$, there can be two qualitatively different regimes for the complex density, namely,
where we used the equilibrium complex density ${\rho}_{{}_{\text{ES}}}^{\mathrm{\infty}}$ defined in Equation S14.
Notably, the first regime effectively corresponds to the case of ideal substrate localization where complex density is independent of the precise value of ${\lambda}_{{}_{\text{S}}}$. The dimensionless number $\sqrt{{\tau}_{D}{k}_{\text{off}}^{\text{S}}}$ sets the scale for the minimum $L/{\lambda}_{{}_{\text{S}}}$ value beyond which ideal localization can be assumed. Conversely, the second regime corresponds to the case where the distance traveled by a complex before dissociating is so short that the complex profile is dictated by the substrate profile itself. Because of that, the complex density reduction from its equilibrium limit is independent of the precise values of ${\tau}_{D}$ and ${k}_{\text{off}}^{\text{S}}$, as long as the condition ${\lambda}_{{}_{\text{ES}}}\ll {\lambda}_{{}_{\text{S}}}$ is met.
The scheme yields its highest fidelity when both right and wrong complex densities are in the first regime (ideal localization). When both densities are in the second regime, fidelity is reduced down to its equilibrium value ${\eta}_{\text{eq}}$ (Appendix 1—table 1). The transition between these two extremes happens when the density profiles of right and wrong complexes fall under different regimes. Fidelity can be navigated in the transition zone by tuning the substrate gradient length scale ${\lambda}_{{}_{\text{S}}}$. This is demonstrated in Appendix 1—figure 1 for three different choices of ${\eta}_{\text{eq}}$. In all three cases, the dimensionless numbers $\sqrt{{\tau}_{D}{k}_{\text{off}}^{\text{R}}}$ and $\sqrt{{\tau}_{D}{k}_{\text{off}}^{\text{W}}}$ set the approximate range in which the bulk of fidelity enhancement occurs, as stated in the main text.
5. Optimal diffusion time scale for maximum fidelity
Figure 3c of the main text illustrated the nonmonotonic dependence of fidelity on the diffusion time scale ${\tau}_{D}$ for different fixed values of ${\lambda}_{{}_{\text{S}}}$. Here, we further explore this feature by asking what sets the optimal ${\tau}_{D}$. To gain analytical insights, we focus on the case where the system can proofread, which, as we argued in the previous section, happens when ${\lambda}_{{}_{\text{S}}},{\lambda}_{{}_{\text{ES}}}<L$. Under this condition, we identified two qualitatively different regimes of complex density reduction (Equation S26). Namely, we found that for sufficiently fast diffusion the system acted as if the substrates were localized ideally, whereas for sufficiently slow diffusion the complex density reduction was dictated solely by ${\lambda}_{{}_{\text{S}}}$ and did not discriminate between the two substrate kinds. These two limiting behaviors are indeed reflected in Figure 3c where in the low ${\tau}_{D}$ limit (fast diffusion) the family of curves matches the dotted ideal localization curve, while in the high ${\tau}_{D}$ limit (slow diffusion) all curves decay to 1, corresponding to the loss of error correction.
An intuitive approach for identifying the optimal ${\tau}_{D}$ is to slow down diffusion up to the point where the density of wrong complexes at $x=L$ approaches a plateau and effectively stops decreasing. Going past this threshold would only reduce the density of right complexes at $x=L$ and thereby, reduce the fidelity. We know from Equation S26 that plateauing for wrong complexes happens when ${\lambda}_{{}_{\text{EW}}}\ll {\lambda}_{{}_{\text{S}}}$ (equivalently, $\sqrt{{\tau}_{D}{k}_{\text{off}}^{\text{W}}}\gg L/{\lambda}_{{}_{\text{S}}}$). Hence, our first guess for the optimal diffusion time scale ${\tau}_{D}^{*}$ is
To test the soundness of this expression, we compared its predictions to the optimal ${\tau}_{D}$ values in Figure 3 that were identified numerically for different choices of ${\lambda}_{{}_{\text{S}}}$. The results of the comparison are shown in Appendix 1—figure 2. As can be seen, for sufficiently high degrees of substrate localization ($L/{\lambda}_{{}_{\text{S}}}$), the prediction of Equation S29 provides a good approximation of the true optimum. However, it is apparent that the prediction consistently underestimates the true ${\tau}_{D}^{*}$, which was expected since plateauing of ${\rho}_{{}_{\text{EW}}}(L)$ happens not under equality but a strict inequality condition (i.e. $\sqrt{{\tau}_{D}^{*}{k}_{\text{off}}^{\text{W}}}\gg L/{\lambda}_{{}_{\text{S}}}$). Because an exact analytical expression for ${\tau}_{D}^{*}$ is not available, we performed different approximations to the fidelity formula and found an empirical correction term for our earlier estimate given by $2(L/{\lambda}_{{}_{\text{S}}})/\sqrt{{\eta}_{\text{eq}}}$. The prediction for ${\tau}_{D}^{*}$ with the correction term is now accurate starting a much lower value of $L/{\lambda}_{{}_{\text{S}}}$, corresponding to a regime where the system proofreads once (${n}_{\text{eff}}\approx 1$). Overall, these analytical results provide good initial guesses for ${\tau}_{D}^{*}$ which should be refined using a numerical approach for a higher accuracy.
Appendix 2
Energetics of the scheme
We start this section by deriving an analytical expression for the minimum dissipated power, which was used in making Figure 4 of the main text. Then, we calculate the upper limit on fidelity enhancement available to our model for a finite substrate gradient length scale and compare this limit with the fundamental thermodynamic bound. We end the section by providing an estimate for the baseline cost of setting up gradients and compare this cost with the maintenance cost reported in the main text. Similar to our treatment of Appendix 1, here too our calculations are based on the ${\rho}_{{}_{\text{E}}}\approx \text{constant}$ assumption to allow for intuitive analytical results.
1. Derivation of the minimum dissipated power
As stated in the main text, we calculate the minimum rate of energy dissipation necessary for maintaining the substrate profiles as
where ${j}_{{}_{\text{S}}}(x)={k}_{\text{on}}{\rho}_{{}_{\text{S}}}(x){\rho}_{{}_{\text{E}}}{k}_{\text{off}}^{\text{S}}{\rho}_{{}_{\text{ES}}}(x)$ is the net local substrate binding flux and $\mu (x)=\mu (0)+{k}_{\text{B}}T\mathrm{ln}{\rho}_{{}_{\text{S}}}(x)/{\rho}_{{}_{\text{S}}}(0)=\mu (0){k}_{\text{B}}T\cdot x/{\lambda}_{{}_{\text{S}}}$ is the local chemical potential.
Our choice for the expression of power at steady state is motivated by that fact that the enzyme transport is passive and therefore, energy needs to be spent only on counteracting the local binding/unbinding events that tend to homogenize the substrate profile. To demonstrate the validity of our proposed expression more formally, we invoke the standard approaches for calculating power (Hill, 1977; Zhang et al., 2012). In particular, for a system that is described through discrete states with transition rates ${k}_{i\to j}$ between them, the rate of energy dissipation at steady state is given by
where ${J}_{i\to j}$ is the flux from state i into state j. We note here that a similar expression for the rate of total entropy production involves a $\mathrm{ln}({J}_{i\to j}/{J}_{j\to i})$ term (statistical forces) instead of the $\mathrm{ln}({k}_{i\to j}/{k}_{j\to i})$ term (deterministic driving forces). At steady state, however, these two expressions are mathematically equivalent (Zhang et al., 2012). Our choice for Equation S31 stems from the better physical intuition that it provides in our context.
So far, the description of our system has been in terms of continuous density functions. To apply Equation S31 for calculating power, we consider the discretestate representation of enzyme dynamics shown in Appendix 2—figure 1. There, space is discretized into intervals of size $\delta x$ and diffusion is represented through jumps between neighboring sites with a rate $D/\delta {x}^{2}$. What keeps the system out of equilibrium is the spatially varying substrate profile ${\rho}_{{}_{\text{S}}}(x)$.
Because forward and backward diffusive transitions have identical rates, according to Equation S31 they will not contribute to energy dissipation (since $\mathrm{ln}(1)=0$). The contribution from the remaining substrate binding/unbinding events can then be written as
where $\delta {n}_{i}^{\text{E}}={\rho}_{{}_{\text{E}}}\delta x$ and $\delta {n}_{i}^{\text{ES}}={\rho}_{{}_{\text{ES}}}({x}_{i})\delta x$ are the numbers of free and substrate–bound enzymes, respectively, in the $[{x}_{i},{x}_{i}+\delta x]$ interval. In the limit of a large number of discrete spatial intervals, the sum over i in Equation S32 can be rewritten as an integral over the coordinate x, namely,
Comparing the form of Equation S33 to that of Equation S30 (with $\mu (x)$ substituted), one can notice a difference in the terms that multiply ${j}_{{}_{\text{S}}}(x)$. Specifically, in Equation S30 we have $\mu (x)=\mu (0){k}_{\text{B}}T\mathrm{ln}{\rho}_{{}_{\text{S}}}(0)+{k}_{\text{B}}T\mathrm{ln}{\rho}_{{}_{\text{S}}}(x)$ while the corresponding term in Equation S33 is ${k}_{\text{B}}T\mathrm{ln}({k}_{\text{on}}/{k}_{\text{off}}^{\text{S}})+{k}_{\text{B}}T\mathrm{ln}{\rho}_{{}_{\text{S}}}(x)$. The difference between them, however, is in the parts that do not depend on x, while the spatially varying parts (namely, the ${k}_{\text{B}}T\mathrm{ln}{\rho}_{{}_{\text{S}}}(x)$ contributions) are identical. Now, since the number of bound complexes is constant at steady state, we have ${\int}_{0}^{\mathrm{\infty}}{j}_{{}_{\text{S}}}(x)dx=0$. This means that the xindependent parts discussed earlier all integrate to zero, making the power estimates by Equation S30 and Equation S33 identical, thereby justifying our proposed expression.
To estimate power, we substitute the analytical expression for ${\rho}_{{}_{\text{ES}}}(x)$ found earlier (Equation S13) into ${j}_{{}_{\text{S}}}(x)$ and performing a somewhat cumbersome integral, obtain
where ${\beta}^{1}={k}_{\text{B}}T$, and ${J}_{\text{bind}}={k}_{\text{on}}{S}_{\text{total}}{\rho}_{{}_{\text{E}}}$ is the net binding rate of each substrate. Figure 4 in the main text was made using this expression for power.
To get additional insights about this result, let us consider the case where substrates are highly localized (${\lambda}_{{}_{\text{S}}}\ll L$) and diffusion is slow (${\lambda}_{{}_{\text{ES}}}\ll L$) – conditions needed for effective proofreading. Under these conditions, the hyperbolic tangent terms become 1 and the expression for the power expenditure simplifies into
The monotonic increase of power with ${\lambda}_{{}_{\text{ES}}}$ suggests that energy is primarily spent on maintaining the concentration gradient of right substrates. This is not surprising, since typically right complexes travel a much greater distance into the low concentration region of the compartment before releasing the bound substrate (i.e. ${\lambda}_{{}_{\text{ER}}}\gg {\lambda}_{{}_{\text{EW}}}$). Therefore, neglecting the contribution from wrong substrates and considering the range of ${\lambda}_{{}_{\text{S}}}$ values where the bulk of power–fidelity tradeoff takes place (${\lambda}_{{}_{\text{ER}}}>{\lambda}_{{}_{\text{S}}}>{\lambda}_{{}_{\text{EW}}}$), we further simplify the power expression into
where we used the identities $\beta \mathrm{\Delta}\mu =L/{\lambda}_{{}_{\text{S}}}$ and ${\lambda}_{{}_{\text{ER}}}=L/\sqrt{{\tau}_{D}{k}_{\text{off}}^{\text{R}}}$. This simple linear relation suggests that in order to maintain the exponential substrate profile, the minimum energy spent per substrate binding event should be at least $P/{J}_{\text{bind}}\approx {k}_{\text{B}}T\cdot {\lambda}_{{}_{\text{ER}}}/{\lambda}_{{}_{\text{S}}}>1{k}_{\text{B}}T$ (since ${\lambda}_{{}_{\text{ER}}}>{\lambda}_{{}_{\text{S}}}$).
We can also use Equation S36 to estimate the minimum dissipation per substrate binding event at ${\lambda}_{{}_{\text{S}}}\approx {\lambda}_{{}_{\text{EW}}}$ where the logarithmic power–fidelity scaling regime ends (see Figure 4 of the main text). Substituting the value of ${\lambda}_{{}_{\text{S}}}$, we obtain $\beta P/{J}_{\text{bind}}\approx ({\lambda}_{{}_{\text{ER}}}/{\lambda}_{{}_{\text{EW}}})=\sqrt{{\eta}_{\text{eq}}}$, which is the result illustrated in Figure 4.
2. Limits on fidelity enhancement
The error reduction capacity of the spatial proofreading scheme improves with a greater difference in substrate offrates, as was demonstrated in Figure 2 of the main text. At the same time, Figure 3c showed that the finite length scale of substrate localization (or, finite driving force) sets an upper limit on fidelity enhancement for substrates with fixed offrates. It is therefore of interest to consider these two features together to find the absolute limit on fidelity enhancement available to our model and then compare it with the fundamental bound set by thermodynamics.
Intuitively, fidelity will be enhanced the most if the density of right complexes does not decay across the compartment, while that of wrong complexes decays maximally. The first condition can be met if diffusion is fast or if the unbinding rate of right substrates is low, in which case we have
where $\rho}_{{\scriptscriptstyle \mathrm{E}\mathrm{R}}}^{\mathrm{\infty}$ is the equilibrium density of right complexes. Conversely, when the unbinding rate of wrong substrates is very large, the density of wrong complexes is maximally reduced at the rightmost boundary and can be obtained from Equation S24 by taking the ${\lambda}_{{}_{\text{ES}}}\to 0$ limit, namely,
Here, ${\rho}_{{}_{\text{EW}}}^{\mathrm{\infty}}$ is the equilibrium density of wrong complexes, and $\beta \mathrm{\Delta}\mu =L/{\lambda}_{{}_{\text{S}}}$ is the effective driving force of the scheme. Taking the ratio of Equations S37 and S38. Limits on fidelity enhancement, we obtain the largest fidelity enhancement of the scheme for the given driving force, namely,
When $\beta \mathrm{\Delta}\mu \gtrsim 1$ (or, ${\lambda}_{{}_{\text{S}}}\lesssim L)$, the limit above gets further simplified into
Now, thermodynamics imposes an upper bound on fidelity enhancement by any proofreading scheme operating with a finite chemical potential $\mathrm{\Delta}\mu $. This bound is equal to ${e}^{\beta \mathrm{\Delta}\mu}$ and is reached when the entire chemical potential is used to increase the free energy difference between right and wrong substrates (Qian, 2006). Comparing it with the result in Equation S41, we can see that fidelity enhancement in the spatial proofreading model has the same exponential scaling term, but with an additional linear factor. Since the dominant contribution comes from the exponential term (as captured also in Appendix 2—figure 2), we can claim that our proposed model can operate very close to the fundamental thermodynamic limit.
3. Energetic cost to setup a concentration gradient
Earlier in the section, we calculated the rate at which energy needs to be dissipated to counteract the homogenizing effect that enzyme activity has on the substrate gradient. In addition to this cost, however, there is also a baseline cost for setting up a gradient in the absence of any enzyme. Here, we calculate this cost in the case where the gradient formation mechanism needs to work against diffusion that tends to flatten the substrate profile.
As before, we consider an exponentially decaying substrate gradient with a decay length scale ${\lambda}_{{}_{\text{S}}}$ and a total number of substrates ${S}_{\text{total}}$. We write the minimum power ${P}_{D}$ required for counteracting the diffusion of substrates as
where ${J}_{D}={D}_{{}_{\text{S}}}\nabla {\rho}_{{}_{\text{S}}}(x)$ is the diffusive flux, with ${D}_{{}_{\text{S}}}$ being the substrate diffusion constant. The rationale for writing this form is that diffusion moves substrates from a higher chemical potential region into a neighboring lower chemical potential region. The gradient maintaining mechanism would need to spend at least this chemical potential difference ($\delta \mu ={\mu}^{\prime}(x)\delta x$) per each substrate diffusing a distance $\delta x$ down the chemical potential gradient. Adding up the contribution from all local neighborhoods with a local diffusive flux ${J}_{D}(x)$ results in Equation S42.
Now, substituting ${\rho}_{{}_{\text{S}}}(x)\sim {e}^{x/{\lambda}_{{}_{\text{S}}}}$ for the substrate profile and $\mu (x)=\mu (0)+{k}_{\text{B}}T\mathrm{ln}\left({\rho}_{{}_{\text{S}}}(x)/{\rho}_{{}_{\text{S}}}(0)\right)$ for the chemical potential, we obtain
where in the third step we used the relation ${\rho}_{{}_{\text{S}}}^{\prime}(x)={\rho}_{{}_{\text{S}}}(x)/{\lambda}_{{}_{\text{S}}}$. This suggests that the minimum dissipated power required for setting up an exponential gradient increases quadratically with decreasing localization length scale ${\lambda}_{{}_{\text{S}}}$.
It is informative to also make a comparison between this result and the earlier calculated minimum dissipation needed to counteract the enzyme’s homogenizing activity. Recall that when substrates were sufficiently localized and when diffusion was sufficiently slow, proofreading power could be approximated as (Equation S35)
where ${J}_{\text{bind}}={k}_{\text{on}}{S}_{\text{total}}{\rho}_{{}_{\text{E}}}$ is the total substrate binding flux. Using the identities ${\lambda}_{{}_{\text{ES}}}=\sqrt{D/{k}_{\text{off}}^{\text{S}}}$ and ${K}_{\text{d}}^{\text{S}}={k}_{\text{off}}^{\text{S}}/{k}_{\text{on}}$, we can calculate the ratio of the proofreading power to baseline power as
Presuming for simplicity that the enzyme and substrate diffusion constants are the same, we see that two factors determine the power ratio: (1) the amount of free enzyme in the system (${\rho}_{{}_{\text{E}}}/{K}_{\text{d}}^{\text{S}}$) and (2) the substrate localization length scale relative to the characteristic length scale of complex diffusion (${\lambda}_{{}_{\text{S}}}/{\lambda}_{{}_{\text{ES}}}$). Now, recall that the proofreading cost is spent largely on counteracting the homogenizing activity of the enzyme on right substrates (Appendix 2.1) and that the bulk of fidelity enhancement takes place when ${\lambda}_{{}_{\text{S}}}\lesssim {\lambda}_{{}_{\text{ER}}}$ (Appendix 1.4). Therefore, when tuning ${\lambda}_{{}_{\text{S}}}$ down, initially the power ratio would only depend on the amount of free enzyme in the system (${\rho}_{{}_{\text{E}}}/{K}_{\text{d}}^{\text{S}}$) and then, with tighter substrate localization, the relative contribution of the proofreading power would start to decrease.
In the end, we would like to note that spatial gradients can also be set up using an external potential without a continuous dissipation of energy. In an in vivo setting, gravity can give rise to spatial structures in oocytes (Feric and Brangwynne, 2013), while in an in vitro setting, electric fields can create gradients and power the transport of the complex (Hansen et al., 2017). We leave the investigations of such alternative strategies to future work.
Appendix 3
Studies on the effect of catalysis on the model performance
In Appendix 1, we considered the rate of catalysis at the right boundary to be very small for the analytical simplicity of our derivations. This resulted in expressions for fidelity that were independent of the rate of catalysis r and allowed us to use the complex density at the right boundary as a proxy for speed. In this section, we relax this assumption and explore the consequences of having nonnegligible catalysis rates on the model’s fidelity and on the speed–fidelity tradeoff.
1. Derivation of the complex density profile ${\rho}_{{}_{\text{ES}}}(x)$
Accounting for catalysis in our model should be done through a boundary condition for the complex density equation (Equation S1). Earlier, we imposed a noflux boundary condition at $x=L$ under the slow catalysis assumption. With nonnegligible catalysis, this assumption is no longer valid, and the boundary condition is modified into
Recall from Equations S4, S6 and S8 that the general solution for the complex profile had the form
Imposing the noflux boundary condition at $x=0$ allows us to eliminate one of the integration constants, namely,
Next, we impose the new boundary condition at $x=L$ (Equation S46), which yields
Note that we have introduced the dimensionless variable $\epsilon $, which, as will see later, will define the extent to which the presence of catalysis affects the fidelity. For convenience, here we write different equivalent forms for $\epsilon $ as
Solving for the remaining unknown coefficient C_{1} in Equation S52, we find
Lastly, we substitute this result for C_{1} into Equation S51 and obtain a general expression for the complex density profile as
One can show in a straightforward way that this result reduces to Equation S13 in the $\epsilon \to 0$ limit.
2. Effects on fidelity in low and high substrate localization regimes
Accounting for the catalysis flux has made the general expression for the complex density profile even more incomprehensible. In order to gain insights about the qualitative as well as quantitative changes introduced by catalysis, we will focus on two characteristic limits of substrate localization – uniform substrate profile (${\lambda}_{{}_{\text{S}}}\to \mathrm{\infty}$) and ideal substrate localization (${\lambda}_{{}_{\text{S}}}\to 0$).
2.1. Uniform substrate profile
In this case, no mechanism for localizing substrates is in play. Let us start off by evaluating the coefficient ${C}_{p}$ (Equation S48) in the ${\lambda}_{{}_{\text{S}}}\to \mathrm{\infty}$ limit. Recalling from Equation S3 that ${\rho}_{{}_{\text{S}}}(0)={S}_{\text{total}}/({\lambda}_{{}_{\text{S}}}(1{e}^{L/{\lambda}_{{}_{\text{S}}}})$), we find
where ${J}_{\text{bind}}={k}_{\text{on}}{S}_{\text{total}}{\rho}_{{}_{\text{E}}}$ is the total substrate binding flux.
Substituting the expression for ${C}_{p}$ into Equation S55 and eliminating all the terms that vanish upon taking the ${\lambda}_{{}_{\text{S}}}\to \mathrm{\infty}$ limit, we obtain
Ultimately, we are interested in knowing the rate of product formation defined via ${v}_{\text{S}}=r{\rho}_{{}_{\text{ES}}}(L)$. We therefore evaluate the complex density at $x=L$ and multiply it by r, which yields
where in the last step we wrote an equivalent expression using the $L/{\lambda}_{{}_{\text{ES}}}=\sqrt{{\tau}_{D}{k}_{\text{off}}^{\text{S}}}$ identity. To analyze this result further, we will consider two limiting cases.
Case 1: Fast diffusion ($\sqrt{{\tau}_{D}\mathbf{}{k}_{\text{\mathbf{o}\mathbf{f}\mathbf{f}}}^{\text{\mathbf{S}}}}\mathrm{\ll}\mathrm{1}$). If diffusion is fast, we can approximate the hyperbolic tangent functions as the arguments themselves (i.e. $\mathrm{tanh}(z)\approx z$ for $z\ll 1$). Then, using the last form of $\epsilon $ in Equation S53, we simplify the expression for speed as
This is an intuitive result, suggesting that an enzyme that diffuses fast acts like a standard Michaelis–Menten enzyme with an effective catalysis rate $\stackrel{~}{r}$. For such an enzyme, the probability of catalysis for a bound substrate is $\stackrel{~}{r}/({k}_{\text{off}}^{\text{S}}+\stackrel{~}{r})$. Multiplying this probability by the net substrate binding flux yields the expression for speed in Equation S61.
Fidelity of the model in this fast diffusion setting can be written as
In the limit where catalysis is very slow ($\stackrel{~}{r}\ll {k}_{\text{off}}^{\text{R}}$), the equilibrium fidelity given by the ratio of offrates is recovered. And in the opposite limit of very fast catalysis ($\stackrel{~}{r}\gg {k}_{\text{off}}^{\text{W}}$), the discriminatory capacity of the enzyme disappears altogether (Appendix 3—figure 1a).
Case 2: Slow diffusion ($\sqrt{{\tau}_{D}\mathbf{}{k}_{\text{\mathbf{o}\mathbf{f}\mathbf{f}}}^{\text{\mathbf{S}}}}\mathrm{\gtrsim}\mathrm{1}$). A more interesting case is when diffusion is slow. Now, the hyperbolic tangent functions in Equation S59 are approximately 1, allowing us to simplify the expression for speed into
Drawing an analogy between the above result and Equation S61, one can notice the presence of an extra $\sqrt{{\tau}_{D}{k}_{\text{off}}^{\text{S}}}$ factor for $\stackrel{~}{r}$ in the denominator.
Evaluating the speeds of right and wrong product formation, we can write fidelity in this slow diffusion setting as
Like the fast diffusion case, when catalysis is very slow ($\stackrel{~}{r}\ll \sqrt{{k}_{\text{off}}^{\text{R}}/{\tau}_{D}}$ or, equivalently, $r\ll \sqrt{D{k}_{\text{off}}^{\text{R}}}$), the equilibrium fidelity is recovered. Unlike the fast diffusion case, however, if catalysis is very fast ($r\gg \sqrt{D{k}_{\text{off}}^{\text{W}}}$), the enzyme partly preserves its discriminatory capacity (Appendix 3—figure 1b). In this limit, a fidelity equal to the square root of the equilibrium fidelity is still attainable, namely,
This unexpected result suggests a potential advantage of localizing fast catalytic reactions instead of having them occur in a well–mixed solution.
2.2. Ideal substrate localization
We next consider the effect of catalysis on model fidelity in the ideal substrate localization limit (${\lambda}_{{}_{\text{S}}}\to 0$). We begin by evaluating the ${C}_{p}/{\lambda}_{{}_{\text{S}}}$ ratio that appears in the density profile expression (Equation S55). Using Equations S48 and Equations S3, we find
where in the last step we invoked the identities ${\lambda}_{{}_{\text{ES}}}^{2}=D/{k}_{\text{off}}^{\text{S}}$ and ${J}_{\text{bind}}={k}_{\text{on}}{S}_{\text{total}}{\rho}_{{}_{\text{E}}}$. We then substitute our result for ${C}_{p}/{\lambda}_{{}_{\text{S}}}$ into Equation S55 and simplify the complex density expression into
To obtain the speed, we evaluate ${\rho}_{{}_{\text{ES}}}(x)$ at the right boundary ($x=L$) and multiply it by r, namely,
To evaluate the effect of catalysis further, we again consider two special limits – those of fast and slow diffusion.
Case 1: Fast diffusion ($\sqrt{{\tau}_{D}\mathbf{}{k}_{\text{\mathbf{o}\mathbf{f}\mathbf{f}}}^{\text{\mathbf{S}}}}\mathrm{\ll}\mathrm{1}$). In this limit, the hyperbolic sine function can be approximated by its argument (i.e. $\mathrm{sinh}(z)\approx z$ for $z\ll 1$), while the hyperbolic cosine function is approximately 1. Making these approximations and substituting the expression for $\epsilon $, we obtain
This result is identical to what we found in the fast diffusion limit for the ${\lambda}_{{}_{\text{S}}}\to \mathrm{\infty}$ setting (Equation S61), which is reasonable, since the location of substrate binding is irrelevant if diffusion is very fast (Appendix 3—figure 2a).
Case 2: Slow diffusion ($\sqrt{{\tau}_{D}\mathbf{}{k}_{\text{\mathbf{o}\mathbf{f}\mathbf{f}}}^{\text{\mathbf{S}}}}\mathrm{\gg}\mathrm{1}$). In this limit, the hyperbolic sine and cosine functions can be approximated as exponentials with a $1/2$ prefactor, simplifying the expression of speed into
Recalling the identity $\epsilon =r/\sqrt{D{k}_{\text{off}}^{\text{S}}}$ (note that $\epsilon $ depends on the substrate kind), we evaluate the speed for right and wrong product formation and, dividing them, obtain the fidelity as
In the case where catalysis is slow ($r\ll \sqrt{D{k}_{\text{off}}^{\text{R}}}$), the first term in the fidelity expression becomes approximately 1, and the our earlier result obtained with no account of catalysis is recovered (Equation S21). In the opposite limit of fast catalysis ($r\gg \sqrt{D{k}_{\text{off}}^{\text{W}}}$), the first term is no longer 1, and we find
As we can see, fast catalysis in the slow diffusion regime reduces the fidelity by $\sqrt{{\eta}_{\text{eq}}}$ or, equivalently, reduces the effective number of proofreading realizations by one half, without affecting the exponential amplification term (Appendix 3—figure 2b).
To conclude, our study demonstrated the expected reduction of fidelity with increasing catalysis rate. In the case of fast diffusion, up to a factor of ${\eta}_{\text{eq}}$ reduction is possible, as is the case for the original (Hopfield, 1974; Wong et al., 2018). In the case of slow diffusion, however, the cap on the amount of reduction is decreased down to $\sqrt{{\eta}_{\text{eq}}}$. The advantage of this feature is most notable in the limit of a nonlocalized (i.e. uniform) substrate profile and fast catalysis where a diffusing enzyme is still capable of discriminating between substrates. This behavior would not be possible for a Michaelis–Menten enzyme in a wellmixed solution.
3. Effects on the speed–fidelity tradeoff
In Figure 3a of the main text we explored the speed–fidelity tradeoff in the slow catalysis limit. This tradeoff arose in response to tuning the substrate localization length scale (${\lambda}_{{}_{\text{S}}}$) and the diffusion time scale (${\tau}_{D}$). Here, we explore the changes to this tradeoff behavior in the case where the effects of catalysis are not negligible. For concreteness, we focus on alterations to the Pareto front of the tradeoff achieved in the ${\lambda}_{{}_{\text{S}}}\to 0$ limit.
Appendix 3—figure 3a compares the Pareto fronts in the cases of slow and fast catalysis limits. In each case, speed is normalized by the corresponding effective Michaelis–Menten speed that is reached in the fast diffusion limit and is given by ${v}_{{}_{\text{MM}}}={J}_{\text{bind}}\times \stackrel{~}{r}/({k}_{\text{off}}^{\text{R}}+\stackrel{~}{r})$, where $\stackrel{~}{r}=r/L$. One can notice a shift of the fast catalysis front toward the lowfidelity region, which was expected since earlier we observed the complete loss of substrate discrimination when diffusion and catalysis were both fast (Appendix 3—figure 2a).
Appendix 3—figure 3a may leave an impression that faster catalysis leads to a less favorable speed–fidelity tradeoff. Note, however, that the speed ${v}_{{}_{\text{MM}}}(\stackrel{~}{r})$ used to normalize the yaxis is itself a function of the catalysis rate and penalizes the fast catalysis case more than its slow counterpart. To eliminate this ambiguity, we plotted a family of Pareto fronts for increasing values of the catalysis rate but this time normalizing the yaxis by the rindependent quantity ${J}_{\text{bind}}$ (Appendix 3—figure 3b). As can be seen, faster catalysis in fact improves the speed–fidelity tradeoff, meaning that in order to maximize fidelity at a given speed level, the best strategy would be to increase the catalysis rate and correspondingly slow down the diffusion.
A tradeoff between speed and fidelity also arises in response to the sole alteration of the catalysis rate, while keeping the rest of the model parameters fixed. To explore this tradeoff for an arbitrary fixed choice of ${\lambda}_{{}_{\text{S}}}$ and ${\tau}_{D}$, we begin by evaluating speed from Equation S55, namely,
In the last step, we introduced coefficients ${a}_{{}_{\text{S}}}$ and ${b}_{{}_{\text{S}}}$ that are independent from r, and used the fact that $\epsilon \sim r$.
Now, using the definition of fidelity and the result obtained above, we can write
Notice that the ratio ${a}_{{}_{\text{R}}}/{a}_{{}_{\text{W}}}\equiv {\eta}_{0}$ is the fidelity in the limit of very slow catalysis ($r\to 0$). Substituting it, we write
where $\mathrm{\Delta}b={b}_{{}_{\text{R}}}{b}_{{}_{\text{W}}}$. Recalling that $\epsilon ={\lambda}_{{}_{\text{ES}}}r/D$ and noting the function form of the denominator in Equation S75, one can show that ${b}_{{}_{\text{S}}}={D}^{1}{\lambda}_{{}_{\text{ES}}}/\mathrm{tanh}(L/{\lambda}_{{}_{\text{ES}}})$. This is an increasing function of ${\lambda}_{{}_{\text{ES}}}$ and hence, a decreasing function of ${k}_{\text{off}}^{\text{S}}$, implying that $\mathrm{\Delta}b>0$.
With this condition in mind, we can see from Equation S78 that speed and fidelity are anticorrelated with a linear slope when tuning the catalysis rate, unlike the more sophisticated tradeoff relations when tuning the other model parameters. The peak fidelity ${\eta}_{0}$ is attained in the limit of vanishing speed. And conversely, speed is the highest when fidelity is the lowest for the given fixed values of ${\lambda}_{{}_{\text{S}}}$ and ${\tau}_{D}$ (Appendix 3—figure 4).
Overall, our result illustrates the simple speed–fidelity tradeoff that can be navigated by tuning the catalysis rate. This, for instance, can be achieved by changing the concentration of effectors that activate the enzyme for catalysis.
Appendix 4
Proofreading for substrates with different localization conditions
Following the original treatment by Hopfield, 1974, we have performed the studies of our model under the assumption that discrimination between right and wrong substrates is solely based on their off–rates (${k}_{\text{off}}^{\text{W}}>{k}_{\text{off}}^{\text{R}}$). Although this is often the signature difference between substrates, in a cellular setting substrate discrimination may occur through other factors also. For example, substrates may be present at different amounts or they may have nonidentical on–rates. These differences, however, have a multiplicative effect on the fidelity (i.e. $\eta \sim ({k}_{\text{on}}^{\text{R}}[\text{R}])/({k}_{\text{on}}^{\text{W}}[\text{W}])$) and do not highlight the proofreading capacity of a particular model.
Unlike these two features, differences in the degree to which right and wrong substrates are localized can have a nontrivial effect on the proofreading performance. In this Appendix, we generalize our study of the model fidelity to cases where right and wrong substrates have unequal localization length scales ${\lambda}_{{}_{\text{R}}}$ and ${\lambda}_{{}_{\text{W}}}$, respectively.
1. Limiting cases
We start off by exploring the limiting cases first. From the earlier derived Equation S14 and Equation S15, we know that the complex density at $x=L$ in very low (${\lambda}_{{}_{\text{S}}}\gg L$) and very high (${\lambda}_{{}_{\text{S}}}\ll L$) substrate localization regimes is given by
respectively. Note that the complex density in the ideal localization case is necessarily lower than that in the case of a uniform profile, since the inequality $\mathrm{sinh}(L/{\lambda}_{{}_{\text{ES}}})>L/{\lambda}_{{}_{\text{ES}}}$ holds for all choices of ${\lambda}_{{}_{\text{ES}}}$. If ${\lambda}_{{}_{\text{R}}}$ and ${\lambda}_{{}_{\text{W}}}$ are not constrained to be equal, then the highest fidelity for a given ${\tau}_{D}$ will be attained when the right substrates are distributed uniformly while the wrong substrates are highly localized (${\lambda}_{{}_{\text{R}}}\gg L$ and ${\lambda}_{{}_{\text{W}}}\ll L$, respectively). We obtain the fidelity in this case as
Notably, this result for maximum fidelity enhancement is independent of ${k}_{\text{off}}^{\text{R}}$. Furthermore, it exceeds the ideal localization fidelity reported in the main text (Equation 5, derived in the ${\lambda}_{{}_{\text{S}}}\to 0$ limit), which was expected since now the right complexes on average travel a shorter distance to reach the activation site than the wrong complexes.
In the opposite scenario where the wrong substrates are uniformly distributed and the right ones are highly localized (${\lambda}_{{}_{\text{R}}}\ll L$ and ${\lambda}_{{}_{\text{W}}}\gg L$, respectively), the system attains its lowest fidelity for a given ${\tau}_{D}$, namely,
Since $L/{\lambda}_{{}_{\text{ER}}}<\mathrm{sinh}(L/{\lambda}_{{}_{\text{ER}}})$, the lowest fidelity is less than the equilibrium fidelity itself (${\eta}^{\text{min}}<{\eta}_{\text{eq}}$), suggesting that the enzyme may in fact do antiproofreading (Murugan et al., 2014) if the wrong substrates are generally closer to the catalytic site.
2. Intermediate levels of substrate localization
In Figure 3 inset as well as in Appendix 1.4, we explored the dependence of fidelity on the substrate localization length scale ${\lambda}_{{}_{\text{S}}}$ when it was the same for the two substrate kinds. Here, we expand this study to the case where this constraint is relaxed.
In particular, using Equation S24, we calculate complex densities and corresponding fidelity values as a function of ${\lambda}_{{}_{\text{R}}}$ for different fixed choices of the length scale ratio ${\lambda}_{{}_{\text{R}}}/{\lambda}_{{}_{\text{W}}}$. The results of the study are captured in Appendix 4—figure 1. In the special case where the two length scales are equal (${\lambda}_{{}_{\text{R}}}$ = ${\lambda}_{{}_{\text{W}}}$, solid black line), fidelity exhibits a monotonic depends on $L/{\lambda}_{{}_{\text{R}}}$, and in the limit of ideal localization (very large $L/{\lambda}_{{}_{\text{R}}}$) the result in Equation 5 of the main text is recovered.
When ${\lambda}_{{}_{\text{R}}}\ne {\lambda}_{{}_{\text{W}}}$, the dependence of fidelity on $L/{\lambda}_{{}_{\text{R}}}$ is no longer monotonic. If right substrates are more localized than the wrong ones (red curves), then the fidelity curves have a minimum where the enzyme does antiproofreading (i.e. $\eta <{\eta}_{\text{eq}}$). The proofreading portion of the curves (when $\eta >{\eta}_{\text{eq}}$) is shifted to the right, suggesting that much higher substrate localization is needed for the enzyme to proofread.
The opposite case is when the right substrates have a shallower gradient than the wrong ones (blue curves). The fidelity curves are now shifted to the left and have a peak that is greater than the large $L/{\lambda}_{{}_{\text{R}}}$ limit of fidelity. This means that there is an optimal degree of substrate localization, going beyond which makes the model performance worse in terms of both error correction and energy consumption.
Over the course of its diffusive transport, a bound enzyme is more likely to deposit a right substrate in a substratedepleted region than a wrong one, because right substrates stay attached to the enzyme for a longer time. Therefore, if the gradientmaintaining mechanism does not discriminate between substrates (which we assume is the case for the kinase/phosphatasebased one), then it will be easier for it to maintain the wrong ones localized since they tend to get deposited closer to the localization site (see Appendix 6—figure 1c as an example). This means that in a realistic setting the spatial organization of substrates is more likely to be in the advantageous blue region of Appendix 4—figure 1 where ${\lambda}_{{}_{\text{R}}}>{\lambda}_{{}_{\text{W}}}$, facilitating the realization of spatial proofreading.
Appendix 5
Studies on the validity of the uniform free enzyme profile assumption
In our treatment of the model so far, we have assumed for mathematical convenience that free enzymes are in excess, which suggested the approximation ${\rho}_{{}_{\text{E}}}(x)\approx \text{constant}$. Example enzyme density profiles shown in Appendix 5—figure 1, however, demonstrate that this assumption does not hold in general. Specifically, there is a depletion of free enzymes near the substrate localization site and abundance near the catalysis site. Because of this depletion at the leftmost edge, we expect a reduction in speed in comparison with our earlier treatment where a flat profile was assumed. In addition, if substrates have a weak gradient, we expect the fidelity to also be reduced, since more enzymes will bind substrates at intermediate positions, reducing the average travel distance to the catalytic site. In what follows, we discuss in greater detail the consequences of having a nonuniform free enzyme distribution on the model performance.
1. Effects that relaxing the ${\rho}_{{}_{\text{E}}}(x)\approx \text{constant}$ assumption has on the Pareto front
We begin by studying the effects of relaxing the uniform free enzyme profile assumption on the Pareto front of the speed–fidelity tradeoff (Figure 3a of the main text). This front is reached in the ideal substrate localization limit (${\lambda}_{{}_{\text{S}}}\to 0$). Though in general enzyme profiles need to be obtained using numerical methods due to the nonlinearity of reaction–diffusion equations, in this particular limit (${\lambda}_{{}_{\text{S}}}\to 0$) an analytical solution is available. To obtain it, we write the reaction–diffusion equations in the bulk region of space as
Substrate binding reactions did not enter the above equations, as they occur at the leftmost boundary only. They are instead accounted for via boundary conditions, which read
where ${S}_{\text{total}}$ is the total amount of free substrate of each kind concentrated at $x=0$.
Relating local enzyme concentrations
Considering the system at steady state, we add Equations S85S87 and obtain
where we replaced the partial derivatives with total derivative since the profiles are timeindependent. Dividing Equation S91 by D and integrating once, we find
The above relation must hold for arbitrary position x. Choosing $x=0$ and noting that from Equations S88S90 the sum of fluxes should be zero, we can claim that ${C}_{1}=0$. Integrating for the second time, we obtain
where C_{2} is now a different constant. To find it, we perform an integral for the last time across the entire compartment, namely,
Here, we introduced the parameter ${E}_{\text{total}}$ as the total number of enzymes in the system (in free or bound forms). The constant C_{2}, which we will rename into ${\rho}_{0}$, is then the average enzyme density, that is,
Substituting this result into Equation S93, we find an insightful relation between free and bound enzyme densities at an arbitrary position, namely,
This relation suggests that whenever the local concentration of bound enzymes is high, the local concentration of free enzymes should be correspondingly low, as we see reflected in the profiles of Appendix 5—figure 1.
Deriving the fidelity expression
Next, we consider Equations S85 and S86 separately at steady state, written in the form
The general solution to this ODE reads
where ${\lambda}_{{}_{\text{ES}}}=\sqrt{D/{k}_{\text{off}}^{\text{S}}}$, and $C}_{1}^{\text{S}$ and $C}_{2}^{\text{S}$ (S = R,W) are constants which are different for right and wrong complexes. The noflux boundary condition at $x=L$ can be used to relate these constants and simplify the complex profile expression, namely,
where ${\stackrel{~}{C}}_{1}^{\text{S}}=2{C}_{1}^{\text{S}}{e}^{L/{\lambda}_{{}_{\text{ES}}}}$ is a new constant coefficient introduced for convenience.
Now, the fidelity of the scheme is the ratio of right and wrong complex densities at $x=L$. Using the result above, the fidelity can be written as
The ratio of these constant coefficients can be obtained by noting that the diffusive fluxes of right and wrong complexes at $x=0$ are identical (from Equations S38 and S38), that is,
Substituting this result into Equation S102, and recalling the equality $L/{\lambda}_{{}_{\text{ES}}}=\sqrt{{\tau}_{D}{k}_{\text{off}}^{\text{S}}}$, we obtain
This expression is identical to that in Equation S20 which was derived under the ${\rho}_{{}_{\text{E}}}(x)\approx \text{constant}$ assumption, suggesting that when substrates are highly localized, the shape of the free enzyme profile does not dictate the fidelity.
Deriving the speed expression
To keep the expression of speed compact while still illustrating the key consequences of relaxing the $\rho (x)\approx \text{constant}$ assumption, we will assume moving forward that the density of wrong complexes is much lower than that of the right complexes, that is, ${\rho}_{{}_{\text{EW}}}(x)\ll {\rho}_{{}_{\text{ER}}}(x)$. This assumption holds as long as the right and wrong complexes have sufficiently different offrates. To see why it is the case, note that the ratio ${\rho}_{{}_{\text{EW}}}(x)/{\rho}_{{}_{\text{ER}}}(x)$ is the highest at $x=0$. We therefore calculate an upper bound for the ratio using Equation S101 and Equation S105 as
As long as ${\eta}_{\text{eq}}\gtrsim 10$, it is fair to assume that the right complexes greatly outnumber the wrong ones, which allows us to approximate the free enzyme density from Equation S96 as ${\rho}_{{}_{\text{E}}}(x)\approx {\rho}_{0}{\rho}_{{}_{\text{ER}}}(x)$.
The specification of the right complex density profile requires the knowledge of the unknown coefficient ${\stackrel{~}{C}}_{1}^{\text{R}}$. To find this coefficient, we use the boundary condition in Equation S88 and the approximation ${\rho}_{{}_{\text{E}}}(x)\approx {\rho}_{0}{\rho}_{{}_{\text{ER}}}(x)$ to write
With the constant coefficient known, the right complex density then becomes
where we used the definitions of the mean substrate density ${\overline{\rho}}_{{}_{\text{S}}}={S}_{\text{total}}/L$ and the dissociation constant ${K}_{\text{d}}^{\text{R}}={k}_{\text{off}}^{\text{R}}/{k}_{\text{on}}$.
To enable a direct parallel between this general treatment and the earlier one with the ${\rho}_{{}_{\text{E}}}(x)\approx \text{constant}$ approximation, let us introduce ${\rho}_{{}_{\text{ER}}}^{\mathrm{\infty}}$ as the uniform right complex density when diffusion is very fast (${\lambda}_{{}_{\text{ER}}}\gg L$) and calculate it from Equation S110 as
Now, using the ${\rho}_{{}_{\text{ER}}}^{\mathrm{\infty}}$ expression, we rewrite Equation S110 as
where ${\rho}_{{}_{\text{ER}}}^{\text{const}}(x)$ is the complex density obtained under the ${\rho}_{{}_{\text{E}}}(x)\approx \text{constant}$ assumption (Equation S15). The extra factor that appears on front does not exceed 1 since $\gamma \ge 1$, indicating a reduction in speed, as we anticipated in our more qualitative discussion at the beginning of the section. The presence of the extra factor suggests two possibilities for the approximation to hold true; first, $\gamma \approx 1$ which happens when ${\lambda}_{{}_{\text{ER}}}\gtrsim L$ or when the right complex does not decay noticeably across the compartment, and second, when $\gamma >1$ and ${\overline{\rho}}_{{}_{\text{S}}}\ll {\gamma}^{1}{K}_{\text{d}}^{\text{R}}$, which is when right complexes do decay but their fraction is low compared with free enzymes because of low substrate concentration.
Let us demonstrate the last statement more explicitly. Specifically, let us show that the validity of the approximation ${\rho}_{{}_{\text{E}}}(x)\approx \text{constant}$ is indeed linked directly to the fraction of bound enzymes. To that end, we evaluate ${\rho}_{{}_{\text{E}}}(0)/{\rho}_{{}_{\text{E}}}(L)$ as a metric that quantifies the degree to which ${\rho}_{{}_{\text{E}}}(x)\approx \text{constant}$ holds. If there is a large depletion of free enzymes near the substratebinding site, then the metric will be significantly less than 1; conversely, if the free enzyme profile is practically flat, then the metric will be close to 1. Invoking the relation ${\rho}_{{}_{\text{E}}}(x)\approx {\rho}_{0}{\rho}_{{}_{\text{ER}}}(x)$ and using our result for the complex density (Equation S110) as well as the definition of $\gamma $ in Equation S112, we evaluate this metric as
Next, we calculate the fraction of bound enzymes ${p}_{\text{bound}}$ from Equation S110 as
Note that ${\gamma}^{1}$ emerges as the highest fraction of bound enzymes (${p}_{\text{bound}}^{\text{max}}$) reached in the large substrate concentration limit.
To link the metric ${\rho}_{{}_{\text{E}}}(0)/{\rho}_{{}_{\text{E}}}(L)$ to the fraction of bound enzymes, we express ${\overline{\rho}}_{{}_{\text{S}}}/{K}_{\text{d}}^{\text{R}}$ in terms of ${p}_{\text{bound}}$ and substitute it into Equation S113, namely,
Now, when the complexes do not decay appreciably across the compartment (${\lambda}_{{}_{\text{ER}}}\gtrsim L$ and thus, $\mathrm{cosh}(L/{\lambda}_{{}_{\text{ER}}})\approx 1$), the metric becomes roughly equal to 1, suggesting that the free enzyme profile is practically flat. A more interesting case is when the complexes do decay (${\lambda}_{{}_{\text{ER}}}<L$), as in Appendix 5—figure 1. In this case, applying the condition $\mathrm{cosh}(L/{\lambda}_{{}_{\text{ER}}})\gg 1$, we find
The anticorrelation between the ${\rho}_{{}_{\text{E}}}(0)/{\rho}_{{}_{\text{E}}}(L)$ and ${p}_{\text{bound}}$ in the above result demonstrates that the degree to which the approximation ${\rho}_{{}_{\text{E}}}(x)\approx \text{constant}$ is violated is indeed dictated by the fraction of bound enzymes.
Pareto front shift
The previous calculations showed that in the ideal substrate localization limit relaxing the $\rho (x)\approx \text{constant}$ assumption keeps the fidelity the same while the speed gets reduced. And this reduction is greater for higher substrate concentrations. We therefore expect a shift in the Pareto front when going to the high substrate concentration limit, as is illustrated in Appendix 5—figure 2a. To get more intuition about the effect of this shift caused by tuning the amount of substrates, we consider the effective number of proofreading realizations at halfmaximum speed (n_{50}) and study how this number changes as a function of the fraction of enzymes bound (${p}_{\text{bound}}$), which increases monotonically with ${S}_{\text{total}}$ as suggested by Equation S114. Appendix 5—figure 2b shows this dependence. As can be seen, n_{50} reduces roughly linearly with ${p}_{\text{bound}}$; for example, if 10% of the enzymes are bound, then a 10% reduction in n_{50} is expected. This suggests that as long as the fraction of bound enzymes is low, our findings related to the Pareto front made under the ${\rho}_{{}_{\text{E}}}\approx \text{constant}$ assumption will generally hold true.
2. Effects that relaxing the ${\rho}_{{}_{\text{E}}}(x)\approx \text{constant}$ assumption has on fidelity in a weak substrate gradient setting
In this section, we study how accounting for the spatial distribution of free enzymes affects our results on the model’s fidelity in the setting where substrates have a finite localization length scale ${\lambda}_{{}_{\text{S}}}$. In this setting, Equations (1–3) (in the main text) describing the system’s dynamics become a system of nonlinear equations, which we solve at steady state using numerical methods.
An example curve of how fidelity changes with tuning diffusion time scale in a finite ${\lambda}_{{}_{\text{S}}}$ setting is shown in Appendix 5—figure 3. As expected, the nonuniform free enzyme profile leads to a reduction in fidelity. This reduction is not significant when diffusion is relatively fast as in that case the free enzyme profile manages to flatten out rapidly. The reduction is not significant also in the very slow diffusion limit where binding events that lead to production primarily take place in the proximity of the activation region and hence, the nonuniform profile of free enzymes across the compartment has little impact on fidelity. The greatest reduction happens at intermediate diffusion time scales; in particular, when the system achieves its peak fidelity.
To quantify the extent of this highest reduction, we calculated the peak value of the effective number of proofreading realizations (${n}_{\text{max}}$) for different free substrate amounts which regulate the fraction of bound enzymes (${p}_{\text{bound}}$). The results obtained for different choices of ${\lambda}_{{}_{\text{S}}}$ are summarized in Appendix 5—figure 4. As can be seen, for the high substrate localization case (${\lambda}_{{}_{\text{S}}}/L=0.04$), there is a roughly linear dependence between ${n}_{\text{max}}$ and ${p}_{\text{bound}}$. The initial decrease in ${n}_{\text{max}}$ with growing ${p}_{\text{bound}}$ is even slower when substrates are less tightly localized (${\lambda}_{{}_{\text{S}}}/L=0.10,0.30$).
Taken together, these results suggest that if the substrate concentration is low enough to leave most of the enzymes unbound, then our proposed scheme will proofread efficiently. And this requirement on substrate amount will be further relaxed if diffusion is fast, or if substrates are not very tightly localized.
Appendix 6
Proofreading on a kinase/phosphataseinduced gradient
In this section, we introduce the mathematical modeling setup for the kinase/phosphatasebased gradient formation scheme and describe how its fidelity is calculated numerically. In the end, we discuss the energetics of setting up the substrate concentration gradient and link our calculations to the lower bounds on energy cost obtained earlier in Appendix 2.
1. Setup and estimation of fidelity
In the analysis thus far, we have imposed a gradient of free substrates and analyzed the proofreading capability of an enzyme acting on this gradient. In a living cell, gradients themselves are maintained by active cellular processes. However, the action of the enzyme – that is, binding a substrate in one spatial location, diffusing away, and releasing the substrate elsewhere – can destroy the gradient, and thereby lead to a loss of proofreading. Here, we analyze the consequences of free substrate depletion and gradient flattening caused by the enzyme.
We model the formation of a substrate gradient by a combination of localized activation and delocalized deactivation. We suppose that substrates can exist in phosphorylated or dephosphorylated forms, and that only the phosphorylated form is capable of binding to the enzyme. The substrates are phosphorylated by a kinase with rate ${k}_{\text{kin}}=0.2$ s^{−1}, and dephosphorylated by a phosphatase with rate ${k}_{\text{p}}=5$ s^{−1}. Crucially, we assume that phosphatases are found everywhere in the domain of size $L\sim 10$ μm (a typical length scale in a eukaryotic cell), while kinases are localized to one end of the domain (at $x=0$), as may occur naturally if kinases are bound to one of the membranes enclosing the domain.
The minimal dynamics of phosphorylated substrates and enzyme–substrate complexes is then given by
augmented by the boundary conditions
Here, we have supposed that the densities of free enzymes, dephosphorylated substrates, and phosphatases are fixed and uniform, and have absorbed them into the relevant rate constants (${k}_{\text{b}}={k}_{\text{on}}{\rho}_{{}_{\text{E}}}$, ${k}_{\text{kin}}$, and ${k}_{\text{p}}$, respectively). For simplicity, we have also assumed that the free substrates and enzyme–substrate complexes have the same diffusion coefficient $D=1$ μm^{2}/s. We note that accounting for distinct diffusivities of phosphorylated and unphosphorylated substrate forms (Kholodenko, 2009) would affect the speed, while accounting for the slower diffusion of the enzyme–substrate complex would alter the estimates of both speed and fidelity of the model. One or several of these effects can be considered when studying a specific biological system where these microscopic details are known.
We numerically solve Equations S118 and S119 at steady state to obtain the concentration profiles. First, the equations of dynamics are made dimensionless by settings units of length and time by L ($\overline{x}=x/L$) and ${\tau}_{D}\equiv {L}^{2}/D$ ($\overline{t}=t/{\tau}_{D}$), respectively. At steady state, the dimensionless equations read
with boundary conditions
where concentrations have been rescaled as $\overline{\rho}=\rho L$, and kinetic rates as $\overline{k}=k{\tau}_{D}$.
We discretize the steady state equations on a grid with spacing $\mathrm{\Delta}\overline{x}=0.01$, approximating the second derivative as
This is illdefined at the boundaries $\overline{x}=0$ and $\overline{x}=1$, which is addressed by incorporating the boundary conditions. For illustration, consider the left boundary, $\overline{x}=0$, and suppose that our domain included also a point at $\overline{x}=\mathrm{\Delta}\overline{x}$. Then, we could approximate the boundary condition ${\overline{\nabla}{\overline{\rho}}_{{}_{\text{S}}}}_{\overline{x}=0}={\overline{k}}_{\text{kin}}$ by a centred difference scheme, and solve out for the fictional point at $\overline{x}=\mathrm{\Delta}\overline{x}$, namely,
which, when inserted into Equation S122, specifies ${\overline{\nabla}}^{2}{\overline{\rho}}_{{}_{\text{S}}}$ at $\overline{x}=0$, that is,
For the boundary at the right ($\overline{x}=1$) as well as for the boundary conditions for $\overline{\rho}}_{{}_{\text{ES}}$, we similarly implement noflux boundary conditions. After discretizing, Equation S120 can then be written in a matrix form as
where ${\overrightarrow{\rho}}_{\text{S}}$, ${\overrightarrow{\rho}}_{\text{ES}}$ are column vectors of the nondimensionalized concentration profiles evaluated at the spatial grid points, that is, ${[\overline{\rho}(0),\overline{\rho}(\mathrm{\Delta}\overline{x}),\mathrm{\cdots}]}^{T}$. Solving these matrix equations yields
We compute Equation S125 numerically for two substrates: a cognate (‘R’) and a noncognate (‘W’), which differ in their offrates (${k}_{\text{off}}^{\text{R}}=0.1{\text{s}}^{1}$ and ${k}_{\text{off}}^{\text{W}}=1{\text{s}}^{1}$, respectively). Having the density profiles, the fidelity of the model becomes $\eta \approx {\overline{\rho}}_{{}_{\text{ER}}}(\overline{x}=1)/{\overline{\rho}}_{{}_{\text{EW}}}(\overline{x}=1)$. We calculate the fidelity for different choices of the first–order rate of enzyme–substrate binding (${k}_{\text{b}}={k}_{\text{on}}{\rho}_{{}_{\text{E}}}$); this may be thought of as varying the concentration of free enzyme in the cell. The results are shown in Figure 5 of the main text.
2. Energy dissipation
In Appendices 2.1 and 2.3, we estimated lower bounds on the minimum power that needs to be dissipated in order to counter the homogenizing effect that enzyme activity and substrate diffusion respectively have on localized substrate profiles. Here, we calculate the energy dissipation required to run the kinase/phosphatasebased mechanism and compare it with these lower bounds estimated earlier.
Let us assume that phosphorylation and dephosphorylation reactions by kinases and phosphatases are nearly irreversible with associated free energy costs of $\mathrm{\Delta}{\epsilon}_{\text{kin}}$ and $\mathrm{\Delta}{\epsilon}_{\text{phosph}}$ per reaction, respectively. The net rate at which active substrates get dephosphorylated is ${k}_{\text{p}}{S}_{\text{phosphorylated}}$ and it needs to be identical to the net phosphorylation rate of inactive substrates in order for ${S}_{\text{phosphorylated}}$ to remain constant. With the costs of each reaction known, we can write the rate of energy dissipation ${P}_{\text{k/p}}$ as
To gain analytical intuition, we first consider the case where the enzyme activity is very low, so that the kinase/phosphatase–based mechanism maintains an exponential profile of active substrates with a decay length scale $\lambda}_{{}_{\text{S}}}=\sqrt{{D}_{{}_{\text{S}}}/{k}_{\text{p}}$. Expressing the rate of phosphorylation in terms of ${\lambda}_{{}_{\text{S}}}$ and ${D}_{{}_{\text{S}}}$ (i.e., ${k}_{\text{p}}={D}_{{}_{\text{S}}}/{\lambda}_{{}_{\text{S}}}^{2}$), and substituting it into Equation S126, we obtain
Comparing this result with the lower dissipation bound found earlier (Equation S43), we can note the presence of an extra factor $\beta (\mathrm{\Delta}{\epsilon}_{\text{kin}}+\mathrm{\Delta}{\epsilon}_{\text{phosph}})$. Since the free energy consumption during ATP hydrolysis is $\sim 10{k}_{\text{B}}T$, we can say that the power dissipated by the kinase/phosphatase system for setting up an exponential gradient surpasses the lower limit necessary for counteracting diffusion roughly by an order of magnitude.
Next, we explore the energetics of the kinase/phosphatasebased mechanism in the context of the power–fidelity tradeoff. Our study of the tradeoff in Figure 4 of the main text was performed under the assumption that substrate profiles were exponentially decaying in the entire spatial domain. In Appendix 6—figure 1a, we show the tradeoff curves obtained under this assumption and compare them with the tradeoff curve for the kinase/phosphatasebased mechanism that arises in response to changing the substrate localization by tuning ${k}_{\text{p}}$. As can be seen, the predicted lower bound (sum of the minimum powers needed to counteract the enzyme action and substrate diffusion) is roughly an order of magnitude lower than the total dissipation of the mechanism, and this difference increases with higher fidelity.
Note, however, that the assumption about an exponential substrate localization is not generally valid for the kinase/phosphatasebased mechanism because substrates can be deposited in low–concentration regions and not get immediately dephosphorylated (Appendix 6—figure 1c). We therefore refine our lower bounds on the dissipated power by estimating them numerically using their generic definitions, namely, Equation S30 for counteracting enzymatic action, and Equation S42 for counteracting substrate diffusion. These refined estimates suggest a factor of ∼10 difference between the total cost and its lower bound consistently across a wide region of the tradeoff curve. This means that substrate gradient maintenance through practically irreversible phosphorylation and dephosphorylation reactions has low energetic efficiency for doing spatial proofreading, which, however, may be sustainable depending on the energy budget of the cell.
Data availability
All scripts used to generate the data for making the plots are provided in supporting files.
References

Gradients in the selforganization of the mitotic spindleTrends in Cell Biology 16:125–134.https://doi.org/10.1016/j.tcb.2006.01.005

Kinase and phosphataseanchoring proteins: harnessing the dynamic duoNature Cell Biology 4:E203–E206.https://doi.org/10.1038/ncb0802e203

Spatial gradients of cellular phosphoproteinsFEBS Letters 457:452–454.https://doi.org/10.1016/S00145793(99)010583

Mechanisms of tailanchored membrane protein targeting and insertionAnnual Review of Cell and Developmental Biology 33:417–438.https://doi.org/10.1146/annurevcellbio100616060839

Thermodynamic constraints on kinetic proofreading in biosynthetic pathwaysBiophysical Journal 31:333–358.https://doi.org/10.1016/S00063495(80)850636

Quality control in the endoplasmic reticulumNature Reviews Molecular Cell Biology 4:181–191.https://doi.org/10.1038/nrm1052

A nuclear Factin scaffold stabilizes ribonucleoprotein droplets against gravity in large cellsNature Cell Biology 15:1253–1259.https://doi.org/10.1038/ncb2830

Tcell receptor binding kinetics in Tcell development and activationExpert Reviews in Molecular Medicine 3:1–17.https://doi.org/10.1017/S1462399401002502

Mathematical and computational models of immunereceptor signallingNature Reviews Immunology 4:445–456.https://doi.org/10.1038/nri1374

Minimum energetic cost to maintain a target nonequilibrium statePhysical Review E 95:042102.https://doi.org/10.1103/PhysRevE.95.042102

Diffusion control of protein phosphorylation in signal transduction pathwaysBiochemical Journal 350:901–907.https://doi.org/10.1042/bj3500901

Fourdimensional organization of protein kinase signaling cascades: the roles of diffusion, endocytosis and molecular motorsJournal of Experimental Biology 206:2073–2082.https://doi.org/10.1242/jeb.00298

Cellsignalling dynamics in time and spaceNature Reviews Molecular Cell Biology 7:165–176.https://doi.org/10.1038/nrm1838

Spatially distributed cell signallingFEBS Letters 583:4006–4012.https://doi.org/10.1016/j.febslet.2009.09.045

DNA replication fidelityJournal of Biological Chemistry 279:16895–16898.https://doi.org/10.1074/jbc.R400006200

The energyspeedaccuracy tradeoff in sensory adaptationNature Physics 8:422–428.https://doi.org/10.1038/nphys2276

The cost of sensitive response and accurate adaptation in networks with an incoherent type1 feedforward loopJournal of the Royal Society Interface 10:20130489.https://doi.org/10.1098/rsif.2013.0489

Nonvesicular lipid transport by lipidtransfer proteins and beyondNature Reviews Molecular Cell Biology 11:739–750.https://doi.org/10.1038/nrm2971

Discriminatory proofreading regimes in nonequilibrium systemsPhysical Review X 4:021016.https://doi.org/10.1103/PhysRevX.4.021016

Subcellular mRNA localisation at a glanceJournal of Cell Science 127:2127–2133.https://doi.org/10.1242/jcs.114272

Reducing intrinsic biochemical noise in cells and its thermodynamic limitJournal of Molecular Biology 362:387–392.https://doi.org/10.1016/j.jmb.2006.07.068

Fidelity of aminoacyltRNA selection on the ribosome: kinetic and structural mechanismsAnnual Review of Biochemistry 70:415–435.https://doi.org/10.1146/annurev.biochem.70.1.415

Thermodynamics of error correctionPhysical Review X 5:041039.https://doi.org/10.1103/PhysRevX.5.041039

Signaling cascades as cellular devices for spatial computationsJournal of Mathematical Biology 58:35–55.https://doi.org/10.1007/s0028500801626

The role of proofreading in signal transduction specificityBiophysical Journal 82:2928–2933.https://doi.org/10.1016/S00063495(02)756336

RNA polymerase fidelity and transcriptional proofreadingCurrent Opinion in Structural Biology 19:732–739.https://doi.org/10.1016/j.sbi.2009.10.009
Decision letter

Ahmet YildizReviewing Editor; University of California, Berkeley, United States

Aleksandra M WalczakSenior Editor; École Normale Supérieure, France
In the interests of transparency, eLife publishes the most substantive revision requests and the accompanying author responses.
Thank you for submitting your article "Proofreading through spatial gradients" for consideration by eLife. Your article has been reviewed by three peer reviewers, and the evaluation has been overseen by a Reviewing Editor and Aleksandra Walczak as the Senior Editor. The reviewers have opted to remain anonymous.
The reviewers have discussed the reviews with one another and the Reviewing Editor has drafted this decision to help you prepare a revised submission.
As the editors have judged that your manuscript is of interest, but as described below that substantial revisions are required before it is published, we would like to draw your attention to changes in our revision policy that we have made in response to COVID19 (https://elifesciences.org/articles/57162). First, because many researchers have temporarily lost access to the labs, we will give authors as much time as they need to submit revised manuscripts. We are also offering, if you choose, to post the manuscript to bioRxiv (if it is not already there) along with this decision letter and a formal designation that the manuscript is "in revision at eLife". Please let us know if you would like to pursue this option. (If your work is more suitable for medRxiv, you will need to post the preprint yourself, as the mechanisms for us to do so are still in development.)
Summary:
In the manuscript by Galstyn et al. on "Proofreading through spatial gradients", the authors proposed and studied a new kinetic proofreading (KP) model/scheme based on having a spatial gradient of the substrate (both "correct" and "wrong" ones) and the diffusive transport of the substratebound enzyme molecules to a spatially localized production site. The authors did an excellent job in explaining their new model and its connection and difference with regards to the classical HopfieldNinos KP mechanism. The key insight is that with spatial inhomogeneity, e.g., in the presence of a persistent spatial gradient for the enzyme or the substrate, one can consider spatial location as a statevariable. By having the substrate and product (or production site) at different spatial locations, these spatial degrees of freedom of the enzyme, i.e., enzymes at different physical location, can be considered as the intermediate states that are necessary for kinetic proofreading – each intermediate state contributes a certain probability for errorcorrection. In the original HopfieldNinos KP scheme, the intermediate state is provided by additional enzyme(s), whereas in this new KP scheme, it depends on having a spatial gradient, which the authors argue is more tunable. The reviewers were enthusiastic about the theoretical model presented in this study because of its simplicity and elegance. However, the reviewers have also raised serious concerns (see Essential Revisions for detail) that need to be addressed in order to consider the manuscript further for publication in Life. In summary, the panel feels that discussion of possible biological example(s) where this novel type of proofreading may be occurring would significantly improve the manuscript's appeal to a broad audience. In addition, the reviewers ask for more explicitly explanation of the effect of enzymatic catalysis rates, and discussion of the full dissipation cost in the revised manuscript.
Essential revisions:
1) The major concern of the reviewer panel is how relevant this mechanism is for realistic biological systems. The original HopfieldNinos KP mechanism was motivated by specific and important biological problems (puzzles), namely the unusually high fidelity in biochemical synthesis process (in comparison with its equilibrium value). In this manuscript, the theory is developed without specific biological system or specific biological question in mind. It is true that spatial gradient exists across biological systems and the authors also showed that typical kinetic rates may fall in the functional range of this new gradientdependent kinetic proofreading mechanism. But, what is the function of the original system that such a kinetic proofreading process can help improve? Is it biochemical synthesis? Do the authors envision "correct" and "wrong" biomolecules being produced at the production site (x=L) like in the original setting of HopfieldNinos? Or is it signaling like in the Tcell signaling case? If so, do the authors envision that both the correct signaling molecule and the incorrect signaling molecule have a spatial gradient and they can both be carried by the same enzyme to their functional sites? The panel is not asking a detailed comparison with a specific system, but a known biological phenomenon that may be explained by this new mechanism would help motivate the mostly biologist audience of eLife. Furthermore, a connection to a specific biological system could also lead to testable predictions that would ultimately verify (or falsify) the existence of this mechanism.
2) The entire manuscript assumes that catalysis is negligible and thus need not be explicitly modeled in solving for the steadystate distributions. How would incorporating a boundary condition at the right that involves nonnegligible catalysis change (even qualitatively) your findings? To be more specific, there is a production r for the enzymatic reaction at x=L where the enzyme is active. However, the effect of this reaction, which change ES>E+P, is not considered in the model equations (Equations 13). Is it because r is considered to be small? If so, smaller than what? Since speed is directly related to r, how does the value of r affect the speed and the speedaccuracy tradeoff?
3) When quantifying the energetic costs, the main text solely focuses on the cost of counteracting the enzyme binding substrate, diffusing, and releasing. The appendix explores some theory for the other cost of maintaining the substrate gradients, but without reporting any absolute numbers. For the biologically plausible kinase/phosphatase substratemaintenance mechanism explored in the main text, how does its cost compare to the cost that you study quantitatively in the main text? Specifically, where does Equation 8 come from? What's the physical meaning of P? The standard way to compute energy dissipation is by computing the entropy production rate S', which is well defined. Then by assuming the internal energy does not change with time in steady state, we equate energy dissipation with kT*S'. The form of entropy production rate is known and can be found in text book (such as those from T. Hill) and papers (e.g., those from H. Qian and collaborators; and from U. Seifert and collaborators), and the formula given in Equation 8 does not seem to be consistent with the known form of entropy production. In particular, for a given reaction with forward flux J+ and backward flux J, the entropy production rate is: (J+J)ln(J+/J), which can be easily shown to be positive definite and only = 0 when detailed balance J+=J is satisfied.
4) The same concentration profiles are assumed for the right substrate R and the wrong substrate W. This is a strong assumption, could the authors consider the case where the concentration gradient length of the wrong substrate profile is larger than this length for the right substrate but still smaller that the distance L? They may calculate a series of the fidelity curves with increasing λ_{W} and the same λ_{R}. How will proofreading change?
https://doi.org/10.7554/eLife.60415.sa1Author response
Summary:
In the manuscript by Galstyn et al. on "Proofreading through spatial gradients", the authors proposed and studied a new kinetic proofreading (KP) model/scheme based on having a spatial gradient of the substrate (both "correct" and "wrong" ones) and the diffusive transport of the substratebound enzyme molecules to a spatially localized production site. The authors did an excellent job in explaining their new model and its connection and difference with regards to the classical HopfieldNinos KP mechanism. The key insight is that with spatial inhomogeneity, e.g., in the presence of a persistent spatial gradient for the enzyme or the substrate, one can consider spatial location as a statevariable. By having the substrate and product (or production site) at different spatial locations, these spatial degrees of freedom of the enzyme, i.e., enzymes at different physical location, can be considered as the intermediate states that are necessary for kinetic proofreading – each intermediate state contributes a certain probability for errorcorrection. In the original HopfieldNinos KP scheme, the intermediate state is provided by additional enzyme(s), whereas in this new KP scheme, it depends on having a spatial gradient, which the authors argue is more tunable. The reviewers were enthusiastic about the theoretical model presented in this study because of its simplicity and elegance. However, the reviewers have also raised serious concerns (see Essential Revisions for detail) that need to be addressed in order to consider the manuscript further for publication in Life. In summary, the panel feels that discussion of possible biological example(s) where this novel type of proofreading may be occurring would significantly improve the manuscript's appeal to a broad audience. In addition, the reviewers ask for more explicitly explanation of the effect of enzymatic catalysis rates, and discussion of the full dissipation cost in the revised manuscript.
We are deeply grateful to the reviewers for the variety of very interesting suggestions and critiques that they have made. These remarks led the author team to several months of lively exchanges and precipitated a number of new and interesting calculations which are now in the paper or the supporting information. We have addressed all of the comments in detail, in many cases adding new calculations to the manuscript and we believe that the paper is much improved. We hope that the revised manuscript will now be viewed as suitable for publication.
The one comment that we wanted to address in a more circumspect fashion was the first comment concerning biological examples of our new proofreading hypothesis. We have several points to make here. First, in developing this new proofreading concept we were inspired by the ubiquitous phenomenon of allostery, the fact that many, many proteins change their state of activity upon binding to a relevant ligand. Further, many allosteric proteins are membrane bound. As a result, there are a plethora of examples where protein localization in conjunction with allostery provide a plausible basis for the kind of proofreading we envisage. As a result, although the reviewers wonder about biological examples which we describe below, in our view, the broad reach of the allostery phenomenon is to our minds an already strong plausibility argument for the kind of proofreading mechanisms we suggest here. A second more philosophical remark is simply to hope that the reviewers are open to the idea that there is something very powerful about theoretical ideas being ahead of experiments. For example, the idea of depletion forces was hypothesized by Asakura and Oosawa long before there was any data. With that in mind, we hope that the reviewers are open to the argument that this kind of interplay between theory and experiment in which a theoretical idea is ahead of the experiments is a potent tool for engendering new experiments. Indeed, the whole notion of positional information in the setting of morphogenesis is quite related to our work and we are inspired by the possibility of using synthetic biology approaches to explicitly construct the kind of mechanism we have hypothesized. In this era of synthetic biology, even if as yet there were no definitive natural examples of the mechanism we propose here, we are confident that this mechanism could be built using the tools of synthetic biology along the lines of the two papers that appeared in Science several weeks ago (Toda, et al. and Stapornwongkul, et al.) in which GFP was artificially used as a morphogen.
In summary, again, we are deeply grateful to the reviewers for many thoughtful and helpful comments. We have addressed all of them, though the question of biological examples is slightly nuanced.
Essential revisions:
1) The major concern of the reviewer panel is how relevant this mechanism is for realistic biological systems. The original HopfieldNinos KP mechanism was motivated by specific and important biological problems (puzzles), namely the unusually high fidelity in biochemical synthesis process (in comparison with its equilibrium value). In this manuscript, the theory is developed without specific biological system or specific biological question in mind. It is true that spatial gradient exists across biological systems and the authors also showed that typical kinetic rates may fall in the functional range of this new gradientdependent kinetic proofreading mechanism. But, what is the function of the original system that such a kinetic proofreading process can help improve? Is it biochemical synthesis? Do the authors envision "correct" and "wrong" biomolecules being produced at the production site (x=L) like in the original setting of HopfieldNinos? Or is it signaling like in the Tcell signaling case? If so, do the authors envision that both the correct signaling molecule and the incorrect signaling molecule have a spatial gradient and they can both be carried by the same enzyme to their functional sites? The panel is not asking a detailed comparison with a specific system, but a known biological phenomenon that may be explained by this new mechanism would help motivate the mostly biologist audience of eLife. Furthermore, a connection to a specific biological system could also lead to testable predictions that would ultimately verify (or falsify) the existence of this mechanism.
We thank the panel for urging us to propose more concrete biological examples where spatial proofreading could potentially be in play and for the questions about the implementation of our scheme. We have added several such examples in the Discussion section. As for the specific questions, in the processes discussed, it is indeed the case that the same enzyme/mediator protein transports both right and wrong substrates which either have a spatial gradient or are ideally localized at a membrane–bound compartment. The “product” of the reaction is the delivery of the substrate at the target site. This can either be the ultimate purpose of the pathway or be followed by biological synthesis. Specifically, the first example we discuss is related to spatially localized protein synthesis often seen in polarized, asymmetric cells. Designated ribonucleoproteins bind specific mRNAs near the cell nucleus and transport them to the localization site (e.g., the bud tip of a dividing cell, the lamellipodia or axonal growth cones) where synthesis occurs. mRNAs that are released during transport are subjected to degradation which prevents protein synthesis in the cytosol that, if it happened, could be toxic or deleterious to the cell (Parton, et al., 2014, Martin and Ephrussi, 2009). Another example is the nonvesicular transport of phospholipids between different membrane–bound compartments of the cell. This is achieved through lipid–transfer proteins that cycle between the donor and acceptor compartments and transfer specific lipids (Lev, 2010). Transport efficiency was mentioned in the review paper by Lev as an important performance metric dictated by the diffusion distance and we think it would be interesting to address the question of optimal architecture from the perspective of fidelity–transport efficiency (or, speed) tradeoff. At the end of the Discussion section, we also mentioned a few other processes involving compartmentalized parts of the cell where our proposed scheme may be applicable. Experimental studies of these processes in in vivo and in vitro reconstituted settings in light of the signature features of the spatial proofreading mechanism will reveal if and to what extent it is used in cells. Lastly, we are very enthusiastic about the use of tools from synthetic biology to explicitly design and construct an in vivo example of our concept. Recent work on synthetic morphogen gradients (Toda, et al., 2020, Stapornwongkul, et al., 2020) foreshow these possibilities.
2) The entire manuscript assumes that catalysis is negligible and thus need not be explicitly modeled in solving for the steadystate distributions. How would incorporating a boundary condition at the right that involves nonnegligible catalysis change (even qualitatively) your findings? To be more specific, there is a production r for the enzymatic reaction at x=L where the enzyme is active. However, the effect of this reaction, which change ES>E+P, is not considered in the model equations (Equations 13). Is it because r is considered to be small? If so, smaller than what? Since speed is directly related to r, how does the value of r affect the speed and the speedaccuracy tradeoff?
We thank the reviewer for raising this important point. We have now rerun our analysis with modified boundary conditions to account for finite catalysis rates (see Appendix 3). We showed that the performance of the spatial proofreading model depends on catalysis in the same qualitative way as with classical proofreading, namely, faster catalysis reduces the effectiveness of the final step of the proofreading cascade and the lowest error is achieved in the limit of slow catalysis. We derived exact analytical conditions for each regime and showed that fidelity reduction due to fast catalysis is bounded by a factor of η_{eq}. We also showed that despite this reduction, higher catalysis rates in fact improve the Pareto–optimal front of the speed–fidelity tradeoff. Specifically, to maximize speed for a given fidelity value, catalysis needs to be fast with a corresponding slowdown of diffusion. We comment on these new findings in the Results subsections “Slow Transport of Enzymatic Complex Enables Proofreading” and “Navigating the Speed–Fidelity TradeOff”.
3) When quantifying the energetic costs, the main text solely focuses on the cost of counteracting the enzyme binding substrate, diffusing, and releasing. The appendix explores some theory for the other cost of maintaining the substrate gradients, but without reporting any absolute numbers. For the biologically plausible kinase/phosphatase substratemaintenance mechanism explored in the main text, how does its cost compare to the cost that you study quantitatively in the main text? Specifically, where does Equation 8 come from? What's the physical meaning of P? The standard way to compute energy dissipation is by computing the entropy production rate S', which is well defined. Then by assuming the internal energy does not change with time in steady state, we equate energy dissipation with kT*S'. The form of entropy production rate is known and can be found in text book (such as those from T. Hill) and papers (e.g., those from H. Qian and collaborators; and from U. Seifert and collaborators), and the formula given in Equation 8 does not seem to be consistent with the known form of entropy production. In particular, for a given reaction with forward flux J+ and backward flux J, the entropy production rate is: (J+J)ln(J+/J), which can be easily shown to be positive definite and only = 0 when detailed balance J+=J is satisfied.
In Appendix 6, subsection “Energy dissipation”, we calculated the total power dissipated in the kinase/phosphatase– based mechanics using a rough estimate for the dissipated energy per phosphorylation and dephosphorylation event (∼ 10 k_{B}T each). We demonstrated that this cost exceeds our estimated lower bound on the proofreading cost (Figure 4 of the main text) as well as the minimum cost required for localizing substrates by roughly an order of magnitude for a wide region of the dissipation–fidelity tradeoff curve (Appendix 6—figure 1), suggesting that for the purposes of spatial proofreading the energetic efficiency of the kinase/phosphatase–based mechanism is low. We mention this feature in the subsection “Proofreading by Biochemically Plausible Intracellular Gradients” of the main text.
In addition, we elaborated our discussion of the proofreading cost in Appendix 2, subsection “Derivation of the minimum dissipated power” and showed that our definition of power in Equation 8 of the main text in fact matches identically with the classical nonequilibrium thermodynamic definition expressed in terms of fluxes and thermodynamic forces (in the aforementioned subsection). The reason for their identity is the fact that driving forces are nonzero only for substrate binding/unbinding events and not for enzyme diffusion. Adding contributions from binding/unbinding events across the entire compartment leads to the proposed expression for power (Equation 8).
4) The same concentration profiles are assumed for the right substrate R and the wrong substrate W. This is a strong assumption, could the authors consider the case where the concentration gradient length of the wrong substrate profile is larger than this length for the right substrate but still smaller that the distance L? They may calculate a series of the fidelity curves with increasing λ_{W} and the same λ_{R}. How will proofreading change?
In our main analysis, the assumption of equal concentration profiles for both substrates allows us to focus on discrimination due to the proofreading mechanism itself. We did not want to implicitly assume any discrimination of substrates other than the difference in their offrates. By analogy, in classical proofreading models, one assumes that ATP hydrolysis (or any other energy consumption mechanism) itself does not discriminate between the substrates and that both substrates are present in equal amounts, even if neither is true in reality.
To understand how effects raised by the reviewers layer on top of the proofreading discrimination described in the main text, Appendix 4 now explores fidelity for unequal values of λ_{W} and λ_{R}. As anticipated, fidelity goes down with shallower gradients of wrong substrates. We note that, at least for the kinase/phosphatasebased gradient formation mechanism, it is in fact the right substrates that have a shallower gradient and not the wrong ones (e.g., see Appendix 6—figure 1C), which makes our assumption of equal concentration profiles a conservative one. This happens because wrong substrates unbind earlier in transport, and hence, are easier to localize than right substrates which are more likely to unbind closer to the production end. The curves plotted in Appendix 4—figure 1 demonstrate this in the λ_{W} <λ_{R} region of the parameter space.
https://doi.org/10.7554/eLife.60415.sa2Article and author information
Author details
Funding
James S. McDonnell Foundation
 Kabir Husain
Simons Foundation
 Arvind Murugan
John Templeton Foundation
 Rob Phillips
National Institute of General Medical Sciences
 Rob Phillips
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Acknowledgements
We thank Anatoly Kolomeisky, Shuou Shan and Erik Winfree for insightful discussions, Soichi Hirokawa and Avi Flamholz for providing useful feedback on the manuscript. We also thank Alexander Grosberg whose idea of a compartmentalized ‘rotary demon’ motivated the development of our model. This work was supported by the NIH Grant 1R35 GM11804301, the John Templeton Foundation Grants 51250 and 60973 (to RP), a James S. McDonnell Foundation postdoctoral fellowship (to KH), and the Simons Foundation (AM).
Senior Editor
 Aleksandra M Walczak, École Normale Supérieure, France
Reviewing Editor
 Ahmet Yildiz, University of California, Berkeley, United States
Publication history
 Received: June 26, 2020
 Accepted: December 24, 2020
 Accepted Manuscript published: December 24, 2020 (version 1)
 Version of Record published: January 18, 2021 (version 2)
Copyright
© 2020, Galstyan et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics

 1,784
 Page views

 189
 Downloads

 0
 Citations
Article citation count generated by polling the highest count across the following sources: Crossref, PubMed Central, Scopus.