Proofreading through spatial gradients
Abstract
Key enzymatic processes use the nonequilibrium error correction mechanism called kinetic proofreading to enhance their specificity. The applicability of traditional proofreading schemes, however, is limited because they typically require dedicated structural features in the enzyme, such as a nucleotide hydrolysis site or multiple intermediate conformations. Here, we explore an alternative conceptual mechanism that achieves error correction by having substrate binding and subsequent product formation occur at distinct physical locations. The time taken by the enzyme–substrate complex to diffuse from one location to another is leveraged to discard wrong substrates. This mechanism does not have the typical structural requirements, making it easier to overlook in experiments. We discuss how the length scales of molecular gradients dictate proofreading performance, and quantify the limitations imposed by realistic diffusion and reaction rates. Our work broadens the applicability of kinetic proofreading and sets the stage for studying spatial gradients as a possible route to specificity.
Introduction
The nonequilibrium mechanism called kinetic proofreading (Hopfield, 1974; Ninio, 1975) is used for reducing the error rates of many biochemical processes important for cell function (e.g. DNA replication [Kunkel, 2004], transcription [Sydow and Cramer, 2009], translation [Rodnina and Wintermeyer, 2001; Ieong et al., 2016], signal transduction [Swain and Siggia, 2002], or pathogen recognition [McKeithan, 1995; Goldstein et al., 2004; Cui and Mehta, 2018]). Proofreading mechanisms operate by inducing a delay between substrate binding and product formation via intermediate states for the enzyme–substrate complex. Such a delay gives the enzyme multiple chances to release the wrong substrate after initial binding, allowing far lower error rates than what one would expect solely from the binding energy difference between right and wrong substrates.
Traditional proofreading schemes require dedicated molecular features such as an exonuclease pocket in DNA polymerases (Kunkel, 2004) or multiple phosphorylation sites on T-cell receptors (McKeithan, 1995; Goldstein et al., 2004); such features create intermediate states that delay product formation (Figure 1a) and thus allow proofreading. Additionally, since proofreading is an active nonequilibrium process often involving near–irreversible reactions, the enzyme typically needs to have an ATP or GTP hydrolysis site to enable the use of energy supplies of the cell (Yamane and Hopfield, 1977; Rodnina and Wintermeyer, 2001). Due to such stringent structural requirements, the number of confirmed proofreading enzymes is relatively small. Furthermore, generic enzymes without such dedicated features are assumed to not have active error correction available to them.
In this work, we propose an alternative scheme where the delay between initial substrate binding and product formation steps is achieved by separating these events in space. If substrates are spatially localized and product formation is favorable only in a region of low substrate concentration where an activating effector is present then the time taken by the enzyme–substrate complex to travel from one location to the other can be used to discard the wrong substrates, which are assumed to unbind from the enzyme more readily than the right substrates (Figure 1b). When this delay is longer than substrate unbinding time scales, very low error rates of product formation can be achieved, allowing this spatial proofreading scheme to outperform biochemical mechanisms with a finite number of proofreading steps.
In contrast to traditional proofreading, the nonequilibrium mechanism here does not require any direct energy consumption by the enzyme or substrate itself (e.g. through ATP hydrolysis). This liberates the enzyme from any proofreading-specific molecular features; indeed, any ‘equilibrium’ enzyme with a localized effector can proofread using our scheme if appropriate concentration gradients of the substrates or enzymes are set up. In this way, the energetic and structural requirements of proofreading can be outsourced from the enzyme and substrate to the gradient maintaining mechanism. It also means that spatial proofreading is easy to overlook in experiments, and that the fidelity of reconstituted reactions in vitro could be lower than the fidelity in vivo.
The lack of reliance on structure makes spatial proofreading more adaptable. We study how tuning the length scale of concentration gradients can trade off error rate against speed and energy consumption on the fly. In contrast, traditional proofreading schemes rely on nucleotide chemical potentials, for example, the out of equilibrium [ATP]/[ADP] ratio in the cell, and cannot modulate their operation without broader physiological disruptions.
Our proposed scheme can be leveraged for specificity if appropriate concentration gradients are set. Such gradients arise in multiple cellular contexts (e.g. near the nucleus, the plasma membrane, the Golgi apparatus, the endoplasmic reticulum [ER], kinetochores, microtubules [Bivona et al., 2003; Caudron et al., 2005; Kholodenko, 2006]) and several gradient-forming mechanisms have been discussed in the literature (Wu et al., 2018; Kholodenko, 2006; Kholodenko, 2003). We conclude our analysis of spatial proofreading by quantifying its limitations as set by realistic reaction rates and gradient formation mechanisms, and discuss examples from the literature, including the localization of mRNAs in polarised cells, and the non-vesicular transport of lipids in eukaryotic cells, in which this mechanism might be in play. Our work motivates a detailed investigation of spatial structures and compartmentalization in living cells as possible delay mechanisms for proofreading enzymatic reactions.
Results
Slow transport of enzymatic complex enables proofreading
Our proposed scheme is based on spatially separating substrate binding and product formation events for the enzyme (Figure 1b). Such a setting arises naturally if substrates are spatially localized by having concentration gradients in a cellular compartment. Similarly, an effector needed for product formation (e.g. through allosteric activation) may have a spatial concentration gradient localized elsewhere in that compartment. To keep our model simple, we assume that the right (R) and wrong (W) substrates have identical concentration gradients of length scale but that the effector is entirely localized to one end of the compartment, for example via membrane tethering. In Appendix 4, we extend our study of model performance to the scenario where the two substrates have different localization length scales.
We model our system using coupled reaction–diffusion equations for the substrate-bound (‘ES’ with ) and free (‘E’) enzyme densities, namely,
Here, D is the enzyme diffusion constant, and (with ) are the substrate binding and unbinding rates, respectively, and is the spatially localized substrate concentration profile which we take to be exponentially decaying, which is often the case for profiles created by cellular gradient formation mechanisms (Driever and Nüsslein-Volhard, 1988; Brown and Kholodenko, 1999). We limit our discussion to this one-dimensional setting of the system, though our treatment can be generalized to two and three dimensions in a straightforward way.
The above model does not explicitly account for several effects relevant to living cells, such as depletion of substrates or distinct diffusion rates for the free and substrate-bound enzymes. More importantly, it does not account for the mechanism of substrate gradient formation. We analyze a biochemically detailed model with this latter feature and experimentally constrained parameters later in the paper. Here, we proceed with the minimal model above for explanatory purposes. To identify the key determinants of the model’s performance, we assume throughout our analysis that the amount of substrates is sufficiently low that the enzymes are mostly free with a roughly uniform profile (i.e. ). This assumption makes Equations (1-3) linear and allows us to solve them analytically at steady state. We demonstrate in Appendix 5 that proofreading is, in fact, most effective under this assumption and discuss the consequences of having high substrate amounts on the performance of the scheme.
In our simplified picture, enzyme activation and catalysis take place upon reaching the right boundary at a rate r that is identical for both substrates. Therefore, the density of substrate–bound enzymes at the right boundary can be taken as a proxy for the rate of product formation , since
where L is the size of the compartment. In order to keep the analytical results concise and intuitive, we perform our main analyses under the assumption that catalysis is slow, mirroring the study of traditional proofreading schemes (Hopfield, 1974). In Appendix 3, we derive the precise conditions under which this treatment is valid, and generalize our analysis to arbitrary catalysis rates.
To demonstrate the proofreading capacity of the model, we first analyze the limiting case where substrates are localized to the left end of the compartment (). In this limit, the fidelity , defined as the number of right products formed per single wrong product, becomes
where is the equilibrium fidelity, and is the characteristic time scale of diffusion across the compartment (see Appendix 1 for the derivation).
Equation 5 is plotted in Figure 2 for a family of different parameter values. As can be seen, when diffusion is fast (small ), fidelity converges to its equilibrium value and proofreading is lost (). Conversely, when diffusion is slow (large ), the enzyme undergoes multiple rounds of binding a substrate at the left end and unbinding midway until it manages to diffuse across the whole compartment as a complex and form a product. These rounds serve as ‘futile cycles’ that endow the system with proofreading. In this regime, fidelity scales as
To get further insights, we introduce an effective number of extra biochemical intermediates (n) that a traditional proofreading scheme would need to have in order to yield the same fidelity, that is . We calculate this number as (see Appendix 1)
Notably, since , the result above suggests a linear relationship between the effective number of proofreading realizations and the compartment size (). In addition, because the right-hand side of Equation 7 is an increasing function of , the proofreading efficiency of the scheme rises with larger differences in substrate off-rates (Figure 2) – a feature that ‘hard–wired’ traditional proofreading schemes with a fixed number of proofreading steps lack.
Navigating the speed–fidelity trade-off
As is inherent to all proofreading schemes, the fidelity enhancement described earlier comes at a cost of reduced product formation speed. This reduction, in our case, happens because of increased delays in diffusive transport. Here, we explore the resulting speed–fidelity trade-off and its different regimes by varying two of the model parameters: diffusion time scale and the substrate localization length scale .
Speed and fidelity for different sampled values of and are depicted in Figure 3a. As can be seen, for a fixed , the reduction of can trade off fidelity against speed. This trade-off is intuitive; with tighter substrate localization, the complexes are formed closer to the left boundary. Hence, a smaller fraction of complexes reach the activation region, reducing reaction speed. The Pareto-optimal front of the trade-off over the whole parameter space, shown as a red curve on the plot, is reached in the limit of ideal substrate localization (). Varying the diffusion time scale allows one to navigate this optimal trade-off curve and access different performance regimes.
Specifically, if the diffusion time scale is fast compared with the time scales of substrate unbinding (i.e. ), then both right and wrong complexes that form near the left boundary arrive at the activation region with high probability, resulting in high speeds, although at the expense of error–prone product formation (Figure 3b, top). In the opposite limit of slow diffusion, both types of complexes have exponentially low densities at the activation region, but due to the difference in substrate off-rates, production is highly accurate (Figure 3b, bottom). There also exists an intermediate regime where a significant fraction of right complexes reach the activation region while the vast majority of wrong complexes do not (Figure 3b, middle). As a result, an advantageous trade-off is achieved where a moderate decrease in the production rate yields high fidelity enhancement – a feature that was also identified in multi-step traditional proofreading models (Murugan et al., 2012).
In Appendix 3, we also study this trade-off caused by varying the catalysis rate . Briefly, we find that when all other parameters are fixed, increasing trades off fidelity against speed in a linear fashion, with the ratio of highest and lowest fidelity values falling in the range. The Pareto–optimal front of the trade-off, however, monotonically shifts toward the higher speed region, suggesting that faster catalysis is, in fact, more favorable if the diffusion time scale can be adjusted accordingly (see Appendix 3 for details).
We saw in Figure 3a that in the case of ideal substrate localization, the slowdown of diffusive transport necessarily reduced the production rate and increased the fidelity. The latter part of this statement, however, breaks down when substrate gradients are weak. Indeed, fidelity exhibits a non-monotonic response to tuning when the substrate gradient length scale is non-zero (Figure 3c). The reason for the eventual decay in fidelity is the fact that with slower diffusion (larger ), substrate binding and unbinding events take place more locally and therefore, the right and wrong complex profiles start to resemble the substrate profile itself, which does not discriminate between the two substrate kinds. We show in Appendix 1 that the optimal diffusion time scale can be roughly approximated as , which increases monotonically with , consistent with the shifting peaks in Figure 3c.
Not surprisingly, the error–correcting capacity of the scheme improves with better substrate localization (lower ). For a fixed , the bulk of this improvement takes place when is tuned in a range set by the two key dimensionless numbers of the model, namely, and (Figure 3c, inset). In Appendix 1, we provide an analytical justification for this result. Taken together, these parametric studies uncover the operational principles of the spatial proofreading scheme and demonstrate how the speed–fidelity trade-off could be dynamically navigated as needed by tuning the key time and length scales of the model.
Energy dissipation and limits of proofreading performance
A hallmark signature of proofreading is that it is a nonequilibrium mechanism with an associated free energy cost. In our scheme, the enzyme itself is not directly involved in any energy-consuming reactions, such as hydrolysis. Instead, the free energy cost comes from maintaining the spatial gradient of substrates, which the enzymatic reaction tends to homogenize by releasing bound substrates in regions of low substrate concentration. As the activating effectors are assumed to be tethered at , they do not require dissipation to remain localized.
While mechanisms of substrate gradient maintenance may differ in their energetic efficiency, there exists a thermodynamically dictated minimum energy that any such mechanism must dissipate per unit time. We calculate this minimum power P as
Here is the net local binding flux of substrate ‘S’, and is the local chemical potential (see Appendix 2.1 for details). For substrates with an exponentially decaying profile considered here, the chemical potential is given by
where is the thermal energy scale. Notably, the chemical potential difference across the compartment, which serves as an effective driving force for the scheme, is set by the inverse of the nondimensionalized substrate localization length scale, namely,
where . This driving force is zero for a uniform substrate profile () and increases with tighter localization (lower ), as intuitively expected.
We used Equation 8 to study the relationship between dissipation and fidelity enhancement as we tuned for different choices of the diffusion time scale . As can be seen in Figure 4, power rises with increasing fidelity, diverging when fidelity reaches its asymptotic maximum given by Equation 5 in the large limit. For the bulk of each curve, power scales as the logarithm of fidelity, suggesting that a linear increase in dissipation can yield an exponential reduction in error. Notably, such a scaling relationship has also been calculated in the context of E. coli chemoreceptor adaptation (Lan et al., 2012). In particular, it was shown that the adaptation error decreases exponentially with energy dissipated through multiple methylation–demethylation cycles which are used to stabilize the activity state of the receptor. Analogies in the cost-performance trade-off across these functionally distinct mechanisms contribute to the search for overarching thermodynamic themes underlying cellular information processing (Lan et al., 2012; Lan and Tu, 2013; Horowitz et al., 2017; Sartori and Pigolotti, 2015).
The logarithmic scaling is achieved in our model when the driving force is in a range where most of the fidelity enhancement takes place, namely,
At the end of this range, the cost per substrate binding event approaches in units (see Appendix 2.1 for details). And beyond the range, additional error correction is attained at an increasingly higher cost.
Note that the power computed here does not include the baseline cost of creating the substrate gradient, which, for instance, would depend on the substrate diffusion constant. We only account for the additional cost to be paid due to the operation of the proofreading scheme which works to homogenize this substrate gradient. The baseline cost in our case is analogous to the work that ATP synthase needs to perform to maintain a nonequilibrium [ATP]/[ADP] ratio in the cell, whereas our calculated power is analogous to the rate of ATP hydrolysis by a traditional proofreading enzyme. We discuss these two classes of dissipation in greater detail in Appendix 2.3.
Just as the cellular chemical potential of ATP or GTP imposes a thermodynamic upper bound on the fidelity enhancement by any proofreading mechanism (Qian, 2006), the effective driving force imposes a similar constraint for the spatial proofreading model. This thermodynamic limit depends only on the available chemical potential and is equal to . This limit can be approached very closely by our model, which for achieves the exponential enhancement with an additional linear prefactor, namely, (see Appendix 2.2). Such scaling behavior was theoretically accessible only to infinite-state traditional proofreading schemes (Qian, 2006; Ehrenberg and Blomberg, 1980). This offers a view of spatial proofreading as a procession of the enzyme through an infinite series of spatial filters and suggests that, from the perspective of peak error reduction capacity, our model outperforms the finite-state schemes.
Proofreading by biochemically plausible intracellular gradients
Our discussion of the minimal model thus far was not aimed at a particular biochemical system and thus did not involve the use of realistic reaction rates and diffusion constants typically seen in living cells. Furthermore, we did not account for the possibility of substrate diffusion, as well as for the homogenization of substrate concentration gradients due to enzymatic reactions, and have thereby abstracted away the gradient maintaining mechanism. The quantitative inspection of such mechanisms is important for understanding the constraints on spatial proofreading in realistic settings.
Here, we investigate proofreading based on a widely applicable mechanism for creating gradients by the spatial separation of two opposing enzymes (Stelling and Kholodenko, 2009; Bivona et al., 2003; Brown and Kholodenko, 1999). Consider a protein that in its free state is phosphorylated by a membrane-bound kinase and dephosphorylated by a delocalized cytoplasmic phosphatase, as shown in Figure 5a. This setup will naturally create a gradient of the active form of protein (), with the gradient length scale controlled by the rate of phosphatase activity (). Such mechanisms are known to create gradients of the active forms of MEK and ERK (Kholodenko, 2006), of GTPases such as Ran (with GEF and GAP [Kalab et al., 2002] playing the role of kinase and phosphatase, respectively), of cAMP (Kholodenko, 2006) and of stathmin oncoprotein 18 (Op18) (Bastiaens et al., 2006; Niethammer et al., 2004) near the plasma membrane, the Golgi apparatus, the ER, kinetochores and other places.
We test the proofreading power of such gradients, assuming experimentally constrained biophysical parameters for the gradient forming mechanism. Specifically, we consider an enzyme that acts on the active forms of cognate () and non-cognate () substrates which have off-rates 0.1 s-1 and 1 s-1, respectively (hence, ). These off-rates are consistent with typical values for substrates proofread by cellular signaling systems (Cui and Mehta, 2018; Gascoigne et al., 2001). The kinases and phosphatases in our setup act identically on right and wrong substrates. We consider a dephosphorylation rate constant s-1 that falls in the range 0.1−100 s-1 reported for different phosphatases (Brown and Kholodenko, 1999; Kholodenko et al., 2000; Todd et al., 1999), and a cytosolic diffusion constant μm2/s for all proteins in this model. With this setup, exponential gradients of length scale ∼0.5 μm are formed for and . We evaluate the proofreading and energetic performance of the model in a compartment of size μm – a typical length scale in eukaryotic cells (see Appendix 6 for details).
Although not cost-efficient, this setup achieves proofreading in a wide range of regimes. Specifically, it is most effective when the enzyme–substrate binding is slow, in which case the exponential substrate profile is maintained and the system attains the fidelity predicted by our earlier explanatory model (Figure 5b). The system’s proofreading capacity is retained if the first–order on-rate is raised up to s-1, where around 10-fold increase in fidelity is still possible. If the binding rate constant () or the enzyme’s expression level () is any higher, then enzymatic reactions overwhelm the ability of the kinase/phosphatase system to keep the active forms of substrates sufficiently localized (Figure 5c) and proofreading is lost. Overall, this model suggests that enzymes can work at reasonable binding rates and still proofread, when accounting for an experimentally characterized gradient maintaining mechanism.
Discussion
We have outlined a way for enzymatic reactions to proofread and improve specificity by exploiting spatial concentration gradients of substrates. Like the classic model, our proposed spatial proofreading scheme is based on a time delay; but unlike the classic model, here the delay is due to spatial transport rather than transitions through biochemical intermediates. Consequently, the enzyme is liberated from the stringent structural requirements imposed by traditional proofreading, such as multiple intermediate conformations and hydrolysis sites for energy coupling. Instead, our scheme exploits the free energy supplied by active mechanisms that maintain spatial structures.
The decoupling of the two crucial features of proofreading – time delay and free energy dissipation – allows the cell to tune proofreading on the fly. For instance, all proofreading schemes offer fidelity at the expense of reaction speed and energy. For traditional schemes, navigating this trade-off is not always feasible, as it needs to involve structural changes via mutations or modulation of the [ATP]/[ADP] ratio which can cause collateral effects on the rest of the cell. In contrast, the spatial proofreading scheme is more adaptable to the changing conditions and needs of the cell. The scheme can prioritize speed in one context, and fidelity in another, simply by tuning the length scale of intracellular gradients (e.g. through the regulation of the phosphotase or free enzyme concentration in the scheme discussed earlier).
On the other hand, this modular decoupling can complicate the experimental identification of proofreading enzymes and the interpretation of their fidelity. Here, the enzymes need not be endowed with the structural and biochemical properties typically sought for in a proofreading enzyme. At the same time, any attempt to reconstitute enzymatic activity in a well-mixed, in vitro assay, will show poor fidelity compared to in vivo measurements, even when all necessary molecular players are present in vitro. Therefore, more care is required in studies of cellular information processing mechanisms that hijack a distant source of free energy compared to the case where the relevant energy consumption is local and easier to link causally to function.
While we focused on spatially localized substrates and delocalized enzymes, our framework would apply equally well to other scenarios, like one with a spatially localized enzyme (or its active form [Kalab et al., 2002; Nalbant et al., 2004]) and effector with delocalized substrates, an example of which would be an alternative version of the scheme in Figure 5a where the target of the kinase/phosphatase activity is changed from substrates to enzymes. Our framework can also be extended to signaling cascades, where slightly different phosphatase activities can result in magnified concentration ratios of two competing signaling molecules at the spatial location of the next cascade step (Roy and Cyert, 2009; Bauman and Scott, 2002; Kholodenko, 2006).
The spatial gradients needed for the operation of our model can be created and maintained through multiple mechanisms in the cell, ranging from the kinase/phosphatase system modeled here, to the passive diffusion of substrates/ligands combined with active degradation (e.g. Bicoid and other developmental morphogens), to active transport processes combined with diffusion. A particularly simple implementation of our scheme is via compartmentalization – substrates and effectors are localized in two spatially separated compartments with the enzyme–substrate complex having to travel from one to another to complete the reaction.
Many molecular localization pathways involving the naturally compartmentalized parts of the cell require high substrate selectivity and are therefore potential candidates for the implementation of spatial proofreading. For example, in polarized, asymmetric cells (e.g. budding yeast or neuronal cells) gene expression often needs to be spatially regulated (Parton et al., 2014; Martin and Ephrussi, 2009). Such regulation is achieved with designated ribonucleoproteins that bind specific mRNAs near the cell nucleus, perform a biased random walk to the mRNA localization site and deliver them for translation. During transport, mRNAs are protected from ribosome binding and when they unbind, they are subject to degradation which would prevent rebinding events at intermediate locations. Another example process is the non-vesicular transport of lipids between the membrane–bound domains of the cells (e.g. the ER, mitochondria, the Golgi apparatus, or the plasma membrane). This transport mechanism is mediated by lipid-transfer proteins that bind lipids on the donor membrane, diffuse to the acceptor membrane and upon interacting with it, undergo a conformational change, delivering the ‘cargo’ (Lev, 2010). Although the higher proximity of the two membranes is thought to enhance the transport efficiency, it would be interesting to study the optimality of the inter-membrane distance in the context of fidelity–transport efficiency trade-off, given the fact that some of the lipid-transfer proteins are known to exhibit specificity for their cognate substrates.
Our scheme may also be applicable as a quality control mechanism in protein secretion pathways (Ellgaard and Helenius, 2003; Arvan et al., 2002), in high-fidelity targeting of membrane proteins mediated by signal recognition particles (Rao et al., 2016; Chio et al., 2017), as well as in selective glycosylation reactions in the Golgi apparatus (Jaiman and Thattai, 2020). Lastly, considering the recent advances in generating synthetic morphogen patterns in multicellular organisms (Toda et al., 2020; Stapornwongkul et al., 2020), spatial proofreading could also be employed in pathways acting on engineered protein gradients. Experimental investigations of these processes in light of our work will reveal the extent to which spatial transport promotes specificity.
In conclusion, we have analyzed the role played by spatial structures in endowing enzymatic reactions with kinetic proofreading. Simply by spatially segregating substrate binding from catalysis, enzymes can enhance their specificity. This suggests that enzymatic reactions may acquire de novo proofreading capabilities by coupling to pre-existing spatial gradients in the cell.
Materials and methods
Detailed derivations of the analytical results presented in the main text along with additional studies on our model are included in the Appendices. In addition, Python scripts and Jupyter notebooks used to generate all the plots in the main text and Appendices are included as Supplementary files.
Appendix 1
Analytical calculations of the complex density profile and fidelity
We begin this section by deriving an analytical expression for the density profile of substrate-bound enzymes () in the case where the assumption holds. Based on this result, we then obtain expressions for fidelity in low, high, and intermediate substrate localization regimes. We reserve the studies of speed and fidelity in the general case of a nonuniform free enzyme profile to Appendix 5.
1. Derivation of the complex density profile
The ordinary differential equation (ODE) that defines the steady state profile of substrate-bound enzymes is
Here, is the substrate density at the leftmost boundary, whose value can be calculated from the condition that the total number of free substrates is , namely,
In the limit of low substrate amounts where the approximation is valid, Equation S1 represents a linear nonhomogeneous ODE. Hence, its solution can be written as
where is the general solution to the corresponding homogeneous equation, while is a particular solution.
Looking for solutions of the form for the homogeneous part, we find
The two possible roots for are . Calling the positive root , which represents the mean distance traveled by the substrate–bound enzyme before releasing the substrate, we can write the general solution to the homogeneous part of Equation S1 as
where C1 and C2 are constants which will be determined from the boundary conditions.
Since the nonhomogeneous part of Equation S1 is a scaled exponential, we look for a particular solution of the same functional form, namely, . Substituting this form into the ODE, we obtain
The constant coefficient can then be found as
where we have used the equality .
Now, to find the unknown coefficients C1 and C2, we impose the no-flux boundary conditions for the density at the left and right boundaries of the compartment, namely,
Note that we did not take into account the product formation flux at the rightmost boundary when writing Equation S10 in order to simplify our calculations. This is justified in the limit of slow catalysis – an assumption that we make in our treatment. The above system of two equations can then be solved for C1 and C2, yielding
With the constant coefficients known, we obtain the general solution for the complex profile as
2. Density profile in low and high substrate localization regimes
If substrate localization is very poor (), the substrate distribution will be uniform (), resulting in a similarly flat profile of enzyme–substrate complexes with their density given by
This is the expected equilibrium result where the complex concentration is inversely proportional to the dissociation constant ().
In the opposite limit where the substrates are highly localized ( and from Equation S3), the complex density profile simplifies into
The x-dependence through the function suggests that the complex density is the highest at the leftmost boundary and lowest at the rightmost boundary, with the degree of complex localization dictated by the length scale parameter . Notably, this localization of complexes does not alter their total number, since the average complex density is conserved, that is,
Equation S15 for the complex profile can be alternatively written in terms of the diffusion time scale and the substrate off-rate . Noting that and introducing a dimensionless coordinate , we find
The above equation is what was used for generating the plots in Figure 3b of the main text for different choices of the diffusion time scale.
3. Fidelity in low and high substrate localization regimes
Let us now evaluate the fidelity of the model in the two limiting regimes discussed earlier. In the poor substrate localization case, which corresponds to an equilibrium setting, the fidelity can be found from Equation S14 as
where we have employed the assumption about the right and wrong substrates having identical density profiles. This is the expected result for equilibrium discrimination where no advantage is taken of the system’s spatial structure.
In the regime with high substrate localization, the enzyme–substrate complexes have a nonuniform spatial distribution. What matters for product formation is the complex density at the rightmost boundary (), which we obtain from Equation S17 as
Substituting the above expression written for right and wrong complexes into the definition of fidelity, we find
This is the result reported in Equation 5 of the main text. To gain more intuition about it and draw parallels with traditional kinetic proofreading, let us consider the limit of long diffusion time scales where proofreading is the most effective. In this limit, the hyperbolic sine functions above can be approximated as , simplifying the fidelity expression into
where we have used the definition of equilibrium fidelity (Equation S18). In traditional proofreading, a scheme with n proofreading realizations can yield a maximum fidelity of . The value of n for the original Hopfield model, for instance, is 1. It would be informative to also know the effective parameter n for the spatial proofreading model. Dividing Equation S21 by , we find
This exact result can be simplified into an approximate form when diffusion is slow and , yielding the expression reported in Equation 7 of the main text, namely,
4. Fidelity in an intermediate substrate localization regime
The generic expression for complex density at the rightmost boundary () can be written using Equation S13 as
For the system to proofread, substrates need to be sufficiently localized () and diffusion needs to be sufficiently slow ( or, ). Under these conditions, the substrate profile can be approximated using Equation S3 as , while the hyperbolic sine and cosine functions used above can be approximated as . With these approximations, the complex density expression simplifies into
Now, depending on how compares with , there can be two qualitatively different regimes for the complex density, namely,
where we used the equilibrium complex density defined in Equation S14.
Notably, the first regime effectively corresponds to the case of ideal substrate localization where complex density is independent of the precise value of . The dimensionless number sets the scale for the minimum value beyond which ideal localization can be assumed. Conversely, the second regime corresponds to the case where the distance traveled by a complex before dissociating is so short that the complex profile is dictated by the substrate profile itself. Because of that, the complex density reduction from its equilibrium limit is independent of the precise values of and , as long as the condition is met.
The scheme yields its highest fidelity when both right and wrong complex densities are in the first regime (ideal localization). When both densities are in the second regime, fidelity is reduced down to its equilibrium value (Appendix 1—table 1). The transition between these two extremes happens when the density profiles of right and wrong complexes fall under different regimes. Fidelity can be navigated in the transition zone by tuning the substrate gradient length scale . This is demonstrated in Appendix 1—figure 1 for three different choices of . In all three cases, the dimensionless numbers and set the approximate range in which the bulk of fidelity enhancement occurs, as stated in the main text.
5. Optimal diffusion time scale for maximum fidelity
Figure 3c of the main text illustrated the non-monotonic dependence of fidelity on the diffusion time scale for different fixed values of . Here, we further explore this feature by asking what sets the optimal . To gain analytical insights, we focus on the case where the system can proofread, which, as we argued in the previous section, happens when . Under this condition, we identified two qualitatively different regimes of complex density reduction (Equation S26). Namely, we found that for sufficiently fast diffusion the system acted as if the substrates were localized ideally, whereas for sufficiently slow diffusion the complex density reduction was dictated solely by and did not discriminate between the two substrate kinds. These two limiting behaviors are indeed reflected in Figure 3c where in the low limit (fast diffusion) the family of curves matches the dotted ideal localization curve, while in the high limit (slow diffusion) all curves decay to 1, corresponding to the loss of error correction.
An intuitive approach for identifying the optimal is to slow down diffusion up to the point where the density of wrong complexes at approaches a plateau and effectively stops decreasing. Going past this threshold would only reduce the density of right complexes at and thereby, reduce the fidelity. We know from Equation S26 that plateauing for wrong complexes happens when (equivalently, ). Hence, our first guess for the optimal diffusion time scale is
To test the soundness of this expression, we compared its predictions to the optimal values in Figure 3 that were identified numerically for different choices of . The results of the comparison are shown in Appendix 1—figure 2. As can be seen, for sufficiently high degrees of substrate localization (), the prediction of Equation S29 provides a good approximation of the true optimum. However, it is apparent that the prediction consistently underestimates the true , which was expected since plateauing of happens not under equality but a strict inequality condition (i.e. ). Because an exact analytical expression for is not available, we performed different approximations to the fidelity formula and found an empirical correction term for our earlier estimate given by . The prediction for with the correction term is now accurate starting a much lower value of , corresponding to a regime where the system proofreads once (). Overall, these analytical results provide good initial guesses for which should be refined using a numerical approach for a higher accuracy.
Appendix 2
Energetics of the scheme
We start this section by deriving an analytical expression for the minimum dissipated power, which was used in making Figure 4 of the main text. Then, we calculate the upper limit on fidelity enhancement available to our model for a finite substrate gradient length scale and compare this limit with the fundamental thermodynamic bound. We end the section by providing an estimate for the baseline cost of setting up gradients and compare this cost with the maintenance cost reported in the main text. Similar to our treatment of Appendix 1, here too our calculations are based on the assumption to allow for intuitive analytical results.
1. Derivation of the minimum dissipated power
As stated in the main text, we calculate the minimum rate of energy dissipation necessary for maintaining the substrate profiles as
where is the net local substrate binding flux and is the local chemical potential.
Our choice for the expression of power at steady state is motivated by that fact that the enzyme transport is passive and therefore, energy needs to be spent only on counteracting the local binding/unbinding events that tend to homogenize the substrate profile. To demonstrate the validity of our proposed expression more formally, we invoke the standard approaches for calculating power (Hill, 1977; Zhang et al., 2012). In particular, for a system that is described through discrete states with transition rates between them, the rate of energy dissipation at steady state is given by
where is the flux from state i into state j. We note here that a similar expression for the rate of total entropy production involves a term (statistical forces) instead of the term (deterministic driving forces). At steady state, however, these two expressions are mathematically equivalent (Zhang et al., 2012). Our choice for Equation S31 stems from the better physical intuition that it provides in our context.
So far, the description of our system has been in terms of continuous density functions. To apply Equation S31 for calculating power, we consider the discrete-state representation of enzyme dynamics shown in Appendix 2—figure 1. There, space is discretized into intervals of size and diffusion is represented through jumps between neighboring sites with a rate . What keeps the system out of equilibrium is the spatially varying substrate profile .
Because forward and backward diffusive transitions have identical rates, according to Equation S31 they will not contribute to energy dissipation (since ). The contribution from the remaining substrate binding/unbinding events can then be written as
where and are the numbers of free and substrate–bound enzymes, respectively, in the interval. In the limit of a large number of discrete spatial intervals, the sum over i in Equation S32 can be rewritten as an integral over the coordinate x, namely,
Comparing the form of Equation S33 to that of Equation S30 (with substituted), one can notice a difference in the terms that multiply . Specifically, in Equation S30 we have while the corresponding term in Equation S33 is . The difference between them, however, is in the parts that do not depend on x, while the spatially varying parts (namely, the contributions) are identical. Now, since the number of bound complexes is constant at steady state, we have . This means that the x-independent parts discussed earlier all integrate to zero, making the power estimates by Equation S30 and Equation S33 identical, thereby justifying our proposed expression.
To estimate power, we substitute the analytical expression for found earlier (Equation S13) into and performing a somewhat cumbersome integral, obtain
where , and is the net binding rate of each substrate. Figure 4 in the main text was made using this expression for power.
To get additional insights about this result, let us consider the case where substrates are highly localized () and diffusion is slow () – conditions needed for effective proofreading. Under these conditions, the hyperbolic tangent terms become 1 and the expression for the power expenditure simplifies into
The monotonic increase of power with suggests that energy is primarily spent on maintaining the concentration gradient of right substrates. This is not surprising, since typically right complexes travel a much greater distance into the low concentration region of the compartment before releasing the bound substrate (i.e. ). Therefore, neglecting the contribution from wrong substrates and considering the range of values where the bulk of power–fidelity trade-off takes place (), we further simplify the power expression into
where we used the identities and . This simple linear relation suggests that in order to maintain the exponential substrate profile, the minimum energy spent per substrate binding event should be at least (since ).
We can also use Equation S36 to estimate the minimum dissipation per substrate binding event at where the logarithmic power–fidelity scaling regime ends (see Figure 4 of the main text). Substituting the value of , we obtain , which is the result illustrated in Figure 4.
2. Limits on fidelity enhancement
The error reduction capacity of the spatial proofreading scheme improves with a greater difference in substrate off-rates, as was demonstrated in Figure 2 of the main text. At the same time, Figure 3c showed that the finite length scale of substrate localization (or, finite driving force) sets an upper limit on fidelity enhancement for substrates with fixed off-rates. It is therefore of interest to consider these two features together to find the absolute limit on fidelity enhancement available to our model and then compare it with the fundamental bound set by thermodynamics.
Intuitively, fidelity will be enhanced the most if the density of right complexes does not decay across the compartment, while that of wrong complexes decays maximally. The first condition can be met if diffusion is fast or if the unbinding rate of right substrates is low, in which case we have
where is the equilibrium density of right complexes. Conversely, when the unbinding rate of wrong substrates is very large, the density of wrong complexes is maximally reduced at the rightmost boundary and can be obtained from Equation S24 by taking the limit, namely,
Here, is the equilibrium density of wrong complexes, and is the effective driving force of the scheme. Taking the ratio of Equations S37 and S38. Limits on fidelity enhancement, we obtain the largest fidelity enhancement of the scheme for the given driving force, namely,
When (or, , the limit above gets further simplified into
Now, thermodynamics imposes an upper bound on fidelity enhancement by any proofreading scheme operating with a finite chemical potential . This bound is equal to and is reached when the entire chemical potential is used to increase the free energy difference between right and wrong substrates (Qian, 2006). Comparing it with the result in Equation S41, we can see that fidelity enhancement in the spatial proofreading model has the same exponential scaling term, but with an additional linear factor. Since the dominant contribution comes from the exponential term (as captured also in Appendix 2—figure 2), we can claim that our proposed model can operate very close to the fundamental thermodynamic limit.
3. Energetic cost to setup a concentration gradient
Earlier in the section, we calculated the rate at which energy needs to be dissipated to counteract the homogenizing effect that enzyme activity has on the substrate gradient. In addition to this cost, however, there is also a baseline cost for setting up a gradient in the absence of any enzyme. Here, we calculate this cost in the case where the gradient formation mechanism needs to work against diffusion that tends to flatten the substrate profile.
As before, we consider an exponentially decaying substrate gradient with a decay length scale and a total number of substrates . We write the minimum power required for counteracting the diffusion of substrates as
where is the diffusive flux, with being the substrate diffusion constant. The rationale for writing this form is that diffusion moves substrates from a higher chemical potential region into a neighboring lower chemical potential region. The gradient maintaining mechanism would need to spend at least this chemical potential difference () per each substrate diffusing a distance down the chemical potential gradient. Adding up the contribution from all local neighborhoods with a local diffusive flux results in Equation S42.
Now, substituting for the substrate profile and for the chemical potential, we obtain
where in the third step we used the relation . This suggests that the minimum dissipated power required for setting up an exponential gradient increases quadratically with decreasing localization length scale .
It is informative to also make a comparison between this result and the earlier calculated minimum dissipation needed to counteract the enzyme’s homogenizing activity. Recall that when substrates were sufficiently localized and when diffusion was sufficiently slow, proofreading power could be approximated as (Equation S35)
where is the total substrate binding flux. Using the identities and , we can calculate the ratio of the proofreading power to baseline power as
Presuming for simplicity that the enzyme and substrate diffusion constants are the same, we see that two factors determine the power ratio: (1) the amount of free enzyme in the system () and (2) the substrate localization length scale relative to the characteristic length scale of complex diffusion (). Now, recall that the proofreading cost is spent largely on counteracting the homogenizing activity of the enzyme on right substrates (Appendix 2.1) and that the bulk of fidelity enhancement takes place when (Appendix 1.4). Therefore, when tuning down, initially the power ratio would only depend on the amount of free enzyme in the system () and then, with tighter substrate localization, the relative contribution of the proofreading power would start to decrease.
In the end, we would like to note that spatial gradients can also be set up using an external potential without a continuous dissipation of energy. In an in vivo setting, gravity can give rise to spatial structures in oocytes (Feric and Brangwynne, 2013), while in an in vitro setting, electric fields can create gradients and power the transport of the complex (Hansen et al., 2017). We leave the investigations of such alternative strategies to future work.
Appendix 3
Studies on the effect of catalysis on the model performance
In Appendix 1, we considered the rate of catalysis at the right boundary to be very small for the analytical simplicity of our derivations. This resulted in expressions for fidelity that were independent of the rate of catalysis r and allowed us to use the complex density at the right boundary as a proxy for speed. In this section, we relax this assumption and explore the consequences of having non-negligible catalysis rates on the model’s fidelity and on the speed–fidelity trade-off.
1. Derivation of the complex density profile
Accounting for catalysis in our model should be done through a boundary condition for the complex density equation (Equation S1). Earlier, we imposed a no-flux boundary condition at under the slow catalysis assumption. With non-negligible catalysis, this assumption is no longer valid, and the boundary condition is modified into
Recall from Equations S4, S6 and S8 that the general solution for the complex profile had the form
Imposing the no-flux boundary condition at allows us to eliminate one of the integration constants, namely,
Next, we impose the new boundary condition at (Equation S46), which yields
Note that we have introduced the dimensionless variable , which, as will see later, will define the extent to which the presence of catalysis affects the fidelity. For convenience, here we write different equivalent forms for as
Solving for the remaining unknown coefficient C1 in Equation S52, we find
Lastly, we substitute this result for C1 into Equation S51 and obtain a general expression for the complex density profile as
One can show in a straightforward way that this result reduces to Equation S13 in the limit.
2. Effects on fidelity in low and high substrate localization regimes
Accounting for the catalysis flux has made the general expression for the complex density profile even more incomprehensible. In order to gain insights about the qualitative as well as quantitative changes introduced by catalysis, we will focus on two characteristic limits of substrate localization – uniform substrate profile () and ideal substrate localization ().
2.1. Uniform substrate profile
In this case, no mechanism for localizing substrates is in play. Let us start off by evaluating the coefficient (Equation S48) in the limit. Recalling from Equation S3 that ), we find
where is the total substrate binding flux.
Substituting the expression for into Equation S55 and eliminating all the terms that vanish upon taking the limit, we obtain
Ultimately, we are interested in knowing the rate of product formation defined via . We therefore evaluate the complex density at and multiply it by r, which yields
where in the last step we wrote an equivalent expression using the identity. To analyze this result further, we will consider two limiting cases.
Case 1: Fast diffusion (). If diffusion is fast, we can approximate the hyperbolic tangent functions as the arguments themselves (i.e. for ). Then, using the last form of in Equation S53, we simplify the expression for speed as
This is an intuitive result, suggesting that an enzyme that diffuses fast acts like a standard Michaelis–Menten enzyme with an effective catalysis rate . For such an enzyme, the probability of catalysis for a bound substrate is . Multiplying this probability by the net substrate binding flux yields the expression for speed in Equation S61.
Fidelity of the model in this fast diffusion setting can be written as
In the limit where catalysis is very slow (), the equilibrium fidelity given by the ratio of off-rates is recovered. And in the opposite limit of very fast catalysis (), the discriminatory capacity of the enzyme disappears altogether (Appendix 3—figure 1a).
Case 2: Slow diffusion (). A more interesting case is when diffusion is slow. Now, the hyperbolic tangent functions in Equation S59 are approximately 1, allowing us to simplify the expression for speed into
Drawing an analogy between the above result and Equation S61, one can notice the presence of an extra factor for in the denominator.
Evaluating the speeds of right and wrong product formation, we can write fidelity in this slow diffusion setting as
Like the fast diffusion case, when catalysis is very slow ( or, equivalently, ), the equilibrium fidelity is recovered. Unlike the fast diffusion case, however, if catalysis is very fast (), the enzyme partly preserves its discriminatory capacity (Appendix 3—figure 1b). In this limit, a fidelity equal to the square root of the equilibrium fidelity is still attainable, namely,
This unexpected result suggests a potential advantage of localizing fast catalytic reactions instead of having them occur in a well–mixed solution.
2.2. Ideal substrate localization
We next consider the effect of catalysis on model fidelity in the ideal substrate localization limit (). We begin by evaluating the ratio that appears in the density profile expression (Equation S55). Using Equations S48 and Equations S3, we find
where in the last step we invoked the identities and . We then substitute our result for into Equation S55 and simplify the complex density expression into
To obtain the speed, we evaluate at the right boundary () and multiply it by r, namely,
To evaluate the effect of catalysis further, we again consider two special limits – those of fast and slow diffusion.
Case 1: Fast diffusion (). In this limit, the hyperbolic sine function can be approximated by its argument (i.e. for ), while the hyperbolic cosine function is approximately 1. Making these approximations and substituting the expression for , we obtain
This result is identical to what we found in the fast diffusion limit for the setting (Equation S61), which is reasonable, since the location of substrate binding is irrelevant if diffusion is very fast (Appendix 3—figure 2a).
Case 2: Slow diffusion (). In this limit, the hyperbolic sine and cosine functions can be approximated as exponentials with a prefactor, simplifying the expression of speed into
Recalling the identity (note that depends on the substrate kind), we evaluate the speed for right and wrong product formation and, dividing them, obtain the fidelity as
In the case where catalysis is slow (), the first term in the fidelity expression becomes approximately 1, and the our earlier result obtained with no account of catalysis is recovered (Equation S21). In the opposite limit of fast catalysis (), the first term is no longer 1, and we find
As we can see, fast catalysis in the slow diffusion regime reduces the fidelity by or, equivalently, reduces the effective number of proofreading realizations by one half, without affecting the exponential amplification term (Appendix 3—figure 2b).
To conclude, our study demonstrated the expected reduction of fidelity with increasing catalysis rate. In the case of fast diffusion, up to a factor of reduction is possible, as is the case for the original (Hopfield, 1974; Wong et al., 2018). In the case of slow diffusion, however, the cap on the amount of reduction is decreased down to . The advantage of this feature is most notable in the limit of a non-localized (i.e. uniform) substrate profile and fast catalysis where a diffusing enzyme is still capable of discriminating between substrates. This behavior would not be possible for a Michaelis–Menten enzyme in a well-mixed solution.
3. Effects on the speed–fidelity trade-off
In Figure 3a of the main text we explored the speed–fidelity trade-off in the slow catalysis limit. This trade-off arose in response to tuning the substrate localization length scale () and the diffusion time scale (). Here, we explore the changes to this trade-off behavior in the case where the effects of catalysis are not negligible. For concreteness, we focus on alterations to the Pareto front of the trade-off achieved in the limit.
Appendix 3—figure 3a compares the Pareto fronts in the cases of slow and fast catalysis limits. In each case, speed is normalized by the corresponding effective Michaelis–Menten speed that is reached in the fast diffusion limit and is given by , where . One can notice a shift of the fast catalysis front toward the low-fidelity region, which was expected since earlier we observed the complete loss of substrate discrimination when diffusion and catalysis were both fast (Appendix 3—figure 2a).
Appendix 3—figure 3a may leave an impression that faster catalysis leads to a less favorable speed–fidelity trade-off. Note, however, that the speed used to normalize the y-axis is itself a function of the catalysis rate and penalizes the fast catalysis case more than its slow counterpart. To eliminate this ambiguity, we plotted a family of Pareto fronts for increasing values of the catalysis rate but this time normalizing the y-axis by the r-independent quantity (Appendix 3—figure 3b). As can be seen, faster catalysis in fact improves the speed–fidelity trade-off, meaning that in order to maximize fidelity at a given speed level, the best strategy would be to increase the catalysis rate and correspondingly slow down the diffusion.
A trade-off between speed and fidelity also arises in response to the sole alteration of the catalysis rate, while keeping the rest of the model parameters fixed. To explore this trade-off for an arbitrary fixed choice of and , we begin by evaluating speed from Equation S55, namely,
In the last step, we introduced coefficients and that are independent from r, and used the fact that .
Now, using the definition of fidelity and the result obtained above, we can write
Notice that the ratio is the fidelity in the limit of very slow catalysis (). Substituting it, we write
where . Recalling that and noting the function form of the denominator in Equation S75, one can show that . This is an increasing function of and hence, a decreasing function of , implying that .
With this condition in mind, we can see from Equation S78 that speed and fidelity are anticorrelated with a linear slope when tuning the catalysis rate, unlike the more sophisticated trade-off relations when tuning the other model parameters. The peak fidelity is attained in the limit of vanishing speed. And conversely, speed is the highest when fidelity is the lowest for the given fixed values of and (Appendix 3—figure 4).
Overall, our result illustrates the simple speed–fidelity trade-off that can be navigated by tuning the catalysis rate. This, for instance, can be achieved by changing the concentration of effectors that activate the enzyme for catalysis.
Appendix 4
Proofreading for substrates with different localization conditions
Following the original treatment by Hopfield, 1974, we have performed the studies of our model under the assumption that discrimination between right and wrong substrates is solely based on their off–rates (). Although this is often the signature difference between substrates, in a cellular setting substrate discrimination may occur through other factors also. For example, substrates may be present at different amounts or they may have non-identical on–rates. These differences, however, have a multiplicative effect on the fidelity (i.e. ) and do not highlight the proofreading capacity of a particular model.
Unlike these two features, differences in the degree to which right and wrong substrates are localized can have a non-trivial effect on the proofreading performance. In this Appendix, we generalize our study of the model fidelity to cases where right and wrong substrates have unequal localization length scales and , respectively.
1. Limiting cases
We start off by exploring the limiting cases first. From the earlier derived Equation S14 and Equation S15, we know that the complex density at in very low () and very high () substrate localization regimes is given by
respectively. Note that the complex density in the ideal localization case is necessarily lower than that in the case of a uniform profile, since the inequality holds for all choices of . If and are not constrained to be equal, then the highest fidelity for a given will be attained when the right substrates are distributed uniformly while the wrong substrates are highly localized ( and , respectively). We obtain the fidelity in this case as
Notably, this result for maximum fidelity enhancement is independent of . Furthermore, it exceeds the ideal localization fidelity reported in the main text (Equation 5, derived in the limit), which was expected since now the right complexes on average travel a shorter distance to reach the activation site than the wrong complexes.
In the opposite scenario where the wrong substrates are uniformly distributed and the right ones are highly localized ( and , respectively), the system attains its lowest fidelity for a given , namely,
Since , the lowest fidelity is less than the equilibrium fidelity itself (), suggesting that the enzyme may in fact do anti-proofreading (Murugan et al., 2014) if the wrong substrates are generally closer to the catalytic site.
2. Intermediate levels of substrate localization
In Figure 3 inset as well as in Appendix 1.4, we explored the dependence of fidelity on the substrate localization length scale when it was the same for the two substrate kinds. Here, we expand this study to the case where this constraint is relaxed.
In particular, using Equation S24, we calculate complex densities and corresponding fidelity values as a function of for different fixed choices of the length scale ratio . The results of the study are captured in Appendix 4—figure 1. In the special case where the two length scales are equal ( = , solid black line), fidelity exhibits a monotonic depends on , and in the limit of ideal localization (very large ) the result in Equation 5 of the main text is recovered.
When , the dependence of fidelity on is no longer monotonic. If right substrates are more localized than the wrong ones (red curves), then the fidelity curves have a minimum where the enzyme does anti-proofreading (i.e. ). The proofreading portion of the curves (when ) is shifted to the right, suggesting that much higher substrate localization is needed for the enzyme to proofread.
The opposite case is when the right substrates have a shallower gradient than the wrong ones (blue curves). The fidelity curves are now shifted to the left and have a peak that is greater than the large limit of fidelity. This means that there is an optimal degree of substrate localization, going beyond which makes the model performance worse in terms of both error correction and energy consumption.
Over the course of its diffusive transport, a bound enzyme is more likely to deposit a right substrate in a substrate-depleted region than a wrong one, because right substrates stay attached to the enzyme for a longer time. Therefore, if the gradient-maintaining mechanism does not discriminate between substrates (which we assume is the case for the kinase/phosphatase-based one), then it will be easier for it to maintain the wrong ones localized since they tend to get deposited closer to the localization site (see Appendix 6—figure 1c as an example). This means that in a realistic setting the spatial organization of substrates is more likely to be in the advantageous blue region of Appendix 4—figure 1 where , facilitating the realization of spatial proofreading.
Appendix 5
Studies on the validity of the uniform free enzyme profile assumption
In our treatment of the model so far, we have assumed for mathematical convenience that free enzymes are in excess, which suggested the approximation . Example enzyme density profiles shown in Appendix 5—figure 1, however, demonstrate that this assumption does not hold in general. Specifically, there is a depletion of free enzymes near the substrate localization site and abundance near the catalysis site. Because of this depletion at the leftmost edge, we expect a reduction in speed in comparison with our earlier treatment where a flat profile was assumed. In addition, if substrates have a weak gradient, we expect the fidelity to also be reduced, since more enzymes will bind substrates at intermediate positions, reducing the average travel distance to the catalytic site. In what follows, we discuss in greater detail the consequences of having a nonuniform free enzyme distribution on the model performance.
1. Effects that relaxing the assumption has on the Pareto front
We begin by studying the effects of relaxing the uniform free enzyme profile assumption on the Pareto front of the speed–fidelity trade-off (Figure 3a of the main text). This front is reached in the ideal substrate localization limit (). Though in general enzyme profiles need to be obtained using numerical methods due to the nonlinearity of reaction–diffusion equations, in this particular limit () an analytical solution is available. To obtain it, we write the reaction–diffusion equations in the bulk region of space as
Substrate binding reactions did not enter the above equations, as they occur at the leftmost boundary only. They are instead accounted for via boundary conditions, which read
where is the total amount of free substrate of each kind concentrated at .
Relating local enzyme concentrations
Considering the system at steady state, we add Equations S85-S87 and obtain
where we replaced the partial derivatives with total derivative since the profiles are time-independent. Dividing Equation S91 by D and integrating once, we find
The above relation must hold for arbitrary position x. Choosing and noting that from Equations S88-S90 the sum of fluxes should be zero, we can claim that . Integrating for the second time, we obtain
where C2 is now a different constant. To find it, we perform an integral for the last time across the entire compartment, namely,
Here, we introduced the parameter as the total number of enzymes in the system (in free or bound forms). The constant C2, which we will rename into , is then the average enzyme density, that is,
Substituting this result into Equation S93, we find an insightful relation between free and bound enzyme densities at an arbitrary position, namely,
This relation suggests that whenever the local concentration of bound enzymes is high, the local concentration of free enzymes should be correspondingly low, as we see reflected in the profiles of Appendix 5—figure 1.
Deriving the fidelity expression
Next, we consider Equations S85 and S86 separately at steady state, written in the form
The general solution to this ODE reads
where , and and (S = R,W) are constants which are different for right and wrong complexes. The no-flux boundary condition at can be used to relate these constants and simplify the complex profile expression, namely,
where is a new constant coefficient introduced for convenience.
Now, the fidelity of the scheme is the ratio of right and wrong complex densities at . Using the result above, the fidelity can be written as
The ratio of these constant coefficients can be obtained by noting that the diffusive fluxes of right and wrong complexes at are identical (from Equations S38 and S38), that is,
Substituting this result into Equation S102, and recalling the equality , we obtain
This expression is identical to that in Equation S20 which was derived under the assumption, suggesting that when substrates are highly localized, the shape of the free enzyme profile does not dictate the fidelity.
Deriving the speed expression
To keep the expression of speed compact while still illustrating the key consequences of relaxing the assumption, we will assume moving forward that the density of wrong complexes is much lower than that of the right complexes, that is, . This assumption holds as long as the right and wrong complexes have sufficiently different off-rates. To see why it is the case, note that the ratio is the highest at . We therefore calculate an upper bound for the ratio using Equation S101 and Equation S105 as
As long as , it is fair to assume that the right complexes greatly outnumber the wrong ones, which allows us to approximate the free enzyme density from Equation S96 as .
The specification of the right complex density profile requires the knowledge of the unknown coefficient . To find this coefficient, we use the boundary condition in Equation S88 and the approximation to write
With the constant coefficient known, the right complex density then becomes
where we used the definitions of the mean substrate density and the dissociation constant .
To enable a direct parallel between this general treatment and the earlier one with the approximation, let us introduce as the uniform right complex density when diffusion is very fast () and calculate it from Equation S110 as
Now, using the expression, we rewrite Equation S110 as
where is the complex density obtained under the assumption (Equation S15). The extra factor that appears on front does not exceed 1 since , indicating a reduction in speed, as we anticipated in our more qualitative discussion at the beginning of the section. The presence of the extra factor suggests two possibilities for the approximation to hold true; first, which happens when or when the right complex does not decay noticeably across the compartment, and second, when and , which is when right complexes do decay but their fraction is low compared with free enzymes because of low substrate concentration.
Let us demonstrate the last statement more explicitly. Specifically, let us show that the validity of the approximation is indeed linked directly to the fraction of bound enzymes. To that end, we evaluate as a metric that quantifies the degree to which holds. If there is a large depletion of free enzymes near the substrate-binding site, then the metric will be significantly less than 1; conversely, if the free enzyme profile is practically flat, then the metric will be close to 1. Invoking the relation and using our result for the complex density (Equation S110) as well as the definition of in Equation S112, we evaluate this metric as
Next, we calculate the fraction of bound enzymes from Equation S110 as
Note that emerges as the highest fraction of bound enzymes () reached in the large substrate concentration limit.
To link the metric to the fraction of bound enzymes, we express in terms of and substitute it into Equation S113, namely,
Now, when the complexes do not decay appreciably across the compartment ( and thus, ), the metric becomes roughly equal to 1, suggesting that the free enzyme profile is practically flat. A more interesting case is when the complexes do decay (), as in Appendix 5—figure 1. In this case, applying the condition , we find
The anti-correlation between the and in the above result demonstrates that the degree to which the approximation is violated is indeed dictated by the fraction of bound enzymes.
Pareto front shift
The previous calculations showed that in the ideal substrate localization limit relaxing the assumption keeps the fidelity the same while the speed gets reduced. And this reduction is greater for higher substrate concentrations. We therefore expect a shift in the Pareto front when going to the high substrate concentration limit, as is illustrated in Appendix 5—figure 2a. To get more intuition about the effect of this shift caused by tuning the amount of substrates, we consider the effective number of proofreading realizations at half-maximum speed (n50) and study how this number changes as a function of the fraction of enzymes bound (), which increases monotonically with as suggested by Equation S114. Appendix 5—figure 2b shows this dependence. As can be seen, n50 reduces roughly linearly with ; for example, if 10% of the enzymes are bound, then a 10% reduction in n50 is expected. This suggests that as long as the fraction of bound enzymes is low, our findings related to the Pareto front made under the assumption will generally hold true.
2. Effects that relaxing the assumption has on fidelity in a weak substrate gradient setting
In this section, we study how accounting for the spatial distribution of free enzymes affects our results on the model’s fidelity in the setting where substrates have a finite localization length scale . In this setting, Equations (1–3) (in the main text) describing the system’s dynamics become a system of nonlinear equations, which we solve at steady state using numerical methods.
An example curve of how fidelity changes with tuning diffusion time scale in a finite setting is shown in Appendix 5—figure 3. As expected, the nonuniform free enzyme profile leads to a reduction in fidelity. This reduction is not significant when diffusion is relatively fast as in that case the free enzyme profile manages to flatten out rapidly. The reduction is not significant also in the very slow diffusion limit where binding events that lead to production primarily take place in the proximity of the activation region and hence, the nonuniform profile of free enzymes across the compartment has little impact on fidelity. The greatest reduction happens at intermediate diffusion time scales; in particular, when the system achieves its peak fidelity.
To quantify the extent of this highest reduction, we calculated the peak value of the effective number of proofreading realizations () for different free substrate amounts which regulate the fraction of bound enzymes (). The results obtained for different choices of are summarized in Appendix 5—figure 4. As can be seen, for the high substrate localization case (), there is a roughly linear dependence between and . The initial decrease in with growing is even slower when substrates are less tightly localized ().
Taken together, these results suggest that if the substrate concentration is low enough to leave most of the enzymes unbound, then our proposed scheme will proofread efficiently. And this requirement on substrate amount will be further relaxed if diffusion is fast, or if substrates are not very tightly localized.
Appendix 6
Proofreading on a kinase/phosphatase-induced gradient
In this section, we introduce the mathematical modeling setup for the kinase/phosphatase-based gradient formation scheme and describe how its fidelity is calculated numerically. In the end, we discuss the energetics of setting up the substrate concentration gradient and link our calculations to the lower bounds on energy cost obtained earlier in Appendix 2.
1. Setup and estimation of fidelity
In the analysis thus far, we have imposed a gradient of free substrates and analyzed the proofreading capability of an enzyme acting on this gradient. In a living cell, gradients themselves are maintained by active cellular processes. However, the action of the enzyme – that is, binding a substrate in one spatial location, diffusing away, and releasing the substrate elsewhere – can destroy the gradient, and thereby lead to a loss of proofreading. Here, we analyze the consequences of free substrate depletion and gradient flattening caused by the enzyme.
We model the formation of a substrate gradient by a combination of localized activation and delocalized deactivation. We suppose that substrates can exist in phosphorylated or dephosphorylated forms, and that only the phosphorylated form is capable of binding to the enzyme. The substrates are phosphorylated by a kinase with rate s−1, and dephosphorylated by a phosphatase with rate s−1. Crucially, we assume that phosphatases are found everywhere in the domain of size μm (a typical length scale in a eukaryotic cell), while kinases are localized to one end of the domain (at ), as may occur naturally if kinases are bound to one of the membranes enclosing the domain.
The minimal dynamics of phosphorylated substrates and enzyme–substrate complexes is then given by
augmented by the boundary conditions
Here, we have supposed that the densities of free enzymes, dephosphorylated substrates, and phosphatases are fixed and uniform, and have absorbed them into the relevant rate constants (, , and , respectively). For simplicity, we have also assumed that the free substrates and enzyme–substrate complexes have the same diffusion coefficient μm2/s. We note that accounting for distinct diffusivities of phosphorylated and unphosphorylated substrate forms (Kholodenko, 2009) would affect the speed, while accounting for the slower diffusion of the enzyme–substrate complex would alter the estimates of both speed and fidelity of the model. One or several of these effects can be considered when studying a specific biological system where these microscopic details are known.
We numerically solve Equations S118 and S119 at steady state to obtain the concentration profiles. First, the equations of dynamics are made dimensionless by settings units of length and time by L () and (), respectively. At steady state, the dimensionless equations read
with boundary conditions
where concentrations have been rescaled as , and kinetic rates as .
We discretize the steady state equations on a grid with spacing , approximating the second derivative as
This is ill-defined at the boundaries and , which is addressed by incorporating the boundary conditions. For illustration, consider the left boundary, , and suppose that our domain included also a point at . Then, we could approximate the boundary condition by a centred difference scheme, and solve out for the fictional point at , namely,
which, when inserted into Equation S122, specifies at , that is,
For the boundary at the right () as well as for the boundary conditions for , we similarly implement no-flux boundary conditions. After discretizing, Equation S120 can then be written in a matrix form as
where , are column vectors of the nondimensionalized concentration profiles evaluated at the spatial grid points, that is, . Solving these matrix equations yields
We compute Equation S125 numerically for two substrates: a cognate (‘R’) and a non-cognate (‘W’), which differ in their off-rates ( and , respectively). Having the density profiles, the fidelity of the model becomes . We calculate the fidelity for different choices of the first–order rate of enzyme–substrate binding (); this may be thought of as varying the concentration of free enzyme in the cell. The results are shown in Figure 5 of the main text.
2. Energy dissipation
In Appendices 2.1 and 2.3, we estimated lower bounds on the minimum power that needs to be dissipated in order to counter the homogenizing effect that enzyme activity and substrate diffusion respectively have on localized substrate profiles. Here, we calculate the energy dissipation required to run the kinase/phosphatase-based mechanism and compare it with these lower bounds estimated earlier.
Let us assume that phosphorylation and dephosphorylation reactions by kinases and phosphatases are nearly irreversible with associated free energy costs of and per reaction, respectively. The net rate at which active substrates get dephosphorylated is and it needs to be identical to the net phosphorylation rate of inactive substrates in order for to remain constant. With the costs of each reaction known, we can write the rate of energy dissipation as
To gain analytical intuition, we first consider the case where the enzyme activity is very low, so that the kinase/phosphatase–based mechanism maintains an exponential profile of active substrates with a decay length scale . Expressing the rate of phosphorylation in terms of and (i.e., ), and substituting it into Equation S126, we obtain
Comparing this result with the lower dissipation bound found earlier (Equation S43), we can note the presence of an extra factor . Since the free energy consumption during ATP hydrolysis is , we can say that the power dissipated by the kinase/phosphatase system for setting up an exponential gradient surpasses the lower limit necessary for counteracting diffusion roughly by an order of magnitude.
Next, we explore the energetics of the kinase/phosphatase-based mechanism in the context of the power–fidelity trade-off. Our study of the trade-off in Figure 4 of the main text was performed under the assumption that substrate profiles were exponentially decaying in the entire spatial domain. In Appendix 6—figure 1a, we show the trade-off curves obtained under this assumption and compare them with the trade-off curve for the kinase/phosphatase-based mechanism that arises in response to changing the substrate localization by tuning . As can be seen, the predicted lower bound (sum of the minimum powers needed to counteract the enzyme action and substrate diffusion) is roughly an order of magnitude lower than the total dissipation of the mechanism, and this difference increases with higher fidelity.
Note, however, that the assumption about an exponential substrate localization is not generally valid for the kinase/phosphatase-based mechanism because substrates can be deposited in low–concentration regions and not get immediately dephosphorylated (Appendix 6—figure 1c). We therefore refine our lower bounds on the dissipated power by estimating them numerically using their generic definitions, namely, Equation S30 for counteracting enzymatic action, and Equation S42 for counteracting substrate diffusion. These refined estimates suggest a factor of ∼10 difference between the total cost and its lower bound consistently across a wide region of the trade-off curve. This means that substrate gradient maintenance through practically irreversible phosphorylation and dephosphorylation reactions has low energetic efficiency for doing spatial proofreading, which, however, may be sustainable depending on the energy budget of the cell.
Data availability
All scripts used to generate the data for making the plots are provided in supporting files.
References
-
Gradients in the self-organization of the mitotic spindleTrends in Cell Biology 16:125–134.https://doi.org/10.1016/j.tcb.2006.01.005
-
Kinase- and phosphatase-anchoring proteins: harnessing the dynamic duoNature Cell Biology 4:E203–E206.https://doi.org/10.1038/ncb0802-e203
-
Spatial gradients of cellular phospho-proteinsFEBS Letters 457:452–454.https://doi.org/10.1016/S0014-5793(99)01058-3
-
Mechanisms of tail-anchored membrane protein targeting and insertionAnnual Review of Cell and Developmental Biology 33:417–438.https://doi.org/10.1146/annurev-cellbio-100616-060839
-
Thermodynamic constraints on kinetic proofreading in biosynthetic pathwaysBiophysical Journal 31:333–358.https://doi.org/10.1016/S0006-3495(80)85063-6
-
Quality control in the endoplasmic reticulumNature Reviews Molecular Cell Biology 4:181–191.https://doi.org/10.1038/nrm1052
-
A nuclear F-actin scaffold stabilizes ribonucleoprotein droplets against gravity in large cellsNature Cell Biology 15:1253–1259.https://doi.org/10.1038/ncb2830
-
T-cell receptor binding kinetics in T-cell development and activationExpert Reviews in Molecular Medicine 3:1–17.https://doi.org/10.1017/S1462399401002502
-
Mathematical and computational models of immune-receptor signallingNature Reviews Immunology 4:445–456.https://doi.org/10.1038/nri1374
-
Minimum energetic cost to maintain a target nonequilibrium statePhysical Review E 95:042102.https://doi.org/10.1103/PhysRevE.95.042102
-
Diffusion control of protein phosphorylation in signal transduction pathwaysBiochemical Journal 350:901–907.https://doi.org/10.1042/bj3500901
-
Four-dimensional organization of protein kinase signaling cascades: the roles of diffusion, endocytosis and molecular motorsJournal of Experimental Biology 206:2073–2082.https://doi.org/10.1242/jeb.00298
-
Cell-signalling dynamics in time and spaceNature Reviews Molecular Cell Biology 7:165–176.https://doi.org/10.1038/nrm1838
-
Spatially distributed cell signallingFEBS Letters 583:4006–4012.https://doi.org/10.1016/j.febslet.2009.09.045
-
DNA replication fidelityJournal of Biological Chemistry 279:16895–16898.https://doi.org/10.1074/jbc.R400006200
-
The energy-speed-accuracy tradeoff in sensory adaptationNature Physics 8:422–428.https://doi.org/10.1038/nphys2276
-
The cost of sensitive response and accurate adaptation in networks with an incoherent type-1 feed-forward loopJournal of the Royal Society Interface 10:20130489.https://doi.org/10.1098/rsif.2013.0489
-
Non-vesicular lipid transport by lipid-transfer proteins and beyondNature Reviews Molecular Cell Biology 11:739–750.https://doi.org/10.1038/nrm2971
-
Discriminatory proofreading regimes in nonequilibrium systemsPhysical Review X 4:021016.https://doi.org/10.1103/PhysRevX.4.021016
-
Subcellular mRNA localisation at a glanceJournal of Cell Science 127:2127–2133.https://doi.org/10.1242/jcs.114272
-
Reducing intrinsic biochemical noise in cells and its thermodynamic limitJournal of Molecular Biology 362:387–392.https://doi.org/10.1016/j.jmb.2006.07.068
-
Fidelity of aminoacyl-tRNA selection on the ribosome: kinetic and structural mechanismsAnnual Review of Biochemistry 70:415–435.https://doi.org/10.1146/annurev.biochem.70.1.415
-
Thermodynamics of error correctionPhysical Review X 5:041039.https://doi.org/10.1103/PhysRevX.5.041039
-
Signaling cascades as cellular devices for spatial computationsJournal of Mathematical Biology 58:35–55.https://doi.org/10.1007/s00285-008-0162-6
-
The role of proofreading in signal transduction specificityBiophysical Journal 82:2928–2933.https://doi.org/10.1016/S0006-3495(02)75633-6
-
RNA polymerase fidelity and transcriptional proofreadingCurrent Opinion in Structural Biology 19:732–739.https://doi.org/10.1016/j.sbi.2009.10.009
Article and author information
Author details
Funding
James S. McDonnell Foundation
- Kabir Husain
Simons Foundation
- Arvind Murugan
John Templeton Foundation
- Rob Phillips
National Institute of General Medical Sciences
- Rob Phillips
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Acknowledgements
We thank Anatoly Kolomeisky, Shu-ou Shan and Erik Winfree for insightful discussions, Soichi Hirokawa and Avi Flamholz for providing useful feedback on the manuscript. We also thank Alexander Grosberg whose idea of a compartmentalized ‘rotary demon’ motivated the development of our model. This work was supported by the NIH Grant 1R35 GM118043-01, the John Templeton Foundation Grants 51250 and 60973 (to RP), a James S. McDonnell Foundation postdoctoral fellowship (to KH), and the Simons Foundation (AM).
Copyright
© 2020, Galstyan et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 5,826
- views
-
- 516
- downloads
-
- 17
- citations
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Computational and Systems Biology
- Physics of Living Systems
Planar cell polarity (PCP) – tissue-scale alignment of the direction of asymmetric localization of proteins at the cell-cell interface – is essential for embryonic development and physiological functions. Abnormalities in PCP can result in developmental imperfections, including neural tube closure defects and misaligned hair follicles. Decoding the mechanisms responsible for PCP establishment and maintenance remains a fundamental open question. While the roles of various molecules – broadly classified into ‘global’ and ‘local’ modules – have been well-studied, their necessity and sufficiency in explaining PCP and connecting their perturbations to experimentally observed patterns have not been examined. Here, we develop a minimal model that captures the proposed features of PCP establishment – a global tissue-level gradient and local asymmetric distribution of protein complexes. The proposed model suggests that while polarity can emerge without a gradient, the gradient not only acts as a global cue but also increases the robustness of PCP against stochastic perturbations. We also recapitulated and quantified the experimentally observed features of swirling patterns and domineering non-autonomy, using only three free model parameters - rate of protein binding to membrane, the concentration of PCP proteins, and the gradient steepness. We explain how self-stabilizing asymmetric protein localizations in the presence of tissue-level gradient can lead to robust PCP patterns and reveal minimal design principles for a polarized system.
-
- Physics of Living Systems
Many proteins have been recently shown to undergo a process of phase separation that leads to the formation of biomolecular condensates. Intriguingly, it has been observed that some of these proteins form dense droplets of sizeable dimensions already below the critical concentration, which is the concentration at which phase separation occurs. To understand this phenomenon, which is not readily compatible with classical nucleation theory, we investigated the properties of the droplet size distributions as a function of protein concentration. We found that these distributions can be described by a scale-invariant log-normal function with an average that increases progressively as the concentration approaches the critical concentration from below. The results of this scaling analysis suggest the existence of a universal behaviour independent of the sequences and structures of the proteins undergoing phase separation. While we refrain from proposing a theoretical model here, we suggest that any model of protein phase separation should predict the scaling exponents that we reported here from the fitting of experimental measurements of droplet size distributions. Furthermore, based on these observations, we show that it is possible to use the scale invariance to estimate the critical concentration for protein phase separation.