Cellular compartmentalisation and receptor promiscuity as a strategy for accurate and robust inference of position during morphogenesis
Abstract
Precise spatial patterning of cell fate during morphogenesis requires accurate inference of cellular position. In making such inferences from morphogen profiles, cells must contend with inherent stochasticity in morphogen production, transport, sensing and signalling. Motivated by the multitude of signalling mechanisms in various developmental contexts, we show how cells may utilise multiple tiers of processing (compartmentalisation) and parallel branches (multiple receptor types), together with feedback control, to bring about fidelity in morphogenetic decoding of their positions within a developing tissue. By simultaneously deploying specific and nonspecific receptors, cells achieve a more accurate and robust inference. We explore these ideas in the patterning of Drosophila melanogaster wing imaginal disc by Wingless morphogen signalling, where multiple endocytic pathways participate in decoding the morphogen gradient. The geometry of the inference landscape in the high dimensional space of parameters provides a measure for robustness and delineates stiff and sloppy directions. This distributed information processing at the scale of the cell highlights how local cell autonomous control facilitates global tissue scale design.
Editor's evaluation
The manuscript introduces a compelling theoretical framework to investigate architectures of signal processing. The predictions of the computational model have been convincingly validated with data from fly wing precursor tissues. The work is important and will be highly valuable to biological physicists and developmental biologists interested in morphogenesis and pattern formation.
https://doi.org/10.7554/eLife.79257.sa0Introduction
Precise positioning of cell fates and cell fate boundaries in a developing tissue is of vital importance in ensuring a correct developmental path (reviewed in Tkačik and Gregor, 2021; Wolpert, 2016). The required positional information is often conveyed by concentration gradients of secreted signalling molecules, or morphogens (reviewed in Tabata and Takei, 2004; Briscoe and Small, 2015). Typically, a spatially varying input morphogen profile is translated into developmentally meaningful transcriptional outputs. Morphogen profile measurements, across several signalling contexts, show that the gradients are inherently noisy Houchmandzadeh et al., 2002; Gregor et al., 2007a; Kicheva et al., 2007; Bollenbach et al., 2008; Zagorski et al., 2017. However, precision of the signalling output should be robust to inherent genetic or environmental fluctuations in the concentrations of the ligands and receptors engaged in translating the positional information. For example, the noisy profile of the morphogen Bicoid (Bcd) that activates hunchback (hb) in the early Drosophila embryo Gregor et al., 2007a; Gregor et al., 2007b , and the expression of gap genes that activate pair-rule genes Dubuis et al., 2013; Petkova et al., 2019 result in cell fate boundaries that are positioned to a remarkable accuracy of about one cell’s width. This points to a local, cell autonomous morphogenetic decoding that is precise and robust to various sources of noise Kerszberg and Wolpert, 2007; Kerszberg, 2004; Jaeger et al., 2004.
Cell autonomous decoding of noisy morphogen profiles includes reading of morphogen concentration, followed by cellular processing, finally leading to inference in the form of transcriptional readout. Several strategies have been proposed to ensure precision in output (reviewed in Barkai and Shilo, 2009; Lander et al., 2009): feedbacks such as self-enhanced morphogen degradation Eldar et al., 2002; Eldar et al., 2003, spatial and temporal averaging Gregor et al., 2007a, use of two opposing gradients McHale et al., 2006, pre-steady state patterning Bergmann et al., 2007 and serial transcytosis Bollenbach et al., 2007.
Most cell signalling systems have regulatory mechanisms that fine-tune signalling by controlling ligand-specific receptor interactions Rogers and Schier, 2011. Ligands such as TGF /BMP Mueller and Nickel, 2012, Jiang and Cong, 2016, D’Souza et al., 2008, show promiscuous interactions with different receptors. Chen and Schier, 2002; Sick et al., 2006 or sequestering components within the extracellular matrix Marjoram and Wright, 2011 or interactions with binding receptors such as heparin sulphate proteoglycans (HSPGs) Baeg et al., 2001; Baeg et al., 2004; Yan and Lin, 2009 can control availability of the ligand. Additionally, the multiple endocytic pathways that operate at the plasma membrane can control the extent of signalling Bökel and Brand, 2014; Di Fiore and von Zastrow, 2014. These examples argue for distributed information processing within the cell.
In this paper, we show how cellular compartmentalisation, a defining feature of multicellularity, provides a compelling realisation of such distributed cellular inference. We show that compartmentalisation together with multiple receptors, receptor promiscuity and feedback control, ensure precision and robustness in positional inference from noisy morphogen profiles during development. Compartments associated with specific chemical (e.g. lipids, proteins/enzymes) and physical (e.g. pH) environments, have been invoked as regulators of biochemical reactions during cellular signalling and development Ellisdon and Halls, 2016; Omerovic et al., 2007; Omerovic and Prior, 2009; Shilo and Schejter, 2011; Bökel and Brand, 2014. Deploying promiscuous receptors against a morphogen, in addition to its specific receptor, is a strategy to buffer variations in morphogen levels. These observations provide the motivation for a general conceptual framework for morphogenetic decoding based on a multi-tiered, multi-branched information channel. While our framework has broader applicability, we will, for clarity, use the terminology of Wingless signalling in Drosophila wing imaginal disc Hemalatha et al., 2016.
Conceptual framework and quantitative models
We pose the task of morphogenetic decoding as a problem in local, cell autonomous inference of position from a morphogen input (Figure 1), where each cell acts as an information/inference channel with the following information flow:
‘reading’ of the morphogen input by receptors on the cell surface,
‘processing’ by various cellular mechanisms such as receptor trafficking, secondary messengers, feedback control, and
‘inference’ of the cell’s position in the form of a transcriptional readout.
At a phenomenological level, reading of the morphogen input is associated with the binding of the morphogen ligand to various receptors with varying degree of specificity, leading to the notion that the information channel describing positional inference must possess multiple branches. Furthermore, the multiple processing steps associated with compartmentalisation of cellular biochemistry and/or signal transduction modules, for example phosphorylation states, provide the motivation for invoking multiple tiers in the channel architecture. At an abstract level, one may think of the branch-tier architecture of the cellular processing as a bipartite Markovian network/graph Hartich et al., 2014, with a fast direction (involving multiple branches) consisting of ligand-bound and unbound states along with chemical state changes, and a slower direction (involving multiple tiers) consisting of intracellular transport, fission and fusion, characterised by energy-utilising processes or a flux imbalance. A general developmental context with multiple morphogens may involve several such bipartite Markov networks/graphs with different receptors (or branches) in parallel. Some of these receptors could be shared between different morphogens. We refer to signalling receptors as those which transduce a signal upon binding to their specific morphogen ligand and non-signalling receptors as those that participate in the signalling pathway without directly eliciting a signalling response. At the end of processing, each individual cell may pool information from the various branches for the final inference of position, i.e. a transcriptional readout (Figure 2).
The task of achieving a precise inference is complicated by the noise in morphogen input arising from both production and transport processes, and by the stochasticity of the reading and processing steps; thus the inference must be robust to the extrinsic and intrinsic sources of noise. The use of feedback control mechanisms is a common strategy to bring about robustness in the context of morphogen gradient formation and sensing Averbukh et al., 2017. Motivated by this, in Section ‘Quantitative models for cellular reading and processing’ we consider different feedback controls in conjunction with the tiers and branches. With these three elements to the channel architecture, the task of morphogenetic decoding can be summarised in the following objective.
Objective: |
Given a noisy ligand input distribution at position , i.e. , what are the requirements on the reading (number of receptor types and receptor concentrations) and processing steps (number of tiers and feedback type) such that the positional inference is precise and robust to extrinsic and intrinsic noise? |
Mathematical framework
Figure 1 describes information processing during development across a two dimensional tissue of nx, ny cells in and directions, respectively. The direction of morphogen gradient is taken to be along , with the morphogen source to the left of . Each cell is endowed with a chemical reaction network (CRN) with the same multi-tiered, multi-branched architecture with feedbacks described previously, that reads a noisy input (morphogen concentration) and produces an ‘output’ (biochemical ‘signal’) that is also noisy. Here, we choose to construct the noisy morphogen profile in the following manner: for a given position , cells along the -direction see different amounts of ligand coming from the same input distribution,
characterised parametrically by a mean and standard deviation . Experimental data can be fit to this distribution Equation 1 (or another distribution suitable for the specific experimental system) to obtain the parameters. Here, we consider an exponentially decaying mean and standard deviation .
Alternatively, one could choose a different parametrisation consistent with experimental observations for a morphogen profile with a monotonically decaying mean. The values of chosen for our analysis are listed in Table 1. The corresponding output distribution can be used to infer the cell’s position. Since we do not know the precise functional relationship between the output and inferred position, we invoke Bayes rule MacKay and Mac Kay, 2003, as in previous work Tkačik et al., 2015, to infer the cell’s position,
where and is the prior distribution which we take to be uniform over a tissue of unit length, . We quantify precision in the inference by the local inference error, . For each position , the inferred position of cells along the -direction is taken to be the maximum a posteriori estimate,
where we use to differentiate from the true position . From this, the local and average inference error can be computed.
where the average in Equation 6 is over cells in the -direction. The logic behind this definition of the inference error is that development of the tissue relies on the precision in the inference of cells’ positions throughout the tissue. However, there may be tissue developmental contexts, where only the positions of certain regions or cell fate boundaries need to be specified with any precision (as in the case of short-range morphogen gradients like Nodal Liu et al., 2022). The definition of inference error may be readily extended to incorporate such specifications (see Section ‘Choice of objective function’).
We have been motivated to use the maximum a posteriori (MAP) estimate in Equation 5 by its successful use in previous studies in Drosophila embryo Dubuis et al., 2013; Petkova et al., 2019; Tkačik et al., 2015 and, more importantly, that it is a local estimate not requiring the computation of (which is independent of ). We have checked that a different definition of the inference error, which does not use the MAP estimate and takes into account the entire distribution ,
leads to the same qualitative results.
Quantitative models for cellular reading and processing
In order to calculate the probability of the inferred position given the output and hence the inference error , one needs to know the prior and the input-output relation giving rise to the output distribution in Equation 4. While a uniform prior may be justified by a homogeneous distribution of cells in the developing tissue at the stage considered, the input-output relation needs to be developed using a specific model based on the general channel design principles described previously. Thus, we will take each cell to be equipped with a chemical reaction network (CRN) that has up to two receptor types both of which bind the ligand on the cell surface but only one is signalling competent Hemalatha et al., 2016; Tabata and Takei, 2004. This latter aspect breaks the symmetry between the receptor types and hence the branches, a point that we will revisit in Section ‘Asymmetry in branched architecture: promiscuity of non-signalling receptors’. In multi-tier architectures, the bound states of both the receptors are internalised and shuttled through several compartments. The last compartment allows for a conjugation reaction between the two receptors (as in the case of Wingless and Dpp Hemalatha et al., 2016; Zhu et al., 2020). The signalling states, defined by all the bound states of the signalling receptor, contribute to the output. Within this schema, we consider control mechanisms on the surface receptor concentrations and in the chemical reactions downstream to binding on the surface (i.e. on internalisation, shuttling, conjugation, etc). We formulate the control on processing steps as a feedback/feedforward regulation from one of the signalling species in the CRN. On the other hand, the control of surface receptors is considered in the form of an open-loop control by allowing receptor profiles to vary within certain bounds, as described below. The key parameters are chemical rate parameters describing the rates of various reactions in the CRN, receptor parameters describing the receptor concentration profiles, feedback topology in the CRN that is a combination of actuator and rate under regulation, control parameters describing the strength and sensitivity of the feedback/feedforward. With these parameters specified, an input-output relation, calculated as a tier-wise weighted sum of all signalling states, can then be used to infer the cell’s position by Equation 4.
Cellular Reading via surface receptors
In the framework described previously, we consider the morphogen ligand as an external input to the receiving cells, outside the cellular information processing channel. The signal and noise of this external input are captured by the distribution Equation 1. This implicitly assumes that there is no feedback control from the output to the ligand input, that is no ‘sculpting’ of the morphogen ligand profile. We revisit this point in the Discussion. Given a distribution of the morphogen input, we address the local, cell autonomous morphogenetic decoding that allows the cells to tune their reading dynamically.
We subject the local, cellular reading to an open-loop control on total (ligand bound plus unbound) surface availability of the signalling and non-signalling receptors. This implies that for each evaluation of inference error within the optimisation routine (see Section ‘Performance of the Channel Architectures’), the local surface receptor levels are held constant in time through a chemostat (see Appendix 1). In our analysis, we consider a family of monotonic (increasing or decreasing in and independent of ) receptor profiles, which for convenience we take to be of the Hill form (Figure 3), that is either
The range of values for these parameters considered in the numerical analysis are listed in Table 1. Therefore, when considering to be monotonically increasing in , we parametrise it with . It follows that in a one-branch channel, there are two possibilities: while in a two-branch channel, there are a total of four possibilities: . This allows us to simulate the ‘reading’ step performed by the cells (see Figure 1b).
Note that we are not fixing a receptor profile but taking it from a class of monotonic profiles (including a uniform profile), over which we vary to determine the optimal inference (see Section ‘Performance of the Channel Architectures’ below). Further, in the optimisation scheme (Section ‘Performance of the Channel Architectures’), we allow the receptor concentrations to vary over the space of all monotonically increasing, decreasing or flat profiles, and do not encode the positional information in the receptor profiles. Monotonicity implicitly assumes a spatial correlation in the receptor concentrations across cells – we return to this point in the Discussion.
Dynamics of processing in a single-tier channel
In a single tier channel, all processing is restricted to the cell surface. We represent the bound state of the signalling receptor as and that of the non-signalling receptor as . The conjugated state is represented by . The CRN for such a system with one and two branches is shown in Figure 4a. Rates associated with these reactions are listed in Table 1. The differential equations that describe the binding, unbinding, conjugation, splitting and degradation reactions of the receptors are given by
The steady-state output , defined as the sum of all the ligand-bound signalling states, is given by . Note that to describe the 1-branch system, we simply set all rates to zero.
Dynamics of processing in a multi-tier channel
In a multi-tiered channel, the receptors go through additional steps of processing before generating an output. We represent the bound state of a receptor in -th tier of the first branch as , that of the second branch as , and the conjugate species that forms in the last -th tier as . The CRN for such a system with tiers is shown in Figure 4b. Rates associated with these reactions are listed in Table 1. The differential equations that describe the binding, unbinding, trafficking, recycling, conjugation, splitting and degradation reactions of the receptors are given by
The output, realised from all the ligand-bound signalling states, now becomes at steady state with wk, such that , representing the weight allotted to the tier (according to the mean residence time in the tier, for instance). For details regarding the setup of Equations 10–17 refer to Appendix 1. These differential equations for single-tiered and multi-tiered systems are to be augmented by stochastic contributions from both extrinsic and intrinsic sources. Extrinsic noise is a consequence of stochasticity of the ligand concentration presented to the cell, , and enters the equations as a source term. On the other hand, intrinsic noise is a consequence of copy-number fluctuations in the CRNs that characterise the channel, and are treated using chemical master equations (CMEs) Sengupta, 2008.
Feedback Control
We consider all rates in the CRN, except the ligand binding and unbinding rates, as potentially under feedback regulation. Any chemical rate that is under feedback control actuated by the node is modelled as.
with r0 as the reference value of the chemical rate in the absence of feedback. The range of values for amplification , feedback sensitivity and feedback strength are listed in Table 1. Figure 5 shows the different categories of possible feedback controls. We discuss the heuristics underlying the feedback controls in Appendix 2.
Performance of the channel architectures
With the model in place, we address the Objective discussed previously, by studying the performance of different channel architectures, i.e. number of tiers and branches, and feedback topology. We define a vector belonging to a parameter space of the channel parameters related to chemical rates, receptor profiles and feedback (see Table 1). While the chemical rates and feedback parameters are the same in all cells, the receptor profile parameters help define the receptor concentrations at each cell position . For a given morphogen input distribution and a channel architecture under consideration, the optimisation can be stated as
and implemented by the following algorithm, the details of which are presented in Appendix 3.
Optimisation scheme |
---|
|
Results
As discussed previously, cells of a developing tissue face both extrinsic as well as intrinsic sources of noise. We first look at the issue of extrinsic noise in the morphogen input (described by Equation 1). The output then is a deterministic function of the morphogen input and parameters of the channel i.e. receptor concentrations, feedback topology, chemical rates and feedback parameters. The range of values considered for these parameters is listed in Table 1, consistent with the timescale separation between the rates of chemical reactions and transport as discussed in Section ‘Conceptual framework and quantitative models’. We apply the numerical analysis and the optimisation algorithm outlined in Section ‘Performance of the Channel Architectures’ to determine the design characteristics of ‘reading’ (receptor profiles) and ‘processing’ (tiers and feedback control) steps. Later, we check how channels, optimised in the reading and processing steps to deal with extrinsic noise, respond to intrinsic noise and what roles the elements of channel architecture play there. All the essential results are presented in this section and the reader may look up the appendices for further details.
Branched architecture with multiple receptors provides accuracy and robustness to extrinsic noise
We begin with architectures comprising single-tiered channels with one and two branches. Such architectures are similar in design to the classic picture of ligand-receptor kinetics Lauffenburger and Linderman, 1996; Alberts et al., 2017, but also to the self-enhanced degradation models for robustness of morphogen gradients Eldar et al., 2003. Before we proceed, it helps to recall a simple heuristic regarding signal discrimination. Appendix 4—figure 1 illustrates that precision in positional inference requires both that the output variance at a given position be small and that the mean output at two neighbouring positions be sufficiently different.
Let us first consider a minimal architecture of a one-tier one-branch channel without feedback control on any of the reaction rates. The output of this channel, here , is a monotonic, saturating function of the input, with the surface receptor concentration setting the asymptote. As in Appendix 5—figure 1a, if the receptor concentrations decrease with mean ligand input, i.e. increases with distance from source ( in Figure 3a; ), the outputs for different input ranges overlap significantly. On the other hand, if the receptor concentrations increase with mean input ( in Figure 3b), the outputs overlap to a lesser degree (see Appendix 5—figure 1b). Thus within this minimal architecture, the inference error is optimised when the receptor concentrations increase with the mean input.
Introducing a feedback in this one-tier one-branch architecture, either on receptor levels or degradation rate, only partially reduces the inference errors (Figure 6a, c). As seen in Figure 6d, this is because the surface receptor concentration sets both the asymptote and the steepness of the input-output functions, resulting in significant overlaps between outputs at neighbouring positions. The receptor control introduces a competition between robustness of the output to input noise and sensitivity to systematic changes in the mean input (see Appendix 4).
Including a non-signalling receptor via an additional branch in the channel architecture opens up several new possibilities of feedback controls, in addition to providing an extra tuning variable. Now, as opposed to the one-tier one-branch case, an inter-branch feedback control (Figure 7a) results in an input-output relation with a sharp rise followed by a saturation (Figure 7d). By appropriately placing the receptors at spatial locations that receive different input, as shown by black arrow in Figure 7d, one can cleanly separate out the cellular outputs in neighbouring positions. For a detailed description see Appendix 6. This mitigates the above-mentioned tension between robustness to input noise and sensitivity to systematic changes in the mean input to a considerable extent (see Appendix 4—figure 2).
As seen in Figure 7c, the two-branch architecture with inter-branch feedback leads to a dramatic reduction in the inference errors, to reach one cell’s width precision at most spatial locations in the tissue.
We would like to highlight two unexpected features of the optimised two-branch architecture. (i) The signalling and non-signalling receptors present opposing optimal profiles – a consequence of the negative inter-branch feedback. (ii) The optimal non-signalling receptor decreases away from the source, indicating that the non-signalling receptor ‘reads’ the ligand input, while the signalling receptor increases away from the source, buffering the noise in the output (Figure 7). A heuristic understanding of the opposing optimal receptor profiles is provided in Appendix 7. In contrast, in the one-branch architectures, it is the signalling receptor that does the reading and buffering.
Tiered architecture with compartmentalisation adds robustness to intrinsic noise
We next investigate the effects of addition of tiers (compartments) on the inference errors. Our optimisation shows there are two distinct optimised two-tier two-branch architectures, one with inter-branch feedback on the internalisation rate of the non-signalling receptors and the other on the conjugation rate , that have comparable inference errors (Figure 8b, c). Both the receptor profiles and the input-output relations of these two optimised two-tier two-branch channels are qualitatively similar (Appendix 8—figure 1).
It would seem that addition of further tiers, that is more than two, would lead to further improvement in the inference. However, in both these optimised architectures, addition of tiers leads only to a marginal reduction of inference errors (Figure 8a) while invoking a cellular cost. Of course, extensions of our model that involve modification of the desired output could favour the addition of more tiers. For instance, additional tiers could facilitate signal amplification or improvement in robustness to input noise through an increase in signal-to-noise ratio (SNR) Stoeger et al., 2016. Further, by making the output a multi-variate function of the tier index (compartment identity) one can multitask the various cellular outcomes (as in Ras/MAPK signalling Fehrenbacher et al., 2009 or with GPCR compartmentalisation Ellisdon and Halls, 2016).
So far, we have only considered noise due to fluctuations in the morphogen profile, that is extrinsic noise. Given that we are considering a distributed channel, intrinsic noise due to low copy numbers of the reacting species in the CRN will have a significant influence on the inference. As discussed in Section ‘Conceptual framework and quantitative models’ and Appendix 3, we solve the stochastic chemical master equations (CMEs) to compute the output distributions and the positional inference. It is here that we find that the addition of tiers contribute significantly to reducing inference errors. A comparison of the one-tier two-branch and two-tier two-branch channel architectures (Figure 9a and b) optimised for extrinsic noise, shows that in the presence of intrinsic noise, additional tiers lead to significantly lower inference errors (Figure 9c). The large inference errors seen in the one-tier one-branch channel in the presence of intrinsic noise, can be traced to the instabilities of steady-state trajectories of the two signalling species and driven by the non-linear feedback (Figure 9d–f). This effect is more prominent for larger values of ligand concentrations, that is closer to the source at . On the other hand, we find that in the two-tier two-branch architecture (Figure 9g–i), the fluctuations in the signalling species are more tempered, the inter-branch feedback leads to a mutual damping of the fluctuations of the signalling species from the two branches. Details of this heuristic argument appear in Appendix 9.
In summary, we find that the nature of the channel architectures play a significant role in robustness of morphogenetic decoding to both extrinsic and intrinsic sources of noise. Of the three elements to the channel architecture - branches, tiers, and feedback control, we find that a branched architecture can significantly reduce inference errors by employing an inter-branch feedback and a control on its local receptor concentrations. For this, the receptor concentration profiles required to minimise inference errors are such that the concentration of signalling (non-signalling) receptor should decrease (increase) with mean morphogen input. Crucially, in the absence of feedback, performance of the channel diminishes and the optimised receptor profiles both decrease away from the source (Appendix 10—figure 1). Further, we show in Appendix 11—figure 1 that having uniform profiles for the signalling and non-signalling receptors, with or without uncorrelated noise, fares poorly in terms of inference capability. This provides a posteriori justification for the monotonicity in receptor profiles. Addition of tiers can help in further bringing down inference errors due to extrinsic noise, but with diminishing returns. An additional tier, however, does provide a buffering role for feedback when dealing with intrinsic noise. We note that these qualitative conclusions remain unaltered for different morphogen input characteristics, that is input noise and morphogen decay lengths (see Appendix 12).
Asymmetry in branched architecture: promiscuity of non-signalling receptors
Before comparing the theoretical results with experiments, we comment on the implications for the cellular control of the signalling and non-signalling receptors. In the two-branch architecture, the symmetry between the signalling and non-signalling receptors is broken by the inter-branch feedback and the definition of output , the latter taken to be a function only of the signalling states and (Section ‘Conceptual framework and quantitative models’, purple nodes in Figure 7a and Figure 8b and c). What are the phenotypic implications of this asymmetry? In Appendix 13—figure 1, we plot the contours of average inference errors in the plane around the optimal point. We compute the eigenvalues of the local curvature of around the optimal point (). The difference in the magnitudes of these eigenvalues, as discussed in Appendix 13, immediately describes stiff and sloppy directions Transtrum et al., 2015 along the and axes, respectively. This implies that while the signalling receptor is under tight cellular control, the control on the non-signalling receptor is allowed to be sloppy. A similar feature is observed in the contour plots for the robustness measure (defined as the ratio of coefficients of variation in the output to that in the input). Appendix 4—figure 3 shows that for any given input distribution, reduction in output variance requires a stricter control on , while the control on can be lax.
This sloppiness in the levels of non-signalling receptor would manifest at a phenotypic level in the context of multiple morphogen inputs as in the case of Drosophila imaginal disc Tabata and Takei, 2004. Participation of the same non-signalling receptor in the different signalling networks would imply its promiscuous interactions with all ligands. The signalling receptors, therefore, are specific for the various ligands while the non-signalling receptor, being promiscuous, is non-specific. This, as we see below, is the case with the Heparan sulfate proteoglycans (HSPGs) such as Dally and Dally-like protein (Dlp) that participate in the Wingless (Wg) and Decapentaplegic (Dpp) signalling networks Lin and Perrimon, 2000; Romanova-Michaelides et al., 2021.
Geometry of fidelity landscape
The above section and Appendix 13 motivate us to study the changes in the inference error upon perturbations of all the channel parameters. We therefore discuss the nature of optima in terms of the local geometry of the fidelity landscape around the optimum, and the geometry of the low inference error states. We work with the case of the optimised one-tier two-branch channel (shown in Figure 7a with optimum channel parameters listed in Table 2, Table 3) in presence of extrinsic noise.
To address the geometry of the local fidelity landscape around the optimum, we compute (i) percent changes in inference error due to perturbations in channel parameters (Figure 10a), and (ii) the eigenspectrum of the Fisher information metric (FIM, Figure 10b). The FIM is evaluated in the log-parameter space as Transtrum et al., 2015.
where, is the channel parameter vector, and are the indices of cells that run along the - and -directions. As shown in Figure 10a, we see that the inference error does not change significantly (up to 20% change with most parameters), that is it remains within . Varying the feedback strength , however, drives a much stronger deviation from the minimum. Similarly, as seen from the heat map (Figure 10b), eigenvectors with the larger eigenvalues (index 1–6) have an appreciable component of the feedback parameters . This implies that variation of the feedback parameters from the optimum would result in significant changes in the inferred positions. Perturbing conjugation and splitting rates simultaneously (see eigenvector 16) does not produce any notable change to the inferred positions (eigenvalue ). Further, perturbations to channel parameters other than the feedback parameters (eigenvectors 7–16) produce marginal changes in inferred positions.
Moving now from a local to global analysis of the fidelity landscape, we run the optimisation algorithm (Section ‘Performance of the Channel Architectures’) on the one-tier two-branch channel architecture with 216 space-filling initial points in the 16-dimensional parameter space of this architecture. We then define the low inference error states as those channel parameters that yield . This cutoff, which equals , corresponds to declaring as equivalent all the inference errors that lie between one and two cells’ widths. Consistent with the local analyses, we find that the frequency distribution of optimal feedback parameters is narrowly distributed about the global optimum (Figure 11a). As shown in Figure 11a, the parameters corresponding to forward and backward rates are skewed towards the upper and lower bounds of the allowed parameter range, respectively. We see that the optimal binding rates in the non-signalling branch (Figure 11a) are more broadly distributed across the permissible range than the optimal binding rates in the signalling branch, which are concentrated towards the upper bound of the permissible range. This again reflects the promiscuity of the non-signalling receptors as described in Section ‘Asymmetry in branched architecture: promiscuity of non-signalling receptors’. All other optimal parameters corresponding to degradation rates, minimum and maximum receptor values and steepness of the receptor profiles, show a very broad spread over this range (Appendix 14—figure 1). To explore the topography of the low inference error landscape, we evaluate the components of the ‘position vectors’ of these minima in the parameter space along the eigenvectors of the Hessian of , defined as
where stands for the entire morphogen profile and we have assumed a Euclidean metric. As shown in Figure 11b and c, components of the ‘position vector’ of the minima lie predominantly along the sloppy directions of the Hessian that is along the eigenvectors with small eigenvalues. This suggests that geometry of the low inference error landscape resembles a deep valley, which is shallow along the several sloppy directions and steep along the few stiff directions.
Choice of objective function
The objective function as defined in Equation 7 gave equal weight to inference errors at all positions along the tissue, driving the inference error to reduce at all positions simultaneously. In certain developmental contexts, the objective could be to partition the tissue into cell identity segments (reviewed in Briscoe and Small, 2015). In such a case, the partition boundaries would need to be sharp Gregor et al., 2007a that is only the errors at the segment boundaries would need to be minimised. We show that even with this choice of objective function, the qualitative results for the optimal channel architectures remain unaltered. We define the inference error for a tissue with segmented cell identities as.
where and denote the Kronecker-delta and Heaviside-theta functions respectively, is the position of the boundary between the i-th and (i+1)-th segments, and is a function that maps position (actual or inferred) to a segment, that is with as the total number of segments.
We optimise one-tier one-branch, one-tier two-branch and two-tier two-branch channel architectures for the inference error as defined in Equation 23 with and equally spaced boundaries located at positions along the -axis. As before, this optimisation suggests that an additional branch aids in reducing the inference errors due to extrinsic noise (compare Figure 12b and d), with similar opposing receptor profiles as in Section ‘Branched architecture with multiple receptors provides accuracy and robustness to extrinsic noise’. Tiers play only a moderate role in reducing the inference errors further in a two-branch channel (compare Figure 12d and f). However, just as with the previous objective function, an additional tier provides substantial robustness to intrinsic noise as shown in Figure 13c.
Experimental verification in the Drosophila Wg signalling system
The phenomenology of the morphogen reading and processing of Wg in the wing imaginal disc of Drosophila melanogaster Hemalatha et al., 2016 suggests a one-to-one mapping to the two-tier two-branch channel defined above, thus providing an ideal experimental system for a realisation of the ideas presented here (Figure 14a, b).
Wingless (Wg) is secreted by a line of cells (1–3 cells) at the dorso-ventral boundary and forms a concentration gradient across the receiving cells Neumann and Cohen, 1997. Receiving cells closer to the production domain show higher Wg signalling while those farther away have lower Wg signalling Neumann and Cohen, 1997. Several cell autonomous factors influence reading and processing of the morphogen Wg in the receiving cells. Binding of Wg to its signalling receptor, Frizzled-2 (DFz2), initiates signal transduction pathway and nuclear translocation of -catenin which further results in activation of Wg target genes (reviewed in Clevers and Nusse, 2012). In addition to the signalling receptor, binding receptors such as Heparin Sulphate Proteoglycans (HSPGs) – Dally and Dlp also contribute to Wg signalling Baeg et al., 2001; Franch-Marro et al., 2005. Further, the two receptors follow distinct endocytic pathways Hemalatha et al., 2016: while, DFz2 enters cells via the Clathrin Mediated Endocytic pathway (CME), Wg also enters cells independent of DFz2, possibly by binding to HSPGs, through CLIC/GEEC (CG) endocytic pathway. The two types of vesicles, containing Wg bound to different receptors, merge in common early endosomes Hemalatha et al., 2016. However, only DFz2 receptors in their Wg-bound state, both at the cell surface and early endosomes, are capable of generating a downstream signal leading to positional inference through a transcriptional readout Tsuda et al., 1999. This phenomenology is faithfully recapitulated in our two-tier two-branch channel architecture (Figure 14b) in which DFz2 and HSPG receptors play the role of the two branches. The conjugated state ‘Q’ represents a combination of the readings from the two branches, possibly realised by the co-receptors HSPGs that bind Kirkpatrick et al., 2006; Capurro et al., 2008 and present Hemalatha et al., 2016 diffusible ligands to signalling receptors (either on the cell surface or within endosomes).
Since an experimental measurement of positional inference error poses difficulties, we measure the cell-to-cell variation in the signalling output for a given position as a proxy for inference error (Appendix 4—figure 3). Larger the variation, higher is the inference error. This is calculated as coefficient of variation (CV, Appendix 15) in the output across cells in the -direction (Figure 14c).
Let us first discuss the results from the theoretical analysis. The optimised two-tier two-branch channel (Figure 14b) shows that the magnitude and the fluctuations in the coefficients of variation are small, with a slight increase with position (blue, Figure 14f). This is consistent with the low inference error associated with the optimised channel (Figure 8b). Upon perturbing this channel via removal of the non-signalling branch, the magnitude and fluctuations in the signalling output variation increases significantly (orange, Figure 14e). This qualitative feature of the coefficient of variation in the optimised two -tier two-branch channel is replicated in the Wg measurements of wild type cells.
In the experiments, we first established the method by determining the CV of a uniformly distributed signal, CAAX-GFP (expressed using ubiquitin promoter), and observed that the CV of CAAX-GFP is relatively uniform in , the distance from Wg producing cells (Figure 14d). In order to study the steady state distribution of Wg within a cell and within the endosomes, we performed a long endocytic pulse (1 hr) with fluorescently labelled antibody against Wg Hemalatha et al., 2016; Prabhakara et al., 2022. Following this, we estimated the CV of the Wg endocytic profile as a function of (Figure 14f, and Figure 14—figure supplement 1).
We assessed the CV of endocytosed Wg under two conditions: one, where the endocytic pulse of Wg is captured by the two branches and two tiers (control condition), and another, where we disengage one of the tiers by inhibiting the second endocytic pathway using a genetically expressed dominant negative mutant of Garz, a key player in the CG endocytic pathway Gupta et al., 2009. This perturbation has little or no effect on the functioning of the CME or the levels of the surface receptors that are responsible for Wg endocytosis (Hemalatha et al., 2016; Prabhakara et al., 2022). As predicted by the theory (Figure 14e), CV in the control shows a slight increase with position (Figure 14f) with fluctuations about the mean profile being small. In the perturbed condition, with the CG endocytic pathway disengaged, we find the CV shows a steeper increase with and has larger fluctuations about the mean profile.
In principle, the coefficient of variation of the output is affected by all the microscopic stochastic processes that intersect with Wg signalling network in the wing imaginal disc and in the ligand input. Therefore, one has to be careful about interpreting the changes in the coefficient of variation of the output, based on the such perturbation experiments. Notwithstanding, this qualitative agreement between theory and experiment is encouraging.
Discussion
In this paper, we have posed the problem of spatial patterning of cell fates in a developing tissue as a local, cell autonomous morphogenetic decoding that ensures precise inference of position, that is robust to extrinsic and intrinsic noise. We treat the cells as inference channels capable of reading and processing the morphogen input. We describe the architecture of the inference channels in terms of three elements: branches (number of receptor types), tiers (number of compartments) and feedbacks. We ask for properties of the inference channel architectures that allow for precision and robustness in the task of morphogenetic decoding of cellular position.
Key results
Taking an information theoretic and systems biology approach, we have addressed the issue of accurate and robust morphogenetic decoding of position. For convenience, we summarise our key results in a point-wise manner:
The main result is that given a noisy morphogen input, cells in a developing tissue can achieve low inference error of their positions by deploying a more elaborate multi-branch multi-tier channel architecture with feedback control. This ensures a separation between the reading of the morphogen input and buffering against noise.
Having a combination of signalling and non-signalling receptors in the channel can significantly improve the performance of cells in their positional decoding.
For a monotonically decaying morphogen input, the signalling and non-signalling receptors exhibit spatially varying profiles with the signalling receptor increasing away from the source and the non-signalling receptor decreasing away from the source. This implies that the non-signalling receptor ‘reads’ the morphogen input, while the signalling receptor buffers against noise.
The performance of the multi-tier multi-branch channels is enhanced by having a feedback from the signalling branch to the non-signalling branch. Along with control on the levels of signalling receptor, this inter-branch feedback provides buffering against extrinsic noise.
Having a multi-tier architecture (cellular compartmentalisation) tempers the effects of intrinsic noise in the channel by stabilising fluctuations of the output at steady-state.
The optimisation shows that the characteristics of the signalling receptor are tightly controlled, whereas those of the non-signalling receptor are flexible. This implies that the signalling receptor is specific whereas the non-signalling receptor is promiscuous.
Analysis of the geometry of the fidelity landscape reveals that the channel parameters corresponding to feedback, binding rates and profile of the signalling receptor are stiff, while the rest of the channel parameters are sloppy (elaborated in Section ‘Geometry of the inference error landscape: implications for control’).
The efficacy of inter-branch feedback control is enabled by having a conjugated state corresponding to a confluence of the signalling and non-signalling branches in a common compartment.
Our analysis demonstrates how local, cell autonomous control can facilitate the optimisation of a tissue-level task, here morphogenetic decoding of cellular position.
Our theoretical predictions are compared with experimental observations from Wg morphogen system of Drosophila wing imaginal disc. We first show that Wg signalling in the experimental system is equivalent to a two-tier two-branch channel. In the experiments, we use signal-to-noise ratio (SNR) of the output as a proxy for robustness of inference. Perturbation of the architecture, i.e. removal of the non-signalling branch, results in reduction of SNR. In a forthcoming manuscript, we will provide a detailed verification of the predicted opposing receptor profiles.
Geometry of the inference error landscape: implications for control
We have explored the local geometry of the fidelity landscape around the optimum, and the global geometry of the low inference error states, by perturbing channel parameters and concentration profiles of the receptors.
The local geometry of the fidelity landscape is studied using the Fisher information metric. This shows that steepest variation in the inference error comes from moving along the feedback parameters while perturbations to other channel parameters produces only marginal changes. Further, we explore the global geometry using the spectrum of the Hessian of the inference error. We find that the topography of the low inference error landscape resembles a ravine or a deep valley, which is shallow along the several sloppy directions and steep along the few stiff directions, the latter being predominantly along the feedback parameters. This dimensional reduction appears to be a recurring feature of such high-dimensional optimisation Transtrum et al., 2015; Yadav et al., 2022.
Such a geometrical approach also provides insight on the differences between the signalling and the non-signalling receptors, which shows up in the extent to which they influence inference errors in the neighbourhood of the optimum. Slight changes in the signalling receptor away from the optimum lead to a sharp increase in inference error while similar changes in the non-signalling receptor do not affect the inference errors significantly. This gives rise to the notion of stiff and sloppy directions of control - with non-signalling receptor placed under sloppy control. In a context with multiple morphogen ligands setting up the different coordinate axes (e.g. Wg, Dpp and Hh in imaginal discs Lin, 2004; Lin and Perrimon, 2000), the non-specific receptor can potentially facilitate cross-talks between them. A sloppy control on non-specific receptor would allow for accommodation of robustness in the outcomes of the different morphogens. This could potentially be tested in experiments.
Future directions
We end our discussion with a list of tasks that we would like to take up in the future. First, the information processing framework established here is very general. Obvious extensions of our models, such as adding more branches, tiers and chemical states, will not lead to qualitatively new features. However, one may alter the objective function – for instance, in the case of short range morphogens like Nodal Liu et al., 2022, only the positions of certain regions (closer to the morphogen source) or cell fate boundaries need to be specified with any precision. To this end, we have analysed another objective function which partitions the tissue into cell identity segments. The qualitative features of the optimised channel architectures remain unaltered. Depending on the developmental context, one might explore other objective functions. This would be a task for a future investigation.
Next, our optimisation study ignores cellular costs due to compartmentalisation, additional receptors and implementation of feedback controls, and thus possible trade-offs between cellular economy and precision in inference. Nevertheless, the observation that addition of extra tiers beyond two provides only marginal improvements to inference, already suggests a balance between precision and cellular costs.
Third, our theoretical result that the optimised surface receptor profiles are either monotonically increasing or decreasing from the morphogen source, suggests that the surface receptor concentrations are spatially correlated across cells. Such correlations could have a mechanochemical basis, either via cell surface tension that could in turn affect internalisation rates Thottacherry et al., 2018 or inter-cellular communication through cell junction proteins Garcia et al., 2018 or from adaptive feedback mechanisms between the output and receptor concentrations Barkai and Leibler, 1997. We emphasize that in the current optimisation scheme, we have allowed the receptor concentrations to vary over the space of all monotonically increasing, decreasing or flat profiles, and have not encoded the positional information in the receptor profiles.
Finally, we have considered the morphogen ligand as an external input to the receiving cells, outside the cellular information processing channel. There is no feedback from the output to the receptors and thus no ‘sculpting’ of the morphogen ligand profile. Morphogen ligand profiles (e.g. Dpp Romanova-Michaelides et al., 2021) are set by the dynamics of morphogen production at the source, diffusion via transcytosis and luminal transport, and degradation via internalisation. These cellular processes are common to both the reading and processing modules in our channel architecture. This would suggest a dynamical coupling and feedback between reading and ligand internalisation, which naturally introduces closed-loop controls on the surface receptors and a concomitant sculpting of the morphogen profile.
Appendix 1
Setting up the dynamical equations
The cellular processes involved in a general two-branch channel architecture, described in Section "Quantitative models for cellular reading and processing" and Figure 4, with any number of tiers are: production of receptors, binding-unbinding of ligand with the receptors, intracellular transport between cellular compartments, conjugation and splitting in the final tier, and degradation of the chemical species involved. The mass action kinetics for these cellular processes are written as the following set of first-order ordinary differential equations (ODEs):
denotes species in the first branch, associated with the signalling receptor, in tier and in bound () or unbound () states. Likewise, denotes the same for the non-signalling receptor. denotes the conjugated species in the final tier. Chemical rates related to the first branch are denoted by while those in the second branch are denoted by . The processes of production, binding, unbinding, internalisation, recycling, conjugation, splitting and degradation are represented by the subscripts , respectively (Table 1). The superscript on degradation rate indicates the index of the tier in which the degradation takes place. The superscript over internalisation rate stands for the tier-index of the species internalised. Superscript over the recycling rate follows that of the corresponding internalisation rate.
As described in Section ‘Quantitative models for cellular reading and processing’, we consider local cell-autonomous control on the total cell surface receptor concentrations, in a manner akin to the open-loop control Stengel, 1994, such that the sum of free and bound signalling receptors is and that of the non-signalling receptor is . This implies that the open-loop control actuated on the production (or secretion) of the cell surface receptors balances the net flux of receptors away from the surface. Consider the dynamics of and in Eq. A, with now an open-loop control realised through on .
At any position , if the total receptor concentration are to stay constant in time (through a chemostat), must satisfy
We emphasize that here we do not consider a closed-loop version of the receptor control. However, it can be realised through an integral control Goodwin et al., 2001, also known as perfect adaptation Barkai and Leibler, 1997. With open-loop-like control on total surface receptor concentrations (Section ‘Quantitative models for cellular reading and processing’), and are replaced by and , respectively. We then drop the subscripts from and to simplify notation.
To keep the notation simple, we avoid explicitly indicating superscripts over chemical rate symbols when presenting the dynamical equations in the main text (Equations 10–17). In the numerical analysis, however, the degradation, internalisation and recycling rates of different tiers are treated separately. The steady-state solutions of the above equations, without feedback controls on chemical rates, are given by,
One-tier channel:
Two-tier channel:
For channels with , the solutions are written in the following recursive form Channel with tiers:
where are in turn evaluated recursively as
The general form of Equations 26–28 holds upon introduction of inter-branch feedbacks (refer Figure 5) and therefore the set of ODEs (Equation 25) can be solved analytically by appropriately replacing the rates under feedback with the Hill-form functions discussed in Section ‘Quantitative models for cellular reading and processing’. Cases with other types of feedbacks need to be solved numerically (Appendix 3).
In writing these dynamical equations, we have made two assumptions, simply as a matter of convenience. One may note that only the ligand bound states of receptors are transported between tiers. This is done with the consideration that residence times of unbound receptors within a compartment, other than the cell surface (first tier), is very small that is receptors in enclosed compartments re-bind the ligand quickly after unbinding due to small volume of the compartment. Further, the open loop control on receptors need not be strictly on surface receptors (tier 1). In principle, the control could be on the production rate of receptors. This does not change the solution in any substantial manner. For instance, in the case of two tiers, the solution would be
and the rest of the species would follow hierarchically as in the case with surface receptor control (see Equation 28). As long as the deviations in the denominator, and remain small compared to ru, and , the steady solutions for the two cases (control on surface receptors versus total receptors) will not differ significantly.
Appendix 2
Heuristics and choice of feedback controls
The figure below illustrates the effect of feedback parameters in Equations 18; 19 discussed under Section ‘Quantitative models for cellular reading and processing’.
Considering the choices of chemical rate under control and the actuating nodes , there are a large number of possible feedbacks for any given channel architecture. However, categorising the different feedbacks according to their effect can reduce the redundancies and aid numerical analysis. The steady-state output of a channel without the feedback controls on chemical rates is instructive in this categorisation. For instance, consider the output for a two-tier two-branch channel with .
This form of the output suggests that feedbacks from (term outside the bracket) on any of the rates in and could cross-correlate with the other signalling species and . Thus, a negative feedback on () would be qualitatively equivalent to a positive feedback on (). Beside helping to reduce the number of CRNs that need to be considered, this observation also allows one to forego the numerical analysis of a CRN with feedback on from that may have non-unique solutions. Similarly, feedbacks from different but related actuators could be interchanged. For example, the feedback on from , i.e. , could be written in terms of the feedback on from , i.e. , by a simple transformation on feedback sensitivity . Note that Equation 31 also indicates the preferred direction of feedbacks, that is with the signalling receptors as actuators.
Appendix 3
A note on numerical methods
Optimisation over the channel parameter vector , belonging to a parameter space , is highly non-convex (see Section ‘Performance of the Channel Architectures’). Moreover, the derivative of with respect to is ill-defined due to finite sampling of input distributions. With these considerations, we chose a derivative-free (gradient independent) optimisation algorithm viz. Pattern Search algorithm Hooke and Jeeves, 1961. The implementation of this algorithm in MATLAB can be found as patternsearch function under the Global Optimization Toolbox.
With this pattern search algorithm, we perform the optimisation routine as described in Section ‘Performance of the Channel Architectures’ in two rounds. First, with a certain number of initial points in the parameter space , we determine the advantageous feedback topologies (CRNs) that is, that give lower inference errors. In the next round, we go through the same optimisation routine for the advantageous feedback topologies determined through the the first round, but now with to converge to the true global minimum. Using parallel computing clusters, the cost of this computation in terms of real running time can be brought down to 12–48 hr.
Ideally, this optimisation ought to be done with extrinsic and intrinsic noises considered together. However, we restrict optimisation to the case of extrinsic noise as the computational cost associated with steady-state solutions of chemical master equations (CMEs), in the case of intrinsic noise, is rather high. Therefore, we evaluate the inference error due to intrinsic noise only of those CRNs that were optimised in the context of extrinsic noise previously. For this, we solve the CMEs using adaptive explicit-implicit tau leaping algorithm Sandmann, 2009 to determine the steady-state outputs and thus inference error. Implementation of this algorithm requires the definition of some numerical parameters that (i) help switch between Gillespie and implicit/explicit tau-leaping algorithms (), (ii) decide the number of reactions when Gillespie algorithm is selected (nb), and (iii) various threshold (). The values chosen for these are , and if implicit tau-leaping algorithm was used in the previous step, , , and .
Appendix 4
Robustness, sensitivity, and trade-offs
Precision in positional inference requires both that the output variance at a given position be small (i.e. the output is robust to the noise in input) and that the mean output at two neighbouring positions be sufficiently different (the output is sensitive to systematic changes in mean input).
With this heuristic understanding of an ideal output, we define two local measures: (i) measures the noise in the output of a cell cohort as compared to the noise in the received input. We define it as the ratio of coefficient of variation in the output to the coefficient of variation in the input for the same cohort of cells.
Thus, robustness of the output increases with decreasing values of ; (ii) measures the sensitivity of the output to a systematic change in the input. We define it as the ratio of relative change in the mean output to the relative change in the mean input for the same cell cohort.
The angular brackets in the above equation denote averaging over cells belonging to the same cohort. Thus, higher sensitivity implies higher values of . Precision in positional inference, in terms of these two local measures, implies simultaneous minimisation of and .
We calculate and at all positions in the various optimised channels. We find that addition of a branch allows for better robustness and alleviates the tension between sensitivity to systematic change in mean input and robustness to input noise (Appendix 4—figure 2).
Furthermore, a one-tier two-branch channel with an inter-branch feedback control (Figure 7a of the main text) shows a preference towards certain concentrations of the two receptors, which provide the cellular output robustness to input noise, i.e. minimise . To maintain lower values of (higher robustness), the signalling receptors must decrease with increasing mean of the input (Appendix 4—figure 3). Increasing the non-signalling receptors as a function of mean input helps separate the mean outputs at neighbouring positions and thus aids in increasing the sensitivity (Appendix 4—figure 4). This is consistent with the receptor profiles in the optimised one-tier two-branch channel (Figure 7b of the main text).
Appendix 5
Input-output relations in a minimal channel
Here, we try to understand why having an appropriate spatial gradient of receptors helps in separating the outputs in nearby cells, thus facilitating accurate positional inference. This is best done in the simplest case of a minimal channel that is with one tier and one branch. This channel ‘reads’ the ligand input through binding of the ligand on only one, signalling receptor.
Appendix 5—figure 1a shows that setting receptor conentrations to decrease with mean ligand input creates an unfavourable scenario. Low receptor availability for higher ligand concentrations ensures a saturation of the output at lower values (blue). On the other hand, higher receptor availability for ligand concentrations is impractical as the output remains low despite increase in potential saturation point (purple). This causes large overlaps of the outputs at neighbouring positions. On the other hand, Appendix 5—figure 1b shows that the output are better separated with receptor profiles that increase with the mean ligand input and thus would enable a better positional inference (Appendix 4).
Appendix 6
Robustness due to inter-branch feedback
Here we describe the logic behind the working of an inter-branch feedback control. Consider the case of a one-tier two-branch channel with the feedback on conjugation rate (Figure 7a of the main text) actuated by the ligand bound state of the signalling receptor in the first tier . The steady-state solution to Equations 10–12 corresponding to this channel with is given by.
Therefore, the output can be expressed in terms of and as follows.
Appendix 6—figure 1 describes the behaviour of the signalling species and as given by Equations 34; 35 where the parameters are taken from the optimised one-tier two-branch channel. increases monotonically with the ligand input (blue curve, Appendix 6—figure 1) and saturates at a value set by the signalling receptor . Meanwhile, the conjugate species has a non-monotonic behaviour (yellow curve, Appendix 6—figure 1): for very low values of input, rises sharply due to absence of the feedback effect from the small values of , but then decreases with further increase in input as the value of the feedback actuator rises. This anti-correlated behaviour of and due to the feedback results in the output being a more stable function of the input for an intermediate range of the input (region around the cusp in the green curve, Appendix 6—figure 1). Modulating the signalling and non-signalling receptors allows for placement of the stability region (cusp) in accordance with the range of ligand input received at any position , thus tempering the noise in the output (see input-output relations in Figure 7d of the main text).
Appendix 7
Heuristic for the influence of feedback action on receptor profiles
The optimised one-tier two-branch channel showed a monotonically increasing profile of the signalling receptor , and a monotonically decreasing profile of the non-signalling receptor (Figure 7b). Here, we provide an understanding for this counter-intuitive result using the forms of the channel output and variation in the output .
Consider the output for a one-tier two-branch channel with inter-branch feedback on the conjugation rate . Equation 37 gives the explicit form for the output of this channel in terms of input ligand concentration , signalling and non-signalling receptor concentrations, feedback strength , feedback sensitivity and binding rates . The last three are small parameters in the optimised channel i.e. . Note that the conjugation and splitting rates are absorbed into , i.e. .
We now look at the output in the limit of high ligand input , and low ligand input . The output forms in these limits are given by Equation 38 and Equation 39.
To analyse the variation of channel output with ligand input , we compute at fixed receptor concentrations .
In the limits of high and low ligand input, the form simplifies to
We provide numerical estimates for the output and the variation in the output, in the high and low ligand input limits, with all possible receptor profile combinations (Figure 3). As shown in the table below (highlighted in bold), it is optimal to have lower levels of the signalling receptor and higher levels of the non-signalling receptor in the high region (close to source). On the other hand, in the low region (far from source), it is optimal to have higher levels of the signalling receptor and lower levels of the non-signalling receptor . These receptor combinations in the two limits on the input (highlighted rows in the table below) help maximally separate the output at the two ends of the tissue while keeping the variation in the output low at both ends. Taking the receptor profiles as monotonic functions of position, this would imply that for a one-tier two-branch channel with an inter-branch feedback, the optimum signalling receptor has a monotonically increasing spatial profile while the optimum non-signalling receptor has a monotonically decreasing spatial profile. This qualitatively explains the result obtained for the optimised one-tier two-branch channel in Section ‘Branched architecture with multiple receptors provides accuracy and robustness to extrinsic noise’, Figure 7.
high | low | |||
---|---|---|---|---|
output θ | variation | output θ | variation | |
low ψ, low Φ | 293.95 | 0.59 | 23.75 | 165.93 |
high ψ, low Φ | 156.53 | 1.67 | 22.52 | 1.12 |
low ψ, high Φ | 902.07 | 0.69 | 67.5 | 543.46 |
high ψ, high Φ | 288.43 | 1.69 | 40.06 | –25.19 |
Appendix 8
Results of optimisation of two-tier two-branch channels
As discussed in Section ‘Branched architecture with multiple receptors provides accuracy and robustness to extrinsic noise’, additional tiers in a two-branch channel only marginally reduced the inference errors due to extrinsic noise when compared to the optimised one-tier two-branch channel (Figure 8). Here, we show that both the receptor profiles and the input-output relations of these two optimised two-tier two-branch channels are qualitatively similar (Appendix 8—figure 1).
Appendix 9
Additional tiers dampen the effects of feedback to provide stability
Stochastic simulations of the chemical reaction network (CRN) corresponding to the optimised one-tier two-branch channel (Figure 9a of the main text) show large fluctuations in the time trajectories of signalling species about its steady-state mean (Figure 9d–f of the main text). increases due to absence of feedback from small values of . However, beyond some amount of increase in , the trajectories veer back towards their mean values. The state of small and increasing can be maintained by replenishment of from binding reaction followed by immediate conjugation with large pools of due to high availability of the non-signalling receptor . Such amplified fluctuations are absent in the two-tier two-branch channel (Figure 9g–i of the main text). Here, we provide heuristics of the differing behaviours in the optimised one-tier and two-tier channels by analysing a simpler set of CRNs (Appendix 9—figure 1) with the essential elements of these channels.
The dynamics for the first, two-species CRN is given by.
and that of the second, 3-species CRN by
with constant playing the role of , thus providing an upper bound to species 1. In both these sets of equations, we consider a feedback k3 such that where k0 represents the reference value of k3 in absence of feedback. Appendix 9—figure 2 shows the phase portrait for the dynamics of the 2-species CRN when with a moderately strong feedback . This is representative of in the optimised one-tier two-branch channel (Figure 9a of the main text). The nullcline (dashed, green curve) acts as a separatrix for the behaviour of this system: if due to fluctuations in species s1 the system () crosses the nullcline from its steady-state (pink point), it sets out on a large trajectory (black line) such that s1 remains close to zero while s2 grows fast until the system turns back towards the steady state. This is similar to the trajectories of and discussed earlier. The non-linearity of the separatrix is due to the feedback from s1 on the rate k3 that couples to s1 in Equation 43. Higher production rates k1 (akin to higher values of ligand in the optmised channel) bring the steady-state closer to the separatrix, making crossing of the separatrix due to fluctuations more likely. Having an additional node in between the actuator species and the controlled rate in the three-species CRN removes the non-linearity in and provides buffering against this effect (Equation 44).
Appendix 10
Two-tier two-branch channel with no feedback
Here we show that the two-tier two-branch channel without any inter-branch feedback control has fundamentally different optimisation characteristics and a poorer positional inference. Crucially, the optimised profiles of both signalling and non-signalling receptors are monotonically decreasing away from the source (Appendix 10—figure 1b, inset).
Appendix 11
Uniform receptor profiles with uncorrelated noise
Note that in arriving at the optimised channel characteristics for a given morphogen profile, we go through all possible monotonic receptor profiles, including flat profiles. The optimised receptor profiles show a spatial gradient. Here, we ask why can’t a flat receptor profile (possibly modified by noise) infer positions accurately from a noisy morphogen gradient? Following the arguments in Appendix 4, we reason that if the morphogen gradient was not corrupted by noise, then flat receptor profiles would have sufficed to infer positions accurately. It is because one wants to discriminate between morphogen concentrations in neighbouring cells in a noisy background that there is a need for a spatial variation in the receptor profiles.
To demonstrate this, we consider uniform spatial profiles, with or without uncorrelated noise, for both the signalling and non-signalling receptors (Appendix 11—figure 1b, c), and optimise the rates and feedback parameters anew (Table 3) to show that this leads to a higher inference error compared to the optimal (Appendix 11—figure 1a). In fact, the inference error in these cases, even with an inter-branch feedback, is only marginally smaller than a channel with no processing of the ligand (black dots in Appendix 11—figure 1a). The inference from flat receptor profiles reflects the noise in morphogen gradient itself. This provides the motivation for choosing monotonically increasing or decreasing profiles for both the signalling and non-signalling receptors (Equations 8; 9). Note that this implicitly assumes spatial correlations in the surface concentrations of receptors across the cells.
Appendix 12
Dependence of inference errors on input characteristics
The general qualitative features of the optimised channels remain invariant to changes in input characteristics. We find the same feedback topology and qualitative results when the one-tier two-branch channel architecture is optimised for input distributions with different decay lengths of mean ligand input (Appendix 12—figure 1). Additionally, lowering noise in the ligand input reduces the inference error of the optimised channel and extends the region of robustness in the tissue (Appendix 12—figure 2).
Appendix 13
Response in inference error due to perturbations in receptor concentrations
The definition of cellular output in a branched architecture involves making a distinction between the signalling and non-signalling receptors. In conjunction, the direction of the inter-branch feedback is from the signalling to the non-signalling branch (Figure 8b of the main text). This gives rise to the possibility of an asymmetric response of the optimised channel to perturbations in the two receptors around the optimal point. Appendix 13—figure 1a shows the average inference error due to the receptor profiles in Appendix 13—figure 1b, c resulting from perturbations in the receptor control parameters A2 and B2 (Table 1, Equations 8; 9) around their optimal values. Each perturbed receptor profile (black curves in Appendix 13—figure 1b, c) leads to a net deviation from the optimal receptor profile (blue curves in Appendix 13—figure 1b, c), which is computed as follows.
where and are the optimum values, and are the perturbations in the receptor control parameters. This is simply the signed area between the optimised and perturbed receptor profiles. Note that the perturbed receptor profiles are such that they maintain the nature of monotnonicity. The local curvature of around the optimal point (, red point in Appendix 13—figure 1a) in the plane has eigenvalues corresponding to the eigenvectors that are nearly parallel to and axes, respectively (white arrows in Appendix 13—figure 1a). This indicates that the inference error is much more sensitive to changes in the signalling receptor than to changes in the non-signalling receptor , implying a stiff direction of control along the former and a sloppy direction of control along the latter.
Appendix 14
Distribution of optimum channel parameters
In Section "Geometry of fidelity landscape", we commented on the nature of low inference error landscape as defined by optimum channel parameters that yield an average inference error . Of the 16 channel parameters, we showed six parameters corresponding to ligand binding rates to the signalling and non-signalling receptors, conjugation and splitting rates, and feedback sensitivity and feedback strength. These parameters were stiff, that is small changes in these parameters led to strong variantions in the inference error. For completeness, here we present the frequency distributions of all the optimum channel parameters that yield an inference error of . While some of these parameters are narrowly distributed about the upper or lower bounds of their permissible ranges, others are more broadly distributed across the range.
Appendix 15
Experimental methods
Fly stocks, endocytic assays, and imaging
Fly stocks used in this study are wildtype w1118, c5-GAL4, CAAX-GFP (Kyoto DGRC - 109823), Port et al., 2014 and UAS-myr-Garz-E740K. Flies were reared on corn flour medium containing sugar, yeast and agar along with antibacterial and antifungal agents. Flies were grown in 25°C incubators with 12 hr light/dark cycles for experiments and otherwise maintained at 18°C or 22°C. Third instar larval wing discs were dissected in Grace’s live imaging media Dye et al., 2017. Dissected discs were incubated with labelled Wg antibodies (Wg-AF-568) on ice for 45 min. Discs were transferred to room temperature for indicated time of pulse, washed with ice cold 1XPBS buffer and fixed using 4% PFA (5 min on ice +15 min at room temperature). Discs were then mounted in imaging chambers and imaged on a FV3000 laser-scanning confocal microscope using a 60 X/1.42 NA oil objective (with acquisition XY pixel dimensions of and Z stacks of size ).
Analysis
The slice-by-slice images of the dome shaped wing disc from the confocal microscope were transformed into images from the outer most surface of the wing disc. This allowed us to compare the intensities of Wg from similar apico-basal height of cells across the dome-shaped epithelial. Wg production plane (Plane Q) and a perpendicular plane (Plane R) were defined. Intensity of different probes in curved tissues is affected by sample geometry and imaging depth. A data-based correction matrix was constructed using a uniform marker – CAAX GFP expressed uniformly under a ubiquitin promoter. Intensity for each disc, for each probe, was corrected using this data-based correction matrix. Detailed experimental and analysis methodology for extracting gradients from a curved tissue is described in Prabhakara et al., 2022. For computing the coefficient of variation, 18 bins parallel to Plane R were defined and intensities at different distances from the production plane was computed (schematic in Figure 14c of the main text). Mean and standard deviation (SD) for each disc across multiple parallel bins at different distances from the producing cells were used to estimate the coefficient of variation.
Computation of CV was done using intensity collected from the apical region of cells (20% of the entire length of cells). Normalized distance from the production plane is represented in all plots. Here, we have considered dorsal and ventral gradients to be equivalent.
Data availability
Figure 14 and Figure 14—figure supplement 1 contain source data in xlsx format. Modelling code and numerical data is available on Gitlab (https://gitlab.com/eoskrish/morphogenetic-decoding; copy archived at swh:1:rev:59c184aaa46c5e769a95ea39b48e911d0fc4fd5b).
References
-
Dealing with noise: the challenge of buffering biological variabilityCurrent Opinion in Systems Biology 1:69–74.https://doi.org/10.1016/j.coisb.2016.12.011
-
Robust generation and decoding of morphogen gradientsCold Spring Harbor Perspectives in Biology 1:a001990.https://doi.org/10.1101/cshperspect.a001990
-
Endocytosis and signaling during developmentCold Spring Harbor Perspectives in Biology 6:a017020.https://doi.org/10.1101/cshperspect.a017020
-
Morphogen transport in epitheliaPhysical Review. E, Statistical, Nonlinear, and Soft Matter Physics 75:011901.https://doi.org/10.1103/PhysRevE.75.011901
-
Endocytosis, signaling, and beyondCold Spring Harbor Perspectives in Biology 6:a016865.https://doi.org/10.1101/cshperspect.a016865
-
Compartmentalization of GPCR signalling controls unique cellular responsesBiochemical Society Transactions 44:562–567.https://doi.org/10.1042/BST20150236
-
Ras/Mapk signaling from endomembranesMolecular Oncology 3:297–307.https://doi.org/10.1016/j.molonc.2009.06.004
-
Cell-Cell junctions organize structural and signaling networksCold Spring Harbor Perspectives in Biology 10:a029181.https://doi.org/10.1101/cshperspect.a029181
-
Stochastic thermodynamics of bipartite systems: transfer entropy inequalities and a maxwell’s demon interpretationJournal of Statistical Mechanics 1:E2016.https://doi.org/10.1088/1742-5468/2014/02/P02016
-
`` direct search’’ solution of numerical and statistical problemsJournal of the ACM 8:212–229.https://doi.org/10.1145/321062.321069
-
Novel regulation of Wnt signaling at the proximal membrane levelTrends in Biochemical Sciences 41:773–783.https://doi.org/10.1016/j.tibs.2016.06.003
-
Noise, delays, robustness, canalization and all thatCurrent Opinion in Genetics & Development 14:440–445.https://doi.org/10.1016/j.gde.2004.06.001
-
The measure of success: constraints, objectives, and tradeoffs in morphogen-mediated patterningCold Spring Harbor Perspectives in Biology 1:a002022.https://doi.org/10.1101/cshperspect.a002022
-
BookReceptors: Models for Binding, Trafficking, and SignalingOxford University Press on Demand.
-
BookInformation Theory, Inference and Learning AlgorithmsCambridge university press.
-
Promiscuity and specificity in BMP receptor activationFEBS Letters 586:1846–1859.https://doi.org/10.1016/j.febslet.2012.02.043
-
Ras proteins: paradigms for compartmentalised and isoform-specific signallingCellular and Molecular Life Sciences 64:2575–2589.https://doi.org/10.1007/s00018-007-7133-8
-
Compartmentalized signalling: Ras proteins and signalling nanoclustersThe FEBS Journal 276:1817–1825.https://doi.org/10.1111/j.1742-4658.2009.06928.x
-
Morphogen gradients: from generation to interpretationAnnual Review of Cell and Developmental Biology 27:377–407.https://doi.org/10.1146/annurev-cellbio-092910-154148
-
BookModeling Biomolecular Networks: An Introduction to Systems BiologyOxford University Press.
-
Morphogens, their identification and regulationDevelopment 131:703–712.https://doi.org/10.1242/dev.01043
-
The many bits of positional informationDevelopment 148:dev176065.https://doi.org/10.1242/dev.176065
-
Positional information and pattern formationCurrent Topics in Developmental Biology 117:597–608.https://doi.org/10.1016/bs.ctdb.2015.11.008
-
Shaping morphogen gradients by proteoglycansCold Spring Harbor Perspectives in Biology 1:a002493.https://doi.org/10.1101/cshperspect.a002493
Article and author information
Author details
Funding
Department of Atomic Energy, Government of India (RTI4006)
- Krishnan S Iyer
- Chaitra Prabhakara
- Satyajit Mayor
- Madan Rao
Simons Foundation (287975)
- Madan Rao
DBT-Wellcome Trust India Alliance (IA/M/15/1/502018)
- Satyajit Mayor
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Acknowledgements
We thank Thomas Lecuit for insights during the course of investigation. We thank past and present members of the Simons Centre, especially Alkesh Yadav, Amit Kumar, Archishman Raju, Kabir Husain, Mukund Thattai and Sandeep Krishna, for critical inputs. In particular, we thank Archishman Raju for useful discussions on the geometric analysis of the fidelity landscape. We acknowledge support from the Department of Atomic Energy (India), under project no. RTI4006, and the Simons Foundation (Grant No. 287975). MR and SM acknowledge DST (India) for JC Bose Fellowships. SM acknowledges a Margadarshi Fellowship of DBT-Wellcome Trust India alliance (IA/M/15/1/502018).
Copyright
© 2023, Iyer et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 1,129
- views
-
- 143
- downloads
-
- 4
- citations
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Developmental Biology
- Stem Cells and Regenerative Medicine
Deficient Anterior pituitary with common Variable Immune Deficiency (DAVID) syndrome results from NFKB2 heterozygous mutations, causing adrenocorticotropic hormone deficiency (ACTHD) and primary hypogammaglobulinemia. While NFKB signaling plays a crucial role in the immune system, its connection to endocrine symptoms is unclear. We established a human disease model to investigate the role of NFKB2 in pituitary development by creating pituitary organoids from CRISPR/Cas9-edited human induced pluripotent stem cells (hiPSCs). Introducing homozygous TBX19K146R/K146R missense pathogenic variant in hiPSC, an allele found in congenital isolated ACTHD, led to a strong reduction of corticotrophs number in pituitary organoids. Then, we characterized the development of organoids harboring NFKB2D865G/D865G mutations found in DAVID patients. NFKB2D865G/D865G mutation acted at different levels of development with mutant organoids displaying changes in the expression of genes involved on pituitary progenitor generation (HESX1, PITX1, LHX3), hypothalamic secreted factors (BMP4, FGF8, FGF10), epithelial-to-mesenchymal transition, lineage precursors development (TBX19, POU1F1) and corticotrophs terminal differentiation (PCSK1, POMC), and showed drastic reduction in the number of corticotrophs. Our results provide strong evidence for the direct role of NFKB2 mutations in the endocrine phenotype observed in patients leading to a new classification of a NFKB2 variant of previously unknown clinical significance as pathogenic in pituitary development.
-
- Developmental Biology
- Genetics and Genomics
We present evidence implicating the BAF (BRG1/BRM Associated Factor) chromatin remodeler in meiotic sex chromosome inactivation (MSCI). By immunofluorescence (IF), the putative BAF DNA binding subunit, ARID1A (AT-rich Interaction Domain 1 a), appeared enriched on the male sex chromosomes during diplonema of meiosis I. Germ cells showing a Cre-induced loss of ARID1A arrested in pachynema and failed to repress sex-linked genes, indicating a defective MSCI. Mutant sex chromosomes displayed an abnormal presence of elongating RNA polymerase II coupled with an overall increase in chromatin accessibility detectable by ATAC-seq. We identified a role for ARID1A in promoting the preferential enrichment of the histone variant, H3.3, on the sex chromosomes, a known hallmark of MSCI. Without ARID1A, the sex chromosomes appeared depleted of H3.3 at levels resembling autosomes. Higher resolution analyses by CUT&RUN revealed shifts in sex-linked H3.3 associations from discrete intergenic sites and broader gene-body domains to promoters in response to the loss of ARID1A. Several sex-linked sites displayed ectopic H3.3 occupancy that did not co-localize with DMC1 (DNA meiotic recombinase 1). This observation suggests a requirement for ARID1A in DMC1 localization to the asynapsed sex chromatids. We conclude that ARID1A-directed H3.3 localization influences meiotic sex chromosome gene regulation and DNA repair.