# Abstract

The successful integration of engineered gene circuits into host cells remains a significant challenge in synthetic biology due to circuit-host interactions, such as growth feedback, where the circuit influences cell growth and vice versa. Understanding the dynamics of circuit failures and identifying topologies resilient to growth feedback are crucial for both fundamental and applied research. Utilizing transcriptional regulation circuits with adaptation as a paradigm, we systematically study 435 distinct topological structures and uncover six categories of failures. Three dynamical mechanisms of circuit failures are identified: continuous deformation of the response curve, strengthened or induced oscillations, and sudden switching to coexisting attractors. Our extensive computations also uncover a scaling law between a circuit robustness measure and the strength of growth feedback. Despite the negative effects of growth feedback on the majority of circuit topologies, we identify a few circuits that maintain optimal performance as designed, a feature important for applications.

**eLife assessment**

This **valuable** study focuses on the impact of growth feedback on the performance of artificial gene circuits capable of achieving adaptive responses, an **important** problem in synthetic biology. Through **solid** computational analysis, the authors identify specific failure mechanisms, as well as core topologies associated with robust performance based on systematic analysis of over four hundred circuit topologies. The results will be of interest to those working on engineering gene circuits for diverse applications.

# Introduction

In biomedical science and engineering, artificially designed gene circuits are anticipated to play an ever-increasing role in disease diagnosis and therapy (** Riglar and Silver, 2018**;

**;**

*Sedighi et al., 2019***). Gene circuits also show great potential in various applications such as microbiome modulation (**

*Xia et al., 2019***;**

*Foo et al., 2017***) and biological containment (**

*Lee et al., 2018***;**

*Gomaa et al., 2014***). While most gene circuits are designed to function after they are inserted or embedded into host cells, the interactions between the circuit and the host environment are generally extremely complex and can lead to undesired effects that were not present in the original, isolated circuit (**

*Caliando and Voigt, 2015***;**

*Tan et al., 2009***;**

*Ceroni et al., 2015***;**

*Borkowski et al., 2016***;**

*Ceroni et al., 2018***a,b;**

*Darlington et al., 2018***;**

*Gouda et al., 2019***,**

*Zhang et al., 2021***;**

*2020***). Understanding the interactions and identifying the circuit topological structures that can withstand the interactions and thrive in the host are thus of fundamental importance, requiring interdisciplinary efforts among systems and synthetic biology, metabolic engineering, nonlinear dynamics and complex systems.**

*Melendez-Alvarez et al., 2021*Typical circuit-host interactions include metabolic burden, cell growth, and resource relocation or competition, among which growth feedback is the most common type of circuit-host interaction between the circuit gene expressions and cell growth. More specifically, a synthetic gene circuit embedded in a host cell possesses an intrinsic coupling mechanism: the circuit affects cell growth and the growth in turn modifies the gene expressions in the circuit (** Klumpp et al., 2009**;

**;**

*Klumpp and Hwa, 2014***;**

*Boo et al., 2019***) - the so-called growth feedback. Studies have shown that the growth-mediated feedback can endow a synthetic gene circuit with various emergent properties, such as innate growth bistability (**

*Scott et al., 2010***). For example, a non-cooperative positive autoregulation system, when coupled with growth feedback, gains increased effective cooperativity, thereby resulting in bistability (**

*Deris et al., 2013***;**

*Tan et al., 2009***). In another example, toxin cooperativity can be induced in multiple toxin-antitoxin systems by growth-mediated feedback (**

*Nevozhay et al., 2012***). The number of steady states in one gene circuit also depends on growth feedback and resource availability (**

*Feng et al., 2014***;**

*McBride and Del Vecchio, 2020***). In general, growth feedback acts to hamper the forward engineering of the circuit functions by introducing modes of nonmodularity and reducing the predictability of the circuit components in an in-vivo context. While various phenomena caused by growth feedback were studied with desirable or undesirable effects on the functions of the gene circuits, a systemic picture is lacking on what effects growth feedback can have on the gene circuits, including failures.**

*Melendez-Alvarez and Tian, 2022*A recent study has revealed that growth feedback can have drastically different effects on congruent circuits with distinct topologies (** Zhang et al., 2020**). In particular, the dynamical behaviors of two bistable synthetic memory circuits were studied: a self-activation switch incorporating positive autoregulation and a toggle switch incorporating double-negative regulatory motifs. It was found that growth feedback impacts both circuits but with quite different manifestations. For the toggle switch, memory can be retained and the circuit tends to be refractory towards growth feedback. However, for the self-activation switch, growth feedback leads to memory loss. While these results indicate that the circuit topology can play a significant role in the circuit functions when growth feedback is present, they were obtained through two specific circuit topologies. Since a particular function of the gene circuit can often be achieved by a finite set of core topologies, it is of fundamental interest to identify the most robust topologies in response to growth feedback. The so-identified optimal topologies can then be used to construct synthetic gene circuits capable of maintaining the essential functions to meet the design goals under the fluctuating growth conditions of the host cell. A systematic study of the interplay between the gene circuit topology and growth feedback is needed.

Adaptation is an important and widely studied functionality of gene circuits, which is defined as the ability of the system to respond to environmental changes and to return to the basal or near-basal state after some time (** Knox et al., 1986**;

**;**

*Tyson et al., 2003***;**

*Friedlander and Brenner, 2009***). Previously, it was found that certain circuits possess biochemical adaptation (**

*Ferrell Jr, 2016***) if they contain at least one of the two architectural classes: an incoherent feed-forward loop with a proportion node and a negative feedback loop with a buffering node. A number of synthetic gene circuits were proposed or constructed to achieve adaptation (**

*Ma et al., 2009***;**

*Kim et al., 2014***;**

*Briat et al., 2016***). Quite recently, a design principle for circuits with four genes was uncovered for simultaneously achieving noise attenuation and adaptation: the circuit must have a sequential assembly structure (**

*Aoki et al., 2019***). However, these existing adaptation studies did not include any growth feedback mechanism.**

*Qiao et al., 2019*In this paper, we conduct a comprehensive computational study to uncover and understand the effects of growth feedback on the gene circuits. Specifically, we focus on a type of transcriptional regulation circuit designed for adaptation. There are 425 possible circuit structures (identified by previous research (** Shi et al., 2017**)), and we study all of them to simulate and test their response under different levels of growth feedback. Altogether, 2 × 10

^{5}sets of circuit parameters are randomly sampled for each structure. Our results reveal a vast number of cases where growth feedback has a detrimental effect on circuit function (1.3 × 10

^{5}cases in total) with varying response curves and dynamical behaviors. To gain a more intuitive overall picture, we classify these cases into several distinct categories based on the circuits’ dynamic behavior. We then systemically summarize the dynamical mechanism behind these growth-induced circuit malfunctions. To quantify circuit adaptation in the presence of growth feedback, we propose a robustness measure that enables us to identify an optimal group of circuits that exhibit a high level of robustness against growth feedback, making them particularly promising for real-world implementation. The motifs associated with this optimal group are found through machine learning. We also obtain a scaling law governing the dependence of this measure on the level of growth feedback and provide a mathematical analysis to gain insights into the underpinnings of the scaling law. The take-home message is that, in spite of the negative effects of growth feedback in the majority of the circuits, there exists a small set of circuits that are still able to deliver optimal performance as designed, which is promising for real-world implementation.

# Results

## A systemic search of functional failures due to growth feedback

Adaptation is referred to as the ability of a gene circuit to respond to changes in input and then to return to the pre-stimulus output level, even when the input change persists (** Ma et al., 2009**). More precisely, with an input signal switched from a lower value

*I*

_{1}to a higher value

*I*

_{2}, as demonstrated in Fig. 1(b), a circuit with functional adaptation should have the following response-curve criteria: (i) precision - the final state

*O*

_{2}should be close to the initial state

*O*

_{1}, (ii) sensitivity - there should be a relatively high

*O*

_{peak}in response to the change in the input, and (iii) the system should reach equilibrium within a reasonable relaxation time. A three-node gene circuit can achieve adaptation (

**), with one node receiving the input (node A), another node realizing various regulatory roles (node B), and a third node outputting the response (node C). A representative circuit topology is shown inside the red dashed box in Fig. 1(a). We restrict our study of the class of transcriptional regulatory networks (TRNs) with the AND logic.**

*Ma et al., 2009*In our work, we use a parameter *k*_{g} to control the strength of growth feedback, which is a parameter determining the maximal growth rate of the host cells, as mathematically explained in Model description. With all the other parameters fixed, a larger *k*_{g} implies a faster cell growth rate and a stronger impact of growth feedback. Previous research identified 425 different three-node TRN network topologies that can achieve adaptation in the absence of growth feedback (** Shi et al., 2017**), providing the base of our computational study. These topologies can be classified into two families based on the mechanism they rely on to achieve adaptation: networks with a negative feedback loop (NFBL) and networks with an incoherent feed-forward loop (IFFL) (

**). To investigate the effect of growth feedback on these circuits, we systematically simulate the response of the 425 network topologies under a switch in the input signal. A three-node gene circuit subject to growth feedback has a large number of parameters, which determine the properties of the regulation links within the circuit and the circuit dynamics. For each topology, we randomly sample 2 × 10**

*Shi et al., 2017*^{5}trials of circuit parameters. Altogether, our study involves analyzing approximately 8.5 × 10

^{7}different circuits. We find that among these trials, only about 1.5 × 10

^{5}meet the adaptation criterion in the absence of growth feedback. For these functional trials, we vary the growth feedback parameter

*k*

_{g}with a series of values, and find that the majority of trials (1.3 × 10

^{5}trials, about 87%) lose their adaptation in the interval of

*k*

_{g}∈ (0, 1), while only 13% of trials remain functional at

*k*

_{g}= 1.0.

## A systemic classification of functional failures due to growth feedback

An essential step towards understanding the detrimental or even destructive effects of growth feedback on circuit functioning is to identify the distinct failure scenarios. Our extensive simulations have yielded a comprehensive picture of these scenarios, as shown in Fig. 2. Overall, we have identified six failure scenarios that encompass more than 99.6% of the 1.3 × 10^{5} failing cases we collect. The first level of classification distinguishes between failures that occur continuously or abruptly as the growth-feedback strength *k*_{g} increases. In a continuous failure, the response curve deforms continuously as *k*_{g} increases, as exemplified in Figs. 2(a-c). In an abrupt failure, the response curve exhibits a sudden change as *k*_{g} increases through a critical value, as illustrated in Figs. 2(d-f). At the next classification level, we further divide the failures into three types of continuous failures and three types of abrupt failures.

The three types of continuous failures, denoted as types I-III as illustrated in Figs. 2(a-c), are determined according to the specific quantitative criteria in the response curve that the circuits violate. Type-I continuous failures, as shown in Fig.2(a), are associated with the violation of the precision criterion. A circuit is deemed as precise if a change in the input signal (e.g., from *I*_{1} to *I*_{2}) generates two opposite dynamical effects in the circuit that cancel each other out after a transient and return the final output to the original state, i.e. *O*_{2} ≈ *O*_{1}. For example, in some networks [e.g., the network in Fig. 1(a)], an increase in the input signal *I* will result in an increase in the concentration of gene *A* and a reduction in the concentration of gene *B*. As both genes regulate the output gene *C* with the respective activation links, for proper system parameter values, the two effects will cancel each other out, resulting in *O*_{2} ≈ *O*_{1}. Type-I continuous failures constitute the largest failure category among all possible circuit topologies, suggesting that the exact cancellation is fragile and the loss of precision is the most common dynamical mechanism behind growth-feedback-induced failures.

Our simulations reveal that an exact cancellation between the two opposite sources at *k*_{g} = 0 prevents an exact cancellation at any other values of *k*_{g}. That is, the set of circuit parameter values leading to perfect precision, in general, depends on the value of *k*_{g} (see Appendix 1 for more details). The implication is that, for fixed circuit parameter values, achieving high precision under growth feedback (*k*_{g} > 0) is difficult if the circuit is precise in the absence of growth feedback (*k*_{g} = 0).

Type-II continuous failures are characterized by a continuous change in the peak of the response curve, denoted as *O*_{peak}, as *k*_{g} increases, eventually falling below a threshold, as shown in Fig. 2(b). This type of failure can make it challenging for downstream circuits to detect the peak signal, hindering information transmission in the larger system. Type-II failures are the second most common type of failure observed in our simulations. The occurrence of a high peak in the response curve requires a significant transient deviation from the final equilibrium point. In the presence of growth feedback, the transient behavior changes, which can further alter the peak height *O*_{peak}.

Type-III and type-IV failures arise due to growth-feedback-induced oscillations, while type-V and type-VI failures are caused by bistability or bifurcations, with fold bifurcations being the most common type. To provide a more detailed understanding of these different failure scenarios, we discuss the two mechanisms, respectively, in the sections of Growth-feedback induced oscillations and Bistability and bifurcations below.

### Box 1.

Three classes of growth-induced failures

All the failures we observed can be categorized into the following three general classes, applicable to both the three-gene and four-gene circuits we tested:

Continuous Deformation of the Response Curve Typically, we require a specific range of response curve shapes for a gene circuit, such as a peak in the output with a minimum height or duration. In a failure caused by continuous deformation, the growth feedback prompts a gradual change that crosses the boundary of these criteria for response curve shapes.

Growth-Induced or Growth-Strengthened Oscillations Growth feedback can induce oscillations in a circuit through various types of bifurcations or amplify existing oscillatory behavior with longer relaxation times or larger amplitudes. A circuit experiencing growthinduced or growth-strengthened oscillations cannot reach a relatively steady state (an equilibrium or relatively small oscillations) within a finite time or reasonable relaxation period.

Growth-Induced Switching Among Coexisting Attractors When coexisting attractors are present in the circuit dynamics, such as bistability or multistability, the circuit typically only functions with one of the attractors. In other words, the circuit is functional locally in its phase space rather than globally. Strengthened growth can push the system across the boundary of different attracting basins in the circuit phase space, causing the circuit to lose its desired functionality by switching from a functional basin to a malfunctioning basin.

## Growth-feedback induced oscillations

As demonstrated by the light green and yellow slices of the pie chart in Fig. 2, a considerable portion (17%) of the circuit failures are caused by growth-feedback-induced oscillations. Growth feedback perturbations can easily change the system from the adaptive domain to the oscillation domain in these cases. Our program classifies oscillation-mediated failures into two categories: continuous (type-III) and discontinuous failures (type-IV). Type-III failures are the results of either (i) a gradual increase in the oscillation amplitude, or (ii) a gradual increase in the transient lifetime of damped oscillations. In the first case, an isolated circuit has already exhibited oscillations with small amplitudes in its gene concentrations with relatively weak growth feedback. As the feedback is strengthened with a larger value of *k*_{g}, the oscillations are intensified with a larger amplitude, leading to a circuit failure. In the second case, there is damped oscillation for small *k*_{g} with a relatively short transient time before approaching an equilibrium. After strengthening the growth feedback, the damping weakens and the oscillation’s amplitude cannot be reduced to the threshold within the time limit, as exemplified in Fig. 2(c).

The second category of growth-feedback-induced oscillation is type-IV, where oscillations emerge suddenly as the growth-feedback strength increases through a critical point. The sudden emergence of oscillations can be caused by a bifurcation or a transition into a basin of a limit-cycle attractor. A random sampling of the type-IV failure cases reveals that most of them are caused by either a saddle-node bifurcation of cycles (** Strogatz, 2018**) or an infinite-period bifurcation (

**gatz, 2018). In the former case, a pair of stable and unstable limit cycles suddenly emerge together. In the latter case, when observed from the opposite direction (i.e., with a decreasing**

*Stro-**k*

_{g}crossing the threshold), the oscillation in the system spends a longer and longer time around a node on the limit cycle. This node finally becomes a stable fixed point at the bifurcation point, and the oscillation period approaches infinity. One example of type-IV oscillation-mediated failures caused by an infinite-period bifurcation is shown in Fig. 2(d). In our simulations, most of the cases where there are saddle-node bifurcations of cycles are categorized as type-III failures because, prior to the bifurcation point, the system can be oscillating near the ghost (

**) cycle for a long time exceeding the criterion for relaxation time, though that ghost cycle is not an attractor but only a transient in the system.**

*Strogatz, 2018*Our results indicate that for various circuit topologies, the dynamic mechanisms leading to failures can differ, resulting in significantly different distributions of failure types among different networks. For instance, the fractions of failures caused by growth-induced oscillations can vary dramatically among all the topologies, as demonstrated in Fig. 3(a), where each data point represents a specific network topology. The fraction of failures caused by growth-induced oscillations can range from approximately zero to about 80%! A particular example of two different networks is presented in Figs. 3(b1) and 3(b2), both of which share the same minimal topology required for adaptation (** Shi et al**.,

**) - the circuit’s core function. Despite differing by only one link, the proportions of failures with unique mechanisms are quite distinct, as illustrated in Figs. 3(c1) and 3(c2). Notably, for the network in Fig. 3(b1), almost half of the failures result from oscillations, while hardly any oscillation-mediated failures occur for the network in Fig. 3(b2). The explanation is that, although the difference lies in only a single link, this link determines whether an oscillation-correlated motif exists within the network. Previously, three classes of motifs capable of supporting persistent oscillations were discussed (**

*2017***), including the “delayed negative-feedback loop” featured in Fig. 3(b1).**

*Novák and Tyson, 2008*Generally, the circuit dynamics depend sensitively on the structure, but oscillations specifically require a negative feedback loop with time delay (** Novák and Tyson, 2008**). Since there are no explicit time-delayed terms in the dynamical equations in our model, one of the two types of motifs - an intermediate node in the path of the negative feedback loop or an additional positive feedback loop - is necessary to induce time delay (

**). For network topologies with a high ratio of functional failures caused by oscillations, both motifs are observed, especially the former type. For the network in Fig. 3(b1), the three links: A → C, C ⊣ B, and B → A, together constitute a negative feedback loop, making the circuit more susceptible to oscillatory behaviors. For the circuit in Fig. 3(b2), no such negative feedback loop exists. Figure 3(a) summarizes the total number of failed trials and the ratio of oscillation-induced failures for each network topology. The network topologies that contain one of the motifs for oscillation as discussed in**

*Novák and Tyson, 2008***(**

*Novák and Tyson***) are marked in red, while the networks that do not consist of any of them are marked in blue. Note that all the networks with relatively high ratios of oscillation-induced failures (e.g., ratio > 0.2) consist of oscillation-correlated motifs. Details about these oscillation-correlated motifs are discussed in Appendix 2.**

*2008*We conclude that, for a network with an oscillation-correlated motif, even if it is functional at some parameter values, the potential of oscillatory behaviors can be triggered by growth feedback. As a result, networks without these motifs can be safer choices to avoid too many failures cases due to oscillations. Note that this relationship is not deterministic. As shown in Fig. 3, even the networks represented by blue dots that have no oscillation-correlated motifs can still have oscillation-induced failures (with small ratios). The complexity of the scenario makes it challenging to find general and relatively simple rules that connect circuit topology to the circuit’s robustness.

## Bistability and bifurcations

In this section, we describe the dynamical mechanisms behind type-V and type-VI failures, which in total take up about 14% of all the circuit failures. These failures are abrupt, meaning that the response curve undergoes an abrupt change at a certain critical value of *k*_{g} from a desirable curve of adaptation. Type-V and type-VI failures correspond to an abrupt change in *O*_{1} and *O*_{2}, respectively. Both types of failures are closely related to bistability and bifurcations.

Bistability and multistability are common phenomena in nonlinear systems. Bistability refers to the situations where two stable attractors coexist in the phase space simultaneously. Multistability describes a similar coexisting phenomenon of attractors, but with more than two attractors. With bistability or multistability in the target dynamical system, the system trajectory may end up in any one of these stable attractors, depending on the initial state of the system evolution. The entire phase space can thus be separated into two or more basins of attraction. Each basin of attraction corresponds to an attractor and consists of all the initial states that eventually lead the system to the attractor. The boundary boundaries separate two different basins of attraction. A close pair of initial states but at different sides of a basin boundary lead the system to different attractors.

In our simulations, we observe bistability in many circuits. While multistability has also been observed, it is relatively rare. We thus focus on bistability. It is highly unusual for both attracting basins to exhibit the desired functional behavior simultaneously. This is because they are located in different regions of the system phase space, and accommodating both would impose overly stringent constraints on the circuit. Having one basin functional is already difficult enough with a random sampling of circuit parameters. As a result, functional adaptation is typically found in only one of the basins, with adaptation being lost in the other, and the circuit is functional only locally in its phase space, rather than on a global scale. A drifting system parameter, such as *k*_{g}, can alter the dynamics of the gene circuit. In a situation with bistability, such a change in the system dynamics can modify the shape and position of the basin of attraction and the basin boundary. Consider an initial state close to a basin boundary. With the deformation caused by a drifting parameter, the boundary may shift across the initial state, leading to a sudden switching of the system’s final attractor. If the basin before the parameter change is functional and the basin after is not, this leads to a growth-feedback-induced failure. The crossing of the basin boundary dictates that the system’s final state will abruptly change from one attractor to another. This type of failure can be classified as a switching type of failure.

An example of bistability-related failures is shown in Fig. 2(e), where in the upper panel, the circuit enters into the functional region after an initial transient. In the lower panel, the circuit enters into another region that does not have adaptability, and the circuit does not respond to the switching of the input signal. Figures 4(a) and 4(b) illustrate how the basin structure of the circuit changes significantly with different values of *k*_{g}. The functional basin is in yellow and it shrinks greatly with an increasing *k*_{g}. Note that the phase space is four-dimensional, so only a two-dimensional slice is shown. For a bistability/multistability-induced type-V failure where *O*_{1} is switched, the boundary of the functional basin crosses the initial state. For a type-VI failure, the simultaneous movement of both *O*_{1} and the basin boundary under input *I*_{2} results in *O*_{1} crossing the boundary.

Bifurcations also play an important role in many type-V and type-VI failures. An example of a failure caused by a fold bifurcation is shown in Fig. 2(f) and the corresponding bifurcation diagram is shown in Fig. 4(c), where a non-functional fixed point appears through a fold bifurcation as *k*_{g} crosses a critical value. Bifurcation-induced abrupt failures differ from those caused by bistability/multistability, but they can be related since significant changes in the basin structures often occur near a bifurcation point.

## Circuit robustness and optimal topology

To quantify a circuit’s robustness against growth feedback, we introduce two metrics: *Q*-value and *R*-value. We track the number of remaining functional trials for each network for various *k*_{g} values (starting from *k*_{g} = 0), denoted as *Q*(*k*_{g}). This measure extends the concept of *Q*-values in ** Ma et al. (2009**) by accommodating non-zero values of

*k*

_{g}. To characterize the circuit robustness, we define the survival ratio

*R*(

*k*

_{g}) as

*R*(

*k*

_{g}=

*k*) =

*Q*(

*k*

_{g}=

*k*)/

*Q*(

*k*

_{g}= 0). This ratio represents the fraction of random circuit realizations that maintain functionality under growth feedback with a strength of

*k*

_{g}.

Note that each *Q*(*k*_{g}) or *R*(*k*_{g}) is defined for a specific network topology in a suitable parameter space. A high value of *R*(*k*_{g}) indicates that a large fraction of the randomly sampled circuit parameters is functional despite cell growth with any strength no larger than *k*_{g}, indicating that the topology is more robust against growth feedback. Because of the detrimental effects of growth feedback, *R*(*k*_{g}) decreases monotonically with respect to *k*_{g}.

To justify the utility of *R*(*k*_{g}), we test the circuit topologies employed in a previous work (** Zhang et al., 2020**), where two relatively simple network topologies were used for a comparison study in terms of their ability to resist growth feedback and remain functional. Our evaluation of

*R*(

*k*

_{g}) for the two topologies has yielded results that are consistent with those in

**. (**

*Zhang et al***), as discussed in Appendix 3. To illustrate our results in a concrete way, we set**

*2020**k*

_{g}= 0.6 and calculate the ratio

*R*(

*k*

_{g}= 0.6) for different network topologies.

Our computations have revealed a set of eight circuit topologies with optimal performance as characterized by high values of both *Q*(*k*_{g} = 0) and *R*(*k*_{g}), as indicated by the set of orange points in Fig. 5(a). The optimal circuits form a family as their topologies exhibit a high level of similarity with one other. In particular, all eight circuits in this family share a common set of links (motif), as shown in Fig. 5(b). The combination of these common links is one of the minimal topologies with perfect adaptation in a three regulatory logic (** Shi et al., 2017**) and is critical for the circuit to be functionally adaptable. The only difference among the circuits in this family is the links from node C. While an inhibition link from node C can be important to achieving a value of

*R*(

*k*

_{g}), as discussed below, the eight optimal circuit topologies do not contain any such inhibition link from node C. (The role of this particular link will be further studied in our analysis of the results in Fig. 6.) This also explains why the family has eight members, as follows. Each link from C has two options: either the link does not appear, or it appears as an activation link. As there are three possible links from C (C to A, C to B, and C to C), there are altogether eight (2

^{3}) topologies within this optimal family, according to the simulation results in Fig. 5.

How can we quickly determine if a three-gene regulatory network with a given topology can be robust against growth feedback? Is there any structural feature of the circuit that can be used to estimate if a high value of *R*(*k*_{g}) can be achieved? To gain insights, we observe that the histogram in Fig. 6(b) has three peaks about low, moderate, and relatively high values of *R*, respectively. Computations reveal certain “shared topological similarity” (or motif) within each peak. Thus, each peak corresponds to a group of similar network topologies that simultaneously have a similar level of *R*. This observation suggests a correlation between the network topology and robustness against growth feedback. For convenience, we refer to these three groups of networks by the colors presented in Fig. 6. For instance, the group with the highest *R* (the green triangles in Fig. 6(a)(d)) is called the green group, and the group with the lowest *R* (the red diamonds in Fig. 6(a)(d)) is the red group.

To better distinguish the three groups, we introduce two binary variables, *B*_{1} and *B*_{2}. For each network, *B*_{1} = 1 if the network contains the motif in Fig. 6(c), and *B*_{1} = 0 otherwise. Then, for each network, an additional binary variable is set to be *B*_{2} = 1 if there is an inhibition link from the output node C, and *B*_{2} = 0 otherwise. We find that a linear combination of the two binary variables, *B*_{S} = *B*_{1} − *B*_{2}, can characterize the circuit topology and robustness against growth feedback. In particular, the three possible cases *B*_{S} = 0, 1, or -1 correspond to the three peaks in Fig. 6(b). This result suggests that the motif shown in Fig. 6(c) is beneficial for robustness, while an inhibition link from the output node C is detrimental. It is the balancing act of these two factors that determines the overall circuit robustness.

The discovery of this three-peak structure and the corresponding topological similarity within each peak is facilitated with the use of machine learning. In particular, we consider a simple type of artificial neural network called multilayer perceptron (MLP), where we train it to predict the *R* value from the input of the network topology through a small hidden layer with only two nodes, as demonstrated in Fig. 6(e). This bottleneck structure in the hidden layer plus the *l*-1 regularization imposed on the input matrix *W*_{in} forces the MLP to extract low-dimensional features from the input topology to estimate *R*. In our tests, the MLP designed this way automatically assigns different levels of weights to the input information of different links. Over an ensemble of 50 MLPs trained with different random initial values, the ranking of average importance is shown in Fig. 6 (f). The top four links are the four links used to categorize the three peaks.

The results in Fig. 6 is for *k*_{g} = 0.6. However, we find that different values of *k*_{g} lead to essentially the same ranking of *R*(*k*_{g}) among the circuit topologies, as illustrated in Fig. 7.

Three remarks on our categorizing rules based on the two extracted featured motifs are in order.

First, the shared motif for the green group is strikingly similar to the optimal minimal network in Fig.5(b) (the orange group). The sole distinction lies in the self-activation link of node B. This specific link plays a crucial role. Every network in the NFBL family depends on this link to achieve adaptation (** Shi et al**.,

**). However, for circuits within the IFFL family, this link is not a necessity for adaptivity. Missing this link makes the motif in Fig. 6(c) no longer a minimal network for adaptation, and a circuit containing this motif may either belong to the NFBL or the IFFL family. We have thus identified two optimal groups: the green group with optimal robustness**

*2017**R*and the orange group with both the optimal robustness

*R*and the largest functional volume

*Q*(

*k*

_{g}= 0) in the absence of growth feedback. The orange group is a subset of the green group, with an additional requirement for

*Q*(

*k*

_{g}= 0).

Second, the shared motif for the red group is also exactly the group of all circuits containing an inhibition link from node C to node B, denoted as C ⊣ B. These two different definitions are in fact equivalent: all networks with *Q*(*k*_{g} = 0) > 300 that contain C ⊣ A or C ⊣ C also contain the motif in Fig. 6(c), yielding *B*_{S} = 0, and belong to the blue group.

Third, the three circuit groups in Fig. 6 are not correlated with the categories used in previous research on circuit functionalities without growth feedback (** Ma et al., 2009**;

**). These studies classified adaptive networks into NFBL and IFFL families. Each family contains a few minimal topologies with or without some additional other motifs, and the two families have distinct minimal functional topologies. The minimal topology acts as the backbone for supporting circuit functionality. We find that, when growth feedback is present, the prior classification scheme and the underlying minimal topologies become less relevant. Circuits belonging to the NFBL family are spread across all three levels of**

*Shi et al., 2017**R*(

*k*

_{g}) in Fig. 6(b), as are the circuits from the IFFL family. A robust circuit can be part of either family, just as a fragile circuit can belong to both. We give that: (i) the topological motifs determining circuit functionality robustness and (ii) the motifs deciding whether a circuit belongs to the NFBL or IFFL family are independent. To quantify this irrelevance, we calculate the point biserial correlation between

*R*(

*k*

_{g}= 0.6) and a binary variable determining the family to which the circuit belongs. The resulting correlation is merely 0.1, suggesting hardly any correlations. A further illustration and quantification of this irrelevance can be found in Appendix 5.

What are the reason and mechanism behind the phenomenological set of circuit categories? Especially, it is desired to understand why the shared motif for the green group is beneficial for circuit robustness, and why the shared motif for the red group is harmful for robustness. It is challenging to find straightforward explanations given the complexity of the problem (see Discussion section). Certain insights are as follows. We find that the average node concentrations at the equilibrium for the network topologies in the red group are consistently smaller than those in the blue group. This difference is reflected in the value of burdens *b*. In particular, according to Eq. (12), the cell growth rate is proportional to the term 1/(1 +⟨*b*⟩) under the same level of growth feedback. Figure 6(d) shows the average burden *b* for each network topology, demonstrating that the values of the term 1/(1 + ⟨*b*⟩) for the circuits in the red group are larger than the values in the blue group. As a result, for the same value of *k*_{g}, the growth feedback effectively received by the circuits in the red group is stronger than that of the blue group circuits. Further support is provided by the results from the limit *J* → ∞ (Appendix 6). In this limit, the burden *b* does not affect the strength of the growth feedback. As a result, the *R* values of the red group significantly overlap with those of the blue group, suggesting that the distinctively low values of *R* for the red group be a result of the burden with finite *J*. We also find that the existence of the shared motif for the red group has a stronger correlation to the motif necessary for growth-feedback induced oscillations. All circuits with oscillation type of failures taking up more than 20% of failures belong to the red group. This correlation can result in further fragility of the red group circuits.

## Scaling law quantifying the effect of growth feedback on gene circuits

A comprehensive way to understand the effects of growth feedback on gene circuits is through scaling laws, an approach commonly employed in statistical and nonlinear physics. Does a scaling law exist that characterizes quantitatively how growth feedback affects the circuit functioning? Through a systematic computational analysis of the circuit robustness, we have uncovered a scaling law that governs how the robustness measure *R*(*k*_{g}) deteriorates as growth feedback is strengthened, as shown in Fig. 8, where the blue curve is the result averaging over all the 425 network topologies. The three other curves represent circuits that have a relatively high, moderate, and low value *R* among the 425 topologies tested. As growth feedback is strengthened, the number of circuit topologies that can maintain functioning decreases (or, equivalently, the number of failed circuits increases). The decreasing behavior of *R*(*k*_{g}) with *k*_{g} tends to be slower than exponential [e.g., exp(−*βk*_{g}) with *β* > 0 being a constant].

A general theoretical argument for the scaling law is unavailable. However, if we simplify the system by setting the parameter *J* in Eq. (12) to be large so the burden *b*(*t*) is much smaller than one, we are able to argue that the scaling law is approximately given by

where *β* > 0 and 0 < *λ* < 1 are two specific constants that depend on the network topology, and the typical value of *λ* is about 0.6. The exponential scaling is assumed, given its memorylessness. That is, there is no special zero point of *k*_{g}, for the reason that a certain level of *k*_{g} is mathematically equivalent to a larger *d*_{x}, as discussed below.

The quantity *R*(*k*_{g}) is a simple and straight-forward measure characterizing the detrimental effect of growth feedback on gene circuits. We carry out a semi-quantitative analysis of this quantity and the effect of *k*_{g} on it. The circuit dynamical equations Eqs. (9-12) can be simplified by substituting Eq. (14) into them to cancel the *dN*/*dt* terms, leading to

Compared with the equations without growth feedback Eqs. (6-8), we see that introducing growth feedback is equivalent to adding a variable *k*_{g}/(1 + *b*) to the degradation terms for each node. Intuitively, the value *Q* of a network topology measures the volume of the functional region ℳ in the parameter space, which is also a function of *k*_{g}. We thus have that *R*(*k*_{g} = *k*) is the volume of the intersection between ℳ(*k*_{g} = *k*) and ℳ(*k*_{g} = 0) divided by the volume of ℳ(*k*_{g} = 0):

where *V* (ℳ) is the volume of ℳ.

The picture can be further simplified if we assume the burden *b* is approximately a constant within a range of *k*_{g}. Since growth feedback contributes to an additional term in degradation *d*_{x}, strengthening the feedback is equivalent to increasing all three quantities *d*_{x} together. Consequently, as *k*_{g} increases, the high dimensional region *M* does not deform, but simply translates in the negative direction in all dimensions of degradation *d*_{x} in the parameter space. That is, as growth feedback becomes stronger, it becomes more difficult for the circuit to maintain it functioning.

# Discussion

When a synthetic gene circuit is introduced into a host cell, an inherent coupling arises wherein the gene circuit affects cell growth and cell growth in turn alters the circuit gene expression (** Klumpp et al., 2009**;

**). Due to the fundamental nonlinearity in the gene network and in the cell growth dynamics, the interaction is generally quite complicated. To understand this interaction so as to identify the circuit topologies that can withstand the interaction and maintaining the intended circuit functions is one of the most challenging problems in synthetic biology.**

*Klumpp and Hwa, 2014*Previous studies showed that growth-mediated feedback can endow synthetic gene circuits with various emergent properties. In general, growth feedback tends to negatively impact the intended function the circuit is designed for. There was preliminary evidence that the effects of growth feedback depend strongly on the circuit topology (** Zhang et al., 2020**). For a particular circuit function, while the vast majority of the topologies would fall under growth feedback, a handful still exists that is adaptable to maintain its designed functions. Identifying the “optimal” topologies that are most robust against growth feedback is fundamental to constructing synthetic gene circuits that can survive, adapt, and function as designed in the fluctuating growth environment of the host cells.

The main contribution of this paper is a systematic computational study of three-gene circuits with adaptation to uncover and understand the detrimental effects of growth feedback on gene circuits and to identify optimal groups of topologies. Without growth feedback, there are 425 possible topologies with functional adaptation. A vast majority of these circuit topologies fail in their functions under growth feedback, and our computations have revealed, for the first time, six distinct main failure categories covering more than 99% of the cases. From a dynamical point of view, there are three mechanisms by which growth feedback can deprive the circuit of its ability to adapt: (i) continuous deformation of the response curve, (ii) strengthened or induced oscillations, and (iii) sudden switching to coexisting attractors (also summarized in ** Box 1**). By introducing a robustness measure to quantify circuit adaptation in the presence of growth feedback, we uncover a general scaling law characterizing the detrimental effect of growth feedback on the circuit functioning in a quantitative manner. We identify an optimal group of circuits with high robustness and key subsets of links associated with this group that play a critical role in sustaining circuit function in host cells. Taken together, to design a functional gene circuit, growth feedback must be taken into account, as the same circuit designed with perfect functions without the feedback can behave quite differently when the feedback is present. Our study has provided unprecedentedly quantitative insights into the interplay between gene circuit topology and growth feedback, unlocking the dynamical mechanism of growth-induced failures and providing guidance to better design practically applicable synthetic gene circuits.

A unique finding is that growth feedback can induce or strengthen oscillations in gene circuits designed for adaptation. Such oscillations can often destroy the circuit functionality. In a recent experimental study, a similar phenomenon was observed in gene circuits designed for self-activation (** Melendez-Alvarez et al., 2021**). These results suggest that growth-feedback induced oscillation may be a general dynamical mechanism that can negatively affect the robustness of gene circuits. In addition, our study has shown that growth feedback has a highly sensitive dependence on the circuit topology: even a small structural differences between two circuits designed for the same function can result in drastically different outcomes under growth feedback. For example, Fig. 3 demonstrates that a link critical for an oscillation-supporting motif can significantly affect the robustness of the circuit against growth feedback. It can thus be quite useful to identify failure-related motifs so that they can be avoided when designing a gene circuit.

From a broad point of view, our study has yielded basic insights into the fundamental topology-function relationships in gene circuits. Examples include how circuit topology affects circuit robust-ness against growth feedback and whether a circuit topology contains motifs supporting a specific type of growth-induced failure, such as oscillation-related malfunctions. However, searching for and understanding the interplay between circuit topology and dynamical behaviors remain to be a challenge, for the following five reasons.

First, the two relevant questions are whether a circuit topology supports adaptation and whether the circuit is robust against growth feedback or is susceptible to a specific type of growth-induced failures. While our study focused on the latter, the former is important. Addressing both questions to identify and analyze all possible scenarios is infeasible at the present, due to the complex parameter space of the circuits. To make our study feasible, we focused on the cases where the circuit satisfies all the requirements for adaptation in the absence of growth feedback. These cases may occupy a small region in the entire parameter space of the circuit. For each circuit topology, the uncovered function failures due to growth feedback are thus limited to relatively small parameter regions. Second, most network topologies studied have dense connections among the three nodes (only about 20% of the networks have fewer than six connections). As a result, different motifs can overlap with each other, blocking or enhancing the function of each other. The dense connections thus pose a difficulty in identifying the motifs accurately. Third, for a particular class of failures, competition among different failure types may arise. For instance, a circuit with oscillation-supporting motifs may not have a high fraction of oscillation-induced failures because it also contains the motif for bistability, leading to a large fraction of failures due to the bistability-induced malfunctions. Fourth, due to the necessity to set a threshold in the relaxation time, transient behaviors can arise. In many failure cases caused by oscillations, the oscillatory behavior is not stable and the circuit will eventually approach a fixed point. However, time scales should be taken into account. The transient behaviors can make the network topologies without the necessary motif for sustained oscillations exhibit oscillation-induced failures. Fifth, growth feedback acts as additional feedback loops within the circuit, potentially complicating the circuit dynamics and adding more links to the circuit topology. These extra links in the integrated topology might give rise to an oscillation-related motif. However, our simulations have shown that the impact of this additional oscillation motif, introduced by growth feedback, tends to be weak (Appendix 6).

In Appendix 7, we extend our analysis to four-gene circuits with over two thousand functional failure trials. A remarkable finding is that the failure scenarios for these four-gene circuits are the same as the categories for three-gene circuits (summarized in ** Box 1**), indicating that the growth-feedback induced failure mechanisms identified in our work are general.

The primary goal of this paper is to explore the ways in which growth feedback can undermine a gene circuit’s functionality, and we have uncovered three main dynamical failure mechanisms. The reason that we focus on small gene circuits (i.e., those with three or four genes) is that, in current synthetic biology, only small gene circuits are of interest. The main reason is that, even for a modest number of genes, when the circuit is introduced to a host, the competition and interactions in the form of growth feedback are likely to lead to unintended and uncontrollable consequences. Another reason is resource competition: the genes in the circuit could compete for the limited resources in the host cell, negatively impacting the circuit dynamics. Because of the two reasons, at present large gene circuits are not favored in synthetic biology. In fact, the state-of-the-art synthetic gene circuitry usually involves three or four genes, where the consequences of growth feedback had been poorly understood. Our work fills in this knowledge gap.

It is possible that, in the future, synthetic biology may use larger and more complex circuits. To uncover and understand the failure mechanisms as well as to identify circuits that are resilient to growth feedback, machine learning can be used. For example, recurrent neural networks have recently been used to identify circuit topologies appropriate for a specified desired function (** Shen et al., 2021**), and reinforcement learning tackle the combinatorial optimization problem (

**;**

*Bello et al., 2016***) of pinpointing the optimal circuit topologies. Furthermore, automated differentiation (**

*Mazyavkina et al., 2021***;**

*Hiscock, 2019***) can be exploited to locate optimal network parameters, which can be efficient for larger circuits with a high-dimensional parameter space. In spite of these works, to study the effects of growth feedback and resource competition among numerous genes in larger circuits remains to be a formidable challenge. Our work providing a comprehensive picture of the failure mechanisms induced by growth feedback represents a step forward in this field.**

*Kong, 2022*# Models and Methods

## Model description

We restrict our study of the class of transcriptional regulatory networks (TRNs) with the AND logic. For an isolated circuit (in the absence of any growth feedback) with the topology specified inside the red dashed box in Fig. 1(a), the dynamical equations are

where the dynamical variables *A, B*, and *C* are the concentrations of each protein (node). The notations are as follows. Let *x* and *y* be two arbitrary nodes. The quantity *ν*_{x} is the maximal production rate of gene *x, d*_{x} is the degradation rate of gene *x, dx*/*dt* is its time derivative of the concentration, *n*_{xy} and *K*_{xy} are the coefficients in the Hill function for a transcriptional regulation from gene *x* to gene *y*.

When growth-mediated feedback is present, the dynamical equations of the three-node circuits are modified to

where the additional dynamical variable *N* denotes the density of the host cells, *k*_{g} is a parameter controlling the maximal growth rate of the host cells, *J* is a parameter reflecting how this three-node gene circuit contributes to the burden.

The growth of *N* is under the regulatory action of two sources: by itself following the logistic equation with the environmental capacity *N*_{0} and by the burden *b* that represents the competence from the metabolism of the gene circuit. To make the computations feasible, we focus our analysis on the exponential growth phase so that *N*_{0} ≫ *N*. The equation governing the growth of the cell numbers, Eq. (12), can then be rewritten as

where the dilution rate *dN*/*dt* is regulated only by the burden *b*(*t*) of the gene circuit. While cell growth is inhibited by the metabolism of the gene circuit, the circuit is also regulated by the growth of *N* that dilutes the concentration of circuit nodes with increasing cell volume. This dilution is reflected by the additional terms −(*x*/*N*)(*dN*/*dt*) in Eqs. (9-11).

It is useful to clarify the meaning of the degradation parameter *d*_{x} and its relationship to growth feedback. While degradation and growth feedback terms have the same sign in the regulatory equations, *d*_{x} may include a constant dilution. We assume that *d*_{x} represents the sum of all the degradation effects in cells that are distinct from growth feedback. For instance, degradation tags, especially in the *ssrA* tagging systems (** Gottesman et al., 1998**), are often used in synthetic gene circuits to increase the degradation rate and thus increase the time scale of the whole system (

**and Leibler, 2000;**

*Elowitz***;**

*Fung et al., 2005***;**

*Stricker et al., 2008***).**

*O’Brien et al., 2012*## Numerical criteria for functional adaptation

We introduce four criteria to determine if a circuit has functional adaptation.

### Precision

The basic requirement of adaptation is that the output remains the same when is input is switched from one state to another, i.e., *O*_{2} should be close to *O*_{1} in Fig. 19b). Specifically, we set the precision criterion to be |(*O*_{2} − *O*_{1})/*O*_{1} |< 0.1.

### Sensitivity

The circuit is also required to respond to the switch of the input signal with a high peak. This ability of the circuit is named sensitivity. We introduce two types of sensitivity: relative and absolute, with the respective criteria *O*_{peak}/*O*_{1} > 0.5 and *O*_{peak} > 0.1. Only the circuits meeting both criteria are regarded as having achieved the required sensitivity.

The need to use the two different criteria simultaneously can be justified, as follows. Given the variety of network topologies and a large number of system parameters, there is a vast diversity in the circuit dynamics and the values of *O*_{1}. When *O*_{1} is small, it is difficult to observe a peak that has even satisfied the relative sensitivity criterion. If the absolute criterion is used alone for a circuit with a large *O*_{1} value, the peak may be negligible in comparison with *O*_{1}, making its observation practically difficult. It is thus necessary to combine the two criteria so that the cases of small and large values of *O*_{1} can be dealt with on the same footing.

### Oscillations

To achieve the desired adaptation, the circuit’s output should reach a steady state before and after the input signal is switched. The values of *O*_{1} and *O*_{2} can be determined as the output signal associated with the steady states. However, realistically, it is not necessary to require that the circuit reach an exact equilibrium. Relatively small oscillations in the circuit are acceptable. We define a “relative steady state” where, within a time block of *t*_{block} = 200, the standard deviation of the time series of each node *x*(*t*) satisfies: std(*x*) < 1 × 10^{−4} and std(*x*)/mean(*x*) < 0.05. To further guarantee that the circuit is actually in the “relatively steady state,” two successive time blocks satisfying the standard deviation requirements are needed. The quantities *O*_{1} or *O*_{2} are then defined as the respective mean values of the output signal in that last time block *t*_{block}.

### Relaxation time

An ideal gene circuit should be able to respond and adapt within a reasonable time scale. We set an upper bound of evolution time *t*_{max} = 4, 000. If the system cannot reach the “relative steady state” within this time, it is regarded as non-functional.

## Details of parameter space sampling and response simulation

A three-node gene circuit subject to growth feedback has a large number of parameters. Let *L* be the number of links among the three nodes (excluding the input link). The total number of parameters is 2 · 3 + 2(*L*+ 1) = (2*L*+ 8). The values of these parameters determine the properties of the regulation links within the circuit and, as a result, the circuit dynamics. The circuit parameters are randomly generated by the Latin hypercube sampling method (** Iman et al., 1980**) using the function “lhsdesign” in Matlab. The parameters are sampled uniformly either on a logarithmic or a linear scale. The sampling ranges of the parameters are:

*ν*

_{x}∈ [10

^{−1}, 10

^{1}] (sampled in logarithmic scale),

*d*

_{x}∈ [10

^{−2}, 1] (sampled in logarithmic scale),

*n*

_{xy}∈ [1, 4] (sampled in linear scale), and

*K*

_{xy}∈ [10

^{−3}, 1] (sampled in logarithmic scale).

The dynamical equations of the circuits are numerically integrated by the 4th order Runge–Kutta method with a time step Δ*t* = 0.05. All the initial states of *A, B*, and *C* are taken to be 0.1. The input signal is initially *I*_{0} = 0.06 and then switched to *I*_{1} = 0.6.

We also observe that, when an isolated circuit fails, a certain amount of growth feedback can restore the circuit’s functions. This phenomenon was previously discovered experimentally (** Tan et al., 2009**). However, such cases are rare. We thus focus on circuits that are functional in isolation and examine how growth feedback affects their adaptation.

# Acknowledgements

This work was supported by ONR under Grant No. N00014-21-1-2323, by NIH grant (R35GM142896 to X.-J.T.), and by the Young Talent Fund of the University Association for Science and Technology in Shaanxi, China, grant No. 20210506.

# Additional information

# Data availability

All relevant data are available from the authors upon request.

# Code availability

All computer codes can be found at: github.com/lw-kong/Growth_Feedback_Adaptation

Author contributions

LWK, XJT, and YCL conceived the idea. LWK did all the simulations. All analyzed the results and data. LWK and YCL wrote the manuscript. LWK, XJT, and YCL edited the manuscript.

# Appendix 1

## An Analysis on the Mathematical Criterion for Robustness Against Growth Feedback

The quantitative measure *R*(*k*_{g}) we have introduced to characterize the effects of growth feedback on gene circuit functioning is generally not amenable to analytic treatment. However, for weak feedback, certain analytic insights can still be gained. Here we consider a three-node gene circuit designed to have adaptation and analyze how growth feedback destroys adaptation. We focus on type-I failure, where the growth feedback makes *O*_{2}(*C*) deviate from *O*_{1}(*C*), because (1) this type of failures is arguably the most important type as it alone takes nearly half of all the failures, and (2) it can be analyzed. Here we provide a semi-quantitative analysis to elucidate how a small *k*_{g} > 0 can make *O*_{2}(*C*) deviate from *O*_{1}(*C*).

## Circuit robustness in the absence of growth feedbac

The dynamical equations of the circuit in the absence of growth feedback are:

Where

and each *H* term represents the regulation of a single link in the circuit. The steady-state solutions (*A*_{0}, *B*_{0}, *C*_{0}) are given by

For notation convenience, we use *x* to denote an arbitrary node (A, B, or C). The steady-state solutions can thus be written as

With a small input signal change Δ*I* applied to the circuit, the steady states becomes (*A*_{0} + Δ*A*_{0}, *B*_{0} + Δ*B*_{0}, *C*_{0} + Δ*C*_{0}). Under Δ*I*, the dynamical equations at the steady point can be written as

For Δ*I* = 0, the equation becomes

Subtracting Eq. (24) from Eq. (23), we get

Where

is the Jacobian matrix of the original dynamical equations evaluated at (*A*_{0}, *B*_{0}, *C*_{0}).

Solving Eq. (25), we have

For the steady state to remain stable under Δ*I*, the requirement is that ratio Δ*C*_{0}/Δ*I* be small. Assuming that the Jacobian matrix satisfies the conditions to make points (*A*_{0}, *B*_{0}, *C*_{0}) and (*A*_{0} +Δ*A*_{0}, *B*_{0} +Δ*B*_{0}, *C*_{0} +Δ*C*_{0}) stable in their corresponding dynamical systems, we have

where (·)_{3} denotes the third component of the vector inside. The limiting case of a perfectly precise circuit is defined to be Δ*C*_{0}/Δ*I* = 0, yielding a precision criterion of by ()_{31} ≈ 0 or

leading to

which is the central criterion analyzed in ** Shi et al**. (

**). The two families, NFBL and IFFL, satisfy this same criterion through different mechanisms.**

*2017*## Precision criteria in the presence of weak growth feedback and *J* → ∞

We now incorporate growth feedback into the analysis in the limit *J* → ∞. In this case, the burden *b* is small so that the dilution strength can be approximated as *dN*/*dt*/*N* ≈ *k*_{g}. Suppose weak growth feedback is present before and after the small input signal Δ*I* is applied. Let the steady state under growth feedback before application of Δ*I* be denoted as (). The steady state with input Δ*I* can be written as ().

The basic equations before and after application of Δ*I* are

Subtracting Eq. (31) from Eq. (32), we get

where ℑ is the identity matrix. The solution is

Compared with Eq. (27), the differences are that the matrix 𝒥≤ is replaced by (), and *x*, Δ*x* are replaced by *x*^{′}, Δ*x*^{′}, respectively.

The precision criterion again requires to be small. we have

which is equivalent to

leading to

Comparing this equation for precision criterion Eq. (37) with the criterion Eq. (30) in the absence of growth feedback, we find an extra term of *k*_{g}. This explicit term of *k*_{g} makes the criterion more difficult to satisfy with a range of different *k*_{g} values. It requires either *∂f*_{C} /*∂A* is zero or the four partial derivative terms change accordingly with a varying *k*_{g} to have exact cancellations.

For neither the NFBL nor the IFFL family, *∂f*_{C} /*∂A* = 0 can be satisfied. In none of the 425 network topologies, the link from node A to node C is absent (*∂f*_{C} /*∂A* = 0). Thus with a random sampling of the parameters for the circuits that have adaptation at *k*_{g} = 0, the probability that *∂f*_{C} /*∂A* = 0 can occur is negligibly small.

## Precision criterion with exact cancellations for the optimal family

As the criterion *∂f*_{C} /*∂A* = 0 cannot be satisfied in three-node gene circuits, we discuss the possibility of exact cancellations with varying *k*_{g}. For the optimal circuit family demonstrated in Fig. 5(b), we have *∂f*_{C} /*∂B* = 0 as there is no direct link from node B to node C. The precision criterion becomes

Since *∂f*_{C} /*∂A* ≠ 0, this can be rewritten as

For this family, the precision criterion in the absence of growth feedback is

Combining Eqs. (39-40), we get

Using the approximation employed in ** Shi et al**. (

**) for the NFB family that**

*2017**f*

_{B}is a linear function of

*B*, we have

We thus have

This equation can be solved analytically only in the regime of *k*_{g} ∼ 0 where () and () are approximately linear functions of *k*_{g}. But it should be difficult for the circuit to meet this criterion with a random sampling of the circuits that have adaptation at *k*_{g} = 0.

# Appendix 2

## Network motifs supporting oscillations

As summarized in ** Novák and Tyson** (

**), three classes of motifs can support oscillations in a three-node circuit.**

*2008*### Class 1 (the dominant class)

Delayed negative-feedback loop with an intermediate node in the path of the negative feedback loop. A majority of the networks with an oscillation-supporting motif belong to this class (237 out of 245 networks). All the circuits that have more than 20% failures as oscillation-induced failures belong to this class.

### Class 2

Amplified negative-feedback loop, with a node regulated by both a negative-feedback loop through another node and a positive-feedback loop through the third node. There are only 8 network topologies that fall into this class. They result in 3% to 20% oscillation-induced failures.

### Class 3

Incoherently amplified negative-feedback loops, as demonstrated in Fig. 5(c) of ** Novák and Tyson** (

**). Among all the 425 networks capable of adaptation studied in our work, no network belongs to this class.**

*2008*# Appendix 3

## Self-activation and toggle switch circuits

The key quantitative results about the survival ratio *R*(*k*_{g}) presented in the main text are obtained from various circuit topologies with three genes. To demonstrate the general applicability of *R*(*k*_{g}), we study two simpler gene circuits: a self-activation circuit with a single gene and a toggle switch circuit with two genes. A comparative study of these two classes of circuits has been carried out recently (** Zhang et al., 2020**), whose topological structures are shown in Figs. 1(a1) and 1(a2), respectively. In the absence of growth feedback, both networks exhibit bistability and a hysteresis loop. Under dilution, the self-activation circuit quickly loses the memory while the toggle switch circuit can remain functional, as was observed numerically and experimentally (

**).**

*Zhang et al., 2020*Our simulation settings are mostly identical to that of 3-node circuits in the main text, including the sampling regions of the random circuit parameters, the specifics of the ODE solver, and the criterion for locating equilibrium. We set *J* = 1. Other than the network topology, the only difference is the functionality criteria. Here, the desired function is a hysteresis. We test the response of the circuit output when (i) the input is a switch from an off-state (with input signal *I*_{off} = 10^{−8}) to an on-state (with input signal *I*_{on} = 2) and (ii) the input is switched from an on-state to an off-state. In the former trial, the steady-state output is switched from *O*_{1,off} to *O*_{1,on}, while in the latter it is switched from *O*_{2,on} to *O*_{2,off}. The criteria are: (i) the two steady states are distinguishable: Δ*O* = *O*_{2,on} − *O*_{1,off} > 0.1; and (ii) the system exhibits a hysteresis: (*O*_{1,on} − *O*_{1,off})/Δ*O* > 0.5 > (*O*_{2,on} − *O*_{2,off})/Δ*O*.

Figures 1(b1) and 1(b2) show the scaling law of *R*(*k*_{g}) with *k*_{g} for the self-activation and toggle switch circuits, respectively. It can be seen that, for the self-activation circuit, as the growth feedback strength increases, *R*(*k*_{g}) approaches zero quickly, indicating that the circuit function cannot sustain even weak feedback with near zero strength. For the toggle switch, *R*(*k*_{g}) approaches zero eventually but at a much slower rate, a result that is consistent with the finding in ** Zhang et al**. (

**). Remarkably, the scaling of**

*2020**R*(

*k*

_{g}) with

*k*

_{g}exhibits qualitatively similar behavior as the scaling laws reported in the main text for various three-gene circuits, lending further credence for the general applicability of the quantitative measure

*R*(

*k*

_{g}) to characterize the effects of growth feedback on gene networks.

# Appendix 4

## Regularized feed-forward neural networks for identifying critical links

We employed ensembles of regularized feed-forward neural networks to detect, in an automated fashion, the links that are crucial in determining the level of robustness *R*. The neural-network structure is illustrated in Fig. 6(e), which has three layers: an input layer, a hidden layer, and an output layer. The input layer receives a nine-dimensional circuit topology vector where each entry represents a potential link in the three-node circuit, such as *A* → *A* and *B* → *C*. For an activation (inhibition) link, the entry value is set to +1 (−1). In the absence of such a link, the value is zero. In the hidden layer, there are only two neurons that use a hyperbolic tangent activation function, creating a bottleneck that limits the complexity of the extracted features. The output layer has one neuron that uses a hyperbolic tangent activation function trained to output the estimated robustness . The input and hidden layers are connected by the matrix *W*_{in}, and the hidden and output layers are connected by the matrix *W*_{out}. Given the input vector *u*, the estimated can be expressed as

We use all the 303 circuit topologies that have *Q*(*k*_{g} = 0) > 100 for training to minimize the relative random fluctuations in the training data. The loss function for optimization is

where *β* = 0.05 is the *l*-1 regularization coefficient, *L*_{n} = 9 is the number of possible links within a three-gene circuit, and *w*_{h} = 2 is the width of the hidden layer. We train the network using a stochastic gradient descent algorithm and repeat it 50 times with different initial weights in the neural net matrices. The “importance” of a link is determined by the logarithm of the absolute value of the weights in *W*_{in} corresponding to the gain of that link. This importance measure is then averaged over all 50 neural networks.

# Appendix 5

## Lack of correlation between the circuit robustness and topological families

As shown in Fig. 1, the network topologies belonging to the two different families (marked in different colors) are mingled together and spread all over the range of *R*(*k*_{g}), suggesting no significant correlation between the circuit robustness and circuits family. To quantify this irrelevance, we calculate the point biserial correlation between (a) the *R*(*k*_{g}) values of all the network topologies with *Q*(*k*_{g} = 0) ≤ 200 (to lower the fluctuations) and (b) a binary variable *b*_{f} which is *b*_{f} = 0 for the NFBL family and *b*_{f} = 0 for the IFFL family. The calculation involves 108 NFBL network topologies and 93 IFFL topologies. The resulting point biserial correlation is as small as 0.1. The 95% confidence interval for the true difference with respect to the two families of *R*(*k*_{g}) is (−0.01,0.06), which is narrow around zero.

# Appendix 6

## Results from low burden level

For the simulation results reported in the main text, the burden parameter is fixed at *J* = 1. What are the possible behaviors of the gene circuit for different values of *J*? Suppose *J* is much larger than one. In this case, the burden term *b* that has *J* in the denominator is negligible, thereby reducing the complexity of the system and providing a parameter regime in which the contributing factors to the survival ratio *R*(*k*_{g}) other than the burden can be identified.

In the regime of large *J*, the burden in Eq. (8) in the main text is much smaller than one, so Eq. (7) in the main text about growth rate can be simplified as

indicating that cell growth is determined entirely by the growth-feedback strength *k*_{g}. It can be seen from Eqs. (4-6) in the main text that, in this case, the effect of growth feedback is equivalent to a linear change of the amount *k*_{g} in the degradation terms *d*_{x}. Further, the interaction between cell growth and the gene circuit is no longer of the type of mutual inhibition: the regulation is a one-way interaction from cell growth to the gene circuit. A semi-quantitative analysis of this scenario can be found in Appendix 1.

We carry out the simulations as in the main text in the regime of large *J* and perform a comparative analysis of the results.

The first issue concerns the relative fractions of different failure scenarios. Figure 1 compares the distributions of distinct types of circuit failures for *J* = 1 and *J* → ∞. The possible failure scenarios are identical in both cases, in spite of the quantitative differences in the relative fractions of the failure mechanisms. Some of the differences are sizable, but none is significant in the sense that none is beyond an order of magnitude. For example, for *J* = 1, type-I failures are the most common (49%) where the precision criterion is broken in a continuous fashion. For *J* → ∞, the fraction is about 31%, but the reduction is still within a factor of two. The plausible reason for the reduction is that the additional regulation of the burden *b* for *J* = 1 is more difficult to be maintained (Appendix 1).

The second issue is the scaling law between the survival ratio *R*(*k*_{g}) and the growth-feedback strength *k*_{g}. Figure 2 compares the scaling laws of *R*(*k*_{g}) for three circuit topologies for *J* = 1 and *J* → ∞, where the results in Figs. 2(a1) and 2(a2) are represented on a linear scale, while those in Figs. 2(b1) and 2(b2) are on a double-logarithmic versus logarithmic scale. The approximately linear relation in Fig. 2(b2) suggests that, for *J* → ∞, the scaling laws is given by (1) in the main text.

For *J* = 1, the scaling law (1) is less accurate, as shown in Fig. 2(b1), which can be heuristically explained, as follows. Suppose we use Eq. (46) and reduce *J* from a large value to one, which is equivalent to adding back the negative feedback from the burden *b* = *A* + *B* + *C* to cell growth. Since cell growth effectively inhibits the gene regulation in the circuit, the burden will be larger for smaller values of *k*_{g}, suppressing the cell growth. Thus, for weak growth feedback (corresponding to small values of *k*_{g}), for small *J, R*(*k*_{g}) decreases more slowly than for larger values of *J*. The difference becomes smaller for larger values of *k*_{g}, causing the curves on the left side in Fig. 2(b1) to be lower than those in Fig. 2(b2), but the curves on the right side are similar in both cases.

The third issue is the effect of the network topology on the survival ratio *R*(*k*_{g}). Figure 3 presents a comparison of the dependency of *R* on the circuit topology for *J* = 1 and *J* → ∞. As discussed above, the difference in the burden *b* can be a major reason for the data points in the red group to have lower *R* values compared with those in the blue and green groups. When the term *b* is effectively removed by setting *J* → ∞, the difference diminishes. It can be seen from Figs. 3(a2), 3(b2) and 3(c2) that, in this case, the range of *R* for the red group, in spite of the low *R* values, overlaps with that of the blue group. However, the *R* values associated with the red group are still distinctly smaller than those with the green group, suggesting some characteristic differences in the network motifs that define these two groups.

Figures 3(b1) and 3(b2) indicate a persistent feature of the distribution of the survival ratio *R*(*k*_{g} = 0.6) for the ensemble of networks: there are three peaks regardless of whether *J* has a small or a large value. Further, the three peaks are approximately located at the same positions for *J* = 1 and *J* → ∞. This feature provides a criterion to determine the likelihood of a given network topology being stable or unstable under growth feedback without the need to calculate the *R*(*k*_{g}) value for many values of the feedback strength. In particular, if the network is such that its *R*(*k*_{g} = 0.6) value is associated with the red peak, then it is highly likely to be unstable and fail to function under growth feedback. On the contrary, if a network “belongs” to the green peak, then the chance for it to sustain its function in a growth environment will be improved significantly.

There can be two different mechanisms for growth-induced oscillations: (i) by altering the system parameter, and (ii) by altering the circuit topology with the additional dynamical variable *N* and regulations attached to it. Our results suggest the first mechanism is the major one, while the second one does not appear to play a significant role. The second mechanism only exists with a finite *J*. Thus, we compare the cases of *J* = 1 and the limit of a large *J*. As shown in Fig. 3, the ratio of functional failures caused by growth-induced oscillations does not change much between the two cases. However, the oscillatory behavior is sensitive to the value of the dilution parameter. In order to have oscillations, it is necessary that the parameter be in some specific interval (** Novák and Tyson, 2008**).

# Appendix 7

## Four-gene circuits

To demonstrate the general applicability of our nonlinear dynamical analysis of the failure mechanism, we study four-gene circuits. Figure 1(a) shows ten representative circuits, where eight are from ** Qiao et al**. (

**) and two being the four-node modifications of three-gene circuits with oscillation-related motifs. For each circuit, we test 10**

*2019*^{5}random sets of parameters. To generate acceptable statistics, we ease the precision and sensitivity criteria to: (i) |(

*O*

_{2}−

*O*

_{1})/

*O*

_{1}|< 0.4, (ii)

*O*

_{peak}> 0.1, and (iii)

*O*

_{peak}/

*O*

_{1}> 0.5 or

*O*

_{peak}/

*O*

_{1}− | (

*O*

_{2}−

*O*

_{1})/

*O*

_{1}| > 0.1. All other simulation settings are the same as those in the three-gene circuit simulations as detailed in the main text. We collect a total of 3,275 trials exhibiting functional adaptation in the absence of growth feedback (

*k*

_{g}= 0). As the growth feedback is turned on so that

*k*

_{g}= 0 increases

*k*

_{g}= 0.5, 2,373 trials encountered functional failures across all ten circuits.

We then investigate the causes of the functional failures. We find that all 2,373 trials fall into the same three categories identified for three-gene circuits: growth-induced oscillations, growth-induced switching in bistability, and continuous deformation of the system trajectory leading the system to cross the criteria thresholds, as shown in Figs. 1(b-d), respectively. For the cases studied, continuous deformation is the dominant failure mechanism, accounting for about 88% of the failures. The fractions of oscillation-related and bistability-related failures are approximately 10% and 3%, respectively. These results indicate that four-gene and three-gene circuits share the common mechanisms of growth-feedback induced failures, implying the generality of these failure mechanisms.

# References

- A universal biomolecular integral feedback controller for robust perfect adaptation
*Nature***570**:533–537 - Neural combinatorial optimization with reinforcement learning
*arXiv* - Host-aware synthetic biology
*Cur Opin Sys Biol***14**:66–72 - Overloaded and stressed: whole-cell considerations for bacterial syn-thetic biology
*Current Opinion in Microbiology***33**:123–130https://doi.org/10.1016/j.mib.2016.07.009 - Antithetic integral feedback ensures robust perfect adaptation in noisy biomolecular networks
*Cell Sys***2**:15–26 - Targeted DNA degradation using a CRISPR device stably carried in the host genome
*Nature communications***6**:1–10 - Quantifying cellular capacity identifies gene expression designs with reduced burden
*Nature Methods***12**https://doi.org/10.1038/nmeth.3339 - Burden-driven feedback control of gene expression
*Nature Methods***15**:387–393https://doi.org/10.1038/nmeth.4635 - Engineering Translational Resource Allocation Controllers: Mech-anistic Models, Design Guidelines, and Potential Biological Implementations
*ACS Synth Biol***7**:2485–2496https://doi.org/10.1021/acssynbio.8b00029 - Dynamic allocation of orthogonal ribosomes facilitates uncoupling of co-expressed genes
*Nature Communications***9**https://doi.org/10.1038/s41467-018-02898-6 - The innate growth bistability and fitness landscapes of antibiotic-resistant bacteria
*Science***342** - A synthetic oscillatory network of transcriptional regulators
*Nature***403**:335–338 - Growth feedback as a basis for persister bistability
*Proc Nat Aca Sci***111**:544–549 - Perfect and near-perfect adaptation in cell signaling
*Cell systems***2**:62–67 - Microbiome engineering: Current applications and its future
*Biotechnology journal***12** - Adaptive response by state-dependent inactivation
*Proceedings of the National Academy of Sciences***106**:22558–22563 - A synthetic gene–metabolic oscillator
*Nature***435**:118–122 - Programmable removal of bacterial strains by use of genome-targeting CRISPR-Cas systems
*MBio***5**:e00928–13 - The ClpXP and ClpAP proteases degrade proteins with carboxy-terminal peptide tails added by the SsrA-tagging system
*Genes Develop***12**:1338–1347 - Evolutionary regain of lost gene circuit function
*Proceedings of the National Academy of Sciences of the United States of America***116**:25162–25171https://doi.org/10.1073/pnas.1912257116 - Adapting machine-learning algorithms to design gene circuits
*BMC bioinformatics***20**:1–13 - Latin hypercube sampling (program user’s guide)
*Department of Energy, Sandia Laboratories* - Synthetic circuit for exact adaptation and fold-change detection
*Nuc Aci Res***42**:6078–6089 - Bacterial growth: global effects on gene expression, growth feedback and proteome partition
*Curr Opin Biotech***28**:96–102 - Growth rate-dependent global effects on gene expression in bacteria
*Cell***139**:1366–1375 - A molecular mechanism for sensory adaptation based on ligand-induced receptor modification
*Proceedings of the National Academy of Sciences***83**:2345–2349 - A repository for a program using automatic differentation to find the optimal circuit parameters to achieve adaptation
- Targeted approaches for in situ gut microbiome manipulation
*Genes***9** - Defining network topologies that can achieve biochemical adaptation
*Cell***138**:760–773 - Reinforcement learning for combinatorial optimization: A survey
*Computers & Operations Research***134** - The number of equilibrium points of perturbed nonlinear positive dynamical sys-tems
*Automatica***112** - Emergent Damped Oscillation Induced by Nutrient-Modulating Growth Feedback
*ACS Syn Biol***10**:1227–1236 - Emergence of qualitative states in synthetic circuits driven by ultrasen-sitive growth feedback
*PLOS Computational Biology***18**https://doi.org/10.1371/jour-nal.pcbi.1010518 - Mapping the environmental fitness landscape of a synthetic gene circuit
*PLoS Comp Biol***8** - Design principles of biochemical oscillators
*Nature reviews Molecular cell biology***9**:981–991 - Modeling synthetic gene oscillators
*Math Biosci***236**:1–15 - Network topologies that can achieve dual function of adaptation and noise attenuation
*Cell Sys***9**:271–285 - Engineering bacteria for diagnostic and therapeutic applications
*Nature Reviews Micro-biology***16**:214–225 - Interdependence of cell growth and gene expression: origins and consequences
*Science***330**:1099–1102 - Therapeutic bacteria to combat cancer; current advances, challenges, and op-portunities
*Cancer medicine***8**:3167–3181 - Finding gene network topologies for given biological function with recurrent neural network
*Nature communications***12**:1–10 - Adaptation with transcriptional regulation
*Sci Rep***7**:1–11 - A fast, robust and tunable synthetic gene oscillator
*Nature***456**:516–519 - Nonlinear dynamics and chaos: with applications to physics, biology, chemistry, and engineering
- Emergent bistability by a growth-modulating positive feedback circuit
*Nat Chem Biol***5**:842–848 - Sniffers, buzzers, toggles and blinkers: dynamics of regulatory and signaling pathways in the cell
*Current opinion in cell biology***15**:221–231 - Synthetic genetic circuits for programmable biological functionalities
*Biotech-nology Advances***37** - Winner-takes-all resource competition redirects cascading cell fate transitions
*Nature Communications***12**https://doi.org/10.1038/s41467-021-21125-3 - Topology-dependent interference of synthetic gene circuit function by growth feedback
*Nat Chem Biol***16**:695–701

# Article and author information

## Version history

- Sent for peer review:
- Preprint posted:
- Reviewed Preprint version 1:

## Copyright

© 2023, Kong et al.

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.

# Metrics

- views
- 257
- downloads
- 10
- citations
- 0

Views, downloads and citations are aggregated across all versions of this paper published by eLife.