Design principles of transcription factors with intrinsically disordered regions
Figures

Model illustration of a TF with the IDR and its search process.
(a) Illustration of a TF locating its target site (highlighted in orange) on the DNA strand (depicted as a green curve). Surrounding the target is a region of length , where the TF’s IDR interacts with the DNA. The TF performs 3D diffusion until it encounters and binds to this ‘antenna’ region. (b) Illustration of a TF that includes an IDR composed of AAs. On the right: a model of the TF, a polymer chain, comprising a single binding site on the DNA Binding Domain (DBD), and multiple binding sites on the IDR (also known as short linear motifs Jonas et al., 2025). (c) Once bound to the antenna, the TF performs effective 1D diffusion until it reaches its target. The 1D diffusion is via the binding and unbinding of sites along the IDR, a process we coin ‘octopusing’. is the targets’ volume, the separation, and the binding energy per binding site, where each site corresponds to a short linear motif.

Design principles across various parameters: , , , and .
(a) Plots of varying with are shown for several combinations of and for typical values of the relevant quantities: the nucleus volume is , , , , and . (b) A heatmap of vs. and at and . (c) Contour lines at different at . Blue hexagrams represent the design principle , with the black dashed line as a visual guide. The inset shows the dependence of on energy. (d) for varying at for several . The vertical line represents as determined by Equation 4, the solid curves illustrate the approximation of obtained from Equation 3, and the horizontal line indicates where .

Mean search time varying with the antenna length , and vs. at and .
The solid curves correspond to , and in Equation 5. Inset: probability density function (pdf) exponentially decays at large (black solid line). The unit represents the time for one AA to diffuse . , , and .

Comparisons with experimental results.
(a) Relative binding probability varies with the truncation length of the IDR, which quantitatively agrees with the experimental data in Brodsky et al., 2020, where we only vary (binding energy of DBD) to ensure that the maximal value reaches 1 at zero truncation length. Antenna length , and . Other parameters are set the same as in Figure 2. (b) The search time estimated by our model (from Equation 5 and divided by a TF copy number of 100) is quantitatively comparable with the experimental results (Larson et al., 2011), as guided by the shaded area. c, The on-rate and off-rate varying with the IDR length, indicating that ranges from to 10 nM, with its variation primarily dominated by the off-rate.
© 2020, Elsevier. Panel A was reprinted with permission from Brodsky et al., 2020 with permission from Elsevier. It is not covered by the CC-BY 4.0 license and further reproduction of this panel would need permission from the copyright holder.

aligns well with Equation 10 (solid curves) for moderate at various.
Inset: vs. from Equation 12. (b) Equation 10 serves as a good approximation (solid lines) even at small .

Binding probability varying with and .
(a) varying with at . initially increases rapidly, until eventually saturating. (b) varies non-monotonically with . slightly decreases with increasing at . Three examples at are displayed. is denoted by pentagrams at .

Contour lines at and on the plane defined by the number of binding sites, , and the number of binding targets, .
The black dashed line is added to guide the relationship .

Plots of the affinity enhancement, quantified by the ratio , for the case where IDR sites as well as their targets are randomly positioned on the IDR and DNA, respectively.
See the analogous Figure 2 in the main text for the case of homogeneously distributed IDR sites and targets. (a) is maximal when (shaded region) for different values of . The average distance between IDR sites is , and the average distance between IDR targets is . (b) A heatmap of at and . (c) Contour lines at different at exhibit an approximate symmetrical L-shape, which indicates , as shown by the red triangles. decreases with increasing . (d) varies with for several different constants of . The characteristic energy , as predicted by Equation 4 in the main text, corresponds to the region where a notable increase is observed.

Demonstration of a single round of 3D-1D search in simulations.
(a) A trajectory (red) of TF in the cell nucleus (light blue). is the average distance of the TF to the antenna, and is the position of the center of mass along the antenna direction. The trajectory is displayed with a time interval of . (b) The average number of rounds the TF hits the antenna, , before reaching the DBD target. The optimal mean total search time is obtained at .

Search times as a function of and .
(a) , , and vs. at fixed and . The red solid line is the prediction of from Equation 5, and the blue line (also in the inset) is the exponential fit for . The black curve in the inset is the number of rounds the TF hits the antenna, , vs. . (b) , and vs. at fixed and . The red and blue solid lines represent the predictions of and from Equation 5, respectively. varying with is shown in the inset.

Left: an example of the target configuration (the antenna) generated by the worm-like chain model, where , , and the persistence length also equals .
Right: Mean total search times weakly vary with persistence length when targets are generated using the worm-like chain model. The values are represented by the black dots, with parameters set to , bp, and . For each persistence length, we obtained ten mean total search times by considering ten different target configurations. The mean total search times do not exhibit significant differences compared to that for targets arranged linearly (indicated by the horizontal orange line).

Search in a complex DNA configuration.
(a) Configuration of DNA (in red) at with a complex structure and a trajectory of TF (in black). A TF acts as a point searcher that diffuses in 3D space and becomes trapped once it hits the DNA, which is wide. It detaches at a rate of . (b) MSD at different .