Cluster size determines internal structure of transcription factories in human cells

Massimiliano Semeraro; Giuseppe Negro; Giada Forte; Antonio Suma; Giuseppe Gonnella; Peter R Cook; Davide Marenduzzo

doi:10.7554/eLife.103955.2

eLife Assessment

This is a valuable polymer model that provides insight into the origin of macromolecular mixed and demixed states within transcription clusters. The simulations are well performed and clearly presented in the context of existing experimental datasets. This compelling study will be of interest to those studying gene expression in the context of chromatin.

https://doi.org/10.7554/eLife.103955.2.sa4

Significance of findings

valuable: Findings that have theoretical or practical implications for a subfield

landmark
fundamental
important
valuable
useful

Strength of evidence

compelling: Evidence that features methods, data and analyses more rigorous than the current state-of-the-art

exceptional
compelling
convincing
solid
incomplete
inadequate

During the peer-review process the editor and reviewers write an eLife assessment that summarises the significance of the findings reported in the article (on a scale ranging from landmark to useful) and the strength of the evidence (on a scale ranging from exceptional to inadequate). Learn more about eLife assessments

Abstract

Transcription is a fundamental cellular process, and the first step of gene expression. In human cells, it depends on the binding to chromatin of various proteins, including RNA polymerases and numerous transcription factors (TFs). Observations indicate that these proteins tend to form macromolecular clusters, known as transcription factories, whose morphology and composition is still debated. While some microscopy experiments have revealed the presence of specialised factories, composed of similar TFs transcribing families of related genes, sequencing experiments suggest instead that mixed clusters may be prevalent, as a panoply of different TFs binds promiscuously the same chromatin region. The mechanisms underlying the formation of specialised or mixed factories remain elusive. With the aim of finding such mechanisms, here we develop a chromatin polymer model mimicking the chromatin binding-unbinding dynamics of different types of complexes of TFs. Surprisingly, both specialised (i.e., demixed) and mixed clusters spontaneously emerge, and which of the two types forms depends mainly on cluster size. The mechanism promoting mixing is the presence of non-specific interactions between chromatin and proteins, which become increasingly important as clusters become larger. This result, that we observe both in simple polymer models and more realistic ones for human chromosomes, reconciles the apparently contrasting experimental results obtained. Additionally, we show how the introduction of different types of TFs strongly affects the emergence of transcriptional networks, providing a pathway to investigate transcriptional changes following gene editing or naturally occurring mutations.

Introduction

The 3D organization of chromatin, the filament composed of DNA wrapped around histone proteins which constitutes the building block of mammalian chromosomes, is a dynamic and intricate blueprint that is thought to be important for cellular function and gene expression [1]. Recent advances in microscopy and highthroughput sequencing [2] have revealed a rich hierarchy of 3D chromatin structures within the cell nucleus. These range from relatively small DNA loops of tens to hundreds of base pairs (bps), to large organised domains spanning over hundreds of thousands of base pairs (or kilo-base pairs, kbp), which are referred to as topologically-associating domains (TADs), whose segments interact more frequently among each other than with other parts of the genome. At even larger scales, the genomic material divides into A (active) and B (inactive) compartments, which have different gene activity and 3D compaction, whereas different chromosomes occupy distinct territories inside the nucleus [3–5].

A central question in cellular biology is the extent to which this rich and multi-scale organization is influenced, or even driven, by transcription [6], the fundamental biological process during which the information encoded in a segment of DNA is converted into RNA, to be then translated into proteins. On the one hand, it is widely believed that TADs remain largely invariant in cells with very different transcriptional programs (e.g., belonging to different organs) – which points to little role for transcription in determining structure [7] (for an opposing view, see [8]). On the other hand, enzymes engaged in the process of transcription, known as RNA polymerases, tend to form aggregates inside the nucleus, often referred to as phase-separated condensates, hubs, or transcription factories [6, 9–12]. Being attached to a factory strongly enhances the transcriptional activity of a gene [6, 9], therefore factories are a primary example of a structural unit with a clear transcriptional role.

A recent effective way to investigate this intricate connection between transcription and 3D chromatin structure has been provided by polymer models together with Brownian dynamics simulations [13–25]. This in silico approach has pointed to a simple and generic mechanism – the bridging-induced attraction or bridging induced phase separation – that spontaneously drives formation of transcription factories [26, 27]. Such microphase separation is due to the fact that, in this type of modelling, TF and polymerase complexes (TF:pol) are usually depicted as multivalent elements, so that each of them can simultaneously bind to several chromatin sites: this is reasonable as a complex of proteins can easily have more than one chromatin-binding domain. Multivalent binding triggers a positive feedback: a TF:pol binding to the chromatin filament provokes a local increase of chromatin density as it attracts several chromatin sites. The higher chromatin density, in turn, recruits further TF:pol resulting in a cluster, or transcription factory, formation. This feedback is eventually arrested by entropic costs associated with crowding and looping more and more DNA [27, 28].

The existence of clusters prompts the question: does a typical cluster mainly contain just one kind of TF, or many different ones? On the one hand, microscopy experiments suggest that different factories specialize in transcribing different sets of genes, so that any one factory typically contains mainly one type of TF. For example, active forms of RNA polymerases II and III are each housed in distinct nucleoplasmic factories that make genic and snRNA transcripts respectively [9, 29, 30]. Similarly, distinct ERα, KLF1, and NFκB factories specialize in transcribing genes involved in the estrogen response, globin production, and inflammation [31–33]. One important consequence of the formation of such specialized factories is the creation of 3D networks [34], in which genes sharing the same TFs are co-transcribed in the same clusters.

On the other hand, and in contrast to the evidence for specialized factories, chromatin immuno-precipitation (ChIP) techniques have revealed the existence of particular chromatin fragments, called “highly-occupied targets” (or HOT), which are promiscuously bound by several different TFs [35–37]. Additionally, single-cell transcriptional profiling points to expression levels varying continuously as cells differentiate into other cell types, which points to a complex interplay between many factors, rather than a few acting as binary switches [38, 39]. Interestingly, simulations of the types described above which involve 2 different kinds of TFs, each one binding specifically to two different TU types, show the resulting clusters often contain bound TFs of just one type, rather than mixtures, although this aspect has not been investigated in depth [11, 13, 21, 23, 26, 27, 40–48].

Here, we develop a polymer model with the aim of investigating the mechanisms leading to the formation of specialised or mixed factories: i.e., clusters formed by a single type or by multiple types of TFs respectively. Within our framework, chromatin is depicted as a polymer composed of a multicolour sequence of beads, corresponding to transcription units (TUs) of different types (or colours), each one binding to the corresponding type of transcription factors, represented as additional spheres diffusing in the system. With respect to previous works on multicolour models [22, 27, 49–52], there are two important differences. First, here we model chromatin transcription by making the assumption that a chromatin bead is transcribed when it binds to a TF. In this way we are able to investigate the link between 3D structure of active chromatin and transcription (transcriptional patterns and emerging transcriptional correlation networks), rather than solely structure as done in previous models. Second, we study the morphology of the ensuing transcriptional clusters, studying their composition and the way in which it can be affected by the 1D binding landscape, and the balance between non-specific and specific chromatin-TF interactions.

Our main result is that specialised (demixed) and mixed clusters are not mutually exclusive. More specifically, we unexpectedly find a transition, or crossover, between a specialised and a mixed clusters regime, influenced by the size of emerging clusters. Smaller clusters are typically specialised, whereas larger clusters are more likely to be formed by different TF types. This result enables us to reconcile the apparently contrasting experimental observations cited above: it is no longer surprising that specialised and mixed clusters can coexist within the same cell, as cluster size depends, for instance, on the local 1D pattern of TU binding sites along chromatin.It is also important to specify that, even if an investigation of a transition between small and larger transcription factories have been carried out in mouse embryonic stem cells [53], in serum-stimulated cells [54] and in zebrafish embryos [55], our study aims to provide a broader mechanistic overview. We further integrated our multi-color model with experimental data, specifically DNase hypersensitive site (DHS) locations, to study human chromosomes. Here, two colours are considered as the simplest extension of the previous DHS model with one color [11], by distinguishing between active TF:pol complex that bind respectively to cell-type-invariant and cell-type-specific TUs in strings mimicking whole human chromosomes. Finally, the existence of specialized and mixed factories is further validated by incorporating different types of proteins into more complex and realistic chromatin models, such as the “highly predictive heteromorphic polymer model” (HiP-HoP model), which accounts for loop extrusion by cohesin-like complexes, the presence of inactive or silenced chromatin, and chromatin heteromorphism [41].

Results

Toy model with different transcription factors

Toy model, with TUs coloured randomly (the random string).
**(A)** Overview. (i) Yellow, red, and green TFs (25 of each colour) bind strongly (when in an on state) to 100 TUs beads of the same colour in a string of 3000 beads (representing 3 Mb), and weakly to blue beads. TU beads are positioned regularly and coloured randomly, as indicated in one region of the string. TFs switch between off and on states at rates and α_on = α_off/4 (τ_B Brownian time, which one can map to 0.6 6 10⁻3 s, see SI). (ii) The sequence of bars reflects the random sequence of yellow, red, and green TUs (blue beads not shown). **(B)** Snapshot of a typical conformation obtained after a simulation (TFs not shown). Inset: enlargement of boxed area. TU beads of the same colour tend to cluster and organize blue beads into loops. **(C)** Bridging-induced phase separation drives clustering and looping. Local concentrations of red, yellow, and green TUs and TFs might appear early during the simulation (blue beads not shown). Red TF 1 – which is multivalent – has bound to two red TUs and so forms a molecular bridge that stabilizes a loop; when it dissociates it is likely to re-bind to one of the nearby red TUs. As red TU 2 diffuses through the local concentration, it is also likely to be caught. Consequently, positive feedback drives growth of the red cluster (until limited by molecular crowding). Similarly, the yellow and green clusters grow as yellow TF 3 and green TF 4 are captured. **(D)** Bar heights give transcriptional activities of each TU in the string (average of 100 runs each lasting 8 10⁵τ_B). A TU bead is considered to be active whilst within 2.24σ ∼6.7 × 10⁻⁸m of a TF:pol complex of similar colour. Dashed boxes: regions giving the 3 clusters in the inset in **(B)**. **(E)** Pearson correlation matrix for the activity of all TUs in the string. TU bead number (from low to high) is reported on axes, with pixel colour giving the Pearson value for each bead pair (bar on right). Bottom: reproduction of pattern shown in (A,**ii)**. Boxes: regions giving the 3 clusters in the inset in **(B)**. shows a typical 3D conformation found in the steady state. Remarkably, clusters of TUs and TFs with distinct colours appear and disappear spontaneously. Such clustering is driven by the positive feedback illustrated in Fig. 1C; it depends critically on TFs being able to form molecular bridges that anchor loops [11, 26, 27].

We now assume that the spheres represent TF:pol complexes, and make the reasonable assumption that a TU bead is transcribed if it lies within 2.25 diameters (2.25σ) of a complex of the same colour (small changes in this threshold do not affect qualitatively the measured activity, as shown in Figure S11); then, the transcriptional activity of each TU is given by the fraction of time that the TU and a TF:pol lie close together. Fig. 1D reports the mean activity profile down the string obtained combining transcriptional data from 100 independent simulation runs each lasting 8 10⁵τ_B; TUs with the lowest activities are flanked by differently-coloured TUs, while those with the highest activities are flanked by similarlycoloured TUs (dashed rectangles in Fig. 1D). As expected, a single-colour model with the same TU placement leads to a flat activity profile (Figure S1A). Clearly, close proximity in 1D genomic space favours formation of similarly-coloured clusters.

We next examine how closely transcriptional activities of different TUs correlate [56]; the Pearson correlation matrix for all TUs is shown in Fig. 1E. This is built in the following way: for every couple i-j of TUs, we evaluate the Pearson correlation coefficient between the transcription of i and j, and then we colour the ij (and ji) pixel of the matrix accordingly. Correlations between neighbouring TUs of similar colour are often positive and strong, resulting in square red blocks along the diagonal (coloured boxes in Fig. 1E highlight the 3 clusters shown in the zoom in Fig. 1B). This effect is again due to the self-assembly of clusters containing neighbouring TUs of the same colour. In contrast, neighbours with different colours tend to compete with each other for TF:pols, and so down-regulate each other to yield smaller correlations. Correlations are more trivial in the single-color counterpart of Fig. 1, where the matrix yields only a positive-correlation band along the diagonal (Figure S1B). These results provide simple explanations of two mysterious effects – the first being why adjacent TUs throughout large domains tend to be cotranscribed so frequently [57]. The second concerns how expression quantitative trait loci (eQTLs)—genomic regions that are statistically associated with variation in gene expression levels—function. While current models often attribute their effects to post-transcriptional regulation through complex mechanisms [11, 58], we have previously argued that any transcriptional unit (TU) can act as an eQTL by directly influencing gene expression at the transcriptional level [11]. Here, we observe individual TUs up-regulating or down-regulating the activity of others TUs – hallmark behaviors of eQTLs that can scriptional clusters, studying their composition and the way in which it can be affected by the 1D binding landscape, and the balance between non-specific and specific chromatin-TF interactions.

To try to solve the apparently contrasting views of segregated and mixed factories we start by introducing a new simple polymer model, where a 3 Mbp chromatin fragment is represented by a chain of 1000 beads (each 30 nm in diameter, and corresponding to 3 kbp). Different kinds of TU beads are positioned regularly in this string, but are coloured randomly yellow, red, or green (we refer to this case as the random string). Different kinds of TFs are modelled as diffusing spheres, at first approximation of the same size of chromatin beads (see later for simulations changing the TFs size), which bind reversibly and strongly to beads of the same colour, and weakly to all others (see Fig. 1A, and Methods for further details). After running a Brownian-dynamics simulation, Fig. 1B give rise to genetic effects such as “transgressive segregation” [59]. This phenomenon refers to cases in which alleles exhibit significantly higher or lower expression of a target gene, and can be, for instance, caused by the creation of a non-parental allele with a specific combination of QTLs with opposing effects on the target gene.

Local mutations

To explore the impact of introducing different colors, we characterize the effects of local mutations. We choose the most active region in the random string – one containing a succession of yellow Tus – and “mutate” 1 4 of these TUs by recolouring them red (Fig. 2A). These simulations are inspired by editing experiments performed using CRISPR/Cas9 [60]. Typical snapshots show red mutants are often ejected from yellow clusters (Fig. 2Bi), or cluster together to leave their wild-type neighbours in isolation (Fig. 2Bii). These changes are reflected in activity profiles (Fig. 2C; arrows indicate mutations). As the number of mutations in the cluster increase, activities of yellow beads in that cluster decrease (Fig. 2D), and new red clusters often emerge (Fig. 2B,ii; Fig. 2Ciii).

Simulating effects of mutations.
Yellow TU beads 1920, 1950, 1980, 2010, 2040 and 2070 in the random string have the highest transcriptional activity. 1-4 of these beads are now mutated by recolouring them red. **(A)** The sequence of bars reflects the sequence of yellow, red, and green TUs in random strings with 1, 2 and 4 mutations (blue beads not shown). Black boxes highlight mutant locations. **(B)** Typical snapshots of conformations with **(i)** one, and **(ii)** 4 mutations. **(C)** Transcriptional-activity profiles of mutants (averages over 100 runs, each lasting 8 10⁵τ_B). Bars are coloured according to TU colour. Black boxes: activities of mutated TUs. **(D)** Activities (+/-SDs) of wild-type (yellow) and different mutants. 3 mutations: TUs 1950, 1980 and 2010 mutated from yellow to red. **(E)** Typical kymographs for **(i)** wild-type, corresponding to the same original string presented in Figure 1, and **(ii)** 4-mutant cases, in which 4 yellow TUs have been mutated to red. Each row reports the transcriptional state of a TU during one simulation. Black pixels denote inactivity, and others activity; pixels colour reflects TU colour. Blue boxes: region containing mutations. **(F)** Pearson correlation matrices for wild-type and 4-mutant cases. Black boxes: regions containing mutations (mutations also change patterns far from boxes).

To confirm that 4 mutations in a yellow cluster often lead to the development of a red cluster, we monitor cluster dynamics over time. Fig. 2Ei illustrates a typical kymograph illustrating changes in activity of all TUs in the wild-type, corresponding to the same original string presented in Fig. 1; yellow, red, and green pixels mark activity of respective TUs, and black ones inactivity. In this particular simulation, a yellow cluster in the region that will be mutated (marked by the blue rectangle) is present during the first quarter of the time window; it then disappears to reappear half-way through the window and then persists until the end. From the activity profiles in Fig. 2C, we can observe that as the number of mutations increases, the yellow cluster is replaced by a red cluster, with the remaining yellow TUs in the region being expelled (Fig. 2Bii). This behavior is reflected in the dynamics, as seen by comparing panels Ei and Eii: in the string with four mutations, transcription of the yellow TUs is inhibited in the affected region, while prominent red stripes—corresponding to active, transcribing clusters—emerge (Fig. 2Eii). Pearson correlation matrices provide complementary information: the yellow cluster in the wild-type yields a solid red block indicating strong positive correlations (Fig. 2Fi), while most of the pixels of this block become light-red or white in the string with 4 mutations (Fig. 2Fii). These results confirm that local arrangements of TUs on the genetic map determine the extent to which any particular TU will cluster and so become active.

Variations in TF concentration

The concentration of TFs is expected to influence the global activity patterns observed and can be adjusted in our model accordingly. These simulations are motivated by experiments that reduce global TF levels using auxin-induced degrons [61]. Specifically, we reduce the concentration of yellow TFs binding to the random string by 30% (Fig. 3A). As expected, transcriptional activity falls both globally and locally (see yellow dotted rectangles in Fig. 3B and C). Surprisingly, activity of a nearby cluster of red TUs (numbers 1080, 1110, 1170, 1200, and 1530 to 1650) increases by 50% (red dotted rectangles in Fig. 3B and C). This effect is specific, in the sense that there is little effect on green clusters (e.g., compare Fig. 1D with Fig. 3B). We attribute this to a now-reduced steric competition for 3D space by yellow neighbours – fewer yellow clusters are present to stunt growth of nearby red ones. Fig. 3D shows the difference in correlation between the case with reduced yellow TFs and the case displayed in Fig. 1E. We can notice a change in correlation between the yellow cluster (boxed) and its neighbour clusters, even if distant. For instance, yellow clusters are more probable to be found both turned off due to the lack of yellow TFs, and thus their activation becomes more correlated. At the same time, when yellow clusters are turned off the activation of other clusters can be affected, with a increase or decrease of correlation. Overall, these results reveal numerous statistically significant correlations in gene activity both in proximal and distal regions of the genetic map. This observation aligns with the omnigenic model, which proposes that the manifestation of a genetic trait is influenced not only by the expression of core genes directly associated with the trait, but also by peripheral genes, which can exert indirect effects through gene regulatory networks. [58].

Reducing the concentration of yellow TFs reduces the transcriptional activity of most yellow TUs while enhancing the activities of some red TUs.
**(A)** Overview. Simulations are run using the random string with the concentration of yellow TFs reduced by 30%, and activities determined (means from 100 runs each lasting 8 10⁵τ_B). **(B)** Activity profile. Dashed boxes: activities fall in the region containing the biggest cluster of yellow TUs seen with 100% TFs, as those of an adjacent red cluster increase. **(C)** Differences in activity induced by reducing the concentration of yellow TFs. This plot is obtained by subtracting the transcriptional activity of the wild-type, Figure 1D, from that of the current system in panel B. **(D)** Pearson correlation difference matrix. This plot is obtained by subtracting the Pearson correlation matrix of the wild-type, Figure 1E, from that of the current system. Boxes: regions giving the 3 clusters from Figure 1B, inset.

Effects of 1D TU patterns on transcriptional activity

To gain a deeper understanding of the local effects revealed by the random string, we now analyze and compare various toy strings that feature regular and repeating patterns of colored TUs (Fig. 4 and methods for further details). Two results are apparent. First, activities (Fig. 4Bii) in the 6-pattern case are higher overall (compare horizontal dotted lines), and more variable (compare activities of the two central TUs within each repeat with peripheral ones) relative to the 1-pattern case (Fig. 4Bi). This is consistent with positive additive effects acting centrally within each 6-pattern repeat, coupled to competitive negative effects of flanking and differently-coloured repeats at the edges. Second, the 6-pattern also has a Pearson correlation matrix (Fig. 4Cii) that is highly-structured, with a checkerboard pattern; red blocks on the diagonal indicate high positive correlations (so the 1D 6-pattern clearly favours 3D clustering). [Such a checkerboard pattern is not seen with a singlecolor model that has a correlation matrix with one red continuous diagonal when TUs are regularly spaced (Figure S1).] Additionally, blue off-diagonal blocks indicate repeating negative correlations that reflect the period of the 6-pattern. These results show how strongly TU position in 1D genomic space affect 3D clustering and activity, and that these effects depend on inclusion of more than one colour.

Clustering similar TUs in 1D genomic space increases transcriptional activity.
**(A)** Simulations involve toy strings with patterns (dashed boxes) repeated 1 or 6 times. Activity profiles plus Pearson correlation matrices are determined (100 runs, each lasting 8 10⁵τ_B). **(B)** The 6-pattern yields a higher mean transcriptional activity (arrow highlights difference between the two means). **(C)** The 6-pattern yields higher positive correlations between TUs within each pattern, and higher negative correlations between each repeat.

Emergent transcriptional correlation networks

We have seen many positive and negative correlations between activities of TUs in the random string (Fig. 1).

We now select significant correlations from Pearson correlation matrices (those which are > 0.2, Fig. 5A) to highlight emergent interaction networks [11]. In such networks, nodes represent each TU from first to last (other beads are not shown), and edges indicate positive (black) or negative (grey) correlations in activities of node pairs. Even for the toy random string, these networks prove to be very complex (Figure S2A). They are also “smallworld” (i.e., most nodes can be reached from other ones by a few steps [11, 62]). Given this complexity, we now consider simplified versions. Thus, in Fig. 5Ai, only interactions between red TUs are shown (the first red TU is at position 60, the last at position 2910, and interactions between different colours are not depicted). As expected, activities of most red TUs are positively correlated with those of nearby TUs. Conversely, negative correlations connect distant TUs, as found in the singlecolor model [11]; as we have seen, binding of red TFs to any red cluster reduces the number available to bind elsewhere.

TU transcriptional networks and demixing.
Simulations are run using the toy models indicated, and complete correlation networks (qualitatively reminiscent of gene regulatory networks) constructed from Pearson correlation matrices. respectively (above a threshold of 0.2, co(rres)ponding to a p-value ∼ 5 10⁻²). The complete network consists of n = 100 **(A)** Simplified network given by the random string. TUs from first (bead 30) to last (bead 3000) are shown as peripheral nodes (coloured according to TU); black and dashed grey edges denote statistically-significant positive and negative correlations, individual TUs, so that there are pairs of TUs couples; we find 990 black and 742 gray edges. Since p-value n_c = 223, most interactions (edges) are statistically significant. Networks shown here only correlations (i) between red TUs, and (ii) between red and green TUs. (ii) **(B)** Average Pearson correlation (shading shows +/-SD, and is usually less than line/spot thickness) as a function of genomic separation for the (i) random, (ii) 6-, and (iii) 1-pattern cases. Correlation values at fixed genomic distance are taken from super-/sub-diagonals of Pearson matrices. Red dots give mean correlation between TUs of the same color (3 possible combinations), and blue dots those between TUs of different colors (4 possible combinations). Cartoons depict contents of typical clusters to give a pictorial representation of mixing degree (as this determines correlation patterns); see SI for exact values of θ_dem.

In Fig. 5Aii, we consider just interactions between red TUs and green TUs. Remarkably, close-range positive correlations (black edges) are still seen between TU pairs that no longer bind TUs of the same colour. We suggest this is due to the presence of weakly-binding beads. Specifically, a red cluster organises a surrounding cloud of weakly-binding beads, and these will bind some green TFs that – in turn – bind green TUs. In contrast to the same-colour network in Figure 5Ai, there are now more long-range positive correlations, showing that the presence of multiple colors enriches the emerging network.

To obtain further quantitative insight into these subtle yet remarkable correlations, we compute the average of those between same- and different-colour TUs as a function of genomic separation (Fig. 5B). For the random string, same-colour correlations switch from clearly positive to slightly negative at about 300 beads (Fig. 5Bi, red curve). Differently-coloured correlations yield a broadlysimilar switch, although positive and negative values are weaker (Fig. 5Bi, blue curve). The 6-pattern gives qualitatively similar trends, with the magnitude of differentlycoloured correlations dampened further (Fig. 5Bii). In contrast, the 1-pattern string yields largely overlapping curves (Fig. 5Biii). These results illustrate how the sequence of TUs on a string can strikingly affect formation of mixed clusters; they also provide an explanation of why activities of human TUs within genomic regions of hundreds of kbp are positively correlated [63].

To quantify the extent to which TFs of different colours share clusters, we introduce a demixing coefficient, θ_dem (see Methods for definition), which can vary between 0 and 1. If θ_dem = 1, a cluster contains only TFs of one colour (and so is fully demixed); if θ_dem = 0, it contains both red and green TFs in equal numbers (and so is fully mixed). Intuitively, one might expect θ_dem to fall as the number of adjacent TUs of similar colour in a string fall; this is what is seen with the 6- and 1-patterns – strings with the most and least numbers of adjacent TUs of similar colour, respectively (Figure S2B; shown schematically by the cluster cartoons in Fig. 5B).

Our simulations then show that in cases where same- and different-colour Pearson correlations trends overlap (as in the 1-pattern string, see Fig. 5Biii), clusters are more mixed (have a larger value of θ_dem). Instead, in cases where same- and different-color correlations trends do not overlap, or are more different (as in the 6-pattern string, see Fig. 5Bii), then clusters are typically unmixed, and so have a larger value of θ_dem (Figure S2B). These results on activity correlation and TF cluster composition suggest that, if eQTLs act transcriptionally as expected [11], down-regulating eQTLs are likely to be located further from their target genes than up-regulating ones. In addition, it is important to note that mixing is promoted by the presence of weakly binding beads; replacing these with non-interacting ones leads to increased demixing and a reduction in long-range negative correlations (Figure S3). More generally, our findings indicate that the presence of multiple TF colors offers an effective mechanism to enrich and fine-tune transcriptional regulation.

Transcriptional activity and comparison with real human chromosomes

We now simulate human chromosome 14 (HSA 14) in HUVECs, with individual beads in the string coloured appropriately (Fig. 6A). Thus, TUs transcribed uniquely in HUVECs are coloured red, housekeeping TUs (i.e., ones also expressed in a stem cell, namely H1-hESCs) are green, euchromatic regions blue, and heterochromatic ones grey. Fig. 6B shows a typical snapshot; red and green clusters again form spontaneously. We next determine transcriptional activities, rank them in order from high to low, and compare binned rank orders with those obtained experimentally by GRO-seq (Fig. 6C); most counts lie along the diagonal, meaning there is a good agreement between the two data sets. More quantitatively, Spearman’s rank correlation coefficient is 3.66 10⁻¹, which compares with 3.24 10⁻¹ obtained previously using a single-colour model [11]. In both cases the estimated uncertainty is of order 10⁻³ (mean and SD obtained using the bootstrap technique over 100 trials) and the p-value is < 10⁻⁶ (2-sided t-test), showing statistical significance.

Comparison of transcriptional activities of TUs on different human chromosomes determined from simulations and GRO-seq.
**(A)** Overview of panels (A-C). The 35784 beads on a string representing HSA14 in HUVECs are of 4 types: TUs active only in HUVECs (red), “house-keeping” TUs – ones active in both HUVECs and ESCs (green), “euchromatic” ones (blue), and “heterochromatic” ones (grey). Red and green TFs bind to TUs of the same colour with strong (specific) interactions, while they experience a weak (non-specific) attraction to euchromatin. Interactions between both red and green TFs and heterochromatin are purely repulsive. **(B)** Snapshot of a typical conformation, showing both specialized and mixed clusters. **(C)** TU activities seen in simulations and GRO-seq are ranked from high to low, binned into quintiles, and activities compared. **(D)** Spearman’s rank correlation coefficients for the comparison between activity data obtained from analogous simulations and GRO-seq for the chromosomes and cell types indicated.

Activity predictions are also improved compared to the one-colour model with HSA 18 and HSA 19 in HU-VECs, plus HSA 14 in GM12878 (Figure 6D and Figure S4). However, Spearman’s rank coefficient for gene-poor HSA 18 is about twice that for gene-rich HSA 19; this may be due to additional regulatory layers in regions with high promoter density. These results show that our multicolour polymer model generates strings that can mimic structures and functions found in whole chromosomes. Additionally, simulated contact maps show a fair agreement with Hi-C data (Figure S5), with a Pearson correlation r ∼0.7 (p-value < 10⁻⁶, 2-sided t-test). However, because this 2-color model does not include heterochromatin-binding proteins and cohesin-mediated active loop-extrusion, as in the Hip-Hop model later discussed, we should not expect a very accurate reproduction of Hi-C maps.

Specialized and mixed clusters

Inspection of simulation snapshots shows 1-colour clusters tend to be smaller than mixed (2-colour) ones (Fig. 7A). To quantify this, we count numbers and types of TFs in individual clusters (Figures 7B and S7). Clusters with just two bound TFs never contain both colours; conversely, those with > 20 bound TFs never contain just one colour (Fig. 7B). We also measure the average value of the demixing coefficient, θ_dem (Materials and Methods). If θ_dem = 1 (θ_dem = 0), this means that a cluster contains only TFs of one colour (a mixture of TFs) and so is fully demixed (maximally mixed). The result is shown in Fig. 7C, and shows the emergence of a crossover between a demixed regime, corresponding to single-colour clusters, and a mixed regime, corresponding to multiplecolour clusters, which is triggered by an increase in cluster size. [Note that we speak of a crossover between regimes, rather than a phase transition, as clusters are finite-size and we do not consider any thermodynamic limit.] The cross-over point between fully mixed and demixed (where the average value of θ_dem = 0.5) occurs when there are ∼10 TFs per cluster (Fig. 7C): notably, this is similar to the average number of productivelytranscribing pols seen experimentally in a transcription factory [6]. Similar results are obtained for different cell types, or chromosomes (see Figs. S6 and S7 for the case of HSA 18, 19 in HUVEC, and HSA 14 in GM12878), and chromosomes under confinement (Fig. S10), with realistic chromatin densities. The latter situation suggests that, as far as the formation of transcription factories and the crossover between mixed and demixed clusters are concerned, chromatin density does not play a crucial role. Other phenomena can indeed depend on density, especially with respect to global chromatin structure (e.g., entanglements and rare long-range contacts). Additionally, simulations of HSA 14 in HUVEC cells with different size of TF:pols (0.5σ and 0.16σ) also lead to similar results (see Fig. S9). Importantly, the critical number of TFs per cluster separating demixed and mixed cluster is around ∼10 in all these different cases. These results suggest that neither the sequence of TUs and its ratio to TFs (which varies among chromosomes, as for instance HSA 18 and HSA 19 are gene poor and gene rich respectively), nor the chromatin density affect the nature of the crossover between the regimes of demixed and mixed clusters.

Small clusters tend to be unmixed, large ones mixed.
After running one simulation for HSA 14 in HUVECs, clusters are identified. **(A)** Snapshot of a typical final conformation (TUs, non-binding beads, and TFs in off state not shown). Insets: a large mixed cluster and a small demixed one. **(B)** Example clusters with different numbers of TFs/cluster (2, 10, 20, 30, 40) chosen to represent the range seen from all-red to all-green (with 3 intervening bins). Black numbers: observed number of clusters of that type seen in the simulation. **(C)** Average of the demixing coefficient θ_dem (error bars: SD), showing a crossover between demixed and mixed clusters with increasing cluster size. Values of 1 and 0 are completely demixed and completely mixed respectively. Grey area: demixed regime where θ_dem is > 0.5.

The existence of a crossover between specialized (demixed) and mixed factories with increasing size is therefore a generic feature of our model, and it can be explained by the following physical argument depending on non-specific binding. Two red TFs in an unmixed cluster might stabilise 3 loops, and so bring into close proximity only a few non-specific binding sites that could bind a green TF. In contrast, 10 red TFs in a cluster will stabilise many loops that inevitably bring into close proximity many non-specific binding sites – and this makes it highly likely that some green TFs will also bind nearby to create a mixed cluster. The mixing crossover provides a way to reconcile observations that some clusters are unmixed (like factories rich in polymerases II and III), and others highly mixed (like HOTs). This is because clusters in a single cell are generally polydisperse, or differ in size (e.g., due to the local chromatin environment, or the patterning of TUs along the genome), hence mixed and specialised factories can coexist in the same nucleus. Note that cluster size is a key parameter because it strongly affects the balance between non-specific and specific chromatin-protein interaction.

Finally, as for the toy model, the balance between mixing and demixing determines correlation patterns. For example, activity patterns of same- and differently-colored TUs in the whole chromosome (Figure S8) are much like those in the 1-pattern model (Fig. 5Biii). We attribute this to ∼78% TFs being in mixed clusters (θ_dem < 0.5), and so inevitably the resulting interactions will dominate the pattern seen.

Our model is already inherently out of thermodynamic equilibrium, as it includes non-equilibrium switching between binding and non-binding states for chromatin-binding proteins, resembling ATP-dependent post-translational modification of such proteins[64]. There are though important principles of chromatin organisation which the presented model does not consider. First, an important remaining question is whether other active (ATP-consuming) processes naturally present in the nucleus, such as loop extrusion [65], affect the results we found. Second, following the same logic behind the multicolor polymer model presented here, it is interesting to ask whether the presence of additional types of inactive, as well as active, TFs and chromatin beads changes the picture.

To answer these questions, we have turned to a more complex framework, and to the HiP-HoP model, which includes loop extrusion by cohesin-like complexes and chromatin heteromorphism [41, 66], as well as accounting for inactive, as well as active, chromatin and TFs. Specifically, we performed simulations for HSA14 in HUVEC using a multicolor version of the HiP-HoP model (see SI for more details). A typical configuration is shown in Figure 8A, where grey regions represent locally compact regions (which are poor in H3K27ac), while cyan regions represent disrupted regions (which are enriched in decompacted chromatin and in H3K27ac). In addition, H3K27me3 and H3K9me3 data were used to determine the chromatin binding sites for polycomb-like and heterochromatin-associated proteins (such as HP1): these are represented in yellow and blue respectively. As in the previous DHS model, TUs only present in HU-VEC are represented in red, while the house-keeping ones in green. Inspection of simulation snapshots shows the presence of small clusters that are demixed (Fig 8C) and large cluster that are mixed (Fig. 8B). We also measure the average value of the demixing coefficient, θ_dem (Fig. 8 D). As in the simpler DHS model, the crossover point between fully mixed and demixed (where the average value of θ_dem = 0.5) occurs when there are ∼10 TFs per cluster. These simulations further confirm the robustness and generality of our results regarding the mixing-demixing crossover between specilized and mixed transcription factories.

HiP-HoP model simulations: small clusters tend to be unmixed, large ones mixed.
(A) Snapshot of a configuration adopted by HSA14 in HUVECs, within the HiP-HoP model. Grey regions represent less accessible chromatin regions poor in H3K27ac, while cyan regions represent those enriched in H3K27ac. In addition, H3K27me3 and H3K9me3 peaks determine the chromatin binding sites for polycomb-like and heterochromatin proteins, and are represented in yellow and blue respectively. As in the previous DHS multicolour model, TUs only present in HUVEC are represented in red, while the house-keeping ones in green. (B-C) Example of clusters of proteins: large mixed cluster (B) and a small demixed one (C). (D)Average of the demixing coefficient θ_dem (error bars: SD). Values of 1 and 0 correspond to completely demixed and completely mixed clusters respectively. Grey area: demixed regime where θ_dem is > 0.5.

Discussion and conclusions

In summary, in this paper we have used coarse-grained simulations to study the 3D structure of human chromatin, its transcriptional dynamics, and their mutual relationship. Unlike previous works [11, 27], here we adopt a multicolour model, viz a polymer model in which chromatin interacts with different types (colours) of complexes between polymerases and chromatin-binding transcription factors (TF:pols). This accounts for the important biological fact that most eukaryotic cells show different kinds of RNA polymerases and a variety of chromatin-binding proteins, with different biological scopes. Our model yields a number of experimentally relevant results.

First, we characterise the morphology of transcription factories (or clusters), arising in our model through the bridging-induced attraction [26]. When these clusters are small, they typically contain TFs of just one colour; these are reminiscent of the specialized transcription factories found in the nucleoplasm that contain active forms of just pol II or pol III – but not both [67]. Instead, when factories are large, they are typically mixed (Fig. 7C); this provides a mechanistic basis for the formation of HOTs, where many different TFs bind promiscuously and weakly to segments of open chromatin that are often devoid of high-affinity specific binding sites [35–37]. The existence of a transition (more precisely, a crossover) between demixed and mixed clusters dependent on cluster size is robust to changes in TF:pol size (Fig. S9), chromatin density (Fig. S10) and the inclusion of active process such as loop extrusion, incorporated through the HiP-HoP model (Fig. 8). The latter simulations also show that the existence of a demixing transition (and the critical size threshold) is not affected by other structurally important ingredients in the model such as the presence of silence or inactive chromatin, and chromatin heteromorphism [41]. This confirms that the mechanism behind the transition is the shift in balance between non-specific and specific chromatin-protein interactions: the former becomes more dominant as cluster size increases. Interestingly, the mechanisms that determine whether a gene belongs to a specialised or mixed factory remain unclear [68]. However, our results suggest that the TF cluster size, along with the 1D TU patterning along the chromatin filament, plays a crucial role, as it links 3D chromatin structure, transcription factory morphology, and gene expression. Specialised and mixed factories thus emerge naturally from TUs arrangement, without the need for additional ingredients, such as posttranscriptional biochemical regulation.

We propose that the predicted demixing–mixing crossover may be experimentally testable in the future through techniques capable of detecting multi-way chromatin interactions, such as SPRITE and GAM [2]. At present, however, these methods have primarily enabled the detection of three-way chromatin contacts [69–72], and statistical data on higher-order interactions remain limited or unavailable [73]. Nevertheless, it is reasonable to expect that ongoing advancements in these technologies will yield increasingly comprehensive datasets, potentially allowing for direct experimental validation of our predictions.

Second, we see remarkable positive and negative correlations in the transcriptional activities of different TUs. For example, activities of same-colour and nearby TUs tend to be strongly positively correlated, as such TUs tend to co-cluster (Fig. 5). Conversely, activities of similar TUs lying far from each other on the genetic map are often weakly negatively correlated, as the formation of one cluster sequesters some TFs to reduce the number available to bind elsewhere (Figure S12).

Taken together, these results provide simple explanations of why adjacent TUs throughout large domains tend to be co-transcribed so frequently [63], as they are likely to gather together in the same cluster. Results also show how one eQTL might up-regulate some TUs and down-regulate others, that can lead to genetic effects like “transgressive segregation” [59].

Third, we can predict effects of local mutations and genome edits that often induce distant omnigenic effects uncovered by genome-wide association studies [11, 58]. For example, mutations that switch a binding site of one TF to another can convert a cluster of one colour into another (Fig. 2). Similarly, global effects of knocking down TF levels are easily assessed (Fig. 3).

Fourth, we also predict transcriptional activities of all TUs (both genic and non-genic) on whole human chromosomes by including cell-type-invariant and cell-type-specific TFs (Fig. 6). We find this yields a better correlation with GRO-seq experimental data than a singlecolour model (where just one TF binds to all TUs similarly). This result underscores the importance of including different TFs in polymer models.

Finally, all our results point to the importance of the 1D pattern of TUs and TF-binding sites on chromosomes in determining activity. In other words, 1D location is a key feature determining transcriptional patterns, and so cell identity. We speculate this is why relative locations of active regulatory elements are so highly conserved. For instance, despite human enhancers evolving much more rapidly than their target protein-coding genes, the synteny between the two (over distances up to 2 Mbp) is highly conserved [74, 75].

In the future, it would be interesting to incorporate the effects of transcription elongation into our model. Although the current implementations of both the simplified toy model and the more detailed HiP-HoP model show good agreement with GRO-seq data, further investigation is needed to assess how molecular motors —such as RNA polymerases—and interactions between RNA molecules and nuclear proteins, including SAF-A [76, 77], contribute to the three-dimensional organization of chromatin. Additionally, it would be valuable to consider the effects of hydrodynamic interactions [78–80]. In our model, as in most chromatin polymer models in the literature [66], hydrodynamic interactions and the resulting spatiotemporal correlations are neglected. Whilst this choice provides significant computational advantages, it also represents a limitation. Although passive hydrodynamic flow associated with polymer motion is likely screened inside the nucleus, the dipolar forces exerted by molecular motors may be strong enough to induce ordering in the intranuclear polymer melt [81]. In fact, recent experiments using Displacement Correlation Spectroscopy, used to map chromatin movements throughout the nucleus in live cells, revealed that chromatin exhibits rapid, uncorrelated motions at short timescales and slower, correlated motions over ∼µm domains at longer timescales [81]. While the typical sizes of the emerging clusters we observe is about an order of magnitude smaller than that of these domains, incorporating hydrodynamics would help elucidate the effect of coherent chromatin dynamics on cluster formation and coarsening.

In addition, it would be of interest to extend the results presented here to incorporate many more types of TFs and TUs within the framework of HiP-HoP [41], and to include dynamic epigenetic modifications [24, 82, 83]. From a theoretical point of view, we hope our results will stimulate the development of theories to understand the mixing-demixing crossover more fundamentally from a polymer physics view-point, as well as more work on the interrelations between 3D structure and function in chromosomes.

Methods

Chromatin fibres are modelled as bead-and-spring polymers [11, 13, 21, 23, 26, 27, 40–47], where each monomer (diameter σ) represents 3 kbp packed into a 30 nm sphere [11, 26, 45]. Different TFs (or TF:pol complexes) are modelled as differently-coloured spheres (also with diameter σ) able to bind (when in an “on” state) to cognate sites of the same colour that are scattered along the polymer. TF:pols complexes and TUs have the same size, since the former represents both transcription factors and polymerases, which in human cells are about 5 nm and 25 nm respectively. We also simulated the case in which TF:pol size is smaller (0.5σ and 0.16σ, Fig. S9) to explore the potential effect of protein size.

Each TF and TF:pol switches between “off” (nonbinding) and “on” (binding) states to reflect the posttranslational modifications that occur in many TFs. Polymer beads are either non-binding (“heterochromatic”), weakly-binding (“euchromatic”), or stronglybinding (containing cognate sites). TFs bind nonspecifically to all weakly-binding beads, and strongly only to TUs of the same colour. TUs in our model represents regulatory elements such as promoters and enhancers, and as discussed below in practice can be identified with DNase hypersensitive regions, which are very sticky for a wide range of TFs, or active protein complexes [84].

The system evolves in a cubic simulation domain with periodic boundary conditions through constanttemperature Langevin dynamics that are integrated numerically by the LAMMPS simulation package [85].

In our model, as in most chromatin polymer models in the literature [1], hydrodynamic interactions are neglected. While this choice offers significant computational advantages, it also presents a limitation. Although passive hydrodynamic flow associated with polymer motion is likely screened inside the nucleus, dipolar forces exerted by molecular motors may still be strong enough to induce ordering in the intranuclear polymer melt, as discussed in [81](see also the Discussion section for additional comments on the possible effects of hydrodynamics). Averages are evaluated over 100 independent runs for each case. The TF volume fraction of each colour is set to ∼3 10⁻⁵, and the polymer volume fraction to ∼2 10⁻³. We note though that the key control parameter is the ratio between the number of TFs and that of TUs, for each colour. More information about the model can be found in the Supplemental Information (SI). Several quantities are monitored to describe the system’s behavior. Mean transcriptional activity is measured as the fraction of time that a TU is “transcriptionally active”, i.e. within a fixed distance of a TF, in 100 simulations, and so represents a population average (each simulation run may be thought of as a different cell). In order to be quite conservative about the evaluation of the transcriptional activity for TUs in clusters, we adopt 2.25σ as transcriptional threshold. Our point of view is in fact to consider transcriptionally active all TUs in clusters, even when TFs are a bit further. However, we also checked that lower distance thresholds do not affect evaluation of transcriptional profiles in any significant way (see Fig.S11 in SI). Transcriptional activity is then compared with experimental data on transcriptional activity, obtained via GRO-seq – a method providing a genome-wide average readout of ongoing transcription of both genic and non-genic TUs in cell populations [86, 87]. The mean transcriptional Pearson correlation between all pairs of TUs is also evaluated, and a graphical overview of this feature is provided via the Pearson correlation matrix. We also analyse clusters/factories of bound and spatially-proximate TFs, count the number of TFs of similar colour in each cluster, and introduce a demixing coefficient

where n is the number of colors, i is an index denoting each of the colors present in the model and x_i,max the largest fraction of TFs of the same i-th color in a single TF cluster. If θ_dem = 1, this means that a cluster contains only TFs of one colour and so is fully demixed; if θ_dem = 0, the cluster contains a mixture of TFs of all colors in equal number, and so is maximally demixed. More details can be found in the SI.

We consider two different types of string, one with M = 3000 beads (or 9 Mbp) which is referred to as a “toy” string, and a second representing a whole human chromosome. Chromosomes are initialised in both cases as random walks. An alternative possibility would be to start from mitotic configuration as in [88], which would remove entanglement in the initial condition. Experience with similar models (e.g., see [50]) suggests that a different initial condition will be important for the very large-scale structure but not for the scale at which transcriptional clusters form, which is the one we are most interested in here.

Toy model

The toy model is built by placing one yellow, red, or green TU every 30 weakly-binding beads, giving a total of 100 TUs of all types in a string of 3000 beads [26]. Various different sequences of TU colour down the string are considered. In one – the “random” string – TU colours are chosen randomly (see Fig. 1a and SI for the specific sequence generated). In a second and third – the “1-pattern” and “6-pattern” strings – TU colors follow a repeating pattern (red, then yellow, then green) 1 or 6 times (see Fig. 4). We made these choices for the sequences of TUs as they are useful to show how 1D patterns affect resulting cluster morphology. In this respect, these patterns in the toy model are only representative. At the same time, the ratio between TFs and TUs are close to those used below for human chromosome simulations.

For the random string, we monitor how the system responds to different perturbations. Local “mutations” are inspired by editing experiments performed using CRISPR/Cas9 [60]. One to four mutations are mimicked by switching selected yellow beads inside a cluster of consecutive yellow TUs (between TUs 1920 to 2070) to red ones (Fig. 2). Thus, conversion of TU bead 1980 gives a string with 1 mutation, of 1950 and 1980 gives 2 mutations, of 1950 to 2010 gives 3 mutations, and 1950 to 2040 gives 4 mutations. Global perturbations are inspired by experiments reducing global levels of TFs using auxin-induced degrons [61]. Here, we study the effects of reducing the concentration of yellow TFs by 30%.

Human chromosomes

Our reference case for whole human chromosome simulations in the main text is the mid-sized human chromosome HSA 14 (107 Mbp), coarse-grained into M = 35784 beads. For Fig. 6, weakly- and strongly-binding beads are identified (using ENCODE data [84] for human umbilical vein endothelial cells, HUVECs) by the presence of H3K27ac modifications and DNase-hypersensitivity sites (DHSs) in the 3 kbp region corresponding to that bead – as these are good markers of open chromatin and active TUs (both genic and non-genic), respectively. For Fig. 6, TUs are split into ones only active in HUVECs and others (“house-keeping” ones) that are also active in H1-hESC cells (again using DHS sites and ENCODE data). Then, if a TU appears in both HUVECs and H1-hESCs, it is marked as housekeeping and coloured red; if it appears only in HUVECs it is marked as HUVEC-specific and coloured green. This allows an intuitive and simple multicolour model of HUVECs to be constructed. All remaining beads (which are not either weakly-binding or TUs) are non-binding. This approach represents a generalisation of the DHS model described in [11], so we call it the multicolour DHS model. For the simulations shown in the main text TF:pols complexes and TU size is the same (σ, corresponding to 30 nm at our resolution). This is justified by the fact that our TF:pol represents both transcription factors and polymerases. A polymerase is about 25 nm in human cells [30], while transcription factors are typically at least 5nm in size. We also considered the case in which TF:pol size is smaller (0.5σ and 0.16σ, Fig. S9) to explore the potential effect of protein size: as we shall see, this does not qualitatively affect our conclusions and results.

We also consider HSA 18 (80 Mbp, 26026 beads) and 19 (58 Mbp, 19710 beads) in HUVECs, chosen as they represent gene-poor and gene-rich chromosomes, respectively. Additionally, we consider HSA 14 in the B-lymphocyte line GM12878 (again, colours are chosen by combining DHS data for GM12878 and H1-hESCs). H3K27ac and DHS data is again from ENCODE.

The multicolor DHS model was also applied within a more realistic chromatin framework, the “highly predictive heteromorphic polymer model”, or HiP-HoP model [17]. This is a much more sophisticated model which takes into account: (i) loop extrusion; (ii) inactive (as well as active) chromatin folding; (iii) chromatin heteromorphicity (different local compaction of chromatin according to acetylation). More details on the HiP-HoP model are given in the SI.

For human chromosomes, transcriptional-activity data obtained from simulations and GRO-seq are compared in two ways [11]. First, we rank activities of each TU, and build a two-dimensional histogram providing an overview of the agreement between the two sets of ranks. Second, we quantify Spearman’s rank correlation coefficient between numerical and experimental data (SI for more details).

Data availability

All experimental data used in this paper are available in the ENCODE database [84]. All custom scripts used for the simulations presented here are available in the Zenodo database 10.5281/zenodo.17408464.

Acknowledgements

M.S. G.N. and G.F. contributed equally to this work. The work has been performed within the HPC-EUROPA3 Project (INFRAIA-2016-1-730897), with the support of the EC Research Innovation Action under the H2020 Programme. We acknowledge funding from MIUR Project No. PRIN 2020/PFCXPE, and from the Well-come Trust (223097/Z/21/Z). G.F. acknowledges support from the Leverhulme Trust (Early Career Fellowship ECF-2024-221).

Additional files

Supplemental Material

References

[1]
1. Chiang Michael
2. Brackley Chris A
3. Marenduzzo Davide
4. Gilbert Nick
2022Predicting genome organisation and function with mechanistic modellingTrends in Genetics 38:364–378Google Scholar
[2]
1. Kempfer Rieke
2. Pombo Ana
2020Methods for mapping 3d chromosome architectureNat. Rev. Genet 21:207–226Google Scholar
[3]
1. Pombo Ana
2. Dillon Niall
2015Three-dimensional genome architecture: players and mechanismsNat. Rev. Mol. Cell Biol 16:245–257Google Scholar
[4]
1. Lieberman-Aiden Erez
2. van Berkum Nynke L.
3. Williams Louise
4. Imakaev Maxim
5. Ragoczy Tobias
6. Telling Agnes
7. Amit Ido
8. Lajoie Bryan R.
9. Sabo Peter J.
10. Dorschner Michael O.
11. Sandstrom Richard
12. Bernstein Bradley
13. Bender M. A.
14. Groudine Mark
15. Gnirke Andreas
16. Stamatoyannopoulos John
17. Mirny Leonid A.
18. Lander Eric S.
19. Dekker Job
2009Comprehensive mapping of long-range interactions reveals folding principles of the human genomeScience 326:289–293Google Scholar
[5]
1. Dixon Jesse R.
2. Selvaraj Siddarth
3. Yue Feng
4. Kim Audrey
5. Li Yan
6. Shen Yin
7. Hu Ming
8. Liu Jun S.
9. Ren Bing
2012Topological domains in mammalian genomes identified by analysis of chromatin interactionsNature 485:376–380Google Scholar
[6]
1. Cook P. R.
2. Marenduzzo D.
2018Transcription-driven genome organization: a model for chromosome structure and the regulation of gene expression tested through simulationsNucleic Acids Res 46:9895–9906Google Scholar
[7]
1. Dixon Jesse R
2. Gorkin David U
3. Ren Bing
2016Chromatin domains: the unit of chromosome organizationMol. Cell 62:668–680Google Scholar
[8]
1. Rowley M Jordan
2. Nichols Michael H
3. Lyu Xiaowen
4. Ando-Kuri Masami
5. Sarahi I
6. Rivera M
7. Hermetz Karen
8. Wang Ping
9. Ruan Yijun
10. Corces Victor G
2017Evolutionarily conserved principles predict 3d chromatin organizationMol. Cell 67:837–852Google Scholar
[9]
1. Papantonis Argyris
2. Cook Peter R
2013Transcription factories: genome organization and gene regulationChemical Reviews 113:8683–8705Google Scholar
[10]
1. Cramer Patrick
2019Organization and regulation of gene transcriptionNature 573:45–54Google Scholar
[11]
1. Brackley CA
2. Gilbert Nick
3. Michieletto Davide
4. Papantonis Argyris
5. Pereira MCF
6. Cook PR
7. Marenduzzo Davide
2021Complex small-world regulatory networks emerge from the 3d organisation of the human genomeNat. Commun 12:1–14Google Scholar
[12]
1. Chiang Michael
2. Brackley Chris A
3. Naughton Catherine
4. Nozawa Ryu-Suke
5. Battaglia Cleis
6. Marenduzzo Davide
7. Gilbert Nick
2022Gene structure heterogeneity drives transcription noise within human chromosomesbioRxiv Google Scholar
[13]
1. Bianco Simona
2. Chiariello Andrea M
3. Annunziatella Carlo
4. Esposito Andrea
5. Nicodemi Mario
2017Predicting chromatin architecture from models of polymer physicsChromosome Res 25:25–34Google Scholar
[14]
1. Laghmach R.
2. Pierro M.
3. Potoyan D.
2022A liquid state perspective on dynamics of chromatin compartmentsFrontiers in Molecular Biosciences 8https://doi.org/10.3389/fmolb.2021.781981 Google Scholar
[15]
1. Di Pierro M.
2. Potoyan D. A.
3. Wolynes P. G.
4. Onuchic J. N.
2018Anomalous diffusion, spatial coherence, and viscoelasticity from the energy landscape of human chromosomesProceedings of the National Academy of Sciences 115:7753–7758Google Scholar
[16]
1. Barbieri Mariano
2. Chotalia Mita
3. Fraser James
4. Lavitas Liron-Mark
5. Dostie Josée
6. Pombo Ana
7. Nicodemi Mario
2012Complexity of chromatin folding is captured by the strings and binders switch modelProceedings of the National Academy of Sciences 109:16173–16178Google Scholar
[17]
1. Buckle Adam
2. Brackley Chris A.
3. Boyle Shelagh
4. Marenduzzo Davide
5. Gilbert Nick
2018Polymer simulations of heteromorphic chromatin predict the 3d folding of complex genomic lociMolecular Cell 72:786–797Google Scholar
[18]
1. Jost Daniel
2. Vaillant Cédric
2018Epigenomics in 3D: importance of long-range spreading and specific interactions in epigenomic maintenanceNucleic Acids Research 46:2252–2264Google Scholar
[19]
1. Ghosh Surya K
2. Jost Daniel
2019Genome organization via loop extrusion, insights from polymer physics modelsBriefings in Functional Genomics 19:119–127Google Scholar
[20]
1. Lin Xingcheng
2. Qi Yifeng
3. Latham Andrew P.
4. Zhang Bin
2021Multiscale modeling of genome organization with maximum entropy optimizationThe Journal of Chemical Physics 155https://doi.org/10.1063/5.0044150 Google Scholar
[21]
1. Natesan Ramakrishnan
2. Gowrishankar Kripa
3. Kuttippurathu Lakshmi
4. Kumar PB Sunil
5. Rao Madan
2021Active remodeling of chromatin and implications for in vivo foldingJ. Phys. Chem. B 126:100–109Google Scholar
[22]
1. Chiariello Andrea M
2. Annunziatella Carlo
3. Bianco Simona
4. Esposito Andrea
5. Nicodemi Mario
2016Polymer physics of chromosome large-scale 3d organisationSci. Rep 6:29775Google Scholar
[23]
1. Giorgetti Luca
2. Galupa Rafael
3. Nora Elphége P
4. Piolot Tristan
5. Lam France
6. Dekker Job
7. Tiana Guido
8. Heard Edith
2014Predictive polymer modeling reveals coupled fluctuations in chromosome conformation and transcriptionCell 157:950–963Google Scholar
[24]
1. Michieletto Davide
2. Orlandini Enzo
3. Marenduzzo Davide
2016Polymer model with epigenetic recoloring reveals a pathway for the de novo establishment and 3d organization of chromatin domainsPhys. Rev. X 6:041047Google Scholar
[25]
1. Di Pierro Michele
2. Cheng Ryan R.
3. Aiden Erez Lieberman
4. Wolynes Peter G.
5. Onuchic N.
2017De novo prediction of human chromosome structures: Epigenetic marking patterns encode genome architectureProceedings of the National Academy of Sciences 114:12126–12131Google Scholar
[26]
1. Brackley Chris A.
2. Taylor Stephen
3. Papantonis Argyris
4. Cook Peter R.
5. Marenduzzo Davide
2013Nonspecific bridging-induced attraction drives clustering of dnabinding proteins and genome organizationProc. Natl. Acad. Sci. USA 110:E3605–E3611Google Scholar
[27]
1. Brackley Chris A.
2. Johnson James
3. Kelly Steven
4. Cook Peter R.
5. Marenduzzo Davide
2016Simulated binding of transcription factors to active and inactive regions folds human chromosomes into loops, rosettes and topological domainsNucleic Acids Res 44:3503–3512Google Scholar
[28]
1. Marenduzzo D
2. Orlandini E
2009Topological and entropic repulsion in biopolymersJstat 2009:L09002Google Scholar
[29]
1. Albert Frank W
2. Kruglyak Leonid
2015The role of regulatory variation in complex traits and diseaseNat. Rev. Genet 16:197–212Google Scholar
[30]
1. Cook P. R.
2001Principles of nuclear structure and functionNew York: Wiley Google Scholar
[31]
1. Fullwood Melissa J
2. Liu Mei Hui
3. Pan You Fu
4. Liu Jun
5. Xu Han
6. Bin Mohamed Yusoff
7. Orlov Yuriy L
8. Velkov Stoyan
9. Ho Andrea
10. Mei Poh Huay
11. et al.
2009An oestrogenreceptor-α-bound human chromatin interactomeNature 462:58–64Google Scholar
[32]
1. Schoenfelder Stefan
2. Sexton Tom
3. Chakalova Lyubomira
4. Cope Nathan F
5. Horton Alice
6. Andrews Simon
7. Kurukuti Sreenivasulu
8. Mitchell Jennifer A
9. Umlauf David
10. Dimitrova Daniela S
11. et al.
2010Preferential associations between co-regulated genes reveal a transcriptional interactome in erythroid cellsNat. Genet 42:53–61Google Scholar
[33]
1. Papantonis Argyris
2. Kohro Takahide
3. Baboo Sabyasachi
4. Larkin Joshua D
5. Deng Binwei
6. Short Patrick
7. Tsutsumi Shuichi
8. Taylor Stephen
9. Kanki Yasuharu
10. Kobayashi Mika
11. et al.
2012Tnfα signals through specialized factories where responsive coding and mirna genes are transcribedEMBO J 31:4404–4414Google Scholar
[34]
1. Pancaldi Vera
2. Carrillo-de Santa-Pau Enrique
3. Javierre Biola Maria
4. Juan David
5. Fraser Peter
6. Spivakov Mikhail
7. Valencia Alfonso
8. Rico Daniel
2016Integrating epigenomic data and 3d genomic structure with a new measure of chromatin assortativityGenome Biology 17:152Google Scholar
[35]
1. Moorman Celine
2. Sun Ling V
3. Wang Junbai
4. de Wit Elzo
5. Talhout Wendy
6. Ward Lucas D
7. Greil Frauke
8. Lu Xiang-Jun
9. White Kevin P
10. Bussemaker Harmen J
11. et al.
2006Hotspots of transcription factor colocalization in the genome of drosophila melanogasterProc. Natl. Acad. Sci. USA 103:12027–12032Google Scholar
[36]
1. Foley Joseph W
2. Sidow Arend
2013Transcription-factor occupancy at hot regions quantitatively predicts rna polymerase recruitment in five human cell linesBMC Genom 14:1–17Google Scholar
[37]
1. Cortini Ruggero
2. Filion Guillaume J
2018Theoretical principles of transcription factor traffic on folded chromatinNat. Commun 9:1–10Google Scholar
[38]
1. Ding Jun
2. Sharon Nadav
3. Bar-Joseph Ziv
2022Temporal modelling using single-cell transcriptomicsNat. Rev. Genet 23:355–368Google Scholar
[39]
1. Elmentaite Rasa
2. Conde Domínguez
3. Yang Lu
4. Teichmann Sarah A
2022Single-cell atlases: shared and tissue-specific cell types across human organsNat. Rev. Genet :1–16Google Scholar
[40]
1. Brackey Chris A.
2. Marenduzzo Davide
3. Gilbert Nick
2020Mechanistic modeling of chromatin folding to understand functionNature Methods 17https://doi.org/10.1038/s41592-020-0852-6 Google Scholar
[41]
1. Buckle Adam
2. Brackley Chris A
3. Boyle Shelagh
4. Marenduzzo Davide
5. Gilbert Nick
2018Polymer simulations of heteromorphic chromatin predict the 3d folding of complex genomic lociMol. Cell 72:786–797Google Scholar
[42]
1. Brackley C. A.
2. Johnson J.
3. Michieletto D.
4. Morozov A. N.
5. Nicodemi M.
6. Cook P. R.
7. Marenduzzo D.
2017Non-equilibrium chromosome looping via molecular slip-linksPhys. Rev. Lett 119:138101Google Scholar
[43]
1. Nicodemi Mario
2. Pombo Ana
2014Models of chromosome structureCurrent Opinion in Cell Biology 28:90–95Google Scholar
[44]
1. Conte Mattia
2. Chiariello Andrea M.
3. Abraham Alex
4. Bianco Simona
5. Esposito Andrea
6. Nicodemi Mario
7. Matteuzzi Tommaso
8. Vercellone Francesca
2022Polymer models of chromatin imaging data in single cellsAlgorithms 15https://doi.org/10.3390/a15090330 Google Scholar
[45]
1. Semeraro Massimiliano
2. Negro Giuseppe
3. Suma Antonio
4. Gonnella Giuseppe
5. Marenduzzo Davide
20233d polymer simulations of genome organisation and transcription across different chromosomes and cell typesPhysica A: Statistical Mechanics and its Applications 625:129013Google Scholar
[46]
1. Tiana Guido
2. Amitai Assaf
3. Pollex Tim
4. Piolot Tristan
5. Holcman David
6. Heard Edith
7. Giorgetti Luca
2016Structural fluctuations of the chromatin fiber within topologically associating domainsBiophys. J 110:1234–1245Google Scholar
[47]
1. Crippa Martina
2. Zhan Yinxiu
3. Tiana Guido
2020Effective model of loop extrusion predicts chromosomal domainsPhys. Rev. E 102:032414Google Scholar
[48]
1. Negro G.
2. Semeraro M.
3. Cook P. R.
4. Marenduzzo D.
2023A unified-field theory of genome organization and gene regulationarxiv Google Scholar
[49]
1. Bianco Simona
2. Lupiáñez Darío G
3. Chiariello Andrea M
4. Annunziatella Carlo
5. Kraft Katerina
6. Schöpflin Robert
7. Wittler Lars
8. Andrey Guillaume
9. Vingron Martin
10. Pombo Ana
11. et al.
2018Polymer physics predicts the effects of structural variants on chromatin architectureNat. Genet 50:662–667Google Scholar
[50]
1. Jost Daniel
2. Carrivain Pascal
3. Cavalli Giacomo
4. Vaillant Cedric
2014Modeling epigenome folding: formation and dynamics of topologically associated chromatin domainsNucleic Acids Res 42:9553–9561Google Scholar
[51]
1. Falk Martin
2. Feodorova Yana
3. Naumova Natalia
4. Imakaev Maxim
5. Lajoie Bryan R.
6. Leonhardt Heinrich
7. Joffe Boris
8. Dekker Job
9. Fudenberg Geoffrey
10. Solovei Irina
11. Mirny Leonid A.
2019Heterochromatin drives compartmentalization of inverted and conventional nucleiNature 570:395–399Google Scholar
[52]
1. Johnstone Sarah E.
2. Reyes Alejandro
3. Qi Yifeng
4. Adriaens Carmen
5. Hegazi Esmat
6. Pelka Karin
7. Chen Jonathan H.
8. Zou Luli S.
9. Drier Yotam
10. Hecht Vivian
11. Shoresh Noam
12. Selig Martin K.
13. Lareau Caleb A.
14. Iyer Sowmya
15. Nguyen Son C.
16. Joyce Eric F.
17. Hacohen Nir
18. Irizarry Rafael A.
19. Zhang Bin
20. Aryee Martin J.
21. Bernstein Bradley E.
2020Large-scale topological changes restrain malignant progression in colorectal cancerCell 182:1474–1489Google Scholar
[53]
1. Cho Won-Ki
2. Spille Jan-Hendrik
3. Hecht Micca
4. Lee Choongman
5. Li Charles
6. Grube Valentin
7. Cisse Ibrahim I
2018Mediator and rna polymerase ii clusters associate in transcription-dependent condensatesScience 361:412–415Google Scholar
[54]
1. Wei Mian
2. Fan Xiaoying
3. Ding Miao
4. Li Ruifeng
5. Shao Shipeng
6. Hou Yingping
7. Meng Shaoshuai
8. Tang Fuchou
9. Li Cheng
10. Sun Yujie
2020Nuclear actin regulates inducible transcription by enhancing rna polymerase ii clusteringScience Advances 6:eaay6515Google Scholar
[55]
1. Pancholi Agnieszka
2. Klingberg Tim
3. Zhang Weichun
4. Prizak Roshan
5. Mamontova Irina
6. Noa Amra
7. Sobucki Marcel
8. Kobitski Andrei Yu
9. Nienhaus Gerd Ulrich
10. Zaburdaev Vasily
11. et al.
2021Rna polymerase ii clusters form in line with surface condensation on regulatory chromatinMolecular systems biology 17:e10272Google Scholar
[56]
1. Cohen Barak A
2. Mitra Robi D
3. Hughes Jason D
4. Church George M
2000A computational analysis of wholegenome expression data reveals chromosomal domains of gene expressionNat. Genet 26:183–186Google Scholar
[57]
1. Gilbert Nick
2. Boyle Shelagh
3. Fiegler Heike
4. Woodfine Kathryn
5. Carter Nigel P
6. Bickmore Wendy A
2004Chromatin architecture of the human genome: generich domains are enriched in open chromatin fibersCell 118:555–566Google Scholar
[58]
1. Boyle Evan A
2. Li Yang I
3. Pritchard Jonathan K
2017An expanded view of complex traits: from polygenic to omnigenicCell 169:1177–1186Google Scholar
[59]
1. Brem Rachel B
2. Kruglyak Leonid
2005The landscape of genetic complexity across 5,700 gene expression traits in yeastProceedings of the National Academy of Sciences 102:1572–1577Google Scholar
[60]
1. Morgan Stefanie L.
2. Mariano Natasha C.
3. Bermudez Abel
4. Arruda Nicole L.
5. Wu Fangting
6. Luo Yunhai
7. Shankar Gautam
8. Jia Lin
9. Chen Huiling
10. Hu Ji-Fan
11. Hoffman Andrew R.
12. Huang Chiao-Chain
13. Pitteri Sharon J.
14. Wang Kevin C.
2017Manipulation of nuclear architecture through crispr-mediated chromosomal loopingNat. Comm 8:15993Google Scholar
[61]
1. Luan Jing
2. Xiang Guanjue
3. Aurelio Gómez-García Pablo
4. Tome Jacob M
5. Zhang Zhe
6. Vermunt Marit W
7. Zhang Haoyue
8. Huang Anran
9. Keller Cheryl A
10. Giardine Belinda M
11. et al.
2021Distinct properties and functions of ctcf revealed by a rapidly inducible degron systemCell reports 34:108783Google Scholar
[62]
1. Watts Duncan J
2. Strogatz Steven H
1998Collective dynamics of ‘small-world’networksNature 393:440–442Google Scholar
[63]
1. Hurst Laurence D
2. Pál Csaba
3. Lercher Martin J
2004The evolutionary dynamics of eukaryotic gene orderNat. Rev. Genet 5:299–310Google Scholar
[64]
1. Brackley Chris A
2. Liebchen Benno
3. Michieletto Davide
4. Mouvet Francois
5. Cook Peter R
6. Marenduzzo Davide
2017Ephemeral protein binding to dna shapes stable nuclear bodies and chromatin domainsBiophys J 112:1085–1093Google Scholar
[65]
1. Fudenberg Geoffrey
2. Imakaev Maxim
3. Lu Carolyn
4. Goloborodko Anton
5. Abdennur Nezar
6. Mirny Leonid A.
2016Formation of chromosomal domains by loop extrusionCell Rep 15:2038–2049Google Scholar
[66]
1. Chiang Michael
2. Forte Giada
3. Gilbert Nick
4. Marenduzzo Davide
5. Brackley Chris A
2022Predictive polymer models for 3d chromosome organizationHi-C Data Analysis: Methods and Protocols :267–291Google Scholar
[67]
1. Xu Meng
2. Cook Peter R
2008Similar active genes cluster in specialized transcription factoriesJ. Cell Biol 181:615–623Google Scholar
[68]
1. Razin SV
2. Gavrilov AA
3. Pichugin M Lipinski
4. Iarovaia OV
5. Vassetzky Yegor S
2011Transcription factories in the context of the nuclear and genome organizationNucleic acids research 39:9085–9092Google Scholar
[69]
1. Quinodoz Sofia A
2. Ollikainen Noah
3. Tabak Barbara
4. Palla Ali
5. Schmidt Jan Marten
6. Detmar Elizabeth
7. Lai Mason M
8. Shishkin Alexander A
9. Bhat Prashant
10. Takei Yodai
11. et al.
2018Higher-order inter-chromosomal hubs shape 3d genome organization in the nucleusCell 174:744–757Google Scholar
[70]
1. Quinodoz Sofia A
2. Jachowicz Joanna W
3. Bhat Prashant
4. Ollikainen Noah
5. Banerjee Abhik K
6. Goronzy Isabel N
7. Blanco Mario R
8. Chovanec Peter
9. Chow Amy
10. Markaki Yolanda
11. et al.
2021Rna promotes the formation of spatial compartments in the nucleusCell 184:5775–5790Google Scholar
[71]
1. Beagrie Robert A
2. Scialdone Antonio
3. Schueler Markus
4. Kraemer Dorothee CA
5. Chotalia Mita
6. Xie Sheila Q
7. Barbieri Mariano
8. de Santiago Inês
9. Lavitas Liron-Mark
10. Branco Miguel R
11. et al.
2017Complex multi-enhancer contacts captured by genome architecture mappingNature 543:519–524Google Scholar
[72]
1. Beagrie Robert A
2. Thieme Christoph J
3. Annunziatella Carlo
4. Baugher Catherine
5. Zhang Yingnan
6. Schueler Markus
7. Kukalev Alexander
8. Kempfer Rieke
9. Chiariello Andrea M
10. Bianco Simona
11. et al.
2023Multiplexgam: genome-wide identification of chromatin contacts yields insights overlooked by hi-cNature Methods 20:1037–1047Google Scholar
[73]
1. Liu Lei
2. Zhang Bokai
3. Hyeon Changbong
2021Extracting multi-way chromatin contacts from hi-c dataPLOS Computational Biology 17:e1009669Google Scholar
[74]
1. Berthelot Camille
2. Villar Diego
3. Horvath Julie E
4. Odom Duncan T
5. Flicek Paul
2018Complexity and conservation of regulatory landscapes underlie evolutionary resilience of mammalian gene expressionNat. Ecol. Evol 2:152–163Google Scholar
[75]
1. Laverré Alexandre
2. Tannier Eric
3. Necsulea Anamaria
2022Long-range promoter–enhancer contacts are conserved during evolution and contribute to gene expression robustnessGenome Res 32:280–296Google Scholar
[76]
1. Nozawa Ryu-Suke
2. Boteva Lora
3. Soares Dinesh C
4. Naughton Catherine
5. Dun Alison R
6. Buckle Adam
7. Ramsahoye Bernard
8. Bruton Peter C
9. Saleeb Rebecca S
10. Arnedo Maria
11. et al.
2017Saf-a regulates interphase chromosome structure through oligomerization with chromatinassociated rnasCell 169:1214–1227Google Scholar
[77]
1. Marenda Mattia
2. Michieletto Davide
3. Czapiewski Rafal
4. Stocks Jon
5. Winterbourne Sophie M
6. Miles Jamilla
7. Flemming Olivia CA
8. Lazarova Elena
9. Chiang Michael
10. Aitken Stuart
11. et al.
2024Nuclear rna forms an interconnected network of transcription-dependent and tunable microgelsBioRxiv Google Scholar
[78]
1. Eshghi Iraj
2. Zidovska Alexandra
3. Grosberg Alexander Y.
2023Model chromatin flows: numerical analysis of linear and nonlinear hydrodynamics inside a sphereThe European Physical Journal E 46:69Google Scholar
[79]
1. Eshghi Iraj
2. Zidovska Alexandra
3. Grosberg Alexander Y.
2023Activity-driven phase transition causes coherent flows of chromatinPhys. Rev. Lett 131:048401Google Scholar
[80]
1. Mahajan Achal
2. Yan Wen
3. Zidovska Alexandra
4. Saintillan David
5. Shelley Michael J.
2022Euchromatin activity enhances segregation and compaction of heterochromatin in the cell nucleusPhys. Rev. X 12:041033Google Scholar
[81]
1. Zidovska Alexandra
2. Weitz David A.
3. Mitchison Timothy J.
2013Micron-scale coherence in interphase chromatin dynamicsProceedings of the National Academy of Sciences 110:15555–15560Google Scholar
[82]
1. Brackley C. A.
2. Liebchen B.
3. Michieletto D.
4. Mouvet F. L.
5. Cook P. R
6. Marenduzzo D.
2017Ephemeral protein binding to dna shapes stable nuclear bodies and chromatin domainsBiophys. J 28:1085–1093Google Scholar
[83]
1. Olarte-Plata Juan D.
2. Haddad Noelle
3. Vaillant Cedric
4. Jost Daniel
2016The folding landscape of the epigenomePhys. Biol 13Google Scholar
[84]
1. Consortium ENCODE Project
2012An integrated encyclopedia of dna elements in the human genomeNature 489:57–74Google Scholar
[85]
1. Thompson A. P.
2. Aktulga H. M.
3. Berger R.
4. Bolintineanu D. S.
5. Brown W. M.
6. Crozier P. S.
7. in ‘t Veld P. J.
8. Kohlmeyer A.
9. Moore S. G.
10. Nguyen T. D.
11. Shan R.
12. Stevens M. J.
13. Tranchida J.
14. Trott C.
15. Plimpton S. J.
2022LAMMPS a flexible simulation tool for particle-based materials modeling at the atomic, meso, and continuum scalesComp. Phys. Comm 271:108171Google Scholar
[86]
1. Core Leighton J
2. Martins André L
3. Danko Charles G
4. Waters Colin T
5. Siepel Adam
6. Lis John T
2014Analysis of nascent rna identifies a unified architecture of initiation regions at mammalian promoters and enhancersNat. Genet 46:1311–1320Google Scholar
[87]
1. Jordán-Pla Antonio
2. Pérez-Martínez Maria E
3. Pérez-Ortín José E
2019Measuring rna polymerase activity genome-wide with high-resolution run-on-based methodsMethods 159:177–182Google Scholar
[88]
1. Rosa Angelo
2. Everaers Ralf
2008Structure and dynamics of interphase chromosomesPLoS Comp. Biol 4:e1000153Google Scholar

Article and author information

Author information

Massimiliano Semeraro
Dipartimento Interateneo di Fisica, Università degli Studi di Bari and INFN, Bari, Italy
ORCID iD: 0000-0001-8273-4232
Giuseppe Negro
SUPA School of Physics and Astronomy, University of Edinburgh, Edinburgh, United Kingdom, Dipartimento Interateneo di Fisica, Università degli Studi di Bari and INFN, Bari, Italy
ORCID iD: 0000-0003-1755-7051
Giada Forte
SUPA School of Physics and Astronomy, University of Edinburgh, Edinburgh, United Kingdom
ORCID iD: 0000-0001-9939-4465
Antonio Suma
Dipartimento Interateneo di Fisica, Università degli Studi di Bari and INFN, Bari, Italy
ORCID iD: 0000-0002-5049-9255
Giuseppe Gonnella
Dipartimento Interateneo di Fisica, Università degli Studi di Bari and INFN, Bari, Italy
ORCID iD: 0000-0002-1829-4743
- For correspondence: giuseppe.settimio.negro@gmail.com
Peter R Cook
Sir William Dunn School of Pathology, University of Oxford, Oxford, United Kingdom
ORCID iD: 0000-0002-6639-188X
Davide Marenduzzo
SUPA School of Physics and Astronomy, University of Edinburgh, Edinburgh, United Kingdom
ORCID iD: 0000-0003-3974-4915

Author Notes

Competing interests: No competing interests declared

Version history

Preprint posted: October 2, 2024
Sent for peer review: October 20, 2024
Reviewed Preprint version 1: January 2, 2025
Reviewed Preprint version 2: January 22, 2026

Cite all versions

You can cite all versions using the DOI https://doi.org/10.7554/eLife.103955. This DOI represents all versions, and will always resolve to the latest one.

Copyright

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.

Metrics

views: 920
downloads: 63
citations: 0

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Significance of findings

Strength of evidence

Abstract

Introduction

Results

Toy model with different transcription factors

Toy model, with TUs coloured randomly (the random string).

Local mutations

Simulating effects of mutations.

Variations in TF concentration

Reducing the concentration of yellow TFs reduces the transcriptional activity of most yellow TUs while enhancing the activities of some red TUs.

Effects of 1D TU patterns on transcriptional activity

Clustering similar TUs in 1D genomic space increases transcriptional activity.

Emergent transcriptional correlation networks

TU transcriptional networks and demixing.

Transcriptional activity and comparison with real human chromosomes

Comparison of transcriptional activities of TUs on different human chromosomes determined from simulations and GRO-seq.

Specialized and mixed clusters

Small clusters tend to be unmixed, large ones mixed.

HiP-HoP model simulations: small clusters tend to be unmixed, large ones mixed.

Discussion and conclusions

Methods

Toy model

Human chromosomes

Data availability

Acknowledgements

Additional files

References

Article and author information

Author information

Massimiliano Semeraro

Giuseppe Negro

Giada Forte

Antonio Suma

Giuseppe Gonnella

Peter R Cook

Davide Marenduzzo

Author Notes

Version history

Cite all versions

Copyright

Metrics