OpenNucleome for high-resolution nuclear structural and dynamical modeling
Figures

Computer model of the human nucleus for structural and dynamical characterizations.
(A) 3D rendering of the nucleus model with particle-based representations for the 46 chromosomes shown as ribbons, the nuclear lamina (gray), nucleoli (cyan), and speckles (yellow). As shown on the right, chromosomes are modeled as beads-on-a-string polymers at a 100 KB resolution, with the beads further categorized into compartment A (red), compartment B (light blue), or centromeric regions (green). (B) Speckle particles undergo chemical modifications concurrent to their spatial dynamics, and the de-phosphorylated (dP) particles contribute to droplet formation. (C) Illustration of the ideal and compartment potential that promotes chromosome compaction and microphase separation. Specific interactions between chromosomes and nuclear landmarks are shown on the right.

Overview of the iterative algorithm for parameterizing the nucleus model with experimental data.
Starting from an initial set of parameters, we perform molecular dynamics (MD) simulations to produce an ensemble of nuclear structures. These structures can be transformed into contacts between chromosomes or between chromosomes and nuclear landmarks for direct comparison with experimental data. Differences between simulated and experiment contacts are used to update parameters for additional rounds of optimization if needed.

The number of speckle clusters formed along a typical simulation trajectory.
This plot shows that the kinetic scheme of speckle particle exchange produces a total of ~30 speckle droplets, reproducing experimental observations. See Appendix 1, section ‘Speckles as phase-separated droplets undergoing chemical modifications’ for more details of the kinetic scheme.

OpenNucleome facilitates GPU-accelerated simulations of the human nucleus.
(A) Illustration of workflow for setting up, performing, and analyzing molecular dynamics (MD) simulations. (B) Python scripts setting up whole nucleus simulations. (C) Performance of MD simulations on different number of CPU cores and a single GPU.

Simulated structures reproduce contact frequencies between chromosomes and between chromosomes and nuclear landmarks.
(A) Comparison between simulated (top right) and experimental (bottom left) whole-genome contact probability maps with Pearson correlation coefficient r = 0.89. Zoom-ins of various regions are provided in Figure 4—figure supplement 1. (B) Comparison between simulated and experimental average contact frequencies, including average contacts between genomic loci from the same chromosomes at a given separation (top), average contacts between genomic loci classified into different compartment types (middle), and average contacts between various chromosome pairs (bottom). (C) Comparison between simulated and experimental Lamin-B DamID (top) and SON TSA-Seq signals (bottom), with Pearson correlation coefficients of haploid chromosomes shown on the right.

Zoom-in of various regions in the contact map presented in Figure 4 further supports the agreement between simulation and experiment.
The simulated and experimental contacts are shown in the upper and lower triangles, respectively, and the Pearson correlation coefficients r between two the sets of contacts were calculated with Equation 41.

Structural and dynamical predictions of the nucleus model match results from microscopy imaging.
(A) Comparison between the simulated and experimental radius of gyration, , for haploid chromosomes. The Pearson correlation coefficient between the two, r, is shown in the legend. (B) Comparison between the simulated and experimental normalized radial positions for haploid chromosomes, with their Pearson correlation coefficient shown in the legend. Detailed definition of the normalized radial positions is provided in Appendix 1, section ‘Computing simulated normalized chromosome radial positions’. (C) Mean-squared displacements (MSDs) as a function of time are shown for selected telomeres. (D) The probability distribution of the anomalous exponent, α, obtained from fitting the MSDs curves for all telomeres with the expression, .

Heterogeneity and conserved features of nuclear organizations.
(A) Normalized chromosome radial positions as a function of simulation time. (B) Contacts between chromosomes 1 and 2 from two independent simulation trajectories show significant variations. (C) Genome-wide in silico Lamin B DamID (top) and SON TSA-Seq (bottom) profiles computed from two independent trajectories. Pearson correlation coefficients, r, are provided on each plot. (D) Pairwise Person correlation coefficients between interchromosomal contact matrices (left), genome-wide Lamin B DamID profiles (middle), and genome-wide SON TSA-Seq profiles (right) determined from independent trajectories. The averages excluding the diagonals of the three datasets are 0.06, 0.53, and 0.72.

Arrested kinetics of chromosome positions over the timescale of cell cycles.
(A) Mean-squared displacement (MSD) of chromosome center of masses as a function of time. The MSDs are much smaller than the average size of chromosomes (see Rg values in Figure 5A), supporting arrested dynamics. (B) Autocorrelation function of normalized chromosome radial positions as a function of time. The autocorrelation function was computed as where t indexes over the trajectory frames and is the mean position.

Configurations used to initialize simulations capture the heterogeneity in interchromosomal contacts seen in DNA-MERFISH data.
(A) Comparison between the simulated and experimental distribution of Pearson correlation coefficients of interchromosomal contacts. Both distributions were computed using the pairwise correlations between genome structures from the respective configurational ensemble. The pairwise interchromosomal contacts include all unique haploid pairs and were computed by averaging the contact probabilities between all genomic segments from the four respective chromosomes. The simulated configuration ensemble includes 1000 unique structures prepared following the protocol outlined in Appendix 1, section ‘Initial configurations for simulations’. The experimental ensemble includes 5455 structures reported in Su et al., 2020 using DNA-MERFISH. (B) Pearson correlation coefficient between average experimental and simulated pairwise interchromosomal contacts as a function of the number of independent configurations used to initialize simulations. Interchromosomal contacts are similarly defined as in (A) and were computed by averaging over all configurations in the respective ensembles. To compute simulated interchromosomal contacts, we used a selection procedure as detailed in Appendix 1, section ‘Initial configurations for simulations’ to determine an ensemble of a given sample size. As the size of the ensemble increases, the resulting average interchromosomal contacts agree better with experimental results. The error bars represent standard deviation computed from 20 independent trials. (C, D) The protocol for selecting initial configurations for simulations robustly capture the heterogeneity seen in experimental configurational ensemble. To examine the experimental structural heterogeneity, we applied Uniform Manifold Approximation and Projection (UMAP) to reduce the interchromosomal contacts into two variables following the procedure detailed in Appendix 1, section ‘Interchromosomal contacts from DNA MERFISH data’. The plot in (C) shows the Pearson correlation coefficients of 50 initial configurations represented in the UMAP variables between independent trials. The average correlation coefficient is 0.98. In (D), we show three independent initial configurations (orange dots) over the distribution of UMAP variables estimated using the experimental configurational ensemble.

Nuclear deformations influence genome organization while preserving chromatin-speckle contacts.
(A) Illustration of force-induced nuclear envelope deformation. The nuclear lamina is modeled as a particle mesh where neighboring lamina particles are covalently bonded together. (B) Example nucleus conformations at different strengths of applied force. (C) Pearson correlation coefficients between results from simulations of deformed nuclei and those from a spherical nucleus for interchromosomal contacts (left), DamID profiles (middle), and TSA-Seq (right). The values at zero force were computed from two independent simulations starting from the same initial configurations.

Impact of nuclear deformation on normalized chromosome radial positions.
(A) The position of chromosome 21 before and after the nuclear deformation. (B) Normalized radial positions of individual diploid chromosomes at various strengths of nuclear deformation forces. (C) Pearson correlation coefficients between normalized chromosome radial positions from simulations of deformed nuclei and those from a spherical nucleus. See Appendix 1, section ‘Nuclear envelope deformation simulations’ for simulation details.

The function defined in Equation 13 smoothly switches from high to low contact probabilities.
The left and right panels plot the function and its derivative as a function of the distance, r.

Correlation between average DamID profiles of individual chromosomes with their normalized radial positions.
The normalized radial positions were determined using the average value of all cells reported from DNA MERFISH data (Su et al., 2020). The correlation coefficient between the two datasets is 0.8.

Parameters of the ideal potential, , as defined in Equation 12.
Numerical values for are included in the software’s GitHub repository.

Parameters of the inter potential, , as defined in Equation 15.
Numerical values for are included in the software’s GitHub repository.

Parameters of the chromosome-lamina potential, , as defined in Equation 24.
Numerical values for are included in the software’s GitHub repository.

Parameters of the chromosome-lamina potential, , as defined in Equation 25.
Numerical values for are included in the software’s GitHub repository.

Pearson correlation coefficients between experimental and simulated contact probabilities at various sequence separations within specific chromosomes.
For each chromosome, we first gathered a set of experimental contacts alongside a matching set of simulated ones for genomic pairs within a particular separation range. The Pearson correlation coefficient at the corresponding sequence separation was then determined using Equation 41. We limited the calculations to half of the chromosome length to ensure the availability of sufficient data.

Comparison between the Class2 potential (defined in Eq. 17) and the Harmonic potential (K(r − r0)2, with K = 20 and r0 = 0.5).

Pearson correlation coefficient between experimental and simulated contact probabilities as a function of the sequence separation within specific chromosomes.
For each chromosome, we first gathered a set of experimental contacts alongside a matching set of simulated ones for genomic pairs within a particular separation range. The Pearson correlation coefficient at the corresponding sequence separation was then determined using Equation R4. We limited the calculations to half of the chromosome length to ensure the availability of sufficient data.
Tables
Summary of the various terms of the chromosome energy function and the algorithms used for parameter optimization.
See also Appendix 1, section ‘Hi-C inspired interactions for the diploid human genome’ for detailed expression of the energy function and ‘Adam optimizer for chromosome interaction parameters’ for details on the optimization algorithm.
Potentials | Functional forms | Parameter values |
---|---|---|
Bonding potential | in Equation 9 | Standard values in coarse-grained polymer models |
Angular Potential | in Equation 9 | Standard values in coarse-grained polymer models |
Soft-core potential | in Equation 11 | Standard values in coarse-grained polymer models |
Ideal potential | in Equation 12 | Values for were obtained from optimizations against Hi-C data (see Appendix 1—figure 3). |
Compartment potential | in Equation 14 | Values for were obtained from optimizations against Hi-C data (see Appendix 1—table 2) |
Inter potential | in Equation 15 | Values for were obtained from optimizations against Hi-C data (see Appendix 1—figure 4) |
Summary of interaction parameters between various compartment types, that is, defined in Equation 14.
αAA | –0.074185 |
αAB | 0.112285 |
αAC | 0.009947 |
αBB | 0.059981 |
αBC | 0.072481 |
αCC | 0.088825 |
Summary of the interaction potentials among particles that make up the nuclear landmarks and their corresponding parameter values.
See also Appendix 1, section ‘Nuclear landmark–nuclear landmark interactions’ for further discussion and ‘Unit conversion’ for choosing the size of various particles, from which the were derived with a linear combination rule.
Potentials | Function forms | Parameter values |
---|---|---|
Nucleolus/nucleolus | in Equation 18 | and were chosen to mimic short-range attractions that produce an average of two nucleoli per cell. . |
Sp-dP/Sp-dP (speckles) | in Equation 18 | and were chosen to mimic short-range attractions that produce around 30 speckle droplets. . |
Sp-dP/Sp-P (speckles) | in Equation 18 | and were chosen as standard values to provide the excluded volume effect. . |
Sp-P/Sp-P (speckles) | in Equation 18 | and were chosen as standard values to provide the excluded volume effect. . |
Nucleolus/speckle | in Equation 18 | and were chosen as standard values to provide the excluded volume effect. . |
Nucleolus/lamina | in Equation 18 | and were chosen as standard values to provide the excluded volume effect. . |
Speckle/lamina | in Equation 18 | and were chosen as standard values to provide the excluded volume effect. . |
Lamina/lamina | in Equation 18 | and were chosen as standard values to provide the excluded volume effect. . |
-
* As mentioned in Appendix 1, section ‘Mapping lamina bead size to real unit’, a larger value for was used here to provide a stronger excluded volume effect that prevents these particles from crossing the nucleus boundary or getting stuck in the space of the lamina particle mesh grid.
Summary of the interaction potentials between chromatin particles and nuclear landmarks and their corresponding parameter values.
See also Appendix 1, section ‘Chromosome–nuclear landmark interactions’ for further discussion and ‘Adam optimizer for chromosome–nuclear body interaction parameters’ for details on the optimization algorithm.
Potentials | Functional forms | Parameter values |
---|---|---|
Chromatin-nucleolus | in Equation 26 | provides a smooth transition in the tanh function for contacts. reflects the minimal distances between chromatin and nucleolus beads as reflected in the excluded volume potential defined in Appendix 1—table 3. The interaction strength of the ith chromatin bead , where is the probability for the chromatin bead i to contact nucleoli as quantified by the software SPIN (Wang et al., 2021). |
Chromatin-speckle | in Equation 25 | and were similarly determined as in . Value for the interaction strength of the ith chromatin bead was obtained from optimizations against SON TSA-Seq data. |
Chromatin-lamina | in Equation 24 | and were similarly determined as in . Value for the interaction strength of the ith chromatin bead was obtained from optimizations against Lamin B DamID data. The extra Lennard Jones potential was included to provide the excluded volume effect, with and as standard values. . |
-
* As mentioned in Appendix 1, section ‘Mapping lamina bead size to real unit’, a larger value for was used here to provide a stronger excluded volume effect that prevents these particles from crossing the nucleus boundary or getting stuck in the space of the lamina particle mesh grid.