DNA sequence encodes the position of DNA supercoils

  1. Sung Hyun Kim
  2. Mahipal Ganji
  3. Eugene Kim
  4. Jaco van der Torre
  5. Elio Abbondanzieri  Is a corresponding author
  6. Cees Dekker  Is a corresponding author
  1. Kavli Institute of Nanoscience, Delft University of Technology, The Netherlands

Abstract

The three-dimensional organization of DNA is increasingly understood to play a decisive role in vital cellular processes. Many studies focus on the role of DNA-packaging proteins, crowding, and confinement in arranging chromatin, but structural information might also be directly encoded in bare DNA itself. Here, we visualize plectonemes (extended intertwined DNA structures formed upon supercoiling) on individual DNA molecules. Remarkably, our experiments show that the DNA sequence directly encodes the structure of supercoiled DNA by pinning plectonemes at specific sequences. We develop a physical model that predicts that sequence-dependent intrinsic curvature is the key determinant of pinning strength and demonstrate this simple model provides very good agreement with the data. Analysis of several prokaryotic genomes indicates that plectonemes localize directly upstream of promoters, which we experimentally confirm for selected promotor sequences. Our findings reveal a hidden code in the genome that helps to spatially organize the chromosomal DNA.

https://doi.org/10.7554/eLife.36557.001

Introduction

Control of DNA supercoiling is of vital importance to cells. Torsional strain imposed by DNA-processing enzymes induces supercoiling of DNA, which triggers large structural rearrangements through the formation of plectonemes (Vinograd et al., 1965). Recent biochemical studies suggest that supercoiling plays an important role in the regulation of gene expression in both prokaryotes (Le et al., 2013) and eukaryotes (Naughton et al., 2013; Pasi and Lavery, 2016). In order to tailor the degree of supercoiling around specific genes, chromatin is organized into independent topological domains with varying degrees of torsional strain (Naughton et al., 2013; Sinden and Pettijohn, 1981). Domains that contain highly transcribed genes are generally underwound whereas inactive genes are overwound (Kouzine et al., 2013). Furthermore, transcription of a gene transiently alters the local supercoiling (Kouzine et al., 2013; Naughton et al., 2013; Peter et al., 2004), while, in turn, torsional strain influences the rate of transcription (Chong et al., 2014; Liu and Wang, 1987; Ma et al., 2013).

For many years, the effect of DNA supercoiling on various cellular processes has mainly been understood as a torsional stress that enzymes should overcome or exploit for their function. More recently, supercoiling has been acknowledged as a key component of the spatial architecture of the genome (de Wit and de Laat, 2012; Dekker et al., 2013; Ding et al., 2014; Neuman, 2010). Here, bound proteins are typically viewed as the primary determinant of sequence-specific tertiary structures while intrinsic mechanical features of the DNA are often ignored. However, the DNA sequence influences its local mechanical properties such as bending stiffness, curvature, and duplex stability, which in turn alter the energetics of plectoneme formation at specific sequences (Dittmore et al., 2017; Irobalieva et al., 2015; Matek et al., 2015). Unfortunately, the relative importance of these factors that influence the precise tertiary structure of supercoiled DNA have remained unclear (Dekker and Heard, 2015). Various indications that the plectonemic structure of DNA can be influenced by the sequence were obtained from biochemical and structural studies (Kremer et al., 1993; Laundon and Griffith, 1988; Pfannschmidt and Langowski, 1998; Tsen and Levene, 1997) as well as from work performed in silico (Eslami-Mossallam et al., 2016; Pasi and Lavery, 2016; Wang et al., 2017). These studies suggested that plectonemes may get localized to highly curved or flexible segments of DNA. However, this examined only a handful of specific sequences such as phased poly(A)-tracts and a particular high–curvature sequence rich in poly(A)-tracts, making it difficult to determine if curvature, long poly(A)-tracts, or some other DNA feature drives the sequence–structure relationship.

Here, we study how DNA sequence governs the structure of supercoiled DNA by use of a recently developed single-molecule technique termed ISD (intercalation-induced supercoiling of DNA) (Ganji et al., 2016b), which uses intercalating dyes to induce supercoiling as well as to observe the resultant tertiary structures in many DNA molecules in parallel. Plectonemes are directly observable as intensity maxima along the DNA, from which their position along DNA can be extracted (see Figure 1a and Figure 1—figure supplement 1). We find a strong relationship between sequence and plectoneme localization. By examining many different sequences, we systematically rule out several possible mechanisms of the observed sequence dependence. Using a model built on basic physics, we show that the local intrinsic curvature determines the relative plectoneme stability at different sequences. Application of this model to sequenced genomes reveals a clear biological relevance, as we identify a class of plectonemic hot spots that localize upstream of prokaryotic promoters. Subsequently, we confirm that these sequences pin plectonemes in our single-molecule assay, testifying to the predictive power of our model. We also discuss several eukaryotic genomes where plectonemes are localized near promoters with a spacing consistent with nucleosome positioning. Taken together, our experimental results and our physical model show a clear sequence-supercoiling relationship and indicate that genomic DNA encodes information for positioning of plectonemes, likely to regulate gene expression and contribute to the three-dimensional spatial ordering of the genome.

Figure 1 with 2 supplements see all
Direct visualization of individual plectonemes on supercoiled DNA.

(a) Schematic of the ISD assay. (top) A flow-stretched DNA is doubly-tethered on a PEG-coated surface via streptavidin-biotin linkage. One-end of the DNA is labeled with Cy5-fluorophores (red stars) for identifying the direction of each DNA molecule. (bottom) Binding of SxO fluorophores induces supercoiling to the torsionally constrained DNA molecule. (b) Representative fluorescence images of a supercoiled DNA molecule. Left: Snap-shot image of a supercoiled DNA with 100 ms exposure. Yellow arrows highlight higher DNA density, that is individual plectonemes. Right: Time-averaged DNA image by stacking 1000 images (of 100 ms exposure each). Arrows indicate peaks in the inhomogeneous average density of plectonemes. (c) AT-contents of two DNA samples: template1 and template2 binned to 300 bp. (d) Plectoneme densities obtained from individual DNA molecules. (top) Plectoneme density on template1 (grey thin lines, n = 70) and their ensemble average (red line). Arrow indicates a strong plectoneme pinning site. (bottom) Plectoneme densities obtained from individual DNA molecules of template2 (grey thin lines, n = 120) and their ensemble average (black line).

https://doi.org/10.7554/eLife.36557.002

Results

Single-molecule visualization of individual plectonemes along supercoiled DNA

To study the behavior of individual plectonemes on various DNA sequences, we prepared 20 kb-long DNA molecules of which the end regions (~500 bp) were labelled with multiple biotins for surface immobilization (Figure 1—figure supplement 1a–b). The DNA molecule were flowed into streptavidin-coated sample chamber at a constant flow rate to obtain stretched double-tethered DNA molecules (Figure 1a and Figure 1—figure supplement 1a). We then induced supercoiling by adding an intercalating dye, Sytox Orange (SxO), into the chamber and imaged individual plectonemes formed on the supercoiled DNA molecules. Notably, SxO does not have any considerable effect on the mechanical properties of DNA under our experimental conditions (Ganji et al., 2016b).

Consistent with previous studies (Ganji et al., 2016b; van Loenhout et al., 2012), we observed dynamic spots along the supercoiled DNA molecule (highlighted with arrows in Figure 1b-top left and Video 1). These spots disappeared when DNA torsionally relaxed upon photo-induced nicking (Figure 1b-bottom left) (Ganji et al., 2016b), confirming that the spots were plectonemes induced by the supercoiling. Interestingly, the time-averaged fluorescence intensities of the supercoiled DNA were not homogeneously distributed along the molecule (Figure 1b-top right), establishing that plectoneme occurrence is position dependent. In contrast, torsionally relaxed (nicked) DNA displayed a featureless homogenous time-averaged fluorescence intensity (Figure 1b-bottom right).

Video 1
A representative real-time fluorescence image of a supercoiled DNA molecule that shows dynamic bright spots upon plectoneme formation.

At 20 s after acquisition, the DNA was torsionally relaxed due to photo-induced nicking.

https://doi.org/10.7554/eLife.36557.005

DNA sequence favors plectoneme localization at certain spots along supercoiled DNA

Upon observing the inhomogeneous fluorescence distribution along the supercoiled DNA, we sought to understand if the average plectoneme position is dependent on the underlying DNA sequence. We prepared two DNA samples; the first contained a uniform distribution of AT-bases while the second contained a strongly heterogeneous distribution of AT-bases (Figure 1c, template1 and template2, respectively). In order to quantitatively analyze the plectoneme distribution, we counted the average number of plectonemes over time at each position on the DNA molecules and built a position-dependent probability density function of the plectoneme occurrence (from now onwards called plectoneme density; see Materials and methods for details). The plectoneme density is normalized to its average value across the DNA such that a density value above one indicates that the region is a favorable position for plectonemes relative to other regions within the DNA molecule. For both DNA samples, we observed a strongly position-dependent plectoneme density (Figure 1d). Strikingly, the plectoneme densities (Figure 1d) were very different for the two DNA samples. This difference demonstrates that plectoneme positioning is directed by the underlying DNA sequence. Note that we did not observe any position dependence in the intensity profiles when the DNA was torsionally relaxed, indicating that the interaction of dye is not responsible for the dependence (Figure 1—figure supplement 2a).

The plectoneme kinetics showed a similar sequence dependence, as the number of events for nucleation and termination of plectonemes was also found to be position dependent with very different profiles for each DNA samples (Figure 1—figure supplement 2b). Importantly, at each position of the DNA, the number of nucleation and termination events were the same, showing that the system was at equilibrium. Because the aim of our study is to examine the sequence–structure relationship in supercoiled DNA, which is an equilibrium property, we focus on analyzing the plectoneme density profiles for a variety of sequences.

Systematic examination of plectoneme pinning at various putative DNA sequences

We first considered a number of potential links between DNA sequence and plectoneme density. Note that in particular the sharply bent apical tips of plectonemes (Figure 1A) create an energy barrier to plectoneme formation. This barrier could be reduced if the DNA was able to locally melt or kink, if a specific region of DNA was more flexible than others, or if the DNA sequence was intrinsically curved already before the plectoneme formed. Because all of these properties (duplex stability, flexibility, and curvature) are influenced by the AT-content, we first examined the relationship between AT-content and the measured plectoneme densities in Figure 1c–d. Indeed, the plectoneme density showed a weak correlation with the local AT-percentage (R = 0.33, Figure 2—figure supplement 1a).

In order to unambiguously link changes in plectoneme density to specific sequences of arbitrary size, we developed an assay where we inserted various short DNA segments carrying particular sequences of interest in the middle of the homogeneous template1 (Figure 2a and Figure 2—figure supplement 1b). This allowed us to easily determine the influence of the inserted sequence on plectoneme formation by measuring changes in the plectoneme density at the insert relative to the rest of the DNA strand. We examined three different AT-rich inserts: seqA, seqB, and seqC with ~60%,~65%, and ~60% AT, respectively (Figure 2a). Interestingly, all three samples showed a peak in the plectoneme density at the position of insertion, further supporting the idea that AT-rich sequences are preferred positions for plectonemes (Figure 2b). Furthermore, when we shortened or lengthened one AT-rich sequence (seqA), we found that the probability of plectoneme pinning (i.e. the area under the peak) scaled with the length of the AT-rich fragment (Figure 2—figure supplement 1b–e). Overall, these results suggest that plectoneme preferentially form at AT-rich regions.

Figure 2 with 1 supplement see all
Sequence-dependent pinning of DNA plectonemes.

(a) Top: Schematics showing DNA constructs with AT-rich fragments inserted in template1. Three different AT-rich segments, SeqA (400 bp), SeqB (500 bp), and SeqC (1 kb), are inserted at 8.8 kb from Cy5-end in template1. Bottom: AT-contents of these DNA constructs zoomed in at the position of insertion. (b) Averaged plectoneme densities measured for the AT-rich fragments denoted in (A). The insertion region is highlighted with a gray box. (n = 43, 31, and 42 for SeqA, SeqB, and SeqC, respectively) (c) Schematics of DNA constructs with a copy of the 1 kb region near the right end of template1 where strong plectoneme pinning is observed (seqCopy). Poly(A)-tracts within the copied region are then mutated either by replacing A bases with G or C (A-G mutation), or with T (A-T mutation). (d) Plectoneme densities measured for the sequences denoted in (c). Plectoneme density of template1 is shown in black, seqCopy in green, A-G mutation in blue, and A-T mutation in red. (n = 45, 34, and 42 for seqCopy, A-G mutation, and A-T mutation, respectively) (e) Schematics of DNA constructs with mixed A/T stretches modified from seqB. The insert is modified either by shuffling nucleotides within the insert to destroy all the poly(A) and poly(A/T)-tracts (Base shuffle), or by re-positioning the poly(A) or poly(A/T)-tracts (AT-tracts shuffle) – both while maintaining the exact same AT content across the insert. (f) Plectoneme densities measured for the sequences denoted in (e). seqB from panel (b) is plotted in green; base shuffle data are denoted in purple; AT-tracts shuffle in orange. (n = 24, and 26 for Base shuffle, and AT-tracts shuffle, respectively).

https://doi.org/10.7554/eLife.36557.006

However, it is clear that AT-content alone cannot be the only factor that sets the plectoneme pinning. For example, the right-end of template1 exhibits a region that pins plectonemes strongly (Figure 1d-top, arrow), even though this region is not particularly AT-rich (Figure 1c). When we inserted a 1 kb copy of this pinning region into the middle of template1 (Figure 2c, ‘seqCopy’), we observed an additional peak in plectoneme density (Figure 2d, green). Given that this region had the same total AT-content as the surrounding DNA, we hypothesized that the particular distribution of A and T bases may be more important than the total AT-content alone. In particular, poly(A)-tracts influence the local mechanical properties of DNA and might be responsible for the plectoneme pinning, as suggested by early studies (Kremer et al., 1993; Pfannschmidt and Langowski, 1998; Tsen and Levene, 1997). To test this, we removed all poly(A) tracts of length four or higher by replacing alternative A-bases with G or C-bases in seqCopy (Figure 2c, ‘A-G mutation’). Upon this change, the peak in the plectoneme density indeed disappeared (Figure 2d, blue). However, when we instead disrupted the poly(A)≥4 tracts by replacing them with alternating AT-stretches (Figure 2c, ‘A-T mutation’), we, surprisingly, did observe strong pinning (Figure 2d, red), establishing that plectoneme pinning does not strictly require poly(A)-tracts either. Hence, instead of poly(A)-tracts, it could be possible that stretches consisting of either A and T (‘poly(A/T)-tracts’) induce the plectoneme pinning. To test this hypothesis, we re-examined the seqB construct to test if long stretches of ‘weak’ bases (i.e. A or T) were the source of pinning. Here, we broke up all poly(A/T)≥4 tracts (i.e. all linear stretches with a random mixture of A or T bases but no G or C bases) by shuffling bases within the seqB insert while keeping the overall AT-content the same. This eliminated plectoneme-pinning, consistent with the idea that poly(A/T) tracts were the cause (Figure 2e–f, purple). However, if we instead kept all poly(A/T)≥4 tracts intact, but merely rearranged their positions within the seqB insert (again keeping AT-content the same), this rearrangement abolished the pinning pattern (Figure 2f, orange), indicating that plectoneme pinning is not solely dependent on the presence of poly(A/T) stretches, but instead is dependent on the relative positions of these stretches.

Taken together, this systematic exploration of various sequences showed that although pinning correlates with AT-content, we cannot attribute this correlation to AT-content alone, to poly(A)-tracts, or to poly(A/T)-tracts. Our data instead suggest that plectoneme pinning depends on a local mechanical property arising from the combined effect of the entire base sequences in a local region, and our shuffled poly(A/T) constructs suggest this property must be measured over distances greater than tens of nucleotides. Among the three mechanical properties we first considered, duplex stability, flexibility, and curvature, the duplex stability is unlikely to be a determinant factor for the plectoneme pinning because duplex stability is mostly determined by the overall AT/GC percentage rather than the specific distribution of bases in the local region.

Intrinsic local DNA curvature determines the pinning of supercoiled plectonemes

To obtain a more fundamental understanding of the sequence specificity underlying the plectoneme pinning, we developed a novel physical model based on intrinsic curvature and flexibility for estimating the plectoneme energetics (see Materials and methods for details). Notably, the major energy cost for making a plectoneme is spent in inducing a strong bend within the DNA in the plectoneme tip region. Our model estimates the energy cost associated with bending the DNA into the highly curved (~240° arc) plectoneme tip (Marko and Neukirch, 2012). For example, at 3pN of tension (characteristic for our stretched DNA molecules), the estimated size of the bent tip is 73 bp, and the energy required to bend it by 240° is very sizeable,~18 kBT (Figure 3a–b). However, if a sequence has a high local intrinsic curvature or flexibility, this energy cost decreases significantly. For example, an intrinsic curvature of 60˚ between the two ends of a 73 bp segment would lower the bending energy by a sizable amount,~8 kBT. Hence, we expect that this energy difference drives plectoneme tips to pin at specific sequences. We calculated local intrinsic curvatures at each segment along a relaxed DNA molecule using published dinucleotide parameters for tilt/roll/twist (Figure 3a and supplementary file 1) (Balasubramanian et al., 2009). The local flexibility of the DNA was estimated by summing the dinucleotide covariance matrices for tilt and roll (Lankas et al., 2003) over the length of the loop. Using this approach, we estimate the bending energy of a plectoneme tip centered at each nucleotide along a given sequence (Figure 3b). The predicted energy landscape is found to be rough with a standard deviation of about ~1 kBT, in agreement with a previous experimental estimate based on plectoneme diffusion rates (van Loenhout et al., 2012). We then used these bending energies to assign Boltzmann-weighted probabilities, PB=exp(EloopkBT), for plectoneme tips centered at each base on a DNA sequence. This provided theoretically estimated plectoneme densities as a function of DNA sequence. Note that we obtained these profiles without any adjustable fitting parameters as the tilt/roll/twist and flexibility values were determined by dinucleotide parameters adopted from published literature. Although both intrinsic curvature and flexibility were included, the model predicts that the flexibility is unimportant and that intrinsic curvature clearly is the dominant factor in positioning plectonemes (Figure 3c).

Figure 3 with 2 supplements see all
DNA plectonemes pin to sequences that exhibit local curvature.

(a) Ingredients for an intrinsic-curvature model that is strictly based on dinucleotide stacking. (Left) Cartoons showing the relative alignment between the stacked bases which are characterized by three modes: roll, tilt, and twist. (Middle) In the absence of variations in the roll, tilt, and twist, a DNA molecule adopts a strictly linear conformation in 3D space. (Right) Example of a curved free path of DNA that is determined by the slightly different values for intrinsic roll, tilt, and twist angles for every dinucleotide. (b) Schematics showing the energy required to bend a rigid elastic rod as a simple model for the tip of a DNA plectoneme. (c) Plectoneme density prediction based on intrinsic curvature and/or flexibility for seqCopy. Predicted plectoneme densities calculated based on either DNA flexibility (blue), only curvature (red), or both (black). Combining flexibility and curvature did not significantly improve the prediction comparing to that solely based on DNA curvature. (d) Predicted plectoneme densities for the DNA constructs carrying a copy of the end peak and its mutations, as in Figure 2b. Note the excellent correspondence to the experimental data in Figure 2b. (e–f) Predicted (e) and measured (f) plectoneme density of a synthetic sequence (250 bp) that is designed to strongly pin a plectoneme. Raw data from the model are shown in black and its Gaussian-smoothed (FWHM = 1600 bp) is shown in blue in the left panel. Plectoneme densities measured from individual DNA molecules carrying the synthetic sequence (thin grey lines) and their averages (thick blue line) are shown in the right panel. (n = 37, and 21 for curved250, and flat500, respectively) (g–h) Model-predicted (upper panels) and experimentally measured (bottom panels) plectoneme densities of 75 bp-long highly curved (g) and flat (h) inserts. (i) Model-predicted (upper panels) and experimentally measured (bottom panels) plectoneme densities of curved GC-rich sequences. (n = 36, 26, 21, 20, 52, and 29 for curve75-1, curve75-2, flat75-1, flat75-2, GCcurve1, and GCcurve2, respectively).

https://doi.org/10.7554/eLife.36557.008

The predicted plectoneme densities (Figure 3d and Figure 3—figure supplement 1) are generally found to be in very good agreement with the measured plectoneme densities. For example, the non-intuitive mutant sequences tested above (A-G and A-T mutations) are faithfully predicted by the model (Figure 2d and Figure 3d). More generally, we find that the model qualitatively represented the experimental data for the large majority of the sequences that were tested (Figure 3—figure supplement 1). The simplicity of the model and the lack of fitting parameters make this agreement all the more striking. Only occasionally, we find that the model is too conservative, that is while it performs well in avoiding false positives, it suffers from some false negatives (Figure 3—figure supplement 1, SeqA, SeqB, and SeqC), possibly because of an insufficient accuracy in the dinucleotide parameters that we adopted from the literature. For example, different dinucleotide parameter sets from the currently available literature produce variations in the model predictions (Figure 3—figure supplement 2). Alternative explanations for the false negatives are also possible, for example that the local curvature is influenced by interactions spanning beyond nearest-neighbor nucleotides, or some unknown DNA sequences that stabilize twist rather than strand writhing or that are prone to base-flipping even in the positive supercoiling regime.

As a test of the predictive power of our model, we designed a 250 bp-long sequence (‘curved250’) for which our model a priori predicted a high local curvature and strong plectoneme pinning (Figure 3e). When we subsequently synthesized and measured this construct, we indeed observed a pronounced peak in the plectoneme density (Figure 3f, blue). By contrast, when we constructed a 500 bp-long flat sequence without strongly curved regions (‘flat500’), the model predicted no such peak, which again was verified experimentally (Figure 3f, black). These data demonstrate that the model can be used to identify potential plectoneme pinning sites in silico. Perhaps most strikingly, we found that a single highly curved DNA sequence of only 75 bp length was able to pin plectonemes (Figure 3g), consistent with the approximated tip loop size in our physical model (~73 bp). As a negative control, we did not observe any such pinning when we inserted a 75 bp-long flat DNA sequence (Figure 3h).

Finally, we wanted to verify that the intrinsic curvature, and not the GC/AT content, is the major determinant for plectoneme formation. Given that the earlier examples in Figure 2f clearly showed that some but not all AT-rich sequences can pin plectonemes, we designed some specifically GC-rich (i.e., AT-poor) sequences that should pin plectonemes. Because of the distribution of wedge angles available, GC-rich sequences tend to produce less intrinsic curvature over >10 bp sequences. To generate plectoneme pinning at a GC-rich sequence, we therefore inserted 8 repeats of a 75 bp-long GC-rich (~60%) insert in the middle of the flat500 sequence. As predicted by the model, the experimental data for this GC-rich curved sequence showed plectoneme pinning (Figure 3i), once more confirming that intrinsic curvature and not AT/GC content is the major determinant for plectoneme pinning.

Transcription start sites localize plectonemes in prokaryotic genomes

Given the success of our physical model for predicting plectoneme localization, it is of interest to examine if the model identifies areas of high plectoneme density in genomic DNA that might directly relate to biological functions. Given that our model associates plectoneme pinning with high curvature, we were particularly interested to see what patterns might associate with specific genomic regions. For example, in prokaryotes, curved DNA has been observed to localize upstream of transcription start sites (TSS) (Kanhere and Bansal, 2005; Olivares-Zavaleta et al., 2006; Pérez-Martín et al., 1994). In eukaryotes, curvature is associated with the nucleosome positioning sequences found near promoters (Tompitak et al., 2017). However, given that our model requires highly curved DNA over long lengths of ~73 bp to induce plectoneme pinning, it was a priori unclear if the local curvature identified at promoter sites is sufficient to strongly influence the plectoneme density.

We first used the model to calculate the plectoneme density profile for the entire E. coli genome, revealing plectonemic hot spots spread throughout the genomic DNA (Figure 4a). Interestingly, we find that a substantial fraction of these hot spots are localized ~100 nucleotides upstream of all the transcription start sites (TSS) associated with confirmed genes in the RegulonDB database (Figure 4b, red) (Gama-Castro et al., 2016). We then performed a similar analysis of several other prokaryotic genomes (Figure 4b) (Cortes et al., 2013; Irla et al., 2015; Papenfort et al., 2015; Zhou et al., 2015). We consistently observe a peak upstream of the TSS, but the size of the peak varied substantially between species, indicating that different organisms rely on sequence-dependent plectoneme positioning to different extents. In one organism (C. crescentus), the signal was too weak to detect at all. To experimentally confirm that these sequences represent plectonemic hot spots, we inserted two of these putative plectoneme-pinning sites from E. coli into template1. Gratifyingly, we indeed observed a strong pinning effect for these sequences in our single-molecule assay (Figure 4c–d).

Plectonemes are enriched at prokaryotic transcription start sites.

(a) The strength of plectoneme pinning calculated for the entire E. coli genome (4,639,221 bp; NC_000913). (b) Mean predicted plectoneme densities around transcription start sites (TSS) in prokaryotic genomes. The density profiles were smoothed over a 51 bp window. (c) Model-predicted and (d) experimentally measured plectoneme densities obtained for two selected TSS sites, TSS-rrsB and TSS-polA, which are E. coli transcription start sites encoding for 16S ribosomal RNA and DNA polymerase I, respectively. For comparison to experimental data, we smoothed the predicted plectoneme densities using a Gaussian filter (FWHM = 1600 bp) that approximates our spatial resolution. (n = 26, and 17 for TSS-rrsB, and TSS-polA, respectively) (e) Strength of plectoneme pinning calculated for the entire 12.1 Mb genome (i.e. all 16 chromosomes placed in sequential order) of S. cerevisiae (NC_001134). For quantitative comparison, we kept the radius of the outer circle the same as in (a). (f) Mean predicted plectoneme densities around the most representative TSS for each gene in several eukaryotic genomes. The density profiles are smoothed over a 51 bp window.

https://doi.org/10.7554/eLife.36557.011

Finally, we extended our analysis to eukaryotic organisms. Again we found plectonemic hotspots that were spread throughout the genome (Figure 4e). When averaging near the TSS (Dreos et al., 2017), we found a diverse range of plectoneme positioning signals (Figure 4f). While one organism (S. cervisiae) showed no detectable plectoneme positioning, most organisms showed both peaks and valleys indicating plectonemes were enriched but also depleted at different regions around the promoter. The features showed a weak periodicity consistent with the reported nucleosome repeat lengths (~150–260 bp) (Jiang and Pugh, 2009).

Discussion

In this study, we reported direct experimental observations as well as a novel basic physical model for the sequence-structure relationship of supercoiled DNA. Our single-molecule ISD technique allowed a systematic analysis of sequences that strongly affect plectoneme formation. To explain the underlying mechanism, we developed a physical model that predicts the probability of plectoneme pinning, based solely on the intrinsic curvature and the flexibility of a local region of the DNA. In the positive supercoiling regime (where no partial duplex melting is expected for the physiological range of tensions and torques), we identified the intrinsic curvature over a ~ 70 bp range as the primary factor that determines plectoneme pinning, while the flexibility alters the mechanics only minimally. Examining full genomes, we found that plectonemes are enriched at promoter sequences in E. coli and other prokaryotes, which suggests a role of genetically encoded supercoils in cellular function. Our findings reveal how a previously unrecognized ‘hidden code’ of intrinsic curvature governs the localization of local DNA supercoils, and hence the organization of the three-dimensional structure of the genome.

For a long time, researchers wondered whether DNA sequence may influence the plectonemic structure of supercoiled DNA. Structural and biochemical approaches identified special sequence patterns such as poly(A)-tracts that indicated plectoneme pinning (Kremer et al., 1993; Laundon and Griffith, 1988; Pfannschmidt and Langowski, 1998; Tsen and Levene, 1997). These early studies suggested that highly curved DNA can pin plectonemes, but the evidence was anecdotal and restricted to a handful of example sequences and it was not possible to establish a general rule for sequence-dependent plectoneme formation. Our high-throughput ISD assay, however, generated ample experimental data that enabled a comprehensive understanding of the underlying mechanism of the sequence-dependent plectoneme pinning.

Our physical modeling reveals that intrinsic curvature is the key structuring factor for determining the three-dimensional structure of supercoiled DNA. In contrast, although perhaps counter-intuitive, we found that the local flexibility is hardly relevant for plectoneme localization. Although highly flexible mismatched single-stranded regions have been shown to be able to act as a preferential position for plectoneme formation (Dittmore et al., 2017; Ganji et al., 2016b), the variations in the flexibility of duplex DNA due to sequence differences seem to produce very minor changes in the pinning probability.

Remarkably, although only the energy required to form the limited tip-loop region of ~73 bp is considered in our modeling, the model is capable of strikingly good qualitative predictions. In occasional cases, the model failed to reproduce the experimental results, giving some false negative predictions. A full statistical mechanical modeling of the plectonemic structures distributed across the DNA molecule should further improve the predictive power and accuracy, but will require significant computational resources and time.

Significant intrinsic curvatures are encoded in genomic DNA, as evident in our scans of both prokaryotic and eukaryotic genomes, which indicates its biological relevance. In support of this idea, an in silico study indeed suggested that curved prokaryotic promoters may control gene expression (Gabrielian et al., 1999). Moreover, early in vivo studies showed that curved DNA upstream to the promoter site affects gene expression levels (Collis et al., 1989; McAllister and Achberger, 1989). These in vivo studies suggested that curved DNA facilitates binding of RNA polymerase, an idea that is further supported by sharply bent DNA structures found around bound RNAP (Rees et al., 1993; Tahirov et al., 2002; ten Heggeler and Wahli, 1985; Yin and Steitz, 2002). In addition to this direct interaction of RNA polymerase and curved DNA, our results suggest an indirect effect, as the same curved DNA can easily pin a plectoneme that can further regulate the transcription initiation and elongation by structural re-arrangement of the promotor and coding regions.

Our analysis of prokaryotic genomes indicates that promoter sequences have evolved local regions with highly curved DNA that promote the localization of DNA plectonemes at these sites. There may be multiple reasons for this. For one, it may help to expose these DNA regions to the outer edge of the dense nucleoid, making them accessible to RNAP, transcription factors, and topoisomerases. Plectonemes may also play a role in the bursting dynamics of gene expression, since each RNAP alters the supercoiling density within a topological domain as it transcribes (Chong et al., 2014; Kouzine et al., 2013), adding or removing nearby plectonemes (Liu and Wang, 1987). In addition, by bringing distant regions of DNA close together, plectonemes may influence specific promoter-enhancer interactions to regulate gene expression (Benedetti et al., 2014). Finally, plectoneme tips may help RNA polymerase to initiate transcription, since the formation of an open complex also requires bending of the DNA (ten Heggeler-Bordier et al., 1992), a mechanism that was proposed as a universal method of regulating gene expression across all organisms (Travers and Muskhelishvili, 2007). The ability of our model to predict how mutations in the promoter sequence alter the plectoneme density opens up a new way to test these hypotheses.

Our analysis of eukaryotic genomes showed a greater diversity of behavior. The spacing of the peaks suggests that plectonemes may play a role in positioning nucleosomes, consistent with proposals that nucleosome positioning may rely on sequence-dependent signals near promoters (Travers et al., 2010). It is also broadly consistent with the universal topological model of plectoneme-RNAP interaction at promoters (Travers and Muskhelishvili, 2007), which proposes that the plectoneme tip forming upstream of the TSS in eukaryotes is positioned by nearby nucleosomes. The plectoneme signal encoded by intrinsic curvature could therefore indirectly position the promoter plectoneme tip by helping to organize these nearby nucleosomes.

In our study, we investigated the sequence-dependent behavior of plectonemes in a positively supercoiled state, although the technique can be extended to study negative supercoiling as well. For negative supercoils, plectoneme pinning can be influenced by both sequence-induced local curvature and local melting, which are hard to disentangle. Furthermore, although theoretical methods have been developed for the sequence dependence of the duplex stability of negatively supercoiled DNA (Benham, 1990; Benham, 1992), torsion-induced melting has been shown to exhibit complicated properties (Vlijm et al., 2015). The model that we have developed for positive supercoils should not be very sensitive to the handedness of supercoiling, since the dinucleotide curvature parameters are not strongly perturbed at these torques. We therefore expect the model to also capture curvature-dependent effects on pinning of negative plectonemes too.

The above findings demonstrate that DNA contains a previously hidden ‘code’ that determines the local intrinsic curvature and consequently governs the locations of plectonemes. These plectonemes can organize DNA within topological domains, providing fine-scale control of the three-dimensional structure of the genome (Le et al., 2013). The model and assay described here make it possible both to predict how changes to the DNA sequence will alter the distribution of plectonemes and to investigate the DNA supercoiling behavior at specific sequences empirically. Using these tools, it will be interesting to explore how changes in this plectoneme code affect levels of gene expression and other vital cellular processes.

Materials and methods

Preparation of DNA molecules of different sequences

Request a detailed protocol

Full sequences of all DNA molecules are given in Supplementary file 2. All DNA molecules except ‘template 2’ in Figure 1 were prepared by ligating four or five DNA fragments, respectively: 1) ‘Cy5-biotin handle’, 2) ‘8.4 kb fragment’, [3) ‘Sequence of Interest’,] 4) ‘11.2 kb fragment’, and 5) ‘biotin handle’ (Figure 1—figure supplement 1b). The ‘Cy5-biotin handle’ and ‘biotin handle’ were prepared by PCR methods in the presence of Cy5-modified and/or biotinylated dUTP (aminoallyl-dUTP-Cy5 and biotin-16-dUTP, Jena Bioscience). The ‘8kb-fragment’ and ‘11 kb fragment’ were prepared by PCR on Unmethylated Lambda DNA (Promega). These fragments were cloned into pCR-XL using the TOPO XL PCR cloning kit (Invitrogen) generating pCR-XL-11.2 and pCR-XL-8.4 (Ganji et al., 2016b). The fragments were PCR amplified and then digested with BsaI restriction enzyme, respectively (Supplementary file 3). The ‘Sequence of Interest’ was made by PCR on different templates. Template two in Figure 1C-black and 1e was made from a digested fragment of an engineered plasmid pSuperCos-λ1,2 with XhoI and NotI-HF (van Loenhout et al., 2012). The digested fragment was further ligated with biotinylated PCR fragments on XhoI side and a biotinylated-Cy5 PCR fragment on the NotI-HF (Supplementary file 4). All the DNA samples were gel-purified before use.

Dual-color epifluorescence microscopy

Request a detailed protocol

Details of our experimental setup are described previously (Ganji et al., 2016a; Ganji et al., 2016b). Briefly, a custom-made epifluorescence microscopy equipped with two lasers (532 nm, Samba, Cobolt and 640 nm, MLD, Cobolt) and an EMCCD camera (Ixon 897, Andor) is used to image fluorescently labeled DNA molecules. For the wide-field, epifluorescence-mode illumination on the sample surface, the two laser beams were collimated and focused at the back-focal plane of an objective lens (60x UPLSAPO, NA 1.2, water immersion, Olympus). Back scattered laser light was filtered by using a dichroic mirror (Di01-R405/488/543/635, Semrock) and the fluorescence signal was spectrally separated by a dichroic mirror (FF635-Di02, Semrock) for the SxO channel and Cy5 channel. Two band-pass filters (FF01-731/137, Semrock, for SxO) and FF01-571/72, Semrock, for Cy5) were employed at each fluorescence channel for further spectral filtering. Finally, the fluorescence was imaged on the CCD camera by using a tube lens (f = 200 mm). All the measurements were performed at room temperature.

Intercalation-induced supercoiling of DNA (ISD)

Request a detailed protocol

A quartz slide and a coverslip were coated with polyethlyleneglycol (PEG) to suppress nonspecific binding of DNA and SxO. 2% of the PEG molecules were biotinylated for the DNA immobilization. The quartz slide and coverslip were sandwiched with a double-sided tape such that a 100 µm gap between the slide and coverslip forms a shallow sample chamber with flow control. Two holes serving as the inlet and outlet of the flow were placed on the slide glass. Typically, a sample chamber holds 10 µl of solution.

Before DNA immobilization, we incubated the biotinylated PEG surface with 0.1 mg/ml streptavidin for 1 min. After washing unbound streptavidin by flowing 100 µl of buffer A (40 mM TrisHCl pH 8.0, 20 mM NaCl, and 0.2 mM EDTA), we flowed the end-biotinylated DNA diluted in buffer A into the sample chamber at a flow rate of 50 µl/min. The concentration of the DNA (typically ~10 pM) was empirically chosen to have an optimal surface density for single DNA observation. Immediately after the flow, we further flowed 200 µl of buffer A at the same flow rate, resulting in stretched, doubly tethered DNA molecules (Figure 1a and Figure 1—figure supplement 1a) of which end-to-end extension can be adjusted by the flow rate. We obtained the DNA lengths of around 60–70% of its contour length (Figure 1—figure supplement 2a), which corresponds to a force range of 2–4 pN (Ganji et al., 2016b). We noted that SxO does not exhibit any sequence preference when binding to relaxed DNA, allowing us to back out the amount of DNA localized within a diffraction-limited spot from the total fluorescence intensity.

After immobilization of DNA, we flowed in 30 nM SxO (S11368, Thermo Fisher) in an imaging buffer consisting of 40 mM Tris-HCl, pH 8.0, 20 mM NaCl, 0.4 mM EDTA, 2 mM trolox, 40 µg/ml glucose oxidase, 17 µg/ml catalase, and 5% (w/v) D-dextrose. Fluorescence images were taken at 100 ms exposure time for each frame. The 640 nm laser was used for illuminated for the first 10 frames (for Cy5 localization), followed by continuous 532 nm laser illumination afterwards. From our previous study, we noted that SxO locally unwinds DNA and extends the contour length (Figure 1—figure supplement 1a), but does not otherwise affect the mechanical properties of the DNA (Ganji et al., 2016b). Based on the same previous work and assuming that each intercalating dye reduces the twist at the local dinucleotide to zero, we estimate that roughly 1 SxO is bound on every 26 base-pairs of DNA. We note that the numbers of plectoneme nucleation and termination events along supercoiled DNA were equal (Figure 1—figure supplement 2b), which is characteristic of a system at equilibrium. Furthermore, we verified that increasing the NaCl concentration from 20 mM to 150 mM NaCl did not result in any significant difference in the observed plectoneme density results, indicating that the plectoneme density is not dependent on the ionic strength (Figure 2—figure supplement 1f).

Data analysis

Request a detailed protocol

Analysis of the data was carried out using custom-written Matlab routines, as explained in our previous report (Ganji et al., 2016b). Briefly, we averaged the first ten fluorescence images to determine the end positions of individual DNA molecules. We identify the direction of the DNA molecules by 640 nm illumination at the same field of view, which identifies the Cy5-labelled DNA end. Then, the fluorescence intensity of the DNA at each position along the length was determined by summing up 11 neighboring pixels perpendicular to the DNA at that position. The median value of the pixels surrounding the molecule was used to correct the background of the image. The resultant DNA intensity was normalized to the total intensity sum of the DNA for each frame to compensate for photo-bleaching of SxO. We recorded more than 300 frames, each taken with a 100 msec exposure time, and built an intensity kymograph by aligning the normalized intensity profiles in time. Supercoiled DNA intensity profiles, that is single lines in the intensity kymograph, were converted to DNA-density profiles by comparing the intensity profile of supercoiled DNA to that of the corresponding relaxed DNA. Specifically, the ratio between the cumulative intensities of all the pixels in the right and the left-hand sides of each position of the DNA was first determined. To find the genomic position (i.e. base pair position) of the peak, we compared this ratio with that obtained after torsional relaxation of the molecule of which the pixel position is the same with the genomic position under the given constant tension (Ganji et al., 2016b). The torsionally relaxed intensity profile was obtained after the plectoneme measurements by increasing the excitation laser power that yielded a photo-induced nick of the DNA.

The position of a plectoneme is identified by applying a threshold algorithm to the DNA density profile. A median of the entire DNA density kymograph was used as the background DNA density. The threshold for plectoneme detection was set at 25% above the background DNA density. Peaks that sustain at least three consecutive time frames (i.e.,≥300 ms) were selected as plectonemes. After identifying all the plectonemes, the probability of finding a plectoneme at each position (250 bp-long segment) along the DNA in base-pair space was calculated by counting the total number of plectonemes at each position (segment) divided by the total observation time. The probability density is then further normalized to its mean value across the DNA molecule to build a plectoneme density. Note that the plectoneme density represents the relative propensity of plectoneme formation at different regions within a DNA molecule, which is insensitive to the length of the DNA as well as the linking number. Typically, more than 20 DNA molecules were measured for each DNA sample and the averaged plectoneme densities were calculated with a weight given by the observation time of each molecule. The analysis code written in Matlab (The MathWorks, Inc.) is freely available from GitHub (Kim, 2018; copy archived at https://github.com/elifesciences-publications/Plectoneme_analysis).

Plectoneme tip-loop size estimation and bending energetics

Request a detailed protocol

An important component of our model is to determine the energy involved in bending the DNA at the plectoneme tip. We first estimate the mean size of a plectoneme tip-loop from the energy stored in an elastic polymer with the same bulk features of DNA. For the simplest case, we first consider a circular loop (360˚) formed in DNA under tension. The work associated with shortening the end-to-end length of DNA to accommodate the loop is

W=rFN

where F is the tension across the polymer, r is the base pair rise (0.334 nm for dsDNA), and N is the number of base pairs. The bending energy is

Ebend=2π2kBTArN

where kB is the Boltzmann constant, A is the bulk persistence length (50 nm for dsDNA). Hence, we obtain an expression for the total energy:

Etotal=rFN+2π2kBTArN=kBT(CN+B360/N)

Taking the derivative of Etotal with respect to N and setting it to zero gives the formula:

N=B360C

Here, the values of the constants are:

C=F12.16pN
B360=2955

So, at 3 pN we get:

N=B360C=109

If the loop at the end of the plectoneme is held at the same length but only bent to form a partial circle, the work needed to accommodate the loop will remain the same but the bending energy will be lower, scaling quadratically with the overall bend angle. For a plectoneme tip, a 240˚ loop is sufficient to match the angle of the DNA in the stem of the plectoneme. The preferred length of a 240˚ loop is therefore:

N=B240C=73

where:

B240=B360(240°360°)2

Physical model predicting the plectoneme density

Request a detailed protocol

A full model must explicitly account for the fact that DNA is not a homogeneous polymer. Instead, each DNA sequence has (1) intrinsic curvature and (2) a variable flexibility. Both 1 and 2 depend on the dinucleotide sequences at each location. Note also that we can bend the DNA along any vector normal to the path of the DNA, which describes a circle spanning the full 360˚ surrounding the DNA strand. We must therefore specify the direction of bending ϕ when calculating the bend energy, and we define ϕ = ϕB to be the bend direction that aligns with the intrinsic curvature.

The intrinsic curvature can be estimated from the dinucleotide content of the DNA (Figure 3a). Several studies have attempted to measure the optimal set of dinucleotide parameters (i.e. tilt, roll, and twist) that most closely predict actual DNA conformations (Balasubramanian et al., 2009; Bolshoy et al., 1991; Morozov et al., 2009; Olson et al., 1998). We find that the parameter set by Balasubramanian et al., produces the closest match to our experimental data when plugged into our model (Balasubramanian et al., 2009). Using these parameters (see Supplementary file 1), we first calculate the winding ground state path traced out by the entire DNA strand. We then determine the intrinsic curvature, θ(N,i), across a given stretch of N nucleotides centered at position i on the DNA by comparing tangent vectors at the start and end of that stretch. Tangent vectors are calculated over an 11 bp window (one helical turn,~3.7 nm). Note that the intrinsic curvature, defined by θ(N,i), also determines the preferred bend direction ϕB.

The flexibility of the DNA also varies with position. The flexibility of the tilt and roll angles between neighboring dinucleotide has been estimated by MD simulations (Lankas et al., 2003). Using these numbers, we can add the roll-tilt covariance matrices for a series of nucleotides (each rotated by the twist angle) to calculate the local flexibility of a given stretch of DNA. The flexibility also depends on the direction of bending. The summed covariance matrix allows us to estimate a local persistence length A(N,I,ϕ).

By combining the local bend angle θ(N,i) and the local persistence length A(N,I,ϕ), we are now able to calculate the energy needed to bend a given stretch of DNA to 240˚. When the DNA is bent in the preferred curvature direction, this bending energy becomes:

Ebend(N,i,ϕB)KBT=2322π2A(N,i,ϕB)0.334nmN1θ(N,i)2402

More generally, we can bend the DNA in any direction, in which case the bending energy can be calculated using the law of cosines:

Ebend(N,i,ϕ)KBT=2322π2A(N,i,ϕ)0.334nmN1+θ(N,i)24022θ(N,i)240cos(ϕϕB)

The first formula is the special case when ϕ = ϕB.

Because both A(N,i, ϕ) and θ(N,i) are sequence dependent, the loop size and bend direction that minimizes the free energy will also be sequence dependent. Rather than trying to find the parameters that give a maximum likelihood at each position along the template, we find that it is more efficient to calculate the relative probabilities of loops spanning a range of sizes and bend directions. We first calculate the energy associated with each loop using:

Etotal(N,i,ϕ)kBT=rFkBTN+Ebend(N,i,ϕ)kBT

We then assign each of these bending conformations a Boltzmann weight:

W(N,i,ϕ)=expEtotal(N,i,ϕ)kBT

Finally, we sum over all the different bending conformations to get the total weight assigned to the formation of a plectoneme at a specific location i on the template:

Wtot(i)=N,ϕW(N,i,ϕ)

Because the direction ϕ is a continuous variable and the length of the loop can range strongly, there are a very large number of bending conformations to sum over. However, because of the exponential dependence on energy, only conformations near the maximum likelihood value in phase space will contribute significantly to the sum. For an isotropic DNA molecule, the maximum likelihood should occur at N = 73 and ϕ = ϕB. We therefore sum over parameter values that span this point in phase space. Our final model sums over eight bending directions (i.e. at every 45°, starting from ϕ = ϕB) and calculates loop sizes over a range from 40 bp to 120 bp at 8 bp increments. We verified that the predictions of the model were stable if we increased the range of the loop sizes considered or increased the density of points sampled in phase space, implying that the range of values used was sufficient.

For a fair comparison to experimental data, all predicted plectoneme densities that are presented were smoothened using a Gaussian filter (FWHM = 1600 bp) that approximates our spatial resolution. The code for the model prediction is freely available from GitHub (Abbondanzieri, 2018; copy archived at https://github.com/elifesciences-publications/Plectoneme_prediction). 

Data availability

All data generated or analysed during this study are included in the manuscript and supporting files. The previously published genome data for E. coli used in Figure 4B can be accessed here http://regulondb.ccg.unam.mx/menu/download/datasets/files/PromoterSet.txt; V. cholerae here http://www.pnas.org/highwire/filestream/618514/field_highwire_adjunct_files/2/pnas.1500203112.sd02.xlsx; B. methanolicus here https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4342826/bin/12864_2015_1239_MOESM2_ESM.xlsx; M. tuberculosis here https://ars.els-cdn.com/content/image/1-s2.0-S2211124713006153-mmc2.xlsx; and C. crescentus here https://doi.org/10.1371/journal.pgen.1004831.s012. The previously published genome data for D. melanogaster, C. elegans, A. thaliana, S. cerevisiae, and S. pombe used in Figure 4E can be accessed using the Eukaryotic Promotor Database (https://epd.vital-it.ch).

The following previously published data sets were used
    1. Foury F
    2. Roganti T
    3. Lecrenier N
    4. Purnelle B
    (1998) NCBI Nucleotide
    ID NC_001224.1. Whole genome sequence data: S. cerevisiase.

References

    1. Gabrielian AE
    2. Landsman D
    3. Bolshoy A
    (1999)
    Curved DNA in promoter sequences
    In Silico Biology 1:183–196.
    1. McAllister CF
    2. Achberger EC
    (1989)
    Rotational orientation of upstream curved DNA affects promoter function in Bacillus subtilis
    The Journal of Biological Chemistry 264:10451–10456.
    1. Pérez-Martín J
    2. Rojo F
    3. de Lorenzo V
    (1994)
    Promoters responsive to DNA bending: a common theme in prokaryotic gene expression
    Microbiological Reviews 58:268–290.

Decision letter

  1. Michael T Laub
    Reviewing Editor; Massachusetts Institute of Technology, United States
  2. Naama Barkai
    Senior Editor; Weizmann Institute of Science, Israel

In the interests of transparency, eLife includes the editorial decision letter and accompanying author responses. A lightly edited version of the letter sent to the authors after peer review is shown, indicating the most substantive concerns; minor comments are not usually included.

Thank you for submitting your article "DNA sequence encodes the position of DNA supercoils" for consideration by eLife. Your article has been reviewed by three peer reviewers, one of whom is a member of our Board of Reviewing Editors, and the evaluation has been overseen by Naama Barkai as the Senior Editor. The reviewers have opted to remain anonymous.

The reviewers have discussed the reviews with one another and the Reviewing Editor has drafted this decision to help you prepare a revised submission.

The work here by Kim et al. describes a technically impressive approach to identify the relationship between DNA sequence and the positioning of DNA supercoils. In agreement with many earlier studies, the authors suggest that plectonemes are positioned by and formed at intrinsically curved DNA sequences. The authors additionally suggest that AT content and sequence flexibility contribute minimally, if at all, to supercoil positioning. Lastly, the authors develop a model and calculate plectoneme density across several genomes. Using this approach, they identify a correlation between curved sequences and transcriptional start site (TSS) location in several organisms and show that two of these sequences can indeed position supercoils in vitro. The assay and many of the underlying concepts have been previously reported, but the systematic study of diverse sequences and direct comparisons to a detailed model based on intrinsic curvature are, to our knowledge, novel. The work complements other recent studies that have explored pinning due to mismatches, general theory, and measurements based on overall DNA extension rather than direct imaging of plectonemes.

Although the reviewers were generally enthusiastic, there were some issues that arose during review and discussion that should be addressed in a revision. The first of these will require additional experiments. The others could, but in some cases may also be dealt with through changes in the text and presentation of the data:

1) While this work shows that AT content is not required for plectoneme formation, the authors' conclusion that the curvature is the predominant driver of supercoil formation is based on a limited number of sequence manipulations, with all insertions tested in a single longer context (template 1) and only one synthetic designed sequence tested after developing the model. A deeper analysis (changing the AT content, length, amount of curvature or eliminating curvature entirely) of their template is important to understand the sequence properties that are necessary for plectoneme formation. Since short 60-90 bp segments are sufficient for supercoil pinning (Kremer et al., 1993) and the authors' predict that a 73 bp region should be sufficient to induce pinning, shorter segments than the ones presented (250-450 bp Figure 3E, 4C) should be directly tested. Working with a GC-rich template, in particular, would improve the generality of the results.

2) Related to the point above, the authors state that "the model qualitatively represented the experimental data for all sequences that were tested", but it appears this comparison is only shown for a limited subset of the sequences. Given that the paper is centrally focused on proposing a general model, it would be appropriate to show comparisons with all sequences, to facilitate evaluation of the model's successes and limitations. In particular, it does not seem that model predictions are shown for about a dozen of the sequence variants for which experiments are reported:

– template 2

– template 1 + seqA, seqB, seqC;

– seqB with base shuffling and with AT-tract shuffling (the latter in particular is a central experiment for the paper)

– template 1 + variable length insertions (.25, 1, 2, 3, 3.9 kb)

3) The Introduction of the manuscript is somewhat misleading. An existing model in the field is that curved sequences induce supercoil formation (many of these papers are cited in the manuscript, see Kremer et al., 1993, Laundon and Griffith, 1988, Pfannschmidt and Langowski, 1998, Tsen and Levene, 1997, and Pavlicek et al., 2004, among others), however the authors do not acknowledge this. The authors are correct that a major shortcoming of these earlier works is their use of phased AT-tracts in generating a curved sequence, however, the introduction should be written to clarify this for the reader. Some of the references are also misattributed. In the second paragraph of the Introduction, Eslami-Mossallam et al., 2016, Pasi and Lavery, 2016, and Wang et al., 2017, are computational studies not biochemical and structural studies, while Kremer et al., 1993, Laundon and Griffith, 1988, Pfannschmidt and Langowski, 1998, and Tsen and Levene, 1997, used biochemical and structural approaches in addition to in silico modeling.

4) How the plectoneme density (e.g., Figure 1D) is calculated was buried in the Materials and methods and the prior paper from this group, making it a little difficult to understand what was done. Given the centrality of plectoneme density plots, it would be useful for the definition, calculation procedure, and motivation for choosing this metric to be explained clearly up front. The Materials and methods mention the calculation of the size of each plectoneme (how much writhe is constrained in the molecule at each position) but it is unclear whether the plectoneme size is ever used in any of the reported analysis. Relatedly, how do AT-content, curvature, and other sequence features affect plectoneme formation/termination kinetics and the amount of writhe per plectoneme? Finally, can the authors quantify sequence effects on plectoneme "slithering" (diffusion)? They mention that the model predicts the expected degree of roughness to explain plectoneme diffusion rates, but do not comment on whether sequence differences cause predictable changes to these rates.

5) The experiments here are performed in the presence of positive supercoiling. There could be important differences under negative supercoiling, relevant to biological contexts. For example, the authors present evidence for intrinsic curvature as a dominant contributor to sequence-dependent plectonemic pinning. An alternative proposal (Matek et al.) is that duplex stability plays an important role, via the formation of plectoneme tip bubbles. If these occur, they are expected to occur more readily in negatively supercoiled DNA, which favors strand separation. Ideally, the authors would provide any evidence addressing whether plectoneme pinning occurs at the same locations/with the same sequence determinants for positively and negatively supercoiled DNA. Clearly the high-throughput assay used here can only interrogate positively supercoiled DNA, so this would require a lower-throughput assay such as side-stretching magnetic tweezers + fluorescence. More generally, it would be interesting to see the predictions of a fully developed local denaturation model compared with experiments – for the conditions reported here, the authors dismiss this kind of model on the basis of the AT-tract shuffling experiment, and do not consider it further. Although additional data would be ideal, and perhaps the authors already have such data. At a minimum, the authors should revise the text to be more cautious about generalizing to negative supercoils, and they should discuss possible differences between negative and positive supercoiling.

6) Can the DNA structures that form plectonemes transition to a change in twist? This question arises in part from a related point or question, which is what the authors envision in terms of plectoneme formation in the context of eukaryotic chromatin? It is difficult to imagine there generally being sufficient "free" DNA to form plectonemes on a chromatinised template in eukaryotic cells. It would be interesting to consider and discuss how much space a plectoneme needs to form.

7) The authors suggest that plectonemes are enriched at TSS. This seems overly speculative unless the authors can include additional information/experiments. For example, the correlation between curved sequences and TSS could be due to promoters evolving curved sequences to increase RNA polymerase binding or activity, rather than plectoneme formation per se. Insertion of a curved sequence upstream of the -35 element has been shown to increase affinity for RNA polymerase in vitro on a linear template in the absence of supercoiling (Nickerson et al., 1995). The authors' argument would be strengthened if they could show plectoneme formation on these sequences in vivo or a relationship between plectoneme formation and RNA polymerase binding/activity in vitro. Unless such data are included (and we recognize it may be beyond the scope of this paper), the authors should at least tone down some statements in this vein, e.g. in the Abstract where it says "…and experimentally verify that plectonemes localize directly upstream…".

8) The experiments here are necessarily performed on DNA with intercalated dye. While the authors argue on the basis of prior work that overall mechanical properties of DNA are minimally affected by the intercalator, can they rule out that sequence-specific observations are affected by the dye? MT tweezers assays in the absence of dye cannot directly report on plectoneme locations, but can detect signatures of strong pinning sequences – could targeted key predictions be tested in this kind of assay? On this point the authors are encouraged to provide new data, if easily obtained, or at least discuss the caveats in a revised manuscript.

9) The authors should discuss the assumptions and approximations involved in predicting plectoneme probabilities solely on the basis of tip-loop energetics. Their model falls short of a complete statistical mechanical treatment that would model partitioning of linking number among plectonemes under imposed constraints across the full molecule, and will for example fail to capture position-dependent effects in which the growth of a plectoneme pinned near the end of the molecule is limited by the available DNA, which can favor the formation of a plectoneme elsewhere in the DNA to absorb more linking number (see Bramarchi, Dittmore, et al., 2018).

10) The authors see no effect of changing the ionic strength on the plectonemic density profile. Can they comment on whether this was expected? Have they analyzed whether ionic strength affects the frequency of observing one vs. multiple plectonemes?

11) In the Abstract, the statement "We… verify that plectonemes localize directly upstream of transcriptional start sites" may easily be misread to imply that this measurement has been made in the context of the chromosome. It would of course increase the impact of the paper if plectonemic pinning and/or functional effects of changing the identified sequences could be detected in cells, but that reasonably lies outside the scope of this paper.

12) For a title, is "DNA sequence encodes the position of plectonemes" more accurate as this is what the authors are measuring?

[Editors' note: further revisions were requested prior to acceptance, as described below.]

Thank you for resubmitting your work entitled "DNA sequence encodes the position of DNA supercoils" for further consideration at eLife. Your revised article has been favorably evaluated by Naama Barkai as the Senior Editor, and three reviewers, one of whom is a member of our Board of Reviewing Editors.

The manuscript has been improved but there are some remaining issues that need to be addressed before acceptance, as outlined below in the individual reviews. As you'll see, two of the reviewers both emphasize the need for greater attention to detail in discussing some of the results and providing appropriate qualifications and relevant caveats. Please carefully and fully address all of the issues raised by the reviewers in a revised manuscript.

Reviewer #1:

The authors have addressed my earlier comments and have significantly improved the manuscript with the addition of new experimental data and further discussion of their results. As noted below, I feel that there are several points in the manuscript that could potentially lead to misunderstandings for the reader and should be clarified.

1) The authors suggest that intrinsic curvature is the major determinant of plectoneme pinning. However, since formation of negative plectonemes is also dependent on the duplex stability, intrinsic curvature may not be the primary mechanism that determines where negative plectonemes are localized. Unless the authors provide experimental data on negatively supercoiled plectonemes, the authors should be careful with their statements such as "intrinsic curvature over a ~70bp range as the primary factor that determines plectoneme pinning…"

2) The authors suggest that their model fails to predict pinning in SeqA, SeqB, and SeqC because of "an insufficient accuracy in the dinucleotide parameters that we adopted from the literature or because the curvature is influenced by interactions spanning beyond nearest-neighbor nucleotides." It is not clear to me that there are not alternative explanations, such as these pinnings being influenced by sequences that are prone to base-flipping or that are able to stabilize twist, or potentially a combination of all the above effects. I think the authors should be more agnostic about the possible mechanisms and acknowledge in the discussion that there are likely other sequence determinants that regulate plectoneme pinning.

3) Given that the authors' model identifies many false-negative pinning sequences, it seems a bit premature to suggest that organisms where their model does not detect strong pinning sequences in the TSS "rely on sequence-dependent plectoneme positioning to different extents". It seems just as likely that these organisms utilize the same as SeqA, SeqB, and SeqC, which is not captured in the authors' model. The authors' argument would be strengthened if they showed in their ISD experiments that C. crescentus or S. cerevisiae promoters indeed do not contain pinning sequences. Additionally, are SeqA, SeqB, and SeqC sequences derived from genomic DNA? If so, this would further support the idea that there are additional mechanisms that regulate pinning in vivo.

4) The sentence "Our data instead suggests that plectoneme pinning depends on the specific distribution of bases, and our shuffled poly(A/T) constructs suggest this distribution must be measured over distances greater than tens of nucleotides" should be modified to reflect the authors' subsequent results. As shown later in this manuscript, plectoneme pinning (1) does not depend on a "specific" distribution of bases, instead being dependent on the curvature of DNA, and (2) occurs on a ~70 bp length scale.

5) Subsection “Systematic examination of plectoneme pinning at various putative DNA sequences”, last two paragraphs. In this section, several hypotheses are presented and discarded and it is not clear to the reader what is the "correct" model (not solely dependent on AT-character, poly(A/T), etc.). It would be easier for the reader if the statements such as "seemingly confirming our hypothesis" are eliminated since these statements are immediately contradicted in the text.

Reviewer #2:

I am satisfied with the revisions.

Reviewer #3:

Abbondanzieri and Dekker and coworkers have improved their manuscript and made their claims easier to evaluate by including experiments on new sequences, presenting additional calculation results, and providing additional context in the text. I favor publication, but I think the authors could do even more to qualify their claims and point out the limitations of the current model – this is an important advance but not a definitive determination of a "plectoneme code".

I appreciate the inclusion of additional model predictions in Figure 3—figure supplement 1. As far as I can tell they still haven't included the complete list we asked for – the variable length AT-rich insertions (series of constructs from Figure 2—figure supplement 1) don't seem to be there, and would be a valuable addition to gauge (albeit anecdotally) the extent of the "false negative" effect seen in SeqA/SeqB/SeqC. The latter false negatives could be emphasized more as a caveat in the text, noting that SeqB was used as the basis of shuffling experiments that the authors relied on to motivate the construction of the model, but in fact the difference between SeqB and its shuffled variants is not captured by the model since the SeqB peak is not predicted.

Clearly, both experimental and theoretical investigation of the negative supercoiling regime is an important future direction for this work. When discussing the limitations of the current model as applied to negative supercoiling, it is important to remind the reader of the biological importance of negative supercoiling, e.g. mesophilic bacterial genomes have strongly net negative superhelical density. When arguing that additional theory is needed to describe sequence-dependent strand separation under tension, it would be appropriate to mention existing theoretical work e.g. from Benham on predicting sites of supercoiling-induced denaturation.

In the text, "a full statistical mechanical modeling of the entire plectonemic structure" should probably be something like "…of the entire DNA molecule" or "…of plectonemic structures distributed across the DNA molecule".

I am surprised that the authors did not expect ionic strength effects – e.g. as noted in other studies low ionic strength can change the distribution of plectonemes, favoring multiple plectonemic domains – but I don't think it is critical to include additional experiments to explore that regime.

https://doi.org/10.7554/eLife.36557.054

Author response

Although the reviewers were generally enthusiastic, there were some issues that arose during review and discussion that should be addressed in a revision. The first of these will require additional experiments. The others could, but in some cases may also be dealt with through changes in the text and presentation of the data:

1) While this work shows that AT content is not required for plectoneme formation, the authors' conclusion that the curvature is the predominant driver of supercoil formation is based on a limited number of sequence manipulations, with all insertions tested in a single longer context (template 1) and only one synthetic designed sequence tested after developing the model. A deeper analysis (changing the AT content, length, amount of curvature or eliminating curvature entirely) of their template is important to understand the sequence properties that are necessary for plectoneme formation. Since short 60-90 bp segments are sufficient for supercoil pinning (Kremer et al., 1993) and the authors' predict that a 73 bp region should be sufficient to induce pinning, shorter segments than the ones presented (250-450 bp Figure 3E, 4C) should be directly tested. Working with a GC-rich template, in particular, would improve the generality of the results.

To verify our model further, we have acquired an extensive set of additional data. Accordingly, we now added these supporting data sets with various sequence-curvature combinations, specifically:

1) 75bp-long AT-rich (41%) highly-curved

2) 73bp-long neutral (50-55%) flat sequence

3) 500bp non-curved GC-rich (60-70%) insert

4) 500bp non-curved GC-rich (60-70%) + multiple 73bp-long GC-rich (60-70%) curved inserts

In these extended sets of model predictions and measurements, plectoneme pinning was observed only when the inserted sequence was curved, regardless of their AT/GC contents, confirming that it is the intrinsic curvature, not merely AT or GC percentage, which is the major determinant of plectoneme pinning. Furthermore, our new observation of strong pinning from a single short (~73 bp) highly curved sequence is consistent with our plectoneme tip-loop size estimation.

We included these results in Figure 3F-I and Figure 3—figure supplement 1.

We note that the acquisition of the additional data was done by Dr. Eugene Kim, who accordingly has now been added as a co-author.

2) Related to the point above, the authors state that "the model qualitatively represented the experimental data for all sequences that were tested", but it appears this comparison is only shown for a limited subset of the sequences. Given that the paper is centrally focused on proposing a general model, it would be appropriate to show comparisons with all sequences, to facilitate evaluation of the model's successes and limitations. In particular, it does not seem that model predictions are shown for about a dozen of the sequence variants for which experiments are reported:

– template 2

– template 1 + seqA, seqB, seqC;

– seqB with base shuffling and with AT-tract shuffling (the latter in particular is a central experiment for the paper)

– template 1 + variable length insertions (.25, 1, 2, 3, 3.9 kb)

We now provide all the model predictions (including all those sequences mentioned in the above list) in a figure supplement (Figure 3—figure supplement 1). This extensive set largely confirms our statement that the model predictions agree with the experimental data. The full in depth analysis revealed that our model can reproduce most of the measured plectoneme densities, indicating that curvature is the major determinant. However, occasionally, we find that the model is too conservative, i.e., while it does a good job avoiding false positives, it suffers from some false negatives. For completeness, we have mentioned this now in the revised manuscript and provide Figure 3—figure supplement 1 which shows examples where an inserted sequence showed a peak in the experiments, whereas the model failed to predict it, which we attribute to the base-base parameters being insufficiently accurate.

3) The Introduction of the manuscript is somewhat misleading. An existing model in the field is that curved sequences induce supercoil formation (many of these papers are cited in the manuscript, see Kremer et al., 1993, Laundon and Griffith, 1988, Pfannschmidt and Langowski, 1998, Tsen and Levene, 1997, and Pavlicek et al., 2004, among others), however the authors do not acknowledge this. The authors are correct that a major shortcoming of these earlier works is their use of phased AT-tracts in generating a curved sequence, however, the introduction should be written to clarify this for the reader. Some of the references are also misattributed. In the second paragraph of the Introduction, Eslami-Mossallam et al., 2016, Pasi and Lavery, 2016, and Wang et al., 2017, are computational studies not biochemical and structural studies, while Kremer et al., 1993, Laundon and Griffith, 1988, Pfannschmidt and Langowski, 1998, and Tsen and Levene, 1997, used biochemical and structural approaches in addition to in silico modeling.

We appreciate the careful examination of our manuscript and the suggestions. We now rephrased and added a statement to properly credit the earlier works and we corrected the misattributed references (Introduction, second paragraph).

4) How the plectoneme density (e.g., Figure 1D) is calculated was buried in the Materials and methods and the prior paper from this group, making it a little difficult to understand what was done. Given the centrality of plectoneme density plots, it would be useful for the definition, calculation procedure, and motivation for choosing this metric to be explained clearly up front. The Materials and methods mention the calculation of the size of each plectoneme (how much writhe is constrained in the molecule at each position) but it is unclear whether the plectoneme size is ever used in any of the reported analysis. Relatedly, how do AT-content, curvature, and other sequence features affect plectoneme formation/termination kinetics and the amount of writhe per plectoneme? Finally, can the authors quantify sequence effects on plectoneme "slithering" (diffusion)? They mention that the model predicts the expected degree of roughness to explain plectoneme diffusion rates, but do not comment on whether sequence differences cause predictable changes to these rates.

We have revised Results and Materials and methods to provide a clearer description on how the plectoneme density is defined (see for example, subsection “DNA sequence favors plectoneme localization at certain spots along supercoiled DNA”, first paragraph, subsection “Data analysis”).

Furthermore, we have removed the statement on the plectoneme size as it is not essential for the data analysis and results in this manuscript.

Finally, we note that our physical model is based on the intrinsic curvature and bending energy which essentially are equilibrium properties. The plectoneme dynamics is an interesting subject in itself (see e.g. our previous paper Van Loenhout et al., 2012), but it is beyond the interest of this manuscript. Henceforth, to keep the paper concise and focused, we did not include the plectoneme nucleation, termination, and diffusion kinetics, as the kinetic data did not give any additional insight into how plectonemes pin to specific sequences on average. This is now noted more clearly at the end of the subsection “DNA sequence favors plectoneme localization at certain spots along supercoiled DNA”.

5) The experiments here are performed in the presence of positive supercoiling. There could be important differences under negative supercoiling, relevant to biological contexts. For example, the authors present evidence for intrinsic curvature as a dominant contributor to sequence-dependent plectonemic pinning. An alternative proposal (Matek et al.) is that duplex stability plays an important role, via the formation of plectoneme tip bubbles. If these occur, they are expected to occur more readily in negatively supercoiled DNA, which favors strand separation. Ideally, the authors would provide any evidence addressing whether plectoneme pinning occurs at the same locations/with the same sequence determinants for positively and negatively supercoiled DNA. Clearly the high-throughput assay used here can only interrogate positively supercoiled DNA, so this would require a lower-throughput assay such as side-stretching magnetic tweezers + fluorescence. More generally, it would be interesting to see the predictions of a fully developed local denaturation model compared with experiments – for the conditions reported here, the authors dismiss this kind of model on the basis of the AT-tract shuffling experiment, and do not consider it further. Although additional data would be ideal, and perhaps the authors already have such data. At a minimum, the authors should revise the text to be more cautious about generalizing to negative supercoils, and they should discuss possible differences between negative and positive supercoiling.

We have revised our text to clarify the limits of our data and model regarding the negative supercoiling (Discussion, eighth paragraph). We can clarify our focus on positive supercoiling as follows:

1) We attempted measurements with the side-stretching magnetic tweezers/fluorescence setup in the past three years but unfortunately encountered technical issues with that which prevented experiments such as those suggested above, and this made us switch to the high-throughput assay employed in this paper.

2) Technically, addressing negative supercoiling is more challenging than positive supercoiling, which is connected to the nature of our assay where the addition of SxO induces positive supercoiling. In the technical paper on the method that we published earlier (Ganji et al., 2016), we reported some data for the plectoneme density for negatively supercoiled DNA (upon removing SxO from pre-stained DNA molecules). While these results were roughly consistent with the data for positive supercoiling, they were noisier and less reproducible from molecule to molecule.

3) In the case of negative supercoils, plectoneme pinning can, as correctly noted by the reviewers, be the convoluted result of curvature and local melting, and separating these two phenomena is not straightforward. As an additional complication, local DNA melting strongly depends on the tension across the DNA (i.e. the end-to-end length in our experiment), which would lead to slightly different plectoneme density profiles from molecule to molecule. Therefore, we limited this study to the pinning of pure plectonemes as set by the flexibility and curvature, which can be best demonstrated in the regime of positive supercoiling.

4) Finally, we note that torque-induced melting is quite different to thermal melting (see for example Vlijm et al., 2015). To our knowledge, it is not yet possible to predict torque-induced melting of DNA directly from sequence. Hence, we did not include the effect of melting in our physical model.

We have revised the text to indicate some of these points and discuss the possible differences between negative and positive supercoiling.

6) Can the DNA structures that form plectonemes transition to a change in twist? This question arises in part from a related point or question, which is what the authors envision in terms of plectoneme formation in the context of eukaryotic chromatin? It is difficult to imagine there generally being sufficient "free" DNA to form plectonemes on a chromatinised template in eukaryotic cells. It would be interesting to consider and discuss how much space a plectoneme needs to form.

Our data are taken in the regime above the buckling transition where the internal twist reaches a maximum and further torsion is all absorbed in plectonemes. Accordingly, the plectonemes cannot transition into twist.

As mentioned in the manuscript, the plectoneme formation in eukaryotic chromatin may be more complicated because the DNA is highly covered by histones. However, we like to point out that histones and other DNA binding proteins do not prevent plectoneme formation. In fact, they can even induce plectoneme formation and pinning by making a sharp bend around their binding site. That may be why the eukaryotic genomes do show less of a propensity of recruiting plectonemes in the promotor region (Figure 4F). As far as the size is considered, only a few hundred base pairs are needed to initiate a plectoneme loop (or ‘curl’ to be more precise). Considering that a chromatin region of a highly transcribing gene is relatively ‘open’ for polymerase binding, it is likely that there are long enough protein-free regions.

7) The authors suggest that plectonemes are enriched at TSS. This seems overly speculative unless the authors can include additional information/experiments. For example, the correlation between curved sequences and TSS could be due to promoters evolving curved sequences to increase RNA polymerase binding or activity, rather than plectoneme formation per se. Insertion of a curved sequence upstream of the -35 element has been shown to increase affinity for RNA polymerase in vitro on a linear template in the absence of supercoiling (Nickerson et al., 1995). The authors' argument would be strengthened if they could show plectoneme formation on these sequences in vivo or a relationship between plectoneme formation and RNA polymerase binding/activity in vitro. Unless such data are included (and we recognize it may be beyond the scope of this paper), the authors should at least tone down some statements in this vein, e.g. in the Abstract where it says "…and experimentally verify that plectonemes localize directly upstream…".

We note that (i) our experiments clearly show a good correspondence between local curvature and plectoneme pinning, and (ii) our bioinformatics data show a good correlation between local curvature and TSS sequence. We do not claim that the promoters evolved curved sequences solely for the purpose of plectoneme pinning, but we find the correspondence striking and noteworthy. We also tested TSS sequences and experimentally found the expected pinning at these sites. We feel that, in vivo experiments to disentangle the likely complex relationship between plectoneme formation and RNA polymerase binding and transcription are beyond the current scope of the manuscript.

We have toned down some statements and rephrased part of the Abstract and Discussion (fifth paragraph) to avoid overclaiming.

8) The experiments here are necessarily performed on DNA with intercalated dye. While the authors argue on the basis of prior work that overall mechanical properties of DNA are minimally affected by the intercalator, can they rule out that sequence-specific observations are affected by the dye? MT tweezers assays in the absence of dye cannot directly report on plectoneme locations, but can detect signatures of strong pinning sequences – could targeted key predictions be tested in this kind of assay? On this point the authors are encouraged to provide new data, if easily obtained, or at least discuss the caveats in a revised manuscript.

We have been keenly aware of possible effects of the intercalating dyes on the mechanical properties of DNA, and accordingly we tested this thoroughly. Indeed, as confirmed experimentally in our magnetic tweezer experiments, Sytox Orange does not in any way perturb the formation of plectonemes on DNA. Furthermore, we observe a homogeneous fluorescence intensity on relaxed DNA, irrespective of the GC content and regardless of how strongly plectonemes are pinned at local positions along the DNA (see Figure 1—figure supplement 1A), indicating that Sytox orange does not show any specificity for high plectoneme pinning sequences or any other specific sequences. Finally, data from the assay reported in this paper were identical to those from an orthogonal approach using side-pulling magnetic tweezers (cf. our previous work, Ganji et al., 2016). All these experiments show that the mechanical properties of DNA are minimally affected, if at all, by the SxO dyes.

9) The authors should discuss the assumptions and approximations involved in predicting plectoneme probabilities solely on the basis of tip-loop energetics. Their model falls short of a complete statistical mechanical treatment that would model partitioning of linking number among plectonemes under imposed constraints across the full molecule, and will for example fail to capture position-dependent effects in which the growth of a plectoneme pinned near the end of the molecule is limited by the available DNA, which can favor the formation of a plectoneme elsewhere in the DNA to absorb more linking number (see Bramarchi, Dittmore, et al., 2018).

Our simple physical model is intended to estimate if a given sequence is capable to pin a plectoneme relatively to other sequences, and it does a good job at that. The model is capable of good qualitative predictions and it requires low computational power. Indeed, better predictions may result from a more extensive full statistical mechanical model, but this will require very significant computational power and time. We have added a discussion on the advantages and limits of our model (Discussion, third and fourth paragraphs).

10) The authors see no effect of changing the ionic strength on the plectonemic density profile. Can they comment on whether this was expected? Have they analyzed whether ionic strength affects the frequency of observing one vs. multiple plectonemes?

Our earlier study (van Loenhout et al., 2012) addressed the question how the ionic strength affects plectoneme dynamics. As pointed out above, however, the plectoneme dynamics is not the focus of this study where we instead investigate the average pinning properties. We therefore did not extensively measure the salt dependence, which we did not expect to have large effects on plectoneme pinning.

11) In the Abstract, the statement "We… verify that plectonemes localize directly upstream of transcriptional start sites" may easily be misread to imply that this measurement has been made in the context of the chromosome. It would of course increase the impact of the paper if plectonemic pinning and/or functional effects of changing the identified sequences could be detected in cells, but that reasonably lies outside the scope of this paper.

We have revised the Abstract to avoid such a misreading.

12) For a title, is "DNA sequence encodes the position of plectonemes" more accurate as this is what the authors are measuring?

We appreciate the suggestion. But we prefer to use the original title “DNA sequence encodes the position of DNA supercoils” because the term “supercoil” is better known to the broad audience than “plectonemes”.

[Editors' note: further revisions were requested prior to acceptance, as described below.]

Reviewer #1:

The authors have addressed my earlier comments and have significantly improved the manuscript with the addition of new experimental data and further discussion of their results. As noted below, I feel that there are several points in the manuscript that could potentially lead to misunderstandings for the reader and should be clarified.

1) The authors suggest that intrinsic curvature is the major determinant of plectoneme pinning. However, since formation of negative plectonemes is also dependent on the duplex stability, intrinsic curvature may not be the primary mechanism that determines where negative plectonemes are localized. Unless the authors provide experimental data on negatively supercoiled plectonemes, the authors should be careful with their statements such as "intrinsic curvature over a ~70bp range as the primary factor that determines plectoneme pinning…"

We agree that we should be careful on this point. Our experiments clearly show that intrinsic curvature is a major determinant of plectoneme pinning for positive supercoiling regime, in which the duplex form of DNA remains intact. This is likely the case as well for negative supercoiling, but the reviewer is correct that things may be more complicated here as local melting may occur. Hence we have rephrased the text to phrase our statement to be more precise (see Discussion, first paragraph).

2) The authors suggest that their model fails to predict pinning in SeqA, SeqB, and SeqC because of "an insufficient accuracy in the dinucleotide parameters that we adopted from the literature or because the curvature is influenced by interactions spanning beyond nearest-neighbor nucleotides." It is not clear to me that there are not alternative explanations, such as these pinnings being influenced by sequences that are prone to base-flipping or that are able to stabilize twist, or potentially a combination of all the above effects. I think the authors should be more agnostic about the possible mechanisms and acknowledge in the discussion that there are likely other sequence determinants that regulate plectoneme pinning.

We thank the reviewer for suggesting the additional mechanisms as potential causes for plectoneme pinning. We now revised the text to not exclude such other possible explanations (subsection “Intrinsic local DNA curvature determines the pinning of supercoiled plectonemes”, second paragraph).

Furthermore, in order to provide the reader with a more quantitative feel for the relative importance of the choice of the dinucleotide parameters, we now added Figure 3—figure supplement 2. This depicts the model predictions for the various sets of dinucleotide parameters that have been reported in the literature. Interestingly, quite pronounced variations can be seen among the results, providing support for our statement that an increased accuracy in the dinucleotide parameters or inclusion of interactions spanning beyond nearest-neighbor nucleotides may in the future improve the model further. Notably we used the Balasubramian data set for our modeling which is most recent and most accurate.

3) Given that the authors' model identifies many false-negative pinning sequences, it seems a bit premature to suggest that organisms where their model does not detect strong pinning sequences in the TSS "rely on sequence-dependent plectoneme positioning to different extents". It seems just as likely that these organisms utilize the same as SeqA, SeqB, and SeqC, which is not captured in the authors' model. The authors' argument would be strengthened if they showed in their ISD experiments that C. crescentus or S. cerevisiae promoters indeed do not contain pinning sequences. Additionally, are SeqA, SeqB, and SeqC sequences derived from genomic DNA? If so, this would further support the idea that there are additional mechanisms that regulate pinning in vivo.

First, we like to point out that our model is remarkably successful for predicting plectoneme pinning on the majority of the sequences examined in the manuscript (15 different sequence inserts plus two ~20kb-long templates) and its deficiency for these few examples does in our view not merit the reviewer’s statement that our model “identifies many false-negative pinning sequences”.

Furthermore, since the genome scan data in Figure 4 are the result from averaging thousands of TSS sites, it is reasonable to conclude that the near-zero pinning probabilities for the C. crescentus and S. cerevisiae are hardly affected by false negative detection of our model.

4) The sentence "Our data instead suggests that plectoneme pinning depends on the specific distribution of bases, and our shuffled poly(A/T) constructs suggest this distribution must be measured over distances greater than tens of nucleotides" should be modified to reflect the authors' subsequent results. As shown later in this manuscript, plectoneme pinning (1) does not depend on a "specific" distribution of bases, instead being dependent on the curvature of DNA, and (2) occurs on a ~70 bp length scale.

We revised the text for better coherency (subsection “Systematic examination of plectoneme pinning at various putative DNA sequences”, last paragraph).

5) Subsection “Systematic examination of plectoneme pinning at various putative DNA sequences”, last two paragraphs. In this section, several hypotheses are presented and discarded and it is not clear to the reader what is the "correct" model (not solely dependent on AT-character, poly(A/T), etc.). It would be easier for the reader if the statements such as "seemingly confirming our hypothesis" are eliminated since these statements are immediately contradicted in the text.

We admit that the logical flow in the section may come across as a bit confusing, as we explore a number of hypotheses that fail. While we were in fact quite careful in writing this section already, we have now further rephrased the text to avoid such confusion (for example, see subsection “Systematic examination of plectoneme pinning at various putative DNA sequences”, second and third paragraphs).

Reviewer #3:

Abbondanzieri and Dekker and coworkers have improved their manuscript and made their claims easier to evaluate by including experiments on new sequences, presenting additional calculation results, and providing additional context in the text. I favor publication, but I think the authors could do even more to qualify their claims and point out the limitations of the current model – this is an important advance but not a definitive determination of a "plectoneme code".

I appreciate the inclusion of additional model predictions in Figure 3—figure supplement 1. As far as I can tell they still haven't included the complete list we asked for – the variable length AT-rich insertions (series of constructs from Figure 2—figure supplement 1) don't seem to be there, and would be a valuable addition to gauge (albeit anecdotally) the extent of the "false negative" effect seen in SeqA/SeqB/SeqC. The latter false negatives could be emphasized more as a caveat in the text, noting that SeqB was used as the basis of shuffling experiments that the authors relied on to motivate the construction of the model, but in fact the difference between SeqB and its shuffled variants is not captured by the model since the SeqB peak is not predicted.

Upon this comment of the reviewer, we now added the predictions on the variable length AT-rich insertions in Figure 3—figure supplement 1, and we also added a statement to more clearly address the caveat of false negative predictions (see Discussion, fourth paragraph).

Furthermore, we note that the sequences used in Figure 2—figure supplement 1 are merely lengthened or shortened versions of SeqA, yet the model predicts an increase of the pinning effect by the lengthened AT-region from 0.25kb to 1kb (see Author response image 1). Also please note that the peak is broadened from 3.0 kb to 3.9kb as the AT-rich insert becomes even larger than the smoothing window (1.6kb). Thus, our model does not entirely miss the pinning mechanism, but somewhat underestimates the effect.

Author response image 1

Clearly, both experimental and theoretical investigation of the negative supercoiling regime is an important future direction for this work. When discussing the limitations of the current model as applied to negative supercoiling, it is important to remind the reader of the biological importance of negative supercoiling, e.g. mesophilic bacterial genomes have strongly net negative superhelical density. When arguing that additional theory is needed to describe sequence-dependent strand separation under tension, it would be appropriate to mention existing theoretical work e.g. from Benham on predicting sites of supercoiling-induced denaturation.

We agree with the reviewer. We now added a statement regarding this interesting early theoretical work on supercoiling-induced denaturation, which will be of relevance for future studies on negative supercoiling (Discussion, eighth paragraph).

In the text, "a full statistical mechanical modeling of the entire plectonemic structure" should probably be something like "…of the entire DNA molecule" or "…of plectonemic structures distributed across the DNA molecule".

We have revised the text accordingly.

I am surprised that the authors did not expect ionic strength effects – e.g. as noted in other studies low ionic strength can change the distribution of plectonemes, favoring multiple plectonemic domains – but I don't think it is critical to include additional experiments to explore that regime.

As we mentioned in the previous rebuttal, the ionic strength mainly affects the dynamics of supercoils, which will merely induce a broadening in the steady-state plectoneme density plots. Hence, the density profiles will not change essentially.

https://doi.org/10.7554/eLife.36557.055

Article and author information

Author details

  1. Sung Hyun Kim

    Department of Bionanoscience, Kavli Institute of Nanoscience, Delft University of Technology, Delft, The Netherlands
    Present address
    Institute of molecular biology and genetics, School of Biological Science, Seoul National University, Seoul, South Korea
    Contribution
    Conceptualization, Data curation, Software, Formal analysis, Validation, Investigation, Visualization, Methodology, Writing—original draft, Writing—review and editing
    Contributed equally with
    Mahipal Ganji
    Competing interests
    No competing interests declared
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0001-9272-7036
  2. Mahipal Ganji

    Department of Bionanoscience, Kavli Institute of Nanoscience, Delft University of Technology, Delft, The Netherlands
    Present address
    1. Department of Physics, Center for Nanoscience, Ludwig Maximilian University, Munich, Germany
    2. Max Planck Institute of Biochemistry, Martinsried, Germany
    Contribution
    Conceptualization, Data curation, Software, Formal analysis, Validation, Investigation, Visualization, Methodology, Writing—original draft, Writing—review and editing
    Contributed equally with
    Sung Hyun Kim
    Competing interests
    No competing interests declared
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0001-8176-3322
  3. Eugene Kim

    Department of Bionanoscience, Kavli Institute of Nanoscience, Delft University of Technology, Delft, The Netherlands
    Contribution
    Data curation, Formal analysis
    Competing interests
    No competing interests declared
  4. Jaco van der Torre

    Department of Bionanoscience, Kavli Institute of Nanoscience, Delft University of Technology, Delft, The Netherlands
    Contribution
    Resources, Investigation, Writing—original draft
    Competing interests
    No competing interests declared
  5. Elio Abbondanzieri

    Department of Bionanoscience, Kavli Institute of Nanoscience, Delft University of Technology, Delft, The Netherlands
    Present address
    Department of Biology, University of Rochester, New York, United States
    Contribution
    Conceptualization, Data curation, Software, Formal analysis, Supervision, Funding acquisition, Validation, Investigation, Visualization, Methodology, Writing—review and editing
    For correspondence
    elio.abbondanzieri@rochester.edu
    Competing interests
    No competing interests declared
  6. Cees Dekker

    Department of Bionanoscience, Kavli Institute of Nanoscience, Delft University of Technology, Delft, The Netherlands
    Contribution
    Conceptualization, Supervision, Funding acquisition, Validation, Investigation, Project administration, Writing—review and editing
    For correspondence
    C.Dekker@tudelft.nl
    Competing interests
    No competing interests declared
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0001-6273-071X

Funding

H2020 European Research Council (669598)

  • Cees Dekker

The Netherlands Organization for Scientific Research (The Frontiers of Nanoscience program)

  • Elio Abbondanzieri

H2020 European Research Council (304284)

  • Elio Abbondanzieri

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Acknowledgements

The data reported in the paper are available from the corresponding authors upon request. We acknowledge valuable discussions with Helmut Schiessel and Ard Louis. We thank Jacob Kerssemakers for helpful discussion and data analysis codes. This work was supported by the ERC Advanced Grant SynDiv [grant number 669598 to CD]; the Netherlands Organization for Scientific Research (NWO/OCW) [as part of the Frontiers of Nanoscience program], and the ERC Marie Curie Career Integration Grant [grant number 304284 to EA].

Senior Editor

  1. Naama Barkai, Weizmann Institute of Science, Israel

Reviewing Editor

  1. Michael T Laub, Massachusetts Institute of Technology, United States

Version history

  1. Received: March 10, 2018
  2. Accepted: December 6, 2018
  3. Accepted Manuscript published: December 7, 2018 (version 1)
  4. Version of Record published: December 20, 2018 (version 2)

Copyright

© 2018, Kim et al.

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.

Metrics

  • 5,911
    Page views
  • 871
    Downloads
  • 38
    Citations

Article citation count generated by polling the highest count across the following sources: Crossref, Scopus, PubMed Central.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Sung Hyun Kim
  2. Mahipal Ganji
  3. Eugene Kim
  4. Jaco van der Torre
  5. Elio Abbondanzieri
  6. Cees Dekker
(2018)
DNA sequence encodes the position of DNA supercoils
eLife 7:e36557.
https://doi.org/10.7554/eLife.36557

Further reading

    1. Chromosomes and Gene Expression
    2. Genetics and Genomics
    James T Anderson, Steven Henikoff, Kami Ahmad
    Research Article

    Spermatogenesis in the Drosophila male germline proceeds through a unique transcriptional program controlled both by germline-specific transcription factors and by testis-specific versions of core transcriptional machinery. This program includes the activation of genes on the heterochromatic Y chromosome, and reduced transcription from the X chromosome, but how expression from these sex chromosomes is regulated has not been defined. To resolve this, we profiled active chromatin features in the testes from wildtype and meiotic arrest mutants and integrate this with single-cell gene expression data from the Fly Cell Atlas. These data assign the timing of promoter activation for genes with germline-enriched expression throughout spermatogenesis, and general alterations of promoter regulation in germline cells. By profiling both active RNA polymerase II and histone modifications in isolated spermatocytes, we detail widespread patterns associated with regulation of the sex chromosomes. Our results demonstrate that the X chromosome is not enriched for silencing histone modifications, implying that sex chromosome inactivation does not occur in the Drosophila male germline. Instead, a lack of dosage compensation in spermatocytes accounts for the reduced expression from this chromosome. Finally, profiling uncovers dramatic ubiquitinylation of histone H2A and lysine-16 acetylation of histone H4 across the Y chromosome in spermatocytes that may contribute to the activation of this heterochromatic chromosome.

    1. Chromosomes and Gene Expression
    2. Developmental Biology
    Virginia L Pimmett, Mounia Lagha
    Insight

    Imaging experiments reveal the complex and dynamic nature of the transcriptional hubs associated with Notch signaling.