Artificially inserted G-quadruplex DNA secondary structures induce long-distance chromatin activation

Shuvra Shekhar Roy; Sulochana Bagri; Soujanya Vinayagamurthy; Avik Sengupta; Claudia Regina Then; Rahul Kumar; Sriram Sridharan; Shantanu Chowdhury

doi:10.7554/eLife.96216.2

eLife assessment

This valuable study demonstrates that genomic insertion of a G4-containing sequence can be sufficient to induce chromosome loops and alter gene expression. The evidence supporting the conclusions is convincing. Effects were shown by Hi-C as well as qPCR for chromatin modifications and expression, and the specificity of the effects was controlled by mutating the G4-containing sequence or treating with LNA probes to abolish G4 structure formation. The work will be of interest to researchers working on chromatin organization and gene regulation.

https://doi.org/10.7554/eLife.96216.2.sa3

Significance of findings

valuable: Findings that have theoretical or practical implications for a subfield

landmark
fundamental
important
valuable
useful

Strength of evidence

convincing: Appropriate and validated methodology in line with current state-of-the-art

exceptional
compelling
convincing
solid
incomplete
inadequate

During the peer-review process the editor and reviewers write an eLife assessment that summarises the significance of the findings reported in the article (on a scale ranging from landmark to useful) and the strength of the evidence (on a scale ranging from exceptional to inadequate). Learn more about eLife assessments

Abstract

Although the role of G-quadruplex (G4) DNA structures has been suggested in chromosomal looping this was not tested directly. Here, to test causal function, an array of G4s, or control sequence that does not form G4s, were inserted within chromatin in cells. In vivo G4 formation of the inserted G4 sequence array, and not the control sequence, was confirmed using G4-selective antibody. Compared to the control insert, we observed a remarkable increase in the number of 3D chromatin looping interactions from the inserted G4 array. This was evident within the immediate topologically associated domain (TAD) and throughout the genome. Locally, recruitment of enhancer histone marks and the transcriptional coactivator p300/Acetylated-p300 increased in the G4-array, but not in the control insertion. Resulting promoter-enhancer interactions and gene activation were clear up to 5 Mb away from the insertion site. Together, these show the causal role of G4s in enhancer function and long-range chromatin interactions. Mechanisms of 3D topology are primarily based on DNA-bound architectural proteins that induce/stabilize long-range interactions. Involvement of the underlying intrinsic DNA sequence/structure in 3D looping shown here therefore throws new light on how long-range chromosomal interactions might be induced or maintained.

Introduction

G-quadruplexes (G4s), non-canonical DNA secondary structures with quartets of Guanines bonded by Hoogsteen base pairing, are instrumental in regulating gene expression (Sengupta et al., 2020; Varshney et al., 2020). G4s were primarily observed to be able to regulate gene expression when present around transcription start sites (TSSs) (Huppert and Balasubramanian, 2007; Rawal et al., 2006; Verma et al., 2008). G4s can regulate gene expression by directly regulating recruitment of transcription factors and RNA polymerase or via alteration of DNA accessibility by modulating the epigenetic state of the gene promoters (Hussain et al., 2017; Kumar et al., 2011; Lago et al., 2021; Mukherjee et al., 2019; Saha et al., 2017; Sharma et al., 2021; Varshney et al., 2020). Recent studies have implicated the role of G4s in long-distance gene regulation (Robinson et al., 2021).

High-throughput chromosome conformation capture techniques reveal that specific regions of the human genome interact in three dimensions (3D) via chromatin looping and formation of topologically associated domains (TADs) (Bonev Boyan and Cavalli Giacomo, 2016; Denker and De Laat, 2016; Roy et al., 2018). Interestingly, recent computational studies observed enrichment of G4s in TAD boundaries along with higher enrichment of architectural proteins like CTCF and cohesin (Hou et al., 2019). Further, multiple studies noted the presence of G4s correlated with enhancer histone marks like H3K27Ac and H3K4Me1, and predominantly open chromatin regions (Calo and Wysocka, 2013; Hou et al., 2021; Shlyueva et al., 2014).

Although these studies implicate the role of G4s in long-range interactions and/or enhancer function, this was not directly tested. Here we asked if G4 structures might directly alter 3D chromatin, and affect long-range interactions including the epigenetic state of chromatin. To address this, we inserted an array of G4s into an isolated locus devoid of G4-forming sequences using CRISPR-Cas9 genome editing. To evaluate the specific function of G4s, a similar sequence of identical length but devoid of G4-forming capability was introduced. Using these pair of cell lines, we observed insertion of G4s specifically led to the recruitment of enhancer histone marks and increased expression of genes in a 10 Mb window. 3C and Hi-C results showed induced long-range interactions throughout the genome affecting topologically associated domains (TADs) that were specifically due to the incorporated G4s, and not found in case of the control insertion.

Results

Insertion of an array of G4s in an isolated locus

First, we sought to insert an array of G4s in a relatively isolated locus. We looked into Hi-C data from Rao et al., 2014 and identified a region that was markedly isolated with little or no interaction with its surrounding regions (as shown by snapshots of Hi-C interaction matrices obtained using the 3D genome browser (Wang et al., 2018) in Figure S1). In addition, this region was devoid of any G4s in the vicinity (no G4 forming motifs in a ±2.5 kb window). Thereafter we artificially inserted an array of G4 forming sequences (275 bp long) at this region near the 79 millionth position of chromosome 12 (79M in following text, chr12:79872423-79872424, hg19 genome assembly) using CRISPR-Cas9 genome editing (Figure 1A, S2). To study specific effects due to G4s, if any, a control sequence of identical length was inserted in HEK293T cells at the same locus where specific G/Cs necessary for G4 formation were substituted so that G4s are not formed by this sequence (G4-mutated control, Figure 1A, S2); we also ensured that the GC content was minimally affected by the substitutions (72.4% from 76.73%). Homozygous insertion was confirmed by PCR using primers adjacent to the insertion site followed by Sanger sequencing (Figure 1B, S3). The array of G4-forming sequences used for insertion was previously reported to form stable G4s in human cells (Lim et al., 2010; Monsen et al., 2020; Palumbo et al., 2009; Sharma et al., 2021).

An isolated locus was chosen for insertion.
(A) 3D Genome Browser (Wang et al., 2018) snapshots showing the Hi-C interaction matrices from 4 cell lines (NHEK, IMR90, HUVEC and HMEC) of a section of chromosome 12 with chr12:79,870,000-79,875,000 (hg19) locus in the middle (marked by arrows); the chr12:79,870,000-79,875,000 locus has very low interaction with its surrounding loci, indicated by the lack of red dots or very faint red dots.

Insert sequences.
Sequences of the G4 array and the G4-mutated array that were inserted; three or more runs of G/Cs (stem of a potential G4) are shown in bold and the G/Cs (marked in blue) that were changed to T/As (marked in red) in the G4-mutated array are marked.

Insertion confirmed by Sanger sequencing.
Representative snapshots of the Sanger sequencing chromatogram of the reverse strand of the insertion locus PCR product showing some of the specific G/C to T/A substitutions in the insert.

Chromatin epigenetic landscape upon insertion of G4s

To understand how the formation of G4s altered the local chromatin, chromatin immunoprecipitation (ChIP) of different chromatin-modifying histone marks was done followed by qRT-PCR using primers spanning the inserted locus. PCR primers were designed such that none of the primers bind to any site of G/C alteration in the mutated control insert; either the forward/reverse primer is from the adjacent region for specificity; covers adjacent regions for studying any effects on chromatin; and, PCRs optimized keeping in mind the repeats within the inserted sequence. Given these, primer pairs R1-R4 were chosen for further work following optimizations (Figure 2, top panel). For G4 formation within cells by the G4-array insert sequence we used the reported G4 antibody BG4 (Hänsel-Hertsch et al., 2016). Using primer pairs R2, covering >100 bases of the inserted G4-array, or the G4-mutated control, BG4 ChIP followed by qPCR was performed. Significant BG4 binding was clear in the G4-array insert, and not in the G4-mutated insert, demonstrating formation of G4s by the inserted G4-array (Figure S4).

Changes in chromatin upon G4-array insertion.
The top panel shows the positions of the PCR amplicons used in the Histone ChIP experiments. Changes in chromatin-modifying histone modifications in the insert region represented by calculating the ratio of occupancy of different histone marks in the G4-array insert cells over the G4-mutated insert (control) cells- enhancer mark, H3K4Me1 (A); active enhancer/promoter mark, H3K27Ac (B); facultative repressor mark, H3K27Me3 (C); constitutive repressor mark, H3K9Me3 (D) and active promoter mark, H3K4Me3 (E). Mean ± SD (n=3); unpaired, two-tailed t-test (*p < 0.05, **p < 0.01, ***p < 0.001, ****p < 0.0001).

G4 formation analyzed by BG4 ChIP.
BG4 antibody enrichment at the 79M insertion locus in the G4-array and the G4-mutated (control) insert cells. Mean ± SD (n=2); unpaired, two-tailed t-test (*p < 0.05, **p < 0.01, ***p < 0.001, ****p < 0.0001). Primer R2, covers >100 bp of the respective inserted regions, as shown in Figure 2 top panel.

We observed significant increase in H3K4Me1 and H3K27Ac enhancer marks in the G4-array when compared to the G4-mutated control (Figures 2A, B). However, there was no G4-specific change in the presence of chromatin compaction marks, H3K27Me3 and H3K9Me3, or the promoter activation mark H3K4Me3 (Figures 2C, D, E). The G4-dependent recruitment of H3K4Me1 (associated with enhancers (Heintzman et al., 2009, 2007)) and H3K27Ac (associated with active enhancers and promoters (Creyghton, 2010; Heintzman et al., 2009, 2007)) indicated enhancer-like characteristics of the inserted G4s.

Enhancer-like features emerged upon insertion of G4s

We next asked how the insertion of the G4-array influenced the expression of surrounding genes. To understand the distance-dependent gene regulatory impacts of the inserted G4-array, the mRNA expression of the nearest three genes and then some arbitrarily chosen genes further away up to 5 megabases (Mb) both up and downstream from the insertion site was quantified. Notably, the expression of four of the tested genes (PAWR, PPP1R12A, NAV3, and SLC6A15) increased in the G4-array insert compared to the mutated insert control cells (Figure 3A). Based on this enhanced expression, we further tested and observed a somewhat concomitant increase in the recruitment of Ser5 phosphorylated RNA Pol II in the surrounding gene promoters (Figure 3B). Next, we tested if chromosomal looping interactions between the insertion site and the gene promoters were involved in these long-distance effects by using chromosome conformation capture (3C). The 3C assay between the insertion locus and the gene promoters could only be performed till the NAV3 promoter 1.6 Mb away. Beyond this distance, there was not any significantly detectable PCR amplification of 3C interaction products. The 3C assays revealed that there was a G4-dependent increase in chromosomal looping interactions of the insertion locus with the gene promoters (Figure 3C). These results suggested that the inserted G4-array sequence was acting like an enhancer element.

Insertion of the G4-array led to enhancer function.
(A) Long-range G4-dependent changes in mRNA expression are represented by calculating the ratio of expression of surrounding genes in the G4-array insert cells over the G4-mutated insert (control) cells. Top panel shows the positions of the gene promoters with respect to the insertion site. (B) Ratio of Pol2 Phospho-Ser5 Occupancy at the promoters of the surrounding genes in the G4-array insert cells over the G4-mutated insert (control) cells. (C) A UCSC genome browser snapshot showing the 3C looping interactions between the insertion and the surrounding gene promoters. Fold change in 3C looping interactions between the insertion and the surrounding gene promoters in the G4-array insert cells over the G4-mutated insert (control) cells (*p < 0.05, **p < 0.01, ***p < 0.001, ****p < 0.0001). Top panel shows the positions of the PCR amplicons used in the ChIP experiments. The ratio of occupancy of p300 (D) and Ac p300/CBP (E) in the G4-array insert cells over the G4-mutated insert (control) cells. Mean ± SD (n=3); unpaired, two-tailed t-test (*p < 0.05, **p < 0.01, ***p < 0.001, ****p < 0.0001).

To understand the mechanism behind the enhancer-like property of the inserted G4-array we analyzed the recruitment of transcriptional coactivator p300 (Kalkhoven, 2004). There was a relatively modest increase in the recruitment of p300, and a more substantial increase in the recruitment of the more functionally active acetylated p300/CBP, was seen within the G4-array when compared against the mutated control (Figures 3D, E). Together, these results supported the enhancer-like function of the inserted G4-array.

LNA-mediated disruption of the inserted G4s reverses enhancer phenotype

To further establish that the enhancer effects upon the G4 array insertion are due to the formation of G4s, we wanted to see if some of the effects observed could be reversed upon disrupting the inserted structures. Specific Locked Nucleic Acid (LNA) probes were designed to target and disrupt the G4 using a similar approach as shown by others (Cadoni et al., 2021; Chowdhury et al., 2022; Kumar et al., 2008). Three probes were designed with stretches of mostly cytosines (Cs) as LNAs which would hybridize with stretches of guanines (Gs) in the G4-array insert important for the structure formation (Figure 4A; see Methods). We observed that there was a significant decrease in the expression of PPP1R12A and NAV3, two of the genes initially observed to have G4-dependent enhanced expression (Figure 3A), when the G4 array inserted cells were treated with the G4 targeting LNAs (Figure 4B). As expected, although modest, a decrease in the H3K4Me1 and H3K27Ac enhancer histone modifications was evident within the insert upon the LNAs treatment (Figures 4C, D). As a control experiment, we next tested whether the LNA probes affected surrounding gene expression in the G4-mutated insert cells. Changes in the expression of the genes were not significant across replicates in case of G4-mutated insert cells (Figure S5). Together these confirmed the decrease in the expression of PPP1R12A and NAV3 in the G4-array insert upon LNA treatment was likely specific to G4 disruption. These indicate that the disruption of the inserted G4s can reverse the enhancer functions observed upon G4 insertion, further supporting the role of the G4 structure in enhancer functions.

Effects of LNA treatment in the G4-mutated insert (control) cells
on the expression of surrounding genes represented by the ratio of expression of surrounding genes in the LNA-treated over the vehicle-treated (control) cells. Mean ± SD (n=3); unpaired, two-tailed t-test showed differences were insignificant.

Domain-wide increase in looping interactions by G4s

For in-depth analysis of the long-range changes in chromatin architecture upon G4 insertion, we performed genome-wide interaction by Hi-C. First, we compared all the Hi-C contacts originating within a ±10 kb window comprising the G4-array insert, or the G4-mutated control insert. Compared to the mutated control, the G4-inserted locus had more than twice as many genome-wide Hi-C interactions (6390 vs 3133) (Figures 5A, B, Supplementary Table 1). To rule out the possibility of artifacts due to the insertion we independently analyzed Hi-C data in HEK293T cells reported earlier (taken from GSE44267, Zuin et al., 2014). After normalizing for sequencing depth, the number of Hi-C contacts from the same window in HEK293T was relatively similar to the G4-mutated insert control (3968 and 3133 respectively, Figure 5C, Supplementary Table 1). Together, these showed that a significant number of new long-range interactions were induced throughout the genome due to the inserted G4s, but not from the inserted control sequence.

Insertion of the G4-array increased Hi-C interactions.
Circos plots showing raw Hi-C contacts across the genome originating from a ±10 kb window with the insertion site at the middle across 3 samples- (A) G4-array insert cells, (B) G4-mutated insert (control) cells and (C) HEK293T control cells (taken from GSE44267). (D) Table showing the number of genome-wide raw Hi-C contacts and normalized contacts (normalized against the total raw Hi-C contacts to normalize for the sequencing depth) originating from the ±10 kb window with the insertion site at the middle across the 3 samples.

Comparative analysis of the G4-dependent increase in Hi-C interactions upon insertion.
Table showing the number of genome-wide raw Hi-C contacts, actual number of contacts originating from the ±10 kb window with the insertion site at the middle and the comparative analysis (mean, standard deviation and z-score) of these number of contacts with contacts across 10,000 random 20kb windows across the genome across the 3 samples.

For closer analysis, we focused on intrachromosomal Hi-C interaction matrices of the G4-array insert, or the mutated control insert. This was centered on the insertion locus on chromosome 12 (chr12:7,80,72,423-8,16,72,423; insertion site marked with arrows in Figures 6A, B). The number of Hi-C interactions in the G4-array insert was clearly enriched compared to the G4-mutated insert control, as expected from the global Hi-C contacts noted above. We noted that while the interactions from the G4-array insert were significantly more, the insertion per se did not affect the overall domain architecture, which was largely similar between G4 or G4-mutated inserts as clear from Figures 6A and B. Further, we asked if the domain architecture was retained from that seen in HEK293T cells (with no insertion): Comparison using reported HiC data for the same region from HEK293T cells showed this to be the case confirming that the chromatin domain architecture remained relatively unchanged on introducing the G-array or G4-mutated regions (Figure S6).

G4-dependent changes in local chromatin architecture.
Juicebox Hi-C matrices showing Hi-C contacts in the (A) G4-array insert cells, (B) G4-mutated insert (control) cells in a 3.6Mb region of chromosome 12 with the insertion site at the middle of the matrices. The arrows at the top of the Hi-C matrices indicate the site of insertion. (C) Juicebox Hi-C matrix showing normalized Hi-C contacts in the G4-array insert cells over the G4-mutated insert (control) cells as a heatmap. The region of interest (i.e., interactions associated with the immediate vicinity of the insert) is marked with a box. The arrow at the top of the Hi-C matrix indicates the site of insertion. (D) A line histogram displaying the differences in interaction frequency across G4-array insert cells and G4-mutated insert (control) cells in regions up to 100 kb away from the insertion site. As seen interactions downstream of the insertion site are more enriched than upstream in the G4-array insert cells as compared to the G4-mutated control. (E) Circos plot showing differential interactions (fold enrichment >=2) originating from a ± 100 kb window with the insertion site at the middle, in the G4-array insert cells over the G4-mutated insert (control) cells. (F) UCSC genome browser snapshot showing the more significant differential interactions (fold enrichment >=2, interaction reads >20) originating from a ± 50 kb window with the insertion site at the middle, in the G4-array insert cells over the G4-mutated insert (control) cells. The color intensity of the arcs indicating the interacting bins is proportional to the fold enrichment. Density of potential G4 motifs (per 10 kb) shown in lower panel; G4-forming sequences identified using pqsfinder (Hon et al., 2017); interaction regions marked in red at the bottom of lower panel.

The chromosomal architecture of the insertion locus in the G4-array insert cells is broadly similar to uninserted cells except for the increase in looping interactions.
Comparison of Hi-C contact matrices around the insertion site in the G4-array insert cells (A) and the HEK293T control cells (taken from GSE44267) (B) shows that the broad chromatin organization is conserved. The TADs appear to be otherwise unaltered upon insertion.

To evaluate the effect of G4s in more detail, we plotted a Hi-C heatmap to show the enhanced or reduced (differential) contacts in the G4-array insert compared to the G4-mutated insert control cells (Figure 6C; relatively enriched/reduced contacts in the G4-array insert w.r.t. the G4-mutated insert plotted in red or blue, respectively; using Juicebox for analysis). This clearly showed that the G4-array induced significantly more Hi-C interactions; interestingly this was particularly evident in the downstream regions. For a closer analysis, we mapped the interaction frequency in a ±100 kb window centered on the insertion site. This clearly showed the difference in the number of interactions between the upstream regions vis-a-vis the region downstream of the insertion (Figure 6D).

To further confirm we used an independent HiC analysis method, HOMER (Hypergeometric Optimization of Motif EnRichment, Heinz et al., 2018) to compute the enhanced/reduced long-range interactions in the G4-array insert, compared to the control G4-mutated insert. Differential analysis using HOMER showed that the inserted locus induced significantly higher number of interactions in the case of G4-array insert relative to the control G4-mutated case (Figure 6E). When we plotted the significantly different chromosomal interactions with minimum 20 interaction reads, it was again clear that the number of interactions with the G4-array insertion region was significantly enhanced in the downstream region relative to the upstream (Figure 6F).

Together these show a clear role of G4s in inducing long-range interactions. A similar sequence devoid of G4-forming capability did not induce such interactions. Furthermore, the overall nature of the TAD was not disturbed, and largely consistent with what is noted in cells with no insertion. Overall, these support that the insertion of G4s induced long-range interactions with minimal organizational changes in the 3D chromatin domain, underlining the molecular role of G4s in the arrangement of 3D chromatin.

A second significant feature was notable at the insertion locus. The number of induced long-range interactions was more significant downstream of the insertion site, compared to the upstream region (Figures 6C, D, F). A close look at the Hi-C contact matrices indicated that the site of insertion was very close and downstream to the TAD boundary (Figures 6A-C). We reasoned that the G4-dependent long-range interactions were largely within the TAD, and limited in the upstream region due to the TAD boundary. This is clearly seen in Figure 6C, akin to an ‘architectural stripe’ displaying that the inserted G4 array had enhanced Hi-C interactions across the domain, thus prominently featured in the downstream regions.

G4-array insertion at a second locus gives enhancer-like functions

Finally, we checked if enhancer-like effects were observed upon insertion of G4 array at another locus. Like the first site of insertion, we first identified an isolated locus devoid of G4s in the vicinity and with low interactions with surrounding regions near the 10 millionth position of chromosome 12 (10M hereafter, chr12:10588429-10588430, hg19; Figure S7). The G4-array, or its G4-mutated (control), sequences were inserted at the 10M locus (Figures 7A, B, S8).

Insertion of the G4-array in another isolated locus and subsequent changes in chromatin and surrounding gene expression.
(A) Schematic showing the insertion of the G4-array and the G4-mutated control at chr12:10,588,429-10,588,430 (hg19). (B) PCR of the insertion locus showing the successful insertion of the 275bp long insert sequence. The top panel shows the positions of the PCR amplicons used in the Histone ChIP experiments. Changes in chromatin-modifying histone modifications in the insert region represented by calculating the ratio of occupancy of different histone marks in the G4-array insert cells over the G4-mutated insert (control) cells- enhancer mark, H3K4Me1 (C); active enhancer/promoter mark, H3K27Ac (D); facultative repressor mark, H3K27Me3 (E); constitutive repressor mark, H3K9Me3 (F) and active promoter mark, H3K4Me3 (G). (H) Long-range G4-dependent changes in mRNA expression are represented by calculating the ratio of expression of surrounding genes in the G4-array insert cells over the G4-mutated insert (control) cells. The panel above shows the positions of the gene promoters with respect to the insertion site. Mean ± SD (n=3); unpaired, two-tailed t-test (*p < 0.05, **p < 0.01, ***p < 0.001, ****p < 0.0001).

Another isolated locus was chosen for insertion.
(A) 3D Genome Browser (Wang et al., 2018) snapshots showing the Hi-C interaction matrices from 4 cell lines (NHEK, IMR90, HUVEC and HMEC) of a section of chromosome 12 with chr12:10585000-10590000 (hg19) locus in the middle (marked by arrows); the chr12:10585000-10590000 locus has low interaction with its surrounding loci, indicated by the lack of red dots or very faint red dots.

G4 formation analyzed by BG4 ChIP.
BG4 antibody enrichment at the 10M insertion locus in the G4-array and the G4-mutated (control) insert cells. Mean ± SD (n=2); unpaired, two-tailed t-test (*p < 0.05, **p < 0.01, ***p < 0.001, ****p < 0.0001). Primer R2, covers >100 bp of the respective inserted regions, as shown in Figure 7 top panel.

As for the 79M locus, to validate intracellular G4 formation and study the chromatin state at the inserted locus, PCR primers were designed keeping multiple points in mind (as described above). Here, for testing formation of G4 at the 10M insertion, we used primer pairs R2 (scheme for 10M shown in Figure 7 top panel), covering >100 bases of the inserted G4-array, or the G4-mutated control. BG4 ChIP-qPCR validated formation of intracellular G4s within the G4 array, and not the G4-mutated control sequence (Figure S8). Next, we checked for changes in chromatin and the surrounding gene expression due to G4 formation. A relative increase in the H3K4Me1 and H3K27Ac enhancer marks in the G4-array was evident compared to the G4-mutated control (Figures 7C, D), consistent with earlier observations following G4 insertion at the 79M locus (Figures 2A, B). We noticed, however, that the enhanced levels of H3K27Ac were not as marked as the 79M locus. On the other hand, interestingly, relative increase in the H3K27Me3 repressor mark compared to the control mutated-G4 insert, particularly at the downstream end of the insertion locus was seen (Figure 7E). There was no G4-specific change in the presence of the chromatin compaction mark H3K9Me3, or the promoter activation mark H3K4Me3 (Figures 7F, G). As expected from earlier observations and the enhancer histone marks, there was a G4-dependent increase in the expression of surrounding genes KLRC2, KLRC1 and NTF3; except for PTPRO, which had reduced expression (Figure 7H). Taken together, G4-specific chromatin changes were evident at the 10M locus consistent with the 79M locus. Notable variations however must be pointed out: like the presence of the H3K27Me3 repressor histone mark, along with H3K27Ac/H3K4Me1 enhancer histone marks, indicating a poised enhancer-like state as described earlier (Calo and Wysocka, 2013). These suggest the impact of G4 formation on chromatin is likely context-specific, that is, dependent on the chromatin state of the adjacent regions.

Discussion

To directly test if G4s affect long-range chromatin organization we artificially inserted an array of G4s in the chromatin. Hi-C experiments clearly showed an enhanced number of cis- and trans-chromosomal long-range interactions emanating from the introduced G4s. This was G4-specific because a similar sequence devoid of G4-forming capability introduced at the same site did not result in enhanced interactions. Furthermore, interestingly, most new long-range interactions following G4 incorporation were downstream from the site of insertion. This is likely because the G4 insertion locus was proximal to the upstream TAD boundary thereby restricting most new interactions to the downstream regions within the TAD (Figures 5, 6).

The insertion of the G4 array led to enhanced expression of genes up to 5 Mb away compared to cells with the G4 mutated control insertion. Furthermore, there was enrichment in the H3K4Me1 and H3K27Ac enhancer histone marks, along with recruitment of transcriptional coactivator p300 and more prominently the functionally active acetylated p300/CBP. This was clearly due to the introduction of G4s and not found upon the introduction of the G4-mutated control sequence. The enhancer marks were relatively reduced, although not markedly, when the inserted G4s were specifically disrupted (Figure 4).

The enhancer-like effects observed upon G4 insertion at the 79M locus were largely replicated when the same G4-array was inserted in another locus (10M). Interestingly, the G4-dependent increase in H3K27Me3 at the 10M locus, in addition to the enhancer mark H3K4Me1, supports formation of a poised enhacer-like state, as described earlier (Calo and Wysocka, 2013). Although both the inserted G4 regions induced enhancer-like chromatin, notable context-specific influence, likely due to the chromatin-state at the regions adjacent to the insertion locus, were evident. Taken together, findings here directly support the function of G4s as enhancer-like elements and as factors that enhance long-range chromatin interactions. It is possible that such interactions are also contextually dependent on the type of G4 structure, in addition to the adjacent sequence context, and further studies will be necessary to elucidate these.

To ensure that the observed effects were from intracellular G4 formation we accounted for the following while designing the experiments. First, we introduced an array of G4s and confirmed in vivo G4 formation; the inserted sequence was from the hTERT promoter region with multiple arrayed G4s (Lim et al., 2010; Monsen et al., 2020; Palumbo et al., 2009). Second, we selected an insertion locus that was otherwise devoid of intrinsic G4s in a ±2.5 kb window. Third, the selected insertion locus was relatively sparse in long-range interactions. Fourth, we independently inserted a sequence of identical length (and similar GC%) which does not form G4s at the same locus (G4-mutated control). All results were compared to the G4-mutated insertion. Although the introduced mutations for the control sequences were minimal, the impacts of such mutations on the binding of specific transcription factors associated with the sequence, particularly SP1, reported to bind G4s(Raiber et al., 2012), cannot be ruled out.

Existing literature shows promoter G4s are involved in regulating gene expression (Huppert and Balasubramanian, 2007; Rawal et al., 2006; Verma et al., 2008). Additionally, G4s have been reported to regulate chromatin epigenetics through both cytosine methylation and histone modifications (Halder et al., 2010; Mao et al., 2018; Sengupta et al., 2020). Previous studies by us further show that promoter G4s regulate gene expression by recruiting histone-modifying regulatory complexes (Hussain et al., 2017; Mukherjee et al., 2019; Saha et al., 2017; Sharma et al., 2021). Here we aimed to study how G4s affect the expression of genes far from their location, and if this was through G4-induced modifications in long-range 3D chromatin interactions.

Multiple studies have correlated the presence of G4s with long-range associations. CTCF, an architectural protein primarily involved in TAD boundary formation, was observed to bind to G4s and G4 stabilization was noted to enhance CTCF occupancy (Tikhonova et al., 2021). In addition, G4s were noted to be enriched in TAD boundaries and associated with the formation of chromatin loops (Hou et al., 2019). G4s were also found to coincide with open chromatin regions and H3K27Ac and H3K4Me1 ChIP-Seq peaks, which are markers for transcriptional enhancers (Calo and Wysocka, 2013; Hou et al., 2021; Lyu et al., 2022; Shlyueva et al., 2014). Most of these regions were observed to overlap with annotated enhancers and promoters regulated by such enhancers were enriched in G4s (Williams et al., 2020). A recent G4 CUT&Tag study further noted G4 formation at both active promoters and active and poised enhancers (Lyu et al., 2022).

Further, it was proposed that inter-molecular G4 formation between distant stretches of Gs may lead to DNA looping (Hegyi, 2015; Liano et al., 2022). Consistent with this, we noted with interest promoters of the four genes (PAWR, PPP1R12A, NAV3 and SLC6A15; Figure 3A), activated on long-range interaction with the inserted loci, harbour potential G4-forming sequences (pG4) (Figure S9). Further, we analysed the long-range contact regions shown in Figure 6F, along with the whole locus, for pG4s (Hon et al., 2017). Relative enrichment in pG4s was evident, particularly within the significantly enhanced contact points, at times spreading beyond the interacting region (Figure 6F, lower panel). Together, these support G4-induced long-range interactions.

pG4s in the activated gene promoters.
-400 to +100 (w.r.t. TSS) promoter region of the genes whose expression increased upon insertion of the G4-array with the pG4 motifs highlighted in yellow and the TSS highlighted in green. C-rich motifs indicate G4 formation on the complementary strand.

The YY1 transcription factor was found to bind to G4s and dimerization of G4-bound YY1 led to chromatin looping interactions and consequent regulation of target gene expression (Li et al., 2020). A recent study also showed R-loops and possibly R-loop-associated G4 formation are enriched at CTCF binding sites, and stronger CTCF binding facilitated by G4s promotes chromatin looping (Wulfridge et al., 2023). In addition, it was shown that G4s assist in RNA polymerase II-associated chromatin looping (Yuan et al., 2023). In this context, further work will be required to understand whether and how formation of R-loops or RNA-DNA hybrid G4s (Fay et al., 2017), and/or association of factors like cohesion and CTCF, at the G4-array insertion sites impact chromatin looping.

In summary, our findings here demonstrate a causal role of G4s in inducing both long-range associations and enhancer function. Findings from a G4-forming stretch inserted at two independent loci illustrate the function of G4s in 3D gene regulation. Together these shed new mechanistic light on how DNA secondary structure motifs directly control the state of 3D chromatin and thereby biological function.

Materials and Methods

Cell lines and Cell Culture Conditions

HEK 293T cells were cultured in Dulbecco’s Modified Eagle’s Medium-High Glucose (DMEM-HG) supplemented with 10% FBS and 1XAnti-Anti (Gibco).

Primary Antibodies

Histone H3 rabbit polyclonal (Abcam ab1791), H3K4Me1 rabbit polyclonal (Abcam ab8895), H3K27Ac rabbit polyclonal (Abcam ab4729), H3K4Me3 mouse monoclonal (Abcam ab1012), H3K27Me3 mouse monoclonal (Abcam ab6002), H3K9Me3 rabbit polyclonal (Abcam ab8848), p300 rabbit monoclonal (CST 54062), Ac-p300/CBP rabbit polyclonal (CST 4771), BG4 antibody (MABE917).

Genomic Insertions using CRISPR-Cas9 genome editing

For the genomic insertions CRISPR-Cas9 genome editing technique was used (Ran et al., 2013). For the G4 array insertion, 275 bp long hTERT promoter region was PCR amplified from HEK 293T genomic DNA. For the insertion of the mutated G4s, a synthetic DNA template was synthesized and cloned into pUC57 vector by Genscript Biotech Corp, where 12 Gs were substituted with Ts (see supplementary methods for detailed sequences). Both the G4 array and the G4 mutated insertion templates were PCR amplified using longer primers where the short homology arms were introduced as overhangs of the primer for the accurate insertion at the 79M locus via homologous recombination (see supplementary methods for primer sequences) (Paix et al., 2017). For cleavage at the 79M locus (chr12:79,872,423-79,872,424 (hg19)), the gRNA sequence, 5’-ACTATGTATGTACATCCAGG-3’, was cloned into the pX459 v2.0, a gift from Feng Zhang, that co-expresses cas9 protein and the gRNA. For cleavage at the 10M locus (chr12:10,588,429-10,588,430 (hg19)), the gRNA sequence, 5’-ATCCTTCCCTGAATCATCAA-3’, was used. Guide RNAs (gRNAs) were designed using the CRISPOR tool (Haeussler et al., 2016). Once the gRNA cloned vector and the insertion donor templates were ready, they were transfected into HEK293T cells and the transfected cells were selected using puromycin, whose resistance gene was present in the pX459 vector. Then these selected cells were serially diluted to isolate clones originating from single cells. Many such clones were screened to detect cells with homozygous/heterozygous insertion of the G4 array or mutated G4 insert by performing locus-specific PCR. Either primers adjacent to the insertion site or cross primers, i.e., one primer within the insert and another from the adjacent region, were used to screen and identify insertions. While using adjacent primers, a shift in PCR product with an increase in amplicon size by 275 bp (size of the insert) indicated successful insertion (see supplementary methods for primer sequences).

ChIP (Chromatin Immunoprecipitation)

ChIP assays were performed as per the protocol previously reported in (Mukherjee et al., 2018). Immunoprecipitation was done using relevant primary antibodies. IgG was used for isotype control. Total histone H3 was used as a control for the histone modifications ChIP. Three million cells were harvested and crosslinked with ∼1% formaldehyde for 10 min and lysed. Chromatin was sheared to an average size of ∼250-500 bp using Biorupter (Diagenode). 10% of sonicated fraction was processed as input using phenol–chloroform and ethanol precipitation. ChIP was performed using 3 μg of the respective antibody incubated overnight at 4°C. Immune complexes were collected using salmon sperm DNA-saturated magnetic protein G Dynabeads (Anti-FLAG M2 magnetic beads for BG4 ChIP) and washed extensively using a series of low salt, high salt and LiCl Buffers. The Dynabeads were then resuspended in TE (Tris- EDTA pH 8.1) buffer and treated with proteinase K at 65° C for ∼5 hrs. Then, phenol-chloroform-isoamyl alcohol was utilized to extract DNA. Extracted DNA was precipitated by centrifugation after incubating overnight at −20 ° C with isopropanol, 0.3M sodium acetate and glycogen. The precipitated pellet was washed with freshly prepared 70% ethanol and resuspended in TE buffer. ChIP DNA was analyzed by qRT-PCR method (see supplementary methods for primer sequences).

Real-time PCR for Gene (mRNA) expression

Total RNA was isolated using TRIzol® Reagent (Invitrogen, Life Technologies) according to the manufacturer’s instructions. RNA was quantified and cDNA was synthesized using iScript cDNA Synthesis Kits. A relative transcript expression level for genes was measured by quantitative real-time PCR using a SYBR Green based method (see supplementary methods for primer sequences). Average fold change was calculated by the difference in threshold cycles (Ct) between test and control samples. GAPDH gene was used as internal control for normalizing the cDNA concentration of each sample.

Chromosome Conformation Capture (3C)

Chromosome Conformation Capture (3C) assay was done as per the protocol reported in (Cope and Fraser, 2009) with certain modifications. Briefly, about 5-6 million cells were crosslinked using 1% formaldehyde for 10 minutes and then lysed to isolate the nuclei. Nuclei were digested overnight by HindIII and then ligated in a diluted reaction so that intramolecular ligation is favored. After ligation, the reaction mixture was treated with proteinase K at 65° C to de-crosslink the DNA, followed by RNase A treatment. Then, phenol-chloroform-isoamyl alcohol was utilized to extract DNA. Extracted DNA was precipitated by centrifugation after incubating overnight at −80 ° C with 70% ethanol, 0.1M sodium acetate and glycogen. The precipitated pellet was washed with freshly prepared 70% ethanol and resuspended in TE buffer. 3C looping interactions were analyzed by TaqMan qRT-PCR method. For comparison, each interaction frequency was normalized to the interaction between exons 2 and 8 of the human α-actin (ACTA2)(Hadjur et al., 2009). See supplementary methods for primer sequences.

G4 disruption using LNA probes

Probes were designed to specifically bind to regions of genomic DNA containing G repeats which would form the G stems of the G4 structure. The probes containing LNA nucleotides should hybridize with the target with higher stability than the stability of the G4 structure thus destabilizing the G4. The probes used to target the G4 array insert were: 5’- C*CCGACCCCTCC*C-3’, 5’-C*CAGCCCCCTCC*G-3’, 5’-C*CCCTCCCCTTC*C-3’. Stretches of three or more Cs are shown in bold, LNA nucleotides within the probes are underlined, the ends of the probes were protected using phosphorothioate bonds, shown as *. Approximately 0.8 μg of LNA probes (all 3 mixed in equimolar amounts) were transfected per million cells. Cells were treated with the LNA probes for 108 hours by transfecting thrice with a gap of 36 hours in between. The schematic below shows the LNA probes designed to disrupt the inserted G4 structures along with the inserted G4 array sequence to show the specific sites of hybridization by the LNA probes.

Hi-C

Hi-C was performed using the Arima-HiC Kit as per the manufacturer’s protocol. After the proximally-ligated Hi-C templates were generated, sequencing libraries were prepared using NEBNext Ultra II DNA Library Prep Kit as per the Arima-HiC Kit’s protocol. The quality of the sequencing libraries was cross-checked using TapeStation (Agilent Technologies) and the KAPA Library Quantification Kit (Roche) before proceeding with sequencing using NovaSeq 6000 (Illumina).

Hi-C data analysis

Hi-C reads were mapped to the hg19 human genome and processed using default parameters using Juicer (https://github.com/aidenlab/juicer). Hi-C count matrices were generated at 5kb, 10kb, 25kb, 50kb, 100kb, and 250kb using Juicer. Hi-C heatmap figures were rendered using Juicebox (https://github.com/aidenlab/Juicebox/wiki/Download). Hi-C contacts originating in the loci flanking the G4 insertion site were generated using bedtools (https://bedtools.readthedocs.io/en/latest/). The circos plots were rendered using Circos (http://circos.ca). To identify significant interaction the data was processed using homer (http://homer.ucsd.edu/homer/) using analyzeHiC function. The bins showing 2-fold enrichment in G4 WT over G4 Mut and vice-versa were retained for filtering contacts for representation on circos plots.

Supporting information

Supplementary Materials and Methods

Data Availability

The sequencing data underlying this article are available in the NCBI Sequence Read Archive, scheduled to be released upon publication and would be accessible using the following link- https://www.ncbi.nlm.nih.gov/sra/?term=PRJNA1048044. The rest of the data are available in the article and its online supplementary material. Further inquiries can be directed to the corresponding author.

Author Contributions

SSR- Methodology, Resources, Investigation, Data curation, Formal analysis, Visualization, Writing – original draft preparation; SB- Resources, Investigation; SV- Investigation; AS- Visualization; CRT- Investigation; RK- Formal analysis, Visualization; SS- Methodology, Formal analysis, Visualization; SC- Conceptualization, Methodology, Writing – review & editing, Supervision, Project administration, Funding acquisition.

Funding

This work was supported by The Wellcome Trust DBT India Alliance (IA/S/18/2/504021).

References

1. Boyan Bonev
2. Giacomo Cavalli
2016Organization and function of the 3D genomeNature Review Genetics 17:661–678https://doi.org/10.1038/nrg.2016.112 Google Scholar
1. Cadoni E
2. De Paepe L
3. Manicardi A
4. Madder A.
2021Beyond small molecules: targeting G-quadruplex structures with oligonucleotides and their analoguesNucleic Acids Res 49:6638–6659https://doi.org/10.1093/NAR/GKAB334 Google Scholar
1. Calo E
2. Wysocka J
2013Modification of Enhancer Chromatin: What, How, and Why?Mol Cell 49:825–837https://doi.org/10.1016/j.molcel.2013.01.038 Google Scholar
1. Chowdhury S
2. Wang J
3. Nuccio SP
4. Mao H
5. Di Antonio M.
2022Short LNA-modified oligonucleotide probes as efficient disruptors of DNA G-quadruplexesNucleic Acids Res 50:7247–7259https://doi.org/10.1093/NAR/GKAC569 Google Scholar
1. Cope NF
2. Fraser P
2009Chromosome Conformation CaptureCold Spring Harb Protoc 2009https://doi.org/10.1101/PDB.PROT5137 Google Scholar
1. Creyghton MP
2010Histone H3K27ac separates active from poised enhancers and predicts developmental stateProc Natl Acad Sci USA 107:21931–21936Google Scholar
1. Denker A
2. De Laat W.
2016The second decade of 3C technologies: Detailed insights into nuclear organizationGenes Dev 30:1357–1382https://doi.org/10.1101/gad.281964.116 Google Scholar
1. Fay MM
2. Lyons SM
3. Ivanov P
2017RNA G-Quadruplexes in Biology: Principles and Molecular MechanismsJ Mol Biol 429:2127–2147https://doi.org/10.1016/J.JMB.2017.05.017 Google Scholar
1. Hadjur S
2. Williams LM
3. Ryan NK
4. Cobb BS
5. Sexton T
6. Fraser P
7. Fisher AG
8. Merkenschlager M
2009Cohesins form chromosomal cis-interactions at the developmentally regulated IFNG locusNature 460:410–413https://doi.org/10.1038/nature08079 Google Scholar
1. Haeussler M
2. Schönig K
3. Eckert H
4. Eschstruth A
5. Mianné J
6. Renaud JB
7. Schneider-Maunoury S
8. Shkumatava A
9. Teboul L
10. Kent J
11. Joly JS
12. Concordet JP
2016Evaluation of off-target and on-target scoring algorithms and integration into the guide RNA selection tool CRISPORGenome Biol 17:1–12https://doi.org/10.1186/s13059-016-1012-2 Google Scholar
1. Halder R
2. Halder K
3. Sharma P
4. Garg G
5. Sengupta S
6. Chowdhury S
2010Guanine quadruplex DNA structure restricts methylation of CpG dinucleotides genome-wideMol Biosyst 6:2439–2447https://doi.org/10.1039/c0mb00009d Google Scholar
1. Hänsel-Hertsch R
2. Beraldi D
3. Lensing S V.
4. Marsico G
5. Zyner K
6. Parry A
7. Di Antonio M
8. Pike J
9. Kimura H
10. Narita M
11. Tannahill D
12. Balasubramanian S.
2016G-quadruplex structures mark human regulatory chromatinNat Genet 48:1267–1272https://doi.org/10.1038/ng.3662 Google Scholar
1. Hegyi H
2015Enhancer-promoter interaction facilitated by transiently forming G-quadruplexesSci Rep 5:1–6https://doi.org/10.1038/srep09165 Google Scholar
1. Heintzman ND
2. Hon GC
3. Hawkins RD
4. Kheradpour P
5. Stark A
6. Harp LF
7. Ye Z
8. Lee LK
9. Stuart RK
10. Ching CW
11. Ching KA
12. Antosiewicz-Bourget JE
13. Liu H
14. Zhang X
15. Green RD
16. Lobanenkov V V.
17. Stewart R
18. Thomson JA
19. Crawford GE
20. Kellis M
21. Ren B
2009Histone modifications at human enhancers reflect global cell-type-specific gene expressionNature 459:108–112https://doi.org/10.1038/nature07829 Google Scholar
1. Heintzman ND
2. Stuart RK
3. Hon G
4. Fu Y
5. Ching CW
6. Hawkins RD
7. Barrera LO
8. Van Calcar S
9. Qu C
10. Ching KA
11. Wang W
12. Weng Z
13. Green RD
14. Crawford GE
15. Ren B.
2007Distinct and predictive chromatin signatures of transcriptional promoters and enhancers in the human genomeNature Genetics 39:311–318https://doi.org/10.1038/ng1966 Google Scholar
1. Heinz S
2. Texari L
3. Hayes MGB
4. Urbanowski M
5. Chang MW
6. Givarkes N
7. Rialdi A
8. White KM
9. Albrecht RA
10. Pache L
11. Marazzi I
12. García-Sastre A
13. Shaw ML
14. Benner C
2018Transcription Elongation Can Affect Genome 3D StructureCell 174:1522–1536https://doi.org/10.1016/J.CELL.2018.07.047 Google Scholar
1. Hon J
2. Martínek T
3. Zendulka J
4. Lexa M
2017. pqsfinder: an exhaustive and imperfection-tolerant search tool for potential quadruplex-forming sequences in RBioinformatics 33:3373–3379https://doi.org/10.1093/bioinformatics/btx413 Google Scholar
1. Hou Y
2. Guo Y
3. Dong S
4. Yang T.
2021Novel Roles of G-quadruplexes on Enhancers in human chromatinbioRxiv https://doi.org/10.1101/2021.07.12.451993 Google Scholar
1. Hou Y
2. Li F
3. Zhang R
4. Li S
5. Liu H
6. Qin ZS
7. Sun X
2019Integrative characterization of G-Quadruplexes in the three-dimensional chromatin structureEpigenetics https://doi.org/10.1080/15592294.2019.1621140 Google Scholar
1. Huppert JL
2. Balasubramanian S
2007G-quadruplexes in promoters throughout the human genomeNucleic Acids Res 35:406–413https://doi.org/10.1093/nar/gkl1057 Google Scholar
1. Hussain T
2. Saha D
3. Purohit G
4. Kar A
5. Kishore Mukherjee A
6. Sharma S
7. Sengupta S
8. Dhapola P
9. Maji B
10. Vedagopuram S
11. Horikoshi NT
12. Horikoshi N
13. Pandita RK
14. Bhattacharya S
15. Bajaj A
16. Riou JF
17. Pandita TK
18. Chowdhury S
2017Transcription regulation of CDKN1A (p21/CIP1/WAF1) by TRF2 is epigenetically controlled through the REST repressor complexSci Rep https://doi.org/10.1038/s41598-017-11177-1 Google Scholar
1. Kalkhoven E
2004CBP and p300: HATs for different occasionsBiochem Pharmacol 68:1145–1155https://doi.org/10.1016/j.bcp.2004.03.045 Google Scholar
1. Kumar N
2. Patowary A
3. Sivasubbu S
4. Petersen M
5. Maiti S
2008Silencing c-MYC expression by targeting quadruplex in P1 promoter using locked nucleic acid trapBiochemistry 47:13179–13188https://doi.org/10.1021/BI801064J/SUPPL_FILE/BI801064J_SI_001.PDF Google Scholar
1. Pankaj Kumar
2. Yadav VK
3. Baral A
4. Parveen Kumar
5. Saha D
6. Chowdhury S
2011Zinc-finger transcription factors are associated with guanine quadruplex motifs in human, chimpanzee, mouse and rat promoters genome-wideNucleic Acids Res https://doi.org/10.1093/nar/gkr536 Google Scholar
1. Lago S
2. Nadai M
3. Cernilogar FM
4. Kazerani M
5. Domíniguez Moreno H
6. Schotta G
7. Richter SN
2021Promoter G-quadruplexes and transcription factors cooperate to shape the cell type-specific transcriptomeNature Communications 12:1–13https://doi.org/10.1038/s41467-021-24198-2 Google Scholar
1. Li L
2. Williams P
3. Ren W
4. Wang MY
5. Gao Z
6. Miao W
7. Huang M
8. Song J
9. Wang Y
2020YY1 interacts with guanine quadruplexes to regulate DNA looping and gene expressionNat Chem Biol https://doi.org/10.1038/s41589-020-00695-1 Google Scholar
1. Liano D
2. Monti L
3. Chowdhury S
4. Raguseo F
5. Di Antonio M.
2022Long-range DNA interactions: inter-molecular G-quadruplexes and their potential biological relevanceChemical Communications 58:12753–12762https://doi.org/10.1039/D2CC04872H Google Scholar
1. Lim KW
2. Lacroix L
3. Yue DJE
4. Lim JKC
5. Lim JMW
6. Phan AT
2010Coexistence of two distinct G-quadruplex conformations in the hTERT promoterJ Am Chem Soc 132:12331–12342https://doi.org/10.1021/ja101252n Google Scholar
1. Lyu J
2. Shao R
3. Kwong Yung PY
4. Elsässer SJ
2022Genome-wide mapping of G-quadruplex structures with CUT&TagNucleic Acids Res 50:e13–e13https://doi.org/10.1093/NAR/GKAB1073 Google Scholar
1. Mao SQ
2. Ghanbarian AT
3. Spiegel J
4. Martínez Cuesta S
5. Beraldi D
6. Di Antonio M
7. Marsico G
8. Hänsel-Hertsch R
9. Tannahill D
10. Balasubramanian S.
2018DNA G-quadruplex structures mold the DNA methylomeNat Struct Mol Biol https://doi.org/10.1038/s41594-018-0131-8 Google Scholar
1. Monsen RC
2. DeLeeuw L
3. Dean WL
4. Gray RD
5. Sabo TM
6. Chakravarthy S
7. Chaires JB
8. Trent JO
2020The hTERT core promoter forms three parallel G-quadruplexesNucleic Acids Res 48:5720–5734https://doi.org/10.1093/NAR/GKAA107 Google Scholar
1. Mukherjee AK
2. Sharma S
3. Chowdhury S
2019Non-duplex G-Quadruplex Structures Emerge as Mediators of Epigenetic ModificationsTrends Genet 35:129–144https://doi.org/10.1016/j.tig.2018.11.001 Google Scholar
1. Mukherjee AK
2. Sharma S
3. Sengupta S
4. Saha D
5. Kumar P
6. Hussain T
7. Srivastava V
8. Roy SD
9. Shay JW
10. Chowdhury S
2018Telomere length-dependent transcription and epigenetic modifications in promoters remote from telomere endsPLoS Genet 14https://doi.org/10.1371/journal.pgen.1007782 Google Scholar
1. Paix A
2. Folkmann A
3. Goldman DH
4. Kulaga H
5. Grzelak MJ
6. Rasoloson D
7. Paidemarry S
8. Green R
9. Reed RR
10. Seydoux G
2017Precision genome editing using synthesis-dependent repair of Cas9-induced DNA breaksProceedings of the National Academy of Sciences 201711979https://doi.org/10.1073/pnas.1711979114 Google Scholar
1. Palumbo SML
2. Ebbinghaus SW
3. Hurley LH
2009Formation of a unique end-to-end stacked pair of G-quadruplexes in the hTERT core promoter with implications for inhibition of telomerase by G-quadruplex-interactive ligandsJ Am Chem Soc https://doi.org/10.1021/ja902281d Google Scholar
1. Raiber E-A
2. Kranaster R
3. Lam E
4. Nikan M
5. Balasubramanian S
2012A non-canonical DNA structure is a binding motif for the transcription factor SP1 in vitroNucleic Acids Res 40:1499–1508https://doi.org/10.1093/nar/gkr882 Google Scholar
1. Ran FA
2. Hsu PD
3. Wright J
4. Agarwala V
5. Scott DA
6. Zhang F
2013Genome engineering using the CRISPR-Cas9 systemNat Protoc 8:2281–2308https://doi.org/10.1038/nprot.2013.143 Google Scholar
1. Rao SSP
2. Huntley MH
3. Durand NC
4. Stamenova EK
5. Bochkov ID
6. Robinson JT
7. Sanborn AL
8. Machol I
9. Omer AD
10. Lander ES
11. Aiden EL
2014A 3D map of the human genome at kilobase resolution reveals principles of chromatin loopingCell 159:1665–1680https://doi.org/10.1016/j.cell.2014.11.021 Google Scholar
1. Rawal P
2. Kummarasetti VBR
3. Ravindran J
4. Kumar N
5. Halder K
6. Sharma R
7. Mukerji M
8. Das SK
9. Chowdhury S
2006Genome-wide prediction of G4 DNA as regulatory motifs: Role in Escherichia coli global regulationGenome Res 16:644–655https://doi.org/10.1101/gr.4508806 Google Scholar
1. Robinson J
2. Raguseo F
3. Nuccio SP
4. Liano D
5. Di Antonio M.
2021DNA G-quadruplex structures: more than simple roadblocks to transcription?Nucleic Acids Res 49:8419–8431https://doi.org/10.1093/NAR/GKAB609 Google Scholar
1. Roy SSSS
2. Mukherjee AKAK
3. Chowdhury S
2018Insights about genome function from spatial organization of the genomeHum Genomics 12:8https://doi.org/10.1186/s40246-018-0140-z Google Scholar
1. Saha D
2. Singh A
3. Hussain T
4. Srivastava V
5. Sengupta S
6. Kar A
7. Dhapola P
8. Dhople V
9. Ummanni R
10. Chowdhury S
2017Epigenetic suppression of human telomerase (hTERT) is mediated by the metastasis suppressor NME2 in a G-quadruplex– dependent fashionJournal of Biological Chemistry 292:15205–15215https://doi.org/10.1074/jbc.M117.792077 Google Scholar
1. Sengupta A
2. Roy SS
3. Chowdhury S
2020Non-duplex G-Quadruplex DNA Structure: A Developing Story from Predicted Sequences to DNA Structure-Dependent Epigenetics and BeyondAcc Chem Res 54:46–56https://doi.org/10.1021/ACS.ACCOUNTS.0C00431 Google Scholar
1. Sharma S
2. Mukherjee AK
3. Roy SS
4. Bagri S
5. Lier S
6. Verma M
7. Sengupta A
8. Kumar M
9. Nesse G
10. Pandey DP
11. Chowdhury S
2021Human telomerase is directly regulated by non-telomeric TRF2-G-quadruplex interactionCell Rep 35:109154https://doi.org/10.1016/J.CELREP.2021.109154 Google Scholar
1. Shlyueva D
2. Stampfel G
3. Stark A
2014Transcriptional enhancers: From properties to genome-wide predictionsNat Rev Genet 15:272–286https://doi.org/10.1038/nrg3682 Google Scholar
1. Tikhonova P
2. Pavlova I
3. Isaakova E
4. Tsvetkov V
5. Bogomazova A
6. Vedekhina T
7. Luzhin A V.
8. Sultanov R
9. Severov V
10. Klimina K
11. Kantidze OL
12. Pozmogova G
13. Lagarkova M
14. Varizhuk A
2021Dna g- quadruplexes contribute to ctcf recruitmentInt J Mol Sci 22:7090https://doi.org/10.3390/IJMS22137090/S1 Google Scholar
1. Varshney D
2. Spiegel J
3. Zyner K
4. Tannahill D
5. Balasubramanian S
2020The regulation and functions of DNA and RNA G-quadruplexesNat Rev Mol Cell Biol 21:459–474https://doi.org/10.1038/s41580-020-0236-x Google Scholar
1. Verma A
2. Halder K
3. Halder R
4. Yadav VK
5. Rawal P
6. Thakur RK
7. Mohd F
8. Sharma A
9. Chowdhury S
2008Genome-wide computational and expression analyses reveal G-quadruplex DNA motifs as conserved cis-regulatory elements in human and related speciesJ Med Chem 51:5641–5649https://doi.org/10.1021/jm800448a Google Scholar
1. Wang Y
2. Song F
3. Zhang B
4. Zhang L
5. Xu J
6. Kuang D
7. Li D
8. Choudhary MNK
9. Li Y
10. Hu M
11. Hardison R
12. Wang T
13. Yue F
2018The 3D Genome Browser: A web-based browser for visualizing 3D genome organization and long-range chromatin interactionsGenome Biol 19:1–12https://doi.org/10.1186/S13059-018-1519-9/FIGURES/5 Google Scholar
1. Williams JD
2. Houserova D
3. Johnson BR
4. Dyniewski B
5. Berroyer A
6. French H
7. Barchie AA
8. Bilbrey DD
9. Demeis JD
10. Ghee KR
11. Hughes AG
12. Kreitz NW
13. McInnis CH
14. Pudner SC
15. Reeves MN
16. Stahly AN
17. Turcu A
18. Watters BC
19. Daly GT
20. Langley RJ
21. Gillespie MN
22. Prakash A
23. Larson ED
24. Kasukurthi M V.
25. Huang J
26. Jinks-Robertson S
27. Borchert GM
2020Characterization of long G4-rich enhancer-associated genomic regions engaging in a novel loop:loop “G4 Kissing” interactionNucleic Acids Res 48:5907–5925https://doi.org/10.1093/nar/gkaa357 Google Scholar
1. Wulfridge P
2. Yan Q
3. Rell N
4. Doherty J
5. Jacobson S
6. Offley S
7. Deliard S
8. Feng K
9. Phillips-Cremins JE
10. Gardini A
11. Sarma K
2023G-quadruplexes associated with R-loops promote CTCF bindingMol Cell 83:3064–3079https://doi.org/10.1016/J.MOLCEL.2023.07.009 Google Scholar
1. Yuan J
2. He X
3. Wang Y
2023G-quadruplex DNA contributes to RNA polymerase II-mediated 3D chromatin architectureNucleic Acids Res 51:8434–8446https://doi.org/10.1093/NAR/GKAD588 Google Scholar
1. Zuin J
2. Dixon JR
3. Van Der Reijden MIJA
4. Ye Z
5. Kolovos P
6. Brouwer RWW
7. Van De Corput MPC
8. Van De Werken HJG
9. Knoch TA
10. Van Ijcken WFJ
11. Grosveld FG
12. Ren B
13. Wendt KS.
2014Cohesin and CTCF differentially affect chromatin architecture and gene expression in human cellsProc Natl Acad Sci U S A 111:996–1001https://doi.org/10.1073/PNAS.1317788111/-/DCSUPPLEMENTAL/SAPP.PDF Google Scholar

Article and author information

Author information

Shuvra Shekhar Roy
CSIR-Institute of Genomics & Integrative Biology, Mathura Road, New Delhi, 110025, India, Academy of Scientific & Innovative Research (AcSIR), Ghaziabad, 201002, India
ORCID iD: 0000-0001-6005-2767
Sulochana Bagri
CSIR-Institute of Genomics & Integrative Biology, Mathura Road, New Delhi, 110025, India, Academy of Scientific & Innovative Research (AcSIR), Ghaziabad, 201002, India
Soujanya Vinayagamurthy
CSIR-Institute of Genomics & Integrative Biology, Mathura Road, New Delhi, 110025, India, Academy of Scientific & Innovative Research (AcSIR), Ghaziabad, 201002, India
Avik Sengupta
Department of Biotechnology, Indian Institute of Technology Hyderabad, Kandi, Telangana, 502284, India
Claudia Regina Then
Cancer Science Institute of Singapore, National University of Singapore, 117599, Singapore
Rahul Kumar
Department of Biotechnology, Indian Institute of Technology Hyderabad, Kandi, Telangana, 502284, India
Sriram Sridharan
Cancer Science Institute of Singapore, National University of Singapore, 117599, Singapore
Shantanu Chowdhury
CSIR-Institute of Genomics & Integrative Biology, Mathura Road, New Delhi, 110025, India, Academy of Scientific & Innovative Research (AcSIR), Ghaziabad, 201002, India
- To whom correspondence should be addressed. Email: shantanuc@igib.in

Version history

Sent for peer review: February 4, 2024
Preprint posted: February 5, 2024
Reviewed Preprint version 1: April 3, 2024
Reviewed Preprint version 2: June 28, 2024
Version of Record published: August 19, 2024

Cite all versions

You can cite all versions using the DOI https://doi.org/10.7554/eLife.96216. This DOI represents all versions, and will always resolve to the latest one.

Copyright

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.

Metrics

views: 2,680
downloads: 237
citations: 8

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Significance of findings

Strength of evidence

Abstract

Introduction

Results

Insertion of an array of G4s in an isolated locus

Insertion of an array of G4s in an isolated locus.

An isolated locus was chosen for insertion.

Insert sequences.

Insertion confirmed by Sanger sequencing.

Chromatin epigenetic landscape upon insertion of G4s

Changes in chromatin upon G4-array insertion.

G4 formation analyzed by BG4 ChIP.

Enhancer-like features emerged upon insertion of G4s

Insertion of the G4-array led to enhancer function.

LNA-mediated disruption of the inserted G4s reverses enhancer phenotype

LNA-mediated disruption of the inserted G4s reverses enhancer phenotype.

Effects of LNA treatment in the G4-mutated insert (control) cells

Domain-wide increase in looping interactions by G4s

Insertion of the G4-array increased Hi-C interactions.

Comparative analysis of the G4-dependent increase in Hi-C interactions upon insertion.

G4-dependent changes in local chromatin architecture.

The chromosomal architecture of the insertion locus in the G4-array insert cells is broadly similar to uninserted cells except for the increase in looping interactions.

G4-array insertion at a second locus gives enhancer-like functions

Insertion of the G4-array in another isolated locus and subsequent changes in chromatin and surrounding gene expression.

Another isolated locus was chosen for insertion.