Genetic interactions of G-quadruplexes in humans

Abstract
Introduction
Results
Discussion
Materials and methods
Data availability
References
Article and author information
Metrics

Abstract

G-quadruplexes (G4) are alternative nucleic acid structures involved in transcription, translation and replication. Aberrant G4 formation and stabilisation is linked to genome instability and cancer. G4 ligand treatment disrupts key biological processes leading to cell death. To discover genes and pathways involved with G4s and gain mechanistic insights into G4 biology, we present the first unbiased genome-wide study to systematically identify human genes that promote cell death when silenced by shRNA in the presence of G4-stabilising small molecules. Many novel genetic vulnerabilities were revealed opening up new therapeutic possibilities in cancer, which we exemplified by an orthogonal pharmacological inhibition approach that phenocopies gene silencing. We find that targeting the WEE1 cell cycle kinase or USP1 deubiquitinase in combination with G4 ligand treatment enhances cell killing. We also identify new genes and pathways regulating or interacting with G4s and demonstrate that the DDX42 DEAD-box helicase is a newly discovered G4-binding protein.

https://doi.org/10.7554/eLife.46793.001

Introduction

G-quadruplex secondary structures (G4s) form in nucleic acids through the self-association of guanines (G) in G-rich sequences to form stacked tetrad structures (reviewed in Bochman et al., 2012; Rhodes and Lipps, 2015). In the human genome, over 700,000 G4s have been detected in vitro (Chambers et al., 2015). Sequences encoding G4s are enriched in regulatory regions consistent with roles in transcription and RNA regulation (Huppert and Balasubramanian, 2007; Huppert, 2008), and their over-representation in oncogene promoters, such as MYC, KRAS and KIT, suggests that they are important in cancer and are potential therapeutic targets (reviewed in Balasubramanian et al., 2011). Computationally predicted G4s have also been linked to replication origins (Besnard et al., 2012) and telomere homeostasis (reviewed in Neidle, 2010). In the transcriptome, more than 3000 mRNAs have been shown to contain G4 structures in vitro, particularly at 5’ and 3’ UTRs, suggestive of roles in posttranscriptional regulation (Bugaut and Balasubramanian, 2012; Kwok et al., 2016).

G4-specific antibodies have been used to visualise G4s in protozoa (Schaffitzel et al., 2001) and mammalian cells (Biffi et al., 2013; Henderson et al., 2014; Liu et al., 2016). More G4s are detected in transformed versus primary cells, and in human stomach and liver cancers compared to non-neoplastic tissues, supporting an association between G4 structures and cancer (Biffi et al., 2014; Hänsel-Hertsch et al., 2016). More recently, ChIP-seq was used to map endogenous G4 structure formation in chromatin revealing a link between G4s, promoters and transcription (Hänsel-Hertsch et al., 2016). G4s are found predominately in nucleosome-depleted chromatin within promoters and 5’ UTRs of highly transcribed genes, including cancer-related genes and regions of somatic copy number alteration. G4s may therefore be part of a regulatory mechanism to switch between different transcriptional states. At telomeres, tandem G4-repeat structures also may help protect chromosome ends by providing binding sites for shelterin complex components (reviewed in Brázda et al., 2014). As G4 structures can pause or stall polymerases, they must be resolved by helicases to allow replication and transcription to proceed. Several helicases, including WRN, BLM, PIF1, DHX36 and RTEL1, have been shown to unwind G4-structures in vitro (Brosh, 2013; Mendoza et al., 2016), and it is notable that fibroblasts from Werner (WRN) and Bloom (BLM) syndrome patients, who are predisposed to cancer, show altered gene expression that correlates with sites with potential to form G4s (Damerla et al., 2012).

Small molecules that selectively bind and stabilise G4 formation in vitro have been used to probe G4 biological function. G4 ligands, such as pyridostatin (PDS), PhenDC3 and TMPyP4, can reduce transcription of many genes harbouring a promoter G4, including oncogenes such as MYC, in multiple cancer cell lines (Halder et al., 2012; McLuckie et al., 2013; Neidle, 2017). G4-stabilising ligands also interfere with telomere homeostasis by inducing telomere uncapping/DNA damage through the inhibition of telomere extension by telomerase leading to senescence or apoptosis (reviewed in Neidle, 2010). 5’ UTR RNA G4 structures may also be involved in eIF4A-dependent oncogene translation (Wolfe et al., 2014) and their stabilisation by G4-ligands can inhibit translation in vitro (Bugaut and Balasubramanian, 2012). Identification of several RNA G4-interacting proteins (reviewed in Cammas and Millevoi, 2016), including DEAD/DEAH helicases such as DDX3X, and DHX36 (Chen et al., 2018; Herdy et al., 2018) additionally suggests specific roles for G4 structures in RNA.

Some G4-stabilising ligands cause a DNA damage-response (DDR); for example, DNA damage sites induced by PDS in human lung fibroblasts mapped to genomic regions at G4s within several oncogenes including SRC (Rodriguez et al., 2012). Subsequent studies demonstrated that homologous recombination (HR) repair deficiencies can be exploited to selectively kill BRCA1/2-deficient cancer cells with G4 ligands (McLuckie et al., 2013; Zimmer et al., 2016). Recently, this concept has been applied to BRCA1/2-deficient breast cancers using CX-5461, a G4 ligand currently in clinical trials (Xu et al., 2017) (NCT02719977 ClinicalTrials.gov). Overall, these initial studies demonstrate that specific genotypes can be selectively vulnerable to G4-stabilisation and raises the question as to what other genotypes might provide further such opportunities.

We set out to address two main questions (Figure 1): 1) which human genes and cellular pathways interact with G4s and 2) what genetic backgrounds selectively lead to enhanced cell killing in the presence of G4 stabilising ligands? We employed PDS and PhenDC3 as representative G4 ligands as these are chemically and structurally dissimilar, but each shows a broad specificity for different G4 structural variants. Both ligands have been widely used as G4-targeting probes in biophysical (De Cian et al., 2007b; Rodriguez et al., 2008) and biological studies in which they have been shown to impart transcriptional inhibition, telomere dysfunction and replication stalling (De Cian et al., 2007a; Halder et al., 2012; Mendoza et al., 2016).

Figure 1

Download asset Open asset

Strategy identifying genetic vulnerabilities involved with G4 biology.

Genome-wide shRNA silencing combined with G4 structure stabilisation by small molecules identifies genes that when depleted compromise cell viability. Cells are infected with a genome-wide pool of shRNA lentiviruses targeting the protein coding genome followed by G4 ligand treatment to stabilise genomic and/or RNA G4 structures. Two general outcomes are possible: a gene is not required in a G4-dependent process so there is no effect on cell viability (left); or gene silencing results in cell death either due to loss of a direct G4 interaction (e.g. binding/unwinding) or indirectly through gene loss in a G4-dependent pathway (right). In absence of ligand, cells are viable in presence of the shRNA. Dotted boxes highlight genotypes of disease significance for possible G4-based therapies (blue) and genes and biological pathways that involve and/or interact with G4 structures (orange).

https://doi.org/10.7554/eLife.46793.002

Results

Identification of genetic vulnerabilities to G4-ligands via genome-wide screening

An unbiased genome-wide shRNA screen was performed in A375 human melanoma cells to globally evaluate genetic vulnerabilities to G4-ligands and to identify genes and pathways involved with G4-structures (Figure 2A). For this, the pyridine-2,6-bis-quinolino-dicarboxamide derivative, PDS (Rodriguez et al., 2012), and bisquinolinium compound, PhenDC3 (De Cian et al., 2007b) were chosen (Figure 2B). We used the latest generation shERWOOD-Ultramir shRNA pLMN retroviral library, comprising 132,000 shRNAs across 12 randomised pools targeting the protein coding genome, with an average of five optimised hairpins per gene (Figure 2C) (Knott et al., 2014). A375 melanoma cells were used due to their rapid doubling, stable ploidy and success in other shRNA-dropout screens (Sims et al., 2011); they are TP53 wild-type and driven by oncogenic BRAF (V600E) and CDKN2A loss (Forbes et al., 2015). Figure 2D outlines our shRNA screening strategy. To identify shRNAs that are lost between the initial (t0) and final (fF) timepoints, unique 3’-antisense sequences were recovered by PCR and quantified by sequencing. If a gene knockdown compromises cell viability then the associated shRNA will be depleted compared to those targeting non-essential genes: the tF sequence count will be less than t0 thus log₂ fold change (FC, tF/t0) is negative. A pilot using one shRNA pool established that a tF of 15 population doublings can be used to reveal significant G4-ligand-mediated changes [false discovery rate (FDR) ≤ 0.05] in shRNA levels using a ligand concentration resulting in 20% cell death (GI₂₀, see Materials and methods and Figure 2—figure supplement 1 for details).

Figure 2 with 1 supplement see all

Download asset Open asset

shRNA screening pipeline to uncover genetic vulnerabilities to G4 stabilisation.

(A) A G-tetrad with four interacting guanines (left), which stack to form G4 structures (right). (B) Structures of the G4-stabilising small molecule ligands PDS and PhenDC3. (C) Distribution of the numbers of shRNAs targeting each gene, with the average indicated by a red dotted line. (D) Overall screening approach illustrated for one library pool. Plasmids are retrovirally packaged and A375 cells are infected at multiplicity of infection (MOI) of 0.3 (30%). Following antibiotic selection, an initial time point (t0) is harvested and then cells are cultured for ‘n’ population doublings in DMSO, PDS or PhenDC3 before the final time point was harvested (tF).

https://doi.org/10.7554/eLife.46793.003

To understand the complete spectrum of G4 vulnerabilities, we first considered the combined set of sensitivities to PDS and PhenDC3 together. For the whole library, when individual shRNAs are considered 9509 (~7%) G4-ligand-specific hairpins (i.e. those not in DMSO) were found to be depleted (FDR ≤ 0.05; log₂ FC <0, Figure 3A, Supplementary file 1). We then reasoned, for a gene knockdown to have compromised cell growth, that a minimum of either 50% or three shRNA hairpins should be significantly depleted for that gene (median log₂ FC <0). This resulted in the identification of 843 G4 ligand-specific gene knockdowns not present in DMSO (Figure 3B). We then denoted a more stringent preliminary list of 758 G4 sensitisers as those having a median log₂ FC ≤ −1 (Figure 3C). It is reassuring that in this list we independently validated the known G4 sensitisers BRCA1/2, ATRX and HERC2 (McLuckie et al., 2013; Wang et al., 2019; Watson et al., 2013; Wu et al., 2018; Xu et al., 2017; Zimmer et al., 2016; Figure 3D).

Figure 3

Download asset Open asset

Genome-wide screening in A375 cells reveals deficiencies in known G4-associated genes as sensitive to G4-stabilising small molecules.

(**A–C**) Venn diagrams for: (A) significantly differentially expressed individual shRNAs (FDR ≤ 0.05); (B) significantly depleted genes (50% or three hairpins, FDR ≤ 0.05, median log₂FC < 0) following DMSO, PDS and PhenDC3 treatment and (C) Significant PDS and PhenDC3 sensitiser genes not in DMSO and after applying a median log₂FC ≤ −1 cut off. (**D–F**) Tables showing the number of depleted hairpins and median log₂FC values for: (D) known G4 ligand sensitisers, *ATRX, HERC2*, *BRCA1* and *BRCA2,* that are independently validated in our screen; (E) sensitisers annotated with a G4-associated term in GO, UniprotKB or G4IPBD databases and (F) sensitisers identified as G4-related by text-mining showing the associated PolySearch2 algorithm score and summary of the G4 association. Sensitisers are defined as a gene where 50% or three hairpins were significantly differentially expressed (FDR ≤ 0.05) with median log₂FC ≤ −1. See also Supplementary file 1.

https://doi.org/10.7554/eLife.46793.005

We next explored further genes already implicated in G4 biology, but whose deficiency has not yet been linked with any enhanced sensitivity to G4 ligands. For genes annotated with G4-related terms in the UniprotKB, Gene Ontology (GO) and G4IPDB databases (Mishra et al., 2016), an additional eight sensitisers (ADAR, DHX36, DNA2, FUS, MCRS1, RECQL4, SF3B3 and XRN1) were uncovered (Figure 3E). Text-mining with G4 search terms using PolySearch2 on PubMed abstracts and open access full texts (see Materials and methods; Liu et al., 2015b) revealed a further 12 sensitisers arising from our screen including helicases (RTEL1), DDR components (CHEK1, RAD17), transcriptional proteins (POLR1A, CNBP) and replication factors (ORC1, RPA3, TOP1) (Figure 3F).

Within the total 758 G4-sensitiser gene list, we uncovered five significant enriched KEGG pathway clusters (p<0.05): ‘cell cycle’, ‘ribosome’, ‘spliceosome’, ‘ubiquitin-mediated proteolysis’ and ‘DNA replication’ (Figure 4A, Supplementary file 1). Within each cluster are gene targets common to both G4 ligands, as well as genes unique to each ligand. To gain functional insights, enriched GO ‘Biological Process’ and ‘Molecular Function’ terms were determined (Figure 4B; Supplementary file 1) which showed 20 out of 45 of the former and all the latter terms into DNA or RNA classifications, consistent with PDS/PhenDC3 directly binding nucleic acid G4 targets. Furthermore, when protein domains were considered using GENE3D and PFAM databases (Figure 4C), we discovered enrichments in helicase C-terminal domains, RNA recognition motifs including RRM, RBD and RNP domains, and DNA-binding domains including zinc fingers, bZIP motifs and HMG boxes. Consistent with the ubiquitin-mediated proteolysis KEGG cluster, enrichments in multifunctional ATPase domains and in ubiquitin hydrolase domains, were also found. These latter findings suggest important areas of biology not previously known to be affected by G4 intervention in mammalian cells.

Figure 4

Download asset Open asset

Pathways and processes showing sensitivity to G4-stabilising ligands.

(A) Enriched KEGG pathways and (B) Gene Ontology terms, GO Biological Processes (BP) and Molecular Functions (MF), for the 758 genome-wide G4-sensitiser genes. Blue- genes common to both ligands; black- genes unique to either PDS or PhenDC3. A right-sided enrichment test with Bonferroni correction used (see Materials and methods). (C) Enriched protein domains (p≤0.05) within GENE3D (black) and PFAM databases (grey) ordered by -Log10 (EASE p-value). See also Supplementary file 1.

https://doi.org/10.7554/eLife.46793.006

Cancer-associated gene depletion enhances sensitivity to G4-ligands

We next used the complete list of 758 genes, identified as stringent G4 ligand sensitisers above, to discover new cancer-associated gene vulnerabilities to G4-stabilising ligands. For this, we searched this list for any significant enrichment in the COSMIC database (v83) of genes causally implicated in cancer (Forbes et al., 2015). Of the 758 sensitisers, there was a two-fold enrichment (p=9.1×10⁻⁶) for 50 cancer-associated genes, which increases to three-fold (p=2.5×10⁻³) when considering only sensitisers common to both G4 ligands (Figure 5A,B, Supplementary file 1). Notably, when STRING network analysis (Szklarczyk et al., 2017) was used to investigate functional interactions, this revealed a DDR cluster that included BRCA1 and BRCA2, as well as their interacting tumour suppressor partners PALB2 and BAP1, two cancer-associated DDR genes not previously indicated as G4 ligand sensitisers. (Figure 5C). This analysis also identified as sensitisers a cluster consisting of several chromatin modifiers including SMARCA4, SMARCB1 and SMARCE1.

Figure 5

Download asset Open asset

Identification of cancer-associated genes whose loss promotes sensitivity to G4 ligands.

(**A, B**) Median log₂FC and number of significantly depleted hairpins for G4 sensitisers overlapping the COSMIC database for PDS (A) and PhenDC3 (B). Genes common to both are indicated in blue. See also Supplementary file 1. (C) Functional interaction network analysis using STRING for the 50 COSMIC proteins indicated in A and B. Clusters are shown using confidence interactions > 0.4 from co-expression and experimental data. Box indicates the DDR cluster.

https://doi.org/10.7554/eLife.46793.007

Focused G4-sensitiser shRNA screening reveals robust G4-ligand genetic vulnerabilities and potential therapeutic targets

To enable more rigorous and further comparative analyses that focus solely on G4 sensitisers, we developed a custom shRNA screening panel encompassing the gene sensitisers identified above plus additional G4-associated genes noted from the literature (Figure 6A, Figure 6—figure supplement 1, see Materials and methods). This panel consisted of a single retroviral shRNA pool to allow all shRNAs to be screened simultaneously under standardised conditions and to minimise technical fluctuations. We first used this panel to recapitulate the findings of the genome-wide screen above and compare responses with different G4 ligands. Using A375 melanoma cells with PDS and PhenDC3, the custom panel recovered a total of 342 G4 sensitisers corresponding to 40.6% overlap (308 genes) with the complete genome-wide screen (Figure 6B,C). From this, we identified 290 G4 sensitisers with 89 and 161 unique for PDS and PhenDC3, respectively, and 40 genes common for both ligands (Figure 6—figure supplement 1E). Comparing PDS and PhenDC3 sensitisers by KEGG analysis shows that each ligand mostly interacts with different but related pathways (Figure 6D,E). Consistent with direct G4-targeting, nucleic-acid-related GO terms were enriched (Figure 6—figure supplement 1F & G, Supplementary file 2). We next considered that the 40 sensitiser genes common between PDS and PhenDC3 reflected the most robust sensitisers for G4 ligands in general and it is notable that 27 out of 40 associated with DNA or RNA binding processes, such as chromatin modification, replication transcription, and translation (Figure 6F). Again, the ubiquitin processes, which previously were not linked with G4 biology, were also uncovered as a significant sensitiser pathway. Overall, these results clearly show the spectrum of biological vulnerabilities that underpin the observed enhanced sensitivities for each G4-targeting ligand.

Figure 6 with 1 supplement see all

Download asset Open asset

A custom G4 sensitiser shRNA panel reveals unique and common G4 ligand sensitivities.

(A) A shRNAs custom retroviral pool (~8000 hairpins) was used to infect A375 cells. Following antibiotic selection, the reference time point (t0) was taken and then cells were cultured for 15 population doublings in DMSO, PDS or PhenDC3 before (tF). Three biological replicates were performed. (B) Significant sensitiser genes for the A375 focused screen (50% or three significantly depleted with median log₂ FC≤ −1). (C) Overlap of the genome-wide (GW) with A375 focused screen for PDS and PhenDC3 G4-sensitisers combined (see also Figure 6—figure supplement 1). (**D–E**) Enriched KEGG pathways for (D) PhenDC3 and (E) PDS sensitiser genes common to the genome-wide and A375 focused screens. A right-sided enrichment test with Bonferroni correction used (see Materials and methods). (F) DAVID, STRING (experimental data, co-expression, medium confidence ≥0.4) interaction and UniprotKB data were used to categorise biochemical roles for the 40 high-confidence G4 sensitisers common to both ligands. Genes in red indicate those found in the (DGIdb 2.0). *=genes in multiple categories. (**G, H**) Overlap of the all 290 robust G4 sensitisers (G) and the 40 G4 sensitisers common to both ligands (H) with the Drug Genome Interaction database. The druggable genome denotes genes with known or predicted drug interactions. Clinically actionable denotes genes used in targeted cancer clinical sequencing panels. See also Figure 6—figure supplement 1, Supplementary file 1.

https://doi.org/10.7554/eLife.46793.008

We next reasoned that the robust set of 290 G4 ligand sensitiser genes above provides a suitable test bed for exploring the arising therapeutic potential for combinatorial pharmacological inhibition and G4-ligands. We therefore looked for the presence of these sensitisers genes within the druggable genome interaction database (DGIdb) (Griffith et al., 2013). A total of 74 G4-sensitisers were found in the classifications ‘Druggable Genome’ (genes with known or predicted drug interactions) and ‘Clinically Actionable’ (genes used in targeted clinical cancer sequencing for precision medicine) with 13 being common to both classifications (Figure 6G, Supplementary file 1). Notably, this included KEAP1, an E3 ubiquitin ligase adapter protein and highlights a new therapeutic domain for the application of G4-based drugs. Performing a similar analysis on the 40 most robust sensitisers common to both G4 ligands gave 12 genes within DGIdb (Figure 6H, Supplementary file 1), including 5 (BRCA1, CHEK1, CDK12, TOP1, PDKP1) common to both druggable and clinically actionable classifications. These results therefore open up new possibilities for cancer therapies based on vulnerabilities to G4 ligands.

G4 sensitisers common to two independent cell lines

We next sought to extend the use of the custom shRNA lentiviral library to gain initial insights into possible commonalities and differences in the response to G4 ligands in cells from different lineages. We therefore applied the custom library to mesenchymal-derived HT1080 fibrosarcoma cells (wild-type TP53, driven by activated NRAS (Q61K) and IDH1 mutation (R132C)) and compared the results to those from ectodermal A375 melanoma cells above (Figure 7, Figure 7—figure supplement 1F & G, Supplementary files 1 & 2). The custom HT1080 screen recovered a total of 121 G4 ligand sensitisers, with the majority (73 genes, 58%) shared with those seen for each ligand in the A375 genome-wide screen. Cytoscape network analysis (Figure 7A) revealed a core set of G4-associated genes/pathways for these genes in spliceosome, HR and ubiquitin-mediated proteolysis processes (p<0.0005). Overall, 29 PDS and 22 PhenDC3 gene sensitivities were found to be shared across all three screens (Figure 7B,C), and it is noteworthy that both G4 ligands targeted similar processes including transcription, splicing and ubiquitin-mediated proteolysis (Figure 7D,E).

Figure 7 with 1 supplement see all

Download asset Open asset

G4 sensitivities in two different cell lines.

(A) Enriched KEGG and GO pathways for all G4 ligand-specific sensitisers (73 genes) shared between the genome-wide A375 and HT1080 screens. A right-sided enrichment test with Bonferroni correction used (see Materials and methods). (**B–C**) Comparison of G4 sensitisers across A375 focused, A375 genome-wide and HT1080 focused screens for (B) PhenDC3 and (C) PDS. (**D–E**) DAVID, STRING (experimental data, co-expression, medium confidence (≥0.4) interaction) and UniProtKB data analysis showing biochemical functions for common PhenDC3 (D) and PDS (E) sensitisers across all three screens. *=genes in multiple categories. Blue, four sensitisers common to both ligands. (F) Left, common sensitiser genes across all three screens. Right, number of depleted hairpins and median log₂FC values for four key genes found as both PDS and PhenDC3 sensitisers across all the three screens. See also Figure 7—figure supplement 1, Supplementary file 1.

https://doi.org/10.7554/eLife.46793.010

BRCA1, TOP1, DDX42 and GAR1 are key G4 ligand sensitiser genes

When we evaluated the data collectively from all screens, it was apparent that four genes were repeatedly found as G4 ligand sensitisers- BRCA1, TOP1, DDX42 and GAR1, as they consistently appeared in both cell types and with both G4-ligands in all screens (Figure 7F, Figure 7—figure supplement 1F). To corroborate these genes as genuine G4 sensitisers, we developed an independent siRNA knockdown approach using a shorter timeframe (~6 days) to recapitulate ligand-induced growth inhibition (Figure 8). Both A375 and HT1080 cells were transfected with siRNAs targeting BRCA1, TOP1, DDX42 or GAR1 alongside non-targeting siRNA and non-transfected controls. Following 24 hr, cells were treated with two concentrations of PDS and PhenDC3 or vehicle control DMSO for 144 hr. Growth curves for non-transfected and non-targeting siRNA controls were similar across ligand treatments in both cell lines (Figure 8—figure supplements 1 and 2). For both HT1080 (Figure 8A & B) and A375 cells (Figure 8—figure supplement 3A & B), protein depletion following siRNA transfection was confirmed after 48 hr by immunoblotting cell lysates with the appropriate antibodies (average 76–92% knockdown for HT1080; 41–69% knockdown for A375 after 48 hr). The percentage difference in confluency compared to non-targeting siRNA control cells was plotted (Figure 8—figure supplement 1B–E and Figure 8—figure supplement 3C–F) and compared to DMSO treatment at 72, 96 and 120 hr (Figure 8C–F, Figure 8—figure supplement 3G–J).

Figure 8 with 3 supplements see all

Download asset Open asset

siRNA knockdowns validate *BRCA1*, *TOP1*, *DDX42* and *GAR1 as* key G4 ligand sensitiser genes.

(A) HT1080 cells were treated with non-targeting (NT) or targeting (T) siRNAs for *BRCA1*, *TOP1, DDX42* and *GAR1*. 48 hr and 144 hr after transfection, cell lysates and a non-transfected cell lysate (U) were probed with appropriate antibodies and actin control by western blotting. (B) Protein levels for targeting (T) and non-targeting (NT) 48 hr lysates were normalised to the internal actin control and then normalised to NT levels for three biological replicates (mean ± standard deviation). (**C–F**) HT1080 cells were transfected with targeting siRNAs for 24 hr before PDS, PhenDC3 or DMSO treatment. Comparative box plots of confluency differences and significance (unpaired parametric t-test) at selected timepoints for (C) *BRCA1*, (D) *TOP1*, (E) *DDX42*, (F) *GAR* (ns = not significant) for three separate siRNA transfections. See also Figure 8—figure supplements 1, 2 and 3.

https://doi.org/10.7554/eLife.46793.012

Figure 8—source data 1 Source files for western blots. (A) Full length western blots and (B) capillary traces obtained from Compass Software (Simple Western) for results shown in Figure 8A.: https://doi.org/10.7554/eLife.46793.017
Download elife-46793-fig8-data1-v1.pdf

Mirroring the shRNA screen findings, siRNA knockdown of all four genes in HT1080 cells imparted significant increases in sensitivity with PDS or PhenDC3 compared to DMSO. Some differences between the ligands and individual gene knockdowns were noted. For BRCA1 and TOP1 the lowest concentration of PDS resulted in the most sensitisation and this was evident early at 72 hr, whereas both PhenDC3 concentrations resulted in similar growth inhibition and was apparent later (Figure 8C & D, Figure 8—figure supplement 1). For DDX42 and GAR1, growth inhibition was mostly manifest from 96 hr, with both ligands and concentrations being broadly similar (Figure 8E & F, Figure 8—figure supplement 1). Results with the A375 cells also lend support to our observations, although there were some differences compared to HT1080 cells (Figure 8—figure supplements 2 and 3). While GAR1 knockdown showed a similar sensitivity profile, BRCA1 and TOP1 deficiencies were sensitive to PDS but not PhenDC3. DDX42 knockdown in A375 cells did not reflect the screens ligand sensitivities and this may in part be due to lower knockdown efficiency compared (~40%). Nonetheless, these independent siRNA short-term assays substantiate that BRCA1, TOP1, DDX42 and GAR1 are genetic vulnerabilities to G4 ligands and these may open up future possibilities for therapeutic development.

G4-targeting ligands plus pharmacological inhibitors of G4 sensitiser genes demonstrate synergistic cell killing

One of our aims was to identify potential cancer genotypes where G4-ligands could be therapeutically exploited. Cancers deficient in our newly discovered G4 sensitisers may be preferentially sensitive to G4-ligands as single agents. Alternatively, rather than exploiting a genetic deficiency per se, it may be possible to use pharmacological inhibition of a critical cancer gene product that phenocopies the deficiency in combination with G4 ligands as an orthogonal approach (Figure 9A). As proof-of-principle, we systematically evaluated cell death potentiation with the G4 ligand PDS in combination with pharmacological inhibitors for two new G4 sensitisers gene products, the WEE1 kinase or the deubiquitinase USP1 (Figure 9B). WEE1 is a crucial G2/M regulator overexpressed in several cancers (Matheson et al., 2016), and USP1 is involved in DDR regulation and is overexpressed in non-small cell lung and other cancers (reviewed in García-Santisteban et al., 2013). For our studies, we used MK1775 (AZD1775), a WEE1 kinase inhibitor that is being clinically evaluated in several cancers (Richer et al., 2017), and pimozide a potent USP1-targeting drug (Chen et al., 2011a). HT1080 and A375 cells were cultured in matrix combinations of PDS with MK1775 or pimozide at concentrations surrounding the GI₅₀ values and cell viability measured after 96 hr using an end-point ATP luminescence-based assay (CellTiter-Glo, Promega). Combenefit software (Di Veroli et al., 2016) was then used to calculate synergy for different treatment combinations in which the percentage growth inhibition compared to single agent controls is used to plot a 3D-dose-response surface of synergy distribution in concentration space (Figure 9C–F). In HT1080 cells, synergy was found for both PDS and MK1775 or pimozide combinations (Figure 9C,D, Figure 9—figure supplement 1) with peak synergies of 21% and 24% at 156 nM PDS with 21 nM MK1775 or 6.25 μM pimozide, respectively (GI₅₀ for PDS, MK1775 and pimozide alone = 322 nM, 59 nM and 8.4 μM, respectively). A375 cells showed lower synergy with PDS and MK1775 combination (Figure 9E, Figure 9—figure supplement 1), with peak synergy of 15% at 8 μM PDS, 444 nM MK1775 (GI₅₀ for PDS, MK1775 and pimozide alone = 8.5 μM, 625 nM and 12.2 μM, respectively). The greatest synergy was seen in combinations of PDS and pimozide in A375 cells (Figure 9F, Figure 9—figure supplement 1) with a peak synergy of 61% at 5.33 μM PDS, 6.25 μM pimozide. Furthermore, long-term clonogenic survival assays revealed a similar potentiation of growth inhibition, albeit at lower compound concentrations, for PDS/MK1775 and PDS/pimozide drug combinations for both cell lines tested (Figure 9—figure supplement 2). Altogether, these results validate that appropriate drug combinations can synergistical act as a surrogate for gene deficiencies in the presence of G4 ligands and thus complements the findings uncovered by our genetic screening approach.

Figure 9 with 2 supplements see all

Download asset Open asset

Cell death potentiation mediated by pharmacological inhibition of WEE1 or USP1 with the G4-stabilising ligand PDS.

(A) Cell death potentiation with G4-stabilising ligands in combination with either gene deficiencies, such as shRNA-mediated knockdown (top), or pharmacological inhibition of a protein (bottom). (B) Numbers of depleted shRNA hairpins and median log₂FC values for WEE1 and USP1 in the genome-wide and focused screens. (**C–F**) Synergy plots for HT1080 (**C, D**) and A375 (**E, F**) cells treated with PDS in combination with MK1775 (**C, E**) or pimozide (**D, F**). To determine any synergy in cell killing, 3D response surface plots were calculated using Combenefit software with the BLISS model for an average of three biological replicas. Heat bar- blue shading indicates synergy combinations, red indicates antagonism (see also Figure 9—figure supplements 1 and 2).

https://doi.org/10.7554/eLife.46793.018

Identification of DDX42 as a new G4-binding protein

Another of our aims was to use the findings of our shRNA screen to identify proteins that may bind and/or regulate G4 structures in cells, such as G4 helicases. Indeed, DHX36 and DHX9, known G4 helicases (Giri et al., 2011; Chen et al., 2018; Chakraborty and Grosse, 2011; Creacy et al., 2008 ; Vaughn et al., 2005) and the DEAD box protein DDX3X, that was recently shown to bind RNA G4s (Herdy et al., 2018), were identified as G4 sensitisers in our screen. Further members of the DDX/DHX helicase family also appeared as G4 sensitisers (Figure 10A), raising the question of whether these represent previously uncharacterized G4-binding proteins. To address this directly, we chose to investigate DDX42 as this was one of the four key G4 sensitisers identified above. DDX42 is a non-processive RNA helicase (Uhlmann-Schiffler et al., 2006) and has been associated with splicing (Will et al., 2002); however, this protein remains largely uncharacterised. By immunoblotting of nuclear and cytoplasmic sub-cellular fractions (Figure 10B–E), we first established that DDX42 predominantly localises to the nucleus (~4 to 9-fold greater than cytoplasmic levels) in three independent cell lines, (HT1080, HEK293 and HeLa). As controls for fractionation, LaminB1 and GAPDH were found to partition as expected into nuclear and cytoplasmic fractions, respectively (Figure 10C,D).

Figure 10 with 1 supplement see all

Download asset Open asset

DDX42 is a predominantly nuclear G4-binding protein.

(A) Number of depleted hairpins and median log₂FC values for DEAH/DEAD-box helicase genes within the 758 genes identified in the genome-wide screen. Those highlighted in blue caused sensitivity to both PDS and PhenDC3. (B) Representative immunoblots showing cytoplasmic (C) and nuclear (N) lysates for HT1080, human embryonic kidney (HEK) and HeLa cells probed for DDX42, laminB1 and GAPDH2. (**C, D**) GAPDH and laminB1 protein levels for (C) cytoplasmic and (D) nuclear lysates (mean for two biological replicates ± standard deviation). (E) DDX42 nuclear protein levels (normalised to cytoplasmic levels, mean for two biological replicates ± standard deviation). (**F, G**) DDX42 binding curves G4s by ELISA. (F) NRAS 5’ UTR RNA G4 (rG4), mutated G4 sequence (rG4 mut) and RNA hairpin. (G) MYC DNA G4 (dG4) and mutated control (dG4 mut). Apparent K_d is calculated from five replicates (values are indicative as the model assumes saturation kinetics).

https://doi.org/10.7554/eLife.46793.021

Figure 10—source data 1 Source files for western blots. (A) Full-length western blots and (B) capillary traces obtained from Compass Software (Simple Western) for results shown in Figure 10A.: https://doi.org/10.7554/eLife.46793.023
Download elife-46793-fig10-data1-v1.pdf

As DDX42 is known to bind RNA, we next set out to demonstrate DDX42 affinity for a RNA-G4 structure as this has not previously been documented. For this, a G4 RNA oligonucleotide from the NRAS 5’UTR sequence, which forms a stable parallel G4 (Kumari et al., 2007), was used together with a mutated oligonucleotide unable to form a G4 structure and also a RNA hairpin as negative controls (Herdy et al., 2018). Oligonucleotides were folded in 100 mM KCl to promote G4 structure formation and the resultant structures confirmed by circular dichroism (CD) spectroscopy (Figure 10—figure supplement 1). The affinity of recombinant DDX42 was then investigated by Enzyme Linked Immunosorbent Assay (ELISA, Figure 10F) and binding parameters calculated using a non-linear regression model, assuming one-site-specific binding and saturation kinetics using Prism software. DDX42 bound the NRAS G4 folded in KCl with an apparent K_d of 71.1 ± 3.5 nM and did not bind detectably to the mutant oligonucleotide or RNA hairpin controls.

Given the nuclear localisation of DDX42 and as some DDX proteins also have DNA helicase activity (Kikuma et al., 2004), the DDX42 affinity for a DNA G4 structure was investigated. For this, an oligonucleotide corresponding to the stable parallel G4 structure in the promoter of MYC (González and Hurley, 2010; Yang and Hurley, 2006), and a non-G4 forming control, were used. The oligonucleotides were folded in 100 mM KCl and structures verified by CD spectroscopic analysis (Figure 10—figure supplement 1B). DDX42 affinity by ELISA (Figure 10G) showed that DDX42 binds to the MYC DNA G4 with an apparent K_d of 232.9 ± 23.5 nM with little binding to the mutant control. Thus, the G4 sensitiser screen has enabled us to identify and classify DDX42 as a G4-interacting protein as a new finding.

Discussion

G4 structures are emerging as promising clinical targets in cancer (Xu et al., 2017) but the range of disease-associated genetic backgrounds that potentiate G4 ligand effects has yet to be defined. Here, we have discovered many genes that when depleted enhance cell killing with the G4 ligands PDS and/or PhenDC3. The majority of these have no documented link to G4 biology and the use of low ligand concentrations is likely to favour discovery of gene losses that are the most sensitive in imparting selective cell killing. Validating the success of our approach, we independently identified G4-associated protein coding genes known to be genetic vulnerabilities to G4 ligands including BRCA1/2, HERC2 and ATRX (McLuckie et al., 2013; Wang et al., 2019; Watson et al., 2013; Wu et al., 2018; Xu et al., 2017; Zimmer et al., 2016). We now report for the first time genetic vulnerabilities in 20 other known G4-associated genes that promote sensitivity to G4-stabilising ligands. These include direct nucleic acid binders and/or unwinders, such as ADAR, DHX36, DNA2, FUS, MCRS1, RECQL4, SF3B3 and XRN1.

The clinical PARP inhibitor, olaparib has exemplified the concept of synthetic lethality in BRCA-deficient cells (Bryant et al., 2005; Farmer et al., 2005), and it is notable that BRCA deficiencies were isolated as one of the top genetic vulnerabilities for both G4 ligands in both A375 and HT1080 cells. While PDS and PhenDC3 have not been optimised by medicinal chemistry, the findings of Zimmer et al showing similar efficacy of PDS and olaparib in several BRCA-deficient models (Zimmer et al., 2016) lends further support that our screen detects robust, biologically relevant effects.

In dropout screens, dissociating minor from robust growth effects is important and is highly dependent on parameters such as compound dose, genotype and cell line selected. Our screen was designed with stringent parameters to detect genes deficiencies worthy of further exploration. Indeed, we demonstrate potent growth inhibition of up to 80% of the four top G4 sensitisers genes in a parallel siRNA approach.

The gene sensitivities uncovered here have potential to be exploited chemotherapeutically in cancer by deploying a G4-stabilising drug as a single-agent therapy. Alternatively, in the absence of a particular gene deficiency, pharmacological inhibition of a critical oncogene could phenocopy the genetic sensitivities described here and be used in combinatorial treatments with G4-stabilising drugs. This may be attractive as cells are less likely to simultaneously develop resistance against two drugs (reviewed in Chan and Giaccia, 2011). Furthermore, as lower drug doses are used, this increases the therapeutic window and has less adverse side effects. As proof-as-principle for this, we selected the WEE1 cell cycle kinase and the deubiquitinase USP1, and demonstrated that their pharmacological inhibition, with MK1775 and pimozide, respectively, leads to the potentiation of cell death in conjunction with the G4 ligand PDS. For example, 5.3 μM PDS or 6.25 μM pimozide alone impart little growth inhibition (14% and 6% respectively), but together they lead to strong growth inhibition (79%). Table 1 highlights further potential combinatorial opportunities for cancer-associated genes with clinical and/or experimental drugs. Additional therapeutic possibilities for other gene sensitivities that are largely still to be explored from a pharmacological perspective are illustrated in Table 2.

Table 1

Possible chemotherapeutic combinations for G4-stabilising ligands with clinically relevant pharmacological drugs

https://doi.org/10.7554/eLife.46793.024

Gene	Oncogene/tumour suppressor	Combinatorial/single agent	Available drug treatments	Cancer association summary	Reference
BRCA1/2	Tumour suppressor	Single agent	Olaparib CX-5461	Deficient in ovarian,breast and colorectal cancer.	Lee et al., 2014; Xu et al., 2017; McLuckie et al., 2013; Zimmer et al., 2016
CCDC6	Tumour suppressor	Single agent	Olaparib	Inactivated in thyroid and lung cancers. CCDC6-deficient tumours are cisplatin-resistant but olaparib sensitive.	Puxeddu et al., 2005; Morra et al., 2015
CDK12	Oncogene	Combinatorial	Dinaclib (SCH77965)	High-grade serous ovariancancer, often exhibits gain-of-function CDK12.	Parry et al., 2010; Bajrami et al., 2014
KEAP1	Oncogene/Tumour suppressor	Combinatorial/single agent	CDDO-Me CPUY192018	KEAP1 inactivated in multiple cancers including thoracic and endometrial; also hasoncogenic role, CDDO-Me used forleukaemia and sold tumours.	Sanchez-Vega et al., 2018; Abed et al., 2015; Lu et al., 2016; Wang et al., 2014
PSMC2	Oncogene	Combinatorial	Proteosome inhibitors: Bortezomib CEP187710 Carfizomib	Ubiquitin is emerging aschemotherapeutic target, and general proteasome inhibitors clinically are used against multiple myeloma.	Chen et al., 2011a; Mattern et al., 2012; Edelmann et al., 2011
SMAD4	Tumour suppressor	Single agent	GSKi: NCT01632306 NCT01214603 NCT01287520	Inactivated in 50% of pancreatic adenocarcinomas. Negativelyregulated by GSK, GSKis in clinical trials for metastatic pancreatic cancer and acute leukaemia.	Schutte et al., 1996; Hahn et al., 1996; Demagny and De Robertis, 2016; McCubrey et al., 2014
SRSF10	Oncogene	Combinatorial	E7107 1C8	Over-expressed in colon cancer. 1C8 inhibits SRSF10 and impairs HIV replication. FUS interactingprotein. E7107 is a splicinginhibitor preventing spliceosome assembly.	Zhou et al., 2014; Shkreta et al., 2017; Cheung et al., 2016; Kotake et al., 2007
UBA3	Oncogene	Combinatorial	MLN4924	Upregulated in AML and multiple solid cancers. MLN4924 is in Phase Iclinical trials.	Soucy et al., 2009
USP1	Oncogene/Tumour suppressor	Combinatorial/single agent	Pimozide	Over-expressed inmelanoma, gastric, cervical and NSCLC; under-expressed in leukaemia and lymphoma. Pimozide is a potent USP1-targeting drug.	García-Santisteban et al., 2013; Chen et al., 2011b
WEE1	Oncogene/Tumour suppressor	Combinatorial/single agent	AZDMK1775	Over-expressed in several cancers, some NSCLC are deficient.	Matheson et al., 2016; Richer et al., 2017; Backert et al., 1999; Yoshida et al., 2004
WHSC1	Oncogene	Combinatorial	DA3003-1 PF-03882845 Chaetocin TC-LPA5-4 ABT-199	Over-expressed in prostate cancer, multiple myeloma and mantle cell lymphoma. five potent candidate inhibitors.	Coussens et al., 2017; Bennett et al., 2017

Table 2

Examples of cancer-associated genetic vulnerabilities to G4 ligands.

https://doi.org/10.7554/eLife.46793.025

Gene category	Gene name	Function/pathway Summary	Cancer association summary	References
DNA damage repair	PALB2	Homologous recombination; binds BRCA2	Inactivating mutations predispose patients to myeloid leukaemia, Wilm’s tumour and Fanconi anaemia.	Harrigan et al., 2018; Nepomuceno et al., 2017
	BAP1	Homologous recombination; binds BRCA1, deubiquitinase for Histone 2A and tumour suppressor HCFC-1	Inactivating mutations foundin uveal melanoma and mesotheliomas.	Harrigan et al., 2018; Carbone et al., 2013
	USP1	Fanconi anaemia and translesion synthesis DDR; deubiquitinase required for FANCD2, FANCI and PCNA localisation to sites of DNA damage	USP1 mRNA over-expressed in melanoma, gastric, cervical and NSCLC; under-expressed in leukaemia and lymphoma.	Harrigan et al., 2018; Nijman et al., 2005; Huang et al., 2006; García-Santisteban et al., 2013
	TOP1	Relieves torsional stress during DNA replication; suppresses genomic instability at actively transcribed exogenous G4-forming sequences	Common cancer target, to induces DNA damage following pharmacological inhibition, lethal to cells.	Yadav et al., 2014; Wang, 2002
Helicase activity	RECQL4 RTEL1	Previously identified G4-helicases	RECQL4 (Rothmund-Thomsun syndrome) and RTEL1 (Hoyeraal-Hreidarsson Syndrome), deficiencies impart increased risks of cancer cancer, autoimmunity and premature ageing.	Brosh, 2013
Chromatin remodellers	ANKRD11	Transcription factor; Recruits histone deacetylases	Tumour suppressor epigenetically silenced in breast cancers.	Lim et al., 2012 Neilsen et al., 2008 Noll et al., 2012
	MLL4	H3K4 lysine methyl transferase	Frequently inactivated in several cancers.	Froimchuk et al., 2017 Kadoch et al., 2013; Rao and Dou, 2015
	SMARCA4 SMARCB1 SMARCE1	SWI/SNF ATP-dependent chromatin remodellers	Mutated in 20% of human cancers; doxorubicin resistant triple-negative breast cancer is associated with loss of SMARCB1, SMARCA4, or KEAP1 (a BRCA1 interactor).	Kadoch et al., 2013; Shain and Pollack, 2013
Ubiquitin	USP37	Deubiquitinating enzyme which stabilises MYC	Upregulated in lung cancer.	Pan et al., 2015
	NEDD4L	E3 ubiquitin ligase	Expression correlates with poor patient outcome in hepatocellular and gastric carcinomas.	Zhao et al., 2018; Gao et al., 2012
	RNF20	E3 ubiquitin ligase; chromatin remodelling and DDR	Tumour supressor down-regulated in several cancers. Deletion is main contributor to chromosomal instability in colorectal cancer.	Moyal et al., 2011; Shema et al., 2008; Barber et al., 2008
Splicing	FUS	Splicing component and known G4-interactor	Over-expressed in colon, breast and liposarcoma cancers, respectively.	Crozat et al., 1993; Dvinge et al., 2016; Takahama et al., 2013

While the custom HT1080 screen recovered 58% of sensitisers seen for each ligand in the A375 genome-wide screen, it is striking that this increases to 93% (i.e. 112 out of 121) when considering all screens irrespective of G4 ligand, suggesting remarkable consistency when comparing G4 ligand effects globally. Differences in individual ligand sensitives may arise from variances in cellular uptake and dose, for example, the GI20 dose of PhenDC3 is ten-fold higher for A375 compared to HT1080; G4 ligand-dependent molecular preference for G-tetrad end binding (Le et al., 2015) and/or the accessibility of G4s in the chromatin of individual cell lines (Hänsel-Hertsch et al., 2016). These points plus differences in protein knockdown efficiency, especially in A375 cells, may contribute to the differences in G4 ligand growth inhibition in our siRNA experiments. In the siRNA experiments, the G4 ligand-induced growth inhibition of both A375 and HT1080 appear not to follow a ‘typical’ dose response where higher concentrations lead to greater effects. This may in part be due to there being an optimum G4 ligand dose for a particular gene loss leading to enhanced cell death. Indeed, it is thought that lower drug concentrations better fall within a ‘synthetic lethality window’ (Nijman, 2011). Higher doses may mask these effects, by targeting more G4s that are not dependent on the particular gene lost and/or be due to other off-target effects. This is also supported by the experiments in Figure 9 that show synergy is only apparent at defined concentrations.

Our data additionally provides insights into the possible functions of the identified G4 sensitisers and indicates roles in DNA damage response (DDR), transcription/chromatin remodelling, nucleic acid unwinding, splicing and ubiquitin-mediated proteolysis. Our findings substantially advance our knowledge of G4 interactions with DDR beyond BRCA1/2 as several key HR genes were identified as novel G4-sensitisers including PALB2, BAP1 and the deubiquitinase USP1. Importantly, this highlights that such HR repair mechanisms are an integral and important cellular response in preventing cell death induced through the increased persistence of G4s. Persistent G4 structures are also inhibitory to DNA replication/cell cycle progression (reviewed in Valton and Prioleau, 2016 ), and it is of note that we also uncovered many cell cycle/DNA replication sensitivities such as PCNA, CHEK1, CCND1, CDC7, RFC2 and RFC4. Taken together these suggest that G4 stabilisation with small molecules could be an attractive therapeutic strategy to inhibit cell growth.

Deficits in G4-unwinding helicases are predicted to increase the persistence of G4 structures resulting in heightened sensitivity to G4 ligands. Several known G4-associated helicase deficiencies were recovered, including RECQL4, RTEL1 and DHX36, alongside many others with no known G4 link (see Figure 10A). Here, we demonstrate for the first time that the DDX42 DEAD/DEAH helicase is in fact a previously unidentified structure-specific G4-binding protein. On a wider level, this acts as proof-of-principle that other specific G4 interacting proteins exist within the sensitiser list of over 700 proteins. Other known G4-helicases such as BLM, WRN, PIF1 and FANCJ (reviewed in Wu and Brosh, 2010) were not identified as sensitisers, which may reflect functional redundancy (Spillare et al., 2006), or a low ligand concentrations and/or cell type effects.

Our findings highlight the ubiquitin-protesome pathway and modifications such, as neddylation as unexplored areas with respect to G4s. The only documented ubiquitin-G4 relationship in human cells is with HERC2, an E3 ubiquitin ligase that is implicated in G4 resolution whose loss sensitises cells to G4 ligands (Wu et al., 2018). We also independently validate HERC2 as a G4 sensitiser in our screen and extend our observations to cover the full breadth of the proteosomal degradation pathway, including members of E1 ligase (UBA3, UBA2, SAE1), E2 ligase (UBE2H), E3 ligase (NEDD4L, RBX1, CUL1, RNF20), deubiquitinating enzyme (USP1 and USP37) and proteosome (PSMC2) families (see Table 2) (Senft et al., 2018; Wei and Lin, 2012) Given the involvement of ubiquitin-proteasomal regulation in pathways, such as DDR and cell cycle, that are generally deregulated in cancer (Harrigan et al., 2018), this opens up an interesting intersection between ubiquitin regulation and G4s. As ubiquitin components are being targeted for anticancer therapies (Huang and Dixit, 2016), their efficacy might be enhanced through simultaneous G4 targeting and here we have provided strong proof-of-principle of this using synergistic combinations of pimozide (targeting UPS1) and the G4 ligand PDS.

In contrast to other genetic screens identifying sensitiser genes that enhance the efficacy of anticancer agents (Azorsa et al., 2009; Martens-de Kemp et al., 2017), our work suggests that persistent G4s are problematic for splicing. We identified several cancer-associated splicing factors as G4 sensitisers, including SRSF10, HNRNPM and the known G4-interactor FUS, which is overexpressed in several cancers (Crozat et al., 1993; Dvinge et al., 2016; Takahama et al., 2013). For the latter, a drug inhibiting general spliceosome assembly (Table 1) has been pharmacologically explored (Kotake et al., 2007) raising the possibility of potentiation by G4-stabilising ligand combinatorial treatment.

We designated four of the genetic vulnerabilities as ‘key’ genes (BRCA1, TOP1, DDX42, and GAR1) whose deficiencies stood out with respect to consistent sensitivity to PDS and PhenDC3 in both cell lines tested. Given this, we postulate that deficiencies in any of these four genes will impart significant G4 ligand sensitivity for a range of cell types and/or with other G4 ligands. As GAR1-deficiencies are implicated in chronic lymphocytic leukaemia and contribute to telomere dysfunction (Dos Santos et al., 2017), we suggest that this cancer may be acutely sensitive to G4-stabilisation by small molecules.

In conclusion, we have revealed genes and pathways that interact with stabilised G4 structures. This information provides new insights into G4-related biology, especially into the functional pathways and roles as G4-interacting proteins. Furthermore, this work reveals novel disease-related genetic vulnerabilities for G4-ligands. Overall, these data provide a unique and comprehensive resource that can be further explored to understand biology that may involve G4s and also inspire new therapeutic possibilities.

Materials and methods

Key resources table

Reagent type (species) or resource	Designation	Source or reference	Identifiers	Additional information
Cell line (H. Sapiens)	A375	ATCC	Cat# CRL-1619, RRID:CVCL_0132
Cell line (H. Sapiens)	HT1080	ATCC	Cat# CCL-121, RRID:CVCL_0317
Cell line (H. Sapiens)	Plat-A	Cell Biolabs Inc	Cat# RV-102, RRID:CVCL_B489
Antibody	Mouse monoclonal anti-Beta Actin	Merck	Cat# A5441, RRID:AB_476744	WB (1:250)
Antibody	Mouse polyclonal anti-DDX42	Abcam	Cat# ab80975, RRID:AB_2041042	WB (1:250)
Antibody	Rabbit monoclonal anti-Beta Actin	Cell Signalling Technology	Cat# 4970, RRID:AB_2223172	WB (1:500)
Antibody	Rabbit polyclonal anti-BRCA1	Cell Signalling Technology	Cat# 9010, RRID:AB_2228244	WB (1:50)
Antibody	Rabbit monoclonal anti-GAPDH	Cell Signalling Technology	Cat# 5174, RRID:AB_10622025	WB (1:50)
Antibody	Rabbit polyclonal anti-GAR1	NovusBio	Cat# NBP2-31742, RRID:AB_2801566	WB (1:100)
Antibody	Rabbit polyclonal anti-GST, HRP-conjugated	Abcam	Cat# ab3416, RRID:AB_30378	ELISA (1:10,000)
Antibody	Rabbit monoclonal anti-LaminB1	Cell Signalling Technology	Cat# 12586, RRID:AB_2650517	WB (1:250)
Antibody	Rabbit monoclonal anti-TOP1	Abcam	Cat# ab109374, RRID:AB_10861978	WB (1:250)
Recombinant DNA reagent	pCMV-VSV-G plasmid	Addgene	Cat # 8454, RRID:Addgene_8454	plasmid
Recombinant DNA reagent	G-quadruplex focused shRNA plasmid library	transOMIC technologies, this paper		supplied as a glycerol stock, Materials and methods subsection: ‘Composition and recombinant DNA reproduction of shRNA libraries’
Recombinant DNA reagent	transOMIC LMN genome-wide shRNA plasmid library	transOMIC technologies		supplied as multiple glycerol stocks
Sequence-based reagent	Biotinylated oligonucleotides	Biffi et al. (2013), Herdy et al. (2018), this paper		Materials andmethods subsection ‘Oligonucleotide annealing’
Sequence-based reagent	Genomic qPCR primers	this paper		Materials and methods subsection ‘Barcode recovery, adapter ligation and sequencing’
Sequence-based reagent	Pasha/DGCR8 siRNA	Qiagen	Cat# 1027423
Sequence-based reagent	siRNAs	this paper		Materials and methods subsection ‘siRNA validation experiments – transfection, experimental outline, immunoblotting’
Peptide, recombinant protein	Recombinant human DDX42	NovusBio	Cat# H0001325-P01
Commercial assay or kit	BluePippin 2% Internal Standard Marker Kit	Sage Science	Cat# BDF2010
Commercial assay or kit	CellTitre-Glo One Solution Assay Reagent	Promega	Cat# G8461
Commercial assay or kit	KAPA library quantification kit for Illumina platforms	Kapa Biosystems	Cat# 07960140001
Commercial assay or kit	KOD Hot Start DNA polymerase	Merck	Cat# 710864
Commercial assay or kit	Lipofectamine RNAiMAX	ThermoFisher Scientific	Cat# 13778150
Commercial assay or kit	Muse Count and Viability kit	Merck	Cat# MCH600103
Commercial assay or kit	QIAmp DNA Blood Maxi Kit	Qiagen	Cat# 51194
Commercial assay or kit	QIAquick PCR purification kit	Qiagen	Cat# 28104
Commercial assay or kit	Qubit dsDNA HS assay kit	ThermoFisher Scientific	Cat# Q32851
Commercial assay or kit	RIPA lysis buffer	ThermoFisher Scientific	Cat# 8990
Commercial assay or kit	ZR GigaPrep Kit	Zymo Research	Cat# D4057
Chemical compound, drug	Ampicillin	Merck	Cat# A5354
Chemical compound, drug	Chloroquine diphosphate	Acros organics	Cat# 455240250
Chemical compound, drug	cOmplete mini protease inhibitor	Roche	Cat# 11836153001
Chemical compound, drug	DMSO	ThermoFisher Scientific	Cat# 20688
Chemical compound, drug	Geneticin	Gibco	Cat# 10131035
Chemical compound, drug	MK1775	Cambridge Bioscience	Cat# CAY21266
Chemical compound, drug	PenStrep	ThermoFisher Scientific	Cat# 1507063
Chemical compound, drug	PhenDC3	In-house synthesis	De Cian et al., 2007a
Chemical compound, drug	Pimozide	Merck	Cat# P1793-500MG
Chemical compound, drug	Pyridostatin (PDS)	In-house synthesis	Rodriguez et al. (2008)
Chemical compound, drug	Sodium Butyrate	Merck	Cat# 303410
Chemical compound, drug	TMB substrate	Merck	Cat# T4444
Software, algorithm	Bowtie 2 v2.2.6	Langmead and Salzberg, 2012	http://bowtie-bio.sourceforge.net/bowtie2/index.shtml
Software, algorithm	ClueGO v3.5.1	Bindea et al., 2009; Bindea et al., 2013	http://www.ici.upmc.fr/cluego/cluegoDownload.shtml
Software, algorithm	ColonyArea	Guzmán et al., 2014	Image J plugin
Software, algorithm	Code used for shRNA screen data analysis	This paper	All scripts are available at: https://github.com/sblab-bioinformatics/GWscreen_G4sensitivity
Software, algorithm	Combenefit	Di Veroli et al., 2016	https://sourceforge.net/projects/combenefit/
Software, algorithm	Cytoscape v3.6.0	Shannon et al., 2003	http://www.cytoscape.org/
Software, algorithm	edgeR v3.6	Robinson et al., 2010	http://bioconductor.org/packages/release/bioc/html/edgeR.html
Software, algorithm	DAVID	Huang et al., 2009a, Huang et al., 2009b	https://david.ncifcrf.gov
Software, algorithm	FastQC v0.11.3	Andrews, 2010	http://www.bioinformatics.babraham.ac.uk/projects/fastqc
Software, algorithm	FASTX-Toolkit v0.0.14	Gordon and Hannon, 2010	http://hannonlab.cshl.edu/fastx_toolkit.html
Software, algorithm	Graphpad Prism	GraphPad Prism (https://graphpad.com)	RRID:SCR_015807	Version 6
Software, algorithm	PolySearch2	Liu et al., 2015a	http://polysearch.cs.ualberta.ca/
Software, algorithm	Python programming language v2.7.10	https://www.python.org
Software, algorithm	R programming language v3.2.1	https://cran.r-project.org/
Software, algorithm	Unix tools (cat, cut, awk, sort and uniq)	https://opengroup.org/unix

Oligo name	Description	Sequence 5’−3’
Mir5-F	Primary PCR Forward Primer	5’-CAGAATCGTTGCCTGCACATCTTGGAAAC- 3’
PGKpro-R	Primary PCR Reverse Primer	5’ -CTGCTAAA GCGCATGCTCCAGACTGC- 3’
P5-Seq-P-Mir-Loop	Secondary PCR forward Primer	5’-AATGATACGGCGACCACCGAGATCTACACT AGCCTGCGCACGTAGTGAAGCCACAGATGTA-3’
P7-Index-n-Truseq-PGKpro-R	Secondary PCR barcoded reverse primer	5’-CAAGCAGAAGACGGCATACGAGAT nnnnnnGTGACTGGAGTTCAGACGTGTGCTCTTCCGATCTCTGCTAAAGCGCATGCTCCAGACTGC – 3’
SeqPrimer MirLoop	Custom sequencing primer	5’-TAGCCTGCGCACGTAGTGAAGCCACAGATGTA-3

GO id	Name	Type	Link
GO:0051880	G-quadruplex DNA binding	Molecular function	https://www.ebi.ac.uk/QuickGO/term/GO:0051880
GO:0002151	G-quadruplex RNA binding	Molecular function	https://www.ebi.ac.uk/QuickGO/term/GO:0002151
GO:0061849	telomeric G-quadruplex DNA binding	Molecular function	https://www.ebi.ac.uk/QuickGO/term/GO:0061849
GO:0071919	G-quadruplex DNA formation	Biological process	https://www.ebi.ac.uk/QuickGO/term/GO:0071919
GO:0044806	G-quadruplex DNA unwinding	Biological process	https://www.ebi.ac.uk/QuickGO/term/GO:0044806
GO:1905493	regulation of G-quadruplex DNA binding	Biological process	https://www.ebi.ac.uk/QuickGO/term/GO:1905493
GO:1905494	negative regulation of G-quadruplex DNA binding	Biological process	https://www.ebi.ac.uk/QuickGO/term/GO:1905494
GO:1905495	positive regulation of G-quadruplex DNA binding	Biological process	https://www.ebi.ac.uk/QuickGO/term/GO:1905495
GO:1905465	regulation of G-quadruplex DNA unwinding	Biological process	https://www.ebi.ac.uk/QuickGO/term/GO:1905465
GO:1905466	negative regulation of G-quadruplex DNA unwinding	Biological process	https://www.ebi.ac.uk/QuickGO/term/GO:1905466
GO:1905467	positive regulation of G-quadruplex DNA unwinding	Biological process	https://www.ebi.ac.uk/QuickGO/term/GO:1905467

siRNA	Catalogue number	Sequence 5’−3’
Non-targeting 2	D-001810-02-05	UGGUUUACAUGUUGUGUGA
BRCA1 (A375)	J-003461–09	CAACAUGCCCACAGAUCAA
BRCA1 (HT1080)	J-003461–12	GAAGGAGCUUUCAUCAUUUC
TOP1 (both cell lines)	J-005278–08	CGAAGAAGGUAGUAGAGUC
DDX42 (both cell lines)	J-012393–11	GGAGAUCGACUAACGGCAA
GAR1 (both cell lines)	J-013386–06	UCCAGAACGUGUAGUCUUA

Oligo	Rna/DNA	Sequence
NRAS G4	RNA	5’ [Btn] UGU GGG AGG GGC GGG UCU GGG UGC 3’
NRAS mut	RNA	5’ [Btn] UGU AGA AAG AGC AGA UCU AGA UG 3’
Stem loop	RNA	5’ [Btn] ACA GGG CUC CGC GAU GGC GGA GCC CAA 3’
Myc G4	DNA	5’ [Btn] TGA GGG T GGG TA GGG T GGG TAA 3’
Myc mut	DNA	5’ [Btn] TGA GAG T GAG TA GAG T GAG TAA 3’

Share this article

Cite this article

Strategy identifying genetic vulnerabilities involved with G4 biology.

shRNA screening pipeline to uncover genetic vulnerabilities to G4 stabilisation.

Genome-wide screening in A375 cells reveals deficiencies in known G4-associated genes as sensitive to G4-stabilising small molecules.

Pathways and processes showing sensitivity to G4-stabilising ligands.

Identification of cancer-associated genes whose loss promotes sensitivity to G4 ligands.

A custom G4 sensitiser shRNA panel reveals unique and common G4 ligand sensitivities.

G4 sensitivities in two different cell lines.

siRNA knockdowns validate BRCA1, TOP1, DDX42 and GAR1 as key G4 ligand sensitiser genes.

Figure 8—source data 1

Cell death potentiation mediated by pharmacological inhibition of WEE1 or USP1 with the G4-stabilising ligand PDS.

DDX42 is a predominantly nuclear G4-binding protein.

Figure 10—source data 1

Possible chemotherapeutic combinations for G4-stabilising ligands with clinically relevant pharmacological drugs

Examples of cancer-associated genetic vulnerabilities to G4 ligands.

Author details

Katherine G Zyner

Contribution

Contributed equally with

Competing interests

Darcie S Mulhearn

Contribution

Contributed equally with

Competing interests

Santosh Adhikari

Contribution

Competing interests

Sergio Martínez Cuesta

Contribution

Competing interests

Marco Di Antonio

Contribution

Competing interests

Nicolas Erard

Contribution

Competing interests

Gregory J Hannon

Contribution

Competing interests

David Tannahill

Contribution

Competing interests

Shankar Balasubramanian

Contribution

For correspondence

Competing interests

Citations by DOI

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

Categories and tags

Research organism