Aicardi–Goutières syndrome (AGS) is a severe childhood inflammatory disorder that shows clinical and genetic overlap with systemic lupus erythematosus (SLE). AGS is thought to arise from the accumulation of incompletely metabolized endogenous nucleic acid species owing to mutations in nucleic acid-degrading enzymes TREX1 (AGS1), RNase H2 (AGS2, 3 and 4), and SAMHD1 (AGS5). However, the identity and source of such immunogenic nucleic acid species remain undefined. Using genome-wide approaches, we show that fibroblasts from AGS patients with AGS1-5 mutations are burdened by excessive loads of RNA:DNA hybrids. Using MethylC-seq, we show that AGS fibroblasts display pronounced and global loss of DNA methylation and demonstrate that AGS-specific RNA:DNA hybrids often occur within DNA hypomethylated regions. Altogether, our data suggest that RNA:DNA hybrids may represent a common immunogenic form of nucleic acids in AGS and provide the first evidence of epigenetic perturbations in AGS, furthering the links between AGS and SLE.https://doi.org/10.7554/eLife.08007.001
The immune system protects the body from attack by bacteria, viruses, and other microbes. A key feature of this system is the ability to discriminate between the body's own cells and potential foreign invaders. Occasionally, this process can go wrong and the immune system starts attacking its own tissues, which can lead to arthritis, diabetes, lupus, and other ‘autoimmune’ diseases.
Aicardi–Goutières syndrome (AGS) is an autoimmune disease that leads to severe mental and physical symptoms. Recent research has revealed that the disease is caused by mutations in genes that make enzymes called nucleases. In healthy people, these enzymes destroy DNA molecules and other nucleic acids. In AGS patients, the failure of the nucleases to act is thought to lead to the accumulation of unwanted DNA and RNA molecules. These molecules, in turn, are thought to be mistakenly identified by the immune system as ‘foreign’ and to cause an autoimmune response. However, it is not clear how this works.
Here, Lim et al. studied skin cells called fibroblasts from patients with Aicardi–Goutières syndrome. The experiments found that the patients' cells had excessive numbers of RNA molecules binding to sections of matching DNA. These unusual DNA–RNA ‘hybrids’ accumulated in regions of the genome that do not contain many genes, perhaps as a result of breaks in the DNA. It is possible that they may mimic nucleic acids from viruses and could trigger an autoimmune response.
In healthy individuals, small ‘methyl’ groups are often attached to DNA in a process known as DNA methylation. This serves to maintain the stability of the genome and controls the activity of genes. Unexpectedly, Lim et al. found that the DNA in AGS patients had far fewer methyl groups, especially in areas where the DNA–RNA hybrids had accumulated. This may lead to genome destabilization, alterations in gene activity, and may mean that the DNA in these regions may be mistaken for foreign DNA by the immune system.
Altogether, Lim et al.'s findings suggest that Aicardi–Goutières syndrome may be caused by immune responses triggered by the accumulation of RNA–DNA hybrids and lower levels of DNA methylation. These findings may aid the development of new therapies to treat Aicardi–Goutières syndrome, lupus, and other similar diseases.https://doi.org/10.7554/eLife.08007.002
Aicardi–Goutières syndrome (AGS) is a severe inflammatory encephalopathy characterized by neurological dysfunction, psychomotor retardation, seizures, unexplained fevers, joint stiffness, basal ganglia calcification, and chilblain skin lesions (Rice et al., 2007b). Despite its rarity, AGS has captured the attention of the scientific community due to its clinical, molecular, and genetic overlap with another, common, systemic autoimmune disorder, systemic lupus erythematosus (SLE) (Rice et al., 2007a, 2007b; Ravenscroft et al., 2011) (Gunther et al., 2015). Both diseases are characterized, and to a large extent driven, by excessive expression of interferon alpha, a potent anti-viral cytokine associated with heightened innate immune response and inflammation (Crow, 2011). AGS is a monogenic disorder caused by mutations in any one of the six AGS genes. The AGS genes encode for the 3′ to 5′ single-stranded DNA exonuclease TREX1 (AGS1) (Crow et al., 2006a), the RNA:DNA hybrid-specific ribonuclease H2 subunits (RNase H2A, B and C, corresponding to AGS4, 2 and 3, respectively) (Crow et al., 2006b), the 3′ to 5′ exonuclease and dNTP hydrolase SAMHD1 (AGS5) (Rice et al., 2009; Goldstone et al., 2011), and the RNA adenosine deaminase ADAR1 (AGS6) (Rice et al., 2012). Recently, gain-of-function mutations in the cytosolic double-stranded RNA receptor gene IFIH1 have also been shown to be associated with AGS (Rice et al., 2014). Interestingly, mutations in TREX1 and the RNASEH2 genes have also been implicated in SLE, further illustrating the genetic link between AGS and SLE (Lee-Kirsch et al., 2007; Gunther et al., 2015). Given the role of AGS enzymes in DNA and RNA metabolism, the innate immune response in AGS is thought to be triggered by the accumulation of incompletely metabolized endogenous nucleic acid elements (Crow and Rehwinkel, 2009). However, the identity and source of such immunogenic nucleic acid elements remain undefined.
While genetically well defined, the potential contribution of epigenetic deregulation to AGS has not been addressed. Characterizing putative epigenetic deregulation in AGS is all the more justified given that DNA hypomethylation, either spontaneous or caused by drug exposure, triggers increased B cell autoreactivity and the development of drug-induced and idiopathic human lupus (Richardson, 1986; Quddus et al., 1993; Yung et al., 1995; Jeffries et al., 2011; Absher et al., 2013; Zhang et al., 2013). Furthermore, loss of DNA methylation, albeit modest and restricted to specific autoimmunity-related genes, was reported in SLE patients (Richardson et al., 1990; Ballestar et al., 2006; Javierre et al., 2010; Jeffries et al., 2011; Absher et al., 2013), raising the question of whether the same epigenetic deregulation is shared in AGS.
To uncover the mechanism driving inflammatory responses in AGS, we sought to identify transcriptional, genetic, and epigenetic perturbations shared among AGS subtypes. For this, we extensively profiled a series of primary fibroblast cells from AGS patients with mutations in AGS1, AGS2, and AGS4 and AGS5 using high-throughput sequencing technologies. Transcriptional profiling via RNA-seq showed that AGS fibroblasts are characterized by an activated antiviral, immune response with a characteristic interferon signature. Genome-wide profiling of RNA:DNA hybrid formation revealed that AGS cells accumulate excessive loads of RNA:DNA hybrids. Whole-genome DNA methylation profiling further indicated that AGS genomes experience global loss of DNA methylation. Our data therefore provide the first evidence that TREX1, RNASEH2, and SAMHD1 mutations share common molecular abnormalities including genome-wide DNA hypomethylation and accumulation of RNA:DNA hybrid species.
We first performed RNA-seq to ascertain transcriptional signatures of AGS using primary fibroblasts from four patients with mutations in AGS1, AGS2, AGS4, or AGS5, and an age-matched healthy control (see Supplementary file for detailed genotype information). We identified a total of 98 and 209 genes that were significantly up- and down-regulated, respectively, in at least one AGS sample (Figure 1A; Figure 1—figure supplement 1A; Supplementary file 2). AGS down-regulated genes significantly associated with ontologies linked to cell adhesion and the extracellular matrix (Supplementary file 2). These ontologies may be relevant to the skin lesions often observed in AGS and SLE patients (Rice et al., 2007b). Genes involved in inflammation, immune responses, chemokine signaling pathways, and sensing of viral nucleic acids were up-regulated in AGS patient cells. Genes for the major pro-inflammatory cytokine IL1β (Zhao et al., 2013b) and the CXCL5 and CXCL6 chemokines were up-regulated in several AGS patients (Figure 1A,B). Additional immune signaling genes (IL-33, CXCL3, CXCL1, IL-8, CCL11) were significantly up-regulated in at least one AGS sample (Supplementary file 2) and gene ontology analysis of AGS deregulated genes identified cytokine and chemokine signaling pathways as one of weakly enriched terms (Figure 1—figure supplement 1B). These observations are indicative of a heightened immunomodulatory and chemotactic response previously identified in multiple autoimmune conditions, including SLE (Tuller et al., 2013). Genes involved in anti-viral responses (RSAD2, OASL, IGF2BP1, BST2), in particular interferon-inducible genes, were also up-regulated in multiple AGS samples (Figure 1A,B). Additional interferon-inducible genes (IFI6, IF44L, ISG15) described previously as representing an interferon signature in SLE (Baechler et al., 2003) and AGS (Rice et al., 2013) showed up-regulation in at least one AGS sample (Supplementary file 2). AGS fibroblasts thus exhibit an activated immune, inflammatory, and anti-viral state, underscoring the systemic nature of the disease and indicating that fibroblasts are an appropriate cell type to study AGS.
While it is likely that dysfunction in AGS enzymes results in an accumulation of incompletely metabolized immunogenic nucleic acid species (Crow and Rehwinkel, 2009), the identity and source of these molecules remain unclear. RNase H2 generally degrades RNA:DNA hybrids (Cerritelli and Crouch, 2009) and is specifically responsible for removing single ribonucleotides that are misincorporated during DNA replication (Reijns et al., 2012). We therefore determined whether ribonucleotide accumulation is a common feature of AGS. For this, we treated genomic DNA from AGS and control primary fibroblasts with purified human RNase H2 and measured the presence of resulting nicks by DNA polymerase I-dependent nick translation in the presence of [α-32P] dCTP. Genomic DNA from wild-type and RNase H2A-deficient (rnh201Δ) yeast cells that readily incorporate ribonucleotides (Nick McElhinny et al., 2010) was used as a control. Relative to wild-type, rnh201Δ yeast cells showed a fourfold to fivefold increase in ribonucleotide accumulation (Figure 2A,B), consistent with prior results (Nick McElhinny et al., 2010). Cells from two patients mutated in RNASEH2B (AGS2) showed a twofold to threefold increase in labeling (Figure 2A,C). Cells from two patients mutated in the catalytic RNase H2A subunit (AGS4) showed a pronounced 10–25-fold increase in ribonucleotides (Figure 2A,C). Thus, as observed in yeast (Nick McElhinny et al., 2010) and mouse (Reijns et al., 2012) models, a reduction of human RNase H2 activity leads to elevated levels of ribonucleotides in genomic DNA. By contrast, no significant increase in ribonucleotide loads could be detected in patients mutated in TREX1 (AGS1) or SAMHD1 (AGS5) (Figure 2A,B). Thus, while aberrant accumulation of ribonucleotides may contribute to the severity of the disease in RNase H2-defective AGS patients, it is unlikely to represent a common form of disease-causing nucleic acids in AGS.
To identify other species of RNA:DNA hybrids that may be accumulating in AGS, we performed DRIP-seq, a technique originally developed to profile R-loop formation genome-wide (Ginno et al., 2012). R-loops are long RNA:DNA hybrid structures that form co-transcriptionally upon re-annealing of the RNA transcript to the template DNA strand, forcing the non-template strand into a single-stranded state. R-loops are enriched at the 5′- and 3′-ends of human genes where they play roles in epigenetic control and transcription regulation (Ginno et al., 2012, 2013). DRIP-seq revealed that AGS and control fibroblasts share a total of 15,897 DRIP peaks representing approximately 141 megabases of genomic space (Figure 3A,C, Figure 3—figure supplement 1A,B). Consistent with prior studies, 32% of these peaks mapped to promoters or gene ends, which represents a strong enrichment over expected genomic distribution and further indicates that gene ends correspond to R-loop formation hotspots (Ginno et al., 2013). Our high-coverage DRIP-seq data also revealed that 56% of the R-loop signal mapped onto gene bodies, suggesting that RNA–DNA entanglements during transcription are prevalent (Figure 3—figure supplement 1C). By contrast, only 15.6% of common R-loop peaks mapped to intergenic regions, consistent with a predominant genic origin for R-loop formation (Figure 3—figure supplement 1C). Overall, common DRIP peaks showed strong overlap with GC skew (Figure 3B), a key sequence determinant that favors co-transcriptional R-loop formation (Ginno et al., 2013). The observation that AGS samples display R-loop formation over expected R-loop forming, GC-skewed, regions indicates that canonical R-loop formation is not significantly altered in AGS patients (Figure 3—figure supplement 1A,B).
Despite the relative agreement between AGS and control R-loop profiles over a broad set of common peaks, a large excess of DRIP peaks (33,781 in total) was observed specifically in AGS cells (Figure 3A). These novel peaks of RNA:DNA hybrids were in many cases unique to each patient sub-type. While a small number of DRIP peaks were also unique to the control sample, AGS-specific DRIP peaks in each AGS subtype were both threefold to fourfold greater in numbers, and when combined, occupied a three to four times larger genomic space than control-specific peaks (Figure 3C). On average, AGS-specific DRIP peaks occupied an additional 23.8 megabases of DNA sequence. This indicates that all AGS subtypes are burdened by higher loads of RNA:DNA hybrids in their genomes.
Unlike common DRIP peaks, AGS-specific DRIP peaks were threefold to sevenfold less likely to overlap with GC skew (Figure 3B), suggesting that they may originate according to a non-canonical mechanism. Additionally, AGS-specific DRIP peaks were depleted at transcription start sites (TSSs) and transcription termination sites (TTSs) (Figure 3D), where R-loops are typically observed (Ginno et al., 2013). In contrast, AGS2- and AGS4-specific peaks were significantly enriched over intergenic portions of the human genome (Figure 3D,E). AGS1- and AGS5-specific DRIP peaks were instead enriched over gene body regions (Figure 3D, Figure 3—figure supplement 1D). Interestingly, all AGS-specific DRIP peaks were significantly enriched in repeat classes corresponding to long interspersed nuclear elements (LINE) and long terminal repeats (LTR) retrotransposons (Figure 3D,F,G, Figure 3—figure supplement 1D). Analysis of the overlap of AGS-specific DRIP peaks with human-specific, retrotransposition-competent, LINE-1 elements failed to reveal any significant trend (data not shown). Therefore, whether the reported enrichment of AGS-specific DRIP peaks over LINE and LTR repeats carries biological significance or is simply a reflection of the increased repeat content of intergenic and intronic space remains to be determined. Altogether, our results indicate that AGS mutations in TREX1, RNASEH2A, RNASEH2B, and SAMHD1 are associated with the accumulation of RNA:DNA hybrids over repeat-rich intergenic and gene body regions.
The observation that RNA:DNA hybrids accumulate over intergenic regions is surprising given that they are normally maintained in a transcriptionally quiescent state owing to the deposition of silencing epigenetic marks such as DNA methylation (Yoder et al., 1997). To determine if DNA methylation patterns were altered in AGS, we performed MethylC-seq (Lister et al., 2009) to profile DNA methylation at low coverage (3.21–7.82 X coverage) across the same set of primary fibroblasts studied above (Supplementary file 1). All AGS cells displayed significant, global, DNA hypomethylation (Figure 4A). AGS2 and 4 cells showed profound DNA hypomethylation, with a ∼20% reduction in genomic methylation levels overall. AGS1 and 5 showed a more moderate, but still highly significant, 5–10% reduction in DNA methylation. This reduction affected TSSs and TTSs (Figure 4B,C; Figure 4—figure supplement 1A) and spread along the lengths of entire chromosomes (Figure 4D), significantly impacting nearly all genomic compartments, including genic, intergenic, and repeat regions (Figure 4E). Analysis of repeat classes further revealed that LINE and LTR elements were significantly hypomethylated in all AGS subtypes (Figure 4F). Pyrosequencing assays targeting human-specific LINE-1 elements also revealed a small but significant decrease in DNA methylation over 4 CpG sites carried at the LINE-1 5′-UTR promoter (Figure 4—figure supplement 1B). SINE elements, satellite repeats, and other repeat types were significantly hypomethylated in AGS2 and 4 but not always in AGS1 and 5 (Figure 4F). This profound reduction in DNA methylation was not caused by any detectable deregulation in the expression of genes encoding for the DNA methylation machinery or of components of a putative DNA de-methylation system (Figure 4—figure supplement 1C).
To further characterize the DNA hypomethylation observed in AGS cells, we employed a Hidden Markov Model (HMM) to annotate highly and partially methylated domains (HMDs and PMDs, respectively) (Schroeder et al., 2011). PMDs in control fibroblasts were generally shared by AGS fibroblasts but not by embryonic stem (ES) cells (Figure 5A; Figure 5—figure supplement 1A), consistent with the notion that they represent fibroblast-specific PMDs. AGS2 and AGS4 fibroblasts displayed broader PMDs than the control cells, while the size of PMD regions in AGS1 and AGS5 cells was relatively unchanged (Figure 5A; Figure 5—figure supplement 1A). Analysis of the chromatin features associated with PMDs showed that common PMDs resided within silenced genomic regions enriched for Lamin-B1, a component of the nuclear lamina, and the silencing mark histone H3 Lysine 9 trimethylation (H3K9me3), as described previously (Berman et al., 2012). Common PMDs were also strongly depleted for the H3K27me3 mark, a modification associated with facultative heterochromatin and silencing over developmental genes (Figure 5B, Figure 5—figure supplement 1B). Interestingly, AGS-specific PMDs, while still associated with Lamin-B1 and to some extent with H3K9me3, were enriched for H3K27me3, particularly over regions corresponding normally to large H3K27me3 blocks (Figure 5B,C). This suggests that the DNA hypomethylation observed in AGS is particularly associated with regions of silenced, condensed chromatin.
We next determined if the regions that became hypomethylated also corresponded to regions that accumulated RNA:DNA hybrids in DRIP-seq analysis. In AGS1 and AGS5 fibroblasts, which showed only moderate DNA hypomethylation and limited additional PMDs, DRIP peaks, whether common or AGS-unique, mostly landed in HMD regions (Figure 5D). This is consistent with AGS1- and AGS5-specific DRIP peaks being enriched over gene body regions (Figure 3D) since gene body regions are typically highly methylated (Lister et al., 2009). In sharp contrast, nearly half of the AGS-unique DRIP peaks in AGS2 and AGS4 fibroblasts landed on PMD regions while only 10–15% of common DRIP peaks matched to PMDs in these two samples. Furthermore, two-thirds of these PMD-contained AGS-unique DRIP peaks matched to AGS-specific PMDs (Figure 5D,E). Consistent with this, the DNA methylation levels measured over AGS1-, AGS2-, and AGS4- (but not AGS5) unique DRIP peaks were significantly lower in their own respective samples compared to the DNA methylation levels of that same regions measured in control (Figure 5F). Thus, our data show a remarkable agreement between regions that accumulate RNA:DNA hybrids in the AGS2, AGS4, and to a lesser extent AGS1 subtypes and regions that undergo DNA hypomethylation.
To determine if the loss of DNA methylation observed in AGS patient cells could be directly due to AGS mutations, we focused on RNASEH2A since AGS4 fibroblasts displayed profound DNA hypomethylation (Figure 4). We employed a lentivirus-mediated CRISPR/Cas method to knockout exon 6 in RNASEH2A (RNASEH2A KO) in a HeLa cell line harboring a silent green fluorescent protein (GFP) reporter gene (HeLa-GFP) (Poleshko et al., 2010). As a control, we treated the same cell line with a CRISPR vector containing a scramble guide sequence (scramble). Effective gene knockout was validated by PCR genotyping (not shown) and by Western blots (Figure 6A). To further validate the knockout, we measured the DNA damage response in RNASEH2A KO and scramble cells using an antibody against gamma H2AX. As expected from previous reports (Reijns et al., 2012; Gunther et al., 2015), we observed higher levels of gamma H2AX in RNASEH2A KO cells (Figure 6B), indicative of an activated DNA damage signaling.
Compared to scramble, RNASEH2A KO cells showed a clear increase in the number and brightness of GFP-positive cells (Figure 6C). GFP expression in RNASEH2A KO cells was 14-fold higher than in scramble cells, as measured by Western blot (Figure 6D), demonstrating reactivation of the GFP reporter gene. To survey DNA methylation levels in RNASEH2A KO and scramble cells, we focused on two 5′-UTR portions common to LINE-1 elements and performed DNA methylation sequencing after sodium bisulfite treatment. RNASEH2A KO cells showed significant DNA hypomethylation relative to scramble (Figure 6E,F). This suggests a direct effect of RNASEH2A deficiency in triggering DNA hypomethylation and gene reactivation at least for the loci studied here.
One of the key observations from this study is that primary fibroblasts from all four AGS subtypes studied here experienced global loss of DNA methylation. While additional samples encompassing other AGS mutations subtypes will need to be studied to confirm this trend, this observation nonetheless reinforces the similarities between AGS and SLE, although the DNA hypomethylation reported here is stronger than that previously reported in SLE. In AGS, DNA hypomethylation was found to affect every genomic compartment, including genic and intergenic regions, unique and repeated sequences (Figure 4). In AGS2 and AGS4, where the effect was most marked, the DNA hypomethylation led to the formation of AGS-specific PMDs in addition to those normally observed in fibroblasts (Lister et al., 2009). Similar to previously characterized PMDs in somatic tissues and cancer samples, common fibroblasts PMDs described here overlapped with inactive, late-replicating, heterochromatic regions associated with H3K9me3 and Lamin-B1 (Hawkins et al., 2010; Berman et al., 2012). Interestingly, AGS-specific PMDs, but not common PMDs, were also enriched over regions marked by H3K27me3, a mark of facultative heterochromatin associated with developmental silencing (Figure 5). This suggests that silent, condensed genomic regions, may in fact be intrinsically prone to becoming PMDs, and that this tendency is made worse by AGS2 and AGS4 mutations. Our data further indicate that RNase H2 defects directly drive these epigenetic perturbations given that we could recapitulate DNA hypomethylation at a subset of LINE-1 sequences and trigger the reactivation of a silent GFP reporter by knocking out RNASEH2A (Figure 6). While the precise mechanism responsible for DNA hypomethylation in AGS remains to be elucidated, it is worth noting that defects in TREX1, RNase H2, and SAMHD1 all trigger the DNA damage response and are associated with increased genomic instability (Yang et al., 2007; Reijns et al., 2012; Clifford et al., 2014; Kretschmer et al., 2015). RNase H2 in fact associates with the DNA replication fork through the PCNA (proliferating cell nuclear antigen) clamp (Bubeck et al., 2011) and TREX1 was found to translocate to the nucleus upon replication stress where it might be involved in processing aberrant DNA replication intermediates (Yang et al., 2007). It is therefore possible that the replication stress caused by AGS mutations indirectly results in defects in replication-coupled, DNMT1-mediated DNA methylation maintenance (Figure 7) (Law and Jacobsen, 2010). In support of this notion, a recent report shows that the PCNA clamp unloads from the lagging strand of stalled DNA replication forks (Yu et al., 2014). Given that the DNMT1 maintenance DNA methyltransferase binds to PCNA (Leonhardt et al., 1992) and requires PCNA binding for efficient activity (Schermelleh et al., 2007), it is plausible that replication stress may lead to inefficient DNA methylation maintenance, particularly in silent, compact regions of the genome such as heterochromatic regions. An analogous mechanism was proposed to account for how defective DNA replication may trigger epigenetic instability by uncoupling DNA synthesis from histone recycling (Sarkies et al., 2010). Interestingly, 5-aza-2′-deoxycytidine, a drug best known as a DNA-demethylating agent, in fact causes genome-wide DNA damage (Karpf et al., 1999; Palii et al., 2008; Orta et al., 2013). Finally, pronounced DNA hypomethylation is also known to cause genomic instability (Jackson-Grusby et al., 2001; Chen et al., 2007; Liao et al., 2015) suggesting a possible positive feedback loop that may link and amplify defects in either pathways (Figure 7). It will be interesting to further investigate the connections between DNA replication stress, DNA hypomethylation, and epigenetic instability in the context of AGS or of other diseases associated with epigenetic aberrations such as cancers.
Understanding if and how DNA hypomethylation contributes to autoimmunity in AGS and SLE is more than ever a priority. One possibility is that DNA hypomethylation results in altered gene expression profiles. However, our RNA-seq characterization only identified a modest number of up- and down-regulated genes that were enriched for ontologies reflecting the biology of the disease itself, such as immunomodulatory genes (Figure 1). There is therefore no evidence for a global alteration of transcriptional profiles as a result of the DNA hypomethylation in AGS. This is consistent with the general absence of direct connections between DNA hypomethylation and gene reactivation (Bestor et al., 2015), including in extremely hypomethylated mouse ES cells (Li et al., 1992; Tsumura et al., 2006). It remains possible, however, that DNA hypomethylation may enable the transcription of normally silent and methylated retrotransposable elements. DNA methylation is indeed the primary silencing force ensuring retrotransposon control in somatic cells (Walsh et al., 1998; Yoder and Bestor, 1998). If correct, this could lead to the reactivation of endogenous retroviral-like particles, which if detected by innate immune sensors, could in turn induce the anti-viral IFNα cytokine, a hallmark of AGS and SLE (Volkman and Stetson, 2014). Here, we showed that the DNA hypomethylation observed in AGS samples affected LTR, LINE, and to a lower extent, SINE repeated DNA elements (Figure 4). However, methylation pyrosequencing focused on human-specific LINE-1s showed that the extent of DNA hypomethylation at these elements, while significant, was minimal and that they in fact remained mostly methylated (Figure 4—figure supplement 1B). Likewise mining of our RNA-seq data sets or targeted real-time reverse-transcription PCR (RT-qPCR) assays aimed at the LINE-1 mRNA transcript failed to provide convincing evidence for LINE-1 transcriptional reactivation (data not shown). Finally, Western blots aimed at detecting the LINE-1 Orf1p protein were negative for AGS fibroblasts even though the protein was easily detected in Ntera2 control cells (data not shown). Altogether, while we can't rule out that LINE-1, or other retroelements, undergo reactivation in AGS cells, particularly in other cell types or at earlier time points during development, the evidence suggests that the DNA hypomethylation observed in AGS primary fibroblasts is not accompanied by LINE-1 reactivation.
The second major observation from this study is that all AGS subtypes tested here exhibit large accumulation of RNA:DNA hybrid species (Figure 3). RNA:DNA hybrids therefore represent candidate immunogenic nucleic acids in AGS. In contrast, ribonucleotide incorporation in genomic DNA was only observed in RNase H2-mutated samples (Figure 2). Our DRIP-seq profiling reveals that AGS2- and AGS4-specific RNA:DNA hybrids map to the intergenic space, away from traditional R-loop hotspots including genes, promoters, and terminators. AGS1- and AGS5-specific RNA:DNA hybrids were by contrast enriched over gene body regions (Figure 3D). All AGS-unique RNA:DNA hybrid peaks showed a lower overlap with GC skew, an important sequence determinant of R-loop formation (Figure 3B), suggesting that these hybrids may occur by a mechanism distinct from the well-accepted ‘thread back’ R-loop mechanism (Aguilera and Garcia-Muse, 2012). AGS-specific DRIP peaks were enriched in LINE-1 and LTR-containing sequences. This enrichment is intriguing in light of the proposed roles of SAMHD1, RNase H2, and TREX1 in the control of retroelement transposition (Stetson et al., 2008; Zhao et al., 2013a; Volkman and Stetson, 2014), although, as discussed above, we did not find convincing evidence for transposon reactivation here. Alternatively, it is possible that enrichment of AGS-specific RNA:DNA hybrids over LINE-1-rich regions simply reflects the fact that they map preferentially to intergenic and intronic regions. One possible clue as to the origin of AGS-specific DRIP peaks stems from the fact that a large fraction of such peaks in AGS2 and AGS4 samples overlap with AGS-specific PMDs in these two subtypes (Figure 5). Direct measurements in fact show that AGS-specific DRIP peaks are significantly hypomethylated in AGS1, AGS2, and AGS4 (Figure 5). Thus, the two main features observed in AGS samples, excessive RNA:DNA hybrid formation and DNA hypomethylation, often overlap, strongly supporting a link between both changes in cis. One possible explanation for this overlap is that regions experiencing DNA hypomethylation as a result of AGS mutations may undergo spurious transcription that lead to co-transcriptional R-loop formation (Figure 7). Another non-exclusive possibility is based on the proposition that PMD regions may result from DNA damage and inefficient maintenance DNA methylation (see above). It was recently described that RNA transcripts may serve as a template for the repair of double-strand DNA breaks upon hybridization to DNA via Rad52, a key member of the homologous recombination pathway. Interestingly, RNase H2 activity was shown to block the formation of these RNA:DNA hybrids (Keskin et al., 2014). It is therefore possible that some of the DNA breaks caused by AGS mutations and replication stress are repaired according to a pathway prone to RNA:DNA hybrid formation (Figure 7). While speculative, this model may also account for the lack of overlap with GC skew since those hybrids are thought to be mediated by Rad52 and not to arise co-transcriptionally. Experiments aimed at testing this proposal will be important.
One key unanswered question concerns the manner in which RNA:DNA hybrids trigger an immune response. The cGAS-STING pathway activates the interferon response upon sensing of not only cytosolic DNA (Sun et al., 2013), but also RNA:DNA hybrids (Mankan et al., 2014). Interestingly, several studies reported that multiple dsDNA break repair proteins can sense damaged DNA and induce a STING-mediated type I IFN response (Zhang et al., 2011; Kondo et al., 2013; Hartlova et al., 2015), thereby highlighting previously underappreciated links between the DNA damage response and innate immune signaling. Besides STING, it is worth noting that the endosomal receptor TLR9 has also been identified as an RNA:DNA hybrid sensor (Rigby et al., 2014) in addition to its canonical role in sensing unmethylated CpG-rich DNA (Hemmi et al., 2000). It is therefore possible that AGS mutations-induced DNA hypomethylation also contributes to the increased interferon response in AGS (Figure 7). Interestingly, treatment of colon adenocarcinoma with 5-aza-2′-deoxycytidine, which causes DNA damage and DNA demethylation, triggers a prominent interferon response (Karpf et al., 1999). Future work will be necessary to dissect the pathways by which unmethylated DNA and RNA:DNA hybrids may be sensed to trigger the innate immune response characteristic of AGS, and by extension, SLE patients.
Detection of incorporated ribonucleotides was performed as previously described (Hiller et al., 2012). Briefly, 200 ng of human primary fibroblasts or yeast genomic DNA (wild-type and RNASEH2-deficient) was treated with 20 nM of purified recombinant human RNase H2 (Loomis et al., 2014) or water in RNase H2 reaction buffer (50 mM Tris-HCl pH 8, 60 mM KCl, 10 mM MgCl2, 0.01% BSA (Bovine Serum Albumin), 0.01% Triton) at 37°C for 1 hr. 20 μM of unlabeled dATP, dGTP and dTTP, plus 3.7 × 105 Bq [α-32P]-dCTP (PerkinElmer, Santa Clara, CA) and 5 U of Escherichia coli DNA polymerase I (New England Biolabs, Inc., Ipswich, MA) were added and the reaction was incubated at 16°C for 30 min and was run on a 1% TAE (Tris-acetate-EDTA) agarose gel. Visualization was performed using a Storm PhosphorImager, and bands were quantified using ImageQuant (GE Healthcare, United Kingdom). Relative ribonucleotide loads were calculated by dividing the radiolabel incorporation in the RNase H2-treated sample over the untreated sample. Experiments were performed at least in triplicate.
DRIP-seq was performed on primary fibroblasts as previously described (Ginno et al., 2012). Sequencing reads were trimmed with FastqMcf (Aronesty, 2011) before mapping to the hg19 reference genome using BWA 0.6.1 (Li and Durbin, 2009). Peak calling was first performed by MACS 1.4.2 (Zhang et al., 2008) using input library as control. DRIP peaks were further assigned onto restriction fragments using BEDtools (Quinlan and Hall, 2010). For control and each AGS subtype, DRIP peaks from two independent samples were merged into one sample for downstream analysis. All overlap analysis was performed using BEDTools (Quinlan and Hall, 2010). GC skew annotation was according to low stringency SkewR peaks as previously described (Ginno et al., 2013). The enrichment or depletion of AGS-unique DRIP peaks over different genomic regions was measured as fold-change of percent base-pair overlap of AGS-unique DRIP peaks relative to common DRIP peaks. The statistical significance of enrichment or depletion of AGS-unique DRIP peaks over a specific genomic feature was measured as follows: for each AGS-unique peak, 500 shuffled peaks of equal lengths were extracted from the common peak set and their overlap with various genomic features was recorded as % length overlap. The significance of the difference in overlap between observed (AGS unique) and expected (shuffled) was calculated using empirical p-values according to the Monte Carlo method.
2 μg of genomic DNA was sheared by sonication down to 100–500 bp in size. Sequencing libraries were constructed as described before (Schroeder et al., 2011). We then performed bisulfite treatment using EZ DNA methylation-direct kit (Zymo Research, Irvine, CA) following manufacturer's instructions. 50 ng of bisulfite-treated DNA was amplified for 15 cycles using Pfu Cx Turbo Hotstart DNA polymerase (Agilent Technologies, Santa Clara, CA), and the quality of the library was checked on a 2100 Agilent Bioanalyzer prior to sequencing on an Illumina HiSeq 2000. Sequencing reads were trimmed as described in DRIP-seq. Mapping was performed using Bismark 0.7.7 (Krueger and Andrews, 2011), and percent methylation was called using custom Perl script, combind_strand_meth.pl. C to T conversion rate was determined as the ratio of converted C in CHG and CHH context relative to the total number of CHG and CHH. CpG coverage was calculated as the average number of reads over CpG sites in the genome. Both C to T conversion rate and CpG coverage are reported in Supplementary file 1. To ensure good coverage and eliminate PCR bias, CpG sites with coverage below 4× or more than 99.9th percentile were discarded. Circos plot was generated using Circos 0.62–1 (Krzywinski et al., 2009). TSS and TTS metaplots were generated using custom Perl script, wig_to_metaplot_lowmem.pl. Methylation levels at different genomic regions were calculated BEDtools (Quinlan and Hall, 2010). Wilcoxon paired test was performed using R with the alternative hypothesis that the AGS samples are less methylated than the control. PMDs and HMDs were called using StochHMM, a HMM-based software (Lott and Korf, 2014) as previously described (Schroeder et al., 2013) with modifications: PMDs were trained using random 25-kb regions with 25–55% methylation and HMDs were trained using random 25-kb regions with 60–100% methylation. H3K27me3 and H3K9me3 data sets were obtained from ENCODE normal human dermal fibroblasts (NHDF-Ad). The lamin B1 data set was obtained from Gene Expression Omnibus (GSE49341).
Reduced representation bisulfite sequencing (RRBS) library was prepared according to the Myers lab protocol (Varley et al., 2013) prior to sequencing on an Illumina HiSeq 2000. Sequencing reads were trimmed with Trim Galore 0.3.3 (Krueger, 2014) and mapped to hg19 genome using Bismark 0.7.7 (Krueger and Andrews, 2011). TSS metaplot was generated using wig_to_metaplot_lowmem.pl.
RNA-seq libraries were constructed using the Illumina TruSeq kit prior to sequencing on an Illumina HiSeq 2000. Raw reads were trimmed as before and mapped to the hg19 genome using TopHat 2.0.5 (Trapnell et al., 2009). Read counts for each gene were calculated using HTSeq. Pearson's correlation between each pair of biological replicates was calculated by comparing every gene's log2 normalized read count (Supplementary file 4). Differential gene expression was identified using DESeq (Anders and Huber, 2010) using fold change > 2 and FDR (false discovery rate) < 0.1. Pathway analysis was performed using DAVID (Huang da et al., 2009).
DRIP-seq, MethylC-seq, RRBS, and RNA-seq data are available at the Gene Expression Omnibus (GEO) database, under the accession number GSE57353. Custom Perl scripts are available at https://github.com/ywlim/Perl.
Total RNA was extracted from 80% confluent primary fibroblasts using TRI reagent (Life Technologies, Grand Island, NY) and Direct-zol RNA miniprep kit (Zymo Research). RNA was reverse transcribed to first strand cDNA using iScript Reverse Transcription Supermix for RT-qPCR (Bio-Rad, Hercules, CA). cDNA was cleaned up using DNA clean and concentrator (Zymo Research) and resuspended in 10 μl water. For RT-qPCR, 5 μl of 1:50 dilution of the cDNA was used per well in a 20 μl reaction, along with 2 μl of 10 μM primer sets and 10 μl of SsoAdvanced Universal SYBR Green Supermix (Bio-Rad). Reactions were run in duplicate at least on a CFX96 Touch Real-Time PCR Detection System (Bio-Rad) with the following protocol: 95°C (2 min), 40 cycles of 95°C (10 s) and 60°C (15 s). Quantification was calculated using the CFX Manager software (Bio-Rad). Primer sequences are listed on Supplementary file 3.
RNASEH2A guide RNA sequence was designed using E-CRISP (http://www.e-crisp.org/) and the corresponding oligonucleotides (top: CACCGTTGGATACTGATTATGGCTC; bottom: AAACGAGCCATAATCAGTATCCAAC) and scramble oligonucleotides (top: CACCGGCACTACCAGAGCTAACTCA; bottom: AAACTGAGTTAGCTCTGGTAGTGCC) were purchased from Life Technologies. The oligonucleotides were annealed and ligated into lentiCRISPR v2 (Addgene plasmid 52961) at the BsmBI restriction site. The resulting lentiCRISPR-RNASEH2A and lentiCRISPR-scramble plasmids were transformed into DH10B cells and purified using Qiagen Plasmid Midi Kit. To produce lentivirus carrying the CRISPR plasmids, 5 μg lentiCRISPR-RNASEH2A or lentiCRISPR-scramble, 4 μg psPAX2, and 1.5 μg of pMD2.G were transfected into HEK293T cells on a 10-cm plate using Turbofect (Life Technologies) following manufacturer's protocol. Viral supernatant was collected 48 hr later, filter sterilized with a 0.45-μm filter (Millipore, Billerica MA), and 1 ml of it was used to infect HeLa-GFP cells on a 6-well plate. 1 μg/ml puromycin was added 24 hr later. 22 days later, HeLa-GFP-RNASEH2A KO and HeLa-GFP-scramble cells were imaged using GFP microscopy. Whole-cell protein was extracted and Western blot was performed using antibodies against RNase H2A (1:1000, ProSci) or GFP (1:500, UC Davis/NIH NeuroMab Facility, Davis, CA), and tubulin (1:1000, Sigma–Aldrich, St. Louis, MO). Immunocytochemistry was performed on the same cells using antibody against γH2AX (1:150, Abcam, United Kingdom).
600 ng of genomic DNA was bisulfite converted using EZ DNA methylation-direct kit (Zymo Research) and eluted in 30 μl elution buffer. 1 μl of it was then PCR amplified using two sets of LINE-1 primers (Supplementary file 3) (95°C for 4 min, 20 cycles of 95°C, 55°C and 72°C for 30 s each, 72°C for 4 min). PCR products were gel purified and cloned into pDRIVE cloning vector using Qiagen PCR cloning kit (Qiagen, Valencia, CA) and the resulting vectors were transformed into DH10B cells. Miniprep was performed using QIAprep Spin Miniprep Kit (Qiagen) and 16 clones were sequenced for each LINE-1 locus. Percent methylation at each CG site was the ratio of unconverted (CG) vs the sum of unconverted and converted (TG) molecules.
Command-line tools for processing biological sequencing dataCommand-line tools for processing biological sequencing data.
Interferon-inducible gene expression signature in peripheral blood cells of patients with severe lupusProc Natl Acad Sci U S A 100:2610–2615.https://doi.org/10.1073/pnas.0337679100
The epigenetic face of systemic lupus erythematosusThe Journal of Immunology 176:7143–7147.https://doi.org/10.4049/jimmunol.176.12.7143
Notes on the role of dynamic DNA methylation in mammalian developmentProceedings of the National Academy of Sciences of USA 112:6796–6799.https://doi.org/10.1073/pnas.1415301111
PCNA directs type 2 RNase H activity on DNA replication and repair substratesNucleic Acids Research 39:3652–3666.https://doi.org/10.1093/nar/gkq980
Type I interferonopathies: a novel set of inborn errors of immunityAnnals of the New York Academy of Sciences 1238:91–98.https://doi.org/10.1111/j.1749-6632.2011.06220.x
Aicardi-Goutieres syndrome and related phenotypes: linking nucleic acid metabolism with autoimmunityHuman Molecular Genetics 18:R130–R136.https://doi.org/10.1093/hmg/ddp293
Defective removal of ribonucleotides from DNA promotes systemic autoimmunityThe Journal of Clinical Investigation 125:413–424.https://doi.org/10.1172/JCI78001
Mammalian RNase H2 removes ribonucleotides from DNA to maintain genome integrityThe Journal of Experimental Medicine 209:1419–1426.https://doi.org/10.1084/jem.20120876
Inhibition of DNA methyltransferase stimulates the expression of signal transducer and activator of transcription 1, 2, and 3 genes in colon tumor cellsProceedings of the National Academy of Sciences of USA 96:14007–14012.https://doi.org/10.1073/pnas.96.24.14007
DNA damage sensor MRE11 recognizes cytosolic double-stranded DNA and induces type I interferon by regulating STING traffickingProceedings of the National Academy of Sciences of USA 110:2969–2974.https://doi.org/10.1073/pnas.1222694110
SAMHD1 prevents autoimmunity by maintaining genome stabilityAnnals of the Rheumatic Diseases 74:e17.https://doi.org/10.1136/annrheumdis-2013-204845
Circos: an information aesthetic for comparative genomicsGenome Research 19:1639–1645.https://doi.org/10.1101/gr.092759.109
Establishing, maintaining and modifying DNA methylation patterns in plants and animalsNature Reviews. Genetics 11:204–220.https://doi.org/10.1038/nrg2719
StochHMM: a flexible hidden Markov model tool and C++ libraryBioinformatics 30:1625–1626.https://doi.org/10.1093/bioinformatics/btu057
Cytosolic RNA: DNA hybrids activate the cGAS-STING axisThe EMBO Journal 33:2937–2946.https://doi.org/10.15252/embj.201488726
Genome instability due to ribonucleotide incorporation into DNANature Chemical Biology 6:774–781.https://doi.org/10.1038/nchembio.424
Identification of a functional network of human epigenetic silencing factorsThe Journal of Biological Chemistry 285:422–433.https://doi.org/10.1074/jbc.M109.064667
Autosomal dominant inheritance of a heterozygous mutation in SAMHD1 causing familial chilblain lupusAmerican Journal of Medical Genetics. Part A 155A:235–237.https://doi.org/10.1002/ajmg.a.33778
Heterozygous mutations in TREX1 cause familial chilblain lupus and dominant Aicardi-Goutieres syndromeAmerican Journal of Human Genetics 80:811–815.https://doi.org/10.1086/513443
Clinical and molecular phenotype of Aicardi-Goutieres syndromeAmerican Journal of Human Genetics 81:713–725.https://doi.org/10.1086/521373
RNA: DNA hybrids are a novel molecular pattern sensed by TLR9The EMBO Journal 33:542–558.https://doi.org/10.1002/embj.201386117
Dynamic DNA methylation across diverse human cell lines and tissuesGenome Research 23:555–567.https://doi.org/10.1101/gr.147942.112
The enemy within: endogenous retroelements and autoimmune diseaseNature Immunology 15:415–422.https://doi.org/10.1038/ni.2872
A candidate mammalian DNA methyltransferase related to pmt1p of fission yeastHuman Molecular Genetics 7:279–284.https://doi.org/10.1093/hmg/7.2.279
Cytosine methylation and the ecology of intragenomic parasitesTrends in Genetics 13:335–340.https://doi.org/10.1016/S0168-9525(97)01181-5
Mechanism of drug-induced lupus. I. Cloned Th2 cells modified with DNA methylation inhibitors in vitro cause autoimmunity in vivoThe Journal of Immunology 154:3025–3035.
Cutting edge: Ku70 is a novel cytosolic DNA sensor that induces type III rather than type I IFNThe Journal of Immunology 186:4541–4545.https://doi.org/10.4049/jimmunol.1003389
A critical role for interleukin-1beta in the progression of autoimmune diseasesInternational Immunopharmacology 17:658–669.https://doi.org/10.1016/j.intimp.2013.08.012
Bing RenReviewing Editor; University of California, San Diego School of Medicine, United States
eLife posts the editorial decision letter and author response on a selection of the published articles (subject to the approval of the authors). An edited version of the letter sent to the authors after peer review is shown, indicating the substantive concerns or comments; minor concerns are not usually shown. Reviewers have the opportunity to discuss the decision before the letter is sent (see review process). Similarly, the author response typically shows only responses to the major concerns raised by the reviewers.
Thank you for submitting your work entitled “Genome-wide DNA hypomethylation and RNA:DNA hybrid accumulation in Aicardi-Goutières syndrome” for peer review at eLife. Your submission has been favorably evaluated by James Manley (Senior editor), and three reviewers, one of whom is a member of our Board of Reviewing Editors. The following individuals responsible for the peer review of your submission have agreed to reveal their identity: Bing Ren (Reviewing editor); Paula Vertino (peer reviewer #3). A further reviewer remains anonymous.
The Reviewing editor has drafted this decision to help you prepare a revised submission.
The work reports a number of interesting findings linking mutations of AGS1-5 to the symptoms displayed by a class of rare genetic disorders. Overall, the manuscript is well written, with most conclusions well supported by evidence presented. However, a number of over-interpretations or over-statements were made, where causal relationships were drawn based on insufficient data. The authors are encouraged to thoroughly revise the manuscript by toning down the statements or provide more objective statements.
1) Thoroughly revise the conclusions, where unwarranted causal relationship was drawn, as noted by both reviewers #2 and #3 (below). For example, “This study identifies global epigenetic perturbations and accumulation of RNA:DNA hybrids as two novel hallmarks that may drive the powerful immune response responsible for...”
2) Provide clearer description of the datasets. For example, provide information upfront regarding the WGBS sequence depth and resolution.
3) The RNAseq data only weakly support the idea of immune activation as a phenptype that is enriched in AGS fibroblasts. There were nearly as many downregulated genes...any patterns there? The data should be fully discussed.
4) The data in Figure 3E, F seem to suggest that the accumulation of RNA:DNA hybrids is confined to retroelements, and further, to specific classes (LINE,LTR) implying some specificity to the effect. Please discuss this trend and biological implications.
5) The Drip-seq data need to be better presented so that a reader can fully appreciate how the patterns change, and the spatial relationship to other genomic features (see Reviewer #3 comment below).
6) The DNA methylation data and RNA-DNA hybrid datasets also need to be presented more clearly, so that the spatial relationship between DNA methylation changes and accumulation of RNA:DNA hybrid can be better appreciated.
7) Experiments from the RNASEH2A k/o fibroblasts have revealed only a modest impact of RNASEH2A KO on DNA methylation at certain retroelements. Deeper analysis of this cell system would allow the investigator to determine the hierarchical nature of the genomic phenomena observed, and strength the causality statement.
8) Proper statistics needs to be used and documented when the enrichment/depletion characteristics and overlapping features are discussed.
Aicardi-Goutières Syndrome (AGS) is caused by mutations in several genes involved in ribonucleotide metabolism and is characterized by an overt native immuneresponse, but the molecular causes of the symptoms remain unidentified. In this study, the authors profiled transcriptome, DNA methylome and RNA-DNA hybrids in fibroblasts from patients with AGS, and identified global DNA hypo-methylation and increased RNA:DNA hybrids as the most likely drivers of the inflammatory response in AGS patients. Importantly they highlight a potential epigenetic contribution to this syndrome (hypomethylation) specifically the reactivation of retrotransposon elements in the human genome, due to genome hypomethylation in the cells. The results are pretty compelling and the conclusions are well supported by the data provided.
1) Figure 1. It appears that for some genes, the normalized RNA-seq data for some of the samples (AGS4-P2, AGS5-P2, AGS2-P1) varies a lot between replicates in the same patient. Could you provide information showing the correlation between replicates?
2) Figure 3A) This figure is confusing. It would help to explain how the right hand vertical axis corresponds to the left hand vertical axis. Figure 3C is much more easily interpretable and a cleaner presentation of the data.
3) The authors show that in AGS2 and AGS4 patients, regions containing RNA:DNA hybrids largely correspond with hypomethylated regions, but this is not true in AGS1 and AGS5 patients. Additionally, the authors mention that AGS1 and AGS5 patients have decrease hypomethylation. Patients AGS2 and AGS4 have RNASEH2a/2b mutations, and the authors show how RNASEH2a mutations trigger DNA hypomethylation. Is it possible to show the same for the RNASEH2b mutation corresponding with the AGS2 patient?
4) Figure 6E/F) By focusing on only two LINE elements it seems too small of a set to “suggest a direct role of RNASEH2a in triggering DNA hypomethylation.” Perhaps you can suggest that it plays a role at these LINE elements. It is likely that if you focused on other areas you could see hypermethylation. Additionally in Figure 6E/F on cpg13, there are four reads supporting 50 % methylation level in the scramble, and two reads supporting a 0% methylation level in the KO. There are too few reads covering this cpg to make any claims regarding changes in methylation at cpg13.
The study focuses on a potential mechanism connecting genetic mutations in TREX1, RNase H2 and SAMHD1 to inflammation-related problems in Aicardi-Goutières syndrome (AGS) in children. AGS involves the accumulation of incompletely metabolized nucleic acids resulting from the mutations. In this study, a variety of epigenome and transcriptome profiling, and a cutting-edge genetic engineering method are used to identify the source of immunogenic nucleic acids. The main conclusion is global epigenetic alterations and accumulation of RNA-DNA hybrids are involved in AGS, both shown here for the first time, and these alterations may drive immune responses that cause the inflammation.
Overall this is a novel study that provides a new evidence of two types of molecular changes that are variably associated with AGS, as shown in fibroblasts from AGS patients and controls.
The main issue I see with this manuscript is the overstatement of causality where only modest associations are shown. For example, I would agree with “This study identifies global epigenetic perturbations and accumulation of RNA:DNA hybrids as two novel hallmarks” but do not see adequate support for the remainder of the conclusion “that may drive the powerful immune response responsible for...” I also suggest adding additional caveat/caution about the first phrase as this study analyzed a very limited number of AGS samples, and they are heterogeneous in terms of the underlying mutation. Another example of overstatement is “our results indicate that AGS mutations in TREX1, RNASEH2A, RNASEH2B, and SAMHD1 lead to the accumulation of RNA:DNA hybrids over retrotransposon-rich intergenic regions.” The data do not support “lead to”, but perhaps better described as “associated with”.
The WGBS data is quite interesting and the Circos display allows a reasonable visual comparison and highlights global differences. I have minor concerns about the data. The coverage is quite low, 3x to 7x, which defeats the main benefit of this method, base resolution. While the overall conclusion is probably correct, it would be better for the reader to know this is very low pass sequencing upfront, rather than putting it in a supplemental table only. It is also not clear how the C to T conversion was calculated. I could not find an explanation in the table or methods.
The manuscript by Lim et al. makes some very interesting observations regarding the molecular basis of Aicardi-Goutieres syndrome, an autoimmune disorder of uncertain molecular origin despite well described loss of function mutations in several nucleic acid processing enzymes. The authors find that loss of function of AGS genes is associated, to varying degrees, with widespread hypomethylation of DNA as well as the selective accumulation of RNA:DNA hybrids in intergenic regions and over repetitive elements, and modest changes in gene expression. How these three observations are molecularly linked is less clear. The manuscript is well written and the experiments appropriately controlled. For the most part (exceptions noted below, the conclusions drawn are supported by the data provided.
1) The authors appear to favor the idea that RNA:DNA hybrid accumulation is the culprit behind activation of the immune response in AGS cells, though the data at hand are such that a direct causal link cannot be established. How this might be achieved is unclear, but the Discussion proposes a number of ‘sensing’ mechanisms, some of which would require a mechanism in cis (e.g. detection of hybrids as they are formed) whereas others (TLR, GAS-STING etc. ) require cytoplasmic accumulation of duplexes. Is there any evidence in the AGS fibroblasts or in the RNASeH2 K/O fibroblasts of non-genomic accumulation of DNA:RNA duplexes in AGS cells?
2) The RNAseq data only weakly support the idea of immune activation as a phenptype that is enriched in AGS fibroblasts. There were nearly as many downregulated genes...any patterns there? The data should be fully discussed.
3) The DNA damage known to accompany defects in the AGS genes is proposed as intermediates that could mediate and/or cause the RNA:DNA hybrids or DNA hypomethylation. Do RNA:DNA hybrids accumulate at the sites of ectopic DNA damage ? This is something that could be addressed in the RNAseH2 k/o cells which exhibit by co-localization studies with gamma H2Ax.
4) It is suggested that the relationship of RNA:DNA hybrid accumulation to retroelements might be circumstantial, due to the enrichment of these structures in intragenic regions... however, the data in Figure 3E, F, if representative, seem to suggest the opposite; that the accumulation of RNA:DNA hybrids is confined to retroelements, and further, to specific classes (LINE,LTR) implying some specificity to the effect. Are the unique drip seq signals by and large confined to retroelements as shown in Figure 3E, F? Is it possible that you are detecting some intermediate in piRNA-mediated transposon silencing?
5) The Drip-seq data are presented in a very general way (as total # of peaks, distance covered etc.) making it difficult to fully appreciate how the patterns change, and the spatial relationship to other genomic features. Both and increase in burden and size is implied. Statements like “...Drip-seq peaks were 3-4 fold larger in size” are unclear. Do you mean that existing Drip-seq peaks in control cells spread / expand laterally in AGS cells, or is the trend for new peaks to form in patient cells? Figure 3D – the ‘raw’ data on common and unique sites would be more informative rather than “fold change” e.g. what fraction of common and unique peaks overlap each genomic feature? This would provide not only a comparison between unique and common sites, but also an indication of the relative distribution of Dripseq peaks across genomic features in each setting.
6) Likewise, it is hard to appreciate the spatial relationship between DNA methylation changes and accumulation of RNA:DNA hybrids, given that the former are very broad and might encompass multiple retroelements some of which change and others that don't. If one looks specifically at those AGS specific DRIP-seq peaks, what fraction are overlapping retroelements, and what is the average DNA methylation level of these in control and AGS cells?
7) The lack of overlap between AGS unique DRIP seq peaks and regions of GC skew/ annotated TSSs is taken as an indication that the aberrant RNA:DNA hybrids detected do not arise co-transcriptionally. Barring a direct test of the role of transcription on RNA: DNA hybrids in AGS cells, I'm not sure one can rule out a transcription-dependent mechanism at this juncture.
8) The authors interpret the data from the RNASEH2A k/o fibroblasts as evidence that “...RNAse H2 defects directly drive these epigenetic perturbations”. Thus far they have only observed a modest impact on DNA methylation at certain retroelements. Whether there is a measurable global (bulk) effect on DNA methylation as in AGS cells is not clear, nor has the impact on RNA:DNA hybrid formation been studied. This could be a powerful system as it might allow the investigator to determine the hierarchical nature of the genomic phenomena observed.
9) Why/what are the missing bubbles in the bisulfite sequencing data in Figure 6E? Depending on the nature of these CpGs (unmethylated/methylated) would change the magnitude of the methylation change significantly.https://doi.org/10.7554/eLife.08007.018
- Frédéric Chédin
- Frédéric Chédin
- Yoong Wearn Lim
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
We thank Dr Yanick Crow and Dr Gillian Rice for providing AGS patient samples, Dr Jodi Nunnari for the RNASEH2-deficient yeast cells, Dr Giovanni Capranico for the psPAX2 and pMD2.G plasmids, Dr David Segal for the HeLa cell line harboring silent GFP reporter gene, Dr John Moran for helpful discussions, and members of the Chedin lab for useful comments. This work was funded by grants from the Hartwell Foundation and the National Institutes of Health (NIH R01 GM094299) to FC. YWL was supported in part by a pre-doctoral training grant (5T32GM007377) and a Dissertation Year fellowship from UC Davis. This work used the Vincent J Coates Genomics Sequencing Laboratory at UC Berkeley, supported by NIH S10 Instrumentation Grants S10RR029668 and S10RR027303.
- Bing Ren, Reviewing Editor, University of California, San Diego School of Medicine, United States
© 2015, Lim et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.