H3K9me3 is required for inheritance of small RNAs that target a unique subset of newly evolved genes
Abstract
In Caenorhabditis elegans, RNA interference (RNAi) responses can transmit across generations via small RNAs. RNAi inheritance is associated with Histone-3-Lysine-9 tri-methylation (H3K9me3) of the targeted genes. In other organisms, maintenance of silencing requires a feed-forward loop between H3K9me3 and small RNAs. Here, we show that in C. elegans not only is H3K9me3 unnecessary for inheritance, the modification’s function depends on the identity of the RNAi-targeted gene. We found an asymmetry in the requirement for H3K9me3 and the main worm H3K9me3 methyltransferases, SET-25 and SET-32. Both methyltransferases promote heritable silencing of the foreign gene gfp, but are dispensable for silencing of the endogenous gene oma-1. Genome-wide examination of heritable endogenous small interfering RNAs (endo-siRNAs) revealed that endo-siRNAs that depend on SET-25 and SET-32 target newly acquired and highly H3K9me3 marked genes. Thus, ‘repressive’ chromatin marks could be important specifically for heritable silencing of genes which are flagged as ‘foreign’, such as gfp.
Editorial note: This article has been through an editorial process in which the authors decide how to respond to the issues raised during peer review. The Reviewing Editor's assessment is that all the issues have been addressed (see decision letter).
https://doi.org/10.7554/eLife.40448.001Introduction
RNA interference (RNAi) responses are inherited in Caenorhabditis elegans nematodes across generations via heritable small RNAs (Alcazar et al., 2008; Buckley et al., 2012; Vastenhouw et al., 2006). In worms, exposure to a number of environmental challenges, such as viral infection (Gammon et al., 2017; Rechavi et al., 2011), starvation (Rechavi et al., 2014), heat (Klosin et al., 2017), and growth in liquid (Lev et al., 2018) induces heritable physiological responses that persist for multiple generations. Inheritance of such transmitted information was linked to inheritance of small RNAs and chromatin modifications, and hypothesized to protect and prepare the progeny for the environmental challenges that the ancestors met.
By base-pairing with complementary mRNA sequences, small RNAs in C. elegans control the expression of thousands of genes, and protect the genome from foreign elements (Luteijn and Ketting, 2013; Malone and Hannon, 2009). Via recruitment of RNA-binding proteins, small interfering RNAs (siRNAs) can induce gene silencing also by inhibiting transcription (Castel and Martienssen, 2013).
Small RNA-mediated transcription inhibition involves modification of histones, however the exact role that histone marks play in inheritance of RNAi and small RNA synthesis is still not entirely clear (Rechavi and Lev, 2017). In C. elegans small RNAs that enter the nucleus were shown to inhibit the elongation phase of Pol II (Guang et al., 2010); In addition, nuclear small RNAs are thought to recruit histone modifiers to the target’s chromatin, resulting in deposition of histone marks such as histone H3K9-tri methylation (H3K9me3) and H3K27me3 (Gu et al., 2012; Lev et al., 2017; Mao et al., 2015).
The interactions between small RNAs and repressive chromatin marks are reciprocal: in Arabidopsis thaliana (Holoch and Moazed, 2015; Molnar et al., 2010) and Schizosaccharomyces pombe (Moazed et al., 2006; Verdel et al., 2004; Hall et al., 2002) small RNAs and repressive histone marks form a self-reinforcing feed-forward loop, where nuclear small RNAs induce deposition of repressive histone marks, and in turn the repressive chromatin marks recruit the small RNA machinery to synthesize additional small RNAs. Whether a similar feedback operates in worms and other organisms, is still under investigation. In Neurospora crassa, transgene-induced small RNAs work independently of H3K9me3 (Chicas et al., 2005). In C. elegans, it was previously suggested that H3K9me is required for RNAi inheritance (Shirayama et al., 2012). However, studies from different groups have shown that the situation is more complex, and that H3K9me could be dispensable, and can even suppress heritable silencing of some targets (Kalinava et al., 2017; Lev et al., 2017; Minkina and Hunter, 2017).
In C. elegans H3K9me is considered to depend mainly on the methyltransferases MET-2, SET-25, and SET-32 (Kalinava et al., 2017; Spracklin et al., 2017; Towbin et al., 2012). H3K9 methylation by MET-2 and SET-25 occurs in a step-wise fashion – after MET-2 deposits the first two methyl groups (H3K9me1/2), SET-25 can add the third methyl group (me3) (Towbin et al., 2012). In the germline, however, SET-25 is capable of tri-methylating H3K9 in a MET-2-independent manner (Bessler et al., 2010; Towbin et al., 2012). SET-32-dependent H3K9me3 is at least in part independent of the activity of SET-25 or MET-2 (Kalinava et al., 2017).
To study the roles of H3K9me3 in the maintenance of heritable small RNAs, we examined the inheritance of small RNAs in mutants defective in these histone methyltransferases. Although H3K9me3 was thought to be required for heritable RNAi (Ashe et al., 2012; Gu et al., 2012), we found the heritable RNAi-responses are greatly potentiated in met-2 mutant background (Lev et al., 2017). Our data indicated that the enhanced strength of the RNAi responses in met-2 mutants stems from a genome-wide massive loss of different endogenous small RNA (endo-siRNAs) species. In normal circumstances, these endo-siRNAs compete with exogenously derived siRNAs over shared biosynthesis components required for small RNA production or inheritance (Lev et al., 2017). In addition, we found that the accumulated sterility (or ‘Mortal Germline’, Mrt phenotype) of met-2 mutants results from dysfunctional small RNA inheritance (Lev et al., 2017).
However, our previous results regarding the role of H3K9me1/2 (deposited by MET-2) did not rule out the possibility that H3K9me3 is yet required for efficient heritable silencing of gfp transgenes: We found that RNAi responses in met-2 mutants nevertheless lead to marking of the target gene’s histones with a heritable H3K9me3 modification. Further, a comparison of the H3K9me3 signal on the gfp locus in different mutants has shown that anti-gfp RNAi responses were strongly inherited only in genetic backgrounds where some H3K9me3 trace could be detected (i.e, in wild type, met-2, and met-2;set-25 mutants). In set-25 single mutants, where no statistically significant H3K9me3 footprint could be detected, anti-gfp RNAi was only weakly inherited. Previously, set-25 mutants were reported to be deficient in heritable RNAi responses targeting different fluorescent transgenes (Ashe et al., 2012; Lev et al., 2017).
RNAi silencing of the endogenous oma-1 gene is also inherited transgenerationally. In contrast to anti-gfp heritable RNAi responses, for which H3K9me3 is important, we detected an enhancement in the inheritance potency of anti-oma-1 RNAi in set-25 mutants (Lev et al., 2017). However, in that study we did not examine whether an H3K9me3 footprint was deposited on the endogenous gene oma-1 in the set-25 background (Lev et al., 2017). The publication of a recent paper (Kalinava et al., 2017) which described strong anti-oma-1 RNAi inheritance in met-2;set-25;set-32 triple mutants, despite the absence of a detectable H3K9me3 footprint, prompted us to re-examine the inheritance of anti-gfp RNAi in this triple mutants. We hypothesized that gene-specific characteristics lead to contrasting requirements for H3K9me3 and specific methyltransferases. In this manuscript, we describe an asymmetry in the requirement for H3K9me3 and specific methyltransferases for heritable RNAi responses aimed against the endogenous gene oma-1 and the foreign gene gfp. These differences led us to perform a genome-wide analysis of H3K9 methyltransferase-dependent small RNAs, which revealed that the endo-siRNAs, which depend on H3K9me3 target newly acquired C. elegans genes that might be considered ‘foreign’, similarly to gfp.
Results
Recently Kalinava et al. examined the heritable RNAi responses against oma-1 also in a triple mutant, lacking the three main C. elegans H3K9 methyltransferases, SET-25, SET-32 and MET-2 (Kalinava et al., 2017). The authors reported that silencing of oma-1 was independent of H3K9me3, as in these mutants RNAi responses raised against the oma-1 gene were heritable despite the lack of an H3K9me3 trace (Kalinava et al., 2017).
We successfully replicated the results of Kalinava et al., and came to the same conclusion, that the met-2;set-25;set-32 triple mutant worms inherit RNAi responses against the oma-1 gene, also when we used a different assay for inheritance (Figure 1A and Figure 1B, upper panel). Unlike Kalinava et al., which used qPCR to score for downregulation of oma-1 expression, we targeted a redundant, temperature-sensitive and dominant oma-1 allele, that in the restrictive temperatures does not allow the development of embryos unless silenced (as previously described [Alcazar et al., 2008]). Upon shifting to 20 degrees, only worms that silence the oma-1 gene in a heritable manner survive.
In parallel we discovered, surprisingly, that in contrast to anti-oma-1 inheritance, heritable silencing of a gfp transgene was defective in the same triple mutants (Figure 1C, upper panel, p=0.0014, 2-way ANOVA). In addition, we also confirmed (Spracklin et al., 2017) that while set-32 single mutants are deficient in inheriting RNAi responses raised against the gfp transgene (Figure 1C, lower panel, p=0.0026, 2-way ANOVA), they are capable (Kalinava et al., 2017) of inheriting responses raised against oma-1 (Figure 1B, lower panel, p=0.8487, 2-way ANOVA). Previously we have shown that while set-25 mutants are defective in inheritance of anti-gfp RNAi, weak inheritance responses can still be observed (Lev et al., 2017). Similarly, we were able to detect weak inheritance responses that last at least until the F3 generation also in met-2;set-25;set-32 and set-32 mutants (Figure 1—figure supplement 1, p-value < 0.0001 for met-2;set-25;set-32 and set-32 in the F3 generation, Two-way ANOVA). Together with our previous data, which showed that set-25 is required for inheriting anti-gfp RNAi, but not anti-oma-1 RNAi (Lev et al., 2017), these results suggested that heritable RNAi requires H3K9 methyltransferases in a gene-specific manner.
The levels of RNAi-induced H3K9me3 do not explain the gene-specific requirements of methyltransferases for heritable RNAi
Histone methyltransferase mutants may affect RNAi-induced H3K9me3 levels in a gene-specific manner, thus leading to different inheritance dynamics for each gene. To test this possibility, we performed anti-H3K9me3 Chromatin Immunoprecipitation (ChIP) on F1 met-2;set-25;set-32 triple mutant progeny, that were derived from parents exposed to anti-oma-1 RNAi, anti-gfp RNAi, or untreated controls. Using qPCR we found, as was discovered before (Kalinava et al., 2017) that in met-2;set-25;set-32 triple mutants the RNAi-induced H3K9me3 signal was significantly reduced (p-value=0.0007 and 0.0009, Two-way ANOVA, for gfp and oma-1, respectively). Importantly, this was true for both the oma-1 and gfp loci (Figure 2A). Interestingly, in naive wild-type animals, that were not treated with RNAi, the levels of H3K9me3 on gfp were significantly higher than on oma-1 (Figure 2B, p-value = 0.0039, Two-Way ANOVA) and an additional germline-expressed gene dpy-28 (Figure 2B, p-value = 0.0176, student's t-test). We discuss the possible contribution of this RNAi-independent H3K9me3 signal below. Regardless, as no differences can be found in the RNAi-induced fold changes in H3K9me3 levels between gfp and oma-1 (Figure 2A), the levels of RNAi-induced H3K9me3 cannot explain the gene-specific requirements of methyltransferases for heritable RNAi.
SET-32 acts upstream to MET-2 and SET-25 to support RNAi inheritance
We previously found that in contrast to set-25 single mutants, which are deficient in RNAi-induced heritable H3K9me3 methylation (Lev et al., 2017; Mao et al., 2015), met-2;set-25 double mutants display a modest but robust H3K9me3 footprint following RNAi (Kalinava et al., 2017; Lev et al., 2017). We therefore hypothesized that in the met-2 background, an additional, perhaps otherwise inactive H3K9 methyltransferase, is expressed or activated, compensating for the absence of SET-25, to allow efficient heritable RNAi responses (see Figure 1—figure supplement 2 for summary). To test this hypothesis, we first examined whether met-2;set-32 double mutants can inherit RNAi responses raised against gfp. If SET-32 and SET-25 compensate for each other and are redundant, then met-2;set-32 double mutants are expected to strongly inherit RNAi responses, similar to met-2;set-25 double mutants (Lev et al., 2017). Our results show, that in contrast to met-2;set-25 double mutants, met-2;set-32 double mutants are defective in RNAi inheritance raised against gfp, since only a very weak response can be detected (Figure 1—figure supplement 2A). The potency of RNAi inheritance in met-2;set-32 double mutants is comparable to that of set-25 (Lev et al., 2017) and set-32 single mutants, or met-2;set-25;set-32 triple mutants (Figure 1C). These results suggest that SET-32 has a distinct role, and that it probably acts upstream to MET-2 and SET-25, in promoting RNAi inheritance. This conclusion is also consistent with the recent observation that SET-32, in contrast to MET-2 and SET-25, has an essential role in the establishment of RNAi-mediated nuclear silencing (Kalinava et al., 2018).
Unlike RNAi silencing of oma-1, silencing of sup-35 and fog-2 genes is not inherited transgenerationally
Currently, the only gene that serves to study heritable transgenerational (more than two generations) RNAi of endogenous genes is oma-1. Transgenerational RNAi inheritance requires the target gene to be expressed in the germline, and many germline genes are essential or do not have a phenotype that can be scored over many generations. The oma-1 gene can serve as a tool for studying RNAi inheritance owing to the availability of a temperature-sensitive, dominant-lethal and redundant allele that can be rescued by RNAi (Alcazar et al., 2008). In search of other endogenous target genes whose heritable silencing could be studied, we examined the inheritance of RNAi against the non-essential germline genes sup-35 and fog-2. SUP-35 is a maternally deposited toxin, expressed in the mother’s germline, suppressed by PHA-1, a zygotically expressed anti-toxin (Ben-David et al., 2017). Consequently, temperature-sensitive pha-1(e2123) mutants develop when grown at 15 degrees but arrest their development when grown in restrictive temperatures, unless exposed to anti-sup-35 RNAi. As previously described (Ben-David et al., 2017), RNAi silencing of sup-35 allowes pha-1 mutants to develop. However, we found this response was not inherited beyond the F1 generation (Figure 1—figure supplement 3A). Expression of the germline gene fog-2 is required for hermaphrodite worms to produce sperm, but is dispensable for sperm production in males (Schedl and Kimble, 1988). Silencing of fog-2 by RNAi lead to depletion of sperm (as evident by stacked oocytes), and the worms were unable to reproduce unless crossed with a male. While we found that this response was inherited to the F1 progeny, it was not inherited transgenerationally (Figure 1—figure supplement 3B). In conclusion, we could not find additional endogenous gene targets that can be transgenerationally silenced upon RNAi. Conveniently, many endo-siRNAs that target various endogenous genes are inherited transgenerationally, and such inheritance can be studied using RNA sequencing.
H3K9me3 methyltransferases are required for the biogenesis of a specific class of endo-siRNAs
Certain germline small RNAs have evolved to confer immunity against foreign genetic elements, while sparing endogenous genes (Malone and Hannon, 2009). The different requirements for particular methyltransferases and H3K9me3 for heritable silencing of gfp and oma-1 may be connected to the fact that gfp is a ‘foreign’ gene, while oma-1 is an endogenous gene. We previously found that exogenous siRNAs that target gfp are lost in set-25 mutants, and hypothesized that endo-siRNAs that target other ‘foreign’ genes would be likewise affected. Therefore, we re-analyzed our previously published small RNA sequencing data, obtained from set-25 mutants (Lev et al., 2017). However, among the targets of these differentially expressed endo-siRNAs, we could not detect striking changes (fold change >1.2) in endo-siRNAs that target transposons and repetitive elements in set-25 mutants (Figure 3A, left panel). In contrast, a subset of endo-siRNAs that target 279 different protein-coding genes was found to exhibit significant changes in set-25 mutants (adj.p <0.1, DESeq2 Figure 3A, right panel). To understand why these small RNAs are uniquely affected by SET-25, we characterized this group and the endo-siRNAs that target them. To compare the endo-siRNA pools that depend on these two H3K9 tri-methyltrasferases, we re-analyzed the recently published small RNA-seq data obtained from set-32 mutants (Kalinava et al., 2018).
Since in set-25 the loss of exogenous siRNAs coincided with the loss of heritable RNAi-induced H3K9me3 (Lev et al., 2017), we first tested whether genes that were differentially targeted by endo-siRNAs in set-25 mutants were also marked by H3K9me3. By examining publicly available H3K9me3 data (McMurchy et al., 2017), we found that the 151 genes that lost the endo-siRNAs that target them in set-25 mutants were robustly marked by H3K9me3 in wild type animals (Figure 3B). We also found that in contrast, the 128 genes that had increased endo-siRNA levels that target them in set-25 and mutants were not significantly marked by H3K9me3 (Figure 3—figure supplement 1A). By analyzing an available mRNA-seq dataset (Klosin et al., 2017), we also found a significant enrichment for genes that were upregulated (at the mRNA level) in set-25 mutants amongst the list of SET-25-dependent endo-siRNA targets (1.93-fold enrichment, 18/151 genes, p-value=0.006). This suggests that endo-siRNAs that depend on SET-25 silence targeted gene. Recently, Kalinava et al. sequenced endo-siRNAs from set-32 mutants (Kalinava et al., 2018). Our analysis show that the 337 genes that had reduced levels of endo-siRNAs (fold change >2) in set-32 mutants, were also significantly marked by H3K9me3 (Figure 3B). As expected, these genes showed lower levels of H3K9me3 in set-32 mutants (Figure 3—figure supplement 1B), and genes having increased levels of endo-siRNAs were not significantly marked by H3K9me3 (Figure 3—figure supplement 1A). Together, these results support the hypothesis that H3K9me3 methyltransferases directly support the biogenesis of silencing endo-siRNAs by tri-methylating the H3K9 histones of the endo-siRNAs targeted genes.
Next we examined whether genes that display altered endo-siRNAs levels in set-25 and set-32 mutants are expressed in specific tissues. Genes that had significantly reduced levels of endo-siRNAs targeting them in set-25 or in set-32 mutants exhibited significant, but modest, enrichment for expression in the germline (Figure 3C and Figure 3—figure supplement 1C). No significant enrichment was found for other tissues (Figure 3C and Figure 3—figure supplement 1C).
To identify the small RNA pathways which are affected by set-25 and set-32, we tested whether the differentially expressed endo-siRNAs depend on particular argonautes, or associate with specific biosynthesis or functional pathways (Figure 3D). It was previously suggested that the CSR-1 argonaute carries heritable endo-siRNAs that mark endogenous genes (Claycomb et al., 2009), while the HRDE-1 argonaute carries heritable endo-siRNAs that silence foreign or aberrant elements, whose expression could be deleterious, such as transposons (Luteijn et al., 2012; Rechavi, 2014; Shirayama et al., 2012). A strong and significant enrichment (Figure 3D and Figure 3—figure supplement 1C) was found for endo-siRNAs which are carried in the germline by the argonautes WAGO-1 (Gu et al., 2009) and HRDE-1, which is required for inheritance of exogenous siRNAs (Buckley et al., 2012). Both argonautes were found to be involved in gene silencing (Buckley et al., 2012; Gu et al., 2009). Nevertheless, some of the targets of HRDE-1-bound endo-siRNAs are expressed in the germline (Figure 3—figure supplement 2A). This may explain the concurrent enrichment for both germline-expressed genes and targets of HRDE-1-bound endo-siRNAs amongst the gene targets of endo-siRNAs that depend on SET-25 or SET-32. A significant enrichment was also found for Mutator pathway small RNAs (Zhang et al., 2011), ERGO-1-dependent small RNAs, and putative piRNA targeted genes (Bagijn et al., 2012). On the contrary, a significant depletion was found for genes known to be targeted by CSR-1-carried small RNAs, a pathway that was suggested to support the expression of targeted genes (Claycomb et al., 2009; Shen et al., 2018). The helicase EMB-4 (Akay et al., 2017; Tyc et al., 2017) was shown to preferably bind introns of genes targeted by CSR-1; We could not detect a significant enrichment for genes whose introns are bound by EMB-4 (fold change = 1.07 and 0.79, p-value=0.26 and 0.002, for endo-siRNAs dependent on SET-25 or SET-32, respectively). All together, these results suggest that H3K9 methyltransferases are required for the maintenance of a specific sub-class of HRDE-1 and WAGO-1 small RNAs, that are associated with the Mutator and piRNA pathways, and that target protein-coding genes (Figure 3—figure supplement 2B).
Endo-siRNAs that depend on H3K9me3 methyltransferases target a distinctive subset of newly evolved genes
What distinguishes the target genes of endo-siRNAs that depend on SET-25 and SET-32 methyltransferases? It was recently found that periodic A/T (PATC) sequences can shield germline genes from piRNA-induced silencing and allow germline expression of genes in H3K9me3-rich genomic regions (Frøkjær-Jensen et al., 2016; Zhang et al., 2018). Fittingly, we found that genes targeted by SET-25-dependent and SET-32-dependent endo-siRNAs exhibit a moderate (~9–13% in median values) but significant reduction in PATC density compared to all protein coding genes (Figure 4A, p-value = 0.0026 and 0.0011 for SET-25 and SET-32, respectively). This feature is not general for genes targeted by WAGOs (Worm-specific Argonautes, HRDE-1, WAGO-1 and ERGO-1) associated endo-siRNAs, since these targeted genes have a higher PATC density (Figure 4A, 10% increase in average values, p-value=0.034). In addition, genes targeted by endo-siRNAs that are increased in set-25 or set-32 mutants exhibit significantly increased PATC density (Figure 4—figure supplement 1A). However, we posit that this feature is unlikely to be sufficient for distinguishing between oma-1 and gfp, since the oma-1 gene has a very low PATC density (Figure 4—figure supplement 1B).
The lists of genes which are targeted by SET-25- and SET-32-dependent endo-siRNAs were enriched for genes targeted by ERGO-1-dependent endo-siRNAs (Figure 3D). Many of the genes that are targeted by ERGO-1-bound endo-siRNAs are duplicated genes (Fischer et al., 2011; Vasale et al., 2010). Accordingly, we found an enrichment for duplicated genes amongst the genes that had reduced endo-siRNA levels targeting them in set-25 and set-32 mutants (Figure 4B). An additional characteristic of the set of genes targeted by ERGO-1 endo-siRNAs is an enrichment for poorly conserved genes, that have fewer introns, and possess splicing site sequences that diverge from the consensus sequence (Fischer et al., 2011; Newman et al., 2018). It was recently suggested that these poorly conserved genes are targeted for silencing because their aberrant or”non-self-like’ splicing signals are detected by the splicing machinery (Newman et al., 2018).
Therefore, we examined whether the targets of the endo-siRNAs that depend on SET-25 or SET-32 can be distinguished by their splicing signals. The changes in the endo-siRNA pool in mutants of small nuclear ribonucleoprotein-associated protein RNP-2/U1A (rnp-2) mirrored the endo-siRNA changes found in set-25 mutants (Figure 4C), but not that of set-32 mutants (Figure 4—figure supplement 2A). We also found that genes targeted by SET-25-dependent- but not SET-32-dependent endo-siRNAs bear fewer introns (Figure 4D, median of 3 and 4 compared to 4 of all protein coding genes, p-values=0.0047, and 0.42 for SET-25 and SET-32, respectively). No significant differences in the length of the coding sequences were found, hence, the difference in intron number does not simply derive from differences in gene lengths (Figure 4—figure supplement 2B, p-value = 0.8673). The lists of genes targeted by SET-25-dependent or SET-32-dependent endo-siRNAs were enriched with genes shown to be targeted by intron-targeting small RNAs (Figure 4—figure supplement 2C and D). We could not find, however, small RNAs aligning to the introns of the gfp transgene that we studied (Figure 4—figure supplement 2E, in most cases endo-siRNAs target only exons). We also did not find significant differences in the splicing motif divergence score (obtained from Newman et al., 2018). Since splicing also directly affects the RNAi machinery untangling its role in endogenous RNAi is challenging (Newman et al., 2018). In summary, splicing may be one of the factors that contribute to distinguishing genes targeted by SET-25-dependent endo-siRNAs, but not by SET-32-dependent endo-siRNAs.
In contrast, in the sets of genes targeted by either SET-25-dependent or SET-32-dependent small RNA we found a significant enrichment for newly evolved genes (Figure 4B, fold-change = 2.57 and p-value<0.0001 for both SET-25- and SET-32-dependent endo-siRNAs targets, respectively). We define newly evolved genes here as genes which had no orthologs outside C. elegans (35/151 and 78/337 of genes targeted by SET-25- or SET-32-dependent endo-siRNAs, respectively). Concordantly, in the same gene sets we also found a significant depletion for nematode-conserved genes (Figure 4B). Importantly, the sets of genes targeted by SET-25-dependent endo-siRNAs and SET-32-dependent endo-siRNAs show very small overlap (25 out of 465 genes). Thus, while SET-25 and SET-32 are required for the maintenance of endo-siRNAs that target different genes, the characteristics of these genes are very similar, that is they are distinctively newly evolved genes that have slightly lower levels of PATC sequences. Although the changes in PATC density and intron numbers that distinguish these target genes are moderate, it is possible that the cumulative effect of these small differences may result in the exposure of foreign genes that need to be silenced.
In general, we find that certain sub-classes of endo-siRNA, such as ERGO-1 and HRDE-1 bound small RNAs, target gene sets enriched for newly evolved genes (Figure 4—figure supplement 3). The significant enrichment for newly evolved genes among SET-25- and SET-32-dependent endo-siRNAs is maintained, however, even after excluding genes that are also targeted by HRDE-1, ERGO-1, WAGO-1 or Mutator endo-siRNAs (SET-25: 59/151 genes are not shared, fold enrichment = 2.97, p-value=0.0001, SET-32: 153/337 genes are not shared, fold-enrichment = 1.89, p-value=0.0012). Thus, the enrichment of newly evolved genes amongst the targets of SET-25- and SET-32-dependent endo-siRNAs is not simply due to a general preference for newly evolved genes by endo-siRNA pathways. Further, we find that newly evolved genes are marked by higher levels of H3K9me3 in comparison to the average level of H3K9me3 on protein coding genes (Figure 4—figure supplement 3). Likewise, in the absence of RNAi, in wild-type animals, gfp, the example for a foreign (non-nematode) gene that we investigated, has higher levels of H3K9me3, in comparison to the well-conserved oma-1 gene (Figure 2B). The fact that across the genome SET-25-dependent- and SET-32-dependent endo-siRNAs target newly evolved and H3K9me3 methylated genes (Figure 3B and Figure 4B), may explain why inheritance of RNAi responses raised against gfp, but not oma-1, depends on SET-25 and SET-32 (Figure 1).
In summary, our experiments reveal a specific role for histone modifications in small RNA inheritance. While in S. pombe and A. thaliana a feedback between H3K9me3 and small RNAs was suggested to be required for silencing, the worm’s RNAi inheritance machinery may use H3K9me3 as a mark that distinguishes genes identified as ‘new’. Since newly evolved genes can be disruptive, small RNAs survey these H3K9me3-flagged elements transgenerationally.
Discussion
Our study began from an investigation of a perplexing asymmetry in the requirement of specific H3K9 methyltransferases for heritable silencing of the endogenous gene oma-1 and the ‘foreign’ gene gfp. Single mutants of set-25 and set-32 and the met-2;set-25;set-32 triple mutant displayed different heritable dynamics when either the gfp or the oma-1 gene were targeted by RNAi. These results are not unique to the specific gfp transgene that was tested, since similar observations have been made with other transgenes (Klosin et al., 2017; Lev et al., 2017; Shirayama et al., 2012; Spracklin et al., 2017).
Unlike mutations in these histone methyltransferases, which negatively affect heritable silencing of gfp, but not oma-1, mutations in genes required for small RNA inheritance negatively affect heritable silencing of both oma-1 and gfp. For example, the argonaute HRDE-1 is required for inheritance of RNAi responses against both genes (Ashe et al., 2012; Buckley et al., 2012; Kalinava et al., 2017; Shirayama et al., 2012; Weiser et al., 2017). The fact that heritable RNAi responses aimed at different genes are affected by different proteins should be taken into account when studying transgenerational inheritance. Specifically, when screening for genes that affect such inheritance, one must acknowledge that heritable silencing of different targets requires different chromatin modifiers.
Future studies will hopefully reveal why some recently evolved genes, but not others, display high levels of H3K9me3 (in the absence of RNAi), and are targeted by endo-siRNAs. Recent studies examined why transgenes are sensitive to silencing by synthetic piRNAs, while endogenous germline expressed genes, including oma-1, are not. This protection was suggested to be conferred at least in part by PATC sequences, and to be independent of the genomic location of the gene (Zhang et al., 2018). PATC sequences were previously shown to allow expression of transgenes in the germline in heterochromatic areas (Frøkjær-Jensen et al., 2016). Similarly, our analysis revealed that the gene targets of SET-25-dependent and SET-32-dependent endo-siRNAs have lower levels of PATC density (Figure 4A). However, the oma-1 gene does not possess many PATC sequences (Figure 4—figure supplement 1B). An additional theory suggested that an intrinsic unknown coding-sequence feature confers resistance to silencing by piRNAs. Seth et al. have studied why a fusion between oma-1 and gfp can trans-activate silenced gfp transgenes (an effect known as ‘RNAa’,(Seth et al., 2013)). While unique ‘protective’ sequence features were not described in that work, the authors showed that an unknown coding-sequence feature, not related to the codon usage or the translation of the protein, grants the oma-1 gene with its ability to activate silenced transgenes (Seth et al., 2018). It is possible that the gene targets of SET-25- dependent and SET-32-dependent small RNAs that we describe here have unique intrinsic sequences that distinguish them as well. The different requirement of methyltransferases for heritable silencing of some genes but not others may be related to such intrinsic sequence features. Alternatively, it is possible, as was suggested in the past, that new genes are silenced because they are not licensed transgenerationally by heritable small RNAs for expression (Claycomb et al., 2009; Shen et al., 2018). If this is the case, future studies will hopefully reveal how such license is granted (See Figure 5 for Scheme).
Materials and methods
Cultivation of the worms
Request a detailed protocolStandard culture techniques were used to maintain the nematodes on nematode growth medium (NGM) plates seeded with OP50 bacteria. Extreme care was taken to avoid contamination or starvation, and contaminated plates were discarded from the analysis.
RNAi bacteria
Request a detailed protocolHT115 Escherichia coli strains expressing dsRNAs were used: anti-oma-1 RNAi bacteria were obtained from the Ahringer RNAi library (Kamath and Ahringer, 2003). Anti-fog-2 and anti-sup-35 RNAi were obtained from the Vidal RNAi library (Rual et al., 2004). For the sequence of the anti-gfp RNAi see supplemental data.
RNAi experiments
Request a detailed protocolRNAi HT115 E.coli bacteria were incubated in lysogeny broth (LB) containing Carbenicillin (25 μg/mL) at 37°C overnight with shaking. Bacterial cultures were seeded onto NGM plates containing isopropyl β-D-1-thiogalactopyranoside (IPTG; 1 mM) and Carbenicillin (25 μg/mL) and grown overnight in the dark at room temperature. Five L4 animals were placed on RNAi bacteria plates and control empty-vector bearing HT115 bacteria plates and maintained at 20°C for 2 days and then removed. The progeny hatching on these plates was termed the P0 generation. In the next generations the worms were grown on E.coli OP50 bacteria. For anti-gfp RNAi experiments, four L4 animals were placed on plates for two days to lay the next generation. In every generation approximately 60 one day adult worms were collected and photographed per condition (see below). For anti-oma-1 experiments, in each generation twelve individual L4 staged worms were placed in individual wells of a twelve well plate. Four days later the number of fertile worms was assessed (at least one progeny) and 12 individual L4 progeny worms were chosen from the most fertile well to continue to the next generation. For and anti-sup-35 RNAi experiments, in each generation 12 individual L4 staged worms were placed in individual wells of a 12-well plate. Two days later the adult worms were removed. Two days later the number of developing worms was counted and twelve individual L4 progeny worms were chosen from the well with the highest number of developing progeny to continue to the next generation. For and anti-fog-2 RNAi experiments, five L4 worms were crossed on RNAi bacteria for 24 hr. The crossed worms were transferred to fresh RNAi bacteria plates. In each generation, five resulting L4 progeny from the cross were crossed on control bacteria plates and ~40 L4 worms were picked to control bacteria plates and photographed a day later. The number of sterile worms with stacked oocytes was assayed.
Germline GFP expression analysis
Request a detailed protocolPercentage silencing analysis: for each condition, around 60 animals were mounted on 2% agarose slides and paralyzed in a drop of M9 with 0.01% levamisole/0.1% tricaine. The worms were photographed with 10x objective using a BX63 Olympus microscope (Exposure time of 200 ms, and gain of 2). The images were analyzed with ImageJ2 software, and the percentage of worms lacking any observable germline GFP signal was calculated.
GFP expression level analysis: for each condition, the GFP fluorescence level of the background and of oocyte nuclei of at least 30 worms was calculated using ImageJ2.
CTCF value was calculated as follows: CTCF = Integrated density of selected object X – (area of selected object X * mean fluorescence of background readings). The obtained CTCF value was normalized to the average CTCF value obtained from photographs of control animals of the same genotype, generation and age which were fed on control plates.
Chromatin immunoprecipitation
Request a detailed protocolChromatin immunoprecipitation experiments were conducted as described in Lev et al. (2017). For anti-H3K9me3 ChIP experiments the abcam, ab8898 antibodies were used.
qPCR reactions
Request a detailed protocolAll Real time PCR reactions were performed using the KAPA SYBR Fast qPCR and run in the Applied Biosystems 7300 Real Time PCR System.
The primer sequences used in qRT-PCR:
gfp set #1 FOR: ACACAACATTGAAGATGGAAGC
gfp set #1 REV: GACAGGTAATGGTTGTCTGG
gfp set #2 FOR: GTGAGAGTAGTGACAAGTGTTG
gfp set #2 REV: CTGGAAAACTACCTGTTCCATG
oma-1 set#1 FOR: AACTTTGCCCGTTTCACC
oma-1 set#1 REV: TCAAGTTAGCAGTTTGAGTAACC
oma-1 set#2 FOR: TTGTTAAGCATTCCCTGCAC
oma-1 set#2 REV: TCGATCTTCTCGTTGTTTTCA
(The above primer set was adapted from Spracklin et al., 2017)
dpy-28 FOR: CTGATGGATCCAGAGTTGG
dpy-28 REV: CTGCTATACGCATCCTGTTC
eft-3 FOR: CCAACATGATTAGTCAGATGACC
eft-3 REV: CTAGGAGTTAGATGTGCAGG.
Information on the sequencing libraries analyzed in this paper
Request a detailed protocolAll the studied publicly available sequencing libraries were prepared from synchronized young adult worms grown at 20 degrees (Kalinava et al., 2018; Klosin et al., 2017; Lev et al., 2017; McMurchy et al., 2017). For more information see the original publications and GEO information: A. set-25 small RNAs (Rechavi and Lev, 2017; GEO accession: GSE94798), B. set-25 mRNA (Klosin et al., 2017; GEO accession: GSE83528), C. set-32 small RNAs (Kalinava et al., 2018; GEO accession: GSE117662, the set-32 (red11) allele data were used)., D. set-32 and wild type H3K9me3 ChIP-seq (Kalinava et al., 2018; GEO accession: GSE117662). E. wild type H3K9me3 ChIP-seq (McMurchy et al., 2017; GEO accession: GSE87524).
Bioinformatic genome-wide endo-siRNAs analysis
Request a detailed protocolSmall RNA analysis was conducted as previously described (Lev et al., 2017). Briefly, adapters were cut from the reads using Cutadapt (Martin, 2011). Reads that were not cut or were less than 19 bp long, were removed. The quality of the libraries was assessed by FastQC (http://www.bioinformatics.babraham.ac.uk/projects/fastqc/). Reads were mapped to the C. elegans genome (WS235) using Bowtie2 (Langmead and Salzberg, 2012). In total 31,053,062, 21,913,420 and 18,372,739 reads were mapped in the three wild type biological repeats and 21,258,241, 19,925,004 and 21,391,091 reads were mapped in the three set-25 biological repeats. The mapped reads were then counted using the python script HTseq_count (Langmead and Salzberg, 2012) using. gff feature file from wormbase.org (version WBcel235). Differential expression was analyzed using DESeq2 (Love et al., 2014). p-adjusted value <0.1 was regarded as statistically significant. The set-32 data from Kalinava et al (Kalinava et al., 2018) GEO was analyzed in a similar fashion. The reads the 5' barcode and 3' linker were trimmed using Cutadapt (Martin, 2011), in accordance the information supplied by Kalinava et al in GEO (accession number: GSE117662). Next, reads were filtered to lengths of 20–23 bp and aligned (not allowing mismatches) to the C. elegans genome (ce10) by Shortstack (Axtell, 2013). In total 1,342,884 and 1,006,568 reads were mapped in the wild type and set-32(red11) small RNA samples, respectively. The reads mapping to each genomic were counted by HTseq_count (Langmead and Salzberg, 2012). Since one biological sample was available, significantly altered small RNAs were defined as genes having fold change of larger than 2 (up-regulated) or smaller than 0.5 (downregulated).
Bioinformatic genome-wide analysis of H3K9me3 signal
Request a detailed protocolFor analysis of H3K9me3 signal on different genes in wild type worms, the processed H3K9me3 data (aligned and normalized) from the McMurchy et al. study was used (McMurchy et al., 2017; GEO accession GSE87524). The shown H3K9me3 signal represents the averaged H3K9me3 signal in two replicates of young adults. For analysis of the H3K9me3 levels in wild type and set-32 mutants the raw data from the Kalinava et al. study was used (Kalinava et al., 2018; GEO accession: GSE117662). The raw data were analyzed in a similar fashion to the analysis conducted by McMurchy et al. Briefly, adaptors were trimmed using Cutadapt (Martin, 2011) and aligned using Bowtie2 (Langmead and Salzberg, 2012). H3K9me3-enriched regions were identified using MACS2 (Lupien et al., 2008) and the H3K9me3 signal was corrected for biases using BEADS (Cheung et al., 2011).
Bioinformatic mRNA expression analysis
Request a detailed protocolProcessed files with raw counts of reads mapping to each gene were downloaded from GEO (Klosin et al., 2017; GEO accession: GSE83528). Differential expressed genes were detected using DESeq2 (adjusted p-value<0.1).
Bioinformatic gene enrichment analysis
Request a detailed protocolThe enrichment values denote the ratio between (A) the observed representations of a specific gene set within a defined differentially expressed genes group, to (B) the expected one, that is the representation of the examined gene set among all protein-coding genes in C. elegans. The analysis was done for 15 gene sets: (1) 7727 genes enriched in oocytes gonads (Ortiz et al., 2014) and 9012 genes enriched in spermatogenic gonads (Ortiz et al., 2014); we excluded genes with expression lower than 1 RPKM(2) 11427 genes expressed in isolated neurons (Kaletsky et al., 2016). (3) 7176 genes expressed in intestine (Gerstein et al., 2010) (4) 2957 genes expressed in pharynx (Gerstein et al., 2010) (5) 2526 genes expressed in body muscle (Gerstein et al., 2010) (6) 4146 targets of CSR-1 (Claycomb et al., 2009) (7) 1478 targets of HRDE-1 (Buckley et al., 2012) (8) 87 targets of WAGO-1 (Gu et al., 2009) (9) 399 targets of ALG-3/4 class small RNAs (Conine et al., 2010) (10) 1823 targets of mutator class small RNAs (11) 721 EGO-1 dependent small RNA gene targets (Maniar and Fire, 2011), (12) 23 gene targets of small RNAs up-regulated in ego-1 mutants (Maniar and Fire, 2011), (13) 49 genes targeted by 26G-RNAs enriched in ERGO-IP (Vasale et al., 2010) (14) 77 genes depleted of 22G-RNAs in ergo-1 mutants (Vasale et al., 2010), and (15) 348 putative piRNA gene targets (Bagijn et al., 2012). The putative piRNA gene targets were defined as genes for which, in at least one transcript, the ratio of the # 22G-RNA reads at piRNA target sites between wild type to prg-1 is at least 2 (linear scale). Note that the indicated number above achieved after intersection between the various published data sources and the records appears in the *.gff file used by us.
The enrichment value of a given gene set i in differentially expressed gene targeting small RNAs was calculated using the following formula:
Obtaining the observed-to-expected ratios, we then calculated the corresponding p-values using 10,000 random gene groups identical in size to that of the examined group of differentially expressed genes. Next, the enrichment values of the random sets are ranked and the p-value is determined by the ranking of the examined gene set amongst the ranking of all enrichment values of the random sets.
Gene sets by conservation
Request a detailed protocolThe classification of gene sets by conservation was done by mining the ‘Homology’ field of all the C. elegans protein-coding genes in WormBase (www.wormbase.com). We defined the following three gene sets (Figure 4B):
Unique to C. elegans – C. elegans genes which have no orthologues gene in any of the following species: B. malayi, C. brenneri, C. briggsae, C. japonica, C. remanei, O. volvulus, P. pacificus and S. ratti.
Caenorhabditis only - C. elegans genes which have at least one orthologues gene in one of the C. brenneri, C. briggsae, C. remanei and C. japonica species, and have no orthologues gene in any of the B. malayi, O. volvulus, P. pacificus and S. ratti species.
Conserved among nematodes - C. elegans genes which have at least one orthologues gene in one of the C. brenneri, C. briggsae, C. remanei and C. japonica species, and in addition have at least one orthologues gene in one of the B. malayi, O. volvulus, P. pacificus and S. ratti species.
Statistical analysis
Request a detailed protocolFor RNAi experiments, Two-way ANOVA tests were used to compare the percentages of the RNAi-affected worms (GFP silencing or fertility for the oma-1 assay) between the tested genotypes. In cases of multiple comparisons between genotypes and across generations, Sidak multiple comparison tests were applied. For GFP fluorescence experiments, Two-way ANOVA tests were used to compare the normalized GFP expression levels between the genotypes and across the biological repeats. For H3K9me3 qPCR-ChIP experiments Two-way ANOVA tests were used to compare the delta-delta-Ct (or delta-Ct) values between the gfp and the oma-1 loci obtained using two different primer sets. In cases of comparisons between genotypes and loci the Sidak multiple comparison tests were applied. Biological replicates were performed using separate populations of animals. Statistical tests were performed using GraphPad Prism software (Graphpad Prism) version 6. The statistical analysis used for each of the bioinformatics analyses is listed under the corresponding bioinformatics methods.
Data availability
All data generated or analyzed during this study are included in the manuscript and supporting files.
-
NCBI Gene Expression OmnibusID GSE94798. MET-2-Dependent H3K9 Methylation Suppresses Transgenerational Small RNA Inheritance.
-
NCBI Gene Expression OmnibusID GSE87524. A team of heterochromatin factors collaborates with small RNA pathways to combat repetitive elements and germline stress.
-
NCBI Gene Expression OmnibusID GSE117662. C. elegans Heterochromatin Factor SET-32 Plays an Essential Role in Transgenerational Establishment of Nuclear RNAi-Mediated Epigenetic Silencing.
-
NCBI Gene Expression OmnibusID GSE83528. Transgenerational transmission of environmental information in C. elegans.
References
-
RNA interference in the nucleus: roles for small RNAs in transcription, epigenetics and beyondNature Reviews Genetics 14:100–112.https://doi.org/10.1038/nrg3355
-
Systematic bias in high-throughput sequencing data and its correction by BEADSNucleic Acids Research 39:e103.https://doi.org/10.1093/nar/gkr425
-
RNA-mediated epigenetic regulation of gene expressionNature Reviews Genetics 16:71–84.https://doi.org/10.1038/nrg3863
-
Fast gapped-read alignment with bowtie 2Nature Methods 9:357–359.https://doi.org/10.1038/nmeth.1923
-
Extremely stable Piwi-induced gene silencing in Caenorhabditis ElegansThe EMBO Journal 31:3422–3430.https://doi.org/10.1038/emboj.2012.213
-
PIWI-interacting RNAs: from generation to transgenerational epigeneticsNature Reviews Genetics 14:523–534.https://doi.org/10.1038/nrg3495
-
Studies on the mechanism of RNAi-dependent heterochromatin assemblyCold Spring Harbor Symposia on Quantitative Biology 71:461–471.https://doi.org/10.1101/sqb.2006.71.044
-
A new dataset of spermatogenic vs. oogenic transcriptomes in the nematode caenorhabditis elegansG3: Genes|Genomes|Genetics 4:1765–1772.https://doi.org/10.1534/g3.114.012351
-
Guest list or black list: heritable small RNAs as immunogenic memoriesTrends in Cell Biology 24:212–220.https://doi.org/10.1016/j.tcb.2013.10.003
-
fog-2, a germ-line-specific sex determination gene required for hermaphrodite spermatogenesis in Caenorhabditis ElegansGenetics 119:43–61.
Article and author information
Author details
Funding
Israel Science Foundation (1339/17)
- Itamar Lev
- Hila Gingold
- Oded Rechavi
European Research Council (335624)
- Itamar Lev
- Hila Gingold
- Oded Rechavi
Adelis Foundation (01430001000)
- Oded Rechavi
Paul G. Allen Family Foundation
- Oded Rechavi
The funders had no role in study design, data collection, and interpretation, or the decision to submit the work for publication.
Acknowledgements
We thank all the Rechavi lab members for the helpful comments and fruitful discussions. We thank Yael Mor for the fruitful discussions and asistance with formulating the newly evolved genes hypothesis. Some strains were provided by the CGC, which is funded by NIH Office of Research Infrastructure Programs (P40 OD010440). We thank Yosef Shiloh, Yael Ziv, for their assistance and advice. Special thanks to Dror Cohen for the illustrations that he contributed. This work was supported by the ERC (grant #335624) and the Israel Science Foundation (grant #1339/17) and OR gratefully acknowledges the support of the Allen Discovery Center of the Paul G Allen Frontiers Group and the support of the Adelis foundation (no. 01430001000).
Copyright
© 2019, Lev et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 3,311
- views
-
- 485
- downloads
-
- 35
- citations
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Chromosomes and Gene Expression
- Developmental Biology
About 70% of human cleavage stage embryos show chromosomal mosaicism, falling to 20% in blastocysts. Chromosomally mosaic human blastocysts can implant and lead to healthy new-borns with normal karyotypes. Studies in mouse embryos and human gastruloids showed that aneuploid cells are eliminated from the epiblast by p53-mediated apoptosis while being tolerated in the trophectoderm. These observations suggest a selective loss of aneuploid cells from human embryos, but the underlying mechanisms are not yet fully understood. Here, we investigated the cellular consequences of aneuploidy in a total of 125 human blastocysts. RNA-sequencing of trophectoderm cells showed activated p53 pathway and apoptosis proportionate to the level of chromosomal imbalance. Immunostaining corroborated that aneuploidy triggers proteotoxic stress, autophagy, p53-signaling, and apoptosis independent from DNA damage. Total cell numbers were lower in aneuploid embryos, due to a decline both in trophectoderm and in epiblast/primitive endoderm cell numbers. While lower cell numbers in trophectoderm may be attributed to apoptosis, aneuploidy impaired the second lineage segregation, particularly primitive endoderm formation. This might be reinforced by retention of NANOG. Our findings might explain why fully aneuploid embryos fail to further develop and we hypothesize that the same mechanisms lead to the removal of aneuploid cells from mosaic embryos.
-
- Chromosomes and Gene Expression
- Developmental Biology
Transcription often occurs in bursts as gene promoters switch stochastically between active and inactive states. Enhancers can dictate transcriptional activity in animal development through the modulation of burst frequency, duration, or amplitude. Previous studies observed that different enhancers can achieve a wide range of transcriptional outputs through the same strategies of bursting control. For example, in Berrocal et al., 2020, we showed that despite responding to different transcription factors, all even-skipped enhancers increase transcription by upregulating burst frequency and amplitude while burst duration remains largely constant. These shared bursting strategies suggest that a unified molecular mechanism constraints how enhancers modulate transcriptional output. Alternatively, different enhancers could have converged on the same bursting control strategy because of natural selection favoring one of these particular strategies. To distinguish between these two scenarios, we compared transcriptional bursting between endogenous and ectopic gene expression patterns. Because enhancers act under different regulatory inputs in ectopic patterns, dissimilar bursting control strategies between endogenous and ectopic patterns would suggest that enhancers adapted their bursting strategies to their trans-regulatory environment. Here, we generated ectopic even-skipped transcription patterns in fruit fly embryos and discovered that bursting strategies remain consistent in endogenous and ectopic even-skipped expression. These results provide evidence for a unified molecular mechanism shaping even-skipped bursting strategies and serve as a starting point to uncover the realm of strategies employed by other enhancers.