Endogenous RNA interference is driven by copy number

Version of Record

Accepted for publication after peer review and revision.

Download
Cite
Share
CommentOpen annotations (there are currently 0 annotations on this page).

Version of Record published: February 11, 2014 (This version)
Accepted: December 29, 2013
Received: September 23, 2013

1. Of interest
N-cadherin directs the collective Schwann cell migration required for nerve regeneration through Slit2/3-mediated contact inhibition of locomotion

Julian JA Hoving, Elizabeth Harford-Wright ... Alison C Lloyd

Research Article Apr 26, 2024
Further reading

Abstract
eLife digest
Introduction
Results
Discussion
Materials and methods
Data availability
References
Article and author information
Metrics

Abstract

A plethora of non-protein coding RNAs are produced throughout eukaryotic genomes, many of which are transcribed antisense to protein-coding genes and could potentially instigate RNA interference (RNAi) responses. Here we have used a synthetic RNAi system to show that gene copy number is a key factor controlling RNAi for transcripts from endogenous loci, since transcripts from multi-copy loci form double stranded RNA more efficiently than transcripts from equivalently expressed single-copy loci. Selectivity towards transcripts from high-copy DNA is therefore an emergent property of a minimal RNAi system. The ability of RNAi to selectively degrade transcripts from high-copy loci would allow suppression of newly emerging transposable elements, but such a surveillance system requires transcription. We show that low-level genome-wide pervasive transcription is sufficient to instigate RNAi, and propose that pervasive transcription is part of a defense mechanism capable of directing a sequence-independent RNAi response against transposable elements amplifying within the genome.

https://doi.org/10.7554/eLife.01581.001

eLife digest

Genes contain the codes that are needed to make the proteins used by cells. This code is transcribed to make a messenger RNA molecule that is then translated to make a protein. However, other types of RNA called non-coding RNA molecules can disrupt this process by binding to messenger RNA molecules, with matching sequences, before translation begins. This phenomenon, which is known as RNA interference, involves enzymes called Dicer and Argonaute.

Many cells contain large numbers of non-coding RNA molecules—so called because they are not translated to produce proteins—and many of these are capable of starting the process of RNA interference. However, most do not, and the reasons for this are not understood. Now, work by Cruz and Houseley has provided new insight into this phenomenon by showing that it is related to the number of copies of the gene encoding such RNAs in the genome.

Yeast cells normally do not have the genes for RNA interference, but Cruz and Houseley used genetically engineered yeast cells containing Dicer and Argonaute. Although most of the messenger RNA molecules in these cells showed no change, the expression of some genes with high ‘copy numbers’ was reduced. Further experiments that involved adding more and more copies of other genes showed that RNA interference could selectively target messenger RNA molecules produced from genes with an increased copy number—particularly if the copies of the genes were clustered in one location in the genome.

RNA interference is also used to defend against DNA sequences that invade and multiply within a genome, such as viruses and other ‘genetic parasites’. As such, the effect observed by Cruz and Houseley could explain why entire genomes are often continuously copied to RNA at low levels. This activity would allow the monitoring of the genome for the invasion of any genetic parasites that had multiplied to high numbers. Following on from this work, the next challenge will be to understand how gene copy number and location are balanced to achieve a selective RNA interference system.

https://doi.org/10.7554/eLife.01581.002

Introduction

Over the past decade, our understanding of the complexity of the eukaryotic transcriptome has been revolutionized. Genome-wide sequencing studies in many organisms have revealed that protein-coding mRNAs are augmented by a multitude of non-protein coding RNAs (ncRNAs), many produced from regions of the genome traditionally considered to be transcriptionally silent (The ENCODE Project Consortium, 2012; Bertone et al., 2004; Cheng et al., 2005; David et al., 2006; Birney et al., 2007). Functional data for the vast majority of ncRNAs are currently lacking, with only a few examples characterized in any detail; however, the diversity of mechanisms by which these act suggests that ncRNAs have a rich and varied biology that is largely still to be sampled.

Long ncRNAs which overlap protein-coding genes have the potential to modulate the expression of their cognate coding RNA. Early characterized examples in yeast were thought to work by directly disrupting transcription factor or polymerase binding to the promoter of the coding RNA (Martens et al., 2004; Hongay et al., 2006); however, more recent data implicate specific chromatin structure changes in repression (Gelfand et al., 2011; Hainer et al., 2011), and many other cases of ncRNAs that alter chromatin modifications have been described (Camblong et al., 2007; Berretta et al., 2008; Houseley et al., 2008; Pinskaya et al., 2009; van Werven et al., 2012). Chromatin modifications are not necessarily repressive, and ncRNAs that enhance expression of their overlapping coding gene have also been described (Uhler et al., 2007; Hirota et al., 2008). In these examples, chromatin modifications are deposited during transcription, and therefore the act of transcription rather than the ncRNA itself is important. This is not always the case, and in higher eukaryotes multiple cis-acting ncRNAs have also been characterized, particularly as functional agents in imprinting. For example, Air and Kcnq1ot1 act in cis to deposit repressive chromatin marks and DNA methylation, but these ncRNAs interact with chromatin modifiers and allele specificity is achieved by restriction of the ncRNA to the vicinity of the transcription site, although the importance of the transcription itself remains controversial (Nagano et al., 2008; Pandey et al., 2008; Redrup et al., 2009; Latos et al., 2012).

Genomes are also replete with low abundance and unstable RNA. The vast majority of ncRNAs in budding yeast are unstable (Neil et al., 2009; van Dijk et al., 2011), limiting the potential action of the RNAs themselves, although the transcription of such RNAs can still alter gene expression (reviewed in Houseley, 2012). Such unstable RNAs are also widespread in higher eukaryotes, probably with similar functional roles (Chekanova et al., 2007; Preker et al., 2008). More mysterious is the pervasive transcription that permeates eukaryotic genomes; the ENCODE project found that almost all the human genome is transcribed at some point, but the products of this transcription are vanishingly rare (Cheng et al., 2005; Birney et al., 2007; Kapranov et al., 2007; Goodman et al., 2012). It appears that regions of the genome which are not actively transcribed for other reasons undergo pervasive transcription; however, it is not known whether this pervasive transcription simply represents transcriptional noise or whether the transcription or RNAs themselves have important but as yet undiscovered functions.

Systems in which a ncRNA is transcribed antisense to a sense protein-coding RNA are common and have strong regulatory potential (Figure 1A) (Derrien et al., 2012; Carninci et al., 2005; Xu et al., 2009). It has been suggested that, since antisense ncRNAs are perfectly complementary to their cognate mRNA, the two species could form double stranded RNA (dsRNA) that would be a substrate for the RNA interference system (RNAi). During a basic RNAi response, dsRNA is cleaved by the endonuclease Dicer into short interfering RNA (siRNA), of which one strand is then loaded onto an Argonaute protein. The Argonaute–siRNA complex can anneal to complementary sequences in target RNAs, which are then cleaved by the endonuclease activity of Argonaute. RNAi was originally discovered in Caenorhabditis elegans and rapidly linked to the phenomenon of post-transcriptional gene silencing in plants; however, almost all eukaryotes contain Dicer and Argonaute orthologues and therefore have some form of RNAi system (Hamilton and Baulcombe, 1999; Fire, 1998; Hannon, 2002). RNAi probably evolved to protect cells against dsRNA viruses, a role which is maintained in plants, insects, and lower eukaryotes (Ding, 2010) and has recently been described in mammalian cells (Li et al., 2013; Maillard et al., 2013). RNAi also forms a potent defense against transposons, and high-copy transposon-derived sequences are excellent targets for RNAi, giving rise to copious siRNAs in most eukaryotes including mammals (Yang and Kazazian, 2006; Slotkin and Martienssen, 2007; Babiarz et al., 2008). In addition to degrading transposon-derived and viral RNA, siRNAs can mediate transcriptional repression of target RNAs through chromatin modifications and DNA methylation, although this activity is seemingly much stronger in lower eukaryotes and plants than in mammals (Martienssen et al., 2005; Lejeune et al., 2010; Zhang and Zhu, 2011). However, the source of the dsRNA that is processed into siRNA is not always obvious, nor is the mechanism by which cells differentiate host and transposon-derived sequences. siRNA-mediated repression is complemented by PIWI-interacting RNAs (piRNAs), which bind to the Argonaut-related PIWI-domain proteins and enforce transposon repression in the germline of eukaryotes from worms to mammals (Siomi et al., 2011). piRNAs are derived from specific genomic clusters, but it is unclear how the transcripts from these clusters are selected for processing into primary piRNAs and many of the processing enzymes remain to be identified (reviewed in Ishizu et al., 2012).

Figure 1

Download asset Open asset

Frequency of annotated antisense non-protein coding RNAs (ncRNAs) and effects on mRNA abundance.

(A) Schematic of an example sense mRNA-antisense (ncRNA) system. (B) Number of annotated open reading frames (ORFs) with antisense transcripts. Positions of CUTs, SUTs, and XUTs were collated with expressed ORFs (Xu et al., 2009; van Dijk et al., 2011), SUTs were later re-classified as XUTs were removed. Overlaps between ORFs expressed in glucose media (total 5171, Xu et al., 2009) and other RNAs were calculated and summed for increasing minimum overlaps of 50–500 bp. ORF–ORF overlaps and ORF–ncRNA overlaps were analyzed separately as ORF–ORF overlaps are consistently smaller. Detailed figures are given in Table 1. (C) Abundance of short interfering RNAs (siRNAs) in RNA interference (RNAi)+ strain produced from expressed ORFs with and without an annotated overlapping antisense ncRNA, based on read counts from published high-throughput sequencing data (Drinnenberg et al., 2011). Minimum antisense overlap with ORF was set at 250 bp; only ORFs with >100 reads in the wild-type poly(A)+ library were assessed to remove noise. Stated p value calculated by Student’s t test. (D) Abundance of mRNA in RNAi+ cells relative to wild-type; data source and categories as in C, differences were not significant.

https://doi.org/10.7554/eLife.01581.003

Various studies have looked for endogenous sense-antisense RNA pairs that instigate RNAi responses. Efficient generation of siRNAs from endogenous sense-antisense systems (endo-siRNA) has been observed in plants under stress (Borsani et al., 2005; Katiyar-Agarwal et al., 2006), and mammalian oocytes generate endo-siRNAs that can mediate mRNA knockdown (Tam et al., 2008; Watanabe et al., 2008). However, although endo-siRNAs have been detected outside the germline in mammals, they are surprisingly under-represented where sense and antisense RNAs are co-expressed (Faghihi and Wahlestedt, 2006; Okamura et al., 2008; Carlile et al., 2009), and overall there is a positive correlation between antisense and sense RNA expression in mammalian genomes, which is inconsistent with RNAi (Derrien et al., 2012; Katayama et al., 2005). This raises questions about whether endogenous sense-antisense systems do in fact form dsRNA in vivo and, if so, whether all dsRNA is equivalently accessible to Dicer.

The tight integration of the RNAi system into the physiology of most eukaryotic cells makes it very difficult to disentangle direct and indirect effects of mutating RNAi components (reviewed in Ketting, 2011). To elucidate factors important for the induction of RNAi by endogenous sense-antisense systems, we therefore used a recently described synthetic system in which RNAi is reconstituted in Saccharomyces cerevisiae by the introduction of Argonaute and Dicer from the related yeast S. castellii (Drinnenberg et al., 2009). S. cerevisiae is highly unusual in lacking an endogenous RNAi system, allowing maintenance of the symbiotic dsRNA Killer virus (Drinnenberg et al., 2011). The reconstituted system is functional, since RNAi+ S. cerevisiae efficiently degrades exogenous hairpin RNAs and endogenous Ty retrotransposon transcripts; however, no clear mRNA expression changes are detectable in these cells (Drinnenberg et al., 2009, 2011).

Results

Antisense ncRNAs exist for many S. cerevisiae genes; combining published datasets we found that 15–30% of yeast open reading frames (ORFs) have an annotated antisense ncRNA depending on the minimum size of overlap considered (Xu et al., 2009; van Dijk et al., 2011), not including overlapping convergent ORFs (Figure 1B and Table 1). In RNAi+ cells, these ORFs produce more siRNA than ORFs lacking an antisense (Figure 1C), showing that they are transcribed into dsRNA that is targeted by the RNAi machinery. However, these ORFs do not show reduced mRNA levels in RNAi+ cells consistent with published data, suggesting that insufficient siRNAs are produced to elicit a detectable mRNA knockdown (Derrien et al., 2012; Katayama et al., 2005; Drinnenberg et al., 2011) (Figure 1D).

Table 1

Stability of antisense ncRNAs

https://doi.org/10.7554/eLife.01581.004

Overlap size (bp)	Overlap type				Totals
Overlap size (bp)	ORF–ORF	ORF–XUT	ORF–CUT	ORF–SUT	ORF–ncRNA	ORF–unstable ncRNA
50	700	1066	575	522	1596	1448 (91%)
100	449	1008	543	475	1507	1367 (91%)
150	216	967	508	425	1423	1306 (92%)
200	136	931	478	403	1358	1249 (92%)
250	96	893	434	380	1287	1181 (92%)
300	69	812	391	363	1189	1085 (91%)
350	62	759	335	351	1086	990 (91%)
400	54	694	292	334	992	899 (91%)
450	51	637	244	322	904	811 (90%)
500	48	591	204	302	828	741 (89%)

Number of ORFs overlapping with ORFs and various classes of ncRNAs, with various minimum size cut-offs for the overlapping region.
Totals are given for ORFs overlapping with ncRNAs and with unstable ncRNAs, including a percentage of overlapping ncRNAs that are unstable.
ncRNA: non-protein coding RNA; ORF: optical reading frame; XUT: Xrn1-sensitive unstable transcript (degraded in cytoplasm); CUT: cryptic unstable transcript (degraded by nuclear exosome); SUT: stable unannotated transcript (not known to be degraded).

We first asked whether any endo-siRNA pairs are degraded by RNAi in this reconstituted system. RNAi+ cells produce abundant siRNAs from sub-telomeric Y′ elements and from the ribosomal DNA (rDNA) intergenic spacers (Drinnenberg et al., 2011 and Figure 2—figure supplement 1) and, despite transcriptional repression by the histone deacetylase Sir2, both regions transcribe sense and antisense ncRNAs that could hybridize to form dsRNA (Aparicio et al., 1991; Yamada et al., 1998; Kobayashi and Ganley, 2005; Houseley et al., 2007) (Figure 2A,D). Northern blots revealed full-length ncRNAs from both strands of the Y′ elements in wild-type cells (Figure 2B lanes 1,5 marked with arrows); these were largely absent in the RNAi+ strain, being replaced by heterogeneous degradation products and readily detectable siRNAs (Figure 2B,C). Despite weak transcriptional repression in this genetic background, ncRNAs and siRNAs were more abundant in sir2Δ mutants reinforcing the precursor–product relationship (Figure 2B,C). Equivalent results were seen for the rDNA intergenic spacer region (Figure 2D–F). These data show that endo-siRNA pairs can form RNAi substrates and undergo efficient degradation by a minimal RNAi system.

Figure 2 with 1 supplement see all

Download asset Open asset

High-copy endogenous sense-antisense pairs instigate efficient RNA interference (RNAi).

(A) Schematic diagram of sub-telomeric Y′ elements. (B) Northern analysis of Y′ element non-protein coding RNAs (ncRNAs) comparing wild-type and RNAi+ strains in *SIR2* and *sir2*Δ backgrounds. 18S ribosomal RNA is shown as a loading control. Arrows indicate full-length RNA species. (C) Northern analysis of Y′ element-derived short interfering RNAs (siRNAs) from cells in B, tRNAs are shown as a loading control. (D–F) Equivalent analysis of rDNA intergenic spacer ncRNAs.

https://doi.org/10.7554/eLife.01581.005

One distinguishing feature of these regions is high copy number; to determine whether copy number amplification can drive RNAi, we examined MAL32 which has a clearly defined antisense RNA that is co-expressed with the sense mRNA (Figure 3—figure supplement 1). MAL32 is effectively present at two copies in the haploid genome as the orthologous gene MAL12 shares 99.5% nucleotide identity, reducing potential transcriptional repression. As for approximately 90% of yeast antisense RNAs (Table 1), MAL32 antisense RNA is highly unstable and is only detectable in strains lacking the nuclear exosome co-factor Trf4 (reviewed in Houseley and Tollervey, 2009) (Figure 3—figure supplement 1), but endogenous MAL32 mRNA was not down-regulated in the RNAi+ strain even in trf4Δ cells (Figure 3B and Figure 3—figure supplement 2A). When expressed from a high-copy plasmid, however, MAL32 mRNA was significantly down-regulated by RNAi (Figure 3C) with concurrent production of siRNA (Figure 3D). The MAL32 antisense RNA was only detected in these experiments as a smear of degradation products and was not noticeably altered by RNAi, probably because nuclear degradation acts faster than RNAi on this substrate. To confirm that the knockdown of MAL32 mRNA was not an indirect effect of the strain background or an undirected Argonaute cleavage, we reconstituted the RNAi system in the BY4741 background using separate plasmids expressing Dicer and Argonaute. dsRNA from the MAL32 locus was detectable in these cells and was removed by Dicer; however, a significant knockdown of the mRNA was only observed in cells expressing both Dicer and Argonaute, confirming that the knockdown represents a genuine RNAi response (Figure 3—figure supplement 3).

Figure 3 with 6 supplements see all

Download asset Open asset

Copy number amplification of coding genes can instigate RNA interference (RNAi).

(A) Schematic of *MAL32* locus. (B) *MAL32* mRNA and antisense non-protein coding RNA (ncRNA) comparing wild-type and RNAi+ strains in wild-type and *trf4*Δ backgrounds; cells grown on YP raffinose (extended image and quantification shown in Figure 3—figure supplement 2A). (C) mRNA and antisense ncRNA from *MAL32* cloned onto a high-copy plasmid in wild-type and RNAi+ strains. Lanes 3,4 show empty vector control. Antisense panel shows degradation products, no full-length antisense is detectable due to Trf4 activity. (D) Short interfering RNA (siRNA) analysis of cells from C. (E) Schematic of *GAL4* locus. (F) *GAL4* mRNA and antisense ncRNA in wild-type and RNAi+ strains; cells grown in YP galactose (extended image and quantification shown in Figure 3—figure supplement 2B). (G) mRNA and antisense ncRNA from *GAL4* locus cloned onto a high-copy plasmid in wild-type and RNAi + strains. Lanes 5,6 show empty vector, signal is from genomic *GAL4*; note that cells used here are diploids to mitigate defects in galactose response (see ‘Materials and methods’). Lanes 3,4 show a previously described *GAL4* antisense mutant (Geisler et al., 2012); this removes detectable antisense RNA for genomic *GAL4*, but the mutant sequence still expresses an antisense ncRNA when cloned on the high-copy plasmid (see Figure 3—figure supplement 4). (H) siRNA analysis of cells in G. For quantification, n = 4 biological replicates, error bars represent ± 1se, *p<0.05, ***p<0.01 by Student’s t test, y axes in arbitrary units.

https://doi.org/10.7554/eLife.01581.007

We then tested GAL4, which is a single-copy gene with a co-expressed antisense that is degraded in the cytoplasm by Xrn1 (Geisler et al., 2012) (Figure 3E). Cells lacking Xrn1 show increased levels of antisense and, unexpectedly, sense RNA, but, as for MAL32, we did not detect a significant decrease in full-length RNA in RNAi+ xrn1Δ cells (Figure 3F and Figure 3—figure supplement 2B). However, amplification of the locus by cloning on a high-copy plasmid leads to significant degradation of the sense RNA along with the antisense RNA by RNAi (Figure 3G,H). Surprisingly, this occurred even in a known mutant that lacks antisense expression (Geisler et al., 2012), but 5′ RACE (Rapid Amplification of 5′ Complementary DNA Ends) experiments revealed that antisense RNA is still produced by this mutant when expressed from a high-copy plasmid, even if it is too heterogeneous for detection by northern blot (Figure 3—figure supplement 4). Taken together, these experiments on MAL32 and GAL4 demonstrate that increasing gene copy number can make the products of a normal gene susceptible to RNAi.

One potential confounding factor in these experiments is the copy number of the high-copy plasmids; if the copy number drops in RNAi+ cells, this would provide a trivial explanation for the observed knockdowns. However, Southern blotting revealed that RNAi+ cells contained approximately twofold more plasmid than the controls, which would tend to decrease rather than increase the apparent effect of RNAi (Figure 3—figure supplement 5A,B). It is likely that RNAi degrades the mRNA for the plasmid-encoded selectable marker and 2µ maintenance genes (2µ genes produce copious siRNA, Figure 3—figure supplement 5C), and the plasmid copy number rises to compensate for this. We also wanted to use a different method to confirm the northern blot results. We therefore lysed wild-type and RNAi+ cells containing the MAL32 and GAL4 plasmids, precipitated dsRNA using a specific monoclonal antibody (Schonborn et al., 1991; Gullerova and Proudfoot, 2012), and assayed total and dsRNA levels by quantitative RT-PCR. MAL32 mRNA was knocked down approximately 75%, as observed by northern blot analysis, while dsRNA was reduced 11-fold, consistent with specific removal of dsRNA by Dicer. GAL4 mRNA knockdown was measured at 80% in this assay, again with an 11-fold reduction in dsRNA in the RNAi+ strain (Figure 3—figure supplement 6).

Increasing gene copy number also increases RNA production. To separate the contributions of RNA abundance and copy number, we analyzed existing genome-wide data (Hobson et al., 2012; Drinnenberg et al., 2011). If siRNA formation depends only on precursor RNA abundance, a positive correlation should be observed between total RNA abundance and siRNA abundance, and there should be no difference between distributions of single-copy loci and multi-copy loci. We observed little evidence for such a positive correlation (Figure 4—figure supplement 1); however, plots of siRNA versus total RNA abundance for single-copy and multi-copy loci showed strikingly different distributions, with multi-copy loci clearly biased towards higher siRNA production (Figure 4A). To quantify this difference, loci were segregated into eight groups of increasing total RNA abundance and siRNA abundance was assessed for single-copy and multi-copy loci in each group (Figure 4B). siRNA production was significantly higher from multi-copy loci than from single-copy loci in all except the lowest category of RNA abundance. This result was robust to changes in the threshold between low and high copy, and was still observed in a comparison of low to medium copy number, showing that high-copy Ty1 retrotransposons were not distorting the analysis (Figure 4—figure supplement 2). A normalization step is required in these analyses to deal with mapping of sequence reads to multi-copy loci (discussed in detail in ‘Materials and methods’); however, the same differences were observed with no normalization or a different normalization scheme (Figure 4—figure supplement 3). These surprising results show that multi-copy loci produce more siRNA than single-copy loci with equivalent RNA abundance. If this observation is real, the siRNA:total RNA ratio should be predictive of copy number, an important test since this comparison requires no copy number normalization. As predicted, the 1% of genome with the highest siRNA:total RNA ratio is massively enriched for multi-copy loci (Figure 4C), and when this ratio was plotted across a chromosome, an obvious correlation was observed between regions of high-copy number and regions with high siRNA:total RNA abundance (Figure 4D). These analyses clearly demonstrate that selectivity towards the products of multi-copy loci is an emergent property of a minimal RNAi system.

Figure 4 with 3 supplements see all

Download asset Open asset

Multi-copy loci are preferentially targeted by RNA interference (RNAi).

(A) Short interfering RNA (siRNA) (Drinnenberg et al., 2011) and total RNA (Hobson et al., 2012) abundance for loci with copy number <2 (left, single-copy) or ≥2 (right, multi-copy). (B) Quantification of data from A binned into categories of increasing total RNA level, with p values for pairwise comparisons of siRNA abundance in single-copy and multi-copy datasets using the Wilcoxon Rank Sum test. (C) Copy number distribution of the 1% of loci with the highest siRNA:total RNA ratio compared with other loci; difference is significant by Wilcoxon Rank Sum test, p<2.2 × 10⁻¹⁶, loci scoring below noise threshold (0–2 category in B) were removed. n values for tests in B and C are given in Table 2. (D) Comparison of copy number with siRNA:total RNA ratio across chromosome I.

https://doi.org/10.7554/eLife.01581.014

We then directly tested the effect of copy number at the MAL32 locus. We constructed strains in which MAL32 sense and antisense RNAs were expressed at similar levels from multi-copy or single-copy loci by over-expressing single-copy sense and antisense (Figure 5A). In this system, both sense and antisense RNAs were produced at higher levels from the single-copy system (Figure 5B compare lanes 1 and 3) but more siRNAs were produced from the multi-copy system (Figure 5C compare lanes 2 and 4). The over-expression of both RNAs from the single-copy MAL32 locus led to the production of easily detectable siRNA, as would be expected; however, this result directly demonstrates that gene copy number influences the formation of siRNA above and beyond the effect of total RNA abundance. The increased siRNA production in these cells is most likely due to enhanced dsRNA formation in the multi-copy system. To confirm this, we quantified MAL32 RNA in wild-type cells that is resistant to the single-strand specific nuclease RNase A, and observed significantly more RNase-resistant material in cells expressing MAL32 from the multi-copy system than the single-copy system (Figure 5D). This experiment shows that a multi-copy locus produces more dsRNA than an equivalently expressed single-copy locus in wild-type cells without the RNAi system, explaining the increased siRNA formation in RNAi+ cells.

Figure 5

Download asset Open asset

Single gene analysis of copy number effect on RNA interference (RNAi).

(A) Schematic of single-copy and multi-copy *MAL32* system. (B) Northern analysis of *MAL32* RNA from single-copy and multi-copy systems. All visible species are included in antisense quantification. (C) *MAL32* short interfering RNA (siRNA) abundance from cells in B. (D) RNase A sensitivity of *MAL32* in wild-type cells expressing multi-copy and single-copy *MAL32*. Cells were lysed on ice, treated with RNase A as indicated and analyzed by northern blot. 25S and *L-A* (a double stranded RNA) are shown as controls for loading and RNase specificity. n = 3 biological replicates, error bars ±1se, *p<0.05, ***p<0.01 by Student’s t test, y axes in arbitrary units.

https://doi.org/10.7554/eLife.01581.018

The influence of copy number suggested that dsRNA formation and potentially siRNA production may occur in the nucleus. We initially tested this by immunofluorescence against Dicer and dsRNA in mixed populations of wild-type and RNAi+ cells (Figure 6—figure supplement 1). Dicer was present in cytoplasmic foci, but also showed a diffuse cytoplasmic staining (compare the indicated wild-type and RNAi+ cells) and, under wide-field imaging, appeared to be present in the nucleus. However, super-resolution images of the same cells showed nuclear exclusion of Dicer; therefore, if Dicer is present in the nucleus, it is at low levels. dsRNA staining in these cells revealed many cytoplasmic foci, presumably Killer virus dsRNAs that are known to be incompletely cleared by RNAi (Drinnenberg et al., 2011), but did not show unambiguous nuclear staining.

As an alternative, we asked whether the spatial configuration of gene copies within the nucleus could affect siRNA formation; such an effect would provide strong evidence for the formation of dsRNA in the nucleus. In systems which undergo efficient RNAi such as the rDNA and 2µ plasmids, all gene copies are clustered together in a small sub-nuclear volume. To test the importance of this clustering, we used the sense-antisense system at the TRP1 locus which produces detectable siRNA even when present in the genome in only two tandem copies (data not shown). We generated strains with three tandem copies of TRP1 on a single plasmid (Clustered, Cls) or three unlinked copies (Dispersed, Dsp) (Figure 6A), and expressed Dicer without Argonaute to allow siRNA formation but minimize the effect of RNAi on total RNA levels. Quantification of total RNA showed that both systems produced similar amounts of sense and antisense RNA molecules (Figure 6B), although this experiment was complicated by read-through transcripts of antisense TRP1 from the clustered system (Figure 6B lanes 1,2), a behavior that was not prevented by insertion of transcriptional stop cassettes between the repeats. Nonetheless, the clustered system produced fourfold more TRP1 siRNA than the dispersed system (Figure 6C), showing that close nuclear juxtaposition of transcriptional loci enhances dsRNA formation.

Figure 6 with 1 supplement see all

Download asset Open asset

Clustered loci show higher efficiency of short interfering RNA (siRNA) formation.

(A) Comparison of the systems used. Three copies of *TRP1* were placed either in tandem on a single-copy plasmid (Clustered, Cls) or a single copy was left in the genome at the *TRP1* locus and two further copies were placed on different single-copy plasmids (Dispersed, Dsp). Dicer was expressed from a single-copy plasmid. (B) Sense and antisense RNA expression in clustered and dispersed systems with and without Dicer. Quantification shows that Dicer alone has little effect on total RNA levels. Read-through species visible in lanes 1,2 are included in the quantification; values have been normalized for the different number of probe binding sites in the read-through RNAs. In the absence of this normalization (i.e., counting the number of binding sites rather than the number of molecules), the clustered antisense is approximately twofold more abundant than the dispersed antisense, which is insufficient to explain the difference in siRNA formation. (C) siRNA produced from *TRP1* in clustered and dispersed systems. n = 3 biological replicates, error bars ±1se, ***p<0.01 by Student’s t test, y axes in arbitrary units.

https://doi.org/10.7554/eLife.01581.019

While testing the effects of copy number amplification on siRNA production, we noticed that even low abundance sense-antisense ncRNA pairs (selected from a published dataset, Xu et al., 2009) underwent efficient RNAi when amplified to high copy number. For both the SUT176 and SUT430 systems (Figure 7A), the sense and antisense RNAs are barely detectable by northern blot and are clearly not targeted for degradation by RNAi (Figure 7B). However, after cloning on high-copy plasmids, the full-length RNAs became highly susceptible to RNAi and produced copious siRNAs (Figure 7C,D). This raised the interesting prospect that low abundance pervasive transcription would be sufficient to trigger efficient RNAi responses from sequences that undergo copy number amplification.

Figure 7

Download asset Open asset

RNA interference (RNAi) against transcripts from amplified low-expression systems.

(A) Schematic diagrams of SUT176 and SUT430 loci. (B) Northern analysis of SUT176 and SUT430 transcripts from single-copy genomic loci in wild-type and RNAi+ cells. Ty1 RNA is a positive control for RNAi, *ACT1* is a loading control. (C) Analysis of SUT176 and SUT430 non-protein coding RNAs (ncRNAs) expressed from high-copy plasmids in wild-type and RNAi+ cells. Amplified regions are indicated in A. (D) Short interfering RNA analysis of cells in C.

https://doi.org/10.7554/eLife.01581.021

Clear examples of pervasive transcription are not well defined in any organism because, by definition, the products of pervasive transcription are almost undetectable. We therefore chose to examine the GAL gene cluster (Figure 8A), which is tightly transcriptionally repressed in cells grown in glucose. Under these conditions, antisense ncRNAs are produced from the GAL10 ORF with a known steady-state abundance of one RNA molecule per 14 cells (Houseley et al., 2008; Pinskaya et al., 2009) (arrows in Figure 8B lane 1). Transcription of these ncRNAs is abrogated in a previously described Reb1 binding site mutant (RBSΔ), leaving almost no detectable RNAs from this locus (Figure 8B lane 3). For reasons that remain unclear, the GAL cluster is slightly de-repressed in the RNAi+ strain (Figure 8B compare lanes 1,2); nonetheless, the RBSΔ RNAi+ strain (Figure 8B lane 4) only produces very low level heterogeneous transcripts from GAL10, suggesting that it forms a good model of pervasive transcription. Cloning either wild-type or RBSΔ GAL clusters onto high-copy plasmids substantially increased the levels of detectable ncRNA as expected (Figure 8B lanes 5,7), and these ncRNAs were processed into easily detectable siRNAs (Figure 8C lanes 6,8). Therefore, ncRNAs produced at the level of pervasive transcription are sufficient to mediate extensive siRNA production when the copy number of the transcribing locus is increased.

Figure 8 with 1 supplement see all

Download asset Open asset

RNA interference (RNAi) against pervasive transcripts from the repressed GAL cluster.

(A) Schematic representation of the GAL cluster. (B) Non-strand-specific northern blot of non-protein coding RNAs (ncRNAs) produced from the GAL locus present at single-copy (lanes 1–4) or high-copy (lanes 5–8), showing wild-type and RBSΔ mutant. Arrow indicates *GAL10* antisense RNA. Strand-specific northern blots for the same RNA are shown in Figure 8—figure supplement 1. (C) *GAL10* short interfering RNA (siRNA) from the same cells as in B. (D) Expression of *GAL10* mRNA from a single-copy genomic locus under the control of a Cu²⁺-responsive promoter in wild-type and RNAi+ strains carrying an empty vector (lanes 1,2), high-copy wild-type GAL cluster (lanes 3,4), or high-copy RBSΔ GAL cluster (lanes 5,6). n = 3 biological replicates, error bars ±1se, ***p<0.01 by Student’s t test, y axes in arbitrary units.

https://doi.org/10.7554/eLife.01581.022

The siRNAs produced from the high-copy GAL10 locus are clearly sufficient to degrade the GAL10 ncRNAs in the RNAi+ background (Figure 8B compare lanes 5,7 with lanes 6,8); however, a classical RNAi response should be able to degrade RNA expressed from a separate locus. To test this we introduced the high-copy GAL cluster plasmids into a strain in which the single-copy genomic GAL10 ORF is expressed at high levels from a Cu²⁺-dependent promoter, allowing expression of the GAL10 mRNA from the single-copy locus while the GAL clusters present on the high-copy plasmids remain fully repressed. As observed for the GAL10 ncRNAs, the GAL10 mRNA was expressed at higher levels in the RNAi+ strain than in the wild-type (Figure 8D compare lanes 1,2) but, nonetheless, both wild-type and RBSΔ high-copy GAL cluster plasmids caused highly significant >50% knockdowns of the GAL10 mRNA compared with the empty vector control (Figure 8D lanes 2,4,6). This was not an indirect effect of the high-copy GAL clusters alone as, in the wild-type background, GAL10 mRNA levels were slightly increased by the presence of the GAL plasmids (Figure 8D lanes 1,3,5). These data demonstrate that pervasive transcription of a high-copy locus is sufficient to instigate an effective RNAi response that can mediate the degradation of a target mRNA in trans.

Discussion

The ability of the RNAi system to selectively target the products of high-copy sequences such as transposons provides a remarkably efficient genome defense, as well as an effective way to differentiate heterochromatic regions, which are often repetitive, from gene-rich euchromatin. Here we have demonstrated that RNAi has an innate preference for the products of high-copy sequences, probably because RNA from high-copy sequences forms dsRNA more efficiently. It has long been known that cells can recognize and silence high-copy DNA, which would form a basic defense against uncontrolled amplification of transposable elements (reviewed in Hsieh and Fire, 2000). This of course requires a mechanism to count copy number, or at least differentiate high- and low-copy regions, which has remained mysterious. Our data show that RNAi provides such a mechanism by selectively targeting the products of high-copy loci.

The production of siRNA from high-copy DNA, which would be the basis of such a counting mechanism, absolutely requires that all DNA is transcribed; if this does not occur, transposable elements that remain transcriptionally silent would be invisible to the system. Pervasive transcription, the general background of very low level RNA produced across the genome, ensures that the vast majority of the genome is transcribed, and therefore that no region remains completely silent (Cheng et al., 2005; Birney et al., 2007; Kapranov et al., 2007; Goodman et al., 2012). The extent to which mammalian genomes are pervasively transcribed has been controversial; however, many of the questions revolve around whether the pervasive transcripts represent defined functional products or whether much of the detected RNA represents random transcriptional noise (van Bakel et al., 2010; Clark et al., 2011). For a general surveillance function, it does not really matter whether pervasive transcription is formed of many discrete transcripts or occurs at random since either process should be sufficient to generate dsRNA. If a large proportion of pervasive transcription does represent random noise, this would be actively advantageous; random transcription would be sequence independent, and therefore transposable elements could not become fully silent by mutating individual promoter sequences. We suggest that the primary function of pervasive transcription lies in ensuring the whole genome is transcribed to allow identification and suppression of transposable elements; this does not conflict with the idea that some proportion of these transcripts may have additional functions.

We propose that hybridization kinetics explains the dependence of RNAi on copy number (shown in Figure 9). The rate of formation of dsRNA from single stranded sense and antisense RNA is proportional to the concentration of each strand of RNA, and so is inversely proportional to the square of the reaction volume. Technically, the reaction volume is the non-excluded volume of the cell; however, this assumes a uniform distribution of RNA throughout the cell. In reality the RNA is far from evenly distributed, so some small volumes may have very high concentrations of RNA and, within these volumes, the rate of dsRNA formation will be dramatically higher than in the bulk of the cell. Single-copy loci cannot simultaneously transcribe sense and antisense RNA (Hobson et al., 2012) so, although such a locus can give rise to a mixed population of sense and antisense RNA in the cytoplasm over time, in the vicinity of the transcription site only one sense of RNA should ever be present, assuming efficient RNA export. Annealing of sense and antisense RNA must therefore occur in the relatively large non-excluded cytoplasmic volume of the cell, which will be inefficient except for very highly expressed RNA. In contrast, at multi-copy loci the simultaneous production of sense and antisense RNA from many closely spaced sites can lead to high concentrations of sense and antisense RNA around the transcription site, causing efficient duplex formation and hence efficient RNAi.

Figure 9

Download asset Open asset

Proposed mechanism for RNA interference (RNAi) on high-copy loci.

The rate of double stranded RNA (dsRNA) formation, the required first step in RNAi, is highly dependent on local RNA concentration. Single-copy loci cannot simultaneously transcribe sense and antisense RNA, allowing RNA export to occur before the strands meet and requiring hybridization to occur in the relatively large cytoplasmic volume. In contrast, sense and antisense RNA can be simultaneously transcribed from different parts of a multi-copy locus and, therefore, if the copies are closely juxtaposed in the genome or in 3D space, the local concentration of sense and antisense RNA around the transcription sites should be high, favoring dsRNA formation.

https://doi.org/10.7554/eLife.01581.024

This mechanism predicts that dsRNA formation should occur in the nucleus, but we were not able to detect Dicer in the nucleus by immunofluorescence. This reflects the situation in higher eukaryotes where Dicer is largely cytoplasmic, but recent experiments in Drosophila and mammalian cells have detected small quantities of nuclear Dicer, particularly associated with chromatin (Sinkkonen et al., 2010; Cernilogar et al., 2011; Gullerova et al., 2011; Doyle et al., 2013). Low but functional levels of Dicer may therefore be present in the nucleus of RNAi+ cells and able to generate siRNA. Alternatively, the dsRNA may be exported and processed in the cytoplasm. To our knowledge, there is no clear evidence for or against export of dsRNA by normal pathways; certainly these would not be too large or too structured to pass through nuclear pores compared, for example, with pre-ribosomes.

One notable prediction of this mechanism is that clustering of multi-copy transcription sites would be a particularly efficient way to increase the local density of sense and antisense RNA. All the systems we have described in this paper are clustered: rDNA repeats are arranged in tandem, telomeres are known to cluster together at various points in the cell cycle (Gotta et al., 1996), and high-copy 2µ plasmids exist in a discrete focus that is vital for copy number maintenance (Velmurugan et al., 2000; Wong et al., 2002). The comparison of siRNA formation from clustered and dispersed TRP1 loci provides experimental evidence for this effect since, for a given quantity of sense and antisense RNA, the clustered system produces more siRNA. Although the clustered system also produces read-through transcripts, these would not have a higher hybridization rate than the non-read-through RNAs as the hybridization rate depends on the frequency of collisions between molecules. Intriguingly, Tf2 retrotransposons in Schizosaccharomyces pombe are clustered by the action of centromere protein B (CENP-B), which also silences these elements through histone deacetylation (Cam et al., 2008). This clustering would allow cells to produce siRNA against the Tf2 elements through pervasive transcription, although multiple mechanisms silence Tf2 retrotransposons (Yamanaka et al., 2012), providing an extra defense against retrotransposon activation. Similarly, gypsy retrotransposons in Drosophila are known to cluster (Gerasimova et al., 2000), which may again facilitate siRNA production. Hence, the clustering of transposable elements by factors such as CENP-B would facilitate their recognition by RNAi and allow for selective RNA degradation.

Mammalian germline cells are replete with small RNAs including endogenous siRNA (Watanabe et al., 2006, 2008; Tam et al., 2008; Song et al., 2011), and siRNAs in sperm and oocytes show a pronounced bias towards high-copy sequences that would be effectively explained by the selectivity of the RNAi system towards the products of high-copy loci (Watanabe et al., 2006; Song et al., 2011). However, it remains unclear how some dsRNA precursors of siRNAs are generated, particularly for retrotransposons that are primarily expressed only on the sense strand. We suggest that pervasive transcription would provide sufficient antisense RNA for this role, just as we observed for high-copy GAL cluster sequences in yeast.

In comparison to the germline, the response of mammalian somatic cells to dsRNA is distinctly muted. dsRNA could be processed into siRNA, be altered by RNA editing (Hogg et al., 2011), or could activate the interferon response leading to apoptosis (Gantier and Williams, 2007). However, transgenic mice expressing a hairpin dsRNA construct produce minimal siRNAs, little edited RNA, and show no phenotype that might indicate cell death (Nejepinska et al., 2012a). Nonetheless, siRNAs produced from LINE-1 retrotransposons have been detected in cell culture (Yang and Kazazian, 2006), and high-copy transfected plasmids expressing a sense-antisense RNA pair do produce detectable siRNA in HEK293 cells (Nejepinska et al., 2012b). This shows that a basic siRNA response with an apparent bias towards high-copy sequences is functional in mammalian somatic cells.

Materials and methods

Yeast strains, plasmids and culture conditions

Request a detailed protocol

Yeast deletion strains (Supplementary file 1) were created by standard methods using the oligonucleotides in Supplementary file 1. Plasmids are described in Supplementary file 1 with construction details. Cells were grown on rich media (2% peptone, 1% yeast extract, 2% sugar) or synthetic media (0.69% yeast nitrogen base with ammonium sulfate, amino acids, 2% sugar) for plasmid assays. GAL10 mRNA was induced with 20 µM CuSO₄ in Figure 8D. Media components were purchased from Formedium. Cells were grown to mid-log (OD 0.4–0.6) at 30°C for most experiments or at 25°C for experiments involving trf4Δ mutants. The W303 background strain used here has defects in galactose induction in synthetic media, so strains in Figure 3G,H were diploids of W303xBY4741 that show a normal galactose response.

RNA extraction and northern analysis

Request a detailed protocol

RNA was extracted by three procedures described below. High molecular weight RNA was prepared using the hot phenol method for all experiments except Figures 7B, 3B, 3F and Figure 3—figure supplements 1,2 where guanidine thiocyanate (GTC)-phenol preparations were used. 5–10 µg glyoxylated RNA was resolved on 1.2% gels as described (Sambrook and Russell, 2001), transferred to Hybond N+ membrane (GE) and hybridized with probes listed in Supplementary file 1 using either Church Hyb (Sambrook and Russell, 2001) or UltraHyb (Life Technologies). RNA probes were hybridized at 65°C and washed at 65°C using 0.1× SSC, 0.1% SDS, DNA probes in Church Hyb were hybridized at 65°C and washed at 65°C with 0.5× SSC, 0.1% SDS, DNA probes in UltraHyb were hybridized at 42°C and washed at 55°C using 0.2× SSC, 0.1% SDS. Small RNA enriched fractions were isolated using the mirVANA kit (Ambion). 4–10 µg small RNA was separated on 15% polyacrylamide/8 M urea gels containing 20 mM MOPS or 1× TBE, transferred in 20 mM MOPS or 0.5× TBE to Hybond N membrane (GE) and chemically cross-linked as described (Pall and Hamilton, 2008). We observed no difference in cross-linking efficiency between MOPS and TBE gels, but resolution of TBE gels was superior in our hands. siRNAs were detected using random primed probes (Supplementary file 1) hybridized in UltraHyb Oligo (Life Technologies) at 42°C and washed with 2× SSC, 0.5% SDS at 42°C, U6 control oligonucleotide was labeled using T4 polynucleotide kinase and hybridized in Church Hyb under the same conditions.

Hot phenol RNA preparation

Request a detailed protocol

10 × 10⁷ cells in 15 ml tubes were re-suspended in 600 µl TES (10 mM Tris pH 7.5, 10 mM EDTA, 0.5% SDS) and 600 µl phenol pH 7. The mixture was incubated at 65°C for 20 min with 30 s vortexing every 5 min, before briefly chilling on ice. Samples were centrifuged for 5 min and the upper phase extracted. This phase (5–600 µl) was extracted twice with phenol:chloroform (5:1) and once with chloroform before precipitation with 50 µl 3 M sodium acetate (NaOAc) pH 5.2 and 1.1 ml ethanol. The pellet was washed with 70% ethanol and re-suspended in 30 µl water.

GTC-phenol RNA preparation

Request a detailed protocol

2 × 10⁷ cells were lysed by 5 min vortexing at 4°C with 50 µl glass beads and 40 µl GTC-phenol (2.1 M GTC, 26.5 mM Na citrate pH7, 5.3 mM EDTA, 76 mM β-mercaptoethanol, 1.06% N-lauryl sarcosine, 50% phenol pH7). 600 µl GTC-phenol was added, mixed, and samples were heated at 65°C for 10 min, then placed on ice for 10 min. 160 µl 100 mM NaOAc pH 5.2 and 300 µl chloroform:isoamyl alcohol (24:1) were added, samples were vortexed and centrifuged at top speed for 5 min. The upper phase was extracted, precipitated with 1 ml ethanol, washed with 70% ethanol and the pellet re-suspended in 6 µl water. 3 µl RNA was analyzed per lane.

Small RNA purification

Request a detailed protocol

Small RNAs were isolated using a mirVANA kit (Life Technologies) with minor modifications. 35 × 10⁷ cells were thoroughly re-suspended in 100 µl lysis/binding buffer, 200 µl glass beads were added, and the samples were vortexed for 5 min at 4°C. 500 µl lysis/binding buffer were added and the samples were mixed before proceeding with the isolation as per the manufacturer’s instructions, starting with addition of the miRNA homogenate additive. After isolation the samples were generally re-precipitated and re-suspended in 20 µl water.

RNase A treatment

Request a detailed protocol

20 × 10⁷ cells were harvested and split into two aliquots, then re-suspended in 600 µl 10 mM Tris pH 7.5, 10 mM EDTA on ice. Cells were lysed with glass beads (10 cycles of 30 s vortex, 60 s on ice), and 5 µg RNase A was added to one aliquot followed by 30 min incubation on ice. After centrifugation for 10 min at 4500×g, 600 µl lysate was extracted, SDS added to 0.5%, and RNA extracted by the hot phenol method as above. RNase A treated samples were re-suspended in 12 µl water.

DNA isolation and Southern blotting

Request a detailed protocol

Cells from 2 ml saturated culture were washed with 50 mM EDTA, then spheroplasted with 250 µl 0.34 U/ml lyticase (Sigma L4025) in 1.2 M sorbitol, 50 mM EDTA, 10 mM DTT. After centrifuging at 1000×g, the cells were gently resuspended in 400 µl of 0.3% SDS, 50 mM EDTA, 100 µg/ml RNase A and incubated at 37°C for 30 min. 4 µl of 20 mg/ml proteinase K was added and the samples were mixed by inversion and heated to 65°C for 30 min. 160 µl 5 M potassium acetate was added after cooling to room temperature, the samples were mixed by inversion and then chilled on ice for 30 min. After 10 min centrifuging at 10,000×g, the supernatant was poured into a new tube containing 500 µl phenol:chloroform pH 8 and the samples were mixed on a wheel for 15 min. The samples were centrifuged for 10 min at 10,000×g and the upper phase was extracted using cut tips and precipitated with 400 µl isopropanol. Pellets were washed with 70% ethanol, air-dried and left overnight at 4°C to dissolve in 30 µl TE. After gentle mixing, 10 µl of each sample was digested with 40 U of EcoRV, ethanol precipitated and separated on a 25 cm 1% TBE gel at 90 V overnight. The gel was washed in 0.25 N HCl for 15 min, 0.5 N NaOH for 45 min, and twice in 1.5 M NaCl 0.5 M Tris pH 7.5 for 20 min before transfer to HyBond N+ membrane in 6× SSC. The membrane was probed for URA3 and GAL7 in Church Hyb at 65°C and washed with 0.5× SSC, 0.1% SDS at 65°C.

Immunofluorescence

Request a detailed protocol

Cells were grown to OD 0.5 in YPD and the cultures mixed as required, 4 ml per coverslip. The cells were fixed with 440 µl 37% formaldehyde (Merck, microscopy grade) for 15 min at room temperature, then centrifuged for 2 min at 4600 rpm. The cells were washed three times with 1 ml of buffer B (0.1 M sodium phosphate pH 7.5, 1.2 M sorbitol), then re-suspended in 100 µl buffer B containing 3 µl 17 U/µl lyticase (Sigma L2524) and 10 mM DTT for 15 min. The cells were centrifuged for 2 min at 1000×g, then washed with 1 ml buffer B. The cells were re-suspended in 40 µl buffer B, applied to a poly-L-lysine coated coverslip (Zeiss 18 × 18 × 0.170 ± 0.005 mm) and left for 20 min before washing twice with buffer B. Coverslips were treated with −20°C methanol for 6 min, then dipped in −20°C acetone for 10 s, followed by two washes with PBS. Coverslips were blocked for 30 min with 5% milk 0.3% Triton-X100 in PBS, washed with PBS, then incubated overnight at 4°C with primary antibodies in 50 µl 1% BSA 0.3% Triton-X100 in PBS. Coverslips were washed three times with PBS and incubated for 30 min at room temperature with secondary antibodies 1:1000 in same buffer as primaries. After washing three times with PBS, the samples were dehydrated with 70%, 90%, and 100% ethanol and mounted in Pro-long Gold with DAPI (Life Technologies). Antibodies were rabbit anti-GFP (Life Technologies A11122) at 1:500 and mouse anti-dsRNA J2 (ESC 10010200) at 1:1000. Images were acquired using a Nikon N-SIM microscope comprising a Nikon Ti-E microscope, Nikon 100× 1.49 NA lens, Nikon SIM illumination module, and Andor iXon 897 EM-CCD camera. SIM data were acquired in ‘3D-SIM’ mode using five phases and three rotations. DAPI, Alexa Fluor 488, and Alexa Fluor 594 dyes were excited using 405, 488, and 561 nm laser light, respectively. Super-resolution images were reconstructed using Nikon Elements software. Equivalent wide-field images were reconstructed in FIJI (ImageJ, NIH) by summing the phase shifts from one grid rotation.

Imaging and analysis

Request a detailed protocol

Gels and phosphor screens were imaged using FLA 3000 (Fuji) or FLA 7000 (GE) systems. Quantification was performed using AIDA (Fuji) or ImageQuant (GE), depending on the scanner used. Images were prepared for publication with ImageJ by smoothing and minimal contrast enhancement. Images from the FLA3000 had a Gamma Shift of 3 applied.

5′ RACE

Request a detailed protocol

5′ RACE was performed with an ExactSTART Eukaryotic mRNA 5′- and 3′-RACE Kit (Epicentre) as per manufacturer’s instructions, except that reverse transcription was primed from random hexamers.

RNA immunoprecipitation

Request a detailed protocol

50 ml of cells at 0.7 × 10⁷ cells/ml were harvested, washed, and frozen on nitrogen. Cells were thawed, washed twice in 1 ml lysis buffer (50 mM HEPES pH7.5, 50 mM KCl, 5 mM MgCl₂, 1 mM DTT, 1× complete protease inhibitors (Roche)) and transferred to 2 ml tubes, then re-suspended in 300 µl lysis buffer. 300 µl zirconium beads were added and cells were lysed by vortexing 10 × 30 s with 30 s on ice between cycles. The lysate was clarified by centrifuging twice for 5 min at 14,000 rpm, a 12 µl aliquot was removed for total RNA and the remaining lysate was split in half and 2.5 µl mouse anti-dsRNA J2 (ESC 10010200) added to one aliquot. Antibody was bound for 2 hr at 4°C, then 20 µl GammaBind beads (GE) pre-blocked overnight with 1% BSA were added and incubation continued on a wheel for 2 hr at 4°C. The beads were washed 5× for 10 min with 1 ml wash buffer (10 mM Tris pH 7.5, 120 mM NaCl, 5 mM MgCl₂, 0.1% NP-40, 1 mM DTT). RNA was extracted from the beads and total lysate samples using Tri-reagent (Sigma) according to the manufacturer’s instructions. 1 µg total lysate and whole immunoprecipitation samples were treated with RQ1 DNase (Promega), purified by phenol:chloroform and ethanol precipitation, then reverse transcribed from random hexamers using Superscript III (Life Technologies). Quantitative PCR was performed using Maxima SYBR qPCR mix (Fermentas).

Bioinformatics

Antisense analysis

Request a detailed protocol

Locations of XUTs (van Dijk et al., 2011) were merged with CUT and SUT locations (Xu et al., 2009), along with expression validated ORFs (Xu et al., 2009), and overlap between ORFs and other features calculated using an R script.

Table 2

n values for statistical tests

https://doi.org/10.7554/eLife.01581.025

Figure	Category	n	Category	n
4B	0–2 single-copy	19,892	0–2 multi-copy	1128
	2–4 single-copy	36,474	2–4 multi-copy	1111
	4–6 single-copy	34,921	4–6 multi-copy	1177
	6–8 single-copy	16,208	6–8 multi-copy	1356
	8–10 single-copy	4060	8–10 multi-copy	1424
	10–12 single-copy	922	10–12 multi-copy	1505
	12–14 single-copy	270	12–14 multi-copy	887
	14–16 single-copy	56	14–16 multi-copy	199

4C	Bulk 1–2 copies	85,495	Top 1% 1–2 copies	252
	Bulk 2–4 copies	1419	Top 1% 2–4 copies	95
	Bulk 4–8 copies	544	Top 1% 4–8 copies	98
	Bulk 8–16 copies	981	Top 1% 8–16 copies	238
	Bulk 16–32 copies	1978	Top 1% 16–32 copies	212
	Bulk >32 copies	695	Top 1% >32 copies	25

4—Supplement 1A	0–2 single-copy	19,892	0–2 multi-copy	836
	2–4 single-copy	36,474	2–4 multi-copy	906
	4–6 single-copy	34,921	4–6 multi-copy	724
	6–8 single-copy	16,208	6–8 multi-copy	387
	8–10 single-copy	4060	8–10 multi-copy	189
	10–12 single-copy	922	10–12 multi-copy	167
	12–14 single-copy	270	12–14 multi-copy	128
	14–16 single-copy	56	14–16 multi-copy	29

4—Supplement 1B	0–2 low-copy	20,728	0–2 high-copy	292
	2–4 low-copy	37,380	2–4 high-copy	205
	4–6 low-copy	35,645	4–6 high-copy	453
	6–8 low-copy	16,595	6–8 high-copy	969
	8–10 low-copy	4249	8–10 high-copy	1235
	10–12 low-copy	1089	10–12 high-copy	1338
	12–14 low-copy	398	12–14 high-copy	759
	14–16 low-copy	85	14–16 high-copy	170

4-Supplement 2A	0–2 single-copy	20,030	0–2 multi-copy	1999
	2–4 single-copy	36,494	2–4 multi-copy	2104
	4–6 single-copy	34,883	4–6 multi-copy	2098
	6–8 single-copy	16,178	6–8 multi-copy	1668
	8–10 single-copy	4078	8–10 multi-copy	644
	10–12 single-copy	873	10–12 multi-copy	204
	12–14 single-copy	215	12–14 multi-copy	66
	14–16 single-copy	52	14–16 multi-copy	10

4-Supplement 2B	0–2 single-copy	19,892	0–2 multi-copy	1128
	2–4 single-copy	36,474	2–4 multi-copy	1111
	4–6 single-copy	34,921	4–6 multi-copy	1177
	6–8 single-copy	16,208	6–8 multi-copy	1356
	8–10 single-copy	4060	8–10 multi-copy	1424
	10–12 single-copy	922	10–12 multi-copy	1505
	12–14 single-copy	270	12–14 multi-copy	887
	14–16 single-copy	56	14–16 multi-copy	199

siRNA analysis

Request a detailed protocol

Sequencing data for mRNA and siRNA fractions in RNAi strains (GEO accession GSE31300) (Drinnenberg et al., 2011) and W303 total RNA (GEO accession GSE38383) (Hobson et al., 2012) were quality and adapter trimmed using Trim Galore (v0.2.3; default options; http://www.bioinformatics.babraham.ac.uk/projects/trim_galore/) and aligned to the yeast genome (build SGD1.01) using Bowtie (Langmead et al., 2009) (v0.12.7; default settings plus ‘--best’) allowing non-unique sequences to be assigned at random. For expressed gene analysis (Figure 1), reads overlapping each ORF were binned and only ORFs with >100 reads were used. For siRNA profiles (Figure 2—figure supplement 1), reads were binned over 50 bp intervals using SeqMonk (http://www.bioinformatics.babraham.ac.uk/projects/seqmonk/). For siRNA versus total RNA expression (Figure 4), read counts were quantified in consecutive 100 bp bins across the genome using SeqMonk, bins with >10,0000 total RNA reads were excluded as were bins derived from 2µ sequence which is single copy in the genome sequence but high copy in reality, and a pseudocount of one read was added to total and siRNA read counts for each bin. Total RNA levels were multiplied by copy number to correct for the division of reads amongst copies that occurs during mapping, or alternate normalization was applied in Figure 4—figure supplement 2 (see below for the reasoning underlying this copy number normalization methodology). The copy number for each bin was determined by splitting the complete genomic sequence into overlapping 20 bp segments at 1 bp intervals and re-mapping to the genome with reads allowed to align to all perfectly matching sequences, producing a measure of the number of genomic sequences matching each 100 bp bin. Little difference was seen if one mismatch was allowed (data not shown).

Read count normalization for multi-copy sequences

Request a detailed protocol

Multi-copy loci are problematic for standard high-throughput sequencing mapping pipelines and are commonly discarded. Reads mapping to a non-unique genome sequence are usually assigned at random to one copy in the genome, therefore the total reads are divided evenly amongst the copies and the apparent abundance of RNA matching each copy is effectively divided by the copy number. In order to assess total RNA abundance (as in Figure 4A,B), we multiplied all total RNA read counts by the copy number to obtain the real total RNA abundance. However, we decided that the siRNA abundance should be analyzed per producing locus because each copy in the genome was analyzed separately for comparison with single-copy loci (i.e., we quantified how many siRNA reads an individual copy of a multi-copy locus produced, not how many the combined copies produced). We therefore did not multiply the siRNA read counts by the copy number. Such normalizations clearly have the potential to introduce systematic biases, and we therefore repeated the analysis in Figure 4 either with no copy number normalization or with both total and siRNA read counts multiplied by copy number (Figure 4—figure supplement 2). We found that although the distributions changed somewhat, the majority of total RNA abundance categories had higher siRNA levels for multi-copy loci irrespective of the copy number normalization applied. We note that the alternative copy number analysis in Figure 4C,D did not require any normalization; indeed, any copy number normalization would simply cancel out in the calculation of the siRNA:total RNA ratio, therefore any systematic bias that might be introduced by copy number normalization would have no effect.

Data availability

The following previously published data sets were used

(2011) Compatibility with Killer explains the Rise of RNAi-deficient Fungi
ID GSE31300. Publicly available at GEO (http://www.ncbi.nlm.nih.gov/geo/).

http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE31300
(2012) RNA polymerase II collision interrupts convergent transcription (RNA-seq)
ID GSE38383. Publicly available at GEO (http://www.ncbi.nlm.nih.gov/geo/).

http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE38383

References

(1991) Modifiers of position effect are shared between telomeric and silent mating-type loci in S. cerevisiae
Cell 66:1279–1287.
https://doi.org/10.1016/0092-8674(91)90049-5
- Google Scholar
1. Babiarz JE
2. Ruby JG
3. Wang Y
4. Bartel DP
5. Blelloch R
(2008) Mouse ES cells express endogenous shRNAs, siRNAs, and other microprocessor-independent, Dicer-dependent small RNAs
Genes Development 22:2773–2785.
https://doi.org/10.1101/gad.1705308
- Google Scholar
(2008) A cryptic unstable transcript mediates transcriptional trans-silencing of the Ty1 retrotransposon in S. cerevisiae
Genes Development 22:615–626.
https://doi.org/10.1101/gad.458008
- Google Scholar
1. Bertone P
2. Stolc V
3. Royce TE
4. Rozowsky JS
5. Urban AE
6. Zhu X
7. Rinn JL
8. Tongprasit W
9. Samanta M
10. Weissman S
11. Gerstein M
12. Snyder M
(2004) Global identification of human transcribed sequences with genome tiling arrays
Science 306:2242–2246.
https://doi.org/10.1126/science.1103388
- Google Scholar
1. Birney E
2. Stamatoyannopoulos JA
3. Dutta A
4. Guigo R
5. Gingeras TR
6. Margulies EH
7. Weng Z
8. Snyder M
9. Dermitzakis ET
10. Thurman RE
11. Kuehn MS
12. Taylor CM
13. Neph S
14. Koch CM
15. Asthana S
16. Malhotra A
17. Adzhubei I
18. Greenbaum JA
19. Andrews RM
20. Flicek P
21. Boyle PJ
22. Cao H
23. Carter NP
24. Clelland GK
25. Davis S
26. Day N
27. Dhami P
28. Dillon SC
29. Dorschner MO
30. Fiegler H
31. Giresi PG
32. Goldy J
33. Hawrylycz M
34. Haydock A
35. Humbert R
36. James KD
37. Johnson BE
38. Johnson EM
39. Frum TT
40. Rosenzweig ER
41. Karnani N
42. Lee K
43. Lefebvre GC
44. Navas PA
45. Neri F
46. Parker SC
47. Sabo PJ
48. Sandstrom R
49. Shafer A
50. Vetrie D
51. Weaver M
52. Wilcox S
53. Yu M
54. Collins FS
55. Dekker J
56. Lieb JD
57. Tullius TD
58. Crawford GE
59. Sunyaev S
60. Noble WS
61. Dunham I
62. Denoeud F
63. Reymond A
64. Kapranov P
65. Rozowsky J
66. Zheng D
67. Castelo R
68. Frankish A
69. Harrow J
70. Ghosh S
71. Sandelin A
72. Hofacker IL
73. Baertsch R
74. Keefe D
75. Dike S
76. Cheng J
77. Hirsch HA
78. Sekinger EA
79. Lagarde J
80. Abril JF
81. Shahab A
82. Flamm C
83. Fried C
84. Hackermüller J
85. Hertel J
86. Lindemeyer M
87. Missal K
88. Tanzer A
89. Washietl S
90. Korbel J
91. Emanuelsson O
92. Pedersen JS
93. Holroyd N
94. Taylor R
95. Swarbreck D
96. Matthews N
97. Dickson MC
98. Thomas DJ
99. Weirauch MT
100. Gilbert J
101. Drenkow J
102. Bell I
103. Zhao X
104. Srinivasan KG
105. Sung WK
106. Ooi HS
107. Chiu KP
108. Foissac S
109. Alioto T
110. Brent M
111. Pachter L
112. Tress ML
113. Valencia A
114. Choo SW
115. Choo CY
116. Ucla C
117. Manzano C
118. Wyss C
119. Cheung E
120. Clark TG
121. Brown JB
122. Ganesh M
123. Patel S
124. Tammana H
125. Chrast J
126. Henrichsen CN
127. Kai C
128. Kawai J
129. Nagalakshmi U
130. Wu J
131. Lian Z
132. Lian J
133. Newburger P
134. Zhang X
135. Bickel P
136. Mattick JS
137. Carninci P
138. Hayashizaki Y
139. Weissman S
140. Hubbard T
141. Myers RM
142. Rogers J
143. Stadler PF
144. Lowe TM
145. Wei CL
146. Ruan Y
147. Struhl K
148. Gerstein M
149. Antonarakis SE
150. Fu Y
151. Green ED
152. Karaöz U
153. Siepel A
154. Taylor J
155. Liefer LA
156. Wetterstrand KA
157. Good PJ
158. Feingold EA
159. Guyer MS
160. Cooper GM
161. Asimenos G
162. Dewey CN
163. Hou M
164. Nikolaev S
165. Montoya-Burgos JI
166. Löytynoja A
167. Whelan S
168. Pardi F
169. Massingham T
170. Huang H
171. Zhang NR
172. Holmes I
173. Mullikin JC
174. Ureta-Vidal A
175. Paten B
176. Seringhaus M
177. Church D
178. Rosenbloom K
179. Kent WJ
180. Stone EA
181. Batzoglou S
182. Goldman N
183. Hardison RC
184. Haussler D
185. Miller W
186. Sidow A
187. Trinklein ND
188. Zhang ZD
189. Barrera L
190. Stuart R
191. King DC
192. Ameur A
193. Enroth S
194. Bieda MC
195. Kim J
196. Bhinge AA
197. Jiang N
198. Liu J
199. Yao F
200. Vega VB
201. Lee CW
202. Ng P
203. Shahab A
204. Yang A
205. Moqtaderi Z
206. Zhu Z
207. Xu X
208. Squazzo S
209. Oberley MJ
210. Inman D
211. Singer MA
212. Richmond TA
213. Munn KJ
214. Rada-Iglesias A
215. Wallerman O
216. Komorowski J
217. Fowler JC
218. Couttet P
219. Bruce AW
220. Dovey OM
221. Ellis PD
222. Langford CF
223. Nix DA
224. Euskirchen G
225. Hartman S
226. Urban AE
227. Kraus P
228. Van Calcar S
229. Heintzman N
230. Kim TH
231. Wang K
232. Qu C
233. Hon G
234. Luna R
235. Glass CK
236. Rosenfeld MG
237. Aldred SF
238. Cooper SJ
239. Halees A
240. Lin JM
241. Shulha HP
242. Zhang X
243. Xu M
244. Haidar JN
245. Yu Y
246. Ruan Y
247. Iyer VR
248. Green RD
249. Wadelius C
250. Farnham PJ
251. Ren B
252. Harte RA
253. Hinrichs AS
254. Trumbower H
255. Clawson H
256. Hillman-Jackson J
257. Zweig AS
258. Smith K
259. Thakkapallayil A
260. Barber G
261. Kuhn RM
262. Karolchik D
263. Armengol L
264. Bird CP
265. de Bakker PI
266. Kern AD
267. Lopez-Bigas N
268. Martin JD
269. Stranger BE
270. Woodroffe A
271. Davydov E
272. Dimas A
273. Eyras E
274. Hallgrímsdóttir IB
275. Huppert J
276. Zody MC
277. Abecasis GR
278. Estivill X
279. Bouffard GG
280. Guan X
281. Hansen NF
282. Idol JR
283. Maduro VV
284. Maskeri B
285. McDowell JC
286. Park M
287. Thomas PJ
288. Young AC
289. Blakesley RW
290. Muzny DM
291. Sodergren E
292. Wheeler DA
293. Worley KC
294. Jiang H
295. Weinstock GM
296. Gibbs RA
297. Graves T
298. Fulton R
299. Mardis ER
300. Wilson RK
301. Clamp M
302. Cuff J
303. Gnerre S
304. Jaffe DB
305. Chang JL
306. Lindblad-Toh K
307. Lander ES
308. Koriabine M
309. Nefedov M
310. Osoegawa K
311. Yoshinaga Y
312. Zhu B
313. de Jong PJ
(2007) Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project
Nature 447:799–816.
https://doi.org/10.1038/nature05874
- Google Scholar
1. Borsani O
2. Zhu J
3. Verslues PE
4. Sunkar R
5. Zhu JK
(2005) Endogenous siRNAs derived from a pair of natural cis-antisense transcripts regulate salt tolerance in Arabidopsis
Cell 123:1279–1291.
https://doi.org/10.1016/j.cell.2005.11.035
- Google Scholar
1. Buhler M
2. Spies N
3. Bartel DP
4. Moazed D
(2008) TRAMP-mediated RNA surveillance prevents spurious entry of RNAs into the Schizosaccharomyces pombe siRNA pathway
Nature Structure & Molecular Biology 15:1015–1023.
https://doi.org/10.1038/nsmb.1481
- Google Scholar
1. Cam HP
2. Noma K
3. Ebina H
4. Levin HL
5. Grewal SI
(2008) Host genome surveillance for retrotransposons by transposon-derived proteins
Nature 451:431–436.
https://doi.org/10.1038/nature06499
- Google Scholar
(2007) Antisense RNA stabilization induces transcriptional gene silencing via histone deacetylation in S. cerevisiae
Cell 131:706–717.
https://doi.org/10.1016/j.cell.2007.09.014
- Google Scholar
1. Carlile M
2. Swan D
3. Jackson K
4. Preston-Fayers K
5. Ballester B
6. Flicek P
7. Werner A
(2009) Strand selective generation of endo-siRNAs from the Na/phosphate transporter gene Slc34a1 in murine tissues
Nucleic Acids Research 37:2274–2282.
https://doi.org/10.1093/nar/gkp088
- Google Scholar
1. Carninci P
2. Kasukawa T
3. Katayama S
4. Gough J
5. Frith MC
6. Maeda N
7. Oyama R
8. Ravasi T
9. Lenhard B
10. Wells C
11. Kodzius R
12. Shimokawa K
13. Bajic VB
14. Brenner SE
15. Batalov S
16. Forrest AR
17. Zavolan M
18. Davis MJ
19. Wilming LG
20. Aidinis V
21. Allen JE
22. Ambesi-Impiombato A
23. Apweiler R
24. Aturaliya RN
25. Bailey TL
26. Bansal M
27. Baxter L
28. Beisel KW
29. Bersano T
30. Bono H
31. Chalk AM
32. Chiu KP
33. Choudhary V
34. Christoffels A
35. Clutterbuck DR
36. Crowe ML
37. Dalla E
38. Dalrymple BP
39. de Bono B
40. Della Gatta G
41. di Bernardo D
42. Down T
43. Engstrom P
44. Fagiolini M
45. Faulkner G
46. Fletcher CF
47. Fukushima T
48. Furuno M
49. Futaki S
50. Gariboldi M
51. Georgii-Hemming P
52. Gingeras TR
53. Gojobori T
54. Green RE
55. Gustincich S
56. Harbers M
57. Hayashi Y
58. Hensch TK
59. Hirokawa N
60. Hill D
61. Huminiecki L
62. Iacono M
63. Ikeo K
64. Iwama A
65. Ishikawa T
66. Jakt M
67. Kanapin A
68. Katoh M
69. Kawasawa Y
70. Kelso J
71. Kitamura H
72. Kitano H
73. Kollias G
74. Krishnan SP
75. Kruger A
76. Kummerfeld SK
77. Kurochkin IV
78. Lareau LF
79. Lazarevic D
80. Lipovich L
81. Liu J
82. Liuni S
83. McWilliam S
84. Madan Babu M
85. Madera M
86. Marchionni L
87. Matsuda H
88. Matsuzawa S
89. Miki H
90. Mignone F
91. Miyake S
92. Morris K
93. Mottagui-Tabar S
94. Mulder N
95. Nakano N
96. Nakauchi H
97. Ng P
98. Nilsson R
99. Nishiguchi S
100. Nishikawa S
101. Nori F
102. Ohara O
103. Okazaki Y
104. Orlando V
105. Pang KC
106. Pavan WJ
107. Pavesi G
108. Pesole G
109. Petrovsky N
110. Piazza S
111. Reed J
112. Reid JF
113. Ring BZ
114. Ringwald M
115. Rost B
116. Ruan Y
117. Salzberg SL
118. Sandelin A
119. Schneider C
120. Schönbach C
121. Sekiguchi K
122. Semple CA
123. Seno S
124. Sessa L
125. Sheng Y
126. Shibata Y
127. Shimada H
128. Shimada K
129. Silva D
130. Sinclair B
131. Sperling S
132. Stupka E
133. Sugiura K
134. Sultana R
135. Takenaka Y
136. Taki K
137. Tammoja K
138. Tan SL
139. Tang S
140. Taylor MS
141. Tegner J
142. Teichmann SA
143. Ueda HR
144. van Nimwegen E
145. Verardo R
146. Wei CL
147. Yagi K
148. Yamanishi H
149. Zabarovsky E
150. Zhu S
151. Zimmer A
152. Hide W
153. Bult C
154. Grimmond SM
155. Teasdale RD
156. Liu ET
157. Brusic V
158. Quackenbush J
159. Wahlestedt C
160. Mattick JS
161. Hume DA
162. Kai C
163. Sasaki D
164. Tomaru Y
165. Fukuda S
166. Kanamori-Katayama M
167. Suzuki M
168. Aoki J
169. Arakawa T
170. Iida J
171. Imamura K
172. Itoh M
173. Kato T
174. Kawaji H
175. Kawagashira N
176. Kawashima T
177. Kojima M
178. Kondo S
179. Konno H
180. Nakano K
181. Ninomiya N
182. Nishio T
183. Okada M
184. Plessy C
185. Shibata K
186. Shiraki T
187. Suzuki S
188. Tagami M
189. Waki K
190. Watahiki A
191. Okamura-Oho Y
192. Suzuki H
193. Kawai J
194. Hayashizaki Y
195. FANTOM Consortium
196. Genome Network Project Core Group
(2005) The transcriptional landscape of the mammalian genome
Science 309:1559–1563.
https://doi.org/10.1126/science.1112014
- Google Scholar
1. Cernilogar FM
2. Onorati MC
3. Kothe GO
4. Burroughs AM
5. Parsi KM
6. Breiling A
7. Lo Sardo F
8. Saxena A
9. Miyoshi K
10. Siomi H
11. Carninci P
12. Gilmour DS
13. Corona DF
14. Orlando V
(2011) Chromatin-associated RNA interference components contribute to transcriptional regulation in Drosophila
Nature 480:391–395.
https://doi.org/10.1038/nature10492
- Google Scholar
1. Chekanova JA
2. Gregory BD
3. Reverdatto SV
4. Chen H
5. Kumar R
6. Hooker T
7. Yazaki J
8. Li P
9. Skiba N
10. Peng Q
11. Alonso J
12. Brukhin V
13. Grossniklaus U
14. Ecker JR
15. Belostotsky DA
(2007) Genome-wide high-resolution mapping of exosome substrates reveals hidden features in the Arabidopsis transcriptome
Cell 131:1340–1353.
https://doi.org/10.1016/j.cell.2007.10.056
- Google Scholar
1. Cheng J
2. Kapranov P
3. Drenkow J
4. Dike S
5. Brubaker S
6. Patel S
7. Long J
8. Stern D
9. Tammana H
10. Helt G
11. Sementchenko V
12. Piccolboni A
13. Bekiranov S
14. Bailey DK
15. Ganesh M
16. Ghosh S
17. Bell I
18. Gerhard DS
19. Gingeras TR
(2005) Transcriptional maps of 10 human chromosomes at 5-nucleotide resolution
Science 308:1149–1154.
https://doi.org/10.1126/science.1108625
- Google Scholar
1. Clark MB
2. Amaral PP
3. Schlesinger FJ
4. Dinger ME
5. Taft RJ
6. Rinn JL
7. Ponting CP
8. Stadler PF
9. Morris KV
10. Morillon A
11. Rozowsky JS
12. Gerstein MB
13. Wahlestedt C
14. Hayashizaki Y
15. Carninci P
16. Gingeras TR
17. Mattick JS
(2011) The reality of pervasive transcription
PLOS Biology 9:e1000625.
https://doi.org/10.1371/journal.pbio.1000625
- Google Scholar
1. David L
2. Huber W
3. Granovskaia M
4. Toedling J
5. Palm CJ
6. Bofkin L
7. Jones T
8. Davis RW
9. Steinmetz LM
(2006) A high-resolution map of transcription in the yeast genome
Proceedings of the National Academy of Sciences of the United States of America 103:5320–5325.
https://doi.org/10.1073/pnas.0601091103
- Google Scholar
1. Derrien T
2. Johnson R
3. Bussotti G
4. Tanzer A
5. Djebali S
6. Tilgner H
7. Guernec G
8. Martin D
9. Merkel A
10. Knowles DG
11. Lagarde J
12. Veeravalli L
13. Ruan X
14. Ruan Y
15. Lassmann T
16. Carninci P
17. Brown JB
18. Lipovich L
19. Gonzalez JM
20. Thomas M
21. Davis CA
22. Shiekhattar R
23. Gingeras TR
24. Hubbard TJ
25. Notredame C
26. Harrow J
27. Guigò R
(2012) The GENCODE v7 catalog of human long noncoding RNAs: analysis of their gene structure, evolution, and expression
Genome Research 22:1775–1789.
https://doi.org/10.1101/gr.132159.111
- Google Scholar
1. Ding SW
(2010) RNA-based antiviral immunity
Nature Reviews Immunology 10:632–644.
https://doi.org/10.1038/nri2824
- Google Scholar
(2013) The double-stranded RNA binding domain of human Dicer functions as a nuclear localization signal
RNA 19:1238–1252.
https://doi.org/10.1261/rna.039255.113
- Google Scholar
(2011) Compatibility with killer explains the rise of RNAi-deficient fungi
Science 333:1592.
https://doi.org/10.1126/science.1209575
- Google Scholar
1. Drinnenberg IA
2. Weinberg DE
3. Xie KT
4. Mower JP
5. Wolfe KH
6. Fink GR
7. Bartel DP
(2009) RNAi in budding yeast
Science 326:544–550.
https://doi.org/10.1126/science.1176945
- Google Scholar
1. Faghihi MA
2. Wahlestedt C
(2006) RNA interference is not involved in natural antisense mediated regulation of gene expression in mammals
Genome Biology 7:R38.
https://doi.org/10.1186/gb-2006-7-5-r38
- Google Scholar
1. Fire A
2. Xu S
3. Montgomery MK
4. Kostas SA
5. Driver SE
6. Mello CC
(1998) Potent and specific genetic interference by double-stranded RNA in Caenorhabditis elegans
Nature 391:806–811.
https://doi.org/10.1038/35888
- Google Scholar
1. Gantier MP
2. Williams BR
(2007) The response of mammalian cells to double-stranded RNA
Cytokine & Growth Factor Reviews 18:363–371.
https://doi.org/10.1016/j.cytogfr.2007.06.016
- Google Scholar
1. Geisler S
2. Lojek L
3. Khalil AM
4. Baker KE
5. Coller J
(2012) Decapping of long noncoding RNAs regulates inducible genes
Molecular Cell 45:279–291.
https://doi.org/10.1016/j.molcel.2011.11.025
- Google Scholar
(2011) Regulated antisense transcription controls expression of cell-type specific genes in yeast
Molecular Cell Biology 31:1701–1709.
https://doi.org/10.1128/MCB.01071-10
- Google Scholar
(2000) A chromatin insulator determines the nuclear localization of DNA
Molecular Cell 6:1025–1035.
https://doi.org/10.1016/S1097-2765(00)00101-5
- Google Scholar
(2012) Pervasive antisense transcription is evolutionarily conserved in budding yeast
Molecular Biology and Evolution 30:409–421.
https://doi.org/10.1093/molbev/mss240
- Google Scholar
1. Gotta M
2. Laroche T
3. Formenton A
4. Maillet SA
5. Driver L
6. Scherthan H
7. Gasser SM
(1996) The clustering of telomeres and colocalization with Rap1, Sir3, and Sir4 proteins in wild-type Saccharomyces cerevisiae
Journal of Cell Biology 134:1349–1363.
https://doi.org/10.1083/jcb.134.6.1349
- Google Scholar
(2011) Autoregulation of convergent RNAi genes in fission yeast
Genes Development 25:556–568.
https://doi.org/10.1101/gad.618611
- Google Scholar
1. Gullerova M
2. Proudfoot NJ
(2012) Convergent transcription induces transcriptional gene silencing in fission yeast and mammalian cells
Nature Structure & Molecular Biology 19:1193–1201.
https://doi.org/10.1038/nsmb.2392
- Google Scholar
(2011) Intergenic transcription causes repression by directing nucleosome assembly
Genes Development 25:29–40.
https://doi.org/10.1101/gad.1975011
- Google Scholar
1. Hamilton AJ
2. Baulcombe DC
(1999) A species of small antisense RNA in posttranscriptional gene silencing in plants
Science 286:950–952.
https://doi.org/10.1126/science.286.5441.950
- Google Scholar
1. Hannon GJ
(2002) RNA interference
Nature 418:244–251.
https://doi.org/10.1038/418244a
- Google Scholar
1. Hirota K
2. Miyoshi T
3. Kugou K
4. Hoffman CS
5. Shibata T
6. Ohta K
(2008) Stepwise chromatin remodelling by a cascade of transcription initiation of non-coding RNAs
Nature 456:130–134.
https://doi.org/10.1038/nature07348
- Google Scholar
(2012) RNA polymerase II collision interrupts convergent transcription
Molecular Cell 48:365–374.
https://doi.org/10.1016/j.molcel.2012.08.027
- Google Scholar
(2011) RNA editing by mammalian ADARs
Advances in Genetics 73:87–120.
https://doi.org/10.1016/B978-0-12-380860-8.00003-3
- Google Scholar
(2006) Antisense transcription controls cell fate in Saccharomyces cerevisiae
Cell 127:735–745.
https://doi.org/10.1016/j.cell.2006.09.038
- Google Scholar
1. Houseley J
(2012) Form and function of eukaryotic unstable non-coding RNAs
Biochemical Society Transactions 40:836–841.
https://doi.org/10.1042/BST20120040
- Google Scholar
(2007) Trf4 targets ncRNAs from telomeric and rDNA spacer regions and functions in rDNA copy number control
The EMBO Journal 26:4996–5006.
https://doi.org/10.1038/sj.emboj.7601921
- Google Scholar
(2008) A ncRNA modulates histone modification and mRNA induction in the yeast GAL gene cluster
Molecular Cell 32:685–695.
https://doi.org/10.1016/j.molcel.2008.09.027
- Google Scholar
1. Houseley J
2. Tollervey D
(2009) The many pathways of RNA degradation
Cell 136:763–776.
https://doi.org/10.1016/j.cell.2009.01.019
- Google Scholar
1. Hsieh J
2. Fire A
(2000) Recognition and silencing of repeated DNA
Annual Review of Genetics 34:187–204.
https://doi.org/10.1146/annurev.genet.34.1.187
- Google Scholar
(2012) Biology of PIWI-interacting RNAs: new insights into biogenesis and function inside and outside of germlines
Genes Development 26:2361–2373.
https://doi.org/10.1101/gad.203786.112
- Google Scholar
1. Kapranov P
2. Cheng J
3. Dike S
4. Nix DA
5. Duttagupta R
6. Willingham AT
7. Stadler PF
8. Hertel J
9. Hackermuller J
10. Hofacker IL
11. Bell I
12. Cheung E
13. Drenkow J
14. Dumais E
15. Patel S
16. Helt G
17. Ganesh M
18. Ghosh S
19. Piccolboni A
20. Sementchenko V
21. Tammana H
22. Gingeras TR
(2007) RNA maps reveal new RNA classes and a possible function for pervasive transcription
Science 316:1484–1488.
https://doi.org/10.1126/science.1138341
- Google Scholar
(2005) Antisense transcription in the mammalian transcriptome
Science 309:1564–1566.
https://doi.org/10.1126/science.1112009
- Google Scholar
(2006) A pathogen-inducible endogenous siRNA in plant immunity
Proceedings of the National Academy of Sciences of the United States of America 103:18002–18007.
https://doi.org/10.1073/pnas.0608258103
- Google Scholar
1. Ketting RF
(2011) The many faces of RNAi
Developmental Cell 20:148–161.
https://doi.org/10.1016/j.devcel.2011.01.012
- Google Scholar
1. Kirmizis A
2. Santos-Rosa H
3. Penkett CJ
4. Singer MA
5. Vermeulen M
6. Mann M
7. Bahler J
8. Green RD
9. Kouzarides T
(2007) Arginine methylation at histone H3R2 controls deposition of H3K4 trimethylation
Nature 449:928–932.
https://doi.org/10.1038/nature06160
- Google Scholar
1. Kobayashi T
2. Ganley AR
(2005) Recombination regulation by transcription-induced cohesin dissociation in rDNA repeats
Science 309:1581–1584.
https://doi.org/10.1126/science.1116102
- Google Scholar
(2009) Ultrafast and memory-efficient alignment of short DNA sequences to the human genome
Genome Biology 10:R25.
https://doi.org/10.1186/gb-2009-10-3-r25
- Google Scholar
1. Latos PA
2. Pauler FM
3. Koerner MV
4. Senergin HB
5. Hudson QJ
6. Stocsits RR
7. Allhoff W
8. Stricker SH
9. Klement RM
10. Warczok KE
11. Aumayr K
12. Pasierbek P
13. Barlow DP
(2012) Airn transcriptional overlap, but not its lncRNA products, induces imprinted Igf2r silencing
Science 338:1469–1472.
https://doi.org/10.1126/science.1228110
- Google Scholar
(2010) On the connection between RNAi and heterochromatin at centromeres
Cold Spring Harbor Symposia on Quantitative Biology 75:275–283.
https://doi.org/10.1101/sqb.2010.75.024
- Google Scholar
1. Li Y
2. Lu J
3. Han Y
4. Fan X
5. Ding SW
(2013) RNA interference functions as an antiviral immunity mechanism in mammals
Science 342:231–234.
https://doi.org/10.1126/science.1241911
- Google Scholar
1. Maillard PV
2. Ciaudo C
3. Marchais A
4. Li Y
5. Jay F
6. Ding SW
7. Voinnet O
(2013) Antiviral RNA interference in mammalian cells
Science 342:235–238.
https://doi.org/10.1126/science.1241930
- Google Scholar
(2004) Intergenic transcription is required to repress the Saccharomyces cerevisiae SER3 gene
Nature 429:571–574.
https://doi.org/10.1038/nature02538
- Google Scholar
(2005) RNA interference and heterochromatin in the fission yeast Schizosaccharomyces pombe
Trends in Genetics 21:450–456.
https://doi.org/10.1016/j.tig.2005.06.005
- Google Scholar
1. Nagano T
2. Mitchell JA
3. Sanz LA
4. Pauler FM
5. Ferguson-Smith AC
6. Feil R
7. Fraser P
(2008) The Air noncoding RNA epigenetically silences transcription by targeting G9a to chromatin
Science 322:1717–1720.
https://doi.org/10.1126/science.1163802
- Google Scholar
(2009) Widespread bidirectional promoters are the major source of cryptic transcripts in yeast
Nature 457:1038–1042.
https://doi.org/10.1038/nature07747
- Google Scholar
(2012a) dsRNA expression in the mouse elicits RNAi in oocytes and low adenosine deamination in somatic cells
Nucleic Acids Research 40:399–413.
https://doi.org/10.1093/nar/gkr702
- Google Scholar
(2012b) Deep sequencing reveals complex spurious transcription from transiently transfected plasmids
PLOS ONE 7:e43283.
https://doi.org/10.1371/journal.pone.0043283
- Google Scholar
1. Okamura K
2. Balla S
3. Martin R
4. Liu N
5. Lai EC
(2008) Two distinct mechanisms generate endogenous siRNAs from bidirectional transcription in Drosophila melanogaster
Nature Structure & Molecular Biology 15:581–590.
https://doi.org/10.1038/nsmb.1438
- Google Scholar
1. Pall GS
2. Hamilton AJ
(2008) Improved northern blot method for enhanced detection of small RNA
Nature Protocols 3:1077–1084.
https://doi.org/10.1038/nprot.2008.67
- Google Scholar
1. Pandey RR
2. Mondal T
3. Mohammad F
4. Enroth S
5. Redrup L
6. Komorowski J
7. Nagano T
8. Mancini-Dinardo D
9. Kanduri C
(2008) Kcnq1ot1 antisense noncoding RNA mediates lineage-specific transcriptional silencing through chromatin-level regulation
Molecular Cell 32:232–246.
https://doi.org/10.1016/j.molcel.2008.08.022
- Google Scholar
(2009) H3 lysine 4 di- and tri-methylation deposited by cryptic transcription attenuates promoter activation
The EMBO Journal 28:1697–1707.
https://doi.org/10.1038/emboj.2009.108
- Google Scholar
(2008) RNA exosome depletion reveals transcription upstream of active human promoters
Science 322:1851–1854.
https://doi.org/10.1126/science.1164096
- Google Scholar
1. Pryde FE
2. Louis EJ
(1999) Limitations of silencing at native yeast telomeres
The EMBO Journal 18:2538–2550.
https://doi.org/10.1093/emboj/18.9.2538
- Google Scholar
1. Redrup L
2. Branco MR
3. Perdeaux ER
4. Krueger C
5. Lewis A
6. Santos F
7. Nagano T
8. Cobb BS
9. Fraser P
10. Reik W
(2009) The long noncoding RNA Kcnq1ot1 organises a lineage-specific nuclear domain for epigenetic gene silencing
Development 136:525–530.
https://doi.org/10.1242/dev.031328
- Google Scholar
Book
1. Sambrook J
2. Russell DW
(2001)
Molecular cloning: a laboratory manual (3rd edition)

Cold Spring Harbor, NY: Cold Spring Harbor Laboratory Press.
- Google Scholar
(1991) Monoclonal antibodies to double-stranded RNA as probes of RNA structure in crude nucleic acid extracts
Nucleic Acids Research 19:2993–3000.
https://doi.org/10.1093/nar/19.11.2993
- Google Scholar
(2010) Dicer is associated with ribosomal DNA chromatin in mammalian cells
PLOS ONE 5:e12175.
https://doi.org/10.1371/journal.pone.0012175
- Google Scholar
1. Siomi MC
2. Sato K
3. Pezic D
4. Aravin AA
(2011) PIWI-interacting small RNAs: the vanguard of genome defence
Nature Reviews Molecular Cell Biology 12:246–258.
https://doi.org/10.1038/nrm3089
- Google Scholar
1. Slotkin RK
2. Martienssen R
(2007) Transposable elements and the epigenetic regulation of the genome
Nature Reviews Genetics 8:272–285.
https://doi.org/10.1038/nrg2072
- Google Scholar
1. Song R
2. Hennig GW
3. Wu Q
4. Jose C
5. Zheng H
6. Yan W
(2011) Male germ cells express abundant endogenous siRNAs
Proceedings of the National Academy of Sciences of the United States of America 108:13159–13164.
https://doi.org/10.1073/pnas.1108567108
- Google Scholar
1. Tam OH
2. Aravin AA
3. Stein P
4. Girard A
5. Murchison EP
6. Cheloufi S
7. Hodges E
8. Anger M
9. Sachidanandam R
10. Schultz RM
et al. (2008) Pseudogene-derived small interfering RNAs regulate gene expression in mouse oocytes
Nature 453:534–538.
https://doi.org/10.1038/nature06904
- Google Scholar
1. The ENCODE Project Constorium
(2012) An integrated encyclopedia of DNA elements in the human genome
Nature 489:57–74.
https://doi.org/10.1038/nature11247
- Google Scholar
(2007) A role for noncoding transcription in activation of the yeast PHO5 gene
Proceedings of the National Academy of Sciences of the United States of America 104:8011–8016.
https://doi.org/10.1073/pnas.0702431104
- Google Scholar
(2010) Most “dark matter” transcripts are associated with known genes
PLOS Biology 8:e1000371.
https://doi.org/10.1371/journal.pbio.1000371
- Google Scholar
1. van Dijk EL
2. Chen CL
3. d’Aubenton-Carafa Y
4. Gourvennec S
5. Kwapisz M
6. Roche V
7. Bertrand C
8. Silvain M
9. Legoix-Ne P
10. Loeillet S
11. Nicolas A
12. Thermes C
13. Morillon A
(2011) XUTs are a class of Xrn1-sensitive antisense regulatory non-coding RNA in yeast
Nature 475:114–117.
https://doi.org/10.1038/nature10118
- Google Scholar
(2012) Transcription of two long noncoding RNAs mediates mating-type control of gametogenesis in budding yeast
Cell 150:1170–1181.
https://doi.org/10.1016/j.cell.2012.06.049
- Google Scholar
1. Velmurugan S
2. Yang XM
3. Chan CSM
4. Dobson M
5. Jayaram M
(2000) Partitioning of the 2-microm circle plasmid of Saccharomyces cerevisiae. Functional coordination with chromosome segregation and plasmid-encoded rep protein distribution
Journal of Cell Biology 149:553–566.
https://doi.org/10.1083/jcb.149.3.553
- Google Scholar
1. Watanabe T
2. Takeda A
3. Tsukiyama T
4. Mise K
5. Okuno T
6. Sasaki H
7. Minami N
8. Imai H
(2006) Identification and characterization of two novel classes of small RNAs in the mouse germline: retrotransposon-derived siRNAs in oocytes and germline small RNAs in testes
Genes Development 20:1732–1743.
https://doi.org/10.1101/gad.1425706
- Google Scholar
1. Watanabe T
2. Totoki Y
3. Toyoda A
4. Kaneda M
5. Kuramochi-Miyagawa S
6. Obata Y
7. Chiba H
8. Kohara Y
9. Kono T
10. Nakano T
11. Surani MA
12. Sakaki Y
13. Sasaki H
(2008) Endogenous siRNAs from naturally formed dsRNAs regulate transcripts in mouse oocytes
Nature 453:539–543.
https://doi.org/10.1038/nature06908
- Google Scholar
(2002) RSC2, encoding a component of the RSC nucleosome remodeling complex, is essential for 2 microm plasmid maintenance in Saccharomyces cerevisiae
Molecular and Cellular Biology 22:4218–4229.
https://doi.org/10.1128/MCB.22.12.4218-4229.2002
- Google Scholar
1. Xu Z
2. Wei W
3. Gagneur J
4. Perocchi F
5. Clauder-Munster S
6. Camblong J
7. Guffanti E
8. Stutz F
9. Huber W
10. Steinmetz LM
(2009) Bidirectional promoters generate pervasive transcription in yeast
Nature 457:1033–1037.
https://doi.org/10.1038/nature07728
- Google Scholar
(1998) Y’-Help1, a DNA helicase encoded by the yeast subtelomeric Y’ element, is induced in survivors defective for telomerase
The Journal of Biological Chemistry 273:33360–33366.
https://doi.org/10.1074/jbc.273.50.33360
- Google Scholar
1. Yamanaka S
2. Mehta S
3. Reyes-Turcu FE
4. Zhuang F
5. Fuchs RT
6. Rong Y
7. Robb GB
8. Grewal SI
(2012) RNAi triggered by specialized machinery silences developmental genes and retrotransposons
Nature 493:557–560.
https://doi.org/10.1038/nature11716
- Google Scholar
1. Yang N
2. Kazazian HH Jnr
(2006) L1 retrotransposition is suppressed by endogenously encoded small interfering RNAs in human cultured cells
Nature Structure & Molecular Biology 13:763–771.
https://doi.org/10.1038/nsmb1141
- Google Scholar
1. Zhang H
2. Zhu JK
(2011) RNA-directed DNA methylation
Current Opinion in Plant Biology 14:142–147.
https://doi.org/10.1016/j.pbi.2011.02.003
- Google Scholar

Article and author information

Author details

Cristina Cruz

Epigenetics Programme, The Babraham Institute, Cambridge, United Kingdom

Contribution
CC, Conception and design, Acquisition of data, Analysis and interpretation of data

Competing interests
The authors declare that no competing interests exist.
Jonathan Houseley

Epigenetics Programme, The Babraham Institute, Cambridge, United Kingdom

Contribution
JH, Conception and design, Acquisition of data, Analysis and interpretation of data, Drafting or revising the article

For correspondence
jon.houseley@babraham.ac.uk

Competing interests
The authors declare that no competing interests exist.

Funding

Wellcome Trust (088335)

Cristina Cruz
Jonathan Houseley

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Acknowledgements

We thank Simon Andrews, Felix Krueger, Laura Biggins and Anne Segonds-Pichon for Bioinformatics support, Simon Walker for microscopy support, David Bartel for strains, Robin Allshire, Wolf Reik and Sarah Elderkin for comments, and Alex Murray and Tim Hore for critical reading. This work was supported by the Wellcome Trust [grant number 088335].

Version history

Received: September 23, 2013
Accepted: December 29, 2013
Version of Record published: February 11, 2014 (version 1)

Copyright

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.

Metrics

2,005

views
206

downloads
24

citations

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Mendeley

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

Cristina Cruz
Jonathan Houseley

(2014)

Endogenous RNA interference is driven by copy number

eLife 3:e01581.

https://doi.org/10.7554/eLife.01581

Categories and tags

Research organism

S. cerevisiae

Share this article

Cite this article

Frequency of annotated antisense non-protein coding RNAs (ncRNAs) and effects on mRNA abundance.

High-copy endogenous sense-antisense pairs instigate efficient RNA interference (RNAi).

Copy number amplification of coding genes can instigate RNA interference (RNAi).

Multi-copy loci are preferentially targeted by RNA interference (RNAi).

Single gene analysis of copy number effect on RNA interference (RNAi).

Clustered loci show higher efficiency of short interfering RNA (siRNA) formation.

RNA interference (RNAi) against transcripts from amplified low-expression systems.

RNA interference (RNAi) against pervasive transcripts from the repressed GAL cluster.

Proposed mechanism for RNA interference (RNAi) on high-copy loci.

Author details

Cristina Cruz

Contribution

Competing interests

Jonathan Houseley

Contribution

For correspondence

Competing interests

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

Categories and tags

Research organism

Further reading