RNA-dependent chromatin association of transcription elongation factors and Pol II CTD kinases

Abstract
Introduction
Results
Discussion
Materials and methods
Data availability
References
Article and author information
Metrics

Abstract

For transcription through chromatin, RNA polymerase (Pol) II associates with elongation factors (EFs). Here we show that many EFs crosslink to RNA emerging from transcribing Pol II in the yeast Saccharomyces cerevisiae. Most EFs crosslink preferentially to mRNAs, rather than unstable non-coding RNAs. RNA contributes to chromatin association of many EFs, including the Pol II serine 2 kinases Ctk1 and Bur1 and the histone H3 methyltransferases Set1 and Set2. The Ctk1 kinase complex binds RNA in vitro, consistent with direct EF-RNA interaction. Set1 recruitment to genes in vivo depends on its RNA recognition motifs (RRMs). These results strongly suggest that nascent RNA contributes to EF recruitment to transcribing Pol II. We propose that EF-RNA interactions facilitate assembly of the elongation complex on transcribed genes when RNA emerges from Pol II, and that loss of EF-RNA interactions upon RNA cleavage at the polyadenylation site triggers disassembly of the elongation complex.

https://doi.org/10.7554/eLife.25637.001

Introduction

For productive transcription through chromatin, RNA polymerase (Pol) II associates with general elongation factors (EFs) (Perales and Bentley, 2009; Shilatifard, 2004; Shilatifard et al., 2003; Sims et al., 2004) that are recruited to the body of transcribed genes in yeast (Mayer et al., 2010). EFs in yeast include Spt5 (a subunit of human DSIF), the histone chaperone Spt6, and the Paf1 complex (Paf1C). The Pol II C-terminal domain (CTD) kinases Bur1 (human CDK9) and Ctk1 (human CDK12), and their cyclin partners Bur2 and Ctk2, respectively, can also be classified as EFs. In addition, the histone methyltransferases Set1 (a subunit of the COMPASS complex), Set2, and Dot1, are recruited to elongating Pol II to set the ‘active’ histone marks H3K4me3, H3K36me3, and H3K79me3, respectively.

Despite extensive research, it remains unclear for several EFs how they are recruited to active genes. EFs may be recruited by interactions with the body of transcribing Pol II, or by contacts with the tail-like C-terminal repeat domain (CTD) of Pol II, or they may bind via other Pol II-associated EFs. Spt5 binds the body of the Pol II elongation complex (Grohmann et al., 2011; Klein et al., 2011; Martinez-Rucobo et al., 2011), whereas Bur1, Spt6 and Set2 bind the CTD (Dengl et al., 2009; Kizer et al., 2005; Li et al., 2003; Phatnani et al., 2004; Sun et al., 2010; Yoh et al., 2007; Qiu et al., 2009; Li et al., 2002). Interaction of Paf1C with Pol II involves Spt5 (Liu et al., 2009; Mayekar et al., 2013; Wier et al., 2013; Zhou et al., 2009; Qiu et al., 2012, 2009) and the CTD (Qiu et al., 2012), whereas interaction of Set1 with Pol II involves Paf1C (Krogan et al., 2003a; Ng et al., 2003).

However, other recruitment mechanisms exist because mutations in EFs that prevent their polymerase interactions do not abolish gene occupancy of such factors, including Bur1, Paf1C subunits, Spt6, and Set2 (Ng et al., 2003; Qiu et al., 2012, 2009; Mayer et al., 2010; Zhou et al., 2009; Krogan et al., 2003b). Further, it remains unknown how the yeast CTD serine 2 (Ser2) kinase Ctk1 is recruited, which is apparently a prerequisite for recruitment of Spt6 and Set2, because these factors bind the Ser2-phosphorylated CTD (Dengl et al., 2009; Kizer et al., 2005; Li et al., 2003; Phatnani et al., 2004; Sun et al., 2010; Yoh et al., 2007). More generally, it is unknown whether and how EFs can distinguish transcribing Pol II from free or initiating polymerase based on polymerase interactions alone, in particular at an early stage of elongation when Ser2 phosphorylation is absent.

An alternative mechanism of EF recruitment would involve interactions with the nascent pre-mRNA. Such RNA interactions are well established for RNA processing factors that are recruited during Pol II elongation (Perales and Bentley, 2009; Bentley, 2005; Baejen et al., 2014; Tuck and Tollervey, 2013) for co-transcriptional capping (Martinez-Rucobo et al., 2015), splicing (Bentley, 2005; Saldi et al., 2016), and 3′-processing (Proudfoot, 2011; Shi and Manley, 2015) of the pre-mRNA. Some observations indeed suggest that nascent RNA contributes to the recruitment of EFs to Pol II. Spt5 and Set1 bind RNA in vitro (Meyer et al., 2015; Missra and Gilmour, 2010; Trésaugues et al., 2006; Halbach et al., 2009), Ctk1 and Bur1 in vivo occupancy at active genes depends on the cap-binding complex, which binds 5′-capped RNA (Hossain et al., 2013; Lidschreiber et al., 2013), and Paf1C binds RNA, which is required for full gene occupancy (Dermody and Buratowski, 2010).

Here we report that most EFs in yeast, including, most notably, CTD Ser2 kinases and histone H3 methyltransferases, directly crosslink to nascent pre-mRNA in vivo. We find that crosslinking preferences can differ for coding RNAs and non-coding (nc) RNAs. We further show that chromatin association of many EFs depends on RNA. We also directly tested one prominent EF for RNA binding in vitro, and found that recombinant, purified Ctk1-containing kinase complex CTDK-I strongly binds RNA in the absence of other components. Moreover, we show that the N-terminal region of Set1 that contains two RNA recognition motifs (RRMs) is required for full Set1 recruitment to genes in vivo. Based on these results we suggest a model where nascent RNA contributes to the assembly and stability of the Pol II elongation complex. RNA-EF interactions provide a missing link for understanding the coordination of the transcription cycle.

Results

Elongation factors directly crosslink to RNA in vivo

To investigate whether EFs interact with RNA in vivo, we used photoactivatable ribonucleoside-enhanced crosslinking and immunoprecipitation (PAR-CLIP) (Hafner et al., 2010), a method that detects and maps direct protein-RNA interactions without chemical crosslinkers. We applied our recently optimized PAR-CLIP protocol (Baejen et al., 2014) to 14 EFs of the yeast Saccharomyces cerevisiae (Table 1, Materials and methods). These EFs included Spt5, Spt6, the five Paf1C subunits Cdc73, Ctr9, Leo1, Rtf1, and Paf1, the kinases Bur1 and Ctk1, the cyclins Bur2 and Ctk2, and the histone methyltransferases Set1, Set2, and Dot1.

Table 1

PAR-CLIP analysis of elongation factors (EFs).

https://doi.org/10.7554/eLife.25637.002

EF	Complex*	Number of crosslink sites^†
Bur1	BUR kinase complex	77931
Bur2	BUR kinase complex	46293
Ctk1	CTDK-I	129352
Ctk2	CTDK-I	98993
Cdc73	Paf1C	57603
Ctr9		55807
Leo1		27665
Paf1		20742
Rtf1		60068
Set1	COMPASS	189723
Set2		68875
Dot1		42848
Spt5^‡	DSIF	517568
Spt6		93902
TFIIB^§		16686

*DSIF, DRB sensitivity inducing factor; CTDK, C-terminal domain kinase; Paf1C, Paf1 complex; COMPASS, Complex Proteins Associated with Set1.
^†Average number of crosslink sites with p-values<0.005.
^‡(Baejen et al., 2017).
^§Initiation factor, used to determine the level of RNA background crosslinking

For 12 of these 14 EFs we obtained PAR-CLIP signals that were more than two-fold above background, showing that these EFs interact with RNA in vivo (Figure 1, Figure 1—figure supplement 1A). We obtained between 42,000 and 520,000 high-confidence protein-RNA crosslinking sites per factor with p-values below 0.005 (Table 1). The obtained data sets were highly reproducible (Figure 1—figure supplement 1B). To estimate background RNA binding, we collected PAR-CLIP data for the transcription initiation factor TFIIB that is recruited to promoter DNA before nascent RNA is made (Sainsbury et al., 2015). Only very low levels of background binding were observed, further emphasizing the significance of EF-RNA interactions detected by UV crosslinking.

Figure 1 with 1 supplement see all

Download asset Open asset

Many elongation factors (EFs) bind RNA in vivo.

PAR-CLIP signal strength for EFs varies. The barplots show log2 fold-enrichments of transcript-averaged PAR-CLIP signals over the averaged PAR-CLIP signal for initiation factor TFIIB, which shows background RNA binding. Averaged PAR-CLIP signals were calculated by taking mean transcript PAR-CLIP signals averaged over all mRNAs, which were filtered to be 800–5000 nt long and at least 150 nt away from neighboring transcripts (2532 mRNAs). Heat plots averaged over mRNA transcripts of the corresponding PAR-CLIP signals are shown in Figure 1—figure supplement 1A.

https://doi.org/10.7554/eLife.25637.003

We then classified EFs into factors with moderate and high PAR-CLIP signals, based on their fold enrichments (>2 and >4 fold, respectively) over background TFIIB signals (Figure 1). Spt5, Set1, Ctk1, Spt6, Ctk2 and Bur1 showed high PAR-CLIP signals (Figure 1, Figure 1—figure supplement 1A, Table 1). EFs with moderate signals included Rtf1, Ctr9, Cdc73, Bur2, Set2 and Dot1. PAR-CLIP signals were clearly specific for individual subunits of known complexes. For instance, only the Paf1C subunits Rtf1, Cdc73 and Ctr9 bound RNA according to the PAR-CLIP results, and the same subunits bound radioactively labeled RNA after immunoprecipitation (Figure 1—figure supplement 1C). A very low background signal was observed for other subunits, whereas the enriched bands were due to the protein of interest. These data revealed that many EFs directly bind RNA in vivo, including Pol II Ser2 kinases and histone H3 methyltransferases.

Comparisons of PAR-CLIP data require normalization

We have previously noted the importance of normalizing the raw PAR-CLIP signal, as measured by the number of U-to-C transitions per U site, to account for differences in RNA abundance (Baejen et al., 2014). Briefly, the raw PAR-CLIP signal is proportional to the occupancy of the factor on RNA and to the concentration of RNAs covering the U site. Therefore, normalization is crucial to enable comparison of PAR-CLIP signals between individual transcripts and transcript classes. Relative occupancies can be estimated by dividing the observed PAR-CLIP signal by RNA-Seq reads that have been obtained under the same experimental conditions (Baejen et al., 2014). An alternative approach is to divide the observed PAR-CLIP signal by a PAR-CLIP signal obtained for Pol II (Baejen et al., 2017), although this is only suitable for proteins that associate with nascent RNA during transcription, which is the case for the EFs studied here.

In Figure 2 we investigate how the two different normalization methods affect EF occupancy profiles on mRNA transcripts. For two representative EFs, Ctk2 and Spt5, the raw data (Figure 2A) was either normalized with RNA-Seq reads (Figure 2B) or with reads from Pol II (Rpb1 subunit) PAR-CLIP data (Figure 2C). Meta-transcript profiles are shown in Figure 2D. In the case of Ctk2, the raw data profile and the Pol II normalized profile look very similar, whereas the RNA-normalized profile shows slightly less occupancy of Ctk2 in the 3′ part of the transcripts, due to the slightly higher RNA-Seq signal in this region (Figure 2B, bottom). The PAR-CLIP signal for Spt5 is enriched around the 5′-end of mRNAs, decreases towards the 3′-end, and this was independent of the normalization approach (Figure 2D, bottom). However, Spt5 signals peak just downstream of the pA site, and the size of this peak varies dependent on the normalization approach. This is due to the intrinsic instability of transcripts downstream of the pA site, which reduces the number of RNA-Seq reads, and artificially increases the PAR-CLIP peak after RNA-Seq-based normalization.

Figure 2 with 1 supplement see all

Download asset Open asset

Normalization of PAR-CLIP data shown for two representative EFs, Ctk2 (top) and Spt5 (bottom).

(A) Smoothed, raw RNA-binding strength as measured by the number of PAR-CLIP U-to-C transitions per U site for all mRNAs sorted by length and aligned at their RNA 5′-end (transcription start site, TSS). (B) Relative occupancy estimated by dividing the number of U-to-C transitions for each U site by the RNA-Seq signal at the corresponding genomic position for all mRNAs. A heat map showing the transcript-averaged RNA-Seq reads for all mRNAs scaled to the same length is shown below. (C) Relative occupancy estimated by dividing the number of U-to-C transitions for each U site by the Rpb1 PAR-CLIP signal at the corresponding genomic position for all mRNAs. A heat map showing the transcript-averaged Rpb1 PAR-CLIP reads for all mRNAs scaled to the same length is shown below. (D) Smoothed, raw and normalized PAR-CLIP signals as shown in A-C but averaged over all mRNAs. Before averaging RNA occupancy profiles were aligned at the RNA 5′-end and length-scaled such that the 5′-ends and pA sites coincided.

https://doi.org/10.7554/eLife.25637.005

Taken together, the PAR-CLIP metagene profiles over stable transcripts were largely independent of the type of normalization used, whereas normalization becomes very important when crosslinking to unstable RNAs is investigated. Indeed, when we compare meta-profiles over cryptic unstable transcripts (CUTs) versus stable mRNAs using the different normalization methods (Figure 2—figure supplement 1), we observe that for proteins that bind CUTs (e.g. Spt5) the relative signal over CUTs increases when total RNA-Seq reads are used for normalization, similarly as for unstable transcripts downstream of the pA site (Figure 2D, bottom). Since we were interested in comparing EF occupancies between transcript classes, including unstable RNAs, we used Pol II PAR-CLIP normalization to calculate normalized EF PAR-CLIP occupancies, and used these for further analysis.

EF localization along mRNA transcripts

To localize EFs on transcripts, we mapped the Pol II normalized PAR-CLIP occupancies onto transcripts in different classes (Materials and methods). We then calculated factor occupancies for 2532 mRNA transcripts that were filtered to reduce ambiguous signals from overlapping transcripts. We calculated heat maps with occupancies averaged around the transcript 5′-end, which corresponds to the transcription start site (TSS), and around the polyadenylation (pA) site (Figure 3A). The obtained profiles were also visible on individual transcripts (Figure 3—figure supplement 1A).

Figure 3 with 1 supplement see all

Download asset Open asset

mRNA-binding profiles of EFs.

(A) Smoothed, transcript-averaged Pol II normalized PAR-CLIP occupancy profiles of EFs centered around the transcript 5′-end (transcription start site, TSS) [−150 nt to +400 nt] and pA site [−400 nt to +150 nt] of a set of 2532 filtered mRNAs (compare Figure 1). Only factors with average RNA-binding occupancies > 2 fold above background are shown. The Spt5 PAR-CLIP profile reveals a peak downstream of the pA site that is discussed in detail elsewhere (Baejen et al., 2017). The color code shows the occupancy relative to the maximum occupancy per profile (dark blue). (B) EFs bind to pre-mRNA. Processing indices (PIs) measure preferential binding of factors to uncleaved pre-mRNA with respect to cleaved RNA, computed as log2 odds ratios uncleaved versus cleaved RNA bound by the factor (Materials and methods). The PIs for Pab1 and Pub1, as typical factors binding mature mRNA (Baejen et al., 2014), are shown for comparison. (C) Colocalization of factor crosslinking sites on transcripts. Euclidean distances between pairwise colocalization measures were subjected to average-linkage hierarchical clustering (Materials and methods). The cluster dendrogram shows similarities in crosslinking locations on transcripts between EFs and published RNA processing factors (Baejen et al., 2014; Schulz et al., 2013).

https://doi.org/10.7554/eLife.25637.007

Generally, PAR-CLIP occupancies were high at the 5′-end of mRNAs and decreased shortly before the pA site, with few exceptions (Figure 3A). First, the histone methyltransferases Set2 and Dot1, for which the corresponding methylation marks accumulate in gene bodies (Bannister et al., 2005; Pokholok et al., 2005), showed more RNA-binding sites over transcript bodies. Second, Set1 crosslinked to mRNAs mainly near the beginning of transcripts, which was expected since Set1 and its methylation mark, H3K4me3, are observed in promoter-proximal regions of genes (Ng et al., 2003). Third, the kinases Ctk1 and Bur1 and their cyclin partners Ctk2 and Bur2 were enriched near the 5′-end but also in the transcript body. The 5′-peak for Bur1-Bur2 slightly preceded that of Ctk1-Ctk2. The three Paf1C subunits Cdc73, Ctr9 and Rtf1 showed similar occupancy profiles as the kinases but with a focused peak at the 5′-end. Fourth, Spt5 and Spt6 showed high PAR-CLIP occupancy at the 5′-end of mRNAs and decreased occupancy towards the pA site. This analysis revealed specific differences in EF localization on mRNAs, and additionally suggested that EFs bind nascent RNA during transcription.

EFs bind nascent pre-mRNA

To test whether EFs interact with nascent pre-mRNA or with spliced, mature mRNA, we measured factor occupancies at introns, which are co-transcriptionally spliced out and subsequently degraded (Carrillo Oesterreich et al., 2016). All EFs cross-linked to introns (Figure 3—figure supplement 1B), indicating that they bind pre-mRNA. Most EFs bound to introns with a frequency that was comparable to that at exons, although Spt5 and Set1 showed slightly higher occupancy within introns, whereas Bur2, Set2 and Dot1 showed lower occupancy (Figure 3—figure supplement 1B). Taking into account that splicing generally occurs co-transcriptionally (Kornblihtt et al., 2004; Tennyson et al., 1995; Listerman et al., 2006), our data show that EFs interact with nascent pre-mRNA. However, only ~4% of yeast genes contain introns (Qin et al., 2016), preventing general statements related to all pre-mRNAs. We therefore calculated a processing index (PI) that measures preferential binding of factors to uncleaved pre-mRNA with respect to cleaved RNA (Materials and methods) (Baejen et al., 2014). All EFs showed positive PIs, indicating binding to pre-mRNA, in contrast to the negative PIs that we previously obtained for typical RNA binders of processed, mature mRNA, such as Pab1 and Pub1 (Figure 3B) (Baejen et al., 2014). We conclude that EFs preferentially interact with nascent pre-mRNA.

We next investigated where EFs localize on RNAs in relation to previously mapped mRNA biogenesis factors (Baejen et al., 2014). We determined the extent of factor colocalization by computing the average occupancy of factor A within ±20 nucleotides (nt) around RNA-binding sites of factor B and subjected the pairwise colocalization measures to hierarchical clustering (Figure 3C, Materials and methods). We found that Spt5 colocalizes with the Cbc2 subunit of the cap-binding complex, consistent with its recruitment during early elongation. Both Ctk1 and Bur1 colocalized with binding sites of Set1 and splicing factors. Paf1C subunits colocalized with Set2, whereas RNA 3′-processing and surveillance factors formed separate groups (Figure 3C). Together these data show a distinct distribution of EFs over RNAs, and suggested that EFs cooperate with other mRNA biogenesis factors during pre-mRNA binding.

Most EFs preferentially interact with coding transcripts

We next analyzed our PAR-CLIP data for EF binding to non-coding Pol II transcripts including short-lived cryptic unstable transcripts (CUTs), which often arise from upstream antisense transcription of bidirectional promoters (Wyers et al., 2005; Xu et al., 2009). We selected CUTs with a minimum length of 350 nt and compared transcript-averaged RNA-binding occupancies between CUTs and mRNAs (see Figure 4A). This revealed that EFs bind to these transcript classes with distinct preferences. Spt5 was equally distributed between CUTs and mRNAs whereas Set1 preferentially bound mRNAs. All other EFs were depleted at CUTs relative to their mRNA occupancies (Figure 4A). This was essentially independent of RNA length (Figure 4—figure supplement 1A). Thus, most EFs preferentially crosslink to coding RNAs.

Figure 4 with 1 supplement see all

Download asset Open asset

Asymmetric distribution of EFs at coding and non-coding transcripts.

(A) PAR-CLIP occupancies over mRNAs (left) and non-coding CUTs (right). Smoothed, averaged Pol II normalized RNA occupancy profiles were aligned at the RNA 5′-end (transcription start site, TSS) and scaled to a common length. The color code shows the occupancy relative to the maximum occupancy per factor over both transcript classes (dark blue) (see also Figure 4—figure supplement 1A). (B) and (C) PAR-CLIP occupancies at selected bidirectional promoters. Smoothed, averaged Pol II normalized RNA occupancy profiles for sense mRNA (right) and divergent CUT (left) were centered around their 5′-end (TSS) [−75 nt to +400 nt]. We considered only bidirectional promoters producing mRNAs and CUTs that did not overlap with any other transcripts in the depicted region. After normalization, average mRNA and CUT profiles were rescaled, setting the maximum occupancy to one and the minimum occupancy to 0 (see also Figure 4—figure supplement 1B and C).

https://doi.org/10.7554/eLife.25637.009

We then analyzed PAR-CLIP signals at bidirectional promoters, which produce mRNA in one direction and a CUT in the divergent direction (Figure 4B). We observed clear differences in PAR-CLIP signals for divergent directions. As in Figure 4A, Set1 and Spt5 showed high signals on CUTs and mRNAs (Figure 4B, top) whereas all other EFs bound exclusively to mRNAs (Figure 4B, bottom). These differences were also observed when the analysis was restricted to bidirectional promoters producing CUTs and mRNAs of similar lengths (Figure 4—figure supplement 1B).

How can some EFs distinguish between CUTs and mRNAs? We carried out motif analysis around the strongest PAR-CLIP sites for each EF using XXmotif (Luehr et al., 2012) and could not find any significantly enriched motifs, indicating that EFs bind RNA in a non-specific manner. We hypothesize that another RNA-binding factor blocks binding of EFs to CUTs. CUTs are rapidly degraded by a surveillance system, which includes Nrd1 (Schulz et al., 2013; Vasiljeva et al., 2008; Steinmetz and Brow, 1996). Nrd1 selectively binds to CUTs (Figure 4C) via motifs that are enriched in CUTs compared to mRNAs (Schulz et al., 2013). Binding of Nrd1 to CUTs might hinder RNA binding of some EFs, especially those which possess lower RNA binding affinity. This may explain how stable elongation complexes preferentially assemble on mRNAs.

Chromatin association of EFs depends on RNA

We next investigated whether RNA binding of EFs contributes to their association with chromatin. Yeast cells were lysed and incubated with buffer containing RNases or with buffer only. Chromatin was isolated and associated protein factors were detected by Western blotting (Materials and methods). We found that RNase treatment strongly decreased the levels of chromatin-associated enzymes Set1, Set2, Dot1, Bur1, Ctk1, and the cyclins Bur2 and Ctk2 (Figure 5). Thus, RNA stabilizes chromatin association of these factors. Two non-enzymatic EFs also depended on RNA for chromatin association, although less strongly. With respect to Paf1C, Rtf1 was partially lost upon RNase treatment, whereas Leo1 and Paf1 were not significantly affected (Figure 5). Spt5 binding to chromatin also depended on RNA, whereas Spt6 was not significantly affected by RNase treatment (Figure 5). These data are generally consistent with our PAR-CLIP results. The discrepancies between RNA-dependent chromatin association and PAR-CLIP results for Spt6 (high PAR-CLIP signal versus RNA-independent chromatin binding) and Dot1 (low PAR-CLIP signal versus strong RNA-dependent chromatin binding) can be explained by additional protein-protein interactions, and by the dependence of PAR-CLIP on the concentration of the RNA-interacting protein in the cell (Chong et al., 2015; Kulak et al., 2014).

Figure 5

Download asset Open asset

Chromatin association of EFs depends on RNA.

Western blot analysis (top) and quantitative densitometry (bottom) of exemplary EFs bound to chromatin before and after treatment with RNase A/T1 mix. H3 was used as loading control. Densitometry data are expressed as mean ± SD from two to three independent experiments. *p<0.05; **p<0.01; ***p<0.001; n.s. = not significant (one-way ANOVA Dunnett post-hoc test).

https://doi.org/10.7554/eLife.25637.011

As a negative control, we subjected TFIIB to the RNase assay. We observed no differences in chromatin binding after RNase treatment (Figure 5), consistent with recruitment of TFIIB to DNA during transcription initiation (Sainsbury et al., 2015). Also as expected, RNase treatment did not affect association of Pol II with chromatin, showing that the observed losses of EFs from chromatin upon RNase treatment were not due to a loss of Pol II (Figure 5). These controls and the above results show that the association of many EFs with chromatin depends on RNA.

Ctk1 kinase complex binds RNA in vitro

The observed RNA-EF crosslinking in vivo and the RNA-dependent chromatin association data strongly suggested that EFs can directly bind RNA. To investigate this in vitro, we prepared one EF complex in recombinant form. We chose the prominent Ser2 kinase complex CTDK-I that comprises Ctk1, Ctk2, and the small subunit Ctk3 (Mühlbacher et al., 2015; Sterner et al., 1995). CTDK-I is the main yeast kinase responsible for phosphorylating the Pol II CTD at Ser2 (Patturajan et al., 1999; Cho et al., 2001), and this is a decisive event in establishing a mature Pol II elongation complex. Further, RNA-dependent chromatin association of Ctk1 and Ctk2 were most unexpected, as for several other EFs RNA interactions were already reported (compare introduction).

We co-expressed recombinant Ctk1, Ctk2, and Ctk3 in insect cells and purified a complete, intact CTDK-I complex (Materials and methods, Figure 6A). We then tested the purified CTDK-I complex for its kinase activity using a purified GST-CTD construct and dephosphorylated full-length S. cerevisiae Pol II (Materials and methods). Both the GST-CTD and the Rpb1 subunit of Pol II were readily phosphorylated by CTDK-I at the Ser2 position in vitro (Figure 6B,C), showing that our purified CTDK-I complex was active.

Figure 6 with 1 supplement see all

Download asset Open asset

Recombinant CTDK-I complex is active and binds RNA in vitro.

(A) The three-subunit CTDK-I complex from *S. cerevisiae* was recombinantly expressed in insect cells and purified to homogeneity. The purified complex was run on a 4–12% gradient sodium dodecyl sulfate polyacrylamide gel electrophoresis (SDS-PAGE) and stained with Coomassie blue. (B) Purified human GST-CTD (10 µM) was incubated with 0.4 µM CTDK-I and 3 mM ATP. Time points were taken at 0 (no ATP), 5, 10, 20, 30, 60 and 120 min and CTDK-I activity was determined by western blot analysis using an antibody that recognizes the Ser2 phosphorylated form of the CTD of Pol II. Molecular mass of GST-CTD is ~70 kDa. (C) Purified and dephosporylated Pol II (2 µM) from *S. cerevisiae* was incubated with 0.4 µM CTDK-I and 3 mM ATP. Time points were taken at 0 (no ATP), 2, 4, 6, 10, 20, 30, 60 and 90 min and CTDK-I activity was determined as in (B). Molecular mass of the CTD containing subunit of Pol II, Rpb1, is ~200 kDa. (D) Increasing concentrations (0–5.8 µM) of the complete CTDK-I kinase complex were incubated with 8 nM of a 24% GC (green line; K_d,app(nM) = 210 ± 18) and with a 45% GC (purple line; K_d,app(nM) = 277 ± 21) ssRNA sequences. Binding was determined by relative change in fluorescence anisotropy. Data was fit with a single site binding equation. Error bars reflect the standard deviation from three experimental replicates. (E) Increasing concentrations (0–5.8 µM) of the complete CTDK-I kinase complex were incubated with 8 nM of a U-rich ssRNA (orange line; K_d,app(nM) = 123 ± 10), an A-rich ssRNA (purple line; K_d,app(nM) = 277 ± 21) and a dsDNA (grey line; K_d,app(nM) = 1007 ± 67) sequences. Binding strength, data fitting and standard deviation was determined as in (D).

https://doi.org/10.7554/eLife.25637.012

We then tested the purified CTDK-I complex for RNA binding in vitro. We performed fluorescence anisotropy titration experiments using single-stranded (ss) RNA oligonucleotides with 45% or 24% GC content and bearing a 5′ FAM label (Figure 6D,E). CTDK-I bound both ssRNAs with similar affinities (Figure 6D). We also tested U- or A-rich sequences for association with CTDK-I and found some preference for U-rich RNA (Figure 6E, Figure 6—figure supplement 1). Fitting the data with binding curves by linear regression resulted in apparent K_d’s in the nanomolar range (Figure 6D,E). All experiments were done in the presence of tRNA as competitor, indicating that flexible, single-stranded nucleic acids are preferentially bound. Consistent with this, CTDK-I bound to duplex DNA much more weakly (dsDNA, Figure 6E). These experiments show that the EF complex CTDK-I binds to single-stranded RNA in vitro, consistent with direct EF-RNA interactions in vivo.

Evidence that RNA contributes to EF recruitment

We also measured gene occupancies of EFs using ChIP-Seq and compared them with our PAR-CLIP occupancies (Figure 7). The obtained ChIP-Seq data sets were highly reproducible (Figure 7—figure supplement 1). For comparability with PAR-CLIP data, we collected ChIP-Seq data, although ChIP data are available for single genes or genome-wide using various other techniques or set-ups (Keogh et al., 2003; Kim et al., 2004; Kizer et al., 2005; Krogan et al., 2003b; Liu et al., 2005; Mayer et al., 2010; Ng et al., 2003; Pokholok et al., 2005; Weiner et al., 2015). Metagene analysis of our ChIP-Seq data revealed that EF occupancy increased within 100–600 bp downstream of the TSS, and was generally high in gene bodies (Figure 7, red lines). In contrast, PAR-CLIP results showed that EFs interacted with RNA already from around 20 nt downstream of the capped 5′-end of mRNAs (Figure 7, blue lines). This difference was most pronounced for Set2, which occupies transcripts at the 5′-end but showed peak levels of genome association only in the downstream region, with peak levels 450–300 bp upstream of the pA site. These results are consistent with the idea that RNA contributes to EF recruitment to transcribed genes, and that the contribution of RNA-based recruitment differs for different EFs.

Figure 7 with 1 supplement see all

Download asset Open asset

Comparison of PAR-CLIP and ChIP-Seq occupancy profiles.

Averaged ChIP-Seq (red) and PAR-CLIP (blue) occupancy profiles of EFs and ChIP-Seq of the histone marks H3K4me3, H3K79me3 and H3K36me3 (yellow) centered around TSSs [−150 bp to +600 bp] and pA sites [−600 bp to +150 bp] individually normalized to range between 0 and 1.

https://doi.org/10.7554/eLife.25637.014

Comparison of our histone methyltransferase PAR-CLIP data sets with ChIP-Seq data of the corresponding methylation marks (Figure 7, left, orange lines) provides further support of the model that RNA binding can contribute to EF recruitment to transcribed regions. In the direction of transcription, the PAR-CLIP signals for methyltransferases increased first, followed by an onset of ChIP-Seq signals for the respective histone methylation marks, which in turn preceded the increase in ChIP-Seq signals for the enzymes (Figure 7, left). This sequence of signal onsets is consistent with the model that these EFs are recruited by nascent RNA and then modify histones as Pol II moves downstream.

To test the model of RNA-based recruitment for a particular factor, we investigated whether Set1 gene occupancy depends on the N-terminal region of the protein that contains two RNA recognition motifs (RRMs, residues 247–375 and 376–579) that bind RNA in vitro (Trésaugues et al., 2006). We performed ChIP-qPCR analysis for Set1 in a strain lacking its N-terminal residues 1–579 (ΔRRM-Set1-TAP, Materials and methods) (Figure 8). Additionally, we carried out Set1 ChIP-qPCR in a mutant lacking Paf1 (ΔPaf1 Set1-TAP) because the Paf1 complex was shown to contribute to Set1 recruitment (Krogan et al., 2003a). We compared Set1 gene occupancy levels of both mutant strains (ΔRRM-Set1-TAP and ΔPaf1 Set1-TAP) with the full-length protein occupancy in a Set1-TAP strain. All strains expressed similar levels of Set1 and the Pol II subunit Rpb3 (Figure 8A). We analyzed protein occupancy at different genomic regions of four housekeeping genes (Figure 8B) and at a non-transcribed region of chromosome V. Gene regions within the first 1000 bp downstream of the TSS showed a severe decrease in ΔRRM-Set1 occupancy (Figure 8C; genomic regions 1, 2, 4 and 6). Similarly, we also detected a decrease in Set1 occupancy in the absence of Paf1, confirming the role of Paf1C in Set1 recruitment (Figure 8C). These results indicate that Set1 recruitment not only depends on the Paf1 complex, but also on binding to nascent RNA. Taken together, several lines of evidence presented here strongly suggest that interactions of EFs with nascent RNA contribute to EF recruitment to actively transcribed genes in vivo.

Figure 8

Download asset Open asset

Deletion of Set1 RRMs impairs its recruitment to genes.

(A) Western blot analysis of Set1-TAP (top) and Rpb3 (bottom) in a Set1-TAP strain (left), a strain lacking the first 579 amino acids of Set1 (ΔRRM-Set1-TAP; middle) and a ΔPaf1 Set1-TAP strain (right); bands are shown for biological duplicates of yeast cell cultures before formaldehyde crosslinking. Set1 was detected using an antibody directed against its C-terminal TAP tag. As a control, Pol II was detected using an antibody against the Rpb3 subunit in all three strains. (B) Schematic localization of gene regions analyzed via ChIP-qPCR. Set1 recruitment was monitored at one gene region of *ADH1* (1) and two different gene regions of *ILV5* (2 and 3), *PDC1* (4 and 5) and *PMA1* (6 and 7). (C) ChIP analysis reveals that Set1 occupancy is reduced in ΔPaf1 cells (ΔPaf1 Set1-TAP) as well as in a truncated version of Set1 that lacks its RRM domains (ΔRRM-Set1-TAP). ChIP data are expressed as mean ± SD from two independent experiments. *p<0.05; **p<0.01 (two sample t-test).

https://doi.org/10.7554/eLife.25637.016

Discussion

Here we present a large set of system-wide occupancy data for yeast transcription elongation factors on RNA (PAR-CLIP) and DNA (ChIP-Seq), and complementary biochemical data. The remarkable finding from our work is that many EFs interact with nascent RNA in vivo. Additional in vitro results support these findings and indicate that RNA can contribute to EF recruitment and the stability of the transcription elongation complex. For Set1 we further demonstrate that the two RNA recognition motifs are required for Set1 recruitment to genes in vivo. These results extend our understanding of how the transcription elongation complex is assembled and maintained on active genes. The emerging view from our data is that nascent RNA contributes to EF recruitment and elongation complex stability to different extents for different EFs. We note that our results do not reveal whether all EFs studied here are initially recruited by RNA, and which EFs establish RNA interactions only after they have been recruited by alternative interactions, although EF binding in the very 5′-region of transcripts argues for a RNA-based recruitment model.

Our results also elucidate the long-standing question how the yeast CTD Ser2 kinases Ctk1 and Bur1, which are essential for transcription elongation, are recruited to transcribing Pol II. The Pol II Ser2 kinases give rise to strong PAR-CLIP signals and their chromatin association is strongly dependent on RNA. In addition, we show that purified CTDK-I complex strongly binds to RNA in vitro. This all indicates that nascent RNA plays an important role in recruiting Ser2 kinases to transcribing Pol II. Binding of the Ser2 kinases near the RNA 5′-end is consistent with stabilization of these kinases on the elongation complex by the cap-binding complex (Hossain et al., 2013; Lidschreiber et al., 2013). A model of kinase recruitment by capped RNA predicts that these enzymes are lost from the transcribing enzyme upon RNA cleavage at the pA site, and this is indeed observed by ChIP-Seq. In conclusion, RNA-based recruitment of Ser2 kinases explains why Ser2 phosphorylation of the CTD is restricted to transcribing polymerases, whereas free or initiating polymerases are not phosphorylated at Ser2 residues.

How can some EFs bind both RNA and Pol II? EFs are generally modular and contain multiple domains that can be involved in RNA or protein interactions. However, the same domain can mediate both RNA and protein interactions, as documented for the RNA export factor Yra1, which contains a RNA recognition motif (RRM) domain that binds both RNA and the phosphorylated CTD (MacKellar and Greenleaf, 2011). Set1 contains two adjacent RRM domains (Trésaugues et al., 2006), and Set2 contains a SRI domain that binds the phosphorylated CTD (Dengl et al., 2009; Sun et al., 2010; Yoh et al., 2007; MacKellar and Greenleaf, 2011), but may also bind RNA. The three Paf1C subunits that bind RNA in vivo, namely Cdc73, Ctr9 and Rtf1, also interact with the phosphorylated CTD and the phosphorylated C-terminal region (CTR) of Spt5 in vitro (Qiu et al., 2012). Rtf1 contains a positively charged Plus-3 domain (Finn et al., 2014) that binds the phosphorylated CTR (Wier et al., 2013) and single-stranded nucleic acids (de Jong et al., 2008). We predict that many EFs contain domains that can interact with RNA or with the phosphorylated CTD or CTR, which resemble RNA in its flexible nature and negative charge. Whereas for some EFs binding to RNA or the CTD may be mutually exclusive, others can bind both Pol II and RNA at the same time, for example Spt5. Due to a lack of solubility of individually expressed EF subunits, and the difficulty of preparing EF complexes in recombinant and pure form in large quantities, we had to limit our in vitro RNA-binding analysis to CTDK-I.

Finally, we predict that RNA-based recruitment of EFs provides a missing link in our understanding of how the transcription cycle is coordinated. When the initiation complex assembles at the promoter, TFIIH phosphorylates Ser5 residues in the CTD and this enables recruitment of the capping enzyme (Cho et al., 1997; Fabrega et al., 2003; Rodriguez et al., 2000; Schroeder et al., 2000; Schwer and Shuman, 2011). The nascent RNA then receives a 5′-cap (Martinez-Rucobo et al., 2015), and capped RNA could then help to recruit EFs. The requirement for a cap on RNA befits the observation that Ser5 phosphorylation is needed for high gene occupancy with some EFs (Qiu et al., 2012, 2009, 2006; Ng et al., 2003). RNA-based recruitment of the major Ser2 kinase, Ctk1, would then lead to CTD phosphorylation on Ser2 residues and stable binding of other EFs. Eventually, transcription of a pA site triggers RNA cleavage, and this would facilitate loss of many RNA-bound EFs and render the polymerase prone to transcription termination. Thus, the transcribing Pol II complex may be viewed as a self-organizing system that is encoded in the DNA, but only realized on the level of RNA, which plays crucial roles in complex assembly and disassembly.

Share this article

Cite this article

Many elongation factors (EFs) bind RNA in vivo.

Normalization of PAR-CLIP data shown for two representative EFs, Ctk2 (top) and Spt5 (bottom).

mRNA-binding profiles of EFs.

Asymmetric distribution of EFs at coding and non-coding transcripts.

Chromatin association of EFs depends on RNA.

Recombinant CTDK-I complex is active and binds RNA in vitro.

Comparison of PAR-CLIP and ChIP-Seq occupancy profiles.

Deletion of Set1 RRMs impairs its recruitment to genes.

Author details

Sofia Battaglia

Contribution

Contributed equally with

Competing interests

Michael Lidschreiber

Contribution

Contributed equally with

Competing interests

Carlo Baejen

Contribution

Competing interests

Phillipp Torkler

Contribution

Competing interests

Seychelle M Vos

Contribution

Competing interests

Patrick Cramer

Contribution

For correspondence

Competing interests

Citations by DOI

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

Categories and tags

Research organism