Methylation of RNA polymerase II non-consensus Lysine residues marks early transcription in mammalian cells

  1. João D Dias
  2. Tiago Rito
  3. Elena Torlai Triglia
  4. Alexander Kukalev
  5. Carmelo Ferrai
  6. Mita Chotalia
  7. Emily Brookes
  8. Hiroshi Kimura  Is a corresponding author
  9. Ana Pombo  Is a corresponding author
  1. Max-Delbrück Centre for Molecular Medicine, Germany
  2. Imperial College London, United Kingdom
  3. University of Porto, Portugal
  4. Tokyo Institute of Technology, Japan
10 figures and 4 tables

Figures

Structure and evolutionary conservation of the C-terminal domain of RPB1.

(a) Mouse RPB1 CTD is composed of 52 heptapeptide repeats with consensus amino-acid sequence YSPTSPS, which is represented 21 times at the most proximal CTD region. Non-consensus amino acids are …

https://doi.org/10.7554/eLife.11215.003
Figure 2 with 1 supplement
Mutation of CTD-K7 to -S7 residues does not interfere with RPB1 stability, phosphorylation or subcellular localization.

(a) Outline of strategy used to generate mouse cell lines bearing K7-to-S7 mutations and study CTD-K7 methylation. Red bars represent CTD repeats with K7 residues. The nomenclature of the cell lines …

https://doi.org/10.7554/eLife.11215.004
Figure 2—figure supplement 1
Complete western blots used in Figure 2a.

Sections shown in Figure 2a are highlighted with dashed boxes. High and low exposure blots are shown for YFP, S5p, and S2p. For total RPB1, YFP, S5p and S2p, two different cell lines (a and b) …

https://doi.org/10.7554/eLife.11215.005
Figure 3 with 1 supplement
RPB1 is mono- and di-methylated at CTD-K7 residues.

(a) Amino-acid sequence of CTD-K7-methyl peptides used for immunization, designed based on the sequence of mouse CTD repeats 35 and 36. (b) Schematic representation of strategy used for production …

https://doi.org/10.7554/eLife.11215.006
Figure 3—figure supplement 1
Complete western blots used in Figure 3a, 3e and 3f, and detection of CTD methylation in mouse ES cells.

(a) Complete western blot used in Figure 3d. Sections shown in Figure 3d are highlighted with dashed boxes. In the case of 8WG16, two different cell lines (a and b) that stably express each of the …

https://doi.org/10.7554/eLife.11215.007
Figure 4 with 1 supplement
Interplay between K7me1 and K7me2 with RPB1 phosphorylation.

(a) CTD K7me1 and K7me2 mark hypophosphorylated and intermediately phosphorylated Rpb1 forms, but not the hyperphosphorylated (II0) form. Western blotting using the indicated antibodies was …

https://doi.org/10.7554/eLife.11215.008
Figure 4—figure supplement 1
Complete western blots used in Figure 4a and 4b.

(a) Complete western blot used in Figure 4a. Used sections are highlighted with dashed boxes and low and higher exposure blot versions are displayed. Sections of the membrane treated with alkaline …

https://doi.org/10.7554/eLife.11215.009
Figure 5 with 2 supplements
K7me1 and K7me2 mark promoters of expressed genes.

(a) K7me1 and K7me2 are enriched at promoters of active genes. ChIP-seq profiles for K7me1, K7me2, K7ac, 8WG16, S5p, S7p and S2p, and mRNA-seq profiles are represented for the inactive gene Myf5, …

https://doi.org/10.7554/eLife.11215.010
Figure 5—figure supplement 1
CTD-K7 mono- and di-methylation are enriched at promoters of active genes.

(a) Abundance and distribution of K7me1, K7me2, and S5p was assessed by ChIP followed by qPCR at promoters and coding regions of active (β-Actin, Oct4, Nanog and Polr2a) and inactive (Gata1 and Myf5)…

https://doi.org/10.7554/eLife.11215.011
Figure 5—figure supplement 2
RNAPII CTD is mono- and di-methylated exclusively at active genes and not at Polycomb repressed genes.

(a) Percentage of genes positive for K7me1, K7me2 and K7ac in three different classes of genes: active, polycomb repressed (PRCr) and inactive. Active genes are positive for S5p, S7p, S2p and 8WG16, …

https://doi.org/10.7554/eLife.11215.012
Figure 6 with 2 supplements
Exploring the relationship between different CTD modifications using correlation and linear regression analyses.

(a) Matrix of Spearman’s correlation coefficients between the levels of K7me1, K7me2, K7ac, 8WG16, S5p, S7p, S2p (2 kb window after TES), mRNA and mock ChIP control ordered according to increasing …

https://doi.org/10.7554/eLife.11215.013
Figure 6—figure supplement 1
Correlations between different RNAPII CTD modifications and histone marks for all active genes.

(a) Matrix of Spearman’s correlation coefficients between the levels of several histone and CTD modifications, mRNA, CpG and mock control. The correlations present in Figure 6a were expanded here to …

https://doi.org/10.7554/eLife.11215.014
Figure 6—figure supplement 2
CTD K7me1 and K7me2 negatively contribute for the prediction of S2p levels.

Typical LASSO regression models to predict S2p levels select K7me1 or K7me2 as negative predictor variables at the optimum penalty (blue line, top graph). LASSO regression employs the L1 norm to …

https://doi.org/10.7554/eLife.11215.015
Figure 7 with 4 supplements
CTD-K7 methylation and acetylation have different distributions at the promoters of active genes and their levels are associated with gene expression.

(a) K7me2/K7ac ratio correlates negatively with mRNA expression. (b) Distribution of K7me2/K7ac ratios, and their division into three quantiles: low (green), medium (yellow) and high (blue) ratios. …

https://doi.org/10.7554/eLife.11215.016
Figure 7—figure supplement 1
CTD methylation / acetylation ratio negatively correlates with S2p and RNA production.

(a) CTD K7 methylation / acetylation ratio negatively correlates with S2p (TES) and mRNA. Matrix of Spearman’s correlation coefficients between ratios K7me1/K7ac and K7me2/K7ac and K7me1, K7me2, …

https://doi.org/10.7554/eLife.11215.017
Figure 7—figure supplement 2
Distribution of different RPB1 modifications around the promoter of active genes.

(a) Heatmaps of ChIP-seq density using a ± 500 bp window centered around the TSS (unless stated otherwise) of active genes defined as positive for S5p, S7p and S2p, as negative for H3K27me3 and …

https://doi.org/10.7554/eLife.11215.018
Figure 7—figure supplement 3
Genes with higher K7me2/K7ac ratio are up-regulated after TSA treatment.

(a) Range of K7me2/K7ac ratios for genes used in expression analysis after Trichostatin A (TSA) treatment. (b) Total RNA levels were analysed by quantitative RT-PCR after treatment of ES cells with …

https://doi.org/10.7554/eLife.11215.019
Figure 7—figure supplement 4
Complete western blots used in Figure 7g.

Sections shown in Figure 7g are highlighted with dashed boxes. High and low exposure blots are shown for K7me2. In the H3K9ac western blot, the sections shown with increasing exposure time …

https://doi.org/10.7554/eLife.11215.020
Author response image 1
P300 inhibition promotes a small increase in global levels of CTD-K7me2.

Mouse ES cells were treated with P300 inhibitor C646 (30 μM, 3h), before western blotting with antibodies specific for total RPB1, K7me1, K7me2 and S7p. Hypo- (IIa) and hyperphosphorylated (II0) …

https://doi.org/10.7554/eLife.11215.025
Author response image 2
Down-regulation of gene expression after P300 inhibition.

(A) Range of K7me2/K7ac ratios for genes used in expression analysis after C646 treatment. (B) Total RNA levels were measured by quantitative RT-PCR after treatment of ES cells with C646 (30 μM, 3h) …

https://doi.org/10.7554/eLife.11215.026
Author response image 3
Screen of CTD-K7 methylase activity in conditional KO mouse ES cells for H3K4 methyltransferases.

(A) Western blotting was performed using protein extracts from mouse ES cells with conditional knockout for H3K4 methylase Setd1a or Setd1b for total RPB1 and K7me1 in 2 biological replicates. Hypo- …

https://doi.org/10.7554/eLife.11215.027

Tables

Table 1

RPB1 CTD peptides. CTD peptides with unmodified, mono-, di-, tri-methyl and acetyl K7 residues were used in ELISA assays to characterize the specificity of the CTD methyl antibodies produced in this …

https://doi.org/10.7554/eLife.11215.021
Peptide sequenceModification
SYSPTSPKYTPTSPSCUnmodified
SYSPTSPKme1YTPTSPSCK7 monomethyl
SYSPTSPKme2YTPTSPSCK7 dimethyl
SYSPTSPKme3YTPTSPSCK7 trimethyl
SYSPTSPKacYTPTSPSCK7 acetyl
  1. CTD, C-terminal domain; ELISA, enzyme-linked immunosorbent assay.

Table 2

List of Antibodies used in this study. Full description of the antibodies and the amounts or concentrations used in this study for WB, ChIP or IF.

https://doi.org/10.7554/eLife.11215.022
Amount/dilution
AntibodyRaised in
(isotype)
CloneStockWBChIPIFSource
S5pMouse (IgG)CTD4H8 (MMS-128P)1 mg/ml1/200,00010 µl (10 µg)1/3000Covance
S7pRat (IgG)4E12-1/10--Kind gift from
Dirk Eick
S2pMouse (IgM)H5
(MMS-129R)
1–3 mg/ml1/500--Covance
Unphospho-S2Mouse (IgG)8WG16
(MMS-126R)
1–3 mg/ml1/200--Covance
N-terminus
(Total RPB1)
Rabbit (IgG)H224
(sc-9001x)
200 µg/ml1/200--Santa Cruz Biotechnology
K7me1Mouse (IgG)CMA61110 mg/ml1/10005 µl
(50 µg)
1/200This study
K7me2Mouse (IgG)CMA61210 mg/ml1/10005 µl
(50 µg)
1/200This study
H3K9acRabbit serum39585-1/1000--Active Motif
Lamin BGoat (IgG)C-20 Sc-6216200 µg/ml1/500-Santa Cruz
Biotechnology
α-tubulinMouse (IgG)T60742 mg/ml1/10,000--Sigma
GFPRabbit (IgG)A111222 mg/ml1/1000--Life Technologies
DigoxigeninMouse (IgG)200–002-1561.2 mg/ml-10 µl (12 µg)-Jackson ImmunoResearch
  1. ChIP, chromatin immunoprecipitation; IF, immunofluorescence; WB, western blotting.

Table 3

List of PCR primers used in this study. Primer sequences (F, forward; R, reverse) are represented (5´to 3´orientation) for promoter and coding regions of active and inactive genes. Primers designed …

https://doi.org/10.7554/eLife.11215.023
GenePrimer sequenceGenePrimer sequence
ChIP primers
β-actin (promoter) F

GCAGGCCTAGTAACCGAGACA
Gene expression primers
β-actin 5´F

CCACCCGCGAGCACA
β-actin (promoter) RAGTTTTGGCGATGGGTGCTβ-actin 5´RCCGGCGTCCCTGCTTAC
β-actin (coding) FTCCTGGCCTCACTGTCCACβ-actin Exon1 FTCTTTGCAGCTCCTTCGTTG
β-actin (coding) RGTCCGCCTAGAAGCACTTGCβ-actin Exon2 RACGATGGAGGGGAATACAGC
Oct4 (promoter) FGGCTCTCCAGAGGATGGCTGAGOct4 5´FTGAGCCGTCTTTCCACCA
Oct4 (promoter) RTCGGATGCCCCATCGCAOct4 5´RTGAGCCTGGTCCGATTCC
Oct4 (coding) FCCTGCAGAAGGAGCTAGAACASox2 5´FAGGGCTGGGAGAAAGAAGAG
Oct4 (coding) RTGTGGAGAAGCAGCTCCTAAGSox2 5´RATCTGGCGGAGAATAGTTGG
Myf5 (promoter) FGGAGATCCGTGCGTTAAGAATCCSerpine1 5´FCCCCGAGAGCTTTGTGAAG
Myf5 (promoter) RCGGTAGCAAGACATTAAAGTTCCGTASerpine1 5´RAAGGTGCCTTGTGATTGGCT
Myf5 (coding) FGATTGCTTGTCCAGCATTGTDusp1 5´FCGGTGAAGCCAGATTAGGAG
Myf5 (coding) RAGTGATCATCGGGAGAGAGTTDusp1 5´RAGGAGCGACAATCCAACAAC
Gata1 (promoter) FAGAGGAGGGAGAAGGTGAGTGCtgf 5´FGACTCAGCCAGATCCACTCC
Gata1 (promoter) RAGCCACCTTAGTGGTATGACGCtgf 5´RGTGCAGAGGCGACGAGAG
Gata1 (coding) FTGGATTTTCCTGGTCTAGGG
Gata1 (coding) RGTAGGCCTCAGCTTCTCTGTAGTA
Polr2a (-1 kb) FCCGTAAAGCTATTAGAGCACAGG
Polr2a (-1 kb) RATGCATAAGGCAGGCAAGAT
Polr2a (-0.5 kb) FGTAACCTCTGCCGTTCAGGA
Polr2a (-0.5 kb) RTTTCTCCCTTTCCGGAGATT
Polr2a (E1) FCAGGCTTTTTGTAGCGAGGT
Polr2a (E1) RGACTCAGGACTCCGAACTGC
Polr2a (I1) FCAGAGGGCTCTTTGAATTGG
Polr2a (I1) RGCATCAGATCCCCTTCATGT
Polr2a (I2/E3) FGCCCTCTTCTGGAGTGTCTG
Polr2a (I2/E3) RAGGAAGCCCACATGAAACAC
Polr2a (E19/I19) FCCAAGTTCAACCAAGCCATT
Polr2a (E19/I19) RTCTTAACCGCTGAGCCATCT
Polr2a (E28) FTCTCCCACTTCTCCTGGCTA
Polr2a (E28) RCCGAGGTTGTCTGACCCTAA
Polr2a (+2 kb TES) FGAGGGGCAGACACTACCAAA
Polr2a (+2 kb TES) RAAAAGGCCAAAGGCAAAGAT
Table 4

Description of ChIP-seq and messenger RNA datasets used in this study. Full description of the ChIP-seq datasets produced or re-analysed in this study. NCBI Gene Expression Omnibus (GEO) Sample …

https://doi.org/10.7554/eLife.11215.024
ChIP-seq datasetDataset
origin
Antibody
clone
Mapped reads
(millions)
ES cell
line
RPB1-K7me1 (GSM1874007)This studyCMA611
(this study)
64ESC OS25
RPB1-K7me2 (GSM1874008)This studyCMA612
(this study)
69ESC OS25
RPB1-K7ac (SRR1028808)Schröder et al. (2013) AcRPB1 (Schröder et al., 2013)85ESC
Input (SRR1028807)Schröder et al. (2013) -21ESC
RPB1-S5p (GSM850467)Brookes et al. (2012) CTD4H8 (MMS-128P, Covance)22ESC OS25
RPB1-S7p (GSM850468)Brookes et al. (2012) 4E12
(Chapman et al., 2007)
11ESC OS25
RPB1-S2p (GSM850470)Brookes et al. (2012) H5 (MMS-129R, Covance)33ESC OS25
Unphospho-S2 (8WG16) (GSM850469)Brookes et al. (2012) 8WG16
(MMS-126R, Covance)
24ESC OS25
Mock IP (GSM850473)Brookes et al. (2012) -12ESC OS25
H3K4me3 (GSM307618)Mikkelsen et al. (2007) ab8580
(Abcam)
9ESC V6.5
H3K36me3 (GSM850472)Brookes et al. (2012) 13C9 (Rechtsteiner et al., 2010)23ESC OS25
H2Aub1 (GSM850471)Brookes et al. (2012) E6C5
(Upstate)
18ESC OS25
H3K27me3 (GSM307619)Mikkelsen et al. (2007) 07–449
(Upstate)
8ESC V6.5
RNA datasets Dataset
origin

Mapped reads (millions)

Cell
line
mRNA-seq
(GSM850476)
Brookes et al. (2012) 74ESC OS25
GRO-seq
(GSE48895)
Jonkers et al. (2014) 25ESC V6.5
(“untreated”)
  1. ChIP-seq, chromatin immunoprecipitation with sequencing; ES, embryonic stem; GRO-seq, global run-on sequencing.

Download links