Exploration of CTCF post-translation modifications uncovers Serine-224 phosphorylation by PLK1 at pericentric regions during the G2/M transition

Version of Record: February 5, 2019
Version of Record: February 4, 2019
Accepted Manuscript: January 28, 2019
Accepted Manuscript: January 24, 2019

Download
Cite
Share
CommentOpen annotations (there are currently 0 annotations on this page).

Altmetric provides a collated score for online attention across various platforms and media.
See more details

Abstract
Introduction
Results
Discussion
Materials and methods
Data availability
References
Article and author information
Metrics

Abstract

The zinc finger CCCTC-binding protein (CTCF) carries out many functions in the cell. Although previous studies sought to explain CTCF multivalency based on sequence composition of binding sites, few examined how CTCF post-translational modification (PTM) could contribute to function. Here, we performed CTCF mass spectrometry, identified a novel phosphorylation site at Serine 224 (Ser²²⁴-P), and demonstrate that phosphorylation is carried out by Polo-like kinase 1 (PLK1). CTCF Ser²²⁴-P is chromatin-associated, mapping to at least a subset of known CTCF sites. CTCF Ser²²⁴-P accumulates during the G2/M transition of the cell cycle and is enriched at pericentric regions. The phospho-obviation mutant, S224A, appeared normal. However, the phospho-mimic mutant, S224E, is detrimental to mouse embryonic stem cell colonies. While ploidy and chromatin architecture appear unaffected, S224E mutants differentially express hundreds of genes, including p53 and p21. We have thus identified a new CTCF PTM and provided evidence of biological function.

https://doi.org/10.7554/eLife.42341.001

Introduction

CCCTC-binding protein (CTCF) has been studied in many capacities since its discovery over twenty years ago. Originally identified as a candidate transcription regulator of c-myc, this multi zinc finger protein is highly conserved amongst Nephrozoa (Heger et al., 2012; Lobanenkov et al., 1990). Subsequent studies further revealed that CTCF has insulatory activity, specifically in blocking enhancer-promoter interactions, and in separating transcriptionally active genomic regions from heterochromatic domains (Merkenschlager and Odom, 2013; Ong and Corces, 2014). Later genomic studies revealed tens of thousands of CTCF binding sites in mammalian genomes (Kim et al., 2007; Rhee and Pugh, 2011; Nakahashi et al., 2013; Schmidt et al., 2012; Xie et al., 2007), with >5000 of these being conserved (Schmidt et al., 2012). In addition to CTCF’s role in blocking enhancer-promoter loops, recent chromatin conformation capture (3C) based assays revealed that CTCF paradoxically also plays an architectural role in shaping the genome as well, helping to mediate three-dimensional chromatin loops in some cases. In addition, 3C-based assays also revealed that CTCF binds to the borders of many Topologically Associating Domains (TADs), megabase-sized regions which function to insulate chromatin interactions such as promoter-enhancer loops (Ong and Corces, 2014; Dixon et al., 2012; Nora et al., 2012). Disruption of TADs, for example through deletion of CTCF sites, can lead to genetic disease through ectopic enhancer-mediated upregulation of genes (Lupiáñez et al., 2015; Rao et al., 2014; Sanborn et al., 2015; Nora et al., 2017).

How can a single protein carry out so many different functions in the cell? In addition, how does the cell specify which function CTCF carries out at a given binding site, especially when some of these functions, for example mediating versus blocking chromatin loops, directly contradict each other? Numerous studies have attempted to explain the multivalency of CTCF on the basis of factors such as motif composition of binding site, chromatin state, DNA methylation, and CTCF’s role in mediating three-dimensional chromatin interactions (Lu et al., 2016). For example, it has been proposed that CTCF’s insulatory activity can be explained in context of its mediation of three-dimensional chromatin loops.

In addition to these models, which predict the function of CTCF based on the chromatin state around a given binding site, other studies have sought to define CTCF function based on post-translation modifications (PTMs) of the protein itself. Poly(ADP-ribosyl)ation of CTCF has been found to play roles in imprinting and nucleolar transcription (Torrano et al., 2006; Yu et al., 2004) while SUMOylation of CTCF appears to enhance its repressive function (MacPherson et al., 2009). Finally, phosphorylation of CTCF has been proposed to turn CTCF into a transcription activator (El-Kady and Klenova, 2005) or reduce its DNA-binding activity (Sekiya et al., 2017), depending on site. These prior studies demonstrate that CTCF PTMs, possibly in combination, could influence the myriad of CTCF functions. The extent to which CTCF is post-translationally modified is currently no known. Many prior PTMs were identified indirectly by candidate-based approaches. Several proteomic screens have been carried out using mass spectrometry (Rigbolt et al., 2011; Olsen et al., 2010; Kettenbach et al., 2011; Dephoure et al., 2008), though phosphorylation sites were called by confidence scores made by probabilistic scoring algorithms without further validation or characterization. Here, we endeavored to screen for new CTCF PTMs by taking an unbiased approach. We identify a novel phosphorylation site at Serine 224 (Ser²²⁴-P), perform an extensive characterization of its cellular functions, and uncover an effect on growth of embryonic stem cell colonies and gene regulation.

Results

Murine CTCF is phosphorylated at a highly conserved position, Ser²²⁴

To explore the possibility that PTMs regulate CTCF, we took an unbiased approach to both confirm known and identify novel PTMs, specifically S/T/Y phosphorylation using immunoprecipitation and mass spectrometry (Sekiya et al., 2017; Rigbolt et al., 2011; Olsen et al., 2010; Kettenbach et al., 2011; Dephoure et al., 2008; Klenova et al., 2001). We utilized a doxycycline-inducible system to express murine CTCF-3xFLAG (Sun et al., 2013). This allowed us to purify CTCF from cells using a FLAG epitope rather than a CTCF antibody, as the latter could select against PTMs present within the antigenic sequence. The CTCF-3xFLAG transgene was stably introduced into female immortalized rtTA MEFs and doxycycline induction was verified by GFP microscopy (Figure 1A) (Jeon and Lee, 2011). Anti-FLAG immunofluorescence confirmed that CTCF-3xFLAG co-stained the nucleus in the same pattern as total CTCF (Figure 1B). Exogenous expression of CTCF-3xFLAG in this system only occurs at a modest level, which we confirmed by western blot (Figure 1C) (Sun et al., 2013). Previously, we also found that even slight perturbation of CTCF levels affects differentiation of female ES cells (Sun et al., 2013). However, induction of the transgene was well-tolerated in SV40T immortalized MEFs as they were viable and stably expressing CTCF-3xFLAG even after 9 days (Figure 1C). We thus concluded that exogenously expressed CTCF-3xFLAG in immortalized MEFs could be a tractable system for analyzing CTCF PTMs.

Figure 1 with 1 supplement see all

Download asset Open asset

Murine CTCF is phosphorylated at Ser²²⁴.

(A) GFP microscopy of co-inducible CTCF-3xFLAG, EGFP MEFs ± 1 μg/mL doxycycline for 48 hr. Bar, 25 μm. (B) FLAG and CTCF immunofluorescence accompanying (A). Bar, 10 μm. (C) FLAG, CTCF, and β-actin western blot of whole cell extracts from inducible CTCF-3XFLAG MEFs treated up to 9 days with 1 μg/mL doxycycline. (D) Coomassie stained gel of anti-FLAG immunoprecipitate from CTCF-3xFLAG MEFs treated 72hrs ± 1μg/mL doxycycline. Red arrow, band analyzed by mass spectrometry. (E) Manually validated mass spectra of CTCF peptide Tyr²¹⁴-Lys²⁴⁴ with y and b ions identified. Phosphorylation event at Ser²²⁴ indicated by red s. (F) ClustalX alignment of CTCF sequences from the indicated species. Shown is a 25 amino acid window centered on mouse Ser²²⁴ (red arrow).

https://doi.org/10.7554/eLife.42341.002

We expanded the CTCF-3xFLAG MEFs with or without induction for 3 days and nuclei were isolated. CTCF is tightly associated with chromatin through the central zinc fingers binding its core DNA motif (Nakahashi et al., 2013; Yusufzai and Felsenfeld, 2004). However, homogenizing nuclei under mild conditions with nucleases was not preferable as endogenous phosphatase activity would have been present. Thus we generated nuclear extracts under stringent conditions. Following dialysis of the nuclear extracts, anti-FLAG immunoprecipitation was performed and eluted material was resolved by SDS-PAGE and Coomassie Blue stained. A band corresponding to CTCF-3xFLAG was excised for mass spectrometry analysis (arrow, Figure 1D). Following trypsin digestion and extraction, phosphorylated peptides were enriched using immobilized metal affinity chromatography (IMAC) with Fe-NTA resin prior to LC MS/MS. A unique murine site of serine phosphorylation was identified at Ser²²⁴, which is ~50 aa N-terminal of the CTCF zinc finger domain (Figure 1E). Notably, in a phosphoproteome screen of HeLa cells, a single peptide containing human CTCF Ser²²⁴-P was identified (Kettenbach et al., 2011). However, the significance of this PTM in human cells was neither validated nor explored. While it is also reported that CTCF is phosphorylated at Ser⁶⁰⁴, Ser⁶⁰⁹, Ser⁶¹⁰ and Ser⁶¹² (El-Kady and Klenova, 2005; Rigbolt et al., 2011; Olsen et al., 2010; Dephoure et al., 2008; Klenova et al., 2001); we were unable to identify peptides containing these residues. As these phospho-sites lie in close proximity in a region of CTCF poorly cut by trypsin and chymotrypsin, it is possible that resolution of these phospho-sites was infeasible with our approach. Previous mass spectrometry studies which identified these residues were also global studies that did not provide spectra or additional validation of sites, making direct comparison with our study difficult. Likewise, we were also unable to confirm if the linker sequences between the zinc fingers were phosphorylated as previously reported (Sekiya et al., 2017).

Interestingly, CTCF orthologs have only been found in Nephrozoa (Heger et al., 2012). To explore the potential evolutionary significance of CTCF Ser²²⁴-P, we compared CTCF amino acid sequences from several Nephrozoa species with Clustal (Figure 1F). The CTCF amino acid sequences from mouse, rat, human, chimpanzee, opossum, chicken, frog, zebrafish, sea urchin, fruit fly, and water flea were aligned. A 25 amino acid window centered on mouse Ser²²⁴ remarkably revealed striking conservation proximal to this position amongst vertebrata, but not echinodermata or arthropoda (arrow, Figure 1F). This conservation suggests that Ser²²⁴ phosphorylation is potentially catalyzed by a conserved vertebrate kinase. Ergo, this PTM may contribute distinctly to the regulatory complexity of vertebrate genomes relative to other deuterostomes or protostomes.

CTCF Ser²²⁴ can be phosphorylated by CK2

Discovery of a novel phosphorylation site at Ser²²⁴ raised the question as to which kinase could modify this position. It was previously reported that casein kinase 2 (CK2) could phosphorylate CTCF in vitro at Ser⁶⁰⁴, Ser⁶⁰⁹, Ser⁶¹⁰ and Ser⁶¹² (Klenova et al., 2001). Interestingly, the highly conserved amino acid sequence surrounding Ser²²⁴ fits the known CK2 recognition site (S-x-x-D/E) (Figure 1F) (Meggio and Pinna, 2003). Thus, we decided to test if CK2 also modifies Ser²²⁴. To do this, we first generated recombinant CTCF and in vitro phosphorylated it with CK2 and [γ-³²P]-ATP (Sun et al., 2013). Following SDS-PAGE, autoradiography revealed that recombinant CTCF can be phosphorylated by CK2 (Figure 1—figure supplement 1A). To determine if Ser²²⁴ specifically is phosphorylated by CK2, we performed that assay with non-radioactive ATP and excised a Coomassie stained band corresponding to full length CTCF (red box, Figure 1—figure supplement 1B). Mass spectrometry analysis of the band, revealed that CKII phosphorylated CTCF Ser²²⁴ (Figure 1—figure supplement 1C). As with our analysis of immunoprecipitated CTCF from MEFs, we did not observe phosphorylation at Ser⁶⁰⁴, Ser⁶⁰⁹, Ser⁶¹⁰ or Ser⁶¹². However, again, technical hurdles (i.e., the close proximity of these sites in a region poorly cut by trypsin and chymotrypsin) may preclude their identification by mass spectrometry. Furthermore, while we discovered that CK2 could phosphorylate Ser²²⁴in vitro, it remained to be seen if this occurs in vivo. Finally, it is also a distinct possibility that CTCF Ser²²⁴ could be recognized by additional kinases pertinent to specific functions.

Generation of a CTCF Ser²²⁴-P antibody

To explore the significance of this PTM in cells, we generated an antibody to CTCF Ser²²⁴-P in collaboration with Cell Signaling Technology. Following screening of prospective affinity purified antibodies, we further tested if the candidate CTCF Ser²²⁴-P antibody recognized a phosphorylated protein by western blot. We generated cell extract from immortalized rtTA MEFs and treated the extract with Lambda Protein Phosphatase. western blot revealed that the CTCF Ser²²⁴-P antibody had reduced affinity for the phosphatase treated samples (Figure 2A). The CTCF Ser²²⁴-P antibody also recognizes a smaller (~125 kDa) band that is not detected by a CTCF monoclonal antibody (Figure 2A). Notably CTCF migrates at a higher molecular weight (150 kDa) than predicted (84 kDa) in SDS-PAGE gels. One possibility is that CTCF Ser²²⁴-P occurs on two different CTCF isoforms in vivo, one of which is not recognized by the monoclonal antibody.

Figure 2 with 1 supplement see all

Download asset Open asset

CTCF Ser224-P is conserved and accumulates at G2/M.

(A) CTCF and CTCF Ser224-P western blots of MEF lysates treated with Lambda Protein Phosphatase for the indicated times. (B) CTCF and CTCF Ser224-P western blots of recombinant CTCF, CTCF S224A, and CTCF S224E ± Casein Kinase II in vitro phosphorylation. (C) Immunofluorescence performed on asynchronous MEFs with the indicated antibodies. Nuclei counterstained with DAPI. Bar, 50 μm. White arrows, prominent CTCF Ser224-P cells. (D) Immunofluorescence performed on asynchronous HEK293 with the indicated antibodies. Nuclei counterstained with DAPI. Bar, 20 μm. White arrows, prominent CTCF Ser224-P cells. (E) Immunofluorescence performed on asynchronous MEFs with the indicated antibodies. Nuclei counterstained with DAPI. Bar, 50 μm. % of n cells labeled with CTCF or CTCF Ser224-P antibodies indicated. White arrows, early G2. Yellow arrows, late G2.

https://doi.org/10.7554/eLife.42341.004

However, to further confirm that the affinity purified antibody preferred CTCF Ser²²⁴-P, we phosphorylated bacterially expressed recombinant CTCF and tested the antibody by western blot. We phosphorylated wild type CTCF, S224A, and S224E with CK2. While the western blot revealed some affinity of the CTCF Ser²²⁴-P antibody for unphosphorylated CTCF; the antibody preferred the CK2 phosphorylated protein (2^nd column, Figure 2B). Moreover, the antibody is likely reactive to Ser²²⁴-P rather than other potential CK2 phosphorylation sites as there appeared to be no bias to CK2 phosphorylated S224A or S224E (Figure 2B).

Cytological distribution of CTCF Ser²²⁴-P is comparable to total CTCF

Using immunofluorescence, we next asked if the CTCF Ser²²⁴-P antibody recognized its epitope in situ and if CTCF Ser²²⁴-P localization was comparable to total CTCF. Using immortalized tetraploid female MEFs, we found that the CTCF Ser²²⁴-P antibody labeled a nuclear antigen in a pattern reminiscent of CTCF (Figure 2C, Figure 2—figure supplement 1A). Furthermore, CTCF was visually excluded during interphase from nucleoli and constitutive heterochromatic regions (Burke et al., 2005). Therefore we also scrutinized CTCF Ser²²⁴-P relative to the nucleolus, constitutive heterochromatin, and facultative heterochromatin using antibodies against B23, H3K9me3 and H3K27me3 respectively (Spector et al., 1984; Bannister and Kouzarides, 2011). In wildtype female cells, H3K27me3 is known to be prominently enriched on the inactive X chromosome (Xi). In tetraploid MEFs, it usually appears as two nuclear bodies. CTCF Ser²²⁴-P was not enriched on the Xi, and we found no evident visual distinction between CTCF Ser²²⁴-P and total CTCF nuclear distribution regardless of co-staining nucleoli and heterochromatin (Figure 2—figure supplement 1B–D). Thus, at a cytological level, CTCF Ser²²⁴-P appears to distribute similarly as total CTCF in the nucleus.

CTCF Ser²²⁴-P accumulates during the G2/M transition of the cell cycle

However, a remarkable observation from our immunofluorescence assays was that CTCF Ser²²⁴-P labeling was not homogeneous in an asynchronous population of MEFs. Some of the nuclei were more intensely labeled by the CTCF Ser²²⁴-P antibody whereas nuclei stained with total CTCF antibody were uniformly labeled (arrows, Figure 2C). While it also appears that there were less intensely CTCF Ser²²⁴-P stained nuclei that were observable particularly when compared to normal IgG staining, this may be attributable to antibody cross reactivity to unmodified CTCF Ser²²⁴ (Figure 2B,C). Since CTCF Ser²²⁴-P potentially occurs in human cells (Kettenbach et al., 2011) and vertebrates share a high degree of sequence identity surrounding Ser²²⁴ (Figure 1E), we next asked if the mottled CTCF Ser²²⁴-P pattern is discernable in asynchronous human cells. As predicted, CTCF Ser²²⁴-P labeling of asynchronous HEK293 revealed that a subset of cells was intensely labeled by the antibody (arrows, Figure 2D). This intimates that this PTM may be associated with a vertebrate-conserved CTCF regulatory mechanism.

Given the irregular CTCF Ser²²⁴-P staining of asynchronous cells, we posited that perhaps this PTM is regulated by the cell cycle. To explore this possibility, we performed CTCF Ser²²⁴-P immunofluorescence on asynchronous MEFs with additional antibodies to cell cycle markers. To identify cells in S phase and G2/M, we labeled cells with antibodies against PCNA and phosphorylated histone H3 Ser¹⁰ (H3S10ph) respectively (Bravo and Macdonald-Bravo, 1985; Hendzel et al., 1997). While a total CTCF antibody labeled 100% of the cells (n = 104), again only a fraction of the cells (23.4%, n = 248) were CTCF Ser²²⁴-P positive (Figure 2E). Comparing this fraction to the cell cycle markers revealed that a striking majority (94.8%) of the CTCF Ser²²⁴-P positive cells were co-labeled with H3S10ph antibody (Figure 2E, Figure 2—figure supplement 1E). The CTCF Ser²²⁴-P labeling was most prominent in cells in both early G2 and late G2/prophase, as evidenced from the H3S10ph staining pattern (white arrows, early G2; yellow arrows, late G2/prophase. Figure 2E, Figure 2—figure supplement 1E). There were also CTCF Ser²²⁴-P positive cells co-stained with PCNA antibody (27.6%) (Orange and magenta arrows, Figure 2—figure supplement 1E). However, the majority of these CTCF Ser²²⁴-P-PCNA double positive cells (81.3%) were also faintly positive for H3S10ph (magenta arrows, Figure 2—figure supplement 1E). Taken together, these findings indicate that phosphorylation of CTCF Ser²²⁴ likely initiates at the end of S phase and peaks at G2/M. To further evaluate if CTCF Ser²²⁴-P is enriched at G2/M, we arrested TST-1 mESCs 20 hours with the CDK1 inhibitor RO-3306 (Vassilev et al., 2006). RO-3306 treatment resulted in an accumulation of CTCF Ser²²⁴-P positive cells blocked at G2/M as evident from the concomitant increase in H3S10ph co-staining (Figure 2—figure supplement 1F). These data pointed to an association of CTCF Ser²²⁴-P with the G2/M transition of the cell cycle.

PLK1 phosphorylates CTCF Ser²²⁴

To explore how CTCF is specifically phosphorylated at the G2/M transition, we first sought to identify which kinase(s) could be phosphorylating CTCF Ser²²⁴ at the end of S and in G2/M. Naturally, our first search criterion was that a prospective kinase also had to be expressed at these stages of the cell cycle. We also presumed that the kinase has a bona fide role limited to G2/M. Using these guidelines, Polo-like kinase I (PLK1) was the most attractive candidate. PLK1 is a regulator of the G2/M transition and phosphorylates mitosis-associated substrates (Barr et al., 2004). Importantly, PLK1 expression increases from S phase and peaks during G2 (Golsteyn et al., 1994; Lake and Jelinek, 1993). Comparing the consensus PLK1 substrate site to the amino acid sequence flanking CTCF Ser²²⁴ revealed that Ser²²⁴ is a potential target of PLK1 (Figure 3A) (Nakajima et al., 2003). Of note, CTCF was not identified as a PLK1 target in HeLa cells (Kettenbach et al., 2011). However, the designation of PLK1 targets in that study was not resultant from direct assay. Similarly, we did not find PLK1 in our CTCF-3xFLAG IP-MS/MS (Figure 1D) likely due to stringent nuclear extraction conditions used to extract CTCF from the chromatin fraction that disrupted native protein-protein interactions (see Materials and methods). We also examined PLK1 conservation across Nephrozoa by clustering the amino acid sequence identity of orthologs using Clustal. For reference, orthologs of the kinases ATM and GSK3B were also compared. We found that PLK1 conservation mirrors that of its potential CTCF Ser²²⁴ substrate, where the kinase is most conserved amongst vertebrates (Figures 1F and 3B). This finding may further hint at a homologous CTCF Ser²²⁴-P function particular to vertebrates.

Figure 3 with 1 supplement see all

Download asset Open asset

CTCF Ser²²⁴ is phosphorylated by PLK1 and prominently labels pericentric chromatin.

(A) Graphic comparison of PLK1 consensus substrate sequence with CTCF D220-F228. Green S/T with yellow encircled P, phosphorylation site at position 0. Red D/E, aspartic or glutamic acid. Blue Φ, hydrophobic amino acid. (B) Amino acid sequence identity heat map for the conserved kinases PLK1, ATM, and GSK3B. Nephrozoa species aligned in Figure 1F shown. (C) CTCF, CTCF Ser²²⁴-P, PCNA, and H3S10ph immunofluorescence performed on MEFs treated with DMSO or BI 6727 at the indicated concentrations for 12 hr. Nuclei counterstained with DAPI. Bar, 20 μm. % of n cells labeled with CTCF, CTCF Ser²²⁴-P, PCNA, or H3S10ph antibodies indicated. (D) PLK1 in vitro kinase assay with CTCF or dephosphorylated Casein substrates. Red *, phosphorylated CTCF. Red **, autophosphorylated PLK1. Red ***, phosphorylated casein. (E) In vitro kinase assay performed in parallel to (D without radioactive isotope. SDS-PAGE gel Coomassie stained. Red box, CTCF band excised for mass spectrometry analysis. (F) Manually validated mass spectra of CTCF peptide Tyr²¹⁴-Lys²⁴⁴ with y and b ions identified. Phosphorylation event at Ser²²⁴ indicated by red s. (G) Immunofluorescence performed on TST-1 mESC metaphase chromosomes with the indicated antibodies. DNA stained with DAPI. (H) CTCF Ser²²⁴-P and H3K27me3 co-stain from (G) deconvolved.

https://doi.org/10.7554/eLife.42341.006

To test if CTCF can be phosphorylated by PLK1, we first treated asynchronous immortalized MEFs with the PLK1-specific inhibitor BI 6727 (Rudolph et al., 2009). We performed CTCF and CTCF Ser²²⁴-P immunofluorescence after 12 hr of treatment with 100 nM or 1000 nM BI 6727 (EC₅₀ ~10–150 nM) (Rudolph et al., 2009; Rudolph et al., 2015; Gorlick et al., 2014). Consistent with inhibition of PLK1, we observed a likely G2/M defect evident in a higher percent of H3S10ph positive cells in BI 6727 treated cells than control (DMSO, 5.6%; 100 nM BI 6727, 11.4%; 1000 nM BI 6727, 29.2%; t = 12 hr; n > 100 cells). While 30.8% (n = 214) of untreated cells were CTCF Ser²²⁴-P positive, only 18.3% (n = 175) were positive following treatment with 100 nM BI 6727 (Figure 3C). The range of EC₅₀ values for BI 6727 (10–150 nM) suggests that remaining CTCF Ser²²⁴-P was likely due to incomplete inhibition of PLK1 at this concentration. Supporting this, 1000 nM BI 6727 completely ablated CTCF Ser²²⁴-P phosphorylation (n = 101) (Figure 3C). Total CTCF expression was not affected by 12 hr treatment with BI 6726 at either concentration (Figure 3C). While this indicates that PLK1 is essential for CTCF Ser²²⁴-P, treatment with BI 6727 does not exclude the possibility that our observations were an indirect outcome. Therefore, we also performed an in vitro kinase assay with purified recombinant CTCF and PLK1, using both Casein and PLK1 autophosphorylation as positive controls (*** and ** respectively, Figure 3D) (Golsteyn et al., 1995). In support of our hypothesis, we found that PLK1 directly phosphorylated CTCF (*, Figure 3D). However, this result did not indicate if CTCF Ser²²⁴-P had occurred. To test this, we performed the PLK1 in vitro kinase assay without radioactive isotope and we excised the band corresponding to full-length CTCF from the Coomassie stained gel (red box, Figure 3E). Analysis by mass spectrometry revealed phosphorylation at Ser²²⁴, arguing that indeed PLK1 is the kinase for CTCF Ser²²⁴ (Figure 3F). However, as we did not capture in vivo interactions of CTCF with PLK1 (or CK2), we could not confirm either as the CTCF kinase for certain. Therefore, both PLK and CK2 remain potential CTCF kinases.

CTCF Ser²²⁴-P localizes to pericentric regions during mitosis

We posited that this modification could be observed on condensed chromatids during metaphase. It was previously reported that CTCF remains bound to mitotic chromosomes – both along the dyad arms and at centromeres (Burke et al., 2005; Rubio et al., 2008). However it was also reported that CTCF is phosphorylated in its zinc finger domain during mitosis which results in its dissociation from chromatids (Sekiya et al., 2017). Notably, the latter study did not directly identify phosphorylation sites or visualize the mitotic chromatids. Accordingly, to resolve this discord and test our hypothesis, we performed immunofluorescence on mESC metaphase spreads alongside antibody to the facultative heterochromatin mark H3K27me3 that is found on chromatid arms but is excluded from the centromere (Terrenoire et al., 2010). Much like CTCF, CTCF Ser²²⁴-P could be observed on chromatid arms, albeit not uniformly like CTCF (Figure 3G, Figure 3—figure supplement 1A–B). Remarkably, CTCF Ser²²⁴-P appeared distinctly enriched in between the centromeres of the murine acrocentric chromosomes and the H3K27me3 positive dyad arms. We therefore concluded that the metaphase localization of CTCF Ser²²⁴-P is pericentric (Figure 3G–H, Figure 3—figure supplement 1A–B). As PLK1 is associated with kinetochores, it is possible that PLK1 phosphorylates CTCF in the vicinity of the centromere for a mitosis function (Barr et al., 2004).

CTCF Ser224-P binds to a subset of CTCF binding sites during interphase

As the phospho-specific antibody labeled metaphase chromatin and thus demonstrated in vivo DNA association of CTCF Ser²²⁴-P, we next determined its precise genome-wide distribution. To this end, we performed both CTCF Ser²²⁴-P and CTCF chromatin immunoprecipitation and sequencing (ChIP-seq) on asynchronous TST-1 mESCs. From our CTCF ChIP-seq, we detected ~50,000 CTCF peaks genome-wide (z = 6). MEME-ChIP analysis tellingly revealed significant enrichment of CTCF motifs (JASPAR MA0139.1) centered in these peaks (54% of peaks, p=3.0e-10948) (Figure 4—figure supplement 1A) (Bailey et al., 2009). In contrast, only ~900 CTCF Ser²²⁴-P peaks were detected in our CTCF Ser²²⁴-P ChIP-seq (z = 6). Using MEME-ChIP, we de novo identified a significantly enriched motif in the CTCF Ser²²⁴-P peaks (48% of peaks, p=2.7e-88) that closely matched the CTCF motif (p=1.8e-86) and was likewise centered in the peaks (Figure 4A, Figure 4—figure supplement 1B).

Figure 4 with 1 supplement see all

Download asset Open asset

CTCF Ser²²⁴-P occupies a fraction of CTCF sites outside of pericentric chromatin in interphase.

(A) De novo CTCF Ser²²⁴-P motif logo determined with MEME-ChIP (top). JASPAR indexed CTCF motif logo (bottom). (B) % distribution of CTCF (blue) and CTCF Ser²²⁴-P (orange) ChIP-seq peaks across genomic features. % distribution of these features in the genome is shown for comparison (green). (C) Four example screenshots showing CTCF (blue) and CTCF Ser²²⁴-P (red) ChIP-seq coverage tracks and called peaks. Intersected CTCF and CTCF Ser²²⁴-P peaks (purple) are also shown. ENCODE CTCF ChIP-seq coverage (green), Refseq Genes (black) and ENCODE RNA-seq (gray) are shown for reference. Chromosome number and window scale are indicated. (D) CTCF versus CTCF Ser²²⁴-P ChIP-seq coverage on CTCF (black) and shared CTCF and CTCF Ser²²⁴-P ChIP-seq peaks (red). R-squared values for both sets of peaks are shown. (E) CTCF ChIP-seq coverage on CTCF versus shared CTCF and CTCF Ser²²⁴-P ChIP-seq peaks (p<2.2×10⁻¹⁶, Wilcoxon rank sum test). (F) Number of CTCF motifs found in CTCF versus shared CTCF and CTCF Ser²²⁴-P ChIP-seq peaks (p<2.2×10⁻¹⁶, Wilcoxon rank sum test).

https://doi.org/10.7554/eLife.42341.008

Further CEAS analysis of the CTCF Ser²²⁴-P peaks revealed a genomic feature distribution that was also broadly similar to that of CTCF peaks (Figure 4B) (Shin et al., 2009). The distributions of CTCF and CTCF Ser²²⁴-P peaks were also spread across all chromosomes (Figure 4—figure supplement 1C,D). Notably, the vast majority (95.9%) of the CTCF Ser²²⁴-P peaks intersected with our CTCF peaks (Figure 4C). And comparison to mESC RNA-seq data (GSM723776) revealed that CTCF Ser²²⁴-P peaks were also proximal to both transcribed and non-transcribed regions (Figure 4C) (Shen et al., 2012). Binding to pericentric sequences was difficult to detect in deep-sequencing based assays because of their highly repetitive nature. Knowing that our CTCF Ser²²⁴-P antibody showed a low level of cross-reactivity to unmodified CTCF (Figure 2A,B), we next wondered whether our CTCF Ser²²⁴-P binding could be explained by this cross-reactivity. Accordingly, we examined the correlation between CTCF Ser²²⁴-P and CTCF binding as detected by our ChIP-seq (Figure 4D). While there was a detectable linear relationship between CTCF Ser²²⁴-P and CTCF binding signal, this was only sufficient to explain about half of all CTCF Ser²²⁴-P binding (R², fraction of variation in CTCF Ser²²⁴-P binding explained by CTCF binding, for all peaks = 0.48, R² for shared peaks = 0.54). Therefore, at least a subset of CTCF Ser²²⁴-P ChIP-seq peaks likely represented real binding sites outside of pericentric regions. In addition, as our ChIP-seq was performed in unsynchronized ES cells, which are mostly in interphase (S phase), this suggests that CTCF Ser²²⁴-P is bound at these regions outside of G2/M as well.

We next sought to examine features which differentiated regions bound by CTCF Ser²²⁴-P from regions bound by unmodified CTCF. Shared CTCF and CTCF Ser²²⁴-P ChIP peaks (representing 95.9% of all CTCF Ser²²⁴-P peaks) had significantly more CTCF binding signal than CTCF peaks in general (Figure 4E, p<2.2e-16, Wilcoxon rank sum test). While CTCF motifs found in CTCF Ser²²⁴-P peaks did not tend to be more conserved than motifs found in CTCF ChIP peaks (Figure 4—figure supplement 1E, p=0.1623, Wilcoxon rank sum test), CTCFSer²²⁴-P ChIP peaks did tend to contain more motifs than CTCF ChIP peaks (Figure 4F, p<2.2e-16, Wilcoxon rank sum test). In other words, CTCFSer²²⁴-P ChIP peaks tend to be found at higher affinity sites with greater numbers of CTCF binding sites than CTCF ChIP peaks in general. We also found that shared CTCF and CTCF Ser²²⁴-P ChIP peaks tended to be larger than CTCF peaks in general (Figure 4—figure supplement 1F, p<2.2e-16, Wilcoxon rank sum test), which may in part explain these trends.

Finally, we compared our CTCF and CTCFSer²²⁴-P ChIP-seq with a previously published cohesin ChIP-seq done in mESCs (Kagey et al., 2010). While 22.8% and 33.3% of our CTCF ChIP-seq peaks overlapped with SMC1 and SMC3 peaks, respectively, 51.6% and 85.1% of our CTCFSer²²⁴-P ChIP peaks overlapped with SMC1 and SMC3 peaks, respectively. This suggests that CTCFSer²²⁴-P ChIP tends to co-bind more frequently with cohesin as well.

Mutational analysis of CTCF Ser²²⁴-P reveals an effect on cell growth

We next decided to examine the effect of expressing either Ser²²⁴ mutants, S224A or S224E, on cells. The former mutation obviates phosphorylation and the latter glutamic acid substitution is a so-called phosphomimetic that mimics negatively charged phosphorylation by presenting an acidic side chain at that position. We generated doxycycline-inducible S224A- and S224E-3xFLAG mESCs and overexpressed wild type CTCF, S224A or S224E over six days of growth. Ectopic expression of S224E but not wild type or S224A affected growth of mESCs (Figure 5A). Namely, mESC colonies overexpressing S224E for 6 days had significantly smaller diameters than uninduced cells, while overexpressing wild type or S224A did not result in significantly smaller colonies (Figure 5B). Expression levels of the induced FLAG-tagged proteins were comparable in all clones by western blot (Figure 5C). Taken together, our data so far suggest that the phosphorylated form of CTCF may have a specific function during the cell cycle, as forced constitutive expression of the phosphomimetic form results in a cell growth defect. However, we also note that the continued presence of endogenous wild-type CTCF in our overexpression system may be obscuring detection of further phenotypes of S224A and S224E CTCF.

Figure 5

Download asset Open asset

CTCF S224E phosphomimic mutation is poorly tolerated by dividing cells.

(A) Representative brightfield and EGFP images of F1-2.1 mESCs carrying a dox-inducible wild type CTCF, S224A or S224E transgene grown for six days with (bottom) or without (top) doxycycline. Two independent S224A and S224E clones are shown. (B) Quantification of colony diameters in microns of cell lines shown in (A), grown for six days with (blue) or without (red) doxycycline. Student’s t-test was used to calculate p-values between indicated samples, with not significant (N.S.) p-values being >0.05. (C) Western blot measuring FLAG and CTCF protein levels of cell lines shown in (A). GAPDH is shown as a loading control.

https://doi.org/10.7554/eLife.42341.010

CTCF Ser²²⁴ mutations have no effect on nuclear import, DNA binding, cell cycle, or ploidy

Given the phenotype, we further explored S224 mutants to understand the normal function of CTCF Ser²²⁴-P. We initially posited that CTCF Ser²²⁴-P may be critical to regulating some chromatin function of CTCF. As CTCF Ser²²⁴-P was observable in the nucleus in only a fraction of asynchronous cells, it was a remote possibility that Ser²²⁴ is also critical for CTCF nuclear import. Thus, we examined localization of inducible wild type CTCF, S224A, and S224E in MEFs by immunofluorescence. Regardless of either amino acid substitution at Ser²²⁴, nuclear localization of CTCF was not affected. (Figure 6A). As we observed CTCF Ser²²⁴-P bound to only a fraction of CTCF sites genome-wide, it was also a possibility that this amino acid position is critical for DNA binding. However, since Ser²²⁴ is not located in the zinc finger domain, we predicted that Ser²²⁴ would not directly influence sequence-specific DNA binding. To test this, we performed EMSA using a dsDNA probe with a known CTCF motif as well as a probe with mutations in the motif (Spencer et al., 2011). We made recombinant FLAG-tagged S224A and S224E as well as wild type CTCF and GFP as positive and negative controls respectively (Figure 6B). As expected, regardless of amino acid polarity or charge, both mutants bound the known CTCF motif and not the mutated motif (Figure 6C). However, it is still a distinct possibility that, in vivo, CTCF Ser²²⁴-P may signify a regulatory event that determines which of the thousands of genomic CTCF sites are bound.

Figure 6

Download asset Open asset

CTCF Ser²²⁴ is nonessential to nuclear import and DNA binding.

(A) FLAG and CTCF immunofluorescence performed on rtTA MEFs with inducible CTCF-3xFLAG transgenes (wild type, S224A, or S224E). Nuclei counterstained with DAPI. Bar, 10 μm. (B) FLAG western blot of recombinant FLAG-tagged GFP and CTCF (wild type, S224A, or S224E). (C) DNA EMSA using RS14C (left) and RS14C mutant (right) probes. Red lowercase letters, mutated positions. 2 pmole GFP and 0.5, 1, or 2 pmole CTCF (wild type, S224A, or S224E) were used. *, CTCF shifted probe. #, free probe.

https://doi.org/10.7554/eLife.42341.011

To test whether CTCF Ser²²⁴-P may play a role on mitotic chromosomes, we asked if could overexpression of S224A or S224E leads to defects in either cell cycle progression or segregation of chromatids. We generated DNA content profiles of mESCs with inducible wild-type CTCF, S224A, or S224E transgenes after six days of doxycycline induction and did not observe any blockages in the cell cycle (Figure 7A). We also investigated changes in ploidy by kayotyping these cells, specifically examining metaphase spreads for polyploidy and translocations. However, we did not find significantly higher numbers of polyploid spreads in mESCs overexpressing wild-type or mutant CTCF versus uninduced mESCs (Figure 7B,C).

Figure 7

Download asset Open asset

Overexpression of CTCF, S224A or S224E does not impact cell cycle progression or ploidy.

(A) Cell cycle profiles of F1-2.1 mESCs carrying a dox inducible CTCF, S224A or S224E transgene grown for six days with (bottom) or without (top) dox induction. Quantification of percent of cells in each stage of the cell cycle indicated in bar graph to the right. (B) Representative metaphase spreads of cells profiled in A grown for six days with (bottom) or without (top) dox induction. (C) Quantification of chromosome counts for cells profiled in A grown for six days with (bottom) or without (top) dox induction. Wilcoxon rank sum test was used to calculate p-values between indicated samples, with not significant (N.S.) p-values being >0.05.

https://doi.org/10.7554/eLife.42341.012

CTCF Ser²²⁴-P and the borders of topologically associating domains (TADs)

Finding that overexpression of wild type, S224A or S224E had little obvious impact on mitotic chromosomes, we investigated whether it could interfere with CTCF function in interphase. We first noted that many of our CTCF Ser²²⁴-P ChIP-seq peaks detected in interphase overlapped CTCF peaks at the borders of Topologically Associating Domains (TADs) (Figure 8A), megabase-scale organizational structures on chromosomes within which genetic elements show high frequency of interaction (Dixon et al., 2012; Nora et al., 2012). Mammalian chromosomes are generally organized into hundreds of such TADs, with each TAD separated by genetically defined ‘borders’. As previous studies had shown that CTCF binding is important for formation of TAD borders (Sanborn et al., 2015; Nora et al., 2017), we decided to examine the impact of overexpressing wild-type CTCF, S224A or S224E on nuclear architecture using HYbrid Capture Hi-C (Hi-C²), a cost-effective alternative to genome-wide Hi-C (Sanborn et al., 2015). The TAD containing the gene Mecp2 was chosen as the capture region as it contained a sub-TAD domain bound by CTCF at both the left and right borders and CTCF Ser²²⁴-P at the left border (Figure 8A). To minimize secondary impacts on TAD structure, Hi-C was performed after 2 days of wild type CTCF, S224A or S224E overexpression in F1-2.1 mESCs, a time point at which no cell colony defects were observed in any of the three cell lines. By eye, interaction matrices of the Mecp2 region appeared similar with or without overexpression of wild type CTCF, S224A and S224E (Figure 8A). To analyze impact on the interactions within the Mecp2 TAD quantitatively, we additionally calculated insulation scores across the Hi-C² region and used this to calculate a TAD score for the Mecp2 TAD in each condition (Crane et al., 2015). The Mecp2 TAD score was similar with or without overexpression of wild type CTCF, S224A and S224E. Thus, overexpression of CTCF, including CTCF S224E, does not detectably impact three-dimensional chromatin structure (Figure 8B).

Figure 8

Download asset Open asset

Impact of overexpression of CTCF, S224A and S224E on three-dimensional chromatin structure and gene expression.

(A) Hi-C² interaction maps at 25 kb resolution of the Mecp2 TAD in F1-2.1 mESCs carrying dox-inducible wild type, S224A or S224E CTCF-3xFLAG transgenes grown for 2 days with (bottom) and without (top) doxycycline. CTCF and CTCF Ser²²⁴-P ChIP-seq tracks are shown for comparison. Black arrows indicate the left border of a sub-TAD domain bound at both borders by CTCF and at one border by CTCF Ser²²⁴-P. In addition, dotted lines and text in the WT -dox Hi-C² interaction map indicate locations of ChIP-qPCR primers used in (C), with the Irak1 and Ikbkg lines also indicating the borders of the Mecp2 TAD scored in (B). (B) TAD scores for the Mecp2 TAD for the Hi-C² interaction maps in A, with higher TAD score indicating a stronger TAD. (C) CTCF and FLAG ChIP-qPCR (% Pulldown) in F1-2.1 mESCs carrying dox-inducible S224E CTCF-3xFLAG grown for 2 days with or without doxycycline. Cirbp and Ccnd indicate positive control regions for CTCF binding, while Oct4 indicates a negative control region. Flna, Irak1, and Ikbkg regions are as indicated in (A). (D) MA plot of RNA-seq expression changes in F1-2.1 mESCs carrying dox-inducible CTCF S224E transgene after 6 days of dox induction. Points in red are DE genes (adjusted p-value<0.01). (E) Coverage of RNA-seq reads over codon 224 of CTCF with (left) and without (right) 6 days of dox induction. The number of reads with A (green), C (blue), G (gold), T (red) or N (grey) at positions 1, 2, and 3 of codon 224 are shown. (F) Metagene coverage of CTCF ChIP-seq reads (top) and CTCF Ser²²⁴-P ChIP-seq reads (bottom) over upregulated (red), downregulated (green), non differentially expressed (purple) and all (black) genes. (G) RNA-seq, CTCF and CTCF Ser²²⁴-P ChIP-seq coverage over two representative upregulated (left) and downregulated (right) genes.

https://doi.org/10.7554/eLife.42341.013

However, as RNA-seq suggested that levels of S224E CTCF may be modest as compared to wild-type CTCF (see below), we assayed to what extent the mutant S224E CTCF was bound to the Mecp2 TAD. We did CTCF and FLAG ChIP-qPCR in doxycycline inducible S224E-3xFLAG mESCs after 48 hr of dox induction, the same conditions under which the Hi-C² experiment was performed (Figure 8C). While FLAG binding was enriched at positive control region Cirbp over IgG, the –dox sample and a negative control region Oct4 (Figure 8C), the level of binding was much lower than that of CTCF. Similarly, FLAG binding was modestly enriched over IgG, the –dox sample, and Oct4 at at least the two CTCF sites bordering the Mecp2 TAD (Figure 8C, Irak1, Ikbkg), as well as at the site bound by phospho-CTCF in the Mecp2 TAD in one replicate (Figure 8C, Flna), although to an extent much less than that of CTCF. This modest binding of S224E CTCF to the Mecp2 TAD, likely due to endogenous wild-type CTCF remaining in the system, may possibly explain why we were unable to detect changes in chromatin architecture in this region.

Overexpression of CTCF S224E upregulates the p53 signaling pathway and globally plasma membrane proteins at the RNA level

As we could not attribute the mESC growth phenotype to changes in cell cycle progression, CTCF binding or TAD structure, we performed RNA sequencing (RNA-seq) to identify genes that were differentially expressed (DE) when CTCF S224E was induced in mESC’s. We found 375 genes that were DE between two replicates (adjusted p-value<0.01), with 118 (31%) being upregulated and 257 (69%) being downregulated upon overexpression of CTCF S224E (Figure 8C). Surprisingly, CTCF was not a DE gene. To confirm that CTCF S224E was expressed, we examined RNA-seq reads overlapping CTCF codon 224 and found that 12–16% of reads had the glutamate codon at position 224 (Figure 8D). This indicates that while CTCF S224E was present, it was not grossly overexpressed at the RNA level; therefore, our observations from modest exogenous expression may reflect the importance of this site. We next looked at coverage of CTCF and CTCF Ser²²⁴-P ChIP-seq over DE genes in an attempt to determine whether DE genes were directly regulated by CTCF. On average, CTCF binding was enriched along both downregulated and upregulated DE genes, albeit at different locations. In particular, downregulated genes were more enriched in CTCF binding upstream of the transcription start site, while upregulated genes were more enriched in CTCF binding along the gene body. CTCF Ser²²⁴-P enrichment followed a similar pattern (Figure 8E). Next, in order to determine if specific pathways were enriched among our DE genes, we performed functional annotation analysis of upregulated and downregulated DE genes using the PANTHER Overrepresentation Test (Mi et al., 2017; Nikolsky and Bryant, 2009), with the list of all expressed genes as the background. Among upregulated genes, the only significantly overrepresented PANTHER pathway was the p53 pathway (FDR = 0.0495), most notably including Trp53 (upregulated 1.3-fold, adjusted p-value<0.00075) and Cdkn1a (upregulated 1.3-fold, adjusted p-value<0.00043), genes which were also bound by CTCF and CTCF Ser²²⁴-P (Figure 8F). Meanwhile, among downregulated genes, the most significant overrepresented cellular component categories were ‘cell periphery’ (FDR = 2.28×10⁻⁶) and ‘plasma membrane’ (FDR = 3.45×10⁻⁶), although, many downregulated genes did not fall into these categories (Figure 8F). These results suggest that modest CTCF S224E overexpression could lead to the slight activation of p53 signaling, and also global downregulation of proteins on the plasma membrane. The effect may be direct, as CTCF binding was found to be enriched at DE genes or indirect, as RNA-seq was assayed on cells after six days of S224E overexpression in mESCs, when a colony grown phenotype was already apparent. We conclude that the serine 224 of CTCF plays a critical role in regulating gene expression and suggest that the effect on the p53 signaling pathway may partly explain the growth phenotype in mESCs.

Discussion

In the quarter century since its discovery, CTCF has been ascribed a multitude of chromatin functions. While DNA sequence composition and CpG methylation are determinants of CTCF binding, a regulatory system independent of DNA binding has remained elusive. Here we discovered and confirmed a CTCF PTM conserved among vertebrates. We observe for the first time that CTCF is differentially regulated during the cell cycle, with CTCF Ser²²⁴-P being enriched at G2/M and specifically on pericentric regions during metaphase. Notably, CTCF Ser²²⁴ phosphorylation is regulated by the kinase PLK1 whose expression profile also peaks in G2/M. As exogenously expressing S224E leads to colony formation defects and gene expression changes, our data hint at the importance of CTCF Ser²²⁴-P for proper cell function. We were further able exclude several potential functions of CTCF Ser²²⁴-P. First, Ser²²⁴ does not directly affect CTCF binding to DNA. However, while CTCF Ser²²⁴-P occupies a subset of all CTCF sites genome wide, how this PTM is limited to a fraction of sites on the chromatids and pericentric satellites is still unclear. Second, while overexpression of a S224E leads to colony formation defects, we did not observe any changes in a TAD’s structure, suggesting that CTCF Ser²²⁴-P may not impact the architectural role of CTCF, at least around the gene-rich Mecp2 TAD. Third, cell cycle progression and ploidy of cells overexpressing the phospho-mimic were not obviously affected. We conclude that regulation of CTCF Ser²²⁴-P is needed to prevent dysregulation of hundreds of genes, directly or indirectly through mechanisms that likely do not involve CTCF interphase binding, architectural function, cell cycle progression or ploidy. However, an intriguing possibility which we did not investigate is that the negative charge introduced by phosphorylation at Ser224 could alter the conformation of the CTCF protein, altering the manner in which CTCF regulates genes or interacts with other proteins. We hope these observations will provide starting points for further elucidating the natural functions of this CTCF PTM and valuable resources for future study.

With our observation of CTCF Ser²²⁴-P enrichment proximal to the centromere, several avenues of investigation become conspicuous. First, what could be the pericentric function of CTCF Ser²²⁴-P? For example, CTCF interacts with cohesins and the latter remain at the centromere to facilitate cohesion during metaphase (Rubio et al., 2008; Xiao et al., 2011; Stedman et al., 2008; Morales and Losada, 2018). CTCF has also been found to impede ‘loop extrusion’ activity of the cohesin complex during interphase (Vian et al., 2018). Could a similar capacity be expected for pericentric CTCF Ser²²⁴-P and the cohesin complex? Likewise, the presence of pericentric CTCF Ser²²⁴-P may demarcate a topological boundary between the chromatid arms and satellite DNA. Notably, PLK1 also phosphorylates cohesins, promoting their prophase dissociation from the chromatid arms (Losada et al., 2002; Hauf et al., 2005; Giménez-Abián et al., 2004; Sumara et al., 2002). PLK1 phosphorylation of CTCF may also be linked to cohesin dissociation from the chromatids. Another prospective undertaking would be to determine if CTCF Ser²²⁴-P is present at all pericentromeres. Are all pericentromeres identical? Thus, the identification of this PTM and development of a specific antibody may prove to be an invaluable resource to the cell cycle and mitosis communities.

PLK1 mediated phosphorylation of CTCF Ser²²⁴ may also not be a finite event. Coordinated signaling could occur. The Polo Box Domain (PBD) of PLK1 targets the kinase activity by tethering PLK1 to a phosphopeptide (Lowery et al., 2004). While CTCF appears to lack an ideal PBD binding site, perhaps PLK1 is recruited to CTCF Ser²²⁴ by a proximal phosphoprotein (Elia et al., 2003). Conversely, what triggers dephosphorylation of CTCF Ser²²⁴-P? Thus exploring the vicinal events surrounding Ser²²⁴ phosphorylation could elucidate the G2/M function of this PTM. Furthermore, as CTCF contains 100 S/T/Y amino acids, there exists the possibility that other bona fide phosphorylation events exist. Similarly, with 65 lysines, CTCF acetylation and/or methylation may also occur and orchestrate a much more complex symphony of insulator regulation. Hence by confirming Ser²²⁴ phosphorylation, we are likely only previewing what could be an expansive CTCF signaling network. While our assays did not elucidate a conspicuous mechanism or functional consequence of CTCF Ser²²⁴-P, nonetheless its identification and generation of a specific antibody to it can be a useful resource. From a disease perspective, this PTM may also serve as a useful biomarker of dividing cells or prove to be a therapeutic target for competitive or allosteric inhibition, if targeting PLK1 directly proves adverse. Lastly, at least 5000 CTCF binding locations are conserved amongst mammalian genomes and these are often associated with syntenic chromatin domain boundaries (Schmidt et al., 2012; Dixon et al., 2012; Vietri Rudan et al., 2015). As CTCF Ser²²⁴-P also appears conserved amongst vertebrates, it is an attractive possibility that Ser²²⁴ phosphorylation along with yet confirmed CTCF PTMs may constitute a conserved code that underlies chromatin organization and gene regulation akin to the histone code (Jenuwein and Allis, 2001). Elucidation of such a CTCF code may also reveal insights into how interphase contacts are remembered and restored before and after mitosis. This prospective code may also be bifurcated species-specifically by the degree of conservation of particular CTCF PTMs, potentially exposing mechanisms of speciation. Thus, further unmasking CTCF Ser²²⁴-P and thoroughly validating the entirety of CTCF PTMs would be invaluable to understanding how the ubiquitous insulator’s functions are specified.

Share this article

Cite this article

Murine CTCF is phosphorylated at Ser224.

CTCF Ser224-P is conserved and accumulates at G2/M.

CTCF Ser224 is phosphorylated by PLK1 and prominently labels pericentric chromatin.

CTCF Ser224-P occupies a fraction of CTCF sites outside of pericentric chromatin in interphase.

CTCF S224E phosphomimic mutation is poorly tolerated by dividing cells.

CTCF Ser224 is nonessential to nuclear import and DNA binding.

Overexpression of CTCF, S224A or S224E does not impact cell cycle progression or ploidy.

Impact of overexpression of CTCF, S224A and S224E on three-dimensional chromatin structure and gene expression.

Author details

Brian C Del Rosario

Contribution

Contributed equally with

Competing interests

Andrea J Kriz

Contribution

Contributed equally with

Competing interests

Amanda M Del Rosario

Contribution

Competing interests

Anthony Anselmo

Contribution

Competing interests

Christopher J Fry

Contribution

Competing interests

Forest M White

Contribution

Competing interests

Ruslan I Sadreyev

Contribution

Competing interests

Jeannie T Lee

Contribution

For correspondence

Competing interests

Citations by DOI

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

Categories and tags

Research organism

Murine CTCF is phosphorylated at Ser²²⁴.

CTCF Ser²²⁴ is phosphorylated by PLK1 and prominently labels pericentric chromatin.

CTCF Ser²²⁴-P occupies a fraction of CTCF sites outside of pericentric chromatin in interphase.

CTCF Ser²²⁴ is nonessential to nuclear import and DNA binding.