Decoding the centromeric nucleosome through CENP-N

Abstract
Introduction
Results
Discussion
Materials and methods
Data availability
References
Article and author information
Metrics

Abstract

Centromere protein (CENP) A, a histone H3 variant, is a key epigenetic determinant of chromosome domains known as centromeres. Centromeres nucleate kinetochores, multi-subunit complexes that capture spindle microtubules to promote chromosome segregation during mitosis. Two kinetochore proteins, CENP-C and CENP-N, recognize CENP-A in the context of a rare CENP-A nucleosome. Here, we reveal the structural basis for the exquisite selectivity of CENP-N for centromeres. CENP-N uses charge and space complementarity to decode the L1 loop that is unique to CENP-A. It also engages in extensive interactions with a 15-base pair segment of the distorted nucleosomal DNA double helix, in a position predicted to exclude chromatin remodelling enzymes. Besides CENP-A, stable centromere recruitment of CENP-N requires a coincident interaction with a newly identified binding motif on nucleosome-bound CENP-C. Collectively, our studies clarify how CENP-N and CENP-C decode and stabilize the non-canonical CENP-A nucleosome to enforce epigenetic centromere specification and kinetochore assembly.

https://doi.org/10.7554/eLife.33442.001

Introduction

Accurate segregation of chromosomes from a mother cell to its two daughters during cell division is a prerequisite for healthy cell physiology and for the transmission of the genetic information across generations (Santaguida and Amon, 2015). Specialized, conserved molecular machinery dedicated to this crucial function has been identified in the majority of eukaryotic organisms studied to date (Drinnenberg et al., 2016; van Hooff et al., 2017). The purpose of this machinery is to generate stable linkages between chromosomes, the carriers of genetic information, and the mitotic spindle, the microtubule-based structure devoted to the segregation of chromosomes into the daughter cells.

In the last two decades, substantial progress in our understanding of the molecular features of the chromosome segregation apparatus has been made. A crucial role in this process is played by centromeres, specialized chromatin domains whose defining mark in almost all known eukaryotes is the enrichment of centromeric protein A (CENP-A, also known as CenH3), which replaces histone H3 in nucleosomes (Fukagawa and Earnshaw, 2014; Musacchio and Desai, 2017). The primary function of centromeres is to provide a platform for the assembly of macromolecular complexes known as kinetochores, whose task in turn is the physical capture of microtubules of the mitotic spindle. Kinetochores contain approximately 30 core subunits, normally subdivided in centromere-proximal and microtubule-proximal groups. The microtubule-proximal subunits (outer kinetochore), which are directly implicated in microtubule binding, are usually denoted as the KMN assembly, from the name of three sub-complexes, the Knl1, Mis12, and Ndc80 complexes (Musacchio and Desai, 2017). The centromere-proximal subunits (inner kinetochore), which are also organized in sub-complexes, are collectively identified as the constitutive centromere associated network (CCAN) because they appear to reside at centromeres for the entire cell cycle (Cheeseman and Desai, 2008; Foltz et al., 2006; Izuta et al., 2006; Obuse et al., 2004; Okada et al., 2006) (Figure 1A).

Figure 1 with 1 supplement see all

Download asset Open asset

The interaction of CENP-N with nucleosomes.

(A) Schematic of crucial CCAN and KMN subunits discussed in the text. The Knl1-Mis12-Ndc80 (KMN) complex is the main microtubule receptor at the kinetochore. Other interactions are discussed in the main text. The question mark indicates that the precise determinants for the recruitment of CENP-LN to CENP-C and for the interaction of CENP-N with the CENP-A nucleosome have not been identified. (B) Schematic depicting constructs described in the manuscript. (**C–E**) Solid phase binding assays where the indicated GST fusion proteins were immobilized on glutathione-sepharose beads (at a final concentration of 1 µM) and incubated with 3 µM of the indicated nucleosome core particles. After incubation (see Materials and methods), beads were centrifuged, washed, and bound proteins visualized by SDS-PAGE and Coomassie staining.

https://doi.org/10.7554/eLife.33442.002

The ability of CENP-A to nucleate kinetochores depends on its incorporation into nucleosomes (CENP-A nucleosomes) with histones H2A, H2B, and H4. In vitro, these interact specifically and selectively with two CCAN components, CENP-C and CENP-N (Carroll et al., 2010, 2009; Guo et al., 2017; Guse et al., 2011; Hoffmann et al., 2016; Klare et al., 2015; Nagpal et al., 2015; Samejima et al., 2015; Weir et al., 2016). Binding of these proteins to CENP-A nucleosomes has been shown to require two regions where the CENP-A sequence diverges significantly from that of histone H3, the L1 loop and the C-terminal tail (Carroll et al., 2009; Fachinetti et al., 2013; Guo et al., 2017; Kato et al., 2013; Logsdon et al., 2015). An evolutionary conserved motif of CENP-C, present in one or two copies in different organisms, is sufficient for recognition of CENP-A in vitro. This motif interacts primarily with a solvent-exposed acidic patch on the H2A and H2B subunits of the CENP-A nucleosome and also decodes the divergent C-terminal tail of CENP-A (Guo et al., 2017; Kato et al., 2013). The two copies of this motif in human CENP-C are referred to as the central motif (or domain) and the CENP-C motif (Figure 1A). While at least the central motif has been shown to be required for efficient centromere retention of newly incorporated CENP-A (Guo et al., 2017), neither motif appears to be strictly necessary for centromere localization of CENP-C in human cells (Guo et al., 2017), likely because CENP-C contains binding sites for additional CCAN subunits that can stabilize its centromere localization even in the absence of a direct interaction with CENP-A (Guo et al., 2017; Hinshaw and Harrison, 2013; Klare et al., 2015; McKinley et al., 2015; Nagpal et al., 2015; Weir et al., 2016). The specific succession of binding sites within CENP-C, a protein that secondary structure prediction algorithms identify as being largely intrinsically disordered, has led to suggest that it acts as a blueprint in the establishment of the inner to outer kinetochore axis, with an N-terminal motif involved in stabilizing the outer kinetochore, a middle region involved in stabilizing the inner kinetochore CCAN complex, and a C-terminal region involved in interactions with the centromeric chromatin (Figure 1A) (Gascoigne et al., 2011; Kato et al., 2013; Klare et al., 2015; McKinley et al., 2015; Przewloka et al., 2011; Screpanti et al., 2011).

CENP-N forms a constitutive complex with CENP-L (designated CENP-LN complex), which in turn interacts with the CENP-HIKM complex and with CENP-C (Guo et al., 2017; Hinshaw and Harrison, 2013; Klare et al., 2015; McKinley et al., 2015; Weir et al., 2016). Binding of CENP-N requires the exposed L1 loop of CENP-A and may also reach into the neighboring DNA (Carroll et al., 2010; Carroll et al., 2009; Fang et al., 2015; Guo et al., 2017). The structural basis of the interaction of CENP-N with the CENP-A nucleosome, however, has remained elusive. Furthermore, it is unclear whether this interaction is sufficient for the recruitment of CENP-N to the kinetochore, or whether additional interactions with CCAN subunits are also required. Here, we addressed both issues. First, we combined X-ray crystallography and cryo electron microscopy (EM) to gain a high-resolution view of the CENP-N:CENP-A nucleosome complex, and identified and validated the main determinants of this interaction. Second, we defined the determinants of a physical interaction of CENP-LN with CENP-C and demonstrated that kinetochore recruitment of CENP-N requires the coincident presence of CENP-A and CENP-C at kinetochores. Our studies have important implications for kinetochore assembly and epigenetic specification of centromeres.

Results

Crystal structure of CENP-N^1-235

Human CENP-N, a 339-residue protein (Figure 1B), interacts directly with CENP-L (Hinshaw and Harrison, 2013; Weir et al., 2016). When immobilized on solid phase and challenged with CENP-A or H3 nucleosome core particles (NCPs), CENP-LN interacted specifically with CENP-A^NCP (Figure 1C). As shown previously (Carroll et al., 2009), the CENP-A-binding region of the CENP-LN complex lies within the N-terminal region of CENP-N, because a stable fragment encompassing residues 1–212 of human CENP-N (CENP-N^1-212) also bound selectively to CENP-A^NCPs but not H3^NCPs (Figure 1D).

To address the structural features of CENP-N and the basis of its interaction with the CENP-A^NCP, we therefore focused our structural analysis on N-terminal constructs of CENP-N (Figure 1B). We obtained well diffracting crystals of the CENP-N^1-235 construct and determined its crystal structure at 2.8 Å resolution (Table 1). CENP-N^1-235 consists of two closely juxtaposed domains that interact through an extended interface to form a single structural unit (Figure 2A–B). The first domain (residues 1–77) consists of a five-helix bundle, whereas the second domain (residues 78–212, cyan in Figure 1A) consists of a six-stranded anti-parallel β-sheet sandwiched between α-helices (Figure 2C–D). There is no clear density beyond residue ~210, indicating that the structure is disordered after this point. Fold-recognition by DALI (Holm and Rosenström, 2010) identified similarity of the first domain to PYRIN domains (PYDs; a superposition is shown in Figure 2—figure supplement 1A–B). PYDs are ‘death fold’ family domains implicated in protein-protein interactions relevant to inflammation and apoptosis (Ratsimandresy et al., 2013). They have not been previously implicated in interactions with DNA or chromatin.

Figure 2 with 3 supplements see all

Download asset Open asset

Crystal structure of the CENP-A-binding region of CENP-N.

(A) Cartoon model of CENP-N^1-235 with secondary structure and domain organization. (B) Close-up of the boxed region in A. (C) Topology diagram of CENP-N. The topology of the Pyrin and CLN-HD domains was directly derived from the crystal structure of CENP-N^1-235 reported here. The topology of the CENP-L-binding domain was derived from the crystal structure of the Chl4 fragment in the complex of the Chl4^CENP-N:Iml3^CENP-L yeast homolog (Hinshaw and Harrison, 2013). (D) Multiple sequence alignment of CENP-N from the indicated species with secondary structure. Green, blue, and orange dots indicate solvent-exposed, semi-buried, and buried side chains, respectively. Positions with conserved residues are displayed red; positions with conserved side chain charge are boxed. (E) Schematic summarizing domain organization of CENP-L, CENP-N, and their dimerization.

https://doi.org/10.7554/eLife.33442.004

Table 1

X-ray data collection and refinement statistics

https://doi.org/10.7554/eLife.33442.008

Data collection and processing
	Native	SeMet 1	SeMet 2	SeMet 1 + 2
Space group	P4₁	P4₁	P4₁	P4₁
Wavelength	0.97793	0.9793	0.9793	0.9793
No. xtals	1	1	1	2
Source	SLS	PETRA	PETRA	PETRA
Detector	Pilatus 6M	Pilatus6M	Pilatus 6M	Pilatus 6M
Mol/AU	2	2	2	2
a,b,c (Å)	87.3 87.3 81.1	88.99 88.99 76.96	89.14 89.14 77.22	88.99 88.99 76.96
α, β, γ (°)	90 90 90	90 90 90	90 90 90	90 90 90
Resolution (Å)	87.3–2.74 (2.81–2.74)*	48.7–3.3 (3.9–3.3)	48.8–3.2 (3.3–3.2)	48.7–3.3 (3.4–3.3)
R_meas	8.2 (155.1)	17.2 (153.4)	18.8 (173.4)	18.7 (167.8)
I/σI	17.3 (1.4)	7.5 (1.1)	7.2 (1.0)	10.4 (1.4)
Completeness (%)	99.8 (98.5)	100.0 (100.0)	99.9 (98.8)	100.0 (100.0)
Redundancy	9.4 (8.7)	7.1 (7.2)	7.0 (6.3)	14.1 (14.1)
Refinement				Phasing
Resolution (Å)	87.3–2.7			FOM 0.39
No. reflections	17103			BAYES-CC 38.1
R_work/R_free(%)	21.6/26.1			12 Selenium-sites
No. atoms:
Protein/ Ligands	3432/6
Water	10
aver. B (Å²)	90.4
R.m.s. deviations
Bond lengths (Å)	0.0076	Ramachandran plot: 98.0% favourable, 0% outliers
Bond angles (°)	1.27

* Values in parentheses are for highest resolution shell

Iml3 and Chl4 are fungal orthologs of CENP-L and CENP-N, respectively. We referred to a previously reported crystal structure of the full-length Iml3 protein bound to the C-terminal region of Chl4 (Iml3:Chl4^C, PDB ID 4JE3) (Hinshaw and Harrison, 2013) to deduce the structural organization of the human CENP-LN complex. Iml3 consists of an N-terminal domain (shown in green in Figure 2—figure supplement 2A) and a C-terminal domain (the ‘insert’ domain shown in yellow; the topology of Iml3 is shown in Figure 2—figure supplement 2B). Iml3 hetero-dimerizes with Chl4 through a subdomain within the insert domain (Figure 2—figure supplement 2B) (Hinshaw and Harrison, 2013). Due to strong sequence similarity of Iml3 and CENP-L throughout their length (not shown), the structure of Iml3 provides an excellent model for the structure of CENP-L. Importantly, although our crystal structure does not encompass the C-terminal region of CENP-N, the sequence of the latter is strongly related to that of Chl4^C (Figure 2—figure supplement 3A), which was captured in complex with Iml3 in the Iml3:Chl4^C structure, indicating that they are also structurally related (Figure 2C). Indeed, as already observed (Guo et al., 2017; Hinshaw and Harrison, 2013), the C-terminal region of CENP-N (CENP-N^230-C) was sufficient to interact with CENP-L (Figure 1—figure supplement 1). Thus, the structure of CENP-N^1-235 reported here and that of the Iml3:Chl4^C complex are complementary, and together provide an almost comprehensive view of the CENP-L^Iml3:CENP-N^Chl4 complex (Figure 2—figure supplement 3A and Figure 2—figure supplement 3B).

Besides identifying the N-terminal domain of CENP-N^1-235 as a PYRIN domain, DALI also identified an unanticipated structural homology of the second domain of CENP-N^1-235 with the N-terminal domain of Iml3^CENP-L (Figure 2—figure supplement 3C). We therefore refer to these domains of CENP-N and CENP-L as CLN-HD (for CENP-L and CENP-N homology domain). Structural similarities of the CLN-HD suggest that CENP-N and CENP-L are evolutionary related. However, sequence identity of the two domains, even after structural superposition, is minimal, likely explaining why structural similarity had not been predicted (Figure 2—figure supplement 3D). CENP-L, or its complex with CENP-N^230-C, did not interact with CENP-A^NCPs or H3^NCPs (Figure 1E and Figure 1—figure supplement 1). Thus, CENP-L and CENP-N, even if partly structurally related, have clearly distinct functions. In conclusion, the structure of CENP-N contains an N-terminal Pyrin domain, a central CLN-HD, and a C-terminal CENP-L dimerization domain, while CENP-L contains an N-terminal CLN-HD, interrupted immediately before the C-terminal helix by an insertion that contains a region required for CENP-N dimerization.

Cryo-EM analysis of the CENP-N:CENP-A nucleosome complex

Using cryo electron microscopy (cryo-EM), we obtained a three-dimensional reconstruction of CENP-N^1-289 bound to CENP-A^NCPs at ~4.0 Å (Figure 3A–B, Figure 3—figure supplement 1, and Table 2). We built an atomic model of the CENP-N:CENP-A^NCP complex by fitting into the EM density high-resolution models of the CENP-A histone core (PDB ID 3AN2) (Tachiwana et al., 2011), combined with DNA derived from a nucleosome reconstituted with the 145 bp 601 DNA sequence (PDB ID 3LZ0; Vasudevan et al., 2010), and the newly determined crystal structure of CENP-N^1-235. Both manual and automatic fitting strategies produced unequivocal fits, allowing the first visualization of the interaction of CENP-N with the CENP-A nucleosome (Figure 3—figure supplement 2).

Figure 3 with 6 supplements see all

Download asset Open asset

The CENP-N:CENP-A^NCP complex.

(A) Cartoon model of the CENP-A^NCP with bound CENP-N^1-235, determined by cryo-EM. (B) Surface representation of the complex. In A and B, the L1 loop of CENP-A is displayed in red. (C) Comparison of the DNA ends in the crystal structure of the CENP-A nucleosome (Tachiwana et al., 2011) and in the structure of the CENP-A:CENP-N complex. (D) Electrostatic potential at the CENP-N DNA binding interface with contour levels ± 4 k_BT/e (k_B, Boltzmann constant; T, absolute temperature; e, the magnitude of electron charge, calculated with the APBS Pymol plugin). (E) Interaction of CENP-N with backbone, minor groove, and major groove of DNA with close-up views of selected interactions. (F) Interactions at the CENP-A L1 loop and comparison with superimposed H3.

https://doi.org/10.7554/eLife.33442.009

Table 2

EM data collection, processing, and refinement statistics

https://doi.org/10.7554/eLife.33442.016

Data collection and processing
Voltage (kv)	300
Magnification	290,000x
Defocus (μm, nominal)	−1.0 to −2.5
Pixel size (Å)	1.02
electron dose rate (counts/pixel/s)	10
Total electron dose (e^- /Å²)	80
Exposure time (s)	8
Number of images (collected/processed)	3900/3024
Number of frames per image	40
Initial particle number	1,843,269
Particle number for 3D classification	1,267,674
Final particle for refinement	937,118
Resolution (masked/unmasked) (Å)	4.0/4.2
Map sharpened b-factor (Å²)	−233
Model refinement
r.m.s. deviation (bonds)	0.005
r.m.s. deviation (angles)	0.97
All-atom clashscore	2.30
Ramachandran plot
Outliers (%)	0.00
Allowed (%)	4.59
Favored (%)	95.81
CaBLAM analysis:
Outliers (%)	1.92
Disfavored (%)	6.65
Ca outliers (%)	0.11
Rotamer outliers (%)	0.00

The CENP-A nucleosome appears to be stabilized by its interaction with CENP-N (Guo et al., 2017). There is clear density for 139 of the 147 bp of DNA and for the N-terminal helix of CENP-A (Figure 3C and Figure 3—figure supplement 2A), two features reported to be largely disordered and thus invisible in the crystal structure of the CENP-A^NCP (PDB ID 3AN2) (Tachiwana et al., 2011). CENP-N, whose structure changes very little upon binding to the CENP-A nucleosome, is positioned on top of the L1 loop of CENP-A (also called RG loop for the presence of a conserved arginine-glycine motif at the loop’s apex) and contacts approximately 15 base pairs of the adjacent DNA gyre (Figure 3A). There is clear density only until CENP-N^1-289 residue ~210, indicating that the following approximately 80 C-terminal residues (at the opposite end of the nucleosome interaction interface) may be flexible. Of the ~2400 Å² of CENP-A^NCP and CENP-N surface area that become buried in the complex,~1400 Å² are at the CENP-N:DNA interface, where both CENP-N domains form extensive interactions with DNA from bp −21 to −35 relative to the twofold axis, or superhelical location [SHL] −2 to −3. There is a marked accumulation of positively charged residues on this DNA binding interface (Figure 3D). Four loops in the CLN-HD straddle the DNA double helix over ~8 bp, and the consecutive 7 bp are bound by the PYRIN domain, which is positioned to insert an arginine (R44) into the compressed minor groove in an arrangement that is reminiscent of the minor groove arginines inserted by the histones (Figure 3E). The highly conserved P17 in the PYRIN domain positions the main chain of CENP-N to latch on to the phosphate backbone of the DNA, with interactions made through the side chains of K15, R42, K45, K81, and R194. There are also likely insertions of CENP-N side chains into two minor grooves (besides R44, also K148, M167, R170) and the intervening major groove at SHL −3 [R196, see also (Carroll et al., 2009) (Figure 3E)]. In agreement with the presence of a large interaction interface with nucleosomal DNA, CENP-N bound more tightly to CENP-A^NCPs but retained substantial binding affinity for H3^NCPs in electrophoretic mobility shift assays (EMSAs) (Figure 3—figure supplement 3A–B). Likely, this residual binding to H3^NCPs in the EMSAs, which emerged less clearly in solid phase binding assays (Figure 1C), reflects emphasis on electrostatic interactions under the low-salt conditions of the EMSA assays (150 mM NaCl for complex formation, followed by further dilution upon loading onto the gel), compared to the GST-binding assays (300 mM NaCl), as also discussed in the context of Figure 4—figure supplement 2.

The CENP-N:CENP-A interface

The substantial interface with DNA explains why CENP-N does not bind CENP-A:H4 tetramers lacking DNA (Carroll et al., 2009). However, while DNA binding clearly contributes to the binding affinity of this interaction, it is unlikely to contribute to the discrimination of CENP-A^NCPs from H3^NCPs, because CENP-N bound selectively to CENP-A^NCPs even when the CENP-A^NCPs and H3^NCPs contained the same DNA sequence (Figure 1B–C). Conversely, the structure clearly suggests why recognition of the L1 loop is crucial for discrimination (Black et al., 2004; Carroll et al., 2010, 2009). CENP-N binds the L1 loop through a continuous interface comprising the α1 helix in CENP-N^PD and the β3-β4 loop in CENP-N^CLN-HD. Several of the infrequent conserved solvent-exposed residues of CENP-N (identified by a green dot in Figure 2D), including E3, E7, R11, K143, P145, N146, and K148 reside in this interface. Y147, which is less conserved, contributes to the stabilization of the relative arrangements of the CENP-N^PD and CENP-N^CLN-HD, which is largely unchanged in the nucleosome-bound and free structures of CENP-N. Insertion of the side chain of M1 into the hydrophobic core contributes to the stabilization of the α1 helix. The interaction with the CENP-A L1 loop engages a triad of residues, E3^CENP-N, E7^CENP-N, and R11^CENP-N, whose side chains emerge from the same face of the α1 helix looking toward the L1 loop (Figure 3F).

The CENP-A residues R80^CENP-A and G81^CENP-A form a two-residue insertion that is the most conspicuous difference between the L1 loops in CENP-A and H3 (Figure 3F, Figure 3—figure supplement 4A–B). The insertion is crucial, because it allows R80^CENP-A to form hydrogen bonds with both E3^CENP-N and E7^CENP-N, while absence of a side chain at G81^CENP-A allows the CENP-A loop to insert deeply into a cleft formed between the two CENP-N domains, where the side chain of Y147^CENP-N packs tightly against V82^CENP-A. In EMSAs, mutation of R80 and G81 to alanine partly ablated the preference of CENP-N for CENP-A^NCPs (Figure 3—figure supplement 3A–B). The side chain of R11^CENP-N, a residue previously shown to be important for the CENP-N:CENP-A^NCP interaction (Carroll et al., 2009), on the other hand, is squeezed between the loop 1 region of CENP-A and the loop 2 region of H4, where it may be involved in a double salt bridge with E74^H4 and E7^CENP-N (Figure 3F and Figure 3—figure supplement 5).

Mutational validation of the CENP-N:CENP-A^NCP structure

We generated a collection of single and double alanine point mutants to probe the role of individual CENP-N residues in the interaction with the CENP-A^NCP. In pull-down assays in vitro, we found essentially complete loss of binding with an alanine (A) mutant of R11 (Figure 4A), and substantial reductions of binding with alanine mutants of E7 or Y147, at the CENP-A L1 interface, or of K15 or K45, at the interface with DNA (Figure 4—figure supplement 1A). Combining mutations of Y147 with either K15 or K45 almost completely disrupted CENP-A^NCP binding (Figure 4A), in line with the idea that recognition of the L1 loop and of the DNA jointly contribute to the binding affinity of CENP-N for the CENP-A nucleosome. CENP-N targeting to centromeres in U2OS cells reflected the observations made in vitro, with R11A single mutant and the K15A-Y147A and K45-Y147A double mutants appearing severely impaired in the ability to target centromeres (Figure 4B–C), and other single mutants suffering intermediate effects on binding to centromeres (Figure 4—figure supplement 1B).

Figure 4 with 2 supplements see all

Download asset Open asset

Validation of the CENP-N:CENP-A^NCP complex.

(A) In vitro binding assay probing the interaction of GST-CENP-N^1-212 immobilized on solid phase with CENP-A^NCP. (B) Fluorescence microscopy analysis comparing localization at human kinetochores (U2OS osteosarcoma cells) of a wild-type CENP-N-mCherry fluorescent reporter and of its mutant variants. (C) Quantification of localization of the mCherry constructs in B normalized to CREST. The same concentrations of transiently transfected plasmids were compared. Error bars represent SD.

https://doi.org/10.7554/eLife.33442.017

In competition gel shift assays, CENP-N mutants at the CENP-A L1 loop interface (L1 mutants), including R11A and two double mutants (E3A-E7A and K143A-Y147A), lost the ability of CENP-N^wt to discriminate between CENP-A and H3-nucleosomes (Figure 4—figure supplement 2A-B). We have shown in Figure 4A that CENP-N^R11A does not bind CENP-A^NCPs in solid phase-binding assays. The residual interaction of this mutant with CENP-A^NCPs or H3^NCPs in EMSAs likely reflects the extensive binding interface of CENP-N for nucleosomal DNA (whose effects are emphasized under low-salt conditions, as already indicated for CENP-N^wt in the context of Figure 3—figure supplement 3). In line with this interpretation, we find that in the EMSAs the L1 mutants of CENP-N bind H3^NCPs indistinguishably from CENP-N^wt, whereas the same mutants bind to CENP-A^NCPs considerably worse than CENP-N^wt (Figure 4—figure supplement 2). Collectively, these results further emphasize the importance of the L1 loop of CENP-A in selective recognition by CENP-N.

Identification of a CENP-C region involved in CENP-LN binding

As discussed in the Introduction, CENP-C, an intrinsically disordered protein, provides a defined succession of binding sites for different kinetochore components (Klare et al., 2015) (Figure 1A). CENP-C, CENP-LN, and CENP-HIKM, another kinetochore sub-complex located in the vicinity of CENP-A, form a 7-subunit complex (designated CHIKMLN) that binds CENP-A^NCPs cooperatively, that is with increased binding affinity in comparison to any of the individual subunits or sub-complexes (McKinley et al., 2015; Weir et al., 2016). Within this assembly, CENP-C^2-545 binds the CENP-LN complex in vitro [Figure 5—figure supplement 1A and (Hinshaw and Harrison, 2013; McKinley et al., 2015; Nagpal et al., 2015; Weir et al., 2016).

We set out to exploit biochemical reconstitution and our improved structural understanding of the CENP-LN complex to query the importance of this interaction for kinetochore assembly in humans. Trimming of CENP-C^2-545 identified CENP-C^225-364 as a minimal CENP-LN interaction domain (Figure 5A), in line with a recent study (Guo et al., 2017). Neither CENP-L nor CENP-N^1-235 bound to CENP-C^2-545 (Figure 5—figure supplement 1B–C). However, the CENP-LN^230-C dimer bound CENP-C in the absence of nucleosomes (Figure 5B). Thus, CENP-C^225-364 binds at or near the CENP-LN dimer interface, possibly also exploiting structural ordering of these regions upon dimerization. CENP-C^225-364 contains a handful of conserved residues, some of which were previously shown to mediate an interaction with the CENP-HIKM complex (Klare et al., 2015) (Figure 5C). We probed an additional conserved linear motif in CENP-C^225-364 (residues 302–306) for its potential role in CENP-N binding. A 5-alanine mutant of residues 302–306 (identified as CENP-C^5A) failed to interact with CENP-NL, identifying this region of CENP-C as the CENP-LN binding motif (Figure 5D and Figure 5—figure supplement 2). Importantly, CENP-C^2-545-5A did not interact with CENP-LN, but retained binding to CENP-A^NCPs and CENP-HIKM (Figure 5E–F). In isothermal titration calorimetry (ITC) experiments, CENP-LN^230-C bound CENP-C^225-364 with a dissociation constant (K_D) of 1 µM, but it showed no binding to CENP-C^225-364-5A (Figure 5G–H).

Figure 5 with 2 supplements see all

Download asset Open asset

Identification of a CENP-N binding site on CENP-C.

(A) Size exclusion chromatography (SEC) runs of CENP-C^225-364, CENP-LN complex, and their combination at the indicated loaded concentrations, identified a binding site for CENP-LN in CENP-C^225-364. Elution fractions were separated by SDS-PAGE and visualized by Coomassie staining. (B) CENP-LN^230-C binds CENP-C^2-545, indicating that the CENP-N N-terminal region is not required for CENP-C binding. (C) Sequence of a segment of the PEST-rich domain CENP-C that contains a binding site for the CENP-HIKM complex (Klare et al., 2015) (residues indicated in salmon). The CENP-N-binding motif is shown in grey. (D) CENP-LN does not bind CENP-C^2-545-5A. (E) CENP-C^2-545-5A retains the ability to bind to CENP-A^NCP. (F) CENP-C^2-545-5A retains the ability to bind to the CENP-HIKM complex. (G) Isothermal titration calorimetry (ITC) experiment quantifying the physical interaction of the CENP-L:CENP-N^230-C complex with CENP-C^225-364. (H) In agreement with the SEC data, CENP-C^225-364-5A fails to interact with the CENP-L:CENP-N^230-C complex in an ITC experiment.

https://doi.org/10.7554/eLife.33442.020

CENP-LN binding motif of CENP-C is required for kinetochore recruitment of CENP-N

The availability of a CENP-LN binding mutant of CENP-C gave us an opportunity to ask if the interaction of CENP-LN with CENP-A, besides being necessary, is also sufficient for kinetochore recruitment of CENP-N. For this, we depleted CENP-C by RNAi and replaced it with exogenous wild type (wt) or mutant (5A) copies. Depletion of CENP-C prevented kinetochore localization of CENP-N, showing that nucleosome binding is not sufficient for CENP-N to reach kinetochores at the low cellular concentration of these proteins. Exogenously expressed wild-type CENP-C promoted CENP-N recruitment, while CENP-C^5A failed to promote it (Figure 6A–B). Thus, the CENP-LN binding site of CENP-C, while not crucial for CENP-C recruitment to kinetochores, is instead crucial for CENP-N recruitment. Overall, these observations indicate that the CENP-LN complex reads the presence of two features of kinetochores, the presence of CENP-A and the presence of CENP-C, both of which are necessary for its efficient recruitment.

Figure 6

Download asset Open asset

Effective CENP-N localization requires CENP-C.

(A) Fluorescence microscopy analysis comparing kinetochore localization of a wild-type CENP-N-mCherry fluorescent reporter in human HeLa FlpIn TRex cells depleted of CENP-C and, were indicated, further expressing wild-type GFP-CENP-C or the 5A mutant. (B) Quantification of CENP-C (left) and mCherry-CENP-N (right) levels at kinetochores in mitotic cells following the rescue of CENP-C depletion by either GFP-CENP-C^WT or the GFP-CENP-C^5A mutant. Graphs show kinetochore fluorescence intensity of the indicated protein (antibodies against CENP-C or mCherry) normalized to CENP-C or mCherry-CENP-N kinetochore levels in the absence of RNAi treatment, respectively. Each graph is representative of two independent experiments. (C) Surface representation of a composite model built by combining the coordinates of the CENP-C motif (residues 712 to 733 from PDB ID 4 × 23, describing its interaction with nucleosome) with those of the CENP-N:CENP-A^NCP complex. (D) Schematic of crucial kinetochore interactions, already shown in Figure 1A, but with question marks removed at interactions investigated in the present work. (E) The grey box, an enlargement of the box in D, summarizes the details of the interactions reported in this work, as well as previous information on the interaction of CENP-C with the CENP-A^NCP.

https://doi.org/10.7554/eLife.33442.023

Discussion

The histone H3 variant CENP-A is an essential feature of centromeres and has two main functions. First, it is required for kinetochore assembly through its direct interactions with inner kinetochore subunits that can then seed the assembly of this large macromolecular assembly. Second, it is a landmark that determines the stability of centromere chromatin identity through cell division. Interactions of CENP-C and CENP-N with CENP-A^NCPs are the only known direct and specific points of contact of the kinetochore with the centromere and are therefore the crucial effectors through which CENP-A implements its role (Carroll et al., 2010, 2009; Kato et al., 2013).

While the structural basis of CENP-C binding to CENP-A had been described (Kato et al., 2013), how CENP-N binds CENP-A had remained elusive. Here, we have filled this important gap, as shown schematically in Figure 6D–E. CENP-N nucleosome binding differs from that observed with RCC1 and other nucleosome binders that engage primarily an exposed acidic patch on histones H2A and H2B (Makde et al., 2010). However, in its outline it resembles the interaction of the ATPase domain of SWI2/SNF2 chromatin remodeler with H3^NCPs (Farnung et al., 2017; Liu et al., 2017; Narlikar et al., 2013), with the important difference that SWI2/SNF2 closely approaches H3 without making significant direct contacts with it, whereas CENP-N interacts directly with CENP-A (Figure 3—figure supplement 6A–D). There are also similarities with the nucleosome-binding mechanism of the bromo-adjacent homology (BAH) domain of Sir3 (PDB ID 3TU4) (Armache et al., 2011), but the latter interacts predominantly with the H4 N-terminal tail, through recognition of K16^H4, and with the acidic patch on H2A-H2B, and much less extensively with DNA (Figure 3—figure supplement 6E–F). In the CENP-N:CENP-A^NCP complex, the normally disordered N-terminal tail of H4 is ordered until R23 and interacts weakly with the CENP-N loop connecting β3 with β4 (Figure 3—figure supplement 2D). The reported mono-methylation of K20 of H4 in the CENP-A nucleosome (Hori et al., 2014) may further modulate this interaction. In summary, the SWI2/SNF2 and BAH modes of nucleosome binding are predominantly based on interactions with DNA or with the histones, respectively, while CENP-N shows a balance of both. The considerable interaction of CENP-N with DNA is a remarkable and unexpected feature of the complex structure.

CENP-C and CENP-N can interact concomitantly with the same CENP-A nucleosome (Carroll et al., 2010), as also confirmed in recent studies (Guo et al., 2017; Weir et al., 2016). The central motif and the CENP-C motif of CENP-C, which confer CENP-A recognition ability in vitro, interact through a ‘arginine anchor’ with the acidic patch of H2A and H2B, and also decode the divergent C-terminal tail of CENP-A (Kato et al., 2013). These determinants of CENP-C binding on CENP-A are located adjacent to, but not overlapping with, the CENP-N-binding footprint. Indeed, when modeled on the CENP-N:CENP-A^NCP structure according to the position it adopts in its structure with the nucleosome (PDB ID 4 × 23) (Kato et al., 2013), the CENP-C motif can be accommodated without steric clashes (Figure 6C,E). Thus, CENP-C and CENP-N interact with CENP-A through complementary interfaces. In the context of a larger CCAN complex, these CENP-A-binding motifs cooperate to increase the overall binding affinity for CENP-A (Guo et al., 2017; Weir et al., 2016).

It has been proposed that CENP-N is significantly stabilized upon binding to the CENP-A nucleosome (Guo et al., 2017). Our study did not identify a clear structural basis for this phenomenon, as we failed to identify significant conformational changes in CENP-N in isolation (crystal structure) compared to its complex with CENP-A^NCP. It has also been proposed that CENP-C reshapes and rigidifies the CENP-A nucleosome and that it modulates the DNA termini to make them match the loose wrap observed at centromeres (Falk et al., 2015). Importantly, these effects of CENP-C binding on the CENP-A nucleosome do not appear to be required for the selective (over H3) interaction of CENP-N, because selectivity for CENP-A was retained in the absence of CENP-C [this study and (Carroll et al., 2009; Weir et al., 2016)]. We also note that the DNA termini appear to be well defined in our structure of the CENP-N:CENP-A^NCP complex, contrarily to what was observed in the structure of the isolated CENP-A^NCP (PDB ID 3AN2). At present, we cannot definitively conclude whether the stabilization of the termini is due to CENP-N binding to the CENP-A nucleosome, as we have not yet been able to obtain a high-resolution EM structure of the CENP-A nucleosome in isolation for comparison. It is possible that the cryogenic conditions used for our structural work stabilize a specific conformation of the complex.

In most organisms, centromere identity is not specified by the centromere’s DNA sequence, but rather by the enrichment of CENP-A at a defined chromatin domain. De novo formation of stably inherited centromeres at previously non-centromeric sites (neo-centromeres) provides clear evidence in favor of this idea. Thus, rather than being genetically (i.e. DNA-sequence) specified, centromeres are epigenetically specified, with the pre-existing enrichment of CENP-A being a necessary condition for continued deposition of new CENP-A at the same site through the generations. There is therefore considerable interest in the molecular mechanisms that promote new CENP-A deposition at centromeres during the cell cycle, and in the mechanisms that promote the stabilization and persistence of CENP-A after its incorporation at centromeres.

Conserved machinery for new CENP-A deposition, including the specialized CENP-A chaperone HJURP (Scm3 in S. cerevisiae) and an adaptor complex consisting of the Mis18 and M18BP1 subunits, has been described in recent years (Dunleavy et al., 2009; Foltz et al., 2009; Fujita et al., 2007; Hayashi et al., 2004; Pidoux et al., 2009; Sanchez-Pulido et al., 2009; Williams et al., 2009). Additional machinery, in particular chromatin remodelling enzymes harnessing ATP hydrolysis to evict H3, is likely involved in the reaction but has not been univocally identified. This machinery is recruited to centromeres early during the cell-cycle and is believed to promote the replacement of histone H3 with new CENP-A (Dunleavy et al., 2011; Jansen et al., 2007; Schuh et al., 2007). Likely, the existing CENP-A nucleosome acts as a template in this reaction, as the abundance of CENP-A nucleosomes at a given centromere is, at least in first approximation, constant through subsequent cell divisions (French et al., 2017; Hori et al., 2017; Jansen et al., 2007). This implies that the same number of new CENP-A nucleosomes is incorporated after each cell division as that of originally present CENP-A nucleosomes, suggesting that the deposition machinery targets, for H3 eviction and replacement with CENP-A, an H3 nucleosome that is likely in close proximity of the CENP-A nucleosome (Musacchio and Desai, 2017). While the mechanistic details of CENP-A deposition remain partly unclear, there is now substantial evidence that recruitment of the CENP-A deposition machinery requires CENP-C and possibly other CCAN factors (Dambacher et al., 2012; Moree et al., 2011; Shono et al., 2015). While CENP-N has not been directly implicated in CENP-A deposition, our observation that CENP-N occupies a region of the nucleosome required for binding by chromatin remodelling enzymes of the SWI2/SNF2 family suggests that CENP-N may protect centromeric nucleosomes from remodelling and eviction, thereby contributing to its stability. Indeed, both CENP-C and CENP-N contribute to the stabilization of newly incorporated CENP-A in centromeric chromatin (Guo et al., 2017). New CENP-N deposition at centromeres occurs in late S phase (Fang et al., 2015; Hellwig et al., 2011), and may trigger a stabilization of centromere organization required for successful kinetochore assembly.

In summary, our analysis of the mechanisms of the interactions of the CENP-NL complex with CENP-A and CENP-C represents a step forward in the molecular dissection of the almost universally conserved functions of CENP-A in eukaryotes, which are required for accurate chromosome segregation and, more generally, for the success of cell division and the propagation of life.

Share this article

Cite this article

The interaction of CENP-N with nucleosomes.

Crystal structure of the CENP-A-binding region of CENP-N.

X-ray data collection and refinement statistics

The CENP-N:CENP-ANCP complex.

EM data collection, processing, and refinement statistics

Validation of the CENP-N:CENP-ANCP complex.

Identification of a CENP-N binding site on CENP-C.

Effective CENP-N localization requires CENP-C.

Author details

Satyakrishna Pentakota

Contribution

Contributed equally with

Competing interests

Keda Zhou

Contribution

Contributed equally with

For correspondence

Competing interests

Charlotte Smith

Contribution

For correspondence

Competing interests

Stefano Maffini

Contribution

For correspondence

Competing interests

Arsen Petrovic

Contribution

For correspondence

Competing interests

Garry P Morgan

Contribution

For correspondence

Competing interests

John R Weir

Present address

Contribution

Competing interests

Ingrid R Vetter

Contribution

Competing interests

Andrea Musacchio

Contribution

For correspondence

Competing interests

Karolin Luger

Contribution

For correspondence

Competing interests

Citations by DOI

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

Categories and tags

Research organism

The CENP-N:CENP-A^NCP complex.

Validation of the CENP-N:CENP-A^NCP complex.