Holocentromeres are dispersed point centromeres localized at transcription factor hotspots

Abstract
eLife digest
Introduction
Results
Discussion
Materials and methods
Data availability
References
Article and author information
Metrics

Abstract

Centromeres vary greatly in size and sequence composition, ranging from ‘point’ centromeres with a single cenH3-containing nucleosome to ‘regional’ centromeres embedded in tandemly repeated sequences to holocentromeres that extend along the length of entire chromosomes. Point centromeres are defined by sequence, whereas regional and holocentromeres are epigenetically defined by the location of cenH3-containing nucleosomes. In this study, we show that Caenorhabditis elegans holocentromeres are organized as dispersed but discretely localized point centromeres, each forming a single cenH3-containing nucleosome. These centromeric sites co-localize with kinetochore components, and their occupancy is dependent on the cenH3 loading machinery. These sites coincide with non-specific binding sites for multiple transcription factors (‘HOT’ sites), which become occupied when cenH3 is lost. Our results show that the point centromere is the basic unit of holocentric organization in support of the classical polycentric model for holocentromeres, and provide a mechanistic basis for understanding how centromeric chromatin might be maintained.

https://doi.org/10.7554/eLife.02025.001

eLife digest

During cell division, the chromosomes in the original cell must be replicated and these ‘sister chromosomes’ must then be divided equally between the two new daughter cells. At first, the sister chromosomes are held together near a region called the centromere, which is important because the microtubules that pull the sister chromosomes apart attach themselves to the centromere. In many cases, the centromere is a small region near the middle of the chromosomes, which produces a classic X shape. However, in some organisms centromeres span the entire length of the chromosomes. There are at least 13 plant and animal lineages with such holocentromeres.

Inside the nucleus of cells, DNA is wrapped around molecules called histones. There are five major families of histones, and histones belonging to one of these families—the H3 histones—are replaced by cenH3 variant histones at both conventional centromeres and holocentromeres. There are many unanswered questions about holocentromeres. In particular, do holocentromeres truly extend along the full length of the chromosomes, or are they found at a large number of specific sites?

Now Steiner and Henikoff have studied the distribution of cenH3 in the genome of the worm C. elegans to investigate holocentromeres in greater detail. These experiments showed that the holocentromere in C. elegans is actually made of about 700 individual centromeric sites distributed along the length of the chromosomes. Each of these sites contains just one nucleosome that contains cenH3, and these sites are likely to be the sites that microtubules attach to during cell division. Surprisingly, the same sites can also act as so-called ‘HOT–sites’: these sites are bound by many proteins that are involved in regulating the process by which genes are expressed as proteins, which suggests a link between centromeres and these regulatory proteins.

The work of Steiner and Henikoff describes how centromeric nucleosomes are distributed across the genome, but why and how cenH3 ends up at these particular 700 sites remains an open question.

https://doi.org/10.7554/eLife.02025.002

Introduction

The centromere is a defining feature of eukaryotic chromosomes and is essential for the segregation of chromosomes during cell division, as it organizes the proteinaceous kinetochore for attachment to the spindle apparatus at mitosis. Centromeres are universally marked by the variant histone cenH3 (also called CENP-A in many organisms) that replaces canonical histone H3 in centromeric nucleosomes, and most commonly localize to a single position along the chromosome (Malik and Henikoff, 2009). However, the DNA on which centromeric nucleosomes assemble is not conserved and varies greatly in size and composition. It ranges from genetically defined point centromeres that assemble a single cenH3-containing nucleosome to epigenetically defined regional centromeres of several kb or Mb of tandemly repeated DNA to holocentromeres that extend along the length of entire chromosomes. With the exception of budding yeast point centromeres, where there is a 1:1 relationship between a single cenH3 nucleosome and the functional centromere, the precise organization of centromeric chromatin has remained elusive. One of the main issues standing in the way of uncovering the distribution of centromeric nucleosomes is the fact that most regional centromeres are localized to homogeneous tandemly repetitive regions of the genome, making it difficult to map individual nucleosomes. C. elegans is amenable to address this question because the genome is repeat-poor, making it possible to precisely map centromeric regions.

Classical cytogenetic observations have demonstrated that C. elegans chromosomes are holocentric, whereby mitotic spindle fibers attach along the length of chromosomes and pull them to the poles as straight bars rather than from a single position that defines the more familiar monocentric chromosomes (Schrader, 1935; Albertson and Thomson, 1982). Two models have been put forward for how holocentric chromosomes might be organized (Schrader, 1947). The ‘diffuse centromere’ model predicts that the centromere is truly distributed along the length of the chromosomes, and that spindle fiber attachments form randomly. The ‘polycentromere’ model predicts that there are a number of discrete sites dispersed along the chromosomes, creating a holocentric appearance when observed at cytological resolution (Figure 1A).

Figure 1 with 3 supplements see all

Download asset Open asset

Genome-wide distribution of cenH3.

(A) Classic holocentromere models proposed by Schrader (Schrader, 1947): diffuse and polycentric holocentromeres. The diffuse model predicts full centromere coverage of the chromosomes. The polycentric model predicts discrete centromeric sites that together give the appearance of holocenticity. (B) Genome browser view of 525 kb on Chr I for cenH3 X-ChIP-chip (Gassmann et al., 2012), cenH3 N-ChIP-seq (this study), H3.3 N-ChIP-chip (Ooi et al., 2010) and H3K9me3 X-ChIP-chip (Liu et al., 2011), showing the close correspondance of cenH3 N-ChIP and X-ChIP signals for domains, but not peaks, and the positive correlation of cenH3 signal with H3K9me3 signal and the negative correlation of cenH3 signal with H3.3 signal. Log₂ ratios of IP and input are shown to enable comparison between microarray and sequencing data. CenH3 peaks are marked by asterisks. (C) Genome browser view of cenH3 N-ChIP-seq (this study) and cenH3 X-ChIP-chip (Gassmann et al., 2012) at two representative cenH3 peaks marked in (B) with red boxes. (D) Number and genomic distribution of cenH3 peaks called per chromosome. See also Figure 1—source data 1. (E) Heatmaps and average plots of input and cenH3 N-ChIP signal within a 2-kb window around all 707 cenH3 peaks. Each line of the heatmaps represents an individual cenH3 site. The heatmaps are sorted from high to low cenH3 signal.

https://doi.org/10.7554/eLife.02025.003

Figure 1—source data 1 Genomic coordinates of cenH3 peaks.: https://doi.org/10.7554/eLife.02025.004
Download elife-02025-fig1-data1-v1.txt

Consistent with either model, C. elegans cenH3 (also called HCP-3) localizes to a characteristic band along the length of the chromosome at mitosis (Buchwitz et al., 1999). A previous study mapped C. elegans cenH3 using a microarray-based approach and found that it occupies ∼2900 broad domains that account for about half of the genome, but that there was only enough cenH3 to cover 4% of the genome, suggesting that cenH3 nucleosomes assemble at random positions within the domains (Gassmann et al., 2012). These findings thus seemingly supported the diffuse centromere model. However, mitotic microtubules must attach to discrete sites for chromosome segregation, and the number of microtubules attached during C. elegans mitosis has been estimated to about 100 for all six chromosomes combined (O’Toole et al., 2003). This left open the question of the relationship between these diffuse domains and discrete microtubule attachment sites.

To identify potential kinetochore attachment sites in C. elegans, we profiled cenH3 nucleosomes with single base-pair resolution. While we observed domains of low occupancy similar to those described in the earlier study, we also discovered discrete sites of much higher cenH3 occupancy that are distributed independently of the domains. Depletion of the machinery needed for incorporation of cenH3 nucleosomes resulted in reduced occupancy of cenH3 at these sites. As an independent indicator of centromeric localization, we also profiled the inner kinetochore protein CENP-C (also called HCP-4 in C. elegans). We found that cenH3 sites coincide with high CENP-C signal, indicating that they serve as attachment sites for the kinetochore, consistent with a polycentric organization of the chromosome. Individual sites resemble budding yeast point centromeres and coincide with transcription factor hotspots, which become occupied by transcription factors when cenH3 is lost, providing a clue as to how kinetochore sites might be selected and maintained.

Results

CenH3 levels are high in domains of low nucleosome turnover

To precisely localize cenH3-containing nucleosomes and identify potential kinetochore attachment sites, we digested chromatin from mixed-stage embryos with micrococcal nuclease (MNase) and solubilized the majority of the chromatin by cavitation, a method adapted from Jin and Felsenfeld, 2007 (Figure 1—figure supplement 1). We subsequently performed native chromatin immunoprecipitation (ChIP) of cenH3 from the soluble chromatin, followed by paired-end sequencing (N-ChIP-seq), resulting in single base-pair resolution maps of cenH3-associated DNA fragments.

As expected from a previous ChIP-microarray map of C. elegans cenH3 using formaldehyde crosslinking (X-ChIP-chip) (Gassmann et al., 2012), we found that cenH3 is broadly distributed throughout the genome. CenH3 is enriched towards the arms relative to the centers of the autosomes, while the distribution on the X chromosome is relatively even (Figure 1—figure supplement 2A). In C. elegans, chromosome arms tend to be enriched for repeats and are associated with marks of heterochromatin (The C. elegans Sequencing Consortium, 1998; Liu et al., 2011). Indeed, the distribution of cenH3 is positively correlated with the distribution of trimethylation of lysine 9 on histone H3 (H3K9me3), a mark of transcriptionally silent regions, in both our ChIP-seq and the previously published ChIP–chip data (Figure 1—figure supplement 2B, left panels. r = 0.44, p<2.2 × 10⁻¹⁶ for correlation with cenH3 X-Chip and r = 0.31, p<2.2 × 10⁻¹⁶ for correlation with N-ChIP) (Gu and Fire, 2010; Liu et al., 2011). This is consistent with previous findings that have associated H3K9 methylation with the acquisition of cenH3 in fission yeast (Folco et al., 2008; Kagansky et al., 2009). We therefore wondered if the enrichment of cenH3 is associated with lower nucleosome turnover. The replication-independent variant histone H3.3 is incorporated into chromatin when nucleosomes are replaced and serves as a measure of replication-independent nucleosome turnover (Ahmad and Henikoff, 2002; Mito et al., 2005; Goldberg et al., 2010; Ooi et al., 2010). Consistent with the hypothesis that cenH3 is associated with lower nucleosome turnover, we found that the distributions of H3.3 and cenH3 are negatively correlated in both our ChIP-seq and the previously published ChIP–chip data (Figure 1—figure supplement 2B, center panels. r = −0.65, p<2.2 × 10⁻¹⁶ for correlation with cenH3 X-Chip and r = −0.21, p<2.2 × 10⁻¹⁶ for correlation with N-ChIP). Replication-independent nucleosome turnover is mainly driven by transcription (Deal et al., 2010; Teves and Henikoff, 2011), consistent with the previously described anti-correlation between cenH3 and RNA polymerase II (Gassmann et al., 2012).

CenH3 has been found to be localized to 10–12 kb wide domains that occupy about half of the genome (Figure 1B, first track) (Gassmann et al., 2012). Despite the use of a different methodology (low-salt native chromatin preparation and MNase digestion instead of formaldehyde fixation and sonication, a different antibody and ChIP-seq instead of ChIP–chip), our data showed a very similar domain pattern (Figure 1B, second track, Figure 1—figure supplement 2B, right panel. Correlation with cenH3 X-ChIP r = 0.67, p<2.2 × 10⁻¹⁶). The previously published data (Gassmann et al., 2012) pointed to an anti-correlation of cenH3 occupancy with transcription in the germline and in the early embryo. Because there is little RNA Polymerase II activity in the early embryo, and transcriptional profiles are not directly transmitted during the maternal–zygotic transition, the authors proposed that it is the memory of germline transcription transmitted to the embryo that excludes cenH3 incorporation. We found that the domains were both negatively correlated with previously published H3.3 data (Figure 1B, third track) and positively correlated with previously published H3K9me3 patterns (Figure 1B, fourth track) (Ooi et al., 2010; Liu et al., 2011). H3.3 is abundantly incorporated into chromatin in the germline and throughout early embryonic development (Ooi et al., 2010), whereas H3K9me3 is associated with transcriptionally silent chromatin where nucleosome turnover and H3.3 incorporation are low (Ahmad and Henikoff, 2002; Mito et al., 2005; Gu and Fire, 2010; Ooi et al., 2010; Liu et al., 2011). This suggests that it is nucleosome turnover that excludes the deposition of cenH3 and shapes the domain-like distribution of cenH3 across the genome, which can happen even in absence of transcription in the early embryo.

High-resolution mapping of cenH3 reveals discrete high-occupancy sites

The levels of cenH3 within a cell only allow for the occupancy of 4% of the genome, and each cenH3 domain can therefore only contain a limited number of cenH3 nucleosomes per cell (Gassmann et al., 2012). The large-scale correspondence of our native ChIP-seq data to the previously published crosslinked ChIP–chip data provides independent confirmation of this interpretation. However, we wondered whether there might also be preferred sites of centromeric nucleosome positioning within the cenH3 domains of C. elegans, as predicted by the polycentromere model. These sites would appear as sites of high cenH3 occupancy in a population average. We indeed found that cenH3 was highly enriched at discrete, dispersed loci (Figure 1B,C). As these loci appeared as very well-defined peaks, we removed background by subtracting the input signal and considered sites with 30 or more normalized counts (equivalent to the mean plus 7 times the standard deviation of the genome-wide signal) in at least one of two biological replicates as positive. We identified about 100 cenH3 peaks on each chromosome (707 peaks total; Figure 1D), with the distance between peaks ranging from 290 bp to 1.9 Mb (median 83 kb; Figure 1—figure supplement 3A, Figure 1—source data 1). We averaged the signal around all 707 cenH3 peaks, represented by a single centered peak in the cenH3 ChIP data that is highly enriched compared to input (Figure 1E). The peaks were enriched in gene-poor regions of the genome (Figure 1—figure supplement 3B). To our surprise, 607 out of 707 of these peaks resided outside of the domains described previously (Gassmann et al., 2012) and corresponded to sites of only slight local cenH3 enrichment in the X-ChIP data (Figure 1—figure supplement 3C). In the N-ChIP data, cenH3 occupancy was much higher at these peaks compared to the domains (Figure 1—figure supplement 3D). We hypothesized that these sites are preferred for the deposition of centromeric nucleosomes and serve as potential kinetochore attachment sites.

cenH3 peaks are hyper-sensitive to MNase digestion

Previous studies in other organisms observed that centromeric regions were sensitive to MNase or resulted in patterns inconsistent with the presence of canonical nucleosomes (Polizzi and Clarke, 1991; Takahashi et al., 1992; Dalal et al., 2007; Krassovsky et al., 2012). We found that the cenH3 peaks were sensitive to MNase and disappeared with progressive MNase digestion (Figure 2A,B), even at MNase conditions where nucleosome arrays remain intact and that would be considered underdigested by most standards (Figure 2—figure supplement 1, left panel). In contrast, the chromatin features around the cenH3 peaks were remarkably unaffected (Figure 2A, Figure 2—figure supplement 2). As a control, we compared the cenH3 peaks to the +1 nucleosomes at transcription start sites (Chen et al., 2013). Occupancy of these well-positioned nucleosomes also decreased with progressing MNase digestion, but to a lesser extent (Figure 2C). We quantified MNase sensitivity at cenH3 peaks, at the nucleosomes immediately flanking the cenH3 peaks, and at the +1 nucleosomes by dividing the occupancy of these features at each time point by the occupancy at the first time point. This analysis revealed that cenH3 nucleosomes were more sensitive to MNase than both flanking and +1 nucleosomes (Figure 2D). These findings suggest that the sites of high cenH3 occupancy contain nucleosomes with similar properties as centromeric nucleosomes in other organisms.

Figure 2 with 2 supplements see all

Download asset Open asset

CenH3 peaks are especially MNase sensitive.

(A) Genome browser view of input chromatin and cenH3 ChIP signal within a 25-kb window surrounding two representative cenH3 peaks. Tracks for occupancy after 1 min, 2 min, 5 min and 10 min of MNase digestion are shown. (B) Average input signal (left) and cenH3 ChIP signal (right) within a 1-kb window around all 707 cenH3 sites after the indicated MNase digestion intervals. The dashed red line indicates the midpoint of the cenH3 nucleosome and the dashed black lines indicate the midpoints of the flanking nucleosomes. (C) Average input signal within a 1-kb window around 7043 transcriptional start sites (TSS) after the indicated MNase digestion intervals. The dashed green line indicates the midpoint of the +1 nucleosome. TSS were defined by Chen et al. (Chen et al., 2013). (D) MNase sensitivity plot for the cenH3 nucleosome and the flanking nucleosomes shown in (B) and the +1 nucleosome shown in (C). The occupancy of these nucleosomes at each MNase digestion time point was divided by the occupancy at the first time point. N = 707 (cenH3 nuc), 1414 (flanking nucs), 7043 (+1 nuc).

https://doi.org/10.7554/eLife.02025.008

CenH3 site occupancy depends on cenH3 loading

Incorporation of cenH3 into chromatin depends on the kinetochore protein KNL-2 (Maddox et al., 2007). To test if the signal at the cenH3 peaks results from KNL-2-dependent incorporation of cenH3, we analyzed the chromatin upon KNL-2 knockdown. KNL-2 depletion by RNAi led to an embryonic lethal phenotype with 99 ± 1% penetrance (n = 8). We confirmed by microscopy that this was caused by chromosome segregation defects and found by immunofluorescence that the cenH3 signal became undetectable in embryos. These results were consistent with published findings (Maddox et al., 2007) and suggested that our depletion of KNL-2 successfully reduced the presence of cenH3 and thus the functionality of centromeres. ChIP experiments revealed that cenH3 occupancy at the cenH3 sites was much reduced in knl-2(RNAi) embryos compared to wildtype (Figure 3) for similar levels of MNase digestion (Figure 2—figure supplement 1, center panel). This effect extended genome wide (Figure 3—figure supplement 1A) and was not caused by changes in the input chromatin, as the overall occupancy and positioning of most canonical nucleosomes and other DNA binding factors remained unchanged in knl-2(RNAi) embryos (Figure 3—figure supplement 1B), and the input chromatin showed the same correlation with wildtype chromatin as between wildtype replicates (R = 0.971 and 0.969 for wildtype vs knl-2(RNAi) and R = 0.973 for wildtype vs wildtype; comparison of normalized fragment counts in 10-bp bins, N = 7633808).

Figure 3 with 1 supplement see all

Download asset Open asset

CenH3 peaks depend on cenH3 loading.

(A) Genome browser view of cenH3 ChIP in wildtype and *knl-2(RNAi)* embryos, with two enlarged cenH3 peaks. KNL-2 is required for cenH3 loading onto chromatin. Differences between ChIP and input are shown. CenH3 peaks are marked by asterisks. (B) Average cenH3 ChIP signal within a 2-kb window around all 707 cenH3 sites in wildtype and *knl-2(RNAi)* embryos. Differences between ChIP and input are plotted. (C) Heatmap of difference in cenH3 ChIP signal between wildtype and *knl-2(RNAi)*. Each line of the heatmap represents an individual cenH3 site. The heatmap is sorted by decreasing difference between wildtype and *knl-2(RNAi)*.

https://doi.org/10.7554/eLife.02025.011

It is conceivable that with a partial depletion of the factor required for cenH3-assembly into chromatin, relatively high cenH3 occupancy is maintained at the sites that are functional due to perdurance of the protein. Indeed, cenH3 is still locally enriched at cenH3 sites in the knockdown, albeit at much reduced levels, while the broad domains of weak enrichment are lost (Figure 3A, Figure 3—figure supplement 1). These results indicate that the signal at the identified cenH3 sites indeed depends on the incorporation of cenH3 into centromeric nucleosomes by the cenH3-specific assembly machinery.

CenH3 peaks correspond to kinetochore sites

CenH3 can be incorporated at low levels into nucleosomes away from the centromeres (Camahort et al., 2009; Lefrancois et al., 2009; Lopes da Rosa et al., 2011; Krassovsky et al., 2012; Lefrancois et al., 2013; Lacoste et al., 2014), and so the presence of cenH3 itself is thus not a sufficient measure for the presence of a centromere. To test if the cenH3 peaks indeed correspond to centromeric sites, we compared it to the kinetochore. Previous findings from our lab have suggested that in budding yeast the chromatin fraction that remains insoluble under native conditions after MNase digestion is strongly enriched for kinetochore complexes (Krassovsky et al., 2012). As there is a one-to-one relationship between the centromere and the kinetochore in budding yeast, and the kinetochore components are conserved between eukaryotes, it can be inferred that kinetochore-bound chromatin remains mostly insoluble under these conditions. We therefore analyzed the distribution of the MNase fragments associated with the chromatin fraction that remained insoluble after MNase digestion and needle extraction and found peaks corresponding to each cenH3 peak (Figure 4A,B). This suggested the presence of insoluble complexes, potentially kinetochores, at every cenH3 site in at least part of the cell population analyzed. The insoluble chromatin signal is reduced in knl-2(RNAi) embryos (Figure 4—figure supplement 1A,B), supporting the interpretation that these peaks correspond to kinetochores. Interestingly, the peaks in the insoluble chromatin were more resistant to MNase than cenH3 nucleosomes in the soluble fraction, and the insoluble chromatin signal persisted even after 10 min of MNase digestion (Figure 4A,B), suggesting that the proteins that render cenH3 chromatin insoluble during extraction help to protect it from nuclease digestion.

Figure 4 with 2 supplements see all

Download asset Open asset

CenH3 sites are bound by the kinetochore.

(A) Genome browser view of cenH3 ChIP (native, 2 min MNase), insoluble chromatin (native, 2 min and 10 min MNase), and CENP-C ChIP (formaldehyde-crosslinked, 2 min MNase) signal at two representative cenH3 sites. (B and C) Heatmaps and average plots of insoluble chromatin signal after 2-min and 10-min MNase digestion (B) and CENP-C ChIP signal (C) within a 2-kb window around all 707 cenH3 sites. Each line of the heatmaps represents an individual cenH3 site. Heatmaps are sorted by decreasing signal.

https://doi.org/10.7554/eLife.02025.013

To test directly if the cenH3 peaks coincide with kinetochore attachment sites, we performed ChIP on the inner kinetochore protein CENP-C. This protein binds cenH3 and is required for the assembly of the kinetochore complex that links centromeric chromatin to the microtubule (Moore and Roth, 2001; Oegema et al., 2001; Cheeseman et al., 2004; Carroll et al., 2010; Kato et al., 2013). In fact, CENP-C can organize the entire functional kinetochore in the absence of cenH3 (Gascoigne et al., 2011; Przewloka et al., 2011; Hori et al., 2013). Although CENP-C remains with centromeric DNA throughout the cell cycle in other organisms, in C. elegans CENP-C localizes to chromatin only during mitosis, but not interphase, when it is in the cytoplasm (Moore and Roth, 2001). As a consequence, only a fraction of the cells analyzed contains CENP-C on chromatin, which limits the dynamic range of ChIP signal that is achievable. Because the kinetochore complex is highly insoluble, no DNA was recovered in native ChIP of CENP-C (data not shown). We therefore profiled CENP-C using MNase followed by formaldehyde crosslinking and solubilization with SDS. We found that CENP-C is enriched at cenH3 sites (Figure 4A,C). Neither the signal in the insoluble fraction nor the signal for CENP-C was enriched over the previously identified cenH3 domains compared to the rest of the genome (Figure 4—figure supplement 2A,B). Despite the presence of non-centromeric enrichment in the insoluble chromatin fraction and the lower dynamic range of the CENP-C ChIP data, we called peaks in these two data sets and compared them to the cenH3 peak calls. 460 of the 2060 insoluble chromatin peaks and 163 of the 347 insoluble chromatin peaks coincided with cenH3 peaks. In contrast, only 174 insoluble chromatin peaks and 26 CENP-C peaks fell within the domains, in both cases fewer sites than expected by chance (p<0.001). Normalized to the genome coverage of domains and peaks, this amounts to an 800-fold enrichment of insoluble chromatin peaks and an almost 2000-fold enrichment of CENP-C peaks at cenH3 peaks compared to cenH3 domains (Figure 4—figure supplement 2C,D). These results indicate that the cenH3 peaks identified in this study act as the preferred sites of kinetochore formation.

The precise co-localization of cenH3, CENP-C and insoluble chromatin peaks confirm that these sites correspond to centromeres. Moreover, the number of sites lies in the same order of magnitude as the number of microtubules observed during mitosis, thus providing a mechanistically reasonable alternative to the conundrum of how domains that cover half of the genome can organize a relatively small number of microtubule attachment sites.

The cenH3 peaks resemble point centromeres

Centromeric nucleosomes in other organisms protect only 80–120 bp of wrapped DNA, compared to 147 bp for canonical nucleosomes (Dalal et al., 2007; Krassovsky et al., 2012; Hasson et al., 2013; Zhang et al., 2013), probably due to the reduced wrapping of DNA around them (Henikoff and Furuyama, 2012). To examine the size and positioning of the nucleosomes at the 707 centromeric sites, we divided the fragments in the input and cenH3 ChIP samples into size classes of fragments >140 bp representing nucleosomes, and ≤140 bp representing sub-nucleosome-sized particles. In the input sample, we found that two well-positioned nucleosomes flank the cenH3 peaks (Figure 5A). These nucleosomes are also visible in modENCODE data for mononucleosomes prepared under native conditions, but not upon formaldehyde-crosslinking, presumably because they become crosslinked to the centromere (Figure 5—figure supplement 1). The native input sample also revealed the presence of sub-nucleosome-size fragments over the center of the cenH3 sites, while few of these fragments were found in the flanking regions (Figure 5A). In the cenH3 ChIP sample, only relatively few nucleosome-size fragments were recovered, while the majority of the signal came from fragments <140 bp (Figure 5B). The insoluble chromatin showed a similar pattern, indicating that the particles bound to DNA at cenH3 sites are similar in the insoluble and in the soluble chromatin fractions (Figure 5C). This analysis showed that centromeric sites consist of two well-positioned nucleosomes flanking a single cenH3 nucleosome that wraps less DNA than is wrapped by a canonical nucleosome.

Figure 5 with 3 supplements see all

Download asset Open asset

CenH3 nucleosomes protect small DNA fragments.

(**A, B, C**) Normalized fragment counts in input (A), cenH3 ChIP (B) and insoluble chromatin (C) samples at centromeric sites. MNase fragments were divided into nucleosomal (141–500 bp) and small (21–140 bp) size classes. Average signals within a 1-kb window around all 707 cenH3 sites are plotted. Dashed lines mark the centers of the flanking nucleosomes in (A) (black lines) or the centromeric nucleosome in (B) (blue line) and in (C) (magenta line). (D) Cartoon illustrating how MNase fragment size distributions shown in (E and F) were determined. Fragments that cross the center of the cenH3 nucleosome or of the flanking nucleosomes were counted. (E and F) MNase fragment size distribution after 2 min (E) and 10 min (F) of MNase digestion. Input fragments at flanking nucleosomes (black) and cenH3 ChIP fragments (blue) and insoluble chromatin fragments (magenta) at centromeric nucleosomes are shown. Cartoons of the protected particles are shown below each panel. (G and H) Comparison of worm holocentromere and budding yeast point centromere. (G) *C. elegans* holocentromere. Centromere model and cenH3 ChIP over input ratio (all size classes; left y-axis) and nucleosomal signal from input (141–500 bp; right y-axis). Average signals within a 1-kb window around all 707 cenH3 sites are shown. (H) Budding yeast point centromere. Centromere model and data from Krassovsky et al. (Krassovsky et al., 2012), cenH3 ChIP over input ratio (left y-axis) and input signal (right y-axis) from all 16 centromeres.

https://doi.org/10.7554/eLife.02025.016

To estimate the size of the DNA associated with these nucleosomes, we plotted the size-distribution of fragments that cross the center of each particle (Figure 5D). In the input sample after 2 min of MNase digestion, the fragments that cross the dyad of the flanking nucleosomes show a distribution that peaks at 166 bp, consistent with canonical nucleosomes (Figure 5E, black line). The peak lies at 166 bp rather than the 147 bp minimal protected fragment size for mononucleosomes because of the relatively light MNase digestion required to prevent the loss of cenH3 peaks (Figure 2B). Progressive MNase digestion reduced the size of the DNA fragments protected by these particles in a manner expected for nucleosomes, analogous to the size pattern observed for bulk chromatin (Figure 5—figure supplement 2A). A second peak representing dinucleosomes is also visible at about 330 bp (Figure 5E, black line). The fragments that cross the center of the cenH3 nucleosomes in the cenH3 ChIP sample after 2 min of MNase digestion show a strikingly different distribution that is left-shifted and indicate that the nucleosomes that occupy these sites protect only about 60–120 bp (Figure 5E, blue line). The fragments in the insoluble chromatin fraction show a very similar distribution, supporting this size estimate (Figure 5E, magenta line). The dinucleosome peak in the cenH3 ChIP and insoluble chromatin, representing the centromeric nucleosome and one flanking canonical nucleosome, is equally left-shifted to about 260 bp (Figure 5E, blue and magenta lines). These shorter dinucleosome fragments, protected by two neighboring nucleosomes on the same molecule of DNA, can only be explained by the presence of a smaller centromeric nucleosome, because the inferred size estimates are not affected by possible MNase encroachment on the centromeric nucleosome. After 10 min of MNase digestion, the peak of the distribution of MNase fragments that cross the center of the cenH3 sites in the insoluble fractions lies at 66 bp (Figure 5F, magenta line), indicating that the minimal protected size of these particles lies in the 60–100 bp range. Moreover, the width at half-height of the average 10 min-digested insoluble chromatin peak that aligns with the cenH3 peaks is 82 bp (Figure 5—figure supplement 3). The inferred fragment size protected by the centromeric nucleosome is thus 60–100 bp. These results confirm the findings in other organisms that cenH3 nucleosomes at centromeric sites in C. elegans wrap less DNA than that wrapped by canonical nucleosomes.

CenH3 ChIP also revealed mild enrichment over broad domains. Fragment size analysis revealed that the majority of cenH3 nucleosomes in these regions of the genome protect about 135–155 bp of DNA, and that this size distribution is similar in regions between domains (Figure 5—figure supplement 2B). This level of protection is consistent with the findings in other organisms that cenH3 can incorporate into canonical-type nucleosomes away from centromeres, in some cases as cenH3-H3.3 heterotypic nucleosomes (Camahort et al., 2009; Krassovsky et al., 2012; Lacoste et al., 2014). This further suggests that the cenH3 domains may be distributed independently of the centromere.

Taken together, fragment size analysis thus revealed that each centromeric site consists of a single cenH3-containing nucleosome that is flanked by two well-positioned canonical nucleosomes (Figure 5G). This chromatin landscape is reminiscent of the budding yeast centromere, where a single cenH3 nucleosome assembles on a genetically defined sequence (Furuyama and Biggins, 2007; Henikoff and Henikoff, 2012). This sequence is flanked by binding sites for centromere-specific protein complexes (Cbf1 and Cbf3) that in turn position two flanking canonical nucleosomes (Figure 5H) (Densmore et al., 1991; Krassovsky et al., 2012). The stable binding of Cbf1 and Cbf3 prevent the centromeric DNA from being occupied by canonical nucleosomes (Kent et al., 2011; Krassovsky et al., 2012). In C. elegans, the flanking nucleosomes are positioned closer together than in yeast, presumably because there are no sequence-specific DNA-binding proteins between the centromeric and the flanking nucleosomes.

Thus, the dispersed centromeric sites in C. elegans holocentromeres consist of a cenH3 nucleosome that is associated with 60–100 bp of DNA flanked by two well-positioned nucleosomes, a chromatin pattern with striking similarities to budding yeast point centromeres.

CenH3 peaks coincide with HOT sites

Budding yeast point centromeres assemble on a genetically defined sequence (Clarke and Carbon, 1980). To determine whether the C. elegans centromeric sites have common sequence properties, we searched for motifs using MEME (Bailey et al., 2009). We found a 15-nt GA-repeat-rich motif that was common to 297 of the 80-bp cores of the centromeric sites (Figure 6A). A weaker, but similar GA-rich motif was common to all 707 centromeric sites (Figure 6—figure supplement 1A). The 15-nt motif also matched more than 60,000 sites in the genome that are not associated with high cenH3 signal and so is not sufficient to determine cenH3 occupancy.

Figure 6 with 1 supplement see all

Download asset Open asset

Centromeres coincide with transcription factor hotspots.

(A) MEME motif for the 80-bp cores of centromeric sites (left) and the 80-bp cores of high occupancy target (HOT) sites (right). (B) Heatmaps and average plots of cenH3 ChIP, insoluble chromatin and CENP-C ChIP signal within a 2-kb window around HOT sites, illustrating that HOT sites are highly occupied by cenH3, insoluble chromatin and CENP-C. Each line of the heatmaps represents an individual HOT site. Heatmaps are sorted by decreasing signal.

https://doi.org/10.7554/eLife.02025.020

GAGA-rich sequences are well-characterized targets for the GAGA factor (GAF/Trl) in D. melanogaster (van Steensel et al., 2003). However, we could not identify a GAF/Trl homologue in the C. elegans genome. Instead, the motif we identified is almost identical to the motif associated with C. elegans high occupancy target (HOT) sites (Figure 6A). These sites were uncovered by the modENCODE consortium through binding-site analysis of 22 transcription factors and operationally defined as sites that are bound by ≥15 transcription factors (Gerstein et al., 2010; Niu et al., 2011). The sequences at these sites do not contain DNA motifs of known C. elegans transcription factors and are therefore expected to bind transcription factors with low affinity. We found that 117 out of 248 HOT sites coincided with cenH3 sites (Figure 6—figure supplement 1B; hypergeometric p=8.6 × 10⁻¹⁶¹). Although this degree of overlap is striking, the actual overlap of cenH3 sites and transcription factor hotspots is likely to be much larger, given the fact that that only 22 transcription factors have been used for the definition of HOT sites, but that there are 934 predicted transcription factors encoded in the C. elegans genome (Reece-Hoyes et al., 2005; Gerstein et al., 2010). We also found that HOT sites show a high signal for cenH3 ChIP, insoluble chromatin, CENP-C ChIP (Figure 6B) and well-positioned flanking nucleosomes (Figure 6—figure supplement 1C). Moreover, the cenH3 ChIP signal is reduced in KNL-2-depleted animals (Figure 6—figure supplement 1D,E). These data suggest that HOT sites and centromeric sites share a similar chromatin landscape and are targeted by both cenH3 nucleosomes and transcription factors.

Transcription factor occupancy accompanies loss of cenH3 upon exit from the cell cycle

Embryonic cells contain both cenH3 and transcription factors, thus complicating the analysis of the chromatin landscape at centromeric sites. To probe the chromatin landscape in cenH3-depleted cells, we analyzed our previously published data for affinity-purified adult muscle cells, where cenH3 protein is below detection levels (Figure 7A), and cenH3 mRNA is significantly depleted (Bayesian t-test; q = 0.0036) (Steiner et al., 2012). These samples were MNase-digested >10 min to enrich for mononucleosomes (Figure 2—figure supplement 1, right panel). Centromeric nucleosomes are unstable under these MNase conditions, and associated fragments are expected to be depleted (Figure 2). Despite differences between chromatin preparations from embryos for native ChIP input and whole nuclear DNA extraction from adults for MNase-seq, the overall nucleosome landscapes obtained were very similar (Figure 7—figure supplement 1A). In total adult samples, which contain nuclei from dividing germline and embryonic cells, we found a depletion of signal at centromeric sites and well-positioned nucleosomes flanking the sites, reminiscent of the chromatin landscape in embryos (Figure 7B, black line). In muscle nuclei, the flanking nucleosomes were also present (Figure 7B, red line), and protected a very similar size range of fragments as in the total nuclei sample (Figure 7C, left panel). However, the centromeric sites in the muscle nuclei sample were occupied by MNase-stable particles (Figure 7B, red line) that protect sub-nucleosome-size fragments (Figure 7C, right panel) and likely represent non-nucleosomal DNA-binding proteins.

Figure 7 with 1 supplement see all

Download asset Open asset

Transcription factor occupancy upon loss of cenH3 as cells exit the cell cycle.

(A) CenH3 in adult germline, intestine and muscle. Staining of a worm section with anti-cenH3, anti-NPP-9 (nuclear pores; staining control) and DAPI are shown. Germ cells in diakinesis are marked with asterisks, muscle cells with arrowheads in the merge. Scale bar is 3 µm. (B) Average MNase-seq signal within a 2-kb window around centromeric sites for adult total and adult muscle nuclei, illustrating that centromeric sites remain occupied in cenH3-depleted cells. MNase digestion >10 min. (C) MNase fragment size distribution at flanking nucleosomes and centromeric sites for total adult and adult muscle nuclei, illustrating that at least a sub-population of the particles occupying the centromeric sites are not canonical nucleosomes. (D) Heatmap and average plot of HLH-1 ChIP signal from adults within a 2-kb window around cenH3 sites. HLH-1 is a HOT site transcription factor. (E) Heatmaps and average plots of ChIP signal for another nine of the transcription factors used to define HOT sites within a 2-kb window around cenH3 sites, data from Gerstein et al. (Gerstein et al., 2010). All transcription factors in (E) were profiled in the third larval instar except ALR-1 (second larval instar) and PHA-4 (adults). Differences between ChIP and input are plotted in (D and E). Each line of the heatmaps represents an individual cenH3 site. Heatmaps are sorted by decreasing signal.

https://doi.org/10.7554/eLife.02025.022

To test directly whether centromeric sites become occupied by transcription factors upon depletion of cenH3, we profiled the muscle-specific transcription factor HLH-1 in young adults by X-ChIP-seq. HLH-1 is the C. elegans myoD homologue and is required for proper myogenesis (Krause, 1995). It is exclusively expressed in the same cells that have been used for the muscle chromatin profiling (Figure 7—figure supplement 1B). We found that HLH-1 is enriched at the majority of centromeric sites (Figure 7D). We also analyzed the previously published transcription factor ChIP-seq datasets for binding at centromeric sites (Gerstein et al., 2010). We found that all tested transcription factors profiled in larval instar 2 or later stages, when few somatic cells are still dividing, are enriched at centromeric sites (Figure 7E).

These results show that if cenH3 is depleted, centromeric sites are not occupied by canonical nucleosomes, but are bound by sub-nucleosome-size particles at least some of which are known transcription factors. As centromere function is needed only in dividing cells, this possibility is consistent with the observation that some HOT sites have post-mitotic functions as enhancers (Kvon et al., 2012; Chen et al., 2013).

Discussion

Holocentric chromosomes are polycentric

Holocentricity is a common mode of chromosome organization, having evolved from monocentricity at least 13 times, including organisms as diverse as nematodes, moths, and sedges (Melters et al., 2012). Based on cytological observations two very different models for holocentricity were proposed more than 60 years ago: diffuse centromeres and polycentromeres (Schrader, 1947). We use high resolution mapping of centromeric nucleosomes to demonstrate that the polycentromere model is correct for C. elegans, and that C. elegans holocentromeres consist of about 100 centromeric sites on each chromosome. The number of discrete centromeric sites is in excess over the observed number of microtubule attachments, which has been estimated to be ∼100 for all six chromosomes by electron tomography (O’Toole et al., 2003). It is possible that only about 15% of the centromeric sites identified in this study are attached in each mitotic cell, which seems reasonable given that we analyzed a diverse cell population. The availability of multiple centromeric sites might reflect redundancy to assure faithful segregation in every cell cycle, although it is also possible that some sites are rendered inaccessible for cenH3 nucleosomes in some cell lineages due to changes in expression profiles during development.

Polycentromere and diffuse centromere mechanisms might not be mutually exclusive, insofar as a large fraction of cenH3 is incorporated into broad domains that cover half of the genome. It has previously been reported that these domains anti-correlate with transcription (Gassmann et al., 2012), and we show that they also anti-correlate with H3.3 and correlate with H3K9me3. These correlations all indicate that cenH3 is preferentially located in regions of low nucleosome turnover, and the intrinsic instability of cenH3 nucleosomes may contribute to losing it from ‘open’ chromatin (Conde e Silva, 2007). The domains are in part shaped by transcription in the germline and are present in the early embryo, however, there is no significant RNA Pol II-dependent transcription during the first two rounds of embryonic cell division. In contrast, H3.3 is deposited in the germline and has a well-established role in the inheritance of chromatin states (Ooi et al., 2006; Ooi et al., 2010; Jullien et al., 2012). Specifically, H3.3 is retained both in mature sperm and oocytes, suggesting that it transmits epigenetic information through both the maternal and the paternal germline. The maintenance of H3.3 in sperm might explain how the domain pattern is established on paternal chromatin upon fertilization, even though cenH3 is not maintained in mature sperm (Gassmann et al., 2012). However, because these domains do not align with the kinetochore, they probably do not have a direct centromere function, although they might serve as cenH3 ‘reservoirs’, in parallel with the suggestion that Drosophila transcription factor hotspots might serve as transcription factor reservoirs (Moorman et al., 2006).

Point centromeres are the building block of polycentromeres

We have found that individual centromeric sites resemble budding yeast point centromeres: a single cenH3 nucleosome flanked by two well-positioned nucleosomes. Point centromeres are genetically defined in budding yeast, and satellite repeats help position centromeric nucleosomes at regional centromeres in many species. However, neo-centromeres can form on sequences that are not normally linked to centromeres (Marshall et al., 2008; Shang et al., 2013). These observations suggest that centromeric nucleosomes are inherited in a sequence-independent way, so that it might seem surprising that a distinct sequence motif is associated with C. elegans centromeric sites. However, given that the motif is short and its abundance in the genome by far exceeds the number of centromeric sites, there is no evidence that it is a direct target for cenH3-nucleosome loading. Rather, the DNA at these sites might disfavor the formation of canonical nucleosomes, allowing centromeric nucleosomes to form in these ‘gaps’ in the chromatin landscape. A similar model for worm holocentromeres had been proposed by Gu and Fire (Gu and Fire, 2010) based on finding ∼120-bp ‘holes’ in the nucleosome landscape that could fit small nucleosomes the size of those previously shown for Drosophila cenH3 (Dalal et al., 2007).

In C. elegans, virtually any piece of DNA injected into the gonad will concatamerize, acquire a centromere and segregate with varying efficiency (Stinchcomb et al., 1985; Mello et al., 1991; Yuen et al., 2011). Opportunistic assembly of centromeric nucleosomes at sites of accessible DNA predicts that cenH3 will be loaded onto any fragment of DNA that contains accessible stretches. The fact that new extrachromosomal arrays initially partition passively and only acquire segregation competence after a few cell cycles indicates that centromeric competence can be acquired epigenetically in C. elegans (Yuen et al., 2011), consistent with an opportunistic gap-filling model.

A role for transcription factors in holocentromere maintenance?

We found that centromeric sites coincide with HOT sites, which are occupied by many transcription factors without having high binding affinity for any of them. When cells exit the cell cycle and cenH3 is no longer expressed, ‘holes’ in the chromatin landscape might open up and allow HOT site transcription factors to bind by mass action. Although many HOT site transcription factors are cell type-specific, their non-specific binding to holes vacated by cenH3 nucleosomes would result in high HOT site transcription factor occupancy in multiple cell-types. Indeed, we found that cenH3 is lost in adult muscle cells, but that centromeric sites remain occupied, in part by the muscle-specific transcription factor HLH-1 and presumably also by other HOT site transcription factors. Virtually all transcription factors profiled in postembryonic tissues (larval instar 2 and later stages) were found at many, if not all centromeric sites. Replacement of cenH3 nucleosomes by transcription factors at HOT sites upon exit from the cell cycle may then result in their reported enhancer activity (Kvon et al., 2012; Chen et al., 2013).

CenH3 protein has been shown to turn over completely during the mitotic cell cycle, to disappear during the pachytene stage of meiotic prophase, to reappear when nuclei progress into diplotene and to be absent from mature sperm (Gassmann et al., 2012). These observations imply that the centromeric sites need to be marked during certain stages of the cell cycle in order to be repopulated at later stages. The coincidence of cenH3 and HOT sites raises the possibility that low-affinity binding of transcription factors by mass action prevents encroachment of nucleosomes and thus, helps to maintain holocentric sites over the course of development.

Our results resolve the long-standing question whether holocentromeres are polycentric or diffuse. We show that C. elegans holocentromeres are organized as dispersed point centromeres, consistent with the polycentromere model. Our discovery of the coincidence of centromeric sites with transcription factor hotspots points to a possible mechanism for centromeric site selection and maintenance.

Materials and methods

Worm culture and RNAi

Request a detailed protocol

We used the standard wild-type strain N2 and OP64 grown at 20°C. Synchronized populations were cultured on peptone-rich plates seeded with E. coli strain NA22. To deplete KNL-2 by RNAi, synchronized populations were grown on NA22 until fourth larval instar (L4), washed in M9 buffer and transferred to bacteria expressing dsRNA that targets knl-2 for 24 hr. Embryos were harvested from adults by sodium hypochlorite treatment.

Immunofluorescence microscopy

Request a detailed protocol

Worms were decapitated with a razor blade in M9 buffer, freeze cracked on dry ice and fixed 2 min in methanol and 4 min in acetone at −20°C. Samples were incubated with anti-cenH3 (rabbit) (Buchwitz et al., 1999) and anti-NPP-9 (mouse) (Sheth et al., 2010) antibodies overnight at 4°C and with Cy3 donkey anti-rabbit and DyLight 488 donkey anti-mouse antibodies (Jackson ImmunoResearch) for 1 hr at 37°C. Washes were carried out with phosphate buffered saline containing 1% Tween-20 (PBS-T) throughout. Samples were incubated in PBS-T containing 0.01 mg/ml 4′,6-diamidino-2-phenylindole (DAPI) before being mounted. Images were acquired using a Nikon Eclipse 90i microscope (60x lens).

Native ChIP

Request a detailed protocol

N2 embryos were treated in 0.1U/ml chitinase (Sigma) for 30–60 min and washed with buffer A (15 mM Tris–HCl pH7.5, 2 mM MgCl₂, 340 mM sucrose, 0.2 mM spermine, 0.5 mM spermidine, 0.5 mM phenylmethanesulfonate [PMSF]). Nuclei were isolated using a glass Dounce homogenizer with 15 strokes each of the loose- and tight-fitting inserts in buffer A supplemented with 0.1% Trition X-100 and 0.25% NP-40 substitute. The homogenate was diluted five times with buffer A, the debris were removed by spinning at 100×g for 2 min and nuclei were pelleted by spinning at 1000×g for 10 min. Nuclei were transferred to 1 ml 10 mM Tris pH7.5, 2 mM MgCl₂, 0.5 mM PMSF and pre-warmed for 5 min at 37°C. CaCl₂ to a final concentration of 2 mM, and 0.1 units of micrococcal nuclease (MNase; Sigma–Aldrich) was added. After 1, 2, 5 or 10 min the reaction was stopped by the addition of ethylenediaminetetraacetic acid (EDTA) to a final concentration of 30 mM. A light MNase digestion corresponding to 2 min in this time-course experiment was used for all other experiments unless otherwise noted. Chromatin was solubilized by cavitation using needle extraction (4 times 20 gauge, 4 times 26 gauge), a protocol modified from Jin and Felsenfeld, 2007. Soluble chromatin was collected by spinning at 1000×g for 5 min and the supernatant was pooled with additional chromatin solubilized by incubating the pellet in 10 mM Tris pH7.5, 10 mM EDTA, 0.1% Trition X-100, 0.5 mM PMSF for 4 hr at 4°C. The remaining pellet was retained as the insoluble chromatin fraction. Soluble chromatin fractions were combined, NaCl adjusted to 100 mM, debris removed by spinning 4 times at maximum speed for 5 min and pre-cleared by incubation with Dynabeads protein A (Invitrogen). From this input fraction, cenH3 was isolated by incubation with 4 µl anti-cenH3 antibody overnight and protein A dynabeads for 2 hr. Beads were washed three times in 10 mM Tris pH7.5, 100 mM NaCl, 10 mM EDTA, 0.1% Trition X-100, 0.5 mM PMSF and twice in 10 mM Tris pH7.5, 100 mM NaCl, 10 mM EDTA, 0.5 mM PMSF. Chromatin was treated with RNase and Proteinase K, and DNA was isolated with phenol:chloroform and precipitated with ethanol in the presence of glycogen.

Crosslinked ChIP

Request a detailed protocol

For CENP-C ChIP, nuclei were prepared and MNase treated as for native ChIP, except that MNase incubation was done in HM2 (50 mM HEPES pH7.4, 2 mM MgCl₂, 0.5 mM PMSF) for 2 min. MNase was inactivated by addition of EGTA to 5 mM and nuclei were washed once in HM2. Chromatin was crosslinked in 1% formaldehyde for 10 min. Crosslinking was quenched by adding glycine to 125 mM for 10 min. Nuclei were washed with HM2 and lysed in 50 mM Tris pH7.5, 10 mM EDTA, 1% SDS, 0.5 mM PMSF by vortexing for 2 min. The lysate was diluted to 20 mM Tris pH7.5, 150 mM NaCl, 0.1% SDS, 1% Triton X-100, 2 mM EDTA, 0.5 mM PMSF and sonicated with a Sonic Dismembrator Model 500 (Fisher Scientific) for 40s at 30% amplitude. Debris removal, pre-clearing and CENP-C ChIP were done as for native ChIP, with the antibody from Moore and Roth, 2001. Beads were washed twice with 20 mM Tris pH7.5, 150 mM NaCl, 0.1% SDS, 1% Triton X-100, 2 mM EDTA, once each in 20 mM Tris pH7.5, 500 mM NaCl, 0.1% SDS, 1% Triton X-100, 2 mM EDTA and 20 mM Tris pH7.5, 250 mM LiCl, 1% sodium deoxycholate, 1% NP-40 substitute, 2 mM EDTA and once in TE. Crosslinks were reversed overnight at 65°C, chromatin was treated with RNase and Proteinase K, and DNA was isolated with phenol:chloroform and precipitated with ethanol in the presence of glycogen.

For HLH-1 ChIP, OP64 worms were washed in PBS, ground under liquid nitrogen and resuspended in PBS containing 1x Complete Protease Inhibitor Cocktail (Roche). Proteins were crosslinked with 1% formaldehyde for 15 min, the reaction was quenched with 125 mM glycine for 10 min, the volume increased to 50 ml, and chromatin pelleted by spinning at 2000×g for 10 min. The pellet was washed again in PBS, resuspended in 50 mM Tris pH7.5, 10 mM EDTA, 1% SDS containing 1x Complete Protease Inhibitor Cocktail (Roche), incubated 10 min at room temperature, diluted to 20 mM Tris pH7.5, 150 mM NaCl, 0.1% SDS, 1% Triton X-100, 2 mM EDTA containing 1x Complete Protease Inhibitor Cocktail (Roche) and sonicated with a Sonic Dismembrator Model 500 (Fisher Scientific) for 4 min at 30% amplitude. Debris was removed by spinning twice at maximum speed for 5 min. The extract was incubated with an anti-FLAG M2 antibody (Sigma) overnight at 4°C. Protein G beads pre-blocked with BSA and yeast tRNA were added for 4h. Beads were washed and DNA isolated as described for CENP-C ChIP above.

Illumina sequencing and data analysis

Request a detailed protocol

Libraries were prepared using a modified Illumina paired-end library protocol as described in Henikoff et al., 2011. Cluster generation, followed by 25 rounds of paired-end sequencing in an Illumina Hi-Seq 2000, was performed by the FHCRC Genomics Shared Resource.

After processing and base-calling by Illumina software, paired-end reads were mapped to the C. elegans genome release WS220 using Novoalign (http://www.novocraft.com) with default parameters, except that each multiple hit was mapped to one site chosen at random (Novoalign parameter -r Random). The number of inserts aligned to each 10-bp interval of the genome was counted, and the interval counts were normalized by dividing by the total number of counts for all intervals, and then scaled by multiplying by the number of bases in the genome. We considered fragments >140 bp to represent nucleosomes, the in silico equivalent of excising a gel slice around the ∼150-bp size range from an MNase-digested chromatin ladder and extracting the DNA for single-end sequencing. As we use a modified paired-end sequencing protocol to include all fragments >25 bp (Henikoff et al., 2011), we can accomplish the size cut more precisely by mapping only the reads in the nucleosome size range. Simple repeat regions were downloaded from www.wormbase.org and excluded from all analyses.

To call peaks, given the discrete nature of the sites of high cenH3 signal, we set a threshold and considered all the features with higher counts as peaks. For cenH3 peak calling, input counts were subtracted from cenH3 ChIP counts for two biological replicates (2 min MNase), and peaks that exceeded 30 counts in at least one of the biological replicates were considered positive. For CENP-C peak calling, input counts were subtracted from CENP-C ChIP counts, and peaks that exceeded 20 counts were considered positive. For insoluble chromatin peak calling, peaks that exceeded 100 counts were considered positive.

To normalize against input, we used log₂-ratios only to compare to previously published array data. Otherwise, we consider input reads a separate “blank” experiment that we subtract from the ChIP counts.

Data availability

The following data sets were generated

1. Steiner FA
2. Henikoff S
(2013) Holocentromeres are dispersed point centromeres localized at transcription factor hotspots
ID GSE44412. Publicly available at the Gene Expression Omnibus (http://www.ncbi.nlm.nih.gov/geo/).

http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE44412

References

1. Ahmad K
2. Henikoff S
(2002) The histone variant H3.3 marks active chromatin by replication-independent nucleosome assembly
Molecular Cell 9:1191–1200.

https://doi.org/10.1016/S1097-2765(02)00542-7
- Google Scholar
1. Albertson DG
2. Thomson JN
(1982) The kinetochores of Caenorhabditis elegans
Chromosoma 86:409–428.

https://doi.org/10.1007/BF00292267
- Google Scholar
1. Bailey TL
2. Boden M
3. Buske FA
4. Frith M
5. Grant CE
6. Clementi L
7. Ren J
8. Li WW
9. Noble WS
(2009) MEME SUITE: tools for motif discovery and searching
Nucleic Acids Research 37:W202–W208.

https://doi.org/10.1093/nar/gkp335
- Google Scholar
1. Buchwitz BJ
2. Ahmad K
3. Moore LL
4. Roth MB
5. Henikoff S
(1999) A histone-H3-like protein in C. elegans
Nature 401:547–548.

https://doi.org/10.1038/44062
- Google Scholar
(2007)
CENP-A-containing nucleosomes: easier disassembly versus exclusive centromeric localization

Journal of Molecular Biology 370:555–573.
- Google Scholar
1. Camahort R
2. Shivaraju M
3. Mattingly M
4. Li B
5. Nakanishi S
6. Zhu D
7. Shilatifard A
8. Workman JL
9. Gerton JL
(2009) Cse4 is part of an octameric nucleosome in budding yeast
Molecular Cell 35:794–805.

https://doi.org/10.1016/j.molcel.2009.07.022
- Google Scholar
(2010) Dual recognition of CENP-A nucleosomes is required for centromere assembly
The Journal of Cell Biology 189:1143–1155.

https://doi.org/10.1083/jcb.201001013
- Google Scholar
1. Cheeseman IM
2. Niessen S
3. Anderson S
4. Hyndman F
5. Yates JR III
6. Oegema K
7. Desai A
(2004) A conserved protein network controls assembly of the outer kinetochore and its ability to sustain tension
Genes & Development 18:2255–2268.

https://doi.org/10.1101/gad.1234104
- Google Scholar
1. Chen RA
2. Down TA
3. Stempor P
4. Chen QB
5. Egelhofer TA
6. Hillier LW
7. Jeffers TE
8. Ahringer J
(2013) The landscape of RNA polymerase II transcription initiation in C. elegans reveals promoter and enhancer architectures
Genome Research 23:1339–1347.

https://doi.org/10.1101/gr.153668.112
- Google Scholar
1. Clarke L
2. Carbon J
(1980) Isolation of a yeast centromere and construction of functional small circular chromosomes
Nature 287:504–509.

https://doi.org/10.1038/287504a0
- Google Scholar
1. Dalal Y
2. Wang H
3. Lindsay S
4. Henikoff S
(2007) Tetrameric structure of centromeric nucleosomes in interphase Drosophila cells
PLOS Biology 5:e218.

https://doi.org/10.1371/journal.pbio.0050218
- Google Scholar
(2010) Genome-wide kinetics of nucleosome turnover determined by metabolic labeling of histones
Science 328:1161–1164.

https://doi.org/10.1126/science.1186777
- Google Scholar
(1991)
In vivo genomic footprint of a yeast centromere

Molecular and Cellular Biology 11:154–165.
- Google Scholar
(2008) Heterochromatin and RNAi are required to establish CENP-A chromatin at centromeres
Science 319:94–97.

https://doi.org/10.1126/science.1150944
- Google Scholar
1. Furuyama S
2. Biggins S
(2007) Centromere identity is specified by a single centromeric nucleosome in budding yeast
Proceedings of the National Academy of Sciences of the United States of America 104:14706–14711.

https://doi.org/10.1073/pnas.0706985104
- Google Scholar
(2011) Induced ectopic kinetochore assembly bypasses the requirement for CENP-A nucleosomes
Cell 145:410–422.

https://doi.org/10.1016/j.cell.2011.03.031
- Google Scholar
1. Gassmann R
2. Rechtsteiner A
3. Yuen KW
4. Muroyama A
5. Egelhofer T
6. Gaydos L
7. Barron F
8. Maddox P
9. Essex A
10. Monen J
11. Ercan S
12. Lieb JD
13. Oegema K
14. Strome S
15. Desai A
(2012) An inverse relationship to germline transcription defines centromeric chromatin in C. elegans
Nature 484:534–537.

https://doi.org/10.1038/nature10973
- Google Scholar
1. Gerstein MB
2. Lu ZJ
3. Van Nostrand EL
4. Cheng C
5. Arshinoff BI
6. Liu T
7. Yip KY
8. Robilotto R
9. Rechtsteiner A
10. Ikegami K
11. Alves P
12. Chateigner A
13. Perry M
14. Morris M
15. Auerbach RK
16. Feng X
17. Leng J
18. Vielle A
19. Niu W
20. Rhrissorrakrai K
21. Agarwal A
22. Alexander RP
23. Barber G
24. Brdlik CM
25. Brennan J
26. Brouillet JJ
27. Carr A
28. Cheung MS
29. Clawson H
30. Contrino S
31. Dannenberg LO
32. Dernburg AF
33. Desai A
34. Dick L
35. Dosé AC
36. Du J
37. Egelhofer T
38. Ercan S
39. Euskirchen G
40. Ewing B
41. Feingold EA
42. Gassmann R
43. Good PJ
44. Green P
45. Gullier F
46. Gutwein M
47. Guyer MS
48. Habegger L
49. Han T
50. Henikoff JG
51. Henz SR
52. Hinrichs A
53. Holster H
54. Hyman T
55. Iniguez AL
56. Janette J
57. Jensen M
58. Kato M
59. Kent WJ
60. Kephart E
61. Khivansara V
62. Khurana E
63. Kim JK
64. Kolasinska-Zwierz P
65. Lai EC
66. Latorre I
67. Leahey A
68. Lewis S
69. Lloyd P
70. Lochovsky L
71. Lowdon RF
72. Lubling Y
73. Lyne R
74. MacCoss M
75. Mackowiak SD
76. Mangone M
77. McKay S
78. Mecenas D
79. Merrihew G
80. Miller DM 3rd
81. Muroyama A
82. Murray JI
83. Ooi SL
84. Pham H
85. Phippen T
86. Preston EA
87. Rajewsky N
88. Rätsch G
89. Rosenbaum H
90. Rozowsky J
91. Rutherford K
92. Ruzanov P
93. Sarov M
94. Sasidharan R
95. Sboner A
96. Scheid P
97. Segal E
98. Shin H
99. Shou C
100. Slack FJ
101. Slightam C
102. Smith R
103. Spencer WC
104. Stinson EO
105. Taing S
106. Takasaki T
107. Vafeados D
108. Voronina K
109. Wang G
110. Washington NL
111. Whittle CM
112. Wu B
113. Yan KK
114. Zeller G
115. Zha Z
116. Zhong M
117. Zhou X
118. modENCODE Consortium
119. Ahringer J
120. Strome S
121. Gunsalus KC
122. Micklem G
123. Liu XS
124. Reinke V
125. Kim SK
126. Hillier LW
127. Henikoff S
128. Piano F
129. Snyder M
130. Stein L
131. Lieb JD
132. Waterston RH
(2010) Integrative analysis of the Caenorhabditis elegans genome by the modENCODE project
Science 330:1775–1787.

https://doi.org/10.1126/science.1196914
- Google Scholar
1. Goldberg AD
2. Banaszynski LA
3. Noh KM
4. Lewis PW
5. Elsaesser SJ
6. Stadler S
7. Dewell S
8. Law M
9. Guo X
10. Li X
11. Wen D
12. Chapgier A
13. DeKelver RC
14. Miller JC
15. Lee YL
16. Boydston EA
17. Holmes MC
18. Gregory PD
19. Greally JM
20. Rafii S
21. Yang C
22. Scambler PJ
23. Garrick D
24. Gibbons RJ
25. Higgs DR
26. Cristea IM
27. Urnov FD
28. Zheng D
29. Allis CD
(2010) Distinct factors control histone variant H3.3 localization at specific genomic regions
Cell 140:678–691.

https://doi.org/10.1016/j.cell.2010.01.003
- Google Scholar
1. Gu SG
2. Fire A
(2010) Partitioning the C. elegans genome by nucleosome modification, occupancy, and positioning
Chromosoma 119:73–87.

https://doi.org/10.1007/s00412-009-0235-3
- Google Scholar
1. Hasson D
2. Panchenko T
3. Salimian KJ
4. Salman MU
5. Sekulic N
6. Alonso A
7. Warburton PE
8. Black BE
(2013) The octamer is the major form of CENP-A nucleosomes at human centromeres
Nature Structural & Molecular Biology 20:687–695.

https://doi.org/10.1038/nsmb.2562
- Google Scholar
(2011) Epigenome characterization at single base-pair resolution
Proceedings of the National Academy of Sciences of the United States of America 108:18318–18323.

https://doi.org/10.1073/pnas.1110731108
- Google Scholar
1. Henikoff S
2. Furuyama T
(2012) The unconventional structure of centromeric nucleosomes
Chromosoma 121:341–352.

https://doi.org/10.1007/s00412-012-0372-y
- Google Scholar
1. Henikoff S
2. Henikoff JG
(2012) “Point” centromeres of Saccharomyces harbor single centromere-specific nucleosomes
Genetics 190:1575–1577.

https://doi.org/10.1534/genetics.111.137711
- Google Scholar
1. Hori T
2. Shang WH
3. Takeuchi K
4. Fukagawa T
(2013) The CCAN recruits CENP-A to the centromere and forms the structural core for kinetochore assembly
The Journal of Cell Biology 200:45–60.

https://doi.org/10.1083/jcb.201210106
- Google Scholar
1. Jin C
2. Felsenfeld G
(2007) Nucleosome stability mediated by histone variants H3.3 and H2A.Z
Genes Development 21:1519–1529.

https://doi.org/10.1101/gad.1547707
- Google Scholar
1. Jullien J
2. Astrand C
3. Szenker E
4. Garrett N
5. Almouzni G
6. Gurdon JB
(2012) HIRA dependent H3.3 deposition is required for transcriptional reprogramming following nuclear transfer to Xenopus oocytes
Epigenetics & Chromatin 5:17.

https://doi.org/10.1186/1756-8935-5-17
- Google Scholar
1. Kagansky A
2. Folco HD
3. Almeida R
4. Pidoux AL
5. Boukaba A
6. Simmer F
7. Urano T
8. Hamilton GL
9. Allshire RC
(2009) Synthetic heterochromatin bypasses RNAi and centromeric repeats to establish functional centromeres
Science 324:1716–1719.

https://doi.org/10.1126/science.1172026
- Google Scholar
1. Kato H
2. Jiang J
3. Zhou BR
4. Rozendaal M
5. Feng H
6. Ghirlando R
7. Xiao TS
8. Straight AF
9. Bai Y
(2013) A conserved mechanism for centromeric nucleosome recognition by centromere protein CENP-C
Science 340:1110–1113.

https://doi.org/10.1126/science.1235532
- Google Scholar
(2011) Chromatin particle spectrum analysis: a method for comparative chromatin structure analysis using paired-end mode next-generation DNA sequencing
Nucleic Acids Research 39:e26.

https://doi.org/10.1093/nar/gkq1183
- Google Scholar
(2012) Tripartite organization of centromeric chromatin in budding yeast
Proceedings of the National Academy of Sciences of the United States of America 109:243–248.

https://doi.org/10.1073/pnas.1118898109
- Google Scholar
1. Krause M
(1995) MyoD and myogenesis in C. elegans
BioEssays: News and Reviews in Molecular, Cellular and Developmental Biology 17:219–228.

https://doi.org/10.1002/bies.950170308
- Google Scholar
(2012) HOT regions function as patterned developmental enhancers and have a distinct cis-regulatory signature
Genes & Development 26:908–913.

https://doi.org/10.1101/gad.188052.112
- Google Scholar
1. Lacoste N
2. Woolfe A
3. Tachiwana H
4. Garea AV
5. Barth T
6. Cantaloube S
7. Kurumizaka H
8. Imhof A
9. Almouzni G
(2014) Mislocalization of the centromeric histone variant CenH3/CENP-a in human cells depends on the Chaperone DAXX
Molecular Cell 53:631–644.

https://doi.org/10.1016/j.molcel.2014.01.018
- Google Scholar
(2013)
Centromere-like regions in the budding yeast genome

PLOS Genetics 9:e1003209.
- Google Scholar
(2009)
Efficient yeast ChIP-Seq using multiplex short-read DNA sequencing

BMC Genomics 10:37.
- Google Scholar
1. Liu T
2. Rechtsteiner A
3. Egelhofer TA
4. Vielle A
5. Latorre I
6. Cheung MS
7. Ercan S
8. Ikegami K
9. Jensen M
10. Kolasinska-Zwierz P
11. Rosenbaum H
12. Shin H
13. Taing S
14. Takasaki T
15. Iniguez AL
16. Desai A
17. Dernburg AF
18. Kimura H
19. Lieb JD
20. Ahringer J
21. Strome S
22. Liu XS
(2011) Broad chromosomal domains of histone modification patterns in C. elegans
Genome Research 21:227–236.

https://doi.org/10.1101/gr.115519.110
- Google Scholar
(2011) Overlapping regulation of CenH3 localization and histone H3 turnover by CAF-1 and HIR proteins in Saccharomyces cerevisiae
Genetics 187:9–19.

https://doi.org/10.1534/genetics.110.123117
- Google Scholar
1. Maddox PS
2. Hyndman F
3. Monen J
4. Oegema K
5. Desai A
(2007) Functional genomics identifies a Myb domain-containing protein family required for assembly of CENP-A chromatin
The Journal of Cell Biology 176:757–763.

https://doi.org/10.1083/jcb.200701065
- Google Scholar
1. Malik HS
2. Henikoff S
(2009) Major evolutionary transitions in centromere complexity
Cell 138:1067–1082.

https://doi.org/10.1016/j.cell.2009.08.036
- Google Scholar
1. Marshall OJ
2. Chueh AC
3. Wong LH
4. Choo KH
(2008) Neocentromeres: new insights into centromere structure, disease development, and karyotype evolution
American Journal of Human Genetics 82:261–282.

https://doi.org/10.1016/j.ajhg.2007.11.009
- Google Scholar
(1991)
Efficient gene transfer in C.elegans: extrachromosomal maintenance and integration of transforming sequences

The EMBO Journal 10:3959–3970.
- Google Scholar
(2012) Holocentric chromosomes: convergent evolution, meiotic adaptations, and genomic analysis
Chromosome Research: an International Journal on the Molecular, Supramolecular and Evolutionary Aspects of Chromosome Biology 20:579–593.

https://doi.org/10.1007/s10577-012-9292-1
- Google Scholar
(2005) Genome-scale profiling of histone H3.3 replacement patterns
Nature Genetics 37:1090–1097.

https://doi.org/10.1038/ng1637
- Google Scholar
1. Moore LL
2. Roth MB
(2001) HCP-4, a CENP-C-like protein in Caenorhabditis elegans, is required for resolution of sister centromeres
The Journal of Cell Biology 153:1199–1208.

https://doi.org/10.1083/jcb.153.6.1199
- Google Scholar
1. Moorman C
2. Sun LV
3. Wang J
4. de Wit E
5. Talhout W
6. Ward LD
7. Greil F
8. Lu XJ
9. White KP
10. Bussemaker HJ
11. van Steensel B
(2006) Hotspots of transcription factor colocalization in the genome of Drosophila melanogaster
Proceedings of the National Academy of Sciences of the United States of America 103:12027–12032.

https://doi.org/10.1073/pnas.0605003103
- Google Scholar
1. Niu W
2. Lu ZJ
3. Zhong M
4. Sarov M
5. Murray JI
6. Brdlik CM
7. Janette J
8. Chen C
9. Alves P
10. Preston E
11. Slightham C
12. Jiang L
13. Hyman AA
14. Kim SK
15. Waterston RH
16. Gerstein M
17. Snyder M
18. Reinke V
(2011) Diverse transcription factor binding features revealed by genome-wide ChIP-seq in C. elegans
Genome Research 21:245–254.

https://doi.org/10.1101/gr.114587.110
- Google Scholar
1. Oegema K
2. Desai A
3. Rybina S
4. Kirkham M
5. Hyman AA
(2001) Functional analysis of kinetochore assembly in Caenorhabditis elegans
The Journal of Cell Biology 153:1209–1226.

https://doi.org/10.1083/jcb.153.6.1209
- Google Scholar
(2010) A native chromatin purification system for epigenomic profiling in Caenorhabditis elegans
Nucleic Acids Research 38:e26.

https://doi.org/10.1093/nar/gkp1090
- Google Scholar
(2006) Histone H3.3 variant dynamics in the germline of Caenorhabditis elegans
PLOS Genetics 2:e97.

https://doi.org/10.1371/journal.pgen.0020097
- Google Scholar
(2003) Morphologically distinct microtubule ends in the mitotic centrosome of Caenorhabditis elegans
The Journal of Cell Biology 163:451–456.

https://doi.org/10.1083/jcb.200304035
- Google Scholar
1. Polizzi C
2. Clarke L
(1991) The chromatin structure of centromeres from fission yeast: differentiation of the central core that correlates with function
The Journal of Cell Biology 112:191–201.

https://doi.org/10.1083/jcb.112.2.191
- Google Scholar
(2011) CENP-C is a structural platform for kinetochore assembly
Current Biology: CB 21:399–405.

https://doi.org/10.1016/j.cub.2011.02.005
- Google Scholar
(2005) A compendium of Caenorhabditis elegans regulatory transcription factors: a resource for mapping transcription regulatory networks
Genome Biology 6:R110.

https://doi.org/10.1186/gb-2005-6-13-r110
- Google Scholar
1. Schrader F
(1935) Notes on the mitotic behavior of long chromosomes
Cytologia 6:422–430.

https://doi.org/10.1508/cytologia.6.422
- Google Scholar
1. Schrader F
(1947) The role of the kinetochore in the chromosomal evolution of the heteroptera and homoptera
Evolution 1:134–142.

https://doi.org/10.2307/2405489
- Google Scholar
1. Shang WH
2. Hori T
3. Martins NM
4. Toyoda A
5. Misu S
6. Monma N
7. Hiratani I
8. Maeshima K
9. Ikeo K
10. Fujiyama A
11. Kimura H
12. Earnshaw WC
13. Fukagawa T
(2013) Chromosome engineering allows the efficient isolation of vertebrate neocentromeres
Developmental Cell 24:635–648.

https://doi.org/10.1016/j.devcel.2013.02.009
- Google Scholar
1. Sheth U
2. Pitt J
3. Dennis S
4. Priess JR
(2010) Perinuclear P granules are the principal sites of mRNA export in adult C. elegans germ cells
Development 137:1305–1314.

https://doi.org/10.1242/dev.044255
- Google Scholar
(2012) Cell-type-specific nuclei purification from whole animals for genome-wide expression and chromatin profiling
Genome Research 22:766–777.

https://doi.org/10.1101/gr.131748.111
- Google Scholar
1. Stinchcomb DT
2. Shaw JE
3. Carr SH
4. Hirsh D
(1985)
Extrachromosomal DNA transformation of Caenorhabditis elegans

Molecular and Cellular Biology 5:3484–3496.
- Google Scholar
(1992) A low copy number central sequence with strict symmetry and unusual chromatin structure in fission yeast centromere
Molecular Biology of the Cell 3:819–835.

https://doi.org/10.1091/mbc.3.7.819
- Google Scholar
1. Teves SS
2. Henikoff S
(2011) Heat shock reduces stalled RNA polymerase II and nucleosome turnover genome-wide
Genes & Development 25:2387–2397.

https://doi.org/10.1101/gad.177675.111
- Google Scholar
1. The C. elegans Sequencing Consortium
(1998)
Genome sequence of the nematode C. elegans: a platform for investigating biology

Science 282:2012–2018.
- Google Scholar
(2003) Genomewide analysis of Drosophila GAGA factor target genes reveals context-dependent DNA binding
Proceedings of the National Academy of Sciences of the United States of America 100:2580–2585.

https://doi.org/10.1073/pnas.0438000100
- Google Scholar
1. Yuen KW
2. Nabeshima K
3. Oegema K
4. Desai A
(2011) Rapid de novo centromere formation occurs independently of heterochromatin protein 1 in C. elegans embryos
Current Biology: CB 21:1800–1807.

https://doi.org/10.1016/j.cub.2011.09.016
- Google Scholar
1. Zhang T
2. Talbert PB
3. Zhang W
4. Wu Y
5. Yang Z
6. Henikoff JG
7. Henikoff S
8. Jiang J
(2013) The CentO satellite confers translational and rotational phasing on cenH3 nucleosomes in rice centromeres
Proceedings of the National Academy of Sciences of the United States of America 110:E4875–E4883.

https://doi.org/10.1073/pnas.1319548110
- Google Scholar

Article and author information

Author details

Florian A Steiner

Basic Sciences Division, Howard Hughes Medical Institute, Fred Hutchinson Cancer Research Center, Seattle, United States

Contribution
FAS, Conception and design, Acquisition of data, Analysis and interpretation of data, Drafting or revising the article

Competing interests
The authors declare that no competing interests exist.
Steven Henikoff

Basic Sciences Division, Howard Hughes Medical Institute, Fred Hutchinson Cancer Research Center, Seattle, United States

Contribution
SH, Conception and design, Analysis and interpretation of data, Drafting or revising the article

For correspondence
steveh@fhcrc.org

Competing interests
The authors declare that no competing interests exist.

Funding

Howard Hughes Medical Institute (Henikoff)

Florian A Steiner
Steven Henikoff

Swiss National Science Foundation (PBSKP3-124362)

Florian A Steiner

National Institutes of Health (U01-HG004274)

Florian A Steiner
Steven Henikoff

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Acknowledgements

We thank Jorja Henikoff for data analysis, Christine Codomo, Terri Bryson and Aaron Hernandez for reagent preparation, Srinivas Ramachandran, Sivakanthan Kasinathan and Christopher Weber for help with the data analysis, James Priess for providing reagents and lab space, Mark Roth and Mike Morrison for providing antibodies and Sue Biggins, Takehito Furuyama, James Priess, Peter Skene, Paul Talbert, Christopher Weber and Gabriel Zentner for comments on the manuscript. Some strains were provided by the CGC, which is funded by NIH Office of Research Infrastructure Programs (P40 OD010440). Sequence data are in the Gene Expression Omnibus (GEO) database under accession number GSE44412.

Copyright

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.