An acetylation-mediated chromatin switch governs H3K4 methylation read-write capability
Abstract
In nucleosomes, histone N-terminal tails exist in dynamic equilibrium between free/accessible and collapsed/DNA-bound states. The latter state is expected to impact histone N-termini availability to the epigenetic machinery. Notably, H3 tail acetylation (e.g. K9ac, K14ac, K18ac) is linked to increased H3K4me3 engagement by the BPTF PHD finger, but it is unknown if this mechanism has a broader extension. Here, we show that H3 tail acetylation promotes nucleosomal accessibility to other H3K4 methyl readers, and importantly, extends to H3K4 writers, notably methyltransferase MLL1. This regulation is not observed on peptide substrates yet occurs on the cis H3 tail, as determined with fully-defined heterotypic nucleosomes. In vivo, H3 tail acetylation is directly and dynamically coupled with cis H3K4 methylation levels. Together, these observations reveal an acetylation ‘chromatin switch’ on the H3 tail that modulates read-write accessibility in nucleosomes and resolves the long-standing question of why H3K4me3 levels are coupled with H3 acetylation.
Editor's evaluation
This is a fundamental study revealing that cis H3 tail acetylation promotes nucleosome accessibility to histone methyl readers and writers. The findings are supported by compelling evidence providing an explanation for long-observed mechanisms of chromatin crosstalk.
https://doi.org/10.7554/eLife.82596.sa0Introduction
In the epigenetic landscape, histone proteins are often variably chemically modified by ‘writer’ enzymes (Jenuwein and Allis, 2001; Strahl and Allis, 2000). Writer-installed post-translational modifications (PTMs) can then be recognized by ‘reader’ proteins and/or removed by ‘eraser’ enzymes. This interplay of PTMs comprises the ‘histone code,’ and has a central function in regulating chromatin organization and activity. For example, methylated/acylated lysine or methylated arginine residues of histones can recruit transcription factors to activate or repress transcription (Strahl and Allis, 2000; Su and Denu, 2016); mitotically phosphorylated serine/threonine residues can regulate reader binding established at earlier stages of the cell cycle (Rossetto et al., 2012) or ubiquitinated lysine can impact the maintenance of DNA methylation (Vaughan et al., 2021). As the complex language of histone PTMs is dissected, it has become clear that multivalent interactions with reader proteins can influence chromatin structure and DNA accessibility, thereby regulating gene transcription and other DNA-templated events (Su and Denu, 2016; Taylor and Young, 2021; Young et al., 2010). In this manner, combinatorial PTMs can more effectively engage different chromatin-binding modules, promoting distinct outcomes versus either PTM alone.
The bulk of chromatin PTM research has employed histone peptides, even though histones exist in vivo in a heteromeric complex with DNA (i.e. the nucleosome). Recent work, however, is making it increasingly clear that studying histone PTM engagement in the nucleosome context provides a more accurate understanding of the histone code. Particularly, the highly charged histone tails interact directly with nucleosomal DNA, restricting access to PTM recognition by reader proteins (Ghoneim et al., 2021; Marunde et al., 2022a; Morrison et al., 2018). Studies with BPTF PHD suggest acetylation releases the H3 N-terminal tail from the nucleosome surface, such that H3K4me3 becomes more readily engaged by the PHD finger (Marunde et al., 2022a; Morrison et al., 2018).
Considerable research effort has focused on dissecting the direct (and multivalent) engagement of chromatin via histone PTM-reader protein interactions. However, less appreciated are any indirect effects of PTMs on histone tail accessibility/nucleosome dynamics (e.g. via charge neutralization). In this report, we demonstrate enhanced nucleosome binding by a range of H3K4 readers when the histone tail is concomitantly acetylated (one or more of K9ac, K14ac, and K18ac). Furthermore, from in vitro enzymatic assays, we found that neighboring acetylation of the cis H3 tail is a prerequisite switch that enables the MLL1 complex (MLL1C: MLL1 SET domain, WDR5, RbBP5, Ash2L, and DPY30) (Rao and Dou, 2015) to robustly methylate H3K4. Consistent with this observation, mass spectrometric proteomic analyses of mammalian cells in a timed response to sodium butyrate (a broad-spectrum lysine deacetylase (KDAC) inhibitor) revealed a tight correlation of H3K4 methylation with cis acetylation. Our findings define a critical aspect of chromatin regulation: i.e., PTM crosstalk through acetylation-mediated tail accessibility. The findings also provide a molecular basis for the long-standing connection between H3K4 methylation and H3 acetylation in multiple eukaryotes (Garcia et al., 2007; Nightingale et al., 2007; Taverna et al., 2007), and resolve the directionality of these correlations: cis hyperacetylation of the H3 tail precedes, and is largely a prerequisite for, H3K4 methylation. Thus, the establishment of sites of H3K4me3 and activation of transcription occurs by a sequence of modifications of the same histone molecule.
Results
PHD finger readers show narrowed selectivity for histone tail PTMs on mononucleosomes versus peptides
How histone readers engage nucleosomes is an extensively researched area of chromatin biology. Most investigators characterize reader binding with PTM-defined histone peptides, although the domains often display a refined preference for similarly modified nucleosomes (Marunde et al., 2022b; Marunde et al., 2022a; Morgan et al., 2021; Morrison et al., 2018). To further assess this potential, we used the dCypher approach (Jain et al., 2020; Marunde et al., 2022b; Morgan et al., 2021; Weinberg et al., 2021) to measure the interactions of three PHD readers (from KDM7A, DIDO1, and MLL5: the queries) with PTM-defined peptides and nucleosomes (the potential targets). As might be expected (Jain et al., 2020), each GST-PHD fusion showed a preference for H3K4-methylated peptides and, particularly, for higher methyl states (i.e. KDM7A bound me2/me3, DIDO1 bound me1/me2/me3, and MLL1 bound me1/me2/me3; Figure 1A). In contrast, each GST-PHD reader was restricted to H3K4me3 over the lower methyl states on nucleosomes (Figure 1B) and displayed weaker relative binding (EC50rel: calculated as in Methods) to this PTM (Figure 1—figure supplement 1C–D). On further examination, we observed no impact of co-incident acetylation on H3K4me3 binding in the peptide context (Figure 1A: compare H3K4me3 to H3K4me3K9acK14acK18ac [hereafter H3K4me3triac]). In stark contrast, the binding of each GST-PHD reader to nucleosomal H3K4me3 was dramatically enhanced (~10–15 fold) by co-incident acetylation (i.e. H3K4me3triac; Figure 1B and Figure 1—figure supplement 1A–B). Additionally, there was no significant reader domain interaction with either nucleosome lacking H3K4 methylation or 147X601 DNA (nucleosomal DNA) alone (Figure 1—figure supplement 1B). A similar observation has also been made for the BPTF PHD domain (Marunde et al., 2022a; Morrison et al., 2018), suggesting the potential for a general mechanism. This led us to consider the possibility that histone tail lysine acetylation (Kac) might function beyond the recruitment of readers, and perhaps also impact H3K4 writers.
H3 N-terminal acetylation enhances MLL1C-mediated methylation of nucleosomal H3K4
To investigate if acetylation might enable a more catalytically accessible H3 N-terminus, we performed enzymatic assays with the MLL1 core complex (MLL1C; responsible for H3K4me3) (Rao and Dou, 2015; Sha et al., 2020) and defined nucleosome substrates ± accompanying acetylation (H3K9acK14acK18ac; hereafter H3triac) (see Methods). In an endpoint assay at constant enzyme and substrate concentrations (and [methyl-3H]-SAM donor), we observed a significant increase in net methylation when the H3 tail was also acetylated (Figure 2A). As expected, methylation by MLL1C sequentially decreased towards H3K4 mono-, di-, and tri-methylated nucleosomal substrates, being undetectable on H3K4me3 (which also confirmed MLL1C targeting of this specific residue). Despite this, methyl group incorporation to each H3K4 methyl state substrate was consistently enhanced by accompanying H3triac (Figure 2A). Of note, we also tested the viability of methyllysine analogs (MLAs) (Simon and Shokat, 2012) ± H3triac as MLL1C substrates and observed no activity, indicating their unsuitability for such studies (Figure 2—figure supplement 1D–E).
We next measured the steady-state methylation kinetics of MLL1C towards nucleosomes with each H3K4 methyl state ± H3triac, and again observed that acetylation increased methyltransferase activity (Figure 2B). Using an extra sum-of-squares F-test, methylation for the H3triac nucleosomes was indicative of positive cooperativity because of better fit (p=0.0216) to the Hill equation (Weiss, 1997) (compare Figure 2B and Figure 2—figure supplement 1C; Table 1 and Supplementary file 1). Because of the low level of enzymatic activity towards the unacetylated nucleosomes we could not make a statistically significant comparison between the Hill and Michaelis-Menten fits. There have been limited studies of MLL1C activity on nucleosomes (Park et al., 2019; Patel et al., 2011; Xue et al., 2019), so an overlooked potential allostery is understandable given the many possible interactions between this enzyme complex and substrate (Lee et al., 2021; Park et al., 2019).
On examining enzymatic parameters in detail, we noted that although overall kcat was ~17 fold greater for homotypic H3triac (over unmodified, H3 • H3) nucleosomes, the K0.5 values (substrate concentration at half-maximal velocity/half-saturation for an allosterically regulated enzyme) were indistinguishable (Table 1). Therefore, although MLL1C catalytic efficiency toward H3triac nucleosomes was enhanced by an increase in kcat, this catalytic efficiency was not driven by K0.5, which suggested no increase in relative binding affinity. To further examine this, we returned to the dCypher approach to examine potential binding between the MLL1C query and a selection of nucleosome targets: H3 • H3, H3N∆32 (lacking the first 32 residues of H3), Tailless (trypsin-digested nucleosomes to remove N- and C-terminal histone tails), and H3triac. At 50 mM NaCl, we observed no compelling difference in MLL1C binding to any of these targets (Figure 2C and Figure 2—figure supplement 1E). This is agreed with structural studies where binding between MLL1C and the nucleosome occurs primarily through interactions with DNA and, to a lesser degree, the H4 N-terminal tail (Lee et al., 2021; Park et al., 2019). Thus, the increased H3K4 methylation observed when the H3 N-terminal tail was acetylated is not due to enhanced MLL1C-nucleosome binding. Instead, H3 acetylation likely released the histone tail from the nucleosome, thereby increasing the apparent H3K4 concentration for MLL1C and enhancing methylation.
To definitively explore if the MLL1C methylation of H3K4 in H3triac nucleosomes was responding to cis and/or trans tail acetylation, we synthesized heterotypic substrates with only one H3K4 residue available in the cis or trans H3ac context (i.e. [H3triac • H3K4me3] vs. [H3 • H3tetraac]). Using these substrates with MLL1C, we observed >threefold enhanced methylation (over [H3 • H3] or [H3 • H3K4me3]) when H3 tail acetylation was cis but no significant impact when trans (Figure 2D). This would support a model where H3 acetylation on the same tail releases H3K4 for methylation by MLL1C and appear to exclude a significant contribution from trans-tail mechanisms.
Cellular level of H3K4 methylation is coupled to H3 N-terminal tail hyperacetylation
The above data suggested a molecular model for how the H3 N-terminal tail, via cis acetylation, becomes available for H3K4 reader binding or enzymatic modification in vitro. To determine if such acetylation could function as an accessibility switch in vivo, we developed a novel targeted middle-down mass spectrometry method to provide a single molecule quantitative measure of histone tail modification. We applied this method to acid-extracted histones from asynchronous MCF-7 breast cancer cells to measure the relationship between H3K4 methylation and tail acetylation on the same H3 proteoforms (Holt et al., 2021; Smith et al., 2013). As expected (Garcia et al., 2007; Peach et al., 2012; Young et al., 2009), the absolute amounts of H3K4me3 and higher Kac states (3ac, 4ac, and 5ac) across adjacent lysine residues were extremely low (Figure 3—figure supplement 1A). Nonetheless, H3K4me3 (<1% of total H3) was strictly associated with molecules that contained multiple acetylations (also <1% of total H3; Figure 3A). Given this relationship, we next addressed the hypothesis that increased lysine acetylation may release the H3 tail for more effective H3K4 methylation (i.e. acetylation precedes methylation). We treated MCF-7 cells with the KDAC inhibitor sodium butyrate and collected samples at multiple time points to measure the levels of H3K4 methylation with cis acetylation. H3 poly-acetylation rapidly increased upon butyrate treatment (Figure 3—figure supplement 1B and Supplementary file 2), as expected (Holt et al., 2019; Young et al., 2009). We observed an increase in all H3K4 methyl states relative to the unmodified state concomitant with increasing states of H3 acetylation (Figure 3B–C and Supplementary file 2), while H3K4me3 levels most dramatically increased in tandem with the 5ac H3 state (Figure 3D). An example of tandem mass spectra at each acetyl degree, showing the C4+1 ion series from which K4 stoichiometry is measured, is shown in Figure 3C. These findings support a direct link, where acetylation releases nucleosome-bound H3 tails to localized H3K4 methyltransferases for subsequent methylation (Figure 4).
To explore whether other H3 lysine methyl marks are similarly affected by H3 acetylation, we examined the co-occurrence of H3K9me1-me2-me3 with increasing H3 acetyl states ± butyrate treatment in asynchronous HEK-293 cells (Figure 3—figure supplement 1C–D; Supplementary file 3). In contrast to findings with H3K4me3, H3K9me3 did not accumulate with hyperacetylated H3 proteoforms (i.e. 3ac, 4ac, and 5ac). Recent in vitro studies showed tested H3K9 methyltransferases have increased activity to partially acetylated substrates (Trush et al., 2022), so our MS data would suggest this is not a common in vivo mechanism: explanations could include that K9 was generally acetylated (and thus occupied) during histone tail release (all penta-acetylated (5ac) H3 proteoforms consisted of acetylation at H3K9, K14, and K18 at minimum (Figure 3, Figure 3—figure supplement 1E and Supplementary file 4)); or that H3K9 methyltransferases, in contrast to H3K4 methyltransferases, were not generally localized to regions that will accumulate butyrate-induced hyperacetylation.
Taken together, these findings demonstrate a specific cis-dependent H3ac regulatory switch that functions to control H3K4 methylation output (Figure 4). We posit that such a ‘chromatin-switch’ is important to the ability of cells to translate short-term acetylation signals at gene promoters to longer-term heritable marks of epigenetic memory (Greer et al., 2014; Hörmanseder et al., 2017; Muramoto et al., 2010; Ng et al., 2003).
Discussion
While previous investigations identified a link between H3K4 methylation and H3 acetylation in diverse species (Garcia et al., 2007; Nightingale et al., 2007; Strahl et al., 1999; Taverna et al., 2007; Young et al., 2009) the molecular basis for this link was unknown, and we posit the H3 acetyl ‘chromatin switch’ defined herein is conserved across eukaryotes. Our new understanding of the dynamic structure of nucleosome histone tails, alternating between collapsed (i.e. nucleosome-bound) and accessible forms (Marunde et al., 2022a; Morrison et al., 2018), has made more plausible the notion that tail availability could be driven by combinatorial cis acetylation to directly promote H3K4 methylation. Histone tail lysine acetylation (Kac) can directly recruit residue-specific readers, e.g., bromodomains (Musselman et al., 2012), but acetylation also neutralizes the positive charge on lysine residues and relieves their interaction with negatively charged DNA (i.e. altering the histone tail-DNA binding equilibrium) (Marunde et al., 2022a; Morrison et al., 2018). In the nucleosome context, this decreased histone-DNA binding supports the increased engagement of reader domains that have no direct affinity for the Kac. Conversely, isolated histone tail peptides are ‘constitutively open’ (no DNA to engage), and thus not subject to this mode of regulation.
In this study, we showed that hyperacetylation of the H3 N-terminal tail promoted rapid accumulation of H3K4 methylation in cis, most likely by increasing the availability of the substrate residue to the MLL1C active site (Figure 4). This finding was supported by our in vitro enzymatic and in vivo mass spectrometric analyses. Methylation assays with MLL1C revealed significantly enhanced enzyme activity towards nucleosome substrates with co-incident acetylated (H3triac) over unmodified H3 (Figure 2A–B; Table 1). dCypher assays demonstrated that the acetylation-mediated increase in H3K4 methylation does not involve stabilized interactions between MLL1C and the nucleosome (Figure 2D). And providing major insight, heterotypic nucleosomes containing two distinct PTM-defined forms of histone H3 definitively showed that MLL1C activity was only enhanced in a cis H3 N-terminal triacetylation context (Figure 2D). In agreement with these in vitro findings, middle-down mass spectrometry showed that H3 hyperacetylation and H3K4 methylation co-occurred on the same histone tails in actively cycling cells; furthermore, upon butyrate treatment, H3K4 methylation increased for the most highly acetylated proteoforms (Figure 3). The in vivo relationship required higher degrees of acetylation (preferring at least four acetyl groups per molecule: e.g. H3K9acK14acK18acK23acK27ac) for the most effective conversion to the H3K4me3 state. This could be a function of yet to be explained in vivo acetylation hierarchies by KATs that are outside the scope of this study but will be important to resolve. Together with binding studies that identify the positive impact of cis H3 tail hyperacetylation on H3K4 reader engagement (Figure 1 and Figure 1—figure supplement 1; Marunde et al., 2022a; Morrison et al., 2018), our findings suggest a molecular switch that governs when and where the histone H3 N-terminus is available for H3K4-related transactions. Such a switch could be used to establish and heritably maintain the location and function of transcriptional promoters across the genome.
A continued observation from this study is that the binding preference of readers (in this case PHD-fingers that engage histone H3K4) narrows on nucleosomes relative to histone peptides (Figure 1 and Figure 1—figure supplement 1; Marunde et al., 2022a; Morgan et al., 2021). Such differences highlight the importance of using a more representative target to identify the most likely/physiologically relevant interactions. However, we also add to the literature confirming the importance of a nucleosome substrate for enzymatic studies (Strelow et al., 2016; Stützer et al., 2016). In this regard our steady-state kinetic methylation assays with MLL1C revealed an intriguing (and previously undescribed) positive cooperativity with its preferred nucleosome substrates (Figure 2B vs. Figure 2—figure supplement 1C). It is important to consider the suggestion that allosteric factors could regulate interactions between MLL1C and the nucleosome, especially since previous kinetic analyses of this [enzyme: substrate] pair did not address such behavior (Park et al., 2019; Xue et al., 2019). However, positive cooperativity was not evident at the level of [MLL1C: nucleosome] binding, which was independent of substrate identity (i.e. ±H3triac) at ionic conditions similar to the catalytic assay (Figure 2C). Still to be determined are the signals that drive and regulate this cooperativity and its function in MLL1-catalyzed methylation, especially in a higher-order chromatin context.
Taken together, our study highlights a previously unrecognized regulatory mechanism for how writers might engage the histone H3 tail in vivo. Although this work focused on H3K4, it will be important to ascertain the consequence of acetylation (or acylation)-mediated changes in accessibility and the function of modifiers and readers of the other lysines on H3, as well as on the other core histones (H2A, H2B, and H4). Underscoring the need for such studies, we note recent in vitro analyses employing unmodified nucleosomes sequentially targeted by purified KATs and KMTs suggesting that the acetylation landscape can impact multiple methyltransferases (Trush et al., 2022). While that study reported enhanced G9a activity towards H3K9 as a function of p300-mediated histone acetylation, our mass spectrometric analysis from cells did not show a connection between H3K9me3 and H3 hyperacetylation (Figure 3—figure supplement 1C–D). The field will require a more detailed analysis of the in vivo contributions of various KATs/KDACs to regulate histone tail accessibility (in cis and/or trans) for other chromatin-modifying enzymes to further uncover the molecular details of any sequential histone code.
Finally, we note that multiple studies have identified H3K4 methylation and H3 acetylation as active marks because of their co-occupancy on the promoters and gene bodies of transcribed genes (Santos-Rosa et al., 2002; Strahl et al., 1999; Wozniak and Strahl, 2014). Our findings agree with these observations, but also uncover a previously unrecognized mechanism of H3 cross-talk that impinges on fundamental functions of H3 acetylation and K4 methylation in gene regulation. While this work shows the impact of cis tail H3 acetylation on the H3K4 writer MLL1, we note a companion study highlighting how the resulting PTM signature (cis >> trans-H3K4me3triac) promotes BPTF-nucleosome engagement (Marunde et al., 2022a). Given its central importance, we predict this mechanism may be a target for dysregulation in human disease.
Methods
Expression and purification of GST-tagged PHD reader proteins
Expression constructs for GST-tagged PHD domains from KDM7A (Uniprot #Q6ZMT4; residues 1–100), DIDO1 (Uniprot #Q9BTC0; residues 250–340), and MLL5 (Uniprot #Q8IZD2; residues 100–180) were synthesized in a pGEX-4T-1 vector (BioMatik Corporation) and provided by Dr. Mark T. Bedford (UT MD Anderson Cancer Center). Recombinant proteins were expressed and purified as described (Jain et al., 2020). See Supplementary file 5 for details about the constructs.
Expression, purification, and assembly of the MLL1 core complex (MLL1C)
Methods for the expression, purification, and assembly of the MLL1 core complex (MLL1C: MLL1 SET domain, WDR5, RbBP5, Ash2L, and DPY30) were adapted from published protocols (Usher et al., 2021). A polycistronic recombinant expression construct containing the MLL1 SET domain (Uniprot Q03164; residues 3745–3969), WDR5 (Uniprot P61964; residues 2–334), RbBP5 (Uniprot Q15291; residues 1–538), and Ash2L (Uniprot Q9UBL3-3; residues 1–534) in pST44 vector (aka. MWRA construct) was a kind gift from Dr. Song Tan (Tan et al., 2005). WDR5 was cloned with an N-terminal hexahistidine (6HIS) tag and TEV protease site to enable purification by immobilized metal affinity chromatography (IMAC), and tag removal by enzymatic cleavage. See Supplementary file 5 for details about the constructs. Rosetta pLysS E. coli cells were transformed with the plasmid and grown on LB plates with 50 μg/mL carbenicillin. Single colonies were used to inoculate 50 mL starter cultures of Terrific Broth (TB) containing 50 μg/mL carbenicillin and grown at 37 °C for 16 hr. This culture was transferred to 1 L of TB+ carbenicillin and grown to OD600 ~0.6 (37°C, 200 RPM, ~4 hr). Recombinant protein expression was induced with 1 mM Isopropyl β-D-1-thiogalactopyranoside (IPTG, Sigma; 16 °C, 200 RPM, 20 hr). Cells were harvested by centrifugation (5000 RPM, 4 °C) and flash frozen in liquid nitrogen.
Frozen cell pellets were resuspended in 50 mL of lysis buffer (50 mM Tris-HCl pH 7.5, 300 mM NaCl, 30 mM imidazole, 3 mM dithiothreitol, and 1 µM ZnCl2) containing 0.5 mg/mL lysozyme (Sigma), 250 U Pierce Universal Nuclease (Thermo Fisher), and an EDTA-free protease inhibitor cocktail (Roche) and rotated at 4 °C for 1 hr. The resultant mixture was then sonicated [five cycles of 30 s on/30 s off at 50% output] and centrifuged at 4 °C, 15,000 RPM for 35 min. The clarified lysate has flowed over a 5 mL HisTrap nickel column (Cytiva) using an AKTA Pure FPLC (Cytiva) at 0.5 mL/min; all FPLC steps were conducted at 4 °C. Unbound molecules were removed with 20 column volumes of wash buffer (WB: 50 mM Tris-HCl, pH 7.5, 300 mM NaCl, 30 mM imidazole, 3 mM DTT, and 1 µM ZnCl2 at 2 mL/min). The 6HIS-tagged MWRA was eluted in a 15-column volume linear gradient from WB to Elution Buffer (WB+ 500 mM imidazole) at 2 mL/min. Fractions containing 6HIS-tagged MWRA were identified by SDS-PAGE, pooled, and supplemented with 6HIS-tagged TEV protease (purified as described: Nautiyal and Kuroda, 2018) at a 1:100 enzyme to substrate molar ratio to cleave the 6HIS-tag on WDR5. This mixture was dialyzed against three changes of WB (each 2 L for at least 4 hr at 4 °C) and 6HIS-TEV removed from cleaved MWRA via IMAC (Usher et al., 2021). MWRA flow-through protein solution was concentrated to ~15 mL using a 30 kDa MWCO centrifugal filter (EMD Millipore), ensuring not to concentrate to where the solution became yellow/cloudy and viscous. MWRA complex was resolved over a HiLoad 16/60 Superdex 200 pg gel filtration (GF) column (Cytiva) pre-equilibrated in 20 mM Tris-HCl, pH 7.5, 300 mM NaCl, 1 mM TCEP, and 1 µM ZnCl2. Fractions containing stoichiometric MWRA sub-complex were identified by SDS-PAGE, pooled, and concentrated to ~15 mL.
HisDPY30 was expressed, purified, and cleaved to remove the 6HIS-tag as described (Patel et al., 2009; Usher et al., 2021). A twofold molar excess of DPY30 was added to the MWRA sub-complex and incubated on ice for 1 hr. Following incubation, the resulting MLL1C was isolated by gel filtration as described for MWRA, with fractions containing the stoichiometric complex identified by SDS-PAGE, pooled, concentrated to ~10 µM, and flash frozen. For dCypher experiments with MLL1C, the 6HIS-tag was retained on DPY30.
Peptides
All peptides were synthesized at the UNC peptide synthesis core facility (RRID: SCR_017837), using Fmoc solid phase synthesis, on an automated peptide synthesizer (PTI Symphony or CEM Liberty Blue). The peptides were purified by preparative RP-HPLC and characterized by MALDI-TOF MS and analytical HPLC.
PTM-defined nucleosomes
All PTM-defined nucleosomes (Supplementary file 5; homotypic unless stated otherwise) were from the dNuc or versaNuc portfolios (EpiCypher). PTMs were confirmed by mass-spectrometry and immunoblotting (if an antibody was available) (Goswami et al., 2021; Marunde et al., 2022b; Marunde et al., 2022a; Weinberg et al., 2019).
Nucleosomes with methyllysine analogs (MLA) at H3K4 were created by the versaNuc approach (Marunde et al., 2022a). In brief, histone H3 peptides (aa1-31; A29L) containing K4C and any PTMs of interest were site-specifically reacted with the corresponding haloalkylamine: (2-bromoethyl) trimethyl ammonium bromide for KCme3; 2-chloro-N,N-dimethyl-ethylamine hydrochloride for KCme2; or 2-chloroethyl(methyl)ammonium chloride for (KCme1) under SN2 reaction conditions as previously (Simon, 2010; Simon and Shokat, 2012), and purified for individual ligation to an H3 tailless nucleosome precursor (H3.1NΔ32 assembled on 147 bp 5’ biotinylated 601 DNA; #16–0016). The resulting nucleosomes (assembled at 50–100 μg scale) contained minimal free DNA (<5%), undetectable levels of peptide precursor, and ≥90% fully-defined full-length H3.1 (e.g. Figure 2—figure supplement 1D–E).
Heterotypic nucleosomes were assembled from PTM-defined histone octamers containing N-terminally bridged H3 dimers, and the bridge was removed in a scarless manner by approaches to be described elsewhere (Manuscript in Preparation). Heterotypic identity was confirmed at all synthesis steps by analyses additional to those used for homotypics, including Nuc-MS on representative final nucleosomes (Schachner et al., 2021). Heterotypic nomenclature describes each PTM-defined histone in the nucleosome, such that [H3K4me3K9acK14acK18ac • H3] vs. [H3K4me3 • H3K9acK14acK18ac] contain the same total PTM complement but distributed cis or trans on the H3 N-termini.
dCypher assays
dCypher binding assays with PTM-defined nucleosomes were performed under standard conditions that titrate query (e.g. epitope-tagged reader domain(s)) to a fixed concentration of target (e.g. biotinylated PTM-defined nucleosome) with the appropriate Alpha Donor and Acceptor beads (Perkin Elmer) (Jain et al., 2020; Marunde et al., 2022b; Weinberg et al., 2019). Binding curves [query: target] were generated using a non-linear 4PL curve fit in Prism 9.0 (GraphPad). For each query, the relative EC50 (EC50rel) and hillslope values were derived from the best binding target. EC50rel is the half-maximal signal for the specified target. Where necessary, we excluded query concentration values determined to be beyond a query’s hook point (signal inhibition due to query exceeding bead saturation). The EC80rel was selected as the optimal probing concentration for discovery screens because of the robust signal-to-background and to provide the best opportunity to bind targets without saturating the primary target signal. To compute EC80rel values, we used the formula ECFrel = (F/(100 – F))1/H × EC50rel; F=80; and H=hillslope.
Briefly, 5 μL of GST-tagged reader domain was incubated with 5 μL of 10 nM biotinylated nucleosomes (e.g. EpiCypher #16–9001) for 30 min at room temperature in 20 mM HEPES pH 7.5, 250 mM NaCl, 0.01% BSA, 0.01% NP-40, 1 mM DTT in a 384-well plate. A mix of 10 uL of 2.5 μg/mL glutathione acceptor beads (PerkinElmer, AL109M) and 5 μg/mL streptavidin donor beads (PerkinElmer, 6760002) was prepared in 20 mM HEPES pH 7.5, 250 mM NaCl, 0.01% BSA, 0.01% NP-40 and added to each well. The plate was incubated at room temperature in subdued lighting for 60 min, and AlphaLISA signal was measured on a PerkinElmer 2104 EnVision (680 nm laser excitation, 570 nm emission filter ± 50 nm bandwidth). Each binding interaction was performed in duplicate in a 20 µL mix in 384 well plates.
MLL1C binding assays (Figure 2C) were performed as above except using Nickel-chelate acceptor beads (10 μg/mL; Perkin Elmer AL108M), streptavidin donor beads (20 μg/mL; Perkin Elmer) and modified assay buffer (20 mM Tris pH 7.5 + 50 mM NaCl, 0.01% BSA, 0.01% NP-40, and 1 mM DTT); [NaCl] was optimized via a titration assay and 50 mM chosen for subsequent analyses.
In vitro methylation assays
Methylation assays (Shinsky et al., 2015) were performed at 15 °C for 3 hr using purified MLL1C enzyme and nucleosome substrate in a reaction volume of 20 µL [in 50 mM HEPES, pH 8.0, 1 mM DTT, 1 µM ZnCl2; 10 µM of 9:1 S-adenosyl-L-methionine (SAM) p-toluenesulfonate salt (Sigma) to S-adenosyl-L-[methyl-3H]-methionine ([methyl-3H]-SAM) (PerkinElmer)]. Concentrations of MLL1C and NaCl were optimized from 2D-titration methylation assays at [0, 4, and 40 nM MLL1C] and [0, 50, and 300 mM NaCl] with 2 µg of the chicken oligo-nucleosome substrate (Figure 2—figure supplement 1A–B; Morris et al., 2007). It is notable that, across a range of concentrations, MLL1C stability decreases as temperature increases and methyltransferase activity has been reported to be enhanced in sub-physiological NaCl concentrations (Namitz et al., 2019; Shinsky et al., 2015). For endpoint methylation assays, 4 nM MLL1C and 100 nM nucleosome substrates were tested as above. For steady-state kinetics, 4 nM MLL1C was incubated with a nucleosome substrate titration (0, 23.4, 35.1, 52.7, 79, 119, 178, 267, and 400 nM) and reactions quenched with 5 µL of 5 X SDS loading dye. For steady-state kinetic assays, 0.5 µg bovine serum albumin was added to each reaction after quenching to act as a loading guide. To analyze methylation, quenched reactions were resolved by 15% Tris-Glycine SDS-PAGE. Gels were stained with Coomassie dye and bands containing mononucleosomes were excised (with serum albumin as a supporting lane marker) and incubated in a solution of 50% Solvable (PerkinElmer) and 50% water at 50 °C for 3 hr. Mixture and gel slices were then combined with 10 mL of Hionic-Fluor scintillation fluid (PerkinElmer), dark-adapted overnight, and radioactivity measured on a Liquid Scintillation Counter (Beckman Coulter).
Middle-down mass spectrometry of MCF-7 and HEK-293 cells ± KDAC inhibition
MCF-7 breast cancer cells (ATCC HTB-22) were grown in MEM (Gibco) supplemented with 10% fetal bovine serum (VWR), 100 I.U. penicillin, 100 µg/mL streptomycin (Corning), 0.01 mg/mL human recombinant insulin (Gibco), and 5 µg/mL plasmocin (Invivogen) at 37 °C and 5% CO2. HEK-293 (ATCC CRL-1573) was grown in DMEM (Gibco) supplemented with 10% fetal bovine serum (Corning), 100 I.U. penicillin, and 100 µg/mL streptomycin (Gibco) at 37 °C and 5% CO2. Both cell lines were authenticated via STR profiling and confirmed to be Mycoplasma negative.
For mass spectrometric (MS) analysis ± KDAC inhibition, cells were cultured in 150 mm dishes to ~80% confluence and treated with 5 mM sodium butyrate (or equivalent volume of water) in triplicate for 0, 10, 20, 30, 60, 120 min. Cells were washed with cold PBS (11.9 mM phosphates, 137 mM NaCl, 2.7 mM KCl) to remove residual sodium butyrate, harvested by scraping, and flash frozen in liquid nitrogen. Histones were acid extracted after nuclei isolation as described (Holt et al., 2021). Isolated histones were resuspended in 85 µL 5% acetonitrile, 0.2% trifluoroacetic acid (TFA), and resolved by offline high-performance liquid chromatography (HPLC) as described (Holt et al., 2021). Reverse Phase HPLC fractionation was performed with a U3000 HPLC system (Thermo Fisher) with a 150 × 2.1–mm Vydac 218TP 3 µm C18 column (HiChrom # 218TP3215), at a flow rate of 0.2 mL/min using a linear gradient from 25 – 60% B in 60 min. The composition of buffers used was A: 5% acetonitrile and 0.2% TFA and B: 95% acetonitrile and 0.188% TFA. After chromatographic separation and fraction collection, histone H3.1 was selected, diluted in 100 mM ammonium acetate (pH = 4), and digested with Glu-C protease (Roche) at 10:1 protein:enzyme for 1 hr at room temperature prior to mass spectrometric analysis.
The digested samples were diluted to 2 µg/µL. Online HPLC was performed on a U3000 RSLC nano Pro-flow system using a C3 column (Zorbax 300 SB-C3, 5 µm; Agilent). Samples were maintained at 4 °C and 1 µL injected for each analysis using a microliter pickup. A linear 70 min gradient of 4–15% B was used (Buffer A: 2% acetonitrile, 0.1% formic acid and Buffer B: 98% acetonitrile, and 0.1% formic acid) with a flow rate of 0.2 µL/min. The column eluant was introduced into an Orbitrap Fusion Lumos mass spectrometer (Thermo Fisher) by nano-electrospray ionization. A static spray voltage of 1900 V and an ion transfer tube temperature of 320 °C were set for the source.
A Fusion Lumos mass spectrometer was used to generate MS data. The 9th charge state of histone H3 was targeted for analysis. MS1 analysis was acquired in the orbitrap with a normal mass range and a 60 k resolution setting in positive ion mode. An Automatic Gain Control (AGC) target of 5.0E5 with a 200 ms maximum injection time, three microscans, and a scan range of 585–640 m/z was used to identify desired ions for fragmentation. MS2 acquisition was performed in both orbitrap and ion trap modes. Both modes used electron transfer dissociation (ETD), a reaction time of 18ms, and an injection time of 200 ms. A normal scan range was used for the orbitrap mode with a resolution setting of 30 k and an AGC target of 5.0E5, with two micro scans. A narrow scan range of 470–530 m/z, targeting ions indicative of K4 modification states, was used for the ion trap mode MS2 with an ACG target of 3.0E4, quadrupole isolation, maximum injection time of 100ms, and eight microscans.
These MS methods were used with two technical replicates per biological replicate (n=3). An MS ion trap mode with a targeted mass list was used to increase sensitivity to identify known low-abundance K4me3-containing proteoforms. The ion trap MS2 spectra were averaged for each H3 acetyl state based on known retention times, and the intensities of ions indicative of the K4 methylation states were manually recorded. Retention times for each acetyl state were approximated as 0ac 35–40 min, 1ac 45–55 min, 2ac 55–60 min, 3ac 62–68 min, 4ac 69–73 min, and 5ac 74–78 min. Precursor mass was used as an additional confirmation and filter of the correct acetyl state. For scans yielding low signal and high noise (i.e. 5ac at 0 min butyrate treatment), data were manually curated before averaging. Within acetyl states, the relative proportion of fragment ions for unmodified, mono-, di-, and tri-methylation of the H3K4 ion at respective m/z of 474, 488, 502, and 516 were recorded per MS run. Values were averaged across replicates of the same conditions and normalized to one hundred percent. A two-tailed t-test was used for significance. Raw MS data is available at (ftp://massive.ucsd.edu/MSV000089089/, ftp://massive.ucsd.edu/MSV000091578/).
The MS method used here is highly targeted to most effectively address the mechanistic or single molecule link between H3 acetylation degree and H3K4 occupancy. The strategy used prioritizes the optimization of efficient selection of acetyl degree and of the signal-to-noise for the C4+1 ion series. This is to the exclusion of other information that can typically be derived from untargeted approaches. For example, because trapping mass spectrometers are limited in the number of ions, we dispose of unnecessary ions to the gas phase to enrich the C4 ions series. This method provides a direct measure of the stoichiometry of K4un, K4me1, K4me2, and K4me3 within an acetyl state.
Data availability
Raw MS data is publicly available and has been uploaded to the UCSD MassIVE database (MSV000089089 and MSV000091578). All analyzed data are reported in the manuscript and Supporting Files.
References
-
Histone tail Conformations: A fuzzy affair with DNATrends in Biochemical Sciences 46:564–578.https://doi.org/10.1016/j.tibs.2020.12.012
-
High-throughput quantitative top-down Proteomics: Histone H4Journal of the American Society for Mass Spectrometry 30:2548–2560.https://doi.org/10.1007/s13361-019-02350-z
-
Identification of histone H3 lysine 36 acetylation as a highly conserved histone modificationJournal of Biological Chemistry 282:7632–7640.https://doi.org/10.1074/jbc.M607909200
-
Perceiving the epigenetic landscape through Histone readersNature Structural & Molecular Biology 19:1218–1227.https://doi.org/10.1038/nsmb.2436
-
Cross-talk between Histone modifications in response to Histone Deacetylase inhibitorsJournal of Biological Chemistry 282:4408–4416.https://doi.org/10.1074/jbc.M606773200
-
Cryo-EM structure of the human Mll1 core complex bound to the NucleosomeNature Communications 10:5540.https://doi.org/10.1038/s41467-019-13550-2
-
On the mechanism of multiple Lysine methylation by the human mixed lineage leukemia Protein-1 (Mll1) core complexThe Journal of Biological Chemistry 284:24242–24256.https://doi.org/10.1074/jbc.M109.014498
-
Hijacked in cancer: the Kmt2 (MLL) family of MethyltransferasesNature Reviews. Cancer 15:334–346.https://doi.org/10.1038/nrc3929
-
Insights on the regulation of the MLL/Set1 family Histone MethyltransferasesBiochimica et Biophysica Acta. Gene Regulatory Mechanisms 1863:194561.https://doi.org/10.1016/j.bbagrm.2020.194561
-
Biochemical reconstitution and Phylogenetic comparison of human Set1 family core complexes involved in Histone methylationThe Journal of Biological Chemistry 290:6361–6375.https://doi.org/10.1074/jbc.M114.627646
-
Installation of site-specific methylation into Histones using methyl Lysine analogsCurrent Protocols in Molecular Biology Chapter 21:mb2118s90.https://doi.org/10.1002/0471142727.mb2118s90
-
Proteoform: a single term describing protein complexityNature Methods 10:186–187.https://doi.org/10.1038/nmeth.2369
-
The use of Nucleosome substrates improves binding of SAM analogs to Setd8Journal of Biomolecular Screening 21:786–794.https://doi.org/10.1177/1087057116656596
-
Reading the combinatorial histone languageACS Chemical Biology 11:564–574.https://doi.org/10.1021/acschembio.5b00864
-
The Pst44 Polycistronic expression system for producing protein complexes in Escherichia coliProtein Expression and Purification 40:385–395.https://doi.org/10.1016/j.pep.2004.12.002
-
Combinations of Histone post-Translational modificationsThe Biochemical Journal 478:511–532.https://doi.org/10.1042/BCJ20200170
-
Enzymatic Nucleosome Acetylation selectively affects activity of Histone Methyltransferases in vitroBiochimica et Biophysica Acta. Gene Regulatory Mechanisms 1865:194845.https://doi.org/10.1016/j.bbagrm.2022.194845
-
Probing multiple enzymatic methylation events in real time with NMR spectroscopyBiophysical Journal 120:4710–4721.https://doi.org/10.1016/j.bpj.2021.09.034
-
Chromatin regulation through Ubiquitin and Ubiquitin-like Histone modificationsTrends in Biochemical Sciences 46:258–269.https://doi.org/10.1016/j.tibs.2020.11.005
-
The Hill equation revisited: uses and misusesFASEB Journal 11:835–841.https://doi.org/10.1096/fasebj.11.11.9285481
-
Hitting the "mark": interpreting Lysine methylation in the context of active transcriptionBiochimica et Biophysica Acta 1839:1353–1361.https://doi.org/10.1016/j.bbagrm.2014.03.002
-
High throughput characterization of Combinatorial Histone codesMolecular & Cellular Proteomics 8:2266–2284.https://doi.org/10.1074/mcp.M900238-MCP200
-
The significance, development and progress of high-throughput Combinatorial Histone code analysisCellular and Molecular Life Sciences 67:3983–4000.https://doi.org/10.1007/s00018-010-0475-7
Article and author information
Author details
Funding
American Cancer Society (PF-20-149-01-DMC)
- Kanishk Jain
National Institute of General Medical Sciences (R35GM126900)
- Brian D Strahl
National Institute of General Medical Sciences (R01GM139295)
- Nicolas L Young
National Cancer Institute (T32CA217824)
- Kanishk Jain
National Institute of General Medical Sciences (P01AG066606)
- Nicolas L Young
National Cancer Institute (R01CA193235)
- Nicolas L Young
National Cancer Institute (R01CA140522)
- Michael S Cosgrove
National Cancer Institute (R43CA236474)
- Michael-Christopher Keogh
National Institute of General Medical Sciences (R44GM117683)
- Zu-Wen Sun
National Cancer Institute (R44CA214076)
- Michael-Christopher Keogh
National Institute of General Medical Sciences (R44GM116584)
- Zu-Wen Sun
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Acknowledgements
KJ is supported by a Postdoctoral Training Fellowship from the National Institutes of Health (NIH; T32CA217824) to the UNC Lineberger Cancer Center and a Postdoctoral Fellowship from the American Cancer Society (PF-20-149-01-DMC). This work was also supported by NIH grants to NLY (R01GM139295, P01AG066606, and R01CA193235), MSC (R01CA140522), EpiCypher (R43CA236474, R44GM117683, R44CA214076, and R44GM116584), and to BDS (R35GM126900). We thank colleagues for the generous supply of materials (see Methods) and members of the Cosgrove, Young, EpiCypher, and Strahl labs for helpful discussions and suggestions.
Copyright
© 2023, Jain et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 2,260
- views
-
- 378
- downloads
-
- 20
- citations
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Biochemistry and Chemical Biology
- Cell Biology
Activation of the Wnt/β-catenin pathway crucially depends on the polymerization of dishevelled 2 (DVL2) into biomolecular condensates. However, given the low affinity of known DVL2 self-interaction sites and its low cellular concentration, it is unclear how polymers can form. Here, we detect oligomeric DVL2 complexes at endogenous protein levels in human cell lines, using a biochemical ultracentrifugation assay. We identify a low-complexity region (LCR4) in the C-terminus whose deletion and fusion decreased and increased the complexes, respectively. Notably, LCR4-induced complexes correlated with the formation of microscopically visible multimeric condensates. Adjacent to LCR4, we mapped a conserved domain (CD2) promoting condensates only. Molecularly, LCR4 and CD2 mediated DVL2 self-interaction via aggregating residues and phenylalanine stickers, respectively. Point mutations inactivating these interaction sites impaired Wnt pathway activation by DVL2. Our study discovers DVL2 complexes with functional importance for Wnt/β-catenin signaling. Moreover, we provide evidence that DVL2 condensates form in two steps by pre-oligomerization via high-affinity interaction sites, such as LCR4, and subsequent condensation via low-affinity interaction sites, such as CD2.
-
- Biochemistry and Chemical Biology
- Physics of Living Systems
For drugs to be active they have to reach their targets. Within cells this requires crossing the cell membrane, and then free diffusion, distribution, and availability. Here, we explored the in-cell diffusion rates and distribution of a series of small molecular fluorescent drugs, in comparison to proteins, by microscopy and fluorescence recovery after photobleaching (FRAP). While all proteins diffused freely, we found a strong correlation between pKa and the intracellular diffusion and distribution of small molecule drugs. Weakly basic, small-molecule drugs displayed lower fractional recovery after photobleaching and 10- to-20-fold slower diffusion rates in cells than in aqueous solutions. As, more than half of pharmaceutical drugs are weakly basic, they, are protonated in the cell cytoplasm. Protonation, facilitates the formation of membrane impermeable ionic form of the weak base small molecules. This results in ion trapping, further reducing diffusion rates of weakly basic small molecule drugs under macromolecular crowding conditions where other nonspecific interactions become more relevant and dominant. Our imaging studies showed that acidic organelles, particularly the lysosome, captured these molecules. Surprisingly, blocking lysosomal import only slightly increased diffusion rates and fractional recovery. Conversely, blocking protonation by N-acetylated analogues, greatly enhanced their diffusion and fractional recovery after FRAP. Based on these results, N-acetylation of small molecule drugs may improve the intracellular availability and distribution of weakly basic, small molecule drugs within cells.