The RNA chaperone Hfq is an Sm protein that facilitates base pairing between bacterial small RNAs (sRNAs) and mRNAs involved in stress response and pathogenesis. Hfq possesses an intrinsically disordered C-terminal domain (CTD) that may tune the function of the Sm domain in different organisms. In Escherichia coli, the Hfq CTD increases kinetic competition between sRNAs and recycles Hfq from the sRNA-mRNA duplex. Here, de novo Rosetta modeling and competitive binding experiments show that the acidic tip of the E. coli Hfq CTD transiently binds the basic Sm core residues necessary for RNA annealing. The CTD tip competes against non-specific RNA binding, facilitates dsRNA release, and prevents indiscriminate DNA aggregation, suggesting that this acidic peptide mimics nucleic acid to auto-regulate RNA binding to the Sm ring. The mechanism of CTD auto-inhibition predicts the chaperone function of Hfq in bacterial genera and illuminates how Sm proteins may evolve new functions.https://doi.org/10.7554/eLife.27049.001
Host factor for RNA phage Qβ replication (Hfq) is found in most sequenced bacterial genomes (Sun et al., 2002) and plays a well characterized role in post-transcriptional regulation by small non-coding RNA (sRNA) (Gottesman et al., 2006; Storz et al., 2011). Regulation by Hfq and sRNAs is important for controlling the expression of metabolic, stress-response and virulence genes in many genera (Feliciano et al., 2016). Hfq binds sRNA and facilitates interactions between sRNAs and their mRNA targets (Zhang et al., 2002; Moller et al., 2002). To chaperone sRNA target recognition, Hfq must select its substrates from a large pool of nucleic acid in the cell and efficiently dissociate from its products at the end of each RNA annealing cycle (Rajkowitsch et al., 2007).
E. coli Hfq contains an Sm-like domain (residues 7–65) that oligomerizes into a homohexameric ring with two sequence-specific RNA-binding faces. The proximal face of the ring is highly conserved and binds to uridines (Zhang et al., 2002; Schumacher et al., 2002) at the 3’-ends of bacterial small non-coding RNA (sRNA) that resemble a classic Sm binding site (Zhou et al., 2014). In E. coli and many Gram negative bacteria, the distal face of Hfq binds to AAN triplet repeats (Mikulecky et al., 2004; Link et al., 2009) found in mRNA leaders (Link et al., 2009; Soper et al., 2011) and certain sRNAs (Schu et al., 2015; Małecka et al., 2015). In addition to these sequence-specific RNA binding sites, arginine-rich basic patches at the rim of the E. coli Hfq hexamer interact with the sRNA body (Zhang et al., 2002; Otaka et al., 2011; Sauer et al., 2012; Ishikawa et al., 2012; Zhang et al., 2013) and facilitate annealing with target mRNAs (Panja et al., 2013; Zheng et al., 2016).
Like many RNA binding proteins, Hfq also possesses intrinsically disordered domains that have the potential to modulate the function of the core Sm ring. The E. coli Hfq Sm domain is flanked by a short, disordered, N-terminal domain (NTD; residues 1–6), which protrudes from the proximal face of the hexamer, and a longer disordered C-terminal domain (CTD; residues 66–102), which extends from the rim (Beich-Frandsen et al., 2011a; Vincent et al., 2012). NMR chemical shift perturbations from a comparison of full-length Hfq (Hfq102) and a truncated variant lacking the CTD (Hfq65) suggested that some part of the CTD contacts residues on the rim of the hexamer, although the specificity of these proposed contacts was uncertain since they occur near where the CTD protrudes from the ring (Beich-Frandsen et al., 2011a; Vincent et al., 2012).
The functional importance of the CTD for sRNA regulation has also been unclear, owing to the conflicting results of different studies (Sonnleitner et al., 2004; Olsen et al., 2010; Večerek et al., 2008; Salim et al., 2012). Using a combination of biophysical and genetic approaches, we recently showed that the CTD displaces RNA from the rim and proximal face of Hfq (Santiago-Frangos et al., 2016), with two important consequences. First, release of annealed dsRNA from the arginine-rich rim is accelerated, increasing Hfq turnover. Second, kinetic competition between sRNAs is increased, which allows dominant sRNAs to bind to Hfq and accumulate in the cell, while weaker competitors are degraded (Santiago-Frangos et al., 2016). The latter creates a hierarchy of sRNA regulation that depends on the CTD.
The mechanism by which the CTD displaces RNA from the core (Sm domain and NTD) of Hfq is unknown. No common sequence motifs have been identified in the CTD (Sun et al., 2002; Vincent et al., 2012; Weichenrieder, 2014; Sobrero and Valverde, 2012; Fortas et al., 2015; Updegrove et al., 2016), which varies in length and composition across bacteria (Attia et al., 2008; Schilling and Gerischer, 2009; Baba et al., 2010). This diversity is characteristic of disordered peptides, which rapidly evolve via non-conservative substitutions and indels (Liu et al., 2008; Brown et al., 2010; Light et al., 2013). Two models could explain the displacement of RNA by CTDs in E. coli Hfq. The ‘polymer brush’ model suggests the CTDs passively obstruct RNA binding sites. This model is attractive since it depends only on the length and flexibility of the CTD. In contrast, the ‘nucleic acid mimic’ model suggests that the CTDs specifically bind to basic core residues and actively compete against nucleic acids. Given the divergence of CTD and core sequences, this model predicts that CTD auto-regulation is likely in some Hfq clades but not others.
In this study, we use de novo modeling and biophysical experiments to determine the mechanism by which the CTD regulates Hfq activity. We propose that the acidic CTD tip transiently binds the rim of E. coli Hfq and makes distributed interactions with basic residues, thereby modulating RNA and DNA binding and RNA annealing. Applying our modeling procedure to Hfqs from other bacteria demonstrates that stable interactions between the acidic CTD and the basic rim correlate with the importance of Hfq for sRNA regulation in that host. Thus, our proposed mechanism of CTD auto-inhibition provides a basis for predicting the function of Hfq in different bacteria. Our approach may be useful for predicting the sequence–function relationship of disordered domains in other partially disordered proteins.
To search for conserved features or amino acid motifs amongst the highly heterogeneous Hfq CTDs, we first built a phylogenetic tree (Figure 1—figure supplement 1) from the multiple sequence alignment of nearly 1000 non-redundant sequences (see Methods). The cluster containing E. coli Hfq contained many other Hfq variants previously identified as functional in RNA annealing (Zheng et al., 2016) or sRNA regulation (Gottesman et al., 2006). Therefore, we examined the sequence logo of this cluster of 222 Hfqs in more detail (Figure 1A).
The start of the CTD region is delineated by a proline at position 64 of E. coli Hfq that is strongly conserved across all clades. Additionally, an arginine at the beginning of the CTD (position 66 in E. coli) that packs against the lateral edge of the Hfq hexamer (Beich-Frandsen et al., 2011a; Sauter et al., 2003; Dimastrogiovanni et al., 2014) is strongly conserved. Although the middle linker region of the CTD lacks conserved motifs (Figure 1—figure supplement 1), the C-terminus is rich in acidic residues, corresponding to the sequence DSEETE in E. coli. Noting that most Hfq clusters containing a basic patch on the rim also end in acidic residues, we hypothesized that the CTD tip binds the rim. Because the basic patch is essential for sRNA binding and annealing, direct interaction between the CTD tip and the Hfq core could explain the previously observed auto-inhibition of the CTD (Santiago-Frangos et al., 2016).
To determine whether the acidic tip of the E. coli Hfq CTD could interact with basic residues in the core, we used Rosetta FloppyTail (Kleiger et al., 2009), a de novo modeling approach for disordered regions of proteins. We updated the original FloppyTail algorithm to model multiple disordered regions simultaneously and to ensure adequate sampling of backbone degrees of freedom (see Materials and methods and Figure 1—figure supplement 2). Then, we generated and analyzed ~30,000 models of the full-length E. coli Hfq hexamer. In the lowest energy (1%) subset of models, the acidic CTD residues (D97, E99, E100, and E102) frequently interact with basic residues on the rim (R16, R17, R19 and K47) and in the NTD (K3) (Figure 1B, top). By contrast, K31 on the distal face is not predicted to be contacted by the CTD, although K31 is highly accessible. This bias accords with prior observations that the CTD does not displace RNA from Hfq’s distal face (Santiago-Frangos et al., 2016). As anticipated for a disordered domain, no single conformation dominated the ensemble of models (Figure 1—figure supplement 3). Rather, the acidic CTD tip was found to bind to various combinations of residues in the basic patch (Figure 1C, inset).
To confirm we were not simply observing the non-specific collapse of the disordered CTD onto the core, or enriching interactions between highly solvent-accessible polar residues, we repeated our simulations using a mutant Hfq in which the acidic CTD residues were replaced with polar or basic side chains (D97R-E99N-E100K-E102N). These mutations drastically decreased the frequency of predicted interactions between the basic core residues and CTD residues at positions 97, 99, 100 and 102 in our simulations (Figure 1B, bottom), without increasing predicted interactions between this mutant CTD and solvent-accessible acidic residues on the Hfq core (D9, E18, E37 and D40).
To determine whether the CTD interacts with the rim as predicted by our models, we used fluorescence anisotropy to measure the affinity of core Hfq (Hfq65) for a fluorescently-labeled CTD peptide, CTD-FITC (Figure 2A and Figure 2—figure supplement 1). CTD-FITC lacks residues 65–72 to avoid contributions to binding from this region, which packs against the Sm domain as one strand of the β-sheet (Arluison et al., 2004). Hfq65 bound to CTD-FITC with a Kd of 2.9 µM Hfq monomer in low salt buffer (cyan in Figure 2B) and 22 µM in a higher salt buffer (Figure 2—figure supplement 2). These interaction strengths are meaningful even at higher ionic strength, because the effective concentration of each individual acidic CTD tip is roughly 350 μM in the full-length protein (see Materials and methods).
Binding of the CTD-FITC peptide to Hfq65 was weakened by mutations in the basic rim residues R16A, R19D and K47A (Figure 2B), which frequently interact with the CTD in our computational models (Figure 1B). Although we were not able to test the predicted interactions between NTD K3 and the CTD (see Materials and methods), K3 is also known to form electrostatic interactions with the RNA backbone (Dimastrogiovanni et al., 2014). In contrast, mutation of a surface-accessible polar residue (Q35A) close to the binding interface (Figure 1C, inset), slightly enhanced CTD binding (Figure 2B). Intriguingly, A35 is common in Hfq from γ-proteobacteria. Finally, a CTD peptide containing the mutated C-terminal tip (RSNKTN) was not able to bind Hfq65, confirming that the acidic residues on the CTD peptide are necessary for this interaction (grey in Figure 2B).
To determine how much core residues that bind the CTD contribute to Hfq’s RNA annealing activity, we compared the effect of rim mutations on the rate of base pairing between an RNA molecular beacon and the 16 nt Target RNA by stopped-flow fluorescence spectroscopy (Figure 2C) (Hopkins et al., 2011). In the absence of competition from the CTD, the rate of annealing in this assay depends only on interactions between the two RNAs and the Hfq core. As previously observed (Santiago-Frangos et al., 2016), Hfq65 is highly active in single-turnover annealing assays (Figure 2—figure supplement 3A). The observed annealing rate was most diminished by the loss of basic residues, especially the conserved R16A, and relatively unaffected by the mutation Q35A (Figure 2D). Similar results were obtained with Target-A18, which anchors to the distal face (Figure 2—figure supplement 3B). The average relative annealing rates of Hfq65 variants correlated well with the importance of each residue for CTD binding in vitro (Figure 2E), suggesting that the CTD peptide and the RNA interact with the same residues on the rim of Hfq.
The predictive value of our computational approach was validated by a direct correlation between the experimentally measured contribution (∆∆G°) of each core residue for CTD binding with the predicted Expected Energetic Contribution (EEC) of that core residue to interactions with the acidic CTD in silico (solid symbols and solid line, Figure 2E). EEC is defined as the average energy of a tail–core interaction multiplied by the average number of tail–core interactions per model. The absolute binding and simulated interaction energies cannot be directly compared because the peptide binding assay is performed in trans rather than in cis, and the Rosetta Energy does not account for entropic contributions to binding. Nevertheless, amino acids that most strongly impacted the free energy of CTD binding when mutated, also had larger contributions to CTD binding in silico (solid symbols and solid line, Figure 2E; linear regression p-value=0.078), and had stronger effects on Hfq65 RNA annealing activity in vitro (open symbols and dashed line, Figure 2E; linear regression p-value=0.020).
If the CTD peptide and RNA interact with the same basic Hfq residues, direct competition between the two could explain how the CTD triggers the release of annealed RNAs from Hfq (Santiago-Frangos et al., 2016) and why it increases the stringency of RNA or DNA binding. To examine whether nucleic acids are in competition against the CTD for binding to the Hfq core, we compared the ability of different nucleic acids to displace CTD-FITC from a preformed Hfq65·CTD-FITC complex (Figure 3A). The strength of competition was expressed as the concentration of nucleic acid needed to displace 50% of CTD-FITC from Hfq65, IC50.
Natural sRNAs strongly competed against the CTD-FITC peptide, in keeping with their strong (~10 nM) affinity for Hfq (Figure 3B). Short RNA and DNA oligomers that bind Hfq weakly were poorer competitors than natural sRNAs, as expected. In general, competition against the CTD peptide correlated with nucleic acid length, suggesting little sequence specificity, or that longer nucleic acids may occupy more than one basic patch (Figure 3C). However, minRCRB RNA, DNA1 and dsDNA2 deviated from this linear trend (dashed circles; Figure 3C). minRCRB, a stronger than expected competitor, consists of a stem-loop with a 5’-overhang, and has been shown to specifically bind to the rim (Santiago-Frangos et al., 2016; Dimastrogiovanni et al., 2014). Similarly, DNA1 possesses a stable minRCRB-like motif at its 5-end and was also a stronger competitor than expected based on its length. By contrast, the completely double-stranded dsDNA2 was a weaker competitor than expected, consistent with the CTD’s ability to displace annealed dsRNAs from the rim of Hfq (Santiago-Frangos et al., 2016).
The results above indicate that an interaction between the acidic CTD tip and the basic rim inhibits RNA binding to the basic patch and stimulates release of dsRNA. If this model is correct, shortening the CTD should increase the local concentration of the acidic CTD tip and exacerbate autoinhibition. Alternatively, if the CTD acts as a polymer brush, a shorter CTD should exhibit less autoinhibition because it will exclude less volume around the Hfq core. To test these predictions, we generated the mutant Hfq-sCTD, which lacks residues 86–96 (inclusive). Truncation of this non-conserved ‘linker’ region is predicted to increase the local concentration of the acidic tip around the Hfq core roughly three-fold, from roughly 350 μM per CTD in Hfq102, to ~1220 μM per CTD in Hfq-sCTD (Equation 4 in Materials and methods).
As previously shown (Santiago-Frangos et al., 2016), Hfq65, which lacks the CTD entirely, anneals the 16 nt Target and Target-U6 RNA about five times faster than full-length Hfq102 (100–60-fold vs. 20–10-fold; cyan and black in Figure 4A). In our model, this is because the CTD sweeps RNAs from the rim and proximal face of Hfq. By contrast, both proteins accelerate Target-A18 annealing roughly 100-fold compared to no Hfq, because this RNA remains anchored to the distal face of Hfq and resists CTD displacement (Santiago-Frangos et al., 2016). Importantly, the shortened CTD linker (Hfq-sCTD) decreased annealing rates relative to Hfq102 for all RNA targets (Figure 4A and Figure 4—figure supplement 1), suggesting that access to the basic patch was more restricted.
Next, we used steady-state fluorescence anisotropy to examine how a shorter CTD affects the release of annealed dsRNA from the Hfq core (Figure 4B). Binding of Hfq102, Hfq65 or Hfq-sCTD to FAM-labeled D16 RNA increased the anisotropy of FAM fluorescence, as expected (Figure 4D). The smaller anisotropy of the Hfq65·D16-FAM complex is due to its smaller hydrodynamic drag, since all Hfqs have similar affinities for D16-FAM (Table 1). When complementary R16 RNA was added to the Hfq102·D16-FAM complex, the anisotropy decreased since most of the annealed dsRNA dissociated from Hfq102 (Santiago-Frangos et al., 2016) (black, Figure 4B). Whereas, when complementary RNA was added to the Hfq65·D16-FAM complex, the anisotropy increased (Santiago-Frangos et al., 2016), because a large proportion of the dsRNA remained bound to Hfq65 (cyan, Figure 4C). When the same experiment was done for Hfq-sCTD, the anisotropy decreased even further than for Hfq102, suggesting that more dsRNA was released when the CTD is shortened. This smaller anisotropy cannot be explained by the slightly smaller molecular weight of Hfq-sCTD, since the maximum anisotropies of Hfq102 and Hfq-sCTD ternary complexes during equilibrium binding experiments were very similar.
As a control, we also measured the relative affinities of each Hfq for the dsRNA product (D16-FAM·R16) versus ssRNA substrate (D16-FAM), Krel = Kd(P)/Kd(S) (Table 1). The Krel of each protein corresponded to the efficiency of product release in the anisotropy experiment (Figure 4C and Table 3): Hfq102 had a high Krel = 7.5, Hfq65 had a low Krel = 2.3, and Hfq-sCTD had the highest Krel = 14.0. Thus, the number of non-conserved residues between the acidic tip and the core of Hfq dictates the stringency of CTD autoinhibition and the efficiency of dsRNA displacement, presumably by controlling the effective concentration of the acidic tip around the Hfq core.
To determine whether the above results on RNA oligomers apply to natural RNA substrates for Hfq, we examined the annealing of the Class II sRNA ChiX to the mRNA chiP via electrophoretic mobility shift assays (EMSAs). Because the mRNA targets of class II sRNAs interact with the rim of Hfq (Schu et al., 2015; Małecka et al., 2015), we reasoned that this sRNA-mRNA pair would be sensitive to displacement by the CTD. Low nanomolar amounts of ChiX and chiP anneal very slowly at 10°C, in the absence of Hfq (Figure 4—figure supplement 2). Whereas, Hfq102, Hfq65 and Hfq-sCTD all form a ternary complex with ChiX and chiP within 20 s (Figure 4D), reaching equilibrium in a few minutes (Figure 4—figure supplement 2). These results demonstrate that the CTD is not necessarily required for annealing longer natural RNAs. In addition, Hfq65 formed the most stable ChiX·Hfq·chiP ternary complex, with almost no ChiX-Hfq65 binary complex remaining after 20 s (Figure 4E). Less ternary complex was formed by Hfq102, and the least by Hfq-sCTD. These results are consistent with the idea that the CTD limits access of chiP mRNA to the rim of Hfq.
Hfq binds dsDNA and has been reported to associate with the bacterial chromosome (Kajitani et al., 1994; Takada et al., 1997; Azam and Ishihama, 1999; Updegrove et al., 2010; Jiang et al., 2015). Since the CTD peptide competes with DNA for binding to the Hfq core (Figure 3B) and inhibits binding of dsRNA and dsDNA more strongly than ssRNA (Table 1 and Figure 3), we asked whether the CTD modulates binding of Hfq to DNA. A change in DNA binding could alter the distribution of Hfq within the cell.
We quantified Hfq102 binding to linearized pUC19 DNA by measuring the change in DNA electrophoretic mobility in 1.5% agarose (Figure 5A,B). The mobility of the DNA-Hfq complexes decreased with added Hfq102, indicating increasing numbers of Hfq hexamers bound per DNA. The complexes exhibited uniform mobility at each protein concentration, however, except for a smear below the main band that may arise from dissociation of Hfq during electrophoresis (Figure 5C, top). This pattern is consistent with an equal (non-cooperative) distribution of Hfq102 between DNA molecules, although Hfq was previously suggested to bind DNA cooperatively (Cech et al., 2016). Cooperative binding to neighboring sites on the DNA would result in distinct bound and unbound populations of pUC19 (Tapias et al., 2000; Kozlov et al., 2010), which we do not observe here. The rim mutation R16A lowered the maximum gel retardation and increased the intensity of the ‘smear’, consistent with a reduced affinity of this mutant for dsDNA (Updegrove et al., 2010).
In contrast to the results with the full-length protein, we observed two behaviors when Hfq65 core interacted with pUC19 DNA. At low protein concentrations, the DNA was sparsely bound by Hfq65, resulting in a small mobility shift (Figure 5B). At higher protein concentrations, the DNA formed aggregates that were too large to enter the gel, resulting in a loss of signal (Figure 5B and Figure 5D, top). Aggregation of the DNA was confirmed by solubilization with Proteinase K or by pelleting assays (Figure 5—figure supplement 1). The rim mutation Hfq65-R16A rescued the formation of insoluble aggregates, confirming that dsDNA interacts with the basic rim and not the CTD itself, as previously proposed (Jiang et al., 2015). Moreover, the Hfq65-R16A complexes migrated more slowly than those formed by similar concentrations of Hfq102 (Figure 5C and Figure 5D, bottom), suggesting that Hfq65 and Hfq65-R16A bind pUC19 at higher densities than Hfq102. Thus, the CTD appears to limit Hfq102 binding to DNA, perhaps by maintaining a regular spacing between Hfq hexamers or by enforcing a dynamic equilibrium between bound and free protein. By contrast, when the Hfq core is exposed by deletion of the CTD, Hfq65 binds and aggregates dsDNA indiscriminately.
To examine which Hfq surfaces bind dsDNA, we challenged complexes of 0.5 μM Hfq102 hexamer and pUC19 DNA with 0–2 μM RNA or DNA oligomers that interact with different sites on Hfq (Figure 5E and Figure 5—figure supplement 2). A18 RNA that binds the distal face of Hfq did not compete for DNA binding (Figure 5E, top). However, when the 16 nt Target RNA that weakly interacts with the rim is appended to A18 (Target-A18), the oligomer strongly competed against DNA for binding to Hfq (Figure 5E). Target-U6 was also a good competitor, whereas the 16 nt Target sequence alone, which binds Hfq102 weakly at the rim (1 µM), was a poor competitor. DNA1 and minRCRB, which bind to the rim of Hfq (Santiago-Frangos et al., 2016; Dimastrogiovanni et al., 2014) (Figure 3C), but have low affinities for Hfq102, were weak competitors for DNA (Figure 5F). Finally, the sRNAs ChiX and RyhB were the strongest competitors, with competition saturating at a 1:1 ratio of sRNA to Hfq102 hexamer (Figure 5F).
Overall these data indicate that the CTD does not directly bind DNA as previously suggested (Updegrove et al., 2010; Jiang et al., 2015), but rather modulates the ability of the Hfq core to bind DNA, so that the extent of binding is limited and Hfq-DNA complexes remain soluble. Our data do not conflict with a low-resolution SANS model that suggests Hfq binds perpendicularly to the DNA duplex with a slight tilt (Jiang et al., 2015), but suggest that this occurs when the basic rim of Hfq interacts with the phosphate backbone of the DNA duplex. We note that the potent competition from sRNAs, which are more numerous than Hfq in the cell (Wagner, 2013), calls into question the hypothesis that E. coli Hfq regulates cellular processes via DNA binding (Sobrero and Valverde, 2012; Cech et al., 2016).
Our results on E. coli Hfq show that the strength and frequency of CTD–core interactions depend on the number of basic residues in the core, the acidic residues in the CTD, and the linker length. Thus, our proposed mechanism for CTD-core interactions can be used to predict how the degree of CTD autoinhibition may vary among bacterial Hfq’s. We applied our de novo modeling procedure to estimate the CTD–core interactions in four other bacterial Hfqs (Figure 6A) for which the genetic function and in vitro annealing activity have been previously characterized (Zheng et al., 2016; Bohn et al., 2007; Liu et al., 2010; Rochat et al., 2015; Christiansen et al., 2004; Oglesby-Sherrouse and Vasil, 2010). We examined low energy models of each Hfq hexamer, and compared how frequently acidic CTD residues interact with basic rim and NTD residues (‘on-target’) versus other core residues (‘off-target’) (see Materials and methods) (Figure 6B,C). This comparison was quantitatively expressed as the difference in the EEC of on-target and off-target interactions (∆EEC).
For E. coli Hfq, an active chaperone with a basic rim patch and long CTD, the CTD tip tended to interact with basic residues on the rim and NTD more often and more strongly than with other residues, resulting in ΔEEC = −1.11 ± 0.20 REU. This was also true for Listeria monocytogenes Hfq (ΔEEC = −1.08 ± 0.22 REU). In contrast, Bacillus subtilis (ΔEEC = 0.51 ± 0.14 REU) and Staphylococcus aureus (ΔEEC = 0.19 ± 0.05 REU) Hfq, which are inactive in our in vitro annealing assay (Zheng et al., 2016), did not exhibit specific CTD-core interactions. Finally, in models of full-length Pseudomonas aeruginosa Hfq, the CTD adopts an extended β conformation that wraps over the rim of the hexamer and places the C-terminal acidic residues near the weakly basic NTDs (ΔEEC = −0.30 ± 0.04 REU) (Figure 6C). In the absence of the NTD, however, the CTDs dock with R16 and K17 on the rim (ΔEEC = −0.50 ± 0.18 REU). Thus, Hfqs that do not anneal RNA in vitro tend to possess shorter, less acidic CTDs that form weaker and less frequent interactions with the basic rim and NTD in silico (Figure 6D). There is a similar trend between ∆EEC and the importance of Hfq for sRNA regulation in each bacterium (Zhang et al., 2013; Bohn et al., 2007; Liu et al., 2010; Rochat et al., 2015; Oglesby-Sherrouse and Vasil, 2010; Tsui et al., 1994; Nielsen et al., 2010; Rochat et al., 2012).
In the above examples, both the CTD and the core co-vary between different species. We next asked whether the CTD conferred specificity or strength to CTD-core interactions. We modeled an Hfq chimera consisting of the highly basic E. coli Sm core, fused to the shorter and slightly less acidic B. subtilis CTD. In our models, the B. subtilis CTD contacted K3 in the NTD and R17 on the rim more frequently than E. coli CTD (Figure 6—figure supplement 1A), but contacted R16 and R19, which are functionally very important (Figure 1 and Figure 2), less frequently than E. coli CTD (Figure 6—figure supplement 1A). This was corroborated with fluorescence anisotropy results showing that E. coli Hfq65 binds a BsCTD-FITC peptide about three times more weakly than its own CTD (8.7 µM vs. 2.9 µM; Figure 6—figure supplement 1B). Although we have shown that a foreign CTD can bind the core of E. coli Hfq, the ‘specificity’ of this interaction may have been lost.
We previously found that the flexible CTD of E. coli Hfq sweeps RNAs from the proximal and rim surfaces of the Hfq ring by an unknown mechanism (Santiago-Frangos et al., 2016). Because the mechanism was not known, it was not possible to predict whether other bacterial Hfq CTDs, which are highly variable in sequence composition and length, would perform similar functions. Here, we have used computational models and experiments to show that the acidic tip of the CTD directly displaces RNA from basic patches on the rim of Hfq. The CTD’s mimicry of nucleic acids is supported by direct competition between nucleic acids and the CTD for binding the Hfq core, and stronger autoinhibition when the linker connecting the acidic tip to the core is shortened (Hfq-sCTD). The good agreement between the modeled CTD–core contacts and the contributions of individual residues to the measured CTD binding energies and to RNA annealing validates our modeling approach, and further suggests that nucleic acids and the acidic tip of the CTD interact with the same residues in the Hfq core. As expected for a nucleic acid mimic, CTD·core interactions are dominated by electrostatics and exhibit a salt dependence (Figure 2 and Figure 2—figure supplement 2) similar to that seen for the autoregulatory CTD of HTLV-1 nucleocapsid (Qualley et al., 2010).
Our results show that competition between the CTD and RNA improves the efficiency of E. coli Hfq’s chaperone activity while increasing the stringency of substrate selection. In our model, sRNA and mRNA substrates are recruited through specific interactions with the proximal or distal face of the Sm ring. When complementary RNA segments engage one or more basic patches on the rim of Hfq, these interactions favor nucleation and zippering of the double helix (Panja et al., 2013; Panja et al., 2015). Transient interactions between the CTD and the rim leads to the displacement of the dsRNA product, preventing strand dissociation and recycling Hfq. In support of the model, we observe that Hfq102 binds dsRNA less strongly than single-stranded RNA, whereas Hfq65 binds them more similarly (Table 1).
We propose that the CTD makes Hfq a more selective RNA binding protein by inhibiting access to its rim. Single-stranded nucleic acids compete with the CTD approximately in proportion to length, suggesting that nucleic acid binding to the Hfq rim has low sequence-specificity. Consequently, short RNAs that only bind the rim, such as the 16 nt Target, weakly compete with the CTD and are poor substrates for annealing. By contrast, sRNAs or mRNAs that specifically bind the proximal or distal face of Hfq strongly compete against the CTD, and gain access to the basic patch on the rim. We previously showed that the CTD increases competition among E. coli sRNAs, resulting in different levels of sRNA accumulation in the cell (Santiago-Frangos et al., 2016). It remains to be shown whether the CTD also increases the stringency of target site selection. The six CTDs, which are disordered and mobile (Beich-Frandsen et al., 2011b), exclude a large volume around the core of E. coli Hfq (Figure 1C). This excluded volume is expected to limit the number of sRNAs that may bind E. coli Hfq at any one time, perhaps further increasing the stringency of RNA selection.
The hyper-variability of the Hfq CTD among different bacteria points to a continuous optimization of autoinhibition and binding selectivity, possibly in response to the acquisition or evolution of novel sRNA-mRNA regulatory pairs (Peer and Margalit, 2014). A balance of interactions at the rim of Hfq is needed, since a CTD that inhibits RNA binding too strongly may adversely affect interactions with genuine RNA substrates (Figure 4). Conversely, our DNA binding results show that an exposed basic patch can bind DNA (and RNA) indiscriminately, hinting that a basic patch necessitates co-evolution with a ‘protective’ CTD. Intriguingly, most Hfqs contain acidic sequences at the end of the CTD (Figure 1—figure supplement 1), despite a general bias toward basic residues at protein C-termini (Berezovsky et al., 1999) and within intrinsically disordered domains (Williams et al., 2001; Lise and Jones, 2005). Additionally, the CTDs of E. coli Hfq are long enough to contact the rims of neighboring monomers (Figure 6—figure supplements 2–3), which may explain the contribution of the CTD to hexamer stability (Vincent et al., 2012), and inter-hexamer contacts (Figure 2—figure supplement 1).
Computational modeling provided atomic-scale insight to the accessible conformations of the disordered N- and C- termini of E. coli Hfq (Figure 1C). Our experimentally validated EEC metric (Figure 2E) defines residue–residue interactions more accurately with respect to experimental data than commonly used distance cutoffs (Cα–Cα and Cβ–Cβ) (Kleiger et al., 2009; Fischer et al., 2006). This de novo modeling strategy was able to identify frequent and specific CTD–rim interactions in Lm and Pa Hfq, which act in sRNA regulation and annealing (Panja et al., 2013; Zheng et al., 2016; Bohn et al., 2007; Liu et al., 2010; Rochat et al., 2015; Rochat et al., 2012), but not for Bs and Sa Hfq (Figure 6D), in agreement with in vitro experiments. This suggests that the FloppyTail algorithm could be generally useful for predicting the interactions of disordered regions with ordered domains.
Many RNA and DNA binding proteins contain disordered or flexible domains that have been implicated in cooperativity, autoinhibition and liquid phase separation (Trudeau et al., 2013; Varadi et al., 2015; Järvelin et al., 2016). Hfq is an example of an emerging paradigm of autoregulation of nucleic acid binding by nucleic acid mimic peptides. Other examples in which a disordered CTD autoinhibits RNA or DNA binding include HTLV-1 NC (Qualley et al., 2010), E. coli gyrase (Tretter and Berger, 2012), E. coli ssDNA binding protein (Kozlov et al., 2010) and mammalian high-mobility group B1 (Watson et al., 2007). Unlike HTLV-1 NC, which also remodels RNA, the Hfq CTD gives rise to dynamic cycling of bound RNAs needed to chaperone sRNA-mRNA interactions. Our modeling procedure could be utilized to screen disordered domains found in kinases, such as myosin light chain kinases and protein kinase C (Kobe and Kemp, 1999), and nucleic acid binding proteins from all kingdoms of life (Trudeau et al., 2013; Adams, 2003; Ward et al., 2004; Wang et al., 2016).
All Hfq gene sequences were taken from Uniprot (UniProt Consortium, 2015). 5359 sequences were aligned using the G-INS-1 algorithm on MAFFT webservers (Yamada et al., 2016). This alignment was reduced using CD-HIT (Li and Godzik, 2006) and Max-Align (Gouveia-Oliveira et al., 2007). An unrooted, neighbor-joining tree of the remaining 985 non-redundant, representative, sequences was made on MAFFT webservers (Yamada et al., 2016). Sequence logos of re-aligned sequences from chosen clusters were generated using WebLogo (Crooks et al., 2004).
The crystal structures of E. coli (1HK9) (Sauter et al., 2003), P. aeruginosa (1U1S) (Nikulin et al., 2005), L. monocytogenes (4NL2) (Kovach et al., 2014), B. subtilis (3HSB) (Someya et al., 2012), and S. aureus (1KQ1) (Schumacher et al., 2002) Hfqs were used as starting points for the computational modeling. All crystal structures contained the hexameric form of Hfq, except for 4NL2, for which we generated the biologically relevant hexamer using the reported symmetry operations. Missing residues were appended or prepended to the crystal structures in the following manner. First, on a single subunit, absent N-terminal residues were prepended and all N-terminal residues predicted to be disordered (Buchan et al., 2013; Jones and Cozzetto, 2015) were initialized in an extended conformation, with backbone dihedral angles set to: and. Since the Hfq hexamer is C6 symmetric, the modified subunit could be symmetrized to all other subunits. The same process was repeated to append C-terminal residues, except the base of the tail (residues 64–69 in 1HK9, 1U1S, and 1KQ1, 66–71 in 4NL2, and 62–67 in 3HSB) was ‘kinked’ to point proximally as in the 1HK9 structure. For the RSNKTN tail mutant, side chains were mutated using the PyMOL ‘mutate’ function. The structures with extended termini were ‘relaxed’ with constraints, using Rosetta (Conway et al., 2014), to eliminate energetically unfavorable atomic clashes, before modeling.
A modified version of the FloppyTail algorithm (Kleiger et al., 2009) was used to model the disordered termini (see Appendix 1 for step-by-step protocol). The source code is freely available to academic users through the RosettaCommons: www.rosettacommons.org. The FloppyTail algorithm generates hypothetical, low-energy conformations of disordered regions through two stages of modeling: (i) low-resolution modeling, where side chains are represented as single pseudo-atom centroids, with aggressive sampling of backbone conformational space and gradient-based minimization, and (ii) all-atom modeling, where all side-chain atoms are restored, with fine sampling of backbone conformational space, side-chain optimization, and minimization. We adapted the original algorithm to permit simultaneous modeling of multiple disordered termini and to more extensively sample conformation space. In our simulations, any Hfq region predicted to be disordered was allowed to move and approximately 500 backbone moves (changes in ϕ/ψ angles) were attempted per disordered residue (Kleiger et al., 2009) attempted ~25 backbone moves per disordered residue). Non-disordered residues had no backbone motion, but were permitted to sample side-chain conformations. In total, simulations were used to generate ~30,000 hypothetical structures for each species’ Hfq.
PyRosetta (Chaudhury et al., 2010) was used to evaluate the energies of pairwise residue–residue interactions. Pairwise energies were computed with the talaris2014 energy function (O’Meara et al., 2015), comprised of terms capturing van der Waals, solvation, hydrogen bonding and electrostatic interactions. If a pairwise energy was unfavorable (0 or greater), we did not consider it for further analysis.
In our analysis, we considered a set of core residues and a set of tail residues . We calculated the average number of tail interactions for a single core residue, , by counting the number of pairwise interactions, with a lower energy than the threshold, between x and every residue in the and dividing by the total number of CTDs:
To compute a standard deviation for the average number of interactions with core residue , we used bootstrap resampling as described in Chaudhury et al. (2011). We resampled, with replacement, our set of models for times and re-computed (same as Equation 1, but using the resampled set of models), acquiring a standard deviation: .
In addition, we calculated the average energy for each interaction above the threshold (between one core residue, , and a set of tail residues, ):
The standard deviation for the interaction energy was computed without bootstrap resampling; the energy has a distribution within a set of models, whereas the presence of an interaction is binary and only varies when the models are resampled. We compute the standard deviation as:
Finally, EEC was computed over a set of basic core residues, by multiplying the average tail–core interaction energy by the average number of interactions per model and summing:
Standard deviation for EEC was computed by assuming that the standard deviations of the above values are independent: .
|Species||Core ()||Basic core (,for EEC)||Acidic tail ()|
|E. coli||1–65||3, 16, 17, 19, 47||97, 99, 100, 102|
|P. aeruginosa||1–65||3, 5, 16, 17, 19, 47||94, 97|
|L. monocytogenes||1–65||2, 16, 17, 19, 35||100, 102|
|B. subtilis||1–65||2, 16, 17, 37||100, 102|
|S. aureus||1–65||10, 16, 41||65, 67, 99, 101, 102|
The intrinsically disordered CTD linker was represented by a worm-like chain model (Kratky and Porod, 1949) with a statistical chain segment of 35 Å or 10 residues, which is twice the persistence length of 15–20 Å for a random coil polypeptide chain (Krigbaum and Hsu, 1975; Damaschun et al., 1991; Damaschun et al., 1993; Kellermayer et al., 1997). The disordered linker region (Schumacher et al., 2002; Beich-Frandsen et al., 2011a; Dimastrogiovanni et al., 2014), was assumed to begin at residue 71 because the first few residues of the CTD tend to pack along the core (Arluison et al., 2004). The last five residues of the CTD constitute the acidic tip. The local concentration of the acidic tip on a single CTD was calculated for full-length Hfq (Hfq102) and for Hfq-sCTD:
In which C is the concentration of the acidic tip, Vtail is the total volume the acidic tip can access around the center of mass of a single Hfq hexamer, Vcore is the inaccessible volume of the Hfq core, and NA is Avogadro’s number. Vtail is estimated as a sphere with radius of 105 Å for Hfq102, and a radius of 70 Å for Hfq-sCTD. Vcore is estimated as a cylinder with radius 31.5 Å and height 25 Å (Sauter et al., 2003).
Untagged E. coli Hfq102, Hfq-sCTD, Hfq65, Hfq65-Q35A, Hfq65-K47A, Hfq65-R19D and Hfq65-R16A were over-expressed in E. coli BL21(DE3)Δhfq::cat-sacB cells grown in 1 L LB-Miller media (10 g/L Tryptone, 10 g/L NaCl, 5 g/L yeast extract) supplemented with 100 µg/mL ampicillin. Plasmids for over-expression of mutant Hfq proteins were created by site-directed mutagenesis of pET21b-Hfq (Zhang et al., 2002). The purification method has been previously described (Santiago-Frangos et al., 2016). In brief, resuspended cell lysates of Hfq102 and Hfq-sCTD variants were clarified by heat denaturation and untagged Hfq was purified via Ni2+-affinity. Lysates of Hfq65 variants were further clarified by ammonium sulfate precipitation after heat treatment, and the protein purified by hydrophobic interaction chromatography. Finally, all Hfq variants were purified by cation-exchange chromatography to remove nucleic acids (Figure 2—figure supplement 1).
The sequences of RNA and DNA substrates are listed in Table 2. Synthetic Target RNAs, molecular beacon (Panja and Woodson, 2012a), A18, D16-FAM and R16 have been previously described (Hopkins et al., 2009). minRCRB RNA (IDT) was reduced with TCEP (tris(2-carboxyethyl)phosphine) and purified by denaturing PAGE before labeling with Cy3-maleimide (GE Healthcare), as previously described (Santiago-Frangos et al., 2016). The extent of labeling was estimated from the absorbance at 260 and 552 nm. The sRNAs ChiX, RprA, DsrA and RyhB and mRNA chiP were transcribed in vitro as previously described (Lease and Woodson, 2004). pUC19 plasmid (NEB) for Hfq binding assays was isolated from transformed DH5α cells (NEB) using Plasmid Maxi kit (QIAGEN) and digested with EcoRI (NEB) and HindIII (NEB) and purified by phenol-chloroform extraction followed by ethanol precipitation.
To measure binding of CTD-FITC, CTDpos-FITC, or BsCTD-FITC peptides (Table 2) to Hfq65 or Hfq65 mutants, the fluorescence polarization of FITC-labeled peptide was measured 3 min after the addition of 0–33.3 µM Hfq65. Anisotropy measurements were normalized to the average anisotropy in the absence of Hfq. Samples were prepared in a 100 µL cuvette containing 100 µL 50 mM Tris·HCl pH 7.5, 45 nM CTD-FITC or CTDpos-FITC, at 30°C. Fluorescence polarization with grating correction factor was measured using a Horiba Fluorolog-3 (L-format) with single excitation and emission monochromators at 495 nm and 515 nm respectively (5 nm slit widths). Titrations were performed in duplicate and the curves were fit to a single-site binding isotherm:
in which Ka is the association constant.
Although residue K3 in the NTD was observed to bind the acidic CTD in silico, the contribution of K3 to in vitro binding could not be determined because neither Hfq65-K3S nor Hfq65-K3Q formed stable proteins.
To measure the displacement of CTD-FITC from Hfq65·CTD-FITC complexes by nucleic acids, samples were prepared in a 100 µL cuvette containing 1 µM CTD-FITC and 1.67 µM Hfq65 hexamer so that roughly 50% of CTD-FITC peptides were bound at the start of the experiment. The polarization of CTD-FITC was measured 3 min after the addition of increasing amounts of RNA or DNA as above. Competition curves were fit to:
where minY and maxY are the minimum and maximum anisotropy values measured, and IC50 is the concentration of nucleic acid which displaced 50% of the bound CTD-FITC from Hfq65.
Binding constants for D16-FAM or minRCRB-Cy3 (5 nM) were measured in TNK buffer (10 mM Tris·HCl, pH 7.5, 50 mM NaCl, 50 mM KCl) at 30°C by FAM fluorescence anisotropy as described before (Hopkins et al., 2009). To measure the affinity of Hfq for D16-FAM·R16 dsRNA complex, 50 nM of both RNAs were mixed and allowed to equilibrate at 30°C for 10 min before titration with Hfq. Annealing kinetics of molecular beacon (50 nM) to either Target or Target-A18 RNA (100 nM) by 0–200 nM Hfq hexamer, in 1X TNK (10 mM Tris·HCl pH 7.5, 50 mM NaCl, 50 mM KCl) buffer at 30°C was measured by stopped-flow fluorescence spectroscopy as described previously (Soper et al., 2011; Panja and Woodson, 2012b). Annealing progress curves were fit to single or double-exponential rate equations.
To measure RNA binding and release from unlabeled Hfq102, Hfq-sCTD and Hfq65 by anisotropy, the polarization of D16-FAM was recorded every 20 s for ≥ 3 min after each addition, as previously described (Santiago-Frangos et al., 2016). Samples were prepared in a 500 µL cuvette containing 50 nM D16-FAM in TNK buffer at 30°C, with additions of 50 nM Hfq102, 50 nM Hfq-sCTD, or 50 nM Hfq65 (binding phase), followed by 50 nM R16 RNA (annealing and release phase), and finally 400 nM of the ssDNA competitor, DNA2 (stimulated release phase). RNA binding and release experiments were done in triplicate for each Hfq variant. The molar fractions of released dsRNA product D16-FAM·R16 (χdr), remaining ternary complex D16-FAM·Hfq·R16 (χhdr) and binary complex Hfq·D16-FAM (χhd) at the end of the ‘annealing and release phase’ of the experiment were calculated from
where rAP is the anisotropy measured at the end of the ‘annealing and release’ phase in the above annealing experiments, rMdr is the average anisotropy of D16-FAM·R16 complex during the ‘stimulated release’ phase that is indistinguishable from its anisotropy without Hfq, rMhdr is the maximum anisotropy of Hfq·D16-FAM·R16 complex from equilibrium binding experiments, and rMhd is the maximum anisotropy of Hfq·D16-FAM complex from equilibrium binding experiments. Using the conservation of mass, χdr + χhdr + χhd=1, and the relative Kd values for Hfq binding to D16Fl and D16Fl·R16, χhd=Krel * χhdr, yields an expression for the molar fraction of ternary complex:
Samples (10 µL) containing linear pUC19 (0.145 nmol bp), 0–3.333 µM Hfq hexamer, in 40 mM Tris-HCl pH 7.5, 0.14 mM EDTA, 35 mM NH4Cl, 3.7% (v/v) glycerol, 0.05% (w/v) bromophenol blue were incubated at 25°C for 30 min. 2 µL of each reaction was loaded into a sample well of a 15 × 8 cm agarose gel (1.5% (w/v) Seakem LE agarose (Lonza) in 1X TAE (40 mM Tris, 20 mM acetate, 1 mM EDTA, pH 8.0). Electrophoresis was carried out in the cold room (4°C) at 4 V/cm for 6.5 hr. Hfq was dissociated from the bound complexes by soaking agarose gels in 150 mL TBE (89 mM Tris, 89 mM borate, 2 mM EDTA, pH 8.3) and 1 M NaCl, for 30 min at 25°C, at 85 rpm. The gels were washed twice with 150 mL TBE for 10 min, stained with 1X SYBR Gold (Invitrogen) in 150 mL TBE for 45 min, and washed twice with 150 mL TBE for 10 min before imaging on a Typhoon 9410 (GE Healthcare) via excitation at 488 nm and using a 555 nm bandpass 30 emission filter. The fluorescence intensity was measured on a line from the bottom edge of the well through the middle of the lane to visualize the migration of pUC19. For RNA competition experiments, 10 µL samples were prepared as above with 0–2 µM RNA or DNA competitor and 0 or 0.5 µM Hfq102 hexamer.
The C-terminal domain of Escherichia coli Hfq increases the stability of the hexamerEuropean Journal of Biochemistry 271:1258–1265.https://doi.org/10.1111/j.1432-1033.2004.04026.x
Moraxella catarrhalis expresses an unusual Hfq proteinInfection and Immunity 76:2520–2530.https://doi.org/10.1128/IAI.01652-07
Expression, crystallization and preliminary crystallographic analysis of RNA-binding protein Hfq (YmaH) from Bacillus subtilis in complex with an RNA aptamerActa Crystallographica Section F Structural Biology and Crystallization Communications 66:563–566.https://doi.org/10.1107/S1744309110009942
Structural analysis of full-length Hfq from Escherichia coliActa Crystallographica Section F Structural Biology and Crystallization Communications 67:536–540.https://doi.org/10.1107/S174430911100786X
Structural insights into the dynamics and function of the C-terminus of the E. coli RNA chaperone HfqNucleic Acids Research 39:4900–4915.https://doi.org/10.1093/nar/gkq1346
Amino acid composition of protein termini are biased in different mannersProtein Engineering, Design and Selection 12:23–30.https://doi.org/10.1093/protein/12.1.23
Comparing models of evolution for ordered and disordered proteinsMolecular Biology and Evolution 27:609–621.https://doi.org/10.1093/molbev/msp277
Scalable web services for the PSIPRED Protein Analysis WorkbenchNucleic Acids Research 41:W349–W357.https://doi.org/10.1093/nar/gkt381
The Escherichia Coli Hfq Protein: An Unattended DNA-Transactions RegulatorFrontiers in Molecular Biosciences 3:36.https://doi.org/10.3389/fmolb.2016.00036
The RNA-binding protein Hfq of Listeria monocytogenes: role in stress tolerance and virulenceJournal of Bacteriology 186:3355–3362.https://doi.org/10.1128/JB.186.11.3355-3362.2004
Acid denatured apo-cytochrome c is a random coil: evidence from small-angle X-ray scattering and dynamic light scatteringBiochimica et Biophysica Acta (BBA) - Protein Structure and Molecular Enzymology 1078:289–295.https://doi.org/10.1016/0167-4838(91)90571-G
Hfq: a multifaceted RNA chaperone involved in virulenceFuture Microbiology 11:137–151.https://doi.org/10.2217/fmb.15.128
Assessing methods for identifying pair-wise atomic contacts across binding interfacesJournal of Structural Biology 153:103–112.https://doi.org/10.1016/j.jsb.2005.11.005
New insight into the structure and function of Hfq C-terminusBioscience Reports 35:1–9.https://doi.org/10.1042/BSR20140128
Small RNA regulators and the bacterial response to stressCold Spring Harbor Symposia on Quantitative Biology 71:1–11.https://doi.org/10.1101/sqb.2006.71.016
Effect of salt and RNA structure on annealing and strand displacement by HfqNucleic Acids Research 37:6205–6213.https://doi.org/10.1093/nar/gkp646
Rapid binding and release of Hfq from ternary complexes during RNA annealingNucleic Acids Research 39:5193–5202.https://doi.org/10.1093/nar/gkr062
Effects of Hfq on the conformation and compaction of DNANucleic Acids Research 43:4332–4341.https://doi.org/10.1093/nar/gkv268
Regulation of the Escherichia coli hfq gene encoding the host factor for phage Q betaJournal of Bacteriology 176:531–534.https://doi.org/10.1128/jb.176.2.531-534.1994
Regulation of single-stranded DNA binding by the C termini of Escherichia coli single-stranded DNA-binding (SSB) proteinJournal of Biological Chemistry 285:17246–17252.https://doi.org/10.1074/jbc.M110.118273
Diffuse small-angle scattering of X-rays in colloid systemsJournal of Colloid Science 4:35–70.https://doi.org/10.1016/0095-8522(49)90032-X
Cycling of the Sm-like protein Hfq on the DsrA small regulatory RNAJournal of Molecular Biology 344:1211–1223.https://doi.org/10.1016/j.jmb.2004.10.006
Protein expansion is primarily due to indels in intrinsically disordered regionsMolecular Biology and Evolution 30:2645–2653.https://doi.org/10.1093/molbev/mst157
Sequence patterns associated with disordered regions in proteinsProteins: Structure, Function, and Bioinformatics 58:144–150.https://doi.org/10.1002/prot.20279
Escherichia coli Hfq has distinct interaction surfaces for DsrA, rpoS and poly(A) RNAsNature Structural & Molecular Biology 11:1206–1214.https://doi.org/10.1038/nsmb858
Combined covalent-electrostatic model of hydrogen bonding improves structure prediction with RosettaJournal of Chemical Theory and Computation 11:609–622.https://doi.org/10.1021/ct500864r
C-terminally truncated derivatives of Escherichia coli Hfq are proficient in riboregulationJournal of Molecular Biology 404:173–182.https://doi.org/10.1016/j.jmb.2010.09.038
Hfq proximity and orientation controls RNA annealingNucleic Acids Research 40:8690–8697.https://doi.org/10.1093/nar/gks618
Hexamer to monomer equilibrium of E. coli Hfq in solution and its impact on RNA annealingJournal of Molecular Biology 417:406–412.https://doi.org/10.1016/j.jmb.2012.02.009
Conserved arginines on the rim of Hfq catalyze base pair formation and exchangeNucleic Acids Research 41:7536–7546.https://doi.org/10.1093/nar/gkt521
A conserved small RNA promotes silencing of the outer membrane protein YbfMMolecular Microbiology 72:566–577.https://doi.org/10.1111/j.1365-2958.2009.06688.x
Paradoxical suppression of small RNA activity at high Hfq concentrations due to random-order bindingNucleic Acids Research 43:8502–8515.https://doi.org/10.1093/nar/gkv777
Sm-like proteins in Eubacteria: the crystal structure of the Hfq protein from Escherichia coliNucleic Acids Research 31:4091–4098.https://doi.org/10.1093/nar/gkg480
The Acinetobacter baylyi Hfq gene encodes a large protein with an unusual C terminusJournal of Bacteriology 191:5553–5562.https://doi.org/10.1128/JB.00490-09
Alternative Hfq-sRNA interaction modes dictate alternative mRNA recognitionThe EMBO Journal 34:2557–2573.https://doi.org/10.15252/embj.201591569
The bacterial protein Hfq: much more than a mere RNA-binding factorCritical Reviews in Microbiology 38:276–299.https://doi.org/10.3109/1040841X.2012.664540
Functional effects of variants of the RNA chaperone HfqBiochemical and Biophysical Research Communications 323:1017–1023.https://doi.org/10.1016/j.bbrc.2004.08.190
Predicted structure and phyletic distribution of the RNA-binding protein HfqNucleic Acids Research 30:3662–3671.https://doi.org/10.1093/nar/gkf508
DNA binding properties of the hfq gene product of Escherichia coliBiochemical and Biophysical Research Communications 236:576–579.https://doi.org/10.1006/bbrc.1997.7013
Mechanisms for defining supercoiling set point of DNA gyrase orthologs: I. A nonconserved acidic C-terminal tail modulates Escherichia coli gyrase activityThe Journal of Biological Chemistry 287:18636–18644.https://doi.org/10.1074/jbc.M112.345678
UniProt: a hub for protein informationNucleic Acids Research 43:204.
E. coli DNA associated with isolated Hfq interacts with Hfq's distal surface and C-terminal domainBiochimica et Biophysica Acta (BBA) - Gene Regulatory Mechanisms 1799:588–596.https://doi.org/10.1016/j.bbagrm.2010.06.007
The C-terminal domain of Escherichia coli Hfq is required for regulationNucleic Acids Research 36:133–143.https://doi.org/10.1093/nar/gkm985
Characterization of Vibrio cholerae Hfq provides novel insights into the role of the Hfq C-terminal regionJournal of Molecular Biology 420:56–69.https://doi.org/10.1016/j.jmb.2012.03.028
Cooperation of Escherichia coli Hfq hexamers in DsrA bindingGenes & Development 25:2106–2117.https://doi.org/10.1101/gad.16746011
Prediction and functional analysis of native disorder in proteins from the three kingdoms of lifeJournal of Molecular Biology 337:635–645.https://doi.org/10.1016/j.jmb.2004.02.002
Mapping intramolecular interactions between domains in HMGB1 using a tail-truncation approachJournal of Molecular Biology 374:1286–1297.https://doi.org/10.1016/j.jmb.2007.09.075
Mutations in interaction surfaces differentially impact E. coli Hfq association with small RNAs and their mRNA targetsJournal of Molecular Biology 425:3678–3697.https://doi.org/10.1016/j.jmb.2013.01.006
Arginine patch predicts the RNA annealing activity of Hfq from gram-negative and gram-positive bacteriaJournal of Molecular Biology 428:2259–2264.https://doi.org/10.1016/j.jmb.2016.03.027
Ben F LuisiReviewing Editor; University of Cambridge, United Kingdom
In the interests of transparency, eLife includes the editorial decision letter and accompanying author responses. A lightly edited version of the letter sent to the authors after peer review is shown, indicating the most substantive concerns; minor comments are not usually included.
Thank you for submitting your work entitled "Acidic C-terminal domains autoregulate the RNA chaperone Hfq" for consideration by eLife. Your article has been favorably evaluated by a Senior Editor and four reviewers, one of whom is a member of our Board of Reviewing Editors. The reviewers have opted to remain anonymous.
Our decision has been reached after consultation between the reviewers. Based on these discussions and the individual reviews below, we regret to inform you that your work will not be considered further for publication in eLife.
All of the reviewers are in agreement the results are interesting and provide mechanistic insights into how the natively unstructured region of the Hfq chaperone could act in an auto-inhibitory manner to drive RNA dissociation once duplexes had formed. There is, however, a consensus that the level of conceptual advance in the manuscript is limited, in light of the excellent earlier publication from your laboratory last year on this system. Moreover, the reviewers raise numerous points concerning the data and its interpretation, requiring additional experimental work. We hope that the detailed reviews, appended below, will be of help to strengthen the experimental results so that the manuscript will become suitable for submission to another journal.
In this manuscript, Santiago-Frangos and coworkers study the function of the acidic amino acid-rich CTD tail of E. coli Hfq using a combination of modeling, phylogenetic analysis, and biochemical approaches. The main conclusion of this manuscript is that it clarifies the mechanism of Hfq with respect to two competing hypotheses. It has been shown that the CTD can displace RNA from the rim and proximal face of Hfq. This could occur either by a "polymer brush" in which the CTD sweeps the RNA from the surface or by the CTD functioning as a "nucleic acid mimic" to bind the positively charged Hfq core. The authors provide strong evidence for the latter mechanism. The work is clearly described and expertly carried out.
Importantly, the authors propose that the consequence of this mechanism is that it may promote specificity by allowing the CTD to compete with non-specific RNA or DNA binding. In addition, the authors propose that the Hfq CTD is co-evolving with the importance of Hfq functioning as a RNA chaperone in diverse species.
While I think that the specifics of Hfq mechanism would be of interest to a small audience, the implication of these findings for understanding the evolution and use of sRNA regulation in bacteria is great. It is likely that many in the eLife readership would be interested in that aspect of the work and it is well-suited for the broad readership of eLife.
My major concern is over the role of electrostatics in the CTD interaction. This interaction appears to be governed primarily by negatively charged amino acids (or the phosphate backbone of RNA) interacting with positively charged amino acids. In part my concern stems from the low salt concentrations used in the CTD binding assays (just 50 mM Tris), and I don't believe salt is considered in the FloppyTail predictions. RNA binding and annealing was carried out at higher (but still lower than physiological levels) salt at 10 mM Tris, 50 mM NaCl, 50 mM KCl. Plasmid binding was then carried out at yet a third salt condition.
I'd be curious if CTD binding and displacement still occurs at the 50 mM NaCl/50 mM KCl condition. At minimum, a short discussion of the role of electrostatics in these types of protein/nucleic acid and protein/protein interactions and influence of solution ions would help clarify this issue.
Woodson and coworkers have studied the role of acidic regions in the bacterial RNA chaperone Hfq. By Rosetta modeling and binding studies the authors propose that the C-terminal acidic tail of the Hfq Sm fold interacts with the Sm core domain and thereby affects the RNA chaperone activity, where the Sm core domain normally helps to stabilize sRNA-mRNA duplex formation. It has been previously shown that the Hfq enhances chaperone activity, however, the molecular mechanisms are unclear, and "polymer brush" vs. "nucleic acid mimicking" models have been proposed. The authors propose that the CTD can directly bind to positively charged residues at the rim of Hfq hexamers, in support of the latter model.
The manuscript is clear and well written, the combination of computational modelling and experimental probing is interesting, but has limitations as molecular details are inferred in a more descriptive fashion while a clear experimental picture of the structural details is not provided.
• The authors have used competition binding experiment with FITC-labeled peptides. These are proxies but do not resemble the reality with respect to the fact that in Hfq the CTD will form an intramolecular interaction, which involves much less entropy loss upon binding compared to the peptide titrated in trans. Thus, the apparent affinities and the "quantitative" correlation between experimental and computational Kd’s has an offset, which should be discussed. Also can the authors exclude any artifacts from the presence of the hydrophobic/aromatic FITC label?
• Preferably, the authors should study the intermolecular interaction with an experimental method. NMR comes to mind as this can reveal the binding site and provide semi-quantitative affinities, for example by comparing Hfq hexamers with and w/o the CTD present and also with peptide titrations in trans (with the caveat mentioned above). These experiments are technically feasible and it is not clear why this has not been considered. The shortened CTD linker constructs could also be well studied in this context and provide an experimental estimation of the local concentration effect by monitoring NMR chemical shifts.
• The model suggests a rather non-specific charged interaction for the binding and biochemical and functional consequences. In the absence of a more precise structural picture for the interactions as suggested above, further support for the non-specific charge-charge interaction and the role of the length in modulating local concentrations should be provided. Artificially designed CTDs could be tested, where the number of negatively charged residues can be controlled and the role of the preceding linker "flexibility" can be explored by replacing the native "linker" with Gly-Ser stretches of varying lengths. These experimental data should be useful to correlate experimental and computational findings and exclude that additional regions in the CTD could play an important role.
In this work, Woodson and colleagues explore the mechanism of the C-terminal domain regulation of Hfq RNA binding. They propose two distinct models and design and carry out experiments that distinguish between them, allowing them to conclude that the CTD displaces RNA from the core by the "nucleic acid mimic" model rather than the "polymer brush" model. The function of the CTD has been a longstanding mystery and source of controversy in the Hfq literature (Updegrove et al., Curr Opin Microbiol 2016; Wagner, RNA Biol 2013). However, the central question that this paper answers is one of mechanism, rather than function. The regulatory function of the CTD had already been established by a previous study published by the Woodson lab (Santiago-Frangos et al., PNAS 2016). This paper expands on those results by providing mechanistic details, but does not offer significantly novel findings regarding the CTD function. Therefore, while this paper contributes to our detailed understanding of Hfq, it is unlikely that it would be of substantial interest to the broad audience of eLife.
In addition to this assessment of significance, I have concerns regarding the analysis of the data, which may help the authors if they resubmit to a different journal. Most of the figures, while including a cartoon of a kinetic scheme, do then show data fits that do not accord with the scheme. For example, Figure 2B data are fit to Hill curves, but the described scheme involves binding of a single Hfq variant to a peptide and should therefore use a standard isotherm (Hill coefficient constrained to 1). There are similar ad hoc fits in Figures 3, 4, and 5. The lack of an overall consistent kinetic/thermodynamic framework for computing thermodynamic equilibria and kinetic relaxation curves lowers my confidence in the data analysis and conclusions, including comparisons of contact frequencies in Rosetta simulations to empirical fits and key conclusions based on the shortened CTD construct.
Santiago-Frangos and colleagues describe evidence for a previously unknown role of the C-terminal extension of the Hfq RNA chaperone protein in auto-inhibition. Using a combination of computational molecular modelling and in vitro binding assays, the authors investigate how acidic residues at the C-terminal tip of Hfq interact with the positive nucleic acid binding residues within the core of the Hfq protein. This interaction is predicted to have a role in preventing non-specific interactions with nucleic acids and in accelerating release of annealed RNA products. The manuscript is well written and the conclusions are logical and well presented.https://doi.org/10.7554/eLife.27049.026
- Sarah A Woodson
- Jeffrey J Gray
- Jeliazko R Jeliazkov
- Andrew Santiago-Frangos
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
The authors thank Steven Lewis (Cyrus Bio), Susan Gottesman (NCI) and Agata Groszewska (Adam Mickiewicz University, Poznan) for helpful discussion, and Nadim Majdalani (NCI) for the gift of E. coli strain NM694 hfq–.
- Ben F Luisi, Reviewing Editor, University of Cambridge, United Kingdom
© 2017, Santiago-Frangos et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.