Recognition of the small regulatory RNA RydC by the bacterial Hfq protein
Abstract
Bacterial small RNAs (sRNAs) are key elements of regulatory networks that modulate gene expression. The sRNA RydC of Salmonella sp. and Escherichia coli is an example of this class of riboregulators. Like many other sRNAs, RydC bears a ‘seed’ region that recognises specific transcripts through base-pairing, and its activities are facilitated by the RNA chaperone Hfq. The crystal structure of RydC in complex with E. coli Hfq at a 3.48 Å resolution illuminates how the protein interacts with and presents the sRNA for target recognition. Consolidating the protein–RNA complex is a host of distributed interactions mediated by the natively unstructured termini of Hfq. Based on the structure and other data, we propose a model for a dynamic effector complex comprising Hfq, small RNA, and the cognate mRNA target.
https://doi.org/10.7554/eLife.05375.001eLife digest
A crucial step in the production of proteins is the translation of messenger RNA molecules. Other RNA molecules called small RNAs are also involved in this process: these small RNAs bind to the messenger RNA molecules to either increase or decrease the production of proteins.
Bacteria and other microorganisms use small RNA molecules to help them respond to stress conditions and to changes in their environment, such as fluctuations in temperature or the availability of nutrients. The ability to rapidly adapt to these changes enables bacteria to withstand harmful conditions and to make efficient use of resources available to them.
Many small RNA molecules use a protein called Hfq to help them interact with their target messenger RNAs. In some cases this protein protects the small RNA molecules when they are not bound to their targets. Hfq also helps the small RNA to bind to the messenger RNA, and then recruits other enzymes that eventually degrade the complex formed by the different RNA molecules.
Previous research has shown that six Hfq subunits combine to form a ring-shaped structure and has also provided some clues about the way in which Hfq can recognise a short stretch of a small RNA molecule, but the precise details of the interaction between them are not fully understood. Now Dimastrogiovanni et al. have used a technique called X-ray crystallography to visualize the interaction between Hfq and a small RNA molecule called RydC.
These experiments reveal that a particular region of RydC adopts a structure known as a pseudoknot and that this structure is critical for the interactions between the RydC molecules and the Hfq ring. Dimastrogiovanni et al. find that one RydC molecule interacts with one Hfq ring, and they identify the contact points between the RydC molecule and different regions of the Hfq ring.
Based on this information, Dimastrogiovanni et al. propose a model for how the RydC:Hfq complex is likely to interact with a messenger RNA molecule. The next step will be to test this model in experiments.
https://doi.org/10.7554/eLife.05375.002Introduction
The expression of genetic information is controlled and synchronised through intricate regulatory networks. In bacteria, the control of gene expression post-transcription is mediated in part by small RNAs (sRNAs), and their contributions enrich the computational complexity and repertoire of regulatory circuits (Beisel and Storz, 2011). Bacterial sRNAs are typically 50 to 300 nucleotides in length (Storz et al., 2011), and each control expression of a distinct set of target mRNAs, which they recognise with high specificity and apparent precision. One of the major facilitators of sRNA activity in bacteria is the protein Hfq, which promotes pairing of an sRNA with its target mRNA in solution (Møller et al., 2002; Vogel and Luisi, 2011; Panja and Woodson, 2012; De Lay et al., 2013). Indeed, the kinetics of target pairing seem an essential aspect of sRNA action in vivo, as many of these sRNAs affect rates of translation or decay, either positively or negatively depending on target and context (Storz et al., 2004; Fröhlich and Vogel, 2009; Desnoyers and Massé, 2012; Papenfort et al., 2013). It can be envisaged how Hfq acts as a catalyst for such recognition in vivo, but the question naturally arises how sRNAs generally achieve precision, accuracy, and speed in producing their effects and avoid undesired off-target consequences. The key to understanding these fundamental processes of sRNA-based regulation is to determine how RNAs are bound and presented by Hfq and other effector proteins.
The previous crystal structures of truncated Hfq variants have provided clues as to how the protein recognises short stretches of single-stranded RNA. Hfq bears an α + β fold that is the signature of the highly conserved family of Sm/Lsm proteins that draws members from all domains of life (Kambach et al., 1999; Wilusz and Wilusz, 2013). Like other proteins of this extensive group, the bacterial Hfq self-assembles into a ring-like architecture. Hfq of Escherichia coli and other eubacteria forms a compact hexameric toroid that presents two structurally non-equivalent concave surfaces for molecular recognition; these faces are referred to as the proximal and distal faces, with the former exposing an N-terminal α-helix (Schumacher et al., 2002; Link et al., 2009). Crystallographic studies have identified interactions of short RNA polymers with either of these two surfaces and have inferred sequence preferences that have been corroborated and refined by mutagenesis and solution binding studies (Schumacher et al., 2002; Link et al., 2009; Sauer and Weichenrieder, 2011; Robinson et al., 2013; Peng et al., 2014). According to these results, the proximal site interacts preferentially with uridine-rich sequences, while the distal site favours the binding of ARN or ARNN motifs (with R being a purine and N any nucleotide) (Schumacher et al., 2002; Link et al., 2009; Sauer and Weichenrieder, 2011). In addition to the distal and proximal faces, the torus-shaped Hfq hexamer bears a convex circumferential rim that has recently been identified as a surface contributing to RNA binding (Sauer et al., 2012; Zhang et al., 2013). Structural studies of the hetero-heptameric Sm assembly from the mammalian splicing machinery have identified RNA binding surfaces that share some similarities with the bacterial counterpart (Leung et al., 2011).
Bacterial sRNAs may recognize their cognate target mRNAs in different ways. In some sRNAs, a target-recognition region is presented at or near the 5′-end (Papenfort et al., 2010). For another subset of sRNAs, several pairing regions are non-continuous and each can independently bind distinct targets (Beisel and Storz, 2011; Shao et al., 2013). In both cases, the sRNA recognition site (referred to as the ‘seed’) and the target can have imperfect complementarity of various lengths with interactions being as short as 6 base-pairs, as seen for example in the SgrS/ptsG mRNA pair from E. coli (Kawamoto et al., 2006). To fully understand the mechanism of sRNA mediated regulation, it is important to address the questions of how the intricate RNA folds are recognised, how the seeds are presented, and how the pairing of sRNAs with mRNAs is facilitated. As the sRNAs, mRNAs, and their complexes can all be remodelled upon binding Hfq, the rules for recognition are likely to be highly dependent on context.
In this study, we describe the structure of the sRNA RydC in complex with the full-length Hfq protein of E. coli. Previous studies demonstrated that RydC is involved in biofilm regulation (Bordeau and Felden, 2014) as well as in the control of membrane stability through the positive regulation of an isoform of the cfa mRNA encoding cyclopropane fatty acid synthase (Fröhlich et al., 2013). RydC is proposed to have a pseudoknot fold that exposes a 5′ seed sequence, and the in vivo stability of RydC requires Hfq (Antal et al., 2005; Fröhlich et al., 2013; Bordeau and Felden, 2014). The crystal structure of the RydC–Hfq complex, together with biochemical and in vivo data, defines key interactions and suggests a hypothetical model for how the sRNA and mRNA targets might be matched through Hfq binding.
Results
RydC interaction with Hfq and pseudoknot structure is essential for its stability in vivo
Despite its low abundance in comparison with other sRNAs under standard growth conditions, RydC has been repeatedly recovered with the Hfq protein in pull-down assays (Zhang et al., 2003; Sittka et al., 2008; Chao et al., 2012). A distinctive feature of RydC is its predicted pseudoknot fold which is highly conserved and whose disruption by point mutations renders the RNA unstable in vivo (Fröhlich et al., 2013). A double mutant of RydC (G37C and G39C within helix 1; hereafter, RydC-S1) was predicted computationally to form a distinct structure with one short stem-loop at the 5′-end and a strong terminator hairpin at the 3′-end (Figure 1A).
To assess the molecular mechanism underlying the intrinsic instability of the RydC-S1 mutant, we compared its turnover rate to the wild-type RNA in the presence and absence of the chaperone Hfq (Figure 1B/C). To this end, both RydC and RydC-S1 were expressed in Salmonella under the control of the constitutive PL promoter from high-copy plasmids. At exponential growth (OD600 of 1), transcription was stopped by the addition of rifampicin, and RNA levels were monitored at various time-points after treatment. As previously shown (Fröhlich et al., 2013), RydC is very stable in wild-type cells (half-life (t1/2) >32 min), while RydC-S1 decayed rapidly (t1/2 ∼2 min). In contrast, the half-life of RydC is already markedly reduced in hfq mutant cells (t1/2 ∼4 min) and there is some further acceleration of the decay rate for RydC-S1, matching that of RydC in the absence of Hfq. This observation led us to assume that RydC stability depends on both the integrity of the pseudoknot fold and the ability to associate with the Hfq protein. Both the wild-type RydC and RydC-S1 mutant bind Hfq in electrophoretic mobility shift assays with in vitro synthesized RNA although the affinity of the mutant is lower than for the wild-type RydC (Figure 1—figure supplement 1A). Competition experiments suggest that the binding sites for the RydC and RydC-S1 overlap at least partially (Figure 1—figure supplement 1B). This suggests that the destabilisation in vivo might result mostly from the integrity of the pseudoknot fold, but to some degree, the Hfq binding alone may inhibit the degradation of RydC from initiating in single-stranded regions as suggested for other sRNAs (Moll et al., 2003; Saramago et al., 2014).
Overview of the structure of the RydC–Hfq complex
The strong association of RydC with Hfq both in vivo and in vitro as well as its highly compact structure made it an ideal candidate for co-crystallizations. Co-crystals of full length E. coli Hfq and Salmonella RydC were obtained that diffracted to 3.48 Å resolution, and the crystal structure of the RydC–Hfq complex was solved by molecular replacement using a model of the structured hexameric core of E. coli Hfq (i.e., lacking the C-terminal residues beyond amino acid 65). Unbiased, interpretable density for the RNA was apparent in the early maps calculated from the positioned protein hexamer, and a model for most of RydC could be fitted into the electron density (Figure 2—figure supplement 1 shows a map in which the RNA was omitted from the refinement and phase calculations). As the crystal structure is limited to a resolution of 3.48 Å, the map does not provide unequivocal alignment of RydC sequence with structure; however, the path of the majority of the RNA and the duplex regions could be modelled confidently. The asymmetric unit of the crystal comprises one full Hfq hexamer and one RydC (Figure 2). The fold of the RydC is consistent with the pseudoknot structure predicted from solution studies and sequence alignments (Antal et al., 2005; Fröhlich et al., 2013). The RNA bridges two adjacent Hfq hexamers in the crystal lattice, effectively forming a distorted sandwich with a wedge-shape (Figure 3A). The 3′-end of the RNA interacts with the proximal face of the principle hexamer, while the 5′-end interacts with the upper portion of the lateral surface of another hexamer in the asymmetric unit (see below for details, paragraph ‘Interactions of RydC with the lateral surface of Hfq’).
The organisation of the RNA in the crystal would suggest that a single RydC molecule could form a closed complex with two sandwiching Hfq hexamers. However, solution studies using analytical ultracentrifugation (AUC) and multi-angle laser light scattering (SEC-MALS) indicate that the RydC can form a stable 1:1 complex with the Hfq hexamer under various buffer conditions when the components are present in unitary stoichiometry (Figure 2—figure supplements 2–3). When Hfq is present at twice the concentration of RydC under low ionic strength conditions, some 2:1 Hfq:RydC species can be identified; moreover, a higher order complex can be observed by electrophoretic mobility shifts when Hfq is in excess over RNA (results not shown). These observations suggest that the stoichiometry of Hfq/RNA complexes is very sensitive to solution conditions. In vivo, it seems unlikely that one RydC can find two Hfq hexamers when the protein is limiting to total RNA (Wagner, 2013), and we suggest that the 1:1 complex is likely to represent the physiologically relevant species. We propose that the 2:1 sandwiching complex we observe in the crystal is not the biologically relevant species, but that it encompasses the full set of interactions that would form in a closed 1:1 RydC:Hfq complex. We suggest that the contacts made by RydC with the neighbouring Hfq hexamers can form on the proximal face of the same, single hexamer to form an idealised 1:1 complex. This hypothetical model will be elaborated on further below.
Interactions of the U-rich tail of RydC with the proximal face of Hfq
The 3′-end of RydC has a U-rich tail that - together with a strong hairpin structure - facilitates Rho-independent transcription termination and which is an important determinant for recognition of many sRNAs (Otaka et al., 2011; Sauer and Weichenrieder, 2011; Ishikawa et al., 2012). The RydC/Hfq structure reveals that the U-rich tail is bound in a recessed channel on the proximal face of Hfq. The electron density is not well defined in this region, and there may be disorder or multiple conformations of the polyU tail, but the features do show that uridines U61, U62, and U64 (as well as C63) each forms an aromatic base stacking interaction with the ring of F42 (Figure 3C; distances provided in Figure 3—figure supplement 1). K56 forms hydrogen bonds with the O2 group of the uracils, consistent with the importance of this residue in binding RNA on the proximal face (Mikulecky et al., 2004). The model indicates that there are likely to be hydrogen bonds between the uridine bases and Q41 and Q8, as well as between the furanose 2′ OH group with the imidazole group of H57. These interactions are consistent with the crystal structure of Salmonella Typhimurium Hfq/polyU (Sauer and Weichenrieder, 2011; PDB code 2YLC) and with fluorescence quenching experiments for binding U6 (Robinson et al., 2013). Similar interactions are seen in the crystal structure of Staphylococcus aureus Hfq in complex with AUUUUUG (Schumacher et al., 2002; PDB code 1KQ2) and E. coli Hfq in complex with AUUUUUA (Wang et al., 2011, 2013; PDB code 4HT9). In principle, the RNA can visit each of the uridine pockets following a clockwise or counter-clockwise path, but the polarity seems to be consistently clockwise in the reference frame that views the proximal face. The RNA exits the groove to engage on the rim of the proximal face of Hfq through numerous interactions with protein side- and main-chain atoms, as we describe further below.
Interactions of RydC with the lateral surface of Hfq
On the exposed rim of the proximal face, U24 stacks onto F39 and U23 packs onto U24 (Figure 4A). The U46/U47 bases form similar interactions with the rim of Hfq in a neighbouring asymmetric unit (Figure 4B). U24 and U47 contact N13 and likely make a hydrogen bonding interaction. Both U23/U24 and U46/U47 pairs are highly conserved in an alignment of the available RydC sequences (Fröhlich et al., 2013). The U24/U25 and U46/U47 interactions with the rim are consolidated by the N-terminal residues G4 and Q5 and likely include amide backbone and lysine side chain interactions (Figure 4A,B; distances provided in Figure 4—figure supplement 1).
Although highly conserved, the U23/U24 and U46/U47 pairs are not required for the base-pairing to the RydC target, cfa mRNA, because a chimeric RNA assembled from the RydC 5′-end (nt. 1 to 13) and the 3′-end of the sRNA MicA is as efficient in target gene regulation as the wild-type RydC (Fröhlich et al., 2013). To address the function of the highly conserved U residues, three mutant variants of RydC were constructed (Figure 5A). RydC-LI (U23G U24C), RydC-LII (U46C U47G), as well as RydC-LI/II (U23G U24C U46C U47G) were expressed from a constitutive promoter in wild-type Salmonella cells, and RNA stability was assessed by rifampicin treatment (Figure 5B). Compared to wild-type RydC (t1/2 ∼32 min), mutant RydC-LII had a reduced half-life (t1/2 ∼11 min). Both variants RydC-LI and the double mutant RydC-LI/LII were even less stable (t1/2 ∼2 min) and especially the RydC-LI and RydC-LI/LII variants were less potent than the wild-type to regulate the reporter target Cfa–GFP (Figure 5C). The discrepancy between steady-state levels and regulatory potential observed between the different RydC variants could be the result of alternative binding patterns via LI and LII leading to changes in RNA orientation on Hfq. Thus, while functional conclusions may be limited at this point, our in vivo data suggest a requirement of the conserved U pairs in the loops of the RydC pseudoknot structure for RNA stability and indicate a functional importance for the interactions observed in the crystal structure of the complex.
The observed U23/U24 and U46/U47 interactions with the rim region may account for the findings from solution studies that E. coli Hfq has greater affinity for 16-mer polyU compared to the 6-mer U6, because the longer polymer could form the contacts in the recessed core with the U-rich 3′-end and simultaneously interact with the rim through F39 and N13. The U23/U24 and U46/U47 interactions may also account for the partial fluorescence quenching observed for F39 in the presence of U6 (Robinson et al., 2013), because excess U6 may bind to the rim and interact with F39 in a similar way to the contacts observed for U23/U24 and U46/47 pairs of RydC.
Earlier studies identified residues R16, R17, R19, and K47 as forming a lateral RNA binding surface in Salmonella Hfq (Sauer et al., 2012; highlighted in cyan in Figure 6C). These residues are part of an arginine patch that is proposed to interact with both the sRNA and the mRNA molecules (Sauer et al., 2012). In our structure, R16 and R17 make phosphate backbone contacts near U23 and U24 (Figure 4A), while R19 and K47 are not interacting with the RNA. Residues R16 and R17 belonging to Hfq in the neighbouring crystal unit cell are also involved in the binding of the 5′-end of RydC (Figure 3B). The nucleotide stretch 8–12, representing the last portion of the RydC seed region, is in fact kept in an extended conformation by the interaction with helical region 8–17 from a symmetry related Hfq molecule (Figure 3B). This may account for observations from previous studies showing that mutations on this region of the rim do not impact on the ability of Hfq to bind sRNA sequences but rather on its capacity to stimulate annealing of sRNA with an mRNA target (Panja et al., 2013).
The RNA mediates numerous other interactions with neighbouring Hfq hexamers. One of the contacts mimics interactions of the A-A-N repeats on the distal face (Figure 6—figure supplement 1). A more extensive and potentially more interesting interaction occurs between the duplex region of RydC and the rim surface of an Hfq hexamer in a neighbouring asymmetric unit (Figure 6A). The residues mediating this interaction are R19, R66, P21, T49, Q33, and Q35 (Figure 6B).
To explore the contribution of the potential rim contacts to mediate the interaction between RydC and cfa, an Hfq mutant in the lateral rim (where residues R19, P21, Q33, Q35, T49, R66 have been mutated to A) was expressed and purified, and its binding rates and affinities were estimated from interferometry (ForteBio Octet Red96). For this experiment, the 5′ region of cfa (cfa1, nucleotides from TSS1 to −72 relative to the AUG, 139 nts, Fröhlich et al., 2013) was prepared with biotin attached to the 5′-end, which allowed the mRNA fragment to be immobilised on a streptavidin sensor surface. As shown in Figure 6D, both the wild-type protein (HfqWT) and the Hfq rim mutant (HfqRim) have similar affinities and association rate constants for the immobilised cfa.
In the presence of RydC, the association of Hfq with cfa changes. The binding affinity of HfqWT for cfa decreases by a factor of roughly 5, suggesting that the formation of the Hfq–sRNA complex affects the Hfq affinity for the mRNA fragment. Moreover, the presence of RydC decreases by an order of magnitude the association rate constant of Hfq for cfa. This could result from cfa being bound in a slightly different way in the Hfq/cfa complex as compared to the Hfq/RydC/cfa ternary complex. Strikingly, in the presence of RydC, the HfqRim mutant no longer has detectable binding to cfa. This finding suggests that the RydC interacts differently with HfqWT and the HfqRim lateral surface mutant in a way that perturbs cfa binding. Thus, the rim interactions shown in Figure 6A,B,C are crucial for forming a productive sRNA:Hfq:mRNA assembly.
The N- and C-termini of Hfq interact distributively with RydC
The C-terminal regions of Hfq, beyond residue 70, are generally disordered, but density could be observed for some of the protomers into which the model could be partially extended. Although poorly ordered, the C-terminal tails appear to be making distributive contacts over the surface of RydC. The N-terminal regions of three Hfq protomers, exposed on the proximal face of the protein, also interact with RydC, mainly in the vicinity of nucleotides 23–25, 51–52, and 59.
To explore if the observed interactions between the Hfq C-termini and RydC might be important in vivo, we compared the stabilities of the sRNA in the presence of wild-type Hfq and an Hfq mutant truncated at residue 70 (Figure 7). We observed that the half-life of RydC was greatly reduced in the absence of the C-terminal end (t1/2 ∼5 min) and was comparable with the half-life observed for an hfq null mutant (t1/2 ∼4 min). However, upon initial decay of a subfraction of RydC, the remaining population appears to be more stable and is only degraded at a rate comparable to RNA in the wild-type background. In vitro binding experiments show that the affinity of purified Hfq truncated at residue 66 (Hfq66) for binding cfa is similar to that of the wild-type protein; however, the presence of RydC decreases substantially the binding of Hfq66 (Figure 6—figure supplement 2). The curve for Hfq66/RydC binding to cfa could not be modelled, but the affinity is likely to be reduced by two orders of magnitude compared with the binding of Hfq66 alone to cfa. This reduction is much greater than the fivefold decrease in affinity when comparing the mRNA binding of the wild-type Hfq and its Hfq/RydC complex (Figure 6D). These observations suggest that the C-terminal tails are important for facilitating the formation of the sRNA:mRNA:Hfq ternary complex.
The C-terminal tails appear to influence the rate of Hfq–RNA complex formation, as indicated by the order-of-magnitude lower on-rate of Hfq66 compared to HfqWT for binding cfa (Figure 6D and Figure 6—figure supplement 2). Moreover, the interactions of the tails with sRNA may explain the puzzling observation that the HfqWT–RydC has a lower kon than HfqWT for association with cfa. In this case, we envisage that the tails, sequestered with the sRNA, are no longer available to ‘fish’ for and accelerate association with the mRNA partner.
Discussion
The crystal structure of RydC/Hfq suggests key elements of molecular recognition of this effector complex. We observe interactions of the 3′ polyU tract with the recessed pore in the Hfq proximal face and interactions of U–U dinucleotide steps in single-stranded regions with the surface near the rim. Based on results from in vivo crosslinking and RNA sequencing from enterohemorrhagic E. coli, it has been proposed that the consensus Hfq binding site on many sRNAs includes a U–U dinucleotide associated with an unpaired region (Tree et al., 2014). Our results are consistent with this prediction and provide a structural rationalisation: the unpaired region is required to engage the U–U step on the rim. Interactions with the rim may account for the capacity of longer U-tracts to bind Hfq more strongly (Murina et al., 2013; Panja et al., 2013), since these are anticipated to bridge between the rim and the 3′-polyU-binding pore. The rim might also be the binding site for the internal U-rich region in SgrS that is required for stable association with Hfq (Ishikawa et al., 2012).
The crystal structure suggests that one RydC can be sandwiched between two Hfq hexamers, and a similar model has been proposed for interaction of Hfq with the sRNA DsrA (Wang et al., 2011). However, solution data indicate that the stoichiometry of the complex is sensitive to buffer conditions and that a 1:1 complex is preferred at higher salt concentrations, but a 2:1 Hfq hexamer:RydC complex can be formed in lower ionic strength buffer (Figure 2—figure supplement 2). We propose that the 1:1 complex is the physiologically relevant species in vivo and that the sandwiching complex we observe in the crystal is likely to mimic a folding intermediate en route to forming a closed 1:1 RydC:Hfq complex. Accordingly, the contacts formed by the RydC with the neighbouring Hfq hexamers can both occur on the proximal face of the same, single hexamer to form the 1:1 complex (shown schematically in Figure 8).
Solution studies indicate that the C-termini of Vibrio cholerae and E. coli Hfq are natively unstructured (Beich-Frandsen et al., 2011; Vincent et al., 2012). For most of the Hfq subunits in the crystal structure studied here, the C-terminal regions beyond residue 77 are disordered, but broken density in the vicinity is likely to be due to multiple conformations of the C-terminal region. These regions make consolidating interactions with the RNA and appear to be distributed over the exposed surfaces of the nucleic acid. A structural condensation of the C-terminal domains with sRNA binding is also consistent with data from X-ray and neutron solution scattering (Henderson et al., 2013). Both the structural data as well as in vivo experiments suggest that the C-terminal end of Hfq contributes to the association with RydC. The distributive interactions of the C-terminal domain are not expected to provide specificity to the recognition but may enhance the stability of equilibrium complexes as well as boosting the association rates by increasing the effective molecular encounter radius. The distributive interaction may account for the finding that deletion of the C-terminal extension decreases binding to mRNAs (Večerek et al., 2008) and that the C-terminal tail can bind to some RNA species (Robinson et al., 2013). The same might be true for yeast Lsm1 mutants with C-terminal truncations that have reduced affinity for RNA (Chowdhury et al., 2012).
These results are in agreement with a study that compared the distribution of Salmonella RNAs associating with Hfq variants of different species (Sittka et al, 2009). Similar to several orthologues of various bacterial species, Hfq of the archaeon Methanocaldococcus jannaschii lacks the unstructured C-terminus. M. jannaschii Hfq is generally able to complement a Salmonella hfq null mutant; however, the distribution of associated sRNAs recovered from pull-down experiments varies greatly compared to the wild-type. While a number of Salmonella sRNAs (including, for example, the Hfq-dependent SroC, InvR, and GcvB) are not reduced in their cellular abundance, a second class of sRNAs is strongly affected by the absence of the C-terminal end. This latter group includes RydC and also several sRNAs highly abundant in the wild-type (e.g., RprA, SdsR, and ArcZ), and it is tempting to speculate that this subpopulation of sRNAs is associated with Hfq through supporting interactions with the C-terminal tail. The differing roles of the tails for different sRNAs may account for the apparently conflicting results reported in the literature on the effects of the C-terminus on sRNA binding and activity (Olsen et al., 2010). Finally, it should be noted that the C-terminal tails have different lengths in different Hfq homologs, and it is likely that their role differs in different species (Vincent et al., 2012).
As a general principle, the interactions of natively unstructured regions with RNA may be important in other nucleic acid/protein complexes (Tompa and Csermely, 2004; Kucera et al., 2011; Jonas and Izaurralde, 2013), and they can contribute to cooperative binding (Hunter and Anderson, 2009; Motlagh et al., 2014). We envisage that these interactions would help to keep the sRNA and mRNA in proximity without constraining them, so that the mRNA might travel along the distal face while the sRNA was held in a more fixed conformation on the proximal face. In this way, the complex can glide along an mRNA until the seed and target region meet. Once this occurs, they will close like a zipper to form a short duplex that could be engaged on the convex rim of the Hfq. The mismatch of molecular symmetries and the high flexibility of both the N- and C-termini of Hfq ensure that the complex will be intrinsically heterogeneous in conformation, but highly dynamic.
Based on the current crystal structure, including the lattice interactions observed, and on other available structural and biophysical data, we propose a speculative model for the ternary complex of Hfq/sRNA and mRNA target (Figure 8). The model shares some similarity to the schematic proposed by Panja et al. (2013) and predicts that the 5′ seed will be exposed and available for interaction with the target to form a duplex region that binds to the convex rim of the Hfq hexamer. We envisage that a different model will be required to explain the interactions of sRNA using recognition regions that are internal and not in the 5′ seed. One prediction is that the base-pairing sites in this group of sRNAs will be presented near the rim of Hfq. This model awaits testing.
Materials and methods
Expression and purification of Hfq
Request a detailed protocolWild-type E. coli Hfq was expressed and purified as previously described by Bandyra et al. (2012). Hfq mutants were expressed from pBAD vector in Top10 Δhfq strain (JVS-02001) to avoid hetero-hexamer formation. The mutant proteins were purified with the same protocol used for the wild-type Hfq, with a HiTrap Heparin purification step added (buffer A: 50 mM Tris, pH 8.0, 100 mM NaCl, 100 mM KCl; buffer B: buffer A + 1 M NaCl). The presence of mutations or truncations was confirmed by mass spectrometry analyses.
Preparation of RydC RNA
Request a detailed protocolFor Salmonella RydC RNA in vitro transcription (IVT), a DNA template was amplified by PCR from pKF42-1 using primers RydC_T7_fwd and RydC_T7_rev (Table 1).
Transcription mixtures containing DNA-template, ribonucleotides (rNTPs), T7 RNA polymerase, DTT (dithiothreitol), and RNase Out (Invitrogen, UK) were incubated in transcription buffer (400 mM Tris (pH 7.9/8.0), 250 mM MgCl2, 20 mM spermidine) for 4 hr at 37°C, and then Turbo DNase (Ambion) was added to digest the template. The IVT product was separated on a 10% polyacrylamide gel containing 7 M urea. The band corresponding to the transcribed RydC was visualized by UV-shadowing, excised from the gel, and the RNA was electro-eluted overnight using an EluTrap system (Whatman, UK). The RNA sample was concentrated by ultrafiltration through a Vivaspin 500 concentrator (5 kDa cut-off).
Crystallization of Hfq–RydC complex
Request a detailed protocolThe RNA–protein complex was prepared by mixing purified Hfq6 and RydC in a ratio of 2:1, the Hfq hexamer concentration being 15 mg/ml. Crystals of the Hfq–RydC complex were grown by sitting drop vapour diffusion after adding an equal volume of crystallization buffer (0.2 M trisodium citrate, 0.1 M sodium cacodylate (pH 6.5), 15% vol/vol isopropanol) to the protein–RNA mixture. Crystals were harvested using 25% PEG400 as cryoprotectant and flash frozen in liquid nitrogen. X-ray data were collected on station I24 at Diamond Light source. Data were collected at 100 K at a wavelength of 0.9778 Å.
Images obtained from the two best crystals having the same space group and cell dimensions were merged and scaled together, before proceeding with the structure determination.
Crystal structure determination
Request a detailed protocolThe structure was solved by molecular replacement with PHASER (McCoy et al., 2007) using residues 5–66 of E. coli Hfq as the search model. Electron density for RNA was apparent in the early maps. Maps were improved with density modification with PARROT using histogram matching with tRNA/aminoacyl-tRNA synthase experimental structure factors as the reference distribution. This structure has a similar solvent content and RNA/protein mass ratio as the RydC/Hfq crystals, and density modification with this method improved the map and visualisation of the RNA. The structure was refined using BUSTER (Bricogne et al., 2011) and with jelly body restraints in refinement with REFMAC (Murshudov et al., 2011). The model was built using COOT (Emsley et al., 2010). The protein and RNA stereochemistry were validated by using both Coot validation tools and Procheck from the CCP4 suite (Laskowski et al., 1993; Emsley et al., 2010). The Ramachandran plot of the model places 92.9% of residues in most favoured regions, 5.4% in additional allowed regions, 1.6% in generously allowed regions, and none in disallowed regions. X-ray data collection and refinement statistics are summarized in Table 2. Figures were prepared using PYMOL (DeLano, 2006). The coordinates and structure factors have been deposited in the PDB with accession code 4v2s.
Binding rates and affinity measurements
Request a detailed protocolKinetic measurements with Bio-Layer Interferometry were performed using an Octet RED96 equipped with Streptavidin sensors (ForteBio, UK) on 96-well plates. The experiment was performed in the binding buffer (25 mM Tris, pH 7.5, 50 mM NaCl, 50 mM KCl, 1 mM MgCl2, 1 mM DTT), which was also used to prepare all dilutions, for dissociation and neutralisation. A fragment of cfa mRNA (cfa1, nucleotides from TSS1 to −72 relative to the AUG, 139 nts, Fröhlich et al., 2013) was in vitro transcribed from a PCR template (JVO-4558*JVO-9044 on pKF31-1) and labelled with biotin in an IVT reaction using fivefold excess of GMP-biotin (TriLink Biotechnologies, San Diego CA, USA) over GTP. 5′ biotin labelled cfa fragment was immobilised on the biosensor that was subsequently submerged into 10 μM solution of maltose binding protein (MBP) labelled with biotin. The binding of wild-type and mutant Hfq were assayed at 0, 5, 10, 25, 75, 150, 250, and 500 nM protein over 400 s, in the absence or presence of 1 μM chemically synthesized RydC (Dharmacon, GE Healthcare, UK). The dissociation was monitored over 300 s and was followed by regeneration of the sensors using 1 M MgCl2. Another set of tips was saturated with MBP-biotin and the measurements were then repeated for all Hfq and Hfq–RNA concentration series. The data were fitted with Data Analysis software (ForteBio) with a 1:1 binding model. The plots were prepared with Profit (Quantum Soft, Switzerland) using the following equation for response fit:
Y = Rm × Xn/(Kdn + Xn),
where Y is the observed binding, X is the molar concentration of the ligand, Rm is the maximum specific binding, and n is the Hill’s coefficient.
DNA/RNA oligonucleotides and plasmids
Request a detailed protocolSequences of all oligonucleotides employed in this study are listed in Table 1. All plasmids used in this study are summarized in Table 3.
For plasmids expressing loop mutants of RydC from the PL-promoter, plasmid pKF42-1 served as template for PCR amplification with primer pairs JVO-10915/JVO-10916 (pPL-RydC-LI; pKF224-9), JVO-10913/JVO-10914 (pPL-RydC-LII; pKF223), and the linear fragments were purified and self-ligated. Similarly, pPL-RydC-LI/LII (pKF225) was constructed by self-ligation of a PCR product of JVO-10915/10,916 using pKF223 as template. Competent E. coli TOP10 were used for all cloning purposes.
Bacterial strains and growth conditions
Request a detailed protocolFor the in vivo analyses, bacteria were grown aerobically in L-Broth (LB) medium at 37°C. Where appropriate, liquid and solid media were supplemented with antibiotics at the following concentrations: 100 µg/ml ampicillin, 50 µg/ml kanamycin, and 20 µg/ml chloramphenicol.
A complete list of bacterial strains employed in this study is provided in Table 4. Salmonella enterica serovar Typhimurium strain SL1344 (JVS-0007) is referred to as wild-type strain and was used for mutant construction. Single mutant derivatives were constructed by the λRed recombinase one-step inactivation method. To obtain JVS-10701 (SL1344 hfq70::STOP), Salmonella cells carrying the pKD46 helper plasmid were transformed with a DNA fragment amplified from pKD4 using JVO-10909/JVO-10910. Subsequently, the KanR cassette of λRed-derived mutants was eliminated by transformation with the FLP recombinase expression plasmid pCP20 (Datsenko and Wanner, 2000). Phage P22 transduction was used to transfer chromosomal modifications to a fresh Salmonella wild-type background as well as to obtain strains carrying multiple mutations.
Protein sample analysis
Request a detailed protocolTo prepare whole-cell samples for Western blot analyses, bacteria were collected by centrifugation (16,000×g; 2 min; 4°C), and pellets were resuspended in 1X protein loading buffer (Fermentas, UK) to a final concentration of 0.01 OD/µl. Samples corresponding to 0.1 OD were loaded per lane and resolved by SDS-PAGE, after which proteins were transferred to PVDF membranes as described in Sittka et al. (2007). GFP fusion proteins and GroEL were detected using commercially available antibodies (GFP: 1:5000; mouse; Roche # 11814460001 and GroEL: 1:10,000; rabbit; Sigma-Aldrich G6532). Anti-mouse or anti-rabbit secondary antibodies conjugated with horseradish peroxidase (1:10,000; GE Healthcare) were used in all cases. Signals were visualized using the Western Lightning reagent (PerkinElmer, Waltham MA, USA) and an ImageQuant LAS 4000 CCD camera (GE Healthcare).
Determination of RNA half-life
Request a detailed protocolTo monitor RNA half-life of RydC mutant variants, cells were grown to an OD600 of 1 and treated with rifampicin (final concentration: 500 µg/ml) to abrogate transcription. RNA samples were withdrawn at indicated time-points, and RNA decay was determined by Northern blot analysis as previously described (Urban and Vogel, 2007; Fröhlich et al., 2012). RydC and its variants were detected with the universal oligo JVO-4364; 5S RNA served as loading control (JVO-0322).
Electrophoretic mobility shift assay (EMSA)
Request a detailed protocolFormation of complexes between sRNAs and Hfq in vitro was analysed by gel shift assays. For RNA in vitro synthesis, ∼200 ng of template DNA carrying a T7 promoter sequence was amplified by PCR (RydC: JVO-4721*JVO-4722 on pKF42-1; RydC-S1: JVO-5165*JVO-4722 on pKF60-1) and reverse transcribed and 5′-end-labelled as described previously (Fröhlich et al., 2013). Labelled RNA RydC or RydC-S1 (4 pmol) was denatured (95°C, 2 min), chilled on ice for 5 min, and supplemented with 1× structure buffer and 1 μg yeast RNA. Upon addition of purified Hfq (concentration as indicated in the figure legends) or Hfq dilution buffer (control; 1× structure buffer, 1% (vol/vol) glycerol, 0.1% Triton X-100), samples were incubated at 37°C for 10 min. Unlabelled competitor RNA was added at the indicated concentrations, and samples were incubated for additional 10 min. Prior to loading, reactions were mixed with native loading buffer (50% glycerol, 0.5× TBE, 0.02% (wt/vol) bromophenol blue) and separated by native PAGE (6% PAA). Gels were dried and signals were determined on a Typhoon FLA 7000 phosphorimager.
References
-
A small bacterial RNA regulates a putative ABC transporterThe Journal of Biological Chemistry 280:7901–7908.https://doi.org/10.1074/jbc.M413071200
-
Structural insights into the dynamics and function of the C-terminus of the E. coli RNA chaperone HfqNucleic Acids Research 39:4900–4915.https://doi.org/10.1093/nar/gkq1346
-
One-step inactivation of chromosomal genes in Escherichia coli K-12 using PCR productsProceedings of the National Academy of Sciences of USA 97:6640–6645.https://doi.org/10.1073/pnas.120163297
-
DeLano ScientificDeLano Scientific, San Carlos, CA, USA, www.pymol.org.
-
Bacterial small RNA-based negative regulation: Hfq and its accomplicesThe Journal of Biological Chemistry 288:7996–8003.https://doi.org/10.1074/jbc.R112.441386
-
Features and development of CootActa Crystallographica Section D, Biological Crystallography 66:486–501.https://doi.org/10.1107/S0907444910007493
-
A small RNA activates CFA synthase by isoform-specific mRNA stabilizationThe EMBO Journal 32:2963–2979.https://doi.org/10.1038/emboj.2013.222
-
A conserved RpoS-dependent small RNA controls the synthesis of major porin OmpDNucleic Acids Research 40:3623–3640.https://doi.org/10.1093/nar/gkr1156
-
Activation of gene expression by small RNACurrent Opinion in Microbiology 12:674–682.https://doi.org/10.1016/j.mib.2009.09.009
-
An intrinsically disordered C terminus allows the La protein to assist the biogenesis of diverse noncoding RNA precursorsProceedings of the National Academy of Sciences of USA 108:1308–1313.https://doi.org/10.1073/pnas.1017085108
-
PROCHECK: a program to check the stereochemical quality of protein structuresJournal of Applied Crystallography 26:283–291.https://doi.org/10.1107/S0021889892009944
-
Structure of Escherichia coli Hfq bound to polyriboadenylate RNAProceedings of the National Academy of Sciences of USA 106:19292–19297.https://doi.org/10.1073/pnas.0908744106
-
Phaser crystallographic softwareJournal of Applied Crystallography 40:658–674.https://doi.org/10.1107/S0021889807021206
-
Escherichia coli Hfq has distinct interaction surfaces for DsrA, rpoS and poly(A) RNAsNature Structural & Molecular Biology 11:1206–1214.https://doi.org/10.1038/nsmb858
-
Hfq binds ribonucleotides in three different RNA-binding sitesActa Crystallographica Section D, Biological Crystallography 69:1504–1513.https://doi.org/10.1107/S090744491301010X
-
REFMAC5 for the refinement of macromolecular crystal structuresActa Crystallographica Section D, Biological Crystallography 67:355–367.https://doi.org/10.1107/S0907444911001314
-
C-terminally truncated derivatives of Escherichia coli Hfq are proficient in riboregulationJournal of Molecular Biology 404:173–182.https://doi.org/10.1016/j.jmb.2010.09.038
-
PolyU tail of rho-independent terminator of bacterial small RNAs is essential for Hfq actionProceedings of the National Academy of Sciences of USA 108:13059–13064.https://doi.org/10.1073/pnas.1107050108
-
Conserved arginines on the rim of Hfq catalyze base pair formation and exchangeNucleic Acids Research 41:7536–7546.https://doi.org/10.1093/nar/gkt521
-
Hfq proximity and orientation controls RNA annealingNucleic Acids Research 40:8690–8697.https://doi.org/10.1093/nar/gks618
-
Evidence for an autonomous 5′ target recognition domain in an Hfq-associated small RNAProceedings of the National Academy of Sciences of USA 107:20435–20440.https://doi.org/10.1073/pnas.1009784107
-
Structural model of an mRNA in complex with the bacterial chaperone HfqProceedings of the National Academy of Sciences of USA 111:17134–17139.https://doi.org/10.1073/pnas.1410114111
-
Mapping Hfq-RNA interaction surfaces using tryptophan fluorescence quenchingNucleic Acids Research 42:2736–2749.https://doi.org/10.1093/nar/gkt1171
-
The role of RNases in the regulation of small RNAsCurrent Opinion in Microbiology 18:105–115.https://doi.org/10.1016/j.mib.2014.02.009
-
Small RNA to the lateral surface of Hfq hexamers and structural rearrangements upon mRNA target recognitionProceedings of the National Academy of Sciences of USA 109:9396–9401.https://doi.org/10.1073/pnas.1202521109
-
Structural basis for RNA 3’-end recognition by HfqProceedings of the National Academy of Sciences of USA 108:13065–13070.https://doi.org/10.1073/pnas.1103420108
-
The RNA chaperone Hfq is essential for the virulence of Salmonella typhimuriumMolecular Microbiology 63:193–217.https://doi.org/10.1111/j.1365-2958.2006.05489.x
-
Controlling mRNA stability and translation with small, noncoding RNAsCurrent Opinion in Microbiology 7:140–144.https://doi.org/10.1016/j.mib.2004.02.015
-
Regulation by small RNAs in bacteria: expanding frontiersMolecular Cell 43:880–891.https://doi.org/10.1016/j.molcel.2011.08.022
-
Translational control and target recognition by Escherichia coli small RNAs in vivoNucleic Acids Research 35:1018–1037.https://doi.org/10.1093/nar/gkl1040
-
The C-terminal domain of Escherichia coli Hfq is required for regulationNucleic Acids Research 36:133–143.https://doi.org/10.1093/nar/gkm985
-
Characterization of Vibrio cholerae Hfq provides novel insights into the role of the Hfq C-terminal regionJournal of Molecular Biology 420:56–69.https://doi.org/10.1016/j.jmb.2012.03.028
-
Hfq and its constellation of RNANature Reviews Microbiology 9:578–589.https://doi.org/10.1038/nrmicro2615
-
Hfq-bridged ternary complex is important for translation activation of rpoS by DsrANucleic Acids Research 41:5938–5948.https://doi.org/10.1093/nar/gkt276
-
Cooperation of Escherichia coli Hfq hexamers in DsrA bindingGenes & Development 25:2106–2117.https://doi.org/10.1101/gad.16746011
-
Mutations in interaction surfaces differentially impact E. coli Hfq association with small RNAs and their mRNA targetsJournal of Molecular Biology 425:3678–3697.https://doi.org/10.1016/j.jmb.2013.01.006
-
Global analysis of small RNA and mRNA targets of HfqMolecular Microbiology 50:1111–1124.https://doi.org/10.1046/j.1365-2958.2003.03734.x
Article and author information
Author details
Funding
Wellcome Trust (RG61065)
- Ben F Luisi
The funder had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Acknowledgements
This work was supported by the Wellcome Trust (BFL) and the Bavarian BioSysNet programme (JV). We thank the beamline staff at Diamond Light Source for help and advice with data collection. We thank Len Packman for mass spectrometry analysis of proteins and Barbara Plaschke for excellent technical assistance. We thank Tommaso Moschetti and Katherine Stott for advice for the binding experiments, Kiyoshi Nagai for helpful comments, Steven Hardwick for help and advice with the AUC, and Tristan Croll for analysis of the stereochemistry of the refined model.
Copyright
© 2014, Dimastrogiovanni et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 3,285
- views
-
- 553
- downloads
-
- 99
- citations
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Structural Biology and Molecular Biophysics
Peroxisome proliferator-activated receptor gamma (PPARγ) is a nuclear receptor transcription factor that regulates gene expression programs in response to ligand binding. Endogenous and synthetic ligands, including covalent antagonist inhibitors GW9662 and T0070907, are thought to compete for the orthosteric pocket in the ligand-binding domain (LBD). However, we previously showed that synthetic PPARγ ligands can cooperatively cobind with and reposition a bound endogenous orthosteric ligand to an alternate site, synergistically regulating PPARγ structure and function (Shang et al., 2018). Here, we reveal the structural mechanism of cobinding between a synthetic covalent antagonist inhibitor with other synthetic ligands. Biochemical and NMR data show that covalent inhibitors weaken—but do not prevent—the binding of other ligands via an allosteric mechanism, rather than direct ligand clashing, by shifting the LBD ensemble toward a transcriptionally repressive conformation, which structurally clashes with orthosteric ligand binding. Crystal structures reveal different cobinding mechanisms including alternate site binding to unexpectedly adopting an orthosteric binding mode by altering the covalent inhibitor binding pose. Our findings highlight the significant flexibility of the PPARγ orthosteric pocket, its ability to accommodate multiple ligands, and demonstrate that GW9662 and T0070907 should not be used as chemical tools to inhibit ligand binding to PPARγ.
-
- Structural Biology and Molecular Biophysics
SCARF1 (scavenger receptor class F member 1, SREC-1 or SR-F1) is a type I transmembrane protein that recognizes multiple endogenous and exogenous ligands such as modified low-density lipoproteins (LDLs) and is important for maintaining homeostasis and immunity. But the structural information and the mechanisms of ligand recognition of SCARF1 are largely unavailable. Here, we solve the crystal structures of the N-terminal fragments of human SCARF1, which show that SCARF1 forms homodimers and its epidermal growth factor (EGF)-like domains adopt a long-curved conformation. Then, we examine the interactions of SCARF1 with lipoproteins and are able to identify a region on SCARF1 for recognizing modified LDLs. The mutagenesis data show that the positively charged residues in the region are crucial for the interaction of SCARF1 with modified LDLs, which is confirmed by making chimeric molecules of SCARF1 and SCARF2. In addition, teichoic acids, a cell wall polymer expressed on the surface of gram-positive bacteria, are able to inhibit the interactions of modified LDLs with SCARF1, suggesting the ligand binding sites of SCARF1 might be shared for some of its scavenging targets. Overall, these results provide mechanistic insights into SCARF1 and its interactions with the ligands, which are important for understanding its physiological roles in homeostasis and the related diseases.