Structural basis of transcription arrest by coliphage HK022 Nun in an Escherichia coli RNA polymerase elongation complex
Abstract
Coliphage HK022 Nun blocks superinfection by coliphage λ by stalling RNA polymerase (RNAP) translocation specifically on λ DNA. To provide a structural framework to understand how Nun blocks RNAP translocation, we determined structures of Escherichia coli RNAP ternary elongation complexes (TECs) with and without Nun by single-particle cryo-electron microscopy. Nun fits tightly into the TEC by taking advantage of gaps between the RNAP and the nucleic acids. The C-terminal segment of Nun interacts with the RNAP β and β’ subunits inside the RNAP active site cleft as well as with nearly every element of the nucleic acid scaffold, essentially crosslinking the RNAP and the nucleic acids to prevent translocation, a mechanism supported by the effects of Nun amino acid substitutions. The nature of Nun interactions inside the RNAP active site cleft suggests that RNAP clamp opening is required for Nun to establish its interactions, explaining why Nun acts on paused TECs.
https://doi.org/10.7554/eLife.25478.001Introduction
Bacteriophages are the most common and diverse entities in the biosphere. They represent the simplest systems that are capable of self-replication. Many of the paradigms of transcription regulation were originally discovered studying bacteriophages (Ptashne, 2005). Bacteriophage replication inside host bacterial cells involves coopting the cellular transcription machinery. Generally, this is achieved through small, phage-encoded proteins that bind to the host RNA polymerase (RNAP) and alter its properties. Elucidating these processes provides insight into the mechanism and regulation of bacterial transcription itself.
The 13 kDa Nun protein of coliphage HK022 has one known function, to exclude superinfection by the rival coliphage λ (Robert et al., 1987) by arresting host Escherichia coli (Eco) RNAP transcription specifically on λ DNA (Hung and Gottesman, 1997; Vitiello et al., 2014). HK022 Nun competes with, and antagonizes the function of, the λ N antitermination protein (Robert et al., 1987). Like λ N, Nun achieves specificity by binding to the cis-acting boxB RNA hairpin within nut (N-utilization) RNA sequences in λ nascent transcripts through an N-terminal arginine-rich motif, or ARM (Chattopadhyay et al., 1995; Faber et al., 2001; Schärpf et al., 2000). Unlike λ N, which suppresses transcription termination at terminators downstream of the nut sites, Nun provokes premature downstream termination (Robert et al., 1987; Robledo et al., 1991).
After localization to specific ternary elongation complexes (TECs) containing the λ nut sequence, the C-terminal segment of Nun interacts with the TEC to block RNAP translocation (forward or reverse) at intrinsic pause sites without affecting RNAP catalytic activity and without dissociating the complex (Hung and Gottesman, 1997; Vitiello et al., 2014). Nun-mediated transcription termination in vivo depends on the function of the Mfd protein, which can remove the arrested TECs from the DNA template (Washburn et al., 2003). At normal in vivo expression levels, Nun acts only on TECs containing the λ nut sequence, and Nun function is dependent on NusA, NusB, NusE, and NusG (Robert et al., 1987; Robledo et al., 1991). At high expression levels in vivo, Nun is toxic to Eco cells as it acts to arrest transcription elongation by the host RNAP nonspecifically (Uc-Mass et al., 2008). Nun alone arrests Eco TECs without λ N nut RNA elements and independent of Nus factors when present at high concentration in vitro (Hung and Gottesman, 1997).
We used single-particle cryo-electron microscopy (cryoEM) to determine structures of Eco TECs (4.1 and 4.4 Å resolution) and a Nun/TEC complex (3.7 Å resolution). The structures, combined with the biochemical effects of amino acid substitutions in Nun, explain how Nun blocks both forward and reverse translocation of the RNAP on the nucleic acids and also provides insight into why Nun acts primarily on paused TECs.
Results
Nun acts on an Eco TEC assembled from a minimized nucleic acid scaffold
To assemble Eco TECs suitable for structure determination and sensitive to Nun, we modified a 65-nucleotide (nt) nucleic acid scaffold (na-scaffold) sequence shown previously to respond to Nun in vitro (Vitiello et al., 2014) by shortening the DNA template to 29-nt (−15 to +14, where +1 denotes the position corresponding to the 3’-end of the pre-translocated RNA transcript) and engineering a stable, 10-nt transcription bubble (−9 to +1) to accommodate a 9-nt post-translocated RNA transcript by introducing non-complementarity in the DNA non-template strand (nt-strand; Figure 1A). The final na-scaffold included a six base pair (bp) upstream DNA duplex (−15 to −10) and a 13 bp downstream DNA duplex (+2 to +14).
Eco TECs assembled from purified Eco core RNAP and the na-scaffold with a 9-nt post-translocated RNA transcript (TEC-9A, since the RNA is 9-nt in length and contains A at the 3’-end; Figure 1A , 9A RNA) were active in transcription elongation (Figure 1B, lane 2). Under the same conditions in the presence of a saturating concentration of Nun (20 µM), essentially 100% of the post-translocated 9A transcript was extended to 10C as expected (Vitiello et al., 2014), since this reaction did not require RNAP translocation on the template (Figure 1B, compare lanes 3 and 4). However, further extension of 10C was efficiently inhibited by Nun (Figure 1B, lane 4). We conclude that Nun alone efficiently binds the Eco TEC-9A and inhibits RNAP translocation, indicating that our reconstituted system captures Nun function.
The Nun:TEC stoichiometry is 1:1
Prior crosslinking analyses left open the possibility of a Nun:TEC stoichiometry of 1:1 or 2:1 (Mustaev et al., 2016). To address this, we performed native mass spectrometry analysis of the TEC with and without addition of Nun. Two major charge-state series corresponding to experimental masses of 383,971 ± 8 Da and 396,705 ± 6 Da were observed in the mass spectrum of Nun/TEC (Figure 1—figure supplement 1). The lower mass (384.0 kDa) matches the expected mass of the fully assembled TEC within the accuracy of the measurement (0.05%). Importantly, the difference between the two measured masses is 12,734 Da, which is close to the expected mass of Nun (12,732 Da), revealing that the 396.7-kDa assembly consists of TEC with one Nun bound.
Structure determination of the Eco TEC ± Nun
We used single-particle cryoEM to determine structures of Eco TECs with and without Nun. Preliminary cryoEM analyses led to the following insights: (1) TEC-9A exhibited disorder in the β-flap domain, possibly due to the absence of upstream RNA filling the RNA exit channel underneath (Lane and Darst, 2010a); (2) Density attributable to Nun was weak and fragmented, possibly due to dissociation during sample preparation, resulting in low occupancy. We used 20A RNA (Figure 1A) to assemble TEC-20A for cryoEM structure determination to fill the RNA exit channel with the upstream single-stranded RNA (ssRNA, −10 to −20; Figure 1A) and to visualize how the RNAP disrupts the RNA:DNA hybrid and directs the upstream ssRNA into the RNA exit channel (Korzheva et al., 1998; Richardson, 1975). TEC-20A, like TEC-9A, is active in elongation and Nun-mediated arrest (Figure 1—figure supplement 2A). We used mild glutaraldehyde crosslinking to stabilize the Nun/TEC complex (Figure 1—figure supplement 2B), resulting in high Nun occupancy.
We present three cryoEM reconstructions, glutaraldehyde crosslinked Nun/TEC-20A (X-Nun/TEC-20A; 3.7 Å overall resolution; Figure 1C, Figure 1—figure supplement 2; Supplementary file 1), glutaraldehyde crosslinked TEC-20A (X-TEC-20A; 4.1 Å overall resolution; Figure 1—figure supplement 3; Supplementary file 1), and TEC-20A without glutaraldehyde crosslinking (4.4 Å overall resolution; Figure 1—figure supplement 4; Supplementary file 1). Superposition of all three structures by RNAP Cα positions resulted in root mean square deviations of <0.7 Å over at least 2750 Cα’s (Supplementary file 2). From the structural correspondence between TEC-20A and X-TEC-20A, we conclude that the crosslinked structures are representative of the uncrosslinked solution structures (i.e. the crosslinking did not trap rare conformations; Figure 1—figure supplement 5A). Moreover, most, if not all of the Nun molecules were crosslink free (note the presence of apparently stoichiometric non-crosslinked Nun evident in the sample; Figure 1—figure supplement 2B), making it unlikely that the crosslinking introduced non-native contacts. From the structural correspondence between TEC-20A, X-TEC-20A, and X-Nun/TEC-20A, we conclude that the TEC accommodates Nun without global conformational changes (Figure 1—figure supplement 5B). Localized conformational changes will be discussed below.
The overall resolution of the X-Nun/TEC-20A cryoEM density map was 3.7 Å (Figure 1—figure supplement 2F) but local resolution calculations indicate much of the central core of the structure, including the active site, the 9 bp RNA/DNA hybrid, and much of the visible part of Nun, was determined to 3.3–3.6 Å resolution (Figure 1—figure supplement 6). The overall resolution of the X-TEC-20A density map was 4.1 Å (Figure 1—figure supplement 3F), but the central core of the structure was determined to 3.6–4.0 Å resolution (Figure 1—figure supplement 7).
Eco TEC structure
In TEC-20A, the downstream duplex DNA enters the active site cleft and begins unwinding at +1 (Figure 2A). The nt-strand G at +1 binds in a G-specific pocket of the β-subunit (Figure 2—figure supplement 1A) as observed in initiation complex structures (Bae et al., 2015; Zhang et al., 2012). A G at this position of the na-scaffold facilitates Nun-mediated translocation arrest (Figure 2—figure supplement 1B). The t-strand passes over the bridge helix (BH) to position the +1 t-strand base in the RNAP active site. The 9-bp RNA/DNA hybrid is in the post-translocated state, with the 3’-end of the RNA at −1 (Figure 2A, Figure 2—figure supplement 2).
At the upstream edge of the RNA/DNA hybrid, conserved structural features of the RNAP limit the extent of the hybrid and provide tracks to confine and direct the ssRNA and t-strand DNA away from each other (Figures 2A and 3). The β’lid directly blocks the upstream extension of the RNA/DNA hybrid (Bernecky et al., 2016; Vassylyev et al., 2007; Westover et al., 2004). Conserved hydrophobic residues V253 and L255 (Figure 3—figure supplement 1) interact with the face of the last upstream RNA/DNA bp at −9, possibly providing a ‘greasy’ surface for the bases to slip past during translocation. V253 appears to help track the upstream ssRNA towards the RNA exit channel (to the left of the β’lid in Figure 3). The RNA track is buttressed on the opposite side by the conserved βswitch 3 (Figures 2A and 3; [Lane and Darst, 2010a]). Conserved R259 of theβ’lid (Figure 3—figure supplement 1) directs the t-strand DNA away from the RNA (to the right of the β’lid in Figure 3), which is buttressed on the opposite side by the β’rudder (Figures 2A and 3). Conserved D256 (Figure 3—figure supplement 1), at the top of the β’lid, forms a salt bridge with conserved R841 descending from the βflap, which covers the top of the RNA exit channel (Figures 2A and 3). This and other interactions between the βflap and elements of the β’ subunit seal off the nucleic acid tracks, preventing the upstream ssRNA and t-strand DNA from slipping over the top of the β’lid and inappropriately displacing the nt-strand DNA to form an extended RNA/DNA hybrid (Naryshkina et al., 2006; Toulokhonov and Landick, 2006).
The -10 t-strand nucleotide base pairs with the nt-strand DNA to begin the upstream DNA duplex with no single-stranded gap in the t-strand DNA (Figure 3). Density for the upstream DNA duplex is weak and noisy, indicative of mobility. The density maps and refinement results indicate that the -10 bp is distorted (Figure 3—figure supplement 2A), consistent with the conclusions of (Turtola and Belogurov, 2016). Upstream of the β’lid, conserved R47 of the β’zipper (Figure 3—figure supplement 1) interacts with the nt-strand -12 backbone phosphate (Figure 3), helping to set the path of the upstream DNA duplex, which exits the complex without any steric clashes, subtending an angle of about 110° with the entering downstream duplex DNA (Figure 3—figure supplement 3).
Nun interacts with RNAP, DNA, and RNA
Although the X-Nun/TEC-20A complex contained full-length Nun (Figure 1—figure supplement 1), only the C-terminal 23 residues (residues 87–109), which contain all known determinants for the translocation arrest activity of Nun (Henthorn and Friedman, 1996; Kim and Gottesman, 2004; Watnick et al., 2000), were visible in the structure (Figure 4A). Nun forms an extended structure that worms its way into the TEC active site cleft, interacting with the β’zipper, then squeezing through the nt-strand DNA track between the β’lid and the β’rudder, then between the β’rudder and the RNA/DNA hybrid. The very C-terminal segment of Nun (residues 102–109) binds in a pre-existing pocket next to the β’rudder between the downstream duplex DNA and the RNA/DNA hybrid (Figure 2B).
The overall structure of the TEC is largely unchanged (Supplementary file 2, Figure 1—figure supplement 5B), but Nun expands the narrow space between the β’lid, the β’rudder, and the beginning of the upstream duplex DNA by pushing the upstream DNA duplex outwards (away from the RNAP) about 3.5 Å, a full bp step (Figure 3—figure supplement 2). Nun residue H93 stacks on the exposed downstream face of the nt-strand DNA -10 base while R94 reaches to the major groove edge of the -10 bp, where it forms a hydrogen-bond (H-bond) with the O4 atom of the t-strand -10T (Figure 4B). This interaction is not likely to be sequence-specific since Nun R94 has the reach and flexibility to form a H-bond with a H-acceptor on the major groove edge of either the nt-strand or t-strand base. The presence of Nun in the vicinity of the upstream DNA duplex accounts for the Nun-dependent upstream toe-print (Vitiello et al., 2014).
Nun then squeezes between the β’rudder and the RNA/DNA hybrid (Figure 4C), displacing some α-carbons of the rudder about 3 Å and altering some rudder/nucleic acid interactions. In particular, in the TEC, β’R322 of the rudder makes polar interactions with the -6 and -7 RNA nucleotides, but in the Nun/TEC complex, β’R322 is displaced and forms an Arg-His pair with Nun H98 (Heyda et al., 2010). Nun H98 in turn forms H-bonds with the 2’-OH of the -7 and -8 RNA nucleotides (Figure 4C). We tested the validity of this observation by replacing the RNA -7A with deoxy-A (so disrupting the Nun H98/-7 RNA 2’-OH interaction), which did not affect elongation in the absence of Nun in the C10 extension assay, but significantly reduced the transcription arrest efficiency of Nun (Figure 4—figure supplement 1). The new position of β’R322 is stabilized by a salt bridge with β’D264 at the base of the lid. This configuration is important since substitutions β’R322H or β’D264G lead to Nun resistance in vivo (Robledo et al., 1991). Along with H98, nearby Nun residues N97 and H100 complete a network of polar interactions with the RNA, the t-strand DNA, and the β’rudder (Figure 4C).
The negatively-charged C-terminal carboxylate of Nun (S109) forms a salt bridge with Nun R102, resulting in an 8-residue non-covalent circle (Figure 4A). The substitution Nun R102A significantly reduced termination efficiency (Kim and Gottesman, 2004). Nun residues from N101 to the C-terminus form a network of polar interactions with the β-subunit, the RNA (-6), the downstream duplex DNA (+4t, +5t), and the nt-strand ssDNA (-1nt) (Figure 4D). The latter interaction (Figure 2—figure supplement 1A) explains why a two nucleotide nt-strand non-complementary 5’-overhang at the downstream fork of the transcription bubble is necessary for optimal Nun function (Figure 2—figure supplement 1C). In addition to RNAP substitutions β’R322H and D264G, the substitution β’D329G leads to Nun resistance (Robledo et al., 1991).β’D329 forms part of a pocket in the RNAP structure that accommodates Nun W108 (Figure 4D). The structure is consistent with previous findings that the C-terminus of Nun is within 15–20 Å of the downstream duplex DNA (Watnick and Gottesman, 1999). The location of the Nun C-terminus within the vicinity of the downstream duplex DNA, combined with the finding that normal Nun activity required an aromatic residue at Nun position 108 led to the proposal that Nun-mediated translocation arrest was facilitated by the intercalation of Nun W108 into the downstream DNA (Watnick and Gottesman, 1999) but this is not observed in the structure.
A segment of Nun encompassing residues 44 to the C-terminus (residue 109) was found to crosslink to both the 5’- and 3’-ends of RNA transcripts (Mustaev et al., 2016). Nun 44-109 crosslinks to the RNA 5’-end when the crosslinking moiety is placed at the -9 to -14 positions. These positions would not be able to crosslink to the Nun C-terminus (residues 87-109) visible in our structure, but Nun residues 44–86 exit the TEC near the RNA exit channel (Figure 1C) and could easily reach the upstream RNA, assuming some breathing/opening of the RNA exit channel.
Nun crosslinks to the RNA 3-‘end in a 2-nucleotide backtracked complex (Mustaev et al., 2016). Structures of both bacterial and eukaryotic TECs backtracked by 1-nucleotide place the backtracked nucleotide (+2 position) in the so-called ‘P’-site (Sekine et al., 2015; Wang et al., 2009). A eukaryotic 2-nucleotide backtracked complex shows the first backtracked nucleotide (+2) again in the P-site, while the second (3’) backtracked nucleotide (+3) is disordered, consistent with molecular dynamics simulations showing the +3 nucleotide sampling a large range of conformations (Wang et al., 2009). Thus, although the closest atom of Nun to the RNAP active site Mg2+ is 27 Å away, the flexible nature of the 2-nucleotide backtracked complex would allow the crosslinking moiety on the base of the RNA 3’-nucleotide to reach Nun.
Nun mutants retain significant translocation arrest activity
Based on our structural analyses, we generated single (and one double) amino acid substitutions in Nun and tested their activity in vitro. We substituted residues that appeared to make important interactions, including H93A and R94A (Figure 4B), N97D, H98A, and H100A (Figure 4C), and R102A, R102K, K106A, K107A, K106A/K107A, and W108A (Figure 4D). Nun N103 does not appear to make important interactions in the structure (Figure 4A), so the N103A substitution served as a control. The Nun substitutions that we tested here included all of the Nun mutants that have been identified as abrogating Nun activity in vivo (Kim and Gottesman, 2004; Watnick and Gottesman, 1999; Watnick et al., 2000). We determined the ‘fractional Nun activity’ for each mutant (at 20 µM Nun) with the in vitro C10 extension assay (Figure 1B) and normalized the results; 0 Nun activity corresponds to no Nun, 1.0 corresponds to the activity of wt-Nun at a saturating concentration (20 µM; Figure 5A). The Nun mutants displayed a wide range of activities (Figure 5—figure supplement 1), from 21 ± 7(SEM)% for H98A to 96 ± 4% for N103A (essentially wild-type, consistent with our structural analysis).
For the functional mechanism of a transcription factor such as Nun, we can consider two extremes: (1) Binding of the factor to RNAP and the activity of the factor in modulating RNAP function may be inseparable, or (2) binding and activity may be uncoupled. Many examples of the latter case exist; for example, single amino acid substitutions of GreA or GreB severely abrogate their activity in stimulating endonucleolytic transcript cleavage but do not have a noticeable effect on RNAP binding (Laptenko et al., 2003; Opalka et al., 2003; Rutherford et al., 2007; Sosunova et al., 2003). To assess the differential effects of the Nun amino acid substitutions on RNAP binding vs. activity in stalling translocation, we used the C10 extension assay to measure the effect of the Nun mutants as a function of their concentration. The resulting isotherms (Figure 5A) yielded the effective dissociation constant (Kd) and the fractional Nun activity at saturation (normalized with respect to wt-Nun; Figure 5B). The effective Kd reveals the effect of the particular Nun substitution on RNAP binding, while the maximum activity reflects the effect of the substitution on Nun activity.
The binding and activity of the Nun mutants are not strictly correlated (Figure 5B), indicating that some substitutions have a strong effect on binding but not on activity (R94A), while others affect binding to a lesser degree but have a stronger effect on activity (H98A). Nevertheless, each of the tested mutants show significant translocation arrest activity (59% to 80% of wt-Nun) when assayed at saturating concentration despite widely varying effects on RNAP binding (Figure 5B).
Discussion
Nun takes advantage of TEC structural requirements dictated by translocation
Efficient transcription elongation requires that the na-scaffold slide freely with respect to the RNAP at each step of translocation in a nucleic acid sequence-independent manner. As a consequence, the RNAP does not form tight, cognate interactions with most of the na-scaffold in the TEC (Figure 6). Nun establishes its interactions with the TEC by taking advantage of the loose RNAP/na-scaffold interactions, insinuating itself into the TEC in pre-existing gaps between the RNAP and the upstream duplex DNA (Figure 6A), the t-strand DNA connecting the upstream duplex DNA with the RNA/DNA hybrid (Figure 6B), and in a pocket next to the rudder between the RNA/DNA hybrid and the downstream duplex DNA (Figure 6C). The Nun/TEC interactions induce some small, localized conformational changes (such as in the upstream duplex, Figure 3—figure supplement 2, or in the rudder, Figures 2 and 6C), but do not cause large-scale changes in the TEC (Supplementary file 2; Figure 1—figure supplement 5B).
Mechanism of Nun translocation arrest
Our results reveal that the functionally critical C-terminal segment of Nun binds to the TEC in an extended conformation, interacting with conserved, functionally important elements of the RNAP (β’zipper, β’lid, β’rudder) as well as with the β subunit (Figure 4). At the same time, Nun interacts with nearly every element of the na-scaffold: the upstream duplex DNA, the RNA/DNA hybrid, the single-stranded nt-strand DNA within the transcription bubble, and the downstream duplex DNA (Figures 4 and 7). With this network of Nun/RNAP and Nun/na-scaffold interactions, Nun essentially mediates extensive crosslinking between the RNAP and the na-scaffold, preventing sliding of the na-scaffold with respect to the RNAP (i.e. preventing translocation). Note that this mechanism explains how Nun arrests both forward and reverse translocation (Vitiello et al., 2014). The finding that Nun establishes multiple interactions with both the RNAP and na-scaffold explains how single amino acid substitutions in Nun still retain significant translocation arrest function (Figure 5) since the disruption of one or a few interactions still leaves many other interactions that the TEC must overcome in order to translocate (Figure 7). Nun does not contain a functional center of critical amino acids for its translocation arrest activity; all of the Nun/TEC interactions contribute to its function.
In phage HK022 lysogens, the concentration of Nun ([Nun]) in the Eco cell is likely to be only ~100 nM (King et al., 2000). The Kd of wt-Nun for a non-specific TEC (i.e. not containing λ nut boxB RNA) is ~300 nM (Figure 5), while the Kd for a TEC containing a boxB RNA hairpin would be expected to be lower. Nun function is stimulated in vivo and in vitro by NusA, NusB, NusE, and NusG, but only in the presence of λ nut boxA RNA (Hung and Gottesman, 1995). The mechanism of Nus factor stimulation of Nun activity is unknown, but the Nus factors likely stabilize Nun binding to λ nut TECs. These observations indicate that [Nun] in phage HK022 lysogens is precisely tuned to allow Nun to target λ nut TECs but not cause general translocation arrest throughout the Eco genome (Figure 7B).
Nun mutants H98A, K106A/K107A, and W108A have been shown previously to be significantly (K106A/K107A) or totally (H98A, W108A) defective in Nun function both in vivo and in vitro (Kim and Gottesman, 2004; Watnick and Gottesman, 1999; Watnick et al., 2000). These results appear to contradict our results that, at saturating [Nun], these mutants exhibit significant translocation arrest activity (H98A, 59%; W108A, 66%; K106A/K107A, >70%; Figure 5, Figure 5—figure supplement 1). However, in previous studies, In vivo function was assessed using a λ pL-nutL-lacZ chromosomal transcriptional fusion (Wilson et al., 1997) with very low expression levels of the Nun mutants. In vitro transcription reactions included between 50–200 nM Nun mutants. In our study, significant activity of the Nun mutants was observed at saturating [Nun] >250 µM (Figure 5). Thus, the previous studies assessed mutant Nun activity at [Nun] too low to overcome the binding defects introduced by the mutations.
Nun and transcriptional pausing
In the final state of the Nun/TEC complex determined here (Figure 1C), Nun forms extensive interactions with both the RNAP clamp domain, the na-scaffold, as well as with elements of the β subunit apposing the clamp (Figure 7), tightening and immobilizing normally mobile elements of the TEC, suggesting why the X-Nun/TEC structure was determined to higher resolution than either the X-TEC or the TEC structures (Figure 1—figure supplements 2–4). Nun fits tightly into the TEC without causing major conformational changes in the TEC by taking advantage of pre-existing gaps between the RNAP and the na-scaffold (Figure 6), but examination of the structure suggests that it would not be possible for Nun to establish itself in this binding mode with the clamp domain tightly closed; clamp opening appears to be required for Nun binding.
There is no clear consensus sequence for Nun-mediated translocation arrest, consistent with the Nun/TEC structure. Nevertheless, Nun does not arrest translocation at random sites on the DNA; Nun preferentially stalls translocation at pre-existing intrinsic transcription pause sites (Hung and Gottesman, 1995). Structural (Weixlbaumer et al., 2013) and functional (Hein et al., 2014) analysis of paused transcription complexes suggests that pausing is linked with a propensity for clamp opening. This provides an explanation for the action of Nun at intrinsic pause sites, since clamp opening is likely required for Nun entry. In addition to clamp opening, TEC pausing provides Nun with a sufficiently long-lived stationary target to establish its extensive network of interactions, which is likely a slow process (Hung and Gottesman, 1995).
Materials and methods
Protein expression and purification
Request a detailed protocolEco RNAP lacking the αCTDs was prepared as described previously (Twist et al., 2011). Glycerol was added to the purified RNAP to 15% (v/v), and the sample was aliquoted and flash-frozen in liquid nitrogen. The aliquots were stored at -80°C until use.
Full-length HK022 Nun was cloned into the pET21d vector (Novagen/EMD Millipore, Billerica, MA) without any tag, and transformed into the Eco BL21-AI (arabinose inducible) strain. HK022 Nun was expressed by autoinduction system due to its high toxicity to cells (Studier, 2005). The transformed cells were inoculated into non-inducing media (MDAG-135 containing 100 µg/ml ampicillin) and grown overnight. The overnight culture was added to inducing media (ZYM-5052 + 0.05% arabinose+100 µg/ml ampicillin) at a ratio of 1:500 and grown for 16 hr at 37°C. The harvested cells were resuspended in 50 mM Na-phosphate, pH 7.9, 200 mM NaCl, 1 mM DTT, Complete Protease Inhibitor (Roche, Branford, CT) buffer, and lysed in a continuous flow French Press (Avestin, Ottawa, Ontario, Canada). The lysate was centrifuged and the supernatant was loaded onto a HiTrapSP column (GE Healthcare Life Sciences, Pittsburgh, PA). The protein was washed with 50 mM Na-phosphate, pH 7.9, 200 mM NaCl, 1 mM DTT, and eluted with a NaCl gradient to 1 M. The eluted protein was diluted with 50 mM Na-phosphate, pH 7.9, to lower the NaCl concentration to 0.1 M and then loaded onto a MonoS column (GE Healthcare Life Sciences). The protein was eluted with a salt gradient, and the eluted protein was purified by gel filtration chromatography on a HiLoad Superdex75 column (GE Healthcare Life Sciences) with TGE buffer [10 mM Tris-HCl, pH 8, 0.1 mM EDTA, 5% (v/v) glycerol] containing 500 mM NaCl and 1 mM DTT. Nun single or double mutants were generated by overlap or inverse PCR and purified using the same procedure as wild-type Nun. Each Nun mutant was confirmed by mass spectrometry. In the case of Nun K106A/K107A, mass spectrometry showed that the initial sample was >80% T-Nun (Nun with 13 C-terminal amino acids proteolytically removed; (Watnick and Gottesman, 1998). T-Nun elutes earlier on the MonoS column, so fractions from the tail-end of the Nun-K106A/K107A MonoS elution peak were analyzed by mass spectrometry and found to be ~50% T-Nun. T-Nun is totally inactive and does not bind RNAP (Watnick and Gottesman, 1998), so does not compete with Nun binding, so for the Nun-K106A/K107A assays (Figure 5—figure supplement 1) we adjusted the concentration to compensate. The finding that Nun-K106A/K107A is highly susceptible to in vivo protease cleavage to yield inactive T-Nun may explain the previous result that Nun-K106A/K107A was inactive in vivo (Kim and Gottesman, 2004).
Nucleic acid preparation
Request a detailed protocolSynthetic DNA oligonucleotides were obtained from Integrated DNA Technologies (Coralville, IA), RNA oligonucleotides from GE Healthcare Dharmacon (Lafayette, CO). The nucleic acids were dissolved in RNase-free water (Ambion/ThermoFisher Scientific, Waltham, MA) at 1 mM concentration. To assemble the TEC, template DNA and RNA were mixed at 1:1 ratio, annealed by incubating at 95°C for 2 min, 75°C for 2 min, 45°C for 5 min, and then decreasing the temperature by 2°C for 2 min until reaching 25°C. The annealed template DNA:RNA hybrid was stored at - 20°C before use.
TEC preparation for CryoEM
Request a detailed protocolPurified Eco RNAP was buffer-exchanged over the Superose 6 INCREASE (GE Healthcare Life Sciences) column into 20 mM Tris-HCl, pH 8.0, 150 mM KCl, 5 mM MgCl2, 5 mM DTT. The eluted protein was mixed with template DNA:RNA hybrid at a molar raio of 1:1.3 and incubated for 15 min at room temperature. Non-template DNA was added and incubated for 10 min. The complex was concentrated by centrifugal filtration (VivaProducts, Littleton, MA) to 3 mg/ml RNAP concentration before grid preparation.
Glutaraldehyde Cross-linked TEC preparation for CryoEM
View detailed protocolPurified Eco RNAP was buffer-exchanged over the Superose 6 INCREASE (GE Healthcare Life Sciences) column into 20 mM HEPES, pH 7.5, 150 mM KCl, 5 mM MgCl2, 5 mM DTT. The eluted protein was mixed with template DNA:RNA hybrid at a molar raio of 1:1.3 and incubated for 15 min at room temperature. Non-template DNA was added and incubated for 10 min. The concentration of TEC was adjusted to ~0.5 µM then glutaraldehyde was added to a final concentration of 0.005% (w/v). The mixture was incubated for 7 min at room temperature and the crosslinking was quenched by adding Tris-HCl, pH 8.0, to 0.1 M. The mixture was centrifuged for 10 min at 4°C, concentrated by cetrifugal filtration, and purified by gel filtration on the Superose6 INCREASE column (GE Healthcare Life Sciences) equilibrated in 20 mM Tris-HCl, pH 8.0, 150 mM KCl, 5 mM MgCl2, 5 mM DTT. The eluted complex was concentrated to 3 mg/ml by centrifugal filtration (VivaProducts) before grid preparation.
Glutaraldehyde Cross-linked Nun/TEC Preparation for CryoEM
View detailed protocolPurified Eco RNAP and HK022 Nun were buffer-exchanged over the Superose 6 INCREASE column (GE Healthcare Life Sciences) equilibrated in 20 mM HEPES, pH 7.5, 150 mM KCl, 5 mM MgCl2, 5 mM DTT. The TEC was assembled as before, and HK022 Nun was added at a concentration of >15 µM. Final steps of crosslinking and purification were the same as those for the TEC sample preparation.
CryoEM grid preparation
Request a detailed protocolBefore freezing, 8 mM CHAPSO was added to the samples. C-flat (Protochips, Morrisville, NC) CF-1.2/1.3 400 mesh copper grids or 300 mesh gold grids were glow-charged for 15 s prior to the application of 3.5 µl of the complex sample (2.0–3.0 mg/ml protein concentration), and plunge-freezing in liquid ethane using a Vitrobot mark IV (FEI, Hillsboro, OR) with 100% chamber humidity at 22°C.
CryoEM data acquisition and processing
Request a detailed protocolThe grids were imaged using a 300 keV Titan Krios (FEI) equipped with a K2 Summit direct electron detector (Gatan, Pleasanton, CA). Images were recorded with Serial EM (Mastronarde, 2005) in super-resolution counting mode with a super resolution pixel size of 0.65 Å and a defocus range of 0.8 to 2.6 µm. Data were collected with a dose of 8 to 10 electrons/physical pixel/s (1.3 Å pixel size at the specimen). Images were recorded with a 15 s exposure and 0.3 s subframes (50 total frames) to give a total dose of 71 to 89 electrons/Å2.
Dose fractionated subframes were 2 × 2 binned (giving a pixel size of 1.3 Å), aligned and summed using Unblur (Grant and Grigorieff, 2015). The contrast transfer function was estimated for each summed image using CTFFIND4 (Rohou and Grigorieff, 2015). From the summed images, approximately 10,000 particles were manually picked and subjected to 2D classification in RELION (Scheres, 2012). Projection averages of the most populated classes were used as templates for automated picking in RELION. Autopicked particles were manually inspected, then subjected to 2D classification in RELION specifying 100 classes. Poorly populated classes were removed, resulting in datasets of 470,000 particles (X-Nun/TEC), 281,000 particles (X-TEC), and 305,000 particles (TEC). These particles were individually aligned using direct-detector-align_lmbfgs software (Rubinstein and Brubaker, 2015). Particles were 3D autorefined in RELION using a model of Eco core RNAP (PDB ID 4LJZ with△1.1σ70 removed; Bae et al., 2013) low-pass filtered to 60 Å resolution using EMAN2 (Tang et al., 2007) as an initial 3D template. With this initial model, 3D classifications were performed without alignment with a soft mask excluding flexible regions of the complex for more robust classification. The soft mask excluded the αII subunit, βi4, βi9, the βflap, β’i6, the β’jaw, and single-stranded nucleic acids. Among the 3D classes, the best-resolved class was 3D autorefined without the mask, and post-processed in RELION. Local resolution caculations were performed using blocres (Cardone et al., 2013).
Model building and refinement
Request a detailed protocolTo build initial models, Eco core enzyme (PDB ID 4LJZ with △1.1σ70 removed; Bae et al., 2013), DNA:RNA hybrid and downstream duplex DNA (PDB ID 2O5J; Vassylyev et al., 2007), and nt-strand single-stranded DNA (PDB ID 4XLN; Bae et al., 2015) were fit into the electron density maps using Chimera (Pettersen et al., 2004). These initial models were real-space refined using Phenix (Adams et al., 2010). In the real-space refinement, domains in the core and nucleic acids were rigid-body refined, then subsequently refined with secondary structure restraints. Single-stranded RNA and Nun were built de novo in Coot, and real-space refined with the previously refined model.
Transcription assays
Request a detailed protocolTranscription assays were done as described previously (Vitiello et al., 2014). Briefly, synthetic RNA was 5’-labeled with 32P-γ-ATP using T4 polynucleotide kinase (NEB) and annealed with t-strand DNA as before. His-tagged Eco RNAP core was added to the radio-labeled DNA:RNA hybrid, incubated for 15 min at room temperature, and nt-strand DNA was added to the mixture. After 10 min incubation, the assembled TEC was added to Ni-NTA beads in transcription buffer (20 mM Tris-HCl, pH 8.0, 40 mM KCl, 5 mM MgCl2, 2 mM β-mercaptoethanol) and incubate for 20 min with shaking. Then, the immobilized TECs were washed with transcription buffer, transcription buffer containing 1 M KCl, then transcription buffer again. The TECs were then eluted with transcription buffer containing 200 mM imidazole (pH 8.0) with 0.1 mg/ml acetylated BSA. The eluted complexes were mixed with Nun (20 µM final concentration unless otherwise noted) or transcription buffer and used for transcription assays. The reactions were initiated by adding 1 mM or 0.1 mM rNTPs to the eluted TECs, and stopped by adding 2X stop buffer after 1 min or 5 min. Transcription products were resolved by 20% denaturing poly-acrylamide gel electrophoresis. The gels were exposed to phosphor screens and scanned using a Typhoon 9400 (GE Healthcare Life Sciences).
Native mass spectrometry
Request a detailed protocolThe samples were buffer-exchanged into native MS buffer (500 mM ammonium acetate, 0.01% Tween-20) using Zeba microspin desalting columns (ThermoFisher Scientific) with a 40 kDa molecular weight cut-off. An aliquot (2–3 µL) of the buffer-exchanged sample was loaded into an in-house fabricated gold-coated quartz capillary and introduced into an Exactive Plus EMR instrument (ThermoFisher Scientific) using a static nanospray source. The EMR instrument was calibrated using cesium iodide. The MS parameters used included spray voltage, 1.0–1.3 kV; capillary temperature, 100°C; in-source dissociation, 10 V; S-lens RF level, 200; resolving power, 17,500 at m/z of 200 corresponding to 64 ms analyzer transient duration; AGC target, 5 × 105; number of microscans, 5; maximum injection time, 200 ms; high energy collision (HCD), 150 V for minimal gas-phase dissociation; injection flatapole, 8 V; interflatapole, 4 V; bent flatapole, 4 V; ultrahigh vacuum pressure, 8–9 × 10−10 mbar; total number of scans, 100. RAW files were processed and deconvolution was performed manually using Thermo Xcalibur Qual Browser (version 3.0.63). Experimental masses were reported as average mass ± standard deviation across all the calculated mass values obtained within the charge-state distribution. The accuracies of the measured masses were calculated as [(experimental mass – theoretical mass)/theoretical mass] and ranged from 0.005% to 0.06%.
Accession numbers
Request a detailed protocolThe cryoEM density maps have been deposited in the EM Data Bank with accession codes EMD-8584 (X-Nun/TEC), EMD-8585 (X-TEC), and EMD-8586 (TEC). Atomic coordinates have been deposited in the Protein Data Bank with accession codes 5UP6 (X-Nun/TEC), 5UPA (X-TEC), and 5UPC (TEC).
Data availability
-
CryoEM structure of HK022 Nun - E. coli RNA polymerase elongation complexPublicly available at the RCSB Protein Data Bank (accession no: 5UP6).
-
CryoEM structure of HK022 Nun - E. coli RNA polymerase elongation complexPublicly available at the EMBL-EBI Protein Data Bank in Europe (accession no: EMD-8584).
-
CryoEM structure of crosslinked E.coli RNA polymerase elongation complexPublicly available at the RCSB Protein Data Bank (accession no: 5UPA).
-
CryoEM structure of crosslinked E.coli RNA polymerase elongation complexPublicly available at the EMBL-EBI Protein Data Bank in Europe (accession no: EMD-8585).
-
CryoEM structure of E.coli RNA polymerase elongation complexPublicly available at the RCSB Protein Data Bank (accession no: 5UPC).
-
CryoEM structure of E.coli RNA polymerase elongation complexPublicly available at the EMBL-EBI Protein Data Bank in Europe (accession no: EMD-8586).
-
Crystal Structure Analysis of the E.coli holoenzymePublicly available at the RCSB Protein Data Bank (accession no: 4LJZ).
-
Crystal structure of the T. thermophilus RNAP polymerase elongation complex with the NTP substrate analogPublicly available at the RCSB Protein Data Bank (accession no: 2O5J).
References
-
PHENIX: a comprehensive Python-based system for macromolecular structure solutionActa Crystallographica Section D Biological Crystallography 66:213–221.https://doi.org/10.1107/S0907444909052925
-
One number does not fit all: mapping local variations in resolution in cryo-EM reconstructionsJournal of Structural Biology 184:226–236.https://doi.org/10.1016/j.jsb.2013.08.002
-
MolProbity: all-atom structure validation for macromolecular crystallographyActa Crystallographica Section D Biological Crystallography 66:12–21.https://doi.org/10.1107/S0907444909042073
-
The structure of the coliphage HK022 nun protein-lambda-phage boxB RNA complex. implications for the mechanism of transcription terminationJournal of Biological Chemistry 276:32064–32070.https://doi.org/10.1074/jbc.M102975200
-
RNA polymerase pausing and nascent-RNA structure formation are linked through clamp-domain movementNature Structural & Molecular Biology 21:794–802.https://doi.org/10.1038/nsmb.2867
-
Attractive interactions between side chains of histidine-histidine and histidine-arginine-based cationic dipeptides in waterThe Journal of Physical Chemistry B 114:8744–8749.https://doi.org/10.1021/jp101031v
-
Transcription termination by phage HK022 nun is facilitated by COOH-terminal lysine residuesJournal of Biological Chemistry 279:13412–13417.https://doi.org/10.1074/jbc.M313206200
-
Mechanistic model of the elongation complex of Escherichia coli RNA polymeraseCold Spring Harbor Symposia on Quantitative Biology 63:337–346.https://doi.org/10.1101/sqb.1998.63.337
-
Molecular evolution of multisubunit RNA polymerases: structural analysisJournal of Molecular Biology 395:686–704.https://doi.org/10.1016/j.jmb.2009.10.063
-
Molecular evolution of multisubunit RNA polymerases: sequence analysisJournal of Molecular Biology 395:671–685.https://doi.org/10.1016/j.jmb.2009.10.062
-
Automated electron microscope tomography using robust prediction of specimen movementsJournal of Structural Biology 152:36–51.https://doi.org/10.1016/j.jsb.2005.07.007
-
The role of the largest RNA polymerase subunit lid element in preventing the formation of extended RNA-DNA hybridJournal of Molecular Biology 361:634–643.https://doi.org/10.1016/j.jmb.2006.05.034
-
UCSF chimera--a visualization system for exploratory research and analysisJournal of Computational Chemistry 25:1605–1612.https://doi.org/10.1002/jcc.20084
-
Regulation of transcription: from lambda to eukaryotesTrends in Biochemical Sciences 30:275–279.https://doi.org/10.1016/j.tibs.2005.04.003
-
Attachment of nascent RNA molecules to superhelical DNAJournal of Molecular Biology 98:565–579.https://doi.org/10.1016/S0022-2836(75)80087-8
-
Escherichia coli mutations that block transcription termination by phage HK022 nun proteinJournal of Molecular Biology 220:613–619.https://doi.org/10.1016/0022-2836(91)90104-E
-
CTFFIND4: fast and accurate defocus estimation from electron micrographsJournal of Structural Biology 192:216–221.https://doi.org/10.1016/j.jsb.2015.08.008
-
Alignment of cryo-EM movies of individual particles by optimization of image translationsJournal of Structural Biology 192:188–195.https://doi.org/10.1016/j.jsb.2015.08.007
-
RELION: implementation of a bayesian approach to cryo-EM structure determinationJournal of Structural Biology 180:519–530.https://doi.org/10.1016/j.jsb.2012.09.006
-
Antitermination in bacteriophage lambda. the structure of the N36 peptide-boxB RNA complexEuropean Journal of Biochemistry 267:2397–2408.https://doi.org/10.1046/j.1432-1327.2000.01251.x
-
Protein production by auto-induction in high density shaking culturesProtein Expression and Purification 41:207–234.https://doi.org/10.1016/j.pep.2005.01.016
-
EMAN2: an extensible image processing suite for electron microscopyJournal of Structural Biology 157:38–46.https://doi.org/10.1016/j.jsb.2006.05.009
-
The role of the lid element in transcription by E. coli RNA polymeraseJournal of Molecular Biology 361:644–658.https://doi.org/10.1016/j.jmb.2006.06.071
-
Overexpression of phage HK022 nun protein is toxic for Escherichia coliJournal of Molecular Biology 380:812–819.https://doi.org/10.1016/j.jmb.2008.05.030
-
Role of E.coli transcription-repair coupling factor mfd in Nun-mediated transcription terminationJournal of Molecular Biology 329:655–662.https://doi.org/10.1016/S0022-2836(03)00465-0
-
Translational repression by a transcriptional elongation factorGenes & Development 11:2204–2213.https://doi.org/10.1101/gad.11.17.2204
-
Structural basis of transcription initiationScience 338:1076–1080.https://doi.org/10.1126/science.1227786
Article and author information
Author details
Funding
National Institutes of Health (R35 GM118130)
- Seth A Darst
National Institutes of Health (R01 GM037219)
- Max E Gottesman
National Institutes of Health (P41 GM103314)
- Brian T Chait
Public Health Research Institute Research Support grant
- Arkady Mustaev
National Institutes of Health (P41 GM109824)
- Brian T Chait
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Acknowledgements
We thank K Uryu and D Acehan at The Rockefeller University Electron Microscopy Resource Center for help with EM sample preparation, and M Ebrahim and J Sotiris at The Rockefeller University Cryo-EM Resource Center for help with data collection. This work was supported by a Public Health Research Institute Research Support grant (AM) and NIH grants P41 GM103314 and P41 GM109824 to BTC, R01 GM037219 to MG, and R35 GM118130 to SAD.
Copyright
© 2017, Kang et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 3,747
- views
-
- 786
- downloads
-
- 117
- citations
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Biochemistry and Chemical Biology
- Structural Biology and Molecular Biophysics
Dynamic conformational and structural changes in proteins and protein complexes play a central and ubiquitous role in the regulation of protein function, yet it is very challenging to study these changes, especially for large protein complexes, under physiological conditions. Here, we introduce a novel isobaric crosslinker, Qlinker, for studying conformational and structural changes in proteins and protein complexes using quantitative crosslinking mass spectrometry. Qlinkers are small and simple, amine-reactive molecules with an optimal extended distance of ~10 Å, which use MS2 reporter ions for relative quantification of Qlinker-modified peptides derived from different samples. We synthesized the 2-plex Q2linker and showed that the Q2linker can provide quantitative crosslinking data that pinpoints key conformational and structural changes in biosensors, binary and ternary complexes composed of the general transcription factors TBP, TFIIA, and TFIIB, and RNA polymerase II complexes.
-
- Biochemistry and Chemical Biology
- Stem Cells and Regenerative Medicine
Human induced pluripotent stem cells (hiPSCs) have great potential to be used as alternatives to embryonic stem cells (hESCs) in regenerative medicine and disease modelling. In this study, we characterise the proteomes of multiple hiPSC and hESC lines derived from independent donors and find that while they express a near-identical set of proteins, they show consistent quantitative differences in the abundance of a subset of proteins. hiPSCs have increased total protein content, while maintaining a comparable cell cycle profile to hESCs, with increased abundance of cytoplasmic and mitochondrial proteins required to sustain high growth rates, including nutrient transporters and metabolic proteins. Prominent changes detected in proteins involved in mitochondrial metabolism correlated with enhanced mitochondrial potential, shown using high-resolution respirometry. hiPSCs also produced higher levels of secreted proteins, including growth factors and proteins involved in the inhibition of the immune system. The data indicate that reprogramming of fibroblasts to hiPSCs produces important differences in cytoplasmic and mitochondrial proteins compared to hESCs, with consequences affecting growth and metabolism. This study improves our understanding of the molecular differences between hiPSCs and hESCs, with implications for potential risks and benefits for their use in future disease modelling and therapeutic applications.