DNA replication requires that the duplex genomic DNA strands be separated; a function that is implemented by ring-shaped hexameric helicases in all Domains. Helicases are composed of two domains, an N- terminal DNA binding domain (NTD) and a C- terminal motor domain (CTD). Replication is controlled by loading of helicases at origins of replication, activation to preferentially encircle one strand, and then translocation to begin separation of the two strands. Using a combination of site-specific DNA footprinting, single-turnover unwinding assays, and unique fluorescence translocation monitoring, we have been able to quantify the binding distribution and the translocation orientation of Saccharolobus (formally Sulfolobus) solfataricus MCM on DNA. Our results show that both the DNA substrate and the C-terminal winged-helix (WH) domain influence the orientation but that translocation on DNA proceeds N-first.
https://doi.org/10.7554/eLife.46096.001The hexameric MCM complex is conserved throughout archaea and eukaryotic species as the DNA helicase that unwinds the duplex genome providing leading and lagging strand templates for replication. The MCM proteins themselves are bilobal with a N-terminal domain (NTD) that acts to stabilize binding to single-strand DNA (ssDNA), a C-terminal domain (CTD) that contains the conserved AAA+ (ATPases associated with diverse cellular activities) motor domain that provide energy for translocation and DNA unwinding, and a winged-helix (WH) domain for DNA binding (Trakselis, 2016). DNA unwinding proceeds by encircling and translocating along the leading strand in the 3’−5’ direction, while sterically excluding the lagging strand template (Kelman et al., 1999; Chong et al., 2000; Bochman and Schwacha, 2009).
In eukaryotes, six homologous proteins comprise the MCM2-7 heterohexameric complex (Yuan et al., 2016). MCM2-7 interacts with Cdc45 and the GINS heterotetramer (Psf1, 2, 3, Sld5) to form the active unwinding CMG complex (Moyer et al., 2006). GINS binds primarily to the AAA+ CTD of MCM5 bringing in Cdc45 to interact with and close the interface with MCM2, aligning the motor domains into a proper configuration for activity (Costa et al., 2011). Archaea have a single MCM protein that is equally homologous to the six eukaryotic MCM2-7 proteins (Makarova et al., 2012; Goswami et al., 2015), and in contrast to eukaryotes, the archaeal MCM helicase is active on its own in vitro and does not require accessory proteins for robust DNA unwinding (Chong et al., 2000; Marinsek et al., 2006). Archaeal MCM forms a homohexameric complex but can also interact with orthologs of Cdc45 (RecJ) and the GINS (GINS23, GINS15) complex to stimulate the helicase activity further (Yoshimochi et al., 2008; Lang and Huang, 2015; Xu et al., 2016), although Cdc45 (i.e. GAN) is not required for viability in the euryarchaea,Thermococcus kodakarensis, possibly implicating this protein as a redundant nuclease for Okazaki fragment maturation (Burkhart et al., 2017).
Loading of the MCM complexes onto and encircling of double-stranded DNA (dsDNA) origins have been a subject of intense experimentation (Remus et al., 2009; Ticau et al., 2015; Frigola et al., 2017; Ticau et al., 2017), but the consensus origin loaded double hexamer state places the NTDs together with the CTDs facing outwards (Sun et al., 2013). This head-to-head orientation achieved during initiation is also conserved with the analogous Large-T antigen of SV40 virus (Valle et al., 2000; Gomez-Lorenzo et al., 2003). There are still remaining questions as to how the MCM or CMG complex goes from encircling dsDNA to selecting one of the ssDNA strands for translocation. Recent data in the eukaryotic system shows this will involve cyclin dependent kinases (CDK) firing factors, additional components including MCM10, and ATP hydrolysis by MCM subunits to untwist the dsDNA to give two independent CMGs that have encircled opposing strands (Douglas et al., 2018). Because the CMG complex translocates 3’ to 5’, the selection of one strand over the other will dictate whether the two hexamers dissociate from each other or pass over each other for elongation. These two mechanisms will be distinguished by whether the CTD or the NTD, respectively, are leading the way for unwinding. In yeast, the N-first mechanism of CMG translocation has been confirmed, which involves a physical passing of each helicase to regulate origin firing before establishing the replisome progression complex (RPC) (Georgescu et al., 2017; Douglas et al., 2018), but this has not been confirmed in other species that contain MCM.
The binding orientation of the single archaeal MCM hexamer bound on fork DNA has been shown previously to place the CTD near the fork junction, suggesting a C-first mechanism of unwinding (McGeoch et al., 2005; Rothenberg et al., 2007; Costa et al., 2014). An X-ray structure of an NTD construct of an archaeal MCM shows ssDNA binding orthogonal to the central channel consistent with either N-first or C-first translocation (Froelich et al., 2014). C-first translocation for MCM was analogous to the orientation of the homohexameric Escherichia coli (E. coli) DnaB, which although it has opposite unwinding polarity (5’−3’), also places its CTD RecA motor domain near the duplex region (Jezewska et al., 1998). This is now directly challenged by the cryoEM data from higher order eukaryotic systems (Georgescu et al., 2017). The strong homology between the archaeal and eukaryotic DNA replication systems would not suggest significant differences in translocation and unwinding mechanisms of the MCM complexes (Barry and Bell, 2006).
This report characterizes both the distribution of archaeal MCM binding to the ssDNA regions of fork DNA as well as the translocation orientation of the MCM complex during active unwinding to compare the mechanistic properties between Domains. Many studies have focused on examining the static structural features of helicase binding to DNA or the mechanistic aspects of DNA translocation and unwinding polarity, but few have simultaneously examined both. Using multiple site-specific DNA footprinting techniques, the orientation population distribution of the DNA fork bound Saccharolobus (formally Sulfolobus [Sakai and Kurosawa, 2018]) solfataricus (Sso) MCM complex was determined. We show that SsoMCM can bind both strands of fork DNA in multiple orientations complicating interpretations, however the NTD adjacent to the duplex region (N@duplex) on a 3’-long arm fork is significantly favored, providing more insight into the productive orientation. Binding to fork DNA is affected by the WH domain at the C-terminus that influences the binding orientation. Deletion of the WH domain results in a loss of orientation specificity on 3’-long arm fork substrates mimicking the initial stages of helicase activation. Single-turnover DNA unwinding experiments reveal the stoichiometry of productively bound SsoMCM orientations that are influenced by the WH domain and correlate with an N-first translocation and unwinding mechanism. Finally, presteady-state fluorescence resonance energy transfer (FRET) experiments that directly monitor the translocation and unwinding direction of productive SsoMCM complexes confirm an N-first mode of unwinding.
Previously, our group and others have shown that SsoMCM loads onto fork DNA with the CTD towards the duplex (C@duplex) binding orientation (McGeoch et al., 2005; Rothenberg et al., 2007; Costa et al., 2014), however, its active translocation orientation has yet to be determined. This C@duplex binding orientation has been used to speculate that MCM also translocates in a C-first orientation (Remus et al., 2009; Graham et al., 2011; Zhou et al., 2012; Bell and Botchan, 2013; Costa et al., 2013; Costa et al., 2014; Miller and Enemark, 2015; Martinez et al., 2017). However, more recent evidence has shown that when assembled within a leading strand holoenzyme complex, yeast MCM2-7 helicase assembles with the NTD leading the way (N-first) (Georgescu et al., 2017). In order to more specifically quantify the binding orientation distribution of SsoMCM on model fork substrates, we utilized two separate and specific DNA cleavage strategies.
Single free cysteines within the CTD, at either C642 or C682, were utilized by mutating the other to alanine, releasing an inherent disulfide (McGeoch et al., 2005). Either cysteine was then labelled independently with the photoactivatable crosslinker, 4-azidophenacyl bromide (APB). APB attachment at C682 (C642A mutant) provided the greatest signal shift in mobility when crosslinked to DNA (Figure 1—figure supplement 1). APB crosslinking to DNA bases is generally non-specific after activation by UV light (Pendergrast et al., 1992; Nodelman et al., 2017), yet we detected significant crosslinking and subsequent ssDNA cleavage even in the absence of direct UV light (Figure 1—figure supplement 2). This was dependent on specific attachment of APB to SsoMCM (lanes 5 vs. 3 or 4), and it was enhanced after exposure to UV light and alkaline digestion (lanes 9–11). Overall, SsoMCM-APB had many cut sites along the length of both ssDNA substrates favoring positioning at the middle of the ssDNA substrate, implicating nonspecific binding orientation at multiple positions.
To further investigate the orientation of SsoMCM on equal arm fork DNA, APB (for crosslinking/digestion) or FeBABE (for a localized hydroxyl radical Fenton footprinting reaction; Owens et al., 1998) were conjugated at C682 using SsoMCM(C642A) mutant. Cleavage could be induced specifically with UV light/NaOH (APB) (Figure 1A or D) or hydrogen peroxide and ascorbic acid (FeBABE) (Figure 1B or E) on two separate forks labelled with 5’-Cy3 or 3’-Cy5 at the duplex end. In all situations, multiple cleavage sites were detected on the ssDNA region of the labelled strand (indicated by arrows), suggesting different orientation populations and positioning of SsoMCM. SsoMCM can bind 3’ or 5’ ssDNA arms with similar affinities to fork DNA, however when noncomplementary 3’ and 5’ fork arms are available, there is a preference for binding/encircling the 3’-arm (Rothenberg et al., 2007). SsoMCM has a significantly lower binding efficiency (~4 fold less) for duplex DNA over fork substrates measured at the single molecule level (Rothenberg et al., 2007), essentially eliminating the possibility of SsoMCM encircling the duplex region and contributing significantly to cutting the ssDNA arms. Furthermore, anisotropy experiments performed with SsoMCM and duplex DNA also show a larger dissociation constant (Kd) over fork substrates (Figure 1—figure supplement 3), suggesting that SsoMCM preferentially binds ssDNA arms of the fork DNA. Moreover, stoichiometric (~1:1 MCM6:DNA) concentration ratios were maintained throughout to promote binding to the highest affinity site and limit nonspecific binding to the duplex region. To test this directly, DNaseI footprinting experiments and Electrophoretic Mobility Shift Assays (EMSA) were performed and confirmed complete DNA binding without protection of the duplex region (Figure 1—figure supplement 4). Previously, we have shown that the 5’-excluded strand is protected from ssDNA nuclease digestion upon SsoMCM binding (Graham et al., 2011) and that titration of large amounts of SsoMCM on fork substrates does not compete off the external excluded strand to favor two hexamers binding (Carney and Trakselis, 2016). Therefore, the predominate bound species is a stoichiometric single SsoMCM hexamer encircling one ssDNA arm and interacting with the other on the exterior surface, but other minor populations also exist.
Using either cleavage agent, there is evidence for footprinting of the CTD of SsoMCM towards the duplex end (C@duplex) or the free ends (N@duplex) for either labelled substrate. Cleavage can occur on the encircled strand or the excluded strand consistent with the flexibility of the WH domain to interact with either strand at the fork junction. We quantified and compared the relative footprinting of the CTD delineated by the midpoint of the ssDNA region (Figure 1C and F). The midpoint of a ssDNA arm was selected for quantification based on a void in cleavage there and the strong preference for binding ssDNA over duplex DNA at stoichiometric concentrations to describe only binary binding orientations. For either agent (APB or FeBABE), there was a significant ~3:1 preference for placing the CTD closer to the duplex region (C@duplex) independent of which strand is labelled.
Although footprinting on equal arm fork DNA favors C@duplex, it is probable that some proportion of SsoMCM is encircling the 5’-arm, complicating our analysis and interpretation. Therefore, asymmetric arm fork DNA substrates that have a 3’-long arm with different length (0 nucleotide (nt) or eight nt) 5’-arms were designed. Fluorescence anisotropy binding experiments show that SsoMCM binds a 5’-long arm substrate with similar affinity to 3’-long arm substrates (Figure 1—figure supplement 3). Some archaeal species have a MCM central channel that can occupy both ss and dsDNA (Fletcher et al., 2003; Pape et al., 2003). Therefore, SsoMCM when loaded onto the 3’-long arm fork substrate containing a 0 nt 5’-arm has the possibility of being translocated over the duplex DNA region and then cleaving outside of our boundaries. In order to overcome this, substrates were designed with an 8 nt short 5’-arm. This length was designed to be long enough to prevent translocation over duplex DNA and short enough to prevent helicase loading onto the 5’-arm. It has been previously shown that archaeal MCM requires > 16 nts for productive binding/unwinding (Haugland et al., 2006).
Therefore, these orientation mapping experiments were repeated with APB labelled at C682 but limiting the 5’-arm to eight nts to enforce encircling of the 3’-arm. APB footprinting studies of the 3’-long arm substrate instead show that there is nearly a 1.5:1 preference of placing the NTD closer to the duplex region (N@duplex) (Figure 2A–B). There is a significant increase and reversed preference for orientating N@duplex for the 3’-long arm fork substrate over the equal arm fork substrate (Figure 1C). This suggests that the 5’-long arm either impacts the helicase orientation or that multiple populations of helicases can exist bound on either the 3’- or 5’-arm of the equal arm fork. Therefore, we repeated APB mapping experiments on an opposite 5’-long arm substrate with a shorter 8 nt 3’-arm (Figure 2C–D). Here, the footprinting orientations were reversed, with a >3:1 preference for C@duplex (Figure 2D). Therefore, on these long arm fork DNA substrates, SsoMCM can bind either the 3’- or 5’-arm in both orientations, but the preferred 3’-5’ polarity is CTD-NTD.
The WH domain at the C-terminus of SsoMCM is suggested as a substrate recognition or localization domain (Aravind et al., 2005). Moreover, the WH domain in both archaea and eukaryotes is considered important for determining MCM helicase loading and initiation during replication (Samson et al., 2016a; Martinez et al., 2017; Goswami et al., 2018) and mediates DNA binding (Gaudier et al., 2007). Thus, we hypothesized that the WH domain may have regulatory effect on directing the orientation of SsoMCM helicase on DNA. To determine this, we utilized SsoMCM-WH mutant (aa 1–612) with two separate cysteine mutations at the CTD (G452C and S456C) (Figure 3A). Footprinting experiments were repeated with APB labelled at either C452 or C456 of SsoMCM–WH on equal arm (Figure 3B) or 3’-long arm (Figure 3D) substrates. The results show a loss of orientation specificity (Figure 3C and E) compared with Figure 1C or 2B.
As shown above, SsoMCM-WH is likely bound on the equal arm fork DNA in at least four populations (two orientations and on either strand). SsoMCM WT on equal arm fork substrates (Figure 1C) specifically loads C@duplex, but when SsoMCM-WH binds the same substrate, it loses a binding preference (Figure 3C). When ABP footprinting experiments were repeated with the 3’-long arm substrate, there is a complete loss of orientation specificity on both mutants (Figure 3E). These results show that the WH domain of SsoMCM influences the binding orientation of this helicase on equal arm fork DNA to place C@duplex but that this WH domain is less important for when engaging ssDNA for translocation.
Previously, multiple reports have shown that the fraction unwound by SsoMCM generally hovers between 0.3 and 0.5 depending on the substrate and conditions (Barry et al., 2007; Graham et al., 2011; Graham et al., 2018). The proportion of SsoMCM bound in a productive orientation and state can be determined in a single-turnover DNA unwinding experiment. Single-turnover unwinding conditions were initiated by the simultaneous addition of a 20-fold excess of unlabelled ssDNA and ATP to a prebound SsoMCM/DNA complex. The proportion of productive translocating SsoMCM hexamers will correlate with the total unwound DNA fraction. Different Cy3 or Cy5 labelled DNA substrates comprised of equal 30 nt fork arms or asymmetric 30 and 8 nt arms were used for unwinding experiments with WT SsoMCM (Figure 4A). The fork DNA substrate has four possible SsoMCM binding orientations (N@duplex or C@duplex on either the 5’ or 3’-arms) and unwinds 0.26 ± 0.01 fraction of DNA. Instead, restricting loading to only the 3’-long arm (8 nt 5’-arm) with only two possible orientations significantly increased the unwound fraction to 0.54 ± 0.03. When experiments were repeated with 0 nt at the 5’ end, there was 2-fold decrease in unwound product confirming that SsoMCM can translocate over the duplex region of the substrates in the absence of any 5’-arm (Figure 4—figure supplement 1). Background unwinding on the 5’-long arm (with 0 or 8 nt 3’-arm) displays only 0.08 ± 0.01 or 0.13 ± 0.01 fraction unwound, respectively (Figure 4—figure supplement 1). Therefore, an 8 nt 3’-arm is not long enough to facilitate unwinding to any significant degree. Hence, the 3’-long arm (with 8 nt 5’-arm) fork substrate enables the most productive fraction of SsoMCM helicases competent for unwinding.
As the WH domain was shown above to influence the binding orientation, DNA unwinding was repeated on the fork and 3’-long arm with SsoMCM-WH. Previously, deletion of the WH motif had no effect on DNA binding affinity but significantly increased DNA unwinding in a steady-state experiment (Barry et al., 2007). The –WH mutant showed a significant increase in the unwound product with the fork (0.35 ± 0.01) (Figure 4B) but a slight decrease with the 3’-long arm (0.46 ± 0.01) (Figure 4C) compared with WT. An increased amount of unwound product with the fork substrate suggests a loss in specificity for SsoMCM orientation and correlates with the near equal N@duplex and C@duplex cleavage mapping (Figure 3C). The slight decrease in unwound product with the 3’-long arm correlates with the fraction of N@duplex mapped for WT (0.57 ± 0.03) (Figure 2B) or –WH (0.52 ± 0.01) (Figure 3E) on the same substrate. Therefore, the flexible WH domain influences the population distribution of binding SsoMCM on fork DNA.
Further comparison of DNA unwinding and footprinting results can lead to the identification of the proportion of SsoMCM bound in a productive orientation. The fraction unwound for the equal arm fork substrate, 0.26 ± 0.01 (Figure 4A), corresponds with a similar footprinting ratio of 0.23 ± 0.03 for N@duplex (Figure 1C) implicating an N-first translocation orientation. The fraction unwound for the 3’-long arm fork substrate, 0.54 ± 0.03 (Figure 4A), also corresponds with a footprinting ratio of 0.57 ± 0.03 for N@duplex (Figure 2B) again correlating with an N-first translocation mechanism.
To more directly monitor orientation and translocation, we turned to fluorescence assays. Steady-state FRET experiments were designed to qualitatively detect SsoMCM binding to forked DNA in a stalled and loaded state from the duplex region. The DNA substrate contains a biotin on the translocating strand (nine bases from the duplex junction) that when bound with streptavidin has been shown to inhibit DNA unwinding (Graham et al., 2011) (Figure 5A). A fluorescein-dT (FAM) is placed six nts beyond the biotin on the complementary strand and is used to detect FRET upon binding SsoMCM labelled at either the N-terminus or C682 with Cy3. SsoMCM is able to bind to this substrate in multiple orientations on either the 30mer 3’- or 20mer 5’-strands that will give drastically different FRET signals. The absolute FRET values will depend on the exact spatial location of Cy3 at the N or C-termini and the relative binding orientation distributions. The labelling of SsoMCM was controlled by dye stoichiometry and reaction time to give 0.4–0.6 Cy3 labels per SsoMCM protein. This puts on average 2–3 Cy3 molecules in the SsoMCM hexamer. The experimental FAM quenching result shows an overall small but significant quenching in fluorescence at 518 nm for both labelled SsoMCMs (Figure 5B) that is consistent with multiple binding populations. The distance of the FAM dye to Cy3 labels near the duplex is modelled to be less than the R0 value for this dye pair (~60 Å) and should be quenched vastly more for one construct over the other if there is a binding preference for either C@duplex or N@duplex. However, results from Figure 1 and 2 indicate that multiple binding orientations predominate favoring C@duplex when there is a long 5’-arm. Qualitatively, the larger quenching for the C-terminally labelled SsoMCM is consistent with a greater distribution that places C@duplex on this semi equal arm fork substrate (Figure 1).
In order to more directly monitor the orientation directionality during unwinding, we changed the experiment setup to monitor presteady-state fluorescence changes in a stopped-flow instrument capable of monitoring loading and translocation of the helicase at 57°C. The 5’-arm was shortened to seven nts to limit loading on that strand and distinguish between translocation orientations solely on the 3’-arm. Binding/loading of SsoMCM labelled at C682 or the N-terminus with Cy3 to fork DNA bound by streptavidin showed similar double exponential increases in Cy3 sensitization with rates of 0.57 ± 0.03 s−1 and 0.030 ± 0.001 s−1 or 0.65 ± 0.02 s−1 and 0.043 ± 0.01 s−1, respectively. (Figure 6—figure supplement 1). Exclusion of ATP in the experiment did not significantly change the exponential results, 0.62 ± 0.01 s−1 and 0.043 ± 0.001 s−1, for N-terminally labelled SsoMCM showing that binding is independent of nucleotide as shown previously (McGeoch et al., 2005). Increases in fluorescence are noted for both N and C-terminal labelled SsoMCM consistent with both orientations bound.
When we preloaded SsoMCM on DNA and instead initiated translocation with ATP in the second syringe, we can monitor directionality of movement by FRET up to the streptavidin block. The design of this experiment relies on the longitudinal length of SsoMCM (>85 Å), the loading orientation (N@duplex or C@duplex), the placement of dyes at the N or C-terminal ends, and the known 3’−5’ unwinding polarity. Although MCM helicases have been shown to displace streptavidin from biotin on the translocating strand, the rate of this displacement in is on the order of hours. Our experimental time courses for these assays are 5 min, where no streptavidin displacement was shown previously (Graham et al., 2011). Therefore upon addition of ATP, the MCM helicase will translocate into the duplex region (~9 nts), stall at the streptavidin block, resulting in an increased FRET value only for a fluorescent label on the leading face. The length and sequence of the duplex (36 bases) was designed such that separation of ~9 base pairs would not result in a thermodynamically unstable intermediate at 57°C.
When the Cy3 is labelled at C682, translocation N-first would show a minimal to no increase in fluorescence because of the large distance spanning the length of the SsoMCM hexamer; whereas translocation C-first will show a large increase in FRET upon stalling at the streptavidin block. When the stopped-flow experiment was performed, an initial increase (0.53 ± 0.05 s−1) within the first 10 s was noted followed by a slower and more significant decrease in fluorescence (1.1 ± 0.2×10−3 s−1) (Figure 6A). The first faster increase is consistent with more SsoMCM molecules being bound to the DNA template upon addition of ATP (Figure 6—figure supplement 1). The second slower change is consistent with dissociation, but not with C-first translocation.
Conversely, when Cy3 is located at the N-terminus, translocation C-first would show little to no change; whereas, translocation N-first would show a large increase in FRET. In the stopped-flow experiment with N-terminal labelled Cy3, there was an initial increase (0.26 ± 0.02 s−1) similar to that seen with the label at C682, followed by a larger and slower increase (1.5 ± 0.4×10−3 s−1) in fluorescence (Figure 6B). The second rate in both experiments (at 57°C) is consistent with the translocation/unwinding rate of SsoMCM at 60°C (Figure 4A). The single turnover unwinding rate for the 3’-long arm substrate (Figure 4A) is 0.07 ± 0.01 min−1 (or ~ 1.1 ± 0.16 x 10−3 s−1) and is for complete separation of the duplex. The second exponential rate in these presteady-state experiments is extremely similar to the single-turnover experiments and only measures translocation up to nine nts or one fourth of the duplex.
When stopped-flow experiments were performed in the absence of streptavidin, similar initial increases are shown for both C682 (0.27 ± 0.01 s−1) and N-terminal (0.21 ± 0.01 s−1) labelled SsoMCM, but now slower and similar decreases are shown for both labelled constructs (0.85 ± 0.05×10−3 s−1 and 1.0 ± 0.5×10−3 s−1), respectively, consistent with unwinding past the biotin and FAM (Figure 6C and D). SsoMCM is known to unwind over small adducts such as biotin on the translocating strand (Graham et al., 2011) and movement past the FAM label on the excluded strand for both labelled SsoMCMs would result in an increase followed by a larger decrease in FRET upon strand separation that would be stochastically blurred in this time scale.
The fluorescent DNA substrates for the presteady-state FRET experiments were varied to limit duplex length (to 20 bp) and lengthen the single-strand region (to 80 bases) to reduce the possibility of binding and translocating on duplex DNA and complicating our interpretation. Biotin was incorporated four nucleotides prior to the duplex region where a FAM label was placed. Translocation of SsoMCM along the ssDNA region would stall when streptavidin was included prior to reaching the duplex but close enough to elicit an increase in FRET when SsoMCM is labelled on the leading face with Cy3.
When stopped-flow experiments were repeated with this substrate that included a long 3’-tail, FRET only increased significantly when SsoMCM was labelled on the N-terminus with Cy3 and when ATP was included (Figure 7A). An initial increase was noted for all experiments consistent with more complex formation as also seen in Figure 6. The second rate constant, 5.4 ± 0.2×10−3 s−1, represents ssDNA translocation by SsoMCM. The ssDNA translocation rate is ~5 fold greater than when DNA unwinding is required for a FRET increase (Figure 6B). Similar experiments with a long 5’-tail showed minimal changes in FRET (Figure 7B). However, when SsoMCM is labelled at C682 with Cy3, there is a small decrease in FRET (at a similar ssDNA translocation rate of 2.0 ± 0.3×10−3 s−1) consistent with the C-terminus moving away from the FAM label in a 3’ to 5’ manner. No significant change was noted when Cy3 was labelled at the N-terminus on this substrate.
Therefore, our results show that SsoMCM can be organized on fork DNA in both orientations with particular probabilities depending on the presence of the excluded strand and the C-terminal WH domain, but translocation and unwinding proceeds N-first in the 3’−5’ direction.
SsoMCM translocates in the 3’−5’ direction, however the orientation during translocation with respect to N or C-first has come under question. Binding assays for archaeal and yeast MCMs on fork or ssDNA show a global orientation preference for the C@duplex (McGeoch et al., 2005; Costa et al., 2014), however, higher order complexes that include additional yeast replisome components orientate CMG with N@duplex, instead hypothesizing an N-first translocation mechanism (Georgescu et al., 2017). This has also been recently confirmed with a x-ray structure of SsoMCM bound to ssDNA in an N-first confirmation (Meagher et al., 2019). The Costa and Diffley laboratories have provided some guidance that synergizes these two seemingly opposing results as intermediates during the loading, activation, and translocation steps (Douglas et al., 2018) that we can better explain mechanistically here.
According to our footprinting assays, there is evidence for placing either CTD or NTD of SsoMCM towards the duplex end. Our site specific footprinting experiments show that SsoMCM has a 3:1 preference for binding equal arm fork DNA with C@duplex, essentially consistent with our previous results (McGeoch et al., 2005; Rothenberg et al., 2007). When 3’-long arm fork DNA were used instead, there was a total reversal in orientation preference of 1.5:1 for binding N@duplex. Therefore, we suggest that a large proportion of the C@duplex in the equal arm fork DNA must have been contributed by the SsoMCM encircling the 5’- strand DNA (Figure 1D–F) but cutting the 3’-strand in proximity. Comparing relative intensities of footprinting for the 5’-strand strand on the equal arm fork substrate (Figure 1D) to the 5’-long arm substrate (Figure 2C), there is a higher intensity for equal arm fork DNA confirming that SsoMCM loaded on the 3’-encircled strand can cut the excluded strand from flexibility rendered by WH domain.
According to previous studies, the C-terminal WH domain of MCM is important for loading the helicase at origins (Samson and Bell, 2016b), but the overall DNA binding affinity of SsoMCM is not impaired with the WH deletion (Barry et al., 2007). Therefore, SsoMCM constructs with a deleted WH domain should have no preference for binding DNA in a particular orientation. Footprinting studies with SsoMCM-WH show an almost 1:1 nonselection of loading onto equal arm or asymmetric arm fork substrates in either orientation, suggesting that along with the DNA polarity, the WH domain influences the orientation of SsoMCM at the loaded state.
Coupling footprinting fractions with single-turnover unwinding experiments helped determine the orientation fraction of SsoMCM involved in active unwinding. The unwinding fraction for equal arm fork DNA substrate is 0.26 and a fractional preference of 0.23 for binding with N@duplex suggesting an N-first translocation orientation. Although SsoMCM has a preference for loading on the 3’-arm of a fork substrate (Rothenberg et al., 2007), we now show a significant population bound to the 5’-arm, however, SsoMCM bound to the 5’-arm is not productive with these substrates. Therefore, a 3’-long arm fork DNA substrate was used to restrict binding/loading onto only the translocating strand. On this substrate, SsoMCM has a 0.57 fractional preference for binding with N@duplex and also corresponds with single-turnover unwinding fraction of 0.54 corroborating an N-first unwinding translocation orientation and 3’−5’ polarity.
This SsoMCM loaded state (C@duplex) is analogous to an initial double hexamer converting to encircling one strand and excluded the other (Figure 8). Based on the accepted structure of the MCM double hexamer loaded onto dsDNA origins, the NTDs interact in a head-to-head conformation (Remus et al., 2009; Li et al., 2015). From that state, there are two possible mechanisms for encircling either the 5’−3’ or 3’−5’ strands (Abid Ali et al., 2017); however in each case, the individual hexamers are still initially orientated in a C@duplex orientation, when both DNA strands are present. Once the excluded strand is melted and displaced outside of the central channel, it can engage with the exterior surface of MCM in a steric exclusion and wrapping (SEW) mechanism (Graham et al., 2011). This preloaded and sequestered state is what we have detected in this report using footprinting studies on equal-arm fork DNA (C@duplex). Interestingly, SsoMCM may have a higher affinity for bubble substrates over fork or ssDNA substrates (Pucci et al., 2004), which may be achieved through direct double hexamer interactions and/or alternative binding configurations with the bubble region to promote conformational activation.
From there, translocation may proceed in the N-first mode bypassing each hexamer as has recently been observed (Georgescu et al., 2017) and indirectly detected (Douglas et al., 2018) or in a C-first mode upon separation which had been speculated (McGeoch et al., 2005; Rothenberg et al., 2007; Costa et al., 2014; Trakselis et al., 2017). Our presteady-state FRET experiments were performed to directly detect the orientation of the SsoMCM hexamer during active translocation and unwinding to be absolutely certain. Using this approach, we could directly monitor the translocation orientation between the NTD of SsoMCM and DNA to verify an N-first translocation mechanism.
Combining the results from footprinting, single-turnover unwinding, and presteady-state FRET studies now all support an N-first translocation/unwinding mechanism for SsoMCM. After loading at an origin, our results agree with the second pathway (Figure 8, ii) for translocation, where two hexamers that have converted to encircling only one DNA strand have to bypass each other to proceed N- first. Similarly, AAA+ papillomavirus E1 helicase which also translocates with 3’−5’ polarity employs a strand exclusion mechanism to unwind DNA proceeding N-first (Enemark and Joshua-Tor, 2006; Lee et al., 2014). As suggested previously, this would provide an inherent physical control mechanism for DNA unwinding to regulate precise elongation timing (Li and O'Donnell, 2018). If pathway i) is incorrectly selected, the N-first 3’−5’ translocation mechanism would inherently block unwinding and render those loaded MCM origins inactive. The consequences of this nonproductive orientation cannot be determined from our current experiments.
The sole selection and encircling of one strand over the other and the conformational steps necessary within the MCM double hexamer remain to be determined and are actively being pursued by a number of laboratories. Some insight into strand selection has be gleamed from a closer examination of the CMG assembly and activation process in eukaryotes (Douglas et al., 2018), where ATP binding initiates CMG hexamer separation and early origin melting where DNA becomes underwound in preparation for ssDNA selection. Whether archaeal GINS and Cdc45 influences the binding population orientation on model forks remains to be determined, but the translocation orientation of N-first confirmed here will remain unchanged. Based on a cryo-EM structures of the T7 replisome (Gao et al., 2019) and CMG (Georgescu et al., 2017) that include ssDNA, it is likely that a helical conformation of DNA will contact multiple subunits in the interior of the hexamer to not only engage one DNA strand to encircle but also for translocation. How the other excluded ssDNA strand slides out between subunits is not yet known but may include contributions of Cdc45 and MCM10 in eukaryotes to remodel CMG and engage that excluded strand on the exterior surface for stability (Petojevic et al., 2015; Mayle et al., 2019).
ATP was obtained from Invitrogen (Carlsbad, CA). Azidophenacyl bromide (APB) was from Sigma-Aldrich (St. Louis, MO). 1-(p-Bromoacetamidobenzyl) ethylenediamine N, N,N (Fe-BABE) was from Dojindo (Rockville, Maryland). Streptavidin was from Invitrogen (Carlsbad, CA). Cy3 succinimidyl ester and maleimide were from ThermoFisher (Pittsburgh, PA). DNaseI was from NEB (Ipswich, MA). All other materials were from commercial sources and were analytical grade or better. Helicase buffer was used in all unwinding and binding reactions and consists of 125 mM potassium acetate, 25 mM Tris acetate (pH 7.5), and 10 mM magnesium acetate. DNA primers and substrates (Supplementary file 1) were all synthesized by Sigma-Aldrich (St. Louis, MO) or IDT (Coralville, IA) and gel purified using crush and soak method (Maniatis et al., 1989). Preformed fork substrates: equal arm (DNA164/165), 3’-long arm/5’-(n)nts (n = 0; DNA165/189, n = 8; DNA165/171), 5’-long arm/3’-(n)nts (n = 0; DNA164/190, n = 8; DNA164/172), duplex (DNA180/188), DNA14-B/179 F, DNA14-B/182 F, DNA60-F/202-B and DNA204-F/203-B were heated to 95°C and cooled at a rate of 1 °C /min to room temperature in a PCR instrument.
A cysteine was introduced into SsoMCM (1-612) (-WH) at G452C or S456C using a standard QuikChange protocol (Agilent, Santa Clara, CA) with KAPA HiFi DNA polymerase (KAPA Biosystems, Woburn, MA) with oligos in Supplementary file 1. Mutations were initially confirmed by silent mutations to create unique restriction sites and then by the DNA Sequencing Faculty at The University of Texas at Austin (Austin, TX). SsoMCM full-length (WT, C642A, and C682A) or 1–612 (-WH: WT, G452C, G456C) were purified as previously described (McGeoch et al., 2005; Graham et al., 2011). Briefly, autoinduced SsoMCM was heat-treated at 70°C for 20 min, and the supernatant was applied to MonoQ, heparin, and S-200 gel filtration columns by use of AKTA Pure (GE Healthscience) to isolate the purified hexameric species.
APB was dissolved in 100% DMF at a concentration of 40 mM and then diluted to 4 mM in 20 mM Tris pH 7.5, 75 mM NaCl, 10% glycerol and 20% DMF, in the dark. APB was then added to a sample of SsoMCM protein (~10 µM monomer) containing a single cysteine in full length (at either C642 or C682) or in –WH (at either C452 or C456) (in 20 mM Tris [pH 7.5], 75 mM NaCl, 10% glycerol), to achieve a final concentration of 4 mM APB and 1% DMF. Labelling proceeded for 2–3 hr at room temperature. APB labelled SsoMCM was incubated with fluorescent (Cy3 or Cy5 as indicated) fork DNA (150 nM) for 10–20 min in 1x CB buffer (20 mM TrisOAc, 25 mM KOAc, 10 mM MgOAc, 0.1 mg/ ml BSA, 1 mM DTT) in 50 µl volumes (maintaining ~ 1:1 MCM6:DNA ratio). For cross-linking, samples were transferred to silanized cover slips and UV irradiated for 15 s before adding 150 µl of post irradiation buffer (20 mM Tris- HCl [pH 8.0], 0.2% SDS, 50 mM NaCl), vortexed, and placed at 70°C for 20 min. Next, 1 µl of 10 mg/ ml Salmon sperm DNA, 30 µl of 3.0 M NaOAc, 750 μl of ice cold 100% ethanol was added, vortexed, left on ice for 1–2 hr at −80°C. Samples were then spun in microfuge at 4°C, 12,000 rpm for 30 min. The supernatant was discarded, and the pellet was washed twice with ice cold 70% ethanol. Ethanol was removed and the pellets were air dried by inverting on bench for 1 hr and then resuspended in 100 µl: 20 mM NH4OAc, 2% SDS, 0.1 mM EDTA pH 8.0 by vortexing. Samples were spun in microfuge at room temperature for 10 min. Supernatants were transferred into fresh tubes, placed in heat block at 90°C for 2 min. Then, 1 µl of 2 M NaOH was added, vortexed briefly, and incubated at 90°C for 20 min. After incubation, samples were pulse spun, added 101 µl 20 mM Tris- HCl pH 8.0, 1 µl of 2 M HCl, 1 µl of 2 M MgCl2, 480 µl 100% ethanol, vortexed, and placed at −80°C for 1–2 hr. The samples were pelleted in microfuge at 4°C for 30 min, washed two times with ice cold 70% ethanol, and air dried on bench for 1 hr. The DNA pellet was resuspended with 5 µl of 40% glycerol loading buffer containing Orange G dye for gel loading, run on a 20% TBE- PAGE (native PAGE), and visualized on a Typhoon FLA 9000 imager (GE Healthsciences).
SsoMCM proteins containing a single cysteine at either 642 or 682 for full-length or at 452 or 456 for -WH were dialyzed overnight at 4°C into conjugation buffer (30 mM MOPS, 100 mM NaCl, 1 mM EDTA, 5% glycerol, pH 8.0). Conjugation was performed by mixing 400 µM of FeBABE with 20 µM SsoMCM and incubating at 37°C for 1 hr in the dark. After 1 hr incubation, FeBABE-protein conjugate sample was dialyzed against the cutting buffer (50 mM MOPS, 120 mM NaCl, 0.1 mM EDTA, 10 mM MgCl2, 10% glycerol). Then FeBABE-protein conjugate was mixed with fluorescent DNA (150 nM, as indicated) and incubated at room temperature for 30 min maintaining ~ 1:1 MCM6:DNA ratio. 2.5 µl of ascorbic acid solution (40 mM ascorbic acid, 10 mM EDTA, pH 8.0) was added, vortexed for 2–3 s, and H2O2 solution (40 mM H2O2, 10 mM EDTA) was added immediately and vortexed for 2–3 s. The reaction mixture was then incubated for 30 s and quenched by adding Orange G dye loading buffer with 40% glycerol. The samples were electrophoresed on a 20% TBE-PAGE gel and visualized on a Typhoon FLA 9000 imager (GE Healthsciences). Calculation of both the APB and FeBABE footprinting was performed by quantifying the relative density (minus background) for the labelled strand, divided at the midpoint on the ssDNA arm according to the following equation
A standard two-tailed equal variance student’s T-test was used to determine significant differences of C@duplex versus N@duplex. P-values are reported for each experimental condition.
Single turnover helicase unwinding assays were assembled in helicase buffer with 15 nM concentration of fluorescent forked DNA (as indicated) incubated with 2 µM SsoMCM (WT or WH mutant) at 60°C for 5 min before initiating with 2 mM ATP and a 300 nM ssDNA trap (unlabelled strand with the same sequence as the fluorescently labelled strand). Three different fork DNA substrates with a 20 bp duplex region with either Cy3 or Cy5 labels at the duplex end and either 30 nt equal arms or 30 and 8 nt asymmetric arms were used. Unwinding reactions were quenched using an equal volume of quench solution (1.6% SDS, 50% glycerol, 0.1% w/v bromophenol blue, 100 mM EDTA) and an additional 300 nM ssDNA trap at various times. Reactions were placed on ice until loading and were electrophoresed on native 20% TBE-PAGE. The gels were visualized on a Typhoon FLA 9000 imager (GE Healthsciences). The fraction unwound was calculated using the equation:
where and are the intensities of the single and double-stranded bands, respectively, at time t; subscript 0 and b indicate equivalent counts at t = 0 and the boiled sample, respectively. The fraction unwound was fit to a single exponential equation as a function of time according to:
where C is a constant for the amplitude, A is the amplitude change, and k is the rate (min−1). The amplitude change denotes the fraction of productive and processive unwinding complexes.
Anisotropy experiments were performed using a Cary Eclipse Spectrophotometer (Agilent, Santa Clara, CA) in CB buffer. The four forked DNA substrates (with equal arms or asymmetric arms) and the duplex substrate were labelled at the duplex end with either Cy3 at the 5’ or Cy5 at the 3’ were annealed as described above. Anisotropy measurements were made at each concentration after a 2 min incubation after protein was added. Anisotropy values were collected with a 0.5 s integration time for three consecutive readings. Final values from at least three independent experiments were averaged and fit to a cooperative binding equation:
in which Y is the measured anisotropy, Amax is the maximal anisotropy and n is the Hill coefficient using the Kaleidagraph (Synergy Software, v 4.2).
DNaseI footprinting experiments were performed in stoichiometric MCM6:DNA concentration ratios. Equal arm forked DNA substrates (DNA164-5/DNA165) labelled at the duplex end with Cy5 were incubated with SsoMCM in 1x CB buffer 15 min at room temperature in 10 µl reaction volumes to facilitate protein-DNA complex formation. The complexes were then digested by 0.1 U/µl DNaseI in 1x DNaseI reaction buffer incubated at 37°C for 30 s. Reaction were then quenched by 5 mM EDTA and heating to 75°C for 10 min. An equal volume of 100% formamide was added and separated on a 20% denaturing PAGE.
EMSAs were performed in stoichiometric MCM6:DNA concentration ratios. Equal arm forked DNA substrates (DNA164-5/DNA165) labelled at the duplex end with Cy5 were incubated with SsoMCM in 1x CB buffer 15 min at room temperature in 10 µl reaction volumes to facilitate protein-DNA complex formation. 2 µl of loading buffer (30% v/v glycerol) was added to the reaction prior to being resolved on 5% native PAGE.
Stopped-flow fluorescence experiments were performed on an Applied Photophysics (Leatherhead, UK) SX.20MV in fluorescence mode at a constant temperature of 57°C.
DNA14 was annealed to either DNA179 or DNA182 using to generate two fork substrates with a 30 base 3’-arm and a 20 or 7 base 5’-arm; DNA60 was annealed to DNA202 to give a 3’-long tail substrate; or DNA204 was annealed to DNA203 to give a 5’-long tail substrate. 5’SsoMCM(C642A) was labelled at the N-terminus or at C682 with Cy3 as described previously (McGeoch et al., 2005). Final concentrations of components after mixing were SsoMCM (500 nM or 83 nM hexamer), DNA (50–63 nM), streptavidin (0 or 188 nM), and ATP (0.5 mM), unless indicated otherwise. The samples were excited at 490 nm, and a 570-nm-cutoff filter was used to collect 4000 oversampled data points detecting only Cy3 emission over single or split-time bases. The slits were set at 3 mm for both excitation and emission. At least seven traces were averaged for each experiment and performed multiple times and on multiple occasions. The observed averaged traces were fit to one, two, or three exponentials using the supplied software. Below is the equation for a double exponential fit:
where a is the amplitude change, k is the exponential rate, t is time, and C is a constant for the amplitude.
All data generated or analyzed during the study are included in the manuscript and supporting files.
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
We acknowledge the Baylor Molecular Bioscience Center (MBC) for providing instrumentation and resources aiding this project. We thank Alessandro Costa and Gregory Bowman for helpful discussions.
© 2019, Perera and Trakselis
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
The SARS-CoV-2 main protease (Mpro or Nsp5) is critical for production of viral proteins during infection and, like many viral proteases, also targets host proteins to subvert their cellular functions. Here, we show that the human tRNA methyltransferase TRMT1 is recognized and cleaved by SARS-CoV-2 Mpro. TRMT1 installs the N2,N2-dimethylguanosine (m2,2G) modification on mammalian tRNAs, which promotes cellular protein synthesis and redox homeostasis. We find that Mpro can cleave endogenous TRMT1 in human cell lysate, resulting in removal of the TRMT1 zinc finger domain. Evolutionary analysis shows the TRMT1 cleavage site is highly conserved in mammals, except in Muroidea, where TRMT1 is likely resistant to cleavage. TRMT1 proteolysis results in reduced tRNA binding and elimination of tRNA methyltransferase activity. We also determined the structure of an Mpro-TRMT1 peptide complex that shows how TRMT1 engages the Mpro active site in an uncommon substrate binding conformation. Finally, enzymology and molecular dynamics simulations indicate that kinetic discrimination occurs during a later step of Mpro-mediated proteolysis following substrate binding. Together, these data provide new insights into substrate recognition by SARS-CoV-2 Mpro that could help guide future antiviral therapeutic development and show how proteolysis of TRMT1 during SARS-CoV-2 infection impairs both TRMT1 tRNA binding and tRNA modification activity to disrupt host translation and potentially impact COVID-19 pathogenesis or phenotypes.
Paramyxovirus membrane fusion requires an attachment protein for receptor binding and a fusion protein for membrane fusion triggering. Nipah virus (NiV) attachment protein (G) binds to ephrinB2 or -B3 receptors, and fusion protein (F) mediates membrane fusion. NiV-F is a class I fusion protein and is activated by endosomal cleavage. The crystal structure of a soluble GCN4-decorated NiV-F shows a hexamer-of-trimer assembly. Here, we used single-molecule localization microscopy to quantify the NiV-F distribution and organization on cell and virus-like particle membranes at a nanometer precision. We found that NiV-F on biological membranes forms distinctive clusters that are independent of endosomal cleavage or expression levels. The sequestration of NiV-F into dense clusters favors membrane fusion triggering. The nano-distribution and organization of NiV-F are susceptible to mutations at the hexamer-of-trimer interface, and the putative oligomerization motif on the transmembrane domain. We also show that NiV-F nanoclusters are maintained by NiV-F–AP-2 interactions and the clathrin coat assembly. We propose that the organization of NiV-F into nanoclusters facilitates membrane fusion triggering by a mixed population of NiV-F molecules with varied degrees of cleavage and opportunities for interacting with the NiV-G/receptor complex. These observations provide insights into the in situ organization and activation mechanisms of the NiV fusion machinery.