SARS-CoV-2 S protein:ACE2 interaction reveals novel allosteric targets
Abstract
The spike (S) protein is the main handle for SARS-CoV-2 to enter host cells via surface angiotensin-converting enzyme 2 (ACE2) receptors. How ACE2 binding activates proteolysis of S protein is unknown. Here, using amide hydrogen–deuterium exchange mass spectrometry and molecular dynamics simulations, we have mapped the S:ACE2 interaction interface and uncovered long-range allosteric propagation of ACE2 binding to sites necessary for host-mediated proteolysis of S protein, critical for viral host entry. Unexpectedly, ACE2 binding enhances dynamics at a distal S1/S2 cleavage site and flanking protease docking site ~27 Å away while dampening dynamics of the stalk hinge (central helix and heptad repeat [HR]) regions ~130 Å away. This highlights that the stalk and proteolysis sites of the S protein are dynamic hotspots in the prefusion state. Our findings provide a dynamics map of the S:ACE2 interface in solution and also offer mechanistic insights into how ACE2 binding is allosterically coupled to distal proteolytic processing sites and viral–host membrane fusion. Thus, protease docking sites flanking the S1/S2 cleavage site represent alternate allosteric hotspot targets for potential therapeutic development.
Introduction
The COVID-19 pandemic caused by the SARS-CoV-2 virus has sparked extensive efforts to map molecular details of its life cycle to drive vaccine and therapeutic discovery (Bar-Zeev and Inglesby, 2020). SARS-CoV-2 belongs to the family of Coronaviridae, which includes other human pathogens including common cold-causing viruses (hCoV-OC43, HKU, and 229E), SARS, and MERS-CoV (Corman et al., 2018; St-Jean et al., 2004; Lau et al., 2006; Warnes et al., 2015). SARS-CoV-2 has an ~30 kb RNA (positive stranded) genome with 14 open reading frames, encoding four structural proteins – spike (S) protein, membrane (M) protein, envelope (E) protein, and nucleo-protein; 16 non-structural proteins, and 9 accessory proteins (Su et al., 2016; Andersen et al., 2020; Tan et al., 2005) An intact SARS-CoV-2 virion consists of a nucleocapsid core containing genomic RNA within a lipid–protein envelope forming a spherical structure of diameter ~100 nm (Ke et al., 2020). The viral envelope is decorated with S, M, and E proteins (Ke et al., 2020). The prefusion S protein is a club-shaped homotrimeric class I viral fusion protein that has distinctive ‘head’ and ‘stalk’ regions (Figure 1A).
A characteristic feature of SARS-CoV-2 is that upon host entry, its prefusion S protein is proteolyzed by host proteases into constituent S1 and S2 subunits. The S1 subunit comprises an N-terminal domain (NTD) and a receptor binding domain (RBD) that interacts with the host receptor angiotensin-converting enzyme-2 (ACE2) (Lan et al., 2020; Hoffmann et al., 2020) to initiate viral entry into the target cell (Yan et al., 2020). The defining virus–host interaction for entry is therefore that mediated by the viral S protein with the host ACE2 receptor (Lan et al., 2020). Binding to ACE2 primes the S protein for proteolysis by host furin proteases at the S1/S2 cleavage site (Walls et al., 2017; Vankadari, 2020). The S2 subunit consists of six constituent domains harboring the membrane fusion machinery of the virus. These comprise the fusion peptide (FP), heptad repeat (HR1), central helix (CH), heptad repeat 2 (HR2), connector domain (CD), transmembrane domain (TM), and cytoplasmic tail (CT) domain (Walls et al., 2020; Wrapp et al., 2020). Extensive structural studies (Ke et al., 2020; Walls et al., 2020; Fan et al., 2020; Turoňová et al., 2020) have captured S protein of coronaviruses in distinct open (PDB: 6VXX) (Walls et al., 2020) and closed (PDB: 6VYB) (Walls et al., 2020) conformational states relative to the orientation of the RBD. These structures additionally reveal distinct orientations of the ectodomain (ECD) in pre- and postfusion states and highlight the intrinsic ensemble nature of the S protein in solution. The S2 subunit promotes host–viral membrane fusion and viral entry (Figure 1B).
Despite extensive cryo-Electron Microscopy (cryo-EM) studies, a map of the S:ACE2 interface in solution and how ACE2 binding to the RBD primes enhanced proteolytic processing at the S1/S2 site is entirely unknown. Amide hydrogen/deuterium exchange mass spectrometry (HDXMS), together with molecular dynamics (MD) simulations, offers a powerful combined approach for describing virus protein conformational dynamics and breathing (Lim et al., 2017a) and mapping protein–protein interactions for host receptor–virus interactions (Lim et al., 2017b). Here, we describe dynamics of free S protein and S:ACE2 complex, which reveal allosteric effects of ACE2 binding-induced conformational changes at distal stalk and protease docking sites flanking the S1/S2 cleavage sites. Our studies uncover distal ‘hotspots’ critical for the first step of the SARS-CoV-2 infection and thereby represent novel targets beyond the RBD for therapeutic intervention.
Results and discussion
Subunit-specific dynamics and domain motions of S protein trimer
Structural snapshots of the ACE2 binding interface with the SARS-CoV-2 S protein have previously been described for the RBD alone (Lan et al., 2020; Wrapp et al., 2020; Ali and Vijayan, 2020; Chan et al., 2020; Wang et al., 2020a). In this study, we have expanded this to map interactions and dynamics of ACE2 binding with a larger S protein construct, S (1–1208), lacking only the C-terminal membrane spanning helices. Mutations at the S1/S2 cleavage site (PRRAS motif substituted by PGSAS motif) and 986–987 (KV substituted PP) were engineered (Wrapp et al., 2020) to block host cell-mediated S protein proteolysis during expression and purification (Figure 2—figure supplement 1). S (1–1208), ACE2, and RBD eluted as trimers, dimers, and monomers, respectively, on size-exclusion chromatography (Figure 2—figure supplement 1, Figure 3—figure supplement 1, and Figure 5—figure supplement 1). S protein hereafter in the text denotes S (1–1208). Isolated RBD constructs showed high-affinity binding to ACE2 (Figure 3—figure supplement 1, Figure 5—figure supplement 1).
HDXMS of S protein alone was next carried out as described in 'Materials and methods'. Pepsin proteolysis of the S protein generated 317 peptides with high signal to noise ratios, yielding a primary sequence coverage of ~87% (Figure 2—figure supplement 2). S protein is highly glycosylated (at least 22 sites have been predicted and characterized on S protein) (Watanabe et al., 2020). Of these, 20 sites are predicted to be N-linked glycosylation modifications. We obtained peptides spanning 12 of the 20 predicted glycosylation sites. None of these peptides were glycosylated, making deuterium exchange of non-glycosylated peptides the focus of this study.
HDXMS results were overlaid onto integrative models of the full-length S protein trimer built using experimental structures of prefusion S ECD in the open conformation (PDB ID: 6VSB) (Wrapp et al., 2020) and HR2 domain from SARS S protein as templates. A deuterium exchange heat map (t = 1 and 10 min) revealed the stalk region to show the greatest relative deuterium exchange (Figure 2A). This is consistent with earlier studies showing at least 60° sweeping motions of the three identified hinge regions of the stalk (Turoňová et al., 2020). This was further verified via all-atom MD simulations of the S protein model embedded in a viral model membrane, which showed significant motions of the S protein ECD resulting from the high flexibility of the stalk region (Figure 2B), combined with large atomic fluctuations around the HR2 domain, compared to the rest of the protein (Figure 2—figure supplement 3, Figure 2—figure supplement 4).
Interestingly, the deuterium exchange heat map also showed highest relative exchange in the S2 subunit (Figure 2—figure supplement 3) and helical segments of the stalk, while peptides spanning the FP showed relatively lower deuterium exchange overall. Individually, S1 and S2 subunits showed different intrinsic deuterium exchange kinetics, where average relative fractional deuterium uptake (RFU) at early deuterium exchange time points of S1 subunit (~0.25) was lower than the average RFU (~0.35) for the S2 subunit (Figure 2—figure supplement 3, source data – Figure 2—source data 1). Furthermore, peptides connecting the RBD to the rest of the S protein showed greater deuterium exchange, suggesting a ‘hinge’ role for this segment to facilitate RBD adopting an ensemble of open and closed conformational states (Figure 2C). Indeed, in our simulations of the S protein (Figure 2B), the RBD oriented initially in an ‘up’ conformation and exhibited spontaneous motion toward the ‘down’ conformation relative to the hinge region (Figure 2D, Figure 2—figure supplement 4A). Interestingly, a part of the receptor binding motif, specifically residues 476–486, exhibited a higher degree of flexibility based on its average atomic fluctuations (Figures 2B and 3B), suggesting a role for the ACE2 receptor in stabilizing S protein dynamics and priming it for host furin proteolysis.
The NTD of the S protein showed low overall RFU (~0.2), consistent with its well-structured arrangement of β-sheets connected by loops (Figures 1B and 2C). Importantly, certain regions showed significantly higher deuterium exchange (~0.4), of which two loci (136–143, 243–265) span the dynamic interdomain interactions with the RBD. This is supported by the high per-residue root mean square fluctuations (RMSFs) and large principal motions observed for residues 249–259 during simulations (Figure 2C, Figure 2—figure supplement 4C). One locus (291–303) at the C-terminal end of the NTD connecting to the RBD showed high deuterium exchange, indicating high relative motions of the two domains. The RBD (Figure 2D) showed an overall higher deuterium exchange (RFU ~0.35), with the peptides spanning the hinge regions (318–336) showing greatest deuterium exchange (~0.6). Peptides spanning residues 351–375 and 432–452 showed significantly increased deuterium uptake, and these correspond to the NTD interdomain interaction sites. Interestingly, certain loci of the RBD at the ACE2 interface (453–467, 491–510) showed higher intrinsic exchange.
Overall, the S2 subunit showed variable deuterium exchange across the constituent domains (Figure 2E, Figure 2—figure supplement 3). Interestingly, peptides spanning the region directly C-terminal to the S1/S2 cleavage site showed the greatest deuterium exchange (0.6). Congruently, our MD simulations revealed the unstructured loop housing the S1/S2 cleavage site (residues 677–689) to be highly dynamic (Figure 2—figure supplement 4), with RMSFs reaching >1.0 nm. It is important to note that the S1/S2 cleavage site has been abrogated in the construct of the S protein used in this study to block proteolytic processing into S1 and S2 subunits during expression in host cells. We observed lower deuterium exchange (and lower RMSF values) at peptides forming the CH and CD, suggesting their function as the central stable core of prefusion S. In contrast, peptides spanning hinge segments and heptad repeats (HR1 and HR2) showed high exchange and RMSF values, reflecting the S protein’s ensemble properties encompassing prefusion, fusion, and postfusion conformations in solution.
Domain-specific and global effects of ACE2 binding to the RBD
Comparative HDXMS of the S protein and S:ACE2 complex showed large-scale changes in S protein upon ACE2 binding. The RBD forms the main interaction site on S protein for ACE2. We therefore set out to comparatively map HDXMS of ACE2:RBD interface of an isolated MBP fusion construct of the RBD (‘RBDisolated’) (Figure 3C, Figure 3—figure supplement 2—source data 1 Supplementary file 1: Table S2) with S:ACE2 complex (Figure 4A, B). A list of peptides common to RBDisolated and S protein (‘RBDS’) showed differences in deuterium exchange only at interdomain interfaces within individual monomers and trimer interaction sites in the S protein (Supplementary file 1: Table S3). Several RBDS peptides showed decreased exchange upon complexation with ACE2 (Figure 3). These include peptides 340–359, 400–420, 432–452, and 487–502 in the RBDS:ACE2 complex (Figure 4). Sites showing deuterium exchange protection are consistent with the RBD:ACE2 interface described by X-ray crystallography (PDB: 6M0J) (Lan et al., 2020). Further, HDXMS revealed the core of this interface to be contributed by peptides 340–359, 400–420, 432–452, and 491–510 (Figure 4A, D, Figure 2—figure supplement 3). Interestingly, loci showing large-magnitude differences in deuterium exchange correlate to certain mutational hotspots (Wang et al., 2020b).
A closer examination of the RBDisolated:ACE2 interface by HDXMS also revealed decreased exchange in peptides spanning these regions (Figure 3). However, the magnitude of deuterium exchange protection was significantly more in RBDisolated than in RBDS, potentially reflecting the higher flexibility in the full-length S trimer relative to free RBD, interfering with ACE2 binding. High-resolution structures of RBD:ACE2 reveal the core of the RBD interface to be formed by amino acids Y449, Y453, N487, Y489, G496, T500, G502, Y505, L455, F456, F486, Q493, Q498, and N501 (Wang et al., 2020a). These correspond to peptide 448–501 from S protein and RBDisolated in our HDXMS study.
Cryo-EM studies have shown that each RBD in the trimeric S protein can adopt an open conformation irrespective of other RBDs, indicating an absence of cooperativity between the three RBDs within a trimer (Ke et al., 2020). Therefore, we compared the deuterium exchange profiles of RBDisolated with RBDS and observed differences in dynamics imposed by quaternary contacts (Figure 3). Overall, the loci with high and low deuterium exchange profiles were similar when compared between RBDisolated and RBDS, both at the disordered ACE2 receptor binding region and the folded regions at the N- and C-termini. In solution, RBDS toggles between open and closed conformations, resulting in an average readout of deuterium exchange measurements.
ACE2 binding to RBDisolated and RBDS resulted in similar effects, where we observed deuterium exchange protection at the peptide regions spanning the known binding interface of RBD. Notably, increased deuterium exchange was observed at the hinge region (Figure 3D), indicating allosteric conformational changes, associated with restricting the open and closed states interconversion. Therefore, the destabilization/local unfolding observed at the hinge region as a result of ACE2 binding enables RBD to maintain the open conformation. It therefore seems likely that small molecules and biologics targeting the hinge region to lock RBD in the closed state would be of potential high therapeutic value.
ACE2 binding to RBD is allosterically propagated to the S1/S2 cleavage site and HR
Unexpectedly, ACE2 binding at the RBD induced large-scale changes in deuterium exchange in distal regions of the S protein. Some of the peptides in the stalk of S protein showed decreased exchange in the S:ACE2 complex (Figure 4C,D). This indicates that ACE2 receptor interactions stabilized the hinge dynamics in the S protein. Decreased exchange was also seen in the distal sites in the S2 subunit, localized at the FP locus and CH. Interestingly, increased exchange was seen in multiple peptides flanking the S1/S2 cleavage site, HR1 domain, and critically at the S1/S2 cleavage site (Figure 4D). Even though the protease cleavage site is abrogated in the construct used in this study, we still observed increased dynamics as inferred by the higher relative deuterium exchange at the S1/S2 locus. Furthermore, this region exhibited high RMSF values during simulations (Figure 2—figure supplement 4B). These results clearly indicate that ACE2 binding induces allosteric enhancement of dynamics at this locus, providing mechanistic insights into the conformational switch from the prefusion to fusogenic intermediate. Differences in deuterium exchange between free S protein and the S:ACE2 complex show stabilization at the ACE2 interacting site and local destabilization at peptides juxtaposed to the S1/S2 cleavage site and HR1 ( peptides 931–938). This suggests that ACE2 binding allosterically primes HR1 and other high exchanging regions flanking the S1/S2 cleavage site for enhanced furin protease binding and cleavage. Importantly, these results suggest that the S1/S2 cleavage site is a critical hotspot for S protein dynamic transitions for facilitating SARS-CoV-2’s entry into the host, and therefore represents a new target for inhibitory therapeutics against the virus.
Dynamics of RBD:ACE2 and S:ACE2 protein interactions provides insights for viral–host entry
Considering the indispensable role of ACE2 binding in SARS-CoV-2 infection, it is crucial to assess the effects of S protein and RBD binding on ACE2 dynamics (Figure 5, Figure 5—figure supplements 1–3, Supplementary file 1: Table S4). We therefore mapped the corresponding binding sites of RBD, both isolated and within the S protein, onto ACE2. The S:ACE2 complex represents the prefusion pre-cleavage state wherein full-length S protein is bound to the ACE2 receptor (Figure 1B, ii), while the RBDisolated:ACE2 complex represents the post-furin cleavage product formed by the S1 subunit and ACE2 (Figure 1B, iii). Previous studies have shown that 14 key amino acids of RBD interact with ACE2, wherein mutations at six sites resulted in higher binding affinity of SARS-CoV-2 (Li et al., 2005). SARS-CoV-2 adopted a different binding mode to ACE2 as a superior strategy for infection compared to SARS-CoV-1. A crystal structure of RBDisolated:ACE2 complex has identified 24 key ACE2 residues, spanning across peptides 16–45, 79–83, 325–330, 350–357, and R393 (Towler et al., 2004). While most of these residues are conserved in binding to both SARS-CoV-1 and SARS-CoV-2, R393 and residues 325–330 are unique to SARS-CoV-1 interaction (Wang et al., 2020b). Interestingly, we observed increased deuterium exchange at these residues in the S:ACE2 complex compared to ACE2 alone (Figure 5C). Identifying the intrinsic dynamics and allosteric changes upon binding could guide development of therapeutic antibodies and small molecule drugs.
Simulations of the ACE2 dimer complexed with the B0AT1 amino acid transporter (PDB: 6M1D) (Yan et al., 2020) in a model epithelial membrane revealed a large motion of the peptidase domain, which recognizes the S protein RBD, with respect to the transmembrane and juxtamembrane domains (Figure 5—figure supplement 3). This large motion is reminiscent of the flexible tilting displayed by the S protein ECD itself, suggesting that both S protein and ACE2 have adaptable hinges that allow for orientational freedom of the domains involved in recognition. To understand how S protein binding affects ACE2 dynamics, we performed HDXMS experiments of monomeric ACE2 alone, S:ACE2 and RBD:ACE2 complexes (Figure 5, Figure 5—figure supplement 2) and mapped the deuterium exchange values onto a deletion construct of ACE2 (PDB: 1R42) (Towler et al., 2004; Figure 5, Figure 5—figure supplement 2). We observed a reduction in deuterium exchange across both RBDisolated:ACE2 and larger S:ACE2 complexes compared to free ACE2 (Figure S8B and S8C). Differences in deuterium exchange between RBDisolated:ACE2 complex and free ACE2 showed that RBD binding stabilizes ACE2 globally, specifically large differences at the binding site (peptides 21–29, 30–39, and 75–92), and also at distal regions (peptides 121–146, 278–292, and 575–586) from the RBD binding site of ACE2 (Figure 5E). Cryo-EM studies have shown that a dimeric full-length ACE2 receptor can stably bind to one trimer of the S protein (Yan et al., 2020).
Conclusions
Here, a combination of HDXMS and MD simulations provides a close-up of S protein dynamics in the prefusion, ACE2-bound, and other associated conformations. Our results reveal the energetics of the S:ACE2 complex interface. ACE2 binding to the isolated RBD and S protein alike leads to binding and stabilization. Interestingly, ACE2 binding to the RBD induces global conformational changes across the entire S trimer. Importantly, the stalk region undergoes dampening of conformational motions while showing increased deuterium exchange at the proteolytic processing sites. This study may help in explaining how mutations in emerging strains in the ongoing COVID-19 outbreak might alter dynamics and allostery of ACE2 binding and offer a mechanistic basis for altered infectivities observed in emerging strains. Sites on S protein showing altered deuterium exchange describe allosteric propagation of ACE2 binding and represent novel cryptic targets for therapeutic small molecule inhibitor/antibody discovery.
Materials and methods
Materials
Mass spectrometry grade acetonitrile, formic acid, and water were from Fisher Scientific (Waltham, MA); deuterium oxide was from Cambridge Isotope Laboratories (Tewksbury, MA). All reagents and chemicals were research grade or higher and obtained from Merck-Sigma-Aldrich (St. Louis, MO).
Methods
Transient expression and purification of recombinant SARS-CoV-2 spike, RBD, and ACE2 receptor
Request a detailed protocolA near-full-length S protein of SARS-CoV-2 (1–1208; Wuhan-Hu-1; GenBank: QHD43416.1), excluding TD and CT, was codon optimized for mammalian cell expression and cloned into pTT5 expression vector with a twin strep tag at the C-terminus (Twist Biosciences, Singapore). Mutations were introduced into this construct at two sites: (i) RRAR motif at the S1/S2 cleavage site (682–685) was substituted by GSAS and (ii) KV motif (986–987) was substituted with two prolines. A gene encoding SARS-CoV-2-RBD (319–591 of SARS-CoV-2 spike) (BioBasic, Singapore) was cloned into the expression vector pHLmMBP-10 as a fusion protein with N-terminal mMBP and C-terminal hexahistidine tags. A gene encoding human ACE2 (residues 21–597) fused to a C-terminal Fc-tag (BioBasic, Singapore) was cloned into vector pHL-sec between the signal peptide and C-terminal 6xHis tag. S (1–1208) was expressed in HEK293-6E using polyethylenimine as the transfection reagent while the isolated RBD (‘RBDisolated’) and ACE2 constructs were expressed in Expi293F using the Expi293 System. Culture supernatant was harvested on day 7 for HEK293-6E expression and day 5 for Expi293F expression. S protein was affinity purified using Strep-TactinXT column (IBA), RBD protein was affinity purified using cOmplete His-Tag Purification column (Merck, Darmstadt, Germany), and ACE2 receptor was affinity purified using HiTrap MabSelect SuRe column (GE Healthcare, Chicago, IL,USA). Purified proteins were concentrated and buffer exchanged into phosphate buffered saline (PBS) using VivaSpin, and the purity was assessed by denaturing polyacrylamide gel electrophoresis (Figure 2—figure supplement 1A, Figure 3—figure supplement 1A, and Figure 5—figure supplement 1A). Cell lines obtained commercially are listed in key resources table and were tested for contamination by Mycoplasma species.
Characterization of RBD:ACE2 receptor binding
Request a detailed protocolInteractions between recombinant purified MBP-RBD and ACE2 receptor (Figure 3—figure supplement 1A and Figure 5—figure supplement 1A) were confirmed by enzyme-linked immunosorbent assay. To test binding activity of ACE2, 96-well maxisorp plates were coated with 100 µL of 27.2 nM MBP-RBD diluted in PBS at 4°C for 16 hr and blocked with 350 µL of 4% skimmed milk in PBST (0.05% Tween 20 in PBS) at room temperature for 1.5 hr. This was followed by 1 hr incubation with ACE2 (100 µL) at varying concentrations and detection with 100 µL of goat-anti-human IgG Fc HRP diluted at 1:5000 in 2% skimmed milk in PBST for 1 hr. Plates were washed three times in PBST after each incubation step above. After 5 min incubation with 100 µL of 3,3′,5,5′-tetramethylbenzidine, reaction was stopped with 100 µL of 1 M H2SO4 and absorbance at 450 nm (A450) was recorded. A similar protocol was adopted for the quality testing of MBP-RBD – it was coated at variable concentrations in PBS at 4°C for 16 hr and blocked at room temperature for 1.5 hr. This was followed by 1 hr incubation with 10.4 nM ACE2 (100 µL) diluted in blocking buffer. Detection, plate washing, and color development steps were performed in the same manner as described above. Data represents an average of three replicates, along with their error bars and plotted using GraphPad Prism 5 (San Diego, CA).
Deuterium exchange
Request a detailed protocolS protein (8 µM), ACE2 (52 µM), and RBD (67 µM) solubilized in PBS (pH 7.4) were incubated at 37°C in PBS buffer reconstituted in D2O (99.90%), resulting in a final D2O concentration of 90%. S:ACE2 and RBDisolated:ACE2 complexes (KD of ~15 and ~150 nM, respectively) (Wrapp et al., 2020) were pre-incubated at 37°C for 30 min in a 1:1 molar ratio to achieve >90% binding prior to each hydrogen–deuterium exchange reaction. Deuterium labeling was performed for 1, 10, and 100 min for isolated construct of RBD, free ACE2, and RBDisolated:ACE2 complex. For isolated S protein and S:ACE2 complex, 1 and 10 min labeling timescales were used. Pre-chilled quench solution 1.5 M GnHCl and 0.25 M Tris(2-carboxyethyl) phosphine-hydrochloride was added to deuterium exchange reaction mixture to lower the pHread to ~2.5 and lower the temperature to ~4°C. Next, the quenched reaction was incubated at 4°C on ice for 1 min followed by online pepsin digestion.
Mass spectrometry and peptide identification
Request a detailed protocolApproximately 100 pmol quenched samples were injected onto chilled nanoUPLC HDX sample manager (Waters, Milford, MA). The injected samples were subjected to online digestion using immobilized Enzymate BEH pepsin column (2.1 × 30 mm) (Waters, Milford, MA) in 0.1% aqueous formic acid at 100 μL/min. Simultaneously, the proteolyzed peptides were trapped in a 2.1 × 5 mm C18 trap (ACQUITY BEH C18 VanGuard Pre-column, 1.7 μm, Waters, Milford, MA). Following pepsin digestion, the proteolyzed peptides were eluted using acetonitrile gradient of 8–40% in 0.1% formic acid at a flow rate of 40 µL/min into reverse phase column (ACQUITY UPLC BEH C18 Column, 1.0 × 100 mm, 1.7 μM, Waters, Milford, MA) pumped by nanoACQUITY Binary Solvent Manager (Waters, Milford, MA). Electrospray ionization mode was used to ionize peptides sprayed onto SYNAPT G2-Si mass spectrometer (Waters, Milford, MA) acquired in HDMSE mode. A flow rate of 5 µL/min was used to continually inject 200 fmol μL−1 of [Glu1]-fibrinopeptide B ([Glu1]-Fib) as lockspray reference mass.
For identification of the resolved and eluted peptides, HDMSE method was used with ion-mobility settings 600 m/s wave velocity and 197 m/s transfer wave velocity. Low collision energies of 4 and 2 V were used for trap and transfer, respectively, while high collision energy was ramped from 20 to 45 V. A constant 25 V cone voltage was used, and mass spectra within 50–2000 Da were acquired for 10 min with mass spectrometer operated in positive ion mode.
Undeuterated protein samples were used to identify sequences from mass spectra data (in HDMSE mode) using ProteinLynx Global Server (PLGS) v3.0. Peptide identification search was performed against a separate sequence database of each protein sequence, along with its respective affinity purification tag sequences. PLGS search parameters selected for peptide sequence identification were (i) no specific protease and (ii) variable N-linked glycosylation modification. Additional cutoff filters applied included (i) minimum intensity = 2500, (ii) minimum products per amino acids = 0.2, and (iii) a precursor ion mass tolerance of <10 ppm in DynamX v.3.0 (Waters, Milford, MA) and confirmed for pepsin cleavage specificity as described (Hamuro et al., 2008). Peptides independently identified under the specified condition and present in at least in two out of three undeuterated sample replicates were retained for HDXMS analysis. S protein contains 22 variable glycosylation sites (Watanabe et al., 2020), out of which we identified peptides spanning 12 glycosylation sites in our sample (Figure 2—figure supplement 2). However, none of these peptides showed glycosylation. For ACE2, we obtained peptides overlapping four glycosylation sites (Figure 5—figure supplement 3).
RFU is the ratio of number of deuterons exchanged to the total number of exchangeable amides of the peptide. Centroid masses of undeuterated reference spectra were subtracted from equivalent spectra of deuterium exchanged peptides to calculate the average deuterons exchanged for each peptide. Deuterium exchange plots, relative deuterium exchange, and difference plots were generated by DynamX v.3.0. The N-terminus and all prolines in each peptide were excluded for estimation of exchangeable amides per peptide (Hoofnagle et al., 2003). Deuterium exchange experiments for two biological replicates and technical triplicates of S protein and the S:ACE2 complex were carried out. Average deuterium exchange measurements between the two biological replicates were within ±0.3 Da (Supplementary file 1: Table S5, S6) (Houde et al., 2011). While deuterium exchange values are not corrected for back exchange, fully deuterated S protein samples were used to measure deuterium back exchange. A list of peptides with back exchange values is shown in Supplementary file 1: Table S7. The mass spectrometry proteomics data have been deposited to the ProteomeXchange Consortium via the PRIDE [1] partner repository with the dataset identifier PXD023138.
Modeling and MD simulations
Request a detailed protocolAn integrative model of full-length SARS-CoV-2 S protein was built using Modeller v.9.21 (Šali and Blundell, 1993). The cryo-EM structure of prefusion S ECD in the open conformation (PDB: 6VSB) (Wrapp et al., 2020) was used as the template for the ECD with missing loops on the NTD and the C-terminus of the ECD modeled based on the cryo-EM structure of S ECD in the closed conformation resolved at a higher resolution (PDB: 6XR8) (Cai et al., 2020). The Nuclear Magnetic Resonance (NMR) structure of the SARS S HR2 domain (PDB: 2FXP) (Hakansson-McReynolds et al., 2006) was used as the template for the HR2 domain, while the TM domain was modeled using the NMR structure of the HIV-1 gp-41 TM domain (PDB: 5JYN) (Dev et al., 2016). Ten models were built and subjected to stereochemical assessment using the discreet optimized protein energy (DOPE) score (Eramian et al., 2006) and Ramachandran analysis (Ramachandran et al., 1963). The model with the lowest DOPE score and the smallest number of Ramachandran outliers was chosen. Palmitoylation was performed at three cysteine residues (C1236, C1240, and C1243) on the CT domain based on a study showing its importance in SARS S protein function (Petit et al., 2007). The S protein model was then embedded into a model membrane representing the endoplasmic reticulum–Golgi intermediate compartment (ERGIC) (van Meer, 1998), where coronaviruses are known to assemble in a bud form (Krijnse-Locker et al., 1994; Klumperman et al., 1994). The ERGIC model membrane was built using CHARMM-GUI Membrane Builder (Lee et al., 2019).
All-atom MD simulation was performed for 200 ns using GROMACS (University of Groningen, Netherlands) 2018 (Abraham et al., 2015) and the CHARMM36 force field (Huang and MacKerell, 2013). The system was solvated with 590,742 TIP3P water molecules and 0.15 M NaCl salt, achieved by adding 3235 Na+ and 2103 Cl- ions. Minimization and equilibration were performed following standard CHARMM-GUI protocols (Lee et al., 2016). This includes six steps of equilibration; the first two steps used a 1 fs integration time step for 125 ps, while the last four used 2 fs time step for 250 ps. With each step, the magnitude of positional and dihedral restraints imposed on the protein and lipid molecules was gradually reduced by lowering the force constants from 1000 (step 1) to 0 kJ mol−1 nm−2 (step 6). Temperature and pressure were maintained at 310 K and one atm, respectively, using the Berendsen thermostat and barostat during equilibration. This was then followed by the production run, whereby the temperature was maintained using the Nosé–Hoover thermostat (Nosé, 1984; Hoover, 1985) and the pressure was maintained via semi-isotropic coupling to the Parrinello–Rahman barostat (Parrinello and Rahman, 1981). Electrostatics were calculated using the smooth particle mesh Ewald method (Essmann et al., 1995) with a real space cutoff of 1.2 nm and the van der Waals interactions were truncated at 1.2 nm with force switch smoothing between 1.0 and 1.2 nm. Constraints were applied to covalent bonds with hydrogen atoms using the LINCS algorithm (Hess et al., 1997) and a 2 fs integration time step was employed. Snapshots of the trajectory were saved every 100 ps. To assess whether the system was properly equilibrated, we calculated domain-specific root mean square deviations (RMSDs) of the Cα atoms following least-squares fitting (Figure 2—figure supplement 5). For all three domains tested (NTD, RBD, and HR2) in all three chains of the S protein, the RMSD reached a plateau after around 50 ns. Additionally, we also calculated RMSF profiles using 20 ns trajectory windows along the simulations. Similarly, the per-residue RMSF values for all three domains converged after the first three windows (60 ns).
For simulations of the ACE2 receptor, the cryo-EM structure of the ACE2-B0AT1 complex in the open conformation (PDB: 6M1D) (Yan et al., 2020) was used. The ACE2-B0AT1 complex was embedded into a model membrane representing the epithelial cell membrane (Jia et al., 2005; Sampaio et al., 2011). The system was solvated with 314,442 TIP3P water molecules and 0.15 M NaCl salt (1868 Na+ and 1300 Cl- ions). Minimization, equilibration, and production runs were performed using the protocols described above. Principal component analysis and RMSF analyses were performed using GROMACS, and simulations were visualized in VMD (University of Illinois at Urbana-Champaign, USA) (Humphrey et al., 1996).
Data availability
All data generated or analysed during this study are included in the manuscript and supporting files. Source data files have been provided for Figures 2, 3, 4 and 5.
References
-
The proximal origin of SARS-CoV-2Nature Medicine 26:450–452.https://doi.org/10.1038/s41591-020-0820-9
-
Hosts and sources of endemic human coronavirusesAdvances in Virus Research 100:163–188.https://doi.org/10.1016/bs.aivir.2018.01.001
-
A composite score for predicting errors in protein structure modelsProtein Science 15:1653–1666.https://doi.org/10.1110/ps.062095806
-
A smooth particle mesh Ewald methodThe Journal of Chemical Physics 103:8577–8593.https://doi.org/10.1063/1.470117
-
Solution structure of the severe acute respiratory syndrome-coronavirus heptad repeat 2 domain in the prefusion stateJournal of Biological Chemistry 281:11965–11971.https://doi.org/10.1074/jbc.M601174200
-
Specificity of immobilized porcine pepsin in H/D exchange compatible conditionsRapid Communications in Mass Spectrometry 22:1041–1046.https://doi.org/10.1002/rcm.3467
-
LINCS: a linear constraint solver for molecular simulationsJournal of Computational Chemistry 18:1463–1472.https://doi.org/10.1002/(SICI)1096-987X(199709)18:12<1463::AID-JCC4>3.0.CO;2-H
-
Protein analysis by hydrogen exchange mass spectrometryAnnual Review of Biophysics and Biomolecular Structure 32:1–25.https://doi.org/10.1146/annurev.biophys.32.110601.142417
-
Canonical dynamics: equilibrium phase-space distributionsPhysical Review A 31:1695–1697.https://doi.org/10.1103/PhysRevA.31.1695
-
The utility of hydrogen/deuterium exchange mass spectrometry in biopharmaceutical comparability studiesJournal of Pharmaceutical Sciences 100:2071–2086.https://doi.org/10.1002/jps.22432
-
CHARMM36 all-atom additive protein force field: validation based on comparison to NMR dataJournal of Computational Chemistry 34:2135–2145.https://doi.org/10.1002/jcc.23354
-
VMD: visual molecular dynamicsJournal of Molecular Graphics 14:33–38.https://doi.org/10.1016/0263-7855(96)00018-5
-
Coronavirus HKU1 and other coronavirus infections in Hong KongJournal of Clinical Microbiology 44:2063–2071.https://doi.org/10.1128/JCM.02614-05
-
CHARMM-GUI input generator for NAMD, GROMACS, AMBER, OpenMM, and CHARMM/OpenMM simulations using the CHARMM36 additive force fieldJournal of Chemical Theory and Computation 12:405–413.https://doi.org/10.1021/acs.jctc.5b00935
-
CHARMM-GUI Membrane Builder for Complex Biological Membrane Simulations with Glycolipids and LipoglycansJournal of Chemical Theory and Computation 15:775–786.https://doi.org/10.1021/acs.jctc.8b01066
-
Conformational changes in intact dengue virus reveal serotype-specific expansionNature Communications 8:14339.https://doi.org/10.1038/ncomms14339
-
A molecular dynamics method for simulations in the canonical ensembleMolecular Physics 52:255–268.https://doi.org/10.1080/00268978400101201
-
Polymorphic transitions in single crystals: A new molecular dynamics methodJournal of Applied Physics 52:7182–7190.https://doi.org/10.1063/1.328693
-
Stereochemistry of polypeptide chain configurationsJournal of Molecular Biology 7:95–99.https://doi.org/10.1016/S0022-2836(63)80023-6
-
Comparative protein modelling by satisfaction of spatial restraintsJournal of Molecular Biology 234:779–815.https://doi.org/10.1006/jmbi.1993.1626
-
Human respiratory coronavirus OC43: genetic stability and neuroinvasionJournal of Virology 78:8824–8834.https://doi.org/10.1128/JVI.78.16.8824-8834.2004
-
Epidemiology, genetic recombination, and pathogenesis of coronavirusesTrends in Microbiology 24:490–502.https://doi.org/10.1016/j.tim.2016.03.003
-
ACE2 X-Ray structures reveal a large Hinge-bending motion important for inhibitor binding and catalysisJournal of Biological Chemistry 279:17996–18007.https://doi.org/10.1074/jbc.M311191200
-
Lipids of the golgi membraneTrends in Cell Biology 8:29–33.https://doi.org/10.1016/S0962-8924(97)01196-3
-
Structure of furin protease binding to SARS-CoV-2 spike glycoprotein and implications for potential targets and virulenceThe Journal of Physical Chemistry Letters 11:6655–6663.https://doi.org/10.1021/acs.jpclett.0c01698
Article and author information
Author details
Funding
National Medical Research Council (WBS#R-571-000-081-213 Establishment of assays for drug screening and virus characterization of the newly emerged novel coronavirus (2019-nCoV) which is also known as the Wuhan coronavirus)
- Paul A MacAry
A*STAR Bioinformatics Institute
- Peter J Bond
National University of Singapore
- Palur V Raghuvamsi
Ministry of Education - Singapore (MOE2017-T2-A40-112)
- Nikhil K Tulsian
- Ganesh S Anand
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Acknowledgements
We thank Dr. Lu Gan, Dept. of Biological Sciences, National University of Singapore, Sean Braet, Theresa Buckley and Varun Venkatakrishnan, Dept. of Chemistry, the Pennsylvania State University for helpful discussions. Additionally, we thank reviewers and a reader for their feedback. We thank Protein Production Platform of Nanyang Technological University for their help in making the RBD and ACE2 expression constructs and small-scale protein expression tests. HDXMS experiments were carried out as a fee for service at the Singapore National Laboratory for Mass Spectrometry (SingMass) funded by NRF, Singapore. PVR was supported by research scholarship from National University of Singapore, Singapore. NKT was supported by research grant from Ministry of Education, Singapore awarded to GSA (MOE2017-T2-A40-112). This work was supported by BII of A*STAR. Simulations were performed on the petascale computer cluster ASPIRE-1 at the National Supercomputing Centre of Singapore (NSCC) and the A*STAR Computational Resource Centre (A*CRC).
Copyright
© 2021, Raghuvamsi et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 9,626
- views
-
- 1,203
- downloads
-
- 106
- citations
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Biochemistry and Chemical Biology
The Parkinson’s disease (PD)-linked protein Leucine-Rich Repeat Kinase 2 (LRRK2) consists of seven domains, including a kinase and a Roc G domain. Despite the availability of several high-resolution structures, the dynamic regulation of its unique intramolecular domain stack is nevertheless still not well understood. By in-depth biochemical analysis, assessing the Michaelis–Menten kinetics of the Roc G domain, we have confirmed that LRRK2 has, similar to other Roco protein family members, a KM value of LRRK2 that lies within the range of the physiological GTP concentrations within the cell. Furthermore, the R1441G PD variant located within a mutational hotspot in the Roc domain showed an increased catalytic efficiency. In contrast, the most common PD variant G2019S, located in the kinase domain, showed an increased KM and reduced catalytic efficiency, suggesting a negative feedback mechanism from the kinase domain to the G domain. Autophosphorylation of the G1+2 residue (T1343) in the Roc P-loop motif is critical for this phosphoregulation of both the KM and the kcat values of the Roc-catalyzed GTP hydrolysis, most likely by changing the monomer–dimer equilibrium. The LRRK2 T1343A variant has a similar increased kinase activity in cells compared to G2019S and the double mutant T1343A/G2019S has no further increased activity, suggesting that T1343 is crucial for the negative feedback in the LRRK2 signaling cascade. Together, our data reveal a novel intramolecular feedback regulation of the LRRK2 Roc G domain by a LRRK2 kinase-dependent mechanism. Interestingly, PD mutants differently change the kinetics of the GTPase cycle, which might in part explain the difference in penetrance of these mutations in PD patients.
-
- Biochemistry and Chemical Biology
- Structural Biology and Molecular Biophysics
Pre-mRNA splicing is catalyzed in two steps: 5ʹ splice site (SS) cleavage and exon ligation. A number of proteins transiently associate with spliceosomes to specifically impact these steps (first and second step factors). We recently identified Fyv6 (FAM192A in humans) as a second step factor in Saccharomyces cerevisiae; however, we did not determine how widespread Fyv6’s impact is on the transcriptome. To answer this question, we have used RNA sequencing (RNA-seq) to analyze changes in splicing. These results show that loss of Fyv6 results in activation of non-consensus, branch point (BP) proximal 3ʹ SS transcriptome-wide. To identify the molecular basis of these observations, we determined a high-resolution cryo-electron microscopy (cryo-EM) structure of a yeast product complex spliceosome containing Fyv6 at 2.3 Å. The structure reveals that Fyv6 is the only second step factor that contacts the Prp22 ATPase and that Fyv6 binding is mutually exclusive with that of the first step factor Yju2. We then use this structure to dissect Fyv6 functional domains and interpret results of a genetic screen for fyv6Δ suppressor mutations. The combined transcriptomic, structural, and genetic studies allow us to propose a model in which Yju2/Fyv6 exchange facilitates exon ligation and Fyv6 promotes usage of consensus, BP distal 3ʹ SS.