Comprehensive fitness landscape of SARS-CoV-2 Mpro reveals insights into viral resistance mechanisms
Abstract
With the continual evolution of new strains of SARS-CoV-2 that are more virulent, transmissible, and able to evade current vaccines, there is an urgent need for effective anti-viral drugs SARS-CoV-2 main protease (Mpro) is a leading target for drug design due to its conserved and indispensable role in the viral life cycle. Drugs targeting Mpro appear promising but will elicit selection pressure for resistance. To understand resistance potential in Mpro, we performed a comprehensive mutational scan of the protease that analyzed the function of all possible single amino acid changes. We developed three separate high-throughput assays of Mpro function in yeast, based on either the ability of Mpro variants to cleave at a defined cut-site or on the toxicity of their expression to yeast. We used deep sequencing to quantify the functional effects of each variant in each screen. The protein fitness landscapes from all three screens were strongly correlated, indicating that they captured the biophysical properties critical to Mpro function. The fitness landscapes revealed a non-active site location on the surface that is extremely sensitive to mutation making it a favorable location to target with inhibitors. In addition, we found a network of critical amino acids that physically bridge the two active sites of the Mpro dimer. The clinical variants of Mpro were predominantly functional in our screens, indicating that Mpro is under strong selection pressure in the human population. Our results provide predictions of mutations that will be readily accessible to Mpro evolution and that are likely to contribute to drug resistance. This complete mutational guide of Mpro can be used in the design of inhibitors with reduced potential of evolving viral resistance.
Data availability
Next generation sequencing data has been deposited to the NCBI short read archive (PRJNA842255). Tabulated raw counts of all variants in all replicates are included in Figure 2 - source data 1. Figure 2 - source data 1, Figure 4 - source data 1, Figure 4 - source data 2, and Figure 5 - source data 1 contain the data used to generate all the figures.
-
Comprehensive fitness landscape of SARS-CoV-2 Mpro in S. cerevisiae - raw sequence readsNCBI Short Read Archive, PRJNA842255.
Article and author information
Author details
Funding
Novartis Institutes for BioMedical Research
- Julia M Flynn
- Neha Samant
- Gily Schneider-Nachum
- Nese Kurt Yilmaz
- Celia A Schiffer
- Daniel NA Bolon
DTB, SAM, and DD are employees of Novartis Institutes for Biomedical Research and were involved in study design, data interpretation, and preparation of this manuscript.
Reviewing Editor
- C Brandon Ogbunugafor, Yale University, United States
Version history
- Preprint posted: January 26, 2022 (view preprint)
- Received: January 28, 2022
- Accepted: June 17, 2022
- Accepted Manuscript published: June 20, 2022 (version 1)
- Version of Record published: July 26, 2022 (version 2)
Copyright
© 2022, Flynn et al.
This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 2,252
- Page views
-
- 495
- Downloads
-
- 28
- Citations
Article citation count generated by polling the highest count across the following sources: PubMed Central, Crossref, Scopus.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Epidemiology and Global Health
- Medicine
- Microbiology and Infectious Disease
eLife has published the following articles on SARS-CoV-2 and COVID-19.
-
- Biochemistry and Chemical Biology
- Evolutionary Biology
Evolution can tinker with multi-protein machines and replace them with simpler single-protein systems performing equivalent functions in an equally efficient manner. It is unclear how, on a molecular level, such simplification can arise. With ancestral reconstruction and biochemical analysis, we have traced the evolution of bacterial small heat shock proteins (sHsp), which help to refold proteins from aggregates using either two proteins with different functions (IbpA and IbpB) or a secondarily single sHsp that performs both functions in an equally efficient way. Secondarily single sHsp evolved from IbpA, an ancestor specialized in strong substrate binding. Evolution of an intermolecular binding site drove the alteration of substrate binding properties, as well as the formation of higher-order oligomers. Upon two mutations in the α-crystallin domain, secondarily single sHsp interacts with aggregated substrates less tightly. Paradoxically, less efficient binding positively influences the ability of sHsp to stimulate substrate refolding, since the dissociation of sHps from aggregates is required to initiate Hsp70-Hsp100-dependent substrate refolding. After the loss of a partner, IbpA took over its role in facilitating the sHsp dissociation from an aggregate by weakening the interaction with the substrate, which became beneficial for the refolding process. We show that the same two amino acids introduced in modern-day systems define whether the IbpA acts as a single sHsp or obligatorily cooperates with an IbpB partner. Our discoveries illuminate how one sequence has evolved to encode functions previously performed by two distinct proteins.