Coevolution-based prediction of key allosteric residues for protein function regulation

  1. Juan Xie
  2. Weilin Zhang
  3. Xiaolei Zhu
  4. Minghua Deng
  5. Luhua Lai  Is a corresponding author
  1. Peking University, China
  2. Anhui Agricultural University, China

Abstract

Allostery is fundamental to many biological processes. Due to the distant regulation nature, how allosteric mutations, modifications and effector binding impact protein function is difficult to forecast. In protein engineering, remote mutations cannot be rationally designed without large-scale experimental screening. Allosteric drugs have raised much attention due to their high specificity and possibility of overcoming existing drug-resistant mutations. However, optimization of allosteric compounds remains challenging. Here, we developed a novel computational method KeyAlloSite to predict allosteric site and to identify key allosteric residues (allo-residues) based on the evolutionary coupling model. We found that protein allosteric sites are strongly coupled to orthosteric site compared to non-functional sites. We further inferred key allo-residues by pairwise comparing the difference of evolutionary coupling scores of each residue in the allosteric pocket with the functional site. Our predicted key allo-residues are in accordance with previous experimental studies for typical allosteric proteins like BCR-ABL1, Tar and PDZ3, as well as key cancer mutations. We also showed that KeyAlloSite can be used to predict key allosteric residues distant from the catalytic site that are important for enzyme catalysis. Our study demonstrates that weak coevolutionary couplings contain important information of protein allosteric regulation function. KeyAlloSite can be applied in studying the evolution of protein allosteric regulation, designing and optimizing allosteric drugs, performing functional protein design and enzyme engineering.

Data availability

All data that support the results of this study are included in the manuscript, supplementary files, and GitHub repository(https://github.com/huilan1210/KeyAlloSite). Source Data files have been provided for all Figures(except Figure 6 and Figure 1-figure supplement 1).

The following previously published data sets were used

Article and author information

Author details

  1. Juan Xie

    Center for Quantitative Biology, Peking University, Beijing, China
    Competing interests
    The authors declare that no competing interests exist.
  2. Weilin Zhang

    College of Chemistry and Molecular Engineering, Peking University, Beijing, China
    Competing interests
    The authors declare that no competing interests exist.
  3. Xiaolei Zhu

    School of Sciences, Anhui Agricultural University, Hefei, China
    Competing interests
    The authors declare that no competing interests exist.
  4. Minghua Deng

    Center for Quantitative Biology, Peking University, Beijing, China
    Competing interests
    The authors declare that no competing interests exist.
  5. Luhua Lai

    College of Chemistry and Molecular Engineering, Peking University, Beijing, China
    For correspondence
    lhlai@pku.edu.cn
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-8343-7587

Funding

National Key R&D Program of China (2022YFA1303700)

  • Luhua Lai

National Natural Science Foundation of China (21633001,22237002)

  • Luhua Lai

Chinese Academy of Medical Sciences (2021-I2M-5-014)

  • Luhua Lai

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Reviewing Editor

  1. Shozeb Haider, University College London, United Kingdom

Version history

  1. Received: July 13, 2022
  2. Preprint posted: July 26, 2022 (view preprint)
  3. Accepted: February 16, 2023
  4. Accepted Manuscript published: February 17, 2023 (version 1)
  5. Version of Record published: March 2, 2023 (version 2)

Copyright

© 2023, Xie et al.

This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.

Metrics

  • 2,276
    views
  • 385
    downloads
  • 10
    citations

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Juan Xie
  2. Weilin Zhang
  3. Xiaolei Zhu
  4. Minghua Deng
  5. Luhua Lai
(2023)
Coevolution-based prediction of key allosteric residues for protein function regulation
eLife 12:e81850.
https://doi.org/10.7554/eLife.81850

Share this article

https://doi.org/10.7554/eLife.81850

Further reading

    1. Biochemistry and Chemical Biology
    2. Computational and Systems Biology
    Richard Sejour, Janet Leatherwood ... Bruce Futcher
    Research Article

    Previously, Tuller et al. found that the first 30–50 codons of the genes of yeast and other eukaryotes are slightly enriched for rare codons. They argued that this slowed translation, and was adaptive because it queued ribosomes to prevent collisions. Today, the translational speeds of different codons are known, and indeed rare codons are translated slowly. We re-examined this 5’ slow translation ‘ramp.’ We confirm that 5’ regions are slightly enriched for rare codons; in addition, they are depleted for downstream Start codons (which are fast), with both effects contributing to slow 5’ translation. However, we also find that the 5’ (and 3’) ends of yeast genes are poorly conserved in evolution, suggesting that they are unstable and turnover relatively rapidly. When a new 5’ end forms de novo, it is likely to include codons that would otherwise be rare. Because evolution has had a relatively short time to select against these codons, 5’ ends are typically slightly enriched for rare, slow codons. Opposite to the expectation of Tuller et al., we show by direct experiment that genes with slowly translated codons at the 5’ end are expressed relatively poorly, and that substituting faster synonymous codons improves expression. Direct experiment shows that slow codons do not prevent downstream ribosome collisions. Further informatic studies suggest that for natural genes, slow 5’ ends are correlated with poor gene expression, opposite to the expectation of Tuller et al. Thus, we conclude that slow 5’ translation is a ‘spandrel’--a non-adaptive consequence of something else, in this case, the turnover of 5’ ends in evolution, and it does not improve translation.

    1. Biochemistry and Chemical Biology
    Boglarka Zambo, Evelina Edelweiss ... Gergo Gogl
    Research Article

    Truncation of the protein-protein interaction SH3 domain of the membrane remodeling Bridging Integrator 1 (BIN1, Amphiphysin 2) protein leads to centronuclear myopathy. Here, we assessed the impact of a set of naturally observed, previously uncharacterized BIN1 SH3 domain variants using conventional in vitro and cell-based assays monitoring the BIN1 interaction with dynamin 2 (DNM2) and identified potentially harmful ones that can be also tentatively connected to neuromuscular disorders. However, SH3 domains are typically promiscuous and it is expected that other, so far unknown partners of BIN1 exist besides DNM2, that also participate in the development of centronuclear myopathy. In order to shed light on these other relevant interaction partners and to get a holistic picture of the pathomechanism behind BIN1 SH3 domain variants, we used affinity interactomics. We identified hundreds of new BIN1 interaction partners proteome-wide, among which many appear to participate in cell division, suggesting a critical role of BIN1 in the regulation of mitosis. Finally, we show that the identified BIN1 mutations indeed cause proteome-wide affinity perturbation, signifying the importance of employing unbiased affinity interactomic approaches.