Cell Signaling: Learning from ancestors

Applying ancestral sequence reconstruction techniques to protein kinases reveals the mutations that underlie different modes of activation.
  1. Suk ho Hong
  2. Neel H Shah  Is a corresponding author
  1. Columbia University, United States

A major goal of biological research is to understand the evolutionary histories of organisms and genes. One way to do this is to study biological entities that have become extinct, as this can provide insights into the form and function of their present-day descendants. This is perhaps best exemplified by paleogenetics and paleoproteomics research, where DNA and protein molecules from ancient biological samples are extracted and sequenced to help us better understand the evolutionary relationships between different species. Indeed, the analysis of biomolecules from our hominid ancestors has revealed significant new insights into the origins and diversity of our species (Warren, 2019). Unfortunately, DNA and proteins are degraded over time, which means that this approach can only be applied to evolutionary events from the past one million years.

Many large gene and protein families have evolved through rounds of gene duplication and functional specialization over hundreds of millions of years, which puts them beyond the reach of paleogenetics and paleoproteomics. How, then, might we use evolution to dissect the specialized properties of individual proteins in a family? One approach, called ancestral sequence reconstruction, involves using a statistical model to analyze the sequences of closely-related proteins from different organisms and generate plausible sequences for their ancestors (Hochberg and Thornton, 2017). Actual protein samples based on these sequences can then be made in the laboratory and compared to naturally occurring proteins.

Ancestral sequence reconstruction has been applied to a variety of protein families to understand, at the molecular level, how closely-related proteins have evolved distinct biochemical properties. For example, this method was previously used to examine how individual kinases – enzymes that modify other proteins through a process called phosphorylation – select different target molecules (Howard et al., 2014). Now, in eLife, the same group, led by Liam Holt at New York University (NYU) – including Dajun Sang as first author – reports on the application of ancestral sequence reconstruction to study the evolution of a subfamily of kinases called the MAP kinases (Sang et al., 2019).

Most eukaryotic organisms have hundreds of different protein kinases – humans have over 500 (Manning et al., 2002) – and many kinases need to be phosphorylated themselves in order to become active (Nolen et al., 2004). Some kinases can phosphorylate and activate themselves, through a process called autophosphorylation, while others are dependent on another kinase to be phosphorylated. Many members of the MAP kinase family autophosphorylate, but ERK1 and ERK2 (referred to as ERK1/2) cannot autophosphorylate efficiently. The molecular characteristics that prevent them from doing so were previously unknown.

Sang et al. compiled MAP kinase sequences from a variety of organisms and used ancestral reconstruction to predict the sequences of their common ancestors (Sang et al., 2019). They used standard biochemical techniques to produce proteins with the predicted sequences, and showed that the predicted common ancestor of ERK1/2 could not autophosphorylate efficiently, whereas other ancestral MAP kinases could (Figure 1). Two mutations were found when the sequence of the ERK1/2 common ancestor was compared to the sequences of the other ancestral MAP kinases. One was an amino acid substitution near the spine connecting different regions of the protein; the other was an amino acid deletion that shortened a flexible loop near the catalytic cleft in the kinases. Together, these two mutations suppress the ability of ERK1/2 and their common ancestor to autophosphorylate (Figure 1). Notably, Sang et al. – who are based at Memorial Sloan Kettering, the Icahn School of Medicine and Yale – also showed that reinserting the deleted amino acid in the flexible loop in human ERK1 relieved its dependence on other kinases for its activation in cells.

The evolution of different regulatory properties in MAP kinases.

A mock phylogenetic tree (left) shows the evolution of ERK1/2 and other MAP kinases. ERK1/2, and their common ancestor (dark blue) cannot efficiently activate themselves through autophosphorylation. More ancient ancestors in the MAP kinase family (light blue) are capable of efficient autophosphorylation. A cartoon diagram (right) highlights the structural properties that differentiate MAP kinases that are capable of autophosphorylation (light blue) from those that cannot autophosphorylate themselves efficiently (dark blue). All protein kinases have a two-lobe structure with a catalytic cleft in the middle. Different parts of the kinase are connected by a spine. The loop in front of the catalytic cleft has to shift position for the enzyme to become active. This is driven by phosphorylation of that loop, either by another kinase or through autophosphorylation (shown as pink residues in the inactive form of the enzyme becoming red residues in the active form, with a concomitant change in the shape of the loop). Sang et al. have identified two mutations that could explain why ERK1/2 and their common ancestor (bottom right, dark blue) are different from other MAP kinases (top right, light blue): i) they have a polar amino acid (yellow, bottom) rather than a hydrophobic amino acid (orange, top) at a site near the spine of the kinase; ii) a loop above the catalytic cleft is one amino acid shorter than in other MAP kinases. It is thought that these two mutations disrupt the geometry and flexibility of the catalytic cleft, altering the ability of the kinase to autophosphorylate.

To explain how these evolutionary sequence alterations resulted in a change in autophosphorylation ability, Sang et al. performed computer simulations of the internal motions of ERK2, with and without the ancestral insertion and substitution. These simulations revealed that the overall flexibility of ERK2 increased when it had ancestor-like sequence features. Sang et al. postulate that increased flexibility in the mutant kinase allows it to more readily adopt a shape compatible with autophosphorylation.

The two mutations reported in the latest work have intriguing implications for kinases in general. Alterations to the flexible loop have been observed in cancer-associated variants of several distantly related kinases (BRAF, HER2, and EGFR; Foster et al., 2016), and mutations at the spine-proximal position are associated with excessive activation and drug resistance in a variety of kinases (Azam et al., 2008). Although ERK1/2 are intimately embedded within oncogenic signaling pathways, mutations at these positions have not been found in those kinases in human cancers. Further analysis of kinase evolutionary history, juxtaposed with cancer genome sequencing, is likely to reveal other conserved mutational hotspots that have facilitated the evolution of divergent properties across protein kinases.


Article and author information

Author details

  1. Suk ho Hong

    Suk ho Hong is in the Department of Chemistry, Columbia University, New York, United States

    Competing interests
    No competing interests declared
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-6024-0685
  2. Neel H Shah

    Neel H Shah is in the Department of Chemistry, Columbia University, New York, United States

    For correspondence
    Competing interests
    No competing interests declared
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-1186-0626

Publication history

  1. Version of Record published: August 13, 2019 (version 1)


© 2019, Hong and Shah

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.


  • 1,603
    Page views
  • 137
  • 0

Article citation count generated by polling the highest count across the following sources: Crossref, PubMed Central, Scopus.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Suk ho Hong
  2. Neel H Shah
Cell Signaling: Learning from ancestors
eLife 8:e49976.
  1. Further reading

Further reading

    1. Biochemistry and Chemical Biology
    2. Epidemiology and Global Health
    Takashi Sasaki, Yoshinori Nishimoto ... Yasumichi Arai
    Research Article

    Background: High levels of circulating adiponectin are associated with increased insulin sensitivity, low prevalence of diabetes, and low body mass index (BMI); however, high levels of circulating adiponectin are also associated with increased mortality in the 60-70 age group. In this study, we aimed to clarify factors associated with circulating high-molecular-weight (cHMW) adiponectin levels and their association with mortality in the very old (85-89 years old) and centenarians.

    Methods: The study included 812 (women: 84.4%) for centenarians and 1,498 (women: 51.7%) for the very old. The genomic DNA sequence data were obtained by whole genome sequencing or DNA microarray-imputation methods. LASSO and multivariate regression analyses were used to evaluate cHMW adiponectin characteristics and associated factors. All-cause mortality was analyzed in three quantile groups of cHMW adiponectin levels using Cox regression.

    Results: The cHMW adiponectin levels were increased significantly beyond 100 years of age, were negatively associated with diabetes prevalence, and were associated with SNVs in CDH13 (p = 2.21 × 10-22) and ADIPOQ (p = 5.72 × 10-7). Multivariate regression analysis revealed that genetic variants, BMI, and high-density lipoprotein cholesterol (HDLC) were the main factors associated with cHMW adiponectin levels in the very old, whereas the BMI showed no association in centenarians. The hazard ratios for all-cause mortality in the intermediate and high cHMW adiponectin groups in very old men were significantly higher rather than those for all-cause mortality in the low level cHMW adiponectin group, even after adjustment with BMI. In contrast, the hazard ratios for all-cause mortality were significantly higher for high cHMW adiponectin groups in very old women, but were not significant after adjustment with BMI.

    Conclusions: cHMW adiponectin levels increased with age until centenarians, and the contribution of known major factors associated with cHMW adiponectin levels, including BMI and HDLC, varies with age, suggesting that its physiological significance also varies with age in the oldest old.

    Funding: This study was supported by grants from the Ministry of Health, Welfare, and Labour for the Scientific Research Projects for Longevity; a Grant-in-Aid for Scientific Research (No 21590775, 24590898, 15KT0009, 18H03055, 20K20409, 20K07792, 23H03337) from the Japan Society for the Promotion of Science; Keio University Global Research Institute (KGRI), Kanagawa Institute of Industrial Science and Technology (KISTEC), Japan Science and Technology Agency (JST) Research Complex Program 'Tonomachi Research Complex' Wellbeing Research Campus: Creating new values through technological and social innovation (JP15667051), the Program for an Integrated Database of Clinical and Genomic Information from the Japan Agency for Medical Research and Development (No. 16kk0205009h001, 17jm0210051h0001, 19dk0207045h0001); the medical-welfare-food-agriculture collaborative consortium project from the Japan Ministry of Agriculture, Forestry, and Fisheries; and the Biobank Japan Program from the Ministry of Education, Culture, Sports, and Technology.

    1. Biochemistry and Chemical Biology
    2. Structural Biology and Molecular Biophysics
    Nina Gubensäk, Theo Sagmeister ... Tea Pavkov-Keller
    Research Article

    The seventh pandemic of the diarrheal cholera disease, which began in 1960, is caused by the Gram-negative bacterium Vibrio cholerae. Its environmental persistence provoking recurring sudden outbreaks is enabled by V. cholerae's rapid adaption to changing environments involving sensory proteins like ToxR and ToxS. Located at the inner membrane, ToxR and ToxS react to environmental stimuli like bile acid, thereby inducing survival strategies e.g. bile resistance and virulence regulation. The presented crystal structure of the sensory domains of ToxR and ToxS in combination with multiple bile acid interaction studies, reveals that a bile binding pocket of ToxS is only properly folded upon binding to ToxR. Our data proposes an interdependent functionality between ToxR transcriptional activity and ToxS sensory function. These findings support the previously suggested link between ToxRS and VtrAC-like co-component systems. Besides VtrAC, ToxRS is now the only experimentally determined structure within this recently defined superfamily, further emphasizing its significance. In-depth analysis of the ToxRS complex reveals its remarkable conservation across various Vibrio species, underlining the significance of conserved residues in the ToxS barrel and the more diverse ToxR sensory domain. Unravelling the intricate mechanisms governing ToxRS's environmental sensing capabilities, provides a promising tool for disruption of this vital interaction, ultimately inhibiting Vibrio's survival and virulence. Our findings hold far-reaching implications for all Vibrio strains that rely on the ToxRS system as a shared sensory cornerstone for adapting to their surroundings.