Protein Evolution: Influenza evolution navigates stability valleys

  1. Mary M Rorick  Is a corresponding author
  2. Mercedes Pascual  Is a corresponding author
  1. University of Michigan, United States

An individual’s phenotype is shaped by a multitude of interactions: those among genes and those with the environment. This means that a mutation’s fitness, and likelihood of evolutionary success, is largely a function of the specific genetic background and environment in which it resides. For example, a mutation that is advantageous within one genome may be detrimental or even lethal in another. This occurs because the effects of a gene on an organism’s phenotype often depend on the actions of many other genes—a phenomenon known as epistasis. Complex epistatic interactions limit the pathways along which evolution can proceed by natural selection. Now, in eLife, Lizhi Gong, Marc Suchard and Jesse Bloom—at the Fred Hutchinson Cancer Research Center and the University of California Los Angeles—provide new insights into a remarkably simple mechanism that can explain many of the interactions that appear to be shaping the evolutionary path of a human influenza virus protein (Gong et al., 2013).

Several decades ago, John Maynard Smith and others predicted that mutation fitness effects should be highly contingent on the genomic context in which they occur, and they described these predictions using the concept of a fitness landscape. They observed that when the selective effect of a point mutation is highly dependent on a protein’s genotype, the fitness landscape is not smooth with a single peak; rather, it is rough and contains various local peaks, with valleys in between. Because natural selection avoids these valleys and instead climbs local peaks in the fitness landscape, a rough terrain implies path dependency: not all evolutionary trajectories lead to the same place. Out of this work came the important prediction that evolution proceeds in a way that is strongly constrained by the order in which mutations occur (Maynard Smith, 1970; Kimura, 1985).

Recently, it has become possible to study these processes experimentally. Data sets of genetic sequences for fast-evolving viruses sampled for many time points make it possible to recapitulate with confidence the chain of mutations that separates any two protein variants. Moreover, advances in site-directed mutagenesis allow historical protein variants to be recreated, so that their fitness can be compared with that of existing proteins.

Gong et al. use computational modelling to infer the most likely evolutionary paths between two variants of the influenza nucleoprotein: one collected in 1968 and the other in 2007. In most cases, they are also able to determine the order in which the mutations occurred. Next, they introduce each of these mutations, one at a time, into the original protein sequence. They also recreate the intermediate proteins that arose at each stage of the evolutionary path. By comparing the fitness of each mutated protein with that of the corresponding natural intermediate in which the mutation first appeared, they manage to identify three mutations that would have been deleterious for the original protein, but have no adverse impacts on the natural intermediates: such mutations are said to be epistatically constrained.

Prior studies have anecdotally demonstrated the existence of such mutations in natural protein evolutionary paths. However, Gong et al. go further by investigating nearly all naturally occurring mutations along a relatively long evolutionary path (consisting of 39 steps), and by examining the causes of these epistatic interactions. They determine the stability of each nucleoprotein variant by measuring its melting temperature, with higher melting temperatures corresponding to more stable proteins (Figure 1). They find that—with one exception—the combined effect of a group of mutations on the fitness of a protein can be explained in terms of the individual effect of each mutation on the stability of the protein.

Protein stability determines the accessibility of multiple possible evolutionary paths separating two protein variants.

The results of Gong et al. suggest that increasing the stability of a protein (y axis) above a certain threshold has no effect on fitness, which means that protein evolution can occur along multiple alternative routes. However, fitness decreases rapidly when stability drops below the threshold (red region), in which case evolution will select for mutations that increase stability. Four sites within a protein are shown, with two possible states for each site, indicated by the presence/absence of the asterisk. Single point mutations are indicated by solid arrows, double mutations are indicated by dashed arrows. Consider the case where a* and c* are mutations that help a protein to evade the immune system and, like the great majority of random mutations, they reduce the stability of the protein relative to the parent protein (abcd). However, mutations that increase stability, such as b* and d*, can compensate for mutations that reduce stability, which means that several proteins containing a* and/or c* can remain above the stability threshold required for optimal fitness.

Gong et al. show that all three of the epistatically constrained mutations reduce the stability of the protein, and that all were preceded by, or occurred almost concurrently with, one or more mutations that increase stability. This lends support to the classic models for how evolution should occur when the fitness landscape contains various valleys and peaks (Figure 1; Maynard Smith, 1970; Kimura, 1985). In other words, mutations that increase the stability of the influenza nucleoprotein enable it to tolerate other mutations that decrease its stability, and this phenomenon appears to account for most of the epistasis they observe.

The results also suggest that once the protein reaches a threshold level of stability, mutations that increase the stability further have no effect on fitness. This is in contrast to the results of a related study (DePristo et al., 2005), and it suggests that extra stability might accumulate through neutral evolutionary processes. Epistatically constrained mutations also seem to have a disproportionately large role in helping proteins escape detection by the immune system. This is suggested by the fact that the three epistatically constrained mutations in influenza nucleoprotein occur more frequently in immune system targets than do the other mutations in the evolutionary path.

The results of Gong et al. paint the following picture of nucleoprotein evolution: natural selection maintains stability above the minimum threshold required for a protein to be viable, but evolution allows for occasional mutations that increase the stability well above this threshold. While this extra stability has no immediate consequences for fitness, it does make a wide range of destabilizing mutations newly accessible to the protein, some of which could potentially contribute to immune escape. Selection will favour these variants if the stability of the protein remains above the threshold. However, if protein stability falls below the threshold, selection for stabilizing mutations will occur.

The work of Gong, Suchard and Bloom helps us move toward a better understanding of how epistasis shapes protein evolution. The insights gained from empirical studies such as these will aid the development of conceptual and computational models of evolution. This information will be particularly relevant for rapidly evolving viruses in the new field of ‘phylodynamics’, at the interface between phylogenetics and epidemiology (Grenfell et al., 2004). A tantalizing possibility is that mechanisms similar to the ones presented by Gong et al. underlie the immune escape of the protein hemagglutinin—a surface antigen for influenza subtype H3N2 and a major target of the human immune system (Koelle et al., 2006). Computational models in the field of phylodynamics have so far largely focused on mutations that influence immune escape but have little or no impact on other biological functions: there is a clear need to study the evolution of viruses while considering that each mutation may influence both the basic functioning of a protein as well as its appearance to the immune system. How evolution mediates such trade-offs is an open area of research that is informed by this study.

Gong et al. use a parent protein with the minimum stability naturally observed, and so their search for epistatically constrained mutations may be biased toward those that are destabilizing. Thus, it remains an open question whether or not other types of epistatic interactions are also important in shaping the evolutionary path of the influenza nucleoprotein, and of proteins in general.


Article and author information

Author details

  1. Mary M Rorick

    Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, United States
    For correspondence
    Competing interests
    The authors declare that no competing interests exist.
  2. Mercedes Pascual

    Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, United States
    For correspondence
    Competing interests
    The authors declare that no competing interests exist.

Publication history

  1. Version of Record published: May 14, 2013 (version 1)


© 2013, Rorick and Pascual

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.


  • 559
    Page views
  • 69
  • 1

Article citation count generated by polling the highest count across the following sources: Crossref, PubMed Central, Scopus.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Mary M Rorick
  2. Mercedes Pascual
Protein Evolution: Influenza evolution navigates stability valleys
eLife 2:e00842.
  1. Further reading

Further reading

    1. Ecology
    2. Evolutionary Biology
    Hannah J Williams, Vivek H Sridhar ... Amanda D Melin
    Review Article

    Groups of animals inhabit vastly different sensory worlds, or umwelten, which shape fundamental aspects of their behaviour. Yet the sensory ecology of species is rarely incorporated into the emerging field of collective behaviour, which studies the movements, population-level behaviours, and emergent properties of animal groups. Here, we review the contributions of sensory ecology and collective behaviour to understanding how animals move and interact within the context of their social and physical environments. Our goal is to advance and bridge these two areas of inquiry and highlight the potential for their creative integration. To achieve this goal, we organise our review around the following themes: (1) identifying the promise of integrating collective behaviour and sensory ecology; (2) defining and exploring the concept of a ‘sensory collective’; (3) considering the potential for sensory collectives to shape the evolution of sensory systems; (4) exploring examples from diverse taxa to illustrate neural circuits involved in sensing and collective behaviour; and (5) suggesting the need for creative conceptual and methodological advances to quantify ‘sensescapes’. In the final section, (6) applications to biological conservation, we argue that these topics are timely, given the ongoing anthropogenic changes to sensory stimuli (e.g. via light, sound, and chemical pollution) which are anticipated to impact animal collectives and group-level behaviour and, in turn, ecosystem composition and function. Our synthesis seeks to provide a forward-looking perspective on how sensory ecologists and collective behaviourists can both learn from and inspire one another to advance our understanding of animal behaviour, ecology, adaptation, and evolution.

    1. Evolutionary Biology
    John S Favate, Kyle S Skalenko ... Premal Shah
    Research Article

    Changes in an organism’s environment, genome, or gene expression patterns can lead to changes in its metabolism. The metabolic phenotype can be under selection and contributes to adaptation. However, the networked and convoluted nature of an organism’s metabolism makes relating mutations, metabolic changes, and effects on fitness challenging. To overcome this challenge, we use the long-term evolution experiment (LTEE) with E. coli as a model to understand how mutations can eventually affect metabolism and perhaps fitness. We used mass spectrometry to broadly survey the metabolomes of the ancestral strains and all 12 evolved lines. We combined this metabolic data with mutation and expression data to suggest how mutations that alter specific reaction pathways, such as the biosynthesis of nicotinamide adenine dinucleotide, might increase fitness in the system. Our work provides a better understanding of how mutations might affect fitness through the metabolic changes in the LTEE and thus provides a major step in developing a complete genotype–phenotype map for this experimental system.