Seasonal Influenza: The challenges of vaccine strain selection

New measures of influenza virus fitness could improve vaccine strain selection through more accurate forecasts of the evolution of the virus.
  1. Amanda C Perofsky  Is a corresponding author
  2. Martha I Nelson  Is a corresponding author
  1. Division of International Epidemiology and Population Studies, Fogarty International Center, National Institutes of Health, United States

Scientists have known since the 1940s that influenza vaccines that perform well one year can be rendered ineffective after the influenza virus mutates. However, despite decades of investment in global surveillance, pathogen sequencing technologies and basic research (Figure 1), vaccines for seasonal influenza have the lowest and most variable performance of any vaccine licensed for use in the United States (CDC, 2016). Now, in eLife, John Huddleston of the Fred Hutchinson Cancer Research Center (FHCRC) and the University of Washington, Trevor Bedford of the FHCRC, and colleagues in the United States, United Kingdom, Japan, Australia and Switzerland present an open-source framework that synthesizes a decade’s worth of innovations in bioinformatics and technology to advance data-driven vaccine design (Huddleston et al., 2020).

Advances in influenza research and vaccine effectiveness (for A/H3N2) from the 2004/05 flu season onwards.

The effectiveness of vaccines for seasonal influenza (A/H3N2) is highly variable and has been less than 10% in some years (teal bars). The H3N2 vaccine strain is shown for seasons when it was changed from the previous season. Black stars indicate seasons where the vaccine strain mismatched circulating H3N2 viruses (; yellow stars indicate seasons in which H3N2 vaccine strains acquired mutations during passage in eggs. Research advances are listed at the top of the figures and are color coded as follows: surveillance in red; experimental approaches in blue; sequencing approaches in purple; computational approaches in green. Advances in understanding the structure of hemagglutinin (Knossow et al., 1984; Wiley and Skehel, 1987) and predicting the evolution of H3 (Bush et al., 1999) occurred before the period shown in the figure. Point estimates of vaccine effectiveness are taken from the following references: Skowronski et al., 2005 (04/05); Skowronski et al., 2007 (05/06); Skowronski et al., 2009 (06/07); Belongia et al., 2011 (07/08); Skowronski et al., 2010 (08/09); Treanor et al., 2012 (10/11); Ohmit et al., 2014 (11/12); McLean et al., 2015 (12/13);; Gaglani et al., 2016 (13/14); Flannery et al., 2016 (14/15); Jackson et al., 2017 (15/16); Flannery et al., 2019 (16/17); Rolfes et al., 2019 (17/18); Flannery et al., 2020 (18/19); estimates were not available during the 2009/10 A/H1N1 pandemic.

Influenza A and influenza B viruses cause seasonal epidemics every winter. Seasonal influenza A viruses include two different subtypes, H1N1 and H3N2, where H and N (short for hemagglutinin and neuraminidase) are proteins found on the surface of the virus. The human immune system protects the body against influenza infection by producing antibodies that can recognize these proteins. However, the influenza virus mutates frequently, including at sites that affect the immune system's ability to detect the virus. This process – called 'antigenic drift' – helps the virus infect new hosts and spread in populations that previously had immunity to influenza. Indeed, antigenic drift can lead to new strains of the virus that completely displace the currently circulating strains in a matter of months.

To keep pace with antigenic drift, the composition of influenza vaccines must be updated continually. Influenza vaccines contain three or four components that protect against various strains representing the different subtypes. Scientists convene twice a year at the World Health Organization (WHO) to predict which strains will have the highest fitness and therefore dominate the next year's flu season. H3N2 viruses evolve particularly fast and unpredictably compared to other seasonal flu viruses. Because the composition of the vaccine has to be decided a year in advance to allow doses to be manufactured, H3N2 vaccine strains have failed to match naturally circulating strains in six of the past fifteen flu seasons (Figure 1).

For decades, vaccine strain selection has been primarily informed by data from 1950s-era serological assays, which provide a phenotypic measure of how immune systems exposed to recently circulating viruses would see a novel strain. However, the assays have certain disadvantages – they are labor intensive, inconsistent across labs, not publicly available, and difficult to interpret or scale up. This means that these phenotypic measures are only available for a small subset of viruses. To remedy this issue, Huddleston et al. use a phylogenetic model (which includes available serological data and sequence data as inputs) to make predictions for the thousands of strains for which serological information is not available (Bedford et al., 2014; Neher et al., 2016; Smith et al., 2004).

Huddleston et al. compare how antigenic phenotypes from serological assays perform against five newer measures of virus fitness in forecasting future H3N2 virus populations, and find that two of their models provide better forecasts than WHO vaccine strain selections. Moreover, they have now integrated their forecasts for H3N2 into, an open-source platform that scientists and policymakers use to track the real-time evolution of a wide range of pathogens (Hadfield et al., 2018; Neher and Bedford, 2015). Nextstrain provides a platform to make influenza vaccine strain selection more data-driven, systematic and transparent, and to allow new forecasting methods to be integrated as they show promise.

How does one predict the fitness of an influenza virus? Most mutations are harmful for influenza viruses, except for a subset of beneficial mutations that lead to antigenic drift. For decades researchers have relied on a list of sites in the genome where seemingly beneficial mutations occur to measure antigenic drift and viral fitness (Bedford et al., 2014; Bush et al., 1999; Shih et al., 2007). However, Huddleston et al. find that serological assays (Neher et al., 2016) continue to be more useful than sequence-onlybased measures when making forecasts of future virus populations. Measures of viral fitness based on genetic sequences could not accurately predict H3N2 evolution in recent years due to the emergence of multiple co-circulating strains and the sudden decline of a dominant strain in 2019. While no method predicts the right vaccine strain every time, serology-based methods appear to outperform other approaches.

Over time, alternative approaches to measuring virus fitness will continue to be refined and may become integrated into vaccine strain selection. For example, Huddleston et al. could not include a new serological assay based on virus neutralization in their framework as data from this assay were only available over a short period of time, but it could be integrated as data accrue. Other incremental improvements could be beneficial when used in combination with serological data. For example, how fast a strain is spreading globally can be measured from branching patterns in the phylogenetic tree (Neher et al., 2014). 'Mutational load' (that is, the total number of mutations in sites unrelated to immune detection) provides a simple inverse measure of viral fitness (Luksza and Lässig, 2014), while a technique called deep mutational scanning measures whether experimentally induced mutations have beneficial or harmful effects (Lee et al., 2018), However, as with other sequence-based approaches, the fact that mutations have different effects in different genetic backgrounds may be a disadvantage.

Going forward, the COVID-19 pandemic could disrupt the ecology of flu viruses in the years ahead, and it will be interesting to observe how predictive models fare in a highly perturbed system with no historical precedent. SARS-CoV-2 viruses may also experience post-pandemic strain turnover that requires periodic updates to any COVID-19 vaccine, and it should be possible to adapt platforms built for influenza forecasting to make forecasts for SARS-CoV-2 and other pathogens.


Disclaimer: The conclusions of this study do not necessarily represent the views of the NIH or the US government.


    1. Skowronski D
    2. Gilbert M
    3. Tweed S
    4. Petric M
    5. Li Y
    6. Mak A
    7. McNabb G
    8. De Serres G
    Effectiveness of vaccine against medical consultation due to laboratory-confirmed influenza: results from a sentinel physician pilot project in british Columbia, 2004-2005
    Canada Communicable Disease Report 31:161–168.

Article and author information

Author details

  1. Amanda C Perofsky

    Amanda C Perofsky is in the Division of International Epidemiology and Population Studies, Fogarty International Center, National Institutes of Health, Bethesda, United States

    For correspondence
    Competing interests
    No competing interests declared
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0001-7341-9193
  2. Martha I Nelson

    Martha I Nelson is in the Division of International Epidemiology and Population Studies, Fogarty International Center, National Institutes of Health, Bethesda, United States

    For correspondence
    Competing interests
    No competing interests declared
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0003-4814-0179

Publication history

  1. Version of Record published: October 13, 2020 (version 1)


This is an open-access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 public domain dedication.


  • 1,832
    Page views
  • 156
  • 4

Article citation count generated by polling the highest count across the following sources: Crossref, PubMed Central, Scopus.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Amanda C Perofsky
  2. Martha I Nelson
Seasonal Influenza: The challenges of vaccine strain selection
eLife 9:e62955.

Further reading

    1. Evolutionary Biology
    Paul C Sereno, Nathan Myhrvold ... Lauren L Conroy
    Research Article

    A predominantly fish-eating diet was envisioned for the sail-backed theropod dinosaur Spinosaurus aegyptiacus when its elongate jaws with subconical teeth were unearthed a century ago in Egypt. Recent discovery of the high-spined tail of that skeleton, however, led to a bolder conjecture that S. aegyptiacus was the first fully aquatic dinosaur. The ‘aquatic hypothesis’ posits that S. aegyptiacus was a slow quadruped on land but a capable pursuit predator in coastal waters, powered by an expanded tail. We test these functional claims with skeletal and flesh models of S. aegyptiacus. We assembled a CT-based skeletal reconstruction based on the fossils, to which we added internal air and muscle to create a posable flesh model. That model shows that on land S. aegyptiacus was bipedal and in deep water was an unstable, slow-surface swimmer (<1 m/s) too buoyant to dive. Living reptiles with similar spine-supported sails over trunk and tail are used for display rather than aquatic propulsion, and nearly all extant secondary swimmers have reduced limbs and fleshy tail flukes. New fossils also show that Spinosaurus ranged far inland. Two stages are clarified in the evolution of Spinosaurus, which is best understood as a semiaquatic bipedal ambush piscivore that frequented the margins of coastal and inland waterways.

    1. Computational and Systems Biology
    2. Evolutionary Biology
    Guillem Santamaria, Chen Liao ... Joao B Xavier
    Research Article Updated

    Microbes have disproportionate impacts on the macroscopic world. This is in part due to their ability to grow to large populations that collectively secrete massive amounts of secondary metabolites and alter their environment. Yet, the conditions favoring secondary metabolism despite the potential costs for primary metabolism remain unclear. Here we investigated the biosurfactants that the bacterium Pseudomonas aeruginosa makes and secretes to decrease the surface tension of surrounding liquid. Using a combination of genomics, metabolomics, transcriptomics, and mathematical modeling we show that the ability to make surfactants from glycerol varies inconsistently across the phylogenetic tree; instead, lineages that lost this ability are also worse at reducing the oxidative stress of primary metabolism on glycerol. Experiments with different carbon sources support a link with oxidative stress that explains the inconsistent distribution across the P. aeruginosa phylogeny and suggests a general principle: P. aeruginosa lineages produce surfactants if they can reduce the oxidative stress produced by primary metabolism and have excess resources, beyond their primary needs, to afford secondary metabolism. These results add a new layer to the regulation of a secondary metabolite unessential for primary metabolism but important to change physical properties of the environments surrounding bacterial populations.