Seasonal Influenza: The challenges of vaccine strain selection
Scientists have known since the 1940s that influenza vaccines that perform well one year can be rendered ineffective after the influenza virus mutates. However, despite decades of investment in global surveillance, pathogen sequencing technologies and basic research (Figure 1), vaccines for seasonal influenza have the lowest and most variable performance of any vaccine licensed for use in the United States (CDC, 2016). Now, in eLife, John Huddleston of the Fred Hutchinson Cancer Research Center (FHCRC) and the University of Washington, Trevor Bedford of the FHCRC, and colleagues in the United States, United Kingdom, Japan, Australia and Switzerland present an open-source framework that synthesizes a decade’s worth of innovations in bioinformatics and technology to advance data-driven vaccine design (Huddleston et al., 2020).
Influenza A and influenza B viruses cause seasonal epidemics every winter. Seasonal influenza A viruses include two different subtypes, H1N1 and H3N2, where H and N (short for hemagglutinin and neuraminidase) are proteins found on the surface of the virus. The human immune system protects the body against influenza infection by producing antibodies that can recognize these proteins. However, the influenza virus mutates frequently, including at sites that affect the immune system's ability to detect the virus. This process – called 'antigenic drift' – helps the virus infect new hosts and spread in populations that previously had immunity to influenza. Indeed, antigenic drift can lead to new strains of the virus that completely displace the currently circulating strains in a matter of months.
To keep pace with antigenic drift, the composition of influenza vaccines must be updated continually. Influenza vaccines contain three or four components that protect against various strains representing the different subtypes. Scientists convene twice a year at the World Health Organization (WHO) to predict which strains will have the highest fitness and therefore dominate the next year's flu season. H3N2 viruses evolve particularly fast and unpredictably compared to other seasonal flu viruses. Because the composition of the vaccine has to be decided a year in advance to allow doses to be manufactured, H3N2 vaccine strains have failed to match naturally circulating strains in six of the past fifteen flu seasons (Figure 1).
For decades, vaccine strain selection has been primarily informed by data from 1950s-era serological assays, which provide a phenotypic measure of how immune systems exposed to recently circulating viruses would see a novel strain. However, the assays have certain disadvantages – they are labor intensive, inconsistent across labs, not publicly available, and difficult to interpret or scale up. This means that these phenotypic measures are only available for a small subset of viruses. To remedy this issue, Huddleston et al. use a phylogenetic model (which includes available serological data and sequence data as inputs) to make predictions for the thousands of strains for which serological information is not available (Bedford et al., 2014; Neher et al., 2016; Smith et al., 2004).
Huddleston et al. compare how antigenic phenotypes from serological assays perform against five newer measures of virus fitness in forecasting future H3N2 virus populations, and find that two of their models provide better forecasts than WHO vaccine strain selections. Moreover, they have now integrated their forecasts for H3N2 into nextstrain.org, an open-source platform that scientists and policymakers use to track the real-time evolution of a wide range of pathogens (Hadfield et al., 2018; Neher and Bedford, 2015). Nextstrain provides a platform to make influenza vaccine strain selection more data-driven, systematic and transparent, and to allow new forecasting methods to be integrated as they show promise.
How does one predict the fitness of an influenza virus? Most mutations are harmful for influenza viruses, except for a subset of beneficial mutations that lead to antigenic drift. For decades researchers have relied on a list of sites in the genome where seemingly beneficial mutations occur to measure antigenic drift and viral fitness (Bedford et al., 2014; Bush et al., 1999; Shih et al., 2007). However, Huddleston et al. find that serological assays (Neher et al., 2016) continue to be more useful than sequence-onlybased measures when making forecasts of future virus populations. Measures of viral fitness based on genetic sequences could not accurately predict H3N2 evolution in recent years due to the emergence of multiple co-circulating strains and the sudden decline of a dominant strain in 2019. While no method predicts the right vaccine strain every time, serology-based methods appear to outperform other approaches.
Over time, alternative approaches to measuring virus fitness will continue to be refined and may become integrated into vaccine strain selection. For example, Huddleston et al. could not include a new serological assay based on virus neutralization in their framework as data from this assay were only available over a short period of time, but it could be integrated as data accrue. Other incremental improvements could be beneficial when used in combination with serological data. For example, how fast a strain is spreading globally can be measured from branching patterns in the phylogenetic tree (Neher et al., 2014). 'Mutational load' (that is, the total number of mutations in sites unrelated to immune detection) provides a simple inverse measure of viral fitness (Luksza and Lässig, 2014), while a technique called deep mutational scanning measures whether experimentally induced mutations have beneficial or harmful effects (Lee et al., 2018), However, as with other sequence-based approaches, the fact that mutations have different effects in different genetic backgrounds may be a disadvantage.
Going forward, the COVID-19 pandemic could disrupt the ecology of flu viruses in the years ahead, and it will be interesting to observe how predictive models fare in a highly perturbed system with no historical precedent. SARS-CoV-2 viruses may also experience post-pandemic strain turnover that requires periodic updates to any COVID-19 vaccine, and it should be possible to adapt platforms built for influenza forecasting to make forecasts for SARS-CoV-2 and other pathogens.
Note
Disclaimer: The conclusions of this study do not necessarily represent the views of the NIH or the US government.
References
-
Enhanced genetic characterization of influenza A(H3N2) viruses and vaccine effectiveness by genetic group, 2014-2015Journal of Infectious Diseases 214:1010–1019.https://doi.org/10.1093/infdis/jiw181
-
Influenza vaccine effectiveness in the United States during the 2016-2017 seasonClinical Infectious Diseases 68:1798–1806.https://doi.org/10.1093/cid/ciy775
-
Nextstrain: real-time tracking of pathogen evolutionBioinformatics 34:4121–4123.https://doi.org/10.1093/bioinformatics/bty407
-
Influenza vaccine effectiveness in the United States during the 2015-2016 seasonNew England Journal of Medicine 377:534–543.https://doi.org/10.1056/NEJMoa1700153
-
Influenza vaccine effectiveness in the United States during 2012-2013: variable protection by age and virus typeJournal of Infectious Diseases 211:1529–1540.https://doi.org/10.1093/infdis/jiu647
-
Effects of influenza vaccination in the United States during the 2017-2018 influenza seasonClinical Infectious Diseases 69:1845–1853.https://doi.org/10.1093/cid/ciz075
-
Effectiveness of vaccine against medical consultation due to laboratory-confirmed influenza: results from a sentinel physician pilot project in british Columbia, 2004-2005Canada Communicable Disease Report 31:161–168.
-
Component-specific effectiveness of trivalent influenza vaccine as monitored through a sentinel surveillance network in Canada, 2006-2007The Journal of Infectious Diseases 199:168–179.https://doi.org/10.1086/595862
-
The structure and function of the hemagglutinin membrane glycoprotein of influenza virusAnnual Review of Biochemistry 56:365–394.https://doi.org/10.1146/annurev.bi.56.070187.002053
Article and author information
Author details
Publication history
Copyright
This is an open-access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 public domain dedication.
Metrics
-
- 2,417
- views
-
- 189
- downloads
-
- 6
- citations
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Evolutionary Biology
With the availability of high-quality full genome polymorphism (SNPs) data, it becomes feasible to study the past demographic and selective history of populations in exquisite detail. However, such inferences still suffer from a lack of statistical resolution for recent, for example bottlenecks, events, and/or for populations with small nucleotide diversity. Additional heritable (epi)genetic markers, such as indels, transposable elements, microsatellites, or cytosine methylation, may provide further, yet untapped, information on the recent past population history. We extend the Sequential Markovian Coalescent (SMC) framework to jointly use SNPs and other hyper-mutable markers. We are able to (1) improve the accuracy of demographic inference in recent times, (2) uncover past demographic events hidden to SNP-based inference methods, and (3) infer the hyper-mutable marker mutation rates under a finite site model. As a proof of principle, we focus on demographic inference in Arabidopsis thaliana using DNA methylation diversity data from 10 European natural accessions. We demonstrate that segregating single methylated polymorphisms (SMPs) satisfy the modeling assumptions of the SMC framework, while differentially methylated regions (DMRs) are not suitable as their length exceeds that of the genomic distance between two recombination events. Combining SNPs and SMPs while accounting for site- and region-level epimutation processes, we provide new estimates of the glacial age bottleneck and post-glacial population expansion of the European A. thaliana population. Our SMC framework readily accounts for a wide range of heritable genomic markers, thus paving the way for next-generation inference of evolutionary history by combining information from several genetic and epigenetic markers.
-
- Computational and Systems Biology
- Evolutionary Biology
There is growing interest in designing multidrug therapies that leverage tradeoffs to combat resistance. Tradeoffs are common in evolution and occur when, for example, resistance to one drug results in sensitivity to another. Major questions remain about the extent to which tradeoffs are reliable, specifically, whether the mutants that provide resistance to a given drug all suffer similar tradeoffs. This question is difficult because the drug-resistant mutants observed in the clinic, and even those evolved in controlled laboratory settings, are often biased towards those that provide large fitness benefits. Thus, the mutations (and mechanisms) that provide drug resistance may be more diverse than current data suggests. Here, we perform evolution experiments utilizing lineage-tracking to capture a fuller spectrum of mutations that give yeast cells a fitness advantage in fluconazole, a common antifungal drug. We then quantify fitness tradeoffs for each of 774 evolved mutants across 12 environments, finding these mutants group into classes with characteristically different tradeoffs. Their unique tradeoffs may imply that each group of mutants affects fitness through different underlying mechanisms. Some of the groupings we find are surprising. For example, we find some mutants that resist single drugs do not resist their combination, while others do. And some mutants to the same gene have different tradeoffs than others. These findings, on one hand, demonstrate the difficulty in relying on consistent or intuitive tradeoffs when designing multidrug treatments. On the other hand, by demonstrating that hundreds of adaptive mutations can be reduced to a few groups with characteristic tradeoffs, our findings may yet empower multidrug strategies that leverage tradeoffs to combat resistance. More generally speaking, by grouping mutants that likely affect fitness through similar underlying mechanisms, our work guides efforts to map the phenotypic effects of mutation.