Within-host evolutionary dynamics of seasonal and pandemic human influenza A viruses in young children

  1. Alvin X Han  Is a corresponding author
  2. Zandra C Felix Garza
  3. Matthijs RA Welkers
  4. René M Vigeveno
  5. Nhu Duong Tran
  6. Thi Quynh Mai Le
  7. Thai Pham Quang
  8. Dinh Thoang Dang
  9. Thi Ngoc Anh Tran
  10. Manh Tuan Ha
  11. Thanh Hung Nguyen
  12. Quoc Thinh Le
  13. Thanh Hai Le
  14. Thi Bich Ngoc Hoang
  15. Kulkanya Chokephaibulkit
  16. Pilaipan Puthavathana
  17. Van Vinh Chau Nguyen
  18. My Ngoc Nghiem
  19. Van Kinh Nguyen
  20. Tuyet Trinh Dao
  21. Tinh Hien Tran
  22. Heiman FL Wertheim
  23. Peter W Horby
  24. Annette Fox
  25. H Rogier van Doorn
  26. Dirk Eggink
  27. Menno D de Jong
  28. Colin A Russell  Is a corresponding author
  1. Department of Medical Microbiology & Infection Prevention, Amsterdam University Medical Center, Netherlands
  2. National Institute of Hygiene and Epidemiology, Viet Nam
  3. Ha Nam Centre for Disease Control, Viet Nam
  4. Children's Hospital 2, Viet Nam
  5. Children's Hospital 1, Viet Nam
  6. Vietnam National Children's Hospital, Viet Nam
  7. Siriraj Hospital, Mahidol University, Thailand
  8. Hospital for Tropical Diseases, Viet Nam
  9. National Hospital for Tropical Diseases, Viet Nam
  10. Oxford University Clinical Research Unit, Viet Nam
  11. Radboud Medical Centre, Radboud University, Netherlands
  12. Nuffield Department of Medicine, University of Oxford, United Kingdom
  13. Peter Doherty Institute for Infection and Immunity, University of Melbourne, Australia
  14. WHO Collaborating Centre for Reference and Research on Influenza, Australia
  15. Centre for Infectious Disease Control, National Institute for Public Health and the Environment, Netherlands
8 figures, 2 tables and 6 additional files

Figures

Figure 1 with 5 supplements
Genetic diversity of within-host influenza A virus populations.

Box plots summarizing the number of intra-host single-nucleotide variants (iSNVs; median, interquartile range [IQR], and whiskers extending within median ±1.5 × IQR) identified in samples with …

Figure 1—figure supplement 1
Sequence coverage across all influenza gene segments and samples.

Black line plots the mean coverage for a sliding window of 50 base pairs (stepsize = 25 base pairs). The interquartile range is shaded in dark pink while the full range is denoted in light pink. (A) …

Figure 1—figure supplement 2
Frequencies of nucleotide variants found in A/H3N2 viral reads sequenced from overlapping amplicons.

Each circle represents a nucleotide variant site (with frequency estimated between 0.02 and 0.98) found in reads attributed to at least two different amplicons (at least 100× coverage for each …

Figure 1—figure supplement 3
Maximum-likelihood phylogeny of putative majority (consensus) and minority whole-genome sequences (by concatenating all eight gene segments) of A/H3N2 virus samples.

Tip names are given in the format: ‘Patient ID_Days since symptom onset_putative consensus or minority sequence.’ The tree is rooted to the A/Brisbane/10/2007 virus (H3N2_Bris07; EPI_ISL_103644). …

Figure 1—figure supplement 4
Maximum-likelihood phylogeny of putative majority (consensus) and minority whole-genome sequences (by concatenating all eight gene segments) of H1N1pdm09 virus samples.

Tip names are given in the format: ‘Patient ID_Days since symptom onset_putative consensus or minority sequence.’ The tree is rooted to the A/California/04/2009 virus (H1N1pdm09_Cali09; …

Figure 1—figure supplement 5
Pearson’s correlation between the first day of oseltamivir treatment administered to patients and the last day on which viral samples with cycle threshold (CT) values ≤35 were collected.

Time is measured by number of days since symptom onset. Each point represents a patient included in this study who was treated with oseltamivir (Supplementary file 4).

Figure 2 with 4 supplements
Box plots (median, interquartile range [IQR], and whiskers extending within median ±1.5 × IQR) summarizing the empirical within-host evolutionary rates of (A) seasonal A/H3N2 viruses and (B) pandemic A/H1N1pdm09 viruses.

Top panel shows the evolutionary rate of individual gene segments over all timepoints (rg) while the bottom panel depicts the genome-wide evolutionary rate (rt) for each day since symptom onset. All …

Figure 2—figure supplement 1
Box plots (median, interquartile range [IQR], and whiskers extending within median ±1.5 × IQR) summarizing the empirical within-host evolutionary rates (rg,t) of different H3N2 viral gene segments.

All rates are stratified by substitution type (synonymous – blue; nonsynonymous – red; stop codon – gray). Wilcoxon signed-rank tests were performed to assess if the paired synonymous and …

Figure 2—figure supplement 2
Box plots (median, interquartile range [IQR], and whiskers extending within median ±1.5 × IQR) summarizing the empirical within-host evolutionary rates (rg,t) of different H1N1pdm09 viral gene segments.

All rates are stratified by substitution type (synonymous – blue; nonsynonymous – red; stop codon – gray). Wilcoxon signed-rank test was not performed here due to the low number of samples collected …

Figure 2—figure supplement 3
Linear regression of within-host synonymous and nonsynonymous evolutionary rates of within-host A/H3N2 virus samples.

Each plotted line is the linearly regressed line to the evolutionary rates computed for each A/H3N2-infected individual. Based on our findings, we expect that synonymous rates correlate negatively …

Figure 2—figure supplement 4
Box plots (median, interquartile range [IQR], and whiskers extending within median ±1.5 × IQR) summarizing the empirical daily within-host evolutionary rates of seasonal A/H3N2 viruses.

Variants that could potentially be PCR artifacts were removed (i.e., those found under the 75th percentile [6.3%] of frequency range of variants located in overlapping amplicons but were only …

Figure 3 with 2 supplements
Histogram of the mean number of minority intra-host single-nucleotide variants (iSNVs) identified per sample across all.

(A) A/H3N2 and (B) A/H1N1pdm09 virus specimens, sorted by frequency bins of 5% and substitution type (synonymous – blue; nonsynonymous – red; stop-codon – gray).

Figure 3—figure supplement 1
Histogram of the mean number of minority intra-host single-nucleotide variants (iSNVs) identified per sample across all H3N2 viral gene segments across all samples sorted by frequency bins of 5% and substitution type (synonymous – blue; nonsynonymous – red; gray – stop codon).
Figure 3—figure supplement 2
Histogram of the mean number of minority intra-host single-nucleotide variants (iSNVs) identified across all H1N1pdm09 viral gene segments across all samples sorted by frequency bins of 5% and substitution type (synonymous – blue; nonsynonymous – red; stop-codon – gray).
Figure 4 with 9 supplements
Intra-host single-nucleotide variants in within-host IAV populations.

(A) Breakdown of intra-host single-nucleotide variants (iSNVs) identified in seasonal A/H3N2 virus samples. The top panels plot the nucleotide positions where iSNVs were found in at least two …

Figure 4—figure supplement 1
Plots of intra-host hemagglutinin (HA) amino acid variants in A/H3N2-infected individuals.

Top panel shows the number of subjects where nonsynonymous variants were found in the respective protein site. Different canonical antigenic sites of the HA protein are colored (HA numbering based …

Figure 4—figure supplement 2
Plots of intra-host neuraminidase (NA) amino acid variants in A/H3N2-infected individuals.

Top panel shows the number of subjects where nonsynonymous variants were found in the respective protein site. Bottom panel plots selected as well as parallel amino acid mutations found in multiple …

Figure 4—figure supplement 3
Plots of intra-host nucleoprotein (NP) amino acid variants in A/H3N2-infected individuals.

Top panel shows the number of subjects where nonsynonymous variants were found in the respective protein site. Bottom panel plots selected as well as parallel amino acid mutations found in multiple …

Figure 4—figure supplement 4
Plots of M2 protein intra-host amino acid variants in A/H3N2-infected individuals.

Top panel shows the number of subjects where nonsynonymous variants were found in the respective protein site. Bottom panel plots selected as well as parallel amino acid mutations found in multiple …

Figure 4—figure supplement 5
Plots of hemagglutinin (HA) intra-host amino acid variants in A/H1N1pdm09-infected individuals.

Top panel shows the number of subjects where nonsynonymous variants were found in the respective protein site. Different canonical antigenic sites of the HA protein are colored (HA numbering based …

Figure 4—figure supplement 6
Plots of neuraminidase (NA) intra-host amino acid variants in A/H1N1pdm09-infected individuals.

Top panel shows the number of subjects where nonsynonymous variants were found in the respective protein site. Bottom panel plots selected as well as parallel amino acid mutations found in multiple …

Figure 4—figure supplement 7
Plots of M2 protein intra-host amino acid variants in A/H1N1pdm09-infected individuals.

Top panel shows the number of subjects where nonsynonymous variants were found in the respective protein site. Bottom panel plots selected as well as parallel amino acid mutations found in multiple …

Figure 4—figure supplement 8
Frequency distributions of intra-host single-nucleotide variants (iSNVs) below the 2% variant calling threshold found in nucleotide positions NP-1150 and M-917 that encode for amino acid sites NP-384 and M2-77, respectively.

All A/H3N2 virus samples collected from all patients with site coverage above the 100× are included. The distributions were compared to that of neighboring sites, ±10 nucleotide positions adjacent …

Figure 4—figure supplement 9
Plots of within-host recurring A/H3N2 amino acid variants NP-G384R and M2-R77* based on variant calls and frequencies after remapping sample reads to their respective sample consensus sequence.
The trimeric and monomeric crystal structures of nucleoprotein (PDB: 3ZDP) (Chenavas et al., 2013) of influenza A viruses.

Amino acid sites with potentially linked epistatic amino acid variants as tabulated in Table 1 are separately colored, with their corresponding positions annotated on the monomeric structure.

Figure 6 with 1 supplement
Evolutionary rates computed from forward-time Monte Carlo within-host simulations for different mean deleterious effects (sd-) of nonsynonymous mutations.

We assumed that synonymous mutations are neutral for all simulations. (A) Neutral expectation where all nonsynonymous mutations are neutral (fneu,NS=100%). We tested our hypotheses where the majority of …

Figure 6—figure supplement 1
Number of virions (N) against replicative generation (t) based on a target cell-limited within-host model.

Blue line with markers denotes the population size computed from the model. When N>107 virions, we assumed that N remained constant at 107 (pink dashed line) to reduce computational costs of …

Appendix 1—figure 1
Genetic diversity of within-host influenza A virus populations as estimated by nucleotide diversity π statistic.

Box plots summarizing the π statistic (intra-host single-nucleotide variants [iSNVs]; median, interquartile range [IQR], and whiskers extending within median ±1.5 × IQR) computed for samples with …

Appendix 1—figure 2
A/H1N1pdm09 virus household transmission pairs.

(A) Schematic of A/H1N1pdm09 virus household transmission pairs identified by epidemiological linkage and plotted based on timing of sample collection (left panel) Intra-host single-nucleotide …

Tables

Table 1
FoldX stability predictions of likely linked nonsynonymous minority variants found in A/H3N2 nucleoprotein.

The mean ΔΔG and standard deviation (SD) values reported are based on the results of five distinct simulations. Variants with mean ΔΔG<-0.46 kcal/mol are deemed to be stabilizing while destabilizing mutants …

ΔΔG(kcal/mol)
VariantsMeanSD
G384R−3.840.06
M426I2.610.05
G384R,M426I−0.420.06
G102R4.870.00
G384R,G102R0.760.09
A493T11.960.30
G384R,A493T5.560.19
V197I−3.110.02
S353Y−1.970.68
V197I,S353Y−4.480.14
Table 2
Parameter values used in the within-host model.
ParameterMeaningValue (units)Source
-Number of hours per replicative generation6 hrAssumption
rAverage number of virions produced by an infected cell100 virionsFrensing et al., 2016
C0Initial target cell population size4 × 108 virionsHadjichrysanthou et al., 2016
dPer capita decay rateTwo per-generationAssumption
R0Within-host basic reproduction number5Hadjichrysanthou et al., 2016
µPer-site, per-generation mutation rate3 × 10−5 per-site, per-generationMcCrone et al., 2020

Additional files

Supplementary file 1

Mean number of nonsynonymous (NS), synonymous (S), and stop codon (Stop) variants per sample for each gene segment as well as the corresponding NS/S ratio.

https://cdn.elifesciences.org/articles/68917/elife-68917-supp1-v2.xlsx
Supplementary file 2

Potentially linked nonsynonymous variants in within-host A/H1N1pdm09 and A/H3N2 virus samples.

Sample names are given in the format of 'Patient ID_Days since symptom onset.' Both linkage disequilibrium (LD) and the normalized LD' measures are tabulated alongside the inferred maximum-likelihood haplotype frequencies (q10 and q01 are the haplotype frequencies with variant i or ii only while q11 is the frequency of haplotypes encoding both variants).

https://cdn.elifesciences.org/articles/68917/elife-68917-supp2-v2.xlsx
Supplementary file 3

A/H3N2 segment-specific primers.

https://cdn.elifesciences.org/articles/68917/elife-68917-supp3-v2.xlsx
Supplementary file 4

Patients metadata (provided as an Excel file).

https://cdn.elifesciences.org/articles/68917/elife-68917-supp4-v2.xlsx
Supplementary file 5

Acknowledgment table of reference sequences downloaded from GISAID.

https://cdn.elifesciences.org/articles/68917/elife-68917-supp5-v2.xlsx
Transparent reporting form
https://cdn.elifesciences.org/articles/68917/elife-68917-transrepform-v2.docx

Download links