1. Epidemiology and Global Health
Download icon

Mathematical modeling of the West Africa Ebola epidemic

  1. Jean-Paul Chretien Is a corresponding author
  2. Steven Riley
  3. Dylan B George
  1. Division of Integrated Biosurveillance, Armed Forces Health Surveillance Center, United States
  2. School of Public Health, Imperial College London, United Kingdom
  3. Biomedical Advanced Research and Development Authority, United States
Research Article
Cited
19
Views
2,643
Comments
0
Cite as: eLife 2015;4:e09186 doi: 10.7554/eLife.09186

Abstract

As of November 2015, the Ebola virus disease (EVD) epidemic that began in West Africa in late 2013 is waning. The human toll includes more than 28,000 EVD cases and 11,000 deaths in Guinea, Liberia, and Sierra Leone, the most heavily-affected countries. We reviewed 66 mathematical modeling studies of the EVD epidemic published in the peer-reviewed literature to assess the key uncertainties models addressed, data used for modeling, public sharing of data and results, and model performance. Based on the review, we suggest steps to improve the use of modeling in future public health emergencies.

https://doi.org/10.7554/eLife.09186.001

eLife digest

The outbreak of Ebola that started in West Africa in late 2013 has caused at least 28,000 illnesses and 11,000 deaths. As the outbreak progressed, global and local public health authorities scrambled to contain the spread of the disease by isolating those who were ill, putting in place infection control processes in health care settings, and encouraging the public to take steps to prevent the spread of the illness in the community. It took a massive investment of resources and personnel from many countries to eventually bring the outbreak under control.

To determine where to allocate people and resources during the outbreak, public health authorities often turned to mathematical models created by scientists to predict the course of the outbreak and identify interventions that could be effective. Many groups of scientists created models of the epidemic using publically available data or data they obtained from government officials or field studies. In some instances, the models yielded valuable insights. But with various groups using different methods and data, the models didn’t always agree on what would happen next or how best to contain the epidemic.

Now, Chretien et al. provide an overview of Ebola mathematical modeling during the epidemic and suggest how future efforts may be improved. The overview included 66 published studies about Ebola outbreak models. Although most forecasts predicted many more cases than actually occurred, some modeling approaches produced more accurate predictions, and several models yielded valuable insights. For example, one study found that focusing efforts on isolating patients with the most severe cases of Ebola would help end the epidemic by substantially reducing the number of new infections. Another study used real-time airline data to predict which traveler screening strategies would be most efficient at preventing international spread of Ebola. Furthermore, studies that obtained genomic data showed how specific virus strains were transmitted across geographic areas.

Chretien et al. argue that mathematical modeling efforts could be more useful in future pubic health emergencies if modelers cooperated more, and suggest the collaborative approach of weather forecasters as a good example to follow. Greater data sharing and the creation of standards for epidemic modeling would aid better collaboration.

https://doi.org/10.7554/eLife.09186.002

Introduction

On March 23, 2014, the Ministry of Health Guinea notified the World Health Organization (WHO) of a rapidly evolving outbreak of Ebola virus disease (EVD), now believed to have begun in December 2013. The epidemic spread through West Africa and reached Europe and the United States. As of November 4, 2015, WHO reported more than 28,000 cumulative cases and 11,000 deaths in Guinea, Liberia, and Sierra Leone, where transmission had been most intense (World Health Organization, 2016).

As the emergency progressed, researchers developed mathematical models of the epidemiological dynamics. Modelers have assessed ongoing epidemics previously, but the prominence of recent EVD work, enabled by existing research programs for infectious disease modeling (National Institutes of Health, 2016a; National Institutes of Health, 2016b) and online availability of EVD data via WHO (World Health Organization, 2016), Ministries of Health of affected countries, or modelers who transcribed and organized public WHO or Ministry of Health data (Rivers C) may be unprecedented. The efforts for this outbreak have been numerous and diverse, with major media incorporating modeling results in many pieces throughout the outbreak. U.S. Government decision making has benefited from modeling results at key moments during the response (Robinson R).

We draw on this vigorous response of the epidemiological modeling community to the EVD epidemic to review (Moher et al., 2009) the application of modeling to public health emergencies, and identify lessons to guide the modeling response to future emergencies.

Results

Overview of modeling applications

We identified 66 publications meeting inclusion criteria (Figure 1).

Models addressed 6 key uncertainties about the EVD epidemic: transmissibility, typically represented by the reproduction number (R, the average number of people each infected person infects; assessed in 41 publications); effectiveness of various interventions that had been or might be implemented (in 29 publications); epidemic forecast (in 29 publications); regional or international spreading patterns or risk (in 15 publications); phylogenetics of EVD viruses (in 9 publications); and feasibility of conducting vaccine trials in West Africa (in 2 publications) (Table 1, Supplementary file 1).

Table 1

Overview of modeling publications on the 2013-present EVD epidemic.

https://doi.org/10.7554/eLife.09186.004
Ref.Date of latest EVD dataDate publishedEVD data was pre-existing and publicUncertainties addressed
RInterventionsForecastSpreadPhylogeneticsClinical trials
Baize et al., 2014 3/20/144/16/14No*
Dudas and Rambaut, 2014 3/20/145/2/14Yes*
Alizon et al., 2014 6/18/1412/13/14Yes**
Gire et al., 2014 6/18/148/28/14No**
Stadler et al., 2014 6/18/1410/6/14Yes*
Volz and Pond, 2014 6/18/1410/24/14Yes*
Pandey et al., 2014 8/7/1410/30/14Yes***
Gomes et al., 2014 8/9/149/2/14Yes***
Valdez et al., 2015 8/15/147/20/15Yes****
Merler et al., 2015 8/16/141/7/15Yes****
Rainisch et al., 2015 8/16/142/18/15Yes*
Althaus, 2014 8/20/149/2/14Yes*
Fisman et al., 2014 8/22/149/8/14Yes**
Nishiura and Chowell, 2014 8/26/149/11/14Yes**
Poletto et al., 2014 8/27/1410/23/14Yes**
Meltzer et al., 2014 8/28/149/26/14Yes***
Agusto et al., 2015 8/29/144/23/15Yes**
Althaus, 2015 8/31/144/19/15Yes*
Scarpino et al., 2014 8/31/1412/15/14Yes*
Weitz and Dushoff, 2015 8/31/143/4/15Yes**
Drake et al., 2015 9/2/1410/30/14Yes***
Towers et al., 2014 9/8/149/18/14Yes**
Bellan et al., 2014 9/14/1410/14/14Yes*
Chowell et al., 2015 9/14/141/19/15Yes*
Cooper et al., 2015 9/14/144/14/15Yes*
Read et al., 2015 9/14/1411/12/14Yes**
WHO Ebola Response Team, 2014 9/14/149/23/14No***
Faye et al., 2015 9/16/141/23/15No**
Bogoch et al., 2015 9/21/1410/21/14Yes**
Yamin et al., 2014 9/22/1410/28/14No**
Lewnard et al., 2014 9/23/1410/24/14Yes***
Webb et al., 2015 9/23/141/30/15Yes***
Shaman et al., 2014 9/28/1410/27/14Yes**
Chowell et al., 2014 10/1/1411/20/14Yes**
Fasina et al., 2014 10/1/1410/9/14Yes*
Khan et al., 2015 10/1/142/24/15Yes*
Rivers et al., 2014 10/5/1410/16/14Yes***
Xia et al., 2015 10/7/149/8/15Yes**
Majumder et al., 2014 10/11/144/28/15Yes**
Kiskowski, 2014 10/15/1411/13/14Yes**
Fisman and Tuite, 2014 10/18/1411/21/14Yes***
Althaus et al., 2015 10/20/141/15/15Yes***
Simon-Loriere et al., 2015 10/25/146/24/15No*
Rainisch et al., 2015 10/31/146/16/15Yes*
Fast et al., 2015 11/1/145/15/15Yes*
Kucharski et al., 2015 11/1/142/18/15Yes***
Tong et al., 2015 11/11/145/13/15No**
Hoenen et al., 2015 11/21/143/26/15No*
Cope et al., 2014 12/3/1412/10/14Yes**
White et al., 2014 12/3/141/30/15Yes***
WHO Ebola Response Team, 2015 12/14/1412/24/14No**
Chowell et al., 2014 12/17/141/21/15Yes*
Siettos et al., 2014 12/21/143/9/15Yes**
Park et al., 2015 12/26/146/18/15No*
Nadhem and Nejib, 2015 12/30/146/14/15Yes*
Camacho et al., 2015 1/18/152/10/15Yes**
Carroll et al., 2015 1/31/156/17/15No**
Bellan et al., 2015 2/9/154/15/15Yes**
Barbarossa et al., 2015 2/13/157/21/15Yes***
Kugelman et al., 2015 2/14/156/12/15No*
Cleaton et al., 2015 2/28/159/3/15Yes*
Wang and Zhong, 2015 3/18/153/24/15Yes*
Toth et al., 2015 3/31/157/14/15Yes**
Dong et al., 2015 4/3/159/5/15Yes**
Browne et al., 2015 4/12/155/14/15Yes**
Zinszer et al., 2015 5/13/159/1/15Yes*

The number of publications with models to estimate R increased rapidly early in the epidemic, along with those including intervention, forecasting, and regional and international spread models; the growth rate of publications with phylogenetic modeling applications and clinical trial models increased later in the epidemic (Figure 2).

Cumulative number of modeling applications by date of most recent EVD data used.

The figure includes 125 modeling applications across the 66 publications.

https://doi.org/10.7554/eLife.09186.005

Of the 125 models reported across the studies, 74% included mechanistic assumptions about disease transmission (e.g., compartmental, agent-based, or phylogenetic models), while 26% were purely phenomenological (Supplementary file 2).

Data sources

For 54 (82%) of the 66 publications, the only EVD data used was pre-existing and publicly-available (Table 1). Typically, these were aggregate case data posted online by the WHO or affected countries, or Ebola virus genetic data released previously during the epidemic. Twelve studies used original EVD epidemiological data (Baize et al., 2014; WHO Ebola Response Team, 20142015; Faye et al., 2015; Yamin et al., 2014) or genomic data (Baize et al., 2014; Gire et al., 2014; Simon-Loriere et al., 2015; Tong et al., 2015; Hoenen et al., 2015; Park et al., 2015; Carroll et al., 2015; Kugelman et al., 2015).

Examples of additional data used for some modeling applications include official reports of social mobilization efforts (Fast et al., 2015), media reports of case clusters (Cleaton et al., 2015), media reports of events that may curtail or aggravate transmission (Majumder et al., 2014), and international air travel data (Gomes et al., 2014; Poletto et al., 2014; Read et al., 2015; Bogoch et al., 2015; Rainisch et al., 2015; Cope et al., 2014). Several studies incorporated spatial data on EVD cases into models of regional EVD spread (Gire et al., 2014; Merler et al., 2015; Rainisch et al., 2015; Tong et al., 2015; Carroll et al., 2015; Zinszer et al., 2015).

Data and results sharing

Of the 12 studies that collected original EVD data, 9 released those data before or at the time of publication (8 with Ebola virus genetic data deposited in GenBank (Baize et al., 2014; Gire et al., 2014; Simon-Loriere et al., 2015; Tong et al., 2015; Hoenen et al., 2015; Park et al., 2015; Carroll et al., 2015; Kugelman et al., 2015) and 1 with detailed epidemiological data in the online publication (Yamin et al., 2014). Many publications used results from the WHO Ebola Response Team investigations (WHO Ebola Response Team, 2014; 2015) (for example, estimates of the generation time, case fatality rate, or other epidemiological parameters as model inputs), but the detailed epidemiological data from these studies, to date, are not publicly available.

Accumulation of shared EVD data over successive studies was evident especially in the phylogenetic analyses. For example, all phylogenetic studies published after release of the initial Ebola virus sequences by (Baize et al., 2014) (Guinea) and (Gire et al., 2014) (Sierra Leone) incorporated those sequence data.

Across all studies, the publication lag (defined as date of most recent EVD data used to date of online publication) was almost 3 months (median [interquartile range] = 85 [30–157] days). The lag varied across modeling applications, and was considerably shorter in studies that included models to estimate R (median = 58 days for publications with R estimation versus 118 days for others) or to forecast (median = 50 versus 125 days) (Figure 3).

Publication lag by type of modeling application.

The vertical red and turquoise lines indicate the median lag for publications including and not including, respectively, the type of modeling application.

https://doi.org/10.7554/eLife.09186.006

Lags were longest for studies with phylogenetic and clinical trials applications (median = 125 and 108 days, respectively), although there were fewer publications with these models.

Modeling results: R and forecasts

Forty-one publications characterized epidemic dynamics using epidemiological (N=36), genomic (N=4), or news report data (N=1). Twenty-four of these provided estimates of the basic reproduction number (R0) for Guinea, Liberia, Sierra Leone, or West Africa, using epidemiological or genomic data (Figure 4, Supplementary file 3).

R0 estimates by type of model input data.

Aggregate, case counts released by the WHO or Ministries of Health; Line-level, individual-level data from epidemiological investigations; Genomic, Ebola virus sequence data. The Figure excludes an outlier estimate of 8.33 for Sierra Leone (Fisman et al., 2014).

https://doi.org/10.7554/eLife.09186.007

There were 16 country-specific estimates of R0 for Guinea, Liberia, or Sierra Leone that used EVD epidemiological data (aggregate or line-level) and provided 95% confidence or credible intervals (CIs). Median CI width was about 85% smaller for models that used cumulative EVD counts (N=11 models in 5 publications) than for models that used disaggregated EVD case data, such as weekly counts (N=5 models in 3 publications) (Figure 5).

R0 estimates and CIs by type of epidemiological input data.

Disaggregated data typically were weekly counts. Top row: Vertical lines indicate 95% CIs. Bottom row: Horizontal bars indicate median CI width.

https://doi.org/10.7554/eLife.09186.008

Although CIs were also narrower for models when deterministic rather than stochastic methods were used to estimate parameter uncertainty, all of the deterministic results came from a single study (Figure 6).

R0 estimates and CIs by model fitting method.

Top row: Vertical lines indicate 95% CIs. Bottom row: Horizontal bars indicate median CI width.

https://doi.org/10.7554/eLife.09186.009

Fifteen publications provided numerical forecasts of cumulative EVD incidence for West African countries. Of 22 models that assumed no additional response measures beyond those implemented at the time (i.e., 'status quo' assumptions), 18 overestimated the future number of cases (Figure 7, Supplementary file 4).

Accuracy of cumulative incidence forecasts.

Accuracy is shown as the ratio of predicted incidence to incidence subsequently reported by the WHO. 'Dampening' refers to various approaches to restrict the growth of forecasted incidence over time. Top row: Accuracy by date of forecast. Bottom row: Accuracy by forecast lead time ('Horizon'). The Figure excludes one forecast with horizon > 1 year (Fisman and Tuite, 2014).

https://doi.org/10.7554/eLife.09186.010

In multivariate analysis, forecast error was lower for forecasts made later in the outbreak (14% reduction in mean absolute percentage error [MAPE] per week, P<0.001), higher for forecasts with longer time horizons (29% increase in MAPE per week, P<0.01), and lower for forecasts that used decay terms, spatially heterogeneous contact patterns, or other methods that served to constrain projected incidence growth (90% reduction in MAPE, P<0.01). Country and number of parameters in the model were not statistically significant predictors of forecast accuracy.

Discussion

We identified 66 modeling publications during approximately 18 months of the EVD response that assessed trends in the intensity of transmission, effectiveness of control measures, future case counts, regional and international spreading risk, Ebola virus phylogenetic relationships and recent evolutionary dynamics, and feasibility of clinical trials in West Africa. We found a heavy dependence on public data for EVD modeling, and identified factors that might have influenced model performance. To our knowledge, this review is one of the most comprehensive assessments of mathematical modeling applied to a single real-world public health emergency.

An important caveat of our review is that it only captures published results. We are aware of additional EVD epidemiological investigations and modeling not yet published. Some modelers providing direct support to operational response efforts have not published results, possibly because of operational demands.

Also, we could not account comprehensively for the sources of variation across studies. For example, studies that estimated R0 using the same data sources at about the same time reported varied results. Such variation may, in part, reflect the problem of identifiability, with different R0 estimates possible for models that perform equally well depending on other parameter values (Weitz and Dushoff, 2015). Ideally, an investigation into this heterogeneity would include implementation of models in a common testing environment.

Our review suggests several possible steps for improving the application of epidemiological modeling during public health emergencies. First, agreement on community best practices could improve the quality of modeling support to decision-makers. For example, our analysis is consistent with simulation studies showing underestimation of uncertainty in estimating R0 with cumulative (as opposed to disaggregated) incidence data, and supports the recommendation to use disaggregated data and stochastic models (King et al., 2015). Additionally, incidence forecasts provided reasonable prospective estimates several weeks forward in time during the initial phase; however, given available data and methodologies these forecasts became progressively more inaccurate as they projected dynamics beyond several weeks. Validation of incidence forecasts against other relevant data, such as hospital admissions and contacts identified, also could provide evidence that the assumptions are sound.

The 2014 onwards ebola outbreak in West Africa clearly highlights the need for a better understanding of how increasing awareness of severe infections within a community decreases their transmissibility even in the absence of specific interventions. Advancing methodological approaches to capture this effect, such as dampening approaches, might help account for behavioral changes, interventions, contact heterogeneity, or other factors that can be expected in a public health emergency which likely will improve forecasting accuracy. Establishing best practices within the community will allow decision-makers the ability to more quickly accept methodologies and results that have been generated via these best practices. Hence, decisions based on these results can happen more quickly.

Second, modeling coordination could facilitate direct comparison of modeling results, identifying issues on which diverse approaches agree and areas of greater uncertainty. Epidemiological modelers might learn from comparison initiatives in modeling of influenza (Centers for Disease Control and Prevention, 2013), dengue (US Department of Commerce), and HIV (HIV modeling consortium); and in other fields such as climate forecasting Intergovernmental Panel on Climate Change, 2010). For epidemiological application, an ensemble approach should preserve methodological diversity to exploit the full range of state-of-the-art modeling methods, but include enough standardization to enable cross-model comparison. Establishing an initial architecture for a coordinated, ensemble effort now could assist the response to EVD, and future public health emergencies.

Perhaps most importantly, outbreak modeling efforts would be much more fruitful if data and analytical results could be made available more quickly to all interested parties (Yozwiak et al., 2015). The publication timelines for academic journals typically will not be consistent with decision-making needs during public health emergencies like the EVD epidemic, where the epidemiological situation was highly dynamic and the usefulness of data and forecasts time-constrained. Establishing mechanisms for modelers without special access to the official epidemiological teams to share interim results would expand the evidence base for response decision-making. Ideally, data should be made available online in machine-readable form to facilitate use in analyses. Modelers and other analysts expended enormous effort during the EVD epidemic transcribing data posted online in pdf documents.

New norms for data-sharing during public health emergencies (World Health Organization, 2015) would remove the most obvious hurdle for model comparison. The current situation where groups either negotiate bilaterally with individual countries or work exclusively with global health and development agencies is understandable, but highly ineffective. The EVD outbreak highlights again – after the 2003 Severe Acute Respiratory Syndrome epidemic and 2009 influenza A (H1N1) pandemic – that an independent, well-resourced global data observatory could greatly facilitate the public health response in many ways, not least of which would be the enablement of rapid, high quality, and easily comparable disease-dynamic studies.

Materials and methods

For this review, we adapted the PRISMA methodology (Moher et al., 2009) to identify quantitative modeling studies of the 2013-present West Africa EVD epidemic. We searched PubMed on September 24, 2015, for publications in English since December 1, 2013, using the term ‘Ebola’ in any field. We reviewed all returned abstracts and selected ones for confirmatory, full-text review that mentioned use of quantitative models to characterize or predict epidemic dynamics or evaluate interventions. We included studies that met this criterion in full-text review.

We excluded studies of clinical prediction models, viral or physiological function models, ecological niche models, animal reservoir models, and publications that did not use data from the 2013-present West Africa EVD epidemic.

For included publications, we recorded the geographic settings, date of most recent EVD data used and date of publication, type of EVD data used, questions the models addressed, modeling approaches, and key results, including estimates of the basic reproduction number (R0) and forecasts of future EVD incidence provided in the main text of the publications. To assess forecast accuracy, we compared predictions of models made under ‘status quo’ assumptions (i.e., without explicit inclusion of additional interventions or behavioral changes) to EVD incidence data subsequently released by the WHO (World Health Organization, 2016), using the WHO figures dated soonest after the forecast target date.

References

  1. 1
  2. 2
  3. 3
  4. 4
    Estimating the reproduction number of ebola virus (eBOV) during the 2014 outbreak in West Africa
    1. CL Althaus
    (2014)
    PLoS Currents, 6, 10.1371/currents.outbreaks.91afb5e0f279e7f29e7056095255b288.
  5. 5
  6. 6
  7. 7
  8. 8
  9. 9
  10. 10
  11. 11
  12. 12
    Temporal changes in ebola transmission in sierra leone and implications for control requirements: a real-time modelling study
    1. A Camacho
    2. A Kucharski
    3. Y Aki-Sawyerr
    4. MA White
    5. S Flasche
    6. M Baguelin
    7. T Pollington
    8. JR Carney
    9. R Glover
    10. E Smout
    11. A Tiffany
    12. WJ Edmunds
    13. S Funk
    (2015)
    PLoS Currents, 7, 10.1371/currents.outbreaks.406ae55e83ec0b5193e30856b9235ed2.
  13. 13
    Temporal and spatial analysis of the 2014–2015 ebola virus outbreak in west africa
    1. MW Carroll
    2. DA Matthews
    3. JA Hiscox
    4. MJ Elmore
    5. G Pollakis
    6. A Rambaut
    7. R Hewson
    8. I García-Dorival
    9. JA Bore
    10. R Koundouno
    11. S Abdellati
    12. B Afrough
    13. J Aiyepada
    14. P Akhilomen
    15. D Asogun
    16. B Atkinson
    17. M Badusche
    18. A Bah
    19. S Bate
    20. J Baumann
    21. D Becker
    22. B Becker-Ziaja
    23. A Bocquin
    24. B Borremans
    25. A Bosworth
    26. JP Boettcher
    27. A Cannas
    28. F Carletti
    29. C Castilletti
    30. S Clark
    31. F Colavita
    32. S Diederich
    33. A Donatus
    34. S Duraffour
    35. D Ehichioya
    36. H Ellerbrok
    37. MD Fernandez-Garcia
    38. A Fizet
    39. E Fleischmann
    40. S Gryseels
    41. A Hermelink
    42. J Hinzmann
    43. U Hopf-Guevara
    44. Y Ighodalo
    45. L Jameson
    46. A Kelterbaum
    47. Z Kis
    48. S Kloth
    49. C Kohl
    50. M Korva
    51. A Kraus
    52. E Kuisma
    53. A Kurth
    54. B Liedigk
    55. CH Logue
    56. A Lüdtke
    57. P Maes
    58. J McCowen
    59. S Mély
    60. M Mertens
    61. S Meschi
    62. B Meyer
    63. J Michel
    64. P Molkenthin
    65. C Muñoz-Fontela
    66. D Muth
    67. EN Newman
    68. D Ngabo
    69. L Oestereich
    70. J Okosun
    71. T Olokor
    72. R Omiunu
    73. E Omomoh
    74. E Pallasch
    75. B Pályi
    76. J Portmann
    77. T Pottage
    78. C Pratt
    79. S Priesnitz
    80. S Quartu
    81. J Rappe
    82. J Repits
    83. M Richter
    84. M Rudolf
    85. A Sachse
    86. KM Schmidt
    87. G Schudt
    88. T Strecker
    89. R Thom
    90. S Thomas
    91. E Tobin
    92. H Tolley
    93. J Trautner
    94. T Vermoesen
    95. I Vitoriano
    96. M Wagner
    97. S Wolff
    98. C Yue
    99. MR Capobianchi
    100. B Kretschmer
    101. Y Hall
    102. JG Kenny
    103. NY Rickett
    104. G Dudas
    105. CE Coltart
    106. R Kerber
    107. D Steer
    108. C Wright
    109. F Senyah
    110. S Keita
    111. P Drury
    112. B Diallo
    113. H de Clerck
    114. M Van Herp
    115. A Sprecher
    116. A Traore
    117. M Diakite
    118. MK Konde
    119. L Koivogui
    120. N Magassouba
    121. T Avšič-Županc
    122. A Nitsche
    123. M Strasser
    124. G Ippolito
    125. S Becker
    126. K Stoecker
    127. M Gabriel
    128. H Raoul
    129. A Di Caro
    130. R Wölfel
    131. P Formenty
    132. S Günther
    (2015)
    Nature 524:97–101.
    https://doi.org/10.1038/nature14594
  14. 14
  15. 15
  16. 16
    Is west africa approaching a catastrophic phase or is the 2014 ebola epidemic slowing down? different models yield different answers for liberia
    1. G Chowell
    2. L Simonsen
    3. C Viboud
    4. Y Kuang
    (2014)
    PLoS Currents, 6, 10.1371/currents.outbreaks.b4690859d91684da963dc40e00f3da81.
  17. 17
    The western africa ebola virus disease epidemic exhibits both global exponential and local polynomial growth rates
    1. G Chowell
    2. C Viboud
    3. JM Hyman
    4. L Simonsen
    (2014)
    PLoS Currents, 7, 10.1371/currents.outbreaks.8b55f4bad99ac5c5db3663e916803261.
  18. 18
  19. 19
  20. 20
    Assessment of the risk of ebola importation to australia
    1. RC Cope
    2. P Cassey
    3. GJ Hugo
    4. JV Ross
    (2014)
    PLoS Currents, 6, 10.1371/currents.outbreaks.aa0375fd48a92c7c9422aa543a88711f.
  21. 21
  22. 22
  23. 23
    Phylogenetic analysis of guinea 2014 EBOV ebolavirus outbreak
    1. G Dudas
    2. A Rambaut
    (2014)
    PLoS Currents, 6, 10.1371/currents.outbreaks.84eefe5ce43ec9dc0bf0670f7b8b417d.
  24. 24
  25. 25
    The role of social mobilization in controlling ebola virus in lofa county, liberia
    1. SM Fast
    2. S Mekaru
    3. JS Brownstein
    4. TA Postlethwaite
    5. N Markuzon
    (2015)
    PLoS Currents, 7, 10.1371/currents.outbreaks.c3576278c66b22ab54a25e122fcdbec1.
  26. 26
  27. 27
    Early epidemic dynamics of the west african 2014 ebola outbreak: estimates derived with a simple two-parameter model
    1. D Fisman
    2. E Khoo
    3. A Tuite
    (2014)
    PLoS Currents, 6, 10.1371/currents.outbreaks.89c0d3783f36958d96ebbae97348d571.
  28. 28
    Projected impact of vaccination timing and dose availability on the course of the 2014 west african ebola epidemic
    1. D Fisman
    2. A Tuite
    (2014)
    PLoS Currents, 6, 10.1371/currents.outbreaks.06e00d0546ad426fed83ff24a1d4c4cc.
  29. 29
  30. 30
    Assessing the international spreading risk associated with the 2014 west african ebola outbreak
    1. MF Gomes
    2. A Pastore Y Piontti
    3. L Rossi
    4. D Chao
    5. I Longini
    6. ME Halloran
    7. A Vespignani
    (2014)
    PLoS Currents, 6, 10.1371/currents.outbreaks.cd818f63d40e24aef769dda7df9e0da5.
  31. 31
  32. 32
  33. 33
  34. 34
  35. 35
  36. 36
    A three-scale network model for the early growth dynamics of 2014 west africa ebola epidemic
    1. MA Kiskowski
    (2014)
    PLoS Currents, 6, 10.1371/currents.outbreaks.c6efe8274dc55274f05cbcb62bbe6070.
  37. 37
  38. 38
  39. 39
  40. 40
    Estimation of MERS-coronavirus reproductive number and case fatality rate for the spring 2014 saudi arabia outbreak: insights from publicly available data
    1. MS Majumder
    2. C Rivers
    3. E Lofgren
    4. D Fisman
    (2014)
    PLoS Currents, 6, 10.1371/currents.outbreaks.98d2f8f3382d84f390736cd5f5fe133c.
  41. 41
    Estimating the future number of cases in the ebola epidemic--liberia and sierra leone, 2014-2015
    1. MI Meltzer
    2. CY Atkins
    3. S Santibanez
    4. B Knust
    5. BW Petersen
    6. ED Ervin
    7. ST Nichol
    8. IK Damon
    9. ML Washington
    (2014)
    Morbidity and Mortality Weekly Report. Surveillance Summaries (Washington, D.C. : 2002) 63 Suppl 3:1–14.
  42. 42
  43. 43
  44. 44
  45. 45
  46. 46
  47. 47
    Early transmission dynamics of ebola virus disease (eVD), West Africa, march to august 2014
    1. H Nishiura
    2. G Chowell
    (2014)
    Euro Surveillance : Bulletin Européen Sur Les Maladies Transmissibles = European Communicable Disease Bulletin 19:20894.
    https://doi.org/10.2807/1560-7917.ES2014.19.36.20894
  48. 48
  49. 49
  50. 50
  51. 51
  52. 52
  53. 53
  54. 54
  55. 55
    Modeling the impact of interventions on an epidemic of ebola in sierra leone and liberia
    1. CM Rivers
    2. ET Lofgren
    3. M Marathe
    4. S Eubank
    5. BL Lewis
    (2014)
    PLoS Currents, 6, 10.1371/currents.outbreaks.4d41fe5d6c05e9df30ddce33c66d084c.
  56. 56
  57. 57
    Epidemiological and viral genomic sequence analysis of the 2014 ebola outbreak reveals clustered transmission
    1. SV Scarpino
    2. A Iamarino
    3. C Wells
    4. D Yamin
    5. M Ndeffo-Mbah
    6. NS Wenzel
    7. SJ Fox
    8. T Nyenswah
    9. FL Altice
    10. AP Galvani
    11. LA Meyers
    12. JP Townsend
    (2015)
    Clinical Infectious Diseases, 60, 10.1093/cid/ciu1131.
  58. 58
    Inference and forecast of the current west african ebola outbreak in guinea, sierra leone and liberia
    1. J Shaman
    2. W Yang
    3. S Kandula
    (2014)
    PLoS Currents, 6, 10.1371/currents.outbreaks.3408774290b1a0f2dd7cae877c8b8ff6.
  59. 59
    Modeling the 2014 ebola virus epidemic – agent-based simulations, temporal analysis and future predictions for liberia and sierra leone
    1. C Siettos
    2. C Anastassopoulou
    3. L Russo
    4. C Grigoras
    5. E Mylonakis
    (2015)
    PLoS Currents, 7, 10.1371/currents.outbreaks.8d5984114855fc425e699e1a18cdc6c9.
  60. 60
  61. 61
    Insights into the early epidemic spread of ebola in sierra leone provided by viral sequence data
    1. T Stadler
    2. D Kühnert
    3. DA Rasmussen
    4. L du Plessis
    (2014)
    PLoS Currents, 6, 10.1371/currents.outbreaks.02bc6d927ecee7bbd33532ec8ba6a25f.
  62. 62
  63. 63
  64. 64
    Temporal variations in the effective reproduction number of the 2014 west africa ebola outbreak
    1. S Towers
    2. O Patterson-Lomba
    3. C Castillo-Chavez
    (2014)
    PLoS Currents, 6, 10.1371/currents.outbreaks.9e4c4294ec8ce1adad283172b16bc908.
  65. 65
  66. 66
  67. 67
    Phylodynamic analysis of ebola virus in the 2014 sierra leone epidemic
    1. E Volz
    2. S Pond
    (2014)
    PLoS Currents, 6, 10.1371/currents.outbreaks.6f7025f1271821d4c815385b08f5f80e.
  68. 68
  69. 69
    A model of the 2014 ebola epidemic in west africa with contact tracing
    1. G Webb
    2. C Browne
    3. X Huo
    4. O Seydi
    5. M Seydi
    6. P Magal
    (2015)
    PLoS Currents, 7, 10.1371/currents.outbreaks.846b2a31ef37018b7d1126a9c8adf22a.
  70. 70
  71. 71
    Projected treatment capacity needs in sierra leone
    1. RA White
    2. E MacDonald
    3. BF de Blasio
    4. K Nygård
    5. L Vold
    6. J-A Røttingen
    (2014)
    PLoS Currents, 10.1371/currents.outbreaks.3c3477556808e44cf41d2511b21dc29f.
  72. 72
  73. 73
  74. 74
  75. 75
  76. 76
  77. 77
  78. 78
  79. 79

Decision letter

  1. Mark Jit
    Reviewing Editor; London School of Hygiene & Tropical Medicine, and Public Health England, United Kingdom

In the interests of transparency, eLife includes the editorial decision letter and accompanying author responses. A lightly edited version of the letter sent to the authors after peer review is shown, indicating the most substantive concerns; minor comments are not usually included.

Thank you for submitting your work entitled "Improving epidemiological modeling coordination for Ebola and other public health emergencies" for peer review at eLife. Your submission has been favorably evaluated by Prabhat Jha (Senior editor), and three reviewers, one of whom (Mark Jit) is a member of our Board of Reviewing Editors. Another of the reviewers, Marco Ajelli, has also agreed to reveal their identity.

The reviewers have discussed the reviews with one another and the Reviewing editor has drafted this decision to help you prepare a revised submission.

The reviewers all agreed that this is a timely, comprehensive and useful review of modelling studies that have described and/or forecasted the Ebola epidemic in West Africa. We all enjoyed reading the paper, and agreed that it made some important points about good modelling practice in general. In particular, we agree that data sharing and complete access to best available epidemiological data (especially patient databases) for all modelling groups is of paramount importance in the case of future outbreaks of infectious diseases.

We believe that the way the information and discussion was presented could be improved, and by doing so would enhance the influence that the paper may have. In particular:

1) More detail could be provided on how Ebola modelling was actually used to support decision making both internationally and for individual national governments. Many of the recommendations in the Discussion are predicated on the premise that the modelling results were actually useful to inform policy (or if they weren't, could be made useful by improving the modelling process). Some historical discussion on what happened in practice would help here. Currently only a single US-centric reference is provided.

2) The paper needs a proper Materials and methods section providing details about the search terms/strategy, inclusion/exclusion criteria and information extracted from the papers. Given that this is a review, compliance with PRISMA guidelines would normally be expected. If a non-standard approach to searching was used, this needs to be justified and the method explained.

3) Each section of the paper seems to contain a mixture of (i) data directly extracted from the reviewed papers, (ii) direct inferences from these data, (iii) more general discussion points around the inferences and (iv) broader discussion (e.g. recommendations for standardisation and funding for future outbreak response). While each section is helpful, the way the paper is written makes it difficult to distinguish the provenance of the different points made. For instance, there are statements like "Modelers welcomed the release of these data, but expended considerable effort curating them for analysis", "accurate epidemic forecasting for Ebola proved to be quite challenging, especially for long time horizons, because the behavior of people infected and at risk was unpredictable", "incomplete information from the field can make it difficult to determine if failure to end the epidemic reflects sub-optimal implementation of effective measures, or over-optimistic assumptions about their effectiveness" where it is unclear whether these are statements drawn from the reviewed papers, or based on the authors' own experiences.

Using a traditional separation into Results and Discussion may also help this as most scientists are used to the idea of drawing immediate inferences from data, and then suggesting broader implications of these inferences. The Discussion section may be lengthy and structured if necessary. The broader discussion around funding etc. which is not really an extrapolation from the data (however welcome the points may be) should probably be given a section of its own.

4) Table 2 shows a very large variation in the baseline scenario forecast, spanning orders of magnitude. The authors seem to suggest that this stems from different assumptions used about epidemic growth forecasting e.g. random mixing, incidence decay or spatial/sociodemographic structuring. It would be useful if you could explicitly structure the presentation of results to show whether these different assumptions had an effect on the outbreak size. This should at minimum be done in the visual presentation but if some sort of meta-analysis/meta-regression could help that would be even better.

5) Additional information about the spatial structure (if any) of the models reviewed would be useful. Spatial structure in these models ranges from homogeneous (non-spatial models) to detailed agent-based models such as the one developed by Merler et al. Lancet ID.

6) Some models employed single streams of data (e.g., all case counts by country) while other models employed multiple data streams (e.g., all cases and cases among health-care workers such as the multi-type branching process model by Drake et al.). This would be useful to highlight.

7) The paper makes the point that there are differences between the results of various modelling groups because of different methodologies used. However, in some cases modelling methodology has been improperly used, and this would be useful to highlight. In particular the following areas could be given attention:

a) Parameter estimation. In Box 1, you have underlined that the different models analyzed in this review have a remarkably different number of "unknown" parameters to be estimated. This represents a very important topic that could be given more space in this manuscript). For example:

A clear distinction should be made between estimated and imputed parameters. For instance the number of estimated parameters in Pandey et al. (2014) is 4, while the other parameters are somehow imputed from the literature. That said, still the 4 unknown parameters estimated in Pandey et al. (2014) are not univocally identifiable by using as input the dataset considered in that work.

In Rivers et al. (2014) about 10 (surely not all unequivocally identifiable) parameters have been estimated on the basis of case counts only.

b) Uncertainty. A good modelling practice (which is not always followed in the reviewed papers) is the assessment of the uncertainty due to i) the stochasticity of the infection transmission process, and ii) the uncertainty in parameter estimates. In particular, as regards the uncertainty in parameter estimates, methods providing estimates of the parameters distribution (e.g., MCMC) should be preferred with respect to methods providing point estimate of the parameter values only (e.g., MSE, ML). This would avoid (or at least limit) the spread in the community of biased findings never confirmed on the field or by subsequent studies (e.g., the estimate that the main driver of the Ebola epidemic was the transmission during traditional burials found by Pandey et al. (2014) which was based on a point estimate made through a MSE fitting procedure).

c) Model validation. Another major point is that most (though not all) the reviewed modelling studies lack in model validation against other epidemiologically relevant quantities (e.g., ETU admissions, number of individuals included in contact tracing). Validation against other quantities is a key indicator to understand whether the model is able to explain the reason of an observed pattern of infection spread or it is just able to reproduce case counts "by chance". In fact, although it is welcome to show that the model is in agreement with the next few incidence points after stopping model calibration, modelers should not limit model validation to that.

8) It would also be good to clarify that the reviewed modelling studies can be divided in two categories: the ones aimed at explaining the observed pattern, for instance by accounting for actual data on intervention measures, and those that do not provide any explanation for the observed pattern (e.g., the model in Fisman, Khoo and Tuite (2014)). In fact, the point here is not to fit some data points, but to understand the mechanisms behind the observed infection spread as their knowledge is key for predictions and preparedness.

9) The importance for modelling groups of having access to online epidemiological data in an easy to export format (e.g., CSV file) could be highlighted. In fact, the Ministries of Health of the three most affected countries provided PDF files only, some of which were images, thus implying remarkable efforts by modelers to readily use such datasets. Moreover, the case of Guinea was even worse as, at the beginning of the epidemic, there was not an official website of the Guinean Ministry of Health with their reports uploaded.

https://doi.org/10.7554/eLife.09186.015

Author response

1) More detail could be provided on how Ebola modelling was actually used to support decision making both internationally and for individual national governments. Many of the recommendations in the Discussion are predicated on the premise that the modelling results were actually useful to inform policy (or if they weren't, could be made useful by improving the modelling process). Some historical discussion on what happened in practice would help here. Currently only a single US-centric reference is provided.

We recast the manuscript as more of a standard scoping literature review, with less emphasis on anecdotal evidence of the practical utility of modeling during the outbreak. We mostly have only our personal experiences to draw on for such evidence; the revision draws much more heavily on the peer-reviewed literature (please see response to Comment 2 below).

2) The paper needs a proper Materials and methods section providing details about the search terms/strategy, inclusion/exclusion criteria and information extracted from the papers. Given that this is a review, compliance with PRISMA guidelines would normally be expected. If a non-standard approach to searching was used, this needs to be justified and the method explained.

We revised the manuscript as a PRISMA-compliant literature review.

3) Each section of the paper seems to contain a mixture of (i) data directly extracted from the reviewed papers, (ii) direct inferences from these data, (iii) more general discussion points around the inferences and (iv) broader discussion (e.g. recommendations for standardisation and funding for future outbreak response). While each section is helpful, the way the paper is written makes it difficult to distinguish the provenance of the different points made. For instance, there are statements like "Modelers welcomed the release of these data, but expended considerable effort curating them for analysis", "accurate epidemic forecasting for Ebola proved to be quite challenging, especially for long time horizons, because the behavior of people infected and at risk was unpredictable", "incomplete information from the field can make it difficult to determine if failure to end the epidemic reflects sub-optimal implementation of effective measures, or over-optimistic assumptions about their effectiveness" where it is unclear whether these are statements drawn from the reviewed papers, or based on the authors' own experiences. Using a traditional separation into Results and Discussion may also help this as most scientists are used to the idea of drawing immediate inferences from data, and then suggesting broader implications of these inferences. The Discussion section may be lengthy and structured if necessary. The broader discussion around funding etc. which is not really an extrapolation from the data (however welcome the points may be) should probably be given a section of its own.

We agree that the original organization was non-standard and confusing. The revision is structured as a standard research paper, with Introduction, Materials and methods, Results, and Discussion. The broader implications and recommendations are included in the Discussion.

4) Table 2 shows a very large variation in the baseline scenario forecast, spanning orders of magnitude. The authors seem to suggest that this stems from different assumptions used about epidemic growth forecasting e.g. random mixing, incidence decay or spatial/sociodemographic structuring. It would be useful if you could explicitly structure the presentation of results to show whether these different assumptions had an effect on the outbreak size. This should at minimum be done in the visual presentation but if some sort of meta-analysis/meta-regression could help that would be even better.

We replaced the table with a figure that identifies several factors that may have influenced forecasting results in a clearer and more compelling way. Figure 6 shows the association between more accurate forecasts and earlier forecasts, shorter time horizons, and approaches that constrained predicted epidemic growth (for which we suggest the term “dampening”). We also performed a regression to quantify these effects, reported in the Results.

5) Additional information about the spatial structure (if any) of the models reviewed would be useful. Spatial structure in these models ranges from homogeneous (non-spatial models) to detailed agent-based models such as the one developed by Merler et al. Lancet ID.

We identified modeling studies that incorporated spatial data (see Results, subsection “Data sources”).

6) Some models employed single streams of data (e.g., all case counts by country) while other models employed multiple data streams (e.g., all cases and cases among health-care workers such as the multi-type branching process model by Drake et al.). This would be useful to highlight.

We added two paragraphs in the Results section on sources of data (Results, subsection “Data sources”) and hope this addresses the spirit of the excellent comment, which we interpret as a call for more thorough characterization of the types of model input data.

7) The paper makes the point that there are differences between the results of various modelling groups because of different methodologies used. However, in some cases modelling methodology has been improperly used, and this would be useful to highlight. In particular the following areas could be given attention: a) Parameter estimation. In Box 1, you have underlined that the different models analyzed in this review have a remarkably different number of "unknown" parameters to be estimated. This represents a very important topic that could be given more space in this manuscript). For example: A clear distinction should be made between estimated and imputed parameters. For instance the number of estimated parameters in Pandey et al. (2014) is 4, while the other parameters are somehow imputed from the literature. That said, still the 4 unknown parameters estimated in Pandey et al. (2014) are not univocally identifiable by using as input the dataset considered in that work. In Rivers et al. (2014) about 10 (surely not all unequivocally identifiable) parameters have been estimated on the basis of case counts only.

We provided the number of parameters for models estimating R, and distinguished between estimated and imputed values (Supplementary file 3).

b) Uncertainty. A good modelling practice (which is not always followed in the reviewed papers) is the assessment of the uncertainty due to i) the stochasticity of the infection transmission process, and ii) the uncertainty in parameter estimates. In particular, as regards the uncertainty in parameter estimates, methods providing estimates of the parameters distribution (e.g., MCMC) should be preferred with respect to methods providing point estimate of the parameter values only (e.g., MSE, ML). This would avoid (or at least limit) the spread in the community of biased findings never confirmed on the field or by subsequent studies (e.g., the estimate that the main driver of the Ebola epidemic was the transmission during traditional burials found by Pandey et al. (2014) which was based on a point estimate made through a MSE fitting procedure).

We added a detailed analysis of how use of cumulative versus disaggregated input data, and deterministic versus stochastic approaches to parameter estimation, may have influenced the reported uncertainty of R estimates (see Results, subsection “Modeling results: R and forecasts” as well as Figure 4 and Figure 5).

c) Model validation. Another major point is that most (though not all) the reviewed modelling studies lack in model validation against other epidemiologically relevant quantities (e.g., ETU admissions, number of individuals included in contact tracing). Validation against other quantities is a key indicator to understand whether the model is able to explain the reason of an observed pattern of infection spread or it is just able to reproduce case counts "by chance". In fact, although it is welcome to show that the model is in agreement with the next few incidence points after stopping model calibration, modelers should not limit model validation to that.

We added this excellent point to the Discussion (paragraph four):

“Validation of incidence forecasts against other relevant data, such as hospital admissions and contacts identified, also could provide evidence that the assumptions are sound.”

8) It would also be good to clarify that the reviewed modelling studies can be divided in two categories: the ones aimed at explaining the observed pattern, for instance by accounting for actual data on intervention measures, and those that do not provide any explanation for the observed pattern (e.g., the model in Fisman, Khoo and Tuite (2014)). In fact, the point here is not to fit some data points, but to understand the mechanisms behind the observed infection spread as their knowledge is key for predictions and preparedness.

We added this distinction to our characterization of the models (see Results, subsection “Overview of modeling applications”):

“Of the 125 models reported across the studies, 74% included mechanistic assumptions about disease transmission (e.g., compartmental, agent-based, or phylogenetic models), while 26% were purely phenomenological (Supplementary file 2).”

9) The importance for modelling groups of having access to online epidemiological data in an easy to export format (e.g., CSV file) could be highlighted. In fact, the Ministries of Health of the three most affected countries provided PDF files only, some of which were images, thus implying remarkable efforts by modelers to readily use such datasets. Moreover, the case of Guinea was even worse as, at the beginning of the epidemic, there was not an official website of the Guinean Ministry of Health with their reports uploaded.

We’re grateful for the chance to address this very important point, and added the recommendation to the Discussion (paragraph seven):

“Ideally, data should be made available online in machine-readable form to facilitate use in analyses. Modelers and other analysts expended enormous effort during the EVD epidemic transcribing data posted online in PDF documents.”

https://doi.org/10.7554/eLife.09186.016

Article and author information

Author details

  1. Jean-Paul Chretien

    1. Department of Defense, Division of Integrated Biosurveillance, Armed Forces Health Surveillance Center, Silver Spring, United States
    Contribution
    J-PC, Conception and design, Acquisition of data, Analysis and interpretation of data, Drafting or revising the article
    For correspondence
    1. JPChretien@gmail.com
    Competing interests
    The authors declare that no competing interests exist.
  2. Steven Riley

    1. MRC Centre for Outbreak Analysis and Modelling, Department of Infectious Disease Epidemiology, School of Public Health, Imperial College London, London, United Kingdom
    Contribution
    SR, Conception and design, Acquisition of data, Analysis and interpretation of data, Drafting or revising the article
    Competing interests
    The authors declare that no competing interests exist.
  3. Dylan B George

    1. Department of Health and Human Services, Biomedical Advanced Research and Development Authority, Washington, United States
    Contribution
    DBG, Conception and design, Acquisition of data, Analysis and interpretation of data, Drafting or revising the article
    Competing interests
    The authors declare that no competing interests exist.

Funding

No external funding was received for this work.

Acknowledgements

We thank the reviewers for excellent comments that improved the manuscript. The views expressed are those of the authors and do not necessarily reflect the views of any part of the US Government.

Reviewing Editor

  1. Mark Jit, Reviewing Editor, London School of Hygiene & Tropical Medicine, and Public Health England, United Kingdom

Publication history

  1. Received: June 2, 2015
  2. Accepted: November 19, 2015
  3. Accepted Manuscript published: December 8, 2015 (version 1)
  4. Version of Record published: February 16, 2016 (version 2)

Copyright

© 2015, Chretien et al

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.

Metrics

  • 2,643
    Page views
  • 617
    Downloads
  • 19
    Citations

Article citation count generated by polling the highest count across the following sources: Crossref, Scopus, PubMed Central.

Comments

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Download citations (links to download the citations from this article in formats compatible with various reference manager tools)

Open citations (links to open the citations from this article in various online reference manager services)

Further reading

    1. Genomics and Evolutionary Biology
    2. Microbiology and Infectious Disease
    Kevin S Bonham et al.
    Research Article