Research Article

Mapping the endemicity and seasonality of clinical malaria for intervention targeting in Haiti using routine case data

Curtin University, Australia
Telethon Kids Institute, Perth Children’s Hospital, Australia
Clinton Health Access Initiative, United States
Tulane University School of Public Health and Tropical Medicine, United States
Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, United Kingdom
Swiss Tropical and Public Health Institute, Switzerland
Division of Global Health Protection, Centers for Disease Control and Prevention, United States
Programme National de Contrôle de la Malaria/MSPP, Haiti
Division of Parasitic Diseases and Malaria, Centers for Disease Control and Prevention, United States
Direction d’Epidémiologie de Laboratoire et de la Recherche, Haiti
Institute for Disease Modelling, United States

Jun 1, 2021

Open access
Copyright information

Abstract
Introduction
Results
Discussion
Materials and methods
Data availability
References
Article and author information
Metrics

Abstract

Towards the goal of malaria elimination on Hispaniola, the National Malaria Control Program of Haiti and its international partner organisations are conducting a campaign of interventions targeted to high-risk communities prioritised through evidence-based planning. Here we present a key piece of this planning: an up-to-date, fine-scale endemicity map and seasonality profile for Haiti informed by monthly case counts from 771 health facilities reporting from across the country throughout the 6-year period from January 2014 to December 2019. To this end, a novel hierarchical Bayesian modelling framework was developed in which a latent, pixel-level incidence surface with spatio-temporal innovations is linked to the observed case data via a flexible catchment sub-model designed to account for the absence of data on case household locations. These maps have focussed the delivery of indoor residual spraying and focal mass drug administration in the Grand’Anse Department in South-Western Haiti.

Introduction

Malaria transmission in Haiti is endemic and poses a significant public health problem with a total of 8828 cases (presumed and confirmed) reported in 2019 (World Health Organization, 2019). However, in relative terms, transmission rates are low: blood stage prevalence of Plasmodium falciparum (Pf) is in many areas below 1% (Lucchi et al., 2014) and the dominant local vector (Anopheles albimanus) is inefficient (being primarily zoophilic and exophagic [Frederick et al., 2016]). Malaria elimination is a national priority and an ambition of the National Malaria Control Program of Haiti (or PNCM; abbreviated from its official name in French: Programme National de Contrôle de la Malaria). To this end, the PNCM has built a working strategy around improvements to the surveillance and management system operating nationally through the health facility and community health worker (CHW) network, and the delivery of information and interventions targeted at sub-national administrative regions hosting identified transmission foci (Boncy et al., 2015; Druetz et al., 2018). To precisely geo-locate transmission foci and develop an evidence-based risk stratification, the PNCM has collaborated with the Malaria Atlas Project (MAP) and partners. In this study, we describe an important component of this collaboration: the construction of a national-level endemicity map and seasonality profile informed by routine case reports from health facilities.

Recent years have seen great progress in the adoption of Bayesian methods for probabilistic map-making (known as model-based geostatistics; Diggle et al., 1998) among the infectious disease and global health research community (Bhatt et al., 2015; Osgood-Zimmerman et al., 2018; Zouré et al., 2014; Karagiannis-Voules et al., 2015). The standard form of this technique is an extension to the generalised linear model whereby geographic covariates based on high-resolution satellite imaging (e.g., land surface temperature; digital elevation; nighttime lights) are combined additively with a Gaussian process representing spatially correlated residuals. A suitable link function then provides a non-linear transformation to the mean of the presumed sampling distribution for the geo-located, point response data (e.g., prevalence; incidence; presence/absence), often geographically precise to the scale of individual villages and sometimes even households. In the case of malaria, these methods provide benchmark estimates of transmission intensity (World Health Organization, 2019; Weiss et al., 2019; Battle et al., 2019) for much of sub-Saharan Africa where (1) routine case reporting data have historically been highly incomplete and/or subject to problematic sources of bias (Rowe et al., 2009; Alegana et al., 2020) and (2) the prevalence of malaria is sufficiently high that national-level parasite surveys can readily be powered to resolve spatial variation (Alegana et al., 2017a). In low transmission settings such as Haiti, transmission typically becomes increasingly focalised, and the low prevalence of patent parasitaemia forces community parasite surveys towards very intensive (viz. expensive) sampling designs to achieve confident spatial stratifications. Spatio-temporal risk modelling from data deriving from a routine passive surveillance process, such as the reporting of health facility case counts, may thus be a more effective means of describing the heterogeneity of malaria in this setting.

There are many challenges to overcome to achieve accurate, fine-scale disease mapping from health facility case data. Foremost of these is that the case count from a given facility represents the aggregation of observable case incidence over all households over an area of unknown extent: the health facility catchment. Extension of the MGB framework requires the development of a sophisticated sub-model linking the fine-scale disease process with the aggregate data (Wilson and Wakefield, 2020; Taylor et al., 2018; Sturrock et al., 2014), including a representation of health facility choice and attendance (Duncan et al., 2016; Nelli et al., 2020). Further challenges include a lack of information regarding spatio-temporal variations in treatment seeking propensities across the studied communities (Alegana et al., 2017b; Alegana et al., 2012; Battle et al., 2016; Karyana et al., 2016) and in the diagnostic practices operating at each health facility (Bastiaens et al., 2014). Validation of model outputs from this class of ‘down-scaling’ models is also uniquely challenging; the hold-out of aggregate response data is of limited utility for testing fine-scale accuracy, since only ancillary point-level observations can overcome the potential for ‘ecological fallacy’ (Wakefield and Smith, 2016). Complementary to research in this direction is the development of survey methodologies and analysis strategies for alternative diagnostic technologies. For example, serological tools that quantify immune responses to particular malaria antigens can reveal whether or not an individual has ever carried a Pf parasite infection (Corran et al., 2007; Helb et al., 2015), effectively targeting a higher prevalence objective (i.e., lifetime exposure history instead of current infection status) to gain statistical power from lower sampling variance at the expense of temporal precision. Data of this nature have been gathered in Haiti and used in various ways to assist with malaria risk stratification (Oviedo et al., 2020).

Here we present the results of a bespoke analysis designed to uncover the characteristic spatial pattern of contemporary malaria endemicity in Haiti and its spatio-temporal seasonality profile using a geostatistical model informed by routine case incidence reports at monthly cadence assembled from across the country over a 6-year period (2014–2019 inclusive). The methodological framework developed for this purpose is described in detail, and model validation against a school-based serological survey is also presented. Finally, we discuss the use of these maps to focus the delivery of indoor residual spraying (IRS) and mass drug administration in the Grand’Anse department in South-Western Haiti.

Results

Fine-scale endemicity surface

Our geostatistical model-based estimate for the contemporary spatial pattern of annual malaria endemicity in Haiti is displayed at 1 × 1 km resolution in Figure 1. Figure 1A shows our posterior geometric mean estimate of the clinical incidence rate in units of cases per 1000 person-years-observed (PYO), with the fitted data (representative case totals) illustrated by the scaled circles overlaid for those facilities with non-zero case counts. The corresponding clinical incidence surface (i.e., incidence rate multiplied by population) is shown in Figure 1B, and a summary of the model-based uncertainties (namely, the pixel-wise standard deviation in our predictions in the logarithm of the clinical incidence rate) is shown in Figure 1C. As described in the Materials and methods section below, the representative case totals against which the model was fitted were constructed algorithmically by a procedure designed to (1) clean the data of epidemic fluctuations, (2) impute missing months of data for facilities with reporting gaps, (3) standardise reports towards a diagnostic benchmark of diagnosis by rapid diagnostic test (RDT), and (4) de-trend earlier years of data towards 2019 transmission levels. Details of the spatial and spatio-temporal covariates, treatment seeking surface, and population map, which are leveraged towards resolution of the endemicity surface below the health facility catchment scale, are also provided in the Materials and methods section.

Figure 1

Download asset Open asset

The contemporary spatial pattern of malaria endemicity in Haiti (2019) based on reported health facility case counts from 2014 to 2019.

(A) The (pointwise) posterior (geometric) mean of the clinical incidence rate of malaria in Haiti at 1 × 1 km resolution based on our model fit to ‘representative’ annual case totals constructed from the health facility dataset. The grey-shaded regions have zero mapped population density, so we do not predict malaria risk in those areas. The boundaries and names (in light capital letters) of the 10 administrative departments of Haiti are marked for reference, as is the location of the capital city, Port-au-Prince. (B) The (pointwise) posterior (geometric) mean of the clinical incidence count (total annual cases): this is the product of the risk surface in (A) with the population surface. (C) A visualisation of the model-based uncertainty in these fine-scale predictions, shown here in terms of the (pointwise) standard deviation in the logarithm of the predicted case incidence rate.

Two additional visualisations of the inferred clinical incidence distribution in Haiti are provided in Figures 2 and 3. In Figure 2, we present exceedance and non-exceedance maps at the thresholds of 1 case per 1000 PYO and 1 case per 10,000 PYO, respectively; these illustrations summarise the posterior probability that the incidence rate in each pixel lies above (or, conversely, below) each threshold, and have been identified in previous work on disease mapping as useful summaries for policy-makers (Giorgi et al., 2018). In Figure 3, we illustrate the aggregate counts of the population-at-risk by department and commune using the same threshold as in our exceedance map; that is, the total number of individuals in each administrative unit estimated to live in areas subject to a case incidence rate above 1 case per 1000 PYO. The first administrative division of Haiti is comprised of 10 departments, and for reference, the names of these are marked (in light capital letters) in Figure 1A.

Figure 2

Download asset Open asset

Exceedance and non-exceedance maps for clinical malaria incidence in Haiti (2019) at policy-relevant thresholds.

(A) The posterior probability that the clinical incidence rate exceeds 1 case per 1000 PYO in each pixel under our geostatistical model. (B) The posterior probability that the clinical incidence rate does not exceed 1 case per 10,000 PYO in each pixel under our geostatistical model.

Figure 3

Download asset Open asset

Predicted population-at-risk of clinical malaria for Haiti (2019) by commune and department.

(A) The posterior median estimate of the number of individuals in each commune (the third largest sub-national administrative level in Haiti) living in areas subjected to a clinical incidence rate above 1 case per 1000 PYO. (B) The same but aggregated at the level of departments (the largest sub-national administrative level in Haiti).

These probabilistic maps of clinical incidence reveal a high degree of heterogeneity in the disease burden due to malaria in Haiti. Large areas of the country – in particular, in the northern departments of Nord-Ouest, Nord, and Nord-Est, and along the Chaîne de la Selle mountain range tracing the border of the Ouest and Sud-Est departments – are essentially malaria free with fewer than one in 10,000 individuals expected to experience clinical malarial each year. Yet there remain a number of high burden communities with clinical incidence rates 500 times greater than this benchmark. These high burden communities are located primarily along the coastline and rivers of the Tiburon peninsula containing the Grand’Anse, Sud, and Nippes departments, with populations-at-risk (defined as those living in an area of malaria incidence greater than one case per 1000 PYO) of 322,693 (95% CI: 280,707–372,057), 322,956 (95% CI: 202,462–392,047), and 108,077 (95% CI: 61,620–147,288), respectively. An additional area of lower but still substantial burden lies within the central river valley joining the Artibonite and Centre departments, with populations-at-risk of 174,766 (95% CI: 97,196–313,169) and 166,938 (95% CI: 95,070–281,816).

The broad credible intervals around the estimation of these populations-at-risk reflect in large part the systematic uncertainties of the de-trending, RDT-standardising, and imputation component of our model, which contribute a substantial variance to inference of the absolute clinical incidence rate, but less so to its relative spatial distribution. Inspection of the uncertainty summary in Figure 1C indicates regions of particularly high variance corresponding to (1) the Chaîne de la Selle mountain range and (2) patches along the borders of the Artibonite department and in Nord and Nord-Ouest departments. The explanation for the former is simply population sparsity (i.e., sampling noise) combined with the extreme elevation (i.e., covariate slope uncertainty); however, since this terrain is believed to be an unlikely habitat for the local Anopheles species (principally A. albimanus) (Frederick et al., 2016), it is probably well-classified as low risk. The explanation for the latter is the ongoing use of microscopic diagnosis at similar frequency to RDT diagnosis at a minority of health facilities in these areas (most used RDT diagnosis near-exclusively in 2019), which leads to a higher contribution here from uncertainty in our standardisation procedure. The accuracy of microscopic diagnosis in Haiti has previously been characterised as inadequate (Landman et al., 2015; Weppelmann et al., 2018), hence the importance of attempting to adjust statistically for diagnostic type.

The estimation of fine-scale spatial patterns below the extent of health facility catchments is driven within our model by the suite of ancillary environmental covariates. The adopted model structure treats these as linear predictors having slopes that vary spatially with a certain degree of smoothness (as described in detail in the Materials and methods section). The most important covariates under the fitted model for the annual malaria incidence rate are highlighted in Figure 4, which shows the dominant positive and negative covariate in each pixel. It is interesting to note that of the four most important covariates over the entire country, three are ‘topological’ in nature – elevation, accessibility, and road presence/absence – and only one is climatic (potential evapotranspiration). However, it is essential not to interpret these results as indicative of importance in a causal sense; Figure 4 is presented purely to provide insight into the structure of the fitted regression model. A method for ranking variables in a causal framework has recently been applied to the modelling of malaria case count data from health facilities in Madagascar and the results were shown to be very different to a regression-based variable selection method (Arambepola et al., 2020). Note also that the spatially varying slope parameter fitted to each covariate may even change sign in different parts of the country under our modelling framework. For instance, a positive slope is assigned to penalise high elevations in the Chaîne de la Selle mountain range of the Ouest and Sud-Est departments, while in Grand’Anse, a negative slope is assigned to boost the predicted incidence along the (low-lying) coastal fringe.

Figure 4

Download asset Open asset

The dominant covariates in fine-scale prediction of the case incidence rate for Haiti (2019).

The colour of each pixel corresponds to the covariate with (A) greatest positive impact (in terms of increasing the local estimate of malaria risk) and (B) greatest negative impact (in terms of decreasing the local estimate of malaria risk), upon the predicted incidence rate in accordance with the legend. Of the 12 total spatial covariates offered to the model, only the eight shown here appear among the most dominant in at least one pixel.

Health facility catchments

The structure of the catchment model used here (see the Materials and methods section for details) allows for patients in any given location to split their attendance between multiple neighbouring facilities according to the relative travel time to each (fixed input) and a relative attractiveness weight (free parameter learnt during fitting). The resulting catchments may thus have overlapping boundaries, which avoids unrealistic structural effects – such as a systematic under-estimation of city health facility patient populations when commuters may otherwise be erroneously assigned to exclusively visit suburban facilities – but can be a challenge for visualisation. In Figure 5, we present one type of visualisation of the fitted catchment model: a flow diagram indicating the inferred movement paths connecting the latent (unobserved) case household locations to the reported case counts at health facilities. The accumulated number of cases on each path is represented by a varying line thickness; facilities for which no malaria cases are reported are also indicated without attached flows, for reference. Aside from illustrating the degree of overlap between catchments inherent to our chosen model structure, the visualisations in Figure 5 highlight the role of the travel-time distances (based on the human movement friction surface of Weiss et al., 2018) in shaping these catchments – the connections between inferred case locations and their attended health facilities directly reflect the network of roads linking the settlements of this region.

Figure 5

Download asset Open asset

Flow paths from predicted malaria case household locations to health facilities based on our catchment model for treatment seeking in Haiti (2019).

Each health facility is assigned a random colour and the flow of patients from households to health facilities predicted under our posterior mean catchment model and case incidence surface are illustrated by the colour-matched (semi-transparent) lines of logarithmically proportional thickness, for regions of interest: (A) in Grand’Anse (tip of the Tiburon peninsula); (B) along the Artibonite River in the central valley; and (C) in Port-au-Prince and its surrounding settlements. Note that the flows shown here are modelled at a discretise 1 × 1 km resolution, far coarser than that of the hill-shading relief and coastline shapefiles used in plotting; no journeys by sea are allowed in our least cost path model.

Seasonality profile

Our model-based estimates of the fine-scale spatial pattern of month-specific variations in the incidence rate of clinical malaria in Haiti are illustrated in Figure 6. For each calendar month, we present the offset (from the annual mean) in the logarithm of the clinical incidence rate surface at 1 × 1 km resolution, as fit to the monthly case counts at each health facility in our representative dataset. The dominant signal is a uniphasic seasonal profile evident across most of the country, and most notably the central valley, with cases rising during October–November to a peak in December–January and then declining rapidly from February to a low during April–May. A small number of locations – notably one hotspot on the northern coast of Grand’Anse near the town of Jeremie – show a tentative suggestion of a biphasic character with a smaller, second peak in June. However, it is possible that this is an artefact of the recent epidemic outbreak in this area that has not been entirely resolved by our cleaning and imputation procedure for constructing a dataset of representative (endemic character) case counts.

Figure 6

Download asset Open asset

The typical fine-scale spatial pattern of month-specific variations in the incidence of clinical malaria in Haiti based on reported health facility case counts from 2014 to 2019.

The (pointwise) posterior mean of the seasonal effect on the logarithm of the predicted case incidence rate is illustrated for each calendar month based on the third stage of our inference procedure: the spatio-temporal geostatistical model with fixed catchment sub-model fitted to the representative monthly case counts constructed at health facility level.

The role of the spatio-temporal covariates that help to shape the estimated seasonality patterns in our geospatial regression model (see the Materials and methods section for details) is explored in Figure 7, in which the covariate having the greatest positive influence on the monthly offset in any month is indicated in Figure 7A, while the covariate having the greatest negative influence is indicated in Figure 7B. In the high incidence areas of the central valley and the Grand’Anse, the most important positive covariate in a predictive sense is the enhanced vegetation index (EVI) lagged by 2 months, while the most important negative covariate is the land surface temperature (LST) lagged by 1 month in the former, and the EVI unlagged in the latter. Again, note that although these covariates may plausibly reflect physical drivers of malaria incidence in Haiti, we caution against this direct interpretation as the fitted model is designed from a predictive framework rather than one of causal inference. Moreover, all of these climatic and vegetation cover covariates are themselves highly co-linear, so upon exclusion of one there is typically another able to be identified as providing an explanatory contribution of similar magnitude within the regression model.

Figure 7

Download asset Open asset

The dominant covariates in fine-scale prediction of month-specific variations in the incidence of clinical malaria in Haiti (2014–2019).

In each pixel, the colour key indicates the covariate having the greatest (A) positive or (B) negative influence on the monthly incidence offset in any month. The lags denoted here are in units of months prior.

Validation against a serology dataset

The empirical spatial pattern of malaria exposure history amongst children in the Integrated Transmission Assessment Surveys (TAS) for lymphatic filariasis and malaria [Knipes et al., 2017] is illustrated in Figure 8A. The TAS program used serological markers of long-term malaria exposure – apical membrane antigen (AMA) and merozoite surface protein (MSP) antigenic responses – to characterise malaria endemicity in school-aged children. The symbols plotting the TAS results in Figure 8 are colour-coded with respect to a (non-geospatial) Bayesian estimate of the median underlying sero-prevalence (positivity by either antigenic response, or both) at the location of each school surveyed. These estimates may be compared visually in a geographic context against our predicted clinical incidence rate surface in Figure 1A. In Panel B, we present an alternative graphical comparison via a scatter plot (magenta ‘crosses’) with the uncertainties in each metric shown as error bars. The median-to-median correlation between these two metrics of transmission intensity has a Pearson coefficient of 0.426 (95% CI: 0.353–0.499), reflecting a positive but noisy relationship. Of course, since the TAS sample size per school is generally small (<30), the credible intervals around these point estimates of sero-prevalence are correspondingly broad. When considered in addition to the uncertainties surrounding the fine-scale predictions from our health facility-based model, it is likely that much of the width in this scatter plot derives from random (sampling) noise. Aggregating the TAS sites in bins of similar predicted clinical incidence rate reveals a much tighter relationship, for which a simple linear regression of the logit of sero-prevalence against the (natural) logarithm of incidence yields a slope of 0.34 (i.e., $\log i t p_{A M A o r M S P} \propto 0.34 \times \log I$ ).

Figure 8

Download asset Open asset

Model validation against the estimated proportion of school children testing positive to serological markers of past malaria exposure in the TAS dataset (2014–2016).

(A) The spatial location of each school sampled in the TAS study is illustrated here with the colour of the plotting symbol (filled circle), indicating the estimated sero-prevalence at that site. In this case, sero-positivity is defined as being classified positive for either the MSP antigenic response, the AMA antigenic response, or both. (B) Comparison of the estimated sero-prevalence (using a simple Bayesian beta-binomial model) from the TAS schools data against the predicted case incidence rate from our full geospatial model fit to the representative health facility-level data. The 95% credible interval in each metric for each school location is illustrated by the purple lines. The median estimated sero-prevalence for sites grouped in a series of bins by predicted case incidence is overlaid in blue, along with the associated line of best fit.

Where visual comparison of Figure 8A to Figure 1A indicates the most interesting discrepancy is with regard to the presence of some moderate (and in one case, high) sero-prevalence schools in the Nord department for which our predicted clinical incidence rate from the health facility dataset is everywhere rather low. We suspect that this is a reflection of a strong decline in transmission intensity in this region over the period 2014–2016, seen in the rapid decline in cases reported and hence the strong de-trending in our model towards the 2019 case counts – although it is not possible to unambiguously distinguish changes to the health reporting system from genuine transmission intensity trends from the available data. An important point raised by this comparison is that the incidence surfaces presented here should be understood as reflecting the current state of transmission subject to the recent history of anti-malarial interventions, rather than as a reflection of the inherent environmental receptivity under a zero intervention scenario.

Comparison against models with naïve imputation and naïve catchment structure

A minimal alternative approach to risk mapping from routine case data that has been explored in the past was to impute missing case reports using the empirical mean over non-missing months (on a per-facility basis) and to attribute cases from each facility to the households for which that facility is the nearest treatment option, ignoring differences in the diagnostic method (microscopy vs RDT). Following this procedure for the case data from 2019, we recover the risk map shown in Figure 9A. The effective ‘resolution’ of this map is to the size of each naïve catchment area, which tends to be smaller in towns and cities and larger in remote, rural areas. Compared with our preferred model-based risk map (Figure 1), the result here would suggest that transmission in the Grand’Anse is dominated by hotspots sharply concentrated on the townships of Abricots, Bonbon, Anse-d’Hainault, and Les Irois, with transmission intensity above 100 cases per 1000 PYO in each, rather than being spread more evenly throughout the coastal settlements and rural communities of this peninsula. The correlation coefficient of this risk map against the TAS sero-prevalence data is just 0.301 (95% CI: 0.215–0.369), compared against 0.426 (95% CI: 0.353–0.499) for our preferred model. A second version of the naïve catchment risk map is shown in Figure 9B, this time after using our Step one and Step two models (see Materials and methods) for de-trending, imputing, and diagnostic correcting the raw case data. Overall, this adjustment improves the correlation against the TAS sero-prevalences to 0.331 (95% CI: 0.257–0.398).

Figure 9

Download asset Open asset

Risk stratification maps for 2019 produced under a naïve catchment model in which patients attend only their nearest facility.

(A) The raw case data is used with a crude imputation by way of per-facility empirical means excluding missing months; (B) the case data has now been imputed, de-trended, and microscopy-to-RDT converted.

Discussion

The fine-scale mapping of malaria incidence and its seasonality profile in Haiti achieved through our fitting of a Bayesian geospatial regression framework with catchment sub-model to the 2014–2019 health facility case reports brings a greatly refined understanding of the elimination challenge on this side of Hispaniola. We see that the communities suffering from the highest annual average rates of clinical malaria (above 50 cases per 1000 PYO) in 2019 are those along the coastline and valleys of the Grand’Anse and Sud departments. Additional pockets of low-to-moderate endemicity (1–10 cases per 1000 PYO) are located in the central valley spanning the Artibonite and Centre departments, in some coastal communities of the Nippes, Sud-Est, and Nord-Est departments, and surrounding Port-au-Prince in the Ouest department; the latter accounting for a substantial proportion of the total cases each year owing to the size of the population in this area. The Nord and Nord-Est departments have lower incidence rates (below 1 case per 1000 PYO), and some areas can yet be confidently predicted as extremely low (below 1 case per 10,000 PYO). Against these broad variations between departments, there exists substantial heterogeneity in the clinical incidence rate of malaria at rather small scales within departments, which in a predictive sense can be explained within our modelling approach by differences in accessibility, elevation, road presence/absence, and potential evapotranspiration. The clinical incidence of malaria in Haiti is also highly seasonal with a strong uniphasic seasonality pattern at maximum during December–January and minimum during April–May.

These results have already proven useful for planning a number of the public health interventions that will be required to achieve malaria elimination in Haiti. These maps have been used to derive epidemiologically relevant operational units for targeting packages of interventions in five priority communes in Grand’Anse. Operational units were ranked by the strength of transmission (quantified after further post-processing in terms of the reproduction number under control, R_c) to help determine those that would receive targeted mass drug administration (tMDA) and with IRS in 2018, and are again being used in planning this year (2020). Serological data were subsequently used to refine this ordering, and an eventual re-definition of operational unit boundaries was made to follow natural logistical divisions such as rivers and major roads. For planning purposes such as these, it is clear that these fine-scale probabilistic maps offer a more nuanced stratification than the categorical risk maps at the commune level produced, e.g., for the World Malaria Report (World Health Organization, 2019), by direct summary of the available case counts divided by areal population totals – and one that is far superior to the dichotomous risk maps based solely on elevation (at a threshold of 500 m) that have, anecdotally, been used in past decision making.

In this context, it is important to again emphasise certain caveats of our analysis, which point towards topics for future research and data gathering. Of particular concern is the lack of information regarding potential spatial variations in treatment seeking behaviour across the country. A recent study of community attitudes towards malaria treatment (Druetz et al., 2018) confirmed that some Haitians will seek care for febrile illness outside the national health care system, such as at a traditional healer or at a private health care provider not reporting to the national network. At present, we have attempted only to adjust for the possible effect of travel-time distance on the absolute treatment seeking propensity, using a model calibrated to an African setting (Alegana et al., 2012); clearly, this deserves refinement if local data can be gathered through a dedicated survey questionnaire.

It is known that seasonal migrations of agricultural workers or other large itinerant groups have a potential to introduce spurious effects into modelled case incidence rates unless explicitly accounted for via a dynamic population denominator (Zu Erbach-Schoenberg et al., 2016). As high-fidelity human movement data is not currently available for Haiti, we cannot yet model this aspect directly and can only hope that a substantial proportion of any such unmodelled variation is absorbed implicitly within the random effects terms of our statistical model. Interestingly, an earlier study in which a regression model was built to predict short-term human movement from internal migration data (Sorichetta et al., 2016) has indicated that in relative terms the Ouest department containing Port-au-Prince is more strongly connected to all other departments than any of those departments are connected with each other independently, although the magnitude of this connectivity in absolute terms remains unknown. It is worth emphasising here that our model aims only to map where people at risk of malaria illness reside, which may not necessarily be the same as where they contracted their infection. The higher risk communities identified in our modelling are primarily in remote and rural areas, in which people are unlikely to regularly commute long distances from their place of residence for work or leisure. However, in terms of absolute case numbers it was shown in Figure 1B that there are a substantial number of people in the vicinity of Port-au-Prince presenting to health facilities with clinical malaria. Whether these infections were contracted locally or elsewhere – and what role seasonal migration and/or travel plays in sustaining transmission wherever it occurs – is not informed by the present dataset.

A final caveat on our analysis concerns the limitations of the catchment sub-model. Our introduction of a gravity-style representation with overlapping catchments based on travel-time distances constitutes a substantial effort towards constructing a realistic representation of patient behaviours, especially in comparison with the vanilla alternatives of Euclidean (‘as-the-crow-flies’) distances and/or hard (tessellation-style) boundaries – yet there is no doubt that our model is still a profound simplification. It remains for future research to establish how much of a limitation this is in terms of our ability to accurately downscale health facility data to pixel level – though at least our comparison against the TAS serological data suggests we are on the right track – and whether there are any simple improvements to the model structure that should be made (such as a refinement of the coefficient of preference decay on travel-time distance, currently fixed at −2; i.e., inverse-square decay). Already we have begun work (van den Hoogen et al., in prep.) to explore risk mapping under more complex catchment sub-models in a focus region of the Artibonite department where partial case tracing of febrile patients (from health facility to patient household location) has been performed. The catchments we have begun to reconstruct in the Artibonite case tracing study do confirm a general dependence on travel time, but they also reveal instances in which clusters of patients travel far beyond their nearest facility to seek care. We do not have data on specific factors, which might help to explain this behaviour, though anecdotal examples that appear in the literature suggest possible explanations, e.g., lower income patients may be avoiding a facility that illegally charges for anti-malarial medication (Druetz et al., 2018).

Another direction we are exploring to further refine our risk maps is the inclusion of information from alternative malaria metrics such as the sero-positvity rate, used here only for model validation. Important to note is that, although our current validation model treats the underlying sero-prevalence at each site as an independent random variable, one can readily apply the same principles of model-based geostatistics to refine sero-prevalence estimates via spatial covariates and spatially correlated noise models (Ashton et al., 2015). While we have not taken this step here to avoid any artificial shrinkage of our validation set towards the health facility dataset through a common model structure with the same covariates, it is easily done. More challenging is to develop appropriate methods for the simultaneous modelling of multiple data types. Indeed this is an active topic of research within geospatial statistics – both in regard to linking point data with areal data (Richardson and Best, 2003; Moraga et al., 2017) and in regard to sharing information between multiple disease metrics (for the same disease or even different diseases [Held et al., 2005]) – and is a direction we are pursuing for further refinement of our incidence maps in the Grand’Anse department (Amratia et al., in prep.).

In conclusion, the analyses and results of this paper demonstrate that point of care case counts can be used to generate programmatically useful maps of clinical incidence rates providing fine-scale risk stratifications. A spatio-temporal seasonality profile can also be determined when data are available at monthly intervals. This information can be used to refine the spatial and/or temporal targeting of high-burden areas for anti-malarial interventions such as tMDA, IRS, and long-lasting insecticidal net delivery. These outputs are readily updatable as additional facility data are made available and will be valuable in defining residual transmission foci as the final stages of elimination near.

Materials and methods

Response data and covariates

Request a detailed protocol

Our primary dataset consisted of monthly counts of confirmed malaria cases – i.e., patients seeking care for febrile illness with patent parasitaemia detected via RDT or microscopy – for each of 771 geo-located health facilities reporting at least once in 2019. These 771 facilities are a sub-set of the 1191 facilities in a master reporting file compiled by the PNCM of Haiti with assistance from the Clinton Health Access Initiative (CHAI); those that did not report on malaria cases or testing at least once in 2019 were assumed here either to have closed or to no longer offer malaria test and treat services to febrile outpatients. Since 2016, CHWs attached to certain health facilities have been proactively seeking cases in the local community, and we add these cases to the reported totals of ordinary patient visits for those facilities. The reporting period covered by this dataset begins with January 2014 and finishes with December 2019 and the overall completeness of reporting among the sub-set of 771 facilities is 77.5%, with 537 facilities reporting in at least 61 of the 72 months (69.6%). It is believed that all health facilities operating in Haiti through 2019 are included in the master reporting file, although it is not certain that all facilities excluded from the sub-set studied in the present analysis represent genuine closures as opposed to circumstances of sudden, sustained reporting failure, or conversely that all included facilities were indeed open through all of 2019. A number of the health facilities in our dataset lie within the same city or village, being separated by a distance comparable to, or less than, the target resolution (1 × 1 km) of our mapping. After imputing missing monthly case reports for all 771 facilities reporting in 2019 (as described in stage one of our inference procedure below), we reduce the subsequent model complexity by aggregating nearly co-located facilities using a hierarchical clustering algorithm. In this way, a total of 450 ‘aggregate pseudo-facilities’ are formed which we will simply continue to refer to as ‘health facilities’.

A suite of high-resolution satellite imaging products were introduced as covariates within our statistical modelling. Namely, accessibility to cities (Weiss et al., 2018), aridity index (Trabucco and Zomer, 2009), distance to water (bespoke), elevation (Farr et al., 2007), EVI (Huete et al., 1999), land cover classification (forest, grass savannah, urban/barren, and woody savannah; Friedl et al., 2010), LST(day and day–night difference; Wan et al., 2002), open street map (2016 road presence/absence; Haklay and Weber, 2008), potential evapotranspiration (Trabucco and Zomer, 2009), slope (Farr et al., 2007), tasselled-cap brightness (TCB; Kauth and Thomas, 1976), tasselled-cap wetness (TCW; Kauth and Thomas, 1976), and topographic wetness index (Farr et al., 2007). All products were downloaded from their respective online repositories, gap-filled (where necessary), and registered to a common grid. The EVI, LST, TCB, and TCW products were summarised to 2014–2019 annual averages and average monthly offsets, while the remainder were used as static covariates. The High Resolution Settlement Layer from the Connectivity Lab at Facebook (URL: https://www.ciesin.columbia.edu/data/hrsl/) provides the population denominator for our model, and the Weiss et al., 2018 friction surface is used to build travel-time maps from each 1 × 1 km pixel to each health facility with assistance from the malariaAtlas R package (Pfeffer et al., 2018). Serological prevalence observations (AMA and MSP antigens) for 24,514 children aged 6 and 7 years old in 820 schools from the Integrated TAD datasets (Knipes et al., 2017) sampled from across Haiti between November 2014 and August 2016 were used for validation of the spatial trends revealed in the annual incidence outputs.

Stepwise modelling approach designed for robustness against unmodelled sources of noise

Request a detailed protocol

Case counts of clinical malaria from health facilities in low-resource settings have traditionally been considered an unreliable and challenging source of data with which to map risk and/or evaluate the efficacy of interventions, owing to spatial and temporal variabilities in reporting completeness and accuracy, testing rates and methods, access to care, and treatment seeking behaviours (Alegana et al., 2020; Afrane et al., 2013; Oduro et al., 2016; Ohiri et al., 2016). In Haiti in particular, the specificity of local microscopy-based diagnosis has been shown to be sub-optimal (Landman et al., 2015; Weppelmann et al., 2018) and increasing the proportion of diagnoses made by RDT has been a key pillar of recent reforms to case management (Boncy et al., 2015) – the impact of which is clearly seen in the reported health facility case counts (Weppelmann et al., 2018). Fortunately, in this study, we have access to data on the relative rates of RDT and microscopy testing by health facility and month, allowing for explicit modelling of this previously identified systematic effect. Spatial variation in access to care is another systematic effect that we attempt to model given our access to a high-resolution travel-time covariate (Weiss et al., 2018). However, we must acknowledge that there are likely many other important confounding factors about which we have very little supporting data. Likewise, the spatio-temporal dynamics of epidemic fluctuations in malaria incidence are challenging to separate from the signal of endemic transmission intensity via a generative (forward-modelling) framework. The stepwise inference framework described below is designed to limit the impact of such factors on our model-based estimates while negotiating a pragmatic trade-off between the theoretical advantages of building an explicit representation of each conceivable error term and the computational advantages of model parsimony.

Our inference of the fine-scale case incidence rate and seasonality profile of clinical malaria in Haiti is thus modularised in three distinct stages. In the first stage, a pair of statistical models is used to impute missing data and de-trend the reported monthly case counts from 2014 to 2019 at each facility towards an RDT-standardised 2019 level. These are then folded (over years) and median filtered by month to produce a year of ‘representative’ case data designed to reflect endemic transmission intensity. Model-based uncertainties from this procedure are propagated through to the subsequent stages of our analysis by sampling multiple versions of this representative dataset from its modular Bayesian posterior. In the second stage, a fine-scale, spatial-only geostatistical model with a flexible catchment sub-model is fit to (each modular posterior draw of) the representative dataset to estimate the annual average incidence rate at pixel-level and the attractiveness of each health facility. Fine-scale mapping in this step is assisted by our suite of high-resolution covariates and an over-dispersed sampling distribution is adopted to represent additional variation in the reported counts beyond that accommodated naturally by our core model. Again the statistical uncertainties are propagated forward via modular posterior sampling. In the third and final stage, a fine-scale spatio-temporal geostatistical model is fit (conditional on the previously fitted catchment sub-model) to explain the residual seasonal variation about (each modular posterior sample of) this baseline risk surface in the (corresponding sample of) representative monthly case data.

Although the primary motivation for introducing these ‘cuts’ (Plummer, 2015) in our inferential approach is, as noted above, to focus on endemic transmission, promote model parsimony, and improve computational feasibility in model fitting, it is worth noting that such contained modularisation can also guard against the magnification of systematic errors between components due to a misspecification in one of them (Jacob, 2017). The following sections give further details on each of the three stages.

Constructing a year of representative case data

The first step in this stage of analysis was to impute values for the fraction of tests performed by microscopy (as opposed to RDT) in those health facilities missing these data in certain months. To this end, we introduce a non-spatially structured model in which the expected proportion of tests conducted by microscopy in each month for each facility is predicted as the inverse logit transformation of a three part temporal spline (covering January 2014 to December 2019) plus intercept. The spline coefficients are assigned a Bayesian shrinkage structure in which the mean of each and the between-facility variation are learned jointly across facilities. The precise structure of this model is described in standard hierarchical Bayesian notation in the box for Model 1 below.

\begin{array}{cc} N_{m i c, 𝑗 𝑡 ∶ w h e r e m i c a n d R D T c a s e t o t a l s b o t h n o n - m i s s i n g} \sim B i n o m (p_{m i c, 𝑗 t}, N_{t e s t e d, 𝑗 𝑡}) \\ l o g i t p_{m i c, j t} = a_{j} \times β_{s p l i n e (1)} (t) + b_{j} \times β_{s p l i n e (2)} (t) + c_{j} \times β_{s p l i n e (3)} (t) + d_{j} \\ a_{j} \sim N o r m a l (a_{m e a n}, σ_{s h r i n k a g e}^{2}), b_{j} \sim N o r m a l (b_{m e a n}, σ_{s h r i n k a g e}^{2}) \\ c_{j} \sim N o r m a l (c_{m e a n}, σ_{s h r i n k a g e}^{2}), \log σ_{s h r i n k a g e} \sim N o r m a l (- 1, 1^{2}), \\ a_{m e a n}, b_{m e a n}, c_{m e a n}, d_{j} \sim I m p r o p e r U n i f o r m \end{array}

Model 1

Request a detailed protocol

Facility-level model with non-spatially-structured Bayesian shrinkage for the estimation of the month and facility-specific propensity to conduct malaria diagnosis by microscopy rather than RDT.

Our de-trending model then takes the form of a point-indexed geostatistical regression on the case counts, $c a s e s_{j t}$ , at each facility in each month (where available), computed with respect to a latent incidence surface using the associated populations under a naïve catchment sub-model as a base rate factor. For the latter, we propose that the population in a given pixel will split its attendance between neighbouring health facilities in inverse proportion to the square of travel-time distance from pixel to facility. In mathematical notation, our naïve catchment matrix, $C_{i \to j}^{*}$ , which gives the proportion of residents in pixel $i$ who attend health facility $j$ is constructed as $C_{i \to j}^{*} \propto \frac{1}{T_{i \to j}^{2}}$ using travel-time distances, $T_{i \to j}$ , computed from the Weiss et al. friction surface (Weiss et al., 2018). We distinguish this formulation (the naïve sub-model) from the more flexible version introduced in the subsequent analysis stages in which an ‘attractiveness’ weight, $W_{j}$ , is learnt for each facility during fitting. This weight represents the impact of unknown factors that might influence attendance preference, such as differences in the availability of staff, the cost of treatment, and perceptions about the quality of care offered. Multiplication of the naïve catchment sub-model against the high-resolution population map for Haiti (while assuming, for now, universal access to treatment) gives crude population denominators for each facility, adequate for this temporally focussed inference step.

The statistical structure of our de-trending model comprised a five-part temporal spline across the 72 months of data with spatially varying coefficients and a spatially varying intercept, as well as a (cyclical) annual seasonality term, as described using hierarchical Bayesian notation in the box for Model 2 below. The mean (log) incidence surface is composed of a spatial-only Gaussian process term and a separable (Kronecker product) spatio-temporal Gaussian process with circularity (over the calendar months) in the temporal dimension (an exponential kernel on the circle). Model fitting was performed in the Template Model Builder (TMB) and Integrated Nested Laplace Approximation (INLA) packages (Kristensen, 2015; Lindgren and Rue, 2015) for R using a Laplace approximation over the random field components and over-dispersion terms, and with posterior approximation over the remaining hyper-parameters represented by a Multivariate Normal matched to the curvature at the empirical Bayes estimate. The suitability of this higher level approximation was confirmed by comparing the (Laplace approximation based) marginal likelihoods at a series of draws from the Multivariate Normal against their densities under this proposal distribution. As this is an expensive operation, we did not calculate and use these factors for importance weighting of our full set of approximate posterior samples, relying instead on the nested Normal formulation.

Finally, for each posterior draw, we impute the missing case reports with predicted case numbers and divide from the completed case–month matrix the exponentiated $f_{(k)} (l o c_{j}) \times β_{s p l i n e, t}^{(k)}$ and $p_{m i c, j t} \times {m i c e f f e c t}_{t}$ to de-trend these numbers towards an RDT-standardised 2019 benchmark. To reduce the impact of any unmodelled factors contributing short-term temporal fluctuations to the case reports, we then wrap our 4 years of imputed and de-trended data around the calendar year to construct (from each posterior draw) a single year of representative data from the median in each month.

\begin{array}{cc} {c a s e s}_{j t ∶ w h e r e c a s e d a t a n o n - m i s s i n g} \sim N e g B i n (\begin{array}{cc} m e a n = I_{j t} \times {a p p r o x c a t c h m e n t p o p}_{j t}, \\ o v e r d i s p e r s i o n f a c t o r = σ \end{array}) \\ \log I_{j t} = c + f_{i n t e r c e p t} ({l o c}_{j}) + f_{(1)} ({l o c}_{j}) \times β_{s p l i n e, t})^{(1)} + f_{(2)} ({l o c}_{j}) \times β_{s p l i n e, t}^{(2)} + f_{(3)} ({l o c}_{j}) \times β_{s p l i n e, t}^{(3)} \\ + f_{(4)} ({l o c}_{j}) \times β_{s p l i n e, t}^{(4)} + f_{(5)} ({l o c}_{j}) \times β_{s p l i n e, t}^{(5)} + f_{s} e a s o n a l ({l o c}_{j}, m o d (t, 12) \\ + p_{m i c, j t} \times {m i c e f f e c t}_{t} \\ f_{i n t e r c e p t} (\cdot) \sim G a u s s i a n P r o c e s s (r a n g e_{i n t e r c e p t}, s c a l e_{i n t e r c e p t}) \\ f_{(1)} (\cdot), f_{(2)} (\cdot), f_{(3)} (\cdot), f_{(4)} (\cdot), f_{(5)} (\cdot) \sim G a u s s i a n P r o c e s s (r a n g e_{s p l i n e}, s c a l e_{s p l i n e}) \\ f_{s e a s o n a l} (\cdot) \sim G a u s s i a n P r o c e s s (r a n g e_{s e a s o n a l t i m e}, s c a l e_{s e a s o n a l t i m e}) \otimes G a u s s i a n P r o c e s s (r a n g e_{s e a s o n a l}, s c a l e_{s e a s o n a l}) \\ {m i c e f f e c t}_{t} = μ + f_{m i c} (t), f_{m i c} (\cdot) \sim A R_{1} (s c a l e_{m i c}, A R p a r_{m i c}) \\ \log r a n g e_{i n t e r c e p t} \sim N o r m a l (- 1, 1^{2}), \log r a n g e_{s p l i n e}, \log r a n g e_{s e a s o n a l}, \log r a n g e_{s e a s o n a l t i m e} \sim N o r m a l (1, 1^{2}) \\ \log s c a l e_{i n t e r c e p t} \sim N o r m a l (2, 1^{2}), \log s c a l e_{s p l i n e}, \log s c a l e_{s e a s o n a l}, \log s c a l e_{s e a s o n a l t i m e} \sim N o r m a l (- 1, 1^{2}) \\ \log s c a l e_{m i c} \sim N o r m a l (- 1, 1^{2}), \log i t A R p a r_{m i c} \sim N o r m a l (2, 1^{2}) \\ \log σ \sim N o r m a l (- 1, 1^{2}), c \sim I m p r o p e r U n i f o r m \end{array}

Model 2

Request a detailed protocol

Point-level geostatistical model for approximate case incidence rate at each health facility location used for de-trending (and imputing) the raw monthly case counts towards the production of an RDT-standardised 2019 benchmark.

Fine-scale prediction of annual incidence surface

The second stage of our inference procedure is to fit a pixel-level geostatistical model with full catchment sub-model to the annual totals at facility level in (each modular posterior draw of) the 12 months of representative counts. On removing the temporal dimension from consideration, it becomes computationally feasible to allow flexible health facility attractiveness weights in the catchment sub-model and to perform the aggregation of the latent cases from pixel level to facility via this sub-model self-consistently during fitting. In this sense the adopted model structure is at least one step more ambitious than other comparable, multi-scale geospatial models for fine-scale disease mapping from areal-averaged data (Wilson and Wakefield, 2020; Taylor et al., 2018). Another extension is that we have adopted a spatially varying coefficient (slope) model (Gelfand et al., 2003) to describe the relationship between our static, environmental covariates and the log incidence rate. The motivation for this is to limit our exposure to bias in this implicit ecological regression (Wakefield and Smith, 2016) due to unmodelled factors, such as the potential role of human movement between regions and spatial variations in the dominant anopheline species. Both of these could lead to differences in the relationship between environmental variables and the case incidence rate amongst the human populations resident in different areas of the country. A decision was made not to attempt to learn a shrinkage hyper-parameter acting on the static covariate slopes in order to avoid exposure to over-shrinkage given that the aggregate dataset may be thought of as inherently under-powered for learning slopes relative to a comparable point-level dataset of similar design and size. Previous applications of fine-scale modelling to aggregate malaria datasets (Sturrock et al., 2014; Alegana et al., 2016) used aggressive covariate selection approaches, which retained far fewer environmental variables than are typically found to be important for prediction at this scale based on point prevalence surveys (Bhatt et al., 2015; Weiss et al., 2015).

A lack of data on treatment seeking behaviours for malaria patients in Haiti has previously been identified as a core knowledge gap (Keating et al., 2008). As our primary interest here concerns the recovery of accurate spatial patterns of malaria incidence, we are less worried about the overall rate of treatment seeking (which studies in African settings suggest is rarely below 30% for acute febrile illness [Alegana et al., 2017b]) than in the possibility of spatial variation. Studies of treatment seeking behaviour in both low- and high-resource settings indicate a tendency for treatment seeking rates to decline with increasing travel-time distance from the nearest point of care (Alegana et al., 2017b; Ensor and Cooper, 2004). However, very little decline is seen until beyond 100 min travel time in well-studied settings (such as Namibia [Alegana et al., 2012]), and at the 1 × 1 km resolution of our map making almost 96.4% of pixels with non-zero population density lies within this distance from their nearest health facility. For this reason, we do not anticipate a strong spatial variation in treatment seeking rates across the country due to this effect, but we have nevertheless constructed an access distance-dependent treatment seeking probability map (following the Namibian example, with maximum treatment seeking probability of 65%) as a first-order approximation.

The complete Bayesian model used in this stage is described in hierarchical notation in the box for Model 3 below. Once again a combination of the TMB and INLA packages are used to fit this model with a Laplace approximation over the random field and the (logarithm of) catchment attractiveness weights, with a Multivariate Normal approximation in the remaining hyper-parameters centred on the empirical Bayes estimator.

\begin{array}{cc} a n n u a l r e p r e s e n t a t i v e c a s e s_{j} \sim N e g B i n (\begin{array}{cc} m e a n = e x p e c t e d c a s e s_{j}, \\ o v e r d i s p e r s i o n f a c t o r = σ \end{array}) \\ {e x p e c t e d c a s e s}_{j} = \sum_{i} C_{i \to j} \times {p o p u l a t i o n}_{i} \times I_{i} \times {t r e a t m e n t s e e i n g p r o b}_{i} \\ C_{i \to j} \propto \frac{W_{j}}{T_{i \to j}^{2}}, \log W_{j} \sim N o r m a l (0, {0.5}^{2}) \\ \log I_{i} = c + X_{s t a t i c}^{'} (β_{s t a t i c} + f_{s t a t i c} ({l o c}_{i})) + f_{i n t e r c e p t} ({l o c}_{i}) \\ f_{i n t e r c e p t} (\cdot) \sim G a u s s i a n P r o c e s s (r a n g e_{i n t}, s c a l e_{i n t}) \\ β_{s t a t i c, k} \sim N o r m a l (0, 1^{2}), f_{s t a t i c, k} (\cdot) \sim G a u s s i a n P r o c e s s (r a n g e_{c o v s}, s c a l e_{c o v s}) \\ \log s c a l e_{s t a t i c}, \log s c a l e_{c o v s} \sim N o r m a l (- 1, 1^{2}), \log r a n g e_{s t a t i c}, l o g r a n g e_{c o v s} \sim N o r m a l (1, 1^{2}) \\ l o g σ \sim N o r m a l (- 1, 1^{2}), c \sim I m p r o p e r U n i f o r m \end{array}

Model 3

Request a detailed protocol

Catchment-based geostatistical model for annual case count at each health facility location used to produce our baseline clinical incidence rate surface.

Spatio-temporal modelling of seasonal fluctuations in case incidence

In the third and final stage of our inference procedure, we hold fixed the health facility attractiveness weights, baseline incidence surface, and annual (i.e., spatial) over-dispersion factors learnt in the previous step. This allows (at the limit of our computational resources; 128 GB RAM) to model the seasonal variations in incidence at fine-scale in a spatio-temporal geostatistical regression against the monthly case counts in (each modular posterior draw of) the representative dataset. The model structure for the seasonality term is the same as that used in the first stage: a separable (Kronecker product) spatio-temporal Gaussian process with circularity (over the calendar months) in the temporal dimension (an exponential kernel on the circle). Due to computational limitations, a spatially varying slope model was infeasible for the dynamic covariates, hence an ordinary linear regression structure was used instead; again with a fixed, limited amount of prior shrinkage in deference to the limited power provided by the aggregate data. Posterior sampling was conducted exactly as described for stages one and two above with implementation in TMB and INLA. The full Bayesian hierarchy is described in the box for Model 4 below.

\begin{array}{cc} m o n t h l y r e d u c e d c a s e s_{j t} \sim N e g B i n (\begin{array}{cc} m e a n = e x p e c t e d c a s e s_{j t}, \\ o v e r d i s p e r s i o n f a c t o r = σ \end{array}) \\ {e x p e c t e d c a s e s}_{j t} = \sum_{i} C_{i \to j} \times {p o p u l a t i o n}_{i} \times I_{i t} \times {t r e a t m e n t s e e k i n g p r o b .}_{i} \\ C_{(i \to j)} \propto \frac{W_{j}}{T_{i \to j}^{2}}, \log W_{j}, \log I_{b a s e l i n e} = f i x e d f r o m e a r l i e r f i t \\ l o g I_{i t} = c + l o g I_{b a s e l i n e} + X_{t e m p o r a l}^{'} β_{t e m p o r a l} + f_{s e a s o n a l} ({l o c}_{i}, t) \\ f_{s e a s o n a l} (\cdot) \sim G a u s s i a n P r o c e s s (r a n g e_{s e a s o n a l t i m e}, s c a l e_{s e a s o n a l t i m e}) \\ \otimes G a u s s i a n P r o c e s s (r a n g e_{s e a s o n a l}, s c a l e_{s e a s o n a l}) \\ β_{t e m p o r a l} \sim N o r m a l (0, 1^{2}), l o g σ \sim N o r m a l (- 1, 1^{2}) \\ l o g s c a l e_{s e a s o n a l}, \log s c a l e_{s e a s o n a l t i m e} \sim N o r m a l (- 1, 1^{2}), l o g r a n g e_{s e a s o n a l}, \log r a n g e_{s e a s o n a l t i m e} \sim N o r m a l (1, 1^{2}) \\ c \sim I m p r o p e r U n i f o r m \end{array}

Model 4

Request a detailed protocol

Catchment-based geostatistical model for representative monthly case count at each health facility location used to produce our seasonality profile.

The R and TMB codes used for running this analysis are provided for reference as Supplementary Information.

Data availability

The routine case data at health facility level are the property of the Haitian Programme National de Contrôle de la Malaria and are not to be made publicly available at this level of resolution to respect privacy. However, the administrative level summaries are published through the World Malaria Report each year. All covariates used are publicly available through their respective online portals. The TAS serology data used in validation are available upon request as per https://www.nature.com/articles/s41598-020-65419-w#data-availability.

References

1. Afrane YA
2. Zhou G
3. Githeko AK
4. Yan G
(2013) Utility of health facility-based malaria data for malaria surveillance
PLOS ONE 8:e54305.

https://doi.org/10.1371/journal.pone.0054305
- PubMed
- Google Scholar
1. Alegana VA
2. Wright JA
3. Pentrina U
4. Noor AM
5. Snow RW
6. Atkinson PM
(2012) Spatial modelling of healthcare utilisation for treatment of fever in Namibia
International Journal of Health Geographics 11:6.

https://doi.org/10.1186/1476-072X-11-6
- PubMed
- Google Scholar
(2016) Advances in mapping malaria for elimination: fine resolution modelling of Plasmodium falciparum incidence
Scientific Reports 6:13.

https://doi.org/10.1038/srep29628
- PubMed
- Google Scholar
1. Alegana VA
2. Wright J
3. Bosco C
4. Okiro EA
5. Atkinson PM
6. Snow RW
7. Tatem AJ
8. Noor AM
(2017a) Malaria prevalence metrics in low- and middle-income countries: an assessment of precision in nationally-representative surveys
Malaria Journal 16:475.

https://doi.org/10.1186/s12936-017-2127-y
- PubMed
- Google Scholar
(2017b) Treatment-seeking behaviour in low- and middle-income countries estimated using a Bayesian model
BMC Medical Research Methodology 17:67.

https://doi.org/10.1186/s12874-017-0346-0
- PubMed
- Google Scholar
(2020) Routine data for malaria morbidity estimation in Africa: challenges and prospects
BMC Medicine 18:1–13.

https://doi.org/10.1186/s12916-020-01593-y
- PubMed
- Google Scholar
Preprint
(2020) Nonparametric causal feature selection for spatiotemporal risk mapping of malaria incidence in Madagascar
arXiv.

https://arxiv.org/abs/2001.07745
- Google Scholar
1. Ashton RA
2. Kefyalew T
3. Rand A
4. Sime H
5. Assefa A
6. Mekasha A
7. Edosa W
8. Tesfaye G
9. Cano J
10. Teka H
11. Reithinger R
12. Pullan RL
13. Drakeley CJ
14. Brooker SJ
(2015) Geostatistical modeling of malaria endemicity using serological indicators of exposure collected through school surveys
The American Journal of Tropical Medicine and Hygiene 93:168–177.

https://doi.org/10.4269/ajtmh.14-0620
- PubMed
- Google Scholar
(2014) Scale-up of malaria rapid diagnostic tests and artemisinin-based combination therapy: challenges and perspectives in sub-Saharan africa
PLOS Medicine 11:e1001590.

https://doi.org/10.1371/journal.pmed.1001590
- PubMed
- Google Scholar
1. Battle KE
2. Bisanzio D
3. Gibson HS
4. Bhatt S
5. Cameron E
6. Weiss DJ
7. Mappin B
8. Dalrymple U
9. Howes RE
10. Hay SI
11. Gething PW
(2016) Treatment-seeking rates in malaria endemic countries
Malaria Journal 15:20.

https://doi.org/10.1186/s12936-015-1048-x
- PubMed
- Google Scholar
1. Battle KE
2. Lucas TCD
3. Nguyen M
4. Howes RE
5. Nandi AK
6. Twohig KA
7. Pfeffer DA
8. Cameron E
9. Rao PC
10. Casey D
11. Gibson HS
12. Rozier JA
13. Dalrymple U
14. Keddie SH
15. Collins EL
16. Harris JR
17. Guerra CA
18. Thorn MP
19. Bisanzio D
20. Fullman N
21. Huynh CK
22. Kulikoff X
23. Kutz MJ
24. Lopez AD
25. Mokdad AH
26. Naghavi M
27. Nguyen G
28. Shackelford KA
29. Vos T
30. Wang H
31. Lim SS
32. Murray CJL
33. Price RN
34. Baird JK
35. Smith DL
36. Bhatt S
37. Weiss DJ
38. Hay SI
39. Gething PW
(2019) Mapping the global endemicity and clinical burden of Plasmodium Vivax, 2000-17: a spatial and temporal modelling study
The Lancet 394:332–343.

https://doi.org/10.1016/S0140-6736(19)31096-7
- PubMed
- Google Scholar
1. Bhatt S
2. Weiss DJ
3. Cameron E
4. Bisanzio D
5. Mappin B
6. Dalrymple U
7. Battle K
8. Moyes CL
9. Henry A
10. Eckhoff PA
11. Wenger EA
12. Briët O
13. Penny MA
14. Smith TA
15. Bennett A
16. Yukich J
17. Eisele TP
18. Griffin JT
19. Fergus CA
20. Lynch M
21. Lindgren F
22. Cohen JM
23. Murray CLJ
24. Smith DL
25. Hay SI
26. Cibulskis RE
27. Gething PW
(2015) The effect of malaria control on Plasmodium falciparum in Africa between 2000 and 2015
Nature 526:207–211.

https://doi.org/10.1038/nature15535
- PubMed
- Google Scholar
1. Boncy PJ
2. Adrien P
3. Lemoine JF
4. Existe A
5. Henry PJ
6. Raccurt C
7. Brasseur P
8. Fenelon N
9. Dame JB
10. Okech BA
11. Kaljee L
12. Baxa D
13. Prieur E
14. El Badry MA
15. Tagliamonte MS
16. Mulligan CJ
17. Carter TE
18. Beau de Rochars VM
19. Lutz C
20. Parke DM
21. Zervos MJ
(2015) Malaria elimination in Haiti by the year 2020: an achievable goal?
Malaria Journal 14:237.

https://doi.org/10.1186/s12936-015-0753-9
- PubMed
- Google Scholar
1. Corran P
2. Coleman P
3. Riley E
4. Drakeley C
(2007) Serology: a robust Indicator of malaria transmission intensity?
Trends in Parasitology 23:575–582.

https://doi.org/10.1016/j.pt.2007.08.023
- PubMed
- Google Scholar
(1998) Model‐based geostatistics
Journal of the Royal Statistical Society 47:299–350.

https://doi.org/10.1111/1467-9876.00113
- Google Scholar
1. Druetz T
2. Andrinopoulos K
3. Boulos LM
4. Boulos M
5. Noland GS
6. Desir L
7. Lemoine JF
8. Eisele TP
(2018) "Wherever doctors cannot reach, the sunshine can": overcoming potential barriers to malaria elimination interventions in Haiti
Malaria Journal 17:393.

https://doi.org/10.1186/s12936-018-2553-5
- PubMed
- Google Scholar
(2016) Bayesian spatiotemporal modelling for identifying unusual and unstable trends in mammography utilisation
BMJ Open 6:e010253.

https://doi.org/10.1136/bmjopen-2015-010253
- PubMed
- Google Scholar
1. Ensor T
2. Cooper S
(2004) Overcoming barriers to health service access: influencing the demand side
Health Policy and Planning 19:69–79.

https://doi.org/10.1093/heapol/czh009
- PubMed
- Google Scholar
1. Farr TG
2. Rosen PA
3. Caro E
4. Crippen R
5. Duren R
6. Hensley S
7. Kobrick M
8. Paller M
9. Rodriguez E
10. Roth L
11. Seal D
12. Shaffer S
13. Shimada J
14. Umland J
15. Werner M
16. Oskin M
17. Burbank D
18. Alsdorf D
(2007) The shuttle radar topography mission
Reviews of Geophysics 45:RG2004.

https://doi.org/10.1029/2005RG000183
- Google Scholar
1. Frederick J
2. Saint Jean Y
3. Lemoine JF
4. Dotson EM
5. Mace KE
6. Chang M
7. Slutsker L
8. Le Menach A
9. Beier JC
10. Eisele TP
11. Okech BA
12. Beau de Rochars VM
13. Carter KH
14. Keating J
15. Impoinvil DE
(2016) Malaria vector research and control in Haiti: a systematic review
Malaria Journal 15:376.

https://doi.org/10.1186/s12936-016-1436-x
- PubMed
- Google Scholar
1. Friedl MA
2. Sulla-Menashe D
3. Tan B
4. Schneider A
5. Ramankutty N
6. Sibley A
7. Huang X
(2010) MODIS collection 5 global land cover: algorithm refinements and characterization of new datasets
Remote Sensing of Environment 114:168–182.

https://doi.org/10.1016/j.rse.2009.08.016
- Google Scholar
(2003) Spatial modeling with spatially varying coefficient processes
Journal of the American Statistical Association 98:387–396.

https://doi.org/10.1198/016214503000170
- Google Scholar
1. Giorgi E
2. Osman AA
3. Hassan AH
4. Ali AA
5. Ibrahim F
6. Amran JGH
7. Noor AM
8. Snow RW
(2018) Using non-exceedance probabilities of policy-relevant malaria prevalence thresholds to identify Areas of low transmission in Somalia
Malaria Journal 17:88.

https://doi.org/10.1186/s12936-018-2238-0
- PubMed
- Google Scholar
1. Haklay M
2. Weber P
(2008) OpenStreetMap: user-generated street maps
IEEE Pervasive Computing 7:12–18.

https://doi.org/10.1109/MPRV.2008.80
- Google Scholar
1. Helb DA
2. Tetteh KK
3. Felgner PL
4. Skinner J
5. Hubbard A
6. Arinaitwe E
7. Mayanja-Kizza H
8. Ssewanyana I
9. Kamya MR
10. Beeson JG
11. Tappero J
12. Smith DL
13. Crompton PD
14. Rosenthal PJ
15. Dorsey G
16. Drakeley CJ
17. Greenhouse B
(2015) Novel serologic biomarkers provide accurate estimates of recent Plasmodium falciparum exposure for individuals and communities
PNAS 112:E4438–E4447.

https://doi.org/10.1073/pnas.1501705112
- PubMed
- Google Scholar
1. Held L
2. Natário I
3. Fenton SE
4. Rue H
5. Becker N
(2005) Towards joint disease mapping
Statistical Methods in Medical Research 14:61–82.

https://doi.org/10.1191/0962280205sm389oa
- PubMed
- Google Scholar
(1999)
MODIS vegetation index (MOD13)

Algorithm Theoretical Basis Document 3:213.
- Google Scholar
Preprint
1. Jacob PE
(2017) Better together? statistical learning in models made of modules
arXiv.

https://arxiv.org/abs/1708.08719
- Google Scholar
1. Karagiannis-Voules DA
2. Biedermann P
3. Ekpo UF
4. Garba A
5. Langer E
6. Mathieu E
7. Midzi N
8. Mwinzi P
9. Polderman AM
10. Raso G
11. Sacko M
12. Talla I
13. Tchuenté LA
14. Touré S
15. Winkler MS
16. Utzinger J
17. Vounatsou P
(2015) Spatial and temporal distribution of soil-transmitted helminth infection in sub-Saharan africa: a systematic review and geostatistical meta-analysis
The Lancet Infectious Diseases 15:74–84.

https://doi.org/10.1016/S1473-3099(14)71004-7
- PubMed
- Google Scholar
1. Karyana M
2. Devine A
3. Kenangalem E
4. Burdarm L
5. Poespoprodjo JR
6. Vemuri R
7. Anstey NM
8. Tjitra E
9. Price RN
10. Yeung S
(2016) Treatment-seeking behaviour and associated costs for malaria in Papua, Indonesia
Malaria Journal 15:536.

https://doi.org/10.1186/s12936-016-1588-8
- PubMed
- Google Scholar
Book
1. Kauth R
2. Thomas G
(1976)
The Tasseled Cap—A Graphic Description of the Spectral-Temporal Development of Agricultural Crops as Seen by Landsat

Purdue University.
- Google Scholar
(2008) A description of malaria-related knowledge, perceptions, and practices in the artibonite valley of Haiti: implications for malaria control
The American Journal of Tropical Medicine and Hygiene 78:262–269.

https://doi.org/10.4269/ajtmh.2008.78.262
- PubMed
- Google Scholar
1. Knipes AK
2. Lemoine JF
3. Monestime F
4. Fayette CR
5. Direny AN
6. Desir L
7. Beau de Rochars VE
8. Streit TG
9. Renneker K
10. Chu BK
11. Chang MA
12. Mace KE
13. Won KY
14. Lammie PJ
(2017) Partnering for impact: integrated transmission assessment surveys for lymphatic filariasis, soil transmitted helminths and malaria in Haiti
PLOS Neglected Tropical Diseases 11:e0005387.

https://doi.org/10.1371/journal.pntd.0005387
- PubMed
- Google Scholar
1. Kristensen K
(2015)
Template model builder TMB

Journal of Statistical Software 70:1–21.
- Google Scholar
1. Landman KZ
2. Jean SE
3. Existe A
4. Akom EE
5. Chang MA
6. Lemoine JF
7. Mace KE
(2015) Evaluation of case management of uncomplicated malaria in Haiti: a national health facility survey, 2012
Malaria Journal 14:394.

https://doi.org/10.1186/s12936-015-0901-2
- PubMed
- Google Scholar
1. Lindgren F
2. Rue H
(2015) Bayesian spatial modelling with R-INLA
Journal of Statistical Software 63:i19.

https://doi.org/10.18637/jss.v063.i19
- Google Scholar
1. Lucchi NW
2. Karell MA
3. Journel I
4. Rogier E
5. Goldman I
6. Ljolje D
7. Huber C
8. Mace KE
9. Jean SE
10. Akom EE
11. Oscar R
12. Buteau J
13. Boncy J
14. Barnwell JW
15. Udhayakumar V
(2014) PET-PCR method for the molecular detection of malaria parasites in a national malaria surveillance study in Haiti, 2011
Malaria Journal 13:462.

https://doi.org/10.1186/1475-2875-13-462
- PubMed
- Google Scholar
(2017) A geostatistical model for combined analysis of point-level and area-level data using INLA and SPDE
Spatial Statistics 21:27–41.

https://doi.org/10.1016/j.spasta.2017.04.006
- Google Scholar
(2020) Achieving explanatory depth and spatial breadth in infectious disease modelling: integrating active and passive case surveillance
Statistical Methods in Medical Research 29:1273–1287.

https://doi.org/10.1177/0962280219856380
- Google Scholar
1. Oduro AR
2. Maya ET
3. Akazili J
4. Baiden F
5. Koram K
6. Bojang K
(2016) Monitoring malaria using health facility based surveys: challenges and limitations
BMC Public Health 16:354.

https://doi.org/10.1186/s12889-016-2858-7
- PubMed
- Google Scholar
1. Ohiri K
2. Ukoha NK
3. Nwangwu CW
4. Chima CC
5. Ogundeji YK
6. Rone A
7. Reich MR
(2016) An assessment of data availability, quality, and use in malaria program decision making in Nigeria
Health Systems & Reform 2:319–330.

https://doi.org/10.1080/23288604.2016.1234864
- PubMed
- Google Scholar
1. Osgood-Zimmerman A
2. Millear AI
3. Stubbs RW
4. Shields C
5. Pickering BV
6. Earl L
7. Graetz N
8. Kinyoki DK
9. Ray SE
10. Bhatt S
11. Browne AJ
12. Burstein R
13. Cameron E
14. Casey DC
15. Deshpande A
16. Fullman N
17. Gething PW
18. Gibson HS
19. Henry NJ
20. Herrero M
21. Krause LK
22. Letourneau ID
23. Levine AJ
24. Liu PY
25. Longbottom J
26. Mayala BK
27. Mosser JF
28. Noor AM
29. Pigott DM
30. Piwoz EG
31. Rao P
32. Rawat R
33. Reiner RC
34. Smith DL
35. Weiss DJ
36. Wiens KE
37. Mokdad AH
38. Lim SS
39. Murray CJL
40. Kassebaum NJ
41. Hay SI
(2018) Mapping child growth failure in Africa between 2000 and 2015
Nature 555:41–47.

https://doi.org/10.1038/nature25760
- PubMed
- Google Scholar
1. Oviedo A
2. Knipes A
3. Worrell C
4. Fox LM
5. Desir L
6. Fayette C
7. Javel A
8. Monestime F
9. Mace K
10. Chang MA
11. Udhayakumar V
12. Lemoine JF
13. Won K
14. Lammie PJ
15. Rogier E
(2020) Combination of serological, antigen detection, and DNA data for Plasmodium falciparum provides robust geospatial estimates for malaria transmission in Haiti
Scientific Reports 10:8443.

https://doi.org/10.1038/s41598-020-65419-w
- PubMed
- Google Scholar
1. Pfeffer DA
2. Lucas TCD
3. May D
4. Harris J
5. Rozier J
6. Twohig KA
7. Dalrymple U
8. Guerra CA
9. Moyes CL
10. Thorn M
11. Nguyen M
12. Bhatt S
13. Cameron E
14. Weiss DJ
15. Howes RE
16. Battle KE
17. Gibson HS
18. Gething PW
(2018) malariaAtlas: an R interface to global malariometric data hosted by the malaria atlas project
Malaria Journal 17:352.

https://doi.org/10.1186/s12936-018-2500-5
- PubMed
- Google Scholar
1. Plummer M
(2015) Cuts in Bayesian graphical models
Statistics and Computing 25:37–43.

https://doi.org/10.1007/s11222-014-9503-z
- Google Scholar
1. Richardson S
2. Best N
(2003) Bayesian hierarchical models in ecological studies of health-environment effects
Environmetrics 14:129–147.

https://doi.org/10.1002/env.571
- Google Scholar
1. Rowe AK
2. Kachur SP
3. Yoon SS
4. Lynch M
5. Slutsker L
6. Steketee RW
(2009) Caution is required when using health facility-based data to evaluate the health impact of malaria control efforts in Africa
Malaria Journal 8:209.

https://doi.org/10.1186/1475-2875-8-209
- PubMed
- Google Scholar
(2016) Mapping internal connectivity through human migration in malaria endemic countries
Scientific Data 3:160066.

https://doi.org/10.1038/sdata.2016.66
- PubMed
- Google Scholar
(2014) Fine-scale malaria risk mapping from routine aggregated case data
Malaria Journal 13:421.

https://doi.org/10.1186/1475-2875-13-421
- Google Scholar
(2018) Continuous inference for aggregated point process data
Journal of the Royal Statistical Society: Series A 181:1125–1150.

https://doi.org/10.1111/rssa.12347
- Google Scholar
Conference
1. Trabucco A
2. Zomer RJ
(2009)
Global aridity index (global-aridity) and global potential evapo-transpiration (global-PET) geospatial database

CGIAR Consortium for Spatial Information.
- Google Scholar
Book
1. Wakefield JC
2. Smith TR
(2016)
Ecological modeling: General issues

In: Lawson A. B, Banerjee S, Haining R. P, editors. Handbook of Spatial Epidemiology. Taylor & Francis. pp. 1–702.
- Google Scholar
1. Wan Z
2. Zhang Y
3. Zhang Q
4. Li Z
(2002) Validation of the land-surface temperature products retrieved from terra moderate resolution imaging spectroradiometer data
Remote Sensing of Environment 83:163–180.

https://doi.org/10.1016/S0034-4257(02)00093-7
- Google Scholar
1. Weiss DJ
2. Mappin B
3. Dalrymple U
4. Bhatt S
5. Cameron E
6. Hay SI
7. Gething PW
(2015) Re-examining environmental correlates of Plasmodium falciparum malaria endemicity: a data-intensive variable selection approach
Malaria Journal 14:68.

https://doi.org/10.1186/s12936-015-0574-x
- PubMed
- Google Scholar
1. Weiss DJ
2. Nelson A
3. Gibson HS
4. Temperley W
5. Peedell S
6. Lieber A
7. Hancher M
8. Poyart E
9. Belchior S
10. Fullman N
11. Mappin B
12. Dalrymple U
13. Rozier J
14. Lucas TCD
15. Howes RE
16. Tusting LS
17. Kang SY
18. Cameron E
19. Bisanzio D
20. Battle KE
21. Bhatt S
22. Gething PW
(2018) A global map of travel time to cities to assess inequalities in accessibility in 2015
Nature 553:333–336.

https://doi.org/10.1038/nature25181
- PubMed
- Google Scholar
1. Weiss DJ
2. Lucas TCD
3. Nguyen M
4. Nandi AK
5. Bisanzio D
6. Battle KE
7. Cameron E
8. Twohig KA
9. Pfeffer DA
10. Rozier JA
11. Gibson HS
12. Rao PC
13. Casey D
14. Bertozzi-Villa A
15. Collins EL
16. Dalrymple U
17. Gray N
18. Harris JR
19. Howes RE
20. Kang SY
21. Keddie SH
22. May D
23. Rumisha S
24. Thorn MP
25. Barber R
26. Fullman N
27. Huynh CK
28. Kulikoff X
29. Kutz MJ
30. Lopez AD
31. Mokdad AH
32. Naghavi M
33. Nguyen G
34. Shackelford KA
35. Vos T
36. Wang H
37. Smith DL
38. Lim SS
39. Murray CJL
40. Bhatt S
41. Hay SI
42. Gething PW
(2019) Mapping the global prevalence, incidence, and mortality of Plasmodium falciparum, 2000-17: a spatial and temporal modelling study
The Lancet 394:322–331.

https://doi.org/10.1016/S0140-6736(19)31097-9
- PubMed
- Google Scholar
(2018) Elimination or more accurate estimation? investigation of trends in malaria diagnoses in the ouest department of Haiti from 2008 to 2017
PLOS ONE 13:e0198070.

https://doi.org/10.1371/journal.pone.0198070
- PubMed
- Google Scholar
1. Wilson K
2. Wakefield J
(2020) Pointless spatial modeling
Biostatistics 21:e17–e32.

https://doi.org/10.1093/biostatistics/kxy041
- Google Scholar
Report
1. World Health Organization
(2019) World Malaria Report 2019
WHO.

https://www.who.int/publications/i/item/9789241565721
- Google Scholar
1. Zouré HG
2. Noma M
3. Tekle AH
4. Amazigo UV
5. Diggle PJ
6. Giorgi E
7. Remme JH
(2014) The geographic distribution of onchocerciasis in the 20 participating countries of the african programme for onchocerciasis control: (2) pre-control endemicity levels and estimated number infected
Parasites & Vectors 7:326.

https://doi.org/10.1186/1756-3305-7-326
- PubMed
- Google Scholar
(2016) Dynamic denominators: the impact of seasonally varying population numbers on disease incidence estimates
Population Health Metrics 14:35.

https://doi.org/10.1186/s12963-016-0106-0
- PubMed
- Google Scholar

Article and author information

Author details

Ewan Cameron
1. Curtin University, Perth, Australia
2. Telethon Kids Institute, Perth Children’s Hospital, Perth, Australia
Contribution
Data curation, Formal analysis, Supervision, Visualization, Methodology, Writing - original draft, Writing - review and editing

For correspondence
dr.ewan.cameron@gmail.com

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0002-8842-3811
Alyssa J Young
1. Clinton Health Access Initiative, Boston, United States
2. Tulane University School of Public Health and Tropical Medicine, New Orleans, United States
Contribution
Data curation, Validation, Investigation, Project administration, Writing - review and editing

Competing interests
No competing interests declared
Katherine A Twohig

Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford, United Kingdom

Contribution
Data curation, Validation, Project administration

Competing interests
No competing interests declared
Emilie Pothin
1. Clinton Health Access Initiative, Boston, United States
2. Swiss Tropical and Public Health Institute, Basel, Switzerland
Contribution
Data curation, Investigation, Methodology

Competing interests
No competing interests declared
Darlene Bhavnani

Clinton Health Access Initiative, Boston, United States

Contribution
Data curation, Investigation, Project administration, Writing - review and editing

Competing interests
No competing interests declared
Amber Dismer

Division of Global Health Protection, Centers for Disease Control and Prevention, Atlanta, United States

Contribution
Data curation, Validation, Investigation, Methodology, Writing - review and editing

Competing interests
No competing interests declared
Jean Baptiste Merilien

Programme National de Contrôle de la Malaria/MSPP, Port-au-Prince, Haiti

Contribution
Data curation, Validation, Investigation, Project administration

Competing interests
No competing interests declared
Karen Hamre

Division of Parasitic Diseases and Malaria, Centers for Disease Control and Prevention, Atlanta, United States

Contribution
Validation, Investigation, Writing - review and editing

Competing interests
No competing interests declared
Phoebe Meyer

Clinton Health Access Initiative, Boston, United States

Contribution
Data curation, Validation, Investigation, Writing - review and editing

Competing interests
No competing interests declared
Arnaud Le Menach

Clinton Health Access Initiative, Boston, United States

Contribution
Data curation, Supervision, Validation, Investigation, Project administration, Writing - review and editing

Competing interests
No competing interests declared
Justin M Cohen

Clinton Health Access Initiative, Boston, United States

Contribution
Supervision, Funding acquisition, Project administration, Writing - review and editing

Competing interests
No competing interests declared
Samson Marseille
1. Programme National de Contrôle de la Malaria/MSPP, Port-au-Prince, Haiti
2. Direction d’Epidémiologie de Laboratoire et de la Recherche, Port-au-Prince, Haiti
Contribution
Data curation, Validation, Writing - review and editing

Competing interests
No competing interests declared
Jean Frantz Lemoine

Programme National de Contrôle de la Malaria/MSPP, Port-au-Prince, Haiti

Contribution
Data curation, Funding acquisition, Validation, Investigation, Project administration, Writing - review and editing

Competing interests
No competing interests declared
Marc-Aurèle Telfort

Programme National de Contrôle de la Malaria/MSPP, Port-au-Prince, Haiti

Contribution
Data curation, Validation, Project administration, Writing - review and editing

Competing interests
No competing interests declared
Michelle A Chang

Division of Parasitic Diseases and Malaria, Centers for Disease Control and Prevention, Atlanta, United States

Contribution
Funding acquisition, Validation, Project administration, Writing - review and editing

Competing interests
No competing interests declared
Kimberly Won

Division of Parasitic Diseases and Malaria, Centers for Disease Control and Prevention, Atlanta, United States

Contribution
Data curation, Writing - review and editing

Competing interests
No competing interests declared
Alaine Knipes

Division of Parasitic Diseases and Malaria, Centers for Disease Control and Prevention, Atlanta, United States

Contribution
Writing - review and editing

Competing interests
No competing interests declared
Eric Rogier

Division of Parasitic Diseases and Malaria, Centers for Disease Control and Prevention, Atlanta, United States

Contribution
Data curation, Validation, Investigation, Project administration, Writing - review and editing

Competing interests
No competing interests declared
Punam Amratia

Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford, United Kingdom

Contribution
Formal analysis, Validation, Investigation, Methodology, Writing - review and editing

Competing interests
No competing interests declared
Daniel J Weiss
1. Curtin University, Perth, Australia
2. Telethon Kids Institute, Perth Children’s Hospital, Perth, Australia
Contribution
Resources, Data curation, Writing - review and editing

Competing interests
No competing interests declared
Peter W Gething
1. Curtin University, Perth, Australia
2. Telethon Kids Institute, Perth Children’s Hospital, Perth, Australia
Contribution
Resources, Supervision, Funding acquisition, Methodology, Writing - review and editing

Competing interests
No competing interests declared
Katherine E Battle

Institute for Disease Modelling, Seattle, United States

Contribution
Data curation, Formal analysis, Validation, Methodology, Project administration, Writing - review and editing

For correspondence
kbattle@idmod.org

Competing interests
No competing interests declared

Funding

Bill and Melinda Gates Foundation (OPP1152978)

Ewan Cameron
Katherine A Twohig
Punam Amratia
Daniel J Weiss
Peter W Gething
Katherine E Battle

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Acknowledgements

Carl Fayette and Franck Monestime with IMA World Health for assistance with implementation of field surveys. Katherine Pendleton, Blaise Tschirhart, Divya Sukumar, Ashraf Patel, Namratha Kolur, and Camelia Herman for assistance in laboratory serology data collection.

Copyright

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.