From multiplicity of infection to force of infection in sparsely sampled high-transmission Plasmodium falciparum populations

eLife Assessment

The ability to estimate the force of infection for Plasmodium falciparum from other more directly measurable epidemiological quantities would contribute to malaria epidemiology. The authors propose a method to accomplish this using genetic data from the var genes of the Pf genome and novel applications of existing methods from queueing theory. After revising the manuscript, this is a useful contribution to the field and the authors provide solid evidence to support it.

https://doi.org/10.7554/eLife.100076.4.sa0

Significance of the findings:

Useful: Findings that have focused importance and scope

Landmark
Fundamental
Important
Valuable
Useful

Strength of evidence:

Solid: Methods, data and analyses broadly support the claims with only minor weaknesses

Exceptional
Compelling
Convincing
Solid
Incomplete
Inadequate

During the peer-review process the editor and reviewers write an eLife Assessment that summarises the significance of the findings reported in the article (on a scale ranging from landmark to useful) and the strength of the evidence (on a scale ranging from exceptional to inadequate). Learn more about eLife Assessments

Abstract
Introduction
Results
Discussion
Materials and methods
Appendix 1
Data availability
References
Article and author information
Metrics

Abstract

High multiplicity of infection (MOI), the number of genetically distinct parasite strains co-infecting a host, characterizes falciparum malaria and other infectious diseases under high transmission. High MOI in Plasmodium falciparum accompanies high prevalence of asymptomatic infection despite high exposure, creating a large transmission reservoir that challenges intervention. This pattern is enabled by parasite immune evasion through extensive antigenic diversity. The force of infection (FOI), the number of new infections acquired by an individual host over a given time interval, is the dynamic counterpart of MOI and a key epidemiological parameter for monitoring antimalarial interventions. FOI is difficult and costly to measure, especially in high-transmission regions, requiring cohort studies or model-based inference from repeated cross-sectional surveys. Here, we apply queuing theory to estimate FOI from MOI with two approaches: a two-moment approximation and Little’s Law. We illustrate these methods using MOI estimates obtained under sparse sampling schemes with the ‘varcoding’ approach. Both methods rely on infection duration data from naive malaria therapy patients and are therefore suitable for subpopulations with limited immunity, such as toddlers. We evaluate their performance using output from a stochastic agent-based model and apply the methods to an interrupted time-series study in northern Ghana, before and immediately after a three-round transient indoor residual spraying intervention. By accounting for sampling limitations with a Bayesian framework and bootstrap imputation, both methods yield good and replicable FOI estimates across various simulated scenarios. Their application to the surveys of 1- to 5-year-old children in Ghana indicates a larger than 70% reduction in annual FOI immediately after intervention.

Introduction

Despite substantial intervention efforts, falciparum malaria in high-transmission regions remains a major public health concern, causing mortality among young children and a considerable economic burden, particularly in sub-Saharan Africa (World Health Organization, 2023). Thus, it remains important to robustly evaluate the effects of intervention efforts in these regions, including on transmission intensity. The force of infection (FOI), defined as the number of new Plasmodium falciparum infections acquired by an individual host over a given time interval, is a key metric reflecting the risk of infection and clinical episodes (Mueller et al., 2012). Whereas other metrics may describe the relationship between transmission intensity and the burden of malaria illness on global or continental scales (Carneiro et al., 2010; Beier et al., 1994), FOI can relate local variation in malaria burden to transmission (Mueller et al., 2012). Although FOI is a key epidemiological parameter for malaria surveillance, it remains difficult, expensive, and labor-intensive to accurately measure, whether directly through cohort studies or indirectly through the fitting of epidemiological models. As molecular tools for parasite genomics become more readily available, they enable new approaches. In particular, molecular advances provide a basis for estimating a sister ‘static’ quantity, the multiplicity of infection (MOI), defined as the number of genetically distinct parasite strains that co-infect a single human host (Zhong et al., 2018; Chang et al., 2017; Ruybal-Pesántez et al., 2022; Tiedje et al., 2022; Labbé et al., 2023). We can therefore ask whether we can go further, and on the basis of MOI obtain the dynamical, rate, quantity of FOI.

Early efforts to directly measure FOI included clearing infections and observing the time to re-infection (Macdonald, 1950; Pull and Grab, 1974; Msuya and Curtis, 1991; Alonso et al., 2004). Molecular approaches now enable the genotyping of individual parasite infections (Mueller et al., 2012; Hofmann et al., 2017), but differentiating new infections from the temporary absence of an old infection in the peripheral blood and its subsequent re-emergence (Daubersies et al., 1996) remains challenging due to the low resolution of polymorphic markers and the complex within-host dynamics of malaria infection. Determining molecular FOI remains challenging, labor-intensive, and costly, requiring close long-term monitoring and genotyping of a large cohort.

Alternatives to direct measurements involve cross-sectional surveys with FOI estimated by fitting simple epidemiological models (Bekessy et al., 1976; Muench, 1959; Smith and Vounatsou, 2003; Felger et al., 2012; Mugenyi et al., 2017). These model-fitting procedures require empirical data sampled regularly and frequently, such as age-stratified large cohorts sampled six times a year, to account for FOI and infection duration heterogeneity (Laishram et al., 2012; Simpson et al., 2002; Langhorne et al., 2008; Childs and Buckee, 2015; Chang et al., 2016; Ashley and White, 2014), which is influenced by the interplay between host immunity and the antigenic composition of infections (Piper et al., 1999; Molineaux et al., 2002; Doolan et al., 2009; Barry et al., 2007). Moreover, these approaches may face identifiability issues with model parameters (Mugenyi et al., 2017). Hence, these indirect measurements share limitations with direct ones.

Due to the described challenges, FOI has not become a readily available epidemiological quantity across geographical locations and times. In contrast, various approaches have been proposed to estimate MOI from clinical samples using size-polymorphic antigenic markers, microsatellites, and panels of biallelic single nucleotide polymorphisms (Felger et al., 1994; Konaté et al., 1999; Anderson et al., 2000; Daniels et al., 2008; Chang et al., 2017). Because Plasmodium parasites reproduce asexually during haploid stages within human hosts (Guttery et al., 2012), polymorphic genotypes indicate multiclonal infection. An alternative approach, termed varcoding, leverages the extreme diversity of the var multigene family, which encodes the major variant surface antigen (VSA) during blood-stage infection, and the resulting zero or extremely low similarity between repertoires with respect to their var gene composition shaped by immune selection. As methods for disentangling the molecular complexity of natural parasite infections across various transmission settings emerge and mature, MOI becomes more commonly and easily surveyed across space and time. Although MOI remains one of the most frequently used genetic metrics of parasite transmission (Arnot, 1998; Sondo et al., 2020), it is by definition a number and not a rate.

A natural correlation exists between MOI and FOI, mediated by infection duration, which offers an opportunity to convert MOI into FOI. This conversion has been challenging because MOI estimates are often obtained from sparse sampling schemes, such as single-time-point surveys at the end of the wet (high-transmission) and dry (low-transmission) seasons (Tiedje et al., 2022; Abukari et al., 2019). These MOI estimates have been useful (Earland et al., 2019; Lee et al., 2006) but the sparse sampling scheme behind them has limited their translation into transmission rates.

In this work, we propose using these MOI estimates for FOI inference with two mathematical modeling frameworks based on queuing theory (Choi et al., 2005; Little and Graves, 2008). The two methods require infection duration values, for which we relied on data from naive malaria therapy patients with neurosyphilis (Collins and Jeffery, 1999; Maire et al., 2006). Consequently, our approach is suited for FOI inference for subpopulations with a similar immune profile and the highest vulnerability, for example, infants or toddlers. We evaluate the methods through numerical simulation of an extended stochastic agent-based model (ABM) (He et al., 2018; Zhan et al., 2024) for both closed and open systems with constant or seasonal transmission. We consider both homogeneous and heterogeneous transmission, with the latter including a high-risk group of hosts that receives the majority of the infectious bites. We also examine different statistical distributions for the times between local transmission events. We incorporate limitations representative of those encountered in the collection of field data into the sampling of simulation output, including under-sampling of var genes, missing data, and antimalarial drug treatment. We address these limitations in MOI estimates with a Bayesian framework and an imputation bootstrap approach. Both methods provide good and replicable FOI estimates across simulated scenarios. After validating with simulations, we apply the two methods to empirical data from an interrupted time-series study in Bongo District, northern Ghana, that involved a three-round transient indoor residual spraying (IRS) intervention (Tiedje et al., 2022; Tiedje et al., 2025). We focus on children aged 1–5 years whose immune profiles are closer to naive patients than the rest of the population, an aspect we discuss later. We then explore the relationship between FOI and another commonly used surrogate for transmission intensity, the entomological inoculation rate (EIR), defined as the number of infectious bites received by an individual over a given time period (Shaukat et al., 2010). This relationship underscores the challenges of relating measures of transmission intensity to malaria burden at local scales and achieving substantial reductions in transmission in high-transmission regions.

Results

The Bayesian formulation of the varcoding method, combined with the bootstrap imputation approach, effectively addresses sampling limitations often encountered in collecting field data for MOI estimates

Because our FOI inference relies on MOI estimates, we first investigate the impact of various sampling limitations on these estimates. We use the Bayesian formulation of the varcoding method and the bootstrap imputation approach (Materials and methods) to address sampling limitations often encountered in collecting field data for MOI estimates: under-sampling or imperfect detection of var genes, missing data, antimalarial drug treatment, and their combination. For this investigation, we utilize simulation output from an ABM of malaria transmission with known true MOI values (He et al., 2018; Zhan et al., 2024). Key assumptions and processes for the ABM, and the experimental design for simulation output are summarized in Appendix 1—Simulation data, with illustrations in Appendix 1—figures 1 and 2.

Our results indicate that MOI estimates obtained using the Bayesian formulation of the varcoding method and the bootstrap imputation approach closely match true MOI values in most cases. To assess the difference between estimated and true MOI distributions, we use the Cramer–von Mises and Anderson–Darling tests. The Cramer–von Mises test quantifies the sum of the squared differences between cumulative distribution functions, while the Anderson–Darling test, a modification of the former, gives more weight to the tails of distributions. Most p-values are non-significant (>0.05), indicating insufficient evidence to conclude that the estimated and true distributions differ, with few exceptions in pre-IRS or low-coverage IRS scenarios. The Bayesian formulation of varcoding tends to underestimate MOI because it assumes that each co-infecting strain contributes a distinct set of var genes. In practice, limited overlap among co-infecting strains reduces the number of var genes detected per individual relative to this expectation, thereby leading to systematic underestimation of MOI. This underestimation bias can be more pronounced in certain high-transmission situations where many hosts have a high true MOI, such as the aforementioned exceptions in pre-IRS or low-coverage IRS scenarios. Consequently, this underestimation in MOI leads to an underestimation of FOI estimates, as described in the next section. Detailed results of both tests are provided in Supplementary file 1—MOImethodsPerformance.xlsx.

The distributions of MOI estimates across different surveys in Ghana and from the simulated outputs are not Poisson-distributed (Appendix 1—Test of deviation from Poisson homogeneity in MOI estimates, Supplementary file 2—deviationFromPoissonTest.xlsx) (Potthoff and Whittinghill, 1966; Lloyd-Smith et al., 2005). This deviation suggests that infection arrivals depart from a homogeneous Poisson process. In addition, infection durations often deviate from an exponential distribution, violating the assumptions under which Poisson MOI arises. Together, these departures complicate conversion from MOI to FOI in the presence of a finite host carrying capacity, but the proposed methods are flexible and applicable (Materials and methods—Inferring FOI from MOI estimates).

The two-moment approximation and Little’s Law methods give good and replicable estimates for FOI across various simulated scenarios

We begin with a homogeneous exposure risk scenario for seasonal transmission in a closed system (Appendix 1—figure 2A–C, and Appendix 1—Simulation data). The times between local transmission events follow a Gamma distribution (Appendix 1—figure 2C). We infer FOI using the true MOI values and the MOI estimates obtained via the Bayesian formulation or the bootstrap imputation approach, with each method accounting for the specific sampling limitations applied in that scenario, whether a single limitation or all limitations combined. Details on deriving the confidence intervals are provided in Appendix 1—Confidence intervals for FOI inference.

Across pre-IRS and three IRS coverage levels, the 95% confidence intervals and the bootstrap distributions of FOI estimates are narrow. FOI estimates based on the true MOI values, as well as those based on MOI estimates obtained under (1) the missing data limitation and (2) the antimalarial treatment limitation, each accounted for by the bootstrap imputation approach, closely match the true FOI values (Figure 1). FOI estimates based on MOI estimates obtained under (1) the under-sampling or imperfect detection of var genes and (2) all sampling limitations combined, accounted for by the Bayesian formulation alone in the former case and by the Bayesian formulation together with the bootstrap imputation approach in the latter, show slight underestimations (Figure 1), due to the underestimation of MOI described in the previous section.

Figure 1

Download asset Open asset

Confidence intervals for estimated mean FOI values in simulated scenarios of homogeneous exposure risk, before and during IRS interventions at three different coverage levels.

The times between local transmission events follow a Gamma distribution, with seasonal transmission in a closed system. FOI estimates are derived from true MOI values and MOI estimates obtained through the Bayesian formulation or the bootstrap imputation approach correcting for all or individual sampling limitations. The true mean FOI per host per year is computed by dividing the total number of infections acquired by the population by the total number of hosts in the population. Confidence intervals are estimated from 200 bootstrap replicates using non-parametric bootstrap analysis. Each boxplot shows minimum, 5% quantile, median, 95% quantile, and maximum values.

To quantify the difference between inferred and true FOI values, we check if the true FOI lies within the bootstrap distribution and calculate the relative deviation, which is defined as the true FOI value minus the median of the bootstrap distribution for the estimate, normalized by the true FOI value. These details are in Supplementary file 3—FOImethodsPerformance.xlsx.

We continue with a heterogeneous exposure risk scenario in which a high-risk group ( $\frac{2}{3}$ of the population) receives approximately 94% of bites, while a low-risk group ( $\frac{1}{3}$ of the population) receives the remainder (Appendix 1—figure 2C). Transmission is seasonal and the system is semi-open (Appendix 1—figure 2A, B and Appendix 1—Simulation data). The times between local transmission events are Gamma-distributed (Appendix 1—figure 2C). As before, FOI estimates across pre-IRS and three IRS coverage levels show narrow 95% confidence intervals and bootstrap distributions close to true FOI values (Figure 2), with slight underestimations for FOI estimates based on MOI estimates corrected for under-sampling or imperfect detection of var genes and those corrected for all sampling limitations.

Figure 2

Download asset Open asset

Confidence intervals for estimated mean FOI values in simulated scenarios of heterogeneous exposure risk, before and during IRS interventions at three different coverage levels.

The times between local transmission events follow a Gamma distribution, with seasonal transmission in a semi-open system. FOI estimates are derived from true MOI values and MOI estimates obtained through the Bayesian formulation of the *var*coding method or the bootstrap imputation approach correcting for all or individual sampling limitations. The true mean FOI per host per year is computed by dividing the total number of infections acquired by the population by the total number of hosts in the population. Confidence intervals are estimated from 200 bootstrap replicates using non-parametric bootstrap analysis. Each boxplot shows minimum, 5% quantile, median, 95% quantile, and maximum values.

The performance of the two methods across additional simulated scenarios is shown in Appendix 1—figures 7–18.

The two-moment approximation and Little’s Law methods give replicable FOI estimates for empirical surveys conducted in the Bongo District of northern Ghana

After validating with simulations, we apply the two methods to empirical surveys in the Bongo District of northern Ghana. We first derive their MOI estimates. Due to the high but imperfect detection power of PCR, we assume three levels of sensitivity: 0% (high detectability), 5% (mid detectability), and 10% (low detectability) of PCR-negative individuals carrying infection (Materials and methods—The under-sampling of infections in empirical surveys).

Antimalarial treatment, sought in response to symptoms or perceived transmission risk, can impact the duration of an ongoing infection and may therefore violate the assumption underlying the two methods, which rely on infection duration data from naive malaria therapy patients with neurosyphilis. We address this issue by either excluding treated individuals from the analysis or by discarding their infection status and MOI estimates, instead sampling from non-treated individuals with MOI >0. Since the latter samples non-zero MOIs for these treated and uninfected individuals, it results in an upper bound for FOI estimates. Note that in this latter case, we do not assume that the MOI distribution for treated individuals is the same as that for untreated individuals. Rather, we aim to estimate what their MOI would have been, and consequently, determine what the FOI per individual per year in the combined population would be, had these individuals not received antimalarial treatment. Further details can be found in Materials and methods—Antimalarial drug treatment of infections in empirical surveys.

Next, we apply the two-moment approximation and Little’s Law methods to derive FOI estimates from empirical MOI estimates. Both methods yield replicable FOI estimates. The FOI estimates are similar across the three PCR sensitivity levels and the two approaches for handling treated individuals. The 95% confidence intervals and the full sampling distributions from bootstrap analysis are concentrated (Figure 3, Figure 3—figure supplements 1–3). Notably, there is a significant reduction in FOI, exceeding 70%, indicating that the three-round IRS intervention, although transient, was highly effective.

Figure 3 with 3 supplements see all

Download asset Open asset

Confidence intervals for the estimated mean FOI values in Ghana surveys before and immediately after a transient three-round IRS intervention.

(A) The estimated FOI values when excluding these treated individuals from the analysis. (B) The estimated FOI values when discarding the infection status and MOI estimates of treated individuals and sampling from non-treated ones with MOI >0. Since this case samples non-zero MOIs for these treated and uninfected individuals, it results in an upper bound for FOI estimates. Confidence intervals are estimated from 200 bootstrap replicates using non-parametric bootstrap analysis. Each boxplot shows minimum, 5% quantile, median, 95% quantile, and maximum. The value of $c$ is set to 30. FOI estimates with other values of $c$ can be found in Figure 3—figure supplements 1–3.

The inferred FOI and directly measured EIR from the Ghana surveys align with the relationship between these two quantities in previous studies

We plot the measured annual EIR against the estimated annual FOI, and the transmission efficiency (the ratio between FOI and EIR) against the measured annual EIR from previous field studies summarized by Smith et al., 2010 (Figure 4). In these studies, FOI estimation was based on fitting a simple epidemiological model to age-stratified prevalence data from cross-sectional parasitological studies (Smith et al., 2010; Pull and Grab, 1974), while EIR was estimated using various methods such as exit bait collection, human bait collection, pyrethrum spray collection, night bite collection, and outdoor resting collection (Smith et al., 2010). The yellow line indicates the functional curve fitted to these data points (Materials and methods—Conversion between FOI and the EIR), initially proposed by Smith et al., 2010.

Figure 4

Download asset Open asset

The saturation in FOI with increasing EIR and their non-linear relationship from previous field studies.

(A) and (B) present our empirical estimates (with $c = 30$ ) when excluding treated individuals from the analysis. (C) and (D) show our estimates when discarding the infection status and MOI estimates of treated individuals and instead sampling from non-treated ones with MOI >0. Since this case samples non-zero MOIs for these treated and uninfected individuals, it results in an upper bound for FOI estimates. The black points represent paired EIR–FOI values from the literature, as summarized by Smith et al., 2010, with crosses indicating instances where multiple estimates or ranges were reported or estimated for the same location. The yellow curve represents the best-fit to these paired EIR–FOI values (Smith et al., 2010). The purple hollow diamond and plus represent the Ghana data, showing our FOI estimates using the two methods and the EIR measured in the field by the entomological team (Tiedje et al., 2022).

The data show that mean annual FOI values are consistently below an empirical limit of 20. Due to the highly non-linear relationship between FOI and EIR, there is no single constant factor to convert FOI to EIR (or vice versa) across different settings. In high-transmission regions, transmission efficiency is extremely low: annual EIR can reach a few hundred to a thousand, while annual FOI ranges from about 5 to 10. For future applications, when using our proposed methods to estimate FOI from MOI under sparse sampling schemes, we can rely on this functional curve to convert FOI estimates to corresponding EIR values. There is, however, an inherently high variance in this conversion. Overall, our paired EIR (measured directly by the entomological team in Ghana Tiedje et al., 2022) and FOI estimates align well with previous studies, indicating the consistency of our methods.

The variance inferred by the two-moment approximation method reflects transmission intensity and heterogeneity across individuals

We focused primarily on the mean FOI in both simulation outputs and empirical data. However, the two-moment approximation method also yields estimates of the variance in infection interarrival times. When examining the inferred variance across simulated scenarios, we find that the estimated individual-level mean FOI, when aggregated across the local population, robustly reflects the total number of infections accumulated within that population. This relation holds across a wide range of assumptions, including seasonality, system openness, heterogeneity in exposure risks, and the distribution of infection interarrival times. In other words, it remains robust regardless of the magnitude of variance in FOI.

The inferred variance is most significantly correlated with the inferred mean FOI. Specifically, a smaller mean FOI is associated with a larger variance (Appendix 1—figure 19). Overall, seasonal runs exhibit greater variance than non-seasonal runs. Runs with heterogeneous transmission (Appendix 1—figure 2C) have higher variance compared to homogeneous transmission runs. These findings align with expectations, as both seasonality and transmission heterogeneity increase the dispersion of FOI and inter-arrival times of infections.

Discussion

Building on estimates of the MOI under sparse sampling schemes, we demonstrate the feasibility of converting these values into estimates of the FOI. In various simulation scenarios using an extended stochastic ABM, the two-moment approximation and Little’s Law methods provide good and replicable FOI estimates based on MOI estimates obtained with the Bayesian formulation of the varcoding method and the bootstrap imputation approach, despite common sampling limitations.

Both methods tend to slightly underestimate FOI because they rely on MOI estimates that are themselves biased downward. The current MOI estimation procedure, specifically the Bayesian formulation of the varcoding method, does not correct for the limited overlap of var genes between co-infecting strains. This limited overlap reduces the number of var genes identified per individual, while the Bayesian formulation implicitly assumes that each co-infecting strain contributes a unique set of var genes, thereby introducing a downward bias in the MOI estimates.

Studies have shown parasite co-transmission from single mosquito bites in high-transmission regions (Nkhoma et al., 2020; Wong et al., 2017), predominantly based on clinical or symptomatic infections (Andolina et al., 2021; Lindblade et al., 2013), although asymptomatic cases constitute the majority of the malaria transmission reservoir in these regions. Co-transmitted recombinant parasites, more closely related to each other than parasites from different bites, further reduce the number of identified var genes (Wong et al., 2022; Nkhoma et al., 2020; Wong et al., 2017) per individual. Low or variable parasite densities due to factors like small blood volumes, genomic DNA quality, clinical status, and within-host dynamics also affect MOI estimation (Peyerl-Hoffmann et al., 2001; Bruce et al., 2000; Okell et al., 2012; Farnert et al., 1997; Färnert et al., 2008; Barry et al., 2021; Hergott et al., 2024). These issues are common to all measures of MOI and direct measures of FOI. We did not correct for these factors when estimating MOI and FOI for the Ghana surveys. In the future, the Bayesian formulation could be extended to account for these confounders in estimating the number of co-infecting strains.

Our proposed methods leverage infection duration data from malaria therapy patients with neurosyphilis who had no prior malaria exposure, making them well-suited for FOI inference in highly vulnerable subpopulations with similarly naive immune profiles. For our Ghana surveys, we focus on children between 1 and 5 years of age, as their immune profiles are closer to those of naive patients than those of older individuals. In the pre-IRS phase of the Ghana surveys, an estimated mean FOI of about 5 per host per year suggests that a 4-year-old child would have experienced around 20 infections, which makes them appear far from naive. However, the extreme documented diversity of var genes (Tiedje et al., 2025) means that even with 20 infections, a 4-year-old may have developed immunity to only a small fraction of the total antigenic diversity encoded by these genes. Consequently, they are not as immunologically experienced as it might initially seem. Moreover, studies have shown that long-lived infections in older children and adults can persist for months or even years, including during the dry season. This persistence is driven by high antigenic variation of var genes and associated incomplete immunity. Additionally, parasites can skew PfEMP1 expression to produce less adhesive erythrocytes, enhancing splenic clearance, reducing virulence, and maintaining extended periods of subclinical parasitemia (Andrade et al., 2024; Tran et al., 2013; Zhang and Deitsch, 2022). The impact of immunity on infection duration with age for falciparum malaria remains a challenging open question.

We recognize the limitations that this aspect of infection duration introduces in the FOI estimation. To reduce mis-specification in infection duration and fully utilize our proposed methods, future data collection and sampling could prioritize subpopulations with minimal prior infection and an immune profile similar to that of naive adults, such as in infants and toddlers. As these individuals are also the most vulnerable, prioritizing them aligns with the short-term priority of all intervention efforts: to monitor and protect the most vulnerable individuals from severe symptoms and death.

The application of both methods is often framed in terms of long time-series or multiple realizations of the same process. However, empirical surveys often rely on sparse sampling schemes with at most a small number of observations per host. We therefore approximate the stationary queue length distribution at the population level by aggregating MOI estimates across sampled individuals, rather than relying on time averaging. In doing so, any individual-level heterogeneity in transmission is not explicitly modeled in the inference and is instead subsumed into the aggregated MOI distribution. The resulting FOI estimates, combined with demographic information on population size, provide an estimate of the total number of P. falciparum infections acquired by the population per year. We evaluated the impact of individual heterogeneity due to transmission on FOI inference using simulations. Even for significant heterogeneity among individuals, our methods show performance comparable to that of homogeneous scenarios. Additionally, our methods perform similarly for both non-seasonal and seasonal transmission scenarios.

After validating our methods with simulations, we applied them to surveys in the Bongo District of northern Ghana, a high-transmission endemic region where estimating MOI and related FOI has been challenging. The three-round transient IRS intervention proved strong and effective, resulting in a significant reduction in FOI of more than 70%.

Our Ghana surveys lack direct FOI measurements, which prevents us from directly evaluating our methods as we did with simulation outputs. Empirical MOI–FOI pairs from cohort studies are still lacking, and direct FOI measurements are prone to errors due to challenges in differentiating new infections from the temporary absence and re-emergence of old infections. These challenges arise from the low resolution of polymorphic markers used in cohort studies and the complexity of within-host dynamics. Alternative approaches fit epidemiological models to densely sampled cross-sectional surveys that also lack direct FOI measurements, thereby precluding direct validation of the inferred FOI estimates. In these approaches, model parameterization likewise relies on capturing certain epidemiological quantities, such as prevalence or incidence, similar to the one done in this work. We selected FOI values that maximize the likelihood of observing given MOI distributions. Additionally, we paired our estimated FOI value for Ghana surveys with independently measured EIR (Tiedje et al., 2022). We demonstrated a reasonable alignment between our paired EIR–FOI values with the general relationship from previous studies. We acknowledge, however, that our validation for field data is indirect and further complicated by high variance in the relationship between EIR and FOI from previous studies.

The FOI estimates obtained through these methods go beyond describing basic malaria epidemiology and evaluating intervention outcomes. FOI for naive hosts is a fundamental parameter for epidemiological models. The FOI of non-naive hosts is typically a function of their immune status, body size, and the FOI of naive hosts. Additionally, the FOI estimates can inform process-based models for the population dynamics of complex infectious diseases, serving as priors for parameterizing and validating more complex agent- or equation-based models, in a way that reduces computational cost, improves efficiency, and minimizes identifiability issues.

A key characteristic of malaria transmission is the saturation in FOI at high transmission, and the highly non-linear relationship between FOI and EIR. While EIR can reach values of several hundred to a thousand per year, annual FOI typically saturates below 20 (Smith et al., 2010). Different choices of field measures of EIR and FOI cannot account for this drastic difference in magnitude. Transmission is highly inefficient in high-transmission regions with high annual EIRs. The difference between these two quantities is mediated primarily by immunity, or within-host dynamics, measurement bias, and heterogeneous transmission (Donovan et al., 2007; Macdonald, 1950; John et al., 2005; Doolan and Martinez-Alier, 2006). Mathematical models commonly use the probability of transmission from an infectious mosquito bite to bridge FOI and EIR, as a general parameter encapsulating a variety of processes.

FOI saturation poses significant challenges to intervention efforts in high-transmission endemic regions. In these areas, intervention efforts must dramatically reduce EIR by several orders of magnitude to bring FOI below saturation levels. In other words, high-coverage interventions are needed to achieve any noticeable impact on individual exposure risk. Theoretical models suggest a sharp non-linear transition toward sustainable low transmission or elimination, influenced by the high antigenic diversity of P. falciparum (de Roos et al., 2023; Zhan et al., 2024). The same molecular information from var gene sequence data used here to estimate MOI and FOI in the Bongo District underlies estimates of this diversity. Zhan et al., 2024.

The proposed methods are applicable to evaluate transmission intensity in pathogens exhibiting multi-genomic infection due to large antigenic diversity, including those encoding such variation with multigene families (Deitsch et al., 2009). Easier estimation of changes in transmission intensity should enhance the efficiency and evaluation of control programs across a broader range of infectious diseases.

Materials and methods

Key resources table

Reagent type (species) or resource	Designation	Source or reference	Identifiers	Additional information
Software, algorithm	R 3.6.1	R Development Core Team (2019)	RRID:SCR_001905

High genetic diversity of var and the associated strain structure of limiting similarity

Request a detailed protocol

We briefly describe the biology of the malaria parasite P. falciparum that underpins our MOI estimation procedure, the Bayesian formulation of the varcoding method.

In high-transmission endemic regions, human hosts remain susceptible to malaria re-infection throughout their lifetime (Doolan et al., 2009). High asymptomatic prevalence and high MOI result from high-transmission rates and incomplete host immunity due to the parasite’s high antigenic variation (Deitsch et al., 2009). Parasites achieve this variation and evade the immune system by encoding key VSAs using multigene families (Deitsch et al., 2009). One important multigene family in the malaria parasite P. falciparum is known as var, which encodes PfEMP1 (Plasmodium falciparum erythrocyte membrane protein 1), the major VSA during the blood stage of infection (Zhang and Deitsch, 2022; Baruch et al., 1995; Smith et al., 1995; Su et al., 1995). Each parasite carries 50–60 var genes across its chromosomes, encoding different variants of this protein, which are expressed largely sequentially (Appendix 1—Simulation data, subsection ‘An extended var model,’ sub-subsection ‘Within-host dynamics’).

Empirical sequencing of var genes focuses on the DBLα tag, a conserved ~450 bp region encoding the immunogenic Duffy-binding-like alpha domain of PfEMP1 (Tiedje et al., 2025; Ruybal-Pesántez et al., 2022; Ruybal-Pesántez et al., 2017; Day et al., 2017). Bioinformatic analyses of a large database of exon 1 sequences of var genes revealed a predominantly 1-to-1 DBLα-var relationship, meaning each DBLα tag typically represents a unique var gene (Tan et al., 2023). Hereafter, we use DBLα types and var genes interchangeably.

In high-transmission endemic regions, local parasite populations exhibit a vast pool of var gene variants, ranging from thousands to tens of thousands (Day et al., 2017; Tiedje et al., 2022). These variants are generated primarily through mitotic recombination, but also through meiotic recombination, mutation, and host/mosquito vector migration (Claessens et al., 2014; Frank et al., 2008; Freitas-Junior et al., 2000; Bopp et al., 2013). This large pool, combined with negative frequency-dependent selection mediated by hosts’ specific immunity, results in the limited overlap of var genes among individual repertoires (individual parasite genomes) and isolates (sets of individual parasite genomes co-infecting individual hosts) (Day et al., 2017; He et al., 2018). Major groups of var genes are classified based on their 5′-flanking region, called ups, which controls gene expression: upsA and upsB/C (non-upsA) (Rask et al., 2010). Non-upsA DBLα sequences are ~20 times more diverse and less conserved among repertoires than the upsA DBLα sequences. Hence our MOI estimation leverages non-upsA DBLα types, as detailed in the following section.

Bayesian formulation of the ‘varcoding’ method for MOI estimation

Request a detailed protocol

The limited overlap of var repertoires allows MOI estimation based on the number of non-upsA DBLα types identified from an isolate. The original varcoding method assumes a constant repertoire length, that is, number of non-upsA DBLα types in a parasite genome, to convert the number of types identified in an isolate to the estimated MOI. This method does not account for the measurement error (Appendix 1—figure 2D) in this length introduced by the under-sampling or imperfect detection of var genes in an infection. We recently extended this method to a Bayesian formulation that considers this error and provides a posterior distribution of MOI values for each sampled individual (Tiedje et al., 2025). We documented the steps of this Bayesian formulation, compared two ways of obtaining population-level MOI distribution (either pooling the maximum a posteriori MOI estimates or calculating a mixture distribution), and examined the impact of different priors (Tiedje et al., 2025). In our analyses here, we provide the estimated population-level MOI distribution obtained from a mixture distribution using a uniform prior for individuals.

Empirical surveys from Ghana

Request a detailed protocol

We use empirical data from an interrupted time-series study conducted in Bongo District, northern Ghana. This study involves four age-stratified cross-sectional surveys of ~2000 participants each, conducted between 2012 and 2016. The study assessed the impacts of a transient three-round IRS intervention, combined with long-lasting insecticidal nets (LLINs), on the asymptomatic P. falciparum reservoir (Tiedje et al., 2017; Tiedje et al., 2022; Tiedje et al., 2025). Surveys were conducted at the end of the wet/high-transmission season (i.e., October) or the dry/low-transmission season (i.e., May/June). The study consists of two phases: (1) Pre-IRS: two surveys before the IRS intervention (Survey 1 in October 2012; Survey 2 in May/June 2013); and (2) Immediately post-IRS: two surveys immediately following the three-round IRS intervention (Survey 3 in October 2015; Survey 4 in May/June 2016) (Appendix 1—figure 2E). Details on the study area, study population, malaria control interventions (IRS and LLINs), inclusion/exclusion criteria, data collection/generation procedures have been previously described (Tiedje et al., 2017; Tiedje et al., 2022; Tiedje et al., 2025).

The under-sampling of infections in empirical surveys

Request a detailed protocol

The empirical MOI estimates in various epidemiological studies, including ours in Bongo District, northern Ghana, rely on microscopy-positive individuals (Tiedje et al., 2025; Tiedje et al., 2022). Due to microscopy’s limited sensitivity, a significant fraction of individuals who carry infections are undetected. A subset of Ghana surveys also include submicroscopic infections detected by PCR (Tiedje et al., 2025; Tiedje et al., 2022), which is significantly more sensitive and can detect a higher fraction, if not 100%, of individuals with P. falciparum infections. Using surveys with both microscopy and PCR detection, we estimate conversion factors of 0.76 for untreated children and 0.67 for antimalarial-treated children aged 1–5 years.

For surveys with both detection methods, we directly calculate the number of microscopy-negative but PCR-positive individuals. For surveys with only microscopy data, we use the estimated conversion factors to estimate the number of microscopy-negative but PCR-positive individuals. Additionally, we account for the high but not exactly known sensitivity of PCR by assuming its detectability ranges from relatively low (10% of all PCR-negative individuals carrying undetected infections) to perfect (none of PCR-negative individuals carrying undetected infections). We calculate the number of individuals with undetected infections by PCR for each sensitivity level.

After estimating the number of individuals with undetected infections (both microscopy-negative but PCR-positive and PCR-negative but infected), we sample from existing MOI estimates of microscopy-positive individuals not under antimalarial treatment (see the following section) to represent the missing MOI data.

Similarly, for individuals who are microscopy-positive but lacking var information due to factors like low DNA quality, we sample values from existing MOI estimates of microscopy-positive individuals not under antimalarial treatment to represent the missing MOI data.

We assume that microscopy-negative but PCR-positive children aged 1–5 years and microscopy-positive children aged 1–5 years have similar MOI distributions. This assumption is suggested by our analysis of Ghana surveys, which shows no clear relationship between parasitemia levels and MOI (or the number of var genes detected within an individual host, on the basis of which our MOI values were estimated) (Appendix 1—figure 3). We scale the parasitemia levels and the number of non-ups A var genes or MOI estimates before performing the regression. Parasitemia levels underlie the difference in detection sensitivity between PCR and microscopy.

This lack of a clear relationship can be attributed to several factors. One factor is immune regulation of parasite density, where host immunity may limit parasite density without reducing the diversity of co-infecting strains (Eldh et al., 2020), leading to individuals with low parasitemia but high MOI. Another factor is asynchronous parasite dynamics, where different parasite clones replicate asynchronously (Farnert et al., 1997), resulting in varied parasite densities that do not directly correlate with the number of distinct strains present. This could explain why individuals with low parasitemia still exhibit multiple strains. Lastly, competition among parasite strains suppresses the growth of individual clones, lowering parasite densities while maintaining high strain diversity, thus reducing the expected correlation between MOI and parasitemia (Sondo et al., 2019; Earland et al., 2019).

Antimalarial drug treatment of infections in empirical surveys

Request a detailed protocol

Individuals may seek and receive antimalarial treatment in response to symptoms or perceived transmission risk. In our surveys in the Bongo District of northern Ghana, over 50% of children aged 1–5 years responded that they had received an antimalarial treatment in the previous 2 weeks (i.e., participants that reported they were sick, sought treatment, and were provided with an antimalarial treatment) in the wet/high-transmission survey before IRS (i.e., Survey 1, Appendix 1—figure 2E; Tiedje et al., 2022). This fraction is significantly lower for the dry/low-transmission survey before IRS and the surveys collected immediately after IRS (i.e., Survey 2–4, Appendix 1—figure 2E; Tiedje et al., 2022).

Disentangling the effect of drug treatment on measurements like infection duration is challenging. Since our methods use infection duration data from naive malaria therapy patients with neurosyphilis, drug treatment can potentially violate this assumption. We propose two solutions: (1) exclude treated individuals from the analysis; (2) remove treated individuals’ samples and use a bootstrap imputation approach based on the remaining population. Specifically, we sample from the MOI estimates of untreated microscopy-positive individuals to represent MOI estimates for treated individuals, which corrects for individuals who have used antimalarial drugs and show either no infection (MOI = 0) or infection (MOI >0). Hence, this solution provides an upper bound for FOI estimates. Numerical simulations show our bootstrap imputation approach is robust even with a significant fraction of treated individuals, as seen in our Ghana surveys.

The final distribution of MOI estimates at the population level includes values for microscopy-positive individuals, imputed values for individuals with missing MOI information or false negatives, imputed values for treated individuals (for the second solution), and true zeros for uninfected individuals. This distribution is used for FOI inference.

Inferring FOI from MOI estimates

Malaria transmission in relation to queuing theory

Request a detailed protocol

In a cohort of individuals acquiring and clearing infections independently, infections occur as a homogeneous Poisson process with a rate equal to the mean FOI, and each infection has an exponentially distributed duration. At equilibrium, MOI follows a Poisson distribution with a mean equal to the mean FOI divided by the mean clearance rate (Dietz et al., 1974). In practice, individuals are often capacity-limited, such that they can only carry up to a certain number of concurrent infections due to within-host competition or immune regulation. New infections that arrive when a host is already at capacity can be simply blocked rather than queued or delayed, resulting in an MOI that follows a Poisson distribution truncated at the carrying capacity and normalized. However, infection arrivals frequently deviate from a homogeneous Poisson process because of factors such as seasonality and heterogeneity in exposure risk, and infection duration may also deviate from an exponential distribution. Consequently, the MOI distribution is often overdispersed. In this case, no simple analytical relationship exists between the MOI distribution and the mean FOI or clearance rate. We formally test for deviations from Poisson homogeneity against a negative binomial alternative in both simulated and empirical MOI distributions, as detailed in Appendix 1—Test of deviation from Poisson homogeneity in MOI estimates and Supplementary file 2—deviationFromPoissonTest.xlsx. Notably, infection durations observed in malaria-naive patients from historical neurosyphilis treatment studies, where patients were intentionally infected with malaria, also exhibit a standard deviation substantially different from their mean (Collins and Jeffery, 1999; Maire et al., 2006) (see details below).

Despite these complexities, the qualitative relationship between MOI and FOI is intuitive when information on infection duration is known. Higher FOI values should correspond to higher MOI values. Less variable FOI values should result in narrower or more concentrated MOI distributions, whereas more variable FOI values should lead to more spread-out MOI distributions.

The process of acquiring infectious bites is structurally analogous to stochastic queuing theory, which relates queue length to the intensity of arrivals, priority schedules, and service and waiting times. Modeled using differential equations (with the Kolmogorov equations), queueing systems comprise three main components: queue length, the intensity of arrivals, and service times. Knowing any two allows for the inference of the third.

In malaria transmission, hosts resemble service facilities composed of a collection of servers, with each infection akin to a customer. Just as service facilities have a carrying capacity for the number of customers they can serve simultaneously, hosts have a carrying capacity for blood-stage infections, which limits the maximum number of infections they can harbor (Appendix 1—figure 4). Empirical MOI estimates provide information on queue length. Hypothetically, knowing the service times, that is, infection durations, can help infer the intensity of arrivals, that is, the rate at which hosts acquire infections or FOI.

However, determining infection durations is challenging in endemic areas where multi-genomic infections are common. Popular polymorphic markers often fail to distinguish between co-infecting strains (Argyropoulos et al., 2023), complicating the tracking of the emergence and clearance of individual strains from the peripheral blood. Complex within-host dynamics further complicate tracking unless daily sampling is conducted, which is impractical in real settings. Frequent ectopic recombination of var genes complicates assigning genes to specific chromosomal locations. This difficulty in phasing compromises the integrity of individual strains, making it hard to isolate them and track their first appearance and subsequent clearance in blood over time. Additionally, infection duration varies widely across age groups, geographical locations, and sampling times (Childs and Buckee, 2015; Chang et al., 2016; Ashley and White, 2014; Bretscher et al., 2011).

We therefore propose focusing on FOI inference in subpopulations with naive or near-naive immune profiles. Their infection duration can be approximated by that of naive hosts, as seen in a historical medical study of neurosyphilis patients intentionally infected with malaria as a treatment. Between 1940 and 1963, 318 syphilis patients were infected with a single strain of P. falciparum (Collins and Jeffery, 1999; Maire et al., 2006), and data on fever and parasite counts in the blood were recorded. Since these patients had no prior P. falciparum infections, the documented infection duration reflects that of naive infections.

In Ghana surveys, we focus on children aged 1–5 years, who have accumulated far fewer infections and less immune memory compared to older individuals, an aspect we discuss in the Discussion section. We treat these children as nearly naive and approximate their duration infection with that of naive hosts. Using their MOI estimates, we infer FOI for these children using the following two methods from queuing theory.

A two-moment approximation for a queue of finite capacity

Request a detailed protocol

Analysis of multi-server models is challenging, with exact results available only for specific cases, such as the previously mentioned $M / M / c / k$ models. In these models, $M$ represents exponential inter-arrival and service times, $c$ is the number of servers, and $k$ is the maximum queue capacity, including both customers being served and those waiting. Additional models like $M / G / c / c$ and $G I / M / c / c + r$ queues also require exponential distributions for either inter-arrival or service times, where $G$ and $G I$ represent generic random variables and independent and identically distributed (i.i.d.) generic random variables, respectively. These models can often be too restrictive for real-world scenarios as well.

We examine a two-moment approximation method introduced by Kim and Cha (Choi et al., 2005). This method considers the $G I / G / c / c + r$ queue, where inter-arrival times ( $G I$ ) and service times ( $G$ ) of customers are independent sequences of i.i.d. general random variables $A$ and $S$ , respectively. There are $c$ (≥1) identical servers in parallel and $r$ (≥0) waiting places. In this framework, overdispersion in MOI arises from temporally structured (non-Poisson) acquisition processes that generate bursts of infection, compounded by non-exponential infection durations, among otherwise homogeneous hosts.

Let $N$ denotes the number of customers in the system at an arbitrary time, and $N^{A} (N^{D})$ denotes the number of customers that an arriving customer finds (that a departing customer leaves behind) in steady state. Customers who arrive to find $c + r$ customers in the system depart immediately, leaving those $c + r$ customers behind. Let $P_{n}$ , $P_{n}^{A}$ , and $P_{n}^{D}$ denote the probabilities that $N = n$ , $N^{A} = n$ , and $N^{D} = n$ , respectively, for $0 \leq n \leq c + r$ . These probabilities are expressed in terms of the following quantities:

a = E (A) = \frac{1}{λ}

b = E (S)

a_{n}^{D} = E [A_{n}^{D}], 0 \leq n \leq c + r

b_{n}^{A} = E [S_{n}^{A}], 0 \leq n \leq c + r

b_{n}^{D} = E [S_{n}^{D}], 0 \leq n \leq c + r

where $A_{n}^{D}$ , $0 \leq n \leq c + r$ , is the residual inter-arrival time at the departure instant of a customer who leaves behind $n$ customers in the system, and $S_{n}^{A} (S_{n}^{D})$ , $1 \leq n \leq c + r$ , is the residual service time of a randomly chosen busy server at the arrival instant (the departure instant) of a customer who finds (leaves behind) $n$ customers in the system. From these definitions, we have $a_{c + r}^{D} = a$ and $b_{c + r}^{D} = b_{c + r}^{A}$ . We set $b_{0}^{A} = b_{0}^{D} = 0$ . We assume that all the above quantities are well defined and finite.

Using Theorems 4.3.19 and 4.3.43 from Heyman, 1985, the steady-state queue-length distribution can be derived:

P_{n}^{A} = P_{n}^{D} = P_{0}^{A} Π_{i = 0}^{n - 1} \frac{λ_{i}}{μ_{i + 1}}, 1 \leq n \leq c + r

P_{n} = P_{n}^{A} γ_{n}, 0 \leq n \leq c + r

where

\frac{1}{μ_{i}} = {\begin{cases} b - i (a - a_{i - 1}^{D}) + (i - 1) (b_{i - 1}^{A} - b_{i - 1}^{D}), 1 \leq i \leq c \\ - c (a - a_{i - 1}^{D}) + b_{i - 1}^{A} + (c - 1) (b_{i - 1}^{A} - b_{i - 1}^{D}), c + 1 \leq i \leq c + r \end{cases}

\frac{1}{λ_{i}} = {\begin{cases} (i + 1) (a_{i + 1}^{D} + b_{i + 1}^{A} - b_{i + 1}^{D}), 0 \leq i \leq c - 2 \\ c a_{i + 1}^{D} + b_{i + 1}^{A} - b + (c - 1) (b_{i + 1}^{A} - b_{i + 1}^{D}), c - 1 \leq i \leq c + r - 2 \\ c a, i = c + r - 1 \end{cases}

γ_{i} = {\begin{cases} λ a_{0}^{D}, i = 0 \\ λ [μ_{i} \frac{(a - a_{i - 1}^{D})}{λ_{i - 1}} + a_{i}^{D}], 1 \leq i \leq c + r \end{cases}

And, by normalization, $\sum_{n = 0}^{c + r} P_{n}^{A} = 1$ :

P_{0}^{A} = (1 + \sum_{n = 1}^{c + r} Π_{i = 0}^{n - 1} \frac{λ_{i}}{μ_{i + 1}})^{- 1}

Since quantities $a_{n}^{D}$ , $b_{n}^{A}$ , and $b_{n}^{D}$ are difficult to compute in general, Kim and Cha propose an approximation to the exact expression which replaces these unknown arrival- and departure-average quantities by their corresponding (well-known) time-average counterparts, which are exact for exponential inter-arrival and service times. That is:

a_{n}^{D} \approx a_{R} = \frac{E [A^{2}]}{2 E [A]} = \frac{(1 + c_{A}^{2}) a}{2}, 0 \leq n \leq c + r - 1

b_{n}^{A} (b_{n}^{D}) \approx b_{R} = \frac{E [S^{2}]}{2 E [S]} = \frac{(1 + c_{S}^{2}) b}{2}, 1 \leq n \leq c + r - 1

where $c_{X}^{2} = \frac{V a r [X]}{(E [X])^{2}}$ is the square coefficient of variation of a random variable $X$ with distribution function $F$ .

Therefore, a two-moment approximation for the steady-state queue-length distribution is:

{\tilde{P}}_{n}^{A} = {\tilde{P}}_{n}^{D} = {\tilde{P}}_{0}^{A} Π_{i = 0}^{n - 1} \frac{{\tilde{λ}}_{i}}{{\tilde{μ}}_{i + 1}}, 1 \leq n \leq c + r

{\tilde{P}}_{n} = {\tilde{P}}_{n}^{A} {\tilde{γ}}_{n}, 0 \leq n \leq c + r

where

\frac{1}{{\tilde{μ}}_{i}} = {\begin{cases} b - i (a - a_{R}), 1 \leq i \leq c \\ - c (a - a_{R}) + b_{R}, c + 1 \leq i \leq c + r \end{cases}

\frac{1}{{\tilde{λ}}_{i}} = {\begin{cases} (i + 1) a_{R}, 0 \leq i \leq c - 2 \\ c a_{R} + b_{R} - b, c - 1 \leq i \leq c + r - 2 \\ c a, i = c + r - 1 \end{cases}

{\tilde{γ}}_{i} = {\begin{cases} λ a_{R}, i = 0 \\ λ [{\tilde{μ}}_{i} \frac{(a - a_{R})}{{\tilde{λ}}_{i - 1}} + a_{R}], 1 \leq i \leq c + r - 1 \\ λ [{\tilde{μ}}_{i} \frac{(a - a_{R})}{{\tilde{λ}}_{i - 1}} + a], i = c + r \end{cases}

And, by normalization, $\sum_{n = 0}^{c + r} {\tilde{P}}_{n}^{A} = 1$ :

{\tilde{P}}_{0}^{A} = (1 + \sum_{n = 1}^{c + r} Π_{i = 0}^{n - 1} \frac{{\tilde{λ}}_{i}}{{\tilde{μ}}_{i + 1}})^{- 1}

Likelihood formulation and parameter estimation

Request a detailed protocol

We vary the mean and variance parameters for inter-arrival times across wide ranges (Supplementary file 4—meanAndVarianceParams.xlsx). For each mean and variance combination, we calculate the steady-state queue length distribution, that is, the probability density distribution of MOI, using the two-moment approximation method. The goal is to identify the parameter combination that minimizes the negative log-likelihood (or maximizes the likelihood) of observed MOI distributions from simulated outputs or Ghana surveys:

\begin{matrix} \underset{μ, σ}{argmin} - \ln L (m; μ, σ) \end{matrix}

with the likelihood defined as follows:

L (m; μ, σ) = Π_{i = 1}^{N} P (m_{i}; μ, σ)

where $N$ is the number of individual hosts, $m_{i}$ is the MOI estimate for individual $i$ , $m$ is the vector of MOI estimates for all hosts, µ and $σ$ represent the mean and variance parameters, respectively, and $P$ is the steady-state queue length distribution from the two-moment approximation method with specific mean and variance parameter values, as defined in the previous section.

The shape of the negative log likelihood for both simulated outputs and Ghana surveys is concave upwards around the trough, signifying a clear minimum point (Appendix 1—figure 5). We tested the impact of different grid value choices on the FOI inference results by refining the grid to include more points, ensuring the FOI inference results are consistent. Specifically, we reduce the grid width for the mean parameter to half and a quarter of the original width, and for the variance parameter to half, a quarter, an eighth, and a sixteenth of the original width. The FOI inference results remain either unchanged or within a 1% deviation from those based on the original grid width (Appendix 1—figure 6).

Details for deriving confidence intervals for the estimated parameters are provided in Appendix 1—Confidence intervals for FOI inference.

Choice of $c$ and $r$ for the $G I / G / c / c + r$ queue in the two-moment approximation method

Request a detailed protocol

When applying the two-moment approximation method, values for the number of parallel servers ( $c$ ) and waiting places ( $r$ ) need to be specified. Since MOI is defined exclusively for blood-stage infections, $r$ is set to 0 by default. The parameter $c$ corresponds to the carrying capacity of blood-stage infections. The maximum MOI observed in empirical data from Bongo, based on the varcoding method, is 20. Certain factors which reduce the number of var genes identified in an individual, and thus affect MOI estimation, are not explicitly accounted for in the current MOI estimation (see Discussion), so the actual carrying capacity could be higher. For simplicity, we assume the value of $c$ to be 30 in the simulation. Provided that $c$ is kept consistent across simulations and the two-moment approximation method, this choice should not affect FOI inference. For empirical surveys from Bongo, we set $c$ to 25, 30, 40, and 60 to systematically investigate its impact on FOI inference results. The FOI inference results are similar across these values (Figure 3, Figure 3—figure supplements 1–3).

In general, the choice of $c$ depends on the maximum MOI observed in a given empirical dataset under high transmission. To account for factors that may lead to underestimation of MOI, $c$ should be set higher than the observed maximum MOI. Since Bongo District of northern Ghana is a high-transmission endemic region, we expect the range of its $c$ to be applicable to other empirical datasets.

The mean arrival rate of infection from Little’s Law

Request a detailed protocol

The second method is known as Little’s Law (Little and Graves, 2008), which describes a relationship between the three main components of queuing systems. This law states that the average number of items in a queuing system $L$ equals the average arrival rate $λ$ multiplied by the average waiting time of an item in the system, $W$ . Reformulating Little’s Law for malaria transmission, the average arrival rate of infection λ equals the average number of blood-stage infections present in an individual $L$ divided by the average duration of infection $W$ .

λ = \frac{L}{W}

The relationship is simple and general, holding true regardless of the number of servers (carrying capacity of blood-stage infections in hosts), the service time distribution (infection duration distribution), the distribution of inter-arrival times, the order of service, or the queue structure.

Population-level MOI distribution substitutes for time-series or repeated observations

Request a detailed protocol

Both the two-moment approximation and Little’s Law rely on quantities defined under the stationary distribution of MOI—either the full distribution or its moments. In practice, this requires that the observed data provide a reasonable approximation to the stationary MOI distribution. Under sparse sampling schemes, each host contributes at most two observations, corresponding to two cross-sectional surveys conducted at the end of the wet/high-transmission and dry/low-transmission seasons; individuals may or may not be observed in both surveys. As a result, time averaging is not feasible. This limitation is common in empirical data, although it can be overcome in numerical simulations that generate complete temporal trajectories.

Nonetheless, a population-level queue length distribution can be obtained from both simulation outputs and empirical data by aggregating MOI estimates across sampled individuals. We use this distribution as a proxy for the steady-state queue length distribution of MOI across individuals. In the ABM, individual-level heterogeneity in transmission may be incorporated into the data-generating process depending on the scenario. The impact of such heterogeneity on FOI inference is assessed using these simulation outputs (Appendix 1—figure 2C). The performance of our methods across various simulated scenarios is reported and discussed in the Results and Discussion sections.

Conversion between FOI and the EIR

Request a detailed protocol

As an indirect proxy for transmission intensity, malariologists typically measure EIR by counting the number of infectious bites a human host receives within a fixed time interval (Shaukat et al., 2010). EIR is considered a standard metric of malaria transmission. Although both FOI and EIR reflect transmission intensity, FOI directly concerns detectable blood-stage infections, while EIR pertains to human-infectious vector contact rates. FOI is defined as the rate at which a host acquires infections, with the focus specifically on blood-stage strains for the following reason. Only blood-stage infections are detectable in all direct measures of FOI. Quantities used in indirect model-fitting approaches for estimating FOI are also based on or reflect these blood-stage strains/infections. Only these blood-stage strains/infections are transmissible to other individuals, impacting disease dynamics.

Studies comparing annual P. falciparum EIR and FOI estimates from age-stratified prevalence in cross-sectional parasitological studies have found significantly different magnitudes for these two quantities (Smith et al., 2010). The number of blood-stage infections per infectious bite (FOI/EIR) is referred to as transmission efficiency. Multiple studies indicate that malaria transmission is inefficient in high-intensity settings, and the reasons for this have been debated. Potential causes include heterogeneous biting, immunity or within-host dynamics, and measurement bias.

We utilize a functional curve with empirically derived parameters under the assumption of heterogeneous transmission, describing the highly non-linear relationship between reported EIR–FOI pairs (Smith et al., 2010). The functional curve, with FOI $h$ , EIR $E$ , and the corresponding parameters $b = 0.55$ , $α = 4.6$ , and $t = 43$ days, is as follows:

h = \frac{l o g (1 + α b E t)}{α t}

The EIR–FOI pairs (Smith et al., 2010) and the functional curve provide a basis for converting between these two quantities. EIR data for a subset of surveys in Bongo District, northern Ghana, were obtained (Tiedje et al., 2022). Combined with FOI estimates from our two proposed methods, we generate an EIR–FOI pair for empirical surveys in Bongo District. This enables us to evaluate whether our EIR–FOI pair aligns with historical data and the functional curve with the best-fit parameter values.

Appendix 1

The measurement error

We incorporate a measurement error model (Appendix 1—figure 2D) into the sampling of simulation outputs and MOI estimation for both simulation outputs and empirical data. This model accounts for the under-sampling or imperfect detection of var genes in the field. By subsampling var genes per strain, we reduce the number of distinct var genes available per host. This model is based on the repertoire size distribution, derived from molecular sequences of infections expected to be monoclonal (MOI = 1), that is, hosts infected by a single P. falciparum strain as they had 45 or fewer non-upsA DBLα types. These molecular sequences were collected during six cross-sectional surveys conducted from 2012 to 2016 in Bongo District, northern Ghana (Appendix 1—figure 2E; Tiedje et al., 2025; Tiedje et al., 2022; Pilosof et al., 2019).

Appendix 1—figure 1

Download asset Open asset

Agent-based model for falciparum malaria transmission.

(A) The stochastic model tracks infection history and specific immune memory of individual hosts to variant surface antigens encoded by *var* genes. At transmission events, a donor and a recipient host are randomly selected. Transmission occurs if the donor host has blood-stage infections, and the recipient host has not reached carrying capacity of infections in its liver. Each parasite genome in the donor host is transmitted to a mosquito with a probability of 1/(number of genomes) multiplied by the transmissibility of the currently expressed gene. Each parasite genome carries 45 *var* genes, with each gene represented by a linear combination of two epitopes (depicted by different shapes), with many possible variants each (alleles, depicted by different colors). (B) During the sexual stage within mosquitoes, different parasite genomes can exchange *var* genes through meiotic recombination, generating novel recombinant repertoires. The recipient host can receive either recombinant genomes or original genomes. (C) When a repertoire is successfully transmitted to a recipient host, it undergoes a 7-day dormant liver stage before entering the blood stage, where *var* genes are sequentially expressed. If the host has no immunity against either epitope of a given *var* gene, its expression lasts 7 days (and either 7.5 or 8 days in additional simulations). Immunity to one of the two epitopes reduces the expression by approximately half, while complete immunity to both epitopes leads to immediate clearance of the gene product. An infection ends either when all *var* genes in the repertoire have been expressed or recognized, or, alternatively, with a certain probability, before the full repertoire is exhausted (Appendix 1—Simulation data, subsection ‘An extended *var* model,’ sub-subsection ‘Within-host dynamics’). (D) During the asexual blood stage of infection, *var* genes within the same genome can swap their two epitope alleles through mitotic (ectopic) recombination, generating new epitopes with a certain probability. (E) *Var* genes can also mutate their epitopes to create new genes.

Appendix 1—figure 2

Download asset Open asset

Simulation design, transmission scenarios, under-sampling or imperfect detection of *var* genes, and the empirical survey design from Bongo District, northern Ghana.

(A) Each simulation comprises three stages: a ‘pre-IRS’ period where local transmission reaches a semi-stationary state, followed by a 3-year ‘IRS’ intervention period (transient IRS) which reduces transmission rate, and a ‘post-IRS’ period where transmission rates return to original levels. After transmission initialization, closed systems do not receive migrant genomes from the regional pool. Semi-open systems explicitly model two local populations connected by migration. Regionally open systems continually receive migrant genomes from the regional pool throughout the simulation. This figure was adapted from Zhan et al., 2024 (Figure 1) (CC BY 4.0 license). The copyright holder has granted permission to publish under a CC BY 4.0 license. (B) Transmission intensity or effective contact rate varies seasonally across the pre-, during-, and post-intervention periods. We simulate three levels of perturbation corresponding to approximately 20% (low-coverage IRS), 40–45% (mid-coverage IRS), and 65–75% (high-coverage IRS) reductions in transmission. Under non-seasonal transmission, the transmission intensity remains constant throughout the year, decreases only during IRS, and then returns to its original level once IRS ends. (C) We examine different statistical distributions for times between local transmission events: exponential and Gamma. We consider homogeneous and heterogeneous exposure risks. In the latter, $\frac{2}{3}$ of the population are high-risk, receiving approximately 94% of all bites, while the remaining population receives the rest. (D) The measurement error is depicted as a histogram showing the number of non-upsA (i.e., upsB and upsC) DBLα types per repertoire from putatively ‘monoclona’ infections, characterized by having 45 or fewer non-upsA DBLα types. These sequences were collected during six cross-sectional surveys conducted from 2012–2016 in Bongo District. This measurement error represents under-sampling or imperfect detection of *var* genes. (E) The study consists of four age-stratified cross-sectional surveys in Bongo District, Ghana, conducted at the end of wet/high-transmission seasons (blue circles) and dry/low-transmission seasons (gold circles). Two phases are covered: (1) Pre-IRS: Survey 1 (S1) in October 2012 and Survey 2 (S2) in May/June 2013; (2) Right post-IRS: Survey 3 (S3) in October 2015 and Survey 4 (S4) in May/June 2016. IRS was implemented with widespread LLIN usage distributed between 2010 and 2012 and again in 2016 (Tiedje et al., 2025; Gogue et al., 2020). This figure was adapted from Tiedje et al., 2022 (Figure 1) (CC BY 4.0 license). The copyright holder has granted permission to publish under a CC BY 4.0 license.

Appendix 1—figure 3

Download asset Open asset

The relationship between the parasitemia level of the individual (measured in µl) and (A) the number of non-upsA *var* genes per isolate/individual, or (B) MOI estimates from the Bayesian formulation of the *var*coding method.

There is a lack of association between the x- and y-axis variables among both untreated and antimalarial drug-treated individuals. We scale the parasitemia levels and the number of non-ups A *var* genes or MOI estimates before performing the regression.

Appendix 1—figure 4

Download asset Open asset

Schematic illustration of (A) systems in queuing theory and (B) malaria transmission.

Appendix 1—figure 5

Download asset Open asset

The shape of the negative log likelihood for (A) a simulation run (pre-IRS) with Gamma-distributed times between local transmission events in a seasonal, semi-open system with heterogeneous exposure risk, and (B) Ghana pre-IRS surveys (Survey 1 and 2) with $c$ = 30 and mid PCR detectability.

We remove the infinite and extremely large values of the negative log likelihood, and plot the rest to improve visualization.

Appendix 1—figure 6

Download asset Open asset

The impact of grid value choices on the results of FOI inference in either simulated outputs or Ghana data.

By further reducing the grid width to include more combinations of the mean and variance values of inter-arrival times, the FOI inference results remain either unchanged or deviate by no more than 1% from those based on the original grid width.

Simulation data

An extended var model

Overview

To evaluate the performance of the two proposed queuing theory methods for FOI inference across different transmission settings, we use a computational model, (He et al., 2018; Zhan et al., 2024) detailed in this section and illustrated in Appendix 1—figure 1. This model is an agent-based (individual-based), discrete-event, and continuous-time stochastic model in which all known possible future events are stored in a single event queue along with their putative times, which may be fixed or drawn from a probability distribution with a certain rate. Event rates are chosen based on malaria epidemiology literature, field studies, and in vitro/in vivo values. When an event occurs, it can cause the addition or removal of future events in the queue or the modification of their rates, resulting in a recalculation of putative times. This approach is implemented using the next-reaction method (Gibson and Bruck, 2000), an optimization of the Gillespie first-reaction method (Gillespie, 1976).

Individual human hosts die and are immediately replaced with newborns who have no immunity. The age structure of the human host population follows a truncated exponential distribution with a mean age of 30 years and a maximum age of 80 years. Individual infections and the immune history of each host are tracked. Evolutionary mechanisms, such as mitotic/ectopic recombination and mutation, are explicitly modeled.

At the beginning of each simulation, a small number of hosts are randomly selected and infected with distinct parasite genomes, assembled from a pool of var genes, to initiate local transmission. Mosquito vectors are not explicitly represented as agents in the model; instead, we consider an effective contact rate (referred to as the transmission rate, which under some assumptions is effectively equivalent to vectorial capacity), which determines the times of local transmission events. At these times, a donor and a recipient host are randomly selected, and successful transmission occurs only if the donor has blood-stage infections and the liver stage of the recipient is below the specified carrying capacity. The times between transmission events can follow various distributions, such as exponential and Gamma.

Var repertoire and gene structure

We assume a parasite repertoire size of 45, based on the median number of non-upsA DBLα sequences identified in our 3D7 laboratory isolate (Ruybal-Pesántez et al., 2022; Tiedje et al., 2022). The classification of var genes into upsA and non-upsA (upsB and upsC) types is based on their semi-conserved upstream promoter sequences (ups) (Gardner et al., 2002; Kraemer et al., 2007; Lavstsen et al., 2003; Rask et al., 2010). Although each parasite carries both types of var genes in a fairly constant proportion (Buckee and Recker, 2012; Rask et al., 2010; Ruybal-Pesántez et al., 2022; Ruybal-Pesántez et al., 2017; Tiedje et al., 2025), we focus on the more diverse and less conserved non-upsA sequences for MOI estimation. Despite functional differences, the groupings do not necessarily correlate with function, as there is within-group functional heterogeneity and cross-group functional similarity (Claessens et al., 2012; Kaestli et al., 2006; Rottmann et al., 2006). The mechanism behind the fairly constant proportion of these groups in empirical samples across different times and locations remains unclear, so we simplify by considering only the non-upsA type.

Each gene is modeled as a linear combination of two epitopes (alleles), based on the empirical description of the two hypervariable subregions in the var tag region amplified from field isolates (Larremore et al., 2013).

Ectopic recombination

We model ectopic recombination among genes within the same genome during the asexual stage inside the human host. This process is a major mechanism of var gene diversification, occurring during both sexual and asexual stages (Claessens et al., 2014). For simplicity, we focus on the asexual stage, where two randomly selected genes from a strain undergo recombination. The breakpoint location is chosen randomly. Under normal recombination, alleles of the two genes are swapped with a probability of creating new alleles. Under conversion, the second gene remains unchanged. In this implementation, we assume all events result in normal recombination rather than gene conversion.

Newly recombined genes have a probability of being functional (i.e., viable), influenced by the similarity of their parental genes and the breakpoint locations (Drummond et al., 2005):

P = ρ^{\frac{x (d - x)}{d - 1}}

where $ρ$ represents recombination tolerance, $d$ is the genetic distance between the two parental genes, and $x$ is the genetic distance between the offspring gene and one of the two parental genes.

Non-functional offspring genes replace their parental ones. As they do not express and get deactivated immediately, they shorten the infection duration of the strain they constitute, thereby reducing its fitness.

Meiotic recombination

Meiotic recombination occurs between strains during sexual replication inside the mosquito vector. While we do not explicitly model mosquitoes, we represent meiotic recombination between genomes at the time of a transmission event.

When multiple strains, denoted by $m$ ( $m > 1$ ), co-infect a donor host, a Bernoulli trial is conducted for each strain to determine if it will be transmitted via the contact event, with the success probability equal to its transmissibility. Each strain’s transmissibility depends on the currently expressed gene’s transmissibility. Co-infection reduces the transmissibility of each strain by a factor of $m$ . Only a subset $n$ of these strains, where $n \leq m$ , is selected. In nature, this subset would co-infect a mosquito vector.

To simulate meiotic recombination, each of the $n$ strains which are to be transmitted from the mosquito vector to a recipient human host is obtained by drawing two parental strains from the pool of $n$ with replacement. If the two parental strains are the same, the original strain is transmitted. If different, they recombine, and the recombinant strain is transmitted. The probabilities of transmitting the original or recombinant strain are $\frac{1}{n}$ and $1 - \frac{1}{n}$ , respectively.

Given that orthologous gene pairs between two parental strains are often unknown, we implement meiotic recombination by randomly selecting genes from the pooled genes of the two strains. This assumption is reasonable as physical locations of var genes can be mobile through ectopic recombination and gene conversions. The resulting offspring strains share some fraction of their var genes with the parental strains.

Within-host dynamics

Each strain is individually tracked through its entire life cycle, encompassing the liver stage and asexual blood stage in the human host, and the sexual stage in the mosquito. As we do not explicitly model mosquitoes, we delay the expression of each strain in the recipient host by 7 days to account for the sexual stage, for example, the time required for gametocytes to develop into sporozoites within mosquitoes. Additionally, we delay the expression of each strain by another 7 days to account for the liver stage, for example, the time required for parasites to be released as merozoites into the bloodstream to invade red blood cells. Hence, after a total of 14 days, the active asexual blood-stage and the expression of the var repertoire begin.

The var genes within an infection are expressed sequentially (Deitsch and Dzikowski, 2017; Zhang and Deitsch, 2022). During this process, the host is considered infectious with the currently active strain, and only these blood-stage strains are transmissible to another host. The deactivation rate of each gene is governed by the host’s variant-specific immunity. When a gene is actively expressed, the host’s immune system ‘checks’ whether either of its two epitopes has been encountered previously either during past infections or earlier in the current infection through genes that have already been expressed. If both epitopes are novel to the host, it takes approximately seven days for the immune system to mount an effective antibody response, after which the expressed variant rapidly declines and is cleared (Gatton and Cheng, 2004). Prior exposure to one epitope shortens the expression duration by about half, whereas prior exposure to both epitopes results in immediate deactivation. Thus, the active period of a gene is proportional to the number of its epitopes unseen by the host. Once a gene is deactivated, its two epitopes are added to the host’s immune memory, and the next new gene in the repertoire becomes active. The strain is cleared once its entire var gene repertoire has been expressed or recognized. Consequently, the total infection duration of a given repertoire is proportional to the total number of previously unseen epitopes across its var genes. Immunity to a given epitope wanes over time (Collins et al., 1964; Collins et al., 1968), requiring re-exposure for maintenance.

We also consider a variation on the within-host rules in addition to the baseline assumption that an infection clears only after the parasite has exhausted its entire var gene repertoire. This modification is motivated by biological evidence that clearance can occur earlier for several reasons, including stochastic extinction before full repertoire exhaustion. Even if some var genes remain unexpressed, an infection may terminate once parasite densities fall to very low levels due to demographic stochasticity. Such declines in parasite densities can arise from non-variant-specific immune mechanisms or from cross-immunity among var genes that share sequence similarity or epitopes (Crompton et al., 2014; Holding and Recker, 2015; Langhorne et al., 2008), both of which can substantially reduce parasite numbers. Here, we focus on the latter. To capture the possibility of early termination, we implemented a simple scenario in which there is a small probability of clearing the current infection while any given var gene-whether non-final or final-is being expressed. This probability is determined by the host’s pre-existing immunity to the two epitopes (alleles) of that gene, thereby capturing in a parsimonious manner the effects of cross-immunity among sequence- or allele-sharing var genes in reducing parasitemia. Specifically, it is modeled as a Bernoulli draw whose success probability equals the immunity level against the gene (0 for no immunity to either epitope, 0.5 for immunity to one epitope, and 1 for immunity to both epitopes) multiplied by a constant factor of 0.025. Thus, the probability scales with pre-existing variant-specific immunity to the gene but remains small overall, while introducing additional variance into the emergent distribution of total infection duration across hosts.

We note that in the ABM simulations, we do not use empirical estimates of infection duration from the historical neurosyphilis treatment studies of immunologically naive individuals as direct inputs. Instead, infection duration emerges from the within-host dynamics described in the previous paragraph. In our simulations, we assume deactivation times of 7, 7.5, or 8 days for a gene to which the host is naive. These values are consistent with the duration of each successive parasitemia peak observed in Plasmodium falciparum infections, each peak corresponding primarily to the expression of a single var gene. These mean expression durations, in combination with the within-host rules described previously, produce distributions of infection duration across naive hosts and those aged 1–5 years that span means and variances above and below, but collectively encompassing, values comparable to the historical clinical data from naive neurosyphilis patients treated with P. falciparum malaria. We provide a set of illustrative supplementary figures showing that the simulated distributions of infection duration overlap with and closely resemble the empirical distribution from the historical clinical data (Appendix 1—figures 20–25).

We acknowledge that the ABM cannot capture all mechanisms and complexities of within-host malaria dynamics, many of which remain poorly understood. Nonetheless, the model generates a range of distributions of infection duration spanning distributions that closely match those from the historical clinical data. Because the queueing-theory methods rely on the mean and variance of infection duration to infer FOI, this range of scenarios provides an appropriate basis for evaluating their performance.

We consider carrying capacities for both liver- and blood-stage infection (Tiedje et al., 2025), adopting a value of 30 for both based on reasons detailed in Materials and Methods—Choice of $c$ and $r$ for the $G I / G / c / c + r$ queue in the two-moment approximation method. When the number of liver-stage strains reaches the specified carrying capacity, the host no longer receives additional infections if selected as the recipient host for transmission events. Similarly, when the number of blood-stage strains reaches the carrying capacity, liver-stage strains are not released into the bloodstream and fail to transition to the blood-stage, effectively being lost.

Details of the parameters and their values are summarized in the Supplementary file 5.

Experimental designs

Each simulation runs for either 200 years (closed systems) or 150 years (open systems) to reach a semi-stationary state before introducing transmission-reducing interventions. We simulate malaria dynamics with parameters representative of high-transmission endemic regions, considering both constant and seasonal transmission, and across different spatial configurations, including closed, semi-open, and regionally open systems.

We consider indoor residual spraying (IRS), which involves applying insecticide to the internal walls and ceilings of homes (World Health Organization, 2015) in the field. IRS effectively reduces the mosquito population, thereby decreasing the transmission rate. We simulate three temporary IRS interventions with varying coverages: low (reducing transmission by around 20%), mid (reducing transmission by around 40–45%), and high (reducing transmission by around 65–75%). Each IRS intervention lasts for 3 years.

Seasonality is represented as a scaling constant multiplied by a temporal vector of 360 days, which represents the daily number of mosquitoes over a year. This temporal vector (Pilosof et al., 2019) is derived from a deterministic model of mosquito population dynamics (White et al., 2011). The model, originally developed for Anopheles gambiae, includes a set of ordinary differential equations describing the dynamics of four mosquito stages: eggs, larvae, pupae, and adults. Seasonality is implemented via density dependence at the egg and larva stages as a function of rainfall (availability of breeding sites). For our purposes, the values of the effective contact rate are more critical than the absolute number of daily mosquitoes. Essentially, we have a basic vector that represents the number of mosquitoes throughout the year and a scaling constant that encapsulates all other parameters related to vectorial capacity. The product of this temporal vector and the scaling constant results in the effective contact rate.

In closed systems, migration is discontinued after the initial seeding of local transmission from a regional pool of var genes. In semi-open systems, two individual populations are explicitly coupled via migration. Regionally open systems involve a local population with migration from a regional pool, which acts as a proxy for regional parasite diversity, that is, diversity from the aggregated individual populations in the region. Because each parasite genome is a repertoire with a given number of var genes, migrant genomes are assembled by randomly sampling var genes from the regional pool.

Transmission can be homogebeous or heterogeneous across individual hosts. For heterogeneity, we consider two groups of human hosts: a high risk group, which receives approximately 94% of the bites, and a low risk group, which receives the remaining fraction. The high-risk group is twice as large as the low-risk group (2:1 ratio).

Test of deviation from Poisson homogeneity in MOI estimates

Many studies have explored whether a count dataset deviates significantly from a homogeneous Poisson distribution. We use the Potthoff–Whittinghill ‘index of dispersion’ test (Potthoff and Whittinghill, 1966; Lloyd-Smith et al., 2005), which is asymptomatically locally most powerful against the negative binomial alternative. For a dataset $X$ with $N$ elements, the statistic is $\frac{(N - 1) * v a r (X)}{m e a n (X)}$ and its asymptotic distribution is chi-squared with $N - 1$ degrees of freedom. The p-value is the cumulative density of the chi-squared ( $N - 1$ ) distribution to the right of the test statistic, representing the probability that the observed variance could have arisen by chance from a Poisson distribution.

We summarize the p-values of the Potthoff–Whittinghill ‘index of dispersion’ test in Supplementary file 2—deviationFromPoissonTest.xlsx. The p-values are close to 0, indicating a significant deviation from a Poisson distribution for both simulated outputs and empirical surveys in Bongo District. Few exceptions occur in high-coverage IRS scenarios in the simulated outputs, where prevalence is very low and most infections are mono-clonal.

Reducing stochastic impact in sampling processes

To mitigate the effects of stochasticity in various sampling processes within the analysis, we perform 200 realizations, including the incorporation of the measurement error model into the simulation output. Each realization results in a slightly different collection of subsampled var genes per host, potentially leading to minor variations in individual MOI estimates. To impute MOI estimates for treated individuals or those with missing data, we conduct 200 samplings from available non-treated individuals. We then calculate a final population-level MOI distribution, weighted across these 200 realizations or samplings. This approach reduces the impact of extreme single-sampling variations on MOI estimates and FOI inference.

Confidence intervals for FOI inference

Non-parametric bootstrap

Bootstrap datasets are generated by resampling with replacement from the original population-level MOI distribution, that is, the collection of individual MOI estimates, using individual-level MOI as the unit of the bootstrap sampling. We run 200 bootstrap replicates, as this number has been tested and shown to achieve a coefficient of variation comparable to that obtained with a higher number of replicates (Efron and Tibshirani, 1994). For each bootstrap dataset, we determine the maximum likelihood estimates for FOI. The 200 replicates thus produce a bootstrap sampling distribution for FOI estimates. We calculate the skewness of these distributions. The majority fall within the range of –0.5 to 0.5, with a few exceptions falling within the range of 0.5 to 0.75. Therefore, we consider them fairly symmetric and do not apply a skewness adjustment to ensure good coverage. Detailed skewness values can be found in Supplementary file 6—FOIBootstrapSkewness.xlsx.

A comparison of the performance of the Bayesian formulation and the original varcoding method based on the simulation output

We compare the performance of the original varcoding method and its Bayesian formulation against true MOI values using the Cramer–von Mises and Anderson–Darling tests.

MOI estimates from the varcoding method often differ significantly from true MOI values (p-value <0.05). Similarly, there is frequently a statistically significant difference between MOI estimates from the varcoding method and those from the Bayesian formulation. However, differences between MOI estimates from the Bayesian formulation and true MOI values are generally not statistically significant.

Therefore, the Bayesian formulation improves upon the original varcoding method, which assumes a constant repertoire size to estimate MOI. This improvement is expected, as the Bayesian formulation accounts for variation in repertoire size due to var gene under-sampling.

Both methods perform well in low-transmission settings, where true MOI values are low. In moderate and high-transmission scenarios, the varcoding method remains reasonably effective, but the Bayesian formulation demonstrates a notable improvement in capturing higher MOI values, where measurement error is more pronounced. This improvement is significant in the high-transmission endemic Bongo District of Ghana, where our empirical MOI estimates were derived.

The documented test results are included in Supplementary file 7—BayesianImprovement.xlsx.

Overall, both methods tend to underestimate MOI. While the Bayesian formulation provides a more accurate and robust estimation by addressing the imperfect detection of var genes, neither method accounts for other factors that may reduce the number of var genes detected per individual (see Discussion).

Appendix 1—figure 7

Download asset Open asset

True and estimated FOI by the two-moment and Little’s Law methods for additional simulated scenarios of homogeneous exposure risk.

The times between local transmission events are Gamma-distributed, with non-seasonal transmission in a closed system. The true mean FOI per host per year is calculated by dividing the total number of infections acquired by the population by the total number of hosts in the population. Confidence intervals are estimated from 200 bootstrap replicates using non-parametric bootstrap analysis. Each boxplot shows minimum, 5% quantile, median, 95% quantile, and maximum values.

Appendix 1—figure 8

Download asset Open asset

True and estimated FOI by the two-moment and Little’s Law methods for additional simulated scenarios of heterogeneous exposure risk.

The times between local transmission events are Gamma-distributed, with non-seasonal transmission in a semi-open system. The true mean FOI per host per year is calculated by dividing the total number of infections acquired by the population by the total number of hosts in the population. Confidence intervals are estimated from 200 bootstrap replicates using non-parametric bootstrap analysis. Each boxplot shows minimum, 5% quantile, median, 95% quantile, and maximum values.

Appendix 1—figure 9

Download asset Open asset

Appendix 1—figure 10

Download asset Open asset

True and estimated FOI by the two-moment and Little’s Law for additional simulated scenarios of homogeneous exposure risk.

The times between local transmission events are Gamma-distributed, with non-seasonal transmission in a regionally open system. The true mean FOI per host per year is calculated by dividing the total number of infections acquired by the population by the total number of hosts in the population. Confidence intervals are estimated from 200 bootstrap replicates using non-parametric bootstrap analysis. Each boxplot shows minimum, 5% quantile, median, 95% quantile, and maximum values.

Appendix 1—figure 11

Download asset Open asset

Appendix 1—figure 12

Download asset Open asset

As in Figure 1, we present confidence intervals for the estimated mean FOI values; all aspects of the simulation setup are identical except that infections are allowed to clear stochastically before full repertoire exhaustion.

Specifically, while any *var* gene, whether non-final or final, is being expressed, there is a small probability of infection clearance that depends on the host’s pre-existing immunity to that gene’s epitopes (Appendix 1—Simulation data, subsection ‘An extended *var* model,’ sub-subsection ‘Within-host dynamics’).

Appendix 1—figure 13

Download asset Open asset

As in Figure 2, we present confidence intervals for the estimated mean FOI values; all aspects of the simulation setup are identical except that infections are allowed to clear stochastically before full repertoire exhaustion.

Specifically, while any *var* gene, whether non-final or final, is being expressed, there is a small probability of infection clearance that depends on the host’s pre-existing immunity to that gene’s epitopes (Appendix 1—Simulation data, subsection ‘An extended *var* model,’ sub-subsection ‘Within-host dynamics’).

Appendix 1—figure 14

Download asset Open asset

As in Appendix 1—figure 7, we present confidence intervals for the estimated mean FOI values; all aspects of the simulation setup are identical except that infections are allowed to clear stochastically before full repertoire exhaustion.

Specifically, while any *var* gene, whether non-final or final, is being expressed, there is a small probability of infection clearance that depends on the host’s pre-existing immunity to that gene’s epitopes (Appendix 1—Simulation data, subsection ‘An extended *var* model,’ sub-subsection ‘Within-host dynamics’).

Appendix 1—figure 15

Download asset Open asset

As in Appendix 1—figure 8, we present confidence intervals for the estimated mean FOI values; all aspects of the simulation setup are identical except that infections are allowed to clear stochastically before full repertoire exhaustion.

Specifically, while any *var* gene, whether non-final or final, is being expressed, there is a small probability of infection clearance that depends on the host’s pre-existing immunity to that gene’s epitopes (Appendix 1—Simulation data, subsection ‘An extended *var* model,’ sub-subsection ‘Within-host dynamics’).

Appendix 1—figure 16

Download asset Open asset

As in Appendix 1—figure 9, we present confidence intervals for the estimated mean FOI values; all aspects of the simulation setup are identical except that infections are allowed to clear stochastically before full repertoire exhaustion.

Specifically, while any *var* gene, whether non-final or final, is being expressed, there is a small probability of infection clearance that depends on the host’s pre-existing immunity to that gene’s epitopes (Appendix 1—Simulation data, subsection ‘An extended *var* model,’ sub-subsection ‘Within-host dynamics’).

Appendix 1—figure 17

Download asset Open asset

As in Appendix 1—figure 10, we present confidence intervals for the estimated mean FOI values; all aspects of the simulation setup are identical except that infections are allowed to clear stochastically before full repertoire exhaustion.

Specifically, while any *var* gene, whether non-final or final, is being expressed, there is a small probability of infection clearance that depends on the host’s pre-existing immunity to that gene’s epitopes (Appendix 1—Simulation data, subsection ‘An extended *var* model,’ sub-subsection ‘Within-host dynamics’).

Appendix 1—figure 18

Download asset Open asset

As in Appendix 1—figure 11, we present confidence intervals for the estimated mean FOI values; all aspects of the simulation setup are identical except that infections are allowed to clear stochastically before full repertoire exhaustion.

Specifically, while any *var* gene, whether non-final or final, is being expressed, there is a small probability of infection clearance that depends on the host’s pre-existing immunity to that gene’s epitopes (Appendix 1—Simulation data, subsection ‘An extended *var* model,’ sub-subsection ‘Within-host dynamics’).

Appendix 1—figure 19

Download asset Open asset

Estimated standard deviation of the inter-arrival times using the two-moment approximation method across different simulation scenarios and field data from Bongo District, Ghana.

Appendix 1—figure 20

Download asset Open asset

Comparison of the distribution of infection durations among naive hosts during the pre-IRS phase in simulated seasonal, semi-open systems where times between local transmission events follow a Gamma distribution, versus historical clinical data from neurosyphilis patients treated with *Plasmodium falciparum*.

In the simulations, each infection can clear before all of its *var* genes have been expressed and recognized. Specifically, during the expression of any gene, whether non-final or final, there is a small probability of infection clearance that depends on the host’s pre-existing immunity to that gene’s epitopes (Appendix 1—Simulation data, subsection ‘An extended *var* model,’ sub-subsection ‘Within-host dynamics’).

Appendix 1—figure 21

Download asset Open asset

As in Appendix 1—figure 20, we compare here the distribution of infection durations for the same simulation conditions with those from the historical clinical data, but show the results for children aged 1–5 years rather than naive hosts in the simulation.

Appendix 1—figure 22

Download asset Open asset

Comparison of the distribution of infection durations among naive hosts during the pre-IRS phase in simulated non-seasonal, semi-open systems where times between local transmission events follow a Gamma distribution, versus historical clinical data from neurosyphilis patients treated with *Plasmodium falciparum*.

In the simulations, each infection can clear before all of its *var* genes have been expressed and recognized. Specifically, during the expression of any gene, whether non-final or final, there is a small probability of infection clearance that depends on the host’s pre-existing immunity to that gene’s epitopes.

Appendix 1—figure 23

Download asset Open asset

As in Appendix 1—figure 22, we compare here the distribution of infection durations under the same simulation conditions with those from the historical clinical data, but show the results for children aged 1–5 years rather than naive hosts in the simulation.

Appendix 1—figure 24

Download asset Open asset

Comparison of the distribution of infection durations among naive hosts during the pre-IRS phase in simulated non-seasonal, regionally open systems where times between local transmission events follow a Gamma distribution, versus historical clinical data from neurosyphilis patients treated with *Plasmodium falciparum*.

In the simulations, each infection can clear before all of its *var* genes have been expressed and recognized. Specifically, during the expression of any gene, whether non-final or final, there is a small probability of infection clearance that depends on the host’s pre-existing immunity to that gene’s epitopes.

Appendix 1—figure 25

Download asset Open asset

As in Appendix 1—figure 24, we compare here the distribution of infection durations under the same simulation conditions with those from the historical clinical data, but show the results for children aged 1–5 years rather than naive hosts in the simulation.

Data availability

The sequences utilized in this study are publicly available in GenBank under BioProject Number: PRJNA 396962. All data associated with this study, including de-identified individual participant data, are available in the manuscript, appendices, and on GitHub (https://github.com/UniMelb-Day-Lab/FOI_Pf_Ghana copy archived at Tiedje, 2026). Redistribution or reuse of the participant metadata provided in this GitHub repository requires proper attribution and prior approval. Researchers interested in further use of these data should contact the Malaria Reservoir Study Team, represented by Prof. Karen Day (karen.day@unimelb.edu.au), a co-author of this work, to discuss how these data will be utilized for academic or research purposes and, if appropriate, to identify opportunities for collaboration. This contact ensures that the ethical standards of the study are maintained, while fostering responsible data stewardship and collaboration. The simulation code and analysis scripts are available at https://github.com/qzhan321/FOI (copy archived at Zhan, 2026).

The following previously published data sets were used

1. Malaria Reservoir Study Team
(2016) NCBI BioProject
ID PRJNA396962. Plasmodium falciparum (malaria parasite P. falciparum).

https://www.ncbi.nlm.nih.gov/bioproject/?term=PRJNA396962

References

1. Abukari Z
2. Okonu R
3. Nyarko SB
4. Lo AC
5. Dieng CC
6. Salifu SP
7. Gyan BA
8. Lo E
9. Amoah LE
(2019) The diversity, multiplicity of infection and population structure of P. falciparum parasites circulating in asymptomatic carriers living in high and low malaria transmission settings of ghana
Genes 10:434.

https://doi.org/10.3390/genes10060434
- PubMed
- Google Scholar
1. Alonso PL
2. Sacarlal J
3. Aponte JJ
4. Leach A
5. Macete E
6. Milman J
7. Mandomando I
8. Spiessens B
9. Guinovart C
10. Espasa M
11. Bassat Q
12. Aide P
13. Ofori-Anyinam O
14. Navia MM
15. Corachan S
16. Ceuppens M
17. Dubois M-C
18. Demoitié M-A
19. Dubovsky F
20. Menéndez C
21. Tornieporth N
22. Ballou WR
23. Thompson R
24. Cohen J
(2004) Efficacy of the RTS,S/AS02A vaccine against Plasmodium falciparum infection and disease in young African children: randomised controlled trial
Lancet 364:1411–1420.

https://doi.org/10.1016/S0140-6736(04)17223-1
- PubMed
- Google Scholar
1. Anderson TJC
2. Haubold B
3. Williams JT
4. Estrada-Franco§ JG
5. Richardson L
6. Mollinedo R
7. Bockarie M
8. Mokili J
9. Mharakurwa S
10. French N
11. Whitworth J
12. Velez ID
13. Brockman AH
14. Nosten F
15. Ferreira MU
16. Day KP
(2000) Microsatellite markers reveal a spectrum of population structures in the malaria parasite Plasmodium falciparum
Molecular Biology and Evolution 17:1467–1482.

https://doi.org/10.1093/oxfordjournals.molbev.a026247
- PubMed
- Google Scholar
1. Andolina C
2. Rek JC
3. Briggs J
4. Okoth J
5. Musiime A
6. Ramjith J
7. Teyssier N
8. Conrad M
9. Nankabirwa JI
10. Lanke K
11. Rodriguez-Barraquer I
12. Meerstein-Kessel L
13. Arinaitwe E
14. Olwoch P
15. Rosenthal PJ
16. Kamya MR
17. Dorsey G
18. Greenhouse B
19. Drakeley C
20. Staedke SG
21. Bousema T
(2021) Sources of persistent malaria transmission in a setting with effective malaria control in eastern Uganda: a longitudinal, observational cohort study
The Lancet. Infectious Diseases 21:1568–1578.

https://doi.org/10.1016/S1473-3099(21)00072-4
- PubMed
- Google Scholar
1. Andrade CM
2. Carrasquilla M
3. Dabbas U
4. Briggs J
5. van Dijk H
6. Sergeev N
7. Sissoko A
8. Niangaly M
9. Ntalla C
10. LaVerriere E
11. Skinner J
12. Golob K
13. Richter J
14. Cisse H
15. Li S
16. Hendry JA
17. Asghar M
18. Doumtabe D
19. Farnert A
20. Ruppert T
21. Neafsey DE
22. Kayentao K
23. Doumbo S
24. Ongoiba A
25. Crompton PD
26. Traore B
27. Greenhouse B
28. Portugal S
(2024) Infection length and host environment influence on Plasmodium falciparum dry season reservoir
EMBO Molecular Medicine 16:2349–2375.

https://doi.org/10.1038/s44321-024-00127-w
- PubMed
- Google Scholar
1. Argyropoulos DC
2. Tan MH
3. Adobor C
4. Mensah B
5. Labbé F
6. Tiedje KE
7. Koram KA
8. Ghansah A
9. Day KP
(2023) Performance of SNP barcodes to determine genetic diversity and population structure of Plasmodium falciparum in Africa
Frontiers in Genetics 14:1071896.

https://doi.org/10.3389/fgene.2023.1071896
- PubMed
- Google Scholar
1. Arnot D
(1998) Unstable malaria in Sudan: the influence of the dry season. Clone multiplicity of Plasmodium falciparum infections in individuals exposed to variable levels of disease transmission
Transactions of the Royal Society of Tropical Medicine and Hygiene 92:580–585.

https://doi.org/10.1016/s0035-9203(98)90773-8
- PubMed
- Google Scholar
1. Ashley EA
2. White NJ
(2014) The duration of Plasmodium falciparum infections
Malaria Journal 13:500.

https://doi.org/10.1186/1475-2875-13-500
- PubMed
- Google Scholar
1. Barry AE
2. Leliwa-Sytek A
3. Tavul L
4. Imrie H
5. Migot-Nabias F
6. Brown SM
7. McVean GAV
8. Day KP
(2007) Population genomics of the immune evasion (var) genes of Plasmodium falciparum
PLOS Pathogens 3:e34.

https://doi.org/10.1371/journal.ppat.0030034
- PubMed
- Google Scholar
1. Barry A
2. Awandu SS
3. Tiono AB
4. Grignard L
5. Bousema T
6. Collins KA
(2021) Improved detectability of Plasmodium falciparum clones with repeated sampling in incident and chronic infections in burkina faso
The American Journal of Tropical Medicine and Hygiene 106:664–666.

https://doi.org/10.4269/ajtmh.21-0493
- PubMed
- Google Scholar
1. Baruch DI
2. Pasloske BL
3. Singh HB
4. Bi X
5. Ma XC
6. Feldman M
7. Taraschi TF
8. Howard RJ
(1995) Cloning the P. falciparum gene encoding PfEMP1, a malarial variant antigen and adherence receptor on the surface of parasitized human erythrocytes
Cell 82:77–87.

https://doi.org/10.1016/0092-8674(95)90054-3
- PubMed
- Google Scholar
1. Beier JC
2. Oster CN
3. Onyango FK
4. Bales JD
5. Sherwood JA
6. Perkins PV
7. Chumo DK
8. Koech DV
9. Whitmire RE
10. Roberts CR
11. Diggs CL
12. Hoffman SL
(1994) Plasmodium falciparum incidence relative to entomologic inoculation rates at a site proposed for testing malaria vaccines in western Kenya
The American Journal of Tropical Medicine and Hygiene 50:529–536.

https://doi.org/10.4269/ajtmh.1994.50.529
- PubMed
- Google Scholar
(1976)
Estimation of incidence and recovery rates of Plasmodium falciparum parasitaemia from longitudinal data

Bulletin of the World Health Organization 54:685–693.
- PubMed
- Google Scholar
1. Bopp SER
2. Manary MJ
3. Bright AT
4. Johnston GL
5. Dharia NV
6. Luna FL
7. McCormack S
8. Plouffe D
9. McNamara CW
10. Walker JR
11. Fidock DA
12. Denchi EL
13. Winzeler EA
(2013) Mitotic evolution of Plasmodium falciparum shows a stable core genome but recombination in antigen families
PLOS Genetics 9:e1003293.

https://doi.org/10.1371/journal.pgen.1003293
- Google Scholar
1. Bretscher MT
2. Maire N
3. Chitnis N
4. Felger I
5. Owusu-Agyei S
6. Smith T
(2011) The distribution of Plasmodium falciparum infection durations
Epidemics 3:109–118.

https://doi.org/10.1016/j.epidem.2011.03.002
- PubMed
- Google Scholar
1. Bruce MC
2. Galinski MR
3. Barnwell JW
4. Donnelly CA
5. Walmsley M
6. Alpers MP
7. Walliker D
8. Day KP
(2000) Genetic diversity and dynamics of Plasmodium falciparum and P. vivax populations in multiply infected children with asymptomatic malaria infections in Papua New Guinea
Parasitology 121:257–272.

https://doi.org/10.1017/s0031182099006356
- PubMed
- Google Scholar
1. Buckee CO
2. Recker M
(2012) Evolution of the multi-domain structures of virulence genes in the human malaria parasite, Plasmodium falciparum
PLOS Computational Biology 8:e1002451.

https://doi.org/10.1371/journal.pcbi.1002451
- PubMed
- Google Scholar
(2010) Age-patterns of malaria vary with severity, transmission intensity and seasonality in sub-Saharan Africa: a systematic review and pooled analysis
PLOS ONE 5:e8988.

https://doi.org/10.1371/journal.pone.0008988
- PubMed
- Google Scholar
(2016) Variation in infection length and superinfection enhance selection efficiency in the human malaria parasite
Scientific Reports 6:26370.

https://doi.org/10.1038/srep26370
- PubMed
- Google Scholar
1. Chang HH
2. Worby CJ
3. Yeka A
4. Nankabirwa J
5. Kamya MR
6. Staedke SG
7. Dorsey G
8. Murphy M
9. Neafsey DE
10. Jeffreys AE
11. Hubbart C
12. Rockett KA
13. Amato R
14. Kwiatkowski DP
15. Buckee CO
16. Greenhouse B
(2017) THE REAL McCOIL: A method for THE concurrent estimation of THE complexity of infection and SNP allele frequency for malaria parasites
PLOS Computational Biology 13:e1005348.

https://doi.org/10.1371/journal.pcbi.1005348
- PubMed
- Google Scholar
1. Childs LM
2. Buckee CO
(2015) Dissecting the determinants of malaria chronicity: why within-host models struggle to reproduce infection dynamics
Journal of the Royal Society, Interface 12:20141379.

https://doi.org/10.1098/rsif.2014.1379
- PubMed
- Google Scholar
1. Choi DW
2. Kim NK
3. Chae KC
(2005) A two-moment approximation for the gi/g/c queue with finite capacity
INFORMS Journal on Computing 17:75–81.

https://doi.org/10.1287/ijoc.1030.0058
- Google Scholar
1. Claessens A
2. Adams Y
3. Ghumra A
4. Lindergard G
5. Buchan CC
6. Andisi C
7. Bull PC
8. Mok S
9. Gupta AP
10. Wang CW
11. Turner L
12. Arman M
13. Raza A
14. Bozdech Z
15. Rowe JA
(2012) A subset of group A-like var genes encodes the malaria parasite ligands for binding to human brain endothelial cells
PNAS 109:E1772–E1781.

https://doi.org/10.1073/pnas.1120461109
- PubMed
- Google Scholar
(2014) Generation of antigenic diversity in Plasmodium falciparum by structured rearrangement of Var genes during mitosis
PLOS Genetics 10:e1004812.

https://doi.org/10.1371/journal.pgen.1004812
- PubMed
- Google Scholar
(1964) Fluorescent antibody studies in human malaria II development and persistence of antibodies to Plasmodium falciparum
The American Journal of Tropical Medicine and Hygiene 13:256–260.

https://doi.org/10.4269/ajtmh.1964.13.256
- PubMed
- Google Scholar
(1968) Studies on the persistence of malarial antibody response
American Journal of Epidemiology 87:592–598.

https://doi.org/10.1093/oxfordjournals.aje.a120849
- PubMed
- Google Scholar
1. Collins WE
2. Jeffery GM
(1999) A retrospective examination of sporozoite- and trophozoite-induced infections with Plasmodium falciparum in patients previously infected with heterologous species of Plasmodium: effect on development of parasitologic and clinical immunity
The American Journal of Tropical Medicine and Hygiene 61:36–43.

https://doi.org/10.4269/tropmed.1999.61-036
- PubMed
- Google Scholar
1. Crompton PD
2. Moebius J
3. Portugal S
4. Waisberg M
5. Hart G
6. Garver LS
7. Miller LH
8. Barillas-Mury C
9. Pierce SK
(2014) Malaria immunity in man and mosquito: insights into unsolved mysteries of a deadly infectious disease
Annual Review of Immunology 32:157–187.

https://doi.org/10.1146/annurev-immunol-032713-120220
- PubMed
- Google Scholar
1. Daniels R
2. Volkman SK
3. Milner DA
4. Mahesh N
5. Neafsey DE
6. Park DJ
7. Rosen D
8. Angelino E
9. Sabeti PC
10. Wirth DF
11. Wiegand RC
(2008) A general SNP-based molecular barcode for Plasmodium falciparum identification and tracking
Malaria Journal 7:223.

https://doi.org/10.1186/1475-2875-7-223
- PubMed
- Google Scholar
(1996) Rapid turnover of Plasmodium falciparum populations in asymptomatic individuals living in a high transmission area
The American Journal of Tropical Medicine and Hygiene 54:18–26.

https://doi.org/10.4269/ajtmh.1996.54.18
- PubMed
- Google Scholar
1. Day KP
2. Artzy-Randrup Y
3. Tiedje KE
4. Rougeron V
5. Chen DS
6. Rask TS
7. Rorick MM
8. Migot-Nabias F
9. Deloron P
10. Luty AJF
11. Pascual M
(2017) Evidence of strain structure in Plasmodium falciparum var gene repertoires in children from Gabon, West Africa
PNAS 114:E4103–E4111.

https://doi.org/10.1073/pnas.1613018114
- PubMed
- Google Scholar
(2009) Common strategies for antigenic variation by bacterial, fungal and protozoan pathogens
Nature Reviews. Microbiology 7:493–503.

https://doi.org/10.1038/nrmicro2145
- PubMed
- Google Scholar
1. Deitsch KW
2. Dzikowski R
(2017) Variant gene expression and antigenic variation by malaria parasites
Annual Review of Microbiology 71:625–641.

https://doi.org/10.1146/annurev-micro-090816-093841
- PubMed
- Google Scholar
(2023) An immune memory-structured SIS epidemiological model for hyperdiverse pathogens
PNAS 120:e2218499120.

https://doi.org/10.1073/pnas.2218499120
- PubMed
- Google Scholar
(1974)
A malaria model tested in the African savannah

Bulletin of the World Health Organization 50:347–357.
- PubMed
- Google Scholar
(2007) Uninfected mosquito bites confer protection against infection with malaria parasites
Infection and Immunity 75:2523–2530.

https://doi.org/10.1128/IAI.01928-06
- PubMed
- Google Scholar
1. Doolan DL
2. Martinez-Alier N
(2006) Immune response to pre-erythrocytic stages of malaria parasites
Current Molecular Medicine 6:169–185.

https://doi.org/10.2174/156652406776055249
- PubMed
- Google Scholar
(2009) Acquired immunity to malaria
Clinical Microbiology Reviews 22:13–36.

https://doi.org/10.1128/CMR.00025-08
- PubMed
- Google Scholar
(2005) On the conservative nature of intragenic recombination
PNAS 102:5380–5385.

https://doi.org/10.1073/pnas.0500729102
- PubMed
- Google Scholar
1. Earland D
2. Buchwald AG
3. Sixpence A
4. Chimenya M
5. Damson M
6. Seydel KB
7. Mathanga DP
8. Taylor TE
9. Laufer MK
(2019) Impact of multiplicity of Plasmodium falciparum infection on clinical disease in malawi
The American Journal of Tropical Medicine and Hygiene 101:412–415.

https://doi.org/10.4269/ajtmh.19-0093
- PubMed
- Google Scholar
Book
1. Efron B
2. Tibshirani RJ
(1994) An Introduction to the Bootstrap
Chapman and Hall/CRC.

https://doi.org/10.1201/9780429246593
- Google Scholar
1. Eldh M
2. Hammar U
3. Arnot D
4. Beck H-P
5. Garcia A
6. Liljander A
7. Mercereau-Puijalon O
8. Migot-Nabias F
9. Mueller I
10. Ntoumi F
11. Ross A
12. Smith T
13. Sondén K
14. Vafa Homann M
15. Yman V
16. Felger I
17. Färnert A
(2020) Multiplicity of asymptomatic Plasmodium falciparum infections and risk of clinical malaria: a systematic review and pooled analysis of individual participant data
The Journal of Infectious Diseases 221:775–785.

https://doi.org/10.1093/infdis/jiz510
- Google Scholar
(1997) Daily dynamics of Plasmodium falciparum subpopulations in asymptomatic children in a holoendemic area
The American Journal of Tropical Medicine and Hygiene 56:538–547.

https://doi.org/10.4269/ajtmh.1997.56.538
- PubMed
- Google Scholar
1. Färnert A
2. Lebbad M
3. Faraja L
4. Rooth I
(2008) Extensive dynamics of Plasmodium falciparum densities, stages and genotyping profiles
Malaria Journal 7:241.

https://doi.org/10.1186/1475-2875-7-241
- PubMed
- Google Scholar
1. Felger I
2. Tavul L
3. Kabintik S
4. Marshall V
5. Genton B
6. Alpers M
7. Beck HP
(1994) Plasmodium falciparum: extensive polymorphism in merozoite surface antigen 2 alleles in an area with endemic malaria in Papua New Guinea
Experimental Parasitology 79:106–116.

https://doi.org/10.1006/expr.1994.1070
- PubMed
- Google Scholar
1. Felger I
2. Maire M
3. Bretscher MT
4. Falk N
5. Tiaden A
6. Sama W
7. Beck HP
8. Owusu-Agyei S
9. Smith TA
(2012) The dynamics of natural Plasmodium falciparum infections
PLOS ONE 7:e45542.

https://doi.org/10.1371/journal.pone.0045542
- PubMed
- Google Scholar
1. Frank M
2. Kirkman L
3. Costantini D
4. Sanyal S
5. Lavazec C
6. Templeton TJ
7. Deitsch KW
(2008) Frequent recombination events generate diversity within the multi-copy variant antigen gene families of Plasmodium falciparum
International Journal for Parasitology 38:1099–1109.

https://doi.org/10.1016/j.ijpara.2008.01.010
- PubMed
- Google Scholar
(2000) Frequent ectopic recombination of virulence factor genes in telomeric chromosome clusters of P. falciparum
Nature 407:1018–1022.

https://doi.org/10.1038/35039531
- PubMed
- Google Scholar
1. Gardner MJ
2. Hall N
3. Fung E
4. White O
5. Berriman M
6. Hyman RW
7. Carlton JM
8. Pain A
9. Nelson KE
10. Bowman S
11. Paulsen IT
12. James K
13. Eisen JA
14. Rutherford K
15. Salzberg SL
16. Craig A
17. Kyes S
18. Chan M-S
19. Nene V
20. Shallom SJ
21. Suh B
22. Peterson J
23. Angiuoli S
24. Pertea M
25. Allen J
26. Selengut J
27. Haft D
28. Mather MW
29. Vaidya AB
30. Martin DMA
31. Fairlamb AH
32. Fraunholz MJ
33. Roos DS
34. Ralph SA
35. McFadden GI
36. Cummings LM
37. Subramanian GM
38. Mungall C
39. Venter JC
40. Carucci DJ
41. Hoffman SL
42. Newbold C
43. Davis RW
44. Fraser CM
45. Barrell B
(2002) Genome sequence of the human malaria parasite Plasmodium falciparum
Nature 419:498–511.

https://doi.org/10.1038/nature01097
- PubMed
- Google Scholar
1. Gatton ML
2. Cheng Q
(2004) Investigating antigenic variation and other parasite-host interactions in Plasmodium falciparum infections in naïve hosts
Parasitology 128:367–376.

https://doi.org/10.1017/s0031182003004608
- PubMed
- Google Scholar
1. Gibson MA
2. Bruck J
(2000) Efficient exact stochastic simulation of chemical systems with many species and many channels
The Journal of Physical Chemistry A 104:1876–1889.

https://doi.org/10.1021/jp993732q
- Google Scholar
1. Gillespie DT
(1976) A general method for numerically simulating the stochastic time evolution of coupled chemical reactions
Journal of Computational Physics 22:403–434.

https://doi.org/10.1016/0021-9991(76)90041-3
- Google Scholar
1. Gogue C
2. Wagman J
3. Tynuv K
4. Saibu A
5. Yihdego Y
6. Malm K
7. Mohamed W
8. Akplu W
9. Tagoe T
10. Ofosu A
11. Williams I
12. Asiedu S
13. Richardson J
14. Fornadel C
15. Slutsker L
16. Robertson M
(2020) An observational analysis of the impact of indoor residual spraying in Northern, Upper East, and Upper West Regions of Ghana: 2014 through 2017
Malaria Journal 19:242.

https://doi.org/10.1186/s12936-020-03318-1
- PubMed
- Google Scholar
(2012) Sexual development in Plasmodium: lessons from functional analyses
PLOS Pathogens 8:e1002404.

https://doi.org/10.1371/journal.ppat.1002404
- PubMed
- Google Scholar
(2018) Networks of genetic similarity reveal non-neutral processes shape strain structure in Plasmodium falciparum
Nature Communications 9:1817.

https://doi.org/10.1038/s41467-018-04219-3
- PubMed
- Google Scholar
1. Hergott DEB
2. Owalla TJ
3. Staubus WJ
4. Seilie AM
5. Chavtur C
6. Balkus JE
7. Apio B
8. Lema J
9. Cemeri B
10. Akileng A
11. Chang M
12. Egwang TG
13. Murphy SC
(2024) Assessing the daily natural history of asymptomatic Plasmodium infections in adults and older children in Katakwi, Uganda: a longitudinal cohort study
The Lancet Microbe 5:e72–e80.

https://doi.org/10.1016/S2666-5247(23)00262-8
- Google Scholar
1. Heyman DP
(1985) Queues and point processes (Peter franken, dieter könig, ursula arndt and volker schmidt)
SIAM Review 27:275–276.

https://doi.org/10.1137/1027083
- Google Scholar
1. Hofmann NE
2. Karl S
3. Wampfler R
4. Kiniboro B
5. Teliki A
6. Iga J
7. Waltmann A
8. Betuela I
9. Felger I
10. Robinson LJ
11. Mueller I
(2017) The complex relationship of exposure to new Plasmodium infections and incidence of clinical malaria in Papua New Guinea
eLife 6:e23708.

https://doi.org/10.7554/eLife.23708
- PubMed
- Google Scholar
1. Holding T
2. Recker M
(2015) Maintenance of phenotypic diversity within a set of virulence encoding genes of the malaria parasite Plasmodium falciparum
Journal of the Royal Society, Interface 12:20150848.

https://doi.org/10.1098/rsif.2015.0848
- PubMed
- Google Scholar
1. John CC
2. Moormann AM
3. Pregibon DC
4. Sumba PO
5. Mchugh MM
6. Narum DL
7. Lanar DE
8. Schluchter MD
9. Kazura JW
(2005) Correlation of high levels of antibodies to multiple pre-erythrocytic Plasmodium falciparum antigens and protection from infection
The American Journal of Tropical Medicine and Hygiene 73:222–228.

https://doi.org/10.4269/ajtmh.2005.73.222
- Google Scholar
1. Kaestli M
2. Cockburn IA
3. Cortés A
4. Baea K
5. Rowe JA
6. Beck HP
(2006) Virulence of malaria is associated with differential expression of Plasmodium falciparum var gene subgroups in a case-control study
The Journal of Infectious Diseases 193:1567–1574.

https://doi.org/10.1086/503776
- PubMed
- Google Scholar
(1999) Variation of Plasmodium falciparum msp1 block 2 and msp2 allele prevalence and of infection complexity in two neighbouring Senegalese villages with different transmission conditions
Transactions of the Royal Society of Tropical Medicine and Hygiene 93 Suppl 1:21–28.

https://doi.org/10.1016/s0035-9203(99)90323-1
- PubMed
- Google Scholar
1. Kraemer SM
2. Kyes SA
3. Aggarwal G
4. Springer AL
5. Nelson SO
6. Christodoulou Z
7. Smith LM
8. Wang W
9. Levin E
10. Newbold CI
11. Myler PJ
12. Smith JD
(2007) Patterns of gene recombination shape var gene repertoires in Plasmodium falciparum: comparisons of geographically diverse isolates
BMC Genomics 8:45.

https://doi.org/10.1186/1471-2164-8-45
- PubMed
- Google Scholar
1. Labbé F
2. He Q
3. Zhan Q
4. Tiedje KE
5. Argyropoulos DC
6. Tan MH
7. Ghansah A
8. Day KP
9. Pascual M
(2023) Neutral vs. non-neutral genetic footprints of Plasmodium falciparum multiclonal infections
PLOS Computational Biology 19:e1010816.

https://doi.org/10.1371/journal.pcbi.1010816
- PubMed
- Google Scholar
1. Laishram DD
2. Sutton PL
3. Nanda N
4. Sharma VL
5. Sobti RC
6. Carlton JM
7. Joshi H
(2012) The complexities of malaria disease manifestations with a focus on asymptomatic malaria
Malaria Journal 11:29.

https://doi.org/10.1186/1475-2875-11-29
- PubMed
- Google Scholar
(2008) Immunity to malaria: more questions than answers
Nature Immunology 9:725–732.

https://doi.org/10.1038/ni.f.205
- PubMed
- Google Scholar
(2013) A network approach to analyzing highly recombinant malaria parasite genes
PLOS Computational Biology 9:e1003268.

https://doi.org/10.1371/journal.pcbi.1003268
- PubMed
- Google Scholar
(2003) Sub-grouping of Plasmodium falciparum 3D7 var genes based on sequence analysis of coding and non-coding regions
Malaria Journal 2:27.

https://doi.org/10.1186/1475-2875-2-27
- PubMed
- Google Scholar
1. Lee SA
2. Yeka A
3. Nsobya SL
4. Dokomajilar C
5. Rosenthal PJ
6. Talisuna A
7. Dorsey G
(2006) Complexity of Plasmodium falciparum infections and antimalarial drug efficacy at 7 sites in Uganda
The Journal of Infectious Diseases 193:1160–1163.

https://doi.org/10.1086/501473
- PubMed
- Google Scholar
(2013) The silent threat: asymptomatic parasitemia and malaria transmission
Expert Review of Anti-Infective Therapy 11:623–639.

https://doi.org/10.1586/eri.13.45
- PubMed
- Google Scholar
Book
1. Little JDC
2. Graves SC
(2008) Little’s law
In: Chhajed D, Lowe TJ, editors. Building Intuition, International Series in Operations Research & Management Science. Springer. pp. 81–100.

https://doi.org/10.1007/978-0-387-73699-0_5
- Google Scholar
(2005) Superspreading and the effect of individual variation on disease emergence
Nature 438:355–359.

https://doi.org/10.1038/nature04153
- PubMed
- Google Scholar
1. Macdonald G
(1950)
The analysis of malaria parasite rates in infants

Tropical Diseases Bulletin 47:915–938.
- PubMed
- Google Scholar
1. Maire N
2. Smith T
3. Ross A
4. Owusu-Agyei S
5. Dietz K
6. Molineaux L
(2006) A model for natural immunity to asexual blood stages of Plasmodium falciparum malaria in endemic areas
The American Journal of Tropical Medicine and Hygiene 75:19–31.

https://doi.org/10.4269/ajtmh.2006.75.19
- PubMed
- Google Scholar
(2002) Malaria therapy reinoculation data suggest individual variation of an innate immune response and independent acquisition of antiparasitic and antitoxic immunities
Transactions of the Royal Society of Tropical Medicine and Hygiene 96:205–209.

https://doi.org/10.1016/s0035-9203(02)90308-1
- PubMed
- Google Scholar
1. Msuya FH
2. Curtis CF
(1991) Trial of pyrethroid impregnated bednets in an area of Tanzania holoendemic for malaria Part 4. Effects on incidence of malaria infection
Acta Tropica 49:165–171.

https://doi.org/10.1016/0001-706x(91)90035-i
- PubMed
- Google Scholar
1. Mueller I
2. Schoepflin S
3. Smith TA
4. Benton KL
5. Bretscher MT
6. Lin E
7. Kiniboro B
8. Zimmerman PA
9. Speed TP
10. Siba P
11. Felger I
(2012) Force of infection is key to understanding the epidemiology of Plasmodium falciparum malaria in Papua New Guinean children
PNAS 109:10030–10035.

https://doi.org/10.1073/pnas.1200841109
- PubMed
- Google Scholar
Book
1. Muench H
(1959) Catalytic Models in Epidemiology
Harvard University Press.

https://doi.org/10.4159/harvard.9780674428928
- Google Scholar
(2017) Estimating age-time-dependent malaria force of infection accounting for unobserved heterogeneity
Epidemiology and Infection 145:2545–2562.

https://doi.org/10.1017/S0950268817001297
- PubMed
- Google Scholar
1. Nkhoma SC
2. Trevino SG
3. Gorena KM
4. Nair S
5. Khoswe S
6. Jett C
7. Garcia R
8. Daniel B
9. Dia A
10. Terlouw DJ
11. Ward SA
12. Anderson TJC
13. Cheeseman IH
(2020) Co-transmission of related malaria parasite lineages shapes within-host parasite diversity
Cell Host & Microbe 27:93–103.

https://doi.org/10.1016/j.chom.2019.12.001
- PubMed
- Google Scholar
(2012) Factors determining the occurrence of submicroscopic malaria infections and their relevance for control
Nature Communications 3:1237.

https://doi.org/10.1038/ncomms2241
- PubMed
- Google Scholar
(2001) Genetic diversity of Plasmodium falciparum and its relationship to parasite density in an area with different malaria endemicities in West Uganda
Tropical Medicine & International Health 6:607–613.

https://doi.org/10.1046/j.1365-3156.2001.00761.x
- PubMed
- Google Scholar
1. Pilosof S
2. He Q
3. Tiedje KE
4. Ruybal-Pesántez S
5. Day KP
6. Pascual M
(2019) Competition for hosts modulates vast antigenic diversity to generate persistent strain structure in Plasmodium falciparum
PLOS Biology 17:e3000336.

https://doi.org/10.1371/journal.pbio.3000336
- PubMed
- Google Scholar
(1999) Plasmodium falciparum: analysis of the antibody specificity to the surface of the trophozoite-infected erythrocyte
Experimental Parasitology 91:161–169.

https://doi.org/10.1006/expr.1998.4368
- PubMed
- Google Scholar
1. Potthoff RF
2. Whittinghill M
(1966) Testing for homogeneity. II. the poisson distribution
Biometrika 53:183–190.

https://doi.org/10.1093/biomet/53.1-2.183
- PubMed
- Google Scholar
1. Pull JH
2. Grab B
(1974)
A simple epidemiological model for evaluating the malaria inoculation rate and the risk of infection in infants

Bulletin of the World Health Organization 51:507–516.
- PubMed
- Google Scholar
(2010) Plasmodium falciparum erythrocyte membrane protein 1 diversity in seven genomes – divide and conquer
PLOS Computational Biology 6:e1000933.

https://doi.org/10.1371/journal.pcbi.1000933
- PubMed
- Google Scholar
1. Rottmann M
2. Lavstsen T
3. Mugasa JP
4. Kaestli M
5. Jensen ATR
6. Müller D
7. Theander T
8. Beck H-P
(2006) Differential expression of var gene groups is associated with morbidity caused by Plasmodium falciparum infection in Tanzanian children
Infection and Immunity 74:3904–3911.

https://doi.org/10.1128/IAI.02073-05
- PubMed
- Google Scholar
1. Ruybal-Pesántez S
2. Tiedje KE
3. Tonkin-Hill G
4. Rask TS
5. Kamya MR
6. Greenhouse B
7. Dorsey G
8. Duffy MF
9. Day KP
(2017) Population genomics of virulence genes of Plasmodium falciparum in clinical isolates from Uganda
Scientific Reports 7:11810.

https://doi.org/10.1038/s41598-017-11814-9
- PubMed
- Google Scholar
1. Ruybal-Pesántez S
2. Tiedje KE
3. Pilosof S
4. Tonkin-Hill G
5. He Q
6. Rask TS
7. Amenga-Etego L
8. Oduro AR
9. Koram KA
10. Pascual M
11. Day KP
(2022) Age-specific patterns of DBLα var diversity can explain why residents of high malaria transmission areas remain susceptible to Plasmodium falciparum blood stage infection throughout life
International Journal for Parasitology 52:721–731.

https://doi.org/10.1016/j.ijpara.2021.12.001
- PubMed
- Google Scholar
(2010) Using the entomological inoculation rate to assess the impact of vector control on malaria parasite transmission and elimination
Malaria Journal 9:122.

https://doi.org/10.1186/1475-2875-9-122
- PubMed
- Google Scholar
(2002) Population dynamics of untreated Plasmodium falciparum malaria within the adult human host during the expansion phase of the infection
Parasitology 124:247–263.

https://doi.org/10.1017/s0031182001001202
- PubMed
- Google Scholar
(1995) Switches in expression of Plasmodium falciparum var genes correlate with changes in antigenic and cytoadherent phenotypes of infected erythrocytes
Cell 82:101–110.

https://doi.org/10.1016/0092-8674(95)90056-x
- PubMed
- Google Scholar
1. Smith T
2. Vounatsou P
(2003) Estimation of infection and recovery rates for highly polymorphic parasites when detectability is imperfect, using hidden Markov models
Statistics in Medicine 22:1709–1724.

https://doi.org/10.1002/sim.1274
- PubMed
- Google Scholar
1. Smith DL
2. Drakeley CJ
3. Chiyaka C
4. Hay SI
(2010) A quantitative analysis of transmission efficiency versus intensity for malaria
Nature Communications 1:108.

https://doi.org/10.1038/ncomms1107
- PubMed
- Google Scholar
1. Sondo P
2. Derra K
3. Lefevre T
4. Diallo-Nakanabo S
5. Tarnagda Z
6. Zampa O
7. Kazienga A
8. Valea I
9. Sorgho H
10. Ouedraogo JB
11. Guiguemde TR
12. Tinto H
(2019) Genetically diverse Plasmodium falciparum infections, within-host competition and symptomatic malaria in humans
Scientific Reports 9:127.

https://doi.org/10.1038/s41598-018-36493-y
- PubMed
- Google Scholar
1. Sondo P
2. Derra K
3. Rouamba T
4. Nakanabo Diallo S
5. Taconet P
6. Kazienga A
7. Ilboudo H
8. Tahita MC
9. Valéa I
10. Sorgho H
11. Lefèvre T
12. Tinto H
(2020) Determinants of Plasmodium falciparum multiplicity of infection and genetic diversity in Burkina Faso
Parasites & Vectors 13:427.

https://doi.org/10.1186/s13071-020-04302-z
- PubMed
- Google Scholar
(1995) The large diverse gene family var encodes proteins involved in cytoadherence and antigenic variation of Plasmodium falciparum-infected erythrocytes
Cell 82:89–100.

https://doi.org/10.1016/0092-8674(95)90055-1
- PubMed
- Google Scholar
1. Tan MH
2. Shim H
3. Chan Y
4. Day KP
(2023) Unravelling var complexity: Relationship between DBLα types and var genes in Plasmodium falciparum
Frontiers in Parasitology 1:1006341.

https://doi.org/10.3389/fpara.2022.1006341
- PubMed
- Google Scholar
1. Tiedje KE
2. Oduro AR
3. Agongo G
4. Anyorigiya T
5. Azongo D
6. Awine T
7. Ghansah A
8. Pascual M
9. Koram KA
10. Day KP
(2017) Seasonal variation in the epidemiology of asymptomatic Plasmodium falciparum infections across two catchment areas in bongo district, ghana
The American Journal of Tropical Medicine and Hygiene 97:199–212.

https://doi.org/10.4269/ajtmh.16-0959
- PubMed
- Google Scholar
1. Tiedje KE
2. Oduro AR
3. Bangre O
4. Amenga-Etego L
5. Dadzie SK
6. Appawu MA
7. Frempong K
8. Asoala V
9. Ruybal-Pésantez S
10. Narh CA
11. Deed SL
12. Argyropoulos DC
13. Ghansah A
14. Agyei SA
15. Segbaya S
16. Desewu K
17. Williams I
18. Simpson JA
19. Malm K
20. Pascual M
21. Koram KA
22. Day KP
(2022) Indoor residual spraying with a non-pyrethroid insecticide reduces the reservoir of Plasmodium falciparum in a high-transmission area in northern Ghana
PLOS Global Public Health 2:e0000285.

https://doi.org/10.1371/journal.pgph.0000285
- PubMed
- Google Scholar
1. Tiedje KE
2. Zhan Q
3. Ruybal-Pésantez S
4. Tonkin-Hill G
5. He Q
6. Tan MH
7. Argyropoulos DC
8. Deed SL
9. Ghansah A
10. Bangre O
11. Oduro AR
12. Koram KA
13. Pascual M
14. Day KP
(2025) Measuring changes in Plasmodium falciparum census population size in response to sequential malaria control interventions
eLife 12:RP91411.

https://doi.org/10.7554/eLife.91411
- PubMed
- Google Scholar
Software
1. Tiedje K
(2026) FOI_Pf_Ghana, version swh:1:rev:dae7918eba4b6e4bbe7f75100a5a7ce77f41f63b
Software Heritage.

https://archive.softwareheritage.org/swh:1:dir:35b7d4d5f1cae18a1393a20fe616139933a52a05;origin=https://github.com/UniMelb-Day-Lab/FOI_Pf_Ghana;visit=swh:1:snp:3ec661cc540530d132e8d925b39c8a44ae1de3b8;anchor=swh:1:rev:dae7918eba4b6e4bbe7f75100a5a7ce77f41f63b
1. Tran TM
2. Li S
3. Doumbo S
4. Doumtabe D
5. Huang CY
6. Dia S
7. Bathily A
8. Sangala J
9. Kone Y
10. Traore A
11. Niangaly M
12. Dara C
13. Kayentao K
14. Ongoiba A
15. Doumbo OK
16. Traore B
17. Crompton PD
(2013) An intensive longitudinal cohort study of Malian children and adults reveals no evidence of acquired immunity to Plasmodium falciparum infection
Clinical Infectious Diseases 57:40–47.

https://doi.org/10.1093/cid/cit174
- PubMed
- Google Scholar
(2011) Modelling the impact of vector control interventions on Anopheles gambiae population dynamics
Parasites & Vectors 4:153.

https://doi.org/10.1186/1756-3305-4-153
- PubMed
- Google Scholar
1. Wong W
2. Griggs AD
3. Daniels RF
4. Schaffner SF
5. Ndiaye D
6. Bei AK
7. Deme AB
8. MacInnis B
9. Volkman SK
10. Hartl DL
11. Neafsey DE
12. Wirth DF
(2017) Genetic relatedness analysis reveals the cotransmission of genetically related Plasmodium falciparum parasites in Thiès, Senegal
Genome Medicine 9:5.

https://doi.org/10.1186/s13073-017-0398-0
- PubMed
- Google Scholar
1. Wong W
2. Volkman S
3. Daniels R
4. Schaffner S
5. Sy M
6. Ndiaye YD
7. Badiane AS
8. Deme AB
9. Diallo MA
10. Gomis J
11. Sy N
12. Ndiaye D
13. Wirth DF
14. Hartl DL
(2022) R H: a genetic metric for measuring intrahost Plasmodium falciparum relatedness and distinguishing cotransmission from superinfection
PNAS Nexus 1:pgac187.

https://doi.org/10.1093/pnasnexus/pgac187
- PubMed
- Google Scholar
Software
1. World Health Organization
(2015)
Indoor residual spraying: an operational manual for indoor residual spraying (IRS) for malaria transmission control and elimination

World Health Organization.
Software
1. World Health Organization
(2023)
World malaria report 2023

World Health Organization.
1. Zhan Q
2. He Q
3. Tiedje KE
4. Day KP
5. Pascual M
(2024) Hyper-diverse antigenic variation and resilience to transmission-reducing intervention in falciparum malaria
Nature Communications 15:7343.

https://doi.org/10.1038/s41467-024-51468-6
- PubMed
- Google Scholar
Software
1. Zhan Q
(2026) FOI, version swh:1:rev:5e950240ae254393db13bd99c1cc7eacc1fa0972
Software Heritage.

https://archive.softwareheritage.org/swh:1:dir:d357f5528502f0531c33f0f35a41b96629d3682e;origin=https://github.com/qzhan321/FOI;visit=swh:1:snp:3e87b76f9f52cf255513c96f16ec3b2bb92f3442;anchor=swh:1:rev:5e950240ae254393db13bd99c1cc7eacc1fa0972
1. Zhang X
2. Deitsch KW
(2022) The mystery of persistent, asymptomatic Plasmodium falciparum infections
Current Opinion in Microbiology 70:102231.

https://doi.org/10.1016/j.mib.2022.102231
- PubMed
- Google Scholar
1. Zhong D
2. Koepfli C
3. Cui L
4. Yan G
(2018) Molecular approaches to determine the multiplicity of Plasmodium infections
Malaria Journal 17:172.

https://doi.org/10.1186/s12936-018-2322-5
- PubMed
- Google Scholar

Article and author information

Author details

Qi Zhan

Committee on Genetics, Genomics and Systems Biology, The University of Chicago, Chicago, United States

Present address
Division of Infectious Diseases, Stanford University School of Medicine, Stanford, United States

Contribution
Conceptualization, Data curation, Software, Formal analysis, Validation, Investigation, Visualization, Methodology, Writing – original draft, Project administration, Writing – review and editing

For correspondence
qz1111@stanford.edu

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0001-7959-817X
Kathryn E Tiedje

Department of Microbiology and Immunology, Bio21 Institute and The Peter Doherty Institute for Infection and Immunity, The University of Melbourne, Melbourne, Australia

Contribution
Resources, Data curation, Writing – review and editing

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0003-3305-0533
Karen P Day

Department of Microbiology and Immunology, Bio21 Institute and The Peter Doherty Institute for Infection and Immunity, The University of Melbourne, Melbourne, Australia

Contribution
Supervision, Funding acquisition, Writing – review and editing

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0002-6115-6135
Mercedes Pascual
1. Department of Biology and Department of Environmental Studies, New York University, New York, United States
2. Santa Fe Institute, Santa Fe, United States
Contribution
Conceptualization, Supervision, Funding acquisition, Writing – original draft, Writing – review and editing

For correspondence
mp6774@nyu.edu

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0003-3575-7233

Funding

Fogarty International Center (R01-TW009670)

Karen P Day
Mercedes Pascual

National Institute of Allergy and Infectious Diseases (R01-AI149779)

Karen P Day
Mercedes Pascual

The funders had no role in study design, data collection, and interpretation, or the decision to submit the work for publication.

Acknowledgements

We wish to thank the participants, communities, and the Ghana Health Service in Bongo District, Ghana, for their willingness to participate in the study of empirical data. We would like to thank the field teams in Bongo for their technical assistance in the field, as well as the laboratory personnel at the Navrongo Health Research Centre for their expertise and for undertaking the sample collections and parasitological assessments. This research was supported by Fogarty International Center at the National Institutes of Health through the joint NIH-NSF-NIFA Ecology and Evolution of Infectious Diseases award R01-TW009670 to K.P.D. and M.P.; and the National Institute of Allergy and Infectious Diseases, National Institutes of Health through the joint NIH-NSF-NIFA Ecology and Evolution of Infectious Diseases award R01-AI149779 to K.P.D. and M.P. We appreciate the support of the Research Computing Center at the University of Chicago through the computational resources of the Midway cluster.

Ethics

The study was reviewed/approved by the ethics committees at the Navrongo Health Research Centre, Ghana (NHRC IRB-131), Noguchi Memorial Institute for Medical Research, Ghana (NMIMR-IRB CPN 089/11-12; NMIMR-IRB CPN 066/20-21), The University of Chicago, United States (IRB14-1495; IRB19-0760; IRB21-0417), New York University, United States (IRB-FY2024-8572), and The University of Melbourne, Australia (Project IDs 13433, 31586, 21649). Individual informed consent was obtained in the local language (i.e., Gurene) from each participant enrolled by signature or thumbprint, accompanied by the signature of an independent witness. For children <18 years of age, a parent or guardian provided consent. In addition, all children between the ages of 12 and 17 provided assent. Details on the study area, study population, inclusion/exclusion criteria, and data collection procedures have been previously described (Tiedje et al., 2017; Tiedje et al., 2022).

Version history

Preprint posted: May 27, 2024
Sent for peer review: July 2, 2024
Reviewed Preprint version 1: September 9, 2024
Reviewed Preprint version 2: May 15, 2025
Reviewed Preprint version 3: March 6, 2026
Version of Record published: June 23, 2026

Cite all versions

You can cite all versions using the DOI https://doi.org/10.7554/eLife.100076. This DOI represents all versions, and will always resolve to the latest one.

Copyright

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.

Metrics

1,254

views
50

downloads
2

citations

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Citations by DOI

1

citation for umbrella DOI https://doi.org/10.7554/eLife.100076

1

citation for Reviewed Preprint v1 https://doi.org/10.7554/eLife.100076.1

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Mendeley

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

Qi Zhan
Kathryn E Tiedje
Karen P Day
Mercedes Pascual

(2026)

From multiplicity of infection to force of infection in sparsely sampled high-transmission Plasmodium falciparum populations

eLife 13:RP100076.

https://doi.org/10.7554/eLife.100076.4

Categories and tags

Research organism

P. falciparum

Share this article

Cite this article

Confidence intervals for estimated mean FOI values in simulated scenarios of homogeneous exposure risk, before and during IRS interventions at three different coverage levels.

Confidence intervals for estimated mean FOI values in simulated scenarios of heterogeneous exposure risk, before and during IRS interventions at three different coverage levels.

Confidence intervals for the estimated mean FOI values in Ghana surveys before and immediately after a transient three-round IRS intervention.

The saturation in FOI with increasing EIR and their non-linear relationship from previous field studies.

Agent-based model for falciparum malaria transmission.

Simulation design, transmission scenarios, under-sampling or imperfect detection of var genes, and the empirical survey design from Bongo District, northern Ghana.

The relationship between the parasitemia level of the individual (measured in µl) and (A) the number of non-upsA var genes per isolate/individual, or (B) MOI estimates from the Bayesian formulation of the varcoding method.

Schematic illustration of (A) systems in queuing theory and (B) malaria transmission.

The shape of the negative log likelihood for (A) a simulation run (pre-IRS) with Gamma-distributed times between local transmission events in a seasonal, semi-open system with heterogeneous exposure risk, and (B) Ghana pre-IRS surveys (Survey 1 and 2) with c = 30 and mid PCR detectability.

The impact of grid value choices on the results of FOI inference in either simulated outputs or Ghana data.

True and estimated FOI by the two-moment and Little’s Law methods for additional simulated scenarios of homogeneous exposure risk.

True and estimated FOI by the two-moment and Little’s Law methods for additional simulated scenarios of heterogeneous exposure risk.

True and estimated FOI by the two-moment and Little’s Law methods for additional simulated scenarios of homogeneous exposure risk.

True and estimated FOI by the two-moment and Little’s Law for additional simulated scenarios of homogeneous exposure risk.

True and estimated FOI by the two-moment and Little’s Law methods for additional simulated scenarios of homogeneous exposure risk.

As in Figure 1, we present confidence intervals for the estimated mean FOI values; all aspects of the simulation setup are identical except that infections are allowed to clear stochastically before full repertoire exhaustion.

As in Figure 2, we present confidence intervals for the estimated mean FOI values; all aspects of the simulation setup are identical except that infections are allowed to clear stochastically before full repertoire exhaustion.

As in Appendix 1—figure 7, we present confidence intervals for the estimated mean FOI values; all aspects of the simulation setup are identical except that infections are allowed to clear stochastically before full repertoire exhaustion.

As in Appendix 1—figure 8, we present confidence intervals for the estimated mean FOI values; all aspects of the simulation setup are identical except that infections are allowed to clear stochastically before full repertoire exhaustion.

As in Appendix 1—figure 9, we present confidence intervals for the estimated mean FOI values; all aspects of the simulation setup are identical except that infections are allowed to clear stochastically before full repertoire exhaustion.

As in Appendix 1—figure 10, we present confidence intervals for the estimated mean FOI values; all aspects of the simulation setup are identical except that infections are allowed to clear stochastically before full repertoire exhaustion.

As in Appendix 1—figure 11, we present confidence intervals for the estimated mean FOI values; all aspects of the simulation setup are identical except that infections are allowed to clear stochastically before full repertoire exhaustion.

Estimated standard deviation of the inter-arrival times using the two-moment approximation method across different simulation scenarios and field data from Bongo District, Ghana.

As in Appendix 1—figure 20, we compare here the distribution of infection durations for the same simulation conditions with those from the historical clinical data, but show the results for children aged 1–5 years rather than naive hosts in the simulation.

As in Appendix 1—figure 22, we compare here the distribution of infection durations under the same simulation conditions with those from the historical clinical data, but show the results for children aged 1–5 years rather than naive hosts in the simulation.

As in Appendix 1—figure 24, we compare here the distribution of infection durations under the same simulation conditions with those from the historical clinical data, but show the results for children aged 1–5 years rather than naive hosts in the simulation.

Author details

Qi Zhan

Present address

Contribution

For correspondence

Competing interests

Kathryn E Tiedje

Contribution

Competing interests

Karen P Day

Contribution

Competing interests

Mercedes Pascual

Contribution

For correspondence

Competing interests

Citations by DOI

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

Categories and tags

Research organism

The shape of the negative log likelihood for (A) a simulation run (pre-IRS) with Gamma-distributed times between local transmission events in a seasonal, semi-open system with heterogeneous exposure risk, and (B) Ghana pre-IRS surveys (Survey 1 and 2) with $c$ = 30 and mid PCR detectability.