Strong isolation by distance and evidence of population microstructure reflect ongoing Plasmodium falciparum transmission in Zanzibar

  1. Sean V Connelly  Is a corresponding author
  2. Nicholas F Brazeau
  3. Mwinyi Msellem
  4. Billy E Ngasala
  5. Ozkan Aydemir
  6. Varun Goel
  7. Karamoko Niaré
  8. David J Giesbrecht
  9. Zachary R Popkin-Hall
  10. Chris Hennelly
  11. Zackary Park
  12. Ann M Moormann
  13. John M Ong'echa
  14. Robert Verity
  15. Safia Mohammed
  16. Shija J Shija
  17. Lwidiko E Mhamilawa
  18. Ulrika Morris
  19. Andreas Mårtensson
  20. Jessica T Lin
  21. Anders Björkman
  22. Jonathan J Juliano
  23. Jeffrey A Bailey
  1. MD-PhD Program, University of North Carolina at Chapel Hill, United States
  2. Research Division, Ministry of Health, United Republic of Tanzania
  3. Department of Parasitology and Medical Entomology, Muhimbili University of Health and Allied Sciences, United Republic of Tanzania
  4. Global Health and Migration Unit, Department of Women's and Children's Health, Uppsala University, Sweden
  5. Department of Medicine, University of Massachusetts Chan Medical School, United States
  6. Carolina Population Center, University of North Carolina at Chapel Hill, United States
  7. Department of Pathology and Laboratory Medicine, Brown University, United States
  8. Institute for Global Health and Infectious Diseases, School of Medicine, University of North Carolina at Chapel Hill, United States
  9. Division of Infectious Diseases, Department of Medicine, School of Medicine, University of North Carolina at Chapel Hill, United States
  10. Center for Global Health Research, Kenya Medical Research Institute, Kenya
  11. MRC Centre for Global Infectious Disease Analysis, Imperial College London, United Kingdom
  12. Zanzibar Malaria Elimination Program (ZAMEP), United Republic of Tanzania
  13. Department of Microbiology, Tumor and Cell Biology, Karolinska Institutet, Sweden
  14. Department of Global Public Health, Karolinska Institute, Sweden
  15. Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, United States
  16. Curriculum in Genetics and Molecular Biology, University of North Carolina at Chapel Hill, United States

eLife assessment

Connelly and colleagues provide convincing genetic evidence that importation from mainland Tanzania is a major source of Plasmodium falciparum lineages currently circulating in Zanzibar. This study also reveals ongoing local malaria transmission and occasional near-clonal outbreaks in Zanzibar. Overall, the article effectively highlights the role of human movements in maintaining residual malaria transmission in an area targeted for intensive control interventions over the past decades and provides clear and valuable information for epidemiologists and public health professionals.

https://doi.org/10.7554/eLife.90173.3.sa0

Abstract

Background:

The Zanzibar archipelago of Tanzania has become a low-transmission area for Plasmodium falciparum. Despite being considered an area of pre-elimination for years, achieving elimination has been difficult, likely due to a combination of imported infections from mainland Tanzania and continued local transmission.

Methods:

To shed light on these sources of transmission, we applied highly multiplexed genotyping utilizing molecular inversion probes to characterize the genetic relatedness of 282 P. falciparum isolates collected across Zanzibar and in Bagamoyo district on the coastal mainland from 2016 to 2018.

Results:

Overall, parasite populations on the coastal mainland and Zanzibar archipelago remain highly related. However, parasite isolates from Zanzibar exhibit population microstructure due to the rapid decay of parasite relatedness over very short distances. This, along with highly related pairs within shehias, suggests ongoing low-level local transmission. We also identified highly related parasites across shehias that reflect human mobility on the main island of Unguja and identified a cluster of highly related parasites, suggestive of an outbreak, in the Micheweni district on Pemba island. Parasites in asymptomatic infections demonstrated higher complexity of infection than those in symptomatic infections, but have similar core genomes.

Conclusions:

Our data support importation as a main source of genetic diversity and contribution to the parasite population in Zanzibar, but they also show local outbreak clusters where targeted interventions are essential to block local transmission. These results highlight the need for preventive measures against imported malaria and enhanced control measures in areas that remain receptive to malaria reemergence due to susceptible hosts and competent vectors.

Funding:

This research was funded by the National Institutes of Health, grants R01AI121558, R01AI137395, R01AI155730, F30AI143172, and K24AI134990. Funding was also contributed from the Swedish Research Council, Erling-Persson Family Foundation, and the Yang Fund. RV acknowledges funding from the MRC Centre for Global Infectious Disease Analysis (reference MR/R015600/1), jointly funded by the UK Medical Research Council (MRC) and the UK Foreign, Commonwealth & Development Office (FCDO), under the MRC/FCDO Concordat agreement and is also part of the EDCTP2 program supported by the European Union. RV also acknowledges funding by Community Jameel.

Introduction

Malaria cases in Tanzania comprise 3% of globally reported cases, but transmission is heterogeneous, with the coastal mainland witnessing declining but substantial transmission of Plasmodium falciparum (Alegana et al., 2021; World Health Organization, 2022). On the other hand, the archipelago of Zanzibar is a pre-elimination setting, with low-level seasonal transmission (Björkman et al., 2019). This is largely due to the routine implementation of a combination of effective control measures, including robust vector control and routine access to effective antimalarials (Björkman et al., 2019). Despite these efforts, malaria has been difficult to eliminate from the archipelago. There are several reasons this may be the case: (1) frequent importation of malaria from moderate- or high-transmission regions of mainland Tanzania and Kenya (Björkman et al., 2019; Le Menach et al., 2011; Lipner et al., 2011; Monroe et al., 2019; Morgan et al., 2020; Tatem et al., 2009); (2) ongoing local transmission due to residual vector capacity despite strong vector control (Björkman et al., 2019); and (3) a reservoir of asymptomatic infections (Björkman and Morris, 2020; Björkman et al., 2019).

Parasite genomics has the potential to help us better understand malaria epidemiology by uncovering population structure and gene flow, providing insight into the changes in the parasite population including how parasites move between regions (Neafsey et al., 2021). Genomics has previously been used to study importation and transmission chains in other low-transmission settings in Africa and elsewhere (Chang et al., 2019; Fola et al., 2023; Morgan et al., 2020; Patel et al., 2014; Roh et al., 2019; Sane et al., 2019). Previously, we had investigated the importation of malaria into Zanzibar from the mainland using whole-genome sequencing, showing highly similar populations within the mainland and within the archipelago, but also identifying highly related parasite pairs between locations, suggesting a role for importation (Morgan et al., 2020). However, this work lacked sufficient samples to assess transmission of parasites within Zanzibar. The larger and spatially rich sample set analyzed in this article offers an opportunity for more refined analyses of transmission across Zanzibar and how parasites are related to those from coastal mainland.

A panel of molecular inversion probes (MIPs), a highly multiplexed genotyping assay, were designed in a previous study to target single-nucleotide polymorphisms (SNPs) throughout the P. falciparum genome (Aydemir et al., 2018). We leveraged this assay to investigate the genetic epidemiology of parasites in the coastal mainland and Zanzibar utilizing 391 samples collected from cross-sectional surveys of both asymptomatic infections and symptomatic, uncomplicated malaria cases during 2016–2018. Specifically, we use identity by descent (IBD) analyses to compare the genetic relatedness of mainland and Zanzibari parasites, and investigate the geography/spatial relationships of genetically related parasites on the archipelago. We further characterize how the genetic complexity of infections (COIs) differs by clinical status and describe patterns of antimalarial drug resistance polymorphisms in the parasite populations. In this low-transmission setting, these analyses characterize fine-scale local parasite populations that contribute to continued transmission within the region, highlighting a key barrier to malaria elimination in the Zanzibar archipelago.

Methods

Samples from coastal Tanzania (178) and Zanzibar (213) were previously sequenced through multiple studies (Table 1, Figure 1—figure supplement 1). These samples include 213 dried blood spots (DBS) collected in Zanzibar between February 2016 and September 2017, coming from cross-sectional surveys of asymptomatic individuals (n = 70) and an in vivo efficacy study of artesunate-amodiaquine (ASAQ) with single low-dose primaquine (SLDP) in pediatric uncomplicated malaria patients in the western and central districts of Unguja island and Micheweni district on Pemba island (n = 143) (Msellem et al., 2020). These samples were geolocalized to shehias, the lowest geographic governmental designation of land in Zanzibar, across its two main islands, Unguja and the northern region of Pemba (Figure 1—figure supplement 1). Mainland Tanzania samples were collected in rural Bagamoyo district, where malaria transmission persists, and residents frequently travel to Dar es Salaam, the major port from where travelers depart for Zanzibar. Of the mainland Bagamoyo samples, 138 were whole blood collected from 2015 to 2017 as part of an in vivo efficacy study of artemether-lumefantrine (AL) in pediatric uncomplicated malaria patients (Topazian et al., 2022), and the remaining 40 samples were leukodepleted blood collected in 2018 from asymptomatic but RDT-positive children who participated in a study investigating the transmission of P. falciparum to colony-reared mosquitoes. This project leveraged MIP data from SRA, including PRJNA926345, PRJNA454490, PRJNA545345, and PRJNA545347.

Table 1
Blood samples from Zanzibar and coastal Tanzania.
DescriptionLocation (district)DatesClinical status*Sample sizeAge range (yr)# in genome-wide analysis# in drug resistance analysis
Community cross-sectional surveysZanzibar (multiple)2016A702–702152
In vivo efficacy study of artesunate-amodiaquine (ASAQ) with single low-dose primaquine (SLDP) in pediatric uncomplicated malaria patientsZanzibar (multiple)2017S1432–60117134
Study of transmission of Plasmodium falciparum to colony-reared mosquitoesMainland Tanzania (Bagamoyo)2018A407–16340
Parasite clearance study of artemether-lumefantrine (AL)Mainland Tanzania (Bagamoyo)2018S1382–11110123
  1. *

    Asymptomatic (A) or symptomatic (S).

In order to place coastal Tanzanian and Zanzibari samples in the context of African P. falciparum population structure across multiple regions, MIP data from 147 whole blood samples collected in Ahero District, Kenya, from the same parasite clearance study were used (Topazian et al., 2022) in conjunction with a subset of data from 2537 samples genotyped for a study of the 2013 Demographic Health Survey of the Democratic Republic of the Congo (DRC), which included samples from DRC, Ghana, Tanzania, Uganda, and Zambia (Verity et al., 2020; see Figure 1—figure supplement 2).

MIP sequencing

Sequence data for the coastal Tanzanian and Zanzibari samples were generated in a similar fashion across studies. Chelex-extracted DNA from DBS and QIAGEN Miniprep (QIAGEN, Germantown, MD)-extracted DNA from leukodepleted blood were used in MIP captures, which were then sequenced as previously described (Aydemir et al., 2018; Verity et al., 2020). Control mixtures of four strains of genomic DNA from P. falciparum laboratory lines were also sequenced as described previously (Verity et al., 2020). We utilized two MIP panels, one being a genome-wide SNP MIP panel and the second being a panel with the known drug resistance markers in P. falciparum (Verity et al., 2020). These libraries were sequenced on Illumina Nextseq 500 instrument using 150 bp paired end sequencing with dual indexing using Nextseq 500/550 Mid-output Kit v2.

MIP variant calling and filtering

MIP sequencing data was processed using MIPTools (v0.4.0, https://github.com/bailey-lab/MIPTools; Hathaway, 2024), which first merges reads and removes errors and unique molecular identifier (UMI) redundancy with MIPWrangler (Aydemir, unpublished). For the genome-wide panel, variant calling was performed using FreeBayes within MIPTools, for a pooled continuous sample that was filtered for a minimum UMI depth of 10, a within-sample allele frequency threshold of 0.01, and a minimum alternate read count of 2 to obtain 5174 variant SNP sites. Utilizing bcftools (version 1.15.1), the samples and loci were filtered to only the known targeted SNPs, requiring a minor allele frequency threshold of 0.01, a sample missingness threshold of 10%, and loci missingness threshold of 15%. After filtering and subsetting to biallelic sites, 282 samples were left at 1270 loci. The final numbers of samples used for analysis by group are shown in Table 1. Sequencing coverage estimates for loci are shown in Figure 1—figure supplement 3.

For the drug resistance panel, variant calling was performed as above, with additional FreeBayes parameters of a haplotype length of 3 and using the 30 best alleles at a given locus. Three aggregate amino acid summary tables were created with reference amino acid UMI counts, alternate amino acid UMI counts, and the coverage depth for each variant, for a total of 309 samples at 2265 SNPs. We focused analysis on the following key known and putative drug resistance molecular marker genes and corresponding mutations: P. falciparum (Pf) chloroquine resistance transporter (Pfcrt: C72S, M74I, N75E, K76T, T93S, H97Y, F145I, I218F, A220S, Q271E, N326S, M343L, C350R, G353V, I356T, R371I), Pf multidrug resistance 1 (Pfmdr1: N86Y, Y184F, S1034C, N1042D, D1246Y), Pf dihydrofolate reductase (Pfdhfr: A16V, N51I, C59R, S108N, I164L), Pf dihydropteroate synthase (Pfdhps: S436A, S436F, A437G, K540E, A581G, A613T, A613S), Pf cytochrome b (Pfcytb: Y268N,Y268S,Y268C), and Pf kelch 13 (Pfk13: P441L, F446I, G449A, N458Y, C469F, C469Y, M476I, A481V, Y493H, R515K, P527H, N537I, N537D, G538V, R539T, I543T, P553L, R561H, V568G, P574L, C580Y, R622I, A675V) (World Health Organization, 2020). Prevalence was calculated separately in Zanzibar or mainland Tanzania for each polymorphism by the number of samples with alternative genotype calls for this polymorphism over the total number of samples genotyped, and an exact 95% confidence interval using the Pearson–Klopper method was calculated for each prevalence.

Analysis of population relatedness and structure

To investigate genetic relatedness of parasites across regions, IBD estimates were assessed using the within-sample major alleles (coercing samples to monoclonal by calling the dominant allele at each locus) and estimated utilizing a maximum likelihood approach using the inbreeding_mle function from the MIPanalyzer package (Verity et al., 2020). This approach has previously been validated as a conservative estimate of IBD (Verity et al., 2020). Next, principal component analysis (PCA) was performed to query the comparative genetic variation of the samples by utilizing the genome-wide SNP panel. We pruned 51 samples that had a pairwise IBD of >0.90 to one randomly selected sample as a representative of the clonal population to avoid clonal structure from dominating the analysis. Within-sample allele frequencies were calculated, with an imputation step replacing missing values with the median per each locus, and PCA was performed using the prcomp function (Verity et al., 2020; R version 4.2.1).

To include geographic information with querying genetic variation, discriminant analysis of principal components (DAPC) was used (Jombart, 2008). Pseudohaplotypes were created by pruning the genotype calls at all loci for each sample into a single haplotype, and redundant haplotypes were removed (282 reduced to 272 with unique pseudohaplotypes). DAPC was conducted at the district level, and samples from districts with less than five samples (272 samples to 270 samples in six districts) were retained (Figure 1—figure supplement 4B). For the main DAPC (Figure 1B), highly related isolates were pruned to a single representative infection (272 reduced to 232) and then included districts with at least five samples (232 reduced to 228 samples in five districts). The DAPC was performed using the adegenet package (Jombart, 2008) with the first 80 PCs based on the cross-validation function xvalDapc. To perform K-means clustering, a cluster K of 1 was assigned to the mainland samples while the kmeans package was used to find the optimal K to cluster the Zanzibar shehias by latitude and longitude. The K-means clustering experiment was used to cluster a continuous space of geographic coordinates in order to compare genetic relatedness in different regions. We selected K = 4 as the inflection point based on the elbow plot (Figure 2—figure supplement 1) and based the number to obtain sufficient subsections of Zanzibar to compare genetic relatedness.

Figure 1 with 4 supplements see all
Parasites between Zanzibar and coastal mainland Tanzania are highly related but microstructure within Zanzibar is apparent.

(A) Principal component analysis (PCA) comparing parasites from symptomatic vs. asymptomatic patients from coastal Tanzania and Zanzibar. Clusters with an identity by descent (IBD) value of > 0.90 were limited to a single representative infection to prevent local structure of highly related isolates within shehias from driving clustering. (B) A discriminant analysis of principal components (DAPC) was performed utilizing isolates with unique pseudohaplotypes, pruning highly related isolates to a single representative infection. Districts were included with at least five isolates remaining to have sufficient samples for the DAPC. For plotting the inset map, the district coordinates (e.g., mainland, Kati, etc.) were calculated from the averages of the shehia centroids within each district.

To investigate how genetic relatedness varies as the distance between pairs increases, isolation by distance was performed across all of Zanzibar and within the islands of Unguja and Pemba. The greater circle (GC) distances between each shehia centroid were calculated (within shehia distances were equal to 0) to find the distance between each shehia in geographic space. Distances were then binned at increments reflecting the max GC distances between regions, which was smallest in Pemba at 12 km and much larger for both Unguja (58 km) and all of Zanzibar (135 km). Within each binned group, the mean IBD with 95% CIs was plotted. For graphing IBD connections at the between and within shehia level, an IBD threshold of 0.25 (half-siblings) or greater was used (Figure 4—figure supplements 1 and 2). In graphing IBD connections at larger distances between islands or between coastal mainland Tanzania and Zanzibar, a between IBD value of 0.125 (quarter-siblings) or greater was used (Figure 4—figure supplements 3 and 4). These plots were created utilizing ggraph in R with the nodes being samples and the edges being IBD estimates.

COI, or the number of parasite clones in a given sample, was determined using THE REAL McCOIL (v2) categorical method (Chang et al., 2017) and the 95% CI was calculated utilizing a nonparametric bootstrap. Fws statistic, which is used to compare the diversity within and between samples in a population, was calculated in R version 4.2.1 through the formula, (1Hw)/Hp, where Hw is the within-sample heterozygosity and Hp is the heterozygosity across the population, and 95% CIs were calculated utilizing a nonparametric bootstrap.

Results

Zanzibari falciparum parasites were closely related to coastal mainland parasites but showed higher within- than between-population IBD and evidence of microstructure on the archipelago

To examine geographic relatedness, we first used PCA. Zanzibari parasites are highly related to other parasites from East Africa and more distantly related to Central and West African isolates (Figure 1—figure supplement 2). PCA of 232 coastal Tanzanian and Zanzibari isolates, after pruning 51 samples with an IBD of >0.9 to one representative sample, demonstrates little population differentiation (Figure 1A).

However, after performing K-means clustering of shehias in Zanzibar and mainland Tanzania, parasites within each population show more highly related pairs within their respective clusters than between clusters (Figure 2). Comparisons of parasite pairs between Zanzibar and coastal Tanzania showed no pairs with an IBD >0.20 (Figure 2, Figure 4—figure supplement 3). Similarly, no pairs with an IBD of ≥0.20 were present in pairwise comparisons between Unguja and Pemba (Figure 4—figure supplement 4).

Figure 2 with 1 supplement see all
Coastal Tanzania and Zanzibari parasites have more highly related pairs within their given region than between regions.

K-means clustering of shehia coordinates was performed using geographic coordinates of all shehias present from the sample population to generate five clusters (colored boxes). All shehias were included to assay pairwise identity by descent (IBD) between differences throughout Zanzibar. K-means cluster assignments were converted into interpretable geographic names Pemba, Unguja North (Unguja_N), Unguja Central (Unguja_C), Unguja South (Unguja_S), and mainland Tanzania (Mainland). Pairwise comparisons of within-cluster IBD (column 1 of IBD distribution plots) and between-cluster IBD (columns 2–5 of IBD distribution plots) were done for all clusters. All IBD values > 0 were plotted for each comparison. In general, within-cluster IBD had more pairwise comparisons containing high IBD identity.

To further assess the differentiation within the parasite population in Zanzibar, we conducted DAPC according to the districts of origin for each isolate. Parasites differentiated geographically, with less variation near the port of Zanzibar town and more differentiation in isolates collected in districts further from the port (Figure 1B). This underlying microstructure is also supported by classic isolation by distance analysis (Figure 3, Figure 3—figure supplement 1). Isolation by distance analysis across all of Zanzibar and within Unguja showed rapid decay of relatedness over very short geographic distances (Figure 3A and B). Interestingly in Pemba, mean IBD remained at a similar relatively high level even at longer distances (Figure 3C).

Figure 3 with 1 supplement see all
Isolation by distance is shown between all Zanzibari parasites (A), only Unguja parasites (B), and only Pemba parasites (C).

Samples were analyzed based on geographic location. Zanzibar (N = 136) (A), Unguja (N = 105) (B), or Pemba (N = 31) (C) and greater circle (GC) distances between pairs of parasite isolates were calculated based on shehia centroid coordinates. These distances were binned at 4 km increments out to 12 km. Identity by descent (IBD) beyond 12 km is shown in Figure 3—figure supplement 1. The maximum GC distance for all of Zanzibar was 135 km, 58 km on Unguja, and 12 km on Pemba. The mean IBD and 95% CI are plotted for each bin.

Within Zanzibar, parasite clones are shared within and between shehias, suggesting local outbreaks

Among the sample pairs in Zanzibar that are highly related (IBD of ≥0.25), we see different patterns of genetic relatedness suggesting common local and short-distance transmission of clones and occasional long-distance transmission (Figure 4). In Unguja (Figure 4A), we see multiple identical or near-identical parasite pairs shared over longer distances, suggesting longer distance gene flow, as well as multiple shehias containing highly related pairs. In northern Pemba, there is one large cluster of highly related parasites shared within and between six shehias (Figure 4B). Network analysis (Figure 4C) for all sample pairs with an IBD of >0.25 from these shehias illustrates this, with pairs linked by yellow lines showing the highest IBD. The largest network represents two highly related clusters (groups linked by yellow edges, mean IBD of 0.99) connected by a highly related intermediate (FMH42), suggesting that the clusters are related through parasites that have recombined while on the archipelago. FMH42 links the lower cluster with pairwise IBD of 0.65 and the upper cluster with a pairwise IBD of 0.27. These symptomatic isolates collected from February 2016 to September 2017 in northern Pemba likely derive from sustained transmission from a seeding event.

Figure 4 with 4 supplements see all
Highly related pairs span long distances across Zanzibar.

Sample pairs were filtered to have identity by descent (IBD) estimates of ≥ 0.25. Within shehia pairwise IBD estimates are shown within Unguja (A) and Pemba (B) as single points, with dark green representing the greatest degree of IBD. Shehias labeled with black dots do not have within IBD estimates of ≥ 0.25. Between shehia IBD reflects pairs of parasites with IBD ≥0.25, with the color of the connecting arc representing the degree of IBD and yellow representing maximal connectivity. Panel (C) shows the network of highly related pairs (IBD ≥ 0.25) within and between the six northern Pemba shehias (note: Micheweni is a shehia in Micheweni district). Samples (nodes) are colored by shehia and IBD estimates (edges) are represented on a continuous scale with increasing width and yellow-shading indicating higher IBD.

Network analysis of within shehia pairwise IBD sharing in Unguja again shows that there is close relatedness on this small geographic scale (Figure 4—figure supplement 1). A cluster of four isolates in the Shakani shehia on Unguja island with pairwise IBDs of 0.99 likely reflects ongoing transmission within Shakani, with similar connections in Bambi and Dimani. Meanwhile, a few distant connections likely reflect the extent of human mobility on the island (Figure 4A). Similar within-district networks on the mainland are shown in Figure 4—figure supplement 2.

Compared to symptomatic infections, asymptomatic infections demonstrate greater genetic complexity, especially in coastal Tanzania

Asymptomatic infections were compared to roughly contemporaneously collected isolates from those presenting with acute, uncomplicated malaria. Asymptomatic infections demonstrated greater COI than symptomatic infections on both the coastal mainland (mean COI 2.5 vs 1.7, p<0.05, Wilcoxon–Mann–Whitney test) and in Zanzibar (mean COI 2.2 vs 1.7, p=0.05, Wilcoxon–Mann–Whitney test) (Figure 5A). A similar pattern was seen when evaluating Fws, which measures the diversity within a sample compared to the population, with lower Fws in asymptomatic samples consistent with higher within-host complexity, with a more pronounced difference on the mainland (Figure 5B). Despite these differences, parasites from asymptomatic and symptomatic infections tended to cluster together in PCA, suggesting that their core genomes are genetically similar and do not vary based on clinical status (Figure 1A).

Complexity of infection (COI) and Fws metric shows a higher COI and lower Fws in asymptomatic than symptomatic infections in both mainland Tanzania and Zanzibar isolates.

COI (A) was estimated using the REAL McCOIL’s categorical method (Chang et al., 2017). The mean COI for asymptomatic was greater than symptomatic infections for all regions; MAIN-A: 2.5 (2.1–2.9), MAIN-S: 1.7 (1.6–1.9), p<0.05, Wilcoxon–Mann–Whitney test and ZAN-A: 2.2 (1.7–2.8), ZAN-S: 1.7 (1.5–1.9), p=0.05, Wilcoxon–Mann–Whitney test. Fws (B) was estimated utilizing the formula, (1-Hw)/Hp, where Hw is the within-sample heterozygosity and Hp is the heterozygosity across the population. Mean Fws was less in asymptomatic than symptomatic samples; MAIN-A: 0.67 (0.6–0.7), MAIN-S: 0.85 (0.8–0.9), p<0.05, Wilcoxon–Mann–Whitney test and ZAN-A: 0.73 (0.6–0.8), ZAN-S: 0.84 (0.8–0.9), p=0.05, Wilcoxon–Mann–Whitney test. A nonparametric bootstrap was applied to calculate the mean and 95% CI from the COI and Fws values.

Drug resistance mutations did not vary between populations

The prevalence of the drug resistance genotypes was quite similar in Zanzibar and coastal Tanzania (Table 2). The frequencies of five mutations associated with sulfadoxine/pyrimethamine resistance (Pfdhfr: N51I, C59R, S108N, Pfdhps: A437G, K540E) were quite high, with prevalences at or above 0.90. Pfcrt mutations associated with chloroquine and amodiaquine resistance (M74I, N75E, K76T) were all present at approximately 0.05 prevalence (Djimdé et al., 2001; Holmgren et al., 2006). For Pfmdr1, wild-type N86 and D1246 were dominant at 0.99 prevalence, which are associated with reduced susceptibility to lumefantrine (Sisowath et al., 2005). No World Health Organization-validated or candidate polymorphism in Pfk13 associated with artemisinin resistance was found.

Table 2
Drug resistance polymorphism prevalence in Zanzibar and coastal mainland Tanzania.
MutationZanzibarMainland Tanzania
Mutant allele prevalence*CI# Genotyped samples Mutant allele prevalence*CI# Genotyped samples
Pfcrt-M74I0.0540.026–0.0981840.0000–0.034106
Pfcrt-N75E0.0540.026–0.0981840.0000–0.034106
Pfcrt-K76T0.0540.026–0.0981840.0000–0.034106
Pfdhfr-A16V0.0000–0.0211730.0000–0.032112
Pfdhfr-N51I0.9770.943–0.9941770.9640.911–0.99112
Pfdhfr-C59R0.9710.934–0.9911740.9450.884–0.98109
Pfdhfr-S108N1.0000.98–11791.0000.965–1104
Pfdhfr-S108T0.0000–0.021790.0000–0.035104
Pfdhfr-I164L0.0000–0.021840.0000–0.03798
Pfdhps-A437G1.0000.98–11821.0000.968–1115
Pfdhps-K540E0.9550.913–0.981780.9640.91–0.99111
Pfdhps-A581G0.0440.019–0.0851810.1070.058–0.175122
Pfk13-K189N0.0230.006–0.0581740.0000–0.0490
Pfk13-K189T0.0780.042–0.131660.0950.042–0.17984
Pfmdr1-N86Y0.0110.001–0.041800.0080–0.044124
Pfmdr1-Y184F0.6440.57–0.7141800.5300.435–0.624115
Pfmdr1-D1246Y0.0110.001–0.0391840.0190.002–0.067105
Pfmdr2-I492V0.4300.357–0.5061790.4070.302–0.51886
  1. *

    Prevalence was calculated as described in the ‘Methods’.

  2. 95% CI of these polymorphisms were calculated using the Pearson–Klopper method.

  3. The number of genotyped samples per loci is also shown for each polymorphism.

Discussion

In this study, we leverage high-throughput targeted sequencing using MIPs to characterize the populations and the relationships of P. falciparum isolates in Zanzibar and coastal mainland Tanzania. The parasite populations appear to be highly related to each other (Figure 1A) when evaluated using SNPs in the core genome. Interestingly, within Zanzibar, structure could be observed, with parasites closer to the main ferry terminal in Zanzibar town clustered more closely with coastal mainland parasites (Figure 1B, Figure 1—figure supplement 1) compared to parasites that were more geographically distant. This, in combination with the evidence of rapid decline of genetic relatedness with distance on the archipelago (Figure 3), is consistent with population microstructure within the island chain. This microstructure within the archipelago is supported by K-means clustering where Zanzibari isolates show higher within-cluster than between-cluster IBD (Figure 2). It is also consistent with isolates with higher IBD (Figure 4A and B) in Unguja and Pemba compared to a maximum IBD of 0.20 between Zanzibar and coastal mainland Tanzania (Figure 4—figure supplement 3) or between Unguja and Pemba (Figure 4—figure supplement 4). Parasite populations within the very low transmission region of Zanzibar may be more isolated than expected, allowing them to differentiate from each other. This may be indicative of very effective local malaria control, yet with continued micro-transmission remaining. Thus, directly targeting local malaria transmission, including the asymptomatic reservoir which contributes to sustained transmission (Barry et al., 2021; Sumner et al., 2021), may be an important focus for ultimately achieving malaria control in the archipelago (Björkman and Morris, 2020). Currently, a reactive case detection program within index case households is being implemented, but local transmission continues and further investigation into how best to control this is warranted (Mkali et al., 2023).

Despite the overall genetic similarity between archipelago populations, we did not find parasite pairs with high levels of IBD between the coastal mainland and Zanzibar, with the highest being 0.20. While this level still represents a significant amount of genetic sharing, similar to a cousin, the lack of higher levels does not allow us to identify specific importation events. This is largely due to the study design, which is based on convenience sampling, the relatively low numbers of samples, and lack of sampling from all mainland travel hubs (Bisanzio et al., 2023). Sampling was also denser in Unguja compared to Pemba. On the other hand, we see clear transmission of highly related parasites within each population (IBD > 0.99). In Zanzibar, highly related parasites mainly occur in the range of 20–30 km. These results are similar to our previous work using whole-genome sequencing of isolates from Zanzibar and mainland Tanzania, showing increased within-population IBD compared to between-population IBD (Morgan et al., 2020). The network of highly related P. falciparum parasites from six shehias in north Pemba provides an excellent example of likely recent near-clonal transmission, consistent with an outbreak (Figure 4C). A recent study investigating population structure in Zanzibar also found local population microstructure in Pemba (Holzschuh et al., 2023). Furthermore, both studies found near-clonal parasites within the same district, Micheweni, and found population microstructure over Zanzibar. Overall, given the findings of microstructure with significant local sharing of highly related strains, these small clusters still potentially drive much of the malaria transmission that occurs within the archipelago through routine human movement or mosquito travel between locales (Huestis et al., 2019). Less frequent longer distance transmission events also occur, likely due to longer range human migration within the islands.

Asymptomatic parasitemia has been shown to be common in falciparum malaria around the globe and has been shown to have increasing importance in Zanzibar (Lindblade et al., 2013; Morris et al., 2015). What underlies the biology and prevalence of asymptomatic parasitemia in very low transmission settings where antiparasite immunity is not expected to be prevalent remains unclear (Björkman and Morris, 2020). Similar to a few previous studies, we found that asymptomatic infections had a higher COI than symptomatic infections across both the coastal mainland and Zanzibar parasite populations (Collins et al., 2022; Kimenyi et al., 2022; Sarah-Matio et al., 2022). Other studies have found lower COI in severe vs. mild malaria cases (Robert et al., 1996) or no significant difference between COI based on clinical status (Conway et al., 1991; Earland et al., 2019; Kun et al., 1998; Lagnika et al., 2022; Tanabe et al., 2015). In Zambia, one study suggested that infections that cause asymptomatic infection may be genetically different from those that cause symptomatic infection (Searle et al., 2017). However, this study included samples collected over different time periods and relied on a low-density genotyping assay that only investigated the diversity of 24 SNPs across the genome. Here, based on SNPs throughout the core genome, we did not see differential clustering of asymptomatic or symptomatic infections in Zanzibar or the mainland (Figure 1A), suggesting that these parasite populations remain similar when comparing clinical status. However, this genotyping approach does not address potential variation in the many hypervariable gene families that encode genes known to be associated with pathogenesis (e.g., var, rifin, and stevor genes) and does not address differences in expression of genes associated with pathogenesis that may reflect differences in the populations. Investigation with other methods, such as long-read genome sequencing and transcriptional profiling, would be needed to address these differences.

While mutations for partial artemisinin resistance were not observed in K13, other antimalarial-resistant mutations of concern were observed. Validated drug resistance mutations linked to sulfadoxine/pyrimethamine resistance (Pfdhfr-N51I, Pfdhfr-C59R, Pfdhfr-S108N, Pfdhps-A437G, Pfdhps-K540E) were found at high prevalence (Table 2). Prevalence of polymorphisms associated with amodiaquine resistance (Pfcrt-K76T, Pfmdr1-N86Y, Pfmdr1-Y184F, Pfmdr1-D1246Y) was seen at similar proportions as previous reports (Msellem et al., 2020). The wild-type Pfmdr1-N86 was dominant in both mainland and archipelago populations, concerning reduced lumefantrine susceptibility. Although polymorphisms associated with artemisinin resistance did not appear in this population, continued surveillance is warranted given the emergence of these mutations in East Africa and reports of rare resistance mutations on the coast consistent with the spread of emerging Pfk13 mutations (Moser et al., 2021).

Overall, parasites between Zanzibar and coastal mainland Tanzania remain highly related, but population microstructure on the island reflects ongoing low-level transmission in Zanzibar, partially driven by asymptomatic infections that potentially constitute a long-term reservoir. This is likely the result of the continued pressure on the population through the implementation of effective control measures. In this study, parasite genomics allows us to parse differences in parasite populations and reveals substructure in an area of low-transmission intensity. A recent study identified ‘hotspot’ shehias, defined as areas with comparatively higher malaria transmission than other shehias, near the port of Zanzibar town and in northern Pemba (Bisanzio et al., 2023). These regions overlapped with shehias in this study with high levels of IBD, especially in northern Pemba (Figure 4). These areas of substructure represent parasites that differentiated in relative isolation and are thus important locales to target intervention to interrupt local transmission (Bousema et al., 2012). While a field cluster-randomized control trial in Kenya targeting these hotspots did not confer much reduction of malaria outside of the hotspot (Bousema et al., 2016), if areas are isolated pockets, which genetic differentiation can help determine, targeted interventions in these areas are likely needed, potentially through both mass drug administration and vector control (Morris et al., 2018; Okell et al., 2011). Such strategies and measures preventing imported malaria could accelerate progress toward zero malaria in Zanzibar.

Data availability

Parasite sequence data is available through SRA (BioProject PRJNA926345). Code used for analysis is available at: https://github.com/sconnelly007/TAN_MIP (copy archived at Connelly, 2024).

The following data sets were generated
The following previously published data sets were used

References

  1. Book
    1. Lipner E
    2. Law M
    3. Barnett E
    4. Keystone J
    5. von Sonnenburg F
    6. Loutan L
    7. Prevots D
    8. Klion A
    9. Nutman T
    (2011)
    Travel patterns and imported Plasmodium falciparum rates among Zanzibar residents
    In: Lipner E, Law M, editors. Hospitality and Health. Apple Academic Press. pp. 78–92.
  2. Report
    1. World Health Organization
    (2020)
    Report on antimalarial drug efficacy, resistance and response: 10685years of surveillance (2010-2019)
    World Health Organization.
  3. Report
    1. World Health Organization
    (2022)
    World malaria report 2022
    World Health Organization.

Article and author information

Author details

  1. Sean V Connelly

    MD-PhD Program, University of North Carolina at Chapel Hill, Chapel Hill, United States
    Contribution
    Conceptualization, Formal analysis, Validation, Investigation, Visualization, Methodology, Writing – original draft, Writing – review and editing
    For correspondence
    sean_connelly@med.unc.edu
    Competing interests
    No competing interests declared
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-7330-7340
  2. Nicholas F Brazeau

    MD-PhD Program, University of North Carolina at Chapel Hill, Chapel Hill, United States
    Contribution
    Conceptualization, Formal analysis, Methodology, Writing – review and editing
    Competing interests
    No competing interests declared
  3. Mwinyi Msellem

    Research Division, Ministry of Health, Zanzibar, United Republic of Tanzania
    Contribution
    Resources, Data curation, Writing – review and editing
    Competing interests
    No competing interests declared
  4. Billy E Ngasala

    1. Department of Parasitology and Medical Entomology, Muhimbili University of Health and Allied Sciences, Dar es Salaam, United Republic of Tanzania
    2. Global Health and Migration Unit, Department of Women's and Children's Health, Uppsala University, Uppsala, Sweden
    Contribution
    Conceptualization, Resources, Funding acquisition, Writing – review and editing
    Competing interests
    No competing interests declared
  5. Ozkan Aydemir

    Department of Medicine, University of Massachusetts Chan Medical School, Worcester, United States
    Contribution
    Conceptualization, Data curation, Formal analysis, Writing – review and editing
    Competing interests
    No competing interests declared
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-3458-6527
  6. Varun Goel

    Carolina Population Center, University of North Carolina at Chapel Hill, Chapel Hill, United States
    Contribution
    Formal analysis, Writing – review and editing
    Competing interests
    No competing interests declared
  7. Karamoko Niaré

    Department of Pathology and Laboratory Medicine, Brown University, Providence, United States
    Contribution
    Formal analysis, Writing – review and editing
    Competing interests
    No competing interests declared
  8. David J Giesbrecht

    Department of Pathology and Laboratory Medicine, Brown University, Providence, United States
    Contribution
    Investigation, Writing – review and editing
    Competing interests
    No competing interests declared
  9. Zachary R Popkin-Hall

    Institute for Global Health and Infectious Diseases, School of Medicine, University of North Carolina at Chapel Hill, Chapel Hill, United States
    Contribution
    Investigation, Writing – review and editing
    Competing interests
    No competing interests declared
  10. Chris Hennelly

    Institute for Global Health and Infectious Diseases, School of Medicine, University of North Carolina at Chapel Hill, Chapel Hill, United States
    Contribution
    Investigation, Writing – review and editing
    Competing interests
    No competing interests declared
  11. Zackary Park

    Division of Infectious Diseases, Department of Medicine, School of Medicine, University of North Carolina at Chapel Hill, Chapel Hill, United States
    Contribution
    Investigation, Writing – review and editing
    Competing interests
    No competing interests declared
  12. Ann M Moormann

    Department of Medicine, University of Massachusetts Chan Medical School, Worcester, United States
    Contribution
    Conceptualization, Funding acquisition, Writing – review and editing
    Competing interests
    No competing interests declared
  13. John M Ong'echa

    Center for Global Health Research, Kenya Medical Research Institute, Kisumu, Kenya
    Contribution
    Conceptualization, Resources, Funding acquisition, Writing – review and editing
    Competing interests
    No competing interests declared
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0003-3928-6774
  14. Robert Verity

    MRC Centre for Global Infectious Disease Analysis, Imperial College London, London, United Kingdom
    Contribution
    Conceptualization, Methodology, Writing – review and editing
    Competing interests
    No competing interests declared
  15. Safia Mohammed

    Zanzibar Malaria Elimination Program (ZAMEP), Zanzibar, United Republic of Tanzania
    Contribution
    Resources, Writing – review and editing
    Competing interests
    No competing interests declared
  16. Shija J Shija

    Zanzibar Malaria Elimination Program (ZAMEP), Zanzibar, United Republic of Tanzania
    Contribution
    Resources, Writing – review and editing
    Competing interests
    No competing interests declared
  17. Lwidiko E Mhamilawa

    1. Department of Parasitology and Medical Entomology, Muhimbili University of Health and Allied Sciences, Dar es Salaam, United Republic of Tanzania
    2. Global Health and Migration Unit, Department of Women's and Children's Health, Uppsala University, Uppsala, Sweden
    Contribution
    Conceptualization, Resources, Data curation, Writing – review and editing
    Competing interests
    No competing interests declared
  18. Ulrika Morris

    Department of Microbiology, Tumor and Cell Biology, Karolinska Institutet, Stockholm, Sweden
    Contribution
    Conceptualization, Resources, Writing – review and editing
    Competing interests
    No competing interests declared
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0003-4734-5754
  19. Andreas Mårtensson

    Global Health and Migration Unit, Department of Women's and Children's Health, Uppsala University, Uppsala, Sweden
    Contribution
    Conceptualization, Resources, Data curation, Supervision, Investigation, Writing – original draft, Project administration, Writing – review and editing
    Competing interests
    No competing interests declared
  20. Jessica T Lin

    Division of Infectious Diseases, Department of Medicine, School of Medicine, University of North Carolina at Chapel Hill, Chapel Hill, United States
    Contribution
    Conceptualization, Supervision, Funding acquisition, Investigation, Writing – original draft, Project administration, Writing – review and editing
    Competing interests
    No competing interests declared
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-4516-723X
  21. Anders Björkman

    1. Department of Microbiology, Tumor and Cell Biology, Karolinska Institutet, Stockholm, Sweden
    2. Department of Global Public Health, Karolinska Institute, Stockholm, Sweden
    Contribution
    Conceptualization, Resources, Supervision, Funding acquisition, Investigation, Writing – original draft, Project administration, Writing – review and editing
    Competing interests
    No competing interests declared
  22. Jonathan J Juliano

    1. Division of Infectious Diseases, Department of Medicine, School of Medicine, University of North Carolina at Chapel Hill, Chapel Hill, United States
    2. Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, Chapel Hill, United States
    3. Curriculum in Genetics and Molecular Biology, University of North Carolina at Chapel Hill, Chapel Hill, United States
    Contribution
    Conceptualization, Resources, Data curation, Supervision, Funding acquisition, Investigation, Writing – original draft, Project administration, Writing – review and editing
    Competing interests
    No competing interests declared
    Additional information
    co-senior authors
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-0591-0850
  23. Jeffrey A Bailey

    Department of Pathology and Laboratory Medicine, Brown University, Providence, United States
    Contribution
    Conceptualization, Resources, Data curation, Supervision, Funding acquisition, Investigation, Writing – original draft, Project administration, Writing – review and editing
    Competing interests
    No competing interests declared
    Additional information
    co-senior authors
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-6899-8204

Funding

National Institutes of Health (R01AI121558)

  • Jonathan J Juliano
  • Jeffrey A Bailey

National Institutes of Health (R01AI137395)

  • Billy E Ngasala
  • Zackary Park
  • Lwidiko E Mhamilawa
  • Jessica T Lin
  • Jonathan J Juliano
  • Jeffrey A Bailey

National Institutes of Health (R01AI155730)

  • Sean V Connelly
  • Billy E Ngasala
  • Varun Goel
  • Robert Verity
  • Jessica T Lin
  • Anders Björkman
  • Jonathan J Juliano
  • Jeffrey A Bailey

National Institutes of Health (F30AI143172)

  • Nicholas F Brazeau

National Institutes of Health (K24AI134990)

  • Jonathan J Juliano

Swedish Research Council

  • Ulrika Morris
  • Andreas Mårtensson
  • Anders Björkman

Erling-Persson Family Foundation

  • Anders Björkman

Yang Biomedical Scholars Fund

  • Jonathan J Juliano

Community Jameel

  • Robert Verity

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Acknowledgements

We thank the communities and participants who took part in these studies. We also thank Abebe Fola for his assistance. We acknowledge the following institutional funding - MRC Centre for Global Infectious Disease Analysis, jointly funded by the UK MRC and the UK FCDO, under the MRC/FCDO Concordat agreement and is also part of the EDCTP2 programme supported by the EU.

Ethics

All participants provided informed consent for analysis and publication. The IRBs of University of North Carolina at Chapel Hill (15-1989, 17-0166, 18-1090), Muhimbili University of Health and Allied Sciences, Zanzibar Medical Research Ethical Committee and the Regional Ethics Review Board, Stockholm, Sweden gave ethical approval for this work.

Version history

  1. Preprint posted:
  2. Sent for peer review:
  3. Reviewed Preprint version 1:
  4. Reviewed Preprint version 2:
  5. Version of Record published:

Cite all versions

You can cite all versions using the DOI https://doi.org/10.7554/eLife.90173. This DOI represents all versions, and will always resolve to the latest one.

Copyright

© 2023, Connelly et al.

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.

Metrics

  • 454
    views
  • 36
    downloads
  • 1
    citations

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Sean V Connelly
  2. Nicholas F Brazeau
  3. Mwinyi Msellem
  4. Billy E Ngasala
  5. Ozkan Aydemir
  6. Varun Goel
  7. Karamoko Niaré
  8. David J Giesbrecht
  9. Zachary R Popkin-Hall
  10. Chris Hennelly
  11. Zackary Park
  12. Ann M Moormann
  13. John M Ong'echa
  14. Robert Verity
  15. Safia Mohammed
  16. Shija J Shija
  17. Lwidiko E Mhamilawa
  18. Ulrika Morris
  19. Andreas Mårtensson
  20. Jessica T Lin
  21. Anders Björkman
  22. Jonathan J Juliano
  23. Jeffrey A Bailey
(2024)
Strong isolation by distance and evidence of population microstructure reflect ongoing Plasmodium falciparum transmission in Zanzibar
eLife 12:RP90173.
https://doi.org/10.7554/eLife.90173.3

Share this article

https://doi.org/10.7554/eLife.90173

Further reading

    1. Epidemiology and Global Health
    Xiaoning Wang, Jinxiang Zhao ... Dong Liu
    Research Article

    Artificially sweetened beverages containing noncaloric monosaccharides were suggested as healthier alternatives to sugar-sweetened beverages. Nevertheless, the potential detrimental effects of these noncaloric monosaccharides on blood vessel function remain inadequately understood. We have established a zebrafish model that exhibits significant excessive angiogenesis induced by high glucose, resembling the hyperangiogenic characteristics observed in proliferative diabetic retinopathy (PDR). Utilizing this model, we observed that glucose and noncaloric monosaccharides could induce excessive formation of blood vessels, especially intersegmental vessels (ISVs). The excessively branched vessels were observed to be formed by ectopic activation of quiescent endothelial cells (ECs) into tip cells. Single-cell transcriptomic sequencing analysis of the ECs in the embryos exposed to high glucose revealed an augmented ratio of capillary ECs, proliferating ECs, and a series of upregulated proangiogenic genes. Further analysis and experiments validated that reduced foxo1a mediated the excessive angiogenesis induced by monosaccharides via upregulating the expression of marcksl1a. This study has provided new evidence showing the negative effects of noncaloric monosaccharides on the vascular system and the underlying mechanisms.

    1. Epidemiology and Global Health
    2. Microbiology and Infectious Disease
    Amanda C Perofsky, John Huddleston ... Cécile Viboud
    Research Article

    Influenza viruses continually evolve new antigenic variants, through mutations in epitopes of their major surface proteins, hemagglutinin (HA) and neuraminidase (NA). Antigenic drift potentiates the reinfection of previously infected individuals, but the contribution of this process to variability in annual epidemics is not well understood. Here, we link influenza A(H3N2) virus evolution to regional epidemic dynamics in the United States during 1997—2019. We integrate phenotypic measures of HA antigenic drift and sequence-based measures of HA and NA fitness to infer antigenic and genetic distances between viruses circulating in successive seasons. We estimate the magnitude, severity, timing, transmission rate, age-specific patterns, and subtype dominance of each regional outbreak and find that genetic distance based on broad sets of epitope sites is the strongest evolutionary predictor of A(H3N2) virus epidemiology. Increased HA and NA epitope distance between seasons correlates with larger, more intense epidemics, higher transmission, greater A(H3N2) subtype dominance, and a greater proportion of cases in adults relative to children, consistent with increased population susceptibility. Based on random forest models, A(H1N1) incidence impacts A(H3N2) epidemics to a greater extent than viral evolution, suggesting that subtype interference is a major driver of influenza A virus infection ynamics, presumably via heterosubtypic cross-immunity.