Common host variation drives malaria parasite fitness in healthy human red cells

Abstract
Introduction
Results
Discussion
Materials and methods
Data availability
References
Article and author information
Metrics

Abstract

The replication of Plasmodium falciparum parasites within red blood cells (RBCs) causes severe disease in humans, especially in Africa. Deleterious alleles like hemoglobin S are well-known to confer strong resistance to malaria, but the effects of common RBC variation are largely undetermined. Here, we collected fresh blood samples from 121 healthy donors, most with African ancestry, and performed exome sequencing, detailed RBC phenotyping, and parasite fitness assays. Over one-third of healthy donors unknowingly carried alleles for G6PD deficiency or hemoglobinopathies, which were associated with characteristic RBC phenotypes. Among non-carriers alone, variation in RBC hydration, membrane deformability, and volume was strongly associated with P. falciparum growth rate. Common genetic variants in PIEZO1, SPTA1/SPTB, and several P. falciparum invasion receptors were also associated with parasite growth rate. Interestingly, we observed little or negative evidence for divergent selection on non-pathogenic RBC variation between Africans and Europeans. These findings suggest a model in which globally widespread variation in a moderate number of genes and phenotypes modulates P. falciparum fitness in RBCs.

Introduction

Malaria caused by the replication of Plasmodium falciparum parasites in red blood cells (RBCs) kills hundreds of thousands of children each year (WHO, 2019). In each 48-hr cycle of blood-stage malaria, parasites must deform RBC membranes to invade them (Koch, 2017; Kariuki et al., 2020); consume hemoglobin and tolerate the resulting oxidative stress (Francis et al., 1997); multiply to displace half the RBC volume (Hanssen et al., 2012); and remodel the RBC membrane to avoid immune detection (Zhang, 2015). Consequently, genetic disorders that alter aspects of RBC biology are well-known to influence malaria susceptibility (Kwiatkowski, 2005). For example, sickle cell trait impairs parasite growth by altering hemoglobin polymerization at low oxygen tension (Pasvol et al., 1978; Archer et al., 2018), while deficiency of the G6PD enzyme involved in oxidative stress tolerance is thought to make parasitized RBCs more susceptible to breakdown (Ruwende and Hill, 1998). Aside from these diseases, however, the genetic basis of RBC susceptibility to malaria remains mostly unknown.

Large genome-wide association studies (GWAS) have identified a few dozen loci that collectively explain up to 11% of the heritability of the risk of severe versus uncomplicated malaria (Timmann et al., 2012; Malaria Genomic Epidemiology Network, 2014; Band et al., 2015; Leffler et al., 2017; Ndila et al., 2018; Malaria Genomic Epidemiology Network, 2019). About 10 of the highest-confidence GWAS signals, including 6 loci known from earlier methods (Allison, 1954; Field et al., 1994; Ruwende and Hill, 1998; Lell et al., 1999; Rowe, 2007; Cao and Galanello, 2010; Galanello and Cao, 2011), are in or near genes expressed predominantly in RBCs. One new GWAS variant has since been shown to regulate expression of the ATP2B4 calcium channel (Zámbó et al., 2017) and to be associated with RBC dehydration (Li et al., 2013b), although a functional link between ATP2B4 and P. falciparum replication has yet to be demonstrated. Additional GWAS discoveries of RBC variation important for malaria are not expected without massive increases in sample size (Boyle et al., 2017; Malaria Genomic Epidemiology Network, 2019), in part because of the large number of hypotheses tested. Severe malaria is a complex phenotype that combines many factors from RBCs, the vascular endothelium, the immune system, the parasite, and the environment (Mackinnon et al., 2005; de Mendonça et al., 2012). Alternate approaches are therefore needed to discover more genetic variation that impacts the replication of malaria parasites in human RBCs.

Heritable RBC phenotypes like mean cell volume (MCV), hemoglobin content (HGB/MCH), and antigenic blood type vary widely within and between human populations (Whitfield et al., 1985; Evans et al., 1999; Pilia et al., 2006; Lo et al., 2011; Cooling, 2015; Canela-Xandri et al., 2018). Large GWAS conducted mostly in Europeans have demonstrated that many blood cell phenotypes are shaped by hundreds of small-effect loci distributed throughout the genome, consistent with polygenic or omnigenic models of complex trait genetics (van der Harst et al., 2012; Astle et al., 2016; Chami et al., 2016; Boyle et al., 2017; Chen et al., 2020; Vuckovic et al., 2020). Certain blood phenotypes like average hemoglobin levels, hematocrit, and RBC membrane fragility are also known to differ between African and European populations, although the differences are typically small in magnitude (Garn, 1981; Perry et al., 1992; Beutler and West, 2005; Kanias et al., 2017; Page et al., 2021). This variation across populations can largely be explained by a few RBC disease alleles that have been widely selected across Africa for their protective effects on malaria (Beutler and West, 2005; Lo et al., 2011; Kanias et al., 2017). Despite the importance of these population-specific variants, a much larger number of common variants with small individual effects on RBC phenotypes are expected to be globally widespread (Biddanda et al., 2020; Chen et al., 2020). It remains untested whether this extensive phenotypic and genetic diversity in RBCs influences malaria susceptibility and, if so, whether it has been shaped by malaria selection.

Here, we approach these questions by performing exome sequencing and extensive RBC phenotyping on blood samples from a diverse human cohort of 122 individuals. We show that P. falciparum fitness varies widely among donor cells in vitro, with the distribution of parasite phenotypes in ‘healthy’ RBCs overlapping those from RBCs carrying classic disease alleles. We apply LASSO variable selection to identify a small set of genes and phenotypes that strongly predict parasite fitness outside of the context of RBC disease, highlighting RBC dehydration and membrane properties as key to modulating P. falciparum fitness. We find little evidence that non-pathogenic alleles or phenotypes that confer parasite protection are associated with African ancestry, perhaps because P. falciparum is susceptible to RBC variation that exists for other selective or demographic reasons. Overall, these findings advance our understanding of the origin and function of common RBC variation and suggest new targets for therapeutic intervention for malaria.

Results

Many healthy blood donors with African ancestry carry alleles for RBC disease

We collected blood samples from 121 donors with no known history of blood disorders, most of whom self-identified as having recent African ancestry (Figure 1A). As a positive control, we also sampled a patient with hereditary elliptocytosis (HE), a polygenic condition characterized by extremely fragile RBC membranes that strongly inhibit P. falciparum growth (Schulman et al., 1990; Facer, 1995; Dhermy et al., 2007; Gallagher, 2013). We performed whole-exome sequencing (Figure 1—source data 1), both to check for the presence of known RBC disease alleles and to confirm the population genetic ancestry of our donors. A principal component analysis of more than 35,000 exomic single-nucleotide polymorphisms (SNPs) showed that most donors fell along a continuum from African to European ancestry, as defined by data from the 1000 Genomes Project (Figure 1A). Pairwise kinship coefficients demonstrated that all donors were unrelated, apart from a six-member family with unique ancestry (Figure 1A, light borders). We found that 16% of the healthy donors carried pathogenic hemoglobin alleles (Figure 1B), including 5 heterozygotes for hemoglobin S (HbAS), 4 heterozygotes for hemoglobin C (HbAC), and 11 individuals with one or two copies of an HBA2 deletion causing α-thalassemia (Galanello and Cao, 2011). We also scored eight polymorphisms in G6PD that have been functionally associated with various degrees of G6PD deficiency (Yoshida et al., 1971; Clarke et al., 2017) and found that 32% of the study population carried at least one, including 12 of the 20 donors with hemoglobinopathies. Among those with wild-type hemoglobin, we identified 1 individual with polymorphisms associated with severe G6PD deficiency (>60% loss of function) and 23 with polymorphisms associated with mild to medium deficiency (<42% loss of function). We detected no alleles linked to other monogenic RBC disorders, including β-thalassemia or xerocytosis (Cao and Galanello, 2010; Glogowska et al., 2017). We therefore classified the remaining 68 unrelated donors as ‘non-carriers’ of known disease alleles for the purposes of this work.

Figure 1

Download asset Open asset

Overview of blood donors and study design.

(A) PCA of genetic variation across 35,759 unlinked exome SNPs. Donors from this study are plotted on coordinate space derived from 1000 Genomes reference populations. Points with white borders represent six related individuals, five of whom were excluded from the study. All exome variants passing quality filters are available in Figure 1—source data 1. (B) Over a third of donors carried alleles for RBC disorders linked to *Plasmodium falciparum* resistance. Individuals with >1 disease allele were classified by their most severe condition. non-carrier: Donor without any of the following alleles or conditions. G6PD⁻_low: Mild to medium G6PD deficiency (<42% loss of function). G6PD⁻_high: Severe G6PD deficiency (>60% loss of function). −α/αα: heterozygous HBA2 deletion, or alpha thalassemia minima. −α/−α: homozygous HBA2 deletion, or α-thalassemia trait. HbAC: heterozygous HBB:E7K, or hemoglobin C trait. HbAS: heterozygous HBB:E7V, or sickle cell trait. HE: hereditary elliptocytosis. (C) Two components of *P. falciparum* fitness were measured with flow cytometry at three timepoints. Invasion is the change in parasitemia as schizonts egress from maintenance RBCs (green) and invade fresh acceptor RBCs from the blood donors (purple). Growth is the multiplication rate from a complete parasite cycle in the fresh acceptor RBCs. (D) RBC phenotypes were measured using complete blood counts with RBC indices, osmotic fragility tests, and ektacytometry on fresh samples. This figure was partially created with Biorender.com. RBC, red blood cell; SNP, single-nucleotide polymorphism.

Figure 1—source data 1 Individual genotypes, population frequencies, and protein annotations for exome variants passing quality filters (N~160,000).: https://cdn.elifesciences.org/articles/69808/elife-69808-fig1-data1-v2.zip
Download elife-69808-fig1-data1-v2.zip

P. falciparum replication rates vary widely among non-carrier RBCs

To determine the variation in P. falciparum fitness among samples with different genotypes, we performed invasion and growth assays with two parasite strains. The genome reference strain 3D7, which was originally isolated from a European, has been continuously cultured in academic labs for at least 40 years (Walliker et al., 1987; Moser, 2020). Th.026.09 is a drug-resistant strain collected from Senegal in 2009 that is minimally adapted to lab culture (Daniels et al., 2012). These divergent strains were selected in an attempt to balance biological realism with reliable in vitro data.

We observed a wide range of P. falciparum growth rates among RBC samples, especially among non-carriers that lacked known disease alleles (Figure 2A–C). Each strain’s growth rate is defined here as parasite multiplication over a full 48-hr cycle in donor RBCs (Figure 1C), with the mean value for non-carriers set to 100% after normalization. Briefly, we used a repeated control RBC sample (Figure 2, gray points) and other batch-specific factors to correct for variation in parasite growth across multiple experiments (Figure 2—figure supplement 1). Among non-carriers, growth rates ranged from 64% to 136% for 3D7 (SD=17.7%) and 76% to 128% for Th.026.09 (SD=10.6%) (Figure 2A–B). Per-sample growth rates were strongly correlated between the two strains (Figure 2C, R²=0.69, p<3×10^–16) and positively correlated when measured in different weeks (p=0.35, Figure 2—figure supplement 2), demonstrating that these data capture meaningful variation among donor RBCs. Furthermore, as expected (Friedman, 1978; Ifediba et al., 1985; Greene, 1993; Facer, 1995), we detected reductions in mean growth rate for both strains in RBCs carrying known disease alleles. These included individuals with α-thalassemia trait (3D7 p=0.027; Th.026.09 p=0.077), HbAS (3D7 p=1.05×10^–7; Th.026.09 p=1.2×10^–4), and the single carriers of HE and severe G6PD deficiency. Notably, the wide distribution of growth rates for non-carrier RBCs had considerable overlap with the growth rates in carrier RBCs. Only the HbAS and HE samples fell entirely outside the non-carrier range. This observation implies the existence of previously unknown RBC variation that impacts P. falciparum growth, which may have cumulative effect sizes comparable to known disease alleles.

Figure 2 with 2 supplements see all

Download asset Open asset

*Plasmodium falciparum* replication rate varies widely among donor RBCs.

(**A, B**) Growth of *P. falciparum* lab strain 3D7 (A) or clinical isolate Th.026.09 (B) over a full 48-hr cycle in donor RBCs (see Figure 1C). Growth is presented relative to the average non-carrier rate after correction for batch effects (Figure 2—figure supplement 1; see Materials and methods), including comparison to a repeated RBC control shown in gray. Each carrier group was compared to unrelated non-carriers using Student’s t-test, except in cases where N=1, where asterisks instead indicate the percentile of the non-carrier distribution. Repeated measurements of 11 donors are shown in Figure 2—figure supplement 2. (C) Per-sample growth rates are correlated between the two *P. falciparum* strains. (**D–F**) As in (**A–C**) but for *P. falciparum* invasion efficiency (see Figure 1C). R² and p-values are derived from OLS regression. *p<0.1; **p<0.05; *******p<0.01. RBC, red blood cell.

We observed a similarly wide range in the efficiency of P. falciparum invasion into donor RBCs (Figure 2D–F). Invasion is defined here as the fold-change in parasitemia over the first 24 hours of the assay, when parasites previously maintained in standard culture conditions egressed and invaded new donor RBCs (Figure 1C). Among non-carriers, invasion rates ranged from 70% to 143% for 3D7 (SD=14.9%) and 41% to 193% for Th.026.09 (SD=29.1%) (Figure 2D–E). Compared to growth rates, no disease alleles conferred protection against invasion that was extreme enough to fall outside the broad non-carrier range. HbAC was associated with an 11% decrease in 3D7 invasion (p=0.008), while α-thalassemia trait was associated with a 22% increase in Th.026.09 invasion (p=0.091). Only HE had a strong effect on the invasion efficiency of both strains. The correlation of invasion efficiencies between strains was weaker than for growth (Figure 2F, R²=0.10, p=6×10^–4), potentially reflecting strain-specific differences in the pathways used for invasion (Wright and Rayner, 2014). However, we also observed greater batch effects (Figure 2—figure supplement 1) and greater variability between repeated samples (Figure 2—figure supplement 2) for invasion than for growth, suggesting that invasion is influenced by greater experimental noise.

RBC phenotypes vary widely among non-carriers

To assess phenotypic variation across donor RBCs, we measured 22 common indices of RBC size and hemoglobin content from complete blood counts using an ADVIA hematology analyzer (Figure 3A–D; Figure 3—figure supplements 1–2). Mean cellular volume (MCV) and hemoglobin mass (MCH) are closely related traits, which can be represented together as cellular hemoglobin concentration (CHCM) or the fraction of RBCs with ‘normal’ hemoglobin and volume indices (M5). As expected, each known disease allele was associated with a distinct set of RBC abnormalities (Figure 3—figure supplement 3). These included elevated CHCM for HbAC (p=0.033), consistent with dehydration, and very low MCV (p=6.8×10^–5) and MCH (p=2.5×10^–7) for α-thalassemia trait (−α/−α), consistent with microcytic anemia (Galanello and Cao, 2011). RBCs from the HE patient also had very low MCV and MCH, reflecting the membrane breakage and volume loss characteristic of this disease. For all these phenotypic measures, we also observed broad distributions in non-carriers that overlapped the distributions of most carriers (Figure 3A–D; Figure 3—figure supplements 1–2). Notably, the breadth of the non-carrier distribution for each phenotype was large (e.g., 24 fl range for MCV) compared to the average difference between Africans and Europeans (e.g., 3–5 fl; Beutler and West, 2005; Lo et al., 2011). This wide diversity and substantial overlap between non-carrier and carrier traits suggest that healthy RBCs exist on the same phenotypic continuum as RBCs carrying known disease alleles.

Figure 3 with 5 supplements see all

Download asset Open asset

Red cell phenotypes that are abnormal in carriers also vary widely among non-carriers.

(**A–D**) Red cell indices were measured by an ADVIA hematology analyzer. Additional indices are shown in Figure 3—figure supplement 1. MCV: mean corpuscular (RBC) volume; MCH: mean cellular hemoglobin; CHCM: cellular hemoglobin concentration; M5: fraction of RBCs with normal volume and normal hemoglobin (see Figure 3—figure supplement 2). Statistical tests as in Figure 2. (**E, F**) Osmotic fragility curves. Fragility is defined as the NaCl concentration at which 50% of RBCs lyse (see Figure 3—figure supplement 4). (**G, H**) Ektacytometry curves characterize RBC deformability and dehydration under salt stress (Figure 3—figure supplement 5). A heatmap of all phenotypes by carrier status is available in Figure 3—figure supplement 3. RBC, red blood cell.

We observed similar patterns of variation in RBC membrane fragility (Figure 3E–F; Figure 3—figure supplement 4) and membrane deformability (Figure 3G–H; Figure 3—figure supplement 5), as measured with osmotic fragility tests and osmotic gradient ektacytometry. Both sets of curves represent RBC tolerance to osmotic stress, which can result in swelling and lysis (fragility, Figure 3E–F) or dehydration and decreased deformability (O_hyper, Figure 3G–H). Specific hemoglobinopathies were associated with moderate to strong reductions in fragility, deformability, and/or resistance to loss of deformability when dehydrated (Figure 3—figure supplements 3–5). HE cells were both extremely fragile and extremely non-deformable. In non-carriers, the distributions for all membrane measures were wide, continuous, and overlapped the distribution of most carriers (Figure 3E–H). Overall, these data demonstrate that multiple phenotypic alterations associated with RBC disease alleles are also present in non-carrier RBCs.

Non-carrier variation in RBC phenotypes predicts P. falciparum replication rate

To identify sets of phenotypes associated with P. falciparum replication in non-carrier RBCs, we used a machine learning method called LASSO (Least Absolute Shrinkage and Selection Operator) that performs regularization and variable selection (Tibshirani, 1994). Briefly, LASSO shrinks the regression coefficients for some possible predictors to zero to obtain a subset of predictors (in this case, phenotypes) that minimizes prediction error. This method is well-suited for data sets where predictors are correlated, as are RBC size, hemoglobin, and membrane dynamics; and for cases where the number of possible predictors is large compared to the number of measurements. To validate RBC phenotypes associated with P. falciparum replication by LASSO, we performed k-folds cross-validation (CV) on train and test sets derived from 10,000 divisions of the non-carrier data in 10-folds (see Materials and methods). To further control for overfitting, we also applied the same procedure to 1000 random permutations of the parasite data. Finally, for each trait selected by LASSO in at least 40% of training sets, we applied univariate OLS regression to estimate the sign of its effect on all measured components of parasite fitness. The highest-confidence results from this analysis are summarized in Figure 4A, with complete details provided in Figure 4—source data 1.

Figure 4 with 1 supplement see all

Download asset Open asset

RBC phenotypes predict *Plasmodium falciparum* fitness in non-carriers.

(A) Phenotypes selected by LASSO in at least 40% of train data sets (blue shading; see Materials and methods) in at least one of four models of parasite replication (columns). Each model was trained on ~90% of the data (**B, C**) and tested on the remaining 10% (**B, C**). (+/−) shows the direction of effect if the phenotype was significantly correlated (p<0.1) with the parasite fitness component in a separate, univariate linear model (Figure 4—figure supplement 1; ). MCV: mean RBC volume (fl).MCH: mean corpuscular hemoglobin (pg/RBC). O₅₀: Osmotic fragility (mM NaCl; see Figure 3—figure supplement 4). DI_max: Maximum membrane deformability (arbitrary units; see Figure 3—figure supplement 5). O_hyper: Tendency to resist osmotic dehydration and loss of deformability. M4: fraction of RBCs with normal volume and low hemoglobin (see Figure 3—figure supplement 2). M6: fraction of RBCs with normal volume and high hemoglobin. M8: fraction of RBCs with low volume and normal hemoglobin. CHCM: cellular hemoglobin concentration mean (g/dl). MCHC: mean corpuscular hemoglobin concentration (g/dl). PLT: platelet number (×10³/µl). MPV: mean platelet volume (fl). RBC: red cell number (×10⁶/µl). HCT: hematocrit, or the fraction of blood volume composed of RBCs. RDW: red cell distribution width (%). (**B, C**) Variance in parasite fitness explained by RBC phenotypes in LASSO models. Dashed lines indicate average R² for the measured test data. Each histogram shows the same procedure on 1000 permutations of the measured test data. RBC, red blood cell.

Figure 4—source data 1 Association statistics for individual phenotypic predictors with non-zero LASSO support.: https://cdn.elifesciences.org/articles/69808/elife-69808-fig4-data1-v2.xlsx
Download elife-69808-fig4-data1-v2.xlsx

P. falciparum fitness in non-carrier RBCs was strongly predicted by variation in traits related to volume, hemoglobin, deformability, and dehydration (Figure 4A). Among 25 tested phenotypes, the most strongly predictive was the ektacytometry parameter O_hyper, which represents a cell’s tendency to retain deformability in the face of dehydration (Figure 3—figure supplement 4). In univariate models, non-carrier RBCs with the largest O_hyper values—that is, those that retained more deformability when dehydrated—supported 22–46% faster parasite growth (3D7 p=0.003, Th.026.09 p=0.007) and 31–83% more effective invasion (3D7 p=0.008, Th.026.09 p=0.005) than RBCs with the smallest O_hyper values, which quickly lost deformability. Consistent with this result, P. falciparum replication was inhibited in RBCs that were more dehydrated at baseline (e.g., with higher CHCM; 3D7 invasion p=0.006, 3D7 growth p=0.024, Th.026.09 invasion p=0.004, Th.026.09 growth p=0.26). Parasites also grew faster in RBCs with larger mean volume (MCV; 3D7 p=0.001, Th.026.09 p=0.0009); a greater mass of hemoglobin per cell (MCH; 3D7 p=0.071, Th.026.09 p=0.016); and more deformable membranes (DI_max; 3D7 p=0.0005, Th.026.09 p=0.008). 3D7 growth was also reduced in RBCs with more fragile membranes (O₅₀; p=0.005). Additional phenotypes related to platelets and RBC density were selected for some models, but the direction of their effects was unclear when they were considered individually (Figure 4A, Figure 4—source data 1). These results indicate that common, non-pathogenic variation in RBC size, membrane dynamics, and other correlated traits have meaningful effects on P. falciparum replication rate in RBCs.

Taken together, the non-carrier phenotypes selected by LASSO from training data (N~61, Figure 4A) explained 3–9% of the variation in parasite growth in separate test data (N~7, Figure 4B; 3D7 p=0.008 and RMSE=18.0%; Th.026.09 p=0.079 and RMSE=10.8%). This fraction was significantly greater than expected from random permutations, which were centered on R²=0 in the test data (Figure 4B). Notably, prediction error was greater for individuals with parasite growth values farther away from the mean. In contrast, for invasion, RBC phenotypes did not explain more variation in the test data than expected from permutations (Figure 4C; 3D7 p=0.79 and RMSE=15.0%; Th.026.09 p=0.53 and RMSE=29.5%). All phenotype models were less predictive for the clinical isolate Th.026.09 than the lab strain 3D7, perhaps because clinical isolates are less adapted to laboratory conditions. Overall, these results demonstrate that multiple, variable phenotypes impact P. falciparum susceptibility in healthy RBCs. Non-carrier cells that are less hospitable to parasites share specific traits with RBCs that carry disease alleles, including smaller size, decreased deformability, and an increased tendency to lose deformability when dehydrating.

Common RBC alleles predict P. falciparum replication in non-carriers

Next, we tested whether non-carrier genotypes derived from exome sequencing could improve our predictions of P. falciparum replication rate. With a sample of 68 unrelated non-carriers, we lacked the power to perform the many thousands of tests that are typical in large genetic association studies (Fadista et al., 2016). Instead, our study design focused on 23 RBC proteins previously associated with malaria (Figure 5—source data 1), which we hypothesized are enriched for common variants impacting P. falciparum fitness, as compared to random control sets of RBC proteins. We used the same LASSO procedure described above to test 106 unlinked genetic variants (pairwise r²<0.1) in these 23 RBC proteins, along with RBC phenotypes, for association with P. falciparum fitness in non-carriers. To test for the effects of population structure, we also included the top 10 principal components (PCs) from 1000 Genomes as possible predictors. Notably, PC1 is equivalent to the exome-wide fraction of African ancestry, as determined by ADMIXTURE with K=4 from the 1000 Genomes reference populations (see Materials and methods). We again compared these results to permuted data, as well as to 1,000 sets of 23 genes drawn at random from the RBC proteome (Figure 5—source data 2).

Taken together, genotypes and phenotypes selected by LASSO explained 7–15% of the variation in parasite growth rate in the test data (Figure 5B; 3D7 p=0.012 and RMSE=16.5%; Th.026.09 p=0.063 and RMSE=10.6%). Prediction error was greater for donors with parasite values farther away from the mean, though this trend was weaker than for phenotype-only models. The variance explained by models using real genotype and phenotype data was significantly larger than expected from permutation (Figure 5—figure supplement 1A) and random sets of RBC genes (Figure 5B), suggesting that the 23 malaria-related genes contain variation that influences P. falciparum development.

Figure 5 with 6 supplements see all

Download asset Open asset

Common variation in malaria-associated genes predicts *Plasmodium falciparum* fitness in non-carrier RBCs.

(A) Variants in 23 malaria-related genes (Figure 5—source data 1) and genetic PCs selected by LASSO in at least >40% of train data sets. Each model was trained on ~90% of the measured data (**B C**) and tested on the remaining 10% (**B C**). The following genes had no associated variants in non-carriers: *CD55, EPB41, FPN, G6PD, GYPA, GYPE, HBA1/2, HBB,* and HP. *The only significant PC association was driven by a single East Asian donor (Figure 5—figure supplement 5). (**B, C**) Variance in parasite fitness explained by LASSO models including 23 malaria-related genes, the top 10 PCs, and RBC phenotypes. Dashed lines indicate average R² for models using the measured test data. Each histogram shows R² for models including variants from 23 random genes in the RBC proteome (Figure 5—source data 2) instead of malaria-related genes. All predictors with non-zero LASSO support are shown in Figure 5—source data 3. Additional histograms from permuted data are shown in Figure 5—figure supplement 1. The variance explained by variants undiscovered by previous GWAS is shown in Figure 5—figure supplement 4. GWAS, genome-wide association studies; PC, principal component; RBC, red blood cell.

Figure 5—source data 1 Twenty-three RBC genes with strong links to malaria in the literature.: https://cdn.elifesciences.org/articles/69808/elife-69808-fig5-data1-v2.xlsx
Download elife-69808-fig5-data1-v2.xlsx
Figure 5—source data 2 Proteins present in mature RBCs. This list was derived from the Red Blood Cell Collection database (rbcc.hegelab.org) using a medium-confidence filter.: https://cdn.elifesciences.org/articles/69808/elife-69808-fig5-data2-v2.csv
Download elife-69808-fig5-data2-v2.csv
Figure 5—source data 3 All genetic and phenotypic predictors with non-zero LASSO support. Growth predictors selected in at least 40% of train data sets are indicated in bold. Genetic predictors are summarized in Figure 5A. NA indicates predictors that were only present as singletons in the smaller invasion data set.: https://cdn.elifesciences.org/articles/69808/elife-69808-fig5-data3-v2.xlsx
Download elife-69808-fig5-data3-v2.xlsx

Nearly all of the 32 polymorphisms selected by LASSO in growth models occurred in (1) ion channel proteins, which regulate RBC hydration; (2) components of the flexible RBC membrane backbone; or (3) red cell plasma membrane proteins, including known invasion receptors (Figure 5A). In the first category, the highly polymorphic ion channel PIEZO1 contained seven polymorphisms associated with small (<3.7%) to moderate (31%) reductions of P. falciparum growth rate. In practice, the smallest effect size that could be reliably determined for an allele with our data was ±3.7% (Figure 5—source data 3). The microsatellite variant PIEZO1-E756del, which has been a focus of several recent studies (Ilboudo et al., 2018; Ma et al., 2018; Rooks et al., 2019; Nguetse et al., 2020), predicted a moderate reduction in Th.026.09 growth (–7.9%, p=0.01) but was not related to RBC dehydration in these data (Figure 5—figure supplement 2). For 3D7, we also detected one growth-associated variant in ATP2B4 (–5.9%, p=0.075), which encodes the primary RBC calcium channel PMCA4b. This variant tags an ATP2B4 haplotype implicated by GWAS in protection from severe malaria and many RBC phenotypes (van der Harst et al., 2012; Li et al., 2013b; Lessard et al., 2017; Lin et al., 2020, Timmann et al., 2012, Zámbó et al., 2017). Notably, however, this variant has never before been functionally demonstrated to be associated with P. falciparum fitness.

SPTA1 and SPTB, which encode the flexible spectrin backbone of RBCs, contained several variants associated with the growth of at least one P. falciparum strain, as did the structural linker genes ANK1, SLC4A1, and EPB42 (Figure 5A). We also identified a total of 10 polymorphisms in ABCB6, GYPB, GYPC, CR1, CD44, and basigin (BSG) that were associated with P. falciparum growth. These plasma membrane proteins have all been previously implicated in P. falciparum invasion by genetic deficiency studies (Mayer et al., 2009; Crosnier et al., 2011; Egan et al., 2015; Egan et al., 2018), and in some cases, studies of natural polymorphisms (Nagayasu et al., 2001; Leffler et al., 2017). Notably, two of the variants identified here are synonymous quantitative trait loci (QTL) for CD44 splicing (rs35356320) and BSG expression (rs4682) (GTEx Consortium et al., 2017), further supporting the possibility that they are functional. No associated variants were detected in the other 10 tested genes, including 3 hemoglobin proteins, G6PD, 2 glycophorins, CD55, EPB41, FPN, and HP. Taken together, these data demonstrate that dozens of host genetic variants shape the phenotypic distribution of red cell susceptibility to P. falciparum in non-carriers.

Eighteen of the 32 variants selected by LASSO were synonymous, which was not significantly different from the input set of 106 variants (p=0.72, two-sided binomial test). Over half of the growth-associated variants have previously been associated with gene expression traits, GWAS traits, or GWAS loci through linkage (Figure 5—source data 3), suggesting that they indeed tag functional polymorphisms. Novel variants nonetheless contribute substantially to the predictive power of these models (Figure 5—figure supplement 4), and nearly all the variants are novel in terms of association with P. falciparum growth rate.

In contrast to growth, models of invasion that included genotypic predictors were no more accurate than expected by chance (Figure 5C, p≥0.3; Figure 5—figure supplement 1B, p≥0.15). However, six of the nine RBC invasion receptors contained variants associated with growth (Figure 5A), including a SNP in glycophorin B (GYPB) that has been linked to malaria risk in Brazil (Tarazona-Santos et al., 2011). These patterns likely stem from experimental noise in our measure of invasion (Figure 1C; Figure 2C and F; Figure 2—figure supplements 1–2), though we note that our definition of growth involves a re-invasion event (Figure 1C).

No PCs of population structure were significantly associated with P. falciparum growth rate (Figure 5—source data 3), including the PC that distinguishes Africans from other populations (PC1, Figure 1A). One PC was selected by LASSO for 3D7 growth, but this association was driven by a single donor with East Asian ancestry and relatively high susceptibility (Figure 5—figure supplement 5). We note that the unique ancestry and extreme phenotypes of the six-member family (Figure 5—figure supplement 6) would have driven additional correlations if family members had not been excluded from the LASSO models. Although the present study is limited by sample size, these associations between global genetic PCs and P. falciparum growth suggest that additional functional variants remain to be discovered in many populations.

African ancestry does not predict P. falciparum resistance in red cells

Based on evidence from balanced disease alleles like HbAS, it has been suggested that anti-malarial selection has shaped polygenic red cell phenotypes in African populations as a whole (Goheen et al., 2016; Kanias et al., 2017; Ma et al., 2018; Page et al., 2021). We tested this hypothesis by examining the correlation between African ancestry and P. falciparum fitness in non-carrier RBCs (Figure 6A–D). Surprisingly, we found no evidence that these traits were related, apart from a positive relationship between African ancestry and invasion rate of Th.026.09, the clinical Senegalese strain (p=0.004, R²=0.13, Figure 6D). To understand this result, we next examined how key RBC phenotypes identified in this study (Figure 4A) vary with African ancestry (Figure 6F–H; Figure 6—figure supplement 1). We found that greater African ancestry predicts reduced osmotic fragility (p=1.2×10^–6), reduced RBC dehydration (CHCM p=0.009; MCHC p=0.089), and a greater fraction of ‘overhydrated’ RBCs with normal volume and low hemoglobin (M4 p=0.041). All of these traits actually predict greater red cell susceptibility to P. falciparum (Figure 4A), although together they explain less than 13% of the non-carrier variation in 3D7 growth. The remaining key phenotypes do not vary with African ancestry, which may explain why African ancestry itself is only weakly associated with P. falciparum fitness in non-carrier RBCs (Figure 6A–D).

Figure 6 with 1 supplement see all

Download asset Open asset

Little evidence of widespread selection in Africa for slower *Plasmodium falciparum* replication, protective alleles, or protective phenotypes in non-carriers.

(**A–D**) Parasite replication versus the exome-wide fraction of African ancestry in non-carriers, determined with ADMIXTURE by comparison to 1000 Genomes reference populations. R² and p-values are shown for OLS regression. (E) Alleles in 23 malaria-related genes that predict slower *P. falciparum* growth in non-carriers (Figure 5A) are not enriched for higher frequencies in Africa versus Europe. Effect sizes are shown for one allele copy for 3D7 or Th.026.09 growth, whichever was greater. Effect sizes were determined from additive models except for three alleles that appeared overdominant (Figure 5—figure supplement 3). F_ST was calculated from African and European samples in gnomAD (see Materials and methods). HbAS and the HBA2 deletion are shown for comparison. (**F–H**) RBC phenotypes associated with *P. falciparum* growth versus the exome-wide fraction of African ancestry in non-carriers. Slower *P. falciparum* growth in RBCs is predicted by greater fragility (F), greater dehydration (G), and lower O_hyper (H) (Figure 4A). Additional phenotypes are shown in Figure 6—figure supplement 1.

Next, we used allele frequency data from over 54,000 individuals in the gnomAD collection (Karczewski et al., 2020) to test whether the polymorphisms we associated with P. falciparum growth occur at different frequencies in African and European populations. Geographical differences in malaria selection are sometimes hypothesized to have increased the frequency of hundreds or thousands of undiscovered anti-malarial alleles in Africa (Mackinnon et al., 2005; Williams, 2006), as has been shown for several variants causing common RBC disorders (Kariuki and Williams, 2020). To address this hypothesis for non-carrier variation, we calculated F_ST between Africans and Europeans for 22 alleles with protective effects large enough to be specified in our sample (≥3.7%; Figure 5—source data 3). We found that 11 of these protective alleles (50%) are more common in Africans, which is not more than expected by chance (p=0.5, one-sided binomial test). The three protective variants with the largest absolute F_ST values are all more common in Europeans, including a synonymous SPTA1 allele with GWAS associations to several RBC and white blood cell traits. Two protective PIEZO1 variants are more common in Africans, including E756del and a synonymous variant of large effect. Overall, however, we find no evidence that African populations are enriched for non-pathogenic RBC polymorphisms or phenotypes associated with impaired P. falciparum growth in vitro.

Discussion

Healthy RBCs harbor extensive phenotypic and proteomic variation, both within and between human populations. In this study, we demonstrate that this variation modulates a wide range of RBC susceptibility to P. falciparum parasites. Our findings add to a growing understanding of the genetic and phenotypic basis of RBC resistance to P. falciparum, especially for RBCs that lack population-specific disease alleles. These findings suggest new targets for future malaria interventions, in addition to challenging assumptions about the role of malaria selection in shaping human RBC diversity.

Exponential replication of P. falciparum is a significant driver of malaria disease progression (Bejon et al., 2007). Therefore, the ample variation that we observed in this trait in vitro could be relevant for clinical outcomes in endemic regions. Growth inhibition from HbAS, for example, reduces the risk of death from malaria by reducing parasite density in the blood (Allison, 1954; Luzzatto, 2012). While HbAS has a uniquely extreme effect size, we found a threefold range of parasite replication rates among non-carrier RBCs that share substantial overlap with RBCs carrying other protective variants. Although the physiologically complex basis of severe malaria (Okwa, 2012) makes it difficult to estimate the precise contribution of RBC factors to severe malaria risk, the genotypes and phenotypes we have associated with P. falciparum fitness may contribute to malaria susceptibility.

We have shown here that widespread, ‘normal’ variation in RBC hydration and deformability traits are associated with P. falciparum fitness in non-carrier RBCs. Interestingly, the protective phenotypes we detect in non-carrier RBCs are also present in carriers, albeit to a stronger degree (Clark et al., 1983; Mockenhaupt, 2000; Pengon et al., 2018). These results are consistent with experimental manipulations that reduce P. falciparum growth, such as chemical or genetic dehydration of RBCs (Tiffert et al., 2005; Ma et al., 2018). They are also consistent with the protective effect conferred by Dantu, a rare glycophorin variant associated with increased membrane tension (Field et al., 1994; Leffler et al., 2017; Kariuki et al., 2020). Our data expand upon these prior findings by demonstrating for the first time that common, healthy phenotypic variation in RBC traits contributes meaningfully to P. falciparum growth.

In the last decade, several association studies have explored the genetic basis of common variation in RBC traits using large, mostly European cohorts (van der Harst et al., 2012; Astle et al., 2016; Chami et al., 2016; Canela-Xandri et al., 2018; Chen et al., 2020; Vuckovic et al., 2020). These studies agree that the broad distribution of RBC phenotypes in humans is shaped by a large number of common alleles, similar to other complex traits (Boyle et al., 2017). Although the effects of most individual alleles are likely too small to be considered pathogenic on their own, different combinations of alleles may underlie the broad phenotypic variation observed in non-carriers. We cannot rule out the possibility that some extreme phenotypes could be better explained by the presence of large-effect ‘disease’ alleles that remain undiscovered. In particular, our study was not powered to detect rare alleles, which could be an important source of missing heritability (Génin, 2020; Kierczak, 2021). Some RBC phenotypes are also shaped by environmental variation, such as diet and time of day (England et al., 1976; Sennels et al., 2011), which likely diminishes correlations between repeated samples. Although this study cannot distinguish among these explanations for phenotypic variation among non-carrier RBCs, it does suggest that this broad variation is both healthy and functional.

In our linear models of P. falciparum growth, phenotypic variation among RBCs was outperformed by genetic variation in a small number of RBC proteins. This result implies the existence of protective RBC phenotypes that we did not measure (or did not measure with sufficient accuracy), such as quantitative proteomic, transcriptomic, and metabolomic traits that could be addressed by future studies. Approximately half of the polymorphisms we identified are non-synonymous and may therefore exert direct effects on phenotypes like RBC membrane structure or ion transport. The other half of associated polymorphisms were synonymous, which could be linked to coding variants but could also have direct effects on mRNA transcription, splicing, and stability (Sauna and Kimchi-Sarfaty, 2011). Indeed, silent and coding SNPs are equally likely to be associated with human disease (Chen et al., 2010), and many synonymous sites experience strong selection (Supek et al., 2014; Machado et al., 2020). Synonymous SNPs that impact splicing, like rs35356320 in CD44, may also impact protein structure. Some other conceivable RBC phenotypes, such as the dynamics of membrane modification during P. falciparum development, may only become evident in more detailed time course experiments. The true number of RBC phenotypes that impact P. falciparum may be effectively infinite (Kinsler et al., 2020), making it useful in practice that genetic variation is more predictive of parasite growth.

One reason that our study could identify genetic associations with a modest sample size was because we focused on a relatively well-defined component of a larger disease that lends itself to controlled, in vitro experiments. Another important explanation is our use of LASSO variable selection on a restricted set of polymorphisms in genes with strong existing links to malaria (Flynn et al., 2017). Focusing our tests on a limited number of hypotheses obviated the need to meet an exome-wide significance threshold, while still allowing for the discovery of novel alleles. This approach relies directly on prior knowledge (Figure 5—source data 1) and cannot readily be expanded to explicitly test large numbers of anonymous genetic variants. However, testing fewer hypotheses that are more likely to be true helps ensure that ‘significant’ results are reliable (Ioannidis, 2005). Exome-wide data were still critical in this study for assessing population structure, as well as for performing permutation tests that confirmed an enrichment of signal in our 23 focal genes. However, future studies with many more than 68 non-carriers will be required to discover additional associations in unknown genes, non-genic regulatory variation, and any alleles with smaller effects. It is also important to note that genetic linkage complicates the identification of the exact functional polymorphisms in any population sample (Sohail et al., 2019); as in GWAS, we cannot rule out that some associated variants are merely linked to the true functional variants. Indeed, about half of our associated variants occur in linkage blocks containing other SNPs associated with RBC traits by GWAS. In this way, our evidence most strongly supports the conclusion that 13 specific RBC genes are strongly enriched for polymorphisms with impacts on P. falciparum growth.

The associations we observed for parasite growth were stronger and more significant than the associations for parasite invasion. While batch effects clearly played a role, this may also be due to missing invasion data in 10 non-carrier samples (see Materials and methods) that reduced statistical power. Both technical and biological reasons may drive the relatively greater noise observed in our invasion data. For example, invasion success may depend on the length of time spent outside the incubator during assay set-up as well as the genotypes of both donor and acceptor red cells. The reproducibility of our invasion data is also constrained by low and variable starting parasitemia and a 24hr time point, which could be substantially improved in future studies using live-cell imaging focused on invasion. Despite these limitations, our ‘growth’ measurement includes a ‘re-invasion’ event and our growth and invasion measurements are correlated. RBC deformability and dehydration are associated with both fitness components, and SNPs in several canonical invasion receptors are only associated with growth. The invasion data also allow us to highlight unique and interesting trends in the Senegalese clinical strain and in carriers of hemoglobin C.

We also observed weaker associations for the clinical strain Th.026.09 than for the lab strain 3D7. These strains display large differences in absolute growth rate, possibly because Th.026.09 carries costly alleles for drug resistance and 3D7 has had decades longer to adapt to lab conditions (Walliker et al., 1987; Daniels et al., 2012; Moser, 2020). Interestingly, African ancestry predicted higher invasion only for Th.026.09, which might indicate that this strain is better adapted to African RBCs. Despite these differences, we showed that normalized fitness values were significantly correlated between the two strains across donors. Several RBC phenotypes and genotypes that predicted fitness in one strain were also replicated in the other. These results suggest that our findings may be generalizable across divergent strains of P. falciparum, although future studies would benefit by testing many more lab strains and clinical isolates.

One of the unique aspects of our study is the participation of individuals with a range of African ancestry, defined by similarity to donors from five 1000 Genomes reference populations. We found that African ancestry was unexpectedly associated with RBC phenotypes that improved parasite fitness, particularly for Th.026.09. In the future, it would be very interesting to test for local parasite adaptation to human RBCs using P. falciparum strains and RBC samples from around the globe. We also found that the total set of polymorphisms associated with P. falciparum growth by LASSO are not enriched in African populations included in the gnomAD database of human variation. Notably, a recent test of data from a large GWAS for severe malaria (Malaria Genomic Epidemiology Network, 2019) was also unable to demonstrate that natural selection has driven many malaria-protective alleles to higher frequencies in African versus European populations. Therefore, for the total set of alleles detectable in this study, we offer at least four possible explanations for this unexpected result. First, compared to large-effect disease alleles, the majority of non-pathogenic variants may not have had sufficient time to increase in frequency since P. falciparum began expanding in humans some 5000–10,000 years ago (Sundararaman et al., 2016; Otto et al., 2018). Second, the complexity of severe malaria could mean that the variants discovered here do not substantially impact disease outcome, especially relative to known disease variants. Third, the variants discovered here may have pleiotropic effects on other phenotypes, which are themselves subject to other selective pressures besides malaria resistance. Finally, human adaptation may be too local to detect with coarse-grain sampling of sub-Saharan African genetic diversity (e.g., Pankratov et al., 2020). Overall, however, our data suggest that few RBC alleles remain to be discovered that are both particularly common in Africa and have large effects on P. falciparum proliferation in RBCs.

More broadly, these data show that it may be inaccurate to make assumptions about RBC susceptibility to P. falciparum based on a person’s race or continental ancestry. These kinds of hypotheses (Williams, 2006; Goheen et al., 2016; Kanias et al., 2017; Ma et al., 2018) are based on well-known examples of balanced disease alleles, which are notable exceptions to the overwhelming genetic similarity of all human populations (Rosenberg et al., 2002; Novembre and Di Rienzo, 2009). In our data, RBC variation that is associated with reduced P. falciparum fitness is clearly not limited to individuals with recent African ancestry. This result is an important reminder that >90% of the total genetic variation among humans occurs within populations, rather than across them (Lewontin, 1972; Rosenberg, 2011); and that the majority of common genetic variation is shared among all human populations (Biddanda et al., 2020).

In conclusion, this study demonstrates that substantial phenotypic and genetic diversity in healthy human RBCs impacts the replication of malaria parasites. Whether or not this diversity is shaped by malaria selection, a better understanding of how P. falciparum biology is impacted by natural RBC variation could help lead to new therapies for one of humanity’s most important infectious diseases.

Materials and methods

Key resources table

Reagent type (species) or resource	Designation	Source or reference	Identifiers	Additional information
Biological sample (Homo sapiens)	Primary whole blood samples	This paper		Freshly drawn from de-identified human subjects into CPDA tubes (IRB #40479)
Strain, strain background (Plasmodium falciparum)	3D7	PMID:3299700; Obtained from Walter and Eliza Hall Institute, Melbourne, Australia
Strain, strain background (P. falciparum)	Th026.09	PMID:22430961; Gift from Daouda Ndiaye and Sarah Volkman, Senegal
Commercial assay or kit	DNeasy Blood and Tissue Kit	QIAGEN
Commercial assay or kit	KAPA Hyperplus Kit	Roche
Commercial assay or kit	SeqCap EZ Prime Exome Kit	Roche
Sequence-based reagent	Primers amplifying PIEZO1 exon 17	PMID:32265284
Software, algorithm	bwa mem	http://arxiv.org/abs/1303.3997	0.7.17-r1188
Software, algorithm	GATK	https://gatk.broadinstitute.org/hc/en-us	4.0.0.0
Software, algorithm	vcftools	doi:10.1093/bioinformatics/btr330	0.1.15
Software, algorithm	ANNOVAR	PMID:20601685	2018-04-16
Software, algorithm	PLINK	PMID:17701901	v1.90b6.8 64-bit
Software, algorithm	ADMIXTURE	PMID:21682921	1.3.0
Software, algorithm	R	https://www.R-project.org/	3.5.1
Other	SYBR Green I nucleic acid stain	Invitrogen	S7563
Other	Drabkin’s Reagent	Ricca Chemical	2660–32

Sample collection and preparation

Request a detailed protocol

One-hundred and twenty-one subjects with no known history of RBC disorders were recruited to donate blood at the Stanford Clinical and Translational Research Unit. This study size was designed to sample multiple individuals carrying alleles of moderate frequency (5% or higher). Written informed consent was obtained from each subject and/or their parent as part of a protocol approved by the Stanford University Institutional Review Board (#40479). To help control for weekly batch effects, subject 1111 donated fresh blood for each parasite assay. Eleven other subjects donated blood on at least 2 different weeks, constituting biological replicates. Whole blood samples from a HE patient were obtained from Dr. Bertil Glader under a separate approved protocol (Stanford IRB #14004) that permitted sample sharing among researchers. All samples were de-identified upon collection by labeling with a random four-to-six digit code. Two samples were eventually removed from analysis based on a failed sequencing library (6449KD) and history of stem cell transplant (8715).

Whole blood was drawn into CPDA tubes and spun down within 36 hr to separate serum, buffy coat, and RBCs. RBCs were washed and stored in RPMI-1640 medium (Sigma-Aldrich) supplemented with 25 mM HEPES, 50 mg/L hypoxanthine, and 2.42 mM sodium bicarbonate at 4°C. Buffy coat was transferred directly to cryotubes and stored at –80°C.

Exome sequencing and genotype calling

Request a detailed protocol

Genomic DNA was isolated from frozen buffy coats using a DNeasy Blood and Tissue Kit (QIAGEN). Libraries were prepared using a KAPA Hyperplus Kit (Roche) and hybridized to human exome probes using the SeqCap EZ Prime Exome Kit (Roche). The resulting exome libraries were sequenced with paired-end 150 bp Illumina reads on the HiSeq or NextSeq platforms at Admera Health (South Plainfield, NJ).

Reads were aligned to the hg38 human reference genome using bwa mem (Li, 2013a), yielding an average coverage of 42X across targeted exome regions (excluding sample 6449KD). Variants were called using GATK best practices (Van der Auwera et al., 2013) and hard filtered with the following parameters: QD<2.0, FS>60.0, ReadPosRankSum<–2.5, SOR>2.5, MQ<55.0, MQRankSum<–1.0, and DP<500. To minimize the effects of sequencing errors, variants not present in 1000 Genomes, dbSNP_138, or the Mills indel collection (Mills et al., 2006) were discarded. Variants that were significantly more frequent in our sample than in gnomAD African and European populations (Karczewski et al., 2020) were also discarded, in order to avoid false associations from miscalled variants. We also excluded singleton variants from all association analyses, potentially including some variants unique to other populations. With the remaining variants, we calculated kinship coefficients among all pairs of donors using vcftools --relatedness2. Only the six members of the known family had pairwise coefficients >0.044, confirming that no other donors were related.

PIEZO1 E756del was genotyped via PCR and Sanger sequencing according to a previously published protocol (Nguetse et al., 2020). To call deletion variants that cause α-thalassemia in the paralogous genes HBA2 and HBA1, we extracted reads from each.bam file that lacked any mismatches or soft-clipping and had MAPQ≥13 (i.e., <5% chance of mapping error). Coverage with these well-mapped reads was calculated over the 73 and 81 bp of unique sequence in HBA2 and HBA1 and normalized to each sample’s exome-wide coverage. To determine which samples has unusually low coverage, we formed an ad hoc reference panel of seven donors who were unlikely to carry deletion alleles based on their normal MCH, MCV, and HGB and >96% exome-wide European ancestry (Weatherall, 2001). We called heterozygous HBA2 deletions when normalized coverage across three unique regions of the HBA2 gene was below the minimum reference value. Similarly, we called homozygous HBA2 deletions when normalized coverage across three unique regions of the HBA2 gene was less than half of the minimum reference value. This approach resulted in an estimated HBA2 copy number of 2.0 in the reference panel, 0.95 in eight putative heterozygotes and 0.12 in four putative homozygotes. The same method produced no evidence of HBA1 deletion in any sample.

Variant classification and linkage pruning

Request a detailed protocol

Exonic variants in RefSeq genes were identified using ANNOVAR (Wang et al., 2010). Variants were classified into three categories: those within 23 malaria-related genes (Figure 5—source data 1); those within 887 other RBC proteins (Figure 5—source data 2) derived with a medium-confidence filter from the Red Blood Cell Collection database (rbcc.hegelab.org); and those within any other gene.

Linkage between all pairs of bi-allelic, exonic variants in our 121 genotyped samples was calculated using the --geno-r2 and --interchrom-geno-r2 functions in vcftools (Danecek et al., 2011). Variants in RBC genes that shared r² >0.1 with any variant in the 23-gene set were removed. Within the 23-gene set and RBC-gene set separately, non-carrier variants were ranked by the p-values of their OLS regression with all four parasite measures. Then, one variant was removed from each pair with r²>0.1, prioritizing retention in the following order: greater significance across models; non-synonymous protein change; higher frequency in our sample; and finally by random sampling. We report results from additive genetic models (genotypes coded 0/1/2), which performed as well or better than recessive (0/0/2) and dominant (0/2/2) models. For three variants, overdominant models (0/1/–) provided the best fit and were used to estimate effect sizes (Figure 5—figure supplement 3).

Population analysis

Request a detailed protocol

The population ancestry of our donors was assessed by comparison with African, European, East Asian, and South Asian reference populations from the 1000 Genomes Project (Auton et al., 2015). Briefly, variants called from an hg38 alignment of the 1000 Genomes data (Lowy-Gallego et al., 2019) were filtered for concordance with the variants genotyped in this study. The --indep-pairwise command in PLINK (Purcell et al., 2007) was used to prune SNPs with r²>0.1 with any other SNP in a 50-SNP sliding window, producing 35,759 unlinked variants. These variants were analyzed in both PLINK --pca and in ADMIXTURE (Alexander and Lange, 2011) with K=4 for the 121 genotyped individuals in this study, alongside 2458 individuals from 1000 Genomes. Pan-African and pan-European allele frequencies were obtained from gnomAD v3 (Karczewski et al., 2020). F_ST for specific alleles was calculated as (H_T−H_S)/H_T and then polarized, such that positive values indicate variants more common in Africa.

P. falciparum culture and assays

Request a detailed protocol

Our 3D7 strain of P. falciparum was obtained from the Walter and Eliza Hall Institute (Melbourne, Australia) and routinely cultured in human erythrocytes obtained from the Stanford Blood Center. Th.026.09 is a clinical strain isolated from a patient in Senegal in 2009 and kindly provided by Daouda Ndiaye and Sarah Volkman. 3D7 is drug-sensitive and has been lab-adapted for over 40 years, whereas Th.026.09 is drug-resistant and minimally lab-adapted (Walliker et al., 1987; Daniels et al., 2012; Moser, 2020). 3D7 was maintained at 2% hematocrit in RPMI-1640 supplemented with 25 mM HEPES, 50 mg/L hypoxanthine, 2.42 mM sodium bicarbonate, and 4.31 mg/ml Albumax (Invitrogen) at 37°C in 5% CO₂ and 1% O₂. Th.026.09 was maintained in the same conditions, except that half the Albumax was replaced with heat-inactivated human AB serum.

Parasite growth and invasion assays were performed using schizont-stage parasites isolated from routine culture using a MACS magnet (Miltenyi). Parasites were added at ~0.5% initial parasitemia to fresh erythrocytes suspended at 1% hematocrit in complete RPMI, as above. Parasites were cultured in each erythrocyte sample for 3–5 days in triplicate 100 µl wells. Parasitemia was determined as the average of the three technical replicates, excluding single outlier points, on day 0, day 1 (24 hr), day 3 (72 hr), and in some cases day 5 (120 hr). The fraction of infected RBCs was measured by staining with SYBR Green one nucleic acid stain (Invitrogen, Thermo Fisher Scientific, Eugene, OR) at 1:2000 dilution in PBS/0.3% BSA for 20 min, followed by flow cytometry analysis on a MACSQuant flow cytometer (Miltenyi). Raw invasion rate was defined as the day 1 parasitemia divided by the day 0 parasitemia; raw growth rate was defined as the day 3 (or day 5) parasitemia divided by the day 1 (or day 3) parasitemia. Day 0 parasitemia was not measured in weeks 1–3, so invasion rate estimates are absent for these samples (N=58 unrelated non-carriers with invasion data). The parasite assays failed for both strains in week 9 and for Th.026.09 in week 10, and so were repeated in weeks 10 and 11 with RBCs that had been stored for 1 or 2 weeks.

To correct for batch effects, including substantial week-to-week variation in P. falciparum replication rate, we extracted the residuals from a linear regression of the raw parasite values against up to four significantly related batch variables: (1) the raw values for control donor 1111 each week; (2) the parasitemia measured at the previous time point; (3) the age in weeks of the RBCs being measured; and (4) the experimenter performing the assays. Notably, there was no additional effect of ‘Week’ or the length of the experiment (i.e., 3 or 5 days) once the above variables were regressed out. To convert these residuals (mean 0%) to relative percentages (mean 100%), we first trained linear models for growth and invasion in each strain with data from control donor 1111 and carriers with extreme parasite values (HbAS and HE for growth; G6PD⁻_high and HE for invasion). For these models, relative percentages were calculated by normalizing the raw multiplication rates in these samples to the raw multiplication rate in the 1111 control from that week. These linear models were used to convert residuals to relative percentages for all samples. Finally, the relative percentages were arithmetically adjusted so that the mean invasion and growth values for non-carriers was 100%. Code for this normalization is available at https://github.com/emily-ebel/RBC (copy archived at swh:1:rev:31f953428a4ec5f0fa83201085ada0a0995facb2), Ebel, 2021.

Red cell phenotyping and normalization

Request a detailed protocol

Complete blood count (CBC) data for RBCs, reticulocytes, and platelets were obtained with an ADVIA 120 hematology analyzer (Siemens, Laguna Hills, CA) at the Red Cell Laboratory at Children’s Hospital Oakland Research Institute. These data were: RBC, HGB, HCT, MCV, MCH, MCHC, CHCM, RDW, HDW, PLT, MPV, Reticulocyte number and percentage, and the fraction of RBCs in each of nine cells of the RBC matrix (see Figure 3—figure supplement 2). Systematic biases were evident for some measures in certain weeks, but data from control donor 1111 were not available for all weeks. Therefore, CBC data were normalized such that the median value for non-carrier samples was equal across weeks.

Osmotic fragility tests were performed in duplicate by incubating 20 μl of washed erythrocytes for 5 min in 500 μl solutions of NaCl in 14 concentrations: 7.17, 6.14, 5.73, 5.32, 4.91, 4.50, 4.30, 4.09, 3.89, 3.68, 3.27, 3.07, 2.66, and 2.46 g/L. Tubes were spun for 5 min at 1000 g and 100 μl of supernatant was transferred to a 96-well plate. Hemoglobin concentration was determined by adding 100 μl of Drabkin’s reagent (Ricca Chemical) to each well and measuring absorbance at OD_540nm with a Synergy H1 Plate Reader (Biotek). Relative lysis was determined by normalizing to the maximum hemoglobin concentration in the 14-tube series for each sample. After outlier points were manually removed, sigmoidal osmotic fragility curves were estimated under a self-starting logistic model in the nls package in R. Curves were summarized by the relative tonicity at which 50% lysis occurred (see Figure 3—figure supplement 4) and normalized within weekly batches, such that this value was equal for control sample 1111 across weeks.

Osmotic gradient ektacytometry (Clark et al., 1983; Kuypers, 1990) was performed at the Red Cell Laboratory at Children’s Hospital Oakland Research Institute. Red cell deformability estimates across a gradient of NaCl concentrations were fitted to a 20-parameter polynomial model to generate a smooth curve, which was manually verified to closely fit the data. Each curve was summarized with three standard points (Figure 3—figure supplement 5; Clark et al., 1983), which were normalized such that the median x- and y-values of the three points was equal for non-carrier samples across weeks.

Statistical analysis

Request a detailed protocol

Student’s t-test was used to compare trait values between non-carriers and carriers where N>1. Given our modest sample sizes and the expected noise in parasite data, we defined statistical significance as p<0.1. Where N=1 (i.e., for G6PD⁻_high and HE), significance was assessed with the percentile of the non-carrier distribution. For all comparisons of two continuous variables, OLS linear regression was performed with the lm function in R unless otherwise specified. Adjusted R² values are reported.

LASSO regression (Tibshirani, 1994; Chatterjee, 2013) was performed in a k-folds CV framework with the glmnet and caret packages in R. For each of 1000 iterations, we used the createFolds function with k=10 to split the non-carrier data into 10-folds of roughly equal size. Each fold was used as a ‘test set’ for a LASSO model trained on the remaining nine folds. For each of the 1000 iterations in which 10-folds were created, we collected 10 sets of predictors from the 10 train sets; one average R² value for the 10 train sets; and one average R² for the 10 test sets. Each set of 1000 resulting R² values were normally distributed, and their average is reported in Figures 4 and 5. The fraction of k-folds CV support per predictor is based on 10,000 total train models (1000 iterations*10 folds each) and is reported in Figure 4A and Figure 5—source data 3.

To perform LASSO with each training set, we used the cv.glmnet function with α=1. This function split the train data into 10 folds 11 times, the first to estimate a lambda sequence and the rest to compute the fit with one fold omitted. The lambda value that produced minimal error in the training data was then used to predict values in the independent test data described above. Since cv.glmnet selects folds at random, we performed this procedure five times for each train/test set (which we term ‘internal cross-validation’). We retained R² values and selected predictors from the median model of these five internal CVs. Internal CVs did not otherwise contribute to the k-folds CV support reported in the main text.

To assess the significance of each LASSO result, we applied the same modeling procedure to 1000 data sets with randomly permuted parasite values, which preserved the original correlations among RBC predictors. We performed 10 iterations of fold creation for each permuted data set and retained the average R² for each set of 10-folds, which generated 1000 fold-averaged R² values for train sets and 1000 fold-averaged R² values for train sets. Significance was determined by the percentile of the permuted distribution in which the real data fell. We also applied this same procedure to 1000 sets of 23 genes chosen at random from the RBC proteome (Figure 5—source data 2).

We noticed that LASSO effect size estimates for each predictor varied considerably across models. Therefore, we used univariate OLS regression on all non-carrier data (excluding five of the six family members) to estimate the effect size of each predictor selected at least once by LASSO. OLS p-values are reported as a measure of confidence in these effect size estimates, with p<0.1 considered sufficient evidence to report the effect size. However, because OLS regression was only performed for variants pre-selected by LASSO, these p-values cannot be interpreted on their own as evidence of significant associations.

We compared groups of selected genetic variants using the binom.test function in R. For synonymous alleles, we used the proportion of synonymous alleles in the input set of 106 variants (53%) as the null hypothesis. For allele frequencies in Africa and Europe, we categorized protective variants as more common (to any absolute degree) among Africans (N=21,042) or non-Finnish Europeans (N=32,399) in the gnomAD database. The null hypothesis was that 50% of the alleles would be more common in Africans.

Data availability

All data generated or analyzed during this study are included in the manuscript and supporting files. Source data files have been provided for Figures 1, 4, and 5 and other raw data and normalization scripts are available at https://github.com/emily-ebel/RBC (copy archived at https://archive.softwareheritage.org/swh:1:rev:31f953428a4ec5f0fa83201085ada0a0995facb2).

The following data sets were generated

1. Ebel ER
2. Kuypers FA
3. Lin C
4. Petrov DA
5. Egan ES
(2020) NCBI BioProject
ID PRJNA683732. Exome Sequencing from Participants in RBC/Malaria Study.

https://www.ncbi.nlm.nih.gov/bioproject/PRJNA683732/

References

1. Alexander DH
2. Lange K
(2011) Enhancements to the admixture algorithm for individual ancestry estimation
BMC Bioinformatics 12:246.

https://doi.org/10.1186/1471-2105-12-246
- PubMed
- Google Scholar
1. Allison AC
(1954) Protection afforded by sickle-cell trait against subtertian malareal infection
British Medical Journal 1:290–294.

https://doi.org/10.1136/bmj.1.4857.290
- PubMed
- Google Scholar
(2018) Resistance to Plasmodium falciparum in sickle cell trait erythrocytes is driven by oxygen-dependent growth inhibition
PNAS 115:7350–7355.

https://doi.org/10.1073/pnas.1804388115
- PubMed
- Google Scholar
1. Astle WJ
2. Elding H
3. Jiang T
4. Allen D
5. Ruklisa D
6. Mann AL
7. Mead D
8. Bouman H
9. Riveros-Mckay F
10. Kostadima MA
11. Lambourne JJ
12. Sivapalaratnam S
13. Downes K
14. Kundu K
15. Bomba L
16. Berentsen K
17. Bradley JR
18. Daugherty LC
19. Delaneau O
20. Freson K
21. Garner SF
22. Grassi L
23. Guerrero J
24. Haimel M
25. Janssen-Megens EM
26. Kaan A
27. Kamat M
28. Kim B
29. Mandoli A
30. Marchini J
31. Martens JHA
32. Meacham S
33. Megy K
34. O’Connell J
35. Petersen R
36. Sharifi N
37. Sheard SM
38. Staley JR
39. Tuna S
40. van der Ent M
41. Walter K
42. Wang SY
43. Wheeler E
44. Wilder SP
45. Iotchkova V
46. Moore C
47. Sambrook J
48. Stunnenberg HG
49. Di Angelantonio E
50. Kaptoge S
51. Kuijpers TW
52. Carrillo-de-Santa-Pau E
53. Juan D
54. Rico D
55. Valencia A
56. Chen L
57. Ge B
58. Vasquez L
59. Kwan T
60. Garrido-Martín D
61. Watt S
62. Yang Y
63. Guigo R
64. Beck S
65. Paul DS
66. Pastinen T
67. Bujold D
68. Bourque G
69. Frontini M
70. Danesh J
71. Roberts DJ
72. Ouwehand WH
73. Butterworth AS
74. Soranzo N
(2016) The Allelic Landscape of Human Blood Cell Trait Variation and Links to Common Complex Disease
Cell 167:1415–1429.

https://doi.org/10.1016/j.cell.2016.10.042
- PubMed
- Google Scholar
(2015) A global reference for human genetic variation
Nature 526:68–74.

https://doi.org/10.1038/nature15393
- PubMed
- Google Scholar
(2015) A novel locus of resistance to severe malaria in a region of ancient balancing selection
Nature 526:253–257.

https://doi.org/10.1038/nature15390
- PubMed
- Google Scholar
1. Bejon P
2. Berkley JA
3. Mwangi T
4. Ogada E
5. Mwangi I
6. Maitland K
7. Williams T
8. Scott JAG
9. English M
10. Lowe BS
11. Peshu N
12. Newton CRJC
13. Marsh K
(2007) Defining childhood severe falciparum malaria for intervention studies
PLOS Medicine 4:e251.

https://doi.org/10.1371/journal.pmed.0040251
- PubMed
- Google Scholar
1. Beutler E
2. West C
(2005) Hematologic differences between African-Americans and whites: the roles of iron deficiency and alpha-thalassemia on hemoglobin levels and mean corpuscular volume
Blood 106:740–745.

https://doi.org/10.1182/blood-2005-02-0713
- PubMed
- Google Scholar
(2020) A variant-centric perspective on geographic patterns of human allele frequency variation
eLife 9:e60107.

https://doi.org/10.7554/eLife.60107
- PubMed
- Google Scholar
(2017) An Expanded View of Complex Traits: From Polygenic to Omnigenic
Cell 169:1177–1186.

https://doi.org/10.1016/j.cell.2017.05.038
- PubMed
- Google Scholar
(2018) An atlas of genetic associations in UK Biobank
Nature Genetics 50:1593–1599.

https://doi.org/10.1038/s41588-018-0248-z
- PubMed
- Google Scholar
1. Cao A
2. Galanello R
(2010) Beta-thalassemia
Genetics in Medicine 12:61–76.

https://doi.org/10.1097/GIM.0b013e3181cd68ed
- PubMed
- Google Scholar
1. Chami N
2. Chen MH
3. Slater AJ
4. Eicher JD
5. Evangelou E
6. Tajuddin SM
7. Love-Gregory L
8. Kacprowski T
9. Schick UM
10. Nomura A
11. Giri A
12. Lessard S
13. Brody JA
14. Schurmann C
15. Pankratz N
16. Yanek LR
17. Manichaikul A
18. Pazoki R
19. Mihailov E
20. Hill WD
21. Raffield LM
22. Burt A
23. Bartz TM
24. Becker DM
25. Becker LC
26. Boerwinkle E
27. Bork-Jensen J
28. Bottinger EP
29. O’Donoghue ML
30. Crosslin DR
31. de Denus S
32. Dubé MP
33. Elliott P
34. Engström G
35. Evans MK
36. Floyd JS
37. Fornage M
38. Gao H
39. Greinacher A
40. Gudnason V
41. Hansen T
42. Harris TB
43. Hayward C
44. Hernesniemi J
45. Highland HM
46. Hirschhorn JN
47. Hofman A
48. Irvin MR
49. Kähönen M
50. Lange E
51. Launer LJ
52. Lehtimäki T
53. Li J
54. Liewald DCM
55. Linneberg A
56. Liu Y
57. Lu Y
58. Lyytikäinen LP
59. Mägi R
60. Mathias RA
61. Melander O
62. Metspalu A
63. Mononen N
64. Nalls MA
65. Nickerson DA
66. Nikus K
67. O’Donnell CJ
68. Orho-Melander M
69. Pedersen O
70. Petersmann A
71. Polfus L
72. Psaty BM
73. Raitakari OT
74. Raitoharju E
75. Richard M
76. Rice KM
77. Rivadeneira F
78. Rotter JI
79. Schmidt F
80. Smith AV
81. Starr JM
82. Taylor KD
83. Teumer A
84. Thuesen BH
85. Torstenson ES
86. Tracy RP
87. Tzoulaki I
88. Zakai NA
89. Vacchi-Suzzi C
90. van Duijn CM
91. van Rooij FJA
92. Cushman M
93. Deary IJ
94. Velez Edwards DR
95. Vergnaud AC
96. Wallentin L
97. Waterworth DM
98. White HD
99. Wilson JG
100. Zonderman AB
101. Kathiresan S
102. Grarup N
103. Esko T
104. Loos RJF
105. Lange LA
106. Faraday N
107. Abumrad NA
108. Edwards TL
109. Ganesh SK
110. Auer PL
111. Johnson AD
112. Reiner AP
113. Lettre G
(2016) EXOME genotyping identifies pleiotropic variants associated with red blood cell traits
American Journal of Human Genetics 99:8–21.

https://doi.org/10.1016/j.ajhg.2016.05.007
- PubMed
- Google Scholar
Preprint
1. Chatterjee S
(2013) Assumptionless Consistency of the Lasso
arXiv.

http://arxiv.org/abs/1303.5817
- Google Scholar
1. Chen R
2. Davydov EV
3. Sirota M
4. Butte AJ
(2010) Non-synonymous and synonymous coding snps show similar likelihood and effect size of human disease association
PLOS ONE 5:e13574.

https://doi.org/10.1371/journal.pone.0013574
- PubMed
- Google Scholar
1. Chen MH
2. Raffield LM
3. Mousas A
4. Sakaue S
5. Huffman JE
6. Moscati A
7. Trivedi B
8. Jiang T
9. Akbari P
10. Vuckovic D
11. Bao EL
12. Zhong X
13. Manansala R
14. Laplante V
15. Chen M
16. Lo KS
17. Qian H
18. Lareau CA
19. Beaudoin M
20. Hunt KA
21. Akiyama M
22. Bartz TM
23. Ben-Shlomo Y
24. Beswick A
25. Bork-Jensen J
26. Bottinger EP
27. Brody JA
28. van Rooij FJA
29. Chitrala K
30. Cho K
31. Choquet H
32. Correa A
33. Danesh J
34. Di Angelantonio E
35. Dimou N
36. Ding J
37. Elliott P
38. Esko T
39. Evans MK
40. Floyd JS
41. Broer L
42. Grarup N
43. Guo MH
44. Greinacher A
45. Haessler J
46. Hansen T
47. Howson JMM
48. Huang QQ
49. Huang W
50. Jorgenson E
51. Kacprowski T
52. Kähönen M
53. Kamatani Y
54. Kanai M
55. Karthikeyan S
56. Koskeridis F
57. Lange LA
58. Lehtimäki T
59. Lerch MM
60. Linneberg A
61. Liu Y
62. Lyytikäinen LP
63. Manichaikul A
64. Martin HC
65. Matsuda K
66. Mohlke KL
67. Mononen N
68. Murakami Y
69. Nadkarni GN
70. Nauck M
71. Nikus K
72. Ouwehand WH
73. Pankratz N
74. Pedersen O
75. Preuss M
76. Psaty BM
77. Raitakari OT
78. Roberts DJ
79. Rich SS
80. Rodriguez BAT
81. Rosen JD
82. Rotter JI
83. Schubert P
84. Spracklen CN
85. Surendran P
86. Tang H
87. Tardif JC
88. Trembath RC
89. Ghanbari M
90. Völker U
91. Völzke H
92. Watkins NA
93. Zonderman AB
94. VA Million Veteran Program
95. Wilson PWF
96. Li Y
97. Butterworth AS
98. Gauchat JF
99. Chiang CWK
100. Li B
101. Loos RJF
102. Astle WJ
103. Evangelou E
104. van Heel DA
105. Sankaran VG
106. Okada Y
107. Soranzo N
108. Johnson AD
109. Reiner AP
110. Auer PL
111. Lettre G
(2020) Trans-ethnic and ancestry-specific blood-cell genetics in 746,667 individuals from 5 global populations
Cell 182:1198–1213.

https://doi.org/10.1016/j.cell.2020.06.045
- PubMed
- Google Scholar
(1983) Osmotic gradient ektacytometry: Comprehensive characterization of red cell volume and surface maintenance
Blood 61:899–910.

https://doi.org/10.1182/blood.V61.5.899.899
- PubMed
- Google Scholar
1. Clarke GM
2. Rockett K
3. Kivinen K
4. Hubbart C
5. Jeffreys AE
6. Rowlands K
7. Jallow M
8. Conway DJ
9. Bojang KA
10. Pinder M
11. Usen S
12. Sisay-Joof F
13. Sirugo G
14. Toure O
15. Thera MA
16. Konate S
17. Sissoko S
18. Niangaly A
19. Poudiougou B
20. Mangano VD
21. Bougouma EC
22. Sirima SB
23. Modiano D
24. Amenga-Etego LN
25. Ghansah A
26. Koram KA
27. Wilson MD
28. Enimil A
29. Evans J
30. Amodu OK
31. Olaniyan S
32. Apinjoh T
33. Mugri R
34. Ndi A
35. Ndila CM
36. Uyoga S
37. Macharia A
38. Peshu N
39. Williams TN
40. Manjurano A
41. Sepúlveda N
42. Clark TG
43. Riley E
44. Drakeley C
45. Reyburn H
46. Nyirongo V
47. Kachala D
48. Molyneux M
49. Dunstan SJ
50. Phu NH
51. Quyen NN
52. Thai CQ
53. Hien TT
54. Manning L
55. Laman M
56. Siba P
57. Karunajeewa H
58. Allen S
59. Allen A
60. Davis TM
61. Michon P
62. Mueller I
63. Molloy SF
64. Campino S
65. Kerasidou A
66. Cornelius VJ
67. Hart L
68. Shah SS
69. Band G
70. Spencer CC
71. Agbenyega T
72. Achidi E
73. Doumbo OK
74. Farrar J
75. Marsh K
76. Taylor T
77. Kwiatkowski DP
78. MalariaGEN Consortium
(2017) Characterisation of the opposing effects of g6pd deficiency on cerebral malaria and severe malarial anaemia
eLife 6:e15085.

https://doi.org/10.7554/eLife.15085
- PubMed
- Google Scholar
1. Cooling L
(2015) Blood groups in infection and host susceptibility
Clinical Microbiology Reviews 28:801–870.

https://doi.org/10.1128/CMR.00109-14
- PubMed
- Google Scholar
1. Crosnier C
2. Bustamante LY
3. Bartholdson SJ
4. Bei AK
5. Theron M
6. Uchikawa M
7. Mboup S
8. Ndir O
9. Kwiatkowski DP
10. Duraisingh MT
11. Rayner JC
12. Wright GJ
(2011) Basigin is a receptor essential for erythrocyte invasion by plasmodium falciparum
Nature 480:534–537.

https://doi.org/10.1038/nature10606
- PubMed
- Google Scholar
(2011) The variant call format and vcftools
Bioinformatics 27:2156–2158.

https://doi.org/10.1093/bioinformatics/btr330
- PubMed
- Google Scholar
1. Daniels R
2. Ndiaye D
3. Wall M
4. McKinney J
5. Séne PD
6. Sabeti PC
7. Volkman SK
8. Mboup S
9. Wirth DF
(2012) Rapid, field-deployable method for genotyping and discovery of single-nucleotide polymorphisms associated with drug resistance in Plasmodium falciparum
Antimicrobial Agents and Chemotherapy 56:2976–2986.

https://doi.org/10.1128/AAC.05737-11
- PubMed
- Google Scholar
(2012) The host genetic diversity in malaria infection
Journal of Tropical Medicine 2012:940616.

https://doi.org/10.1155/2012/940616
- PubMed
- Google Scholar
(2007) Spectrin-based skeleton in red blood cells and malaria
Current Opinion in Hematology 14:198–202.

https://doi.org/10.1097/MOH.0b013e3280d21afd
- PubMed
- Google Scholar
Software
1. Ebel ER
(2021) RBC, version swh:1:rev:31f953428a4ec5f0fa83201085ada0a0995facb2
Software Heritage.

https://archive.softwareheritage.org/swh:1:dir:936eeefd426d26efb71dfd49bea1ccafaa03ac3f;origin=https://github.com/emily-ebel/RBC;visit=swh:1:snp:4e53482ed5ff50b01ed3179d405ec1ac9387b7a5;anchor=swh:1:rev:31f953428a4ec5f0fa83201085ada0a0995facb2
1. Egan ES
2. Jiang RHY
3. Moechtar MA
4. Barteneva NS
5. Weekes MP
6. Nobre LV
7. Gygi SP
8. Paulo JA
9. Frantzreb C
10. Tani Y
11. Takahashi J
12. Watanabe S
13. Goldberg J
14. Paul AS
15. Brugnara C
16. Root DE
17. Wiegand RC
18. Doench JG
19. Duraisingh MT
(2015) Malaria. A forward genetic screen identifies erythrocyte CD55 as essential for plasmodium falciparum invasion
Science 348:711–714.

https://doi.org/10.1126/science.aaa3526
- PubMed
- Google Scholar
1. Egan ES
2. Weekes MP
3. Kanjee U
4. Manzo J
5. Srinivasan A
6. Lomas-Francis C
7. Westhoff C
8. Takahashi J
9. Tanaka M
10. Watanabe S
11. Brugnara C
12. Gygi SP
13. Tani Y
14. Duraisingh MT
(2018) Erythrocytes lacking the langereis blood group protein abcb6 are resistant to the malaria parasite plasmodium falciparum
Communications Biology 1:45.

https://doi.org/10.1038/s42003-018-0046-2
- PubMed
- Google Scholar
(1976) Microcytosis, anisocytosis and the red cell indices in iron deficiency
British Journal of Haematology 34:589–597.

https://doi.org/10.1111/j.1365-2141.1976.tb03605.x
- PubMed
- Google Scholar
(1999) Genetic and environmental causes of variation in basal levels of blood cells
Twin Research 2:250–257.

https://doi.org/10.1375/136905299320565735
- PubMed
- Google Scholar
1. Facer CA
(1995) Erythrocytes carrying mutations in spectrin and protein 4.1 show differing sensitivities to invasion by plasmodium falciparum
Parasitology Research 81:52–57.

https://doi.org/10.1007/BF00932417
- PubMed
- Google Scholar
(2016) The (in)famous GWAS P-value threshold revisited and updated for low-frequency variants
European Journal of Human Genetics 24:1202–1205.

https://doi.org/10.1038/ejhg.2015.269
- PubMed
- Google Scholar
(1994) Glycophorin variants and plasmodium falciparum: Protective effect of the dantu phenotype in vitro
Human Genetics 93:148–150.

https://doi.org/10.1007/BF00210600
- PubMed
- Google Scholar
(2017) On the sensitivity of the lasso to the number of predictor variables
Statistical Science. Institute of Mathematical Statistics 32:88–105.

https://doi.org/10.1214/16-STS586
- Google Scholar
(1997) Hemoglobin metabolism in the malaria parasite Plasmodium falciparum
Annual Review of Microbiology 51:97–123.

https://doi.org/10.1146/annurev.micro.51.1.97
- PubMed
- Google Scholar
1. Friedman MJ
(1978) Erythrocytic mechanism of sickle cell resistance to malaria
PNAS 75:1994–1997.

https://doi.org/10.1073/pnas.75.4.1994
- PubMed
- Google Scholar
1. Galanello R
2. Cao A
(2011) Alpha-thalassemia
Genetics in Medicine 13:83–88.

https://doi.org/10.1097/GIM.0b013e3181fcb468
- Google Scholar
1. Gallagher PG
(2013) Abnormalities of the erythrocyte membrane
Pediatric Clinics of North America 60:1349–1362.

https://doi.org/10.1016/j.pcl.2013.09.001
- PubMed
- Google Scholar
1. Garn SM
(1981)
Lower hematocrit levels in blacks are not due to diet or socioeconomic factors

Pediatrics 67:580.
- PubMed
- Google Scholar
1. Génin E
(2020) Missing heritability of complex diseases: case solved?
Human Genetics 139:103–113.

https://doi.org/10.1007/s00439-019-02034-4
- PubMed
- Google Scholar
1. Glogowska E
2. Schneider ER
3. Maksimova Y
4. Schulz VP
5. Lezon-Geyda K
6. Wu J
7. Radhakrishnan K
8. Keel SB
9. Mahoney D
10. Freidmann AM
11. Altura RA
12. Gracheva EO
13. Bagriantsev SN
14. Kalfa TA
15. Gallagher PG
(2017) Novel mechanisms of piezo1 dysfunction in hereditary xerocytosis
Blood 130:1845–1856.

https://doi.org/10.1182/blood-2017-05-786004
- PubMed
- Google Scholar
1. Goheen MM
2. Wegmüller R
3. Bah A
4. Darboe B
5. Danso E
6. Affara M
7. Gardner D
8. Patel JC
9. Prentice AM
10. Cerami C
(2016) Anemia offers stronger protection than sickle cell trait against the erythrocytic stage of falciparum malaria and this protection is reversed by iron supplementation
EBioMedicine 14:123–130.

https://doi.org/10.1016/j.ebiom.2016.11.011
- PubMed
- Google Scholar
1. Greene LS
(1993) G6pd deficiency as protection againstfalciparum malaria: An epidemiologic critique of population and experimental studies
American Journal of Physical Anthropology 36:153–178.

https://doi.org/10.1002/ajpa.1330360609
- Google Scholar
(2017) Genetic effects on gene expression across human tissues
Nature 550:204–213.

https://doi.org/10.1038/nature24277
- PubMed
- Google Scholar
1. Hanssen E
2. Knoechel C
3. Dearnley M
4. Dixon MWA
5. Le Gros M
6. Larabell C
7. Tilley L
(2012) Soft x-ray microscopy analysis of cell volume and hemoglobin content in erythrocytes infected with asexual and sexual stages of Plasmodium falciparum
Journal of Structural Biology 177:224–232.

https://doi.org/10.1016/j.jsb.2011.09.003
- PubMed
- Google Scholar
(1985) Plasmodium falciparum in vitro: Diminished growth in hemoglobin h disease erythrocytes
Blood 65:452–455.

https://doi.org/10.1182/blood.V65.2.452.452
- PubMed
- Google Scholar
(2018) A common functional piezo1 deletion allele associates with red blood cell density in sickle cell disease patients
American Journal of Hematology 93:E362–E365.

https://doi.org/10.1002/ajh.25245
- PubMed
- Google Scholar
1. Ioannidis JPA
(2005) Why most published research findings are false
PLOS Medicine 2:e124.

https://doi.org/10.1371/journal.pmed.0020124
- PubMed
- Google Scholar
1. Kanias T
2. Lanteri MC
3. Page GP
4. Guo Y
5. Endres SM
6. Stone M
7. Keating S
8. Mast AE
9. Cable RG
10. Triulzi DJ
11. Kiss JE
12. Murphy EL
13. Kleinman S
14. Busch MP
15. Gladwin MT
(2017) Ethnicity, sex, and age are determinants of red blood cell storage and stress hemolysis: Results of the REDS-III Rbc-omics study
Blood Advances 1:1132–1141.

https://doi.org/10.1182/bloodadvances.2017004820
- PubMed
- Google Scholar
1. Karczewski KJ
2. Francioli LC
3. Tiao G
4. Cummings BB
5. Alföldi J
6. Wang Q
7. Collins RL
8. Laricchia KM
9. Ganna A
10. Birnbaum DP
11. Gauthier LD
12. Brand H
13. Solomonson M
14. Watts NA
15. Rhodes D
16. Singer-Berk M
17. England EM
18. Seaby EG
19. Kosmicki JA
20. Walters RK
21. Tashman K
22. Farjoun Y
23. Banks E
24. Poterba T
25. Wang A
26. Seed C
27. Whiffin N
28. Chong JX
29. Samocha KE
30. Pierce-Hoffman E
31. Zappala Z
32. O’Donnell-Luria AH
33. Minikel EV
34. Weisburd B
35. Lek M
36. Ware JS
37. Vittal C
38. Armean IM
39. Bergelson L
40. Cibulskis K
41. Connolly KM
42. Covarrubias M
43. Donnelly S
44. Ferriera S
45. Gabriel S
46. Gentry J
47. Gupta N
48. Jeandet T
49. Kaplan D
50. Llanwarne C
51. Munshi R
52. Novod S
53. Petrillo N
54. Roazen D
55. Ruano-Rubio V
56. Saltzman A
57. Schleicher M
58. Soto J
59. Tibbetts K
60. Tolonen C
61. Wade G
62. Talkowski ME
63. Genome Aggregation Database Consortium
64. Neale BM
65. Daly MJ
66. MacArthur DG
(2020) The mutational constraint spectrum quantified from variation in 141,456 humans
Nature 581:434–443.

https://doi.org/10.1038/s41586-020-2308-7
- PubMed
- Google Scholar
1. Kariuki SN
2. Marin-Menendez A
3. Introini V
4. Ravenhill BJ
5. Lin Y-C
6. Macharia A
7. Makale J
8. Tendwa M
9. Nyamu W
10. Kotar J
11. Carrasquilla M
12. Rowe JA
13. Rockett K
14. Kwiatkowski D
15. Weekes MP
16. Cicuta P
17. Williams TN
18. Rayner JC
(2020) Red blood cell tension protects against severe malaria in the Dantu blood group
Nature 585:579–583.

https://doi.org/10.1038/s41586-020-2726-6
- PubMed
- Google Scholar
1. Kariuki SN
2. Williams TN
(2020) Human genetics and malaria resistance
Human Genetics 139:801–811.

https://doi.org/10.1007/s00439-020-02142-6
- PubMed
- Google Scholar
1. Kierczak M
(2021) The contribution of rare whole genome sequencing variants to plasma protein levels and to the missing heritability
Research Square 1:625433/v1.

https://doi.org/10.21203/rs.3.rs-625433/v1
- Google Scholar
Preprint
(2020) A Genotype-Phenotype-Fitness Map Reveals Local Modularity and Global Pleiotropy of Adaptation
bioRxiv.

https://doi.org/10.1101/2020.06.25.172197
- Google Scholar
1. Koch M
(2017) Plasmodium falciparum erythrocyte-binding antigen 175 triggers a biophysical change in the red blood cell that facilitates invasion
PNAS 114:4225–4230.

https://doi.org/10.1073/pnas.1620843114
- Google Scholar
1. Kuypers FA
(1990)
Use of ektacytometry to determine red cell susceptibility to oxidative stress

The Journal of Laboratory and Clinical Medicine 116:527–534.
- Google Scholar
1. Kwiatkowski DP
(2005) How malaria has affected the human genome and what human genetics can teach us about malaria
American Journal of Human Genetics 77:171–192.

https://doi.org/10.1086/432519
- PubMed
- Google Scholar
1. Leffler EM
2. Band G
3. Busby GBJ
4. Kivinen K
5. Le QS
6. Clarke GM
7. Bojang KA
8. Conway DJ
9. Jallow M
10. Sisay-Joof F
11. Bougouma EC
12. Mangano VD
13. Modiano D
14. Sirima SB
15. Achidi E
16. Apinjoh TO
17. Marsh K
18. Ndila CM
19. Peshu N
20. Williams TN
21. Drakeley C
22. Manjurano A
23. Reyburn H
24. Riley E
25. Kachala D
26. Molyneux M
27. Nyirongo V
28. Taylor T
29. Thornton N
30. Tilley L
31. Grimsley S
32. Drury E
33. Stalker J
34. Cornelius V
35. Hubbart C
36. Jeffreys AE
37. Rowlands K
38. Rockett KA
39. Spencer CCA
40. Kwiatkowski DP
41. Malaria Genomic Epidemiology Network
(2017) Resistance to malaria through structural variation of red blood cell invasion receptors
Science 356:1140–1152.

https://doi.org/10.1126/science.aam6393
- PubMed
- Google Scholar
1. Lell B
2. May J
3. Schmidt-Ott RJ
4. Lehman LG
5. Luckner D
6. Greve B
7. Matousek P
8. Schmid D
9. Herbich K
10. Mockenhaupt FP
11. Meyer CG
12. Bienzle U
13. Kremsner PG
(1999) The role of red blood cell polymorphisms in resistance and susceptibility to malaria
Clinical Infectious Diseases 28:794–799.

https://doi.org/10.1086/515193
- PubMed
- Google Scholar
1. Lessard S
2. Gatof ES
3. Beaudoin M
4. Schupp PG
5. Sher F
6. Ali A
7. Prehar S
8. Kurita R
9. Nakamura Y
10. Baena E
11. Ledoux J
12. Oceandy D
13. Bauer DE
14. Lettre G
(2017) An erythroid-specific atp2b4 enhancer mediates red blood cell hydration and malaria susceptibility
The Journal of Clinical Investigation 127:3065–3074.

https://doi.org/10.1172/JCI94378
- PubMed
- Google Scholar
1. Lewontin RC
(1972) The apportionment of human diversity’, in evolutionary biology
Taylor and Francis 6:381–398.

https://doi.org/10.1007/978-1-4684-9063-3_14
- Google Scholar
Preprint
1. Li H
(2013a) Aligning Sequence Reads, Clone Sequences and Assembly Contigs with BWA-MEM
arXiv.

http://arxiv.org/abs/1303.3997
- Google Scholar
1. Li J
2. Glessner JT
3. Zhang H
4. Hou C
5. Wei Z
6. Bradfield JP
7. Mentch FD
8. Guo Y
9. Kim C
10. Xia Q
11. Chiavacci RM
12. Thomas KA
13. Qiu H
14. Grant SFA
15. Furth SL
16. Hakonarson H
17. Sleiman PMA
(2013b) GWAS of blood cell Traits identifies novel associated loci and epistatic interactions in caucasian and african-american children
Human Molecular Genetics 22:1457–1464.

https://doi.org/10.1093/hmg/dds534
- PubMed
- Google Scholar
(2020) LDTRAIT: An online tool for identifying published phenotype associations in linkage disequilibrium
Cancer Research 80:3443–3446.

https://doi.org/10.1158/0008-5472.CAN-20-0985
- PubMed
- Google Scholar
1. Lo KS
2. Wilson JG
3. Lange LA
4. Folsom AR
5. Galarneau G
6. Ganesh SK
7. Grant SFA
8. Keating BJ
9. McCarroll SA
10. O’Donnell CJ
11. Palmas W
12. Tang W
13. Tracy RP
14. Reiner AP
15. Lettre G
(2011) Genetic association analysis highlights new loci that modulate hematological trait variation in Caucasians and african Americans
Human Genetics 129:307–317.

https://doi.org/10.1007/s00439-010-0925-1
- PubMed
- Google Scholar
(2019) Variant calling on the GRCH38 assembly with the data from phase three of the 1000 Genomes project
Wellcome Open Research 4:50.

https://doi.org/10.12688/wellcomeopenres.15126.2
- PubMed
- Google Scholar
1. Luzzatto L
(2012) Sickle cell anaemia and malaria
Mediterranean Journal of Hematology and Infectious Diseases 4:e2012065.

https://doi.org/10.4084/MJHID.2012.065
- PubMed
- Google Scholar
1. Ma S
2. Cahalan S
3. LaMonte G
4. Grubaugh ND
5. Zeng W
6. Murthy SE
7. Paytas E
8. Gamini R
9. Lukacs V
10. Whitwam T
11. Loud M
12. Lohia R
13. Berry L
14. Khan SM
15. Janse CJ
16. Bandell M
17. Schmedt C
18. Wengelnik K
19. Su AI
20. Honore E
21. Winzeler EA
22. Andersen KG
23. Patapoutian A
(2018) Common piezo1 allele in african populations causes rbc dehydration and attenuates plasmodium infection
Cell 173:443–455.

https://doi.org/10.1016/j.cell.2018.02.047
- PubMed
- Google Scholar
(2020) Pervasive strong selection at the level of codon usage bias in Drosophila melanogaster
Genetics 214:511–528.

https://doi.org/10.1534/genetics.119.302542
- PubMed
- Google Scholar
(2005) Heritability of malaria in Africa
PLOS Medicine 2:20340.

https://doi.org/10.1371/journal.pmed.0020340
- PubMed
- Google Scholar
1. Malaria Genomic Epidemiology Network
(2014) Reappraisal of known malaria resistance loci in a large multicenter study
Nature Genetics 46:1197–1204.

https://doi.org/10.1038/ng.3107
- PubMed
- Google Scholar
1. Malaria Genomic Epidemiology Network
(2019) Insights into malaria susceptibility using genome-wide data on 17,000 individuals from Africa, Asia and Oceania
Nature Communications 10:5732.

https://doi.org/10.1038/s41467-019-13480-z
- PubMed
- Google Scholar
1. Mayer DCG
2. Cofie J
3. Jiang L
4. Hartl DL
5. Tracy E
6. Kabat J
7. Mendoza LH
8. Miller LH
(2009) Glycophorin B is the erythrocyte receptor of Plasmodium falciparum erythrocyte-binding ligand, EBL-1
PNAS 106:5348–5352.

https://doi.org/10.1073/pnas.0900878106
- PubMed
- Google Scholar
1. Mills RE
2. Luttig CT
3. Larkins CE
4. Beauchamp A
5. Tsui C
6. Pittard WS
7. Devine SE
(2006) An initial map of insertion and deletion (INDEL) variation in the human genome
Genome Research 16:1182–1190.

https://doi.org/10.1101/gr.4565806
- PubMed
- Google Scholar
1. Mockenhaupt FP
(2000) Anaemia in pregnant Ghanaian women: importance of malaria, iron deficiency, and haemoglobinopathies’, Transactions of the Royal Society of Tropical Medicine and Hygiene
Royal Society of Tropical Medicine and Hygiene 94:477–483.

https://doi.org/10.1016/S0035-9203(00)90057-9
- Google Scholar
1. Moser KA
(2020) ‘strains used in whole organism Plasmodium falciparum vaccine trials differ in genome structure, sequence, and immunogenic potential’, genome medicine 2020 12:1
BioMed Central 12:1–17.

https://doi.org/10.1186/S13073-019-0708-9
- Google Scholar
1. Nagayasu E
2. Ito M
3. Akaki M
4. Nakano Y
5. Kimura M
6. Looareesuwan S
7. Aikawa M
(2001) CR1 density polymorphism on erythrocytes of falciparum malaria patients in Thailand
The American Journal of Tropical Medicine and Hygiene 64:1–5.

https://doi.org/10.4269/ajtmh.2001.64.1.11425154
- PubMed
- Google Scholar
1. Ndila CM
2. Uyoga S
3. Macharia AW
4. Nyutu G
5. Peshu N
6. Ojal J
7. Shebe M
8. Awuondo KO
9. Mturi N
10. Tsofa B
11. Sepúlveda N
12. Clark TG
13. Band G
14. Clarke G
15. Rowlands K
16. Hubbart C
17. Jeffreys A
18. Kariuki S
19. Marsh K
20. Mackinnon M
21. Maitland K
22. Kwiatkowski DP
23. Rockett KA
24. Williams TN
25. MalariaGEN Consortium
(2018) Human candidate gene polymorphisms and risk of severe malaria in children in Kilifi, Kenya: A case-control association study
The Lancet 5:e333–e345.

https://doi.org/10.1016/S2352-3026(18)30107-8
- PubMed
- Google Scholar
1. Nguetse CN
2. Purington N
3. Ebel ER
4. Shakya B
5. Tetard M
6. Kremsner PG
7. Velavan TP
8. Egan ES
(2020) A common polymorphism in the mechanosensitive ion channel piezo1 is associated with protection from severe malaria in humans
PNAS 117:9074–9081.

https://doi.org/10.1073/pnas.1919843117
- PubMed
- Google Scholar
1. Novembre J
2. Di Rienzo A
(2009) Spatial patterns of variation due to natural selection in humans
Nature Reviews. Genetics 10:745–755.

https://doi.org/10.1038/nrg2632
- PubMed
- Google Scholar
Conference
1. Okwa OO
(2012) InTech
Malaria parasites, malaria parasites.

https://doi.org/10.5772/1477
- Google Scholar
1. Otto TD
2. Gilabert A
3. Crellen T
4. Böhme U
5. Arnathau C
6. Sanders M
7. Oyola SO
8. Okouga AP
9. Boundenga L
10. Willaume E
11. Ngoubangoye B
12. Moukodoum ND
13. Paupy C
14. Durand P
15. Rougeron V
16. Ollomo B
17. Renaud F
18. Newbold C
19. Berriman M
20. Prugnolle F
(2018) Genomes of all known members of a plasmodium subgenus reveal paths to virulent human malaria
Nature Microbiology 3:687–697.

https://doi.org/10.1038/s41564-018-0162-2
- PubMed
- Google Scholar
(2021) Multiple-ancestry genome-wide association study identifies 27 loci associated with measures of hemolysis following blood storage
The Journal of Clinical Investigation 131:146077.

https://doi.org/10.1172/JCI146077
- PubMed
- Google Scholar
1. Pankratov V
2. Montinaro F
3. Kushniarevich A
4. Hudjashov G
5. Jay F
6. Saag L
7. Flores R
8. Marnetto D
9. Seppel M
10. Kals M
11. Võsa U
12. Taccioli C
13. Möls M
14. Milani L
15. Aasa A
16. Lawson DJ
17. Esko T
18. Mägi R
19. Pagani L
20. Metspalu A
21. Metspalu M
(2020) Differences in local population history at the finest level: The case of the estonian population
European Journal of Human Genetics 28:1580–1591.

https://doi.org/10.1038/s41431-020-0699-4
- PubMed
- Google Scholar
(1978) Cellular mechanism for the protective effect of haemoglobin S against P. falciparum malaria
Nature 274:701–703.

https://doi.org/10.1038/274701a0
- PubMed
- Google Scholar
(2018) Hematological parameters and red blood cell morphological abnormality of glucose-6-phosphate dehydrogenase deficiency co-inherited with thalassemia
Hematology/Oncology and Stem Cell Therapy 11:18–24.

https://doi.org/10.1016/j.hemonc.2017.05.029
- PubMed
- Google Scholar
1. Perry GS
2. Byers T
3. Yip R
4. Margen S
(1992) Iron nutrition does not account for the hemoglobin differences between blacks and whites
The Journal of Nutrition 122:1417–1424.

https://doi.org/10.1093/jn/122.7.1417
- PubMed
- Google Scholar
1. Pilia G
2. Chen W-M
3. Scuteri A
4. Orrú M
5. Albai G
6. Dei M
7. Lai S
8. Usala G
9. Lai M
10. Loi P
11. Mameli C
12. Vacca L
13. Deiana M
14. Olla N
15. Masala M
16. Cao A
17. Najjar SS
18. Terracciano A
19. Nedorezov T
20. Sharov A
21. Zonderman AB
22. Abecasis GR
23. Costa P
24. Lakatta E
25. Schlessinger D
(2006) Heritability of cardiovascular and personality traits in 6,148 Sardinians
PLOS Genetics 2:e132.

https://doi.org/10.1371/journal.pgen.0020132
- PubMed
- Google Scholar
1. Purcell S
2. Neale B
3. Todd-Brown K
4. Thomas L
5. Ferreira MAR
6. Bender D
7. Maller J
8. Sklar P
9. de Bakker PIW
10. Daly MJ
11. Sham PC
(2007) PLINK: A tool set for whole-genome association and population-based linkage analyses
American Journal of Human Genetics 81:559–575.

https://doi.org/10.1086/519795
- PubMed
- Google Scholar
1. Rooks H
2. Brewin J
3. Gardner K
4. Chakravorty S
5. Menzel S
6. Hannemann A
7. Gibson J
8. Rees DC
(2019) A gain of function variant in piezo1 (E756DEL) and sickle cell disease
Haematologica 104:e91–e93.

https://doi.org/10.3324/haematol.2018.202697
- PubMed
- Google Scholar
(2002) Genetic structure of human populations
Science 298:2381–2385.

https://doi.org/10.1126/science.1078311
- PubMed
- Google Scholar
1. Rosenberg NA
(2011) A population-genetic perspective on the similarities and differences among worldwide human populations
Human Biology 83:659–684.

https://doi.org/10.3378/027.083.0601
- PubMed
- Google Scholar
1. Rowe JA
(2007) Blood group O protects against severe Plasmodium falciparum malaria through the mechanism of reduced rosetting
PNAS 104:17471–17476.

https://doi.org/10.1073/pnas.0705390104
- Google Scholar
1. Ruwende C
2. Hill A
(1998) Glucose-6-phosphate dehydrogenase deficiency and malaria
Journal of Molecular Medicine 76:581–588.

https://doi.org/10.1007/s001090050253
- PubMed
- Google Scholar
1. Sauna ZE
2. Kimchi-Sarfaty C
(2011) Understanding the contribution of synonymous mutations to human disease
Nature Reviews. Genetics 12:683–691.

https://doi.org/10.1038/nrg3051
- PubMed
- Google Scholar
1. Schulman S
2. Roth EF Jr
3. Cheng B
4. Rybicki AC
5. Sussman II
6. Wong M
7. Wang W
8. Ranney HM
9. Nagel RL
10. Schwartz RS
(1990) Growth of Plasmodium falciparum in human erythrocytes containing abnormal membrane proteins
PNAS 87:7339–7343.

https://doi.org/10.1073/pnas.87.18.7339
- PubMed
- Google Scholar
(2011) Diurnal variation of hematology parameters in healthy young males: The Bispebjerg study of diurnal variations
Scandinavian Journal of Clinical and Laboratory Investigation 71:532–541.

https://doi.org/10.3109/00365513.2011.602422
- PubMed
- Google Scholar
1. Sohail M
2. Maier RM
3. Ganna A
4. Bloemendal A
5. Martin AR
6. Turchin MC
7. Chiang CW
8. Hirschhorn J
9. Daly MJ
10. Patterson N
11. Neale B
12. Mathieson I
13. Reich D
14. Sunyaev SR
(2019) Polygenic adaptation on height is overestimated due to uncorrected stratification in genome-wide association studies
eLife 8:e39702.

https://doi.org/10.7554/eLife.39702
- PubMed
- Google Scholar
1. Sundararaman SA
2. Plenderleith LJ
3. Liu W
4. Loy DE
5. Learn GH
6. Li Y
7. Shaw KS
8. Ayouba A
9. Peeters M
10. Speede S
11. Shaw GM
12. Bushman FD
13. Brisson D
14. Rayner JC
15. Sharp PM
16. Hahn BH
(2016) Genomes of cryptic chimpanzee plasmodium species reveal key evolutionary events leading to human malaria
Nature Communications 7:11078.

https://doi.org/10.1038/ncomms11078
- PubMed
- Google Scholar
(2014) Synonymous mutations frequently act as driver mutations in human cancers
Cell 156:1324–1335.

https://doi.org/10.1016/j.cell.2014.01.051
- PubMed
- Google Scholar
1. Tarazona-Santos E
2. Castilho L
3. Amaral DRT
4. Costa DC
5. Furlani NG
6. Zuccherato LW
7. Machado M
8. Reid ME
9. Zalis MG
10. Rossit AR
11. Santos SEB
12. Machado RL
13. Lustigman S
(2011) Population genetics of GYPB and association study between GYPB*S/s polymorphism and susceptibility to P. falciparum infection in the Brazilian Amazon
PLOS ONE 6:e16123.

https://doi.org/10.1371/journal.pone.0016123
- PubMed
- Google Scholar
1. Tibshirani R
(1994)
Regression shrinkage and selection via the lasso

JSTOR 58:267–288.
- Google Scholar
1. Tiffert T
2. Lew VL
3. Ginsburg H
4. Krugliak M
5. Croisille L
6. Mohandas N
(2005) The hydration state of human red blood cells and their susceptibility to invasion by Plasmodium falciparum
Blood 105:4853–4860.

https://doi.org/10.1182/blood-2004-12-4948
- PubMed
- Google Scholar
1. Timmann C
2. Thye T
3. Vens M
4. Evans J
5. May J
6. Ehmen C
7. Sievertsen J
8. Muntau B
9. Ruge G
10. Loag W
11. Ansong D
12. Antwi S
13. Asafo-Adjei E
14. Nguah SB
15. Kwakye KO
16. Akoto AOY
17. Sylverken J
18. Brendel M
19. Schuldt K
20. Loley C
21. Franke A
22. Meyer CG
23. Agbenyega T
24. Ziegler A
25. Horstmann RD
(2012) Genome-wide association study indicates two novel resistance loci for severe malaria
Nature 489:443–446.

https://doi.org/10.1038/nature11334
- PubMed
- Google Scholar
1. Van der Auwera GA
2. Carneiro MO
3. Hartl C
4. Poplin R
5. Del Angel G
6. Levy-Moonshine A
7. Jordan T
8. Shakir K
9. Roazen D
10. Thibault J
11. Banks E
12. Garimella KV
13. Altshuler D
14. Gabriel S
15. DePristo MA
(2013) From FastQ data to high confidence variant calls: the Genome Analysis Toolkit best practices pipeline
Current Protocols in Bioinformatics 43:bi1110s43.

https://doi.org/10.1002/0471250953.bi1110s43
- PubMed
- Google Scholar
1. van der Harst P
2. Zhang W
3. Mateo Leach I
4. Rendon A
5. Verweij N
6. Sehmi J
7. Paul DS
8. Elling U
9. Allayee H
10. Li X
11. Radhakrishnan A
12. Tan S-T
13. Voss K
14. Weichenberger CX
15. Albers CA
16. Al-Hussani A
17. Asselbergs FW
18. Ciullo M
19. Danjou F
20. Dina C
21. Esko T
22. Evans DM
23. Franke L
24. Gögele M
25. Hartiala J
26. Hersch M
27. Holm H
28. Hottenga J-J
29. Kanoni S
30. Kleber ME
31. Lagou V
32. Langenberg C
33. Lopez LM
34. Lyytikäinen L-P
35. Melander O
36. Murgia F
37. Nolte IM
38. O’Reilly PF
39. Padmanabhan S
40. Parsa A
41. Pirastu N
42. Porcu E
43. Portas L
44. Prokopenko I
45. Ried JS
46. Shin S-Y
47. Tang CS
48. Teumer A
49. Traglia M
50. Ulivi S
51. Westra H-J
52. Yang J
53. Zhao JH
54. Anni F
55. Abdellaoui A
56. Attwood A
57. Balkau B
58. Bandinelli S
59. Bastardot F
60. Benyamin B
61. Boehm BO
62. Cookson WO
63. Das D
64. de Bakker PIW
65. de Boer RA
66. de Geus EJC
67. de Moor MH
68. Dimitriou M
69. Domingues FS
70. Döring A
71. Engström G
72. Eyjolfsson GI
73. Ferrucci L
74. Fischer K
75. Galanello R
76. Garner SF
77. Genser B
78. Gibson QD
79. Girotto G
80. Gudbjartsson DF
81. Harris SE
82. Hartikainen A-L
83. Hastie CE
84. Hedblad B
85. Illig T
86. Jolley J
87. Kähönen M
88. Kema IP
89. Kemp JP
90. Liang L
91. Lloyd-Jones H
92. Loos RJF
93. Meacham S
94. Medland SE
95. Meisinger C
96. Memari Y
97. Mihailov E
98. Miller K
99. Moffatt MF
100. Nauck M
101. Novatchkova M
102. Nutile T
103. Olafsson I
104. Onundarson PT
105. Parracciani D
106. Penninx BW
107. Perseu L
108. Piga A
109. Pistis G
110. Pouta A
111. Puc U
112. Raitakari O
113. Ring SM
114. Robino A
115. Ruggiero D
116. Ruokonen A
117. Saint-Pierre A
118. Sala C
119. Salumets A
120. Sambrook J
121. Schepers H
122. Schmidt CO
123. Silljé HHW
124. Sladek R
125. Smit JH
126. Starr JM
127. Stephens J
128. Sulem P
129. Tanaka T
130. Thorsteinsdottir U
131. Tragante V
132. van Gilst WH
133. van Pelt LJ
134. van Veldhuisen DJ
135. Völker U
136. Whitfield JB
137. Willemsen G
138. Winkelmann BR
139. Wirnsberger G
140. Algra A
141. Cucca F
142. d’Adamo AP
143. Danesh J
144. Deary IJ
145. Dominiczak AF
146. Elliott P
147. Fortina P
148. Froguel P
149. Gasparini P
150. Greinacher A
151. Hazen SL
152. Jarvelin M-R
153. Khaw KT
154. Lehtimäki T
155. Maerz W
156. Martin NG
157. Metspalu A
158. Mitchell BD
159. Montgomery GW
160. Moore C
161. Navis G
162. Pirastu M
163. Pramstaller PP
164. Ramirez-Solis R
165. Schadt E
166. Scott J
167. Shuldiner AR
168. Smith GD
169. Smith JG
170. Snieder H
171. Sorice R
172. Spector TD
173. Stefansson K
174. Stumvoll M
175. Tang WHW
176. Toniolo D
177. Tönjes A
178. Visscher PM
179. Vollenweider P
180. Wareham NJ
181. Wolffenbuttel BHR
182. Boomsma DI
183. Beckmann JS
184. Dedoussis GV
185. Deloukas P
186. Ferreira MA
187. Sanna S
188. Uda M
189. Hicks AA
190. Penninger JM
191. Gieger C
192. Kooner JS
193. Ouwehand WH
194. Soranzo N
195. Chambers JC
(2012) Seventy-five genetic loci influencing the human red blood cell
Nature 492:369–375.

https://doi.org/10.1038/nature11677
- PubMed
- Google Scholar
1. Vuckovic D
2. Bao EL
3. Akbari P
4. Lareau CA
5. Mousas A
6. Jiang T
7. Chen MH
8. Raffield LM
9. Tardaguila M
10. Huffman JE
11. Ritchie SC
12. Megy K
13. Ponstingl H
14. Penkett CJ
15. Albers PK
16. Wigdor EM
17. Sakaue S
18. Moscati A
19. Manansala R
20. Lo KS
21. Qian H
22. Akiyama M
23. Bartz TM
24. Ben-Shlomo Y
25. Beswick A
26. Bork-Jensen J
27. Bottinger EP
28. Brody JA
29. van Rooij FJA
30. Chitrala KN
31. Wilson PWF
32. Choquet H
33. Danesh J
34. Di Angelantonio E
35. Dimou N
36. Ding J
37. Elliott P
38. Esko T
39. Evans MK
40. Felix SB
41. Floyd JS
42. Broer L
43. Grarup N
44. Guo MH
45. Guo Q
46. Greinacher A
47. Haessler J
48. Hansen T
49. Howson JMM
50. Huang W
51. Jorgenson E
52. Kacprowski T
53. Kähönen M
54. Kamatani Y
55. Kanai M
56. Karthikeyan S
57. Koskeridis F
58. Lange LA
59. Lehtimäki T
60. Linneberg A
61. Liu Y
62. Lyytikäinen LP
63. Manichaikul A
64. Matsuda K
65. Mohlke KL
66. Mononen N
67. Murakami Y
68. Nadkarni GN
69. Nikus K
70. Pankratz N
71. Pedersen O
72. Preuss M
73. Psaty BM
74. Raitakari OT
75. Rich SS
76. Rodriguez BAT
77. Rosen JD
78. Rotter JI
79. Schubert P
80. Spracklen CN
81. Surendran P
82. Tang H
83. Tardif JC
84. Ghanbari M
85. Völker U
86. Völzke H
87. Watkins NA
88. Weiss S
89. VA Million Veteran Program
90. Cai N
91. Kundu K
92. Watt SB
93. Walter K
94. Zonderman AB
95. Cho K
96. Li Y
97. Loos RJF
98. Knight JC
99. Georges M
100. Stegle O
101. Evangelou E
102. Okada Y
103. Roberts DJ
104. Inouye M
105. Johnson AD
106. Auer PL
107. Astle WJ
108. Reiner AP
109. Butterworth AS
110. Ouwehand WH
111. Lettre G
112. Sankaran VG
113. Soranzo N
(2020) The polygenic and monogenic basis of blood traits and diseases
Cell 182:1214–1231.

https://doi.org/10.1016/j.cell.2020.08.008
- PubMed
- Google Scholar
1. Walliker D
2. Quakyi IA
3. Wellems TE
4. McCutchan TF
5. Szarfman A
6. London WT
7. Corcoran LM
8. Burkot TR
9. Carter R
(1987) Genetic analysis of the human malaria parasite Plasmodium falciparum
Science 236:1661–1666.

https://doi.org/10.1126/SCIENCE.3299700
- PubMed
- Google Scholar
1. Wang K
2. Li M
3. Hakonarson H
(2010) ANNOVAR: Functional annotation of genetic variants from high-throughput sequencing data
Nucleic Acids Research 38:e164.

https://doi.org/10.1093/nar/gkq603
- PubMed
- Google Scholar
1. Weatherall DJ
(2001) Phenotype-genotype relationships in monogenic disease: Lessons from the thalassaemias
Nature Reviews. Genetics 2:245–255.

https://doi.org/10.1038/35066048
- PubMed
- Google Scholar
(1985) Genetic and environmental influences on the size and number of cells in the blood
Genetic Epidemiology 2:133–144.

https://doi.org/10.1002/gepi.1370020204
- PubMed
- Google Scholar
Website
1. WHO
(2019) Malaria eradication: Benefits, future scenarios and feasibility. Executive summary, who strategic advisory group on malaria eradication
Accessed August 7, 2021.

https://www.who.int/publications/i/item/who-cds-gmp-2019-10
1. Williams TN
(2006) Human red blood cell polymorphisms and malaria
Current Opinion in Microbiology 9:388–394.

https://doi.org/10.1016/j.mib.2006.06.009
- PubMed
- Google Scholar
1. Wright GJ
2. Rayner JC
(2014) Plasmodium falciparum erythrocyte invasion: Combining function with immune evasion
PLOS Pathogens 10:e1003943.

https://doi.org/10.1371/journal.ppat.1003943
- PubMed
- Google Scholar
(1971)
Human glucose-6-phosphate dehydrogenase variants bulletin of the World Health Organization

World Health Organization 45:243–253.
- PubMed
- Google Scholar
1. Zámbó B
2. Várady G
3. Padányi R
4. Szabó E
5. Németh A
6. Langó T
7. Enyedi Á
8. Sarkadi B
(2017) Decreased calcium pump expression in human erythrocytes is connected to a minor haplotype in the atp2b4 gene
Cell Calcium 65:73–79.

https://doi.org/10.1016/j.ceca.2017.02.001
- PubMed
- Google Scholar
1. Zhang Y
(2015) Multiple stiffening effects of nanoscale knobs on human red blood cells infected with Plasmodium falciparum malaria parasite
PNAS 112:6068–6073.

https://doi.org/10.1073/pnas.1505584112
- Google Scholar

Article and author information

Author details

Emily R Ebel
1. Department of Biology, Stanford University, Stanford, United States
2. Department of Pediatrics, Stanford University School of Medicine, Stanford, United States
Contribution
Conceptualization, Formal analysis, Funding acquisition, Investigation, Methodology, Visualization, Writing – original draft, Writing – review and editing

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0002-3244-4250
Frans A Kuypers

Children's Hospital Oakland Research Institute, Oakland, United States

Contribution
Investigation, Methodology, Resources, Writing – review and editing

Competing interests
No competing interests declared
Carrie Lin

Department of Pediatrics, Stanford University School of Medicine, Stanford, United States

Contribution
Investigation

Competing interests
No competing interests declared
Dmitri A Petrov

Department of Biology, Stanford University, Stanford, United States

Contribution
Conceptualization, Methodology, Resources, Supervision, Writing – review and editing

For correspondence
dpetrov@stanford.edu

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0002-3664-9130
Elizabeth S Egan
1. Department of Pediatrics, Stanford University School of Medicine, Stanford, United States
2. Department of Microbiology & Immunology, Stanford University School of Medicine, Stanford, United States
Contribution
Conceptualization, Investigation, Methodology, Resources, Supervision, Writing – review and editing

For correspondence
eegan@stanford.edu

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0002-2112-7700

Funding

Stanford University School of Medicine

Elizabeth S Egan

Stanford University School of Medicine

Elizabeth S Egan

Stanford Center for Computational, Evolutionary and Human Genomics

Emily R Ebel

National Institute of General Medical Sciences (5R35GM118165-05)

Dmitri A Petrov

National Science Foundation (DGE-1247312)

Emily R Ebel

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Acknowledgements

The authors gratefully acknowledge the invaluable participation of all volunteer blood donors. Nick Bondy, Bertil Glader, Sandra Larkin, Brian Fleischer, Ashley Dunn, Talal Seddik, Trung Pham, David Vu, and Spectrum Child Health provided crucial assistance in donor coordination and sample processing. P. falciparum strain Th.026.09 was kindly provided by Daouda Ndiaye and Sarah Volkman. For quantitative advice, the authors thank Grant Kinsler, Jonathan Pritchard, Susan Holmes, and the Stanford Statistics Consulting Group. This study was primarily supported by a Pilot Early Career award from the Stanford Maternal Child Health Research Institute and a Gabilan Faculty Award from the Stanford University School of Medicine Office of Faculty Development and Diversity (ESE). ERE was an NSF Graduate Research Fellow (DGE-1247312) and received additional support from the Stanford Center for Computational, Evolutionary, and Human Genomics. DAP was funded through an NIH MIRA award 5R35GM118165-05. ESE is a Tashia and John Morgridge Endowed Faculty Scholar in Pediatric Translational Medicine through the Stanford Maternal Child Health Research Institute. Local blood samples were drawn at the Stanford Clinical and Translational Research Unit, which is supported by CTSA Grant UL1 TR001085.

Ethics

Human subjects: Written informed consent and consent to publish was obtained from each subject and/or their parent as part of a protocol approved by the Stanford University Institutional Review Board (#40479).

Copyright

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.