Antibodies to repeat-containing antigens in Plasmodium falciparum are exposure-dependent and short-lived in children in natural malaria infections

  1. Madhura Raghavan
  2. Katrina L Kalantar
  3. Elias Duarte
  4. Noam Teyssier
  5. Saki Takahashi
  6. Andrew F Kung
  7. Jayant V Rajan
  8. John Rek
  9. Kevin KA Tetteh
  10. Chris Drakeley
  11. Isaac Ssewanyana
  12. Isabel Rodriguez-Barraquer
  13. Bryan Greenhouse  Is a corresponding author
  14. Joseph L DeRisi  Is a corresponding author
  1. University of California, San Francisco, United States
  2. Chan Zuckerberg Initiative, United States
  3. University of California, Berkeley, United States
  4. Infectious Diseases Research Collaboration, Uganda
  5. London School of Hygiene and Tropical Medicine, United Kingdom
  6. Chan Zuckerberg Biohub, United States
7 figures, 3 tables and 9 additional files

Figures

Figure 1 with 1 supplement
PhIP-seq overview and analysis pipeline.

Falciparome phage library displays the proteome of Plasmodium falciparum in 62-aa peptides with 25-aa step size on T7 phage and also includes variant sequences of many antigens, including major …

Figure 1—figure supplement 1
Pipeline for library construction.

Input sequences of different groups were filtered with CD-HIT to remove similar sequences with more than the indicated % identity in Table 2. The filtered sequences were then processed into peptides …

Figure 2 with 6 supplements
PhIP-seq with Falciparome captures known, novel antigens and relationships between age, exposure and breadth of seroreactive regions.

(a) Heatmap of Z-score enrichment over US controls for seroreactive peptides (rows) with >10% seropositivity across different age groups in the moderate and high exposure cohorts. Peptides are …

Figure 2—figure supplement 1
Histogram of read counts of Falciparome phage library.

Read counts corresponding to the 5th and 95th percentile in the distribution (indicated in blue) are within a 16-fold difference. Cumulative density plot of the distribution is shown in red.

Figure 2—figure supplement 2
Technical replicates are well correlated.

Top - Pearson correlation matrix of depth-adjusted read counts across all samples. Technical replicates are placed symmetrically on rows and columns. Bottom three - Representative scatter plots of …

Figure 2—figure supplement 3
Target peptides are enriched in a sample-specific manner.

Top panel - PhIP-seq with polyclonal anti-GFAP enriches for GFAP peptides and enrichment is specific to IP with anti-GFAP, but is observed rarely in the Ugandan cohort and US controls. Left - …

Figure 2—figure supplement 4
Moving threshold analysis to determine optimal thresholds for calling peptides as seroreactive based on minimum Z-score and enrichment in a minimum number of samples in a group.

Box plots of resultant number of seroreactive peptides for corresponding thresholds are shown for Ugandan samples and US controls. The final thresholds for calling seroreactivity were selected based …

Figure 2—figure supplement 5
Breadth of non-redundant seroreactive peptide groups per person across age and exposure.

All seroreactive peptides in each person were collapsed based on sequence similarity (sharing of 7mer identical motifs). The resulting number of non-redundant groups was used as a measure of …

Figure 2—figure supplement 6
Breadth of seroreactivity in the variable regions of RIFIN and PfEMP1.

Top - Box plot of number of domain variants seroreactive in the variable region V2 of RIFINs. Significantly different groups (KS test <0.05) are marked with an *. Bottom - Heatmap of proportion of …

Figure 3 with 1 supplement
Tiled design of library facilitates high resolution characterization of seroreactive regions.

(a) Examples of previously well-characterized antigens and (b) novel/previously under-characterized antigens identified in this dataset. Average percentage of people seropositive at each residue …

Figure 3—figure supplement 1
Comparison of high-resolution localization of seroreactive regions identified in this study with regions identified through a peptide-array approach.

Location of seroreactive peptides identified in this dataset (red bar) and seroreactive 15-mer peptides identified using a high-density peptide array (black bar) in Jaenisch et.al. (peptides with …

Figure 4 with 1 supplement
Repeat elements are more enriched in seroreactive peptides within seroreactive proteins than non-seroreactive peptides.

(a) Distribution of cumulative frequency of repeat elements per protein is significantly higher (KS test p-value <0.05) in the seroreactive protein set than a randomly sampled subset of …

Figure 4—figure supplement 1
Distribution of repeat indices of seroreactive and non-seroreactive peptides within hit proteins for different lengths and degeneracy of the repeating motif.

Left three: Conservative substitutions ([GA],[ST],[DE],[NQ],[RHK],[LVI],[YFW]) are allowed at all positions in the motif. Right three: Identical residues at all positions in the motif. For all six …

Figure 5 with 4 supplements
Breadth of seroreactive repeat-containing peptides, but not non-repeat peptides, increases with exposure in children.

(a) Breadth of seroreactive non-repeat peptides per person is not significantly different (KS-test p-value >0.05) between the two exposure settings within each age group. (b) Breadth of seroreactive …

Figure 5—figure supplement 1
Breadth of repeat-containing peptides per person using different repeat index thresholds for categorizing repeat-containing peptides.

Age groups showing significant difference between the two transmission settings are marked by * based on a KS-test p-value <0.05.

Figure 5—figure supplement 2
Seropositivity of individual seroreactive repeat elements increases with exposure in children, but not adults.

Each dot represents a seroreactive repeat element and seropositivity for the repeat element in a given group was calculated as the percent of people in that group enriching for any seroreactive …

Figure 5—figure supplement 3
Controlling for time since infection status, breadth of seroreactive repeat-containing peptides, but not non-repeat peptides, still shows an increase with exposure in children.

Groups showing significant difference between the two transmission settings are marked by * based on a KS-test p-value <0.05.

Figure 5—figure supplement 4
Breadth of seroreactive repeat-containing peptides, but not non-repeat peptides, wanes with increased time since infection in the moderate exposure setting in children.

Groups showing significant differences are marked by * based on a KS-test p-value <0.05.

Figure 6 with 5 supplements
Extensive sharing of motifs observed among seroreactive proteins, with the most shared with PfEMP1 family.

(a) Pipeline to identify inter-protein motifs (6-9aa) significantly enriched (FDR <0.001) in seroreactive peptides from different seroreactive proteins (different colors) over background. Background …

Figure 6—figure supplement 1
Biochemical characteristics of inter-protein motifs.

Top - Histogram of net charge and hydrophobicity index of the 911 inter-protein motifs (7-aa motifs with at least five identical residues and up to two conservative substitutions) in comparison to a …

Figure 6—figure supplement 2
Inter-protein motifs are associated with seroreactivity.

(a) Design of the tiled peptide library showing segments in Peptide 4 overlapping with neighboring peptides. Start and end amino acid positions of each peptide are marked at either ends. (b) …

Figure 6—figure supplement 3
Co-occurrence of reactivity to peptides containing inter-protein motifs from different proteins within individuals.

Each plot in orange depicts the Cumulative Distribution Function (CDF) for the proportion of people showing reactivity in >y proteins for the set of inter-proten motifs shared among n proteins. The …

Figure 6—figure supplement 4
Histogram of number of other seroreactive proteins with which a seroreactive protein shares inter-protein motifs.
Figure 6—figure supplement 5
Network of seroreactive proteins outside the PfEMP1 network.

(a) All seroreactive proteins except PfEMP1 (b) Proteins with >30% seropositivity.

Author response image 1

Tables

Table 1
Characteristics of the Ugandan cohorts.
RegionAge group (yrs)No. of peopleProportion positive for infection at the time of sample collectionTime since last infection (days) - median (IQR)Incidence of symptomatic malaria per year - median (IQR)Household annual EIR* (infective bites / person) - median (IQR)
Tororo2–3100.518.5 (0,85)5.8 (2.9,7.7)56 (33,148)
4–6300.660 (0,45)3.6 (2.6,4.8)59 (38,84)
7–11300.630 (0,45)2.3 (2,4.3)46 (30,110)
>18300.70 (0,45)1.2 (0.9,1.6)49 (35,94)
Kanungu2–3100.1155 (61,190)1.7 (0.9,2)4.3 (4, 14)
4–6300.2114 (43,289)1.5 (0.7, 2.3)7.3 (4.5, 15)
7–11300.13121 (41,263)1.5 (0.6, 2)5.2 (4, 7)
>18300.2109 (38, 223)1.1 (0.8, 1.3)6.8 (4.8, 15.4)
  1. *

    EIR – Entomological Inoculation Rate.

Table 2
Composition of Falciparome phage library.
Input sequences before collapsing on similarityIdentity threshold for collapsing byCD-HIT# Final collapsed Protein sequences
P. falciparum reference proteome3D7, IT (10,771 total)99%6372
P. falciparum variant sequences
  • PfEMP1 (431 from 3D7, IT, IGH, RAJ116, PFCLIN, IT4, DD2 genomes)

  • RIFIN (all 3D7+IT)

  • STEVOR (all 3D7+IT)

  • SURFIN (all 3D7+IT + 15)

  • AMA1 (2)

  • CSP (6)

  • MSPDBL1 (6)

  • MSPDBL2 (5)

  • PfMC2TM (all 3D7+IT)

100%
(90% for CSP)
1205
Other variantsP. reichnowi PfEMP1 (PFREICH)
Anopheles - CE5 (5), SG6 (5)
Anopheles salivary proteins53 proteins from 19 Anopheles species as described in Figure 1 of Arcà et al., 201798%708
Vaccine/Viral/Toxin sequences
  • Tetanus

  • Diphtheria

  • Pertussis

  • EBV

  • Measles

  • Mumps

  • Rubella

  • Polio

  • RotoAB

98%
(90% for RotoAB)
684
Laboratory positive controls
  • GFAP

  • GFP

  • Gephryn

  • MYC, NR1

  • Tubulin (alpha/beta)

98%11
TOTAL PROTEINS8,980
TOTAL PEPTIDES238,068
Key resources table
Reagent type (species) or resourceDesignationSource or referenceIdentifiersAdditional information
Strain, strain background (E. coli)BLT5403Novagen/EMD Millipore, T7 Select KitCat# 70550–3
Strain, strain background (T7 Bacteriophage)T7 vector arms, Packaging extractNovagen/EMD Millipore, T7 Select KitCat# 70550–3
Genetic reagent (T7 Bacteriophage library)FalciparomeMade in this studySee Materials and Methods
Biological sample (Humans)Ugandan cohort plasmaKamya et al., 2015, Rek et al., 2016; Yeka et al., 2015
Biological sample (Humans)US control plasmaNew York Blood Center
AntibodyAnti-Glial Fibrillary Associated Protein (rabbit, polyclonal)AgilentCat# Z033429-21 ug used
Peptide, recombinant proteinProtein A conjugated magnetic beadsInvitrogen/Thermo Fisher SciCat# 10008D
Peptide, recombinant proteinProtein G conjugated magnetic beadsInvitrogen/Thermo Fisher SciCat# 10009D
Peptide, recombinant proteinBSA Fraction VSigma-AldrichCat# 10735094001
Peptide, recombinant proteinT4 ligaseNew England BioCat# M0202S
Peptide, recombinant proteinPhusion DNA PolymeraseNew England BioCat# M0530L
Commercial assay or kitT7 Select 10-3b Cloning kitEMD MilliporeCat# 70550–3
Commercial assay or kitAmpure XP BeadsBeckman CoulterCat# A63881
Software, algorithmCD-HITFu et al., 2012; Li and Godzik, 2006http://weizhongli-lab.org/cd-hit/
Software, algorithmnumpyOpen Sourcehttps://doi.org/10.1109/MCSE.2011.37
Software, algorithmscipyOpen Sourcehttps://www.nature.com/articles/s41592-019-0686-2
Software, algorithmMatplotlibOpen Sourcehttps://ieeeexplore.ieee.org/document/4160265
Software, algorithmCutadaptMartin, 2011https://cutadapt.readthedocs.io/en/stable/
Software, algorithmCytoscapeShannon et al., 2003https://cytoscape.org

Additional files

Supplementary file 1

List of 9927 seroreactive peptides identified in this dataset with their sequences.

https://cdn.elifesciences.org/articles/81401/elife-81401-supp1-v2.xlsx
Supplementary file 2

Top 40 proteins with highest seropositivity and associated literature.

https://cdn.elifesciences.org/articles/81401/elife-81401-supp2-v2.zip
Supplementary file 3

List of top 100 proteins with highest seropositivity used for GO analysis.

https://cdn.elifesciences.org/articles/81401/elife-81401-supp3-v2.zip
Supplementary file 4

Seropositivity rate (proportion of people seropositive) for all 9927 seroreactive peptides across different groups in the two exposure settings.

https://cdn.elifesciences.org/articles/81401/elife-81401-supp4-v2.xls
Supplementary file 5

Seropositivity rate (proportion of people seropositive) for top repeat elements across different groups in the two exposure settings.

https://cdn.elifesciences.org/articles/81401/elife-81401-supp5-v2.xlsx
Supplementary file 6

List of inter-protein motifs and the proteins sharing them.

Motifs reported here are 7-mers with at least 5 identical amino acids and up to two conservative substitutions (and no wildcards).

https://cdn.elifesciences.org/articles/81401/elife-81401-supp6-v2.xlsx
Supplementary file 7

Table describing the number of interprotein motifs obtained with varied parameters for calling the motifs.

https://cdn.elifesciences.org/articles/81401/elife-81401-supp7-v2.xlsx
Supplementary file 8

Gene network file for interprotein motifs (7-mers with at least 5 identical amino acids and up to two conservative substitutions (and no wildcards)).

Can be visualized on Cytoscape.

https://cdn.elifesciences.org/articles/81401/elife-81401-supp8-v2.zip
MDAR checklist
https://cdn.elifesciences.org/articles/81401/elife-81401-mdarchecklist1-v2.pdf

Download links