(A) Overview of the single-cell RNAseq protocol. Steps in the original Smart-seq2 protocol (Picelli et al., 2013) that resulted in significant gains are highlighted in orange. (B) Relative numbers …
(A) Individually sorted P. falciparum and P. berghei cells from a mixed pool revealed no doublets and little contamination. (B) Distributions of numbers of genes identified as expressed in our three …
(A) Purified asexual late blood stage of GFP P. falciparum and mCherry P. berghei were mixed at a 1:1 ratio, inactivated in RNAlater, and sorted individually by flow cytometry, gated on respective …
There was no apparent over- or under-representation of GC-rich regions.
(A) A combination of Principal Components Analysis (PCA), k-means clustering and comparison to bulk RNA-seq datasets was used to classify 144 high-quality P. berghei single cells, and revealed three …
Stage-specific genes at different expression levels, were identified from RNA-seq data from (Otto et al., 2014) for (A) asexual stages, (B) male gametocytes and (C) female gametocytes. Mean FPKM …
(A) A combination of Principal Components Analysis (PCA), k-means clustering and comparison to bulk RNA-seq datasets was used to classify 191 high-quality P. falciparum gametocytes. A consensus of …
(A) Pseudotime ordering (using [Trapnell et al., 2014]) of the asexual cells in was in close agreement with bulk RNA-seq datasets (predicted stage = consensus; see Materials and methods). (B) …
PCA of 155 P. falciparum cells colored by pseudotime (A) or Monocle state (B); identified trajectory branches are displayed as circled numbers 1 and 2. (C) Differentially expressed genes were …
A shared set of 651 genes identified as following a sigmoidal expression pattern through the intraerythrocytic developmental cycle (see Materials and methods) are shown in both bulk transcriptome …
A heatmap showing logged, mean-normalised expression values for late asexual parasites from (Poran et al., 2017) ordered by pseudotime. Genes were ordered as for Figure 4—figure supplement 2A …
(A) Expression of Plasmodium ApiAP2 genes in asexual parasites. Orthologous genes are presented on the same rows. (B) A co-expression network for P. berghei was built using significant positive and …
(A) P. falciparum genes with >= 50% of their variance attributed to cell-cycle associated latent variable one vary in pseudotime. After removing variation associated with the cell cycle, 56 genes …
We found that only the first two latent variables explained at least 5% of variation in cell cycle genes (red line).
berghei and P. falciparum. (a) The heatmap shows gene expression levels for multigene family members differentially expressed between male and female P. berghei gametocytes. * gene variably …
berghei and P. falciparum, respectively. (A) Pir gene expression was highly variable across male gametocytes. In addition, more pir genes were expressed in males than females. These are distinct …
(A) Expression of Plasmodium ApiAP2 genes in sexual parasites. Orthologous genes are presented on the same rows. (B) A co-expression network for P. berghei was built using significant positive and …
Despite being very similar in identity (88% at the nucleotide level), most reads deriving from these transcripts map uniquely. It is notable here that there appears to be variable splicing of coding …
These data show that dropout rates within each cluster are generally very low and expression levels are high but cover a range of values. This makes it unlikely that all the genes in a cluster would …
Different combinations of the protocol were tested by sequencing. Initial trials were performed with 2 µl of lysis buffer, this was increased to 4 µl to augment capture efficiency. Permutations of …
Conditions tested | Protocol | SSII, V30, 30 cycles | SSII, T30, 30 cycles | SmSc, T30, 30 cycles | SSII, T30, 25 cycles | SmSc, T30, 25 cycles | SmSc, T30, 25 cycles | SmSc, T30, 25 cycles | SmSc, T30, 25 cycles |
---|---|---|---|---|---|---|---|---|---|
Cells | Sexual | Asexual | Asexual | Asexual | Asexual | Sexual | Mixed blood | Asexual | |
Species | Pf | Pf | Pf | Pf | Pf | Pf | Pb | Pf | |
Lysis buffer volume | 2 µl | ✓ | |||||||
4 µl | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ||
Oligo Dt (IDT) | Anchored 30 bp | ✓ | |||||||
Non-Anchored 30 bp | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ||
Reverse transcriptase | Superscript II (Life Technologies) 10U | ✓ | ✓ | ✓ | |||||
Smartscribe (Clontech) 5U | ✓ | ✓ | ✓ | ✓ | ✓ | ||||
Cycle number | 25 | ✓ | ✓ | ✓ | ✓ | ✓ | |||
30 | ✓ | ✓ | ✓ | ||||||
Sequencing machine | HiSeq | ✓ | ✓ | ✓ | |||||
MiSeq | ✓ | ✓ | ✓ | ✓ | ✓ | ||||
Sequencing results summary | % rRNA | 5.7 | 33.5 | 36.2 | 6.4 | 18.4 | 17.8 | 16.7 | 34.8 |
% coding genes | 4.4 | 11.3 | 39.3 | 10.5 | 33 | 51.7 | 49 | 40.5 | |
% other | 90 | 55.2 | 24.4 | 83.1 | 48.6 | 30.5 | 34.2 | 24.6 | |
Median genes detected for 50k reads | 25 | 84 | 145 | 174 | 181 | 502.5 | NA | NA | |
Total cells | 5 | 6 | 6 | 6 | 6 | 237 | 182 | 174 | |
Cells passing filters | NA | NA | NA | NA | NA | 191 | 144 | 161 | |
Median gene count | NA | NA | NA | NA | NA | 2011 | 1922.5 | 1793 |
Marker genes identifying P. berghei mixed stage k-means clusters.
Genes identified as variable in asexual stage parasites
(a) Clusters of P. berghei genes in pseudotime. (b) GO term enrichment for clusters of P. berghei genes in pseudotime. GO class: bp = biological process, mf = molecular function, cc = cellular component. (c) Clusters of P. falciparum genes in pseudotime. (d) GO term enrichment for clusters of P. falciparum genes in pseudotime. (e) P. falciparum genes identified as variant independently of the cell cycle. Cell cycle variance is the proportion of the variance for that gene associated with the first two latent variables and therefore the cell cycle. Technical variance is the proportion of variance for that gene attributed technical noise. Biological variance is the variance left over and attributable to cell-cycle-independent variation. (f) GO term enrichment for P. falciparum cell-cycle-independent genes. (g) P. berghei genes identified as variant independently of the cell cycle.
Highly variable genes and enriched functions in P. berghei and P. falciparum gametocytes.
(a) Genes identified as variable in P. berghei female gametocytes. The p and q values were calculated using M3Drop. (b) GO term enrichment amongst gene from (a). (c) Genes identified as variable in P. berghei male gametocytes. (d) GO term enrichment amongst gene from (c). (e) Genes identified as variable in P. falciparum female gametocytes. (f) GO term enrichment amongst gene from (e).
Gene expression data for multigene families.
(a) Gene expression data for pirs in P. berghei cells underlying Figure 6—figure supplement 1A. (b) Gene expression data for vars in P. falciparum cells underlying Figure 3b. (c) Multigene family members differentially expressed between P. berghei male and females gametocytes. (d) Multigene family members differentially expressed between P. falciparum male and females gametocytes, based on bulk RNA-seq data from Lasonder et al. (2016).
Samples sequenced in this study
(a) Description of samples generated with the initial, unmodified Smart-seq2 protocol. (b) Description of samples generated with variants of the Smart-seq2 protocol, e.g. differing numbers of PCR cycles and different reverse transcriptases. (c) Samples used to assess contamination of single cells due to lysis. (d) Description of samples for P. berghei mixed blood stages. Sc3_k4 = clustering results for SC3 clustering of all cells with k = 4, sc3_k3 = SC3 clustering of all cells with k = 3, sc3_sex_k3 = SC3 clustering of only male and female gametocytes with k = 3 (used to identify outliers). Hoo is the best correlated timepoint from the Hoo et al. (2016) microarray data for each cell. Otto is the best correlated timepoint from the Otto et al RNA-seq data (Otto et al., 2014) for each cell. Consensus is our consensus call between the clustering and the correlations against these bulk datasets. Pass_filter is TRUE if that cell passed our filtering criteria. (e) Description of samples for P. falciparum asexual parasites. Lopez is the best correlated timepoint from the López-Barragán et al. (2011) bulk RNA-seq data. Otto is the best correlated timepoint from the Otto et al. (2010) bulk RNA-seq data. Pseudotime state is the path within pseudotime identified by Monocle. This was used to filter out minor paths. Pass_filter is TRUE if that cell passed our filtering criteria. (f) Description of samples for P. falciparum gametocytes. Lasonder is the best correlated samples from Lasonder et al. (2016) bulk RNA-seq data.
Gene count tables for the three large datasets included in the study.
(a) Read counts for P. berghei mixed blood stages. (b) Read counts for P. falciparum asexual parasites. (c) Read counts for P. falciparum gametocytes