ImmCellTyper facilitates systematic mass cytometry data analysis for deep immune profiling

  1. Jing Sun  Is a corresponding author
  2. Desmond Choy
  3. Nicolas Sompairac
  4. Shirin Jamshidi
  5. Michele Mishto
  6. Shahram Kordasti  Is a corresponding author
  1. Centre for Inflammation Biology and Cancer Immunology & Peter Gorer Department of Immunobiology, King's College London, United Kingdom
  2. School of Cancer and Pharmaceutical Sciences, King’s College London, United Kingdom
  3. Research Group of Molecular Immunology, Francis Crick Institute, United Kingdom
  4. Haematology Department, Guy’s Hospital, United Kingdom
  5. Department of Clinical and Molecular Sciences, Università Politecnica delle Marche, Italy
8 figures, 3 tables and 1 additional file

Figures

Figure 1 with 2 supplements
Schematic diagram of BinaryClust framework.

Semi-supervised classification is first performed on selected markers in the user-defined marker expression matrix to classify and annotate major cell types. Population-of-interest can be further extracted and explored using unsupervised clustering methods followed by differential analysis. Created with BioRender.com.

Figure 1—figure supplement 1
Manual hierarchical gating strategy for main cell linages from human peripheral blood mononuclear cell (PBMC) samples (myeloproliferative neoplasm [MPN] dataset, n = 9).

All FCS files were cleaned up to remove doublets, normalisation beads and debris (please refer to the methods for standard clean-up procedure), and pre-gated for CD45+ leukocytes. (A) Serial bi-axial scatter plots representing the gating diagram for T cell subsets (CD4 T cells, CD8 T cells, and gamma delta T cells), NK cells and dendritic cells based on the indicated phenotypic markers. (B) Serial bi-axial scatter plot indicating the gating strategy to isolate monocytes and B cells from leukocytes. All manual gating was done using Cytobank platform (https://premium.cytobank.org/cytobank/).

Figure 1—figure supplement 2
Clean-up procedure of cytometry by time-of-flight (CyTOF) data using Cytobank.
Figure 2 with 1 supplement
Agreement evaluation comparing manual gating and BinaryClust in myeloproliferative neoplasm (MPN) cohort (n = 9).

Manual gating of B cells, CD4 T cells, CD8 T cells, dendritic cells, NK cells, monocytes, and gamma delta T cells were performed by two independent experts using Cytobank, and mean values of the population percentages were calculated to compare with BinaryClust results. Each dot represents one patient sample. (A) Scatter plot showing the correlation between the two methods, with the red line indicating perfect agreement (correlation coefficient = 1). (B) Bland–Altman plots of the two measurement methods among all the cell populations, with the black line suggesting the mean observed difference and red dotted lines indicating limits of agreement (1.96× standard deviations).

Figure 2—figure supplement 1
Agreement evaluation between ImmCellTyper and manual gating in influenza dataset (n = 11).

Manual gating was performed using Cytobank and exported manually. (A) Correlation plots between ImmCellTyper results and manual gating results concerning percentages in CD4 T cells, gamma delta T cells, dendritic cells, NK cells, CD8 T cells, and B cells, with red line indicating perfect agreement (correlation coefficient = 1). (B) Bland–Altman plots of the two measurements in the indicated populations, with black line suggesting mean difference between measurements and dotted red line indicating limits of agreement (1.96× standard deviations). (C) Calculation of precision, recall, F-measure for ImmCellTyper method in comparison to manual gating in the indicated cell populations.

Figure 3 with 1 supplement
Comparison of manual gating (manual1 and manual2), BinaryClust, and flowSOM clustering results in myeloproliferative neoplasm (MPN) cohort (n = 9).

Interaction plots showing the individual measurement (percentage) of each study participant with indicated colours by different methods across main cell lineages (B cells, CD8 T cells, gamma delta T cells, NK cells, dendritic cells, monocytes, and CD4 T cells); analysis of variance (ANOVA) was used for statistical testing, and significance was marked by asterisk. *p < 0.05, **p < 0.01, ***p < 0.001, ****p < 0.0001.

Figure 3—figure supplement 1
Boxplots of the indicated cell percentages generated by different methods.

Statistical significance was marked by asterisk. *p < 0.05, **p < 0.01, ***p < 0.001.

Comparison of BinaryClust, flowSOM, and linear discriminant analysis (LDA) on speed.

Bar chart showing runtime (in seconds) of the three methods in three different datasets.

Figure 5 with 1 supplement
Cell type characterisation and visualisation using ImmCellTyper pipeline in myeloproliferative neoplasm (MPN) dataset (n = 9).

(A) Intensity distribution of selected phenotypic markers used for BinaryClust classification, coloured by sample_id. (B) Pre-defined expression classification matrix for the MPN dataset, ‘+’ indicates positive, ‘-’ indicates negative, and ‘A’ suggests ‘any’. (C) Proportion of the main cell lineages of all cells in the concatenated FCS files after classification. (D) Median marker expression heatmap of BinaryClust classification results. (E) UMAP plot of random downsample of 2000 cells per patient coloured by main cell types based on BinaryClust classification (left) and manual gating results (right). (F) UMAP plots coloured by normalised expression of indicated markers (CD3, CD4, CD8a, CD20, CD19, CD14, and CD56) across 2000 cells per sample.

Figure 5—figure supplement 1
Comparison of different clustering methods in MPN cohort.

(A) UMAP plots of normalised expression of indicated markers (CD66b, HLADR, TCRgd, CD20, CD16, and CD161) across 2000 cells per sample in myeloproliferative neoplasm (MPN) dataset. (B) FlowSOM clustering was performed on the same dataset to compare with BinarClust, k = 20 was chosen followed by manual annotation of each cluster. UMAP plot was projected with merged flowSOM clusters with biological annotation (downsample 2000 cells per sample). (C) The corresponding median marker expression heatmap after flowSOM clustering and annotation.

Applying ImmCellTyper pipeline on COVID-19 patient dataset (n = 82) published by Chevrier et al., 2021.

(A) Marker intensity distribution of selected phenotypic markers used for BinaryClust classification, coloured by disease severity (n = 22 healthy individuals, 28 mild COVID-19 patients, and 38 severe COVID-19 patients). (B) Pre-defined marker expression classification matrix used for BinaryClust. (C) t-Distributed stochastic neighbour embedding (t-SNE) plots, with 1000 cells per sample, were coloured by the main cell types generated by BinaryClust and faceted by different study groups. (D) The corresponding median marker expression heatmap of BinaryClust results for the COVID-19 dataset.

Figure 7 with 1 supplement
Quantification and statistical analysis comparing the study conditions in COVID-19 dataset (n = 82).

(A) Stacked histogram of main cell type composition per individual generated by BinaryClust, and grouped by study conditions (healthy, mild, and severe). (B) Boxplots representing cell abundance frequencies among the study conditions, faceted by different main cell types. (C) State marker expression intensities with comparison of the study groups across the main cell types. (D) Clusters of monocytes and neutrophils were extracted from the whole cells for downstream interrogation. t-Distributed stochastic neighbour embedding (t-SNE) plots with random downsample of 1000 monocyte cells and (E) neutrophils per sample were coloured by study conditions and Phenograph clustering results (k = 60), respectively. Statistical significance was marked by asterisk. *p < 0.05, **p < 0.01, ***p < 0.001, ****p < 0.0001.

Figure 7—figure supplement 1
Expression heatmaps of monocytes and neotrophils in COVID-19 dataset.

Marker expression heatmap of (A) monocytes and (B) neutrophils generated by Phenograph clustering.

Overall schematic outline of the ImmCellTyper workflow with description for each step.

Tables

Table 1
Precision, recall, F-measure, and ARI of indicated clustering methods.
Gated populationCountsCluster cell countsTrue positivePrecisionRecallF-measureAverage F-measureARIAverage ARI
BinaryClust2CD4 T cells1,270,0411,232,8321,226,7121.000.970.980.930.950.91
CD8 T cells601,995601,639582,4020.970.970.970.95
NK cells389,261380,999376,5610.990.970.980.97
Monocytes576,624625,735568,9240.910.990.950.92
B cells397,279380,170374,5010.990.940.960.95
Dendritic cells135,89486,01284,1300.980.620.760.74
TCRgd T cells87,19473,59973,5621.000.840.910.91
flowSOMCD4 T cells1,270,0411,268,9291,226,7120.970.970.970.700.900.66
(k = 20)CD8 T cells601,995692,374571,3440.830.950.880.81
NK cells389,261393,646365,4030.930.940.930.91
Monocytes576,624792,447568,2840.720.990.830.73
B cells397,279405,886390,0770.960.980.970.96
Dendritic cells135,89420,83218,2540.880.130.230.22
TCRgd T cells87,19419,72049670.250.060.090.08
LDACD4 T cells634,733665,481632,7000.951.000.970.930.930.90
CD8 T cells301,138327,469299,3090.910.990.950.92
NK cells194,470202,110187,2590.930.960.940.92
Monocytes288,338301,387282,8360.940.980.960.94
B cells198,582211,090195,9080.930.990.960.94
Dendritic cells68,15290,45661,7850.680.910.780.75
TCRgd T cells43,40543,06439,9760.930.920.920.92
  1. ARI, Adjusted Rand Index; LDA, linear discriminant analysis.

Key resources table
Reagent type (species) or resourceDesignationSource or referenceIdentifiersAdditional information
AntibodyMaxpar Direct Immune Profiling AssayStandard BiotoolsCat# 201325
Antibodyanti-CD95
(RRID:AB_314546,
mouse, monoclonal)
BiolegendCat# 3056071:200, 1.5 μl
Antibodyanti-TIGIT
(mouse, monoclonal)
Standard BiotoolsCat# 201406 (Maxpar Direct T cell expansion panel 2)1:300, 1 μl
Antibodyanti-PD1
(mouse, monoclonal)
Standard BiotoolsCat# 201406 (Maxpar Direct T cell expansion panel 2)1:200, 1.5 μl
Antibodyanti-ICOS
(hamster, monoclonal)
Standard BiotoolsCat# 201406 (Maxpar Direct T cell expansion panel 2)1:200, 1.5 μl
Antibodyanti-TIM3
(mouse, monoclonal)
Standard BiotoolsCat# 201406 (Maxpar Direct T cell expansion panel 2)1:300, 1 μl
Antibodyanti-OX40
(mouse monoclonal)
Standard BiotoolsCat# 201406 (Maxpar Direct T cell expansion panel 2)1:300, 1 μl
Antibodyanti-CXCR4
(mouse, monoclonal)
Standard BiotoolsCat# 201406 (Maxpar Direct T cell expansion panel 2)1:300, 1 μl
Genetic reagent; Recombinant DNA reagentPierce 16% Formaldehyde (methanol-free)Thermo FisherCat# 28906
Genetic reagent; Recombinant DNA reagentTrypan blue solution (0.4%)GibcoCat# 15250-061
Genetic reagent; Recombinant DNA reagentHuman TruStain FcXBiolegendCat# 422302
Genetic reagent; Recombinant DNA reagentEQ Four Element Calibration beadsStandard BiotoolsCat# 201078
Appendix 1—table 1
CyTOF antibody panel for MPN cohort.
AntibodyCloneMetal tagManufacturer
Anti-CD45HI3089YStandard BioTools
Live/dead IndicatorN/A103RhStandard BioTools
Anti-CD95DX2APCBiolegend
Anti-APCAPC003106CdStandard BioTools
Anti-CD196G034E3141PrStandard BioTools
Anti-OX40ACT35142NdStandard BioTools
Anti-CD1236H6143NdStandard BioTools
Anti-CD19HIB19144NdStandard BioTools
Anti-CD4RPA-T4145NdStandard BioTools
Anti-CD8aRPA-T8146NdStandard BioTools
Anti-CD11cBu15147SmStandard BioTools
Anti-CD163G8148NdStandard BioTools
Anti-CD45ROUCHL1149SmStandard BioTools
Anti-CD45RAHI100150NdStandard BioTools
Anti-CD161HP-3G10151EuStandard BioTools
Anti-CD194L291H4152SmStandard BioTools
Anti-CD25BC96153EuStandard BioTools
Anti-CD27O323154SmStandard BioTools
Anti-CD57HCD57155GdStandard BioTools
Anti-CD183G025H7156GdStandard BioTools
Anti-CD185J252D4158GdStandard BioTools
Anti-TIM-3F38-2E2159TbStandard BioTools
Anti-CD28CD28.2160GdStandard BioTools
Anti-CD38HB-7161DyStandard BioTools
Anti-CD56NCAM16.2163DyStandard BioTools
Anti-TCRgdB1164DyStandard BioTools
Anti-PD-1EH12.2H7165HoStandard BioTools
Anti-CD294BM16166ErStandard BioTools
Anti-CD197G043H7167ErStandard BioTools
Anti-CD1463D3168ErStandard BioTools
Anti-ICOSC398.4A169TmStandard BioTools
Anti-CD3UCHT1170ErStandard BioTools
Anti-CD202H7171YbStandard BioTools
Anti-CD66bG10F5172YbStandard BioTools
Anti-HLA-DRLN3173YbStandard BioTools
Anti-IgDIA6-2174YbStandard BioTools
Anti-CXCR412G5175LuStandard BioTools
Anti-CD127A019D5176YbStandard BioTools
Anti-TIGITMBSA43209BiStandard BioTools

Additional files

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Jing Sun
  2. Desmond Choy
  3. Nicolas Sompairac
  4. Shirin Jamshidi
  5. Michele Mishto
  6. Shahram Kordasti
(2024)
ImmCellTyper facilitates systematic mass cytometry data analysis for deep immune profiling
eLife 13:RP95494.
https://doi.org/10.7554/eLife.95494.3