High-content high-resolution microscopy and deep learning assisted analysis reveals host and bacterial heterogeneity during Shigella infection

Ana T. López-Jiménez author has email address
Dominik Brokatzky
Kamla Pillay
Tyrese Williams
Gizem Özbaykal Güler
Serge Mostowy author has email address

Department of Infection Biology, London School of Hygiene and Tropical Medicine, Keppel Street, London WC1E 7HT, United Kingdom

https://doi.org/10.7554/eLife.97495.1

Open access
Copyright information

Figures and data

High-content imaging captures the heterogeneity of S. flexneri infection
(A) Representative high-resolution microscopy image of HeLa cells infected with S. flexneri expressing mCherry acquired with the high-content microscope ZEISS CellDiscoverer7. Scale bar, 10 µm. (B) Enlarged view of insets in A, highlighting a S. flexneri bacteria polymerising an actin tail (1) and entrapped in a septin cage (2), scale bar, 1 µm. (C) Percentage of infected cells at 3 h.p.i. of 7 independent experiments. Graph represents mean = 11.49% ± SD = 4.36%. (D) Percentage of infected cells harbouring different bacterial doses, graph represents mean ± SD. (E) Distance of the bacteria to the centroid of the host cell. Graph represents the median (8.74 µm) and interquartile range (6.51 – 11.05 µm). Values from n = 4990 cells from 7 independent experiments. (F) Distance of individual bacteria to the centroid of the host cell depending on its bacterial load. Graph represents the median and interquartile range. Kruskal-Wallis test and Dunn’s multiple comparisons test. (G) Area of cells depending on infection. Graph represents the median area (uninfected: 3031 µm², infected: 4148 µm²) and interquartile range (unifected: 2308 – 4169 µm², infected: 3062 – 5752 µm²). Values from n = 9469 uninfected cells and n = 1031 infected cells from 7 independent experiments. (H) Area of infected cells depending on its bacterial load. Graph represents the median and interquartile range. Kruskal-Wallis test and Dunn’s multiple comparisons test. (I) Nuclear area of cells depending on infection. Graph represents the median area (uninfected: 1551 µm², infected: 1897 µm²) and interquartile range (unifected: 1209 - 1950 µm², infected: 1486 - 2333 µm²). Values from n = 9469 uninfected cells and n = 1031 infected cells from 7 independent experiments. (J) Nuclear area of infected cells depending on its bacterial load. Graph represents the median and interquartile range. Kruskal-Wallis test and Dunn’s multiple comparisons test.

S. flexneri infection reduces host DNA synthesis and host protein synthesis. (A) Representative microscopy image of HeLa cells infected with S. flexneri expressing mCherry at 3 h.p.i, after 1h incubation with Click-IT Edu. Asterisks label the nucleus of infected cells with reduced Edu incorporation. Contour of selected cells is delimited with a dashed line. Scale bar, 10 µm. (B) Edu acquisition in uninfected cells is bimodal. Graph represents the normalised median of the ratio between the fluorescence intensity of Edu and Hoechst (1), and interquartile range (0.5924 – 31.22). Values from n = 8941 cells from 3 independent experiments. An arbitrary threshold of 5 is selected to define cells in S-phase. (C) Frequency of cells in S-phase (as defined in B) depending on infection. Graph represents mean ± SD (uninfected: 34.45% ± 9.78%, infected: 35.89% ± 4.69%). Student’s t-test. (D) Edu acquisition in cells in S-phase depending on infection. Graph representes the normalised median of the ratio between the fluorescence intensity of Edu and Hoechst (uninfected: 75.14, infected: 44.73), and interquartile range (uninfected: 27.89 – 131.0, infected: 22.05 – 93.36), from 3065 uninfected and 552 infected cells. Mann-Whitney U test. (E) Edu acquisition in cells in S-phase depending on bacterial load. Graph represents the median and interquartile range. Kruskal-Wallis test and Dunn’s multiple comparisons test. (F) Representative microscopy image of HeLa cells infected with S. flexneri expressing mCherry at 3 h.p.i, after 1h incubation with Click-IT AHA. Asterisks label the infected cells with reduced AHA incorporation. Scale bar, 10 µm. (G). AHA acquisition in host cells depending on infection. Graph representes the normalised median of AHA fluorescence intensity (uninfected: 1, infected: 0.86), and interquartile range (uninfected: 0.7850 – 2.694, infected: 0.6207 – 1.241), from 36 uninfected and 233 infected cells. Mann-Whitney U test. (H) AHA acquisition in infected cells depending on bacterial load. Graph represents the median and interquartile range. Kruskal-Wallis test and Dunn’s multiple comparisons test.

Microscopy pipeline to automatically identify SEPT7-S. flexneri interactions
(A) Diagram representing the steps involved in the imaging and analysis pipeline. After infection, fixation and staining, high number of images are automatically acquired using the high-content microscope. Scale bar, 10 µm. These are then processed to segment bacteria based on their fluorescence, so that a square field containing the bacteria in the centre is cropped and saved for subsequent analysis. A deep learning model based on CNN is applied using Jupiter notebooks and the Python Keras library hosted on Colab to sort individual, isolated bacteria. Several processing steps are applied using Python to remove noise (band-pass filtering by difference of Gaussian and mean filters), normalise the data across datasets for comparison, and remove signal away from the bacteria which is irrelevant for the identification of SEPT7 assemblies. Scale bar, 1 µm. Finally, a similar second deep learning model based on CNN is applied to identify SEPT7 interacting with bacteria.

Training of a deep learning algorithm to identify single isolated bacteria cells and SEPT7 associated S. flexneri
(A) Table summarizing the annotated dataset used for the training process. The labels of the two classification classes used were “Single bacteria” and “Clump”. The annotated images were randomly split into two groups, one used as training dataset (comprising 80% of the images) and another used as validation dataset (with the remaining 20%). A separate annotated dataset was used for testing. (B) Representative Airyscan images of bacteria used for each class. Scale bar, 1 µm. (C) Architecture of the deep learning algorithm used for training. Each raw describes the characteristics of the sequential transformation steps applied. Conv2D stands for 2D Convolution. (D, E) Accuracy and Loss as metrics to represent the training process over subsequent epochs (entire passing of the training data through the algorithm). Accuracy increases and loss decreases for both training and validation datasets, indicating a good fit of the model to the data. Vertical dashed grey line indicates Early Stopping, or epoch value for the minimum validation Loss. (F) Confusion matrix performed on an annotated test dataset not used previously for the training or validation process, indicating the percentage of predictions that were correct or wrong for each class. (G) Precision, recall and f1 score as metrics that summarize the performance of the model or classifier. (H) Table summarizing the annotated dataset used for the training process. The labels of the two classification classes used were “Septin” and “Negative”. The annotated images were randomly split into three groups, one used as training dataset (comprising 75% of the images), another used as validation dataset (with 15%) and a last one used as test dataset (with 10%). Due to the data being very imbalanced (15% natural frequency of SEPT7 associated bacteria), the images comprised in the Negative class were under-sampled as indicated in the table. (I) Representative Airyscan images of bacteria used for each class. Scale bar, 1 µm. (J) Architecture of the deep learning algorithm used for training. Each raw describes the characteristics of the sequential transformation steps applied. Conv2D stands for 2D Convolution, Batch_norm stands for Batch normalisation. (K, L) Accuracy and Loss as metrics to represent the training process over subsequent epochs. Accuracy increases and loss decreases for both training and validation datasets, indicating a good fit of the model to the data. Vertical dashed grey line indicates Early Stopping, or epoch value for the minimum validation Loss. (M) Confusion matrix performed on the test dataset not used previously for the training or validation process, indicating the percentage of predictions that were correct or wrong for each class. (N) Precision, recall and f1 score as metrics that summarize the performance of the model or classifier.

Septin assemblies associated to S. flexneri are heterogeneous
(A) Selected examples of the training dataset to showcase the diverse morphology of septin assemblies associated to bacteria. Frequent morphologies include (a) ring like structures around the bacteria, (b) smooth tight recruitment around the entire bacterial surface, (c) loose structures more distant from the bacterial surface, (d) septin recruitment to both bacterial poles, (e) septin recruitment to a single bacterial pole. Scale bar, 1 µm. (B) Intensity profiles of the cytosolic bacterial mCherry and the SEPT7 recruitment, as indicated in the white and orange dashed lines, respectively. (C) Pie chart representing the relative frequency of 855 S. flexneri-SEPT7 associations depicted in A.

S. flexneri associated with SEPT7 are metabolically active
(A) Representative microscopy image of HeLa cells infected with S. flexneri expressing mCherry at 3 h.p.i, after 1h incubation with Click-IT Edu. Scale bar 10 µm. (B) Enlarged view of insets in A, highlighting a S. flexneri associated or not to SEPT7 structures. Scale bar, 1 µm. (C) Frequency of the number of SEPT7-associated bacteria identified using deep learning from experiments in A, from n= 1024 from 3 independent replicates. Graph represents mean ± SD (9.23% ± 2.62%). (D) Normalised ratio between Edu signal and mCherry in individual bacteria. Graph representes the median (negatives: 1, septin associated: 1.059), and interquartile range (negatives: 0.8185 – 1.207, septin associated: 0.8573 – 1.487). Mann-Whitney U test. (E) Representative microscopy image of HeLa cells infected with S. flexneri expressing mCherry at 3 h.p.i, after 1h incubation with Click-IT AHA. Scale bar 10 µm. (F) Enlarged view of insets in E, highlighting a S. flexneri associated or not to SEPT7 assemblies. Scale bar, 1 µm. (G) Frequency of the number of SEPT7-associated bacteria identified using deep learning from experiments in E, from n = 1872 bacteria from 3 independent replicates. Graph represents mean ± SD (7.01% ± 2.41%). (D) Normalised AHA signal in individual bacteria in A, B. Graph representes the median (negatives: 1, SEPT7 associated: 1.055), and interquartile range (negatives: 0.6986 – 1.345, SEPT7 associated: 0.7837 – 1.497). Mann-Whitney U test.

S. flexneri associated with SEPT7 have an active T3SS
(A) Representative microscopy image of HeLa cells infected with S. flexneri expressing the TSAR reporter at 3 h.p.i. Scale bar 10 µm. Arroheads highlight TSAR+ bacteria. Selected infected cells contours are delimited with dashed lines. (B) Distribution of TSAR signal intensity in individual bacteria. Graph represents the normalised median of the ratio between TSAR and mCherry (1), and interquartile range (0.7827 – 1.369). Values from n = 3654 bacterial cells from 3 independent experiments. An arbitrary threshold of 1.5 is selected to define TSAR+ S. flexneri. (C) Intensity of TSAR signal in individual bacteria depending on infection load of host cells. Graph represents the normalised median and interquartile range. Kruskal-Wallis test and Dunn’s multiple comparisons test. (D) Representative microscopy image of HeLa cells infected with S. flexneri expressing the TSAR reporter at 3 h.p.i. Scale bar 10 µm. (E) Enlarged view of insets in D, highlighting a S. flexneri associated or not to SEPT7 assemblies. Scale bar, 1 µm. (F) Frequency of the number of SEPT7-associated bacteria identified using deep learning from experiments in F, from n= 5030 bacteria from 4 independent replicates. Graph represents mean ± SD (9.044% ± 5.582%). (G) Percentage of TSAR+ S. flexneri depending on their association to SEPT7. Graph represents mean ± SD (negatives: 2.214 ± 1.603, septin associated: 17.74 ± 5.138). Student’s t-test. (H) Normalised ratio between TSAR signal and mCherry in individual bacteria. Graph representes the median (negatives: 1, SEPT7 associated: 1.122), and interquartile range (negatives: 0.7674 – 1.286, SEPT7 associated: 0.9047 – 1.511). Mann-Whitney U test.

S. flexneri infection induces host cell size changes
(A) Hoestch fluorescence intensity of sum-projected nuclei as proxy of nuclear volume. Graph represents normalised median (uninfected: 1, infected: 1.294) and interquartile range (uninfected: 0.7588 – 1.287, infected: 0.9534 – 1.878) of n=2071 uninfected cells and n=195 infected cells from 3 independent experiments. Mann-Whitney U test. (B – C) Cell and nuclear area in presence and absence of infection and Latrunculin B. Graph represents the median and interquartile range. Kruskal-Wallis test and Dunn’s multiple comparisons test.

Click-IT Edu and Click-IT AHA specifically label de novo DNA and protein synthesis in Hela cells
(A) Representative widefield image of HeLa cells after Click-IT Edu reaction in the presence or absence of Aphidicolin to inhibit eukaryotic DNA replication. Scale bar, 10 µm. (B) Quantification of de novo DNA synthesis in single HeLa cells in A as the ratio of nuclear Click-IT Edu signal to Hoechst. Graph represents the normalised median (untreated: 1, Aphidicolin: 0.7667) and interquartile range (untreated: 0.8474 – 6.571, Aphidicolin: 0.6816 – 0.8543) from 15828 untreated and 14943 treated cells from 3 independent experiments. Mann-Whitney U test. (C) Representative Airyscan confocal images of HeLa cells after Click-IT AHA reaction in the presence or absence of Cycloheximide to block eukaryotic translational elongation. Scale bar, 10 µm. (D) Quantification of cytoplasmic Click-IT AHA signal of the HeLa cells in A. Graph represents normalised median (untreated: 1, Cycloheximide: 0.3531) and interquartile range (untreated :0.6794 – 1.401, Cycloheximide: 0.2504 – 0.6637) of n=268 untreated and n= 258 Cycloheximide treated cells from 3 independent experiments. Mann-Whitney U test.

Additional examples of the annotated datasets for deep learning
(A) Representative images of multiple bacteria associated to SEPT7. Scale bar, 1 µm. (B) Representative images of the training and validation dataset to identify single bacteria versus clumps. Scale bar, 5 µm. (C) Representative images of the training and validation dataset to identify SEPT7-positive versus SEPT7-negative bacteria, after processing. Scale bar, 5 µm.

Probability maps of SEPT7 around S. flexneri
(A, C) Median of the signal for mCherry and SEPT7 in both datasets (SEPT7-positive and SEPT7-negative), centered at the bacterial midbody or bacterial pole, respectively. Only bacteria with width ≥ 2.5 µm were selected to ensure bacteria were mostly transversal to the acquisition axis. Scale bar, 1 µm. (B,C) Quantification of SEPT7 signal and S. flexneri mCherry signal in the midbody projection depicted in A and transversal dash line indicated in B.

Click-IT Edu and Click-IT AHA specifically label de novo DNA and protein translation in S. flexneri
(A) Representative Airyscan image of mCherry S. flexneri after Click-IT Edu reaction in the presence or absence of Nalidixic acid to block bacterial DNA replication. Scale bar, 1 µm. (B) Quantification of Click-IT Edu signal of bacteria in C measured by FACS. Graph represents normalised mean ± SD. One-way ANOVA and Tukey’s multiple comparisons test. (C) Representative Airyscan confocal images of HeLa cells infected with mCherry expressing S. flexneri after Click-IT Edu reaction in the presence or absence of Nalidixic acid to inhibit bacterial DNA nuclear replication. Scale bar, 10 µm. (D) Quantification of de novo DNA synthesis in single bacteria in E as the ratio of nuclear Click-IT signal to Hoechst. Graph represents normalised median (untreated: 1, Nalidixic acid: 0.0646) and interquartile range (untreated: 0.3589 – 1.924, Nalidixic acid: 0.0350 – 0.3080) of 9310 untreated and 4368 Nalidixic acid treated bacteria from 3 independent experiments. Mann-Whitney U test. (E) Representative Airyscan confocal images of HeLa cells infected with mCherry expressing S. flexneri after Click-IT AHA reaction in the presence or absence of Rifampicin to block prokaryotic translation. Scale bar, 10 µm; insets, 1 µm. (F) Quantification of cytoplasmic Click-IT AHA signal of individual S. flexneri cells in C. Graph represents normalised median (untreated: 1, Rifampicin: 0.8263) and interquartile range (untreated: 0.7520 – 1.350, Rifampicin: 0.6143 – 1.100) of n=2309 untreated and n=345 Rifampicin treated bacteria from 3 independent experiments.

TSAR labels T3SS activity specifically
(A) Representative Airyscan confocal images of S. flexneri harbouring the TSAR plasmid grown in broth in the presence or absence of congo red to induce the expression of the T3SS. Scale bar, 1 µm. (B) Quantification of the TSAR dependent GFP signal of bacteria in A measured by FACS. Graph represents the normalised mean ± SD (untreated: 1 ± 0, congo red: 155.4 ± 125.2) from 4 independent experiments. (C) Representative examples of TSAR-positive S. flexneri associated to SEPT7. C stands for “cage”, P stands for “protrusion”. Scale bar 5 µm.

Sign up for email alerts