An improved bacterial single-cell RNA-seq reveals biofilm heterogeneity
eLife Assessment
This work introduces an important new method for depleting ribosomal RNA from bacterial single-cell RNA sequencing libraries, demonstrating its applicability for studying heterogeneity in microbial biofilms. The findings provide convincing evidence for a distinct subpopulation of cells at the biofilm base that upregulates PdeI expression. Future studies exploring the functional relationship between PdeI and c-di-GMP levels, along with the roles of co-expressed genes within the same cluster, could further enhance the depth and impact of these conclusions.
https://doi.org/10.7554/eLife.97543.4.sa0Important: Findings that have theoretical or practical implications beyond a single subfield
- Landmark
- Fundamental
- Important
- Valuable
- Useful
Convincing: Appropriate and validated methodology in line with current state-of-the-art
- Exceptional
- Compelling
- Convincing
- Solid
- Incomplete
- Inadequate
During the peer-review process the editor and reviewers write an eLife Assessment that summarises the significance of the findings reported in the article (on a scale ranging from landmark to useful) and the strength of the evidence (on a scale ranging from exceptional to inadequate). Learn more about eLife Assessments
Abstract
In contrast to mammalian cells, bacterial cells lack mRNA polyadenylated tails, presenting a hurdle in isolating mRNA amidst the prevalent rRNA during single-cell RNA-seq. This study introduces a novel method, ribosomal RNA-derived cDNA depletion (RiboD), seamlessly integrated into the PETRI-seq technique, yielding RiboD-PETRI. This innovative approach offers a cost-effective, equipment-free, and high-throughput solution for bacterial single-cell RNA sequencing (scRNA-seq). By efficiently eliminating rRNA reads and substantially enhancing mRNA detection rates (up to 92%), our method enables precise exploration of bacterial population heterogeneity. Applying RiboD-PETRI to investigate biofilm heterogeneity, distinctive subpopulations marked by unique genes within biofilms were successfully identified. Notably, PdeI, a marker for the cell-surface attachment subpopulation, was observed to elevate cyclic diguanylate (c-di-GMP) levels, promoting persister cell formation. Thus, we address a persistent challenge in bacterial single-cell RNA-seq regarding rRNA abundance, exemplifying the utility of this method in exploring biofilm heterogeneity. Our method effectively tackles a long-standing issue in bacterial scRNA-seq: the overwhelming abundance of rRNA. This advancement significantly enhances our ability to investigate the intricate heterogeneity within biofilms at unprecedented resolution.
Introduction
Biofilms, contributing to approximately 80% of chronic and recurrent microbial infections in the human body (Costerton et al., 1999), are complex microbial ecosystems characterized by a diverse array of bacterial cells existing in various physiological states (Costerton et al., 1999; Evans et al., 2023; Stewart and Franklin, 2008). This heterogeneity within biofilms is influenced by multiple factors, including bacterial interactions (such as competition, symbiosis, and parasitism) that contribute to stable community structures (Flemming et al., 2023; Shokeen et al., 2021), environmental factors (like host environment, aquatic conditions, and nutrient concentrations), and spatial organization. Bacteria occupying different positions within the biofilm perform distinct roles (Qian et al., 2022): some mediate material exchange between cells and with the extracellular matrix, others facilitate complex communication systems between bacteria and with the host through signaling molecules, while certain bacteria participate in various energy conversion processes. This intricate division of labor not only contributes to bacterial heterogeneity but also enhances the biofilm’s overall resistance to various stresses (Momeni, 2018). The resulting cellular and functional diversity reflects the complex nature of the biofilm ecosystem, allowing it to adapt and persist in challenging environments. However, the study of biofilms faces significant limitations, primarily stemming from challenges in investigating heterogeneity within a bacterial population (Cheng et al., 2023; Spormann, 2008). Single-cell RNA-seq emerges as a promising avenue for addressing this heterogeneity (Shapiro et al., 2013; Tang et al., 2009; Blattman et al., 2020; Imdahl et al., 2020; Kuchina et al., 2021; Ma et al., 2023; Wang et al., 2023; McNulty et al., 2023; Lu et al., 2023). Expending on established protocols for cell fixation and permeabilization which facilitate in-cell barcoding while avoiding cell lysis, combinatorial barcoding-based bacterial scRNA-seq techniques, such as prokaryotic expression profiling by tagging RNA in situ and sequencing (PETRI-seq) (Blattman et al., 2020) and microbial split-pool ligation transcriptomics (microSPLiT) (Kuchina et al., 2021), have been developed. Nevertheless, these methods encounter challenges in terms of low transcript recovery rates due to overwhelmingly abundant rRNA, restricting the comprehensive analysis of within-population heterogeneity. In comparison to mammalian cells (Maynard et al., 2020), the absence of mRNA polyadenylated tails in bacteria necessitates an alternative approach for isolating mRNA (~5%) from the significantly more abundant rRNA (~95%). Here, by integrating a ribosomal RNA-derived cDNA depletion protocol (RiboD) into a PETRI-seq, we developed RiboD-PETRI-seq that efficiently eliminates rRNA reads, thereby significantly improving mRNA detection rates and enabling exploration of within-population heterogeneity.
Results
In the RiboD protocol, we designed a set of probe primers that spans all regions of the bacterial rRNA sequence (Supplementary file 1). The core principle behind our probe design is twofold: the 3'-end of the probes is reverse complementary to the r-cDNA sequences, allowing for specific recognition of r-cDNA, while the 5'-end complements a biotin-labeled universal primer. This design enables the probes to be bound to magnetic beads, facilitating the separation of r-cDNA-probe-bead complexes from the rest of the library. Following template switching and RNaseH treatment on the barcoded cDNA from lysed cells to eliminate hybridized RNA, the library of probe primers and biotin-labeled universal primers is introduced to facilitate adequate hybridization. Pre-treated Streptavidin magnetic beads are then added to the hybridized rRNA-derived cDNA. The mRNA-derived cDNA remains in the supernatant and is collected for subsequent library construction and sequencing (Figure 1A). To assess the efficiency of single-cell capture in RiboD-PETRI, we calculated the multiplet frequency (Blattman et al., 2020) using a Poisson distribution based on our sequencing results (see details in Materials and methods). The multiplet frequency for RiboD-PETRI ranges from 1.16% to 3.35% (Supplementary file 2), indicating the technique’s capability to effectively capture transcriptomes at the single-cell level.

Development of RiboD-PETRI and validation of its technical performance in studying population heterogeneity.
(A) Graphic summary of the RiboD-PETRI method illustrating the incorporation of RiboD after cell pooling and lysis in PETRI-seq. The RiboD protocol is represented by the dashed-line box. In this box, first, we perform template-switching oligonucleotides (TSOs) in the mixture of heterozygous chain, then we remove the RNA strand using RNaseH, at this point the system contains r-cDNA and m-cDNA single-stranded mixture. Then we add the r-cDNA probe, which specifically binds to the r-cDNA. The probes are then bound to magnetic beads, allowing the r-cDNA-probe-bead complexes to be separated from the rest of the library. And then we remove the r-cDNA that is attached to the probe by Streptavidin magnetic beads. We then performed amplification of the libraries and sent them for sequencing. We designed separate probe sets for Escherichia coli, Caulobacter crescentus, and Staphylococcus aureus. Each set was specifically constructed to be reverse complementary to the r-cDNA sequences of its respective bacterial species. This species-specific approach ensures high efficiency and specificity in rRNA depletion for each organism. (B) Comparison of non-rRNA (tRNA, mRNA, and other non-rRNA) and rRNA unique molecular identifier (UMI) counts ratio among different bacterial scRNA-seq methods. Data from PETRI-seq (E. coli), MicroSPLIT-seq (E. coli), M3-seq (E. coli) cited from previous studies. Error bars represent standard deviations of biological replicates. The ‘ΔΔ’ label represents the RiboD-PETRI protocol. The ‘Ctrl’ label represents the classic PETRI-seq protocol we performed. (C) Comparison of UMI counts per cell between RiboD-PETRI (Supplementary file 7) and PETRI (Supplementary file 8) at the same unsaturated sequencing depth. (D) Assessment of the effect of rRNA depletion on transcriptional profiles. The Pearson correlation coefficient (r) of UMI counts per gene (log2 UMIs) between RiboD-PETRI (Supplementary file 7) and PETRI (Supplementary file 9) was calculated for 3790 out of 4141 total genes, excluding those with zero counts in either library. Each point represents a gene. (E) Evaluation of the correlation between RiboD-PETRI (Supplementary file 7) data and bulk RNA-seq (Supplementary file 10) results. The Pearson correlation coefficient (r) of UMI counts per gene (log2 UMIs) among RiboD-PETRI data and the reads per gene (log2 reads) of bulk RNA-seq data was calculated for 3814 out of 4141 total genes, excluding those with zero counts in either library. Each point represents a gene. All data presented in C, D, E were from our own sequencing experiments.
-
Figure 1—source code 1
Related to Figure 1.
- https://cdn.elifesciences.org/articles/97543/elife-97543-fig1-code1-v1.zip
-
Figure 1—source data 1
Related to Figure 1.
- https://cdn.elifesciences.org/articles/97543/elife-97543-fig1-data1-v1.xls
To assess the performance of RiboD-PETRI, we designed a comprehensive assessment of rRNA depletion efficiency under diverse physiological conditions, specifically contrasting exponential and stationary phases. This approach allows us to understand how these different growth states impact rRNA depletion efficacy. Additionally, we included a variety of bacterial species, encompassing both gram-negative and gram-positive organisms, to ensure that our findings are broadly applicable across different types of bacteria. By incorporating these variables, we aim to provide insights into the robustness and reliability of the RiboD-PETRI method in various biological contexts. The results highlight a substantial enhancement in rRNA-derived cDNA depletion, with mRNA ratio increases from 8.2% (Ctrl, the PETRI-seq we performed) to 81% (ΔΔ, RiboD-PETRI) for E. coli from exponential phase, from 10% (Ctrl) to 92% (ΔΔ) for S. aureus from stationary phase, and from 3.9% (Ctrl) to 54% (ΔΔ) for C. crescentus from exponential phase (Figure 1B; Supplementary file 3). Additionally, we compared our findings with other reported methods (Figure 1B; Supplementary file 4). The original PETRI-seq (Blattman et al., 2020) protocol, which does not include an rRNA depletion step, exhibited an mRNA detection rate of approximately 5%. The MicroSPLiT-seq (Kuchina et al., 2021) method, which utilizes poly A polymerase for mRNA enrichment, achieved a detection rate of 7%. Similarly, M3-seq (Wang et al., 2023) and BacDrop-seq (Ma et al., 2023), which employ RNaseH to digest rRNA post-DNA probe hybridization in cells, reported mRNA detection rates of 65% and 61%, respectively. MATQ-DASH (Homberger et al., 2023), which utilizes Cas9-mediated targeted rRNA depletion, yielded a detection rate of 30%. smRandom-seq utilizes a CRISPR-based rRNA depletion technique, reduced the rRNA proportion from 83% to 32%, increasing the mRNA proportion from 16% to 63% (Xu et al., 2023). BaSSSh-seq’s employs a rational probe design for efficient rRNA depletion, though specific efficiency was not reported (Korshoj and Kielian, 2024). Among these, RiboD-PETRI demonstrated superior performance in mRNA detection while requiring the least sequencing depth. With equivalent sequencing depth, RiboD-PETRI demonstrates a significantly enhanced unique molecular identifier (UMI) counts detection rate compared to PETRI-seq alone (Figure 1C). This method recovered approximately 20,175 cells (92.6% recovery rate) with ≥15 UMIs per cell with a median UMI count of 42 per cell, which was significantly higher than PETRI-seq’s recovery rate of 17.9% with a median UMI count of 20 per cell (Figure 1—figure supplement 1A and B), indicating the number of detected mRNA per cell increased prominently. Notably, this enhancement was achieved while maintaining mRNA profiles consistent with non-depleted samples (r=0.93; Figure 1D) and show a significant correlation with profiles from the traditional bulk RNA-seq method (r=0.84; Figure 1E).
We subsequently investigated the transcriptome coverage of RiboD-PETRI across different physiological states and bacterial species. For exponential phase E. coli cells, we sequenced a library with 60,000 cells, recovering approximately 30,004 cells (50% recovery), each with ≥15 UMIs (Figure 2A, Figure 1—figure supplement 1C). This analysis revealed 99.86% transcriptome-wide gene coverage across the cell population. The method achieved an average of 128.8 UMIs per single cell, with a median UMI count of 102 per cell. Further examination of high-quality cells showed varying levels of detection: the top 1000, 5000, and 10,000 cells exhibited median UMI counts of 462, 259, and 193, respectively (Figure 2B), and median gene detection of 362, 236, and 188, respectively (Figure 2C). These high-performing cells demonstrate the upper limits of the method’s capabilities. For stationary phase S. aureus cells, we sequenced a library with 30,000 cells, recovering approximately 9982 cells (33.3% recovery), each with ≥15 UMIs (Figure 2—figure supplement 1A). Analysis showed 99.96% transcriptome-wide gene coverage across the cell population. At the single-cell level, we observed an average of 153.8 UMIs and a median of 142 UMIs. Top high-quality cells exhibited the following median UMI counts: 378 (top 1000 cells), 207 (top 5000 cells), and 167 (top 8000 cells) (Figure 2—figure supplement 1B). These cells also demonstrated median gene detection of 308, 194, and 158 genes, respectively (Figure 2—figure supplement 1C). For exponential phase C. crescentus cells, we sequenced a library with 30,000 cells, recovering approximately 13,897 cells (46.3% recovery), each with ≥15 UMIs (Figure 2—figure supplement 1G). Analysis showed 99.64% transcriptome-wide gene coverage across the cell population. At the single-cell level, we observed an average of 439.7 UMIs and a median of 182 UMIs. Top high-quality cells demonstrated the following median UMI counts: 2190 (top 1000 cells), 662 (top 5000 cells), and 225 (top 10,000 cells) (Figure 2—figure supplement 1H). These cells also exhibited median gene detection of 1262, 529, and 219 genes, respectively (Figure 2—figure supplement 1I). These results underscore RiboD-PETRI’s ability to capture a wide range of transcripts across varying cell qualities and species, providing a comprehensive view of gene expression at the single-cell level.

Comprehensive analysis of single-cell mRNA transcriptomic profiles in exponential phase E. coli using RiboD-PETRI.
(A) The number of unique molecular identifiers (UMIs) detected per cell in recovered cells in exponential period E. coli (≥15 UMIs/cell). The cells are ranked from highest to lowest based on the number of detected UMIs, and cells with ≥15 UMIs are selected for plotting. The median number of UMIs is calculated for these selected cells. (B) Distribution of mRNA UMIs captured per cell in RiboD-PETRI data of exponential period E. coli, presented as violin plots showing the upper quartile, median, and lower quartile lines. The cells are ranked from highest to lowest based on the number of UMIs detected. Then, specific numbers of cells (indicated above the panel) are selected for plotting. The median number of UMIs is calculated for these selected cells. (C) The number of genes detected per cell in exponential period E. coli. The cells are ranked from highest to lowest based on the number of genes detected. Then, specific numbers of cells (indicated above the panel) are selected for plotting. The median number of genes is calculated for these selected cells. (D) Uniform Manifold Approximation and Projection (UMAP) visualization of E. coli bacteria during the exponential phase. Data were filtered for cells with UMIs between 200 and 5000, resulting in 1464 cells. Each dot represents a cell. (E) Heatmap illustrating the normalized gene expression levels of marker genes in different clusters of exponential period E. coli. Marker genes with relatively high expression levels are depicted in yellow, while lower expression levels are shown in purple. Each row represents a gene, and each column represents a cell. (F) Functional enrichment analysis of marker genes of exponential period E. coli in cluster 2. Marker genes were selected based on screening criteria of p-value <0.001 and log2 fold change (FC)>0.2. The color blocks in these figures represent the p-values of the data points. The color scale ranges from red to blue. Red colors indicate smaller p-values, suggesting higher statistical significance and more reliable results. Blue colors indicate larger p-values, suggesting lower statistical significance and less reliable results. Count is the number of genes enriched into this pathway. (G) Expression levels of marker genes in cluster 2 during the 3 hr exponential period of E. coli overlaid on the UMAP plot. Cells with high expression levels are depicted in blue. Marker genes were selected based on a p-value greater than 0.001 and a log2 FC greater than 3. (H) Principal component analysis (PCA) performed on screened data of exponential phase E. coli. The resulting scatterplots show heterogeneity among the populations, with each point representing a cell. (I) Distribution of UMIs on the UMAP results for exponential phase E. coli. UMAP results reveal heterogeneity among populations, with each point representing a cell and color shading indicating UMI counts (Supplementary file 11).
-
Figure 2—source code 1
Source code for Figure 2 and Figure 2—figure supplement 2.
- https://cdn.elifesciences.org/articles/97543/elife-97543-fig2-code1-v1.zip
-
Figure 2—source code 2
Source code for Figure 2—figure supplement 1, Figure 2—figure supplement 3 and Figure 2—figure supplement 4.
- https://cdn.elifesciences.org/articles/97543/elife-97543-fig2-code2-v1.zip
-
Figure 2—source data 1
Related to Figure 2.
- https://cdn.elifesciences.org/articles/97543/elife-97543-fig2-data1-v1.xls
Our results affirm RiboD-PETRI’s reliability in capturing the bacterial single-cell transcriptome, providing ample coverage and sensitivity for various species. To provide a thorough evaluation of our sequencing depth and library quality, we performed sequencing saturation analysis on our sequencing samples. The findings reveal that our sequencing saturation is greater than 90% (Figure 1—figure supplement 1D–F), indicating that our sequencing depth is sufficient to capture the diversity of most transcripts.
We further investigated its ability to consistently identify within-population heterogeneity across different bacterial species and growth conditions. In the exponential phase of E. coli, we recovered 1464 cells and identified three major subpopulations (Figure 2D), with 17 cells (1.2%) in a unique subpopulation characterized by pentose and glucuronate interconversions (Figure 2E and F) and the marker genes of cluster 2 included yfgM, glpG, scpA, elyC, ptsA, carB, actP, and pgpB (Figure 2G). For the expression levels of marker gene shown in Figure 2E, violin plots have been created to offer a more comprehensive view of the distribution across different cell populations (Figure 2—figure supplement 2). In stationary phase S. aureus cells, we recovered 9386 cells and found six major subpopulations (Figure 2—figure supplement 1D), with 437 cells (4.7%) in a distinct subpopulation named cluster 4. The marker genes of cluster 4 included KQ76-13335, KQ76-00740, and KQ76-11725 (Figure 2—figure supplement 3). In the stationary phase of C. crescentus cells, we recovered 5728 cells and identified four major subpopulations (Figure 2—figure supplement 1J), with 603 cells (10.5%) in a unique subpopulation named cluster 3. The marker genes of cluster 3 included CCNA-00259, CCNA-03402, CCNA-02361, and CCNA-03119 (Figure 2—figure supplement 4). These findings highlight RiboD-PETRI’s consistent ability to unveil within-population heterogeneity across different cell physiology and bacterial species (Figure 2H and I, Figure 2—figure supplement 1E, F, K, L), crucial for understanding bacterial population complexity. While RiboD-PETRI consistently detects potential heterogeneity, further experimental validation would be required to confirm the biological significance of the observations.
We next focused on exploring biological heterogeneity of a biofilm at the early stage of development by utilizing the static biofilm system (Merritt et al., 2011). E. coli cells were cultured in microtiter dishes overnight, adhered cells were fixed for RiboD-PETRI processing in duplicate experiments. For these two replicates, we sequenced libraries containing 20,000 and 40,000 cells, recovering 5244 and 11,344 cells, which corresponded to recovery rates of 26% and 28%, respectively. The correlations between detected reads and UMIs were found to be 0.87 and 0.90 for the two replicates, respectively (Figure 3—figure supplement 1A). The median UMI counts for the recovered cells were 34 and 52 (Figure 3A). After screening, the final datasets comprised 1621 and 3999 cells for the two replicates, respectively. While no significant batch effects were observed, we applied batch correction as a precautionary measure (Figure 3—figure supplement 1B–D). In replicate 1, each cell was sequenced with an average of 1563 reads, while in replicate 2, the average was 2034 reads (Supplementary file 5), yielding median UMI counts of 283.5 and 239 per cell, respectively (Figure 3B). For gene detection, the median counts were 219 and 193 genes per cell for the respective replicates (Figure 3C). Additionally, UMAP visualization was employed to illustrate the distribution of cellular UMI numbers, revealing heterogeneity among populations that was independent of UMI counts (Figure 3—figure supplement 1E and F). Unsupervised clustering analysis identified four major subpopulations in each replicate, with a consistently identified rare subpopulation (2.6%/2.1%) as cluster 2, driven by cell envelope genes (Figure 3D–F). Marker genes for this cluster included yffO, lptE, rdgB, pdeI, sstT, fixA, yjjG, rlmI, accC, and yaiA (Figure 3G, H and Figure 3—figure supplement 2).

Single-cell transcriptomic analysis and characterization of static E. coli biofilm using RiboD-PETRI.
(A–F, H) RiboD-PETRI data from static E. coli biofilm (E. coli 24 hr static culture) (Supplementary files 12 and 13). RiboD-PETRI data of static E. coli biofilm were screened for cells with unique molecular identifiers (UMIs) between 100 and 2000, resulting in 1621 and 3999 cells. (A) The number of UMIs detected per cell in recovered cells in Static E. coli biofilms (≥15 UMIs/cell). The cells are ranked from highest to lowest based on the number of detected UMIs, and cells with ≥15 UMIs are selected for plotting. (B) Distribution of mRNA UMIs captured per cell in RiboD-PETRI data of static E. coli biofilm. (C) The number of genes detected per cell in static E. coli biofilm. (D) UMAP visualization of static E. coli biofilm, revealing two small populations of heterogeneous cells in clusters 2 and 3. (E) Inferred expression levels of marker genes from static E. coli biofilm of E. coli across different clusters. (F) Enrichment pathways for marker genes of static E. coli biofilm data in cluster 2, selected based on screening criteria of p-value<0.001 and log2 fold change (FC)>0.2. The color blocks in these figures represent the p-values of the data points. (G and H) Dot plot displaying scaled expression levels of marker genes in different clusters of E. coli in exponential phase (G) and E. coli in static E. coli biofilm (H). These genes were markers of static E. coli biofilms in cluster 2, identified with screening criteria of p-value<0.001 and log2 FC>3. Dot size represents the percentage expression of the gene in the cluster, while color indicates the average expression level normalized from 0 to 1 across all clusters for each gene.
-
Figure 3—source code 1
Source code for Figure 3, Figure 3—figure supplement 1 and Figure 3—figure supplement 2.
- https://cdn.elifesciences.org/articles/97543/elife-97543-fig3-code1-v1.zip
-
Figure 3—source data 1
Related to Figure 3.
- https://cdn.elifesciences.org/articles/97543/elife-97543-fig3-data1-v1.xls
PdeI, identified among marker genes, was predicted as a phosphodiesterase enzyme hydrolyzing c-di-GMP, a vital bacterial second messenger (Yu et al., 2023; Li et al., 2023; Figure 4A and B). However, our comprehensive structural analysis revealed a more complex and novel role for PdeI. While PdeI contains an intact EAL domain typically associated with c-di-GMP degradation, it also possesses a divergent GGDEF domain, generally linked to c-di-GMP synthesis (Figure 4—figure supplement 1). This dual-domain architecture suggested potential complex regulatory roles. To validate PdeI’s function, we created a PdeI-BFP fusion construct under the native pdeI promoter, integrated with a ratiometric c-di-GMP sensing system (Vrabioiu and Berg, 2022) in E. coli. Confocal microscopy revealed PdeI as a membrane protein (Figure 4C). Single-cell level monitoring showed cell-to-cell variability in c-di-GMP levels and PdeI expression, with a positive correlation observed (Figure 4D), indicating PdeI upregulated c-di-GMP synthesis rather than degradation. This finding was confirmed by high-pressure liquid chromatography-tandem mass spectrometry (HPLC-MS/MS), which showed an approximately 11-fold increase in c-di-GMP concentration in the PdeI overexpression strain compared to the control strain (Figure 4E). These results align with previous studies showing that a point mutation (G412S) in PdeI’s divergent GGDEF domain in a strain lacking PdeH, the major phosphodiesterase in E. coli, resulted in decreased c-di-GMP levels (Reinders et al., 2016). Our additional experiments with a PdeI(G412S)-BFP mutation strain showed constant c-di-GMP levels despite increasing BFP fluorescence, serving as a proxy for PdeI(G412S) expression levels (Figure 4D). These results, combined with the presence of a CHASE (cyclases/histidine kinase-associated sensory) domain in PdeI, strongly suggest that PdeI functions as a membrane-associated sensor that integrates environmental signals with c-di-GMP production under complex regulatory mechanisms. This discovery challenges the initial prediction of PdeI as solely a phosphodiesterase and highlights its novel role as a c-di-GMP synthetase, contributing significantly to our understanding of bacterial signaling pathways. It’s worth noting that while the other marker genes in this cluster are co-expressed, our analysis indicates that they do not have a significant impact on biofilm formation or a direct relationship with c-di-GMP or PdeI.

Functional investigation of marker gene pdeI in static E. coli biofilm.
(A, B) Uniform Manifold Approximation and Projection (UMAP) plots showing the distribution of pdeI in single-cell data of exponential period E. coli (A) and static E. coli biofilm (B). Each dot represents a cell colored by normalized expression levels of genes. (C) Subcellular localization of PdeI-GFP and GFP. Scale bar, 1 μm. (D) c-di-GMP levels (R–1 score) in E. coli cells with different BFP, PdeI-BFP, PdeI(G412S)-BFP expression levels (low or high), under the control of the pdeI native promoter, in static E. coli biofilm. c-di-GMP levels are measured using the c-di-GMP sensor system integrated into E. coli cells. R–1 score was determined using the fluorescent intensity of mVenusNB and mScarlet-I in the system. The fluorescent intensity is measured by flow cytometry (n>50). (E) Determination of cellular concentrations of c-di-GMP by high-pressure liquid chromatography-tandem mass spectrometry (HPLC-MS/MS) in cells overexpressing PdeI under the control of arabinose promoter, with 0.002% arabinose induction for 2 hr (n=3). (F, G) Localization of PdeI-high cells in the biofilm matrix. Cells expressing PdeI-BFP under the control of the pdeI native promoter were grown in a glass-bottom cell culture dish and stained with SYTO 24 for bacterial DNA. Cells expressing BFP under the control of arabinose promoter, with 0.00001% arabinose induction for 24 hr in a glass-bottom cell culture dish and stained with SYTO 24 for bacterial DNA. (H, I) Heterogeneous expression of PdeI in single-cell data of exponential period E. coli (H) and E. coli in static E. coli biofilm (E. coli 24 hr static culture) (I). Biofilm cells with high or low expression levels of PdeI-BFP were sorted by flow cytometry. (J) Persister counting assay using 150 μg/ml ampicillin on cells with high or low expression levels of BFP, PdeI-BFP, and PdeI(G412S)-BFP from static E. coli biofilm, sorted by flow cytometry (n=3). These strains were under the control of the pdeI native promoter. (K) Time-lapse images of the persister assay observed under a microscope. Static biofilm cells of the PdeI-GFP strain were spotted on a gel pad and treated with 150 μg/ml ampicillin in Luria broth (LB). Images were captured over 6 hr at 37°C, followed by the replacement of fresh LB to allow persister cell resuscitation. Scale bar, 2 μm. Error bars represent standard deviations of biological replicates. Significance was ascertained by unpaired Student’s t-test. Statistical significance is denoted as *p<0.05, **p<0.01, ***p<0.001, and ****p<0.0001.
-
Figure 4—source data 1
Related to Figure 4.
- https://cdn.elifesciences.org/articles/97543/elife-97543-fig4-data1-v1.xls
Confocal laser scanning microscopy provided further insights into the spatial distribution of PdeI-positive cells within the biofilm structure. In the PdeI-BFP fusion strain, PdeI-BFP-positive cells, characterized by elevated c-di-GMP levels, were predominantly located at the bottom of the static biofilm (Figure 4F). This localization corresponds to the region of cell-surface attachment, aligning with our hypothesis that PdeI functions as a membrane-associated sensor integrating environmental signals with c-di-GMP production through complex regulatory mechanisms (Lacanna et al., 2016). In contrast, in the control strain where BFP was expressed alone under arabinose-induced promoter, BFP-positive cells were observed to be distributed throughout the entire biofilm community (Figure 4G). This distinct spatial distribution pattern between the PdeI-BFP fusion and the BFP-only control strains provides compelling evidence for PdeI’s specific role in biofilm formation, particularly at the biofilm-surface interface. The concentration of PdeI-positive cells at the bottom of the biofilm suggests that PdeI may be especially crucial in the initial stages of biofilm formation, potentially responding to surface-associated cues to modulate c-di-GMP levels and promote attachment. The uniform distribution of BFP in the control strains suggests that the localization of PdeI-BFP is not affected by BFP labeling. These observations further underscore the complex and nuanced role of PdeI in bacterial signaling and biofilm development, highlighting the importance of considering cell-to-cell heterogeneity in understanding the function of regulatory proteins in microbial communities.
To investigate the association of the PdeI-high cluster with bacterial drug tolerance in the early stages of biofilm development, we isolated PdeI-high cells using flow cytometry (Figure 4H, I) and subjected them to an ampicillin antibiotic killing assay to determine their persister frequency. Our results revealed that the PdeI-high population produced a significantly higher ratio of persister cells (~7.3%) compared to the whole biofilm population (~0.6%). Notably, cells expressing high levels of BFP alone or PdeI(G412S)-BFP showed no increase in persister ratios (Figure 4J). This finding suggests that the increased persistence is specifically linked to PdeI activity. Time-lapse imaging during the antibiotic killing process consistently demonstrated that persisters primarily originated from PdeI-GFP-positive cells (Figure 4K and Video 1). These PdeI-GFP-positive cells, displaying characteristics of dormancy, survived ampicillin treatment for 6 hr without visible growth or division. Upon antibiotic removal and replacement with fresh growth medium, the PdeI-GFP-positive persister cells resumed activity, elongating, dividing, and forming new microcolonies (Figure 4K and Video 1). This dynamic behavior provides direct visual evidence of the persister phenotype associated with PdeI-high cells. These findings strongly suggest that c-di-GMP, a molecule whose intracellular levels are upregulated by PdeI, plays a significant role in generating a persister subpopulation during the early stages of biofilm development. The mechanism by which elevated c-di-GMP levels contribute to antibiotic tolerance may involve modulation of cellular metabolism or activation of stress response pathways, leading to a state of dormancy that enables survival under antibiotic stress. This discovery not only enhances our understanding of the link between biofilm formation and antibiotic tolerance but also identifies PdeI as a potential target for strategies aimed at combating persistent bacterial infections.
Time-lapse images of the persister assay using cells with different PdeI-BFP.
Discussion
In this study, we introduce RiboD-PETRI, an enhanced bacterial scRNA-seq method that offers a cost-effective (Supplementary file 6), equipment-free, and high-throughput solution. By incorporating a probe hybridization-based rRNA-derived cDNA depletion protocol, our approach efficiently removes rRNA reads and significantly improves mRNA detection rates, enabling a more comprehensive exploration of within-population heterogeneity. At $0.0049 per cell, RiboD-PETRI is substantially more economical than the original PETRI-seq ($0.056 per cell), making it an attractive option for budget-conscious researchers. The method demonstrates improved mRNA detection, versatility across various bacterial species and growth conditions, preservation of transcriptome profiles consistent with non-depleted samples (r=0.93) and traditional bulk RNA-seq methods (r=0.84), high transcriptome coverage (>99%), and robust single-cell resolution with median UMI counts ranging from 102 to 182 per cell across different species and conditions.
The application of RiboD-PETRI to investigate biofilm heterogeneity exemplifies its potential for exploring complex biological systems. Our analysis of early-stage biofilm development uncovered a rare subpopulation (2.1–2.6%) characterized by cell envelope genes, including the previously uncharacterized gene pdeI. Further investigation revealed PdeI’s novel role as a c-di-GMP synthetase rather than a phosphodiesterase, challenging initial predictions and contributing significantly to our understanding of bacterial signaling pathways. Moreover, we demonstrated that PdeI-high cells exhibit increased antibiotic tolerance, with a significantly higher proportion of persister cells compared to the general biofilm population. This finding establishes a link between elevated c-di-GMP levels, regulated by PdeI, and the generation of antibiotic-tolerant subpopulations during early biofilm development. The spatial distribution of PdeI-positive cells at the bottom of the static biofilm, corresponding to the cell-surface attachment region, supports our hypothesis that PdeI functions as a membrane-associated sensor integrating environmental signals with c-di-GMP production. This localization pattern suggests PdeI’s crucial role in the initial stages of biofilm formation, potentially responding to surface-associated cues to modulate c-di-GMP levels and promote attachment. While other marker genes in this cluster are co-expressed, our analysis indicates they do not significantly impact biofilm formation or directly relate to c-di-GMP or PdeI.
In conclusion, RiboD-PETRI represents a significant advancement in bacterial scRNA-seq methodology. Its ability to uncover hidden variations within bacterial populations, as demonstrated in our biofilm analysis, underscores its potential impact on advancing our understanding of microbial behavior and population dynamics. By providing a cost-effective and efficient tool for exploring bacterial heterogeneity, RiboD-PETRI opens new avenues for research in microbiology, potentially leading to novel insights into antibiotic resistance, biofilm formation, and other critical areas of bacterial biology.
Materials and methods
Reagent type (species) or resource | Designation | Source or reference | Identifiers | Additional information |
---|---|---|---|---|
Strain, strain background (Escherichia coli) | MG1655 | Yale Genetic Stock Center | CGSC#6300 | |
Strain, strain background (Caulobacter crescentus) | NA1000 | Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences | NCBI accession number CP001340 | |
Strain, strain background (Staphylococcus aureus) | ATCC 25923 | ATCC | ATCC 25923 | |
Strain, strain background (Escherichia coli) | MG1655 pBAD::gfp | This paper | Figure legends and Materials and methods section | |
Strain, strain background (Escherichia coli) | MG1655 p(pdeI promoter)::pdeI-gfp | This paper | Figure legends and Materials and methods section | |
Strain, strain background (Escherichia coli) | MG1655 p(pdeI promoter)::pdeI-bfp | This paper | Figure legends and Materials and methods section | |
Strain, strain background (Escherichia coli) | MG1655 Δara pBAD::pdeI | This paper | Figure legends and Materials and methods section | |
Strain, strain background (Escherichia coli) | MG1655 Δara pBAD::vector | This paper | Figure legends and Materials and methods section | |
Strain, strain background (Escherichia coli) | MG1655 p(pdeI promoter)::bfp | This paper | Figure legends and Materials and methods section | |
Strain, strain background (Escherichia coli) | MG1655 p(pdeI promoter)::pdeI(G412S)-bfp | This paper | Figure legends and Materials and methods section | |
Strain, strain background (Escherichia coli) | MG1655 p(pdeI promoter)::bfp p15A::c-di-GMP-sensor | This paper | Figure legends and Materials and methods section | |
Strain, strain background (Escherichia coli) | MG1655 p(pdeI promoter)::pdeI-bfp p15A::c-di-GMP-sensor | This paper | Figure legends and Materials and methods section | |
Strain, strain background (Escherichia coli) | MG1655 p(pdeI promoter)::pdeI(G412S)-bfp p15A::c-di-GMP-sensor | This paper | Figure legends and Materials and methods section | |
Strain, strain background (Escherichia coli) | MG1655 Δara pBAD::bfp | This paper | Figure legends and Materials and methods section | |
Recombinant DNA reagent | p15A::c-di-GMP-sensor | This paper | p15A ori | |
Recombinant DNA reagent | pBAD::vector | This paper | Arabinose-induction | |
Recombinant DNA reagent | pBAD::gfp | This paper | Arabinose-induction | |
Recombinant DNA reagent | pBAD::bfp | This paper | Arabinose-induction | |
Recombinant DNA reagent | p(pdeI promoter)::bfp | This paper | pdeI native promoter induction | |
Recombinant DNA reagent | p(pdeI promoter)::pdeI-bfp | This paper | pdeI native promoter induction | |
Recombinant DNA reagent | p(pdeI promoter)::pdeI(G412S)-bfp | This paper | pdeI native promoter induction | |
Recombinant DNA reagent | p(pdeI promoter)::pdeI-gfp | This paper | pdeI native promoter induction | |
Recombinant DNA reagent | p(pdeI promoter)::pdeI | This paper | pdeI native promoter induction | |
Recombinant DNA reagent | pBAD::pdeI-gfp | This paper | Arabinose-induction | |
Sequence-based reagent | P-pdeI-F | This paper | PCR primers | AATTGTCTGATTCGTTACCAACTGACCGTACTGGCGTTC |
Sequence-based reagent | P-pdeI-R | This paper | PCR primers | TTGCTGCTGCCTCGGCTTCTAGCTCTTTTACTAATTTTCCACTTTTATCCCAGG |
Sequence-based reagent | pdeI-F | This paper | PCR primers | GGCTAACAGGAGGAATTAACCATGCTGAGTTTATACGAAAAGATAAAGATAAG |
Sequence-based reagent | pdeI-R | This paper | PCR primers | GCTGGAGACCGTTTAAACTCACTACTCTTTTACTAATTTTCCACTTTTATCCC |
Sequence-based reagent | pBAD-R | This paper | PCR primers | TTGGTAACGAATCAGACAATTGAC |
Sequence-based reagent | pBAD-F | This paper | PCR primers | TGAGTTTAAACGGTCTCCAGC |
Sequence-based reagent | pBAD-R2 | This paper | PCR primers | GGTTAATTCCTCCTGTTAGCCC |
Sequence-based reagent | Bfp-F | This paper | PCR primers | CGAGGCAGCAGCAAAGGCCCTAGAAGGTGGATCCGGCGGTTCTAG |
Sequence-based reagent | Gfp-F | This paper | PCR primers | CTAGAAGCCGAGGCAGCAGCAAAGGCCCTAGAAATGAGTAAAGGAGAAGAACTTTTCAC |
Sequence-based reagent | G412S-F | This paper | PCR primers | GAAGCGGTGTTTAGTGTTGATG |
Sequence-based reagent | G412S-R | This paper | PCR primers | CATCAACACTAAACACCGCTTC |
Sequence-based reagent | P-bfp-R | This paper | PCR primers | GTTAATACATTTAACAAAATAACTATCTGA |
Sequence-based reagent | P-bfp-F | This paper | PCR primers | ATAGTTATTTTGTTAAATGTATTAACGGTGGATCCGGCGGTTCT |
Sequence-based reagent | UP-F | This paper | PCR primers | CATGAATTCTGGCGACGATTTCG |
Sequence-based reagent | UP-R | This paper | PCR primers | GTTAATACATTTAACAAAATAACTATCTGA |
Sequence-based reagent | ccdB-F | This paper | PCR primers | CACAGCGTTCAGATAGTTATTTTGTTAAATGTATTAACTCTAGAGCGACGCCAGACG |
Sequence-based reagent | ccdB-R | This paper | PCR primers | CTGTAAGTACGAACTTATTGATTCTGGACATACGTAAATTACGCCCCGCCCTGCCAC |
Sequence-based reagent | Down-F | This paper | PCR primers | TTTACGTATGTCCAGAATCAATAAGTTCGTACTTAC |
Sequence-based reagent | Down-R | This paper | PCR primers | ATCTTCGTCAAAGGATTTTCTGCCC |
Sequence-based reagent | UP2-R | This paper | PCR primers | ATCTTTTCGTATAAACTCAGCATGTTAATACATTTAACAAAATAACTATCTGAA |
Sequence-based reagent | pdeI-G412S-F | This paper | PCR primers | ATGCTGAGTTTATACGAAAAGATAAAGAT |
Sequence-based reagent | pdeI-G412S-R | This paper | PCR primers | CTTATTGATTCTGGACATACGTAAACTACTCTTTTACTAATTTTCCACT |
Sequence-based reagent | Down2-F | This paper | PCR primers | TTTACGTATGTCCAGAATCAATAAGTTCGTACTTAC |
Commercial assay or kit | KAPA HIFI hotStart ReadyMix PCR Kits | KAPA | Cat#2602 | |
Commercial assay or kit | VAHTS Universal DNA Library Prep Kit | Vazyme | Cat#NR603 | |
Commercial assay or kit | Bacteria RNA Extraction Kit | Vazyme | Cat#R403-01 | |
Commercial assay or kit | Ribo-off rRNA Depletion Kit (Bacteria) | Vazyme | Cat#N407 | |
Commercial assay or kit | 2× MultiF Seamless Assembly Mix | ABclonal | Cat#RK21020 | |
Commercial assay or kit | VAHTS Universal DNA Library Prep Kit for Illumina V3 | Vazyme | Cat#ND607 | |
Commercial assay or kit | ABScript III RT Master Mix for qPCR with gDNA Remover | ABclonal | Cat#RK20429 | |
Commercial assay or kit | SUPERase-In RNase Inhibitor | Invitrogen | Cat#AM2696 | |
Chemical compound, drug | Streptavidin Magnetic Beads | Thermo Fisher | Cat#88816 | |
Chemical compound, drug | Syto 24 dye | Invitrogen | Cat#S7559 | |
Chemical compound, drug | Arabinose | Sigma | Cat#V900920 | |
Chemical compound, drug | Ampicillin | Sangon Biotech | Cat#A610028 | |
Chemical compound, drug | Chloramphenicol | Sangon Biotech | Cat#A600118 | |
Chemical compound, drug | Kanamycin | Sangon Biotech | Cat#A600286 | |
Software, algorithm | Fiji | GitHub | https://fiji.sc/; RRID:SCR_002285 | |
Software, algorithm | FlowJo | Treestar, Inc | https://www.flowjo.com/ |
Resource availability
Request a detailed protocolFurther information and requests for resources and reagents should be directed to and will be fulfilled by the lead contact, Yingying Pu (yingyingpu@whu.edu.cn).
Materials availability
Request a detailed protocolPlasmids generated in this study are available from the lead contact upon request.
Bacterial strains and growth conditions
The bacterial strains used in this study included E. coli strains MG1655, C. crescentus NA1000, and S. aureus strain ATCC 25923. E. coli cultures were grown in Luria broth (LB) medium. For the biofilm setup, bacterial cultures were grown overnight. The next day, we diluted the culture 1:100 in a Petri dish. We added 2 ml of LB medium to the dish. If the bacteria contain a plasmid, the appropriate antibiotic needs to be added to LB. The Petri dish was then incubated statically in a growth chamber for 24 hr. After incubation, we performed imaging directly under the microscope. The Petri dishes used were glass-bottom dishes from Biosharp (catalog number BS-20-GJM), allowing for direct microscopic imaging without the need for cover slips or slides. This setup allowed us to grow and image the biofilms in situ, providing a more accurate representation of their natural structure and composition. C. crescentus strain NA1000 was grown in peptone yeast extract (PYE) medium. And S. aureus strain ATCC 25923 was grown in Mueller-Hinton Broth (MHB) medium. All bacterial strains were routinely grown at 37°C and 220 rpm. To maintain plasmids, when necessary, media were supplemented with chloramphenicol (25 µg/ml) or kanamycin sulfate (50 µg/ml). For arabinose-induction system expression experiments, 0.002% or 0.00002% arabinose was supplemented in the medium.
Strains construction
Request a detailed protocolThe construction of recombinant plasmids was performed using the 2× MultiF Seamless Assembly Mix (ABclonal, RK21020). For the detection of c-di-GMP levels using c-di-GMP sensor and the detection of persister, the PdeI gene, pdeI, along with its native promoter (250 bp), was fused with either gfp or bfp and cloned into the pBAD backbone. The original promoter region of the pBAD vector was removed to avoid any potential interference. This construction allows the expression of the BFP, PdeI-BFP, and PdeI(G412S)-BFP fusion proteins to be driven by pdeI’s native promoter, thus maintaining its physiological control mechanisms. And the BFP coding sequence was fused to the pdeI gene to create the PdeI-BFP fusion construct. Besides, for membrane localization and localization in biofilm community, the pdeI-gfp and pdeI-bfp with native promoter of pdeI were cloned into the pBAD backbone, and the original promoter region of the pBAD vector was removed. For the control group, bfp and gfp was cloned into the pBAD backbone under the control of arabinose-induction system. For HPLC-MS/MS analysis, pdeI and empty vector were cloned into the pBAD backbone, induced by arabinose. GFP and BFP were used in different experiments. GFP was used for imaging and time-lapse imaging to observe persister cell growth. BFP was used for cell sorting and detecting the proportion of persister cells. For the c-di-GMP sensor (Addgene: #182291), the plasmid origin was replaced with the p15A ori.
RiboD-PETRI
Cell preparation
Request a detailed protocolE. coli MG1655 cells were cultured overnight and subsequently diluted at a ratio of 1:100 into fresh LB medium and grown statically for 24 hr at 37°C. For 3 hr exponential period E. coli sample, E. coli MG1655 cells were grown overnight and then diluted 1:100 into fresh LB medium and grown for 3 hr at 37°C and 220 rpm. C. crescentus strain NA1000 cells were grown overnight and then diluted 1:100 into fresh MHB medium and grown for 9 hr at 37°C and 220 rpm. And S. aureus strain ATCC 25923 cells were grown overnight and then diluted 1:100 into fresh PYE medium and grown for 3 hr at 37°C and 220 rpm. All the culture was vigorously shaken using a vortex, and the cells were then centrifuged at 5000×g for 2 min at 4°C. The pellet was resuspended in 2 ml of ice-cold 4% formaldehyde (F8775, MilliporeSigma, diluted into PBS). These suspensions were rotated at 4°C for 16 hr.
Cell permeabilization
Request a detailed protocol1 ml of fixed cells were centrifuged at 5000×g for 5 min at 4°C, then resuspended in 1 ml washing buffer (100 mM Tris-HCl pH 7.0, 0.02 U/μl SUPERase-In RNase Inhibitor, AM2696, Invitrogen). After another centrifugation at 5000×g for 5 min at 4°C, the supernatant was removed. The pellet was then resuspended in 250 μl permeabilization buffer (0.04% Tween-20 in PBS-RI, PBS with 0.01 U/μl SUPERase-In RNase Inhibitor) and incubated on ice for 3 min. 1 ml cold PBS-RI was added, and the cells were centrifuged at 5000×g for 5 min at 4°C. The pellet was resuspended in 250 μl Lysozyme Mix (250 μg/ml Lysozyme or 5 μg/ml Lysostaphin for S. aureus in TEL-RI buffer, comprising 100 mM Tris pH 8.0 [AM9856, Invitrogen], 50 mM EDTA [AM9261, Invitrogen], and 0.1 U/μl SUPERase In RNase Inhibitor). The samples were incubated at 37°C and mixed gently every minute. Then 1 ml cold PBS-RI was added immediately, and cells were centrifuged at 5000×g for 5 min at 4°C. The cells underwent another wash with 1 ml cold PBS-RI. Subsequently, cells were resuspended in 40 μl DNaseI-RI buffer (4.4 μl 10× reaction buffer, 0.2 μl SUPERase In RNase inhibitor, 35.4 μl H2O), followed by addition of 4 μl DNaseI (AMPD1, MilliporeSigma). The samples were incubated for 30 min at room temperature and mixed gently every 5 min. 4 μl Stop Solution was added, and the samples were incubated for 10 min at 50°C with gentle mixing every minute. Following centrifugation at 5000×g for 10 min at 4°C, cells were washed twice with 0.5 ml cold PBS-RI. Finally, cells were resuspended in 200 μl cold PBS-RI, and their count and integrity were assessed using the ACEA NovoCyte flow cytometer with a 100× oil immersion lens.
Primer preparation
Request a detailed protocolFor the first round of reverse transcription reaction, round 2 and round 3 ligation reactions, all primers design and preparation as previously described (Blattman et al., 2020). All primers were purchased from Sangon Biotech (Supplementary file 1). For ligation primers preparation, mixtures were prepared as follows: 31.1 μl each R2 primer (100 μM), 28.5 μl SB83 (100 μM), and 21.4 μl H2O were splitted to 2.24 μl for one sample. Mixtures containing 63.2 μl each R3 primer (70 μM) and 58 μl SB8 (70 μM) were splitted to 3.49 μl for one sample. Before use, ligation primers were incubated as follows: 95°C for 3 min, then decreasing the temperature to 20°C at a ramp speed of −0.1 °C/s, 37°C for 30 min. For blocking mix preparation, 50 μl primer SB84 (400 μM) and 80 μl primer SB81 (400 μM) were incubated as follows: 94°C for 3 min, then decreasing the temperature to 25°C at a ramp speed of −0.1 °C/s, 4°C for keeping. Round 2 blocking primers were mixed as follows: 37.5 μl 400 μM SB84, 37.5 μl 400 μM SB85, 25 μl 10× T4 ligase buffer, 150 μl H2O. Round 3 blocking primers were mixed as follows: 72 μl 400 μM SB81, 72 μl 400 μM SB82, 120 μl 10× T4 ligase buffer, 336 μl H2O, 600 μL 0.5 M EDTA.
Round 1 RT reaction
Request a detailed protocolAbout 3×107 cells were introduced into an RT reaction mix composed of 240 μl 5× RT buffer, 24 μl dNTPs (N0447L, NEB), 12 μl SUPERase In RNase Inhibitor, and 24 μl Maxima H Minus Reverse Transcriptase (EP0753, Thermo Fisher Scientific), 132 μl PEG8000 (50%). Nuclease-free water was added to achieve a total reaction volume of 960 μl, and the mixture was thoroughly mixed by vortexing. Subsequently, 8 μl of the reaction mixture was dispensed into each well of a 96-well plate, where 2 μl of each RT primer had been added previously. The sealed 96-well plate was inverted repeatedly for thorough mixing, followed by a brief spin. The plate was then incubated as follows: 50°C for 10 min, 8°C for 12 s, 15°C for 45 s, 20°C for 45 s, 30°C for 30 s, 42°C for 6 min, 50°C for 16 min, and finally held at 4°C. After the RT process, all 96 reactions were pooled into one tube. 75 μl of 0.5% Tween-20 was added, and the reactions were incubated on ice for 3 min. Cells were centrifuged at 7000×g for 10 min at 4°C and then resuspended in 0.4 ml PBS-RI. Thirty-two microliters of 0.5% Tween-20 was added, and the cells underwent centrifugation at 7000×g for 10 min at 4°C.
Round 2 ligation reaction
Request a detailed protocolCells were resuspended in 500 μl 1× T4 ligase buffer, followed by the addition of 107.5 μl PEG8000, 37.5 μl 10× T4 ligase buffer, 16.7 μl SUPERase In RNase Inhibitor, 5.6 μl BSA, and 27.9 μl T4 ligase (M0202L, NEB). The reaction solution was thoroughly mixed by vortexing. Subsequently, 5.76 μl of the reaction mixture was dispensed into each well of a 96-well plate, where 2.24 μl of each round 2 ligation primer had been added previously. The sealed 96-well plate was inverted repeatedly for thorough mixing and then subjected to a short spin. The plate was incubated at 37°C for 45 min. Following this, 2 μl of round 2 blocking mix was added to each well and incubated at 37°C for an additional 45 min. All 96 reactions were pooled into one tube after incubation.
Round 3 ligation reaction
Request a detailed protocolA mixture comprising 89 μl H2O, 26 μl PEG8000, 46 μl 10× T4 ligase buffer, and 12.65 μl T4 ligase was prepared and thoroughly mixed by vortexing. Subsequently, 8.51 μl of the reaction mixture was dispensed into each well of a 96-well plate, where 3.49 μl of each round 3 ligation primer had been added previously. The sealed 96-well plate was inverted repeatedly for thorough mixing and then subjected to a brief spin. The plate was incubated at 37°C for 45 min. Following this, 10 μl of round 3 blocking mix was added to each well and incubated at 37°C for an additional 45 min. All 96 reactions were combined into one tube after incubation.
Cells lysis
Request a detailed protocol42 μl of 0.5% Tween-20 was added, and cells were centrifuged at 7000×g for 10 min at 4°C. The cells underwent two washes using 200 μl TEL-RI containing 0.01% Tween-20, each time centrifuged at 7,000×g for 10 min at 4°C. Subsequently, cells were resuspended in 30 μl TEL-RI buffer. Cell counting and integrity checks were performed using the ACEA NovoCyte flow cytometer with a 100× oil immersion lens. A moderate amount of cells was then added to the lysis buffer (50 mM Tris pH 8.0, 25 mM EDTA, 200 mM NaCl, 0.5% Triton X-100), and 5 μl of proteinase K (AM2548, Invitrogen) was introduced. Samples were incubated at 55°C for 60 min and gently mixed every minute.
Library construction
Request a detailed protocolTo facilitate template switching, lysates were purified with VAHTS DNA Clean Beads (N411, Vazyme) at a ratio of 2.0×, and cDNA was eluted in 12 μl of water. The purified cDNA was then combined with 4 μl of 5× RT buffer, 1 μl of dNTPs (N0447L, NEB), 0.5 μl of SUPERase In RNase Inhibitor, 0.5 μl of Maxima H Minus Reverse Transcriptase, and 2 μl of the TSO (Picelli et al., 2013) primer (100 mM, Supplementary file 1). This reaction solution underwent incubation as follows: 25°C for 30 min, 42°C for 90 min, 85°C for 5 min, and then held at 4°C. Subsequently, 1 μl of RNaseH was added, and the reaction solution was incubated at 37°C for 30 min. The cDNA was purified once again with VAHTS DNA Clean Beads at a ratio of 2.0× and eluted in 13 μl of H2O. The integrity of the cDNA was assessed using primers TSO-2 and R1 or R2 or R3 by qPCR (Supplementary file 1).
Ribosomal RNA-derived cDNA depletion
Request a detailed protocolWe developed a set of cDNA probe primers to selectively deplete r-cDNA (Supplementary file 1). These probe primers possess the ability to specifically hybridize with r-cDNA and also hybridize with a biotin-labeled universal primer. In the reaction, 5 μl of r-cDNA probe primers (10 μM), 2.5 μl of 10× hybridization buffer (Tris-HCl pH 8.0 100 mM, NaCl 500 mM, EDTA pH 8.0 10 mM), and 5 μl of biotin primer (10 μM) were added to 12.5 μl of purified cDNA. The reaction solution underwent incubation as follows: 95°C for 2 min, followed by a temperature decrease to 20°C at a ramp speed of −0.1 °C/s, and then held at 37°C for 30 min. Subsequently, 20 μl of Streptavidin magnetic beads (BEAVER, 22307) was washed twice using 1 ml of 1× B&W buffer (Tris-HCl pH 7.5 10 mM, EDTA 1 mM, NaCl 1 M, Tween-20 0.05%) and resuspended in 25 μl of 2× B&W buffer. Twenty-five microliters of washed Streptavidin magnetic beads were added to 25 μl of annealed cDNA. The reaction solution was incubated at room temperature for 30 min with gentle mixing per minute. Following this, the reaction solution tube was placed into a magnetic stand to collect the supernatant. The cDNA depleted of r-cDNA was purified using VAHTS DNA Clean Beads at a ratio of 2.0× and eluted in 12.5 μl of H2O. The depletion of r-cDNA could be repeated using the above protocol, and ultimately, the cDNA was eluted in 20 μl of H2O. We designed separate probe sets for E. coli, C. crescentus, and S. aureus. Each set was specifically constructed to be reverse complementary to the r-cDNA sequences of its respective bacterial species. This species-specific approach ensures high efficiency and specificity in rRNA depletion for each organism.
Library amplification and sequencing
Request a detailed protocolTo the 20 μl cDNA solution, the following components were added: 2.4 μl R3 primer (10 mM, Supplementary file 1), 2.4 μl TSO-2 primer (10 mM, Supplementary file 1), 40 μl 2× KAPA HIFI mix (KAPA, 2602), 1.6 μl SYBR Green (25×), 0.8 μl MgCl2 (0.1 M), and 12.8 μl H2O. This PCR solution was placed in a thermocycler and incubated with the following parameters: 98°C for 45 s, followed by cycling of 98°C for 15 s, 60°C for 30 s, and 72°C for 60 s. Cycling continued on a qPCR machine until the reaction approached saturation. PCR products were then purified using VAHTS DNA Clean Beads at a ratio of 0.9× and eluted in 25 μl of H2O. Finally, the purified PCR products underwent end repair and adaptor ligation using the VAHTS Universal DNA Library Prep Kit for Illumina V3 (Vazyme, ND607).
Bulk RNA-seq library construction
Request a detailed protocolTotal RNA of the samples was extracted utilizing the Bacteria RNA Extraction Kit (R403-01, Vazyme). Subsequently, the RNA underwent mRNA enrichment (N407, Vazyme), fragmentation, cDNA synthesis, and library preparation using the VAHTSTM Total RNA-seq (H/M/R) Library Prep Kit for Illumina (NR603, Vazyme).
Bioinformatics analysis methods
Single-cell analysis
Request a detailed protocolThe sequencing data underwent processing into matrices using scripts and a pipeline as previously described (Blattman et al., 2020) in Python 2.7.15, with some modifications (the detailed original code and all the data were deposited in the GEO repository). After the count tables were made, subsequent analysis of single-cell data was conducted using Seurat (Hao et al., 2021) package (version 4.3.0; http://satijalab.org/seurat/) in R (https://www.r-project.org/). Since there were two replicates of static E. coli biofilm, these two datasets were merged into one SeuratObject and batch effects were removed. However, the samples for exponential period E. coli, S. aureus, and C. crescentus only had one sample, so they did not need this process. At the beginning of doing the scRNA-seq analysis, we screened the data of all samples. For preprocessing of static E. coli biofilm data, cells were filtered with UMI per cell more than 100 and less than 2000 for replicate 1 and replicate 2 to obtain 1621 and 3999 cells, respectively. For data of exponential period E. coli, the data was screened for cells with UMIs greater than 200 and less than 5000 to obtain 1464 cells. The screening criteria of S. aureus were cells with UMIs greater than 15 and less than 1000 and genes greater than 30 (1000>UMIs>15, gene counts>30). The screening criteria of C. crescentus were cells with UMIs greater than 200 and less than 5000 and gene counts greater than 30 (5000>UMIs>200, gene counts>30). After screening, all the data were normalized using a scale factor of 10,000 through a global-scaling normalization method called ‘LogNormalize’. Highly variable features were then identified, returning 500 features per dataset. Then we combine the data of the two replicates of static E. coli biofilm into a single SeuratObject by FindIntegrationAnchors and IntegrateData functions. Then all the data underwent scaling using the ScaleData function, followed by dimension reduction through principal component analysis. To avoid subtle batch effects influencing downstream analyses, we removed batch effects using RunHarmony (Korsunsky et al., 2019) for the two replicates of static E. coli biofilm data. Then a graph-based clustering approach was employed in all data to identify clusters of gene expression programs using the Louvain algorithm (Seurat 4.3.0). The dims we chose were 6. And the resolution was 0.3 for C. crescentus and S. aureus or 0.4 for E. coli data. Marker genes for each cluster were computed using the Wilcoxon rank-sum test. Specifically, marker genes for each cluster were initially obtained using the FindMarkers function of Seurat. Then we performed pathway enrichment analysis of marker gene by clusterProfiler function (Yu et al., 2012) within R. For transcriptome-wide gene coverage across the cell population, we counted the number of genes expressing at least one UMI. Then we calculated the percentage of these genes out of all the genes in each bacterium.
Comparison of scRNA-seq with bulk RNA-seq
Request a detailed protocolThe bulk RNA-seq clean data reads were mapped to the E. coli MG1655 k12 genome (EnsemblBacteria Taxonomy ID: 511145) using the BWA aligner software (v0.7.17-r1188, https://github.com/lh3/bwa: Li, 2024). Sam files were converted to bam files using samtools (v1.9). The mapping results were counted by featureCounts (https://subread.sourceforge.net/; Liao et al., 2024) to generate expression results. Single-cell and bulk transcriptomes of E. coli were compared by computing the Pearson correlation of log2 reads per gene of bulk RNA-seq and log2 UMI per gene of scRNA-seq.
Sequencing saturation of the libraries
Request a detailed protocolTo assess sequencing saturation, we generated five subsamples from the single-cell sequencing data, representing 20%, 40%, 60%, 80%, and 100% of the total data. Each subsample was analyzed independently following the previously described single-cell sequencing data analysis process. We then created gene expression matrices for each subsample based on the analysis results. The number of UMIs or genes was counted for each cell detected in each subsample. Next, we sorted the cells in descending order based on their UMI counts or gene counts and selected different numbers of cells from this sorted list, starting from those with the highest UMI or gene counts. For each selection of cells, we calculated the median number of UMIs or genes. Finally, we created a line graph representing the median UMI counts and genes using GraphPad Prism 9 software, allowing us to visualize the sequencing saturation across the different subsamples.
In addition, we used the saturation calculation method of 10x Genomics to further detect the saturation of the data. The formula for calculating this metric is as follows:
Sequencing Saturation = 1 - (n_deduped_reads / n_reads). Given the differences between RiboD-PETRI and 10x Genomics datasets, we have adapted the calculation as follows:
n_deduped_reads: The number of UMIs as a measure of unique reads.
n_reads: The total number of confidently mapped reads.
Multiplet frequency determination
Request a detailed protocolDetermination of the multiplet frequency was essential in assessing the efficiency of single-cell capture in RiboD-PETRI. This frequency is defined as the probability that a non-empty barcode corresponds to more than one cell. To calculate it, we used a Poisson distribution-based approach involving several key steps. Initially, we calculated the proportion of barcodes corresponding to zero cells using the formula p(0) = . Then, we determined the proportion for one cell, p(1)= , and derived the proportions for more than zero cells and more than one cell . These values allow for the calculation of the multiplet frequency as . The parameter λ plays a vital role in this model, representing the ratio of the number of cells to the total number of possible barcode combinations.
Flow cytometry sorting of bacteria and analysis
Request a detailed protocolAll samples were measured using a Beckman CytoFLEX SRT flow cytometer with a 70 μm nozzle, using normal saline as sheath fluid. During the 24 hr static biofilm growth phase, strains labeled with BFP, PdeI-BFP, PdeI(G412S)-BFP, or c-di-GMP sensors were washed and resuspended in sterile PBS. Microorganisms were identified based on forward scatter and side scatter parameters. Cells were sorted into distinct groups according to their fluorescence intensity, with V450 used for BFP, FITC for mVenusNB, and ECD for mScarlet-I. The resulting data were subsequently analyzed using FlowJo V10 software (Tree Star, Inc).
Antibiotic killing and persister counting assay
Request a detailed protocolCells sorted by flow cytometry were resuspended in fresh LB supplemented with 150 μg/ml ampicillin. The suspension was then incubated at 37°C for 3 hr with continuous shaking at 220 rpm. To determine the initial cell count, an aliquot of the cell suspension was taken before the ampicillin challenge, serially diluted, and plated on LB agar plates for colony-forming unit (CFU) enumeration. These plates were incubated overnight at 37°C. Following the ampicillin challenge, cells were harvested by centrifugation, washed once with sterile PBS to remove residual antibiotic, and resuspended in fresh PBS. This suspension was then serially diluted and plated on LB agar plates for post-challenge CFU counts. The persister ratio was calculated as the number of CFUs after ampicillin challenge divided by the number of CFUs before challenge. All experiments were performed in triplicate, with the results presented as means ± standard deviations of three independent biological replicates.
Microscopy
Bright-field and fluorescence imaging
Request a detailed protocolInverted microscopes, specifically the Nikon Eclipse Ti2 and Leica Stellaris 5 WLL, were employed for imaging, utilizing different lasers for illumination: a 405 nm laser for BFP and a 488 nm laser for GFP. Fluorescence emission signals were captured using an sCMOS camera (pco.edge 4.2 bi). Dedicated filter sets corresponding to the spectral characteristics of each fluorophore were utilized. Image analysis was performed with ImageJ software (Fiji). For the analysis of the c-di-GMP sensor, the ratio of mVenusNB to mScarlet-I (R) displayed a negative correlation with c-di-GMP concentration. Consequently, the value of R−1 demonstrated a positive correlation with c-di-GMP concentration.
Time-lapse imaging
Request a detailed protocolTo investigate the processes of antibiotic killing and bacterial resuscitation, cells labeled with PdeI-GFP during the 24 hr static growth phase were collected and washed twice with PBS. These cells were then imaged on a gel pad composed of 3% low melting temperature agarose in PBS, which was prepared as a gel island in the center of the FCS3 chamber. The cells were observed under either bright-field or epifluorescence illumination. Following the imaging, the gel pad was surrounded by LB containing 150 μg/ml ampicillin, and the cells were incubated for 6 hr at 35°C. Fresh LB was subsequently flushed in, and the growth medium was refreshed every 3 hr, ensuring sufficient recovery time for the cells.
Determination of c-di-GMP concentration by HPLC-MS/MS
Request a detailed protocolThe determination of c-di-GMP concentration by HPLC-MS/MS involved a series of steps. Initially, MG1655 Δara pBAD::pdeI and MG1655 Δara pBAD::empty-vector strains were grown to mid-exponential growth phase, followed by induction with 0.002% arabinose. After a 2 hr incubation period, cells were harvested and washed with PBS. The washed cells were then rapidly frozen using liquid nitrogen. Simultaneously, another portion of washed cells was stained with SYTO 24 and quantified using flow cytometry. The determination of c-di-GMP concentration was conducted by Wuhan Lixinheng Technology Co. Ltd. through HPLC-MS/MS. In the experiment, first, for cell samples, addition of 500 μl of extract solvent (precooled at –20°C, acetonitrile-methanol water, 2:2:1), the samples were vortexed for 30 s, homogenized at 38 Hz for 4 min, and sonicated for 5 min in ice-water bath. The homogenate and sonicate circle were repeated for three times, followed by incubation at –20°C for 1 hr and centrifugation at 12,000 rpm and 4°C for 15 min. An 80 μl aliquot of the clear supernatant was transferred to an auto-sampler vial for LC-MS/MS analysis. The UHPLC separation was carried out using an Waters ACQUITY H-class plus UPLC System, equipped with Agilent ZORBAX Eclipse Plus C18 column (2.1 mm × 150 mm, 1.8 μm). An Waters Xevo TQ-XS triple quadrupole mass spectrometer, equipped with an electrospray ionization interface, was applied for assay development. All strains were assayed in biological triplicates, and the measured values were converted into intracellular c-di-GMP concentrations (pg) per cell.
Quantification and statistical analysis
Request a detailed protocolStatistical analysis was conducted using GraphPad Prism 9 software for Windows. The significance of results was determined using a two-tailed Student’s t-test. Error bars in the data representation indicate the standard deviations of the mean from a minimum of three independent experiments. A significance threshold of p<0.05 was applied throughout the analysis. To denote significant differences in the results, a system of asterisks was used: * for p<0.05, ** for p<0.01, *** for p<0.001, and **** for p<0.0001. This comprehensive approach ensured a thorough and statistically sound analysis of the c-di-GMP concentration in the studied bacterial strains.
Data availability
Sequencing data have been deposited in GEO under accession codes GSE260458. All data generated or analysed during this study are included in the manuscript and supporting files; source data files have been provided for Figures 1—4 and Figure 1—figure supplement 1, Figure 2—figure supplement 1 and Figure 3—figure supplement 1.
-
NCBI Gene Expression OmnibusID GSE260458. An improved bacterial single-cell RNA-seq reveals biofilm heterogeneity.
-
NCBI Gene Expression OmnibusID GSE151940. Microbial single-cell RNA sequencing by split-pool barcoding.
-
NCBI Gene Expression OmnibusID GSE141018. Prokaryotic Single-Cell RNA Sequencing by In Situ Combinatorial Indexing.
-
NCBI Gene Expression OmnibusID GSE231935. Massively-parallel Microbial mRNA Sequencing (M3-Seq) reveals heterogenous behaviors in bacteria at single-cell resolution.
-
NCBI Gene Expression OmnibusID GSE119888. Single-cell RNA-seq reports growth condition-specific global transcriptomes of individual bacteria.
-
NCBI Gene Expression OmnibusID GSE180237. Bacterial droplet-based single-cell RNA-seq reveals antibiotic-associated heterogeneous cellular states.
References
-
Prokaryotic single-cell RNA sequencing by in situ combinatorial indexingNature Microbiology 5:1192–1201.https://doi.org/10.1038/s41564-020-0729-6
-
The biofilm matrix: multitasking in a shared spaceNature Reviews. Microbiology 21:70–86.https://doi.org/10.1038/s41579-022-00791-0
-
c-di-GMP inhibits the DNA binding activity of H-NS in SalmonellaNature Communications 14:7502.https://doi.org/10.1038/s41467-023-43442-5
-
scRNA-seq data analysis method to improve analysis performanceIET Nanobiotechnology 17:246–256.https://doi.org/10.1049/nbt2.12115
-
Probe-based bacterial single-cell RNA sequencing predicts toxin regulationNature Microbiology 8:934–945.https://doi.org/10.1038/s41564-023-01348-4
-
Growing and analyzing static biofilmsCurrent Protocols in Microbiology 22:1.https://doi.org/10.1002/9780471729259.mc01b01s22
-
Division of labor: how microbes split their responsibilityCurrent Biology 28:R697–R699.https://doi.org/10.1016/j.cub.2018.05.024
-
Marine biofilms: diversity, interactions and biofoulingNature Reviews. Microbiology 20:671–684.https://doi.org/10.1038/s41579-022-00744-7
-
Expression and genetic activation of cyclic Di-GMP-specific phosphodiesterases in Escherichia coliJournal of Bacteriology 198:448–462.https://doi.org/10.1128/JB.00604-15
-
Single-cell sequencing-based technologies will revolutionize whole-organism scienceNature Reviews. Genetics 14:618–630.https://doi.org/10.1038/nrg3542
-
Physiology of microbes in biofilmsCurrent Topics in Microbiology and Immunology 322:17–36.https://doi.org/10.1007/978-3-540-75418-3_2
-
Physiological heterogeneity in biofilmsNature Reviews. Microbiology 6:199–210.https://doi.org/10.1038/nrmicro1838
-
mRNA-Seq whole-transcriptome analysis of a single cellNature Methods 6:377–382.https://doi.org/10.1038/nmeth.1315
-
Gas and light: triggers of c-di-GMP-mediated regulationFEMS Microbiology Reviews 47:fuad034.https://doi.org/10.1093/femsre/fuad034
Article and author information
Author details
Funding
National Natural Science Foundation of China (31970089)
- Yingying Pu
National Key Research and Development Program of China (2021YFC2701602)
- Yingying Pu
Science Fund for Distinguished Young Scholars of Hunan Province (2022CFA077)
- Yingying Pu
Fundamental Research Funds for the Central Universities (2042022dx0003)
- Yingying Pu
Major Project of Guangzhou National Laboratory (GZNL2024A01023)
- Yingying Pu
Natural Science Foundation of Yunnan Province (202001BB050005)
- Chunming Guo
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Acknowledgements
We thank Prof. Fan Bai (Peking University) for valuable discussions. We thank Drs. Jidong Xing and Ziyang Liu for support in bioinformatics. We also thank the members of our laboratory for helpful discussion. This work is supported by the grants to YP from the National Key R&D Program of China (2021YFC2701602), the National Natural Science Foundation of China (31970089), Science Fund for Distinguished Young Scholars of Hubei Province (2022CFA077),
Major Project of Guangzhou National Laboratory (GZNL2024A01023), and the Fundamental Research Funds for the Central Universities (2042022dx0003). This work is also supported by CG from the Natural Science Foundation of Yunnan Province of China (202001BB050005). We also thank all the staff in the Core Facilities of Medical Research Institute at Wuhan University and the Core Facilities at School of Life Sciences at Peking University for their technical support.
Version history
- Preprint posted:
- Sent for peer review:
- Reviewed Preprint version 1:
- Reviewed Preprint version 2:
- Reviewed Preprint version 3:
- Version of Record published:
Cite all versions
You can cite all versions using the DOI https://doi.org/10.7554/eLife.97543. This DOI represents all versions, and will always resolve to the latest one.
Copyright
© 2024, Yan, Liao et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 1,551
- views
-
- 118
- downloads
-
- 0
- citations
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Microbiology and Infectious Disease
Influenza A virus transcribes viral mRNAs from the eight segmented viral genome when it infects. The kinetics of viral transcription, nuclear export of viral transcripts, and their potential variation between the eight segments are poorly characterised. Here, we introduce a statistical framework for estimating the nuclear export rate of each segment from a snapshot of in situ mRNA localisation. This exploits the cell-to-cell variation at a single time point observed by an imaging-based in situ transcriptome assay. Using our model, we revealed the variation in the mRNA nuclear export rate of the eight viral segments. Notably, the two influenza viral antigens hemagglutinin and neuraminidase were the slowest segments in the nuclear export, suggesting the possibility that influenza A virus uses the nuclear retention of viral transcripts to delay the expression of antigenic molecules. Our framework presented in this study can be widely used for investigating the nuclear retention of nascent transcripts produced in a transcription burst.
-
- Microbiology and Infectious Disease
Bacterial pathogens employ epigenetic mechanisms, including DNA methylation, to adapt to environmental changes, and these mechanisms play important roles in various biological processes. Pseudomonas syringae is a model phytopathogenic bacterium, but its methylome is less well known than that of other species. In this study, we conducted single-molecule real-time sequencing to profile the DNA methylation landscape in three model pathovars of P. syringae. We identified one Type I restriction–modification system (HsdMSR), including the conserved sequence motif associated with N6-methyladenine (6mA). About 25–40% of the genes involved in DNA methylation were conserved in two or more of the strains, revealing the functional conservation of methylation in P. syringae. Subsequent transcriptomic analysis highlighted the involvement of HsdMSR in virulent and metabolic pathways, including the Type III secretion system, biofilm formation, and translational efficiency. The regulatory effect of HsdMSR on transcription was dependent on both strands being fully 6mA methylated. Overall, this work illustrated the methylation profile in P. syringae and the critical involvement of DNA methylation in regulating virulence and metabolism. Thus, this work contributes to a deeper understanding of epigenetic transcriptional control in P. syringae and related bacteria.