Peptidoglycan (PGN), a net-like polymer constituted by muropeptides, provides protection for microorganisms and has been a major target for antibiotics for decades. Researchers have explored host-microbiome interactions through PGN recognition systems and discovered key muropeptides modulating host responses. However, most common characterization techniques for muropeptides are labor-intensive and require manual analysis of mass spectra due to the complex cross-linked PGN structures. Each species has unique moiety modifications and inter-/intra-bridges, which further complicates the structural analysis of PGN. Here, we developed a high-throughput automated muropeptide analysis (HAMA) platform leveraging tandem mass spectrometry and in silico muropeptide MS/MS fragmentation matching to comprehensively identify muropeptide structures, quantify their abundance, and infer PGN cross-linking types. We demonstrated the effectiveness of HAMA platform using well-characterized PGNs from E. coli and S. aureus and further applied it to common gut bacteria including Bifidobacterium, Bacteroides, Lactobacillus, Enterococcus, and Akkermansia muciiniphila. Specifically, we found that the stiffness and strength of the cell envelopes may correspond to the lengths and compositions of interpeptide bridges within Bifidobacterium species. In summary, the HAMA framework exhibits an automated, intuitive, and accurate analysis of PGN compositions, which may serve as a potential tool to investigate the post-synthetic modifications of saccharides, the variation in interpeptide bridges, and the types of cross-linking within bacterial PGNs.
This study reports a new approach to determine the architecture of peptidoglycan (PG), the primary component of the bacterial cell wall, validating the pipeline through an architectural analysis of several members of the human gut microbiota. The technique is potentially valuable for this sub-field as it would enable researchers interested in peptidoglycan in a range of organisms to easily assess muropeptide composition in an easy, automated manner. However, there is some uncertainty about whether the pipeline was fully automated. Additionally, the use of the technique to support that PG cross-bridge length is a determinant of cell wall stiffness resulted in incomplete evidence that would need substantiating, more direct support.
Peptidoglycan (PGN) is an essential and distinct component of bacterial cell walls that envelop the cytoplasmic membrane.1 It maintains a defined bacterial shape, preserves cell integrity by counteracting the internal osmotic pressure, and acts as a scaffold for other cell wall- anchored components such as proteins and teichoic acids.2 Peptidoglycan provides a protective barrier for microorganisms against environmental threats, contributing to its structural rigidity. Generally, PGN is a net-like polymeric structure composed of liner glycan chains cross-linked by short peptides. The PGN-repeat unit (so-called muropeptide) comprises N-acetylglucosamine (GlcNAc) and N-acetylmuramic acid (MurNAc) linked by β1,4-glycosidic bonds and a peptide stem attached to lactyl moiety of MurNAc. The stem peptide generally contains L-alanine, D-iso- glutamic acid (amidated to D-iso-glutamine in most Gram-positive bacteria), meso- diaminopimelic acid (mDAP) in Gram-negative bacteria, L-lysine in Gram-positive bacteria, D- alanine, and D-alanine sequentially. In PGN architecture, portions of the stem peptide stem from adjacent glycan strands are connected through transpeptidation, forming a multi-layered, three- dimensional (3-D) meshwork.3 Although this main structure is well-conserved among bacteria, the mature peptidoglycan structure remains heterogeneous and species-specific. This structural diversity is attributed to chemical modifications of the sugar backbone (O-acetylation, N- deacetylation, and N-glycolylation), variations in the type of di-amino acid at position 3 of the stem peptide, variations in the interpeptide bridge, and patterns of cross-linking (4-3 linkage, 3-3 linkage, and 2-4 linkages).4, 5
In recent years, characterizing muropeptides has become increasingly important, not only because peptidoglycan is a target for antibiotic drug design, but also because of the various roles muropeptides play as signaling molecules involved in microbial interaction, antimicrobial release, and host innate immunity.6–9 The gut microbiota, consisting of thousands of bacteria, is a source of peptidoglycans in organisms. The signaling functions of gut microbiota-derived PGN fragments in host-microbiota interactions have been extensively studied in inflammation, metabolism, autoimmune diseases, and brain development.10–13 Therefore, analyzing the PGN structural composition and its chemical modifications is critical to understand the PGN recognition processes and subsequent activation mechanisms in the immune system.14
Nowadays, analytical methods for peptidoglycan architecture include HPLC, LC-MS, solid- state NMR, and AFM imaging, enabling a comprehensive study of PGN structure from chemical composition to 3-D architecture.5 Online ultra-performance liquid chromatography coupled with electrospray ionization– tandem mass spectrometry (UPLC-ESI-MS/MS) is the primary strategy for muropeptide structural identification, requiring minimal sample amounts and time.15–17 However, data processing, such as peak annotation and MS/MS spectra alignment, still relies on the prior knowledge of experienced biological chemists. Previous work published by Bern et al. demonstrated the analysis of muropeptides of Clostridium difficile with MS/MS fragmentation spectra by Byonic peptide identification software.18 Another innovative tool, PGFinder, created by Patel et al., shows consistent and reproducible data analysis of Gram-negative bacterial PGN with MS1 library searching.19 Nevertheless, an automated analysis based on MS/MS identification is certainly needed for a thorough PGN investigation of complex muropeptide structures, particularly for gut microbial peptidoglycans.
Here, we developed an in-house, high-throughput automated muropeptide analysis (HAMA) platform that simplifies muropeptide structures to sequence format, mimicking the proteomic “bottom-up” approach. We also established a comprehensive in silico MS/MS fragmentation database for muropeptide identification. Using this platform coupled with high-resolution mass spectrometry, we revealed the peptidoglycan compositions of common gut bacteria, including Bifidobacterium, Bacteroides, Lactobacillus, Enterococcus, and Akkermansia muciiniphila.
Material and methods
Bacterial strains and cell culture
Bacteria and their respective media used in this work were listed in Supplementary Table 1. Anaerobic bacteria such as Bifidobacterium, Bacteroides, Lactobacillus, Enterococcus, and Akkermansia muciiniphila were cultured anaerobically by using an anaerobic workstation (Whitley DG 250, Don Whitley Scientific Limited, England).
Peptidoglycan extraction and Mutanolysin Digestion
The extraction of peptidoglycan was performed using a previously described method, with some modifications.9, 16 Ten milliliters of overnight cultures that have reached the stationary phase were harvested and lysed by 0.5% SDS solution in 0.1 M Tris/HCl (pH 7.0). After boiled for 30 min at 100 ℃ and thoroughly washed with H2O three times, the pellets were resuspended in 750 mL of H2O and sonicated for 30 min. The suspensions were spiked with 750 mL of 0.1 M Tris/HCl solution (pH 7.0) and treated with DNase and RNase at 37 ℃ in a shaker for 1 hr. After the removal of residual nucleic acids, the suspensions were subsequently treated with pronase to digest cell wall-bound proteins (final concentration of 1 mg/mL for 16 h at the same conditions). The pronase-treated cell walls were washed twice with H2O before wall teichoic acids were released by 1 mL of 1 N HCl incubation for 4 h. Insoluble peptidoglycans were washed with H2O until pH 5-6 and resuspended in 200- 500 μL of 12.5 mM sodium dihydrogen-phosphate buffer (pH 5.5) to an OD600 of 3.0. One hundred microliters of insoluble peptidoglycan suspensions were digested overnight with 1 μL of mutanolysin solution (1.000 U/mL in H2O). Following mutanolysin inactivation (100 ℃ boiling for 3 min) and centrifugation (5 min at 10,000 rpm), the supernatants were spiked with 50 μL of reduction solution (10 mg/mL sodium borohydride in 250 mM borate buffer at pH 9.0). After 20 min incubation at room temperature, the reduction reaction was stopped by adding 1 μL of phosphoric acid (98%), resulting in pH 3-4. Reduced muropeptide samples were subsequently analyzed by UPLC-MS/MS.
UPLC-MS/MS analysis of muropeptides
UPLC-MS/MS analysis was performed by a Dionex UltiMate 3000 UHPLC system coupled to a Q-Exactive Plus hybrid quadrupole-orbitrap mass spectrometer (Thermo Scientific). Soluble muropeptides were separated by an Acquity UPLC CSH C18 column (130 Å, 1.7 μm, 2.1 mm × 100 mm), with a solvent A (0.1% formic acid in water) and solvent B (0.1% formic acid in acetonitrile) as the mobile phase. A flow rate of 0.25 mL/min and 1% mobile phase B were applied in column condition. The injection volume of each sample was 5 μL. An elution gradient was run for 25 min: starting with 1% solvent B for 0-2 min; 1-20% B (linear), 2-15 min, 20-95% B (linear), 15-17 min; 95% B, 17-19 min; 95-1% B (linear), 19-21 min; 1% B, 21-25 min for re-equilibration. The column temperature was controlled at 52 ℃ throughout the whole analysis program. A Heat electrospray ionization (HESI) source was operated in positive mode, with parameters automatically optimized under a flow rate of 250 μL/min (a capillary temperature of 300 ℃, a probe temperature of 300 ℃, and a spray voltage of 3.5 kV). Data-dependent acquisition (DDA) mode was used in the instrument. A mass range of MS1 acquisition was from 400 to 2,000 m/z at a resolution of 70,000 and the top 5 most abundant ions were subjected to higher-energy collision-induced dissociation (HCD) fragmentation with Δm/z 3 isolation window, stepped normalized collision energy (NCE) at 20, 25, and 35 (A.U.), and dynamic exclusion time of 4 sec.
Data processing and data analysis
LC-MS/MS raw data was converted to an mzXML format using MSConvert (ProteoWizard) and then processed by HAMA for mass deconvolution and PGN identification.
Mutanolysin digestion assay
Purified peptidoglycan was adjusted to OD600 1.0 in sodium dihydrogen-phosphate buffer and digested with 5 μL of mutanolysin (1.000 U/mL in H2O). Absorbance was measured at OD600 of time points at 0, 3, 6, 9, and 16 hr in a Synergy H1 microplate reader (BioTek) with constant orbital shaking at 37 °C.
Immobilization of cultured bacteria for AFM imaging
The method of bacteria immobilization was performed as described previously, with some modifications.20 Overnight cultured bacteria were washed twice and diluted to OD600 0.3 with 1x PBS. Freshly cleaved Si wafers were coated by 500 μL of 0.1% (w/v) poly-l-lysine in H2O and left overnight at room temperature. Afterward, the substrates were rinsed five times with ultrapure H2O and dried with a nitrogen flow. One hundred microliters of the bacteria suspension were dropped on a silicon substrate, left for 10-20 min, and then rinsed three times in H2O bath. The bacteria were dried onto the substrate with flowing nitrogen and rehydrated again in 1x PBS bath for liquid imaging.
Atomic force microscopy (AFM) imaging
Cultured bacteria images were acquired in 1x PBS bath at room temperature by AFM (Dimension Icon, Bruker) with ScanAsystTM. Data were collected by the PeakForce Quantitative NanoMechanics (QNM) mode with qp-CONT probes (spring constant: 0.25 N/m, resonant frequency: 30 kHz, NANOSENSORSTM). Images were acquired at a scan rate of 25 μm/s, an applied force of 500 pN, and with a resolution of 256 × 256 pixels per image frame. For mechanical analysis, the approaching part of the curves were fitted with the Derjaguin-Muller-Toporov (DMT) model. DMT modulus of each bacterium was recorded with 16 × 16 curves on top of the bacteria with the area of 250 × 250 nm2.
HAMA platform: a High-throughput Automated Muropeptide Analysis for Identification of PGN Fragments
The complex composition and structure of PGN make their structural analysis and identification a time-consuming and challenging process. To address this, we developed a high- throughput and automated platform for PGN structural analysis based on a proteomic “bottom- up” approach. Our strategy is to simplify the structure of muropeptides into sequences by which MS/MS patterns of b- and y- ions could be generated in silico for CID/HCD fragmentation spectra matching. The overview of peptidoglycan analysis in this work is depicted in Fig. 1a. Briefly, bacterial peptidoglycan was extracted, purified, and hydrolyzed by mutanolysin, and the muropeptide products were then analyzed by UPLC-MS/MS. The automated identification of these muropeptides was accomplished using the HAMA platform, a MATLAB-based software with a user-friendly graphic user interface (Supplementary Fig. 1) that includes three parts: DBuilder, Analyzer, and Viewer.
DBuilder for Species-specific muropeptide database construction. In order to construct muropeptide databases in silico, inspired by the well-developed glyco-proteomics analysis, we named each residue in disaccharide peptide with a letter and simplified the muropeptide structure into a sequence format. The representation letters of monosaccharides and amino acids are listed in Supplementary Table 2. Dashes and parentheses were used to represent peptide/glycosidic bonds and discriminate certain subunits, respectively. DBuilder combined the input options of three parts (GlcNAc, MurNAc, and peptide) to generate a comprehensive database that contains monomeric, dimeric, and trimeric muropeptides, as shown in Fig. 1b. Herein, we constructed PGN multimers using two types of polymerization events: transglycosylation connected via glycosidic bonds, and transpeptidation linked through peptide bonds. In DBuilder, the type of peptide linkage used was the common 4-3 cross-link, which could be achieved either directly through a covalent bond between the penultimate D-Ala and the third residue in the acceptor peptide stem, or via an interpeptide bridge. We also considered that the terminal D-Ala (at the fifth position) in the donor stem was not allowed during the transpeptidation reaction. In addition, choosing more possible modifications (anhydro MurNAc residues, deacetyl GlcNAc residues, acetyl MurNAc residues, and amidated iso-Glu) to construct a PGN database could lead to severe mass coincidence issues for MS1 searching in the Analyzer. Hence, we set a maximum modification number of six to avoid generating multi-modified muropeptide candidates. Finally, the species-specific muropeptide database was outputted to a *.csv format file, including muropeptide sequences and corresponding chemical formula and theoretical monoisotopic mass.
Analyzer for analysis of muropeptide MS raw files. Analyzer is an integrated tool for processing MS data and identifying muropeptides. The flowchart is shown in Fig. 1c. First, the species-specific database, MS raw file (mzXML format),21 and parameters were loaded into Analyzer. Parameters were originally set with MS1 range m/z 400-2000, retention time range 2- 12 min, mass tolerance of 10 ppm (orbitrap mass spectrometry data), intensity threshold of 1e5, etc., which could be input by users for different experimental conditions. Then, the mzXML file was processed sequentially by peak picking, deisotoping, feature selection, and deconvolution. The observed masses were searched against the loaded database within 10 ppm tolerance. However, in cases where the observed mass matched more than one inferred sequence in MS1 searching, Analyzer compared the MS/MS spectra of those MS1 matched features to in silico MS/MS fragmentation spectra of corresponding inferred candidates through cosine similarity and matched peak score (MPS= the number of the matched peaks/the number of the predicted peaks) calculation. The observed mass was then identified as the sequence with the highest matching score. Lastly, all identified muropeptides were merged into an Excel spreadsheet reporting charge state, molecular weight, retention time, peak intensity, peak area, sequence, main scan number, cosine similarity, score, etc. Analyzer also outputted a base peak chromatogram with peak annotations and a result file (in Matlab data) for Viewer input. The entire analysis was completed within a few minutes.
Viewer for annotation and identification display. Viewer, a visualization tool that allows users to browse extracted ion chromatograms (XICs) of the identified muropeptides and visualize the in-silico MS/MS matching spectra annotated with b- and y- ions. In the muropeptide spectra match (PSM) page of each identified muropeptide, Viewer lists all inferred candidates and visualizes their individual MS/MS spectral matches to clearly demonstrate the process of scoring in Analyzer. Additionally, Viewer provides a data sorting function that allows users to classify the identification list by molecular weight, peak area, score (in ascending or descending order), and sequence (in alphabetical order).
Demonstration of the HAMA Platform Using Well-characterized PGNs of E. coli and S. aureus
As a proof-of-concept, we demonstrated the HAMA platform using well-characterized peptidoglycans from Gram-negative Escherichia coli DH5α and Gram-positive Staphylococcus aureus SA113. The typical PGN subunit of E. coli is GlcNAc – MurNAc – L-Ala – D-iGlu – mDAP– D-Ala – D-Ala, represented as a sequence of B-M-l(-A-E-H-A-A) in the HAMA platform. For S. aureus, the classic PGN structure is a disaccharide-pentapeptide with the Gly5 interpeptide bridge branching from the lysine, which is simplified as B-M-l(-A-Q-(G-G-G-G-G-)K-A-A. Based on the analyzed input dataset, approximately 70% of the peak area in the base peak chromatograms was assigned to muropeptide signals, which allowed for a comprehensive PGN mapping (Fig. 2a, c). The HAMA platform successfully identified E. coli and S. aureus muropeptides, and their XICs and MS/MS matching spectra could be visualized in Viewer (Fig. 2b, d). In total, 28 and 89 unique muropeptides were identified in PGNs of E. coli and S. aureus, respectively. Table 1 lists overviews of the main muropeptide peaks in the chromatograms. Upon examining the entire identification results, we found that some muropeptides were eluted in multiple retention times due to the existence of a few abundant stereoisomers. Additionally, a few monosaccharide muropeptides identified as the loss of GlcNAc could be recognized as in- source fragments by exact co-elution.
In the E. coli data, the most abundant muropeptides were disaccharide-tetrapeptide (Tetra monomer) and disaccharide-tetrapeptide-disaccharide-tetrapeptide (Tetra-Tetra dimer). We also identified two low-abundant muropeptides, B-M-l(-A-E-H-G) and B-M-l(-A-E-H-A-G), in which the fourth and fifth Ala are individually substituted with a Gly. This unique composition has been reported in strain E. coli Nissle 1917 from the previous analysis,16 but not for DH5α strain used in our study. In the output of S. aureus peptidoglycan, the elution profile was the same as what has been previously known: the most abundant monomers were disaccharide- pentapeptide with a (Gly)5 bridge and disaccharide-pentapeptide with a Gly bridge. The most abundant cross-linked dimer was disaccharide-tetrapeptide-disaccharide-pentapeptide, which contains a total of ten Gly residues.16 We also characterized the known modifications and structural variations within S. aureus peptidoglycan, such as O-acetylation of MurNAc, the presence of D-iGlu (non-amidated) in the stem peptides, and the length variation of the interpeptide bridge. The high-throughput analysis allowed for the identification of monomeric muropeptides consisting of one to nine Gly residues and dimeric muropeptides containing a total of five to fourteen Gly residues in a single analysis. However, due to the insufficient structural information in the peptide backbone provided by the HCD fragmentation spectra, the exact number of Gly residues harbored in each PGN unit could not be determined. Hence, further careful analysis and manual verification are required to confirm the identity of identified muropeptides, particularly for low-score dimers and trimers. Collectively, these results provide valuable insights into the PGN compositions and architectures.
Characterizing Gut Bacterial PGN Compositions and Resolving Isomeric Muropeptides
Over the past two decades, an increasing number of gut microbial species have been found to be associated with human health. In addition, emerging evidence has supported the notion that gut microbial muropeptides work as signaling molecules that mediate host−microbiome interactions in metabolism, gut homeostasis, and immunity.22 However, little has been discussed about the structures of PGN fragments due to the diversity and complexity of gut bacterial PGNs and the lack of an efficient analytical tool.14 To date, most of the structural information on gut bacterial peptidoglycan comes from reports published between 1970 to 2000,23–25 whereas that of the health-promoting gut microbes discovered in the last two decades have been rarely reported. Therefore, we utilized the HAMA platform to investigate the specific PGN structures of several common gut bacterial species, including Bifidobacterium, Bacteroides, Lactobacillus, Enterococcus, and Akkermansia muciiniphila. We collected LC-MS/MS data of purified peptidoglycans from ten gut bacterial species and identified those with the self-built species- specific muropeptide databases. Overall, the base peak chromatograms of each species showed approximately 70% of peaks were annotated as muropeptides, and these elution profiles reflected the actual composition and organization within the peptidoglycan architecture. Additionally, the structural compositions of these purified peptidoglycans are consistent with previous reports, listed in Table 2. The muropeptide profiles of the species in this study are summarized in Supplementary Table 3-12.
In the identification results of Gram-positive gut bacterial peptidoglycans, the main muropeptide sequence is GlcNAc – MurNAc – L-Ala – D-iGln – L-Lys– D-Ala – D-Ala, in which the second amino acid of the stem peptide is usually amidated to D-iGln from D-iGlu. This chemical modification is done by the MurT or GatD biosynthetic enzyme and is supposed to control the PGN cross-linking levels, which has recently been demonstrated using labeled PGN stem mimics in certain species.26 Another unique feature is L-ornithine (L-Orn) at the third position of the stem peptide. Unlike L-lysine, which is a usual feature of Gram-positive peptidoglycans, L-Orn is a non-proteinogenic amino acid and has been found in the peptidoglycan of certain Bifidobacterium species before.23 As anticipated, we identified the L- Orn-harbored muropeptides in B. bifidum and B. longum peptidoglycans. The most species- specific variation among Bifidobacterial PGN was the architecture of the interpeptide bridge, such as Gly, Ser-Asp, and Ser-Ala-Thr-Ala, which corresponded to B. breve, B. bifidum, and B. longum species. This structural diversity arouses our interest in the relationship between the length of an interpeptide bridge and the physical property of the bacterial cell envelope, which will be discussed in a later section.
In the peptidoglycans of E. faecium and L. acidophilus (Supplementary Table 4-5), we observed the monomeric structure of GlcNAc – MurNAc – L-Ala – D-iGln – L-Lys– D-Ala – D- Ala, which harbored an interpeptide bridge of asparagine (D-Asn) or aspartate (D-Asp), shortened as B-M-l(-A-Q-(N-)K-A-A) or B-M-l(-A-Q-(D-)K-A-A). In this part, we found that both B-M-l(-A-E-(N-)K) and B-M-l(-A-Q-(D-)K) had identical molecular weights since the mass difference (+ 0.984 Da) between Asn (N) and Asp (D) is the same as between Gln (Q) and Glu (E). This kind of isomeric muropeptides makes identification more complicated, but it can still be addressed by MS/MS in silico fragmentation matching under an appropriate separation chromatography. For example, two disaccharide-tripeptides separately eluted at 6.18 min and 7.02 min were found as structural isomers that existed in E. faecium peptidoglycan (Fig. 3 and Supplementary Table 4). Through in silico MS/MS fragmentation matching, these two isomers can be discriminated and identified as the sequences of B-M-l(-A-E-(N-)K) and B-M-l(-A-Q-(D-)K), respectively. To validate the correctness of the automated platform, we also extracted the experimental data of those isomers and manually inspected the MS/MS spectra (Supplementary Fig. 2). However, this strategy did not work for the identification of E. faecalis peptidoglycan whose interpeptide bridge is composed of two alanine residues (Supplementary Table 3). In this case, two structurally isomeric muropeptides, a disaccharide-tripeptide with a bridge (B-M-l(-A- Q-(A-A-)K)) and a disaccharide-pentapeptide (B-M-l(-A-Q-K-A-A)), have similar in silico MS/MS fragmentation patterns, which often leads to misidentification. Nevertheless, the retention time still gave us information to identify them. We annotated the peak at a retention time of 3.66 min as disaccharide-pentapeptide (B-M-l(-A-Q-K-A-A)) by comparing it to an identical sequence that appeared in L. acidophilus peptidoglycan with a similar retention time (3.61 min). Therefore, the other peak at a retention time of 6.39 min could be determined as disaccharide-tripeptide with a bridge (B-M-l(-A-Q-(A-A-)K)), which had a longer retention time than the linear structure one.
Apart from Gram-positive bacteria, we also analyzed the peptidoglycan of several anaerobic Gram-negative bacteria, including Bacteroides fragilis, Bacteroides ovatus, Bacteroides thetaiotaomicron, and Akkermansia muciniphila (Table 2 and Supplementary Table 9-12). Structurally, the general stem peptide of Gram-negative peptidoglycan is L-Ala – D-iGlu – m- DAP– D-Ala – D-Ala, with diaminopimelic acid in the third position being a representative feature. The peptidoglycan structures we identified were consistent with previously published ones, with O-acetyl-MurNAc found in Bacteroides species and N-deacetyl-GlcNAc (GlcN) in Akkermansia muciniphila.27, 28 The output analysis showed that around 56-66% of the total muropeptides in Bacteroides species’ peptidoglycans contained O-acetylated MurNAc, while approximately 87% of the total muropeptides in A. muciniphila peptidoglycan contained de-N- acetylated GlcNAc. The high occurrence of N-deacetylation in A. muciniphila peptidoglycan suggests that A. muciniphila might possess a homolog of oxidative stress-induced PGN deacetylase (PgdA) found in Helicobacter pylori.29–31 Chemical modifications to the disaccharide backbone are known to provide resistance to lysozyme and protect bacteria against enzymatic attack from the host innate immune system.32, 33
Inferring PGN Cross-linking Types Based on Identified PGN Fragments
We not only identified peptidoglycan structures but also were dedicated to exploring the type of peptide cross-linking within those gut bacterial peptidoglycan. The substrates and catalyzed enzymes involved in peptide cross-linking have been targets for antibiotic development and antimicrobial resistance studies.34 In general, there are two types of PGN cross-linkage: 4-3 cross-links generated by D,D-transpeptidases (Ddts) and 3-3 cross-links created by L,D- transpeptidases (Ldts) (see Fig. 4a). As the enzyme names imply, the substrates that Ddts and Ldts bind to are terminated as D,D-stereocenters and L,D-stereocenters, which structurally means pentapeptides and tetrapeptides. During D,D-transpeptidation, Ddts first remove the terminal (fifth) D-Ala residue of the pentapeptide stem and form an intermediate. Then, the intermediate cross-links the NH2 group in the third position of the neighboring acceptor stem, forming a 4-3 cross-link. Following this distinctive rule of PGN biosynthesis, the possible combinations of 4-3 cross-linked dimers (Donor-Acceptor) are disaccharide-tetrapeptide– disaccharide-pentapeptide (D45) and disaccharide-tetrapeptide–disaccharide-tetrapeptide (D44). During L,D-transpeptidation, Ldts cleave the terminal (fourth) D-Ala residue of the donor tetrapeptide stem and generate a peptide bond between the third residue and the NH2 group in the third position of adjacent peptide.35 The possible structures of 3-3 cross-linked dimers are disaccharide-tripeptide–disaccharide-tetrapeptide (D34) and disaccharide-tetrapeptide– disaccharide-tripeptide (D33). Hence, we could infer the possible PGN cross-linkage types and the involved enzymes based on the main PGN fragments.
Take Bifidobacterium breve ATCC 15700, CSCC 1900, and ATCC 15698 as examples. The LC-MS profiles showed noticeable differences in muropeptide compositions among the three strains (Fig. 4b). The relative muropeptide compositions of each strain are presented in a heatmap (Fig. 4c). Obviously, the main muropeptides of B. breve ATCC 15700 are disaccharide- tripeptide with an interpeptide bridge (M3b), disaccharide-tetrapeptide with an interpeptide bridge (M4b), and disaccharide-tripeptide–disaccharide-tetrapeptide (D34), whereas the CSCC 1900 strain showed great abundances in disaccharide-tetrapeptide with an interpeptide bridge (M4b), disaccharide-pentapeptide with an interpeptide bridge (M5b), and disaccharide- tetrapeptide–disaccharide-tetrapeptide (D44). These results suggested that 3-3 cross-links and 4- 3 cross-links might be predominant in the peptidoglycans of ATCC 15700 and CSCC 1900 strains, respectively. In the case of ATCC 15698 strain, its peptidoglycan likely contains both types of cross-links since the abundances of M3b and M4b, as well as D34 and D44, are almost equivalent.
Broad-spectrum β-Lactams are known to inhibit D,D-transpeptidases, such as penicillin- binding proteins (PBPs). However, L,D-transpeptidases are generally insensitive to β-Lactams and offer alternative cross-links in the PGNs.36 Numerous studies have investigated the role of L,D-transpeptidases in the maintenance and remodeling of mature peptidoglycan in organisms such as Enterococcus faecium, Clostridium difficile, Escherichia coli, and Mycobacterium tuberculosis.37–42 Based on the muropeptide compositional analysis mentioned above, we found high abundances of M3/M3b monomer and D34 dimer in the peptidoglycans of E. faecalis, E. faecium, L. acidophilus, B. breve, B. longum, and A. muciniphila, which may be the PGN products catalyzed by Ldts. While the homologs of Ldts in L. acidophilus, B. breve, B. longum, and A. muciniphila have been supported by genome sequence evidence,43–46 biochemical evidence is needed to confirm the existence of L,D-transpeptidases in those species.
Proposed Bridge Length-dependent Cell Envelope Stiffness in B. longum and B. breve
Peptidoglycan, a protective exoskeleton around cells, provides structural integrity to the cell. The porosity of the PGN scaffold, which is defined by the degree of cross-link, might affect the delivery and diffusion of drugs and signaling molecules to the cell membrane. Thus, modifications to PGN structure are thought to significantly influence bacterial cell mechanics.47, 48 Previous evidence has indicated that a high PGN cross-linking level enhances the stiffness of the cell wall material in Gram-positive bacteria.49 The PGN-lattice architecture based on the interpeptide bridge length has been investigated using solid-state NMR in S. aureus.50 However, the effect of the interpeptide bridge length variants on the porosity and stiffness of bacterial cell envelope has little been discussed. Interestingly, the identification table presented above showed that Bifidobacterium peptidoglycans have different architecture of the interpeptide bridges among species: a tetrapeptide bridge (Ser-Ala-Thr-Ala) found in Bifidobacterium longum and a monopeptide bridge (Gly) found in Bifidobacterium breve (Supplementary Fig. 3). We wondered whether the length of interpeptide bridges may be related to the bacterial cell envelope’s mechanical properties and hypothesized that the cross-linking with shorter bridges may form a tighter meshwork in peptidoglycan layers (Fig. 5a).
In order to explore the above relevance, we selected three B. breve strains and three B. longum strains, whose cell morphology is similar, as research objects. The proportion of muropeptide composition of these species is summarized in Fig. 5b. A slightly higher abundance of peptide-linked dimers in B. breve than in B. longum implies a higher degree of PGN cross- linking level in B. breve. Next, we directly measured the stiffness (defined by Young’s modulus, E) of these two species using the single-cell atomic force microscopy technique (AFM). Representative data obtained on the different strains are presented in Fig. 5c. For each cell, a high-resolution elasticity map was recorded on top of the cell in the central region to avoid edge effects. The values of Young’s modulus were extracted and subjected to statistical analysis, shown in Fig. 5d. Overall for the cell envelope properties, B. longum displayed smaller values of stiffness than B. breve, demonstrating softer cell envelope properties with one-fifth lower values of Young’s modulus. As anticipated, these data suggested the B. breve harboring shorter interpeptide bridges corresponded to a higher cell stiffness. Furthermore, a slower hydrolysis rate was observed in purified PGN of B. breve than in that of B. longum, implying that the cell wall of B. breve is a harder exoskeleton (Supplementary Fig. 4). Interestingly, among three B. breve strains, the average stiffness of ATCC 15700 is higher than the other strains, likely due to the more abundant 3-3 cross-linkages in its peptidoglycan. Computational modeling suggested a more rigid stem peptide in the conformation of L,D-cross-linking, implying the 3-3 cross- linkages strengthen the PNG layer.36, 51 These results might support our implication that a tight peptidoglycan network woven with shorter peptides gives the bacterium a stiffer cell wall, proposed as an idea of bridge length-dependent cell envelope.
The purpose of this study was to develop an automated platform for identifying and analyzing the muropeptide compositions of gut bacterial PGNs. Using high-resolution MS data, we characterized PGN structures of common gut microbes, investigated their PGN cross-linking types, and evaluated the effect of bridge-length variants on the stiffness of bacterial cell walls.
In early 1970s, the amino acid compositions of PGN stem peptides were characterized in many bacteria species, including certain gut microorganisms. At that time, PGNs were purified and hydrolyzed under harsh acidic conditions and then separated by two-dimensional paper chromatography to determine the amino acid sequence. Based on this characterization work, Schleifer and Kandler proposed a bacterial PGN classification system that is still used today.23 In the Bifidobacterium and Lactobacillus genera, only type A PGNs were identified, representing 4- 3 cross-linking through interpeptide bridges.25 However, our study of B. breve, B. longum, and L. acidophilus revealed a significant abundance of M3b and D34 in their muropeptide compositions, implying the incorporation of 3-3 cross-linkage type within PGN network. This cross-linking type is catalyzed by penicillin-insensitive L,D-transpeptidases. We speculate that this finding may be due to the comprehensive mass spectrometric approaches we used. Furthermore, we adopted the enzymatic method using mutanolysin to cleave the β-N- acetylmuramyl-(1,4)-N-acetylglucosamine linkage, which is crucial for obtaining the complete PGN subunit that preserves the original peptide linkage. Importantly, our HAMA platform provides a powerful tool for mapping peptidoglycan architecture and generating structural information on the PGN biosynthesis system, which is closely related to the antibiotic resistance of bacteria.
In most Gram-positive bacteria, the interpeptide bridges vary depending on the species and are likely associated with the PGN architectures. Although the chemical structure of PGN units has been extensively studied, the physical structure and 3-D architecture remain open questions.52, 53 A previous review by Kim et al. suggested that the length of interpeptide bridges is a key factor in determining different PGN architectures and proposed three PGN-bridge length-dependent architectures based on the orientations of cross-linked peptide stems.50 The parallel-stem model, which has the smallest pore size and the highest cross-linking level, is suited for bacteria with long-length bridges. The perpendicular-stem model, with intermediate pore size, is proposed for bacteria with intermediate-length bridges, while the antiparallel-stem model, which has the largest pore size, is for bacteria without bridges. Notably, our study provides observations regarding the relationship between the bridge length and the stiffness of bacterial cell walls in Bifidobacterium species (Fig. 5). B. longum, which harbors tetrapeptide bridges (Ser-Ala-Thr-Ala) corresponds to lower stiffness, whereas B. breve, whose PGN is cross- linked by monopeptide bridges (Gly), has higher stiffness. This finding implies that the PGN architecture with shorter cross-linked bridges corresponds to a more rigid skeleton. This is not surprising as Bifidobacterium species are known for their thick peptidoglycan layers, which account for 30-70% of the cell wall.35 Meanwhile, other biomolecules in the cell wall also influence bacterial stiffness, such as lipoprotein, wall teichoic acids, and lipoteichoic acids.54–57 Future studies on the effect of quantitatively changing the PGN-bridge length on stiffness will require intensive research, considering the cross-linking densities, glycan strand lengths, and PGN architecture models. These studies may provide further insight into the physical structure and architecture of PGN, which would contribute to a better understanding of the factors determining bacterial cell wall properties.
In the HAMA platform, simplifying muropeptide structures into sequences has facilitated the generation of in silico MS/MS fragments for spectra matching, enabling the structural resolving of isomeric muropeptides. However, there are several limitations to our study design. Our species-specific PGN database is built based on known structures and modifications reported previously. While the identified muropeptides cover approximately 70% of peak area in the base peak chromatograms, certain low-abundant muropeptides composed of uncoded amino acids or saccharides required additional MS/MS identification with manual analysis. For peptide-linked multimers, the current DBuilder only builds up 4-3 cross-linked sequences because including the 3-3 cross-link type results in severe misidentifications, as it produces similar in silico MS/MS fragmentation patterns as the 4-3 cross-links (about 88-90% similarity). However, we can infer possible PGN cross-linkages based on the type of PGN fragments obtained from hydrolysis.58 The primary limitation is that MS/MS spectra generated by HCD fragmentation carry mostly structural information from disaccharide moiety (glyco-oxonium ions) without providing sufficient peptide fragment ions to derive peptide sequence,59 making it difficult to accurately identify multimers and determine the position of modifications. Nevertheless, accurate monoisotopic masses acquired from high-resolution mass spectrometry are still informative in peptidoglycan compositional components. Alternative approaches can be considered to elucidate branched structures of muropeptides, such as breaking down PGN with cell wall amidases (cleaving the amide bond between MurNAc and L-Ala residue) to target stem peptide analysis,60, 61 acquiring MS3 spectra to obtain fragmentation maps, or using electron transfer dissociation (ETD) fragmentation mode to leave more peptide fragments.18 Additionally, strategies commonly used in glycopeptide identification, such as multi-fragmentation modes (CID, HCD, and ETD), could also be applied to PGN identification to provide more information on peptide sequences and disaccharide structures.62 To achieve a more comprehensive identification of muropeptides, future maneuvers are proposed for an expanded database, innovative in-silico fragmentation patterns, and improved MS/MS spectra acquisition.
HAMA is an innovative automated platform that constructs species-specific muropeptide databases, validates structural identification using in-silico MS/MS analysis, and provides visualized results for efficient and reliable identification of muropeptides, greatly reducing the time-consuming task of manual interpretation of LC-MS/MS data. The HAMA platform has the potential to be a valuable tool for various research fields, including microbiology, pathology, molecular biology, and immunology. Its applications in identifying activation ligands for antimicrobial resistance studies, characterizing key motifs recognized by pattern recognition receptors for host-microbiota immuno-interaction research, and mapping peptidoglycan in cell wall architecture studies. With the ease and efficiency offered by HAMA, we believe that muropeptide analysis will become more accessible and contribute to a deeper understanding of cell wall biology.
This research was supported by Ministry of Science and Technology (MOST), R.O.C. (Grants MOST 108-2636-M-002-008-, 109-2636-M-002-005-, and 110-2636-M-002-014-). Y.-C. H. acknowledged the financial support of “The Program of Research Performance Enhancement via Students Entering Ph.D. Programs Straight from an Undergraduate/Master’s Program” from National Taiwan University. The instrument support from the NTU Mass Spectrometry Platform was acknowledged.
The authors declare no competing financial interest.
Supplementary information includes supplementary figures (MS/MS spectra) and supplementary tables (lists of automated identification of gut bacterial muropeptides). The Source Data underlying Figs. 4–5 is provided as a separate Source Data file. The HAMA software package is available at: https://drive.google.com/drive/folders/17LVGOm-LEHzNmw7nPULXMEZvg_cnO3k1?usp=share_link
- 1.The bacterial cell envelopeCold Spring Harbor Perspect. Biol 2
- 2.Peptidoglycan structure and architectureFEMS Microbiol Rev 32:149–167
- 3.Peptidoglycan Muropeptides: Release, Perception, and Functions as Signaling MoleculesFront Microbiol 10
- 4.Peptidoglycan structureMethods in microbiology 38:101–129
- 5.Elucidating Peptidoglycan Structure: An Analytical Toolset: (Trends in Microbiology 27, 607-622; 2019)Trends Microbiol 27:653–654
- 6.Peptidoglycan recognition by the innate immune systemNat Rev Immunol 18:243–254
- 7.Resistance to antibiotics targeted to the bacterial cell wallProtein Sci 23:243–259
- 8.Peptidoglycan molecular requirements allowing detection by Nod1 and Nod2J Biol Chem 278:41702–41708
- 9.Impacts of Penicillin Binding Protein 2 Inactivation on β-Lactamase Expression and Muropeptide Profile in Stenotrophomonas maltophiliamSystems 2:e00077–17
- 10.Muramyl Dipeptide-Based Postbiotics Mitigate Obesity-Induced Insulin Resistance via IRF4Cell Metab 25:1063–1074
- 11.Host recognition of bacterial muramyl dipeptide mediated through NOD2. Implications for Crohn’s diseaseJ Biol Chem 278:5509–5512
- 12.Antibody neutralization of microbiota-derived circulating peptidoglycan dampens inflammation and ameliorates autoimmunityNat Microbiol 4:766–773
- 13.The bacterial peptidoglycan-sensing molecule Pglyrp2 modulates brain development and behaviorMol Psychiatry 22:257–266
- 14.Bacterial Peptidoglycan Fragments Differentially Regulate Innate Immune SignalingACS Cent Sci 7:688–696
- 15.Ultra-Sensitive, High- Resolution Liquid Chromatography Methods for the High-Throughput Quantitative Analysis of Bacterial Cell Wall Chemistry and StructureMethods Mol Biol 1440:11–27
- 16.From cells to muropeptide structures in 24 h: peptidoglycan mapping by UPLC-MSSci Rep 4
- 17.Separation and quantification of muropeptides with high-performance liquid chromatographyAnal Biochem 172:451–464
- 18.Towards an automated analysis of bacterial peptidoglycan structureAnal Bioanal Chem 409:551–560
- 19.PGFinder, a novel analysis pipeline for the consistent, reproducible, and high-resolution structural analysis of bacterial peptidoglycansElife 10
- 20.The architecture of the Gram-positive bacterial cell wallNature 582:294–297
- 21.A cross- platform toolkit for mass spectrometry and proteomicsNat. Biotechnol 30:918–920
- 22.Uptake, recognition and responses to peptidoglycan in the mammalian hostFEMS Microbiol. Rev 45
- 23.Peptidoglycan types of bacterial cell walls and their taxonomic implicationsBacteriol Rev 36:407–477
- 24.The structure of the cell wall peptidoglycan of Bifidobacterium bifidum var. pennsylvanicusArch. Biochem. Biophys 143:204–211
- 25.Bifidobacteria cell wall-derived exo-polysaccharides, lipoteichoic acids, peptidoglycans, polar lipids and proteins - their chemical structure and biological attributesInt J Biol Macromol 147:333–349
- 26.D-Transpeptidase Specific Probe Reveals Spatial Activity of Peptidoglycan Cross-LinkingACS Chem Biol 14:2185–2196
- 27.Identification and characterization of O-acetylpeptidoglycan esterase: a novel enzyme discovered in Neisseria gonorrhoeaeBiochemistry 45:839–851
- 28.Peptidoglycan from Akkermansia muciniphila MucT: chemical structure and immunostimulatory properties of muropeptidesGlycobiology 32:712–719
- 29.Chemical biology of peptidoglycan acetylation and deacetylationBioorg Chem 54:44–50
- 30.Oxidative stress-induced peptidoglycan deacetylase in Helicobacter pyloriJ Biol Chem 284:6790–6800
- 31.Peptidoglycan deacetylation in Helicobacter pylori contributes to bacterial survival by mitigating host immune responsesInfect Immun 78:4660–4666
- 32.Modifications to the peptidoglycan backbone help bacteria to establish infectionInfect Immun 79:562–570
- 33.Mechanistic Pathways for Peptidoglycan O-Acetylation and De-O-AcetylationFront. Microbiol 9
- 34.Peptidoglycan Cross- Linking Preferences of Staphylococcus aureus Penicillin-Binding Proteins Have Implications for Treating MRSA InfectionsJ Am Chem Soc 139:9791–9794
- 35.Clostridium difficile has an original peptidoglycan structure with a high level of N- acetylglucosamine deacetylation and mainly 3-3 cross-linksJ Biol Chem 286:29053–29062
- 36.LD-transpeptidases: the great unknown among the peptidoglycan cross-linkersFebs j 289:4718–4730
- 37.Balance between two transpeptidation mechanisms determines the expression of beta-lactam resistance in Enterococcus faeciumJ Biol Chem 277:35801–35807
- 38.Structure of Enterococcus faeciuml,d-transpeptidase acylated by ertapenem provides insight into the inactivation mechanismACS Chem Biol 8:1140–1146
- 39.D- transpeptidases from gram-positive bacteria producing different peptidoglycan chemotypesJ Biol Chem 282:13151–13159
- 40.Peptidoglycan Cross-Linking Activity of l,d-Transpeptidases from Clostridium difficile and Inactivation of These Enzymes by β-LactamsAntimicrob Agents Chemother 62:e01607–1617
- 41.Identification of the L,D- transpeptidases for peptidoglycan cross-linking in Escherichia coliJ Bacteriol 190:4782–4785
- 42.The peptidoglycan of stationary-phase Mycobacterium tuberculosis predominantly contains cross-links generated by L,D-transpeptidationJ Bacteriol 190:4360–4366
- 43.Complete Genome Sequence of the Newly Developed Lactobacillus acidophilus Strain With Improved Thermal AdaptabilityFront Microbiol 12
- 44.Complete genome sequence of Bifidobacterium breve JCM 1192(T) isolated from infant fecesJ Biotechnol 210:81–82
- 45.A library of human gut bacterial isolates paired with longitudinal multiomics data enables mechanistic microbiome researchNat Med 25:1442–1452
- 46.46. Ottman, N. A. Host immunostimulation and substrate utilization of the gut symbiont Akkermansia muciniphila. PhD thesis, Wageningen University, Wageningen, 2015.
- 47.Bacterial Cell MechanicsBiochemistry 56:3710–3724
- 48.Peptidoglycan Compositional Analysis of Enterococcus faecalis Biofilm by Stable Isotope Labeling by Amino Acids in a Bacterial CultureBiochemistry 57:1274–1283
- 49.Reduction of the peptidoglycan crosslinking causes a decrease in stiffness of the Staphylococcus aureus cell envelopeBiophys J 107:1082–1089
- 50.Peptidoglycan architecture of Gram-positive bacteria by solid-state NMRBiochim Biophys Acta 1848:350–362
- 51.Structural constraints and dynamics of bacterial cell wall architectureFront Microbiol 6
- 52.Architecture of peptidoglycan: more data and more modelsTrends Microbiol 18:59–66
- 53.Different walls for rods and balls: the diversity of peptidoglycanMol. Microbiol 91:862–874
- 54.Lipoprotein Lpp regulates the mechanical properties of the E. coli cell envelopeNat. Commun 11
- 55.Mechanical Genomics Identifies Diverse Modulators of Bacterial Cell StiffnessCell Syst 2:402–411
- 56.Compound-gene interaction mapping reveals distinct roles for Staphylococcus aureus teichoic acidsProc Natl Acad Sci U S A 111:12510–12515
- 57.D-alanylation of lipoteichoic acids confers resistance to cationic peptides in group B streptococcus by increasing the cell wall densityPLoS Pathog 8
- 58.Peptidoglycan compositional analysis of Mycobacterium smegmatis using high- resolution LC-MSSci Rep 12
- 59.Open-source Platform for Mass Spectrometry-based Glycoproteomics Data AnalysisMol Cell Proteomics 16:2032–2047
- 60.Peptidoglycan branched stem peptides contribute to Streptococcus pneumoniae virulence by inhibiting pneumolysin releasePLoS Pathog 11
- 61.Activity of the major staphylococcal autolysin AtlFEMS Microbiol. Lett 259:260–268
- 62.The benefits of hybrid fragmentation methods for glycoproteomics. TrACTrends Anal. Chem 108:260–268