Evolution and regulation of microbial secondary metabolism
Abstract
Microbes have disproportionate impacts on the macroscopic world. This is in part due to their ability to grow to large populations that collectively secrete massive amounts of secondary metabolites and alter their environment. Yet, the conditions favoring secondary metabolism despite the potential costs for primary metabolism remain unclear. Here we investigated the biosurfactants that the bacterium Pseudomonas aeruginosa makes and secretes to decrease the surface tension of surrounding liquid. Using a combination of genomics, metabolomics, transcriptomics, and mathematical modeling we show that the ability to make surfactants from glycerol varies inconsistently across the phylogenetic tree; instead, lineages that lost this ability are also worse at reducing the oxidative stress of primary metabolism on glycerol. Experiments with different carbon sources support a link with oxidative stress that explains the inconsistent distribution across the P. aeruginosa phylogeny and suggests a general principle: P. aeruginosa lineages produce surfactants if they can reduce the oxidative stress produced by primary metabolism and have excess resources, beyond their primary needs, to afford secondary metabolism. These results add a new layer to the regulation of a secondary metabolite unessential for primary metabolism but important to change physical properties of the environments surrounding bacterial populations.
Editor's evaluation
This important study proposes how Pseudomonas aeruginosa is able to produce costly secondary metabolites. By combining a wide variety of experimental and computational approaches and studying 31 isolates in several growth environments, they present highly compelling evidence that secondary metabolites are produced in this species when it is not suffering from oxidative stress linked to its primary metabolism. This paper helps to understand pathogen metabolism, and more generally, how environmental conditions can allow for the evolution of costly collective behavior.
https://doi.org/10.7554/eLife.76119.sa0Introduction
Microorganisms touch practically every human activity, in harmful and beneficial ways: they cause infections (Morens et al., 2004) and help us digest food (Rowland et al., 2018), make bioterrorism weapons (Henderson, 1999) and synthesize life-saving drugs (Demain and Fang, 2000), contribute to climate change (Bardgett et al., 2008), and clean our wastewater (Kartal et al., 2010). The outsize impact of microbes on the macroscopic world that surrounds them comes from their ability to multiply to populations of billions that cooperate to transform chemicals in vast amounts. Many of the chemicals that they release are compounds unessential to growth, development, and division: they are secondary metabolites. But despite their importance, it remains unclear why microbes invest in secondary metabolites (Cavalier-Smith, 1992; Kell et al., 1995).
Secondary metabolites enable collective traits like cell-cell signaling (Dufour and Rao, 2011), microbial warfare (Demain and Fang, 2000), pathogenesis (Nicas and Iglewski, 1985), and nutrient scavenging (Albelda-Berenguer et al., 2019). These collective traits bring fitness advantages to a population (Maplestone et al., 1992) but by diverting precursors, cofactors, and energy away from primary metabolism they can also bring a cost to the individual cell (Yan et al., 2018). How did secondary metabolites evolve and how are they maintained by natural selection?
Social evolution theory provides an answer. A secondary metabolite that brings a fitness cost C to the cell and a benefit B to the population can be favored by natural selection if rB >C, where r is the relatedness among different genotypes across the population (West et al., 2006). This means that secondary metabolite with a high cost can still be evolved if relatedness r is very high, such as in monoclonal populations. But population mixing and mutation, which are frequent in a microbial world (Andersen et al., 2015; Andersen et al., 2018), mean that microbial populations can have a low relatedness, and a low r limits the maximum cost allowed for secondary metabolites.
Social evolution theory therefore predicts that secondary pathways should have a low cost (Cavalier-Smith, 1992). But many secondary products need multiple enzymes for their biosynthesis (Osbourn, 2010), and their burden on primary metabolism can be hard to avoid. Microorganisms have sophisticated molecular networks that can sense environmental stimuli and regulate the expression of secondary pathways to lower their cost (Brakhage, 2013; Bibb, 2005; Bayram et al., 2016; Lind et al., 2015). Experiments in laboratory culture show, for example, that microbes can regulate the production of secondary metabolites depending on environmental conditions such as the type and quantity of the nutrients provided, especially the carbon source (Ruiz et al., 2010), that they can restrict production to specific growth phases, typically the stationary phase (Malik, 1980), and that many secondary metabolites are controlled by quorum sensing (Whiteley et al., 1999), a type of cell-cell communication that regulates genes in a density-dependent way. Understanding how microbes regulate their secondary metabolites complements the evolutionary explanation by providing a different level of explanation—molecular mechanism—of the same biological phenomenon (Bateson and Laland, 2013).
Here we investigate the evolution and regulation of biosurfactants that the bacterium Pseudomonas aeruginosa secretes to decrease the surface tension of the surrounding liquid (Abdel-Mawgoud et al., 2010). These biosurfactants are secondary metabolites, and they enable spectacular collective behaviors like swarming (Tremblay et al., 2007; Deforet, 2022), climbing over walls (Yang et al., 2021), killing microbial competitors (Sotirova et al., 2008), and breaching the epithelial barriers of a host (Zulianello et al., 2006). These collective feats, impossible to achieve by a single bacterium, can benefit the population. But a P. aeruginosa cell produces 20% or more of its dry weight in biosurfactants (Guerra-Santos et al., 1984), which is potentially a huge cost (Xavier et al., 2011). A full understanding of P. aeruginosa surfactant secretion requires both evolutionary and molecular explanations (Yan et al., 2019).
The first step in the biosynthesis of rhamnolipid surfactants is catalyzed by the RhlA enzyme, which takes a metabolic intermediate from fatty acid synthesis, β-hydroxyacyl-acyl carrier protein (β-hydroxyacyl-ACP), and produces 3-(3-hydroxyalkanoyloxy) alkanoic acids (HAAs) (Zhu and Rock, 2008). The RhlB and RhlC enzymes each add one unit of rhamnose to the HAAs to produce mono-rhamnolipids and di-rhamnolipids (Chong and Li, 2017). The expression of the rhlAB operon is regulated by a network that includes at least three quorum sensing signals (Mukherjee et al., 2018; Latifi et al., 1996) and is conditional to the nutrients in the media, requiring a high carbon-to-nitrogen or carbon-to-iron ratio (Xavier et al., 2011; Boyle et al., 2015; Mellbye and Schuster, 2014). This regulatory strategy—called metabolic prudence—delays the expression of the secondary biosynthetic genes to times when the population is large enough, but also checks if the individual cell has sufficient carbon source to make the product at low or no cost to its fitness (Xavier et al., 2011). Metabolic prudence regulates the expression of other secondary metabolites in P. aeruginosa besides surfactants (Mellbye and Schuster, 2014) and may provide a general evolutionary explanation for secondary metabolites in other microbes as well (Bruger and Waters, 2015). Compared to the non-regulated constitutive production, metabolic prudence lowers the C/B ratio and stabilizes biosurfactant synthesis—even when r is low—by preventing cheaters from increasing in frequency (de Vargas Roditi et al., 2013). But this evolutionary explanation alone cannot explain why the surfactant production varies widely among strains, even when they are closely related (Müller et al., 2011), and even though the genes encoding the secondary pathway—rhlA, rhlB, and rhlC—are highly conserved (Germer et al., 2020). A better understanding of the molecular processes driving surfactant secretion can shed light on the evolution of regulation of this secondary metabolite.
Here we investigate surfactant secretion among diverse P. aeruginosa strains that we isolated from infected patients. We start by focusing on the role of glycerol as a carbon source, a relevant nutrient for P. aeruginosa infection (Scoffield and Silo-Suh, 2016; Abdel-Mawgoud et al., 2014) that puts a substantial strain on Pseudomonas primary metabolism (Poblete-Castro et al., 2020). Then, we investigate other carbon sources to search for a general explanation: we show that the lineages that lost their ability to make the surfactants tend to grow not less but slower compared to lineages that can make surfactants. Using metabolomics and mathematical modeling, we link the inability to reduce oxidative stress produced by primary metabolism with the lack of surfactant secretion. Our results add a new layer to the regulation of surfactant biosynthesis and help explain the inconsistent variation of this secondary metabolite that P. aeruginosa secretes to alter the surface tension of its surrounding environment.
Results
Surfactant production varies inconsistently across the phylogenetic tree
We investigated the phenotypic variability among clinical isolates in a diverse panel of 31 P. aeruginosa strains, including 28 isolates from infected patients (Yan et al., 2017) and the 3 types of strains PAO1, PA14, and PA7, all of which have their genomes sequenced. We grew each strain in synthetic media using glycerol as the sole carbon source and ammonium sulfate as the sole nitrogen source, producing a carbon-to-nitrogen molar ratio of 7.0. Then we used the drop collapse assay (Figure 1A) to classify each strain as surfactant absent, moderate, or high. Previous work had shown that glycerol enables the biosynthesis and secretion of surfactants in the strain PA14 (Boyle et al., 2015), a finding that we confirmed here with the wild-type PA14 and its isogenic mutants in the rhamnolipid synthesis. As expected, the wild type caused drop collapse indicating surfactant secretion, the ΔrhlA mutant did not, indicating loss of surfactants, and the complementation of the ΔrhlA with the L-arabinose inducible PBADrhlAB restored the drop collapse but only when L-arabinose was added, as expected (Figure 1B). Among the clinical isolates, however, there was a wide variability of surfactant phenotypes. The strongest producers (n=17) showed phenotypes similar to the wild-type PA14, while non-producers (n=8) showed no surfactant secretion similar to the ΔrhlA mutant; moderate producers (n=6) showed a phenotype in between. There was no obvious relation between the body site infected in the patient and the surfactant phenotype (Figure 1C). There was also no coherent distribution with the isolates’ phylogeny, which we built using the sequence variation in core genes, also supported by a poor correlation in Moran’s I test of p=0.14. Ancestral reconstruction of the surfactant secretion trait supports that the ability to make surfactants from glycerol was present in the common ancestor of all strains (Figure 1—figure supplement 2). Importantly, the genes rhlA, rhlB, and rhlC, the genes encoding the secondary biosynthesis pathway, were present in all 31 isolates, and their sequence was 100% conserved. This indicates that the absence of surfactant secretion was not due to a loss of the biosynthetic pathway. Instead, we hypothesized that the phenotypic variation was due to the life histories of the strains: some lineages may have lost their ability to rewire their regulatory network to overcome the specific selective pressures that they had faced during their evolution.
We further investigated whether variation elsewhere in the genome—particularly in the accessory genome (the set of genes missing in at least one strain)—could explain the phenotype differences. Among the 8290 accessory genes, 354 are only absent in those non-producers but present in all mild and strong producers (Supplementary file 1). In a principal component analysis (PCA) of the presence-absence profiles, there was no visible grouping of isolates by their surfactant phenotype (Figure 1—figure supplement 2), which is consistent with the lack of association with the phylogenetic tree built from the core genome.
Surfactant secretion lost in strains with slower exponential growth
To search for a general explanation, we conducted analyses that showed producers have faster primary metabolisms in the same media of drop collapse assay. This finding was unexpected because, if anything, secondary metabolism should divert resources from primary metabolism (Yan et al., 2018): strains that produce surfactants would grow slower, not faster. We started by noting differences among the growth curves of the 31 strains, as assessed by the dynamics of the population optical density at 600 nm (OD600). The maxima that the population densities reached showed no obvious difference in producers vs non-producers, but the cultures did vary widely in the shape of their lag phase and the exponential phase (Figure 2A). The shape of the growth curve, as a whole, was not predictive of surfactant secretion because hierarchical clustering using the entire time series failed to separate producers and non-producers (Figure 2—figure supplement 1). Instead, the slope of the exponential growth phase was steeper in producers. This was also shown using non-negative matrix factorization (Lee and Seung, 1999), which could decompose each growth curve as a weighted sum of three basis functions (i.e. features). This analysis showed that—although the growth curves of the producers (orange lines; including mild and strong producers) and non-producers (blue lines) largely overlapped (Figure 2A)—producers had higher weights for base 1, which agrees with a faster growth rate in the exponential phase (Figure 2B). This was also confirmed by dividing each growth curve into three phases (Figure 2C and Figure 2—figure supplement 2) and defining seven quantitative features to characterize each growth phase (Figure 2D and Supplementary file 2). Random forest classification revealed that the top two features associated with surfactant production were the maximum and the average specific growth rates in phase I, the exponential growth phase (Figure 2E). Features of phase III (the decay phase), such as the starting value of OD600 in phase III which represents the final biomass produced by different strains in phase II, were among the least significant.
These analyses indicate that the ability to make surfactants, which typically peak when growth slows down from exponential (Latifi et al., 1996; Boyle et al., 2015; Mellbye and Schuster, 2014), is associated with the speed of the exponential phases, and hence, an efficient primary metabolism on glycerol. This suggests that the non-producer lineages had particular life histories that are selected for specific metabolic adaptations, and that those adaptations had the collateral damage of impacting their ability to grow on glycerol as a sole carbon source.
Metabolomics shows differences in intracellular metabolomes associated with loss of surfactant biosynthesis
We then searched the composition of the intracellular metabolome for differences that distinguish producer and non-producer lineages. We extracted the metabolites from all our strains except for two non-producers (M55212 and F23197), which grew too slowly during the transition between phase I and phase II when rhamnolipid production begins (Materials and methods). Liquid chromatography-mass spectrometry (LC-MS)revealed 67 metabolites whose identities are known, and their abundances varied significantly across the tested strains (Figure 3—figure supplement 1A–B). Hierarchical clustering and PCA (Figure 3—figure supplement 2A–B) showed a reasonable separation between producers and non-producers. Fitting the data using the orthogonal projections to latent structures-discriminant analysis (OPLS-DA) (Cloarec et al., 2005) confirmed that there is a significant association between the broad metabolome and surfactant secretion (Figure 3A, R2=0.82, Q2=0.66, p-value = 5e−4). A Mann-Whitney U test revealed the 15 metabolites whose abundances differed most significantly between producers and non-producers. Then, we used those 15 metabolites to identify pathways perturbed in non-producers using the FELLA algorithm for pathway enrichment analysis (Picart-Armada et al., 2018; Figure 3—figure supplement 3, Supplementary file 3). The most perturbed pathways were the TCA cycle and amino acid metabolism (Figure 3B). Three out of six metabolites in the TCA cycle—fumarate, succinate, and citrate—were significantly changed: fumarate and malate had positive loadings, which indicated higher levels in producers. Citrate, succinate, and to a lesser extent cis-aconitate and alpha-ketoglutarate had negative loadings, which indicated higher levels in non-producers (Figure 3B). Pyruvate remained relatively constant across all the strains (Figure 3—figure supplement 1A), implying that the differential responses in the TCA cycle were independent from the changes in its upstream central carbon metabolism.
Besides the TCA cycle metabolites, most of the annotated compounds in the metabolism of branched chain amino acids (leucine/isoleucine, valine) and sulfur-containing amino acids (cysteine/methionine) were more abundant in non-producers than producers (Figure 3—figure supplement 3). A notable exception was formylmethionine (fMet), which was lower in non-producers. To place this observation in context, we reanalyzed previous metabolomics data (Boyle et al., 2017) from an experiment where we had compared PA14 and its ΔrhlA mutant. Interestingly, the ΔrhlA mutant also had lower levels of fMet compared with the wild type (Figure 3—figure supplement 4). The ΔrhlA mutant grows just as fast in glycerol as the wild type (van Ditmarsch and Xavier, 2011), which means that the link between lower fMet and lack of surfactant secretion is not simply due to a growth defect.
Theory formalizes the link between oxidative stress, slower growth, and lack of surfactants
Several theoretical arguments support that non-producers might have slower primary metabolisms because they suffer from higher oxidative stress than the producers. First, the major differences include TCA intermediates (Figure 3B). The TCA cycle harbors five enzymes with Fe-S clusters, aconitase A, aconitase B, succinate dehydrogenase (SDH), fumarase A, and fumarase B (Py and Barras, 2010), which make them especially vulnerable to oxidative stress (Figure 3C). If non-producers are less successful at removing the oxidative stress from growth in glycerol, then the stress accumulated could reduce flux through the TCA cycle, which would slow down growth. The accumulation of succinate and depletion of fumarate in non-producers could be due to a reduced activity of SDH under oxidative stress: SDH is a membrane-bound dehydrogenase linked to the respiratory chain—a major site of ROS production in the cell—and also a member of the TCA cycle that catalyzes the oxidation of succinate into fumarate (Hederstedt and Rutberg, 1981). Since SDH contains [2Fe-2S], [3Fe-4S], and [4Fe-4S] clusters (Ayala-Castro et al., 2008), ROS that damages Fe-S clusters could decrease SDH activity in vivo.
Second, we saw that non-producers also included intermediate metabolites in amino acid biosynthetic pathways which are also substrates of Fe-S containing enzymes (Figure 3B). For example, the glutamate synthetase has a large chain (encoded by gltB) and a small chain (encoded by gltD), and both subunits contain Fe-S clusters that catalyze the production of glutamate from oxoglutarate and glutamine (Curti et al., 1996). Non-producers had slightly higher oxoglutarate and glutamine as well as lower glutamate than producers.
Finally, we used a whole-genome reconstruction to model the intracellular metabolic fluxes during growth in the glycerol medium (Nogales et al., 2017; Nogales et al., 2020). The rhamnolipids are produced when carbon is in excess (Boyle et al., 2015; Mellbye and Schuster, 2014) and, in the simulations, rhamnolipids secretion occurred when C:N flux ratio >6.3 (Figure 4—figure supplement 1). We then set C:N to 10.0, which exceeds the minimum threshold, and we simulated various levels of redox stress level by changing the flux through three redox molecules—NADH (reduced nicotinamide adenine dinucleotide), NADPH (reduced nicotinamide adenine dinucleotide phosphate), and GSH (reduced glutathione)—responsible for the bulk of cellular electron transfer and the main sources of reactive oxygen species (ROS) (Xiao and Loscalzo, 2020). Although redox stress is ultimately caused by high ROS levels, not fluxes, and ROS was not an explicit variable in our model, we assumed that ROS could perturb intracellular metabolic fluxes, particularly the fluxes of redox molecules, and we then asked how cell growth and surfactant synthesis responded to these ROS-mediated indirect perturbations.
For all three redox molecules, we found that the maximum growth rate was maintained at an intermediate flux range (redox homeostasis; Figure 4A–C, upper panels, white area). The growth rate decreased gradually as the flux of redox molecules changed in either direction away from this homeostatic interval (gray shading). The reason for the gradual decrease in growth rate is that deficiency or excess of these redox molecules which participate in the redox control of a great variety of biological processes disturbs the primary metabolism for growth. Importantly, before the growth starts slowing down due to an excess in redox species there was an abrupt shutdown in the overflow on carbon into many central carbon metabolites and—importantly— in the secretion of the HAAs, mono- and di-rhamnolipids that makeup the surfactants (Figure 4A–C, lower panels). The shutdown of overflow reactions under excess of redox molecules mimics the metabolic state in the non-producer strains who lose metabolic plasticity to overflow metabolites without compromising growth (Vemuri et al., 2006). When we forced these secretion fluxes to zero—to simulate a ΔrhlA mutant—the simulation did not predict a growth benefit. This result agrees with data showing that the ΔrhlA mutant has no growth benefit in glycerol media compared with wild type (van Ditmarsch and Xavier, 2011).
This also suggests that regaining surfactant synthesis should not restore fast growth in glycerol to non-producers, which agrees with the idea that loss of surfactant secretion is a consequence—not a cause—of the slower growth in non-producers. Notably, most of the overflowed metabolites are carbon-rich molecules, including HAA, mono-, and di-rhamnolipid (Xavier, 2011). Collectively, these theoretical arguments support a link between the ability to make surfactants and oxidative stress: fast growth and surfactant secretion are both impacted by the accumulation of ROS produced by primary metabolism.
Non-producers are worse at reducing oxidative stress
Our model suggested that non-producers suffer from higher oxidative stress. Pseudomonas have a variety of antioxidant enzymes including catalases, glutathione reductases, NADH peroxidases, and cysteine-based peroxidases (Mishra and Imlay, 2012), which in PA14 and PAO1 were only made when the growth rate maintained its maximum value under excess carbon (Xavier et al., 2011; Boyle et al., 2015; Mellbye and Schuster, 2014). Isogenic mutants lacking some of those could make surfactants (Vinckx et al., 2010). If the slower growth in the clinical non-producer strains is due to impairments in some of those mechanisms to reduce oxidative stress, then this could affect surfactant biosynthesis.
To test this hypothesis, we compared the dynamics of hydrogen peroxide (H2O2) during growth in glycerol (Figure 4D and E). H2O2 is a representative ROS that can diffuse freely between cells and the environment; we can access its dynamics by measuring the fluorescence by the Amplex assay, where the difference between cell cultures and baseline reveals each strain’s net ability to reduce H2O2 (Zhao et al., 2012). All strains except for M1608, one of the non-producers, quenched the signal over the first 18 hr (Figure 4E). Consistent with our hypothesis, the non-producers (blue lines) removed less H2O2 than moderate producers (green lines) and strong producers (red lines). To factor out the possibility that the non-producers performed worse due to lower cell density, we calculated the H2O2 removal rate per cell. We observed a similar relative trend of H2O2 degradation between non-producers and producers (Figure 4F): a linear mixed-effect model quantified the difference that mild and strong producers have in per-cell H2O2 removal rates, which was in both cases significantly higher than in non-producers (p<0.001; Figure 4G). This is consistent with the notion that non-producers suffer more from oxidative species, which can be caused by high production or slow degradation.
To identify the genes that would explain the varying tolerance to oxidative stresses, we performed a Spearman correlation between gene presence or absence and the time averaged H2O2 removal rate per cell. No gene passed the significance threshold (p<0.05) after Benjamini-Hochberg correction for multiple hypothesis testing. It is possible that oxidative stress tolerance is a system-level phenotype that depends on the interactions among many genes in multiple pathways. Alternatively, the phenotype can be determined by a small number of, but different genes for different strains. For example, the worst strain in this assay, M1608, lacks katE (Supplementary file 1), a catalase that degrades H2O2 (Mulvey et al., 1988), and another non-producer, F5677, lacks the redox sensor soxR (Hidalgo et al., 1998).
RNAseq reveals oxidative stress response in non-producers
To investigate transcriptomic differences associated with the surfactant phenotype, we selected 11 representative strains (6 strong producers, 3 mild producers, and 2 non-producers), in proportion to the number in each category. We then assessed their transcriptomics in Phase II, which is typically when producers start their surfactant biosynthesis (van Ditmarsch and Xavier, 2011). A PCA of the gene expression profiles could not cluster the strains according to their phenotypes (Figure 5—figure supplement 1). We then turned to RLQ analysis (R, L, and Q are explained in Figure 5A) to find both genes (Figure 5B) and pathways (Figure 5C) associated with rhamnolipid production.
As expected, the genes in the biosynthesis of rhamnose (rmlA, rmlB, rmlC, rmlD), rhamnolipid biosynthesis (rhlA, rhlB, rhlC, rhlR, rhlI), and quorum sensing (lasA, lasB, lasI, pqsABCDEH) were significantly less expressed in the non-producers (Figure 5B, Figure 5—figure supplement 2). Interestingly, in vitro data shows that the expression of genes from the above list, including lasI, lasR, rhlI, rhlR, pqsA, pqsR, in P. aeruginosa PAO1, can be significantly reduced by quorum sensing quencher and oxidative stress by exogenous H2O2 (Mohamed et al., 2020), supporting that the loss of the phenotype is linked to oxidative stress. More importantly, the zwf gene encoding glucose 6-phosphate dehydrogenase (G6PDH) was expressed higher in non-producers (Figure 5—figure supplement 2). G6PDH, a major source of NADPH that contributes to antioxidant defenses by reducing oxidized glutathione, was expressed higher, supporting our hypothesis that non-producers experience higher oxidative stress. Other genes, including the peroxide-sensing transcriptional regulator (oxyR), catalases (katA, katB), alkyl hydroperoxide reductases (ahpC, ahpF), glutaredoxin (grx), glutathione reductase (gor), and thioredoxins (trxA, trxB1, trxB2) were not highly expressed in the producers (Figure 5—figure supplement 2). We speculate that the stress levels may occur locally within the cells (e.g. within SDH) so that the oxidative stress response is also local. Another possibility is that the antioxidant defense may be regulated at the metabolic level. A notable example is biosynthesis of phenazine, whose genes (e.g. phzA1/B1/B2) are highly expressed in the producers but not in the non-producers. It was previously shown that the phenazine pyocyanin plays a role in the reduction of oxidative stress as it can restore growth of the oxyR mutant in rich medium (LB) and undergoes H2O2 oxidation (Vinckx et al., 2010).
The RLQ analysis also revealed the top five pathways enriched specifically in each phenotype group: strong producers were enriched in acarbose and validamycin biosynthesis (pau00525), phenazine biosynthesis (pau00405), polyketide sugar unit biosynthesis (pau00523), starch and sucrose metabolism (pau00500), and pentose and glucuronate interconversions (pau00040) (Supplementary file 4). By contrast, non-producers were enriched in the biosynthesis of siderophore group nonribosomal peptides (pau01053), ascorbate and aldarate metabolism (pau00053), monobactam biosynthesis (pau00261), D-glutamine and D-glutamate metabolism (pau00471), and sulfur metabolism (pau00920).
Exogenous oxidative stress impacts expression of surfactant biosynthetic genes
So far, we have shown data supporting that surfactant producers can reduce ROS better than non-producers. This explains why strains that grow slower in glycerol also tend to produce less surfactants from that carbon source, likely due to the impact of ROS on growth. It is even possible that the ability to direct metabolic fluxes to surfactant secretion is part of a multifarious response to oxidative stress. Previous work in an engineered strain of P. putida showed that inducing surfactant synthesis reduces production of the final electron acceptor CO2, suggesting that directing metabolic flux to surfactant secretion could indeed serve as an alternative electron sink (Tiso et al., 2016).
To investigate the link with oxidative stress further, we measured the expression of the rhamnolipid synthesis operon rhlAB to extrinsic oxidative stress using PA14 genetically engineered with a reporter fusion (PrhlABgfp). This strain allows tracking the expression of rhlAB through a dynamic readout of green fluorescence (Figure 6). A PA14 carrying gfp under a constitutive promoter, PA1/04/03gfp (Lambertsen et al., 2004), was used in parallel to account for growth effects on the gfp dynamics. We then added H2O2 at different doses always at the time when the culture started entering the exponential phase of growth in glycerol. In both PA14::PrhlABgfp and PA14::PA1/04/03gfp, the start of the exponential phase was increasingly delayed as the concentration of H2O2 was larger (Figure 6A and C), but once the exponential growth started there was no difference in the growth rate under different H2O2 concentrations (Figure 6E). This indicates that PA14 can restore normal operation of its primary metabolism after removal of the extrinsic stress.
The apparent removal of H2O2 during the lag phase may be achieved by metabolic rewiring that affects rhamnolipid production in the late exponential phase (phase II). Similar to growth dynamics, the promoter activity of both mutants was also delayed, and the time delay is longer at higher H2O2 concentration (Figure 6B and D). We determined that the ratio of the median PrhlAB expression relative to the median PA1/04/03 increased linearly with the H2O2 concentration (Figure 6F, orange). Since PA1/04/03 measures the growth dependence of promoter activity (e.g. changes in number of RNA polymerases and ribosomes), the increased ratio showed that extrinsic redox stress can induce rhlAB expression via a growth-independent mechanism that likely involves transcriptional activators.
We next compared the above results obtained using PA14 wild-type strain and the ΔrhlA mutant. The gfp dynamics showed that in the absence of H2O2 the ΔrhlA strain still attempts to express rhlAB at a similar level as the wild type (p=0.3). We observed no decrease in the growth rate with increasing H2O2 concentration (Figure 6E), which indicates that the lack of surfactant biosynthesis had no detrimental impacts on the primary metabolism. Similar to the PA14 wild-type strain, H2O2 increased the expression of the rhlAB promoter relative to the constitutive promoter in the ΔrhlA mutant (Figure 6F, blue), but the baseline promoter activity increased significantly compared to the wild type (1.14×, p<0.01). Taken together, these results indicate that the genes for surfactant biosynthesis can be induced by extrinsic redox stress, and that the induction is stronger if surfactant synthesis fails to start.
Mathematical model predicts impact of carbon sources on surfactant production
All analyses so far were done in glycerol, a carbon source relevant for P. aeruginosa rhamnolipid production in bioremediation (Zhao et al., 2021) and clinical settings (Scoffield and Silo-Suh, 2016; Abdel-Mawgoud et al., 2014). But could different environments lead strains that are non-producers in glycerol to produce surfactants, and vice-versa? To address this question, we developed a predictive model from data and then sought out to test its predictions experimentally.
The model is a function that takes as input a given carbon source and predicts whether a strain produces surfactants from that source. To parameterize the model we obtained a dataset using the BIOLOG phenotype microarrays PM1 and PM2a; these two arrays together contain 190 distinct carbon sources in separate wells of two 96-well microtiter plates. We used the same composition as in the glycerol media (minus the glycerol). Then, we grew each of the 31 strains in each of the 190 nutrients and negative controls (wells without carbon source) for 24 hr (shorter than 48 hr used previously to facilitate high throughput). We measured the final absorbance (OD600) to quantify the ability to grow in that nutrient (Figure 7A; Figure 7—figure supplement 1A). The data showed that D-glucose (also known as dextrose) was the best carbon source for growth. Also, we saw 40 carbon sources (including the non-metabolizable isomer of D-glucose, L-glucose) in which none of the strains could grow on (OD600 <0.05).
Then we assessed the ability of each strain to secrete surfactants from each carbon source and gave a score of 0 (absent), 1 (moderate), or 2 (high) using the drop collapse assay. To avoid experimentalist bias, the drop collapse was done by an experimentalist blind to the surfactant secretion score that we had determined previously in glycerol. The final dataset totaled N=6014 data points (31 strains grown on 194 conditions each, including the 190 carbon sources and 4 negative controls)—a sizeable dataset that we used to parameterize the model.
As with any high throughput approach, the BIOLOG data came with a loss of precision. We could assess that imprecision first by looking at three positive controls—tween20, tween40, and tween80—which come in the PM1 BIOLOG plate and are themselves surfactants, which means that they should produce a score of 3 regardless of surfactant secretion. The error of failing to identify collapse in these compounds was 4.2%. Then, we collected additional BIOLOG data on the PA14 ΔrhlA, which cannot produce surfactants. The rate of false identification of collapse in this strain was 1.3%. Finally, we used the fact that the PM1 plates contain glycerol in one of the wells to compare those results with our previous surfactant secretion scores (Figure 1). The assay correctly determined 8/8 non-producers, which amount to a 100% specificity rate. It failed to detect 10/23 producers, which amount to a 43% sensitivity rate. Taken together, these results show that BIOLOG assay was mildly sensitive but highly specific: while it may miss some surfactant secretion cases, it is unlikely to overestimate the ability of a specific sugar to induce secretion.
In the model, we encoded each carbon source as a set of hot-encoded binary variables . We included another covariate important to determine the surfactant production outcome: the OD600 after at 24 hr (covariate ), which quantifies whether that strain could grow on that nutrient. The model therefore had parameters, each represented by the Greek letter with an index () where is the number of nutrient conditions tested ( represents the parameter, or effect size, for the OD600, and each to represents the parameters or effect sizes for each nutrient). We used the LASSO (Tibshirani, 1996) to impose an L1-penalization for the parameter λ (see equation in Figure 7). Then we used threefold cross-validation, which splits the dataset into training and testing in repeated iterations and uses the test loss (the error rate of the model) to tune the penalty parameter. As common practice with LASSO, we choose the penalty value that was one standard error above the one that produced the minimum loss in the test: λ=9 × 10–3 (Figure 7—figure supplement 1B). This penalty shrank all but 11 parameters to 0 (Figure 7—figure supplement 1C). The covariates with non-zero effect sizes included OD600 and 10 carbon sources (Figure 7D). The top three compounds were—as expected—the positive controls tween80, tween40, and tween20. This provided a sanity check that the model worked as intended. D-glucose came in fourth place: the only carbon source that the model deemed better than glycerol at favoring P. aeruginosa surfactant production.
To validate the glucose result, we grew all the 31 strains in minimal medium using D-glucose as the sole carbon source, now using media that we made ourselves from scratch as opposed to the high throughput—but less precise—BIOLOG plates and we did each test in triplicate for added rigor. Consistent with our model, glucose improved surfactant secretion overall, showing better secretion seven strains relatively to the secretion we had seen in glycerol (Figure 7C); this included three strains that were non-producers in glycerol but turned into producers in glucose. Interestingly, the switch also went in the opposite direction: four strains had worse surfactant scores in D-glucose than in glycerol. This suggests a complex strain-to-strain variability in the metabolic features of P. aeruginosa. Nonetheless, the data showed that—at least for some strains—glucose, a carbon source which should impose less burden on primary metabolism (Guerra-Santos et al., 1984), could free up resources for secondary metabolism.
To test this idea further, we tested another carbon source, succinate, which P. aeruginosa can grow on (Figure 7A) but which—according to our model—should not improve secretion relatively to glycerol. Succinate enters the carbon metabolism directly through the TCA cycle, and compared to glycerol and glucose, the initial TCA cycle flux would be faster and generate more ROS level that would burden primary metabolism and impair surfactant synthesis. Indeed, none of the strains secreted surfactants in succinate (Figure 7C).
To investigate how oxidative stress might impact expression rhlAB in glucose we added H2O2 and monitored PrhlABgfp fluorescence in the PA14 wild type and ΔrhlA strains. As described previously (Figure 6), we used the same constitutive PA1/04/03gfp as the baseline for comparison. Phase I growth rate was unaffected, indicating that both strains could still remove oxidative stress effectively (Figure 7D). As in glycerol, the rhlAB expression levels showed the same higher baseline level in the ΔrhlA compared to the wild type. But, contrasting with the results from glycerol, D-glucose did not increase rhlAB expression with increasing H2O2 in either strain (Figure 7E). Taken together, these results support that carbon sources that place a lower burden on primary metabolism leave resources available to both deal with extrinsic oxidative stress and sustain surfactant production.
Discussion
Here we investigated the evolution and regulation of surfactant secretion using diverse isolates of P. aeruginosa. Our results suggest a generalizable principle: that a strain can use its secondary metabolic pathways to produce surfactants if it can meet its primary requirements of energy production, biomass synthesis, and reduction of oxidative stress (Figure 8A). The lineages that retained their ability to make surfactants from glycerol were also those capable of reducing the oxidative stress created by primary metabolism and could overflow any excess carbon to the secondary metabolic pathway (Figure 8B). The inconsistent distribution across the phylogenetic tree was possibly due to the life histories of each lineage. Non-producing strains showed impaired abilities to reduce oxidative stress, a slower exponential growth in that carbon source (indicating a deficient primary metabolism), and perturbed levels of TCA cycle metabolites consistent with oxidative damage and accumulated intermediates of amino acid synthesis (also consistent with a slower primary metabolism). Those burdens may be what prevented the strains from making surfactants as their resources were prioritized for primary metabolism (Figure 8C). This was consistent with the observation the OD600 at the end of exponential growth was not a good distinguisher for producers and non-producers, indicating that their final yield of biomass production was unaffected, while their exponential growth was.
What were the selective pressures that led to the diversity of surfactant secretion phenotypes? The answer to this question is less clear. The strains uncapable of making surfactants from glycerol-lacked genes (Supplementary file 1) that suggested a particular path which led to their loss of phenotype. Importantly, though, all three genes of the secondary biosynthetic pathway (rhlA, rhlB, rhlC) were intact, and some of those strains could make surfactants in D-glucose. Absence of a selective pressure to produce surfactants from glycerol in the life histories of non-producer lineages could account for these results. But another explanation is that the impaired ability to reduce oxidative stress in glycerol is a maladaptive consequence of specific adaptations experienced by those lineages. Since the strains were isolated from hospitalized patients—a host-associated environment where we may expect oxidative stresses imposed by the immune system in their fight against pathogens (Zhu et al., 2019) and by antibiotic treatment (Mohamed et al., 2020)—it is unlikely that the selection was for a worse response to oxidative stress.
Experiments in PAO1 and clinical isolates had shown previously that P. aeruginosa decreases expression of the key quorum-sensing genes (lasI, lasR, rhlI, rhlR, pqsA, and pqsR) in response to oxidative stress caused by exogenous H2O2 (Mohamed et al., 2020). Considering that quorum sensing regulates many genes (Whiteley et al., 1999), shutting down surfactant biosynthesis could be part of a broader response to reduce oxidative stress and save resources for primary metabolism. For PA14, a strain capable of reducing the stress produced by glycerol metabolism, adding an extrinsic oxidative stress induced expression of rhlAB in glycerol, though not in glucose. ΔrhlA increased the baseline expression in both glycerol and glucose but did not impact the rate of rhlAB increase with H2O2, suggesting an intricate regulation. Also, the lack of rhlA did not impact growth—in glycerol or glucose—compared to the wild type, even in the highest H2O2 concentrations tested. This is consistent with our flux-balance modeling of Pseudomonas metabolism, where shutting down rhamnolipid secretion did not impact the growth rate because the excess carbon could be alternatively secreted in other forms such as acetate (Figure 4A).
We did detect nuanced consequences of deleting surfactant biosynthesis in PA14: previous data shows that the ΔrhlA isogenic mutant of PA14 has lower levels of fMet (Boyle et al., 2017) which, interestingly, also happened in non-producer isolates analyzed here. fMet plays a role in translation initiation, and in the quality control mechanisms that degrade misfolded proteins (Piatkov et al., 2015). Bacillus subtilis lacking formyl-methionine transferase—the enzyme attaching a formyl group to methionine loaded on tRNAfmet—is more sensitive to H2O2 and defective in swarming (Cai et al., 2017). This link between fMet, oxidative stress, and the loss of rhamnolipid synthesis is intriguing and warrants further research. Also, the ΔrhlA mutant had higher levels of gamma-glutamylcysteine (Figure 3—figure supplement 4), the immediate precursor of glutathione. Since glutathione is a well-known antioxidant that protects cells from oxidative damage (Ezraty et al., 2017), the result could indicate a mild oxidative stress. The finding that ΔrhlA expressed rhlAB at a higher level than the wild type—in glycerol and in glucose—suggests a futile attempt by this mutant to synthesize more surfactants. The fatty acid biosynthesis pathway that provides precursors for surfactant biosynthesis regenerates NAD(P)+ from NAD(P)H. ΔrhlA could perturb this pathway causing minor increases in oxidative stress.
Our analysis adds to the model of metabolic prudence, which provided a social evolutionary explanation for biosurfactant secretion. The model of metabolic prudence says that the microbial cell can lower the cost of a cooperative public good by delaying expression to times when it has excess carbon source (Xavier et al., 2011). A detailed study suggested that the strain PA14, needs to regulate rhlAB expression to carefully split the flux of carbon between biomass production and rhamnolipid synthesis (Boyle et al., 2015). Here, comparing diverse strains reminds us that heterotrophs like P. aeruginosa use organic compounds like glycerol, glucose, and succinate as carbon source for biomass, but also as an energy source. That energy fuels essential processes, including the reduction of oxidative stress, which ends up competing with secondary metabolism. This adds molecular detail to complement the evolutionary explanation and advance our understanding of P. aeruginosa surfactants.
More generally, our analysis highlights the importance of balancing the individual costs and population benefits of secondary metabolism. There are many products of secondary metabolism that contribute to the outsize impact that microbes have on the macroscopic world. Products of one microbial species can impact other species—cooperatively or competitively—potentially amplifying microbial functions in diverse communities such as the human microbiome (Xavier, 2011). The surfactants of P. aeruginosa are only one instance of the multitude of microbial secondary metabolism (Demain, 2014). But their analysis suggests a general principle: natural selection will favor sophisticated regulation that conciliates the individual-level costs and the population-level benefits of secondary metabolism.
Materials and methods
Media
Glycerol synthetic media were prepared with 800 ml of Milipore water, 200 ml of 5× minimal salts buffer, 1 ml of 1 M magnesium sulphate, 0.1 ml of calcium sulphate with 0.5 gN/l nitrogen (ammonium sulphate), iron at 5 μM (iron III sulphate), and 3.0 gC/l glycerol, respectively. 5× stock minimal salts buffer was prepared with 12 g of Na2HPO4 (anhydrous), 15 g of KH2PO4 (anhydrous), and 2.5 g of NaCl into 1 l water.
Growth curve assays
Request a detailed protocolThe clinical isolates were inoculated in 3 mL of lysogen broth (LB) and incubated at 37°C overnight with shaking at 250 rpm. 500 μL of cell culture was pelleted and washed three times with PBS. 0.0025 units were inoculated into 3.0 g of carbon/liter of medium in glycerol minimal medium (7.7 g/l of glycerol) in BD Falcon (BD Biosciences, San Jose, CA) 96-well flat-bottom plates, with 150 μL of suspension per well. The plate was incubated during 48 hr at 37°C in a Tecan Infinite M1000 or Tecan Infinite M1000 Pro plate reader (Männedorf, Switzerland), with an orbital shaking of 4 mm of amplitude. OD600 was measured in 10 min intervals.
Rhamnolipid production quantification
Request a detailed protocolThe production of rhamnolipids was assessed by drop collapse assay (Jain et al., 1991; Chen et al., 2007). We placed 50 μL of the culture’s supernatant on a polystyrene surface (the lid of a 96-well plate). The presence of rhamnolipids decreases the surface tension of the liquid, making the drop collapse. We considered a strain rhamnolipid non-producer if the drop does not spread, a mild-producer if spreads, but not enough to pass over the lid’s circle that corresponds to the wells, and producer if spreads over it.
Pangenome analysis
Request a detailed protocolThe pangenomes of our clinical P. aeruginosa isolates were identified using the roary tool (Page et al., 2015) using the output of genome annotation from prokka (Seemann, 2014). The core genes need to have at least 95% identity using blastp and to be present in at least 99% of all genomes, and then were aligned with the MAFFT method (Katoh et al., 2002).
Phylogenetic analysis
Request a detailed protocolThe phylogenetic tree of clinical P. aeruginosa strains was constructed from core genomes as previously described (Yan et al., 2017). Moran’s I test was carried out using the ape package in R (Paradis and Schliep, 2019). The ancestral state of swarming and rhamnolipid production was reconstructed using corHMM package (Beaulieu et al., 2013).
Growth curve analysis
Request a detailed protocolWe first normalized the growth curves conducted in different experimental batches (different microtiter plates) using the PA14 inoculated in the same plate as the reference for normalization. The plate-specific coefficients were obtained by solving a linear regression model ‘log(OD of PA14)~Plate + Time’ (the Wilkinson notation) using Matlab function fitlm, where Plate is plate ids and Time is time points (both are categorical variables). Then the growth curves in each plate were normalized by multiplying the adjustment coefficient of the plate. These normalized growth curves were then smoothed by Savitzky-Golay filter (sgolayfilt function in Matlab). The growth phases (phase I, II, III) were determined by the following rules: (phase I) first-order derivative >0 and second-order derivative >0; (phase II) first-order derivative >0 and second-order derivative <0; (phase III) first-order derivative <0. The mean and maximum specific growth rate for each growth phase were estimated from the best-fit model among three models of bacterial growth curves: (Morens et al., 2004) exponential model ; (Rowland et al., 2018) Zwietering-Logistic model ; (Henderson, 1999) Zwietering-Gompertz model , where are the parameters to fit. The goodness of fit for each model was evaluated by the value of the coefficient of determination, , where is the growth curve data observed at time point t and . Both non-negative factorization and random forest regression were performed in python using sklearn.decomposition.NMF and sklearn.ensemble.RandomForestClassifier, respectively.
Metabolite extraction
Request a detailed protocolAll P. aeruginosa strains were grown until the end of exponential phase of growth in glycerol minimal medium. Bacteria was then loaded into 0.25-μm nylon membranes (Millipore) using vacuum, transferred to pre-warmed hard agar plates with the same medium composition and incubated at 37°C during 2.5 hr. The filters were then passed to 35-mm polystyrene dishes (Falcon) with 1 mL of 2:2:1 methanol:acetonitrile:H2O quenching buffer and incubated there during 15 min on dry ice. Cells were removed by scraping, and the lysate containing quenching buffer was transferred to 1.5 mL tubes and centrifuged at 16,000 rpm for 10 min at 4°C. Supernatant transferred to fresh tubes and stored at –80°C.
Metabolomic data preprocessing
Request a detailed protocolThe extracts were profiled using LC-MS, identifying a total of 92 compounds (Supplementary file 5). Some compounds contained missing values. These missing values in metabolite abundance can be (Morens et al., 2004) truly missing; (Rowland et al., 2018) present in a sample but its level is below detection limit; (Henderson, 1999) present in a sample at a level above the detection limit but missing due to failure of algorithms in data processing. Here we assume that a metabolite with missing values in all three replicates is truly missing in the sample and removed from our analysis (Supplementary file 5). However, if the missing values were only found in one or two replicates, the missing values were imputed by the average of the non-missing values. After that imputation, all compounds remaining with missing values were removed (Supplementary file 5).
The peak areas were normalized using cross-contribution compensating multiple standard normalization (Redestig et al., 2009) with NormalizeMets R package (De Livera et al., 2018). This method relies on the use of multiple internal standards. Since LC-MS lacks such internal standards, we used instead a set of metabolites assumed to be constant across all the strains. They were selected with a Kuskal-Wallis test, adjusting the p-value with Benjamini-Hochberg method. The ones with a p-value above 0.05 were considered constant (pyruvate, methylglyoxal, (S)–2-acetolactate, tyramine, D-glucose, (S)-lactate, N-acetyl-L-glutamate 5-semialdehyde, 4-aminobutyraldehyde, and glycine), therefore after the normalization step they were removed (indicated in red, Supplementary file 5A). The processed area peaks for all metabolites are included in Supplementary file 5.
Unsupervised analysis of metabolomic data
Request a detailed protocolThe hierarchical clustering of the normalized metabolomic data was performed using gplots R package (Warnes et al., 2015), with Euclidean distance and Ward’s aggregation method. The clustering was performed with all metabolites, including 16 unidentified metabolites that we could not identify (indicated in black, Figure 2—figure supplement 2). We included these 16 unidentified compounds in the clustering analysis because they were detected in all strains and therefore were not artifacts of the LC-MS. Nonetheless, because they were unidentified, we removed them from Figure 3—figure supplement 2 and from the downstream analyses. Two other compounds, fumarate and guanosine, which were only putative initially, were bioinformatically confirmed as all other compounds with the same molecular weight are produced by enzymatic reactions missing in our clinical isolates. The PCA was done with R’s base method, using also the normalized data and previously removing the 16 metabolites with unknown identity. The plot was obtained using ggplot2 R package (Wickham, 2016).
Metabolic pathway enrichment
Request a detailed protocolThe differential metabolites between rhamnolipid producers and non-producers were determined by a Mann-Whitney test, with p-values adjusted with Benjamini-Hochberg method and a significance level of 0.05. These compounds were fed to FELLA algorithm (Picart-Armada et al., 2018; Picart-Armada et al., 2017). FELLA retrieves a graph describing the relationships among pathway, module, enzyme, reaction, and compounds of P. aeruginosa strain PA14 from the KEGG database. The graph was then used as an input network of differential compounds for its diffusion algorithms (Vandin et al., 2011). The output of FELLA consists of all subnetworks of the entries predicted to have a high connectivity with the differential compounds. We filtered the subnetwork entries to keep only metabolic pathways. The entries shown in Supplementary file 3 are the ones with a significant probability of receiving part of the simulated flux.
OPLS-DA model
Request a detailed protocolOPLS-DA model of metabolomics data was built using ropls R package (Thévenot et al., 2015), fixing the number of orthogonal components to 3. R2 and Q2, key parameters for assessing the validity of the model, were assessed with sevenfold cross-validation. The significance of the model was determined by permutation test (n=2000). The p-value corresponds to the proportion of Q2perm above Q2. With a p-value below 0.05 we considered the model significant. The loadings of the predictive component of the model were extracted to determine how each metabolite contributes to the separation according to the phenotype.
Genome-scale modeling
Request a detailed protocolCustom Python codes were developed with the COBRApy package (Ebrahim et al., 2013) to carry out all metabolic flux modeling and simulations in the paper. Since iJN1411 model was developed for P. putida, we removed genes and associated reactions that are missing in all our strains but present in the iJN1411 model. The futile cycles involving NADH, NADPH, and GSH were also removed. The modified iJN1411 model was further expanded by adding rhamnolipid biosynthesis pathway involving 9 new metabolites and 12 new reactions.
The boundary fluxes of the model were set to mimic the composition of the glycerol minimum medium. For C:N=10, the lower bounds of glycerol and ammonium fluxes were set to –10 and –3, respectively. For C:N=3, the lower bounds were set to –3 and –3, respectively. The flux unit is mmol/gDW/hr throughout the paper. To constrain the total producing flux of NADH (the same for NADPH and GSH) at a certain value , we first defined a binary variable for each NADH-involving reaction to indicate whether NADH is produced by this reaction. Given the stoichiometric coefficient of NADH in this reaction () and its flux value (), the mathematical constraints for was set by for and otherwise. Therefore, the constraint that equalizes the total NADH producing flux and a constant is simply . However, both and are variables and such quadratic constraint has not yet been supported by COBRApy. We overcame this difficulty by defining and linearized the product with the following two inequalities: and , where and are the lower and upper bounds of . The two constraints ensure that when and when . The minimum/maximum flux values of byproduct secretion were simulated by flux variability analysis at maximum growth rate.
Detection of hydrogen peroxide (H2O2)
Request a detailed protocolThe H2O2 level in the extracellular medium was quantified with Amplex Red Hydrogen Peroxide/Peroxidase Assay Kit (Invitrogen, Carlsbad, USA, Catalog no. A22188). 500 μL of overnight growth in LB medium cell suspension were spun down, washed twice in PBS, and resuspended into glycerol medium at OD 1. Each reaction was done in a final volume of 100 μL, with a final concentration of 50 μM of Amplex Red reagent, 0.1 U/mL of HRP (Horseradish Peroxidase), and 0.2 cell OD, in glycerol synthetic medium, in BD Falcon (BD Biosciences, San Jose, CA) 96-well flat-bottom plates. The first column of the plate corresponded to the reaction without cells (the volume was substituted by PBS), and the last column instead of cells contained H2O2 (final concentration 10 μM). OD600 and fluorescence 530/590 nm were measured in 10 min intervals (48 hr 37°C).
RNA-seq and data analysis
Request a detailed protocolThe P. aeruginosa strains were inoculated into LB and grown with agitation for 16 hr. The overnight culture was diluted in glycerol synthetic medium the next day and grew to late exponential phase to harvest RNA. Cells were harvested by centrifuge and then immediately resuspended in RNAprotect (QIAGEN). The RNA was extracted using the RNeasy Mini Kit from QIAGEN. The standard RNA sequencing was done by Genewiz (Genewiz, Inc; South Plainfield, NJ) using pair-end indices of 150 bp read length on an Illumina HiSeq platform. The raw fastq files first went through quality check by fastQC (Andrews, 2010), and aligned with STAR (Dobin et al., 2013). The final output was generated using DEseq2 (Love et al., 2014) package in R. The read counts for each gene are included in Supplementary file 6. The SRA accession number for the RNAseq data is listed in Supplementary file 7.
RLQ analysis
Request a detailed protocolTo test for the functional associations between genes and rhamnolipid production, we adopted the RLQ analysis (ade4 package in R) that was developed for identifying the associations between traits and environmental variables. Here we applied it to RNAseq analysis to understand the main co-structures between gene functions and phenotypes mediated gene expression. The theory of RLQ analysis is described in detail elsewhere (Thioulouse et al., 2018). Briefly, it builds on the simultaneous ordination of three tables: an R table (categorical) describing rhamnolipid production (strong-, mild-, and non-producers) for 11 selected isolates (each replicate was treated as an independent observation), an L table (continuous) describing the RNAseq counts of 1474 core genes shared among the 11 isolates, and a Q table (binary) describing 120 KEGG pathways for the 1474 genes. We first applied corresponding analysis (CA) to table L and Q and PCA to table R. In the CA of R and Q, each strain and gene were weighted by their total RNA expression levels, respectively. The three separate ordinations are then used as inputs to the rlq function. The output of the function includes coefficients for the gene functions and phenotypes (loadings) as well as the scores of strains (out of phenotypes) and scores of genes (out of gene functions). RLQ analysis maximizes the squared cross-covariances between the two sets of scores weighted by gene expressions.
Hydrogen peroxide (H2O2) degradation
Request a detailed protocolH2O2 was removed from environment by cells which degrade H2O2 intracellularly. The cumulative H2O2 removal curve was calculated by subtracting the values of emission of the wells containing each strain from the values of emission of the wells without cells or H2O2. Then they were normalized to the relative scale by dividing their values by the corresponding values (at the same time points) of the control strain (PA14) in the same experiment (plate). These normalized curves were then smoothed by the Savitzky-Golay filter (sgolayfilt function in Matlab). The H2O2 removal rate per cell was calculated by dividing the first-order derivative of the cumulative removal curves by OD values. To estimate the effect of rhamnolipid production on per-cell H2O2 removal rate, we developed a linear mixed-effect model that extends the simple linear regression model by considering both fixed and random effects. The model was specified by the Wilkinson notation ‘RemovalRate ~Time + RHL + (1|CurveID)’, where RemovalRate is per-cell H2O2 removal rate, Time is time point (categorical variable), RHL is categorical rhamnolipid production (non-producer, mild-producer, and strong-producer), and CurveID is the unique ID for each strain and replicate. The expression ‘(1|CurveID)’ indicates that CurveID was modeled as random effects. The model was solved by fitlme in Matlab.
Promoter activity and growth rate comparison
Request a detailed protocolTwo P. aeruginosa strains carrying chromosomal insertions of the reporter fusion PrhlABgfp (Lequette and Greenberg, 2005) and PA1/04/03gfp (a constitutive promoter, to be used as internal standard) (Lambertsen et al., 2004) in a PA14 background (Xavier et al., 2011), respectively, were inoculated into LB, and overnight cultures were harvested and washed with PBS three times. Cells were diluted to OD600=0.1 in synthetic media and monitored using TECAN plate reader for both OD600 and GFP fluorscence. Upon cells entering the exponential phase, H2O2 was added to the cells at a concentration in the assay of 0, 10, 20, 50, and 100 mM, continuing growth monitoring over 48 hr. After identifying growth phase II, the promoter activity at each timepoint within this interval was calculated as: . The ratio of promoter activity in each experiment was extracted as Rp = . Similarly, the growth rate was calculated as , and the ratio of strains with different promoters was calculated as . The effect of H2O2 was estimated by fitglme in Matlab as: log(R)~a0+ [H2O2]+, where R is the measured ratio of either promoter activity or growth rate, and is denoted as the random effect of experiments. This procedure was used too for performing the promoter activity and growth rate analyses in PA14 ∆rhlA mutant.
BIOLOG assay to obtain data for predictive modeling
Request a detailed protocolWe grew each of the P. aeruginosa stains in the BIOLOG phenotype microarrays PM1 and PM2a from Biolog Inc (Hayward, California, USA). We prepared minimal media without any carbon source added (800 ml of Milipore water, 200 ml of 5× minimal salts buffer, 1 ml of 1 M magnesium sulphate, 0.1 ml of calcium sulphate with 0.5 gN/l nitrogen [ammonium sulphate], iron at 5 μM [iron III sulphate]; 5× stock minimal salts buffer was prepared with 12 g of Na2HPO4 [anhydrous], 15 g of KH2PO4 [anhydrous], and 2.5 g of NaCl into 1 l water). We grow 5 mL bacterial cultures for 12–16 hr at 250 rpm and 37 ℃, then aliquoted washed 1.5 mL in media without carbon source and adjusted the initial OD by further dilution. We added 150 ul of inoculated media onto each well of a BIOLOG plate and incubated for 24 hr overnight shaking at 250 rpm at 37℃. After that we measured the endpoint OD and assessed surfactant secretion using the drop collapse assay.
Data availability
Sequencing data have been deposited in SRA, in the bioproject accession number PRJNA253624. Each individual sample has a file accession number listed in Supplementary file 7. The additional dataset is provided through Dryad. The source code for generating the figures of this study is available from: https://github.com/guisantagui/code_PA_paper (copy archived at swh:1:rev:cf190909e0f87db785ec4803bdd6e1a7cb854ad9); https://github.com/liaochen1988/Source_code_for_Pseudomonas_Metabolomics_Paper (copy archived at swh:1:rev:dcc50627d31d66a9f25a4d6b60294321087edf50); https://github.com/Jinyuan1998/PA_metabolomics_rhamnolipids_SourceCode (copy archived at swh:1:rev:9c8f53d6d21bd089c605f8b55f72f04774fd31bd).
-
NCBI BioProjectID PRJNA253624. RNAseq files.
-
Dryad Digital RepositoryEvolution and regulation of microbial secondary metabolism.https://doi.org/10.5061/dryad.7sqv9s4tg
References
-
Rhamnolipids: diversity of structures, microbial origins and rolesApplied Microbiology and Biotechnology 86:1323–1336.https://doi.org/10.1007/s00253-010-2498-2
-
Siderophores: from natural roles to potential applicationsAdvances in Applied Microbiology 106:193–225.https://doi.org/10.1016/bs.aambs.2018.12.001
-
Fe-S cluster assembly pathways in bacteriaMicrobiology and Molecular Biology Reviews 72:110–125.https://doi.org/10.1128/MMBR.00034-07
-
Tinbergen’s four questions: an appreciation and an updateTrends in Ecology & Evolution 28:712–718.https://doi.org/10.1016/j.tree.2013.09.013
-
Regulation of secondary metabolism in streptomycetesCurrent Opinion in Microbiology 8:208–215.https://doi.org/10.1016/j.mib.2005.02.016
-
Metabolism and the evolution of social behaviorMolecular Biology and Evolution 34:2367–2379.https://doi.org/10.1093/molbev/msx174
-
Regulation of fungal secondary metabolismNature Reviews. Microbiology 11:21–32.https://doi.org/10.1038/nrmicro2916
-
Sharing the sandbox: evolutionary mechanisms that maintain bacterial cooperationF1000Research 4:F1000 Faculty Rev-1504.https://doi.org/10.12688/f1000research.7363.1
-
The application of a high throughput analysis method for the screening of potential biosurfactants from natural sourcesJournal of Microbiological Methods 70:503–510.https://doi.org/10.1016/j.mimet.2007.06.006
-
Microbial production of rhamnolipids: opportunities, challenges and strategiesMicrobial Cell Factories 16:0753-2.https://doi.org/10.1186/s12934-017-0753-2
-
Glutamate synthase: a complex iron-sulphur flavoproteinBiochemical Society Transactions 24:95–99.https://doi.org/10.1042/bst0240095
-
The natural functions of secondary metabolitesAdvances in Biochemical Engineering/Biotechnology 69:1–39.https://doi.org/10.1007/3-540-44964-7_1
-
Importance of microbial natural products and the need to revitalize their discoveryJournal of Industrial Microbiology & Biotechnology 41:185–201.https://doi.org/10.1007/s10295-013-1325-z
-
Multilevel selection analysis of a microbial social traitMolecular Systems Biology 9:684.https://doi.org/10.1038/msb.2013.42
-
Star: ultrafast universal RNA-seq alignerBioinformatics 29:15–21.https://doi.org/10.1093/bioinformatics/bts635
-
Secondary metabolites and other small molecules as intercellular pathogenic signalsFEMS Microbiology Letters 314:10–17.https://doi.org/10.1111/j.1574-6968.2010.02154.x
-
Oxidative stress, protein damage and repair in bacteriaNature Reviews. Microbiology 15:385–396.https://doi.org/10.1038/nrmicro.2017.26
-
Exploiting the natural diversity of rhla acyltransferases for the synthesis of the rhamnolipid precursor 3- (3-hydroxyalkanoyloxy) alkanoic acidApplied and Environmental Microbiology 86:e02317-19.https://doi.org/10.1128/AEM.02317-19
-
Pseudomonas aeruginosa biosurfactant production in continuous culture with glucose as carbon sourceApplied and Environmental Microbiology 48:301–305.https://doi.org/10.1128/aem.48.2.301-305.1984
-
Succinate dehydrogenase -- a comparative reviewMicrobiological Reviews 45:542–555.https://doi.org/10.1128/mr.45.4.542-555.1981
-
The looming threat of bioterrorismScience 283:1279–1282.https://doi.org/10.1126/science.283.5406.1279
-
A drop-collapsing test for screening surfactant-producing microorganismsJournal of Microbiological Methods 13:271–279.https://doi.org/10.1016/0167-7012(91)90064-W
-
MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transformNucleic Acids Research 30:3059–3066.https://doi.org/10.1093/nar/gkf436
-
Pheromones, social behaviour and the functions of secondary metabolism in bacteriaTrends in Ecology & Evolution 10:126–129.https://doi.org/10.1016/S0169-5347(00)89013-8
-
Mini-tn7 transposons for site-specific tagging of bacteria with fluorescent proteinsEnvironmental Microbiology 6:726–732.https://doi.org/10.1111/j.1462-2920.2004.00605.x
-
Microbial secondary metabolismTrends in Biochemical Sciences 5:68–72.https://doi.org/10.1016/0968-0004(80)90071-7
-
Physiological framework for the regulation of quorum sensing-dependent public goods in Pseudomonas aeruginosaJournal of Bacteriology 196:1155–1164.https://doi.org/10.1128/JB.01223-13
-
Why do bacteria use so many enzymes to scavenge hydrogen peroxide?Archives of Biochemistry and Biophysics 525:145–160.https://doi.org/10.1016/j.abb.2012.04.014
-
Evaluation of rhamnolipid production capacity of Pseudomonas aeruginosa PAO1 in comparison to the rhamnolipid over-producer strains DSM 7108 and DSM 2874Applied Microbiology and Biotechnology 89:585–592.https://doi.org/10.1007/s00253-010-2901-z
-
The contribution of exoproducts to virulence of Pseudomonas aeruginosaCanadian Journal of Microbiology 31:387–392.https://doi.org/10.1139/m85-074
-
Roary: rapid large-scale prokaryote pan genome analysisBioinformatics 31:3691–3693.https://doi.org/10.1093/bioinformatics/btv421
-
FELLA: an R package to enrich metabolomics dataBMC Bioinformatics 19:538.https://doi.org/10.1186/s12859-018-2487-5
-
Biochemistry, genetics and biotechnology of glycerol utilization in Pseudomonas speciesMicrobial Biotechnology 13:32–53.https://doi.org/10.1111/1751-7915.13400
-
Building Fe-S proteins: bacterial strategiesNature Reviews. Microbiology 8:436–446.https://doi.org/10.1038/nrmicro2356
-
Gut microbiota functions: metabolism of nutrients and other food componentsEuropean Journal of Nutrition 57:1–24.https://doi.org/10.1007/s00394-017-1445-8
-
Production of microbial secondary metabolites: regulation by the carbon sourceCritical Reviews in Microbiology 36:146–167.https://doi.org/10.3109/10408410903489576
-
Glycerol metabolism promotes biofilm formation by Pseudomonas aeruginosaCanadian Journal of Microbiology 62:704–710.https://doi.org/10.1139/cjm-2016-0119
-
Prokka: rapid prokaryotic genome annotationBioinformatics 30:2068–2069.https://doi.org/10.1093/bioinformatics/btu153
-
BookMultivariate Analysis of Ecological Data with Ade4New York, NY: Springer.https://doi.org/10.1007/978-1-4939-8850-1
-
Regression shrinkage and selection via the LASSOJournal of the Royal Statistical Society 58:267–288.https://doi.org/10.1111/j.2517-6161.1996.tb02080.x
-
Creating metabolic demand as an engineering strategy in Pseudomonas putida-rhamnolipid synthesis as an exampleMetabolic Engineering Communications 3:234–244.https://doi.org/10.1016/j.meteno.2016.08.002
-
Self-produced extracellular stimuli modulate the Pseudomonas aeruginosa swarming motility behaviourEnvironmental Microbiology 9:2622–2630.https://doi.org/10.1111/j.1462-2920.2007.01396.x
-
Algorithms for detecting significantly mutated pathways in cancerJournal of Computational Biology 18:507–522.https://doi.org/10.1089/cmb.2010.0265
-
Overflow metabolism in Escherichia coli during steady-state growth: transcriptional regulation and effect of the redox ratioApplied and Environmental Microbiology 72:3653–3661.https://doi.org/10.1128/AEM.72.5.3653-3661.2006
-
Social evolution theory for microorganismsNature Reviews. Microbiology 4:597–607.https://doi.org/10.1038/nrmicro1461
-
Social interaction in synthetic and natural microbial communitiesMolecular Systems Biology 7:483.https://doi.org/10.1038/msb.2011.16
-
Metabolic responses to reductive stressAntioxidants & Redox Signaling 32:1330–1347.https://doi.org/10.1089/ars.2019.7803
-
Bow-tie signaling in c-di-GMP: machine learning in a simple biochemical networkPLOS Computational Biology 13:e1005677.https://doi.org/10.1371/journal.pcbi.1005677
-
Photooxidation of Amplex red to resorufin: implications of exposing the Amplex red assay to lightFree Radical Biology & Medicine 53:1080–1087.https://doi.org/10.1016/j.freeradbiomed.2012.06.034
Article and author information
Author details
Funding
National Institutes of Health (U01 AI124275)
- Joao B Xavier
National Institutes of Health (R01 AI137269)
- Joao B Xavier
Fundação para a Ciência e a Tecnologia (UIDB/04046/2020)
- Francisco Rodrigues Pinto
Fundação para a Ciência e a Tecnologia (UIDP/04046/2020)
- Francisco Rodrigues Pinto
European Research Council (734790)
- Francisco Rodrigues Pinto
Fundação para a Ciência e a Tecnologia (SFRH/BD/142899/2018)
- Guillem Santamaria
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Acknowledgements
This work was supported by NIH grants U01 AI124275 and R01 AI137269-01 to JBX, FRP and GS were partially supported by UIDB/04046/2020 and UIDP/04046/2020 Centre grants from FCT, Portugal (to BioISI) and by the LungCARD project (Grant Agreement n°: 734790, H2020-MSCA-RISE-2016). GS is the recipient of a fellowship from BioSys PhD programme PD65-2012 (Ref SFRH/BD/142899/2018) from FCT (Portugal). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Copyright
© 2022, Santamaria, Liao et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 2,277
- views
-
- 373
- downloads
-
- 15
- citations
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Computational and Systems Biology
Changes in chondrocyte gene expression can contribute to the development of osteoarthritis (OA), and so recognition of the regulative processes during chondrogenesis can lead to a better understanding of OA. microRNAs (miRNAs) are key regulators of gene expression in chondrocytes/OA, and we have used a combined experimental, bioinformatic, and systems biology approach to explore the multiple miRNA–mRNA interactions that regulate chondrogenesis. A longitudinal chondrogenesis bioinformatic analysis identified paralogues miR-199a-5p and miR-199b-5p as pro-chondrogenic regulators. Experimental work in human cells demonstrated alteration of miR-199a-5p or miR-199b-5p expression led to significant inverse modulation of key chondrogenic genes and extracellular matrix production. miR-199a/b-5p targets FZD6, ITGA3 and CAV1 were identified by inhibition experiments and verified as direct targets by luciferase assay. The experimental work was used to generate and parameterise a multi-miRNA 14-day chondrogenesis kinetic model to be used as a repository for the experimental work and as a resource for further investigation of this system. This is the first multi-miRNA model of a chondrogenesis-based system, and highlights the complex relationships between regulatory miRNAs, and their target mRNAs.
-
- Computational and Systems Biology
Early-career researchers in the Global South have to overcome obstacles that are not found in high-income countries, but in Morocco at least, the future is looking brighter than the past.