Musashi-1 can down-regulate translation in bacteria.

a) Overview of the biotechnological development. In mammals, MSI-1 binds to the 3’ UTR of its target mRNA to repress translation. Here, the M. musculus gene coding for MSI-1 was moved to E. coli (transgenesis) to implement a synthetic regulation system at the level of translation. b) Schematic of the synthetic gene circuit engineered in E. coli. A truncated version of MSI-1 (termed MSI-1*) was expressed from the PLlac promoter to be induced with lactose (or IPTG) in a genetic background over-expressing LacI. sfGFP was used as a reporter expressed from a constitutive promoter (J23119) and under the control of a suitable RNA motif recognized by MSI-1* in the N-terminal coding region of the transcript (viz., located after the start codon). The activity of MSI-1* could in turn be allosterically inhibited by oleic acid. In electronic terms, this circuit implements an IMPLY logic gate. The inset shows the predicted secondary structure of the N-terminal coding region of the reporter mRNA. Within the motif (blue shaded), the consensus recognition sequences (RUnAGU) are bolded and the minimal cores (UAG) are marked in red. System implemented with pRM1+ and pREP6. c) Dose-response curve of the system using lactose as inducer (up to 1 mM). MSI-1* down-regulated sfGFP expression by 2.5-fold. The inset shows the dynamic range of the response using lactose or IPTG (1 mM), showing a statistically significant regulation in both cases (Welch’s t-test, two-tailed P < 0.05). d) Transfer function of the system (between sfGFP and MSI-1*). The inset shows the dose-response curve of eBFP2 expressed from the PLlac promoter (proxy of MSI-1* expression) with lactose. e) Scatter plot of the dynamic response of the system in the Crick space (translation rate vs. transcription rate). The dose-response curve of mScarlet expressed from the J23119 promoter with lactose was used to perform the decomposition (vertical line fitted to 48 AU/h). The inset shows the growth rate of the cells for each induction condition (horizontal line fitted to 0.55 h-1). In all cases, points correspond to experimental data, while solid lines come from adjusted mathematical models. Error bars correspond to standard deviations (n = 3). f) Probability-based histograms of sfGFP expression from single-cell data for different lactose concentrations, showing a statistically significant regulation (one-way ANOVA test, P < 10-4). The inset shows the percentage of cells in the ON state (sfGFP expressed), according to a specified threshold, for each lactose concentration. AU, arbitrary units.

Mechanistic characterization of the Musashi-1-mRNA interaction.

a) Sequences and predicted secondary structures of the different RNA motif variants for MSI-1 binding analyzed in this work. Point-mutations indicated in red. Three-dimensional representations of the RRM1 and RNA motif are also shown. Within the RRM1, the region that recognizes the RNA is shown in blue. b) Dynamic range of the response of the different genetic systems using lactose (1 mM), showing a statistically significant regulation in all cases (Welch’s t-test, two-tailed P < 0.05; although some mutants present a small fold change). c) Schematic of the heliX biosensor platform. A double-strand DNA nanolever was immobilized on a gold electrode of the chip. The nanolever carried a fluorophore in one end and the RNA motif for MSI-1 binding in the other. Binding between MSI-1* (injected analyte) and RNA led to a fluorescence change, whose monitoring in real time served to extract the kinetic constants that characterize the interaction. d) Scatter plot of the experimentally-determined kinetic constants of association and dissociation between the protein and the RNA for all systems (original and 5 mutants). Means and deviations calculated in log scale (geometric). e) Correlation between the maximal sfGFP expression level (in absence of lactose) and the translation rate predicted with RBS calculator. Linear regression performed. f) Correlation between the fold change in sfGFP expression and the dissociation constant (KD). Deviations calculated by propagation. Linear regression performed (vs. 1/KD). Blue shaded areas indicate 95% confidence intervals. In all cases, error bars correspond to standard deviations (n = 3). AU, arbitrary units.

A mathematical model captures the dynamic response of the system.

a) Schematics of gene regulation at different levels with proteins that bind to nucleic acids (DNA or RNA). On the left, schematic of transcription regulation (e.g., LacI regulating MSI-1* expression). On the right, schematic of translation regulation (e.g., MSI-1* regulating sfGFP expression). A general mathematical expression (grey shaded) was derived to calculate the fold change in protein expression as a function of the regulator concentration (R), the association and dissociation rates (kON and kOFF), the elongation leakage fraction (E), and the nucleic acid degradation rate (o). b) Heatmap of the fold change as a function of kONR and kOFF (i.e., the first-order kinetic rates that characterize the protein-DNA/RNA interaction) when o = 0 and E = 0.1. This would correspond to transcription regulation. c) Heatmap of the fold change when o = 0.14 min-1 and E = 0.1. This would correspond to translation regulation. d) Total red fluorescence of the cell population (ΣmScarlet) over time without and with 1 mM lactose. In this case, the cell growth rate was fitted to 0.80 h-1. e) Total green fluorescence of the cell population (ΣsfGFP) over time without and with 1 mM lactose. The inset shows the dynamic response for different lactose concentrations. f) Correlation between the experimental values of ΣsfGFP at different times and for different lactose concentrations and the predicted values from a mathematical model that accounts for population growth and gene regulation. Data for t > 2 h. Linear regression performed. g) Ratio of total green and red fluorescence as a proxy of cellular sfGFP expression over time. Ratio not represented at early times due to the high error obtained given the low number of cells present in the culture (grey shaded area). Deviations calculated by propagation. In all cases, points correspond to experimental data, while solid lines come from an adjusted mathematical model. Error bars correspond to standard deviations (n = 3). AU, arbitrary units.

mRNA redesign to enhance the down-regulation by Musashi-1.

a) Dynamic range of the response of two redesigned genetic systems using lactose (1 mM), showing a statistically significant regulation only in the first case (Welch’s t-test, two-tailed P < 0.05). The predicted secondary structures of the N-terminal coding regions of the reporter mRNAs are shown on the right. Redesign 1 (red1) was implemented with pREP4b and redesign 2 (red2) with pREP4b3x, which contains three MSI-1 binding sites. These stem-loop structures are less stable than the original one. b) Dose-response curve of another redesigned genetic system (redesign 3, red3) using lactose as inducer (up to 1 mM). MSI-1* down-regulated sfGFP expression by 8.6-fold (Welch’s t-test, two-tailed P < 0.05). The inset shows the scatter plot of the dynamic response in the Crick space (translation rate vs. transcription rate; vertical line fitted to 27 AU/h). The predicted secondary structure of the N-terminal coding region of the reporter mRNA is shown on the right; the mRNA contains two MSI-1 binding sites (blue shaded). In the 5’ UTR, the binding site is formed by two RUnAGU repeats that flank the RBS without forming secondary structure. In the N-terminal coding region, the binding site is the original one. The minimal cores (UAG) are marked in red. Redesign 3 was implemented with pREP7. Points correspond to experimental data, while the solid line comes from an adjusted mathematical model. In all cases, error bars correspond to standard deviations (n = 3). c) Probability-based histograms of sfGFP expression from single-cell data for different lactose concentrations (redesign 3), showing a statistically significant regulation (one-way ANOVA test, P < 10-4). The inset shows the percentage of cells in the ON state (sfGFP expressed), according to a specified threshold, for each lactose concentration. AU, arbitrary units.

Oleic acid inhibits the regulatory activity of Musashi-1 in bacteria.

a) Three-dimensional structural schematic of the allosteric regulation. RRM1 of MSI-1 is shown alone, in complex with the RNA motif, and in complex with oleic acid. b) Gel electrophoresis mobility shift assay to test the allosteric inhibition of MSI-1* with oleic acid. A purified MSI-1* protein (45 μM), the RNA motif as a label-free sRNA molecule (11 μM), and oleic acid (1 mM) were mixed in a combinatorial way in vitro. On the left, nucleic acid-stained gel. On the right, protein-stained gel (Coomassie). The different formed species are indicated. M denotes molecular marker (GeneRuler ultra-low range DNA ladder, 10-300 bp, Thermo). BSA was used as a control. c) On the top, images of E. coli colonies harboring pRM1+ and pREP6. Bacteria were seeded in LB-agar plates with suitable inducers (1 mM lactose or 1 mM lactose + 20 mM oleic acid). Fluorescence and bright field images are shown. On the bottom, schematics of the working modes of the synthetic gene circuit according to the different induction conditions. d) Quantification of the green fluorescence of the colonies from panel c (denoted by ΣsfGFP as it is from populations; n = 5). e) Images of E. coli colonies harboring pRM1+ and pREP7. f) Quantification of the green fluorescence of the colonies from panel e. Oleic acid produced a statistically significant response in both cases (Welch’s t-test, two-tailed P < 10-4). AU, arbitrary units.

Applications of Musashi-1 for a fine expression control in bacteria.

a) Overview of the regulatory utility of MSI-1*. It could i) regulate the expression of a given enzyme belonging to a polycistronic operon for a metabolic pathway control, ii) be exploited together with transcription factors to implement combinatorial regulations following the genetic information flow, envisioning biosensing applications, and iii) regulate noise in protein expression with the aim of producing cell populations less disperse, especially for bacterial delivery applications in animals. b) Schematic of a new synthetic gene circuit engineered in E. coli. MSI-1* was always expressed from the PLlac promoter to be induced with lactose in a genetic background over-expressing LacI. sfGFP and mScarlet were used as reporters, both expressed from the PLtet promoter to be induced with aTC in a genetic background over-expressing TetR. In this bicistronic operon, only sfGFP was under the control of a suitable RNA motif recognized by MSI-1* in the leader region of the transcript (original motif or redesign 3). In electronic terms, this circuit implements a NIMPLY logic gate considering sfGFP as the output. System implemented with pRM1+ and pREP6α or pREP7α. c) Dynamic range of the response using lactose (1 mM) and aTC (100 ng/mL) in a combinatorial way. aTC significantly activated the expression of the operon (Welch’s t-test, two-tailed P < 0.05) and lactose, through the action of MSI-1*, significantly down-regulated sfGFP expression in a specific way (data for pREP7α; Welch’s t-test, two-tailed P < 0.05). Error bars correspond to standard deviations (n = 3). d) Probability-based histograms of sfGFP expression from single-cell data for different inducer concentrations. On the left, pRM1+ and pREP6α with 100 ng/mL aTC and 1 mM lactose (i) or 15 ng/mL aTC (ii). On the right, pRM1+ and pREP7α with 100 ng/mL aTC and 1 mM lactose (iii) or 30 ng/mL aTC (iv). The mean expression and Fano factor are shown. AU, arbitrary units.