Deciphering the regulatory genome of Escherichia coli, one hundred promoters at a time
Abstract
Advances in DNA sequencing have revolutionized our ability to read genomes. However, even in the most well-studied of organisms, the bacterium Escherichia coli, for ≈ 65% of promoters we remain ignorant of their regulation. Until we crack this regulatory Rosetta Stone, efforts to read and write genomes will remain haphazard. We introduce a new method, Reg-Seq, that links massively-parallel reporter assays with mass spectrometry to produce a base pair resolution dissection of more than 100 E. coli promoters in 12 growth conditions. We demonstrate that the method recapitulates known regulatory information. Then, we examine regulatory architectures for more than 80 promoters which previously had no known regulatory information. In many cases, we also identify which transcription factors mediate their regulation. This method clears a path for highly multiplexed investigations of the regulatory genome of model organisms, with the potential of moving to an array of microbes of ecological and medical relevance.
Data availability
Sequencing data has been deposited in the SRA under accession no.PRJNA599253 and PRJNA603368Mass spectrometry data is deposited in the CalTech data repository at doi:10.22002/d1.1336Model files and inferred information footprints are deposited in the CalTech data repository at doi:10.22002/D1.1331Processed sequencing data sets and analysis software are available in the GitHub repository available at https://doi.org/10.5281/zenodo.3953312
-
RNAseq data for the Reg-Seq projectShort Read Archive, PRJNA599253.
-
Mass Spectrometry data for the Reg-Seq projectCalTech Data, 10.22002/d1.1336.
-
Sequencing Data for mapping mutated constructsShort Read Archive, PRJNA603368.
Article and author information
Author details
Funding
National Institutes of Health (Director's Pioneer Award)
- Rob Phillips
National Institutes of Health (National Research Service Award,5T32GM007616-38)
- Suzannah M Beeler
National Institutes of Health (Maximizing Investigators Research Award)
- Rob Phillips
Howard Hughes Medical Institute (International Student Research Fellowship)
- Nathan M Belliveau
National Institutes of Health (1S10OD02001301)
- Annie Moradian
National Institutes of Health (1S10OD02001301)
- Michael J Sweredoski
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Copyright
© 2020, Ireland et al.
This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 7,508
- views
-
- 833
- downloads
-
- 51
- citations
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Cell Biology
- Physics of Living Systems
Cell mechanical properties determine many physiological functions, such as cell fate specification, migration, or circulation through vasculature. Identifying factors that govern the mechanical properties is therefore a subject of great interest. Here, we present a mechanomics approach for establishing links between single-cell mechanical phenotype changes and the genes involved in driving them. We combine mechanical characterization of cells across a variety of mouse and human systems with machine learning-based discriminative network analysis of associated transcriptomic profiles to infer a conserved network module of five genes with putative roles in cell mechanics regulation. We validate in silico that the identified gene markers are universal, trustworthy, and specific to the mechanical phenotype across the studied mouse and human systems, and demonstrate experimentally that a selected target, CAV1, changes the mechanical phenotype of cells accordingly when silenced or overexpressed. Our data-driven approach paves the way toward engineering cell mechanical properties on demand to explore their impact on physiological and pathological cell functions.
-
- Physics of Living Systems
Proteins containing prion-like low complexity domains (PLDs) are common drivers of the formation of biomolecular condensates and are prone to misregulation due to amino acid mutations. Here, we exploit the accuracy of our residue-resolution coarse-grained model, Mpipi, to quantify the impact of amino acid mutations on the stability of 140 PLD mutants from six proteins (hnRNPA1, TDP43, FUS, EWSR1, RBM14, and TIA1). Our simulations reveal the existence of scaling laws that quantify the range of change in the critical solution temperature of PLDs as a function of the number and type of amino acid sequence mutations. These rules are consistent with the physicochemical properties of the mutations and extend across the entire family tested, suggesting that scaling laws can be used as tools to predict changes in the stability of PLD condensates. Our work offers a quantitative lens into how the emergent behavior of PLD solutions vary in response to physicochemical changes of single PLD molecules.