Deciphering the regulatory genome of Escherichia coli, one hundred promoters at a time

  1. William T Ireland
  2. Suzannah M Beeler
  3. Emanuel Flores-Bautista
  4. Nicholas S McCarty
  5. Tom Röschinger
  6. Nathan M Belliveau
  7. Michael J Sweredoski
  8. Annie Moradian
  9. Justin B Kinney
  10. Rob Phillips  Is a corresponding author
  1. California Institute of Technology, United States
  2. California Institute of Technology, United States
  3. Cold Spring Harbor Laboratory, United States

Abstract

Advances in DNA sequencing have revolutionized our ability to read genomes. However, even in the most well-studied of organisms, the bacterium Escherichia coli, for ≈ 65% of promoters we remain ignorant of their regulation. Until we crack this regulatory Rosetta Stone, efforts to read and write genomes will remain haphazard. We introduce a new method, Reg-Seq, that links massively-parallel reporter assays with mass spectrometry to produce a base pair resolution dissection of more than 100 E. coli promoters in 12 growth conditions. We demonstrate that the method recapitulates known regulatory information. Then, we examine regulatory architectures for more than 80 promoters which previously had no known regulatory information. In many cases, we also identify which transcription factors mediate their regulation. This method clears a path for highly multiplexed investigations of the regulatory genome of model organisms, with the potential of moving to an array of microbes of ecological and medical relevance.

Data availability

Sequencing data has been deposited in the SRA under accession no.PRJNA599253 and PRJNA603368Mass spectrometry data is deposited in the CalTech data repository at doi:10.22002/d1.1336Model files and inferred information footprints are deposited in the CalTech data repository at doi:10.22002/D1.1331Processed sequencing data sets and analysis software are available in the GitHub repository available at https://doi.org/10.5281/zenodo.3953312

The following data sets were generated

Article and author information

Author details

  1. William T Ireland

    Physics, California Institute of Technology, Pasadena, United States
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0003-0971-2904
  2. Suzannah M Beeler

    Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, United States
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-1930-4827
  3. Emanuel Flores-Bautista

    Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, United States
    Competing interests
    The authors declare that no competing interests exist.
  4. Nicholas S McCarty

    Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, United States
    Competing interests
    The authors declare that no competing interests exist.
  5. Tom Röschinger

    Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, United States
    Competing interests
    The authors declare that no competing interests exist.
  6. Nathan M Belliveau

    Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, United States
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-1536-1963
  7. Michael J Sweredoski

    Proteome Exploration Laboratory, Division of Biology and Biological Engineering, Beckman Institute, California Institute of Technology, Pasadena, United States
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0003-0878-3831
  8. Annie Moradian

    Proteome Exploration Laboratory, Division of Biology and Biological Engineering, Beckman Institute, California Institute of Technology, Pasadena, United States
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-0407-2031
  9. Justin B Kinney

    Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, United States
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0003-1897-3778
  10. Rob Phillips

    Department of Bioengineering, California Institute of Technology, Pasadena, United States
    For correspondence
    phillips@pboc.caltech.edu
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0003-3082-2809

Funding

National Institutes of Health (Director's Pioneer Award)

  • Rob Phillips

National Institutes of Health (National Research Service Award,5T32GM007616-38)

  • Suzannah M Beeler

National Institutes of Health (Maximizing Investigators Research Award)

  • Rob Phillips

Howard Hughes Medical Institute (International Student Research Fellowship)

  • Nathan M Belliveau

National Institutes of Health (1S10OD02001301)

  • Annie Moradian

National Institutes of Health (1S10OD02001301)

  • Michael J Sweredoski

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Copyright

© 2020, Ireland et al.

This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.

Metrics

  • 7,508
    views
  • 833
    downloads
  • 51
    citations

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. William T Ireland
  2. Suzannah M Beeler
  3. Emanuel Flores-Bautista
  4. Nicholas S McCarty
  5. Tom Röschinger
  6. Nathan M Belliveau
  7. Michael J Sweredoski
  8. Annie Moradian
  9. Justin B Kinney
  10. Rob Phillips
(2020)
Deciphering the regulatory genome of Escherichia coli, one hundred promoters at a time
eLife 9:e55308.
https://doi.org/10.7554/eLife.55308

Share this article

https://doi.org/10.7554/eLife.55308

Further reading

    1. Cell Biology
    2. Physics of Living Systems
    Marta Urbanska, Yan Ge ... Jochen Guck
    Research Article

    Cell mechanical properties determine many physiological functions, such as cell fate specification, migration, or circulation through vasculature. Identifying factors that govern the mechanical properties is therefore a subject of great interest. Here, we present a mechanomics approach for establishing links between single-cell mechanical phenotype changes and the genes involved in driving them. We combine mechanical characterization of cells across a variety of mouse and human systems with machine learning-based discriminative network analysis of associated transcriptomic profiles to infer a conserved network module of five genes with putative roles in cell mechanics regulation. We validate in silico that the identified gene markers are universal, trustworthy, and specific to the mechanical phenotype across the studied mouse and human systems, and demonstrate experimentally that a selected target, CAV1, changes the mechanical phenotype of cells accordingly when silenced or overexpressed. Our data-driven approach paves the way toward engineering cell mechanical properties on demand to explore their impact on physiological and pathological cell functions.

    1. Physics of Living Systems
    M Julia Maristany, Anne Aguirre Gonzalez ... Jerelle A Joseph
    Research Article

    Proteins containing prion-like low complexity domains (PLDs) are common drivers of the formation of biomolecular condensates and are prone to misregulation due to amino acid mutations. Here, we exploit the accuracy of our residue-resolution coarse-grained model, Mpipi, to quantify the impact of amino acid mutations on the stability of 140 PLD mutants from six proteins (hnRNPA1, TDP43, FUS, EWSR1, RBM14, and TIA1). Our simulations reveal the existence of scaling laws that quantify the range of change in the critical solution temperature of PLDs as a function of the number and type of amino acid sequence mutations. These rules are consistent with the physicochemical properties of the mutations and extend across the entire family tested, suggesting that scaling laws can be used as tools to predict changes in the stability of PLD condensates. Our work offers a quantitative lens into how the emergent behavior of PLD solutions vary in response to physicochemical changes of single PLD molecules.