Characterization of sequence determinants of enhancer function using natural genetic variation

  1. Marty G Yang
  2. Emi Ling
  3. Christopher J Cowley
  4. Michael E Greenberg  Is a corresponding author
  5. Thomas Vierbuchen  Is a corresponding author
  1. Harvard University, United States
  2. Rockefeller University, Howard Hughes Medical Institute, United States
  3. Memorial Sloan Kettering Cancer Center, United States

Abstract

Sequence variation in enhancers that control cell type-specific gene transcription contributes significantly to phenotypic variation within human populations. However, it remains difficult to predict precisely the effect of any given sequence variant on enhancer function due to the complexity of DNA sequence motifs that determine transcription factor (TF) binding to enhancers in their native genomic context. Using F1-hybrid cells derived from crosses between distantly related inbred strains of mice, we identified thousands of enhancers with allele-specific TF binding and/or activity. We find that genetic variants located within the central region of enhancers are most likely to alter TF binding and enhancer activity. We observe that the AP-1 family of TFs (Fos/Jun) are frequently required for binding of TEAD TFs and for enhancer function. However, many sequence variants outside of core motifs for AP-1 and TEAD also impact enhancer function, including sequences flanking core TF motifs and AP-1 half sites. Taken together, these data represent one of the most comprehensive assessments of allele-specific TF binding and enhancer function to date and reveal how sequence changes at enhancers alter their function across evolutionary timescales.

Data availability

We submitted our data to GEO, and it is now accessible via GSE193728.

The following data sets were generated
The following previously published data sets were used

Article and author information

Author details

  1. Marty G Yang

    Department of Neurobiology, Harvard University, Boston, United States
    Competing interests
    The authors declare that no competing interests exist.
  2. Emi Ling

    Department of Neurobiology, Harvard University, Boston, United States
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0001-5287-0284
  3. Christopher J Cowley

    Laboratory of Mammalian Cell Biology and Development, Rockefeller University, Howard Hughes Medical Institute, New York, United States
    Competing interests
    The authors declare that no competing interests exist.
  4. Michael E Greenberg

    Department of Neurobiology, Harvard University, Boston, United States
    For correspondence
    meg@hms.harvard.edu
    Competing interests
    The authors declare that no competing interests exist.
  5. Thomas Vierbuchen

    Developmental Biology Program, Memorial Sloan Kettering Cancer Center, New York, NY, United States
    For correspondence
    vierbuct@mskcc.org
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-5690-5680

Funding

NIH Office of the Director (T32EY00711030)

  • Marty G Yang

NIH Office of the Director (T32AG000222)

  • Marty G Yang

National Science Foundation (DGE0946799)

  • Emi Ling

National Science Foundation (DGE1144152)

  • Emi Ling

NIH Office of the Director (R01 NS115965)

  • Michael E Greenberg

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Ethics

Animal experimentation: All animal experiments were approved by the National Institutes of Health and the Harvard Medical School Institutional Animal Care and Use Committee and were conducted in compliance with the relevant ethical regulations (Protocol # IS00000074-3)

Copyright

© 2022, Yang et al.

This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.

Metrics

  • 2,791
    views
  • 505
    downloads
  • 13
    citations

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Marty G Yang
  2. Emi Ling
  3. Christopher J Cowley
  4. Michael E Greenberg
  5. Thomas Vierbuchen
(2022)
Characterization of sequence determinants of enhancer function using natural genetic variation
eLife 11:e76500.
https://doi.org/10.7554/eLife.76500

Share this article

https://doi.org/10.7554/eLife.76500

Further reading

    1. Chromosomes and Gene Expression
    2. Neuroscience
    Robyn D Moir, Emilio Merheb ... Ian M Willis
    Research Article

    Pathogenic variants in subunits of RNA polymerase (Pol) III cause a spectrum of Polr3-related neurodegenerative diseases including 4H leukodystrophy. Disease onset occurs from infancy to early adulthood and is associated with a variable range and severity of neurological and non-neurological features. The molecular basis of Polr3-related disease pathogenesis is unknown. We developed a postnatal whole-body mouse model expressing pathogenic Polr3a mutations to examine the molecular mechanisms by which reduced Pol III transcription results primarily in central nervous system phenotypes. Polr3a mutant mice exhibit behavioral deficits, cerebral pathology and exocrine pancreatic atrophy. Transcriptome and immunohistochemistry analyses of cerebra during disease progression show a reduction in most Pol III transcripts, induction of innate immune and integrated stress responses and cell-type-specific gene expression changes reflecting neuron and oligodendrocyte loss and microglial activation. Earlier in the disease when integrated stress and innate immune responses are minimally induced, mature tRNA sequencing revealed a global reduction in tRNA levels and an altered tRNA profile but no changes in other Pol III transcripts. Thus, changes in the size and/or composition of the tRNA pool have a causal role in disease initiation. Our findings reveal different tissue- and brain region-specific sensitivities to a defect in Pol III transcription.

    1. Biochemistry and Chemical Biology
    2. Chromosomes and Gene Expression
    Ting-Wen Chen, Hsiao-Wei Liao ... Chung-Te Chang
    Research Article

    The mRNA 5'-cap structure removal by the decapping enzyme DCP2 is a critical step in gene regulation. While DCP2 is the catalytic subunit in the decapping complex, its activity is strongly enhanced by multiple factors, particularly DCP1, which is the major activator in yeast. However, the precise role of DCP1 in metazoans has yet to be fully elucidated. Moreover, in humans, the specific biological functions of the two DCP1 paralogs, DCP1a and DCP1b, remain largely unknown. To investigate the role of human DCP1, we generated cell lines that were deficient in DCP1a, DCP1b, or both to evaluate the importance of DCP1 in the decapping machinery. Our results highlight the importance of human DCP1 in decapping process and show that the EVH1 domain of DCP1 enhances the mRNA-binding affinity of DCP2. Transcriptome and metabolome analyses outline the distinct functions of DCP1a and DCP1b in human cells, regulating specific endogenous mRNA targets and biological processes. Overall, our findings provide insights into the molecular mechanism of human DCP1 in mRNA decapping and shed light on the distinct functions of its paralogs.