Characterization of sequence determinants of enhancer function using natural genetic variation

  1. Marty G Yang
  2. Emi Ling
  3. Christopher J Cowley
  4. Michael E Greenberg  Is a corresponding author
  5. Thomas Vierbuchen  Is a corresponding author
  1. Harvard University, United States
  2. Rockefeller University, Howard Hughes Medical Institute, United States
  3. Memorial Sloan Kettering Cancer Center, United States

Abstract

Sequence variation in enhancers that control cell type-specific gene transcription contributes significantly to phenotypic variation within human populations. However, it remains difficult to predict precisely the effect of any given sequence variant on enhancer function due to the complexity of DNA sequence motifs that determine transcription factor (TF) binding to enhancers in their native genomic context. Using F1-hybrid cells derived from crosses between distantly related inbred strains of mice, we identified thousands of enhancers with allele-specific TF binding and/or activity. We find that genetic variants located within the central region of enhancers are most likely to alter TF binding and enhancer activity. We observe that the AP-1 family of TFs (Fos/Jun) are frequently required for binding of TEAD TFs and for enhancer function. However, many sequence variants outside of core motifs for AP-1 and TEAD also impact enhancer function, including sequences flanking core TF motifs and AP-1 half sites. Taken together, these data represent one of the most comprehensive assessments of allele-specific TF binding and enhancer function to date and reveal how sequence changes at enhancers alter their function across evolutionary timescales.

Data availability

We submitted our data to GEO, and it is now accessible via GSE193728.

The following data sets were generated
The following previously published data sets were used

Article and author information

Author details

  1. Marty G Yang

    Department of Neurobiology, Harvard University, Boston, United States
    Competing interests
    The authors declare that no competing interests exist.
  2. Emi Ling

    Department of Neurobiology, Harvard University, Boston, United States
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0001-5287-0284
  3. Christopher J Cowley

    Laboratory of Mammalian Cell Biology and Development, Rockefeller University, Howard Hughes Medical Institute, New York, United States
    Competing interests
    The authors declare that no competing interests exist.
  4. Michael E Greenberg

    Department of Neurobiology, Harvard University, Boston, United States
    For correspondence
    meg@hms.harvard.edu
    Competing interests
    The authors declare that no competing interests exist.
  5. Thomas Vierbuchen

    Developmental Biology Program, Memorial Sloan Kettering Cancer Center, New York, NY, United States
    For correspondence
    vierbuct@mskcc.org
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-5690-5680

Funding

NIH Office of the Director (T32EY00711030)

  • Marty G Yang

NIH Office of the Director (T32AG000222)

  • Marty G Yang

National Science Foundation (DGE0946799)

  • Emi Ling

National Science Foundation (DGE1144152)

  • Emi Ling

NIH Office of the Director (R01 NS115965)

  • Michael E Greenberg

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Reviewing Editor

  1. Stephen CJ Parker, University of Michigan, United States

Ethics

Animal experimentation: All animal experiments were approved by the National Institutes of Health and the Harvard Medical School Institutional Animal Care and Use Committee and were conducted in compliance with the relevant ethical regulations (Protocol # IS00000074-3)

Version history

  1. Preprint posted: December 18, 2021 (view preprint)
  2. Received: December 18, 2021
  3. Accepted: August 30, 2022
  4. Accepted Manuscript published: August 31, 2022 (version 1)
  5. Version of Record published: November 14, 2022 (version 2)

Copyright

© 2022, Yang et al.

This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.

Metrics

  • 2,485
    views
  • 474
    downloads
  • 9
    citations

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Marty G Yang
  2. Emi Ling
  3. Christopher J Cowley
  4. Michael E Greenberg
  5. Thomas Vierbuchen
(2022)
Characterization of sequence determinants of enhancer function using natural genetic variation
eLife 11:e76500.
https://doi.org/10.7554/eLife.76500

Share this article

https://doi.org/10.7554/eLife.76500

Further reading

    1. Chromosomes and Gene Expression
    Rupam Choudhury, Anuroop Venkateswaran Venkatasubramani ... Axel Imhof
    Research Article

    Eukaryotic chromatin is organized into functional domains, that are characterized by distinct proteomic compositions and specific nuclear positions. In contrast to cellular organelles surrounded by lipid membranes, the composition of distinct chromatin domains is rather ill described and highly dynamic. To gain molecular insight into these domains and explore their composition, we developed an antibody-based proximity-biotinylation method targeting the RNA and proteins constituents. The method that we termed Antibody-Mediated-Proximity-Labelling-coupled to Mass Spectrometry (AMPL-MS) does not require the expression of fusion proteins and therefore constitutes a versatile and very sensitive method to characterize the composition of chromatin domains based on specific signature proteins or histone modifications. To demonstrate the utility of our approach we used AMPL-MS to characterize the molecular features of the chromocenter as well as the chromosome territory containing the hyperactive X-chromosome in Drosophila. This analysis identified a number of known RNA binding proteins in proximity of the hyperactive X and the centromere, supporting the accuracy of our method. In addition, it enabled us to characterize the role of RNA in the formation of these nuclear bodies. Furthermore, our method identified a new set of RNA molecules associated with the Drosophila centromere. Characterization of these novel molecules suggested the formation of R-loops in centromeres, which we validated using a novel probe for R-loops in Drosophila. Taken together, AMPL-MS improves the selectivity and specificity of proximity ligation allowing for novel discoveries of weak protein-RNA interactions in biologically diverse domains.

    1. Cancer Biology
    2. Chromosomes and Gene Expression
    Gregory Caleb Howard, Jing Wang ... William P Tansey
    Research Article

    The chromatin-associated protein WD Repeat Domain 5 (WDR5) is a promising target for cancer drug discovery, with most efforts blocking an arginine-binding cavity on the protein called the ‘WIN’ site that tethers WDR5 to chromatin. WIN site inhibitors (WINi) are active against multiple cancer cell types in vitro, the most notable of which are those derived from MLL-rearranged (MLLr) leukemias. Peptidomimetic WINi were originally proposed to inhibit MLLr cells via dysregulation of genes connected to hematopoietic stem cell expansion. Our discovery and interrogation of small-molecule WINi, however, revealed that they act in MLLr cell lines to suppress ribosome protein gene (RPG) transcription, induce nucleolar stress, and activate p53. Because there is no precedent for an anticancer strategy that specifically targets RPG expression, we took an integrated multi-omics approach to further interrogate the mechanism of action of WINi in human MLLr cancer cells. We show that WINi induce depletion of the stock of ribosomes, accompanied by a broad yet modest translational choke and changes in alternative mRNA splicing that inactivate the p53 antagonist MDM4. We also show that WINi are synergistic with agents including venetoclax and BET-bromodomain inhibitors. Together, these studies reinforce the concept that WINi are a novel type of ribosome-directed anticancer therapy and provide a resource to support their clinical implementation in MLLr leukemias and other malignancies.