Systematic analysis of naturally occurring insertions and deletions that alter transcription factor spacing identifies tolerant and sensitive transcription factor pairs
Abstract
Regulation of gene expression requires the combinatorial binding of sequence-specific transcription factors (TFs) at promoters and enhancers. Prior studies showed that alterations in the spacing between TF binding sites can influence promoter and enhancer activity. However, the relative importance of TF spacing alterations resulting from naturally occurring insertions and deletions (InDels) has not been systematically analyzed. To address this question, we first characterized the genome-wide spacing relationships of 73 TFs in human K562 cells as determined by ChIP-seq. We found a dominant pattern of a relaxed range of spacing between collaborative factors, including 45 TFs exclusively exhibiting relaxed spacing with their binding partners. Next, we exploited millions of InDels provided by genetically diverse mouse strains and human individuals to investigate the effects of altered spacing on TF binding and local histone acetylation. These analyses suggested that spacing alterations resulting from naturally occurring InDels are generally tolerated in comparison to genetic variants directly affecting TF binding sites. To experimentally validate this prediction, we introduced synthetic spacing alterations between PU.1 and C/EBPβ binding sites at six endogenous genomic loci in a macrophage cell line. Remarkably, collaborative binding of PU.1 and C/EBPβ at these locations tolerated changes in spacing ranging from 5-bp increase to >30-bp decrease. Collectively, these findings have implications for understanding mechanisms underlying enhancer selection and for the interpretation of non-coding genetic variation.
Data availability
All sequencing data generated during this study have been deposited in GEO under accession code GSE178080. For reviewer access, please go to https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE178080 and enter token inyjgyqcbrsnrwz into the box.
Article and author information
Author details
Funding
National Institutes of Health (DK091183)
- Christopher K Glass
National Institutes of Health (HL147835)
- Christopher K Glass
Leducq Transatlantic Network (16CVD01)
- Christopher K Glass
National Institutes of Health (T32DK007044)
- Thomas A Prohaska
American Heart Association (postdoctoral grant)
- Marten A Hoeksema
Netherlands Organization for Scientific Research (Rubicon grant)
- Marten A Hoeksema
Amsterdam Cardiovascular Sciences Institute (postdoctoral grant)
- Marten A Hoeksema
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Ethics
Animal experimentation: Bone marrow cells were isolated from femurs and tibias of Cas9-expressing transgenic mice (Jackson Laboratory, No.028555) housed at the University of California San Diego animal facility on a 12-hour/12-hour light/dark cycle with free access to normal chow food and water. All of the mice were handled according to approved institutional animal care and use committee (IACUC) protocols (S01015) of the University of California San Diego to minimize pain and suffering.
Copyright
© 2022, Shen et al.
This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 2,352
- views
-
- 261
- downloads
-
- 8
- citations
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Chromosomes and Gene Expression
In insects and mammals, 3D genome topology has been linked to transcriptional states yet whether this link holds for other eukaryotes is unclear. Using both ligation proximity and fluorescence microscopy assays, we show that in Saccharomyces cerevisiae, Heat Shock Response (HSR) genes dispersed across multiple chromosomes and under the control of Heat Shock Factor (Hsf1) rapidly reposition in cells exposed to acute ethanol stress and engage in concerted, Hsf1-dependent intergenic interactions. Accompanying 3D genome reconfiguration is equally rapid formation of Hsf1-containing condensates. However, in contrast to the transience of Hsf1-driven intergenic interactions that peak within 10–20 min and dissipate within 1 hr in the presence of 8.5% (v/v) ethanol, transcriptional condensates are stably maintained for hours. Moreover, under the same conditions, Pol II occupancy of HSR genes, chromatin remodeling, and RNA expression are detectable only later in the response and peak much later (>1 hr). This contrasts with the coordinate response of HSR genes to thermal stress (39°C) where Pol II occupancy, transcription, histone eviction, intergenic interactions, and formation of Hsf1 condensates are all rapid yet transient (peak within 2.5–10 min and dissipate within 1 hr). Therefore, Hsf1 forms condensates, restructures the genome and transcriptionally activates HSR genes in response to both forms of proteotoxic stress but does so with strikingly different kinetics. In cells subjected to ethanol stress, Hsf1 forms condensates and repositions target genes before transcriptionally activating them.
-
- Chromosomes and Gene Expression
The R-loop is a common transcriptional by-product that consists of an RNA-DNA duplex joined to a displaced strand of genomic DNA. While the effects of R-loops on health and disease are well established, there is still an incomplete understanding of the cellular processes responsible for their removal from eukaryotic genomes. Here, we show that a core regulator of chromosome architecture -the Smc5/6 complex- plays a crucial role in the removal of R-loop structures formed during gene transcription. Consistent with this, budding yeast mutants defective in the Smc5/6 complex and enzymes involved in R-loop resolution show strong synthetic interactions and accumulate high levels of RNA-DNA hybrid structures in their chromosomes. Importantly, we demonstrate that the Smc5/6 complex acts on specific types of RNA-DNA hybrid structures in vivo and promotes R-loop degradation by the RNase H2 enzyme in vitro. Collectively, our results reveal a crucial role for the Smc5/6 complex in the removal of toxic R-loops formed at highly transcribed genes and telomeres.