Cooperative interactions enable singular olfactory receptor expression in mouse olfactory neurons

Abstract
Introduction
Results
Discussion
Materials and methods
Data availability
References
Article and author information
Metrics

Abstract

The monogenic and monoallelic expression of only one out of >1000 mouse olfactory receptor (ORs) genes requires the formation of large heterochromatic chromatin domains that sequester the OR gene clusters. Within these domains, intergenic transcriptional enhancers evade heterochromatic silencing and converge into interchromosomal hubs that assemble over the transcriptionally active OR. The significance of this nuclear organization in OR choice remains elusive. Here, we show that transcription factors Lhx2 and Ebf specify OR enhancers by binding in a functionally cooperative fashion to stereotypically spaced motifs that defy heterochromatin. Specific displacement of Lhx2 and Ebf from OR enhancers resulted in pervasive, long-range, and trans downregulation of OR transcription, whereas pre-assembly of a multi-enhancer hub increased the frequency of OR choice in cis. Our data provide genetic support for the requirement and sufficiency of interchromosomal interactions in singular OR choice and generate general regulatory principles for stochastic, mutually exclusive gene expression programs.

https://doi.org/10.7554/eLife.28620.001

Introduction

The mammalian main olfactory epithelium (MOE) provides an extreme example of cellular diversity orchestrated by the seemingly stochastic, monogenic, and monoallelic expression of a single olfactory receptor (OR) gene. Each mature olfactory sensory neuron (mOSN) in the MOE expresses only one OR that is chosen from a pool of more than two thousand alleles (Buck and Axel, 1991; Chess et al., 1994). The basis of the regulation of OR gene expression is chromatin-mediated transcriptional silencing followed by the stochastic de-repression and, thereby, transcriptional activation of a single OR allele that prevents the de-repression of additional OR genes (Dalton and Lomvardas, 2015; Monahan and Lomvardas, 2015). OR gene clusters are assembled into constitutive heterochromatin at early stages of OSN differentiation (Magklara et al., 2011), a process that represses OR transcription and preserves the monogenic and stochastic nature of OR expression (Lyons et al., 2014). Heterochromatic silencing is reinforced by the interchromosomal convergence of OR loci to OSN-specific, highly compacted nuclear bodies that assure complete transcriptional silencing of ORs in mOSNs (Clowney et al., 2012). Consequently, OR gene activation requires de-silencing by lysine demethylase Lsd1 (Lyons et al., 2013) and spatial segregation of the single chosen OR allele towards euchromatic nuclear territories (Armelin-Correa et al., 2014; Clowney et al., 2012). Translation of the newly transcribed OR mRNA activates a co-opted arm of the unfolded protein response (Dalton et al., 2013) and induces a feedback signal (Lewcock and Reed, 2004; Serizawa et al., 2005; Shykind et al., 2004) that turns off Lsd1, preventing the de-silencing and activation of additional OR genes (Lyons et al., 2013).

In the context of this repressive chromatin environment, OR gene choice requires the action of intergenic enhancers that escape heterochromatic silencing and activate the transcription of their proximal ORs (Khan et al., 2011; Markenscoff-Papadimitriou et al., 2014; Serizawa et al., 2003). These euchromatic enhancer ‘islands’, which we named after Greek Islands, engage in interchromosomal interactions with each other, and with the transcriptionally active OR allele, forming a multi-enhancer hub for OR transcription outside of the repressive OR foci (Clowney et al., 2012; Lomvardas et al., 2006; Markenscoff-Papadimitriou et al., 2014). The convergence of multiple Greek Islands to the chosen OR allele suggests that strong, feedback-eliciting OR gene transcription may be achieved only in the context of a multi-enhancer hub (Markenscoff-Papadimitriou et al., 2014). Yet, the molecular mechanisms that specify Greek Islands in the context of OR heterochromatin and, thus, enable their elaborate interactions during OSN differentiation remain unknown.

Here, we present a detailed molecular characterization of the Greek Islands, which revealed a common genetic signature and occupancy by shared sequence-specific transcription factors, allowing us, for the first time, to incapacitate them as a whole. ChIP-seq studies of FAC-sorted mOSNs revealed that most of the previously characterized Greek Islands, and several newly identified islands, are bound by two transcription factors: Lhx2 and Ebf. Computational analysis of the co-bound ChIP-seq peaks from Greek Islands revealed stereotypically positioned Lhx2 and Ebf binding sites that together constitute a ‘composite’ binding motif that affords cooperative binding in vivo. This motif is highly enriched in Greek Islands relative to OR promoters and Lhx2/Ebf co-bound sites genome-wide. Considering the prevalence and specificity of this composite motif in Greek Islands, we designed a synthetic ‘fusion’ protein that binds to this consensus sequence and not to individual Lhx2 or Ebf motifs in vitro. We found that overexpression of this fusion protein in mOSNs eliminated chromatin accessibility at most Greek Islands, and resulted in strong transcriptional downregulation of every OR, regardless of their genomic distance, or even their chromosomal linkage to a Greek Island. Finally, partial pre-assembly of a Greek Island hub in cis, by insertion of an array of 5 Greek Islands next to the Greek Island Rhodes, significantly increased the frequency of expression of Rhodes-linked OR genes. These manipulations provide genetic support for the requirement of trans enhancement in OR gene expression, and are consistent with the sufficiency of a multi-enhancer hub formation for OR gene choice.

Results

Greek Islands are co-bound by Lhx2 and EBF

Greek Islands share a characteristic chromatin modification signature and in vivo footprints for transcription factors Lhx2 and Ebf (Markenscoff-Papadimitriou et al., 2014). To test the predicted binding of Lhx2 and Ebf, we performed ChIP-seq experiments using crosslinked chromatin prepared from FAC-sorted mOSNs, the neuronal population that stably expresses ORs in a singular fashion. To isolate mOSNs we FAC-sorted GFP⁺ cells from the MOEs of Omp-IRES-GFP knock-in mice, as previously described (Magklara et al., 2011). The Ebf antibody we used for these experiments cross-reacts with all 4 Ebf proteins, Ebf1-4, (data not shown), which are all highly expressed in the MOE. Because of the genetic redundancy of the Ebf genes in the MOE (Wang et al., 2004), and because the 4 Ebf members form homo- and hetero-dimers with identical sequence specificity (Wang et al., 1997), we did not attempt to further distinguish between the 4 paralogues. For Lhx2 ChIP-seq studies we used a custom-made antibody (Roberson et al., 2001). The specificity of these antibodies is supported by motif analysis of the Lhx2 and Ebf ChIP-seq experiments, which revealed that the Lhx2 and Ebf binding sites are the most highly enriched motifs respectively (Figure 1A). Genome-wide, we identified 9024 peaks for Ebf and 16,311 Lhx2 peaks, with 4792 peaks being co-bound by both proteins (Figure 1B). Despite the in vivo recognition of an essentially identical motif in pro/pre-B cells(Györy et al., 2012; Kong et al., 2016; Treiber et al., 2010b), where Ebf acts as master regulator of B-cell differentiation (Mandel and Grosschedl, 2010), there is little overlap between the genome-wide binding of Ebf in mOSNs and B-cell progenitors (data not shown). Genes proximal to Lhx2 and Ebf co-bound sites in mOSNs are statistically enriched for functions related to olfactory transduction and axonogenesis (Figure 1—figure supplement 1), consistent with a combinatorial role of these transcription factors in OSN differentiation and function (Hirota and Mombaerts, 2004; Wang et al., 1993; Wang et al., 2004; Wang et al., 1997).

Figure 1 with 5 supplements see all

Download asset Open asset

Greek Islands represent Lhx2 and Ebf co-bound regions residing in heterochromatic OR clusters.

(A) The top sequence motif identified for mOSN ChIP-seq peaks is shown above sequence motifs generated from previously reported Lhx2 (Folgueras et al., 2013) and Ebf (Lin et al., 2010) ChIP-seq data sets. mOSN ChIP-seq peaks were identified using HOMER and motif analysis was run on peaks present in both biological replicates. (B) Overlap between mOSN Lhx2 and Ebf bound sites genome-wide. See Figure 1—figure supplement 2 for analysis of ChIP-seq signal on Ebf and Lhx2 Co-bound sites within OR clusters. (C) Overlap between mOSN Lhx2 and Ebf bound sites within OR clusters. For each factor, co-bound sites are significantly more frequent within OR clusters than in the rest of the genome (p=5.702e⁻⁹ for Lhx2, p=1.6e⁻¹⁵ for Ebf, Binomial test). See Figure 1—figure supplement 2 for gene ontology analysis of peaks bound by Lhx2 and Ebf. (D) mOSN ATAC-seq and ChIP-seq signal tracks for three representative OR gene clusters. Values are reads per 10 million. Below the signal tracks, OR genes are depicted in red and non-OR genes are depicted in blue. Greek Island locations are marked. *Anafi* is a newly identified Greek Island, located in a small OR cluster upstream of the *Sfaktiria* cluster. See also Figure 1—figure supplement 3 and Supplementary file 1. For ATAC-seq, pooled data is shown from 4 biological replicates, for ChIP-seq, pooled data is shown from 2 biological replicates. For H3K9me3 ChIP-seq, input control signal is subtracted from ChIP signal prior to plotting. (E) mOSN ATAC-seq or ChIP-seq signal across 63 Greek Islands. Each row of the heatmap shows an 8 kb region centered on a Greek Island. Regions of high signal are shaded red. Mean signal across all elements is plotted above the heatmap, values are reads per 10 million. All heatmaps are sorted in the same order, based upon ATAC-seq signal. See also Figure 1—figure supplement 3 and Supplementary file 1. For ATAC-seq, pooled data is shown from 4 biological replicates, for ChIP-seq, pooled data is shown from 2 biological replicates. See Figure 1—figure supplement 4 for a comparison of newly and previously identified Greek Islands, and Figure 1—figure supplement 5 for RNA-seq analysis of ORs with Greek Islands near the TSS. (F) mOSN ATAC-seq and ChIP-seq signal tracks on OR genes. Each row of the heatmap shows an OR gene scaled to 4 kb as well as the 2 kb regions upstream and downstream. Plots and heatmap are scaled the same as in Figure 1E.

https://doi.org/10.7554/eLife.28620.002

Figure 1—source code 1 R code for analysis of ChIP-seq data from mOSNs.r.: https://doi.org/10.7554/eLife.28620.008
Download elife-28620-fig1-code1-v1.r
Figure 1—source code 2 R code for analysis of RNA-seq data from mOSNs.r.: https://doi.org/10.7554/eLife.28620.009
Download elife-28620-fig1-code2-v1.r
Figure 1—source data 1 Lhx2 ChIP-seq signal by peak type.txt.: https://doi.org/10.7554/eLife.28620.010
Download elife-28620-fig1-data1-v1.txt
Figure 1—source data 2 Ebf ChIP-seq signal by peak type.txt.: https://doi.org/10.7554/eLife.28620.011
Download elife-28620-fig1-data2-v1.txt
Figure 1—source data 3 Transcript level of ORs grouped by presence of Greek Island in Promoter.txt.: https://doi.org/10.7554/eLife.28620.012
Download elife-28620-fig1-data3-v1.txt

The apparent coordinated binding of Lhx2 and Ebf to genomic DNA is exaggerated within the boundaries of heterochromatic OR clusters where individually bound peaks are rare and have low signal. Specifically, there are 63 peaks that are co-bound by both Lhx2 and Ebf, 2 Ebf-only, and 51 Lhx2-only peaks (Figure 1C) in the ~36 MB of OR clusters, a significantly higher rate of overlap than the rate observed genome-wide (p=1.5e⁻¹⁵ and p=5.7e⁻⁹, respectively, Binomial test). Notably, most Ebf and Lhx2 co-bound sites in OR clusters have much stronger ChIP signal than singly bound sites (Figure 1—figure supplement 2A). Several of these co-bound sites within OR clusters are among the regions of highest ChIP-seq signal in the genome, suggesting that they are bound in a large fraction of mOSNs (Figure 1—figure supplement 2B–C), whereas individually bound peaks barely pass our peak-calling threshold. Co-bound sites within OR clusters coincide with 21 of the 35 previously characterized Greek Islands (Supplementary file 1). For example, visual inspection of three Greek Islands, Crete, Sfaktiria and Lipsi, revealed strong Lhx2 and Ebf binding despite the high levels of flanking H3K9me3 on these OR clusters (Figure 1D). ATAC-seq analysis in the same cellular population revealed increased chromatin accessibility at the exact genomic location of the Lhx2 and Ebf ChIP-seq peaks, but very little accessibility across the rest of the OR cluster (Figure 1D). Each of these sites also exhibits a reduction of the heterochromatic modifications, H3K9me3 and H3K79me3, over the body of the element, and locally increased levels of the active enhancer mark H3K7ac (Figure 1—figure supplement 3A). Overall, this chromatin signature is shared by the full set of Ebf and Lhx2 co-bound sites within OR gene clusters (Figure 1E and Figure 1—figure supplement 3B). Thus, Lhx2/Ebf co-bound sites that do not correspond to the original Greek Islands (Supplementary file 1) likely represent additional, less frequently active Islands that were only detected here due to the increased sensitivity of our mOSN-specific analysis (Anafi in Figure 1D and Figure 1—figure supplement 4 for comparison between old and new Islands). In contrast, Greek Islands from the original set that lack Ebf and Lhx2 binding in mOSNs also deviate from the characteristic ‘epigenetic’ signature obtained from whole MOE experiments (Supplementary file 1). Thus, these sites are likely to be functionally distinct or active in a different population of cells within the MOE, and are not included within our revised set of Greek Islands.

OR gene promoters are also significantly enriched for predicted Lhx2 and Ebf binding sites (Clowney et al., 2011; Michaloski et al., 2006; Plessy et al., 2012; Young et al., 2011), and mutations of individual Ebf and Lhx2 sites have been shown to reduce OR expression in vivo (Rothman et al., 2005). However, as a whole, OR gene promoters are inaccessible and not bound by these transcription factors in mOSNs (Figure 1F). Specifically, only 10 OR promoters show significant binding of Ebf and Lhx2 within 500 bp of the TSS. Interestingly, these 10 ORs are expressed at levels similar to the median of OR expression (Figure 1—figure supplement 5 and Supplementary file 1). Thus, detection of Lhx2 and Ebf binding on these peaks is not explained by the unusually frequent transcriptional activation of their proximal ORs.

OR identity does not affect Greek Island accessibility

Based on the observation that most OR promoters display a complete lack of chromatin accessibility and Lhx2/Ebf binding, we asked if these promoters are accessible to transcription factors only in the OSNs that transcribe them. We FAC-sorted OSNs that express the same OR allele, by isolating GFP⁺ cells from Olfr17-IRES-GFP (Gogos et al., 2000), Olfr151-IRES-tauGFP (Bozza et al., 2002), and Olfr1507-IRES-GFP (Shykind et al., 2004) knock-in mice (Figure 2A,B), and performed ATAC-seq (Buenrostro et al., 2013). As expected, the promoters Olfr1507, Olfr17 and Olfr151, are highly accessible when these genes are transcriptionally active (Figure 2C), consistent with local chromatin de-compaction being a prerequisite for OR gene transcription (Magklara et al., 2011). We also detect an increase in transposase accessibility at the 3’UTR of transcriptionally active OR alleles, an unusual feature that is not characteristic of most transcriptionally active genes in OSNs (Figure 2C, Figure 2—figure supplement 1).

Figure 2 with 2 supplements see all

Download asset Open asset

Greek island accessibility is independent of OR promoter choice.

(A) GFP fluorescence (green) in MOE tissue sections from adult mice bearing *Olfr17-IRES-GFP*, *Olfr151-IRES-tauGFP*, or *Olfr1507-IRES-GFP* alleles. Nuclei are stained with DAPI (blue). (B) Representative FACS data for *Olfr-IRES-GFP* mice. Data is shown from *Olfr151-IRES-GFP* mice. Viable (DAPI negative), GFP+ cells were collected for ATAC-seq. (C) ATAC-seq signal tracks from GFP+ cells sorted from *Olfr17-IRES-GFP* (red), *Olfr151-IRES-GFP* (blue), or *Olfr1507-IRES-GFP* (green) mice. Values are reads per 10 million. The region spanning each targeted OR is shown for all three lines. See also Figure 2—figure supplement 1. Pooled data is shown for 2 biological replicates. (D) ATAC-seq signal over Greek Islands is shown for mOSNs and each *Olfr-IRES-GFP* line. All samples are sorted by signal in mOSNs. A blue arrow marks the H Enhancer, which is the Greek Island proximal to *Olfr1507*. A blue asterisk marks *Kimolos*, the Greek Island proximal to *Olfr151*, which has the strongest change in signal relative to mOSNs. See also Figure 2—figure supplement 2. Pooled data is shown for 4 biological replicates for mOSNs, and 2 biological replicates for each *Olfr-IRES-GFP* sorted population. (E) MA-plots showing fold change in ATAC-seq signal for each sorted *Olfr-IRES-GFP* population compared to mOSNs. Peak strength (normalized reads in peak) and fold change are shown for all ATAC-seq peaks; peaks that are not significantly changed are black and peaks that are significantly changed (FDR < 0.001) are gold. Greek Islands are plotted as larger dots and are shown in red if significantly changed. *Kimolos* is marked with an asterisk in Olfr151 expressing cells, and H is marked with an arrow in Olfr1507 expressing cells. See also Figure 2—figure supplement 2.

https://doi.org/10.7554/eLife.28620.013

Figure 2—source code 1 R code for analysis of ATAC-seq data from OR-IRES-GFP.r.: https://doi.org/10.7554/eLife.28620.016
Download elife-28620-fig2-code1-v1.r
Figure 2—source data 1 ATAC-seq MA plot of mOSN versus Olfr17-ires-GFP.txt.: https://doi.org/10.7554/eLife.28620.017
Download elife-28620-fig2-data1-v1.txt
Figure 2—source data 2 ATAC-seq MA plot of mOSN versus Olfr151-ires-tauGFP.txt.: https://doi.org/10.7554/eLife.28620.018
Download elife-28620-fig2-data2-v1.txt
Figure 2—source data 3 ATAC-seq MA plot of mOSN versus Olfr1507-ires-GFP.txt.: https://doi.org/10.7554/eLife.28620.019
Download elife-28620-fig2-data3-v1.txt

In contrast to the differences between active and silent OR promoters, the overall pattern of accessibility of the Greek Islands is very similar in OSN populations that have chosen different ORs (Figure 2D). Very few Greek Islands display significantly different accessibility in the three OSN populations when compared to mOSNs (Figure 2E), and most fluctuations represent small but uniform shifts in Greek Island accessibility. For example, the H enhancer, which is proximal to Olfr1507 and is required for Olfr1507 expression, has a relatively strong ATAC-seq signal in all four cell populations and is not significantly stronger in Olfr1507+ cells than in mOSNs (Figure 2D,E, Figure 2—figure supplement 2A). However, we do note some evidence for differential activity of Greek Islands. In particular, Kimolos, the Greek Island proximal to Olfr151, has relatively weak ATAC-seq signal in mOSN and in Olfr17+ and Olfr1507+ OSNs, but exhibits a nearly 10-fold increase in signal in Olfr151-expressing cells (Figure 2D,E, Figure 2—figure supplement 2B). Thus, it appears that a large number of Greek Islands are broadly accessible in most OSNs, irrespective of the identity of the chose OR allele, whereas OR promoters are accessible only in the OSNs in which they are active.

Proximity of Lhx2/Ebf motifs correlates with binding on Greek Islands

What mechanism allows binding of Lhx2 and Ebf on Greek Islands but not OR promoters in most OSNs? We hypothesized that additional factors may bind specifically on Greek Islands but not on OR promoters, providing the functional distinction between the two types of regulatory elements. Motif analysis of the Lhx2 and Ebf ChIP-seq peaks using HOMER (Heinz et al., 2010) did not reveal additional known DNA binding sites that are shared by a significant portion of Greek Islands, other than Lhx2 and Ebf. De novo motif analysis, however, uncovered a novel, ‘composite’ motif that corresponds to Lhx2 and Ebf sites positioned next to each other (Figure 3A). This composite Lhx2/Ebf motif is structurally very similar to the numerous heterodimeric motifs identified by an in vitro screen for sequences that are co-bound by a variety of transcription factor combinations (Jolma et al., 2015). A stringent Lhx2/Ebf composite motif, with score over 10 (see material and methods), is found in 35 of the 63 Greek Islands (Figure 3—figure supplement 1A, Supplementary file 2). This motif is significantly enriched in Greek Islands in comparison with OR promoters and with Lhx2/Ebf co-bound peaks outside of OR clusters (Figure 3B). In aggregate, the 43 strong composite motifs found in Greek Islands reside exactly at a local depletion of the ATAC-seq signal from mOSNs, consistent with in vivo occupancy of these sequences by transcription factors (Figure 3C) as previously described (Buenrostro et al., 2015).

Figure 3 with 1 supplement see all

Download asset Open asset

Greek Islands have stereotypically proximal Lhx2 and Ebf motifs.

(A) Sequence logo of the Greek Island composite motif (center). The mOSN ChIP-seq derived Lhx2 and Ebf motifs logos are positioned above and below the corresponding regions of the composite motif. (B) Cumulative distribution plot of the score of the best composite motif site found in each of the 63 Greek Islands. Also plotted are cumulative distributions for co-bound sites outside of OR clusters and OR gene promoters. A score of 10 was selected as a stringent threshold for motif identification, and a score of 5 was selected for permissive motif identification. This motif is significantly enriched in Greek Islands relative to co-bound sites outside of OR clusters at both of these score cut-offs (Binomial test). See also Supplementary file 2. (C) Plot of the density of ATAC-seq fragment ends in the vicinity of Greek Island composite motifs sites scoring over 10. Plot shows mean signal and standard error in 5 bp windows centered on 43 composite motif sites (yellow). (D) Multiple alignment of composite motif sequences from Greek Islands together with 20 bp of flanking sequence. Each base is shaded by nucleotide identity: A = green, C = blue, G = yellow, T = red. Top panel depicts composite with score over 10 and bottom panel depicts composites with score between 5 and 10, together with a sequence logo of the motif present in those sequences. See Figure 3—figure supplement 1 for sequences of strong and weak Greek Island composite motifs. (E) As in (D), except purines are shaded red and pyrimidines are shaded blue. (F) For each site, the distance (in base pairs) between the closest Ebf-Lhx2 motif pair was determined. For each set of sites, the distribution of distances is shown as a boxplot. Sets of sites comprising Greek Islands with a strong composite motif, Greek Islands without a strong composite motif, Ebf and Lhx2 co-bound sites genome-wide, and OR gene promoters are compared. Sites without an Ebf motif are excluded. The distribution of distances between Ebf and Lhx2 motifs was significantly smaller for Greek Islands without a composite motif than for Ebf and Lhx2 bound sites genome-wide (two-sample, one-sided Kolmogorov–Smirnov test) See also Supplementary file 2. n = 25 for Greek Islands with Composite Score greater than 10; n = 21 for Greek Islands with Composite Score less than 10; n = 3805 for Co-bound sites genome wide; n = 521 for OR promoters.

https://doi.org/10.7554/eLife.28620.020

Figure 3—source code 1 R code for Motif Analysis.r.: https://doi.org/10.7554/eLife.28620.022
Download elife-28620-fig3-code1-v1.r
Figure 3—source data 1 Composite Motif Score Cumulative Distribution.txt.: https://doi.org/10.7554/eLife.28620.023
Download elife-28620-fig3-data1-v1.txt
Figure 3—source data 2 Motif Proximity.txt.: https://doi.org/10.7554/eLife.28620.024
Download elife-28620-fig3-data2-v1.txt

Visual inspection of the aligned composite motifs revealed that the Ebf site is less constrained to stretches of C and G bases than solitary Ebf motifs, and instead tolerates stretches of pyrimidines and purines that retain a highly stereotypic spacing from the Lxh2 site (Figure 3D,E, top panel). Recent observations suggested that the relative positioning of DNA binding motifs compensates for the fluctuation of individual nucleotides in vivo (Farley et al., 2016). Similarly, the positioning of transcription factors on the face of the DNA double helix, as determined by the spacing between transcription factor binding sites, is more important than the relative strength of individual binding sites for the assembly of the IFN beta enhanceosome (Merika et al., 1998; Thanos and Maniatis, 1995). Thus, we asked if composite motifs with lower scores, which, predominantly, have degenerate Ebf motifs (Figure 3—figure supplement 1B), still meet these stereotypic constraints. Indeed, despite increased fluctuation in the nucleotide level, the stereotypic distribution between purines and pyrimidines is retained in composites with score above 5 (Figure 3D,E bottom panel), with a new total of 55 out of 63 Greek Islands having a composite motif under this less stringent cutoff. Moreover, of the 28 Greek Islands that lack a strong composite, 20 have an Ebf site that is juxtaposed to an Lhx2 site. The distance between Ebf and Lhx2 sites in these Greek Islands is significantly shorter than the distance between Ebf and Lhx2 sites in OR promoters and in co-bound peaks outside of OR gene clusters (Figure 3F). In total, 61/63 islands contain a composite motif and/or very proximal Lhx2 and Ebf binding sites (Supplementary file 2). Thus, although Lhx2 and Ebf frequently bind at the same genomic targets genome-wide, their binding on Greek Islands is restricted to stereotypically proximal Lhx2 and Ebf motifs.

Lhx2 is essential for Ebf binding on Greek Islands

An immediate prediction of our computational analyses is that Lhx2 and Ebf bind cooperatively to composite DNA binding motifs. In addition, Lhx2 and Ebf binding to these stereotypically spaced motifs may result in synergistic recruitment of coactivators that cannot be recruited by the individually bound proteins. In either case of functional cooperativity, deletion of either Lhx2 or Ebf should abolish the binding of the other transcription factor on Greek Islands. To test this we deleted Lhx2 from mOSNs, using a conditional Lhx2 allele (Mangale et al., 2008) that we crossed to Omp-IRES-Cre mice. Deletion of Lhx2 with Omp-IRES-Cre, results in loss of Lhx2 immunofluorescence (IF) signal from mOSNs, while Lhx2 protein levels are unaffected in progenitor and immature OSNs (Figure 4A). To enrich for Lhx2 KO mOSNs in our analyses, we introduced the Cre-inducible fluorescent reporter tdTomato (Madisen et al., 2010) to our genetic strategy and we FAC-sorted Tomato⁺ Lhx2-/- mOSNs. RNA-seq of the FAC-sorted cells verifies the deletion of the floxed exons in mOSNs and the generation of a mutant Lhx2 mRNA that does not encode for Lhx2 protein (Figure 4—figure supplement 1). Lhx2 gene deletion results in significant downregulation of OR gene expression (Figure 4B), a result consistent with the partial deletion of a different floxed Lhx2 allele from mOSNs (Zhang et al., 2016). Furthermore, upon Lhx2 deletion the Lhx2 ChIP-seq signal is depleted genome-wide and from the Greek Islands (Figure 4C,D). Importantly, deletion of Lhx2 in mOSNs, results in loss of Ebf binding from Lipsi (Figure 4C) and from nearly all other Greek Islands (Figure 4E). ATAC-seq on the Lhx2 KO OSNs also shows strong reduction of ATAC-peaks from Greek Islands (Figure 4F), suggesting that Lhx2 and Ebf co-binding on Greek Islands is essential for their sustained accessibility in this heterochromatic environment. Consistent with the role of composite motifs on cooperative Lhx2 and Ebf binding, the effects of Lhx2 deletion on Ebf binding are weaker at co-bound sites outside the OR clusters compared to Greek Islands (Figure 4G, Figure 4—figure supplement 2). Interestingly, the general downregulation of OR gene transcription upon Lhx2 deletion extends to ORs that do not have Lhx2 motifs on their promoters (Figure 4—figure supplement 3), suggesting that Lhx2 activates OR transcription predominantly through the Greek Islands.

Figure 4 with 3 supplements see all

Download asset Open asset

Lhx2 is required for Ebf binding predominantly on Greek Islands.

(A) Lhx2 immunofluorescence (IF) (green) in MOE sections from 3 week old control (*Lhx2* fl/fl) and *Lhx2* KO (*Omp-IRES-Cre; Lhx2*fl/fl) mice. Nuclei are stained with DAPI (blue). The Lhx2 immunoreactive cells on the basal layers of the MOE represent immature OSNs and progenitors that have not yet turned on OMP (and thus Cre) expression. See also Figure 4—figure supplement 1 for demonstration of the Cre induced deletion at the mRNA level. (B) MA-plot of OR transcript levels in FAC-sorted *Lhx2* KO mOSNs (*Omp-IRES-Cre; Lhx2fl/fl; tdTomato*) compared to FAC-sorted control mOSNs (*Omp-IRES-GFP*). Red dots correspond to OR genes with statistically significant transcriptional changes (adjusted p-value<0.05). Three biological replicates were included for control mOSNs and 2 biological replicates were included for *Lhx2* KO mOSNs. (C) ChIP-seq and ATAC-seq signal tracks from FAC-sorted control mOSNs (*Omp-IRES-GFP*) and *Lhx2* KO mOSNs (*Omp-IRES-Cre; Lhx2fl/fl; tdTomato*) for the OR cluster containing the Greek Island Lipsi. Values are reads per 10 million. For ATAC-seq, pooled data from 4 biological replicates for control mOSNs are compared to data from 2 biological replicates for *Lhx2* KO mOSNs. For ChIP, pooled data is shown from 2 biological replicates. (**D–F**) Heatmaps depicting Lhx2 and Ebf ChIP-seq and ATAC-seq signal across Greek Islands for FAC-sorted control and *Lhx2* KO mOSNs for the samples described in C. (G) Log2 fold change in normalized Ebf ChIP-seq signal in *Lhx2* KO mOSNs relative to control mOSNs for Greek Islands (red), compared to sites genome-wide that are bound by Ebf-only or both Ebf and Lhx2 in wild-type mOSNs. Fold change was calculated using data from 2 biological replicates each of control mOSNs and *Lhx2* KO mOSNs.. See also Figure 4—figure supplement 2 for MA-plot showing data for all peaks in each set and Figure 4—figure supplement 3 for RNA-seq analysis of the effect of Lhx2 KO on ORs with and without a promoter Lhx2 motif.

https://doi.org/10.7554/eLife.28620.025

Figure 4—source code 1 R Code for analysis of ChIP-seq data from Lhx2KO mOSNs.r.: https://doi.org/10.7554/eLife.28620.029
Download elife-28620-fig4-code1-v1.r
Figure 4—source code 2 R code for analysis of RNA-seq data from Lhx2KO mOSNs.r.: https://doi.org/10.7554/eLife.28620.030
Download elife-28620-fig4-code2-v1.r
Figure 4—source data 1 RNA-seq MA plot of Olfr Expression in mOSNs versus Lhx2KO.txt.: https://doi.org/10.7554/eLife.28620.031
Download elife-28620-fig4-data1-v1.txt
Figure 4—source data 2 Effect of Lhx2KO on Ebf ChIPSeq signal.txt.: https://doi.org/10.7554/eLife.28620.032
Download elife-28620-fig4-data2-v1.txt
Figure 4—source data 3 MA-plot of Ebf ChIP-seq in control mOSNs versus Lhx2KO mOSNs.txt.: https://doi.org/10.7554/eLife.28620.033
Download elife-28620-fig4-data3-v1.txt
Figure 4—source data 4 Change in OR expression in Lhx2KO mOSNs versus promoter motifs.txt.: https://doi.org/10.7554/eLife.28620.034
Download elife-28620-fig4-data4-v1.txt

Inhibition of Greek Islands inhibits OR transcription

Our data suggest that composite motifs are an ideal target for genetic manipulations that could inhibit the function of Greek Islands as a whole. We reasoned that if we could fuse Lhx2 and Ebf DNA binding domains (DBD) at a proper distance, we could generate a DNA binding peptide that has high affinity for the composite but not for individual motifs. Because the DNA binding specificity of homeobox genes is low and is influenced by their partners (Chan et al., 1994; Passner et al., 1999), the Lhx2 DBD could be easily incorporated in this design. Ebf, however, has high affinity and specificity for its cognate palindromic motif, where it binds as a dimer (Hagman et al., 1993; Hagman et al., 1995; Travis et al., 1993; Wang and Reed, 1993; Wang et al., 1997). Crystal structure of an Ebf1 homodimer bound to DNA revealed that each DBD monomer contacts both halves of the palindromic motif and forms a clamp-like structure that likely stabilizes DNA binding (Treiber et al., 2010a). Thus, in order to reduce Ebf affinity for DNA without affecting its sequence specificity, we fused only one Ebf DBD to the Lhx2 DBD with various flexible linkers. Fusion of the two DNA binding domains with a 20aa protein linker generated a protein with affinity for the composite motif but not for individual Lhx2 and Ebf sites in vitro (Figure 5A). Competition experiments demonstrate that only unlabeled oligos containing the composite, and not individual Lhx2 or Ebf motifs, can compete off the binding of the fusion protein to the composite motif at up to 100x molar excess (Figure 5B,C). Remarkably, insertion of only 2 DNA bases between the Lhx2 and the Ebf binding sites on the composite motif impairs its ability to compete with the wild type composite (Figure 5C). Further increase of the distance between the two sites essentially eliminates any competitive advantage the composite motif had over the individual Lhx2 and Ebf sites (Figure 5C). Thus, the fusion of the Lhx2 DBD to a single Ebf DBD creates a novel DNA binding protein that recognizes the composite motif with sensitivity to the stereotypical distance of the two individual DNA binding sites.

Figure 5 with 3 supplements see all

Download asset Open asset

Displacement of Lhx2 and Ebf from Greek Islands shuts off OR transcription.

(A) Electrophoretic Mobility Shift Assay (EMSA) for binding of in vitro translated protein to DNA probes containing either an Ebf site, an Lhx2 site, or a composite site. Binding of three versions of the Fusion protein with either 5, 10, or 20 amino acid linker peptides were compared to full length Lhx2 or full length Ebf1. (B) EMSA for sequence selectivity of in vitro translated proteins. Binding of Fusion protein (20aa linker), Ebf1, and Lhx2 to composite motif probe was competed with a 20-fold molar excess of unlabeled oligo containing either an Lhx2 site, Ebf site, or composite site. (C) EMSA for motif-spacing selectivity of in vitro translated proteins. Binding of Fusion protein (20aa linker) was competed with 100-fold molar excess of unlabeled oligo containing either wild type composite sequence or mutant composite generated by the insertion of 2–14 base pairs in two base pair increments. In the last two lanes the competitors are either a single Lhx2 or a single Ebf site. (D) Schematic illustrating the proposed dominant-negative activity of the fusion protein for composite motif sites. See also Figure 5—figure supplement 1 for depiction of the genetic strategy for mOSN overexpression. (E) ATAC-seq and RNA-seq signal tracks from FAC-sorted control mOSNs and Fusion protein-expressing mOSNs for the OR cluster containing the Greek Island Lipsi. ATAC-seq values are reads per 10 million. RNA-seq values are reads per million. For ATAC-seq, pooled data from 4 biological replicates for control mOSNs are compared to data pooled from 2 independent founders of the Fusion Protein transgene. For RNA-seq, representative tracks are shown for one of three biological replicates for control mOSNs and for one of 2 independent founders for the Fusion Protein transgene. (F) ATAC-seq signal across the Greek Islands for control mOSNs and Fusion protein-expressing mOSNs. Pooled data from 4 biological replicates for control mOSNs are compared to data pooled from 2 independent founders of the Fusion Protein transgene. See Figure 5—figure supplement 2 for the effect of Fusion Protein expression on Ebf and Lhx2 sites genome-wide. (G) MA-plot (Dudoit and Fridlyand, 2002) of OR transcript levels in FAC-sorted mOSNs expressing fusion protein (*Omp-IRES-tTA; tetO-Fusion-2a-mcherry*) compared to FAC-sorted control mOSNs (*Omp-IRES-GFP*). Red dots correspond to OR genes with statistical significant transcriptional changes (adjusted p-value<0.05). Three biological replicates were included for control mOSNs and data from 2 independent founders were included for the Fusion Protein transgene. See Figure 5—figure supplement 3 for analysis of effect of Fusion Protein expression on ORs grouped by the presence of Ebf and Lhx2 promoter motifs. (H) Violin plot of Log2 fold change in transcript levels of ORs (red) in mOSNs expressing fusion protein compared to control mOSN. ORs are compared to additional sets of genes: genes with Ebf and Lhx2 bound within 1 kb of the TSS, genes with Lhx2-only bound within 1 kb of the TSS, genes with Ebf-only bound within 1 kb of the TSS, and non-OR genes without Ebf or Lhx2 binding. (I) As in (H), with Log2 fold change in transcript levels shown as a heatmap for each set of genes.

https://doi.org/10.7554/eLife.28620.035

Figure 5—source code 1 R code for analysis of ATAC-seq data from Fusion Protein mOSNs.r.: https://doi.org/10.7554/eLife.28620.039
Download elife-28620-fig5-code1-v1.r
Figure 5—source code 2 R code for analysis of RNA-seq data from Fusion Protein mOSNs.r.: https://doi.org/10.7554/eLife.28620.040
Download elife-28620-fig5-code2-v1.r
Figure 5—source data 1 RNA-seq MA plot of Olfr Expression in mOSNs versus Fusion Protein expressing mOSNs.txt.: https://doi.org/10.7554/eLife.28620.041
Download elife-28620-fig5-data1-v1.txt
Figure 5—source data 2 RNA-seq Log2 fod change in mOSNs versus Fusion Protein Expressing mOSNs.txt.: https://doi.org/10.7554/eLife.28620.042
Download elife-28620-fig5-data2-v1.txt
Figure 5—source data 3 Change in ATAC-seq signal in Fusion Protein expressing mOSNs by peak type.txt.: https://doi.org/10.7554/eLife.28620.043
Download elife-28620-fig5-data3-v1.txt
Figure 5—source data 4 Change in OR expression in Fusion Protein expressing mOSNs versus promoter motifs.txt.: https://doi.org/10.7554/eLife.28620.044
Download elife-28620-fig5-data4-v1.txt

To express the fusion protein in the MOE, we generated a transgenic construct under the control of the tetO promoter. This transgene includes a bi-cistronic mCherry reporter using the 2A peptide(Kim et al., 2011) (Figure 5—figure supplement 1A), which allows isolation of the transgene-expressing OSNs by FACS. We analyzed two independent founders, which we crossed to Omp-IRES-tTA knock-in mice (Gogos et al., 2000), to obtain expression of the fusion protein specifically in mOSNs (Figure 5—figure supplement 1B). We hypothesized that the fusion protein will compete with endogenous Lhx2 and Ebf for binding on composite motifs, acting as a repressor of the Greek Islands (Figure 5D). Indeed, ATAC-seq analysis shows strong reduction of ATAC-seq signal from the Greek Islands upon expression of the fusion protein in mOSNs (Figure 5E,F), suggesting the displacement of the heterochromatin-resisting transcription factors from OR enhancers. Unfortunately, both the Lhx2 and the Ebf antibodies we used in our ChIP-seq experiments cross-react with the DBD domains of the fusion protein (data not shown), thus we could not confirm by ChIP-seq their displacement from the Greek Islands. However, RNA-seq analysis of the FAC-sorted mCherry+ cells revealed significant reduction of OR transcription as a whole (Figure 5E,G). Although the repressing effect of the fusion protein does not extend to non-OR genes residing outside of OR clusters (Figure 5E), it has a ubiquitous repressive effect on OR transcription (Figure 5G). In fact, of the 500 most significantly downregulated genes 482 are ORs (p<1e-313, hypergeometric test). In agreement with this, genome-wide analysis shows that while ORs are homogeneously repressed by the fusion protein, genes containing Ebf-, Lhx2-, or Ebf and Lhx2-bound promoters are, on average, transcriptionally unaffected (Figure 5H,I). Consistently, the effects of fusion protein overexpression on the ATAC signal of promoters bound by Lhx2 and/or Ebf are much weaker than the effects on Greek Islands (Figure 5—figure supplement 2). Finally, it is worth noting that similarly to the effects of the Lhx2 deletion, the repressive effects of the fusion protein on OR transcription does not depend on the presence of Ebf and Lhx2 motifs on OR promoters (Figure 5—figure supplement 3), supporting the Greek Island-mediated repressive effects of the fusion protein.

Multi-enhancer hubs activate OR transcription

The widespread downregulation of OR gene expression detected in Lhx2 KO and fusion protein expressing mOSNs suggests that the effects of Greek Island inhibition extend over large genomic distances, or even across chromosomes. Visual inspection of an isolated OR cluster on chromosome 16, which does not contain a Greek Island and is over 15 MB away from the closest OR cluster with a Greek Island, supports the strong downregulation of ORs in trans (Figure 6—figure supplement 1A). Genome-wide, for both Lhx2 KO and fusion protein expressing mOSNs, there is a uniform reduction in OR expression regardless of the presence of a Greek Island in a cluster (Figure 6A,B). There is also a uniform reduction of OR expression independently of the distance between the OR and the closest Greek Island, and this reduction occurs irrespective of the motif content of OR promoters (Figure 6C,D). Moreover, comparable downregulation was observed for the ORs with a Greek Island in the promoter region (distance = 1) and for ORs that lack a Greek Island in cis (distance set to 1e + 08) (Figure 6C,D). Thus, functional incapacitation of Greek Islands by two distinct genetic manipulations results in specific but pervasive disruption of OR expression irrespective of OR promoter sequence, OR distance from a Greek Island, presence of a Greek Island within the OR cluster, or even presence of a Greek island within the same chromosome.

Figure 6 with 1 supplement see all

Download asset Open asset

Downregulation of OR expression over large genomic distances.

(A) MA-plot of OR transcript levels in FAC-sorted *Lhx2* KO (*Omp-IRES-Cre; Lhx2fl/fl; tdTomato*) mOSNs compared to FAC-sorted control mOSNs (*Omp-IRES-GFP*). Gold dots correspond to OR genes with statistical significant transcriptional changes. ORs in clusters without a Greek Island are shown as large dots, with significantly changed ORs in red. Three biological replicates were included for control mOSNs and 2 biological replicates were included for *Lhx2* KO mOSNs. (B) MA-plot of OR transcript levels in FAC-sorted Fusion protein expressing (*Omp-IRES-tTA; tetO-Fusion-2a-mcherry*) mOSNs compared to FAC-sorted control mOSNs (*Omp-IRES-GFP*). Gold dots correspond to OR genes with statistical significant transcriptional changes. ORs in clusters without a Greek Island are shown as large dots, with significantly changed ORs in red. Three biological replicates were included for control mOSNs and data from 2 independent founders were included for the Fusion Protein transgene. See Figure 6—figure supplement 1 for an example OR cluster without a Greek Island. (C) Plot of OR distance from a Greek Island compared to Log2 Fold change in *Lhx2* KO mOSNs. ORs overlapping a Greek Island have distance set to 1. ORs on a chromosome without a Greek Island have distance set to 1e + 08. (D) Plot of OR distance from a Greek Island compared to Log2 Fold change in Fusion Protein expressing mOSNs. ORs overlapping a Greek Island have distance set to 1. ORs on a chromosome without a Greek Island have distance set to 1e + 08.

https://doi.org/10.7554/eLife.28620.045

Figure 6—source code 1 R code for analysis of RNA-seq data from Lhx2KO mOSNs.r.: https://doi.org/10.7554/eLife.28620.047
Download elife-28620-fig6-code1-v1.r
Figure 6—source code 2 R code for analysis of RNA-seq data from Fusion Protein mOSNs.r.: https://doi.org/10.7554/eLife.28620.048
Download elife-28620-fig6-code2-v1.r
Figure 6—source data 1 RNA-seq MA-plot of OR expression in Lhx2KO versus presence of Greek Island.txt.: https://doi.org/10.7554/eLife.28620.049
Download elife-28620-fig6-data1-v1.txt
Figure 6—source data 2 RNA-seq MA-plot of OR expression in Fusion Protein versus presence of Greek Island.txt.: https://doi.org/10.7554/eLife.28620.050
Download elife-28620-fig6-data2-v1.txt
Figure 6—source data 3 OR expression in Lhx2KO versus promoter motifs and distance to Greek Island.txt.: https://doi.org/10.7554/eLife.28620.051
Download elife-28620-fig6-data3-v1.txt
Figure 6—source data 4 OR expression in Fusion Protein mOSNs versus promoter motifs and distance to Greek Island.txt.: https://doi.org/10.7554/eLife.28620.052
Download elife-28620-fig6-data4-v1.txt

If trans interactions between Greek Islands are essential for OR transcription and the formation of a multi-enhancer hub over a stochastically chosen OR allele is the low probability event responsible for singular OR choice, then increasing the number of Greek Islands in an OR cluster should increase the expression frequency of the ORs in that cluster. To test this prediction, we introduced, by homologous recombination, an array of 5 Greek Islands (Lipsi, Sfaktiria, Crete, H and Rhodes, hereafter termed LSCHR) next to the endogenous Rhodes, a Greek Island from chromosome 1 (Figure 7A). This array comprised the ATAC-seq accessible core of each Greek Island (392–497 bp) together with 50 bp of endogenous flanking sequence (Supplementary file 6). We chose Rhodes for this manipulation for two reasons: First, the ATAC-seq and ChIP-seq signals on Rhodes are among the strongest between the 63 Greek Islands, which combined with the almost complete H3K9me3 local depletion suggest that it is accessible and bound by Lhx2 and Ebf in the majority of mOSNs. Thus, any transcriptional changes observed by this manipulation would not be attributed to increased Lhx2 and Ebf binding on this locus. Second, there are no additional Greek Islands within a genomic distance of over 80 MB on chromosome 1, thus formation of a Greek Island hub over this cluster requires recruitment of unlinked OR enhancers. We, therefore, reasoned that Rhodes-proximal ORs would be more responsive to the insertion of additional enhancers next to their local Greek Island, than ORs residing on chromosomes with multiple Greek Islands.

Figure 7 with 1 supplement see all

Download asset Open asset

Multi-enhancer hubs activate OR transcription.

(A) Targeted insertion of 5 Greek Islands (*LSCHR*) adjacent to *Rhodes*. Coordinates are mm10. See Figure 7—figure supplement 1 for ChIP qPCR analysis of Lhx2 binding to the inserted Greek Islands. (B) RT-qPCR of OR transcript levels in MOEs of 3 week old LSCHR mice and wild-type littermate controls. Transcript levels are expressed as quantity relative to *Adcy3*, error bars are SEM. ORs are grouped by presence inside or outside the OR cluster containing *Rhodes*, and within each group ORs are ordered by level of expression in wild-type mice. *p<0.05, **p<0.01, two-tailed student’s t-test. For wild-type mice n = 3, for LSCHR heterozygous and homozygous mice n = 4. (C) Fluorescent RNA in situ hybridization with probe for *Olfr12* (green) in *LSCHR* homozygous and wild-type littermate control MOE at 2 weeks of age. Nuclei are labeled with DAPI (blue). (D) Fluorescent RNA in situ hybridization with probe for *Olfr1410* (green) in *LSCHR* homozygous and wild-type littermate control MOE at 2 weeks of age. Nuclei are labeled with DAPI (blue).

https://doi.org/10.7554/eLife.28620.053

To test whether transcription factors bind the Greek Islands comprising the LSCHR knock-in allele in vivo we performed ChIP-qPCR for Lhx2 from whole MOE of mice homozygous for the LSCHR knock-in and wild-type controls. When normalized to percent input, which adjusts for the increased copy number of each Greek Island, we observe strong binding of Lhx2 to the LSCHR Greek Islands that is similar to the binding observed in wild-type mice (Figure 7—figure supplement 1A). If a reference set of external control sites are instead used for normalization, we detect an approximately two-fold increase in LSCHR Greek Island signal in the input control and in the Lhx2 ChIP (Figure 7—figure supplement 1B,C), consistent with the increase of the number of alleles of the 5 Greek Islands in the knock-in mice. Taken together, these data suggest that insertion of a Greek Island array next to Rhodes does not increase significantly the frequency of Lhx2 binding to Rhodes or to the transgenic Greek Island alleles.

Despite the minimal effects of LSCHR insertion on transcription factor binding, we detect significant transcriptional upregulation of the OR alleles in the Rhodes cluster. q-PCR analysis of cDNA prepared from the whole MOE of LSCHR knock-in mice and wild type littermates, shows strong transcriptional upregulation of the ORs in the Rhodes cluster that is almost doubled in homozygote knock-in mice in comparison to heterozygote littermates (Figure 7B). ORs from different clusters and non-OR genes are not strongly upregulated by this manipulation; however, four of the ORs in the Rhodes cluster are upregulated by more than 8 fold in the homozygote knock-in mice (Figure 7B). In fact, Olfr1412, which is the most upregulated OR in the Rhodes cluster approaches mRNA levels comparable to Olfr1507, the most highly expressed OR in the MOE (Figure 7B). RNA FISH experiments demonstrate that this transcriptional upregulation represents an increase in frequency of choice, rather than an increase of transcription rates in each cell (Figure 7B,D). ORs from different clusters do not appear significantly affected by this genetic manipulation, a result that is not surprising since the trans effects of this enhancer array would be distributed to more than a 1000 OR genes.

Discussion

In most cell types interchromosomal interactions are rare and thus far appear to represent technical or biological noise (Nagano et al., 2015), rather than to provide a reliable mechanism for gene regulation. Various studies suggest that the majority of genomic interactions are restricted within topologically associated domains (TADs) that show little variation between different tissues (Dixon et al., 2012). Specific genomic interactions between TADs are infrequent, and interactions between different chromosomes are even less prominent (Lieberman-Aiden et al., 2009; Rao et al., 2014). However, in certain biological contexts, specific interchromosomal interactions are readily detected by imaging and genomic approaches, or have been inferred genetically. For example, during X chromosome inactivation, there is a ‘chromosome kissing’ step that occurs just before one of the two chromosomes is inactivated (Bacher et al., 2006; Masui et al., 2011; Xu et al., 2006). During T and B cell differentiation interchromosomal interactions regulate antigen receptor choice and cellular differentiation (Hewitt et al., 2008; Spilianakis et al., 2005). The stochastic induction of the human IFN beta gene by virus infection requires the formation of interchromosomal interactions between the IFN beta enhancer and NF-kappa B-bound Alu repeats (Apostolou and Thanos, 2008). Finally, stochastic photoreceptor choice in drosophila omatidia is determined by DNA elements that, genetically, appear to communicate in trans (Johnston and Desplan, 2014). Thus, although interchromosomal interactions may not be involved in gene regulation in most cell types, their stochastic and infrequent nature may be ideal for the execution of non-deterministic, and mutually exclusive regulatory processes like OR gene choice (Dekker and Mirny, 2016).

The involvement of interchromosomal interactions in OR gene choice was first postulated by the demonstration that the prototypical OR enhancer, the H enhancer (Serizawa et al., 2003), interacts in trans with transcriptionally active ORs (Lomvardas et al., 2006). The significance of these interactions was questioned as deletion of the H enhancer affected the expression of only three proximal ORs (Fuss et al., 2007; Nishizumi et al., 2007). Subsequently, however, additional OR enhancers, the Greek Islands, were discovered to a current total of 63 elements. The striking similarities between these elements in regards of the transcription factors that bind to them, combined with the demonstration that Greek Islands form a complex network of interchromosomal interactions (Markenscoff-Papadimitriou et al., 2014), suggested that extensive functional redundancy may mask the effects of single or even double (Khan et al., 2011) enhancer deletions in trans. The non-redundant role of Greek Islands for the expression of certain ORs in cis may be attributed to the inability of some OR promoters to recruit enhancers from other chromosomes, making them completely dependent on the presence of a proximal enhancer for this function. In other words, even if trans enhancement is required for the activation of every OR gene, a fraction of them may depend on the assistance of a local Island for the recruitment of trans enhancers. Such qualitative promoter differences are consistent with the observation that enhancer deletions affect only some ORs in a cluster, and by the fact that certain ORs can be expressed as transgenic minigenes (Vassalli et al., 2002), while others can be expressed as transgenes only in the presence of an enhancer in cis (Serizawa et al., 2003). The proposed redundant function of Greek Islands as trans enhancers may have facilitated the rapid evolution of this gene family, which expanded dramatically during the transition from aquatic to terrestrial life (Niimura and Nei, 2007). Activation of OR transcription by Greek Islands in trans allows the functional expression of newly evolved OR alleles in mOSNs, without a requirement for physical linkage to an enhancer- a property fully compatible with gene expansion through retrotransposition, segmental duplication, and chromosomal translocation. Thus, OR gene activation through non-deterministic genomic interactions in trans may provide a mechanism that ‘shuffles the deck’ and assures that a newly evolved OR allele will be expressed at a frequency similar to that of the existing OR repertoire.

Global and trans action of OR enhancers

A correlation between the formation of interchromosomal Greek Island hubs and OR transcription was previously established by ectopic expression of Lamin b receptor (Lbr) in mOSNs, and by conditional deletion of transcriptional co-activator Bptf, either of which caused reduction of Greek Island interactions in trans and pervasive OR downregulation (Clowney et al., 2012; Markenscoff-Papadimitriou et al., 2014). However, these manipulations have more general consequences that extend beyond the regulation of Greek Island interaction. For example, ectopic Lbr expression in mOSNs caused a general rearrangement of nuclear topology and disrupted the aggregation of OR clusters, making difficult to distinguish between the effects on interchromosomal OR clustering and interchromosomal Greek Island interactions. Deletion of Bptf on the other hand, although it only disrupted interchromosomal associations between Greek Islands, also caused a developmental arrest in the OSNs that may, or may not, be related to the failure to activate OR expression.

To minimize indirect effects that may confound the interpretation of these manipulations, we targeted a common and highly specific genetic signature among Greek Islands, the composite motif. This DNA sequence constitutes a remarkable example of highly constrained and stereotypically distributed transcription factor binding motifs that is shared between most Greek Islands, and is highly enriched relative to OR promoters and co-bound sites genome-wide. Overexpression of a ‘synthetic’ fusion protein that specifically recognizes the composite motif eliminated ATAC-seq signal from Greek Islands in mOSNs, suggesting that it displaced the endogenous Lhx2 and Ebf proteins on most OR enhancers. Similar observations were made for the conditional Lhx2 deletion, which also reduced the chromatin accessibility of Greek Islands and abolished Ebf binding from these elements. The strong and specific downregulation of the OR transcriptome in both Lhx2 knock out and in fusion protein expressing mOSNs, clearly reveals the critical and ubiquitous role of the Greek Islands as key regulators of OR expression. The fact that these transcriptional effects extend to ORs that have neither a Greek Island in cis nor Lhx2/Ebf motifs on their promoters, is consistent with the role of Greek Islands as trans OR gene enhancers. However, it should be noted that we cannot exclude alternative interpretations of our data. For example, it is possible that there are additional intergenic OR enhancers with bona fide composite sites that are only utilized in neurons that express specific OR genes. Although our analysis in the 3 purified OSN populations expressing ORs Olfr1507, Olfr17 and Olfr171 does not support the existence of OR-specific Greek Islands, we cannot exclude the possibility that Greek Islands with such restricted activity exist in other OSN sub-populations. Furthermore, although our ATAC-seq analysis suggests that fusion protein overexpression affects predominantly Greek Islands, indirect effects on OR transcription cannot be excluded. Taking all these caveats into account, the fact that distinct genetic manipulations that target the Greek Islands, and their genomic interactions, cause widespread downregulation of OR expression, provides strong genetic support for the requirement of interchromosomal interactions in OR gene choice.

Same transcription factors different chromatin states

The experimental demonstration that every Greek Island is co-bound by Lhx2 and Ebf, the same transcription factors predicted to bind on most OR promoters, is unexpected because of the fundamentally different chromatin states of the two types of regulatory elements in mOSNs. OR promoters are inaccessible in the mixed mOSN population, and only upon FAC-sorting cells that express the same OR, could we obtain evidence for OR promoter accessibility. In contrast, the enhancers of OR genes appear accessible and bound by Lhx2 and Ebf in a large fraction of mOSNs. The stereotypically proximal positioning of Lxh2 and Ebf motifs on OR enhancers emerged as the key determinant for these differences, since the functionally cooperative binding of Lhx2 and Ebf on proximal motifs in vivo appears to counteract the propagating properties of the surrounding heterochromatin. Notably, cooperative binding between ‘phased’ Lhx2 motifs was recently proposed as an explanation for the increased frequency of expression of OR transgenes under the control of an artificial promoter(D'Hulst et al., 2016), although in this case the local chromatin state is not known. Interestingly, the composite Lhx2/Ebf motif that we identified on Greek Islands is structurally very similar to the numerous heterodimeric motifs identified by an in vitro screen for sequences that are co-bound by a variety of transcription factors (Jolma et al., 2015). Thus, the solution that was adopted by intergenic OR enhancers to generate heterochromatin-resistant binding sites, may be generally utilized by other transcription factors in a variety of genomic contexts and regulatory needs. In support of this, the striking stereotypy of Lhx2 and Ebf motifs in Greek Islands, also known as ‘rigid motif grammar’ (Long et al., 2016), is reminiscent of the constraint spacing of transcription factor binding sites in the IFN beta enhanceosome (Panne et al., 2007; Thanos and Maniatis, 1995).

A multi-enhancer hub for robust and singular OR expression

The concept that Greek Islands may have stronger affinity for Lhx2 and Ebf than OR promoters, immediately provides a molecular solution for the need of a multi-enhancer hub for stable and robust OR transcription. In the event that an OR promoter becomes de-silenced and occupied by Lhx2 and Ebf, singular or weak binding by these transcription factors will be unstable, due to the competing forces of flanking heterochromatin. However, if an OR promoter is surrounded by multiple strong sites of cooperative binding, like the ones we detect in high frequency on the Greek Islands, then every time Lhx2 and Ebf fall off an OR promoter they will be sequestered by local, high affinity sites, which may also act as a replenishing source for these transcription factors. In other words, inter-chromosomal Greek island hubs may create local regions of high Lhx2 and Ebf concentration that is essential for continuous binding on the low affinity sites of a chosen OR promoter and high transcription rates.

Thus, we propose a model whereby the deployment of multiple, individually weak components that function in a coordinated and hierarchical fashion to activate OR transcription. According to this model, first, cooperative interactions between Lhx2 and Ebf result in stable binding to Greek Islands, which prevents flanking heterochromatin from spreading and silencing these intergenic elements. Because composite motifs are specifically enriched on Greek Islands similar cooperative interactions between Lhx2 and Ebf cannot protect OR promoters from heterochromatic silencing (Figure 8A). Second, cooperative interactions between Greek Islands assemble numerous Lhx2 and Ebf elements into a multi-chromosomal enhancer hub (Figure 8B). When this hub forms stable interactions with a stochastically chosen OR allele in trans, then heterochromatin is displaced, and cooperative enhancer-promoter interactions mediate stable Lhx2 and Ebf binding on the promoter, and therefore, transcriptional activation (Figure 8C). These cooperative interactions may be direct, homotypic interactions between Lhx2 and Ebf or facilitated by coactivator or mediator proteins that are recruited by these transcription factors. In either scenario, the same fundamental principles of cooperativity and synergy that govern the genetic switch between lysis and lysogeny in the lambda bacteriophage (Ptashne, 2009), and promote the formation and function of the human IFN beta enhanceosome (Thanos and Maniatis, 1995), may also regulate the formation of a 3-dimensional enhanceosome responsible for OR gene choice.

Figure 8

Download asset Open asset

A Hierarchical Model for OR gene choice.

(A) Lhx2 and Ebf bind in a functionally cooperative fashion on the composite motifs of the Greek Islands. Because these motifs are not juxtaposed in most OR promoters, Lhx2 and Ebf cannot overcome the heterochromatic silencing of OR promoters, thus their binding is restricted to the OR enhancers. (B) Lhx2/Ebf bound OR enhancers are not strong enough to activate proximal OR alleles on their own and to facilitate stable transcription factor binding on their promoters. (C) Lhx2/Ebf bound Greek Islands form an interchromosomal, multi-enhancer hub that recruits coactivators essential for the de-silencing of OR promoters and robust transcriptional activation of the OR allele that would be recruited to this hub.

https://doi.org/10.7554/eLife.28620.055

A multi-enhancer hub model explains why the few OR promoters that are bound by Lhx2 and Ebf in a large fraction of mOSNs are not transcribed at higher frequencies than most ORs. It also may explain why 63 OR genes, one for each Greek Island, are not simultaneously expressed in each mOSN: if numerous enhancers must cooperate for OR transcription, individual promoters, and even individual enhancer-promoter combinations, may not be sufficient for OR transcription. But what prevents the formation of numerous multi-enhancer hubs, which could then activate more than one OR allele at a time? The answer to this critical question may be found in the transcriptional phenotype of the Rhodes knock-in mice, whereby 6 Greek Islands reside in tandem. In these mice, we detect a significant increase in the frequency of OR choice, that is not caused by increased occupancy by Lhx2. Thus, the increase frequency of local OR choice is likely explained by a mechanism subsequent to transcription factor binding; either through the more efficient recruitment of an unknown limited coactivator, or by recruitment of additional trans enhancers, culminating at the assembly of more complex, transcription-competent Greek island hub. Regardless of the mechanism by which LSCHR increases the frequency of OR choice within the Rhodes cluster, the local ORs remain silent in the vast majority of the mOSNs. This implies the existence of a strong ‘thresholding’ mechanism in the ability of Greek Islands to activate OR transcription, such that even 6 Islands acting together are inadequate to drive ubiquitous expression in most mOSNs. Thus, even if multiple enhancer hubs were to form in an mOSN nucleus, only the ones that surpass a critical number of interacting Greek Islands would lead to the activation of OR transcription. These non-linear properties are reminiscent of ‘super-enhancers’ (SE), which exhibit steep concentration threshold requirements attributed to phase transition processes(Hnisz et al., 2017). Thus, an attractive model suggests that Greek Island hubs may undergo phase transition when they exceed a steep threshold of interacting Greek Islands. If this phase transition is required for OR transcription, hubs with lower complexity will not be transcriptionally competent, providing an elegant molecular solution of the singularity of OR gene choice.

Such thresholding mechanism may be less strict in immature OSNs and progenitors, where low level OR co-expression is detected by single cell RNA-seq (Hanchate et al., 2015; Saraiva et al., 2015; Tan et al., 2015). Similar low level co-expression is detected in Lbr-expressing mOSNs, where the nuclear aggregation of OR clusters is prevented and the chromatin accessibility of OR genes is increased (Clowney et al., 2012). Thus, it is possible that the differentiation dependent silencing and aggregation of heterochromatic OR clusters into condensed nuclear foci contribute to this ‘all or none’ transcriptional paradigm. In other words, the extreme silencing forces imposed by mOSNs on OR genes may result in extraordinary requirements for OR transcription, which can only be met by an activating multi-enhancer assembly of unprecedented complexity and possibly unique biochemical properties. Thus, even if more than one multi-enhancer hub could form in a nucleus, the number of transcription-competent hubs would be extremely limited if not singular. Combined with the kinetic restrictions imposed by the OR-elicited feedback signal, and a recently reported post choice refinement process(Abdus-Saboor et al., 2016), our model provides a mechanistic solution for the singular choice of one out of >2000 OR alleles.

Share this article

Cite this article

Greek Islands represent Lhx2 and Ebf co-bound regions residing in heterochromatic OR clusters.

Figure 1—source code 1

Figure 1—source code 2

Figure 1—source data 1

Figure 1—source data 2

Figure 1—source data 3

Greek island accessibility is independent of OR promoter choice.

Figure 2—source code 1

Figure 2—source data 1

Figure 2—source data 2

Figure 2—source data 3

Greek Islands have stereotypically proximal Lhx2 and Ebf motifs.

Figure 3—source code 1

Figure 3—source data 1

Figure 3—source data 2

Lhx2 is required for Ebf binding predominantly on Greek Islands.

Figure 4—source code 1

Figure 4—source code 2

Figure 4—source data 1

Figure 4—source data 2

Figure 4—source data 3

Figure 4—source data 4

Displacement of Lhx2 and Ebf from Greek Islands shuts off OR transcription.

Figure 5—source code 1

Figure 5—source code 2

Figure 5—source data 1

Figure 5—source data 2

Figure 5—source data 3

Figure 5—source data 4

Downregulation of OR expression over large genomic distances.

Figure 6—source code 1

Figure 6—source code 2

Figure 6—source data 1

Figure 6—source data 2

Figure 6—source data 3

Figure 6—source data 4

Multi-enhancer hubs activate OR transcription.

A Hierarchical Model for OR gene choice.

Author details

Kevin Monahan

Contribution

Competing interests

Ira Schieren

Contribution

Competing interests

Jonah Cheung

Contribution

Competing interests

Alice Mumbey-Wafula

Contribution

Competing interests

Edwin S Monuki

Contribution

Competing interests

Stavros Lomvardas

Contribution

For correspondence

Competing interests

Citations by DOI

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

Categories and tags

Research organism