Regulatory Networks: Reality check for transposon enhancers
First described over forty years ago, an enhancer is a genetic sequence that can 'switch on' a far away gene in certain tissues or at key points during development by interacting with the promoter for that gene. While promoters are generally conserved between organisms, enhancers are often unique to a given species, suggesting that they have evolved more recently (reviewed in Long et al., 2016).
One source of species-specific enhancers might be transposable elements, DNA sequences that can copy themselves and jump to another location in the genome (or simply move to another place). Many of these elements are derived from retroviruses whose genetic code has permanently colonized the genome of their hosts (also known as endogenous retrovirus-like elements, or ERVs). In humans and mice, over 40% of chromosomal DNA is made of transposable elements. Although the vast majority are no longer capable of jumping, they are responsible for much of the genomic diversity across species.
To successfully spread through the genome, these sequences contain their own regulatory components, including enhancers and promoters. Whether cells have then ‘domesticated’ transposable elements for their own advantage – and in particular, whether certain sequences can act as dispersed ‘controlling elements’ in regulatory gene networks – has been a topic of interest for half a century (Britten and Davidson, 1969; Chuong et al., 2017), with this concept gaining momentum in the last decade.
Indeed, genome-wide studies have revealed that transposable elements can show traits associated with enhancers, such as being able to bind to transcription factors or displaying characteristic epigenetic and chromatin features (Kunarso et al., 2010; Chuong et al., 2013). These discoveries have fuelled models in which transposable elements are being co-opted to act as enhancers.
Enhancer-like epigenetic features and binding sites for transcription factors are particularly common in regions of ERVs called long terminal repeats. Still, the evidence which shows that these elements have enhancer activity remains provocative. As with any putative enhancer, the challenge is now to go beyond analyses which demonstrate correlations and towards studies that rigorously validate that transposable elements can work as enhancers (as discussed in Chuong et al., 2017). Now, in eLife, Miguel Branco and colleagues at Queen Mary University of London – including Christopher Todd as first author – report that such assessment is, indeed, critically needed (Todd et al., 2019).
The team examined families of ERVs whose long terminal repeats can bind to transcription factors and which show the classic epigenetic features associated with enhancers, such as open chromatin and certain histone modifications (Figure 1A). In particular, they focused on elements that had been reported to contain binding sites for key transcription factors which are specific to mouse embryonic or trophoblast stem cells (Kunarso et al., 2010; Chuong et al., 2013; Sundaram et al., 2017). This allowed Todd et al. to identify putative enhancers overlapping with long terminal repeats (roughly 630 elements in embryonic stem cells and 360 in trophoblast stem cells). These elements are called ‘TE+ enhancers’ to distinguish them from traditional ‘TE- enhancers’, which do not share sequences with transposable elements. Most putative TE+ enhancers in embryonic stem cells have already been described, but Todd et al. highlight that these are more specific to certain types of cells than TE- enhancers.
Plasmid-based reporter assays work by inserting a sequence of interest into a plasmid, and evaluating its impact on the expression of a reporter gene; these experiments have already demonstrated that, in vitro, transposable elements with certain transcription factor binding motifs could play the role of enhancers (Sundaram et al., 2017). Looking at long terminal repeats in which such assays had highlighted a potential enhancer activity, Todd et al. found that, in situ in the genome, only a third of them had chromatin features that were compatible with an enhancer role (Figure 1B). This means that specific sequence features are not enough to predict whether a transposable element works as an enhancer when in the genome: the broader chromatin context in which the element is embedded likely influences whether enhancer-like features can appear.
Since enhancers can act over large distances, Todd et al. took advantage of their previously published promoter chromatin-capture data to identify which genes the putative TE+ enhancers could target. Compared to TE- enhancers, only about 40% of TE+ enhancers were found to physically interact with at least one gene promoter (Figure 1C). These target genes were expressed almost exclusively in embryonic or trophoblast stem cells, which is consistent with the epigenetic profile of TE+ enhancers. In contrast, TE- enhancers tended to interact with genes expressed in a broader range of tissues; this highlights that transposable elements acquire their enhancer-like features in ways that are specific to a cell type.
Finally, Todd et al. harnessed a combination of specific CRISPR-Cas9 deletions and widespread CRISPR interference (Gilbert et al., 2013) to test how TE+ enhancers influenced the expression of the genes they target. The results showed that deleting putative enhancers did not always affect gene expression. In addition, when 76 putative enhancers belonging to the RLTR13D6 family were disrupted in embryonic stem cells, only three of their target genes showed significant reduction in transcription (Figure 1D,E). Chromatin features and exogenous plasmid-based assays can help to map new candidate enhancer regions, but the Branco’s group showcases that, alone, these assays are not enough to confirm enhancer function.
This low validation rate reflects several difficulties that emerge when assessing if sequences with tantalizing epigenomic characteristics are indeed enhancers (discussed in Halfon, 2019). Recent work in humans has demonstrated that primate-specific long terminal repeats are also used as enhancers in human embryonic stem cells (Fuentes et al., 2018; Pontis et al., 2019). Unlike the mouse experiments of Todd et al., the human studies yielded a much higher proportion of putative TE+ enhancers with an impact on gene transcription upon in situ targeting with CRISPR interference. It is not clear whether these differences are due to variations in techniques and significance thresholds, or because humans and mice recruit families of long terminal repeats with enhancer-like roles at a different pace. Nonetheless, this body of work strengthens the theory that transposable elements can act as enhancers, while also highlighting that careful, in situ evaluation is required before any candidate region is given a definite enhancer role.
References
-
Regulatory activities of transposable elements: from conflicts to benefitsNature Reviews Genetics 18:71–86.https://doi.org/10.1038/nrg.2016.139
-
Functional cis-regulatory modules encoded by mouse-specific endogenous retrovirusNature Communications 8:14550.https://doi.org/10.1038/ncomms14550
Article and author information
Author details
Publication history
Copyright
© 2019, Brind'Amour and Mager
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 1,936
- views
-
- 221
- downloads
-
- 3
- citations
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Chromosomes and Gene Expression
In response to DNA double-strand damage, ongoing transcription is inhibited to facilitate accurate DNA repair while transcriptional recovery occurs after DNA repair is complete. However, the mechanisms at play and the identity of the transcripts being regulated in this manner are unclear. In contrast to the situation following UV damage, we found that transcriptional recovery after ionizing radiation (IR) occurs in a manner independent of the HIRA histone chaperone. Sequencing of the nascent transcripts identified a programmed transcriptional response, where certain transcripts and pathways are rapidly downregulated after IR, while other transcripts and pathways are upregulated. Specifically, most of the loss of nascent transcripts occurring after IR is due to inhibition of transcriptional initiation of the highly transcribed histone genes and the rDNA. To identify factors responsible for transcriptional inhibition after IR in an unbiased manner, we performed a whole genome gRNA library CRISPR/Cas9 screen. Many of the top hits on our screen were factors required for protein neddylation. However, at short times after inhibition of neddylation, transcriptional inhibition still occurred after IR, even though neddylation was effectively inhibited. Persistent inhibition of neddylation blocked transcriptional inhibition after IR, and it also leads to cell cycle arrest. Indeed, we uncovered that many inhibitors and conditions that lead to cell cycle arrest in G1 or G2 phase also prevent transcriptional inhibition after IR. As such, it appears that transcriptional inhibition after IR occurs preferentially at highly expressed genes in cycling cells.
-
- Chromosomes and Gene Expression
- Genetics and Genomics
Annotation of newly sequenced genomes frequently includes genes, but rarely covers important non-coding genomic features such as the cis-regulatory modules—e.g., enhancers and silencers—that regulate gene expression. Here, we begin to remedy this situation by developing a workflow for rapid initial annotation of insect regulatory sequences, and provide a searchable database resource with enhancer predictions for 33 genomes. Using our previously developed SCRMshaw computational enhancer prediction method, we predict over 2.8 million regulatory sequences along with the tissues where they are expected to be active, in a set of insect species ranging over 360 million years of evolution. Extensive analysis and validation of the data provides several lines of evidence suggesting that we achieve a high true-positive rate for enhancer prediction. One, we show that our predictions target specific loci, rather than random genomic locations. Two, we predict enhancers in orthologous loci across a diverged set of species to a significantly higher degree than random expectation would allow. Three, we demonstrate that our predictions are highly enriched for regions of accessible chromatin. Four, we achieve a validation rate in excess of 70% using in vivo reporter gene assays. As we continue to annotate both new tissues and new species, our regulatory annotation resource will provide a rich source of data for the research community and will have utility for both small-scale (single gene, single species) and large-scale (many genes, many species) studies of gene regulation. In particular, the ability to search for functionally related regulatory elements in orthologous loci should greatly facilitate studies of enhancer evolution even among distantly related species.