DNA Methylation: Epigenetics in the wild
Individuals of the same species often carry the same genes but with slight differences. Each version of a gene is called an allele, and individuals with certain alleles can display certain traits or characteristics that will give them, within their local environment, a fitness advantage over individuals with different alleles. How genes underlie this ‘local adaptation’ and how natural selection shapes this process have been actively researched since the middle of the last century (Orr, 2005). In recent years, however, it has emerged that there are heritable traits that are not the direct result of differences in DNA sequences. These epigenetic variations can also provide the raw materials for natural selection to occur (Eitchen et al., 2014).
DNA molecules can be very long; as a result they are wrapped around proteins called histones so that they can be easily packed inside a cell's nucleus. Both DNA and the histone proteins can be chemically modified, and alleles with the same DNA sequence but different patterns of chemical modifications (called ‘epialleles’) can be passed between generations and contribute to complex traits or characteristics (Cortijo et al., 2014). Now in eLife, Magnus Nordborg and co-workers—from Austria, the US, the UK and Germany—have explored variations in DNA methylation among wild populations of a plant called Arabidopsis thaliana in Sweden to see how epigenetic variation is influenced by the local environment (Dubin et al., 2015).
Methylation is a chemical modification to DNA that inhibits the proliferation of selfish DNA elements (such as transposable elements) and helps regulate gene expression. Several proteins and enzymes work together in pathways to establish and maintain DNA methylation at sites with one of the following DNA sequences: CG, CHG (where H can be an A, T or C base) or CHH (Law and Jacobsen, 2010).
One pathway is responsible for ‘gene body methylation’, which involves the methylation of DNA within a large subset of genes, but only at CG sites. However, transposable elements can be methylated by multiple pathways: one pathway important to this study involves the enzyme CMT2, which methylates long or ‘deep’ transposable elements at CHG and CHH sites (Zemach et al., 2013; Stroud et al., 2014). In this case, ‘deep’ refers to transposable elements that are within tightly packed (or heterochromatic) regions of the genome.
Patterns of DNA methylation at regions within both genes and transposable elements vary extensively within and among natural Arabidopsis populations (Schmitz et al., 2013). However, the potential effects of this epigenetic variation on fitness and local adaptation remain unclear. Nordborg and co-workers—who include Manu Dubin, Pei Zhang, Dazhe Meng and Marie-Stanislas Remigereau as joint first authors—found that DNA methylation at CHH sites in transposable elements increases with temperature (Dubin et al., 2015, Figure 1A). Using CHH methylation as a trait, Nordborg and co-workers then conducted a genome-wide search and revealed that, in general, a lot of the variation in this trait could be explained by which allele the plant carried at a specific site called CMT2a (Figure 1B). This site is near a gene that encodes an enzyme called CMT2, which is known to methylate CHG and CHH sites in long transposable elements (Zemach et al., 2013; Stroud et al., 2014). All the plants examined carried one of two possible alleles (that differed by a single DNA base). Furthermore, plants with the less common of the two alleles—called the ‘non-reference allele’—typically had more CHH methylation than plants with the more common reference allele. Additional searches revealed another similar site nearby, called CMT2b. In this case plants with the rarer non-reference allele had less CHH methylation on average.
The two pairs of alleles were found in populations of Arabidopsis from both southern and northern Sweden, but the non-reference alleles were more common in southern regions. This may indicate that there is gene flow between populations, or that natural selection is still ‘in action’ and continues to select for one allele over the other but has not yet ‘fixed’ the alleles between populations. Together with other recent results (Shen et al., 2014), these latest findings indicate that the temperature-dependent CHH methylation is a flexible trait, and that certain alleles that encode the CMT2 enzyme may make plant genomes more responsive to environmental changes.
In addition to CHH methylation, Nordborg and co-workers also observed a correlation between gene body methylation and the latitude of origin (Figure 1C). Specifically, populations from northern regions had higher levels of gene body methylation. Genes that are more heavily methylated in the northern regions are expressed at higher levels compared to their less methylated counterparts in the south.
This work is a first step on the way to a full understanding of how environment and genetic makeup contribute to the variation in DNA methylation observed in wild populations. The work also suggests that genetic variation at enzymes involved in DNA methylation may provide some populations with an advantage to changing environmental conditions or seasons.
Future experiments, including moving wild plants between different populations and then assessing their fitness, would shed more light on DNA methylation and its role in local adaptation. Likewise, comparisons between individuals of different species could unveil other types of naturally occurring diversity and provide a wealth of genetic or genomic resources to help us better understand DNA methylation in the light of evolutionary biology. Furthermore, studies within a species could help determine how much variation in physical traits is controlled by DNA methylation variation as opposed to genetic variation. In this scenario each genome-wide difference in DNA methylation is used as a marker and tested for an association with the trait under study. However, differences in DNA methylation markers between different populations due to demographic factors would confound these studies, making it difficult to determine the underlying epigenetic variation that contributes to the traits.
Research into DNA methylation (and epigenetics in general) has only recently begun in the natural sciences. Understanding the relationship between an organism's fitness and its epigenetics, traits and environment represents a challenging, but fruitful, area of future research.
References
-
Mapping the epigenetic basis of complex traitsScience 343:1145–1148.https://doi.org/10.1126/science.1248127
-
Epigenetics: beyond chromatin modifications and complex genetic regulationPlant Physiology 165:933–947.https://doi.org/10.1104/pp.113.234211
-
Establishing, maintaining and modifying DNA methylation patterns in plants and animalsNature Reviews Genetics 11:204–220.https://doi.org/10.1038/nrg2719
-
The genetic theory of adaptation: a brief historyNature Reviews Genetics 6:119–127.https://doi.org/10.1038/nrg1523
-
Non-CG methylation patterns shape the epigenetic landscape in ArabidopsisNature Structural & Molecular Biology 21:64–72.https://doi.org/10.1038/nsmb.2735
Article and author information
Author details
Publication history
Copyright
© 2015, Bewick and Schmitz
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 2,379
- views
-
- 367
- downloads
-
- 8
- citations
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Genetics and Genomics
- Microbiology and Infectious Disease
The sustained success of Mycobacterium tuberculosis as a pathogen arises from its ability to persist within macrophages for extended periods and its limited responsiveness to antibiotics. Furthermore, the high incidence of resistance to the few available antituberculosis drugs is a significant concern, especially since the driving forces of the emergence of drug resistance are not clear. Drug-resistant strains of Mycobacterium tuberculosis can emerge through de novo mutations, however, mycobacterial mutation rates are low. To unravel the effects of antibiotic pressure on genome stability, we determined the genetic variability, phenotypic tolerance, DNA repair system activation, and dNTP pool upon treatment with current antibiotics using Mycobacterium smegmatis. Whole-genome sequencing revealed no significant increase in mutation rates after prolonged exposure to first-line antibiotics. However, the phenotypic fluctuation assay indicated rapid adaptation to antibiotics mediated by non-genetic factors. The upregulation of DNA repair genes, measured using qPCR, suggests that genomic integrity may be maintained through the activation of specific DNA repair pathways. Our results, indicating that antibiotic exposure does not result in de novo adaptive mutagenesis under laboratory conditions, do not lend support to the model suggesting antibiotic resistance development through drug pressure-induced microevolution.
-
- Computational and Systems Biology
- Genetics and Genomics
Enhancers and promoters are classically considered to be bound by a small set of transcription factors (TFs) in a sequence-specific manner. This assumption has come under increasing skepticism as the datasets of ChIP-seq assays of TFs have expanded. In particular, high-occupancy target (HOT) loci attract hundreds of TFs with often no detectable correlation between ChIP-seq peaks and DNA-binding motif presence. Here, we used a set of 1003 TF ChIP-seq datasets (HepG2, K562, H1) to analyze the patterns of ChIP-seq peak co-occurrence in combination with functional genomics datasets. We identified 43,891 HOT loci forming at the promoter (53%) and enhancer (47%) regions. HOT promoters regulate housekeeping genes, whereas HOT enhancers are involved in tissue-specific process regulation. HOT loci form the foundation of human super-enhancers and evolve under strong negative selection, with some of these loci being located in ultraconserved regions. Sequence-based classification analysis of HOT loci suggested that their formation is driven by the sequence features, and the density of mapped ChIP-seq peaks across TF-bound loci correlates with sequence features and the expression level of flanking genes. Based on the affinities to bind to promoters and enhancers we detected five distinct clusters of TFs that form the core of the HOT loci. We report an abundance of HOT loci in the human genome and a commitment of 51% of all TF ChIP-seq binding events to HOT locus formation thus challenging the classical model of enhancer activity and propose a model of HOT locus formation based on the existence of large transcriptional condensates.