DNA Methylation: Epigenetics in the wild

  1. Adam J Bewick
  2. Robert J Schmitz   Is a corresponding author
  1. University of Georgia, United States

Individuals of the same species often carry the same genes but with slight differences. Each version of a gene is called an allele, and individuals with certain alleles can display certain traits or characteristics that will give them, within their local environment, a fitness advantage over individuals with different alleles. How genes underlie this ‘local adaptation’ and how natural selection shapes this process have been actively researched since the middle of the last century (Orr, 2005). In recent years, however, it has emerged that there are heritable traits that are not the direct result of differences in DNA sequences. These epigenetic variations can also provide the raw materials for natural selection to occur (Eitchen et al., 2014).

DNA molecules can be very long; as a result they are wrapped around proteins called histones so that they can be easily packed inside a cell's nucleus. Both DNA and the histone proteins can be chemically modified, and alleles with the same DNA sequence but different patterns of chemical modifications (called ‘epialleles’) can be passed between generations and contribute to complex traits or characteristics (Cortijo et al., 2014). Now in eLife, Magnus Nordborg and co-workers—from Austria, the US, the UK and Germany—have explored variations in DNA methylation among wild populations of a plant called Arabidopsis thaliana in Sweden to see how epigenetic variation is influenced by the local environment (Dubin et al., 2015).

Methylation is a chemical modification to DNA that inhibits the proliferation of selfish DNA elements (such as transposable elements) and helps regulate gene expression. Several proteins and enzymes work together in pathways to establish and maintain DNA methylation at sites with one of the following DNA sequences: CG, CHG (where H can be an A, T or C base) or CHH (Law and Jacobsen, 2010).

One pathway is responsible for ‘gene body methylation’, which involves the methylation of DNA within a large subset of genes, but only at CG sites. However, transposable elements can be methylated by multiple pathways: one pathway important to this study involves the enzyme CMT2, which methylates long or ‘deep’ transposable elements at CHG and CHH sites (Zemach et al., 2013; Stroud et al., 2014). In this case, ‘deep’ refers to transposable elements that are within tightly packed (or heterochromatic) regions of the genome.

Patterns of DNA methylation at regions within both genes and transposable elements vary extensively within and among natural Arabidopsis populations (Schmitz et al., 2013). However, the potential effects of this epigenetic variation on fitness and local adaptation remain unclear. Nordborg and co-workers—who include Manu Dubin, Pei Zhang, Dazhe Meng and Marie-Stanislas Remigereau as joint first authors—found that DNA methylation at CHH sites in transposable elements increases with temperature (Dubin et al., 2015, Figure 1A). Using CHH methylation as a trait, Nordborg and co-workers then conducted a genome-wide search and revealed that, in general, a lot of the variation in this trait could be explained by which allele the plant carried at a specific site called CMT2a (Figure 1B). This site is near a gene that encodes an enzyme called CMT2, which is known to methylate CHG and CHH sites in long transposable elements (Zemach et al., 2013; Stroud et al., 2014). All the plants examined carried one of two possible alleles (that differed by a single DNA base). Furthermore, plants with the less common of the two alleles—called the ‘non-reference allele’—typically had more CHH methylation than plants with the more common reference allele. Additional searches revealed another similar site nearby, called CMT2b. In this case plants with the rarer non-reference allele had less CHH methylation on average.

DNA methylation responds to temperature changes.

(A) Methylation at CHH sites across the whole genome is higher in plants that are grown at higher temperatures. (B) Individual plants with different genotypes (that is, in plants with different combinations of alleles) for the gene that encodes the CMT2 enzyme show different levels of CHH methylation in long transposable elements. For example plants with non-reference (nr) alleles at CMT2a and reference (r) alleles at CMT2b display a high rate of CHH methylation. (C) Plants originating from northern regions of Sweden (high latitude of origin) have higher levels of gene-body methylation and overall gene expression than plants originally from southern Sweden.

The two pairs of alleles were found in populations of Arabidopsis from both southern and northern Sweden, but the non-reference alleles were more common in southern regions. This may indicate that there is gene flow between populations, or that natural selection is still ‘in action’ and continues to select for one allele over the other but has not yet ‘fixed’ the alleles between populations. Together with other recent results (Shen et al., 2014), these latest findings indicate that the temperature-dependent CHH methylation is a flexible trait, and that certain alleles that encode the CMT2 enzyme may make plant genomes more responsive to environmental changes.

In addition to CHH methylation, Nordborg and co-workers also observed a correlation between gene body methylation and the latitude of origin (Figure 1C). Specifically, populations from northern regions had higher levels of gene body methylation. Genes that are more heavily methylated in the northern regions are expressed at higher levels compared to their less methylated counterparts in the south.

This work is a first step on the way to a full understanding of how environment and genetic makeup contribute to the variation in DNA methylation observed in wild populations. The work also suggests that genetic variation at enzymes involved in DNA methylation may provide some populations with an advantage to changing environmental conditions or seasons.

Future experiments, including moving wild plants between different populations and then assessing their fitness, would shed more light on DNA methylation and its role in local adaptation. Likewise, comparisons between individuals of different species could unveil other types of naturally occurring diversity and provide a wealth of genetic or genomic resources to help us better understand DNA methylation in the light of evolutionary biology. Furthermore, studies within a species could help determine how much variation in physical traits is controlled by DNA methylation variation as opposed to genetic variation. In this scenario each genome-wide difference in DNA methylation is used as a marker and tested for an association with the trait under study. However, differences in DNA methylation markers between different populations due to demographic factors would confound these studies, making it difficult to determine the underlying epigenetic variation that contributes to the traits.

Research into DNA methylation (and epigenetics in general) has only recently begun in the natural sciences. Understanding the relationship between an organism's fitness and its epigenetics, traits and environment represents a challenging, but fruitful, area of future research.


Article and author information

Author details

  1. Adam J Bewick

    Department of Genetics, University of Georgia, Athens, United States
    Competing interests
    The authors declare that no competing interests exist.
  2. Robert J Schmitz

    Department of Genetics, University of Georgia, Athens, United States
    For correspondence
    Competing interests
    The authors declare that no competing interests exist.

Publication history

  1. Version of Record published: May 5, 2015 (version 1)


© 2015, Bewick and Schmitz

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.


  • 2,141
    Page views
  • 354
  • 5

Article citation count generated by polling the highest count across the following sources: Crossref, PubMed Central, Scopus.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Adam J Bewick
  2. Robert J Schmitz
DNA Methylation: Epigenetics in the wild
eLife 4:e07808.

Further reading

    1. Computational and Systems Biology
    2. Genetics and Genomics
    Stella Tamana, Maria Xenophontos ... Petros Kountouris
    Research Article Updated

    Haemoglobinopathies are the commonest monogenic diseases worldwide and are caused by variants in the globin gene clusters. With over 2400 variants detected to date, their interpretation using the American College of Medical Genetics and Genomics (ACMG)/Association for Molecular Pathology (AMP) guidelines is challenging and computational evidence can provide valuable input about their functional annotation. While many in silico predictors have already been developed, their performance varies for different genes and diseases. In this study, we evaluate 31 in silico predictors using a dataset of 1627 variants in HBA1, HBA2, and HBB. By varying the decision threshold for each tool, we analyse their performance (a) as binary classifiers of pathogenicity and (b) by using different non-overlapping pathogenic and benign thresholds for their optimal use in the ACMG/AMP framework. Our results show that CADD, Eigen-PC, and REVEL are the overall top performers, with the former reaching moderate strength level for pathogenic prediction. Eigen-PC and REVEL achieve the highest accuracies for missense variants, while CADD is also a reliable predictor of non-missense variants. Moreover, SpliceAI is the top performing splicing predictor, reaching strong level of evidence, while GERP++ and phyloP are the most accurate conservation tools. This study provides evidence about the optimal use of computational tools in globin gene clusters under the ACMG/AMP framework.

    1. Cancer Biology
    2. Genetics and Genomics
    Minsu Kang, Hee Young Na ... Jong Seok Lee
    Research Article

    We aimed to elucidate the evolutionary trajectories of gallbladder adenocarcinoma (GBAC) using multi-regional and longitudinal tumor samples. Using whole-exome sequencing data, we constructed phylogenetic trees in each patient and analyzed mutational signatures. A total of 11 patients including 2 rapid autopsy cases were enrolled. The most frequently altered gene in primary tumors was ERBB2 and TP53 (54.5%), followed by FBXW7 (27.3%). Most mutations in frequently altered genes in primary tumors were detectable in concurrent precancerous lesions (biliary intraepithelial neoplasia, BilIN), but a substantial proportion was subclonal. Subclonal diversity was common in BilIN (n=4). However, among subclones in BilIN, a certain subclone commonly shrank in concurrent primary tumors. In addition, selected subclones underwent linear and branching evolution, maintaining subclonal diversity. Combined analysis with metastatic tumors (n=11) identified branching evolution in 9 patients (81.8%). Of these, 8 patients (88.9%) had a total of 11 subclones expanded at least 7-fold during metastasis. These subclones harbored putative metastasis-driving mutations in cancer-related genes such as SMAD4, ROBO1, and DICER1. In mutational signature analysis, 6 mutational signatures were identified: 1, 3, 7, 13, 22, and 24 (cosine similarity >0.9). Signatures 1 (age) and 13 (APOBEC) decreased during metastasis while signatures 22 (aristolochic acid) and 24 (aflatoxin) were relatively highlighted. Subclonal diversity arose early in precancerous lesions and clonal selection was a common event during malignant transformation in GBAC. However, selected cancer clones continued to evolve and thus maintained subclonal diversity in metastatic tumors.