Meiotic Recombination: Birth and death of a protein

The ways in which recombination sites are determined during meiosis are becoming clearer following a phylogenomic analysis for 225 different species.
  1. Julie Clément
  2. Bernard de Massy  Is a corresponding author
  1. CNRS-Université de Montpellier, France

Each of our cells carry two almost identical copies of each of our chromosomes, one copy inherited from each parent. These small differences, which are mainly caused by mutations, make an important contribution to the genetic diversity observed in humans and other species. During meiosis, the chromosomes from each parent pair up and then swap segments of DNA: this process, which is known as meiotic recombination, is important for diversity and is essential for fertility (Hunter, 2015). An important scientific goal is to understand where recombination occurs on chromosomes, what molecular processes are involved, and how meiotic recombination affects the evolution of the genome.

Many components of the molecular machinery involved in meiotic recombination are similar in fungi, plants and animals. However, some features of meiotic recombination have not been conserved across species. For example, in all yeast, plant and vertebrate species studied to date, meiotic recombination happens at specific regions within the chromosomes known as hotspots. In flies and worms, on the other hand, it happens at many more locations within the chromosomes. Moreover, there are two main pathways that direct meiotic recombination to hotspots.

The hotspots in yeast, plant and some vertebrate species are located at regions of the genome that are easier to access, such as regulatory regions, transcription start sites or CpG islands (regions with a high frequency of CG dinucleotides; de Massy, 2013). These hotspots are stable over evolutionary time (Lam and Keeney, 2015; Singhal et al., 2015) and this pathway is likely ancestral among eukaryotes.

In contrast, the hotspots in many mammals, including humans, apes and mice, are found at locations where a protein called PRDM9 can bind to the DNA (Figure 1). In particular, the binding sites (and, therefore, the location of the hotspots) are determined by the amino acid sequence of the zinc finger domain of PRDM9 and are characterized by histone modifications mediated by the PR/SET domain of PRDM9 (Baudat et al., 2010; Myers et al., 2010; Parvanov et al., 2010). However, over the course of time the binding sites have rapidly accumulated mutations that can disrupt the PRDM9 binding. This is correlated with the fast evolution of the zinc finger domain, which ensures that PRDM9 can maintain its binding activity in the genome.

The evolution of PRDM9 and meiotic recombination.

The full-length PRDM9 protein contains four domains: KRAB, SSXRD, PR/SET and ZnF (short for Zinc finger). PRDM9 plays an important role in meiotic recombination but, over the course of evolution, many species have either completely lost this protein (light grey line in the simplified phylogenetic tree to the left), or now carry truncated versions of it (dark grey lines). In species with full-length PRDM9 (top row), recombination hotspots are located at sites where PRDM9 binds to the DNA in the genome, and three residues in the PR/SET domain that are essential for the recombination process (Y276, Y341 and Y357; Wu et al., 2013) are conserved: moreover, the locations of the hotspots change over time due to the rapid evolution of the ZnF domain. In species with truncated PRDM9 (next four rows), recombination hotspots are located at transcription start sites (TSS) or at CpG islands, the three residues in the PR/SET domain are not conserved, and the ZnF domain (if present) does not evolve rapidly. In species that lack PRDM9 (bottom row), recombination hotspots are located at transcription start sites (TSS) or at CpG islands.

It is still only partly understood why such a molecular strategy has evolved and why the PRDM9 gene is present in some vertebrates but lost in others (Ponting, 2011). Now, in eLife, Molly Przeworski of Columbia University and colleagues – including Zachary Baker and Molly Schumer as joint first authors – report the results of a series of experiments that shed light on some of these questions (Baker et al., 2017).

By analyzing the DNA and RNA sequences of 225 different species, Baker et al. demonstrate that PRDM9 is not specific to mammals, and they show that even distantly related animals have genes that produce their own equivalents of the PRDM9 protein. They go on to show that while the zinc finger domain evolves rapidly in these species, the other three domains in PRDM9 (the KRAB, SSRXD and PR/SET domains) are conserved (Figure 1). This suggests that the role of PRDM9 appeared very early during the evolution of vertebrates and has been maintained in these different lineages.

Baker et al. – who are based at Columbia, Harvard, Texas A&M and the CICHAZ research station in Mexico – also discovered that several species had independently lost the ability to make PRDM9, while others produced only truncated versions of the protein. For example, the version of PRDM9 found in teleost fish does not contain a domain called the KRAB domain, which is found in full-length PRDM9: moreover, the zinc finger domain is not evolving rapidly in these fish, and three of the amino acids involved in the catalytic activity of PR/SET are not conserved (Figure 1). It appears that the truncated PRDM9 proteins have no influence on the location of recombination hotspots.

Baker et al. investigated this hypothesis by analyzing recombination sites in swordtail fish hybrids between two species of fish belonging to the genus Xiphophorus. In these hybrids, in which PRDM9 lacks both the KRAB and SSXRD domains, meiotic recombination occurs preferentially at CpG islands, similar to what has been reported for dogs and birds, which completely lack the gene for PRDM9 (Figure 1; Auton et al., 2013; Axelsson et al., 2012; Singhal et al., 2015). This suggests that the ancestral pathway of recombination (that is, a pathway that existed before the emergence of PRDM9) is active in swordtail fish. Moreover, since the gene for PRDM9 is still under selection in these fish, it must have other roles that remain to be determined. Thus, two important new conclusions emerge from the work of Baker, Schumer, Przeworski and colleagues. First, the full length PRDM9 is required for directing recombination, and second, a shorter version of the protein may still have a relevant role.

Overall, there is also a clear divide between the two hotspot localization pathways. However, several questions remain open, and more research is needed to discover why PRDM9 has evolved in the first place, and why it is maintained in some species but not in others. Answers to these puzzles may come from a better understanding of the underlying molecular properties and how the different pathways have affected the pattern of genetic diversity and selection.

References

Article and author information

Author details

  1. Julie Clément

    Institut de Génétique Humaine, CNRS-Université de Montpellier, Montpellier, France
    Competing interests
    The authors declare that no competing interests exist.
  2. Bernard de Massy

    Institut de Génétique Humaine, CNRS-Université de Montpellier, Montpellier, France
    For correspondence
    Bernard.De-Massy@igh.cnrs.fr
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-0950-2758

Publication history

  1. Version of Record published: July 20, 2017 (version 1)

Copyright

© 2017, Clément et al.

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.

Metrics

  • 1,902
    views
  • 233
    downloads
  • 3
    citations

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Julie Clément
  2. Bernard de Massy
(2017)
Meiotic Recombination: Birth and death of a protein
eLife 6:e29502.
https://doi.org/10.7554/eLife.29502

Further reading

    1. Computational and Systems Biology
    2. Genetics and Genomics
    Weichen Song, Yongyong Shi, Guan ning Lin
    Tools and Resources

    We propose a new framework for human genetic association studies: at each locus, a deep learning model (in this study, Sei) is used to calculate the functional genomic activity score for two haplotypes per individual. This score, defined as the Haplotype Function Score (HFS), replaces the original genotype in association studies. Applying the HFS framework to 14 complex traits in the UK Biobank, we identified 3619 independent HFS–trait associations with a significance of p < 5 × 10−8. Fine-mapping revealed 2699 causal associations, corresponding to a median increase of 63 causal findings per trait compared with single-nucleotide polymorphism (SNP)-based analysis. HFS-based enrichment analysis uncovered 727 pathway–trait associations and 153 tissue–trait associations with strong biological interpretability, including ‘circadian pathway-chronotype’ and ‘arachidonic acid-intelligence’. Lastly, we applied least absolute shrinkage and selection operator (LASSO) regression to integrate HFS prediction score with SNP-based polygenic risk scores, which showed an improvement of 16.1–39.8% in cross-ancestry polygenic prediction. We concluded that HFS is a promising strategy for understanding the genetic basis of human complex traits.

    1. Genetics and Genomics
    2. Immunology and Inflammation
    Jean-David Larouche, Céline M Laumont ... Claude Perreault
    Research Article

    Transposable elements (TEs) are repetitive sequences representing ~45% of the human and mouse genomes and are highly expressed by medullary thymic epithelial cells (mTECs). In this study, we investigated the role of TEs on T-cell development in the thymus. We performed multiomic analyses of TEs in human and mouse thymic cells to elucidate their role in T-cell development. We report that TE expression in the human thymus is high and shows extensive age- and cell lineage-related variations. TE expression correlates with multiple transcription factors in all cell types of the human thymus. Two cell types express particularly broad TE repertoires: mTECs and plasmacytoid dendritic cells (pDCs). In mTECs, transcriptomic data suggest that TEs interact with transcription factors essential for mTEC development and function (e.g., PAX1 and REL), and immunopeptidomic data showed that TEs generate MHC-I-associated peptides implicated in thymocyte education. Notably, AIRE, FEZF2, and CHD4 regulate small yet non-redundant sets of TEs in murine mTECs. Human thymic pDCs homogenously express large numbers of TEs that likely form dsRNA, which can activate innate immune receptors, potentially explaining why thymic pDCs constitutively secrete IFN ɑ/β. This study highlights the diversity of interactions between TEs and the adaptive immune system. TEs are genetic parasites, and the two thymic cell types most affected by TEs (mTEcs and pDCs) are essential to establishing central T-cell tolerance. Therefore, we propose that orchestrating TE expression in thymic cells is critical to prevent autoimmunity in vertebrates.