Genetics: The next step in Mendelian randomization

Expanding a statistical approach called Mendelian randomization to include multiple variables may help researchers to identify new molecular causes of specific traits.
  1. Matthias Weith
  2. Andreas Beyer  Is a corresponding author
  1. Cologne Excellence Cluster on Cellular Stress Responses in Age‐Associated Diseases, and the Institute for Biochemistry, University of Cologne, Germany
  2. Cologne Excellence Cluster on Cellular Stress Responses in Age‐Associated Diseases, the Faculty of Medicine and University Hospital of Cologne, the Center for Molecular Medicine Cologne, and the Institute for Genetics, University of Cologne, Germany

Understanding how variations in our genome influence our susceptibility to diseases is one of the most compelling research topics in the life sciences. Researchers have used genome-wide association studies – experiments that analyze the DNA sequences of multiple individuals – to identify statistical relationships between genetic variants and specific human traits, such as susceptibility to a disease or various body parameters.

Despite the success of this approach, major challenges persist. First, associations between variants that are located close to each other within the genome can make it difficult to determine which of these genetic changes are responsible for the phenotype of interest (a problem called linkage disequilibrium). Second, even if specific variants can be identified, it is often not straightforward to determine the molecular mechanism by which they impact the trait (Tam et al., 2019).

To overcome these difficulties, studies often include information about other modalities such as transcriptomes, proteins and metabolites (Emilsson et al., 2008; Fraser and Xie, 2009; Nicolae et al., 2010; Wainberg et al., 2019; Schadt, 2009; Suhre et al., 2011). Some ‘multi-omic’ studies use one modality, or ‘layer’, to confirm changes to another, such as confirming changes in levels of mRNA by measuring the respective protein product. However, there is a shortage of examples of mechanistic links between the different layers (Buccitelli and Selbach, 2020; Wörheide et al., 2021). Now, in eLife, Zoltán Kutalik, Eleonora Porcu and colleagues from the Swiss Institute of Bioinformatics and the University of Lausanne – including Chiara Auwerx as first author – report a new approach that uses a technique called Mendelian randomization to reveal a chain of molecular connections between the transcriptome, metabolome, and high-level physiological traits such as biomarkers associated with kidney health (Auwerx et al., 2023).

Mendelian randomization is considered to be an ‘experiment of nature’, as it uses variations already present in the genetic code to determine if exposure to certain conditions (such as the amount of cholesterol in the blood, or the expression level of a gene) affects a specific trait (for instance, increased susceptibility to heart disease). The genetic variants act as a proxy, or ‘instrument’, for exposures that are difficult or impossible to manipulate in the population being studied. Mediation analysis can then be applied to ask if the exposure is responsible for the effects of the instrumental variable on the trait of interest. However, it is necessary to proceed carefully (Sanderson et al., 2022): for example, the instrumental variable being used should not affect the trait of interest through any other mediator.

The computational framework presented by Auwerx et al. integrates results from genome-wide association studies with data on genetic variants that affect the level of transcripts or the composition of metabolites. These variants are typically referred to as eQTL (short for expression quantitative trait loci) and mQTL (metabolite QTL), and can be derived from separate population cohorts, allowing researchers to tap into the vast resources of information that are already available.

First, causal links between transcripts and metabolites are established using overlapping mQTL and eQTL as instrumental variables. Causal effects of metabolites on traits of interest are then determined in the same manner using mQTL and genetic variants identified in genome-wide association studies. The next step in the framework is purely based on this established causality: transcripts that causally affect trait-modifying metabolites have to be causally linked to the same trait, resulting in transcript-metabolite-trait triplets (Figure 1). A statistical calculation, known as multivariate Mendelian randomization, is then performed on these triplets using the metabolite-associated variants as the instrumental variable. This determines what proportion of change in the outcome is a result of the transcript directly (or via unknown mediators) impacting the trait, and what proportion is the result of changes in the level of the metabolite mediating the relationship between them.

Mendelian randomization with multiple variables.

In the first step, Mendelian randomization calculations establish causal links between: (i) transcripts (T; pink chains) and metabolites (M; green hexagons) using eQTL and mQTL as instrumental variables (IV; first row); (ii) metabolites and various phenotypes (Y, such as height), using mQTL and the genetic variants associated with the traits as instrumental variables (second row). These causal links are then overlapped to establish causal triplets (third row). These causal triplets are subsequently analyzed in another Mendelian randomization-based calculation, which evaluates the effect of the respective mQTL on the levels of the transcripts, metabolites and traits of the triplet (fourth row). From this multivariate Mendelian randomization (MWMR), the proportion of transcript changes that directly effect a trait, and the proportion that cause an effect via metabolites, can be inferred. eQTL: expression quantitative trait loci; mQTL: metabolite quantitative trait loci.

Image credit: Figure created using BioRender (CC BY 4.0).

Auwerx et al. highlight an intriguing example of genetic variants affecting the transcription of a citrate-exporting protein encoded by a gene called ANKH that has been implicated in mineralization disorders. The resulting change to the export of citrate seems to affect the level of calcium present in the serum of individuals – a connection that was not detected when only transcript levels were correlated with the calcium trait.

By extending the Mendelian randomization approach to include two modalities (transcripts and metabolites), this new framework can detect causal relationships that could not be identified by comparing the genome wide association data to a single modality only. It also provides new insights into how the transcript impacts the phenotype through metabolic changes. With multi-omics studies increasing further in size, it is highly probable that even more advanced statistical approaches may become feasible in the future.

References

Article and author information

Author details

  1. Matthias Weith

    Matthias Weith is in the Cologne Excellence Cluster on Cellular Stress Responses in Age‐Associated Diseases, and the Institute for Biochemistry, University of Cologne, Cologne, Germany

    Competing interests
    No competing interests declared
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0003-0804-4262
  2. Andreas Beyer

    Andreas Beyer is in the Cologne Excellence Cluster on Cellular Stress Responses in Age‐Associated Diseases, the Faculty of Medicine and University Hospital of Cologne, the Center for Molecular Medicine Cologne, and the Institute for Genetics, University of Cologne, Cologne, Germany

    For correspondence
    andreas.beyer@uni-koeln.de
    Competing interests
    No competing interests declared
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-3891-2123

Publication history

  1. Version of Record published:

Copyright

© 2023, Weith and Beyer

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.

Metrics

  • 1,738
    views
  • 198
    downloads
  • 23
    citations

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Matthias Weith
  2. Andreas Beyer
(2023)
Genetics: The next step in Mendelian randomization
eLife 12:e86416.
https://doi.org/10.7554/eLife.86416

Further reading

    1. Developmental Biology
    2. Genetics and Genomics
    Anne-Sophie Pepin, Patrycja A Jazwiec ... Sarah Kimmins
    Research Article

    Paternal obesity has been implicated in adult-onset metabolic disease in offspring. However, the molecular mechanisms driving these paternal effects and the developmental processes involved remain poorly understood. One underexplored possibility is the role of paternally-induced effects on placenta development and function. To address this, we investigated paternal high-fat diet-induced obesity in relation to sperm histone H3 lysine 4 tri-methylation signatures, the placenta transcriptome and cellular composition. C57BL6/J male mice were fed either a control or high-fat diet for 10 weeks beginning at 6 weeks of age. Males were timed-mated with control-fed C57BL6/J females to generate pregnancies, followed by collection of sperm, and placentas at embryonic day (E)14.5. Chromatin immunoprecipitation targeting histone H3 lysine 4 tri-methylation (H3K4me3) followed by sequencing (ChIP-seq) was performed on sperm to define obesity-associated changes in enrichment. Paternal obesity corresponded with altered sperm H3K4me3 at promoters of genes involved in metabolism and development. Notably, sperm altered H3K4me3 was also localized at placental enhancers. Bulk RNA-sequencing on placentas revealed paternal obesity-associated sex-specific changes in expression of genes involved in hypoxic processes such as angiogenesis, nutrient transport, and imprinted genes, with a subset of deregulated genes showing changes in H3K4me3 in sperm at corresponding promoters. Paternal obesity was also linked to impaired placenta development; specifically, a deconvolution analysis revealed altered trophoblast cell lineage specification. These findings implicate paternal obesity-effects on placenta development and function as one potential developmental route to offspring metabolic disease.

    1. Developmental Biology
    2. Genetics and Genomics
    Debashish U Menon, Prabuddha Chakraborty ... Terry Magnuson
    Research Article

    We present evidence implicating the BAF (BRG1/BRM Associated Factor) chromatin remodeler in meiotic sex chromosome inactivation (MSCI). By immunofluorescence (IF), the putative BAF DNA binding subunit, ARID1A (AT-rich Interaction Domain 1 a), appeared enriched on the male sex chromosomes during diplonema of meiosis I. Germ cells showing a Cre-induced loss of ARID1A arrested in pachynema and failed to repress sex-linked genes, indicating a defective MSCI. Mutant sex chromosomes displayed an abnormal presence of elongating RNA polymerase II coupled with an overall increase in chromatin accessibility detectable by ATAC-seq. We identified a role for ARID1A in promoting the preferential enrichment of the histone variant, H3.3, on the sex chromosomes, a known hallmark of MSCI. Without ARID1A, the sex chromosomes appeared depleted of H3.3 at levels resembling autosomes. Higher resolution analyses by CUT&RUN revealed shifts in sex-linked H3.3 associations from discrete intergenic sites and broader gene-body domains to promoters in response to the loss of ARID1A. Several sex-linked sites displayed ectopic H3.3 occupancy that did not co-localize with DMC1 (DNA meiotic recombinase 1). This observation suggests a requirement for ARID1A in DMC1 localization to the asynapsed sex chromatids. We conclude that ARID1A-directed H3.3 localization influences meiotic sex chromosome gene regulation and DNA repair.