Integrative analysis of metabolite GWAS illuminates the molecular basis of pleiotropy and genetic correlation
Abstract
Pleiotropy and genetic correlation are widespread features in GWAS, but they are often difficult to interpret at the molecular level. Here, we perform GWAS of 16 metabolites clustered at the intersection of amino acid catabolism, glycolysis, and ketone body metabolism in a subset of UK Biobank. We utilize the well-documented biochemistry jointly impacting these metabolites to analyze pleiotropic effects in the context of their pathways. Among the 213 lead GWAS hits, we find a strong enrichment for genes encoding pathway-relevant enzymes and transporters. We demonstrate that the effect directions of variants acting on biology between metabolite pairs often contrast with those of upstream or downstream variants as well as the polygenic background. Thus, we find that these outlier variants often reflect biology local to the traits. Finally, we explore the implications for interpreting disease GWAS, underscoring the potential of unifying biochemistry with dense metabolomics data to understand the molecular basis of pleiotropy in complex traits and diseases.
Data availability
The source data and analyzed data have been deposited in Dryad. Code are available at the github link (https://github.com/courtrun/Pleiotropy-of-UKB-Metabolites). The raw individual level data are available through application to UK Biobank.
-
Pleiotropy of UK Biobank Metabolites [preliminary]Dryad Digital Repository, doi:10.5061/dryad.79cnp5hxs.
-
The UK Biobank resource with deep phenotyping and genomic dataUK Biobank http://www.ukbiobank.ac.uk/register-apply/.
Article and author information
Author details
Funding
Stanford Knight-Hennessy Scholars Program (Graduate Student Fellowship)
- Courtney J Smith
National Science Foundation (Graduate Student Fellowship)
- Courtney J Smith
National Institute of Health (5R01HG011432 and 5R01AG066490)
- Jonathan K Pritchard
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Ethics
Human subjects: All participants provided written informed consent and ethical approval was obtained from the North West Multi-Center Research Ethics Committee (11/NW/0382). The current analysis was approved under UK Biobank Project 24983 and 30418.
Copyright
© 2022, Smith et al.
This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 4,297
- views
-
- 734
- downloads
-
- 28
- citations
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Genetics and Genomics
RNAs in circulation carry sequence-specific regulatory information between cells in plant, animal, and host-pathogen systems. Such RNA can cross generational boundaries, as evidenced by somatic double-stranded RNA (dsRNA) in the nematode Caenorhabditis elegans silencing genes of matching sequence in progeny. Here we dissect the intergenerational path taken by dsRNA from parental circulation and discover that cytosolic import through the dsRNA importer SID-1 in the parental germline and/or developing progeny varies with developmental time and dsRNA substrates. Loss of SID-1 enhances initiation of heritable RNA silencing within the germline and causes changes in the expression of the sid-1-dependent gene sdg-1 that last for more than 100 generations after restoration of SID-1. The SDG-1 protein is enriched in perinuclear germ granules required for heritable RNA silencing but is expressed from a retrotransposon targeted by such silencing. This auto-inhibitory loop suggests how retrotransposons could persist by hosting genes that regulate their own silencing.
-
- Cell Biology
- Genetics and Genomics
Progression through the G1 phase of the cell cycle is the most highly regulated step in cellular division. We employed a chemogenetic approach to discover novel cellular networks that regulate cell cycle progression. This approach uncovered functional clusters of genes that altered sensitivity of cells to inhibitors of the G1/S transition. Mutation of components of the Polycomb Repressor Complex 2 rescued proliferation inhibition caused by the CDK4/6 inhibitor palbociclib, but not to inhibitors of S phase or mitosis. In addition to its core catalytic subunits, mutation of the PRC2.1 accessory protein MTF2, but not the PRC2.2 protein JARID2, rendered cells resistant to palbociclib treatment. We found that PRC2.1 (MTF2), but not PRC2.2 (JARID2), was critical for promoting H3K27me3 deposition at CpG islands genome-wide and in promoters. This included the CpG islands in the promoter of the CDK4/6 cyclins CCND1 and CCND2, and loss of MTF2 lead to upregulation of both CCND1 and CCND2. Our results demonstrate a role for PRC2.1, but not PRC2.2, in antagonizing G1 progression in a diversity of cell linages, including chronic myeloid leukemia (CML), breast cancer, and immortalized cell lines.