A sex-specific evolutionary interaction between ADCY9 and CETP
Abstract
Pharmacogenomic studies have revealed associations between rs1967309 in the adenylyl cyclase type 9 (ADCY9) gene and clinical responses to the cholesteryl ester transfer protein (CETP) modulator dalcetrapib, however, the mechanism behind this interaction is still unknown. Here, we characterized selective signals at the locus associated with the pharmacogenomic response in human populations and we show that rs1967309 region exhibits signatures of positive selection in several human populations. Furthermore, we identified a variant in CETP, rs158477, which is in long-range linkage disequilibrium with rs1967309 in the Peruvian population. The signal is mainly seen in males, a sex-specific result that is replicated in the LIMAA cohort of over 3,400 Peruvians. Analyses of RNA-seq data further suggest an epistatic interaction on CETP expression levels between the two SNPs in multiple tissues, which also differs between males and females. We also detected interaction effects of the two SNPs with sex on cardiovascular phenotypes in the UK Biobank, in line with the sex-specific genotype associations found in Peruvians at these loci. We propose that ADCY9 and CETP coevolved during recent human evolution due to sex-specific selection, which points towards a biological link between dalcetrapib’s pharmacogene ADCY9 and its therapeutic target CETP.
Data availability
- 1000 Genomes Project, GEUVADIS is freely available.- The Native American genetic dataset was shared to us upon request to the authors of the initial paper and through a data access agreement with Universidad de Antioquia (Prof. Omar Triana Chavez). We contacted bedoya.g@gmail.com and a.ruizlin@ucl.ac.uk to get access to the dataset and we completed a data access application form and signed a data access approval once approved. Applications for access to these data can be submitted at any time. These are considered on a rolling basis and a decision was given within 1 month of receipt. PhD student applicants must include their supervisors as a co-applicant and provide their full contact details. A publication list must be provided for the applicant, co-applicants and PhD supervisors where PhD students have applied to provide proof of competence in handling datasets of this size and nature. - UK Biobank was accessed through data access approval under the project number #15357 and #20168. Information to apply for data access can be found here: https://www.ukbiobank.ac.uk/enable-your-research/apply-for-access - CARTaGENE biobank was accessed through data access approval under the project number #406713. Information to apply for data access can be found here: https://www.cartagene.qc.ca/en/researchers/access-request - GTEx v8 dataset was accessed through dbGaP under project number #19088 LIMAA dataset was accessed through dbGaP under the project number #26882. Information to apply for data access through dbGAP can be found here: https://dbgap.ncbi.nlm.nih.gov- RNA-sequencing of ADCY9-knocked-down HepG2 cell line data has been deposited under GSE174640 (embargo): https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE174640For reviewers/editors: Enter token ktaxgqkkrxupjkl into the box- Source data files and code for all main figures are available here: https://github.com/HussinLab/adcy9_cetp_Gamache_2021
-
RNA-sequencing of ADCY9-knocked-down HepG2 cell line (embargo)NCBI Gene Expression Omnibus, GSE174640.
-
LIMAAdbGaP project #26882.
-
Native American genetic dataset (NAGD)rom authors bedoya.g@gmail.com and a.ruizlin@ucl.ac.uk.
Article and author information
Author details
Funding
Institut de Cardiologie de Montréal
- Isabel Gamache
- Marc-André Legault
- Jean-Christophe Grenier
- Rocio Sanchez
- Eric Rhéaume
- Holly Trochet
- Jean-Claude Tardif
- Marie-Pierre Dubé
- Julie Hussin
Université de Montréal
- Isabel Gamache
Canadian Institutes of Health Research
- Marc-André Legault
Canada Research Chairs
- Jean-Claude Tardif
- Marie-Pierre Dubé
Fonds de Recherche du Québec - Santé
- Julie Hussin
Institut de Valorisation des Données
- Isabel Gamache
- Jean-Christophe Grenier
- Julie Hussin
Health collaboration acceleration fund from the Ministère de l'Économie et de l'Innovation of the Government of Quebec
- Jean-Claude Tardif
- Marie-Pierre Dubé
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Reviewing Editor
- Ellen Leffler, University of Utah, United States
Ethics
Human subjects: The analyses done in this study were approved by the different cohorts used. Participants in these cohorts gave their general consent for their data to be used for research purposes. All individual-level data was anonymized and no efforts were made by the authors to deanonymize or recontact any of the participants from the cohorts, in keeping with our agreements with the UK Biobank, CARTaGENE, dbGAP, 1000Genomes and the Native American Genetics dataset (Universidad de Antioquia).
Version history
- Received: April 7, 2021
- Preprint posted: May 14, 2021 (view preprint)
- Accepted: October 4, 2021
- Accepted Manuscript published: October 5, 2021 (version 1)
- Version of Record published: November 16, 2021 (version 2)
Copyright
© 2021, Gamache et al.
This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 1,039
- views
-
- 96
- downloads
-
- 6
- citations
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Computational and Systems Biology
- Genetics and Genomics
We propose a new framework for human genetic association studies: at each locus, a deep learning model (in this study, Sei) is used to calculate the functional genomic activity score for two haplotypes per individual. This score, defined as the Haplotype Function Score (HFS), replaces the original genotype in association studies. Applying the HFS framework to 14 complex traits in the UK Biobank, we identified 3619 independent HFS–trait associations with a significance of p < 5 × 10−8. Fine-mapping revealed 2699 causal associations, corresponding to a median increase of 63 causal findings per trait compared with single-nucleotide polymorphism (SNP)-based analysis. HFS-based enrichment analysis uncovered 727 pathway–trait associations and 153 tissue–trait associations with strong biological interpretability, including ‘circadian pathway-chronotype’ and ‘arachidonic acid-intelligence’. Lastly, we applied least absolute shrinkage and selection operator (LASSO) regression to integrate HFS prediction score with SNP-based polygenic risk scores, which showed an improvement of 16.1–39.8% in cross-ancestry polygenic prediction. We concluded that HFS is a promising strategy for understanding the genetic basis of human complex traits.
-
- Computational and Systems Biology
Revealing protein binding sites with other molecules, such as nucleic acids, peptides, or small ligands, sheds light on disease mechanism elucidation and novel drug design. With the explosive growth of proteins in sequence databases, how to accurately and efficiently identify these binding sites from sequences becomes essential. However, current methods mostly rely on expensive multiple sequence alignments or experimental protein structures, limiting their genome-scale applications. Besides, these methods haven’t fully explored the geometry of the protein structures. Here, we propose GPSite, a multi-task network for simultaneously predicting binding residues of DNA, RNA, peptide, protein, ATP, HEM, and metal ions on proteins. GPSite was trained on informative sequence embeddings and predicted structures from protein language models, while comprehensively extracting residual and relational geometric contexts in an end-to-end manner. Experiments demonstrate that GPSite substantially surpasses state-of-the-art sequence-based and structure-based approaches on various benchmark datasets, even when the structures are not well-predicted. The low computational cost of GPSite enables rapid genome-scale binding residue annotations for over 568,000 sequences, providing opportunities to unveil unexplored associations of binding sites with molecular functions, biological processes, and genetic variants. The GPSite webserver and annotation database can be freely accessed at https://bio-web1.nscc-gz.cn/app/GPSite.