A proteome-wide genetic investigation identifies several SARS-CoV-2-exploited host targets of clinical relevance
Abstract
Background:
The virus SARS-CoV-2 can exploit biological vulnerabilities (e.g. host proteins) in susceptible hosts that predispose to the development of severe COVID-19.
Methods:
To identify host proteins that may contribute to the risk of severe COVID-19, we undertook proteome-wide genetic colocalisation tests, and polygenic (pan) and cis-Mendelian randomisation analyses leveraging publicly available protein and COVID-19 datasets.
Results:
Our analytic approach identified several known targets (e.g. ABO, OAS1), but also nominated new proteins such as soluble Fas (colocalisation probability > 0.9, p = 1 x 10-4), implicating Fas-mediated apoptosis as a potential target for COVID-19 risk. The polygenic (pan) and cis-Mendelian randomisation analyses showed consistent associations of genetically predicted ABO protein with several COVID-19 phenotypes. The ABO signal is highly pleiotropic and a look-up of proteins associated with the ABO signal revealed that the strongest association was with soluble CD209. We demonstrated experimentally that CD209 directly interacts with the spike protein of SARS-CoV-2, suggesting a mechanism that could explain the ABO association with COVID-19.
Conclusions:
Our work provides a prioritised list of host targets potentially exploited by SARS-CoV-2 and is a precursor for further research on CD209 and FAS as therapeutically tractable targets for COVID-19.
Funding:
MAK, JSc, JH, AB, DO, MC, EMM, MG, ID were funded by Open Targets. J.Z. and T.R.G were funded by the UK Medical Research Council Integrative Epidemiology Unit (MC_UU_00011/4). JSh and GJW were funded by the Wellcome Trust Grant 206194. This research was funded in part by the Wellcome Trust [Grant 206194]. For the purpose of open access, the author has applied a CC BY public copyright licence to any Author Accepted Manuscript version arising from this submission.
Data availability
Summary data used for genetic analyses are publicly available (Sun et al can be downloaded from GWAS catalog https://www.ebi.ac.uk/gwas/downloads/summary-statistics and COVID-19 HGI summary statistics can be downloaded from their website https://www.covid19hg.org/results/). Data generated from our study are provided in the supplementary tables (pan-MR and cis-MR association results filtered at p < 0.05 and no filters applied to colocalisation results).
-
Genomic atlas of the human plasma proteomeGWAS Catalog, GCST005806.
-
Mapping the human genetic architecture of COVID-19 by worldwide meta-analysisCOVID GWAS meta-analysis results, release 4.
Article and author information
Author details
Funding
Wellcome Trust (Grant 206194)
- Mohd Anisul
- Jarrod Shilts
- Jeremy Schwartzentruber
- Gavin J Wright
- Maya Ghoussaini
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Reviewing Editor
- John W Schoggins, University of Texas Southwestern Medical Center, United States
Ethics
Human subjects: All institutions contributing cohorts to the COVID-19 Host Genetics Initiative and INTERVAL (Sun et al) study for proteomics received ethics approval from their respective research ethics review boards. All participants in the INTERVAL study provided informed consent before joining the INTERVAL study with approval from the National Research Ethics (11/EE/0538). Ethics statements of studies that contributed participant data to the COVID-19 Host Genetics Initiative are provided in Supplementary Table 1 of their recently published paper (https://www.nature.com/articles/s41586-021-03767-x).
Version history
- Preprint posted: March 17, 2021 (view preprint)
- Received: April 23, 2021
- Accepted: August 7, 2021
- Accepted Manuscript published: August 17, 2021 (version 1)
- Version of Record published: September 22, 2021 (version 2)
Copyright
© 2021, Anisul et al.
This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 1,783
- views
-
- 217
- downloads
-
- 22
- citations
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Epidemiology and Global Health
We discuss 12 misperceptions, misstatements, or mistakes concerning the use of covariates in observational or nonrandomized research. Additionally, we offer advice to help investigators, editors, reviewers, and readers make more informed decisions about conducting and interpreting research where the influence of covariates may be at issue. We primarily address misperceptions in the context of statistical management of the covariates through various forms of modeling, although we also emphasize design and model or variable selection. Other approaches to addressing the effects of covariates, including matching, have logical extensions from what we discuss here but are not dwelled upon heavily. The misperceptions, misstatements, or mistakes we discuss include accurate representation of covariates, effects of measurement error, overreliance on covariate categorization, underestimation of power loss when controlling for covariates, misinterpretation of significance in statistical models, and misconceptions about confounding variables, selecting on a collider, and p value interpretations in covariate-inclusive analyses. This condensed overview serves to correct common errors and improve research quality in general and in nutrition research specifically.
-
- Ecology
- Epidemiology and Global Health
Zoonotic disease dynamics in wildlife hosts are rarely quantified at macroecological scales due to the lack of systematic surveys. Non-human primates (NHPs) host Plasmodium knowlesi, a zoonotic malaria of public health concern and the main barrier to malaria elimination in Southeast Asia. Understanding of regional P. knowlesi infection dynamics in wildlife is limited. Here, we systematically assemble reports of NHP P. knowlesi and investigate geographic determinants of prevalence in reservoir species. Meta-analysis of 6322 NHPs from 148 sites reveals that prevalence is heterogeneous across Southeast Asia, with low overall prevalence and high estimates for Malaysian Borneo. We find that regions exhibiting higher prevalence in NHPs overlap with human infection hotspots. In wildlife and humans, parasite transmission is linked to land conversion and fragmentation. By assembling remote sensing data and fitting statistical models to prevalence at multiple spatial scales, we identify novel relationships between P. knowlesi in NHPs and forest fragmentation. This suggests that higher prevalence may be contingent on habitat complexity, which would begin to explain observed geographic variation in parasite burden. These findings address critical gaps in understanding regional P. knowlesi epidemiology and indicate that prevalence in simian reservoirs may be a key spatial driver of human spillover risk.