Knowledge synthesis of 100 million biomedical documents augments the deep expression profiling of coronavirus receptors
Abstract
The COVID-19 pandemic demands assimilation of all biomedical knowledge to decode mechanisms of pathogenesis. Despite the recent renaissance in neural networks, a platform for the real-time synthesis of the exponentially growing biomedical literature and deep omics insights is unavailable. Here, we present the nferX platform for dynamic inference from 45 quadrillion+ possible conceptual associations from unstructured text and triangulation with insights from Single Cell RNA-sequencing, bulk RNAseq and proteomics from diverse tissue types. A hypothesis-free profiling of ACE2 suggests tongue keratinocytes, olfactory epithelial cells, airway club cells and respiratory ciliated cells as potential reservoirs of the SARS-CoV-2 receptor. We find the gut as the putative hotspot of COVID-19, where a maturation correlated transcriptional signature is shared in small intestine enterocytes among coronavirus receptors(ACE2, DPP4, ANPEP). A holistic data science platform triangulating insights from structured and unstructured data holds potential for accelerating the generation of impactful biological insights and hypotheses.
Data availability
All data used in this manuscript were obtained from published and freely available sources online. A complete list of these can be found in Supplementary File 1.
Article and author information
Author details
Funding
No external funding was received for this work.
Copyright
© 2020, Venkatakrishnan et al.
This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 4,137
- views
-
- 526
- downloads
-
- 47
- citations
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Epidemiology and Global Health
- Medicine
- Microbiology and Infectious Disease
eLife has published the following articles on SARS-CoV-2 and COVID-19.
-
- Medicine
- Microbiology and Infectious Disease
Persisting HIV reservoir viruses in resting CD4 T cells and other cellular subsets are a barrier to cure efforts. Early antiretroviral therapy (ART) enables post-treatment viral control in some cases, but mechanisms remain unclear. We hypothesised that ART initiated before peak viremia impacts HIV-1 subtype C reservoirs. We studied 35 women at high risk of infection from Durban, South Africa, identified with hyperacute HIV by twice-weekly HIV-RNA testing. Participants included 11 starting ART at a median of 456 (297–1203) days post-onset of viremia (DPOV) and 24 at 1 (1–3) DPOV. Peripheral blood mononuclear cells (PBMCs) were used to measured total HIV-1 DNA by droplet digital PCR (ddPCR) and sequence viral reservoir genomes by full-length proviral sequencing (FLIP-seq). ART during hyperacute infection blunted peak viremia (p<0.0001), but contemporaneous total HIV-1 DNA did not differ (p=0.104). Over 1 year, a decline of total HIV-1 DNA was observed in early treated persons (p=0.0004), but not late treated. Among 697 viral genome sequences, the proviral genetic landscape differed between untreated, late treated, and early treated groups. Intact genomes after 1 year were higher in untreated (31%) versus late treated (14%) and early treated (0%). Treatment in both late and early infection caused more rapid decay of intact (13% and 51% per month) versus defective (2% and 35%) viral genomes. However, intact genomes persisted 1 year post chronic treatment but were undetectable with early ART. Early ART also reduced phylogenetic diversity of intact genomes and limited cytotoxic T lymphocyte immune escape variants in the reservoir. Overall, ART initiated in hyperacute HIV-1 subtype C infection did not impact reservoir seeding but was associated with rapid intact viral genome decay, reduced genetic complexity, and limited immune escape, which may accelerate reservoir clearance in combination with other interventional strategies.