Author-sourced capture of pathway knowledge in computable form using Biofactoid
Abstract
Making the knowledge contained in scientific papers machine-readable and formally computable would allow researchers to take full advantage of this information by enabling integration with other knowledge sources to support data analysis and interpretation. Here we describe Biofactoid, a web-based platform that allows scientists to specify networks of interactions between genes, their products, and chemical compounds, and then translates this information into a representation suitable for computational analysis, search and discovery. We also report the results of a pilot study to encourage the wide adoption of Biofactoid by the scientific community.
Data availability
All Biofactoid data are available under the Creative Commons CC0 public domain license. To download the data and code, please refer to the documentation on the Biofactoid GitHub repository (github.com/PathwayCommons/factoid). More information on software availability is available in Materials and methods.
Article and author information
Author details
Funding
National Human Genome Research Institute (U41 HG006623)
- Jeffrey V Wong
- Max Franz
- Metin Can Siper
- Dylan Fong
- Funda Durupinar
- Christian Dallago
- Augustin Luna
- John M Giorgi
- Igor Rodchenkov
- Özgün Babur
- Emek Demir
- Gary D Bader
- Chris Sander
National Human Genome Research Institute (U41 HG003751)
- Jeffrey V Wong
- Max Franz
- Metin Can Siper
- Dylan Fong
- Funda Durupinar
- Christian Dallago
- Augustin Luna
- John M Giorgi
- Igor Rodchenkov
- Özgün Babur
- Emek Demir
- Gary D Bader
- Chris Sander
National Human Genome Research Institute (R01 HG009979)
- Max Franz
- Gary D Bader
National Institute of General Medical Sciences (P41 GM103504)
- Jeffrey V Wong
- Max Franz
- Metin Can Siper
- Dylan Fong
- Funda Durupinar
- Christian Dallago
- Augustin Luna
- John M Giorgi
- Igor Rodchenkov
- Özgün Babur
- Emek Demir
- Gary D Bader
- Chris Sander
Defense Advanced Research Projects Agency (Big Mechanism,ARO W911NF-14-C-0119)
- Metin Can Siper
- Funda Durupinar
- Özgün Babur
- John A Bachman
- Benjamin Gyori
- Emek Demir
Defense Advanced Research Projects Agency (Communicating with Computers,ARO W911NF-15-1-054)
- Metin Can Siper
- Funda Durupinar
- Özgün Babur
- John A Bachman
- Benjamin Gyori
- Emek Demir
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Ethics
Human subjects: Participants of user testing provided written consent to volunteer, have their testing sessions recorded and have quotes obtained in the session published.
Copyright
© 2021, Wong et al.
This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 1,962
- views
-
- 115
- downloads
-
- 13
- citations
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Computational and Systems Biology
Live-cell microscopy routinely provides massive amounts of time-lapse images of complex cellular systems under various physiological or therapeutic conditions. However, this wealth of data remains difficult to interpret in terms of causal effects. Here, we describe CausalXtract, a flexible computational pipeline that discovers causal and possibly time-lagged effects from morphodynamic features and cell–cell interactions in live-cell imaging data. CausalXtract methodology combines network-based and information-based frameworks, which is shown to discover causal effects overlooked by classical Granger and Schreiber causality approaches. We showcase the use of CausalXtract to uncover novel causal effects in a tumor-on-chip cellular ecosystem under therapeutically relevant conditions. In particular, we find that cancer-associated fibroblasts directly inhibit cancer cell apoptosis, independently from anticancer treatment. CausalXtract uncovers also multiple antagonistic effects at different time delays. Hence, CausalXtract provides a unique computational tool to interpret live-cell imaging data for a range of fundamental and translational research applications.
-
- Computational and Systems Biology
- Structural Biology and Molecular Biophysics
Viral adhesion to host cells is a critical step in infection for many viruses, including monkeypox virus (MPXV). In MPXV, the H3 protein mediates viral adhesion through its interaction with heparan sulfate (HS), yet the structural details of this interaction have remained elusive. Using AI-based structural prediction tools and molecular dynamics (MD) simulations, we identified a novel, positively charged α-helical domain in H3 that is essential for HS binding. This conserved domain, found across orthopoxviruses, was experimentally validated and shown to be critical for viral adhesion, making it an ideal target for antiviral drug development. Targeting this domain, we designed a protein inhibitor, which disrupted the H3-HS interaction, inhibited viral infection in vitro and viral replication in vivo, offering a promising antiviral candidate. Our findings reveal a novel therapeutic target of MPXV, demonstrating the potential of combination of AI-driven methods and MD simulations to accelerate antiviral drug discovery.