SIMMER employs similarity algorithms to accurately identify human gut microbiome species and enzymes capable of known chemical transformations

  1. Annamarie E Bustion
  2. Renuka R Nayak
  3. Ayushi Agrawal
  4. Peter J Turnbaugh
  5. Katie S Pollard  Is a corresponding author
  1. University of California, San Francisco, United States
  2. Gladstone Institutes, United States

Abstract

Bacteria within the gut microbiota possess the ability to metabolize a wide array of human drugs, foods, and toxins, but the responsible enzymes for these chemical events remain largely uncharacterized due to the time-consuming nature of current experimental approaches. Attempts have been made in the past to computationally predict which bacterial species and enzymes are responsible for chemical transformations in the gut environment, but with low accuracy due to minimal chemical representation and sequence similarity search schemes. Here, we present an in silico approach that employs chemical and protein Similarity algorithms that Identify MicrobioMe Enzymatic Reactions (SIMMER). We show that SIMMER accurately predicts the responsible species and enzymes for a queried reaction, unlike previous methods. We demonstrate SIMMER use cases in the context of drug metabolism by predicting previously uncharacterized enzymes for 88 drug transformations known to occur in the human gut. We validate these predictions on external datasets and provide an in vitro validation of SIMMER's predictions for metabolism of methotrexate, an anti-arthritic drug. After demonstrating its utility and accuracy, we made SIMMER available as both a command-line and web tool, with flexible input and output options for determining chemical transformations within the human gut. We present SIMMER as a computational addition to the microbiome researcher's toolbox, enabling them to make informed hypotheses before embarking on the lengthy laboratory experiments required to characterize novel bacterial enzymes that can alter human ingested compounds.

Data availability

Data generated and analyzed during this study are provided in Figures 2-10 source data files, Table 1 source data file, supplemental files, and at https://github.com/aebustion/SIMMER. Accession numbers of previously published datasets are provided in the Materials and Methods section. SIMMER code can either be run at the SIMMER website (https://simmer.pollard.gladstone.org/) or downloaded directly from the above-linked GitHub.

The following previously published data sets were used

Article and author information

Author details

  1. Annamarie E Bustion

    Pharmaceutical Sciences and Pharmacogenomics Graduate Program, University of California, San Francisco, San Francisco, United States
    Competing interests
    No competing interests declared.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-7380-3619
  2. Renuka R Nayak

    Department of Medicine, University of California, San Francisco, San Francisco, United States
    Competing interests
    No competing interests declared.
  3. Ayushi Agrawal

    Institute of Data Science and Biotechnology, Gladstone Institutes, San Francisco, United States
    Competing interests
    No competing interests declared.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0003-2940-8926
  4. Peter J Turnbaugh

    Department of Microbiology and Immunology, University of California, San Francisco, San Francisco, United States
    Competing interests
    Peter J Turnbaugh, Reviewing editor, eLife.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-0888-2875
  5. Katie S Pollard

    Institute of Data Science and Biotechnology, Gladstone Institutes, San Francisco, United States
    For correspondence
    kpollard@gladstone.ucsf.edu
    Competing interests
    Katie S Pollard, is a consultant for Phylagen Inc..
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-9870-6196

Funding

PhRMA Foundation (Predoctoral Fellowship)

  • Annamarie E Bustion

ARCS Foundation (Graduate Student Scholarship)

  • Annamarie E Bustion

UCSF Benioff Center for Microbiome Medicine (Trainee Pilot Award)

  • Annamarie E Bustion

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Copyright

© 2023, Bustion et al.

This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.

Metrics

  • 2,001
    views
  • 173
    downloads
  • 11
    citations

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Annamarie E Bustion
  2. Renuka R Nayak
  3. Ayushi Agrawal
  4. Peter J Turnbaugh
  5. Katie S Pollard
(2023)
SIMMER employs similarity algorithms to accurately identify human gut microbiome species and enzymes capable of known chemical transformations
eLife 12:e82401.
https://doi.org/10.7554/eLife.82401

Share this article

https://doi.org/10.7554/eLife.82401

Further reading

    1. Biochemistry and Chemical Biology
    2. Genetics and Genomics
    Jiale Zhou, Ding Zhao ... Zhanjun Li
    Research Article

    5-Methylcytosine (m5C) is one of the posttranscriptional modifications in mRNA and is involved in the pathogenesis of various diseases. However, the capacity of existing assays for accurately and comprehensively transcriptome-wide m5C mapping still needs improvement. Here, we develop a detection method named DRAM (deaminase and reader protein assisted RNA methylation analysis), in which deaminases (APOBEC1 and TadA-8e) are fused with m5C reader proteins (ALYREF and YBX1) to identify the m5C sites through deamination events neighboring the methylation sites. This antibody-free and bisulfite-free approach provides transcriptome-wide editing regions which are highly overlapped with the publicly available bisulfite-sequencing (BS-seq) datasets and allows for a more stable and comprehensive identification of the m5C loci. In addition, DRAM system even supports ultralow input RNA (10 ng). We anticipate that the DRAM system could pave the way for uncovering further biological functions of m5C modifications.

    1. Biochemistry and Chemical Biology
    Meina He, Yongxin Tao ... Wenli Chen
    Research Article

    Copper is an essential enzyme cofactor in bacteria, but excess copper is highly toxic. Bacteria can cope with copper stress by increasing copper resistance and initiating chemorepellent response. However, it remains unclear how bacteria coordinate chemotaxis and resistance to copper. By screening proteins that interacted with the chemotaxis kinase CheA, we identified a copper-binding repressor CsoR that interacted with CheA in Pseudomonas putida. CsoR interacted with the HPT (P1), Dimer (P3), and HATPase_c (P4) domains of CheA and inhibited CheA autophosphorylation, resulting in decreased chemotaxis. The copper-binding of CsoR weakened its interaction with CheA, which relieved the inhibition of chemotaxis by CsoR. In addition, CsoR bound to the promoter of copper-resistance genes to inhibit gene expression, and copper-binding released CsoR from the promoter, leading to increased gene expression and copper resistance. P. putida cells exhibited a chemorepellent response to copper in a CheA-dependent manner, and CsoR inhibited the chemorepellent response to copper. Besides, the CheA-CsoR interaction also existed in proteins from several other bacterial species. Our results revealed a mechanism by which bacteria coordinately regulated chemotaxis and resistance to copper by CsoR.