Top-down machine learning approach for high-throughput single-molecule analysis

  1. David S White
  2. Marcel P Goldschen-Ohm
  3. Randall H Goldsmith  Is a corresponding author
  4. Baron Chanda  Is a corresponding author
  1. University of Wisconsin-Madison, United States
  2. University of Texas at Austin, United States

Abstract

Single-molecule approaches provide enormous insight into the dynamics of biomolecules, but adequately sampling distributions of states and events often requires extensive sampling. Although emerging experimental techniques can generate such large datasets, existing analysis tools are not suitable to process the large volume of data obtained in high-throughput paradigms. Here, we present a new analysis platform (DISC) that accelerates unsupervised analysis of single-molecule trajectories. By merging model-free statistical learning with the Viterbi algorithm, DISC idealizes single-molecule trajectories up to three orders of magnitude faster with improved accuracy compared to other commonly used algorithms. Further, we demonstrate the utility of DISC algorithm to probe cooperativity between multiple binding events in the cyclic nucleotide binding domains of HCN pacemaker channel. Given the flexible and efficient nature of DISC, we anticipate it will be a powerful tool for unsupervised processing of high-throughput data across a range of single-molecule experiments.

Data availability

Simulated and raw data in addition to analysis scripts are available at https://zenodo.org/record/3727917#.Xn0Fw9NKjq0DOI: 10.5281/zenodo.3727917

The following data sets were generated

Article and author information

Author details

  1. David S White

    Neuroscience, University of Wisconsin-Madison, Madison, United States
    Competing interests
    No competing interests declared.
  2. Marcel P Goldschen-Ohm

    Neuroscience, University of Texas at Austin, Austin, United States
    Competing interests
    No competing interests declared.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0003-1466-9808
  3. Randall H Goldsmith

    Chemistry, University of Wisconsin-Madison, Madison, United States
    For correspondence
    rhg@chem.wisc.edu
    Competing interests
    No competing interests declared.
  4. Baron Chanda

    Department of Neuroscience, University of Wisconsin-Madison, Madison, United States
    For correspondence
    chanda@wisc.edu
    Competing interests
    Baron Chanda, Reviewing editor, eLife.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0003-4954-7034

Funding

National Institute of Neurological Disorders and Stroke (NS-101723)

  • Baron Chanda

National Institute of Neurological Disorders and Stroke (NS-081320)

  • Baron Chanda

National Institute of Neurological Disorders and Stroke (NS-081293)

  • Baron Chanda

National Institute of General Medical Sciences (GM007507)

  • David S White

National Institute of General Medical Sciences (GM127957)

  • Randall H Goldsmith

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Copyright

© 2020, White et al.

This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.

Metrics

  • 3,995
    views
  • 515
    downloads
  • 35
    citations

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. David S White
  2. Marcel P Goldschen-Ohm
  3. Randall H Goldsmith
  4. Baron Chanda
(2020)
Top-down machine learning approach for high-throughput single-molecule analysis
eLife 9:e53357.
https://doi.org/10.7554/eLife.53357

Share this article

https://doi.org/10.7554/eLife.53357

Further reading

    1. Structural Biology and Molecular Biophysics
    Parveen Goyal, KanagaVijayan Dhanabalan ... Subramanian Ramaswamy
    Research Advance

    N -Acetylneuraminic acid (Neu5Ac) is a negatively charged nine-carbon amino sugar that is often the peripheral sugar in human cell-surface glycoconjugates. Some bacteria scavenge, import, and metabolize Neu5Ac or redeploy it on their cell surfaces for immune evasion. The import of Neu5Ac by many bacteria is mediated by tripartite ATP-independent periplasmic (TRAP) transporters. We have previously reported the structures of SiaQM, a membrane-embedded component of the Haemophilus influenzae TRAP transport system, (Currie et al., 2024). However, none of the published structures contain Neu5Ac bound to SiaQM. This information is critical for defining the transport mechanism and for further structure-activity relationship studies. Here, we report the structures of Fusobacterium nucleatum SiaQM with and without Neu5Ac. Both structures are in an inward (cytoplasmic side) facing conformation. The Neu5Ac-bound structure reveals the interactions of Neu5Ac with the transporter and its relationship with the Na+ binding sites. Two of the Na+-binding sites are similar to those described previously. We identify a third metal-binding site that is further away and buried in the elevator domain. Ser300 and Ser345 interact with the C1-carboxylate group of Neu5Ac. Proteoliposome-based transport assays showed that Ser300-Neu5Ac interaction is critical for transport, whereas Ser345 is dispensable. Neu5Ac primarily interacts with residues in the elevator domain of the protein, thereby supporting the elevator with an operator mechanism. The residues interacting with Neu5Ac are conserved, providing fundamental information required to design inhibitors against this class of proteins.

    1. Computational and Systems Biology
    2. Structural Biology and Molecular Biophysics
    Bin Zheng, Meimei Duan ... Peng Zheng
    Research Article

    Viral adhesion to host cells is a critical step in infection for many viruses, including monkeypox virus (MPXV). In MPXV, the H3 protein mediates viral adhesion through its interaction with heparan sulfate (HS), yet the structural details of this interaction have remained elusive. Using AI-based structural prediction tools and molecular dynamics (MD) simulations, we identified a novel, positively charged α-helical domain in H3 that is essential for HS binding. This conserved domain, found across orthopoxviruses, was experimentally validated and shown to be critical for viral adhesion, making it an ideal target for antiviral drug development. Targeting this domain, we designed a protein inhibitor, which disrupted the H3-HS interaction, inhibited viral infection in vitro and viral replication in vivo, offering a promising antiviral candidate. Our findings reveal a novel therapeutic target of MPXV, demonstrating the potential of combination of AI-driven methods and MD simulations to accelerate antiviral drug discovery.