Top-down machine learning approach for high-throughput single-molecule analysis

  1. David S White
  2. Marcel P Goldschen-Ohm
  3. Randall H Goldsmith  Is a corresponding author
  4. Baron Chanda  Is a corresponding author
  1. University of Wisconsin-Madison, United States
  2. University of Texas at Austin, United States

Abstract

Single-molecule approaches provide enormous insight into the dynamics of biomolecules, but adequately sampling distributions of states and events often requires extensive sampling. Although emerging experimental techniques can generate such large datasets, existing analysis tools are not suitable to process the large volume of data obtained in high-throughput paradigms. Here, we present a new analysis platform (DISC) that accelerates unsupervised analysis of single-molecule trajectories. By merging model-free statistical learning with the Viterbi algorithm, DISC idealizes single-molecule trajectories up to three orders of magnitude faster with improved accuracy compared to other commonly used algorithms. Further, we demonstrate the utility of DISC algorithm to probe cooperativity between multiple binding events in the cyclic nucleotide binding domains of HCN pacemaker channel. Given the flexible and efficient nature of DISC, we anticipate it will be a powerful tool for unsupervised processing of high-throughput data across a range of single-molecule experiments.

Data availability

Simulated and raw data in addition to analysis scripts are available at https://zenodo.org/record/3727917#.Xn0Fw9NKjq0DOI: 10.5281/zenodo.3727917

The following data sets were generated

Article and author information

Author details

  1. David S White

    Neuroscience, University of Wisconsin-Madison, Madison, United States
    Competing interests
    No competing interests declared.
  2. Marcel P Goldschen-Ohm

    Neuroscience, University of Texas at Austin, Austin, United States
    Competing interests
    No competing interests declared.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0003-1466-9808
  3. Randall H Goldsmith

    Chemistry, University of Wisconsin-Madison, Madison, United States
    For correspondence
    rhg@chem.wisc.edu
    Competing interests
    No competing interests declared.
  4. Baron Chanda

    Department of Neuroscience, University of Wisconsin-Madison, Madison, United States
    For correspondence
    chanda@wisc.edu
    Competing interests
    Baron Chanda, Reviewing editor, eLife.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0003-4954-7034

Funding

National Institute of Neurological Disorders and Stroke (NS-101723)

  • Baron Chanda

National Institute of Neurological Disorders and Stroke (NS-081320)

  • Baron Chanda

National Institute of Neurological Disorders and Stroke (NS-081293)

  • Baron Chanda

National Institute of General Medical Sciences (GM007507)

  • David S White

National Institute of General Medical Sciences (GM127957)

  • Randall H Goldsmith

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Copyright

© 2020, White et al.

This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.

Metrics

  • 3,998
    views
  • 515
    downloads
  • 35
    citations

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. David S White
  2. Marcel P Goldschen-Ohm
  3. Randall H Goldsmith
  4. Baron Chanda
(2020)
Top-down machine learning approach for high-throughput single-molecule analysis
eLife 9:e53357.
https://doi.org/10.7554/eLife.53357

Share this article

https://doi.org/10.7554/eLife.53357

Further reading

    1. Neuroscience
    2. Structural Biology and Molecular Biophysics
    Yangyu Wu, Yangyang Yan ... Fred J Sigworth
    Research Article

    We present near-atomic-resolution cryoEM structures of the mammalian voltage-gated potassium channel Kv1.2 in open, C-type inactivated, toxin-blocked and sodium-bound states at 3.2 Å, 2.5 Å, 3.2 Å, and 2.9 Å. These structures, all obtained at nominally zero membrane potential in detergent micelles, reveal distinct ion-occupancy patterns in the selectivity filter. The first two structures are very similar to those reported in the related Shaker channel and the much-studied Kv1.2–2.1 chimeric channel. On the other hand, two new structures show unexpected patterns of ion occupancy. First, the toxin α-Dendrotoxin, like Charybdotoxin, is seen to attach to the negatively-charged channel outer mouth, and a lysine residue penetrates into the selectivity filter, with the terminal amine coordinated by carbonyls, partially disrupting the outermost ion-binding site. In the remainder of the filter two densities of bound ions are observed, rather than three as observed with other toxin-blocked Kv channels. Second, a structure of Kv1.2 in Na+ solution does not show collapse or destabilization of the selectivity filter, but instead shows an intact selectivity filter with ion density in each binding site. We also attempted to image the C-type inactivated Kv1.2 W366F channel in Na+ solution, but the protein conformation was seen to be highly variable and only a low-resolution structure could be obtained. These findings present new insights into the stability of the selectivity filter and the mechanism of toxin block of this intensively studied, voltage-gated potassium channel.

    1. Structural Biology and Molecular Biophysics
    Kazi A Hossain, Lukasz Nierzwicki ... Giulia Palermo
    Research Article

    xCas9 is an evolved variant of the CRISPR-Cas9 genome editing system, engineered to improve specificity and reduce undesired off-target effects. How xCas9 expands the DNA targeting capability of Cas9 by recognising a series of alternative protospacer adjacent motif (PAM) sequences while ignoring others is unknown. Here, we elucidate the molecular mechanism underlying xCas9’s expanded PAM recognition and provide critical insights for expanding DNA targeting. We demonstrate that while wild-type Cas9 enforces stringent guanine selection through the rigidity of its interacting arginine dyad, xCas9 introduces flexibility in R1335, enabling selective recognition of specific PAM sequences. This increased flexibility confers a pronounced entropic preference, which also improves recognition of the canonical TGG PAM. Furthermore, xCas9 enhances DNA binding to alternative PAM sequences during the early evolution cycles, while favouring binding to the canonical PAM in the final evolution cycle. This dual functionality highlights how xCas9 broadens PAM recognition and underscores the importance of fine-tuning the flexibility of the PAM-interacting cleft as a key strategy for expanding the DNA targeting potential of CRISPR-Cas systems. These findings deepen our understanding of DNA recognition in xCas9 and may apply to other CRISPR-Cas systems with similar PAM recognition requirements.