Abstract

Multi-wavelength single-molecule fluorescence colocalization (CoSMoS) methods allow elucidation of complex biochemical reaction mechanisms. However, analysis of CoSMoS data is intrinsically challenging because of low image signal-to-noise ratios, non-specific surface binding of the fluorescent molecules, and analysis methods that require subjective inputs to achieve accurate results. Here, we use Bayesian probabilistic programming to implement Tapqir, an unsupervised machine learning method that incorporates a holistic, physics-based causal model of CoSMoS data. This method accounts for uncertainties in image analysis due to photon and camera noise, optical non-uniformities, non-specific binding, and spot detection. Rather than merely producing a binary 'spot/no spot' classification of unspecified reliability, Tapqir objectively assigns spot classification probabilities that allow accurate downstream analysis of molecular dynamics, thermodynamics, and kinetics. We both quantitatively validate Tapqir performance against simulated CoSMoS image data with known properties and also demonstrate that it implements fully objective, automated analysis of experiment-derived data sets with a wide range of signal, noise, and non-specific binding characteristics.

Data availability

All data generated or analyzed for this study will be available at https://github.com/ordabayevy/tapqir-overleaf. That repository also includes all Figures and Figure supplements and the scripts and data used to generate them. It also contains the Supplemental Data files and preprint manuscript text.

The following data sets were generated

Article and author information

Author details

  1. Yerdos A Ordabayev

    Department of Biochemistry, Brandeis University, Waltham, United States
    Competing interests
    The authors declare that no competing interests exist.
  2. Larry J Friedman

    Department of Biochemistry, Brandeis University, Waltham, United States
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0003-4946-8731
  3. Jeff Gelles

    Department of Biochemistry, Brandeis University, Waltham, United States
    For correspondence
    gelles@brandeis.edu
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0001-7910-3421
  4. Douglas L Theobald

    Department of Biochemistry, Brandeis University, Waltham, United States
    For correspondence
    dtheobald@brandeis.edu
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-2695-8343

Funding

National Institute of General Medical Sciences (R01GM121384)

  • Jeff Gelles
  • Douglas L Theobald

National Institute of General Medical Sciences (R01GM081648)

  • Jeff Gelles

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Reviewing Editor

  1. Ruben L Gonzalez Jr, Columbia University, United States

Version history

  1. Received: September 14, 2021
  2. Preprint posted: October 1, 2021 (view preprint)
  3. Accepted: March 19, 2022
  4. Accepted Manuscript published: March 23, 2022 (version 1)
  5. Version of Record published: June 9, 2022 (version 2)

Copyright

© 2022, Ordabayev et al.

This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.

Metrics

  • 1,560
    views
  • 287
    downloads
  • 2
    citations

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Yerdos A Ordabayev
  2. Larry J Friedman
  3. Jeff Gelles
  4. Douglas L Theobald
(2022)
Bayesian machine learning analysis of single-molecule fluorescence colocalization images
eLife 11:e73860.
https://doi.org/10.7554/eLife.73860

Share this article

https://doi.org/10.7554/eLife.73860

Further reading

    1. Biochemistry and Chemical Biology
    2. Cell Biology
    Ya-Juan Wang, Xiao-Jing Di ... Ting-Wei Mu
    Research Article

    Protein homeostasis (proteostasis) deficiency is an important contributing factor to neurological and metabolic diseases. However, how the proteostasis network orchestrates the folding and assembly of multi-subunit membrane proteins is poorly understood. Previous proteomics studies identified Hsp47 (Gene: SERPINH1), a heat shock protein in the endoplasmic reticulum lumen, as the most enriched interacting chaperone for gamma-aminobutyric type A (GABAA) receptors. Here, we show that Hsp47 enhances the functional surface expression of GABAA receptors in rat neurons and human HEK293T cells. Furthermore, molecular mechanism study demonstrates that Hsp47 acts after BiP (Gene: HSPA5) and preferentially binds the folded conformation of GABAA receptors without inducing the unfolded protein response in HEK293T cells. Therefore, Hsp47 promotes the subunit-subunit interaction, the receptor assembly process, and the anterograde trafficking of GABAA receptors. Overexpressing Hsp47 is sufficient to correct the surface expression and function of epilepsy-associated GABAA receptor variants in HEK293T cells. Hsp47 also promotes the surface trafficking of other Cys-loop receptors, including nicotinic acetylcholine receptors and serotonin type 3 receptors in HEK293T cells. Therefore, in addition to its known function as a collagen chaperone, this work establishes that Hsp47 plays a critical and general role in the maturation of multi-subunit Cys-loop neuroreceptors.

    1. Biochemistry and Chemical Biology
    Hao Wang, Chen Ye ... Yan Li
    Research Article

    Bacterial exonuclease III (ExoIII), widely acknowledged for specifically targeting double-stranded DNA (dsDNA), has been documented as a DNA repair-associated nuclease with apurinic/apyrimidinic (AP)-endonuclease and 3′→5′ exonuclease activities. Due to these enzymatic properties, ExoIII has been broadly applied in molecular biosensors. Here, we demonstrate that ExoIII (Escherichia coli) possesses highly active enzymatic activities on ssDNA. By using a range of ssDNA fluorescence-quenching reporters and fluorophore-labeled probes coupled with mass spectrometry analysis, we found ExoIII cleaved the ssDNA at 5′-bond of phosphodiester from 3′ to 5′ end by both exonuclease and endonuclease activities. Additional point mutation analysis identified the critical residues for the ssDNase action of ExoIII and suggested the activity shared the same active center with the dsDNA-targeted activities of ExoIII. Notably, ExoIII could also digest the dsDNA structures containing 3′-end ssDNA. Considering most ExoIII-assisted molecular biosensors require the involvement of single-stranded DNA (ssDNA) or nucleic acid aptamer containing ssDNA, the activity will lead to low efficiency or false positive outcome. Our study revealed the multi-enzymatic activity and the underlying molecular mechanism of ExoIII on ssDNA, illuminating novel insights for understanding its biological roles in DNA repair and the rational design of ExoIII-ssDNA involved diagnostics.