Multi-wavelength single-molecule fluorescence colocalization (CoSMoS) methods allow elucidation of complex biochemical reaction mechanisms. However, analysis of CoSMoS data is intrinsically challenging because of low image signal-to-noise ratios, non-specific surface binding of the fluorescent molecules, and analysis methods that require subjective inputs to achieve accurate results. Here, we use Bayesian probabilistic programming to implement Tapqir, an unsupervised machine learning method that incorporates a holistic, physics-based causal model of CoSMoS data. This method accounts for uncertainties in image analysis due to photon and camera noise, optical non-uniformities, non-specific binding, and spot detection. Rather than merely producing a binary 'spot/no spot' classification of unspecified reliability, Tapqir objectively assigns spot classification probabilities that allow accurate downstream analysis of molecular dynamics, thermodynamics, and kinetics. We both quantitatively validate Tapqir performance against simulated CoSMoS image data with known properties and also demonstrate that it implements fully objective, automated analysis of experiment-derived data sets with a wide range of signal, noise, and non-specific binding characteristics.
All data generated or analyzed for this study will be available at https://github.com/ordabayevy/tapqir-overleaf. That repository also includes all Figures and Figure supplements and the scripts and data used to generate them. It also contains the Supplemental Data files and preprint manuscript text.
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
© 2022, Ordabayev et al.
This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Deep Mutational Scanning (DMS) is an emerging method to systematically test the functional consequences of thousands of sequence changes to a protein target in a single experiment. Because of its utility in interpreting both human variant effects and protein structure-function relationships, it holds substantial promise to improve drug discovery and clinical development. However, applications in this domain require improved experimental and analytical methods. To address this need, we report novel DMS methods to precisely and quantitatively interrogate disease-relevant mechanisms, protein-ligand interactions, and assess predicted response to drug treatment. Using these methods, we performed a DMS of the melanocortin-4 receptor (MC4R), a G-protein-coupled receptor (GPCR) implicated in obesity and an active target of drug development efforts. We assessed the effects of >6600 single amino acid substitutions on MC4R’s function across 18 distinct experimental conditions, resulting in >20 million unique measurements. From this, we identified variants that have unique effects on MC4R-mediated Gαs- and Gαq-signaling pathways, which could be used to design drugs that selectively bias MC4R’s activity. We also identified pathogenic variants that are likely amenable to a corrector therapy. Finally, we functionally characterized structural relationships that distinguish the binding of peptide versus small molecule ligands, which could guide compound optimization. Collectively, these results demonstrate that DMS is a powerful method to empower drug discovery and development.
Copper is an essential enzyme cofactor in bacteria, but excess copper is highly toxic. Bacteria can cope with copper stress by increasing copper resistance and initiating chemorepellent response. However, it remains unclear how bacteria coordinate chemotaxis and resistance to copper. By screening proteins that interacted with the chemotaxis kinase CheA, we identified a copper-binding repressor CsoR that interacted with CheA in Pseudomonas putida. CsoR interacted with the HPT (P1), Dimer (P3), and HATPase_c (P4) domains of CheA and inhibited CheA autophosphorylation, resulting in decreased chemotaxis. The copper-binding of CsoR weakened its interaction with CheA, which relieved the inhibition of chemotaxis by CsoR. In addition, CsoR bound to the promoter of copper-resistance genes to inhibit gene expression, and copper-binding released CsoR from the promoter, leading to increased gene expression and copper resistance. P. putida cells exhibited a chemorepellent response to copper in a CheA-dependent manner, and CsoR inhibited the chemorepellent response to copper. Besides, the CheA-CsoR interaction also existed in proteins from several other bacterial species. Our results revealed a mechanism by which bacteria coordinately regulated chemotaxis and resistance to copper by CsoR.