Active machine learning-driven experimentation to determine compound effects on protein patterns

  1. Armaghan W Naik
  2. Joshua D Kangas
  3. Devin P Sullivan
  4. Robert F Murphy  Is a corresponding author
  1. Carnegie Mellon University, United States

Abstract

High throughput screening determines the effects of many conditions on a given biological target. Currently, to estimate the effects of those conditions on other targets requires either strong modeling assumptions (e.g. similarities among targets) or separate screens. Ideally, data-driven experimentation could be used to learn accurate models for many conditions and targets without doing all possible experiments. We have previously described an active machine learning algorithm that can iteratively choose small sets of experiments to learn models of multiple effects. We now show that, with no prior knowledge and with liquid handling robotics and automated microscopy under its control, this learner accurately learned the effects of 48 chemical compounds on the subcellular localization of 48 proteins while performing only 29% of all possible experiments. The results represent the first practical demonstration of the utility of active learning-driven biological experimentation in which the set of possible phenotypes is unknown in advance.

Article and author information

Author details

  1. Armaghan W Naik

    Computational Biology Department, Center for Bioimage Informatics, Carnegie Mellon University, Pittsburgh, United States
    Competing interests
    The authors declare that no competing interests exist.
  2. Joshua D Kangas

    Computational Biology Department, Center for Bioimage Informatics, Carnegie Mellon University, Pittsburgh, United States
    Competing interests
    The authors declare that no competing interests exist.
  3. Devin P Sullivan

    Computational Biology Department, Center for Bioimage Informatics, Carnegie Mellon University, Pittsburgh, United States
    Competing interests
    The authors declare that no competing interests exist.
  4. Robert F Murphy

    Computational Biology Department, Center for Bioimage Informatics, Carnegie Mellon University, Pittsburgh, United States
    For correspondence
    murphy@cmu.edu
    Competing interests
    The authors declare that no competing interests exist.

Reviewing Editor

  1. Uwe Ohler, Duke, Germany

Version history

  1. Received: July 13, 2015
  2. Accepted: January 28, 2016
  3. Accepted Manuscript published: February 3, 2016 (version 1)
  4. Version of Record published: March 7, 2016 (version 2)

Copyright

© 2016, Naik et al.

This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.

Metrics

  • 6,946
    views
  • 1,220
    downloads
  • 39
    citations

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Armaghan W Naik
  2. Joshua D Kangas
  3. Devin P Sullivan
  4. Robert F Murphy
(2016)
Active machine learning-driven experimentation to determine compound effects on protein patterns
eLife 5:e10047.
https://doi.org/10.7554/eLife.10047

Share this article

https://doi.org/10.7554/eLife.10047

Further reading

    1. Cancer Biology
    2. Cell Biology
    Ian Lorimer
    Insight

    Establishing a zebrafish model of a deadly type of brain tumor highlights the role of the immune system in the early stages of the disease.

    1. Cell Biology
    2. Neuroscience
    Jaebin Kim, Edwin Bustamante ... Scott H Soderling
    Research Article

    One of the most extensively studied members of the Ras superfamily of small GTPases, Rac1 is an intracellular signal transducer that remodels actin and phosphorylation signaling networks. Previous studies have shown that Rac1-mediated signaling is associated with hippocampal-dependent working memory and longer-term forms of learning and memory and that Rac1 can modulate forms of both pre- and postsynaptic plasticity. How these different cognitive functions and forms of plasticity mediated by Rac1 are linked, however, is unclear. Here, we show that spatial working memory in mice is selectively impaired following the expression of a genetically encoded Rac1 inhibitor at presynaptic terminals, while longer-term cognitive processes are affected by Rac1 inhibition at postsynaptic sites. To investigate the regulatory mechanisms of this presynaptic process, we leveraged new advances in mass spectrometry to identify the proteomic and post-translational landscape of presynaptic Rac1 signaling. We identified serine/threonine kinases and phosphorylated cytoskeletal signaling and synaptic vesicle proteins enriched with active Rac1. The phosphorylated sites in these proteins are at positions likely to have regulatory effects on synaptic vesicles. Consistent with this, we also report changes in the distribution and morphology of synaptic vesicles and in postsynaptic ultrastructure following presynaptic Rac1 inhibition. Overall, this study reveals a previously unrecognized presynaptic role of Rac1 signaling in cognitive processes and provides insights into its potential regulatory mechanisms.