1. Neuroscience
Download icon

Dynamic decision policy reconfiguration under outcome uncertainty

  1. Krista Bond  Is a corresponding author
  2. Kyle Dunovan
  3. Alexis Porter
  4. Jonathan E Rubin
  5. Timothy Verstynen  Is a corresponding author
  1. Carnegie Mellon University, United States
  2. Northwestern University, United States
  3. University of Pittsburgh, United States
Research Article
  • Cited 0
  • Views 329
  • Annotations
Cite this article as: eLife 2021;10:e65540 doi: 10.7554/eLife.65540

Abstract

In uncertain or unstable environments, sometimes the best decision is to change your mind. To shed light on this flexibility, we evaluated how the underlying decision policy adapts when the most rewarding action changes. Human participants performed a dynamic two-armed bandit task that manipulated the certainty in relative reward (conflict) and the reliability of action-outcomes (volatility). Continuous estimates of conflict and volatility contributed to shifts in exploratory states by changing both the rate of evidence accumulation (drift rate) and the amount of evidence needed to make a decision (boundary height), respectively. At the trialwise level, following a switch in the optimal choice, the drift rate plummets and the boundary height weakly spikes, leading to a slow exploratory state. We find that the drift rate drives most of this response, with an unreliable contribution of boundary height across experiments. Surprisingly, we find no evidence that pupillary responses associated with decision policy changes. We conclude that humans show a stereotypical shift in their decision policies in response to environmental changes.

Data availability

Behavioral data and their computational derivatives are available at https://github.com/kmbond/dynamic_decision_policy_reconfiguration. Code used to generate figures can be found here: https://github.com/kmbond/dynamic_decision_policy_reconfiguration/tree/master/revised_figure_nbs.Raw pupillometry data (DOI: 10.1184/R1/13543133), the features of the task-evoked pupillometry response (DOI: 10.1184/R1/13543067), and the principal components calculated from those features (DOI: 10.1184/R1/13543160) are available at https://kilthub.cmu.edu/projects/Dynamic_decision_policy_reconfiguration_under_outcome_uncertainty/96116.

Article and author information

Author details

  1. Krista Bond

    Department of Psychology, Carnegie Mellon University, Pittsburgh, United States
    For correspondence
    kbond@andrew.cmu.edu
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0003-1492-6798
  2. Kyle Dunovan

    Department of Psychology, Carnegie Mellon University, Pittsburgh, United States
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-7857-5133
  3. Alexis Porter

    Department of Psychology, Northwestern University, Evanston, United States
    Competing interests
    The authors declare that no competing interests exist.
  4. Jonathan E Rubin

    Department of Mathematics, University of Pittsburgh, Pittsburgh, United States
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-1513-1551
  5. Timothy Verstynen

    Department of Psychology, Carnegie Mellon University, Pittsburgh, United States
    For correspondence
    timothyv@andrew.cmu.edu
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0003-4720-0336

Funding

Air Force Research Laboratory (FA9550-18-1-0251)

  • Krista Bond
  • Timothy Verstynen

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Ethics

Human subjects: Neurologically healthy adults were recruited from thelocal university population. All procedures were approved by the Carnegie Mellon University Institutional Review Board (Approval Code: 2018_00000195; Funding: Air Force Research Laboratory, Grant Office ID: 180119). All research participants provided informed consent to participate in the study and consent to publish any research findings based on their provided data.

Reviewing Editor

  1. Redmond G O'Connell, Trinity College Dublin, Ireland

Publication history

  1. Received: December 7, 2020
  2. Accepted: December 23, 2021
  3. Accepted Manuscript published: December 24, 2021 (version 1)

Copyright

© 2021, Bond et al.

This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.

Metrics

  • 329
    Page views
  • 62
    Downloads
  • 0
    Citations

Article citation count generated by polling the highest count across the following sources: Crossref, PubMed Central, Scopus.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Download citations (links to download the citations from this article in formats compatible with various reference manager tools)

Open citations (links to open the citations from this article in various online reference manager services)

Further reading

    1. Neuroscience
    Christian Brodbeck et al.
    Research Article

    Speech processing is highly incremental. It is widely accepted that human listeners continuously use the linguistic context to anticipate upcoming concepts, words, and phonemes. However, previous evidence supports two seemingly contradictory models of how a predictive context is integrated with the bottom-up sensory input: Classic psycholinguistic paradigms suggest a two-stage process, in which acoustic input initially leads to local, context-independent representations, which are then quickly integrated with contextual constraints. This contrasts with the view that the brain constructs a single coherent, unified interpretation of the input, which fully integrates available information across representational hierarchies, and thus uses contextual constraints to modulate even the earliest sensory representations. To distinguish these hypotheses, we tested magnetoencephalography responses to continuous narrative speech for signatures of local and unified predictive models. Results provide evidence that listeners employ both types of models in parallel. Two local context models uniquely predict some part of early neural responses, one based on sublexical phoneme sequences, and one based on the phonemes in the current word alone; at the same time, even early responses to phonemes also reflect a unified model that incorporates sentence level constraints to predict upcoming phonemes. Neural source localization places the anatomical origins of the different predictive models in non-identical parts of the superior temporal lobes bilaterally, with the right hemisphere showing a relative preference for more local models. These results suggest that speech processing recruits both local and unified predictive models in parallel, reconciling previous disparate findings. Parallel models might make the perceptual system more robust, facilitate processing of unexpected inputs, and serve a function in language acquisition.

    1. Neuroscience
    Travis A Hage et al.
    Research Article

    Understanding cortical microcircuits requires thorough measurement of physiological properties of synaptic connections formed within and between diverse subclasses of neurons. Towards this goal, we combined spatially precise optogenetic stimulation with multicellular recording to deeply characterize intralaminar and translaminar monosynaptic connections to supragranular (L2/3) neurons in the mouse visual cortex. The reliability and specificity of multiphoton optogenetic stimulation were measured across multiple Cre lines and measurements of connectivity were verified by comparison to paired recordings and targeted patching of optically identified presynaptic cells. With a focus on translaminar pathways, excitatory and inhibitory synaptic connections from genetically defined presynaptic populations were characterized by their relative abundance, spatial profiles, strength, and short-term dynamics. Consistent with the canonical cortical microcircuit, layer 4 excitatory neurons and interneurons within L2/3 represented the most common sources of input to L2/3 pyramidal cells. More surprisingly, we also observed strong excitatory connections from layer 5 intratelencephalic neurons and potent translaminar inhibition from multiple interneuron subclasses. The hybrid approach revealed convergence to and divergence from excitatory and inhibitory neurons within and across cortical layers. Divergent excitatory connections often spanned hundreds of microns of horizontal space. In contrast, divergent inhibitory connections were more frequently measured from postsynaptic targets near each other.