Reinforcement biases subsequent perceptual decisions when confidence is low: a widespread behavioral phenomenon
Abstract
Learning from successes and failures often improves the quality of subsequent decisions. Past outcomes, however, should not influence purely perceptual decisions after task acquisition is complete since these are designed so that only sensory evidence determines the correct choice. Yet, numerous studies report that outcomes can bias perceptual decisions, causing spurious changes in choice behavior without improving accuracy. Here we show that the effects of reward on perceptual decisions are principled: past rewards bias future choices specifically when previous choice was difficult and hence decision confidence was low. We identified this phenomenon in six datasets from four laboratories, across mice, rats, and humans, and sensory modalities from olfaction and audition to vision. We show that this choice-updating strategy can be explained by reinforcement learning models incorporating statistical decision confidence into their teaching signals. Thus, despite being suboptimal from the experimenter’s perspective, confidence-guided reinforcement learning optimizes behavior in uncertain, real-world situations.
Data availability
The data used in this study is available at http://dx.doi.org/10.6084/m9.figshare.4300043
Article and author information
Author details
Funding
Wellcome (106101)
- Armin Lak
Wellcome (213465)
- Armin Lak
National Institutes of Health (R01 MH110404)
- Naoshige Uchida
National Institutes of Health (R01MH097061 and R01DA038209)
- Naoshige Uchida
Wellcome (205093)
- Matteo Carandini
Deutsche Forschungsgemeinschaft (DO 1240/2-1 and DO 1240/3-1)
- Tobias H Donner
RIKEN-CBS
- Emily Hueske
- Susumu Tonegawa
JPB Foundation
- Emily Hueske
- Susumu Tonegawa
Howard Hughes Medical Institute
- Emily Hueske
- Susumu Tonegawa
German Academic Exchange Service
- Anne E Urai
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Ethics
Animal experimentation: The experimental procedures were approved by Institutional committees at Cold Spring Harbor Laboratory (for experiments on rats), MIT and Harvard University (for mice auditory experiments) and were in accordance with National Institute of Health standards (project ID: 18-14-11-08-1). Experiments on mice visual decisions were approved by the home Office of the United Kingdom (license 70/8021). Experiments in humans were approved by the ethics committee at the University of Amsterdam (project ID: 2014-BC-3376).
Human subjects: The ethics committee at the University of Amsterdam approved the study, and all observers gave their informed consent.project ID: 2014-BC-3376
Copyright
© 2020, Lak et al.
This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 7,765
- views
-
- 1,169
- downloads
-
- 91
- citations
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Neuroscience
As the global population ages, the prevalence of neurodegenerative disorders is fast increasing. This neurodegeneration as well as other central nervous system (CNS) injuries cause permanent disabilities. Thus, generation of new neurons is the rosetta stone in contemporary neuroscience. Glial cells support CNS homeostasis through evolutionary conserved mechanisms. Upon damage, glial cells activate an immune and inflammatory response to clear the injury site from debris and proliferate to restore cell number. This glial regenerative response (GRR) is mediated by the neuropil-associated glia (NG) in Drosophila, equivalent to vertebrate astrocytes, oligodendrocytes (OL), and oligodendrocyte progenitor cells (OPCs). Here, we examine the contribution of NG lineages and the GRR in response to injury. The results indicate that NG exchanges identities between ensheathing glia (EG) and astrocyte-like glia (ALG). Additionally, we found that NG cells undergo transdifferentiation to yield neurons. Moreover, this transdifferentiation increases in injury conditions. Thus, these data demonstrate that glial cells are able to generate new neurons through direct transdifferentiation. The present work makes a fundamental contribution to the CNS regeneration field and describes a new physiological mechanism to generate new neurons.
-
- Neuroscience
Memory consolidation during sleep depends on the interregional coupling of slow waves, spindles, and sharp wave-ripples (SWRs), across the cortex, thalamus, and hippocampus. The reuniens nucleus of the thalamus, linking the medial prefrontal cortex (mPFC) and the hippocampus, may facilitate interregional coupling during sleep. To test this hypothesis, we used intracellular, extracellular unit and local field potential recordings in anesthetized and head restrained non-anesthetized cats as well as computational modelling. Electrical stimulation of the reuniens evoked both antidromic and orthodromic intracellular mPFC responses, consistent with bidirectional functional connectivity between mPFC, reuniens and hippocampus in anesthetized state. The major finding obtained from behaving animals is that at least during NREM sleep hippocampo-reuniens-mPFC form a functional loop. SWRs facilitate the triggering of thalamic spindles, which later reach neocortex. In return, transition to mPFC UP states increase the probability of hippocampal SWRs and later modulate spindle amplitude. During REM sleep hippocampal theta activity provides periodic locking of reuniens neuronal firing and strong crosscorrelation at LFP level, but the values of reuniens-mPFC crosscorrelation was relatively low and theta power at mPFC was low. The neural mass model of this network demonstrates that the strength of bidirectional hippocampo-thalamic connections determines the coupling of oscillations, suggesting a mechanistic link between synaptic weights and the propensity for interregional synchrony. Our results demonstrate the presence of functional connectivity in hippocampo-thalamo-cortical network, but the efficacy of this connectivity is modulated by behavioral state.