Reinforcement biases subsequent perceptual decisions when confidence is low: a widespread behavioral phenomenon

University of Oxford, United Kingdom
MIT, United States
Doshisha University, Japan
Cold Spring Harbor Laboratory, United States
University Medical Center Hamburg-Eppendorf, Germany
University College London, United Kingdom
Massachusetts Institute of Technology, United States
Harvard University, United States

Apr 14, 2020

Open access
Copyright information

Abstract
Data availability
Article and author information
Metrics

Abstract

Learning from successes and failures often improves the quality of subsequent decisions. Past outcomes, however, should not influence purely perceptual decisions after task acquisition is complete since these are designed so that only sensory evidence determines the correct choice. Yet, numerous studies report that outcomes can bias perceptual decisions, causing spurious changes in choice behavior without improving accuracy. Here we show that the effects of reward on perceptual decisions are principled: past rewards bias future choices specifically when previous choice was difficult and hence decision confidence was low. We identified this phenomenon in six datasets from four laboratories, across mice, rats, and humans, and sensory modalities from olfaction and audition to vision. We show that this choice-updating strategy can be explained by reinforcement learning models incorporating statistical decision confidence into their teaching signals. Thus, despite being suboptimal from the experimenter’s perspective, confidence-guided reinforcement learning optimizes behavior in uncertain, real-world situations.

Data availability

The data used in this study is available at http://dx.doi.org/10.6084/m9.figshare.4300043

The following previously published data sets were used

(2017) Pupil-linked arousal is driven by decision uncertainty and alters serial choice bias
Figshare, 4300043.

http://dx.doi.org/10.6084/m9.figshare.4300043

Article and author information

Author details

Armin Lak

Department of Physiology, Anatomy and Genetics, University of Oxford, Oxford, United Kingdom

For correspondence
armin.lak@dpag.ox.ac.uk

Competing interests
No competing interests declared.

"This ORCID iD identifies the author of this article:" 0000-0003-1926-5458
Emily Hueske

Picower Institute, MIT, Cambridge, United States

Competing interests
No competing interests declared.
Junya Hirokawa

Graduate School of Brain Science, Doshisha University, Kyotanabe, Japan

Competing interests
No competing interests declared.

"This ORCID iD identifies the author of this article:" 0000-0003-1238-5713
Paul Masset

Cold Spring Harbor Laboratory, Cold Spring Harbor, United States

Competing interests
No competing interests declared.

"This ORCID iD identifies the author of this article:" 0000-0003-2001-7515
Torben Ott

Cold Spring Harbor Laboratory, Cold Spring Harbor, United States

Competing interests
No competing interests declared.
Anne E Urai

Cold Spring Harbor Laboratory, Cold Spring Harbor, United States

Competing interests
No competing interests declared.

"This ORCID iD identifies the author of this article:" 0000-0001-5270-6513
Tobias H Donner

Department of Neurophysiology and Pathophysiology, University Medical Center Hamburg-Eppendorf, Hamburg, Germany

Competing interests
Tobias H Donner, Reviewing editor, eLife.

"This ORCID iD identifies the author of this article:" 0000-0002-7559-6019
Matteo Carandini

UCL Institute of Ophthalmology, University College London, London, United Kingdom

Competing interests
No competing interests declared.

"This ORCID iD identifies the author of this article:" 0000-0003-4880-7682
Susumu Tonegawa

Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, United States

Competing interests
No competing interests declared.
Naoshige Uchida

Center for Brain Science, Harvard University, Cambridge, United States

Competing interests
Naoshige Uchida, Reviewing editor, eLife.

"This ORCID iD identifies the author of this article:" 0000-0002-5755-9409
Adam Kepecs

Cold Spring Harbor Laboratory, Cold Spring Harbor, United States

For correspondence
akepecs@wustl.edu

Competing interests
No competing interests declared.

Funding

Wellcome (106101)

Armin Lak

Wellcome (213465)

Armin Lak

National Institutes of Health (R01 MH110404)

Naoshige Uchida

National Institutes of Health (R01MH097061 and R01DA038209)

Naoshige Uchida

Wellcome (205093)

Matteo Carandini

Deutsche Forschungsgemeinschaft (DO 1240/2-1 and DO 1240/3-1)

Tobias H Donner

RIKEN-CBS

Emily Hueske
Susumu Tonegawa

JPB Foundation

Emily Hueske
Susumu Tonegawa

Howard Hughes Medical Institute

Emily Hueske
Susumu Tonegawa

German Academic Exchange Service

Anne E Urai

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Ethics

Animal experimentation: The experimental procedures were approved by Institutional committees at Cold Spring Harbor Laboratory (for experiments on rats), MIT and Harvard University (for mice auditory experiments) and were in accordance with National Institute of Health standards (project ID: 18-14-11-08-1). Experiments on mice visual decisions were approved by the home Office of the United Kingdom (license 70/8021). Experiments in humans were approved by the ethics committee at the University of Amsterdam (project ID: 2014-BC-3376).

Human subjects: The ethics committee at the University of Amsterdam approved the study, and all observers gave their informed consent.project ID: 2014-BC-3376

Copyright

This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.