Human complex exploration strategies are enriched by noradrenaline-modulated heuristics
Abstract
An exploration-exploitation trade-off, the arbitration between sampling a lesser-known against a known rich option, is thought to be solved using computationally demanding exploration algorithms. Given known limitations in human cognitive resources, we hypothesised the presence of additional cheaper strategies. We examined for such heuristics in choice behaviour where we show this involves a value-free random exploration, that ignores all prior knowledge, and a novelty exploration that targets novel options alone. In a double-blind, placebo-controlled drug study, assessing contributions of dopamine (400mg amisulpride) and noradrenaline (40mg propranolol), we show that value-free random exploration is attenuated under the influence of propranolol, but not under amisulpride. Our findings demonstrate that humans deploy distinct computationally cheap exploration strategies and where value-free random exploration is under noradrenergic control.
Data availability
All necessary resources are publicly available at: https://github.com/MagDub.
Article and author information
Author details
Funding
Max-Planck-Gesellschaft
- Magda Dubois
Wellcome Sir Hendry Dale Fellowship (211155/Z/18/Z)
- Tobias U Hauser
Jacobs Foundation (2017-1261-04)
- Tobias U Hauser
Wellcome Trust Investigator Award (098362/Z/12/Z)
- Raymond J Dolan
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Reviewing Editor
- Thorsten Kahnt, Northwestern University, United States
Ethics
Human subjects: The study was approved by the UCL research committee (REC No 6218/002) and all subjects provided written informed consent
Version history
- Received: June 12, 2020
- Accepted: January 3, 2021
- Accepted Manuscript published: January 4, 2021 (version 1)
- Version of Record published: January 19, 2021 (version 2)
Copyright
© 2021, Dubois et al.
This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 3,025
- views
-
- 392
- downloads
-
- 35
- citations
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Neuroscience
Subpopulations of neurons in the subthalamic nucleus have distinct activity patterns that relate to the three hypotheses of the Drift Diffusion Model.
-
- Neuroscience
The placebo and nocebo effects highlight the importance of expectations in modulating pain perception, but in everyday life we don’t need an external source of information to form expectations about pain. The brain can learn to predict pain in a more fundamental way, simply by experiencing fluctuating, non-random streams of noxious inputs, and extracting their temporal regularities. This process is called statistical learning. Here, we address a key open question: does statistical learning modulate pain perception? We asked 27 participants to both rate and predict pain intensity levels in sequences of fluctuating heat pain. Using a computational approach, we show that probabilistic expectations and confidence were used to weigh pain perception and prediction. As such, this study goes beyond well-established conditioning paradigms associating non-pain cues with pain outcomes, and shows that statistical learning itself shapes pain experience. This finding opens a new path of research into the brain mechanisms of pain regulation, with relevance to chronic pain where it may be dysfunctional.