Value representations in the rodent orbitofrontal cortex drive learning, not choice
Abstract
Humans and animals make predictions about the rewards they expect to receive in different situations. In formal models of behavior, these predictions are known as value representations, and they play two very different roles. Firstly, they drive choice: the expected values of available options are compared to one another, and the best option is selected. Secondly, they support learning: expected values are compared to rewards actually received, and future expectations are updated accordingly. Whether these different functions are mediated by different neural representations remains an open question. Here we employ a recently-developed multi-step task for rats that computationally separates learning from choosing. We investigate the role of value representations in the rodent orbitofrontal cortex, a key structure for value-based cognition. Electrophysiological recordings and optogenetic perturbations indicate that these representations do not directly drive choice. Instead, they signal expected reward information to a learning process elsewhere in the brain that updates choice mechanisms.
Data availability
Data collected for the purpose of this paper will be posted on Figshare upon acceptance. Software used to analyze the data will be made available as a Github release. Software used for training rats and design files for constructing behavioral rigs are available on the Brody lab website.
Article and author information
Author details
Funding
National Institutes of Health (T-32 MH065214)
- Kevin J Miller
- Matthew M Botvinick
- Carlos D Brody
Princeton University (Harold W Dodds Fellowship)
- Kevin J Miller
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Ethics
Animal experimentation: All experimental procedures were performed in strict accordance with the recommendations in the Guide for the Care and Use of Laboratory Animals of the National Institutes of Health., and were approved by the Princeton University Institutional Animal Care and Use Committee (protocol #1853)
Copyright
© 2022, Miller et al.
This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 4,609
- views
-
- 904
- downloads
-
- 34
- citations
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Neuroscience
Processing pathways between sensory and default mode network (DMN) regions support recognition, navigation, and memory but their organisation is not well understood. We show that functional subdivisions of visual cortex and DMN sit at opposing ends of parallel streams of information processing that support visually mediated semantic and spatial cognition, providing convergent evidence from univariate and multivariate task responses, intrinsic functional and structural connectivity. Participants learned virtual environments consisting of buildings populated with objects, drawn from either a single semantic category or multiple categories. Later, they made semantic and spatial context decisions about these objects and buildings during functional magnetic resonance imaging. A lateral ventral occipital to fronto-temporal DMN pathway was primarily engaged by semantic judgements, while a medial visual to medial temporal DMN pathway supported spatial context judgements. These pathways had distinctive locations in functional connectivity space: the semantic pathway was both further from unimodal systems and more balanced between visual and auditory-motor regions compared with the spatial pathway. When semantic and spatial context information could be integrated (in buildings containing objects from a single category), regions at the intersection of these pathways responded, suggesting that parallel processing streams interact at multiple levels of the cortical hierarchy to produce coherent memory-guided cognition.
-
- Neuroscience
Orexin signaling in the ventral tegmental area and substantia nigra promotes locomotion and reward processing, but it is not clear whether dopaminergic neurons directly mediate these effects. We show that dopaminergic neurons in these areas mainly express orexin receptor subtype 1 (Ox1R). In contrast, only a minor population in the medial ventral tegmental area express orexin receptor subtype 2 (Ox2R). To analyze the functional role of Ox1R signaling in dopaminergic neurons, we deleted Ox1R specifically in dopamine transporter-expressing neurons of mice and investigated the functional consequences. Deletion of Ox1R increased locomotor activity and exploration during exposure to novel environments or when intracerebroventricularely injected with orexin A. Spontaneous activity in home cages, anxiety, reward processing, and energy metabolism did not change. Positron emission tomography imaging revealed that Ox1R signaling in dopaminergic neurons affected distinct neural circuits depending on the stimulation mode. In line with an increase of neural activity in the lateral paragigantocellular nucleus (LPGi) of Ox1RΔDAT mice, we found that dopaminergic projections innervate the LPGi in regions where the inhibitory dopamine receptor subtype D2 but not the excitatory D1 subtype resides. These data suggest a crucial regulatory role of Ox1R signaling in dopaminergic neurons in novelty-induced locomotion and exploration.