Signed and unsigned reward prediction errors dynamically enhance learning and memory

  1. Nina Rouhani  Is a corresponding author
  2. Yael Niv
  1. Princeton University, United States

Abstract

Memory helps guide behavior, but which experiences from the past are prioritized? Classic models of learning posit that events associated with unpredictable outcomes as well as, paradoxically, predictable outcomes, deploy more attention and learning for those events. Here, we test reinforcement learning and subsequent memory for those events, and treat signed and unsigned reward prediction errors (RPEs), experienced at the reward-predictive cue or reward outcome, as drivers of these two seemingly contradictory signals. By fitting reinforcement learning models to behavior, we find that both RPEs contribute to learning by modulating a dynamically changing learning rate. We further characterize the effects of these RPE signals on memory, and show that both signed and unsigned RPEs enhance memory, in line with midbrain dopamine and locus-coeruleus modulation of hippocampal plasticity, thereby reconciling separate findings in the literature.

Data availability

All data files and code for models, analysis and figures are publicly available at https://github.com/ninarouhani/2021_RouhaniNiv

The following data sets were generated

Article and author information

Author details

  1. Nina Rouhani

    Princeton Neuroscience Institute, Princeton University, Princeton, United States
    For correspondence
    nrouhani@princeton.edu
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0003-2814-0462
  2. Yael Niv

    Princeton Neuroscience Institute, Princeton University, Princeton, United States
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-0259-8371

Funding

Army Research Office (W911NF-14-1-0101)

  • Yael Niv

National Institute of Mental Health (R01MH098861)

  • Yael Niv

National Science Foundation (Graduate Student Fellowship)

  • Nina Rouhani

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Ethics

Human subjects: We obtained informed consent online; procedures were approved by Princeton University's Institutional Review Board (IRB #4452).

Copyright

© 2021, Rouhani & Niv

This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.

Metrics

  • 5,939
    views
  • 789
    downloads
  • 55
    citations

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Nina Rouhani
  2. Yael Niv
(2021)
Signed and unsigned reward prediction errors dynamically enhance learning and memory
eLife 10:e61077.
https://doi.org/10.7554/eLife.61077

Share this article

https://doi.org/10.7554/eLife.61077

Further reading

    1. Neuroscience
    Jan H Kirchner, Lucas Euler ... Julijana Gjorgjieva
    Research Article

    Dendritic branching and synaptic organization shape single-neuron and network computations. How they emerge simultaneously during brain development as neurons become integrated into functional networks is still not mechanistically understood. Here, we propose a mechanistic model in which dendrite growth and the organization of synapses arise from the interaction of activity-independent cues from potential synaptic partners and local activity-dependent synaptic plasticity. Consistent with experiments, three phases of dendritic growth – overshoot, pruning, and stabilization – emerge naturally in the model. The model generates stellate-like dendritic morphologies that capture several morphological features of biological neurons under normal and perturbed learning rules, reflecting biological variability. Model-generated dendrites have approximately optimal wiring length consistent with experimental measurements. In addition to establishing dendritic morphologies, activity-dependent plasticity rules organize synapses into spatial clusters according to the correlated activity they experience. We demonstrate that a trade-off between activity-dependent and -independent factors influences dendritic growth and synaptic location throughout development, suggesting that early developmental variability can affect mature morphology and synaptic function. Therefore, a single mechanistic model can capture dendritic growth and account for the synaptic organization of correlated inputs during development. Our work suggests concrete mechanistic components underlying the emergence of dendritic morphologies and synaptic formation and removal in function and dysfunction, and provides experimentally testable predictions for the role of individual components.

    1. Neuroscience
    Christian Thome, Jan Maximilian Janssen ... Maren Engelhardt
    Tools and Resources

    The axon initial segment (AIS) constitutes not only the site of action potential initiation, but also a hub for activity-dependent modulation of output generation. Recent studies shedding light on AIS function used predominantly post-hoc approaches since no robust murine in vivo live reporters exist. Here, we introduce a reporter line in which the AIS is intrinsically labeled by an ankyrin-G-GFP fusion protein activated by Cre recombinase, tagging the native Ank3 gene. Using confocal, superresolution, and two-photon microscopy as well as whole-cell patch-clamp recordings in vitro, ex vivo, and in vivo, we confirm that the subcellular scaffold of the AIS and electrophysiological parameters of labeled cells remain unchanged. We further uncover rapid AIS remodeling following increased network activity in this model system, as well as highly reproducible in vivo labeling of AIS over weeks. This novel reporter line allows longitudinal studies of AIS modulation and plasticity in vivo in real-time and thus provides a unique approach to study subcellular plasticity in a broad range of applications.