Memory over 24 hours was impaired in Parkinson's patients off, rather than on, dopaminergic medication during reinforcement learning, whereas dopamine did not affect positive and negative reinforcement, in contrast to previous studies.
fMRI evidence for off-task replay predicts subsequent replanning behavior in humans, suggesting that learning from simulated experience during replay helps update past policies in reinforcement learning.
The human brain is capable of implementing inverse reinforcement learning, where an observer infers the hidden reward structure of a decision problem solely through observing another individual take actions.
A mathematical model built around the assumption that the desire to maintain internal homeostasis drives the behavior of animals, by affecting their learning processes, can explain many real-world behaviors, including some that might otherwise appear irrational.
Confidence-dependent reinforcement learning is active and produces trial-to-trial choice updating even in well-learned perceptual decisions without explicit reward biases, across species and sensory modalities.
The finding that fly maggots, equipped with only 10,000 neurons, process reinforcement not only by value but also by specific quality reveals a basic operating principle of brains and challenges current models of memory organization.