The rational application of heuristic learning strategies and satisficing goals accounted for near-optimal decisions that combined reward and noisy visual information by well-trained monkeys.
Confidence-dependent reinforcement learning is active and produces trial-to-trial choice updating even in well-learned perceptual decisions without explicit reward biases, across species and sensory modalities.
Anne E Urai, Jan Willem de Gee ... Tobias H Donner
Choice history signals bias the interpretation of current sensory input, akin to shifting endogenous attention toward (or away from) the previously selected interpretation.
Residual activity from previous trials in a biophysical decision network model causes biases in choice behavior such that a previous response is more likely to be repeated.
In monkeys making decisions that balance noisy evidence and reward expectation, frontal cortical and caudate activity reflect different computational components that are related to the monkeys' strategy.