The hippocampus supports deliberation during value-based decisions

Abstract
Introduction
Results
Discussion
Materials and methods
Appendix 1
Data availability
References
Article and author information
Metrics

Abstract

Choosing between two items involves deliberation and comparison of the features of each item and its value. Such decisions take more time when choosing between options of similar value, possibly because these decisions require more evidence, but the mechanisms involved are not clear. We propose that the hippocampus supports deliberation about value, given its well-known role in prospection and relational cognition. We assessed the role of the hippocampus in deliberation in two experiments. First, using fMRI in healthy participants, we found that BOLD activity in the hippocampus increased as a function of deliberation time. Second, we found that patients with hippocampal damage exhibited more stochastic choices and longer reaction times than controls, possibly due to their failure to construct value-based or internal evidence during deliberation. Both sets of results were stronger in value-based decisions compared to perceptual decisions.

https://doi.org/10.7554/eLife.46080.001

Introduction

Some decisions involve more deliberation than others. Even seemingly simple decisions such as those that involve preferences between a pair of familiar items take more time when they involve a choice between options of similar subjective value. This simple observation holds across many kinds of decisions, whether they are based on perception of the environment—is the apple green or red? (Cassey et al., 2013; Gold and Shadlen, 2007; Ratcliff, 2002; Usher and McClelland, 2001)—or on internal values and preferences—do I prefer a green apple or a red one? (Basten et al., 2010; Hunt et al., 2012; Krajbich et al., 2010; Milosavljevic et al., 2010). One explanation for why such decisions take more time is that a commitment to a choice depends on the accumulation of evidence to a threshold, and when the evidence is weaker, more samples are required to reach such a threshold (Krajbich et al., 2010; Milosavljevic et al., 2010). This idea has been studied extensively in perceptual decisions about dynamic stimuli (e.g. moving dots) for which more time clearly provides more samples of external evidence, and therefore can improve the accuracy of the decision (Britten et al., 1996; Britten et al., 1993; Hanks et al., 2015; Mazurek et al., 2003; Newsome and Paré, 1988; Salzman et al., 1990). It is less clear why the same framework would apply to value-based decisions, which depend on internal evidence (Krajbich et al., 2010; Milosavljevic et al., 2010). In such cases, it is not known what the source of the evidence is and why more samples should be required to decide between options that are close in value.

We sought to understand the processes involved in deliberation when making value-based decisions. Our central hypothesis is that the hippocampus plays a key role in this deliberation process, contributing to the comparison between items and the construction of internal samples of evidence bearing on the decision.

This hypothesis is guided by several observations. First, extensive research demonstrates that the hippocampus is necessary for detailed and vivid prospection about future events (Addis and Schacter, 2008; Buckner, 2010; Hassabis et al., 2007; Klein et al., 2002; Race et al., 2011; Schacter et al., 2007). This sort of prospection is likely to guide value-based decisions because it allows a decision-maker to imagine the detailed outcome of each choice option. Second, and more broadly, the hippocampus is known to contribute to relational encoding (Cohen and Eichenbaum, 1993; Horner and Burgess, 2013), a term coined by Cohen and Eichenbaum (1993) to capture the essential role of the hippocampus across many cognitive processes that involve flexible comparison and association between distinct items and features (for reviews, see Barry and Maguire, 2019; Davachi, 2006; Eichenbaum, 2000; Eichenbaum, 2018; Konkel and Cohen, 2009; Palombo et al., 2015a; Shohamy and Turk-Browne, 2013). This relational function of the hippocampus is thought to underlie its well-known role in episodic memory, but the comparison of multiple dimensions of items and their relation to each other is also likely to help guide deliberation during decision making by supplying internal evidence about each option. Recent studies have indeed linked hippocampal-based mnemonic processes to choice behavior by demonstrating that the hippocampus is involved in decisions that explicitly depend on memory by requiring participants to use novel associations acquired in the experiment (Barron et al., 2013; Gluth et al., 2015; Wimmer and Shohamy, 2012). However, a critical open question remains about whether the hippocampus also contributes to seemingly simple decisions—between two highly familiar items—without the explicit demand to use memory.

We conducted two experiments to address this question. First, we conducted an fMRI study in healthy young participants while they made decisions based on well-established subjective value (fMRI; Experiment 1). We reasoned that if the hippocampus supports deliberation, then longer decision times should be related to more engagement of the hippocampus. Second, to test whether the hippocampus plays a causal role in resolving value-based decisions, we tested amnesic patients with damage to the hippocampus and surrounding medial temporal lobe (MTL) as well as age-, education-, and verbal IQ-matched healthy controls (Patients; Experiment 2). Although a choice between two familiar items is not typically thought to depend on the hippocampal memory system (Bartra et al., 2013; Padoa-Schioppa and Assad, 2006; Platt and Plassmann, 2014; Rangel and Clithero, 2014; Rangel et al., 2008), we reasoned that amnesic patients may nonetheless show differences in the way they deliberate about simple value-based decisions. Amnesic patients could take less time because their decisions involve less deliberation, or they could take more time because they try unsuccessfully to deliberate using evidence derived from relational mechanisms. In the latter case, the extra time would not improve their decisions.

In both experiments, participants performed a value-based decision task in which they made a series of choices between two familiar food items (Figure 1). The subjective value of each individual item was determined for each participant using an auction procedure in advance (see Materials and methods), so that we could systematically vary the difference in value between the two items (i.e. ∆Value) during the decision task (see also Grueschow et al., 2015; Krajbich et al., 2010; Milosavljevic et al., 2010; Polanía et al., 2015). The same participants also took part in a control condition in which they made perceptual decisions about the dominant color of a dynamic random dot display (Figure 1 and Figure 1—video 1). The perceptual comparison task solicits the same choice and reaction time behavior but is based on external sensory input.

Figure 1 with 1 supplement see all

Download asset Open asset

Experimental tasks.

In Experiment 1, healthy participants were scanned with fMRI during three different tasks: a value-based decision task (top), a perceptual decision task (middle), and a memory recognition task (bottom). In the value-based decision task, participants were presented with 150 pairs of foods that differed on *∆Value* (based on a pre-task auction procedure for rating the items; see Materials and methods). Participants were told to choose the item that they preferred and that their choice on a randomly selected trial would be honored at the end of the experiment. In the perceptual decision task, participants were presented with 210 trials of a cloud of flickering blue and yellow dots that varied in the proportion of blue versus yellow (color coherence). Participants were told to determine whether the display was more blue or more yellow. In the recognition memory localizer task, participants underwent a standard recognition task using incidental encoding of everyday objects: first, they rated 100 objects (outside of the scanner); 48 hr later they were presented with a surprise memory test in the scanner, in which ‘old’ objects were intermixed with 100 ‘new’ objects, one at a time, and participants were asked to indicate whether each object was ‘old’ or ‘new’. In Experiment 2, amnesic patients with MTL damage and healthy controls performed variants of the value-based and perceptual decision tasks (see Materials and methods).

https://doi.org/10.7554/eLife.46080.002

In Experiment 1, we found that decision time in the value-based decision task was longer when the choice options were closer in value, as expected (Krajbich et al., 2010; Milosavljevic et al., 2010; Polanía et al., 2015). We also found that reaction times correlated with hippocampal BOLD activity, and this effect was localized to regions of the hippocampus that showed activity related to memory retrieval, independently identified in the same participants. In Experiment 2, we found that amnesic patients were somewhat more stochastic and much slower when making value-based decisions. Importantly, despite parallel behavioral findings in value-based decisions and perceptual decisions in the healthy controls, both the hippocampal BOLD effects and the impairments in patients were selective to the value-based decision task. Together, these findings establish a critical role for the hippocampus in value-based decisions about familiar choice options.

Results

We conducted two experiments to test the mechanisms underlying deliberation in value-based decisions. In the first experiment, we scanned healthy young participants with functional MRI while they performed value-based and perceptual decision tasks. In the second experiment, we tested behavior in amnesic patients with damage to the hippocampus and surrounding MTL as well as age-, education-, and verbal IQ-matched healthy control participants on slightly modified versions of these two decision tasks (see Materials and methods).

Experiment 1: functional MRI

Behavior in both decision tasks conforms to sequential sampling models

On the perceptual decision task, healthy young participants (n = 30) made more accurate decisions when the color was more biased toward blue or yellow (Figure 2A, top) and reaction times (RT) were longer for decisions between options that were more difficult to discriminate (i.e. color coherence near zero, Figure 2A, bottom). Similarly, on the value-based decision task, participants made decisions more consistent with their subjective valuation when ∆Value was larger (Figure 2B, top). RTs were longer for decisions between options for which the magnitude of ∆Value (|∆Value|) was smaller (Figure 2B, bottom). For both the perceptual and the value-based tasks, choices and RT were well described by drift diffusion models (Figure 2, solid lines). This observation is consistent with prior work (Krajbich et al., 2015; Ratcliff and McKoon, 2008; Shadlen and Kiani, 2013) and with the proposal that both types of decisions arise through a process of sequential sampling that stops when the accumulation of evidence satisfies a threshold or bound. The choice functions and range of RT were comparable in the two tasks, as were the goodness of fits (for model parameter estimates, see Figure 2—source data 1; for individual participant fits, see Figure 2—figure supplement 1). Some of the differences between the fits, apparent by eye, are attributed to the different scales of evidence strength in the two tasks (see Figure 2—figure supplement 2). We considered simpler parameterizations of the model, but the full model presented here produced a better fit compared to a model with no power law (BIC = 19.45), and a better fit compared to a model with no power law and flat bounds (BIC = 168.45).

Figure 2 with 3 supplements see all

Download asset Open asset

Choices between options that are similar take more time for both perceptual and value-based decisions in Experiment 1.

Behavioral results from 30 young healthy participants for (A) perceptual and (B) value-based decisions. (A) Proportion of blue choices (top) and mean RT (bottom) plotted as a function of signed color coherence (the logarithm of the odds that a dot is plotted as blue). (B) Proportion of right item preference (top) and mean RT (bottom) plotted as a function of value difference (the subjective value of the item on the right side of the screen minus the subjective value of the item on the left) binned into eleven levels. Gray symbols are means (error bars are s.e.m.); solid black lines are fits to drift diffusion models. See Figure 2—figure supplement 1 for fits to data from individual participants. See Figure 2—figure supplement 3 for parameter recovery analysis.

https://doi.org/10.7554/eLife.46080.004

Figure 2—source code 1 Jupyter notebook with analysis code and output for analyses performed on data from Experiment 1.: https://doi.org/10.7554/eLife.46080.008
Download elife-46080-fig2-code1-v2.ipynb
Figure 2—source data 1 Parameter estimates and goodness of fit measures for Experiment 1. κ is the drift rate. B₀ is the initial height of the bound. B_del is the delay before the bound starts decreasing. B₂ is the coefficient of the exponential term that governs the bound decrease. t_nd is the non-decision time. σ_tnd is the standard deviation of the non-decision time. μ₀ is the bias in drift rate. Plaw is the coefficient of the power law applied to the stimulus strength (color coherence for perceptual and ∆Value for value-based). NLL is negative log-likelihood of the parameters given the choice and RT data. R² choices is the McFadden pseudo-R² for choice data given color coherence (for perceptual) or ∆Value (for value-based). R² RT is the R² for RT data given color coherence (for perceptual) or ∆Value (for value-based).: https://doi.org/10.7554/eLife.46080.009
Download elife-46080-fig2-data1-v2.xlsx
Figure 2—source data 2 Trial-level data for the perceptual task in Experiment 1. The file contains seven columns; subject ID, signed color coherence, $p_{b l u e}$ , response ('#3'=left, '$4'), reaction time, button order (one means blue requires a left response, two means blue requires a right response), and whether the participant chose blue.: https://doi.org/10.7554/eLife.46080.010
Download elife-46080-fig2-data2-v2.csv
Figure 2—source data 3 Trial-level data for the value-based task in Experiment 1. The file contains eight columns; subject ID, reaction time, whether the participant chose the item on the right side of the screen, the value placed on the item on the left side of the screen, the value placed on the item on the right side of the screen, the name of the image that appeared on the left, the name of the image that appeared on the right, and the participant’s response ('#3'=left, '$4').: https://doi.org/10.7554/eLife.46080.011
Download elife-46080-fig2-data3-v2.csv

Timing of value-based decisions is related to brain correlates of memory

We first conducted a whole-brain analysis to identify regions in the brain that show (i) an effect of RT: a correlation between RT and BOLD activity for the value-based task more so than for the perceptual task, and (ii) a memory effect: greater BOLD activity for successful retrieval of object memories (using the separate object-memory localizer task, see Materials and methods, Figure 3—figure supplement 1 and Figure 3—source data 1). Each of these analyses of the fMRI data (RT; memory retrieval) identified largely separate networks of brain regions (Figure 3—figure supplement 1 and Figure 3—figure supplement 3; Stark and Squire, 2001; Yarkoni et al., 2009). Critically, however, both showed significant effects in the hippocampus and, as shown in Figure 3 (and Figure 3—source data 2), the conjunction of these two effects revealed significant shared BOLD activity in the hippocampus. BOLD activity in memory-related hippocampal regions was more positively correlated with RT for value-based decisions than perceptual decisions, consistent with our hypothesis that deliberation associated with resolving preference relies on memory-related hippocampal mechanisms.

Figure 3 with 5 supplements see all

Download asset Open asset

Deliberation time during value-based decisions is related to activation in the hippocampus.

The figure shows a representative slice at the level of the hippocampus. The map exploits all three tasks and shows a comparison of the effect of trial-by-trial RT on value-based decisions with perceptual decisions, localized (with a conjunction analysis) to regions of the brain that also show a memory-retrieval effect. The full map can be viewed at https://neurovault.org/collections/BOWMEEOR/images/56727. This effect in the hippocampus was replicated with a separate analysis controlling for potential confounds (e.g. mean value across items in a pair; Figure 3—figure supplement 3D). Coordinates reported in standard MNI space. Heatmap color bars range from z-stat = 2.3 to 3.2. The map was cluster corrected for familywise error rate at a whole-brain level with an uncorrected cluster-forming threshold of z = 2.3 and corrected extent threshold of p<0.05.

https://doi.org/10.7554/eLife.46080.012

Figure 3—source data 1 Activation table for map in Figure 3—figure supplement 1; successful memory retrieval: hits > correct rejections.: https://doi.org/10.7554/eLife.46080.019
Download elife-46080-fig3-data1-v2.xlsx
Figure 3—source data 2 Activation table for map in Figure 3; conjunction between RT effect on BOLD for value-based greater than perceptual with effect of successful memory recognition.: https://doi.org/10.7554/eLife.46080.018
Download elife-46080-fig3-data2-v2.xlsx
Figure 3—source data 3 Activation table for map in Figure 3—figure supplement 2A; overall main effect of value-based greater than perceptual decisions.: https://doi.org/10.7554/eLife.46080.020
Download elife-46080-fig3-data3-v2.xlsx
Figure 3—source data 4 Activation table for map in Figure 3—figure supplement 2B; the effect of RT on BOLD for value-based greater than perceptual decisions, restricted to trials for which the range in RT was matched between the two decision tasks.: https://doi.org/10.7554/eLife.46080.021
Download elife-46080-fig3-data4-v2.xlsx
Figure 3—source data 5 Activation table for map in Figure 3—figure supplement 3A; effect of value-based RT on BOLD.: https://doi.org/10.7554/eLife.46080.022
Download elife-46080-fig3-data5-v2.xlsx
Figure 3—source data 6 Activation table for map in Figure 3—figure supplement 3B; effect of perceptual RT on BOLD.: https://doi.org/10.7554/eLife.46080.023
Download elife-46080-fig3-data6-v2.xlsx
Figure 3—source data 7 Activation table for map in Figure 3—figure supplement 3C; value-based RT > perceptual RT.: https://doi.org/10.7554/eLife.46080.024
Download elife-46080-fig3-data7-v2.xlsx
Figure 3—source data 8 Activation table for maps in Figure 3—figure supplement 3E; Figure 3—figure supplement 3F; Figure 3—figure supplement 3G.: https://doi.org/10.7554/eLife.46080.025
Download elife-46080-fig3-data8-v2.xlsx
Figure 3—source data 9 Activation table for map in Figure 3—figure supplement 5: Modulated effect of the value of the chosen food.: https://doi.org/10.7554/eLife.46080.026
Download elife-46080-fig3-data9-v2.xlsx

We conducted a series of control analyses to consider possible alternative explanations for the differential hippocampal activation on value-based versus perceptual tasks. First, the hippocampal BOLD activity might be related simply to the fact that the value-based decision task makes more demands on memory because it depends on identifying objects. Indeed, a main effect of value-based versus perceptual decisions reveals differences in BOLD activity along the ventral stream and in the medial temporal lobe, including the hippocampus (Figure 3—figure supplement 2A and Figure 3—source data 3). However, if object identification were the reason for the RT effects, one would expect to find only a main effect of task—that is, an overall difference between the two tasks regardless of deliberation time—rather than a significant interaction between task and RT. The observation of both a main effect of task and an interaction with RT suggests that differences in object recognition do not account for the finding in the hippocampus. Second, we wondered whether the hippocampal BOLD activity in the value-based task could be related to the fact that for some participants there was a difference in the range of RT in the value-based task compared to the perceptual task. To test this, we repeated the analysis using only trials that shared the same range of RT on the two tasks (by participant). This analysis revealed a similar result (Figure 3—figure supplement 2B and Figure 3—source data 4), suggesting that the difference in the hippocampus is not related to differences in RT range.

A third possibility we considered was that the tasks differ in overall levels of difficulty. Indeed, RT is a function of the difficulty levels in each of the two tasks, but there is also variability in RT within each level of difficulty, allowing us to address questions about RT while controlling for difficulty. Therefore, we tested the possibility that difficulty accounted for more of the variance in hippocampal BOLD activity than RT by repeating the same analysis as in Figure 3 while controlling for the magnitude of color coherence and ∆Value, as well as other potential correlates of RT (e.g. mean of the pair of values; see Materials and methods). This analysis again revealed RT-related activity in the hippocampus that is greater for value-based than perceptual decisions, even after accounting for other correlates of RT, both within an anatomical ROI of bilateral hippocampus and at a whole-brain corrected level (Figure 3—figure supplement 3 and Figure 3—source datas 5–8). The conjunction between the RT effect and the memory map was again found within the hippocampus ROI (Figure 3—figure supplement 3H). Finally, because our memory encoding task involved value judgments (see Materials and methods), we reran the conjunction analysis using an independent memory recognition localizer that was not specific to value-based encoding, instead using two independent meta-analysis maps from neurosynth.org based on the terms ‘autobiographical memory’ and ‘recollection’. The three-way conjunction between the differential effect of RT on BOLD and these two meta-analysis maps also shows overlap in the hippocampus (Figure 3—figure supplement 4).

Connectivity between hippocampus and parietal cortex increases with value-based decision time

The fMRI results suggest that BOLD activity in the hippocampus is related to the time it takes to make value-based decisions. We next explored the broader neural circuits that interact with the hippocampus during value-based decisions and how activity in such circuits varies with RT. We used a psychophysiological interaction (PPI) analysis to identify brain regions with activity that covaried in an RT-dependent manner with the activity of hippocampal ‘seed’ voxels—that is those that exhibited RT-dependent activation on the value-based decision task and memory-related activation on the memory localizer task. The strongest RT-dependent correlation was between the hippocampus and the parietal cortex (superior parietal lobule and precuneus), showing that functional connectivity between the hippocampus and parietal cortex was greater for value-based decisions that took longer (Figure 4 and Figure 4—source data 1).

Figure 4

Download asset Open asset

Timing of value-based decisions is related to functional coupling between the hippocampus and parietal cortex.

Lateral (left) and medial (right) view of a semi-inflated surface of a template brain. PPI results were projected onto the cortical surface. There was a stronger correlation in activity between the hippocampus and the parietal cortex when value-based decisions took more time. The full map can be viewed at https://neurovault.org/collections/BOWMEEOR/images/129376. Heatmap color bars range from z-stat = 2.3 to 3.2. The map was cluster corrected for familywise error rate at a whole-brain level with an uncorrected cluster-forming threshold of z = 2.3 and corrected extent of p<0.05.

https://doi.org/10.7554/eLife.46080.027

Figure 4—source data 1 Activation table for map in Figure 4; PPI for value-based decision trials with hippocampus seed modulated by RT.: https://doi.org/10.7554/eLife.46080.028
Download elife-46080-fig4-data1-v2.xlsx

Experiment 2: behavior in amnesic patients

The fMRI data reveal that the timing of value-based decisions is related to BOLD activity in the hippocampus, suggesting a possible role for the hippocampus in the deliberation process. However, fMRI can only tell us about brain activity correlated with a mental process, leaving open the critical question of whether the hippocampus plays a direct, causal role in value-based decisions. Experiment 2 was designed to address this question by testing value-based decision making in patients with amnesia subsequent to damage to the hippocampus and nearby MTL structures.

Our overarching hypothesis is that the hippocampus contributes to value-based decisions by supporting the comparison of options, the simulation of outcomes, and the recollection of internal evidence. We therefore expected that damage to the hippocampus would impair this deliberation process. As noted earlier, we had no strong prediction regarding whether patients would show faster or slower RTs in general. We reasoned that slower RTs might reflect efforts to search for evidence to resolve decisions, whereas faster RTs might reflect choices that lack deliberative reasoning altogether. Patients with hippocampal damage are not known to have general impairments in valuation processes and the experiment only included food items that each patient fully recognized (see Materials and methods). Therefore, we expected that patients would make choices largely consistent with their subjective valuations. Finally, for the perceptual task, we expected the patients to show intact performance, consistent with the notion that the hippocampus is not needed to make decisions based on external evidence.

Timing of value-based decisions is impaired in amnesic patients

We tested six amnesic patients with damage to the hippocampus and surrounding MTL on the decision tasks from Experiment 1, slightly modified to accommodate the patient population (see Materials ans methods). The patients have well-characterized memory impairments combined with intact verbal reasoning and IQ (see Table 1), and have participated in several prior studies (Foerde et al., 2013; Grilli and Verfaellie, 2016; Palombo et al., 2019; Palombo et al., 2015b). We compared the patients to fourteen age-, education-, and verbal IQ-matched healthy participants.

Table 1

Amnesic patient demographic and neuropsychological data.

https://doi.org/10.7554/eLife.46080.029

Patient #	Diagnosis	Gender	Age	Edu	WAIS-III		WMS-III			BNT	FAS	L-N sequence	Years since onset
Patient #	Diagnosis	Gender	Age	Edu	VIQ	WMI	GM	VD	AD	BNT	FAS	L-N sequence	Years since onset
P01	Hypoxic-ischemic	F	67	12	88	75	52	56	55	−1.3	−1.1	-2	27.29
P02	Status epilepticus + left temp. lobectomy	M	54	16	93	94	49	53	52	−4.6	−0.96	-1	29.17
P03	Hypoxic-ischemic	M	61	14	106	115	59	72	52	0.54	−0.78	1.33	24.18
P04	Hypoxic-ischemic	M	65	17	131	126	86	78	86	1.3	0.03	1.33	15.00
P05	Encephalitis	M	75	13	99	104	49	56	58	−0.11	−0.5	0.33	5.85
P06	Stroke	M	53	20	111	99	60	65	58	1.02	2.1	−0.33	3.45

Age in years at first session; Edu, education in years; WAIS-III, Wechsler Adult Intelligence Scale-III (Wechsler, 1997a); WMS-III, Wechsler Memory Scale-III (Wechsler, 1997b); VIQ, verbal IQ; WMI, working memory index; GM, general memory; VD, visual delayed; AD, auditory delayed; scores are age-adjusted such that a score of 100 is the age-adjusted mean with a standard deviation of 15; BNT, Boston Naming Test; FAS, verbal fluency test; L-N, Letter-Number Sequence. BNT, FAS and L-N scores were z-scored against normative data for each test.

On the perceptual decision task, both patients and healthy participants made more accurate decisions when the color was more strongly biased toward blue or yellow (Figure 5A, top). The RTs of both the patients and healthy participants were longer for decisions between options that were more difficult to discriminate (i.e. color coherence near zero, Figure 5A, bottom). Patients took about the same amount of time as healthy controls to make a perceptual decision and there were no significant differences between the groups on accuracy (i.e. slopes of the choice function in Figure 5A, p=0.28) or RT (interaction between |color coherence| and group on RT, p=0.18; and main effect of group on RT, p=0.41). Further, for both groups, choices and RTs were well-described by a drift diffusion model (Figure 5A, solid lines), suggesting that damage to the hippocampus did not impair the patients’ ability to make decisions that require sequential sampling of external evidence.

Figure 5 with 4 supplements see all

Download asset Open asset

Amnesic patients exhibited more stochastic choices and longer reaction times on value-based decisions but not perceptual decisions.

(A) Proportion of blue choices (top) and mean RT (bottom) plotted as a function of signed color coherence, the logarithm of the odds that a dot is plotted as blue. Data from 14 healthy controls and six amnesic patients (2922 and 1246 trials, respectively). (B) Proportion of right-item preference (top) and mean RT (bottom) plotted as a function of value difference (right minus left) binned into 11 levels. Data from 14 healthy controls and six amnesic patients (2893 and 1118 trials, respectively). To further summarize these findings, we plot individual average speed-adjusted accuracy, calculated as average accuracy divided by average RT per participant during (C) perceptual decisions and (D) value-based decisions (here, accuracy is defined as choices that are consistent with the individuals’ initial value ratings). Circle symbols are data from amnesic patients (red) and healthy age-matched controls (black). Square symbols are group averages. Error bars are s.e.m. Curves are fits of a bounded drift diffusion model (see Materials and methods). See Figure 5—figure supplement 4 for fits to data from individual participants, Figure 5—source data 1 for model parameters fit to data from individual participants, and Figure 5—figure supplement 2 for consideration of an alternative model.

https://doi.org/10.7554/eLife.46080.030

Figure 5—source code 1 Jupyter notebook with analysis code and output for analyses performed on data from Experiment 2.: https://doi.org/10.7554/eLife.46080.035
Download elife-46080-fig5-code1-v2.ipynb
Figure 5—source data 1 Parameter estimates and goodness of fit measures for Experiment 2. κ is the drift rate. B₀ is the initial height of the bound. B_del is the delay before the bound starts decreasing. B₂ is the coefficient of the exponential term that governs the bound decrease. t_nd is the non-decision time. σ_tnd is the standard deviation of the non-decision time. μ₀ is the bias in drift rate. Plaw is the coefficient of the power law applied to the stimulus strength (color coherence for perceptual and ∆Value for value-based). NLL is negative log-likelihood of the parameters given the choice and RT data. R² choices is the McFadden pseudo-R² for choice data given color coherence (for perceptual) or ∆Value (for value-based). R² RT is the R² for RT data given color coherence (for perceptual) or ∆Value (for value-based).: https://doi.org/10.7554/eLife.46080.036
Download elife-46080-fig5-data1-v2.xlsx
Figure 5—source data 2 Trial-level data for the perceptual task in Experiment 2. The file contains eight columns; subject ID, group (healthy or amnesia), signed color coherence, $p_{b l u e}$ , response (‘z’=left, ‘m’), reaction time, button order (one means blue requires a left response, two means blue requires a right response), and whether the participant chose blue.: https://doi.org/10.7554/eLife.46080.037
Download elife-46080-fig5-data2-v2.csv
Figure 5—source data 3 Trial-level data for the value-based task in Experiment 2. The file contains 12 columns; subject ID, group (healthy or amnesia), the name of the image that appeared on the left side of the screen, the name of the image that appeared on the right, the participant’s response (‘z’=left, ‘m’), reaction time, the value rating of the item on the left, the value rating of the item on the right, whether the participant chose the item on the right side of the screen, the z-scored value rating of the item on the left, the z-scored value rating of the item on the right, and ΔValue.: https://doi.org/10.7554/eLife.46080.038
Download elife-46080-fig5-data3-v2.csv

In contrast, on the value-based decision task the amnesic patients’ performance diverged from that of healthy controls. Although the amnesic patients’ choices were clearly governed by ∆Value (red sigmoid function, Figure 5B top, simple effect of ΔValue on choices among amnesics, p<0.0001), their choices were more stochastic than those of the controls (flatter red sigmoid function, Figure 5B top, p=0.0008). This observation implies that the amnesic patients were not randomly guessing or forgetting the subjective value of the items but were less sensitive to their difference. Notably, the patients did not show any obvious differences in their use of the value rating scale nor in the resulting range of ΔValues (Figure 5—figure supplement 1). This implies that the flatter choice function is not explained by a difference in the use of the value rating scale but that the ∆Value derived from that scale had less purchase on their choices.

The more striking difference between the two groups was observed on RT during value-based decisions: the amnesic patients were substantially slower than healthy controls (Figure 5B bottom, p=0.0004). These slower RTs were specific to the value-based compared to the perceptual decision task (p=0.002 for the interaction between task type and group on RT). In addition, their RTs were less driven by subjective value ratings (flatter red curve in Figure 5B bottom). This difference between amnesic patients and healthy controls was statistically reliable (p=0.015, interaction between |ΔValue| and group on RT in a mixed effects linear regression, see Materials and methods). In principle, slower decisions could be a sign of a speed-accuracy tradeoff favoring accuracy, but that does not appear to be the case, as the patients were both slower and less accurate (i.e. less consistent with initial subjective values) than the controls. To clarify this point, we calculated an index of efficiency (I_E) for each participant (average accuracy divided by the average RT). The index captures the extent to which additional time was used to resolve sources of uncertainty that contribute to stochastic choice behavior. For perceptual decisions, I_E did not differ between amnesic patients and healthy controls (Figure 5C, t_17.21 = 0.02, p=0.98, Welch’s t-test), presumably because the uncertainty originates in the stimulus and its noisy representation by sensory neurons (Britten et al., 1993; Mante et al., 2013; Shadlen and Newsome, 1998). For value-based decisions, I_E was significantly lower in the amnesic patients compared to controls (Figure 5D, t_11.84 = 4.2, p=0.0007, Welch’s t-test). This implies that whatever deliberative process the amnesics engaged in to reach their decisions, it was less efficient than the process used by the controls.

To further characterize differences in the deliberative process between the groups, we evaluated an alternative to the drift-diffusion model. In this ‘heuristic model’, the decision maker makes (1) fast choices for items they like strongly, (2) fast choices for an item paired with one they dislike strongly, and (3) slow stochastic choices when the preference is not resolved by rules 1 and 2 (see Materials and methods and Figure 5—figure supplement 2). The model is representative of a class of alternatives that would account for RT and choice based on distinct rules—that is, a break from sequential sampling with optional stopping. While we found no support for this model in healthy controls (DDM performs better than this heuristic model, BIC = 537.5), at least one feature of the RTs from the amnesic patients is consistent with this model (Figure 5—figure supplement 2). This observation does not provide definitive support for the heuristic above, but it does suggest that the measurable differences between amnesics and controls in accuracy and RT may be related to a fundamental difference in how the amnesics resolve value-based preferences.

Discussion

We found converging evidence from fMRI and patients pointing to a role for the hippocampus in deliberation between choice options in value-based decisions. In healthy participants, the time it took to resolve choices between two options was longer for near-value decisions and was correlated with BOLD activity in the hippocampus. Amnesic patients with damage to the hippocampus were just as fast as healthy controls to make perceptual decisions but took almost twice as much time to make value-based decisions. The additional time did not lead to better accuracy; in fact, the patients’ choices were less accurate (i.e. more stochastic, relative to the values they initially assigned to the items). Together, these findings link the timing of value-based decisions about highly familiar options to the hippocampus.

Value-based decisions between highly familiar choice options are typically assumed to rely on subjective value (Levy and Glimcher, 2012; Rangel et al., 2008; Tversky and Kahneman, 1986). Such value signals are thought to be supported by the ventromedial prefrontal cortex (vmPFC, Camille et al., 2011; Fellows, 2016; Fellows and Farah, 2007; Levy and Glimcher, 2011; Padoa-Schioppa and Assad, 2006). Yet, even when choosing between options that differ greatly in their subjective value, such choices involve a comparison of the values by way of taking both options, their relation, and their predicted value, into account (Houston et al., 1999; Tversky, 1972; Voigt et al., 2017). Resolving the choice between two options with similar value likely requires the generation of additional information—that is, evidence—to resolve the indecision. This evidence must come from internal sources and might involve multiple dimensions of comparisons between the options. In that sense, it may seem obvious that deliberating between even highly familiar options is likely to involve the sort of relational mechanisms that the hippocampus is known to support.

Our findings suggest that the role of the hippocampus in value-based decisions is almost certainly more nuanced than memory retrieval of the value associated with each of the items. Prior work suggests that simple object-value associations do not depend on the hippocampus (Neubert et al., 2015; Reynolds et al., 2001; Rudebeck et al., 2008; Rushworth et al., 2011; Schultz et al., 1997; Vo et al., 2014). Moreover, it is not obvious why a simple associative memory process would account for longer deliberation times. Instead, we propose that the hippocampus contributes to deliberative processes during decision making. Specifically, we propose that deliberation may be served by the construction of value from internal evidence and engagement in the comparison between the options. Such a process is likely to also involve evaluation of alternatives and prospection about future hypothetical experiences. Prior work suggests that all these processes are likely to engage the hippocampus (Barron et al., 2013; Eichenbaum and Cohen, 2001; Schacter et al., 2007). Future work will be necessary to evaluate how these different processes interact and whether their unique contributions may differ under different circumstances.

Our findings extend recent results demonstrating a role for the hippocampus in value-based decisions under conditions in which value information has been experimentally manipulated to depend on retrieval of new associative memories (Barron et al., 2013; Gluth et al., 2015; Wimmer and Shohamy, 2012). Recent work has also characterized sampling processes during value-based decisions that are reliant on memory (Bornstein and Norman, 2017; Bornstein et al., 2017; Duncan and Shohamy, 2016). Our study builds on these findings to implicate the hippocampus functionally and establish a causal role for the hippocampus in decisions about familiar options for which value is known. One open question is whether this role varies as a function of the nature of the items under deliberation. For example, natural versus packaged items may vary in the extent to which perceptual features reveal their value; the color of an apple is revealing of its sweetness, the color of a package of chocolate perhaps less so. But ultimately, all such decisions depend on the transformation of external perceptual input to internal estimates of subjective value bearing on the relative desirability of the items. It is this deliberative process—beyond the simple item-value association—that we posit the hippocampus contributes to.

The pattern of behavior among the amnesic patients provides further insight into how and when the hippocampus is necessary for value-based decision making. We found that amnesic patients were somewhat less consistent in their decisions and that they took much longer to make them. A similar pattern has been shown recently in healthy older adults with mild memory deficits (Levin et al., 2018). As noted earlier, it is unlikely that amnesic patients simply cannot remember the value of the items, as their choices are not arbitrary. This suggests that the patients may be relying on degraded value signals that are coarser than those in controls. Studies of simple valuation have described general valence signals in neurons in orbitofrontal cortex, striatum, amygdala, and anterior cingulate cortex that could potentially drive these choices (Figure 3—figure supplement 5; Hayden et al., 2009; Hikosaka et al., 2014; Padoa-Schioppa and Assad, 2006; Platt and Plassmann, 2014; Saez et al., 2017). Interestingly, patients with vmPFC damage also show greater stochasticity in their choices (Camille et al., 2011; Fellows and Farah, 2007; Pelletier and Fellows, 2019), but do not display the slowing in RT during deliberation that we see in the patients with amnesia due to hippocampal damage. This finding and others (Jones and Wilson, 2005; Wikenheiser et al., 2017; Wimmer and Büchel, 2016), point to possible complementary roles for the hippocampus and the vmPFC in guiding value-based decisions (also see, McCormick et al., 2018), with the hippocampus possibly supporting evidence-based construction of value and deliberation (Weilbächer and Gluth, 2016).

If patients resolve their choices by accessing a simpler form of value representation, then why do they take such a long time to reach decisions? We propose that this reflects the patients’ attempt to engage hippocampal relational mechanisms and their failure to do so. This conclusion is based on a detailed consideration of the relationship between time, accuracy, and choices. In particular, it may help to elaborate on an important difference between the decision processes at play in the value and perceptual decisions we studied. For both tasks, choice and RT were reconciled by fits to drift diffusion models, indicating that both perceptual and value-based decisions exhibit a systematic relationship in speed and accuracy as a function of difficulty. In the perceptual task, a sequence of samples of blue and yellow dots can be converted by the visual system to samples of evidence by spatially integrating blue or yellow (or the difference) across the stimulus aperture in sampling epochs governed by the temporal resolution of the color system, which is slower than the frame rate of the display. These samples arrive in series until the subject terminates the decision. The samples are independent, identically distributed random values drawn from a distribution with an expectation (i.e. mean) determined by the stimulus strength and a variance governed by the stochastic properties of the stimulus and the neurons that represent blue, yellow or blue minus yellow. The accumulation of these noisy samples is analogous to a deterministic drift plus diffusion.

As mentioned earlier, similar logic has been applied to value-based decisions (Krajbich and Rangel, 2011; Milosavljevic et al., 2010; Polanía et al., 2015), but the analogy breaks down at the nature of the evidence samples. One might posit that neurons that represent value provide the samples of evidence (Rangel et al., 2008; Rangel and Clithero, 2014; Sokol-Hessner et al., 2012). However, the stimulus provides only one sample of the objects, and there is no reason to think that the brain would then generate a sequence of independent samples of ∆Value (Shadlen and Shohamy, 2016). Instead, we reason that the comparison itself triggers constructive thought processes to provide samples of evidence that bear on evaluation of the items along a dimension. It is hard to imagine integrating these samples of ∆Value along different dimensions, although it is possible if they were converted to some common currency (e.g. Kira et al., 2015). It seems at least equally likely that each sample leads to a new internal estimate of preference, only to terminate if such a sample provides a sufficiently compelling preference. Although such a process involves no integration, the drift diffusion model can be fit to such a process well enough to render these alternatives indistinguishable (Ditterich, 2006). On this view, the longer RTs in the amnesic patients stem from their continued effort to generate evidence to resolve the comparison. Accordingly, the greater stochasticity in their choices possibly stems from the fact that they may fail to generate such evidence and ultimately fall back on a more rudimentary and noisier form of value representation to guide their choices. We are not committed to this specific interpretation and consider a simple heuristic strategy that accounts for some aspects of the data (see Figure 5—figure supplement 2).

One limitation of the present study is that we are unable to identify the specific hippocampal-based process that guides deliberation. We can only observe the manifestation of the process in RT and its associated changes in hippocampal BOLD activity or the effect of hippocampal damage. In future work, it will be useful to guide the dimensions of inquiry (e.g. saltiness) and/or construct memories associated with these dimensions that have discernible effects on BOLD activity. In this study, we deliberately avoided any possibility of biasing participants to adopt a memory-based strategy to resolve value preference, as we were interested in testing whether memory spontaneously contributed to such decisions without instruction or guidance.

The idea that memory supports construction of evidence to guide value-based decisions offers new insights to our understanding of how decisions are made, as well as the role of the hippocampus in guiding behavior. The finding that the hippocampus supports deliberation between choice options with similar subjective value addresses a challenge that has long puzzled economists and philosophers (often referred to as Buridan’s paradox, Chislenko, 2016; Sorensen, 2004). By linking the hippocampus to choice behavior, this finding also highlights the pervasive and broad role of the hippocampus in guiding actions and decisions. Research on the hippocampus has typically focused on its role in supporting the formation of conscious, declarative memories for episodes of one’s life. The current findings add to a growing shift in this point of view, suggesting that the hippocampus may serve a more general purpose in guiding behavior by providing behaviorally relevant input about relational associations to implicitly guide actions and decisions (Chun and Phelps, 1999; Eichenbaum and Cohen, 2001; Hannula et al., 2007; Olsen et al., 2016; Palombo et al., 2015a; Ryan et al., 2000; Schapiro et al., 2014; Shohamy and Turk-Browne, 2013; Wimmer and Shohamy, 2012).

Share this article

Cite this article

Experimental tasks.

Choices between options that are similar take more time for both perceptual and value-based decisions in Experiment 1.

Figure 2—source code 1

Figure 2—source data 1

Figure 2—source data 2

Figure 2—source data 3

Deliberation time during value-based decisions is related to activation in the hippocampus.

Figure 3—source data 1

Figure 3—source data 2

Figure 3—source data 3

Figure 3—source data 4

Figure 3—source data 5

Figure 3—source data 6

Figure 3—source data 7

Figure 3—source data 8

Figure 3—source data 9

Timing of value-based decisions is related to functional coupling between the hippocampus and parietal cortex.

Figure 4—source data 1

Amnesic patient demographic and neuropsychological data.

Amnesic patients exhibited more stochastic choices and longer reaction times on value-based decisions but not perceptual decisions.

Figure 5—source code 1

Figure 5—source data 1

Figure 5—source data 2

Figure 5—source data 3

Author details

Akram Bakkour

Contribution

For correspondence

Competing interests

Daniela J Palombo

Present address

Contribution

Competing interests

Ariel Zylberberg

Contribution

Competing interests

Yul HR Kang

Present address

Contribution

Competing interests

Allison Reid

Contribution

Competing interests

Mieke Verfaellie

Contribution

Competing interests

Michael N Shadlen

Contribution

Competing interests

Daphna Shohamy

Contribution

Competing interests

Citations by DOI

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

Categories and tags

Research organism

Further reading