The hippocampus encodes delay and value information during delay-discounting decision making

Abstract
Introduction
Results
Discussion
Materials and methods
Data availability
References
Article and author information
Metrics

Abstract

The hippocampus, a region critical for memory and spatial navigation, has been implicated in delay discounting, the decline in subjective reward value when a delay is imposed. However, how delay information is encoded in the hippocampus is poorly understood. Here, we recorded from CA1 of mice performing a delay-discounting decision-making task, where delay lengths, delay positions, and reward amounts were changed across sessions, and identified subpopulations of CA1 neurons that increased or decreased their firing rate during long delays. The activity of both delay-active and -suppressed cells reflected delay length, delay position, and reward amount; but manipulating reward amount differentially impacted the two populations, suggesting distinct roles in the valuation process. Further, genetic deletion of the N-methyl-D-aspartate (NMDA) receptor in hippocampal pyramidal cells impaired delay-discount behavior and diminished delay-dependent activity in CA1. Our results suggest that distinct subclasses of hippocampal neurons concertedly support delay-discounting decisions in a manner that is dependent on NMDA receptor function.

Introduction

Animals faced with multiple options optimize their decisions through a complex cost-benefit valuation. The introduction of a time delay decreases preference for the delayed option (delay discounting) (Ainslie, 1992; Ainslie, 1975), with the discount rate varying on an individual basis. People who are considered patient exhibit lower discount rates, whereas impatient (or impulsive) people exhibit higher discount rates. Further, higher discount rates have been shown to be related to various neuropsychological disorders (Bickel and Marsch, 2001; Chesson et al., 2006; Epstein et al., 2008; Luman et al., 2010; Odum et al., 2000; Weller et al., 2008). Although lesion studies have revealed a critical role for the hippocampus in delay discounting (Figner et al., 2010; Kalivas and Volkow, 2005; Peters and Büchel, 2011; Cheung and Cardinal, 2005; Mariano et al., 2009; McHugh et al., 2008), how this is reflected in hippocampal activity remains poorly understood.

Decades of study point to a critical role for the hippocampus in episodic memory (Scoville and Milner, 1957; Squire, 1992) and spatial navigation (Burgess et al., 2002; Ekstrom et al., 2003). Although much of the rodent hippocampal physiology literature has focused on the spatial code present in hippocampal place cell activity (Jung and McNaughton, 1993; O'Keefe and Dostrovsky, 1971; Wilson and McNaughton, 1993), subsequent work has demonstrated that the circuit is capable of encoding a variety of spatiotemporal features beyond the animal’s current position, including past and future trajectories (Ambrose et al., 2016; Foster and Wilson, 2006; Johnson et al., 2007; Johnson and Redish, 2007; Pfeiffer and Foster, 2013; Skaggs and McNaughton, 1996), the location of other animals or objects (Danjo et al., 2018; Omer et al., 2018), internal time (Kraus et al., 2013; MacDonald et al., 2011; Manns et al., 2007; Pastalkova et al., 2008), and various physical scales (Aronov et al., 2017; Terada et al., 2017).

When trying to understand the links between behavior and physiological data, several factors must be considered, including the variable(s) correlated with the activity, the regions or cell assemblies engaged and the mechanisms of representation on both the single-cell and population levels. To this end, studies combining imaging and recording with optogenetic manipulation and identification suggest that subsets of CA1 neurons can encode distinct features of a task (Cembrowski et al., 2016; Danielson et al., 2016). Consistent with these data, Gauthier and Tank (2018) recently identified a unique population of neurons that are active at reward sites, serving as ‘reward cells’. Although the modulation of reward-based activity has been well-investigated in relation to spatial context (Hölscher et al., 2003; Lee et al., 2012; Murty and Adcock, 2014; Ólafsdóttir et al., 2015; Peters and Büchel, 2010; Singer and Frank, 2009) or probability discounting (Tryon et al., 2017), the impact of delay (length/location) and reward amount, which both alone and together constitute the core computation for delay-based decision making, has not been examined. Further, since delay and reward both modulate value, the most important parameter for decision making, neurons encoding value information should respond in similar ways to changes in delay length increment/decrement and reward amount loss/gain, as can be seen in dopaminergic neurons in the ventral tegmental area (VTA) (Roesch et al., 2007).

There are at least two dissociable schemes of hippocampal coding for delay length. One is on the population level, with time cells (MacDonald et al., 2011; Pastalkova et al., 2008) — which are a series of neurons that separately represent distinct temporally receptive fields that tile the delay period — forming sequences that correspond with different delay lengths. The other is rate coding, where individual neurons change their firing rate according to the delay length variations.

Here, we designed experiments to identify and characterize hippocampal neurons that are engaged during the delay of a delay-discounting task and to probe their sensitivity to changes in delay length, delay position, and reward amount. We recorded single-unit activity in CA1 of mice performing a delay-discounting version of the T-maze task (Zhang et al., 2018), and assessed changes in neural activity related to delay length, delay position, and reward size. We first examine the two schemes for encoding delay length: population coding and rate coding, and found that both schemes were employed by a significant fraction of CA1 neurons, including two populations that demonstrated increased or decreased delay-period activity. The activity of these distinct populations reflected delay length, delay positions, and/or reward size, however manipulation of reward size resulted in these populations' having opposite responses. Finally, we were able to identify a specific population of neurons that fit the criteria of value coding. These results suggest that distinct subpopulation of neurons in the hippocampus can have unique contributions to the valuation processes that are required for delay-based decisions.

Results

Behavioral profiling

We conducted a delay-based decision-making task in mice using the T-maze, in which mice chose between right or left goal arms, with each arm containing a small reward or large reward, and with or without delay, respectively (Figure 1A). In total, we employed five behavioral conditions, with each session consisting of about 10 trials within 30 min, with a 20 s inter-trial interval at the start zone.

Figure 1 with 2 supplements see all

Download asset Open asset

Task design of the delay-based decision making in the T-maze.

(A) Schematic diagram for the experimental setup. Mice can choose the right or left arms assigned to obtain the small reward without delay or the large reward with delay, respectively. (B) Flow of the extension conditions. The delay lengths were extended sequentially. Red circles indicate the number of sugar pellets. (C) Percentages of large-reward choices as a function of delay length. Error bars indicate the standard error of the means (SEM).

Figure 1—source data 1 Source Data File for Figure 1C.: https://cdn.elifesciences.org/articles/52466/elife-52466-fig1-data1-v2.xlsx
Download elife-52466-fig1-data1-v2.xlsx

To examine the impact of delay on decision making, we changed the delay length sequentially. Once mice showed a preference for the large reward arm (>80%) we increased the length of delay in a stepwise fashion (e.g., 0, 5, 10, 20, and 40 s; Figure 1B; Figure 1—figure supplement 1). With the inclusion of a delay, preference for the large reward arm decreased as a function of delay length (Figure 1C). The whole schedule of experiments is shown in Figure 1—figure supplement 2.

Delay-dependent neuronal activity in CA1

We recorded extracellular single units and local-field potentials (LFPs) from the CA1 region in a total of 28 mice (Figure 2—figure supplement 1) during delay-discount behavior, and classified cells as putative excitatory neurons or inhibitory neurons on the basis of the characteristics of their extracellular waveform (Figure 2—figure supplement 2; see 'Materials and methods'). We first analyzed LFP signals in the CA1 region during delay periods. Consistent with the active movement of the mice, sharp-wave/ripples (SWRs) were rarely observed (Z=2.19, p=0.03, for start and stem zones vs delay zone; Z=2.40, p=0.02, for delay zone vs goal zone; Mann-Whitney U Test; Figure 2A and B) and the LFP was dominated by theta-range (7–11 Hz) activity (Figure 2C and D), suggesting that the circuit remained engaged during this phase. We then examined the activity of excitatory neurons (Table 1) during specific task events, the exit from the start zone (start), the entrance/exit of the delay zone (delay), and the entrance to the goal zone (goal). In CA1, a subset of neurons exhibited delay-related activity, with firing rate rising during longer delays (>20 s) (Figure 3A), whereas a distinct subset fired only under short delay conditions, decreasing their firing rate as the delay length increased (Figure 3B).

Figure 2 with 2 supplements see all

Download asset Open asset

LFP signals during the long delay were characterized by strong theta power and lack of SWRs.

(A) Sharp-wave/ripple events (SWRs) rarely occurred during the delay in the task (data from one session of delay 20 s extension conditions). Red asterisks indicate the locations of SWRs. Black and gray dots indicate the path of animal movements before (black) and after arriving at the goal (gray), respectively. (B) SWRs per session in specific experimental zones. The total number of SWRs for each zone was counted and color-coded according to individual animals (the average number of events acquired from 3 sessions of delay 20 s extension conditions). *, p<0.05, Mann-Whitney’s U-test. (C) Spectrogram of the hippocampal CA1 region during the peri-delay period (averaged from three mice, total six sessions of delay 20 s extension conditions) Green line: delay-onset; red line: estimated goal-onset. (D) Power spectrum density during 2 s at the beginning of the delay. Shaded area indicates ± SD.

Figure 2—source data 1 Source Data File for Figure 2B.: https://cdn.elifesciences.org/articles/52466/elife-52466-fig2-data1-v2.xlsx
Download elife-52466-fig2-data1-v2.xlsx

Figure 3

Download asset Open asset

Increased or decreased neuronal activity of CA1 cells during delay.

(A) An example of CA1 delay-active (delay-act) cells, which showed an increment in the firing rate as a function of delay length. Left, raster plots of the firing activity of the cells aligned with start-onset (top), delay-onset (middle) and goal-onset (bottom). Orange lines indicate start-onset. Green lines indicate delay-onset. Pale red lines indicate expected delay-offset. Red lines indicate goal-onset. Center, peristimulus time histograms (PSTHs) of the firing activity of the cells aligned with start-onset (top), delay-onset (middle) and goal-onset (bottom). Right, color-coded rate maps. The delayed arm was assigned to the right side with a large reward for this recording session. Red dots indicate the number of sugar pellets. (B) An example of CA1 delay-suppressed cells, which showed a decrement in the firing rate during delay. The delayed arm was assigned to the right side with a large reward for this recording session.

Table 1

The number of delay-active and delay-suppressed CA1 excitatory and inhibitory neurons recorded from all sessions.

Cell type	Delay-active	Delay-suppressed	Other	Total
Excitatory neurons	243	313	83	639
Inhibitory neurons	43	100	26	169

We next asked whether neurons significantly altered their firing rate during long delays compared with other phases of the task (see 'Materials and methods'). We found that across all task conditions, large numbers of neurons exhibited significant increases (CA1: 243/639 units: 38.0%) or decreases (CA1: 313/639: 48.9%) in their firing rates during the delay (Table 1). We termed these delay-active (delay-act) and delay-suppressed (delay-sup) neurons, respectively (Figure 4A). Comparison between the firing rates for short delays (5 s) and those for long delays (20–40 s) revealed that some delay-act and delay-sup cells exhibited significant elevation or reduction of firing rates for specific delay lengths (Figure 4B). At the population level, peak firing times of both CA1 delay-act and delay-sup cells were highly distributed across the time spent in the delay zone (Figure 4C). To assess the population activity of CA1 cells during the task, we examined the autocorrelation of the population vectors under long-delay conditions (Figure 4D). The population activity of both delay-act and delay-sup cells was clearly segmented into three periods — start, delay and goal — with differential patterns of sustained activity in each. Similar population codes were found in inhibitory CA1 neurons (Figure 4—figure supplement 1). We next analyzed the population codes across the individual experimental conditions. We found a uniform-like distribution only under the both-side condition (Kolmogorov–Smirnov test; Salz et al., 2016; p=0.46 for the both-side condition; p<0.05 for all other conditions; Figure 4—figure supplement 2), while the remaining three protocols found activity biased towards the early part of the delay.

Figure 4 with 2 supplements see all

Download asset Open asset

Delay-dependent firing patterns of CA1 delay-active and delay-suppressed cells.

(A) The distribution of delay-active and delay-suppressed cells aligned with the ratio of firing rate in the long delay period and in whole trials. (B) The ratio of mean firing rate during long delays (20 or 40 s) to that during short delay (5 s) for all neurons (base-10 log-transformed). Each dot indicates an individual neuron. Black dots indicate neurons that had a statistically significant difference in firing rate between short and long delay conditions (p<0.001). (C) Temporal patterns of firing rates in CA1 delay-active and delay-suppressed cells during delay. Top, color-coded temporal firing patterns. Neurons were ordered by the time of their peak firing rates. Bottom, temporal distribution of the peak firing rates of the neurons. Green lines indicate delay-onset. Pale red lines indicate expected delay-offset. (D) Correlation matrix of population vectors as a function of time for CA1 delay-active and delay-suppressed cells.

When we examined the activity of neurons sequentially recorded under all possible delays, we found that lengthening the delay dynamically altered activity, with a substantial fraction of units demonstrating a significant correlation between firing rate and delay length (mean firing rate, 20/58 [34.5%], peak firing rate, 46/58 [79.3%] for delay-act; mean firing rate 31/83 [37.3%], peak firing rate, 60/83 [72.3%] for delay-sup; p=0.01, permutation test, for percentages; Table 2, Figure 5). This indicates that the hippocampus may encode delay length at the level of individual neurons. Peak firing rates may be a better indicator, as mean firing rates at different delay lengths will result in deformative normalization. Further, decoding analysis using support vector machine (SVM) confirmed that the population codes of firing pattern can also predict delay length (classification into five different delay conditions) (Figure 5—figure supplement 1; see 'Decoding of delay length from population spike activity' in 'Materials and methods'). Taken together, the hippocampus may encode delay length using dual coding schemes.

Figure 5 with 3 supplements see all

Download asset Open asset

Delay-dependent firing patterns of CA1 delay-active and delay-suppressed cells.

(A) Scatter plots show correlations between firing rate during delay (upper, peak firing rate; lower, mean firing rate) and the delay length of five representative delay-act cells. Cells A and B show positive correlations in both peak and mean firing rate; on the other hand, cell C shows either. Cells D and E show negative correlations. (B) Distribution of correlation coefficients between firing rate (left, peak firing rate; right, mean firing rate) and delay length in delay-act cells. Dark color bars indicate statistically significant neurons (p<0.05), whereas bright color bars indicate neurons do not reach statistical significance. (C) Scatter plots show correlations between firing rate during delay (upper, peak firing rate; lower, mean firing rate) and delay length for five representative delay-suppressed cells. (D) Distribution of correlation coefficients between firing rate (left, peak firing rate; right, mean firing rate) and delay length in delay-suppressed cells. Dark color bars indicate statistically significant neurons (p<0.05), whereas bright color bars indicate that neurons do not reach statistical significance.

Table 2

Full distribution of CA1 excitatory neurons for all of the tested conditions.

Test conditions	Delay responsiveness	Neuron number
Extension	Delay-active	58
	Delay-suppressed	83
	Other	36
Switched or both-side	Delay-active	155
	Delay-suppressed	191
	Other	34
Reward loss or gain	Delay-active	30
	Delay-suppressed	39
	Other	13

Given the learning-dependent development of hippocampal firing during delay period (Gill et al., 2011), we also examined the time shift of firing by delay changes. At the beginning of the daily recording session, about 10% (4/33) of delay-act cells initially fired after the animal reached the goal in the 0 s delay condition, then shifted their firing to the delay period once a delay was introduced (Figure 5—figure supplement 2). Interestingly, subsequent elimination of the delay did not result in return to goal-related activity (Figure 5—figure supplement 2A). When we compared the firing of CA1 delay-act cells under identical short delay trials occurring before and after long-delay trial blocks (Figure 5—figure supplement 2B), we found about 10% of neurons shifted positively or negatively (Figure 5—figure supplement 2C), indicating that the onset of firing in the CA1 neurons was influenced by the experience of waiting and/or learning of the delay.

Place-specific delay information is encoded in the majority of CA1 neurons

Given the robust place code present in the hippocampus, we next asked whether CA1 delay-act neurons were spatially selective. To this end, we switched the location of the delay and no-delay arms (switched conditions) or replicated the delay on the other side (both-side conditions) (Figure 6A, Figure 6—figure supplement 1), with corresponding changes in reward size. Under both conditions, the mice changed their behavior within several trials, with the preference for the large reward arm reaching about 70%. We then evaluated side-selectivity of the delay activity, adding location as a variable under three-way ANOVA (side, delay-length, and timing; see 'Materials and methods') during switched and both-side trials (Figure 6A). Representative side-selective and -unselective excitatory neurons in the CA1 are shown in Figure 6B. The percentage of side-selective neurons was high in both CA1 delay-act (114/155: 73.5%) and delay-sup(124/191: 64.9%) neurons (Figure 6C and Tables 2 and 3), however more than a quarter of the neurons of both groups encoded delay independent of location.

Figure 6 with 1 supplement see all

Download asset Open asset

Spatial-selective delay coding in CA neurons.

(A) Experimental conditions to investigate the location selectivity in delay-active neurons. The location of the delay zone was switched to the other side (switched conditions) or doubled to both sides (both-side conditions). (B) Example CA1 delay-active and delay-suppressed cells. Side-dependent and side-independent neurons are shown as left and right rows, respectively. Top left, colored raster plots expressing relative firing rates. Green lines indicate delay-onset. Pale red lines indicate expected delay-offset. Top right, information of conditions corresponded to the raster plots on the left. Red dots indicate the number of sugar pellets. Bottom left, Peri-event time histograms showing the averaged firing rates. Magenta lines indicate the firing rate of the left choice with a 20 s delay. Black-filled histograms indicate the firing rate of the right choice with a 20 s delay. Bottom right, color-coded rate maps for the two conditions (normal delay and switch or both-side conditions). (C) Percentage of place-dependent and -independent CA1 delay-active and delay-suppressed neurons. Error bars indicate 95% Clopper-Pearson’s confidence intervals. **: p<0.01, Mann-Whitney’s U-test.

Table 3

Distribution of side-dependent and side-independent, delay-active and delay-suppressed, CA1 excitatory and inhibitory neurons.

Delay responsibility	Side-dependency	Cell types	N	%
Delay-active	Side-dependent	Excitatory neuron	114	73.5
	Side-dependent	Inhibitory neuron	12	60.0
	Side-independent	Excitatory neuron	41	26.5
	Side-independent	Inhibitory neuron	8	40.0
Delay-suppressed	Side-dependent	Excitatory neuron	124	64.9
	Side-dependent	Inhibitory neuron	45	71.4
	Side-independent	Excitatory neuron	67	35.1
	Side-independent	Inhibitory neuron	18	28.6

Value-coding in CA1 neurons

We next asked how subjective value influenced the activity of the delay-act and delay-sup neurons in CA1. As mentioned above, delay and reward are common factors that modulate subjective value. To examine whether the changes of delay period firing patterns in delay increment were correlated with changes in reward size, we first lengthened the delay length and subsequently decreased the reward for the delayed option (reward loss conditions) and followed this by restoration of reward. In addition, we manipulated the reward size in the opposite direction to avoid order-dependent confounds arising from decreased hunger or motivation of the animals in later trials (reward gain conditions; Figure 7A, Figure 7—figure supplement 1). The majority of delay-act cells decreased their firing rate in response to reward loss, whereas delay-sup neurons had the opposite response, increasing their activity (Figure 7B and C). As a result, the log ratio of the firing rates (large reward/small reward) under both reward loss and gain conditions was significantly different between delay-act and delay-sup cells (Z = −2.6, p=0.007 for reward loss; Z = 2.1, p=0.03 for reward gain, Mann-Whitney’s U-test). In total, the ratio was negatively skewed in the delay-act cells (T = −2.5, p=0.01, one-sample t-test) but positively skewed in delay-sup cells (T = 2.7, p=0.01, one-sample t-test, Figure 7D). These results suggest that firing during the delay independently reflected positive and negative outcomes in these different subpopulations of CA1 neurons. Finally, we examined the relation between firing rate changes ‘by delay extension’ and ‘by reward manipulations’ to explore whether value, a more general concept of information, may be neutrally encoded. In both delay-act and delay-sup cells, there was no global trend, but a subset of neurons, plotted around the line of ‘delay effect = reward effect’ (Figure 7E), can be interpreted as value-coding neurons.

Figure 7 with 2 supplements see all

Download asset Open asset

The firing of CA1 delay-active and delay-suppressed cells is distinctly changed by reward size manipulations.

(A) Left, experimental reward loss conditions: the reward size was changed from 4 to 1 (or 0) pellets. Right, experimental reward gain conditions: the reward size was changed from 1 (or 0) to 4 pellets. (B) Example CA1 delay-active (top) and delay-suppressed cells (bottom) fired during delay in reward loss conditions. Green lines indicate delay-onset. Red lines indicate expected delay-offset. Red dots indicate the number of sugar pellets. (C) Ratio of firing rates of delay-active and -suppressed cells in reward loss and gain conditions. Dots indicate individual data for delay-active cells (red) and delay-suppressed cells (blue). Central bars indicate the medians. *, p<0.05; **, p<0.01, Mann-Whitney’s U-test. (D) Ratio of firing rates of delay-active and -suppressed cells in mixed population. Error bars indicate SEM. *, p<0.05; **, p<0.01, One-sample t-test. (E) Scatter plots of firing rate ratios between small/large reward conditions and between long delay/short delay conditions. The computed correlation coefficient R and p value are indicated.

Figure 7—source data 1 Source Data File for Figure 7C and D.: https://cdn.elifesciences.org/articles/52466/elife-52466-fig7-data1-v2.xlsx
Download elife-52466-fig7-data1-v2.xlsx

We next focused on the relationship between the behavioral shift during reward loss sessions and the firing patterns of delay-act cells. If CA1 activity is dependent on the animals’ choice preference, the activity should be dynamically changed after the elimination of preference. However, across the session, animals avoided the delayed option, making it difficult to observe CA1 activity under this condition. To eliminate the preference for the delayed options, we designed an ‘unequal conditions’ (long delay + no pellet vs long delay + four pellets, with the latter being the better option). Animals then quickly reduced their preference to the delayed option with no reward. To record the activity for the less-preferred or adverse choice, we forced mice to choose the less-preferred option with an obstacle set at the entrance of the opposite arm. When faced with an unrewarded delayed option, CA1 neurons indicating choice preference were silent (Figure 7—figure supplement 2). These results suggest that the firing of delay-act neurons in the CA1 region represents the animal’s subjective value of the chosen options.

NMDAR deficiency in hippocampus disrupted delay-discounting and populational delay coding in CA1

Finally, we took advantage of a mutant mouse, the CaMK2-Cre; NR1-flox/flox mouse, which lacks CA1 pyramidal cell N-methyl-D-aspartate (NMDA) receptors (NMDARs) (CA1-NR1cKO mouse; McHugh et al., 1996; Tsien et al., 1996a), RRID:MGI:3581524), to assess the role of synaptic plasticity in task performance. Consistent with previous reports of hippocampus-dependent learning deficits in these mice (Bannerman et al., 2012; Rondi-Reig et al., 2001; Tsien et al., 1996b), they exhibited impaired delay discounting (Figure 8A), demonstrating a significant bias for the larger reward even when the delay was extended (F₁ = 14.4, p<0.001, genotype (CA1-NR1cKO vs NR1 f/f); F₄ = 23.0, p<0.001, interaction between delay length x genotype, F_1,4 = 1.07, p=0.37, two-way ANOVA, p=0.61 on delay 0 s, p=0.04 on delay 5 s, p=0.04 on delay 10 s, p=0.002 on delay 20 s, p=0.005 on delay 40 s, multiple comparisons on each delay length).

Figure 8 with 1 supplement see all

Download asset Open asset

NMDAR-dependent mechanism for delay-discounting.

(A) Impaired delay-discounting in CA1-NR1 cKO mice. *, p<0.05; **, p<0.01; post-hoc Scheffe’s test. Error bars indicate SEM. (B) NMDAR deficiency disrupted the delay tuning in the CA1 activity. Average firing patterns of the CA1 delay-active cells from cKO and control mice for different delay lengths (0, 5, 10, 20, and 40 s). (C) Abnormal delay-active and –suppressed cell proportion in cKO mice. Ratio of delay-act cells to delay-sup cells for cKO and control mice. Error bars indicate 95% Clopper-Pearson’s confidence intervals. *, p<0.05; Mann–Whitney’s U test. (D) NMDAR deficiency disrupted the populational activity in CA1. Top, color-coded temporal firing patterns of the CA1 delay-active cells in cKO and control mice. Neurons were ordered by the time of their peak firing rates. Middle, temporal distribution of neurons. Green lines indicate delay-onset. Red lines indicate expected delay-offset. Bottom, correlation matrix of population vectors as a function of time for CA1 delay-act cells in cKO and control mice. (E) NMDAR deficiency disrupted the negative skew in the firing rate ratio of delay-active cells. Ratio of firing rates of delay-active cells in CA1 of cKO and WT mice. Dots indicate individual data for cKO (gray) and control (black) mice. The central bar indicates the median. *, p<0.05; Mann–Whitney’ U test.

Figure 8—source data 1 Source Data File for Figure 8A and E.: https://cdn.elifesciences.org/articles/52466/elife-52466-fig8-data1-v2.xlsx
Download elife-52466-fig8-data1-v2.xlsx

We next recorded CA1 neuronal activity in cKO (n = 3, 123 units) and control mice (n = 4, 69 units, Table 4 and Table 5) to look for physiological correlates of the behavioral change. Delay-act cells in the cKO mice showed non-specific activation during the delay period (Figure 8B and Figure 8—figure supplement 1A). Hence, there was a lower and higher proportion of delay-act and delay-sup neurons, respectively, in the cKO and the ratio of delay-act/delay-sup was significantly lower in the cKO than in the control mice (Figure 8C, p=0.02, Fisher’s Exact Test, Figure 8—figure supplement 1B). Further, in contrast to the controls, the temporal distribution of all delay-act cells in the cKO was sparse and not specific to delay-onset. As a result, population vector analysis revealed that the activity was not segmented into three periods in the cKO mice (Figure 8D). In addition, the ratio of CA1 firing of cKO was significantly different than that observed in control mice and lacked the expected negatively skewed distribution (Z = 2.0, p=0.04, Mann-Whitney’s U-Test, Figure 8E). We could not detect significant difference among the genotypes in basic firing property during the task (mean firing rate, cKO — 3.07 Hz, control — 3.39 Hz, Z = −0.76, p=0.44, Mann-Whitney’s U-test). Subpopulation firing rates were also not significantly different (delay-act, cKO — 2.66 Hz, control — 3.13 Hz, Z = −0.91, p=0.35; delay-sup, cKO — 3.47 Hz, control — 3.47 Hz, Z = −0.68, p=0.14, Mann-Whitney’s U-test). These findings suggest that delay discount behavior and the underlying delay-related activity in CA1 pyramidal cells requires NMDAR-dependent mechanisms in the hippocampus.

Table 4

Full distribution of CA1 excitatory neurons for the NMDAR mutant study.

The numbers in parentheses are cells from the wildtype.

Test conditions	Delay responsiveness	Neurons
Test conditions	Delay responsiveness	cKO	Control
Extension	Delay-active	28	25
	Delay-suppressed	56	20
	Other	22	19
Reward loss and gain	Delay-active	8	33 (30)
	Delay-suppressed	6	0
	Other	3	2

Table 5

Ages of CA1-NMDAR cKO mutant and control mice used for the electrophysiological study.

Genotype	Animal ID	Age at surgery	Age at experiments ended
CA1-NR1 cKO (CaMK2-Cre; NR1-flox/flox)	M18	2 months	3 months
	M28	3 months	3 months
	M30	3 months	4 months
Control (NR1-flox/flox)	M24	5 months	5 months
	M26	3 months	4 months
	M29	3 months	4 months
	M31	4 month	5 months

Discussion

We recorded CA1 neuronal activity in mice during delay-based decision making in an automated T-maze task while independently manipulating delay length and reward size across sessions. We observed distinct populations of neurons that increased or decreased their firing during the delay. Moreover, the firing rates of a subset of the delay-activated CA1 neurons decreased with both delay length increments and reward size declines. Notably, the activated and suppressed neurons showed distinct activity changes following reward size manipulations. These results suggest that dissociable subpopulations of hippocampal neurons represent delay and reward information in opposing ways. These discoveries should help shape models of how the hippocampus supports decision making.

Although the delay-modulated activity was diverse across CA1 neurons, their responsiveness to delay was precisely controlled. A significant fraction of CA1 neurons reflected delay length in their firing rate, suggesting the encoding of delay length in the hippocampus on a single-cell level. Related to this, positive and negative correlations with delay length were observed in both delay-act and delay-sup cells. Currently, it is not clear what roles delay-act or delay-sup cells or those neurons with positive or negative correlation play in the animal’s decision. In a delay-discounting task, delay may be encoded in two different ways: by a discounting factor and by a factor predicting a larger reward. Future work should investigate whether the two directions of correlation are related to the discounting or prediction.

At the population level, the peak firing rates of delay-act and delay-sup cells were distributed largely around the delay-onset. As a result, population vector analysis demonstrated segmented and sustained network activity during the delay in the CA1 region, suggesting a role in prospective coding of specific periodic events centered on the delay. The decoding analysis demonstrated that particularly during short delay blocks, delay length could be decoded with population activity even prior to the delay initiation. This may reflect the animal's experience with the task and expectation of an impending reward. In addition, in the specific circumstance where a fixed delay was constantly presented, the population coding of delay may be more precise.

A significant fraction of both the delay-act and the delay-sup neurons that we recorded also carried spatially tuned delayed information. Thus, the activity of most delay-act and delay-sup cells in dorsal CA1 does not appear to represent solely delay information, but rather, may represent integrated information of the chosen option, reflecting both location and delay. This result is consistent with the idea that hippocampal cells are coding not only within the space and time dimensions individually, but rather across them jointly (Eichenbaum, 2014; Howard and Eichenbaum, 2015; MacDonald et al., 2011).

Changing reward size modulated the firing rates of both the delay-act and the delay-sup cells in CA1. It is widely known that the activity of CA1 neurons can depend on reward (Ambrose et al., 2016; Hölscher et al., 2003; Singer and Frank, 2009). Studies focusing on goal-directed behavior have demonstrated that some CA1 neurons fire when animals approach, wait for, or acquire rewards, but not when animals visit the same location in the absence of the reward (Eichenbaum et al., 1987; Fyhn et al., 2002; Hok et al., 2007; Kobayashi et al., 2003; Rolls and Xiang, 2005), indicating that a certain subset of CA1 neurons are highly sensitive to reward expectation or motivation. However, in monkeys, omission of a reward activated some CA1 neurons (Watanabe and Niki, 1985). This is consistent with our results demonstrating that during the delay, dissociable subsets of CA1 neurons were positively or negatively correlated with reward size. The scatter plot of the firing rate ratio of small/large reward conditions and long/short delay conditions (Figure 7E) shows that there are no global trends, suggesting that the CA1 neurons exhibit independent relationship between delay and reward manipulation responses. We found, however, that a fraction of neurons reacted in the same way to delay and reward manipulation, suggesting that there may be value-coding neurons in the CA1. Further study will be required to isolate specific neurons encoding subjective value, focusing on specific pathways or cell types. Accordingly, a distinct subpopulation of CA1 neurons may encode the delay-reward integration and may support the valuation process in delay-based decision making.

The phenotype of ‘lowered delay discounting’ caused by a loss of the NMDAR may also be interpreted as an abnormal repetition of an unpleasant choice, referred to as ‘compulsive behavior’. Systemic injection of the partial NMDAR agonist D-cycloserine reduces compulsive lever-pressing in a model of obsessive-compulsive disorder (OCD) in rats (Albelda et al., 2010). In addition, polymorphisms in a subunit of NMDAR have been considered as a risk factor in OCD (Arnold et al., 2004). The present study suggests that the hippocampal NMDARs are required for delay discounting and provides additional evidence that hippocampal NMDARs may be associated with compulsive disorders. It is widely believed that synaptic plasticity via NMDAR-dependent machinery contributes to association learning and that, in the hippocampus, this contributes to the formation of long-term, spatial memories (Martin et al., 2000). Studies using several lines of conditional knockout mice have pointed out that NMDAR in the hippocampus is involved in spatial learning (Tsien et al., 1996a), nonspatial learning (Huerta et al., 2000; Rondi-Reig et al., 2001), anxiety (Bannerman et al., 2004; Kjelstrup et al., 2002; McHugh et al., 2004; Richmond et al., 1999), time perception (Huerta et al., 2000), and decision making (Bannerman et al., 2012). In addition, physiological studies have demonstrated that a hippocampus that lacks NMDAR exhibits less specific spatial representation in place cells (McHugh et al., 1996). We found that NMDAR deficiency disrupted the proportion of delay-act and delay-sup cells, and population coding for the delay. These findings suggest that the NMDAR in the hippocampus may be required to maintain or develop time-coding. It should be noted that the NR1 knockout may be extended to other telencephalic regions (CA3, dentate gyrus, deep cortical layers) in the cKO animals in the present study. Further research is required in order to identify more specific mechanistic roles of the hippocampal NMDAR in delay-based decision making. In addition, in contrast to previous studies showing that rats with hippocampal lesions exhibit higher discount rates (Cheung and Cardinal, 2005; Mariano et al., 2009; McHugh et al., 2008), the NMDA KO mice demonstrated the opposite phenotype. Thus, there may be considerable differences between the effect of the lesions and that of NMDAR knockout in the hippocampus on the full network engaged during delay-based decision making.

In conclusion, our results show that CA1 neuronal activity during delay is segregated into two populations, delay active and delay suppressed neurons. Further, these groups demonstrate opposing responses to changes in motivational background. In addition, NMDAR-dependent plasticity mechanisms appear to be required for the formation of the firing patterns during delay and for the delay-discounting. These findings further clarify the role of the hippocampus in decision making, as well as in the control of impulsive or compulsive behaviors.

Materials and methods

Key resources table

Reagent type (species) or resource	Designation	Source or reference	Identifiers	Additional information
Strain, strain background (Mus musculus)	C57BL/6J	RIKEN Bio Resource Center	RRID: IMSR_JAX:000664	Wild-type mouse
Strain, strain background (Mus musculus)	NR1^flox	PMID: 8980237	005246 (Jackson Laboratory)	Targeted mutation line
Strain, strain background (Mus musculus)	Tg(Camk2a-cre)T29-1Stl/J	PMID: 8980237	005359 (Jackson Laboratory)	Cre transgenic line
Strain, strain background (Mus musculus)	Tg(Camk2a-cre)T29-1Stl/J, NR1^flox/flox	PMID: 8980237	RRID: MGI:3581524	Conditional knockout line
Commercial assay or kit	T-maze	O’hara and Co., Ltd.	RRID: SCR_018016	Automatic operant test
Other	Neural probes	NeuroNexus	A4 × 2-tet-5mm-150-200-312	32-ch electrode
Other	nDrive	NeuroNexus	RRID: SCR_018019	Micro driver to control movement of electrode
Other	Amplipex: KJE-1001	Amplipex	RRID: SCR_018017	Recording system for neural signals
Software, algorithm	MATLAB_R2018a	Mathworks	RRID: SCR_001622
Software, algorithm	Klusters	PMID: 16580733	RRID:SCR_015533
Software, algorithm	NDmanager	PMID: 16580733	RRID:SCR_015533
Software, algorithm	Neuroscope	PMID: 16580733	RRID:SCR_015533
Software, algorithm	KlustaKwik2	PMID: 25149694	RRID:SCR_014480
Sequence-based reagent	Cre_F	PMID: 28244984	PCR primers	ACC TGA TGG ACA TGT TCA GGG ATC G
Sequenced-based reagent	Cre_R	PMID: 28244984	PCR primers	TCC GGT TAT TCA ACT TGC ACC ATG C
Sequenced-based reagent	NR1^flox-F	PMID: 28244984	PCR primers	TGT GCT GGG TGT GAG GGT TG
Sequenced-based reagent	NR1^flox-R	PMID: 28244984	PCR primers	GTG AGC TGC ACT TCC AGA AG
Other	DAPI stain	ThermoFisher	Thermo Fisher Scientific Cat# D1306	(1 µg/mL)
Other	DiI stain	ThermoFisher	Thermo Fisher Scientific Cat# D3911	(200 µg/mL)

Animals

All procedures were approved by the RIKEN Animal Care and Use Committee. A total of 29 male C57B6/J mice were used for this study (wildtype, n = 5; cKO, n = 11 (8 for behavioral study); control: n = 13 (9 for behavioral study]). Mice lacking NMDAR in the hippocampus (RRID:MGI:3581524) were generated by crossing the line gene-targeted for loxP-tagged Nr1 (Grin1) alleles (Nr1^flox; Tsien et al., 1996a) and a transgenic line carrying Camk2a promoter-driven Cre recombinase (Camk2a-Cre, T29-1Stl; Tsien et al., 1996a). In this mutant, deletion of NR1 is delayed until about 4 weeks after birth and is restricted to the CA1 pyramidal cells until about 2 months of age (Fukaya et al., 2003). Most of the behavioral analysis using the mutant was done until this age. Hence, it is unlikely that the behavioral impairment observed was the result of undetected developmental abnormalities. Physiological characterization, however, may have harbored a more widespread deletion of the NR1 gene as the ages of cKO animals in the recording session were slightly more than 2 months old (Table 5).

Delay-based decision-making task

Request a detailed protocol

Adult mice were trained in a delay-based decision-making task under an automated T-maze (O’HARA and Co., Tokyo, Japan, RRID:SCR_018016) before electrophysiological recording. The maze was partitioned off into six areas (Start, Junction, Right-Goal, Right-Back, Left-Goal, and Left-Back) by seven sliding doors (S-J, J-R, R-RG, RG-S, J-L, L-LG, and LG-S). The detailed protocol has been described previously (Kobayashi et al., 2013; Zhang et al., 2018). In short, the mice had food restriction to approximately 80% of free-feeding weight, were habituated to the maze, and baited with scattered pellets (30 min/day) for 2 days. The large reward arm and the small reward arm were allocated to the right or left side arm randomly for each mouse. Four pellets were available in the large reward arm, whereas only one pellet was available in the small reward arm. Mice were allowed to roam freely and without delay to select either arm for 5–10 days for the initial training period until they preferred the large arm (>80%). Then, all animals were trained in the extension delay conditions for at least 5 days. For the first block of trials for each day, the large reward arm was associated without delay (0 s), and then, during the later blocks, it was associated with a 5 s, 10 s, 20 s, or 40 s delay. In the meantime, the small reward arm was always associated with no delay. Each block consisted of 10 trials or more (15 or 30 min). If the trial number was lower than 10, additional blocks were employed. Next, the mice, except cKO and control, were trained in the switched and both-side conditions. In the switched condition, the side of the delayed-large arm was switched to the other side. In the both-side condition, both sides were set as delayed-small and delayed–large arms. The switched conditions were performed initially and then under both-side conditions. In changing the conditions, 10 or more trials were continuously performed to develop a sustained reaction from the animals. Finally, the mice were trained in the Reward loss and gain conditions. We decreased the reward size to investigate whether the firing rate reflected a positive or negative aspect in the delayed option. Initially, we set a delay for a short time with the normal large reward, and then we changed the delay to be long without any change in the large reward, similar to other conditions. After these two continuous sessions, we changed the reward size from four to one pellet. As for other control conditions, we also performed the opposite flow (long delay with one pellet first, long delay with four pellets next). For all experiments, during the time between blocks, mice were allowed to drink water. Four to six consecutive daily sessions were performed per week.

Histological identification of the localization of the recorded sites

Request a detailed protocol

Owing to the small thickness of the silicon probe shanks, the tracks of shanks were hard to detect. Painting at the back of the shanks with DiI (Thermo Fisher Scientific Cat# D3911) and/or the creation of an electrical lesion by a small current (5 mA for 5 s) was used to facilitate track identifications under DAPI staining (Thermo Fisher Scientific Cat# D1306) (Figure 2—figure supplement 1).

Recording and spike sorting

Request a detailed protocol

Mice were anesthetized with isoflurane during surgery. Silicon probes or wire tetrodes were implanted in the hippocampal CA1 region (AP = −2.0 to −2.8 mm, ML = 1.2 to 2.0 mm, DV = 1.2 to 1.5 mm). In all experiments, ground and reference screws were fixed in the skull atop the cerebellum. The silicon probes attached to micromanipulators (nDrive, NeuroNexus, Michigan, USA), or to nichrome wire tetrodes combined with a micro-drive (Middleton and McHugh, 2016), which enabled us to move their positions to the desired depth, were implanted into the mice. Electrophysiological signals were acquired continuously at 20 kHz on a multi-channel recording system (KJE-1001, Ampliplex Ltd, Szeged, Hungary, RRID:SCR_018017). The wide-band signal was down-sampled to 1.25 kHz and used as the LFP signal. We detected SWRs (their timing, power, and durations) from filtered signal (120–230 Hz), which corresponded to more than three SD of log-power in the same frequency band. To trace the temporal positions of the animals, two color LEDs were set on the headstage and were recorded using a digital video camera at 30 frames/s. Spikes were extracted from the high-pass filtered signals (median filter, cut-off frequency: 800 Hz). Spike sorting was performed semi-automatically, using KlustaKwik2 (RRID:SCR_014480, https://github.com/kwikteam/klustakwik2/; Kadir et al., 2014). The cell types of the units were classified by peak-trough latency and width. In total, we analyzed 831 putative excitatory neurons (n = 639 for wildtype; n = 123 for cKO; n = 69 for NR1f/f mice; Table 1 and S4) and 250 inhibitory neurons (n = 169 for wildtype; n = 53 for cKO; n = 28 for NR1f/f mice). The positions of the animals were determined by the position of the LEDs mounted on the headstage. The rate maps of the spike number and occupancy probability were generated from 4 cm binned segments from the position and spiking data. The normalized PSTH for individual neurons in delay-act and delay–sup cells in the CA1 was computed under delay 20 s conditions. The autocorrelation of the population vector was then computed.

Determination of delay-active and delay-suppressed neurons

Request a detailed protocol

To examine the effect of delay on neuronal activities, we quantified changes in the firing rate of each neuron during the long delay period. First, we calculated the firing rate in the delay zone (R_delay = spike number in delay zone n_delay/time spent in delay zone t_delay; see Figure 1A), and that in all zones (R_total = spike number in all zones n_total/time spent in all zones t_total) in long-delay trials, and then computed the ratio of them (R_delay/R_total). Second, we performed a permutation test in order to determine whether the ratio of the firing rates R_delay/R_total shows significant change or not. To make surrogate data, we resampled the spike trains by permuting the inter-spike-intervals and by realigning with them. We repeated this process 1000 times to obtain 1000 resampled datasets. The rank of the original firing rate ratio R_delay/R_total in the resampled 1000 firing rate ratios was used to define the statistical assessment (delay-act cells — significant higher firing rate [rank <50, top 5%]; delay-sup cells — significant lower firing rate [rank >950, bottom 5%]).

Decoding of delay length from population spike activity

Request a detailed protocol

To quantify the information of delay length reflected in the population spike activity, we performed decoding analysis. We used the fitcecoc.m function from MATLAB statistics and the machine-learning toolbox, which enables to train a multiclass, error-correcting output codes (ECOC) model of linear support vector machines (SVM) for binary choices (e.g., Reber et al., 2019; Stavisky et al., 2019). In this, multiple binary SVMs between all pairs of labels are trained. All parameters were set to their default values. We constructed a feature vector for one or two trials, consisting of the firing activity of each neuron (normalized firing rate [0 to 1]) in 25 bins of 200 ms (over 5 s). The classifier was trained on spike trains from −25 s to 60 s after delay-onset of all five conditions, with labels of the delay length (delay lengths are 0, 5, 10, 20 and 40 s) for each animal (no fewer than 17, not more than 43 neurons from one animal) at every 2 s time step in each trial (Figure 5—figure supplement 1A). Classification performance was cross-validated using a leave-one-trial-out method and quantified as the correction probability. We separately calculate the correction probability of each delay length. The performance was shown together with surrogate decoding performance as chance prediction, obtained from artificial testing datasets created by shuffling the neuron labels and/or delay lengths (Figure 5—figure supplement 1B and D).

Statistical analysis

Request a detailed protocol

Correlation coefficients and P values between firing rates and delay length were calculated by the Matlab function (corrcoef). To estimate statistical significance of the obtained percentage of neurons correlated with delay length, we resampled firing rate and delay length in all trials with 1000 repeats. We then compared the observed percentage from the permutated percentage. To compare the firing rates between short and long delay conditions, we performed Wilcoxon’s rank sum test. Kolmogorov–Smirnov test (kstest) (Salz et al., 2016) was conducted to test the normality. To assess side-dependency in firing rates, three-way ANOVA (side [right and left] × phase [start, delay, and goal] × delay length [5 and 20]) was used. To compare the effect of reward loss and gain on firing rate of delay-act and delay-sup cells and average firing rates between cKO and control mice, Mann-Whitney’s U test was carried out. To examine the ratio distribution, we performed two-tailed one-sample t tests against 0. The behavioral impact of NMDAR conditional knockout was evaluated by two-way ANOVA (genotype [cKO and control] × choice probability) followed by post-hoc Scheffe’s test. Fisher’s exact test was applied to compare the cell-type distributions between cKO and control mice.

Data availability

All data generated or analysed during this study are included in the manuscript and supporting files.

References

1. Ainslie G
(1975) Specious reward: a behavioral theory of impulsiveness and impulse control
Psychological Bulletin 82:463–496.

https://doi.org/10.1037/h0076860
- PubMed
- Google Scholar
Book
1. Ainslie G
(1992)
Picoeconomics: The Strategic Interaction of Successive Motivational States Within the Person

Cambridge University Press.
- Google Scholar
(2010) The role of NMDA receptors in the signal attenuation rat model of obsessive–compulsive disorder
Psychopharmacology 210:13–24.

https://doi.org/10.1007/s00213-010-1808-9
- Google Scholar
(2016) Reverse replay of hippocampal place cells is uniquely modulated by changing reward
Neuron 91:1124–1136.

https://doi.org/10.1016/j.neuron.2016.07.047
- PubMed
- Google Scholar
(2004) Association of a glutamate (NMDA) subunit receptor gene (GRIN2B) with obsessive-compulsive disorder: a preliminary study
Psychopharmacology 174:530–538.

https://doi.org/10.1007/s00213-004-1847-1
- PubMed
- Google Scholar
(2017) Mapping of a non-spatial dimension by the hippocampal–entorhinal circuit
Nature 543:719–722.

https://doi.org/10.1038/nature21692
- Google Scholar
1. Bannerman DM
2. Rawlins JNP
3. McHugh SB
4. Deacon RMJ
5. Yee BK
6. Bast T
7. Zhang W-N
8. Pothuizen HHJ
9. Feldon J
(2004) Regional dissociations within the hippocampus—memory and anxiety
Neuroscience & Biobehavioral Reviews 28:273–283.

https://doi.org/10.1016/j.neubiorev.2004.03.004
- Google Scholar
1. Bannerman DM
2. Bus T
3. Taylor A
4. Sanderson DJ
5. Schwarz I
6. Jensen V
7. Hvalby Ø
8. Rawlins JN
9. Seeburg PH
10. Sprengel R
(2012) Dissecting spatial knowledge from spatial choice by hippocampal NMDA receptor deletion
Nature Neuroscience 15:1153–1159.

https://doi.org/10.1038/nn.3166
- PubMed
- Google Scholar
1. Bickel WK
2. Marsch LA
(2001) Toward a behavioral economic understanding of drug dependence: delay discounting processes
Addiction 96:73–86.

https://doi.org/10.1046/j.1360-0443.2001.961736.x
- PubMed
- Google Scholar
(2002) The human Hippocampus and spatial and episodic memory
Neuron 35:625–641.

https://doi.org/10.1016/S0896-6273(02)00830-9
- PubMed
- Google Scholar
(2016) Spatial Gene-Expression gradients underlie prominent heterogeneity of CA1 pyramidal neurons
Neuron 89:351–368.

https://doi.org/10.1016/j.neuron.2015.12.013
- PubMed
- Google Scholar
(2006) Discount rates and risky sexual behaviors among teenagers and young adults
Journal of Risk and Uncertainty 32:217–230.

https://doi.org/10.1007/s11166-006-9520-1
- Google Scholar
1. Cheung TH
2. Cardinal RN
(2005) Hippocampal lesions facilitate instrumental learning with delayed reinforcement but induce impulsive choice in rats
BMC Neuroscience 6:36.

https://doi.org/10.1186/1471-2202-6-36
- PubMed
- Google Scholar
1. Danielson NB
2. Zaremba JD
3. Kaifosh P
4. Bowler J
5. Ladow M
6. Losonczy A
(2016) Sublayer-Specific coding dynamics during spatial navigation and learning in hippocampal area CA1
Neuron 91:652–665.

https://doi.org/10.1016/j.neuron.2016.06.020
- PubMed
- Google Scholar
(2018) Spatial representations of self and other in the Hippocampus
Science 359:213–218.

https://doi.org/10.1126/science.aao3898
- PubMed
- Google Scholar
(1987) Cue-sampling and goal-approach correlates of hippocampal unit activity in rats performing an odor-discrimination task
The Journal of Neuroscience 7:716–732.

https://doi.org/10.1523/JNEUROSCI.07-03-00716.1987
- PubMed
- Google Scholar
1. Eichenbaum H
(2014) Time cells in the Hippocampus: a new dimension for mapping memories
Nature Reviews Neuroscience 15:732–744.

https://doi.org/10.1038/nrn3827
- PubMed
- Google Scholar
1. Ekstrom AD
2. Kahana MJ
3. Caplan JB
4. Fields TA
5. Isham EA
6. Newman EL
7. Fried I
(2003) Cellular networks underlying human spatial navigation
Nature 425:184–188.

https://doi.org/10.1038/nature01964
- PubMed
- Google Scholar
(2008) Food reinforcement and impulsivity in overweight children and their parents
Eating Behaviors 9:319–327.

https://doi.org/10.1016/j.eatbeh.2007.10.007
- PubMed
- Google Scholar
1. Figner B
2. Knoch D
3. Johnson EJ
4. Krosch AR
5. Lisanby SH
6. Fehr E
7. Weber EU
(2010) Lateral prefrontal cortex and self-control in intertemporal choice
Nature Neuroscience 13:538–539.

https://doi.org/10.1038/nn.2516
- PubMed
- Google Scholar
1. Foster DJ
2. Wilson MA
(2006) Reverse replay of behavioural sequences in hippocampal place cells during the awake state
Nature 440:680–683.

https://doi.org/10.1038/nature04587
- PubMed
- Google Scholar
1. Fukaya M
2. Kato A
3. Lovett C
4. Tonegawa S
5. Watanabe M
(2003) Retention of NMDA receptor NR2 subunits in the lumen of endoplasmic reticulum in targeted NR1 knockout mice
PNAS 100:4855–4860.

https://doi.org/10.1073/pnas.0830996100
- Google Scholar
1. Fyhn M
2. Molden S
3. Hollup S
4. Moser M-B
5. Moser EI
(2002) Hippocampal neurons responding to First-Time dislocation of a target object
Neuron 35:555–566.

https://doi.org/10.1016/S0896-6273(02)00784-5
- Google Scholar
1. Gauthier JL
2. Tank DW
(2018) A dedicated population for reward coding in the Hippocampus
Neuron 99:179–193.

https://doi.org/10.1016/j.neuron.2018.06.008
- PubMed
- Google Scholar
(2011) Hippocampal episode fields develop with learning
Hippocampus 21:1240–1249.

https://doi.org/10.1002/hipo.20832
- PubMed
- Google Scholar
1. Hok V
2. Lenck-Santini PP
3. Roux S
4. Save E
5. Muller RU
6. Poucet B
(2007) Goal-related activity in hippocampal place cells
Journal of Neuroscience 27:472–482.

https://doi.org/10.1523/JNEUROSCI.2864-06.2007
- PubMed
- Google Scholar
(2003) Reward modulates neuronal activity in the Hippocampus of the rat
Behavioural Brain Research 142:181–191.

https://doi.org/10.1016/S0166-4328(02)00422-9
- PubMed
- Google Scholar
1. Howard MW
2. Eichenbaum H
(2015) Time and space in the Hippocampus
Brain Research 1621:345–354.

https://doi.org/10.1016/j.brainres.2014.10.069
- PubMed
- Google Scholar
1. Huerta PT
2. Sun LD
3. Wilson MA
4. Tonegawa S
(2000) Formation of temporal memory requires NMDA receptors within CA1 pyramidal neurons
Neuron 25:473–480.

https://doi.org/10.1016/S0896-6273(00)80909-5
- Google Scholar
(2007) Integrating Hippocampus and striatum in decision-making
Current Opinion in Neurobiology 17:692–697.

https://doi.org/10.1016/j.conb.2008.01.003
- PubMed
- Google Scholar
1. Johnson A
2. Redish AD
(2007) Neural ensembles in CA3 transiently encode paths forward of the animal at a decision point
Journal of Neuroscience 27:12176–12189.

https://doi.org/10.1523/JNEUROSCI.3761-07.2007
- Google Scholar
1. Jung MW
2. McNaughton BL
(1993) Spatial selectivity of unit activity in the hippocampal granular layer
Hippocampus 3:165–182.

https://doi.org/10.1002/hipo.450030209
- PubMed
- Google Scholar
(2014) High-dimensional cluster analysis with the masked EM algorithm
Neural Computation 26:2379–2394.

https://doi.org/10.1162/NECO_a_00661
- PubMed
- Google Scholar
1. Kalivas PW
2. Volkow ND
(2005) The neural basis of addiction: a pathology of motivation and choice
American Journal of Psychiatry 162:1403–1413.

https://doi.org/10.1176/appi.ajp.162.8.1403
- PubMed
- Google Scholar
(2002) Reduced fear expression after lesions of the ventral Hippocampus
PNAS 99:10825–10830.

https://doi.org/10.1073/pnas.152112399
- PubMed
- Google Scholar
1. Kobayashi T
2. Tran AH
3. Nishijo H
4. Ono T
5. Matsumoto G
(2003) Contribution of hippocampal place cell activity to learning and formation of goal-directed navigation in rats
Neuroscience 117:1025–1035.

https://doi.org/10.1016/S0306-4522(02)00700-5
- PubMed
- Google Scholar
1. Kobayashi Y
2. Sano Y
3. Vannoni E
4. Goto H
5. Suzuki H
6. Oba A
7. Kawasaki H
8. Kanba S
9. Lipp HP
10. Murphy NP
11. Wolfer DP
12. Itohara S
(2013) Genetic dissection of medial habenula-interpeduncular nucleus pathway function in mice
Frontiers in Behavioral Neuroscience 7:17.

https://doi.org/10.3389/fnbeh.2013.00017
- PubMed
- Google Scholar
(2013) Hippocampal "time cells": time versus path integration
Neuron 78:1090–1101.

https://doi.org/10.1016/j.neuron.2013.04.015
- PubMed
- Google Scholar
1. Lee H
2. Ghim JW
3. Kim H
4. Lee D
5. Jung M
(2012) Hippocampal neural correlates for values of experienced events
Journal of Neuroscience 32:15053–15065.

https://doi.org/10.1523/JNEUROSCI.2806-12.2012
- PubMed
- Google Scholar
(2010) Identifying the neurobiology of altered reinforcement sensitivity in ADHD: A review and research agenda
Neuroscience & Biobehavioral Reviews 34:744–754.

https://doi.org/10.1016/j.neubiorev.2009.11.021
- Google Scholar
(2011) Hippocampal “Time Cells” Bridge the Gap in Memory for Discontiguous Events
Neuron 71:737–749.

https://doi.org/10.1016/j.neuron.2011.07.012
- Google Scholar
(2007) Gradual Changes in Hippocampal Activity Support Remembering the Order of Events
Neuron 56:530–540.

https://doi.org/10.1016/j.neuron.2007.08.017
- Google Scholar
(2009) Impulsive choice in hippocampal but not orbitofrontal cortex-lesioned rats on a nonspatial decision-making maze task
European Journal of Neuroscience 30:472–484.

https://doi.org/10.1111/j.1460-9568.2009.06837.x
- PubMed
- Google Scholar
(2000) Synaptic plasticity and memory: an evaluation of the hypothesis
Annual Review of Neuroscience 23:649–711.

https://doi.org/10.1146/annurev.neuro.23.1.649
- PubMed
- Google Scholar
1. McHugh TJ
2. Blum KI
3. Tsien JZ
4. Tonegawa S
5. Wilson MA
(1996) Impaired hippocampal representation of space in CA1-specific NMDAR1 knockout mice
Cell 87:1339–1349.

https://doi.org/10.1016/S0092-8674(00)81828-0
- PubMed
- Google Scholar
(2004) Amygdala and ventral Hippocampus contribute differentially to mechanisms of fear and anxiety
Behavioral Neuroscience 118:63–78.

https://doi.org/10.1037/0735-7044.118.1.63
- PubMed
- Google Scholar
(2008) A role for dorsal and ventral Hippocampus in inter-temporal choice cost-benefit decision making
Behavioral Neuroscience 122:1–8.

https://doi.org/10.1037/0735-7044.122.1.1
- PubMed
- Google Scholar
1. Middleton SJ
2. McHugh TJ
(2016) Silencing CA3 disrupts temporal coding in the CA1 ensemble
Nature Neuroscience 19:945–951.

https://doi.org/10.1038/nn.4311
- PubMed
- Google Scholar
1. Murty VP
2. Adcock RA
(2014) Enriched encoding: reward motivation organizes cortical networks for hippocampal detection of unexpected events
Cerebral Cortex 24:2160–2168.

https://doi.org/10.1093/cercor/bht063
- PubMed
- Google Scholar
1. O'Keefe J
2. Dostrovsky J
(1971) The hippocampus as a spatial map. Preliminary evidence from unit activity in the freely-moving rat
Brain Research 34:171–175.

https://doi.org/10.1016/0006-8993(71)90358-1
- PubMed
- Google Scholar
1. Odum AL
2. Madden GJ
3. Badger GJ
4. Bickel WK
(2000) Needle sharing in opioid-dependent outpatients: psychological processes underlying risk
Drug and Alcohol Dependence 60:259–266.

https://doi.org/10.1016/S0376-8716(00)00111-3
- PubMed
- Google Scholar
(2015) Hippocampal place cells construct reward related sequences through unexplored space
eLife 4:e06063.

https://doi.org/10.7554/eLife.06063
- PubMed
- Google Scholar
1. Omer DB
2. Maimon SR
3. Las L
4. Ulanovsky N
(2018) Social place-cells in the bat Hippocampus
Science 359:218–224.

https://doi.org/10.1126/science.aao3474
- PubMed
- Google Scholar
(2008) Internally generated cell assembly sequences in the rat Hippocampus
Science 321:1322–1327.

https://doi.org/10.1126/science.1159775
- PubMed
- Google Scholar
1. Peters J
2. Büchel C
(2010) Episodic future thinking reduces reward delay discounting through an enhancement of prefrontal-mediotemporal interactions
Neuron 66:138–148.

https://doi.org/10.1016/j.neuron.2010.03.026
- PubMed
- Google Scholar
1. Peters J
2. Büchel C
(2011) The neural mechanisms of inter-temporal decision-making: understanding variability
Trends in Cognitive Sciences 15:227–239.

https://doi.org/10.1016/j.tics.2011.03.002
- PubMed
- Google Scholar
1. Pfeiffer BE
2. Foster DJ
(2013) Hippocampal place-cell sequences depict future paths to remembered goals
Nature 497:74–79.

https://doi.org/10.1038/nature12112
- PubMed
- Google Scholar
1. Reber TP
2. Bausch M
3. Mackay S
4. Boström J
5. Elger CE
6. Mormann F
(2019) Representation of abstract semantic knowledge in populations of human single neurons in the medial temporal lobe
PLOS Biology 17:e3000290.

https://doi.org/10.1371/journal.pbio.3000290
- PubMed
- Google Scholar
1. Richmond MA
2. Yee BK
3. Pouzet B
4. Veenman L
5. Rawlins JN
6. Feldon J
7. Bannerman DM
(1999) Dissociating context and space within the Hippocampus: effects of complete, dorsal, and ventral excitotoxic hippocampal lesions on conditioned freezing and spatial learning
Behavioral Neuroscience 113:1189–1203.

https://doi.org/10.1037/0735-7044.113.6.1189
- PubMed
- Google Scholar
(2007) Dopamine neurons encode the better option in rats deciding between differently delayed or sized rewards
Nature Neuroscience 10:1615–1624.

https://doi.org/10.1038/nn2013
- PubMed
- Google Scholar
1. Rolls ET
2. Xiang JZ
(2005) Reward-spatial view representations and learning in the primate Hippocampus
Journal of Neuroscience 25:6167–6174.

https://doi.org/10.1523/JNEUROSCI.1481-05.2005
- PubMed
- Google Scholar
(2001) CA1-specific N-methyl-D-aspartate receptor knockout mice are deficient in solving a nonspatial transverse patterning task
PNAS 98:3543–3548.

https://doi.org/10.1073/pnas.041620798
- PubMed
- Google Scholar
1. Salz DM
2. Tiganj Z
3. Khasnabish S
4. Kohley A
5. Sheehan D
6. Howard MW
7. Eichenbaum H
(2016) Time cells in hippocampal area CA3
Journal of Neuroscience 36:7476–7484.

https://doi.org/10.1523/JNEUROSCI.0087-16.2016
- PubMed
- Google Scholar
1. Scoville WB
2. Milner B
(1957) Loss of recent memory after bilateral hippocampal lesions
Journal of Neurology, Neurosurgery & Psychiatry 20:11–21.

https://doi.org/10.1136/jnnp.20.1.11
- Google Scholar
1. Singer AC
2. Frank LM
(2009) Rewarded outcomes enhance reactivation of experience in the Hippocampus
Neuron 64:910–921.

https://doi.org/10.1016/j.neuron.2009.11.016
- Google Scholar
1. Skaggs WE
2. McNaughton BL
(1996) Replay of neuronal firing sequences in rat Hippocampus during sleep following spatial experience
Science 271:1870–1873.

https://doi.org/10.1126/science.271.5257.1870
- PubMed
- Google Scholar
1. Squire LR
(1992) Memory and the Hippocampus: a synthesis from findings with rats, monkeys, and humans
Psychological Review 99:195–231.

https://doi.org/10.1037/0033-295X.99.2.195
- PubMed
- Google Scholar
1. Stavisky SD
2. Willett FR
3. Wilson GH
4. Murphy BA
5. Rezaii P
6. Avansino DT
7. Memberg WD
8. Miller JP
9. Kirsch RF
10. Hochberg LR
11. Ajiboye AB
12. Druckmann S
13. Shenoy KV
14. Henderson JM
(2019) Neural ensemble dynamics in dorsal motor cortex during speech in people with paralysis
eLife 8:e46015.

https://doi.org/10.7554/eLife.46015
- PubMed
- Google Scholar
(2017) Temporal and rate coding for discrete event sequences in the Hippocampus
Neuron 94:1248–1262.

https://doi.org/10.1016/j.neuron.2017.05.024
- PubMed
- Google Scholar
1. Tryon VL
2. Penner MR
3. Heide SW
4. King HO
5. Larkin J
6. Mizumori SJY
(2017) Hippocampal neural activity reflects the economy of choices during goal-directed navigation
Hippocampus 27:743–758.

https://doi.org/10.1002/hipo.22720
- PubMed
- Google Scholar
1. Tsien JZ
2. Chen DF
3. Gerber D
4. Tom C
5. Mercer EH
6. Anderson DJ
7. Mayford M
8. Kandel ER
9. Tonegawa S
(1996a) Subregion- and cell type-restricted gene knockout in mouse brain
Cell 87:1317–1326.

https://doi.org/10.1016/S0092-8674(00)81826-7
- PubMed
- Google Scholar
(1996b) The essential role of hippocampal CA1 NMDA receptor-dependent synaptic plasticity in spatial memory
Cell 87:1327–1338.

https://doi.org/10.1016/S0092-8674(00)81827-9
- PubMed
- Google Scholar
1. Watanabe T
2. Niki H
(1985) Hippocampal unit activity and delayed response in the monkey
Brain Research 325:241–254.

https://doi.org/10.1016/0006-8993(85)90320-8
- PubMed
- Google Scholar
1. Weller RE
2. Cook EW
3. Avsar KB
4. Cox JE
(2008) Obese women show greater delay discounting than healthy-weight women
Appetite 51:563–569.

https://doi.org/10.1016/j.appet.2008.04.010
- PubMed
- Google Scholar
1. Wilson MA
2. McNaughton BL
(1993) Dynamics of the hippocampal ensemble code for space
Science 261:1055–1058.

https://doi.org/10.1126/science.8351520
- PubMed
- Google Scholar
1. Zhang Q
2. Kobayashi Y
3. Goto H
4. Itohara S
(2018) An automated T-maze based apparatus and protocol for analyzing delay- and Effort-based decision making in free moving rodents
Journal of Visualized Experiments 57895.

https://doi.org/10.3791/57895
- Google Scholar

Article and author information

Author details

Akira Masuda
1. Laboratory for Behavioral Genetics, Center for Brain Science, RIKEN, Wako, Japan
2. Organization for Research Initiatives and Development, Doshisha University, Kyotanabe, Japan
Contribution
Conceptualization, Software, Formal analysis, Funding acquisition, Investigation, Visualization, Methodology

For correspondence
amasuda@mail.doshisha.ac.jp

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0002-8659-6356
Chie Sano

Laboratory for Behavioral Genetics, Center for Brain Science, RIKEN, Wako, Japan

Contribution
Investigation

Competing interests
No competing interests declared
Qi Zhang
1. Laboratory for Behavioral Genetics, Center for Brain Science, RIKEN, Wako, Japan
2. Faculty of Human Science, University of Tsukuba, Tsukuba, Japan
Contribution
Validation, Methodology

Competing interests
No competing interests declared
Hiromichi Goto

Laboratory for Behavioral Genetics, Center for Brain Science, RIKEN, Wako, Japan

Contribution
Resources, Validation, Methodology

Competing interests
No competing interests declared
Thomas J McHugh

Laboratory for Circuit and Behavioral Physiology, Center for Brain Science, RIKEN, Wako, Japan

Contribution
Conceptualization, Resources, Supervision, Methodology

Competing interests
No competing interests declared
Shigeyoshi Fujisawa

Laboratory for Systems Neurophysiology, Center for Brain Science, RIKEN, Wako, Japan

Contribution
Conceptualization, Resources, Software, Supervision, Funding acquisition

Competing interests
No competing interests declared
Shigeyoshi Itohara

Laboratory for Behavioral Genetics, Center for Brain Science, RIKEN, Wako, Japan

Contribution
Conceptualization, Resources, Data curation, Supervision, Funding acquisition, Validation, Methodology

For correspondence
shigeyoshi.itohara@riken.jp

Competing interests
No competing interests declared

Funding

Japan Society for the Promotion of Science (16K15196)

Akira Masuda

Japan Agency for Medical Research and Development (Brain/MINDS)

Shigeyoshi Fujisawa

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Acknowledgements

We thank Drs Steven Middleton, Roman Boehringer, and Chinnakkaruppan Adaikkan for help building tetrodes, Dr Charles Yokoyama for valuable comments, and Dr Susumu Tonegawa for providing us NR1 cKO mice. Funding was provided by a Grand-in-Aid for Exploratory Research (JSPS KAKENHI Grant Number 16K15196) and from the ‘Brain/MINDS’ program from the Japan Agency for Medical Research and Development (AMED).

Ethics

Animal experimentation: This study was performed in strict accordance with the recommendations in the Guide for the Care and Use of Laboratory Animals of the National Institute of Health. The study was approved by the Institutional Animal Care and Use Committee of the RIKEN Institute in Wako (approval number H27-2-239(6)), in conformity with Article 24 of the RIKEN regulations for animal experiments. All surgery was performed under isoflurane anesthesia, and every effort was made to minimize suffering.

Copyright

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.