Causal contribution and dynamical encoding in the striatum during evidence accumulation

Abstract
Introduction
Results
Discussion
Materials and methods
Data availability
References
Article and author information
Metrics

Abstract

A broad range of decision-making processes involve gradual accumulation of evidence over time, but the neural circuits responsible for this computation are not yet established. Recent data indicate that cortical regions that are prominently associated with accumulating evidence, such as the posterior parietal cortex and the frontal orienting fields, may not be directly involved in this computation. Which, then, are the regions involved? Regions that are directly involved in evidence accumulation should directly influence the accumulation-based decision-making behavior, have a graded neural encoding of accumulated evidence and contribute throughout the accumulation process. Here, we investigated the role of the anterior dorsal striatum (ADS) in a rodent auditory evidence accumulation task using a combination of behavioral, pharmacological, optogenetic, electrophysiological and computational approaches. We find that the ADS is the first brain region known to satisfy the three criteria. Thus, the ADS may be the first identified node in the network responsible for evidence accumulation.

https://doi.org/10.7554/eLife.34929.001

Introduction

All behaving animals must interpret sensory information arriving from the environment and use that information to select future actions. How the nervous system solves this problem has been a longstanding question in neuroscience. Multiple studies across a wide range of behavioral tasks and model systems, including humans (Hunt et al., 2012; Krajbich et al., 2012; Ratcliff et al., 2015), non-human primates (Gold and Shadlen, 2007; Huk and Shadlen, 2005; Shadlen and Newsome, 1996) and rodents (Brunton et al., 2013; Carandini and Churchland, 2013; Erlich et al., 2015; Hanks et al., 2015; Raposo et al., 2012; Sanders and Kepecs, 2012) have proposed a framework through which neural circuits gradually accumulate sensory evidence to guide decisions. Yet, despite the observation of neural correlates of evidence accumulation in several brain regions (Ding and Gold, 2010; Gold and Shadlen, 2007; Hanks et al., 2015; Ratcliff et al., 2007; Shadlen and Newsome, 1996), a major challenge in this line of research has been that the neural circuits that are causally responsible for evidence accumulation have not yet been determined. Two of the cortical regions that are most prominently associated with evidence accumulation, namely the posterior parietal cortex (LIP; Huk and Shadlen, 2005; Kira et al., 2015; Roitman and Shadlen, 2002; Shadlen and Newsome, 1996) and the frontal eye fields in primates (FEF; Ding and Gold, 2012a; Gold and Shadlen, 2000; Mante et al., 2013), together with its probable rodent analogue, the frontal orienting fields (FOF; Erlich et al., 2011), have been the focus of studies recently. Surprisingly, these studies have indicated that neither region is central to the computation of gradually accumulating evidence (Erlich et al., 2015; Hanks et al., 2015; Katz et al., 2016).

The anterior dorsal striatum (ADS) serves as an intriguing alternative candidate (Ding and Gold, 2013), due in part to its unique anatomical positioning as a convergence hub for multiple brain regions (Cheatwood et al., 2003; McGeorge and Faull, 1989) where neural signatures of evidence accumulation have been observed (such as the PPC and FEF/FOF; Gold and Shadlen, 2007; Cheatwood et al., 2003; Ding and Gold, 2013; McGeorge and Faull, 1989). The ADS is thus ideally positioned to participate in evidence accumulation as part of its established role in action selection (Bogacz and Gurney, 2007; Graybiel, 2008; Hikosaka et al., 2014; Jin and Costa, 2010; Nelson and Kreitzer, 2014; Redgrave et al., 2010). Modeling work has also suggested that the ADS may participate in post-accumulation decision commitment (Lo and Wang, 2006).

The auditory input to a different striatal subregion, the posterior ‘auditory’ striatum, has been shown to be critical for auditory discriminations, leading to the suggestion that cortical projections into the striatum may provide a general mechanism for the control of motor decisions (Xiong et al., 2015; Znamenskiy and Zador, 2013). Specifically with regard to evidence accumulation, work in primates found neural correlates of evidence accumulation in the ADS (Ding 2015; Ding and Gold, 2010; Ding and Gold, 2012b; Ding and Gold, 2013; Ding, 2015, Lo and Wang, 2006), and revealed that electrical microstimulation of the ADS impacts behavior that is based on accumulation of evidence (Ding and Gold, 2012a). These data led to the proposal that the ADS may contribute to the computations specifically involved in evidence accumulation. Yet three critical questions to test this proposal have been left unanswered.

First, is the ADS required for unimpaired accumulation-based decision making? To date, there have been no recorded inactivations of the dorsal striatum during the accumulation of evidence. Inactivations are important probes of whether a region plays a central causal role for a cognitive variable of interest (Newsome and Paré, 1988; Erlich et al., 2015; Katz et al., 2016).

Second, do neurons in the dorsal striatum encode sensory information in a way that is sufficient to be involved directly in the graded accumulation process? The correlates of evidence accumulation reported to date in striatum have been of two types: either firing rates that, when averaged over trials, ramp upwards with a slope of the ramp that increases as the evidence strengthens (Ding and Gold, 2010), or estimates of the temporal dynamics of firing rate variance across trials (Ding, 2015). However, the trial averages do not distinguish between graded evidence encodings and other encodings that on a single-trial basis do not represent gradually accumulating evidence, such as sharp coordinated steps in firing for which the timing of the step varies across trials (Hanks et al., 2015; Latimer et al., 2015). Furthermore, the variance estimates have not yet produced clearly definitive conclusions, suggesting that the ADS is only partly involved in graded accumulation (Ding, 2015).

We recently developed a complementary approach, distinct from the two earlier methods, to assess evidence accumulation encoding. This most recent approach estimates ‘tuning curves,’ that is, direct descriptions of the relationship between recorded neural firing rates and the graded value of the evidence accumulator, and can discriminate between different encodings that otherwise appear indistinguishable (Hanks et al., 2015). Here, we apply this approach to electrophysiological recordings from the ADS.

Third, does the dorsal striatum play a causal role throughout the period of accumulation? One of the key aspects of interest in gradual evidence accumulation is its relatively long timescale, as it occurs over a period of hundreds of milliseconds or more (thought to be a potential model of mental deliberation [Gold and Shadlen, 2007]). If the striatum is part of the gradual accumulation process that drives behavior, perturbing it at any timepoint during that accumulation process should affect behavior. This feature is thus an essential prerequisite for a component of the accumulator. However, no temporally specific perturbations of the ADS during the accumulation of evidence have yet been carried out to probe for this feature. Indeed, no brain region studied during an accumulation of evidence behavior has yet been reported to possess this feature.

Here, using a combination of behavioral, pharmacological, optogenetic, electrophysiological and computational approaches, we address these three fundamental questions. The results provide evidence supporting a central causal role for the anterior dorsal striatum in evidence accumulation.

Results

We trained rats on a previously developed decision-making task (Brunton et al., 2013) in which subjects accumulate auditory evidence over many hundreds of milliseconds to inform a binary left/right choice (Figure 1a). In each trial, rats kept their nose in a central port during the presentation of two simultaneous trains of randomly timed auditory clicks, one played from a speaker to their left and the other from a speaker to their right. At the end of the auditory stimulus, the rat’s task was to decide which side had played the greater total number of clicks. Consistent with previous studies using this task, analysis of our rats’ behavior indicated that they gradually accumulated auditory evidence over the entire trial, and used that accumulated evidence to drive a categorical choice (Figure 1—figure supplement 1; Supplementary file 1).

Figure 1 with 2 supplements see all

Download asset Open asset

Dorsal anterior striatum is required for unimpaired performance on the Poisson-clicks evidence accumulation task.

(a) Sequence of events in each trial of the rat auditory Poisson-clicks task. From left to right: after light onset above the center port, rats 'fixate' their position by placing their nose inside the center port. During nose-fixation, two different trains of randomly timed auditory clicks are played concurrently from the left and right speakers. Upon termination of the sound trains, the light above the center port turns off and the rat needs to make a choice, poking into the left or right port to indicate if more clicks were played on the left or right sides, respectively. (b) Unilateral infusion of muscimol into the striatum results in a significant ipsilateral bias on accumulation trials. Purple and cyan psychometric curves show data on days of right and left striatal infusions (n left sessions = 29; n right sessions = 29), respectively. Black psychometric curve shows data from control sessions that occurred one day before infusion sessions (n = 58). (c) Bilateral infusion of muscimol into the striatum results in significant impairment on accumulation trials. The blue psychometric curve is from bilateral infusion sessions (n = 26) and the black psychometric curve is from control sessions that occurred one day before bilateral infusion sessions (n = 26). Data are shown as mean ± S.E.M.

https://doi.org/10.7554/eLife.34929.002

We began to assess the role of the anterior striatum in the accumulation task using reversible pharmacological inactivation with muscimol (Materials and methods). The anterior striatial region targeted in this study receives convergent inputs from the PPC and the FOF, brain regions previously reported to contain neural correlates of evidence accumulation but later shown to not be central to the accumulation process itself (Erlich et al., 2015; Hanks et al., 2015; Katz et al., 2016). Unilateral inactivation of the ADS biased rats to make more ipsilateral choices relative to controls (Figure 1b; bias for right side inactivation = 19.2 ± 4.4%, p<0.01; bias for left side inactivation = 18.6 ± 3.3%, p<0.01). This effect was not a gross motor bias, but was instead specific for accumulation trials, because no significant bias was caused on interleaved motor control trials in which the rats had to make a similar left/right motor response, but were cued by a simple visual stimulus (Figure 1—figure supplement 2; p>0.4 for both left- and right-side trials). Bilateral pharmacological inactivation caused a substantial impairment in performance for accumulation trials (Figure 1c; impairment = 12.6 ± 3.2%, p<0.01). This impairment was again specific for accumulation trials, with no significant impairment in motor control trials where the decision was not based on the accumulation of evidence over time (Figure 1—figure supplement 2; p>0.6 for both left- and right-side trials).

Psychometric curves such as those shown in Figure 1b,c group together trials based on the click difference accrued by the end of the stimulus stream and treat all trials within each group as if they were the same. But in our clicks task, we have far more information available because the precise temporal pattern of each individual trial’s click trains is known. We have previously used this information, together with a model that takes into account those known individual click times, to quantify our subjects’ behavior in terms of multiple parameters governing the dynamics of a drift-diffusion decision process (Ratcliff and McKoon, 2008). We use an enhanced model of the drift-diffusion process so that we can obtain trial-by-trial, moment-by-moment estimates of accumulating evidence (Brunton et al., 2013). This model converts the incoming stream of each trial’s discrete left and right click stimuli into a scalar quantity a(t) that represents the gradually accumulating difference between the two click streams; each right click increases the value of a(t), whereas each left click decreases a(t). Eight parameters, quantifing sensory and accumulator noise, the leakiness or instability of the accumulation process, a sticky accumulation bound, sensory depression or facilitation, side bias (þ), and lapse rate, govern the dynamics of how a(t) evolves in response to the sensory evidence pulses, and how they are then turned into a binary decision. At the end of each trial’s stimulus, the accumulator a(t_end), together with the parameter þ, drives choices: if a(t_end) > þ, the model prescribes ‘choose right’, whereas if a(t_end) < þ, the model prescribes ‘choose left’. All of the parameters are estimated by fitting the model to the rat’s behavior (Materials and methods).

The original model of Brunton et al. (2013) was not constructed to explain different types of side biases, so it had only a single parameter (þ) that could account for such lateralized effects. By adding three more parameters that could cause different types of side biases, fitting the extended model to behavioral data following unilateral inactivations, and asking which parameters are most affected relative to control trials, we can better estimate which particular aspect of the behavior was impacted by unilateral inactivations. The three side bias parameters that we consider, in addition to þ, are: asymmetric sensory input gain, asymmetric sensory input noise, and asymmetric lapse rates (Materials and methods). Considering all four of these side bias parameters in the case of unilateral inactivations of the FOF, we previously concluded that FOF inactivations were consistent with perturbing a process that was not part of evidence accumulation directly, but was instead downstream of the accumulation process and therefore followed it (Erlich et al., 2015; Piet et al., 2017).

Here, we improve upon this analysis and apply it to our striatum inactivation data. At the time of the Erlich et al. (2015) study, the complexity of determining the derivative of the model with respect to all 11 of its parameters precluded us from fitting all 11 parameters simultaneously. We instead performed exhaustive scans in the space of two parameters at a time while the other nine parameters were fixed to their control (no inactivation) values (e.g., Figure 4 in Erlich et al. (2015)). Since that time, however, algorithmic differentiation packages, which greatly facilitate computing the derivative of arbitrary differentiable models embodied in computer code, have become widely available (Abadi et al., 2016; Baydin et al., 2015; Revels et al., 2016; Al-Rfou et al., 2016). Using the ForwardDiff package of the language Julia (Revels et al., 2016) to obtain automatically the derivative with respect to all 11 parameters in the model of Erlich et al. (2015), we constructed a package that can efficiently and simultaneously fit all 11 parameters in the model. We are publishing this package in open source form, as part of the contribution of the current manuscript (code available at https://github.com/misun6312/PBupsModel.jl [Yoon and Brody, 2018]; copy archived at https://github.com/elifesciences/PBupsModel.jl). We validated this approach and our previous FOF analysis by fitting all 11 parameters simultaneously to our previous FOF unilateral inactivation data. This new analysis (Figure 2—figure supplement 2, Supplementary file 4) confirmed the conclusions about the FOF found by Erlich et al. (2015). Following this conclusion, we next turned to performing the same analysis on the inactivation data collected in the current study for the anterior striatum.

For simplicity of presentation, below, we illustrate some of the results of the model fits in terms of psychometric plots (i.e., graphing the probability of a decision to one side as a function of total #R – #L clicks, averaged over trials), but we note again that our model and its fits are sensitive to the detailed timing of the click stimuli in each individual trial, which is information that is obscured in the trial-averaged psychometric plot. As a result, the model and its fits can resolve the effects of different parameters that are indistinguishable in a psychometric plot (see also illustrations of this point in Supplementary Figure S4 in Brunton et al. (2013)). For example, a leaky (i.e., forgetful) accumulator and an increased overall lapse rate both predict an overall performance impairment. But the leaky accumulator impairment will be greater for trials that by chance had their clicks earlier rather than later, whereas the lapse impairment will be independent of the timing of each trial’s clicks. A model that is sensitive to the timing of each trial’s clicks can thus distinguish the two. Similarly, an asymmetric sensory input gain and an asymmetric lapse rate both predict a side bias. But the magnitude of the bias due to an asymmetric input gain will scale with the number of clicks presented on each trial. This contrasts with the bias that would be induced by an asymmetric lapse rate, which would be independent of the number of clicks presented. This again allows the effects of the two parameters to be distinguished. In sum, trial-by-trial and detailed click-timing effects, although not visible in the trial-averaged psychometric plot, impact the likelihood of the data under the model, and thus impact the model fits and the likelihood landscapes (such as those shown in Figure 2b and d below). When two parameters trade off in a manner that impairs our ability to distinguish them, this is revealed in the likelihood landscape as a ridge of high likelihood. The shape of the ridge quantifies the extent and scaling of the parameter trade-off (Materials and methods and for example Figure 2D in Brunton et al. (2013)).

Figure 2 with 4 supplements see all

Download asset Open asset

Fits of the model of Brunton et al. (2013) and Erlich et al. (2015) to data from sessions following muscimol inactivation of the striatum.

(a) Psychometric curves for control and unilateral inactivation data. Left and right inactivations were collapsed together. Orange data points are from sessions following unilateral infusions of muscimol. The black and orange lines are the psychometric curves predicted from the model fit to the control and inactivation data, respectively. (b) Normalized likelihood of the data given the model, shown as a function of the parameters for which best-fit values for inactivation data were significantly different from best-fit values for control data. Magenta shows the best-fit values for control datas. The black cross shows the best-fit values for inactivation data. The color scale indicates the percentage of probability mass; the region of probability mass >95 indicates the 95% confidence region. Left: sensory noise for the side contralateral to the infusion versus accumulator noise. Although there is a trade-off between accumulator and sensory noise, the weighted sum of the accumulator and biased sensory noise has a best-fit value following unilateral inactivations that is significantly greater than its control best-fit value. Middle: leak/instability parameter versus accumulator noise. Right: summed sensory plus accumulator noise versus lapse rate, which shows that in fact the lapse rate *κ_C* does not trade off with the summed noise. (**c,d**) As in panels (**a,b**) but for bilateral striatum inactivation data, and for a model where the sensory noise is constrained to be the same for both sides of the brain, so there is only one sensory noise parameter. Here the tradeoff between sensory noise and accumulator noise is large enough that we cannot distinguish whether one or both are significantly different from their control values, but there is nevertheless a significant increase in their sum.

https://doi.org/10.7554/eLife.34929.005

Simultaneously fitting all parameters of the enhanced 11-parameter model to data from sessions with unilateral muscimol inactivation of the anterior dorsal striatum revealed that two parameters differed enough from their control values to produce substantial changes in behavior (Supplementary file 2). First, the side bias in the lapse rates (the contralateral lapse rate parameter κ_C and the ipsilateral lapse rate parameter κ_I, which are unitless parameters in terms of fraction of trials; Materials and methods) significantly increased in favor of ipsilateral choices (κ_I: from 0.29, 95% C.I. = [0.18 0.43] in control sessions to 0.00, 95% C.I. = [0.00 0.14] for inactivation sessions, κ_C: from 0.20, 95% C.I. = [0.04 0.58] in control sessions to 0.60, 95% C.I. = [0.14 0.93] for inactivation sessions). An effect on lapse rates was also seen after unilateral FOF inactivations, where it was interpreted as an effect on processes subsequent to the accumulator, and not part of it (Erlich et al., 2015). Second, the magnitude of the accumulator and sensory noise parameters, which respectively describe diffusion noise intrinsic to the accumulator and noise associated with the addition of each sensory click, also increased significantly (Figure 2a,b and Figure 2—figure supplement 1). The trade-off between these parameters (Brunton et al., 2013) was large enough that it was impossible to distinguish which of the accumulator noise σ²_a or the ipsilateral and contraletral sensory noise parameters σ²_s,I and σ²_s,C was responsible for the increase. We note that we do not mean to imply that the combination of sensory and accumulator noise is a single, biologically interpretable quantity, but simply that our data cannot distinguish between the different trade-offs between these parameters that fit the data equally well. There was a suggestion that the intrinsic accumulator noise σ²_a specifically increased, with this parameter being significantly greater than zero during inactivation trials whereas it was not distinguishable from zero in control trials (Figure 2b and Figure 2—figure supplement 1), but the difference between control and inactivation σ²_a values was not significant (p<0.15). Lapse rate and noise parameters described distinct effects, and did not trade off with each other (Figure 2—figure supplement 1a, third row, middle column).

For data from bilateral inactivation sessions, the combination of sensory and accumulator noise parameters was again significantly greater than for control sessions (Figure 2c,d; w₁σ²_a + w₂σ²_s: from 31.64 clicks²/sec [95% C.I. = [17.40 55.84]] in control sessions to 117.00 clicks²/sec [95% C.I. = [71.12 156.53]] , where w₁ = 0.92, w₂ = 0.39 during inactivations. We note that noise magnitudes cannot be less than zero, implying that confidence intervals for both σ²_a and σ²_s are bounded by zero).

These fits contrast with those following FOF inactivation (Erlich et al., 2015). In particular, we note that the sensory and accumulator noise parameters were minimally altered after FOF inactivation, whereas ADS inactivation significantly impacted them.

This pharmacological demonstration that the striatum is required for unimpaired decisions that are based on the accumulation of evidence, and the model-based suggestion that the striatum affects properties of the accumulator, led us to explore the detailed neural dynamics that may support its potential causal contribution. To do so, we conducted single-unit recordings from freely behaving subjects engaged in the evidence accumulation task. Consistent with previous work (Graybiel, 2008; Jin and Costa, 2010; Kravitz and Kreitzer, 2012), we found that the neural activity of many striatal neurons was modulated by movement initiation (Figure 3—figure supplement 1a–c). However, we also found that over a third of the recorded neurons significantly modulate their activity in a side-selective manner (p<0.05) during the fixation period many hundreds of milliseconds before the movement initiation reporting the decision (64/173 [37%] of the neurons active during the fixation period [Figure 3—figure supplement 1d–f]). This timing suggests that they may have a role in forming the upcoming decision (go-left or go-right). These neurons were termed as ‘side-selective’ and for each we further defined the neuron’s prefered side as that yielding the largest activation, as done previously by Hanks et al. (2015). Consistent with previous work in primate dorsal striatum (Ding and Gold, 2010), we found that the average responses of these rat striatum neurons ramped upwards for stimuli in the preferred direction (Figure 3), and moreover, that after an initial onset latency, the slope of the ramp was proportional to the stimulus strength (Figure 3; Figure 4a). Importantly, however, a gradual ramping profile is not conclusive evidence for the encoding of gradually accumulating evidence, because such a response profile can also be consistent with other encoding schemes (Ditterich, 2006; Hanks et al., 2015; Latimer et al., 2015) such as step changes in firing rate that occur at different times in different trials (Latimer et al., 2015). Thus, we extended our analysis to include a more direct test in which the influence of single quanta of sensory evidence on the responses of the cells is quantitatively assessed.

Figure 3 with 1 supplement see all

Download asset Open asset

Peri-stimulus time histograms (PSTHs) of example neurons.

PSTHs aligned to stimulus onset are shown for three example striatum neurons. Trials were sorted into four stimulus-strength bins for each neuron. Green traces correspond to the preferred-direction stimuli and red traces to anti-preferred-direction stimuli. Darker colors correspond to stronger stimuli (less difficult) and brighter colors correspond to weaker stimuli (more difficult).

https://doi.org/10.7554/eLife.34929.010

Figure 4 with 1 supplement see all

Download asset Open asset

Graded representation of accumulated evidence in the dorsal striatum.

(a) Responses of pre-movement side-selective striatal neurons during evidence accumulation (mean ± S.E.M.). Trials are grouped by the average strength of sensory evidence with greener and redder colors corresponding to stimuli in the preferred and non-preferred direction of the neurons, respectively. Each group of trials is sorted on the basis of the difficulty of the trials from easy to hard, corresponding to darker and lighter colors, respectively. Note the significant dependence of ramping responses on stimulus strength (n = 64 neurons from three rats). (b) Click-triggered average response ± S.E.M. Note the close correspondence of the average click-triggered population response to a theoretical prediction of a fixed-magnitude and sustained increase in the neurons’ firing rate (see Materials and methods). (c) Firing of striatal neurons aligned to trial onset minus the neural response lag (150 ms; see Materials and methods) grouped on the basis of model-derived accumulator value (colors with ± B correspond to sticky accumulation bounds). Note that this accumulator value to firing rate map is graded and fairly stable over time (n = 64 neurons). (d) The population change in firing rate as a function of accumulator value averaged across time exhibits a graded response.

https://doi.org/10.7554/eLife.34929.012

If indeed temporal integration underlies the ramping activity of the striatal cells, then each single quantum of sensory evidence (an auditory click) should result in a fixed-magnitude and a sustained increase in the neuron’s firing rate (Figure 4b, model) (Hanks et al., 2015; Huk and Shadlen, 2005). We thus estimated the effect of each sensory evidence quantum by computing the click-triggered average response of the side-selective striatal neurons. We found that striatal neurons modulated their activity in close agreement with this theoretical prediction (Figure 4b, data), arguing in favor of a role of this anterior striatal subregion in the behavioral accumulation of evidence process.

We also took advantage of a recently developed method, i.e., direct estimates of firing rates as a function of accumulated evidence, to compute neural tuning curves (Hanks et al., 2015). Model-derived estimates of the moment-by-moment value of the accumulating evidence in each trial are collated with simultaneously recorded firing rates to generate tuning curves for accumulated evidence (see Hanks et al. (2015)), Materials and methods, and the illustration of the method in Figure 4—figure supplement 1). When applying this analysis to the striatal data, we found that the side-selective neurons encoded accumulating evidence in a remarkably graded manner throughout the period of evidence accumulation (Figure 4c,d). This graded encoding was consistent across different neurons in the population of recorded striatal cells (Figure 5). Such a graded representation implies that the striatum carries information about the graded value of accumulated evidence, as would be necessary for a brain structure involved in such a process.

Figure 5

Download asset Open asset

Distribution of tuning curve slopes for individual striatal neurons.

(a) Histogram of the slope of individual neurons obtained from a sigmoidal fit of the relationship between firing rate and accumulator value. The black arrow indicates the median value of the distribution (50th percentile). Red and blue arrows indicate points corresponding to the 20th and 80th percentile marks, respectively. (b) Example tuning curves shown for 20th, 50th, and 80th (colored as in [a]) percentile neurons. Graded encodings of accumulated evidence are exhibited for all of these neurons.

https://doi.org/10.7554/eLife.34929.014

Our pharmacological methods address the questions of whether the anterior dorsal striatum is involved in the process of accumulation of evidence, and our electrophysiological and computational methods address how the anterior dorsal striatum represents the accumulation of sensory evidence. However, neither directly addresses the question of when the anterior dorsal striatum is involved. This question is critical and has proven to be pivotal in assessing the involvement of a brain region in the evidence accumulation process. For example, some brain regions can be required for decisions that are based on accumulation of evidence, yet contribute at times suggesting that they are instead required for processes that are subsequent to the gradual accumulation of evidence itself (Erlich et al., 2015; Hanks et al., 2015). No region to date has been reported to be required at points of time that fully coincide with the evidence accumulation period.

To delineate the precise timing of the anterior dorsal striatum’s contribution, we used optogenetic inactivation, mediated by halorhodopsin (eNpHR3.0), to unilaterally and transiently inactivate this region during the Poisson Clicks task. We expressed eNpHR3.0 using viral delivery methods (Figure 6a; Materials and methods). Acute neural recordings in our experimental rats verified that we could indeed transiently silence neural activity in the striatum with fine temporal precision using the delivery of green light (Figure 6b). We began with full-trial unilateral optogenetic inactivation and found, in agreement with the pharmacological inactivation described above, that optogenetic manipulation resulted in more ipsilateral choice biases relative to control trials, which in this case were randomly interleaved with the inactivation trials (Figure 6c; bias = 9.0 ± 2.3%, p<0.01). These effects were consistent across rats (Figure 6d). Control rats whose striatum was injected with the same virus expressing YFP alone did not show a behavioral bias (bias = 0.1 ± 1.8%, p=0.89). Next, to resolve directly when the striatum contributes to the auditory accumulation of evidence task, we transiently inactivated it unilaterally during one of four different 500 ms time periods during the task: (i) the delay period immediately preceding stimulus onset (‘pre-accumulation’), (ii) the first half of a 1 s sensory stimulus (‘first half’), (iii) the second half of a 1 s sensory stimulus (‘second half’), or (iv) the movement period (‘post-choice’). In contrast to similar inactivation assays of the cortical FOF, which have no effect during the early parts of the accumulation period (Hanks et al., 2015), we found that transient optogenetic inactivation of the anterior dorsal striatum during both the first half and second half of the accumulation caused a significant bias for the ipsilateral choices, with a similar magnitude of effect in these two periods (first half bias = 10.4 ± 4.0, p<0.01; second half bias = 12.9 ± 3.7%, p<0.01; difference = 2.5 ± 2.8, p=0.2; Figure 6e; the first-half effect in the striatum is significantly greater than that in the FOF, p<0.01, Figure 7). Remarkably, the effect in striatum was limited to the stimulus presentation period and we found no significant effect of optogenetic inactivation during the pre-accumulation or post-choice periods (pre-accumulation bias = 0.4 ± 5.4%, p=0.42; post-choice bias = 0.9 ± 5.2%, p=0.38; Figure 6e). These results are consistent with the idea that the anterior dorsal striatum plays a direct causal role throughout the entire evidence accumulation process.

Figure 6

Download asset Open asset

Optogenetic inactivation reveals that dorsal striatal activity causally contributes to decision formation throughout the accumulation process but not before nor after.

(a) Coronal section of the left hemisphere showing the expression of eYFP-eNpHR3.0 in the left dorsal striatum. Optical fiber localization and 750 μm estimated inactivation radius are indicated by the red circle. (b) Raster plot (bottom) and peri-stimulus time histogram (top) showing the effectiveness in silencing of local striatal activity in response to delivery of green light (indicated by the green bar at the top). (c) Unilateral full-trial optical inactivation of the striatum results in an ipsilateral bias in accumulation trials. The purple and cyan psychometric curves show data for right and left striatal inactivation, respectively, whereas the black psychometric curve shows data from control trials that occurred on the same days (n = 8 rats). (d) Scatter plot indicated the mean ipsilateral bias for each individual rat. (e) Bottom: behavioral bias caused by 500 ms inactivation during the pre-stimulus delay period (red), the first half of the sensory stimulus (yellow), the second half of the stimulus (green) and upon initiation of movement (blue). Top: task structure. Note the significant effect (indicated by an asterisk) only during evidence accumulation but not prior to the presentation of sensory stimuli nor after.

https://doi.org/10.7554/eLife.34929.015

Figure 7

Download asset Open asset

Comparison of early stimulus period optogenetic inactivation effects in the striatum and frontal orienting field (FOF).

Optogenetic inactivation of the anterior dorsal striatum during the first half of the 1 s stimulus presentation period produced a significantly larger effect than the same manipulation of the FOF (p<0.01), with the latter data coming from a previous report. For this analysis, individual trials were resampled with replacement from both data sets across 1000 iterations, and the difference in inactivation effect was calculated for each iteration to provide a nonparametric statistical comparison. As reported above, the first-half anterior dorsal striatum effect itself is significant, and as reported previously, the first-half FOF effect is not significant, but a direct comparison as described here is still necessary to establish a significant difference.

https://doi.org/10.7554/eLife.34929.016

Discussion

Studies carried out over more than two decades have attempted to elucidate neural circuits that underlie the accumulation of evidence over time (starting with Shadlen and Newsome (1996); see Gold and Shadlen (2007), Carandini and Churchland (2013), Brody and Hanks (2016), and Hanks and Summerfield (2017) for reviews; see also Carandini and Churchland (2013), Gold and Shadlen (2007), Krajbich et al. (2012), Shadlen and Newsome (1996). Ding and colleagues have shown that microstimulation of the ADS perturbs decisions based on the accumulation of evidence (Ding and Gold, 2010 , 2012a; Ding, 2015; but see Histed et al., 2009 and Tehovnik and Slocum, 2013) for discussion as to whether or not microstimulation primarily affects axon terminals, which would add complications for its interpretation when localizing neural function). Despite these many years of studies, no brain region has previously been identified as: first, being required for unimpaired accumulation-based decision-making behavior; second, having the graded neural encoding required for direct involvement in computing the graded, gradually evolving, value of the accumulating evidence; and third, making a causal contribution throughout times that fully coincide with the accumulation process. By demonstrating that the anterior dorsal striatum satisfies all three of these criteria, our work suggests that the anterior dorsal striatum is the first identifiable node in the neural circuit causally responsible for computing evidence accumulation. The anterior dorsal striatum is well positioned anatomically to participate in evidence accumulation as it receives diverse convergent anatomical input from multiple cortical areas (Cheatwood et al., 2003; McGeorge and Faull, 1989) and it is connected via recurrent loops with cortical and subcortical areas that are widely believed to play a role in action selection (Ding and Gold, 2013). Whether the anterior dorsal striatum possesses a unique role in evidence accumulation, or whether it is an important node of a more extended network of brain regions that operate in coordination to mediate evidence accumulation, remains to be resolved. Corticostriatal loops are organized as distinct parallel circuits (Alexander et al., 1986; Kim and Hikosaka, 2015); future studies dissecting the contribution of different loops will be important for resolving this major question.

Our results, together with those of Ding and colleagues (Ding and Gold, 2010; Ding and Gold, 2012a; Ding, 2015), suggest that the striatum may be directly involved in a more expansive set of computations, traditionally considered to be more cognitive in nature, than the already well-established functions of the dorsal striatum in action selection, response initiation, evaluation of reward uncertainty, and habit formation (Ding and Gold, 2010; Graybiel, 2008; Hikosaka et al., 2014; Jin and Costa, 2010; Kravitz and Kreitzer, 2012). It will be important to better understand how the striatal involvement in computing accumulation of evidence, as identified in this study, may contribute to those previously established functions. Extensions to our paradigm (for example, free response protocols or more extended trial durations) are likely to be useful for reconciling the functions indciated by results with the other functions of the striatum. The computations involved in evidence accumulation may perhaps provide an efficient mechanism for extracting important pieces of information from the environment in the service of other roles of the striatum.

By identifying model parameters affected by the inactivations, our model fits suggest specific aspects of the evidence accumulation computation that could be prioritized as potentially particularly strongly related to the ADS’s role in the computation. The model fits to unilateral pharmacological inactivation data found that, similar to unilateral inactivations of the FOF, the side bias in lapse rates was increased by the inactivations. But in contrast to what occurs in the FOF, noise parameters, including the accumulator’s intrinsic noise, were also substantially increased after striatal inactivation (Figure 2a,b). Model fits to bilateral anterior dorsal striatum inactivation data found that the sum of sensory and accumulator noise magnitude parameters was significantly increased by striatal inactivation (Figure 2c,d). A parsimonious account suggests that the main noise parameter that is affected may perhaps be the magnitude of the noise in the evidence accumulator. This would be consistent with the idea, supported by our electrophysiological and optogenetic data, that the striatum plays a role in the accumulation process. The lack of a significant effect on other parameters should be treated with caution: it remains possible that future studies with greater statistical power could discern an effect of striatal inactivation on some of these other parameters. Nevertheless, even while we emphasize that we do not take the modeling results on their own as conclusive, they do suggest that accumulator noise is a principal parameter of interest. It is conceivable that bilateral striatal inactivation increases accumulator noise by destabilizing the accumulator’s representation without biasing it, but a circuit model hypothesis to explain precisely how the striatum might affect the accumulator noise level remains to be developed. Another important direction for future studies will be the development of models with temporally specific parameters that could be used to model the effects of temporally specific optogenetic inactivation appropriately.

Independently of whether the anterior dorsal striatum operates alone or as part of a broader circuit for computing gradual evidence accumulation, and independently of the precise nature of its contribution to the evidence accumulation computation, the data reported here provide a critical foothold towards delineating the relevant causal circuit: for example, the anterior dorsal striatum’s major inputs and outputs become important candidate regions to be examined for a potential role in the process. The possibility that the causal circuit for computing evidence accumulation may be delineated in the near future suggests that we will soon be able to elucidate the circuit and cellular mechanisms that support evidence accumulation, a computation that is crucial for decision-making behavior in a wide range of species, including humans.

Share this article

Cite this article

Dorsal anterior striatum is required for unimpaired performance on the Poisson-clicks evidence accumulation task.

Fits of the model of Brunton et al. (2013) and Erlich et al. (2015) to data from sessions following muscimol inactivation of the striatum.

Peri-stimulus time histograms (PSTHs) of example neurons.

Graded representation of accumulated evidence in the dorsal striatum.

Distribution of tuning curve slopes for individual striatal neurons.

Optogenetic inactivation reveals that dorsal striatal activity causally contributes to decision formation throughout the accumulation process but not before nor after.

Comparison of early stimulus period optogenetic inactivation effects in the striatum and frontal orienting field (FOF).

Author details

Michael M Yartsev

Contribution

Contributed equally with

For correspondence

Competing interests

Timothy D Hanks

Contribution

Contributed equally with

For correspondence

Competing interests

Alice Misun Yoon

Contribution

Competing interests

Carlos D Brody

Contribution

For correspondence

Competing interests

Citations by DOI

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

Categories and tags

Research organism