Mesencephalic representations of recent experience influence decision making

Abstract
eLife digest
Introduction
Results
Discussion
Materials and methods
References
Article and author information
Metrics

Abstract

Decisions are influenced by recent experience, but the neural basis for this phenomenon is not well understood. Here, we address this question in the context of action selection. We focused on activity in the pedunculopontine tegmental nucleus (PPTg), a mesencephalic region that provides input to several nuclei in the action selection network, in well-trained mice selecting actions based on sensory cues and recent trial history. We found that, at the time of action selection, the activity of many PPTg neurons reflected the action on the previous trial and its outcome, and the strength of this activity predicted the upcoming choice. Further, inactivating the PPTg predictably decreased the influence of recent experience on action selection. These findings suggest that PPTg input to downstream motor regions, where it can be integrated with other relevant information, provides a simple mechanism for incorporating recent experience into the computations underlying action selection.

https://doi.org/10.7554/eLife.16572.001

eLife digest

The decisions we make are influenced by recent experience, yet it is not known how this experience is represented in the brain. For decisions about when, where and how to move, researchers have hypothesized that recent experience might influence activity in a region of the brainstem – the central trunk of the brain – that is known to be involved in movement.

When deciding when, where and how to move, several areas of the brain are involved in selecting the optimal action. Recent studies suggest that groups of neurons known as locomotor brainstem nuclei may also contribute to making decisions about movements.

Thompson et al. investigated whether a brainstem locomotor area called the pedunculopontine tegmental (PPTg) nucleus in mice might contribute to decision making rather than just conveying the selected response. The mice were trained to recognize particular odors and move to either the left or right to collect a food reward. While the mice were selecting an action, the activity of neurons in the PPTg nucleus reflected the action they had chosen on a previous experience and the outcome of that choice (i.e. whether they received a reward). These representations of past experiences influenced the upcoming decision the mice were about to take.

The findings of Thompson et al. suggest that the PPTg nucleus might play a critical role in the process of selecting the optimal action. Future work will examine what kinds of information about the environment or recent experience have the biggest effect on the activity of this region.

https://doi.org/10.7554/eLife.16572.002

Introduction

Selecting actions in a dynamic environment should take into account both sensory input and internally-generated estimates of action value. Integrating these sources of information is well-described by a Bayesian framework in which estimates of action value are continually updated by incoming sensory information in order to select the most valuable action (Körding and Wolpert, 2006; Gold and Shadlen, 2007; Kim and Basso, 2010). This updating of action values, based on experiencing the outcomes associated with past actions, is thought to be mediated primarily by striatal circuits (Lau and Glimcher, 2007, 2008; Histed et al., 2009; Tai et al., 2012; Kim et al., 2013). These action values can be maintained in striatal activity and ultimately used to bias activity in downstream motor centers such that the most valuable actions are more likely to be selected (Hikosaka et al., 2006, 2014). This system is capable of flexibly encoding action value estimates in arbitrarily complex and dynamic contexts over a range of time-scales. However, in environments in which only the recent past is relevant to action value, which is often the case in the real world, a simpler complementary mechanism would be to maintain short-term representations of the most recent actions and their outcomes that directly modulate the action selection process.

We studied this possibility by examining behavior and neural activity in well-trained mice performing a task requiring them to select an action based on the dominant component of an odor mixture (Uchida and Mainen, 2003). We have previously shown that the superior colliculus (SC) plays a critical role in selecting the action – a leftward or rightward orienting movement – required by this task (Felsen and Mainen, 2008, 2012; Stubblefield et al., 2013), consistent with its role in selecting orienting movements in other species (Glimcher and Sparks, 1992; Horwitz and Newsome, 2001; Bergeron et al., 2003; Krauzlis et al., 2004; Song et al., 2011; Wolf et al., 2015). In this study, we asked whether the pedunculopontine tegmental nucleus (PPTg), a mesencephalic sensorimotor hub that provides direct input to the SC (Graybiel, 1978; Beninato and Spencer, 1986; Stubblefield et al., 2015), encodes information about recent actions and their outcomes by recording from individual neurons in behaving mice. While numerous regions provide input to the SC (Sparks and Hartwich-Young, 1989), many of which may modulate its processing underlying action selection (Wolf et al., 2015), the PPTg holds particular interest because it is engaged by sensorimotor tasks across species (Matsumura et al., 1997; Dormont et al., 1998; Kobayashi and Isa, 2002; Kobayashi et al., 2002; Okada and Kobayashi, 2009; Norton et al., 2011; Thompson and Felsen, 2013; Lau et al., 2015).

We found that actions in this task are influenced by actions and outcomes in the recent past. Further, at the time of action selection and even throughout much of the trial, most PPTg neurons represented the choice (left or right movement), outcome (rewarded or non-rewarded), or both, on the previous trial. Furthermore, we found that these representations influenced action selection: the probability of particular upcoming choices was predictably related to the firing rates of neurons selective for choice on the previous trial, and pharmacological inactivation of the PPTg causally affected behavior, in part by decreasing the influence of recent choices on upcoming choices. Our results suggest a novel mechanism, subserved by the PPTg, for efficiently modulating action selection based on the recent history of actions and their outcomes.

Results

Influence of previous trials on behavior

Our overall goal was to address the role played by the PPTg within the interconnected network of brain regions responsible for selecting actions (Gold and Shadlen, 2007; Wolf et al., 2015). Given its connectivity with the basal ganglia and input to the SC, we hypothesized that PPTg activity may represent recent information that could be relevant to the selection of upcoming choices. To test this hypothesis, we examined first whether behavior in our task was influenced by the choices (left or right) and outcomes (rewarded or non-rewarded) on previous trials, and next whether PPTg activity similarly reflected these previous choices and outcomes.

Towards this end, we trained seven mice on a well-established odor-guided spatial choice task (Uchida and Mainen, 2003; Thompson and Felsen, 2013). In each trial of the task, the mouse samples a binary odor mixture at a central port, waits for a go signal, and reports which odor is dominant by moving to the left or right port for a water reward (Figure 1A; see Materials and methods). We have previously used this task to show that the activity of a population of PPTg neurons tends to be higher preceding contralateral than ipsilateral orienting movements (Thompson and Felsen, 2013). Although optimal performance on this task requires that choices be based only on the current stimulus, previous studies – in mice, primates and humans – have shown that behavior in such tasks is, nevertheless, often influenced by previous trials (Lau and Glimcher, 2005; Gold et al., 2008; Busse et al., 2011; Akaishi et al., 2014). We therefore examined how, in our task, upcoming choices depend on the choices and outcomes of previous trials by first examining psychometric functions conditional on previous trial history. As illustrated in the example data from one mouse in Figure 1B,C, the mouse tended to choose the left port more often when it had chosen the left port on the previous trial and had been rewarded (Figure 1B, compare solid black line to gray line), and tended to choose the right port more often when it had chosen the right port on the previous trial whether or not it had been rewarded (Figure 1C, compare solid and dashed black lines to gray line). To quantify the influence of previous trials on choices, separately for each mouse (12 sessions per mouse; 451 ± 115 (mean ± STD) trials per session), we employed a logistic regression model with terms for previous choice ( $β_{C h o i c e}^{1}$ ) and outcome ( $β_{O u t c o m e : L}^{1}$ and $β_{O u t c o m e : R}^{1}$ ) (Extended model, 1 trial back; see Materials and methods). The sign of $β_{C h o i c e}^{1}$ reflects the tendency to stay with the same (positive) or switch to the opposite (negative) choice as on the previous trial, while the sign of $β_{O u t c o m e : L}^{1}$ and $β_{O u t c o m e : R}^{1}$ reflects the tendency to choose left (positive) or right (negative) when the previous choice was rewarded at the side indicated in the subscript. Consistent with the example data shown in Figure 1B,C, and with some previous work (Lau and Glimcher, 2005), we found that, while there was some variability across mice, they tended to choose the same side as they had on the previous trial (reflected in positive values of $β_{C h o i c e}^{1}$ ) [although in different tasks, a tendency to choose the opposite side has been observed (Lau and Glimcher, 2005; Kim et al., 2007; Sul et al., 2011)], and that this tendency was enhanced when the mouse was rewarded on the previous trial (reflected in positive values of $β_{O u t c o m e : L}^{1}$ and negative values of $β_{O u t c o m e : R}^{1}$ ; Figure 1D). In order to determine whether including these terms improved our behavioral model, we compared the accuracy of the Extended model, 1 trial back to the accuracy of the Simple model at predicting choices (Materials and methods). We found that including the choice and outcome on the previous trial improved the predictive accuracy of the model for five of seven mice (Figure 1E, p<0.0001, one-tailed paired Student’s t test separately by mouse). However, including the choice and outcome from two trials back did not further improve the predictive accuracy of the model in any mice (Figure 1F, p>0.16, one-tailed paired Student’s t test).

Figure 1

Download asset Open asset

Behavior is influenced by previous trials.

(A) Timing of behavioral events for two consecutive trials (‘previous’ and ‘current’) of the spatial choice task. Gray box shows pre-stimulus epoch, the primary focus of our neurophysiological analyses. (B and C) Behavioral performance conditional on whether the choice on the previous trial was left (B) or right (C), and rewarded or unrewarded, for 1 example mouse (12 sessions). Gray circles and lines include all trials and are identical in B and C, shown for comparison with the conditional data (black). Lines show best-fit logistic functions using the *Simple model*. (D) Influence of previous choice and outcome on choice behavior for 7 mice (M1-M7), estimated with the *Extended model, 1 trial back*. Error bars, 95% confidence intervals. (E and F) Improvement in predicting choices realized by including in the behavioral model choices and outcomes 1 trial back (E) and 2 trials back (F) for all seven mice (corresponding to M1-M7 in D). Each symbol represents 1 session. For each session, simulated choices on a test set of trials (50% of trials in the session) were separately generated by three models (*Simple model; Extended model, 1 trial back; and Extended model, 2 trial back)* in which all regression coefficients were estimated from the remaining trials (in all sessions) for that mouse. The accuracy of each model’s prediction of choices in the test set was calculated as the percentage of trials in which the predicted choice matched the actual choice. This process was repeated, with a new test set, 50 times/session. Each symbol shows the average, across repeats for each session, of the improvement in accuracy of the *Extended model, 1 trial back* over the *Simple model* (E), and of the *Extended model, 2 trials back* over the *Extended model, 1 trial back* (F). *p<0.0001, one-tailed paired Student’s t test across sessions per mouse.

https://doi.org/10.7554/eLife.16572.003

In principle, the influence of the previous choice on behavior (Figure 1E) could be mediated in part by body position. For example, if the orientation at which the mouse enters the odor port depends on its previous choice, its subsequent choice behavior may be biased. If this were the case, we reasoned that would likely, although not necessarily, observe that movement duration (from odor port exit to reward port entry) would depend on the previous choice. However, we found that this was not the case (durations for rightward movements preceded by right choices and preceded by left choices, analyzed separately for each mouse, did not differ in 7/7 mice; durations for leftward movements preceded by right choices and preceded by left choices did not differ in 6/7 mice; p>0.05, two-tailed unpaired Students t-tests separately by mouse). Therefore, although it is not possible to entirely rule out a role for the position of the body or any of its parts (e.g., whiskers) in contributing to our observed effect, as we did not measure all dimensions of body position, these results reduce the likelihood that choices depend on patterns of body position arising from the previous choice. Together, these results indicate that choices in our task are influenced by what happened on the previous trial – primarily by the choice rather than the outcome – and little influenced by the choice or the outcome from two trials in the past.

Influence of previous trials on PPTg activity

Having established the relevance of the previous trial for behavior, we next examined PPTg activity recorded in the same seven mice while they performed the task (see Materials and methods). We first asked whether PPTg activity reflects the choice or outcome of the current trial. Figure 2A shows rasters and peristimulus time histograms (PSTHs) for a representative PPTg neuron aligned to the time of odor valve open and grouped by the choice on the current trial, and Figure 2B shows the same data grouped by outcome on the current trial. Although the firing rate increased around the time of odor valve open, it does not appear to depend on either the choice (Figure 2A) or outcome (Figure 2B) on the current trial. To quantify dependence on choice across the population of 506 PPTg neurons recorded, we used an ROC-based preference analysis that uses the firing rates in a particular epoch to assign values ranging from −1 (ipsilateral preference) to 1 (contralateral preference), where the magnitude reflects preference strength (see Materials and methods). We were initially interested in preference during the pre-stimulus epoch – from odor port entry to 100 ms after odor valve open (Figure 1A, gray shading; Figure 2A, gray bar) – when the mouse is presumably preparing to update its estimates of the value of each option (i.e., to move left or right) with the information provided by the stimulus. We found that few neurons exhibited a preference for choice during this epoch (Figure 2, p<0.05, Monte Carlo permutation test). We then used a similar preference analysis to quantify dependence on outcome during this same epoch, where negative values indicate preference for no-reward and positive values indicate preference for reward (see Materials and methods). We found that few neurons exhibited outcome preference (Figure 2, p<0.05). It is not surprising that few neurons exhibit choice or outcome preference during this epoch, because the mouse presumably does not commit to a choice and cannot predict the outcome of the trial, before receiving the stimulus.

Figure 2

Download asset Open asset

PPTg activity during the pre-stimulus epoch is influenced by choices and outcomes on previous trials.

(A and B) Rasters (top) and PSTHs (bottom) for one example neuron grouped by choice (A) and outcome (B) on the current trial. Fifty pseudorandomly selected ipsilateral and contralateral trials (A) and reward and no-reward trials (B) are shown in the rasters; all trials are included in the PSTHs. Activity is aligned to the time of odor valve open (blue lines) and sorted within each group by the time since odor port entry (orange ticks). PSTHs show average firing rate across all trials in each group, smoothed with a Gaussian filter (σ = 15 ms). Shading, ± s.e.m. Gray bar shows mean pre-stimulus epoch. (C and D) Preference during the pre-stimulus epoch for choice (C) and outcome (D) on the current trial across the population of neurons. Arrowheads indicate preferences for example neuron shown in A and B. (E, F, G and H) As in A, B, C and D with respect to 1 trial back instead of current trial. Data from same example neuron are shown. (I, J, K and L) As in A, B, C and D with respect to two trials back instead of current trial. Data from same example neuron are shown.

https://doi.org/10.7554/eLife.16572.004

We next examined whether PPTg activity depended on the choice and outcome on the previous trial, which have been shown to influence behavior (Figure 1). Figure 2E,F shows data from the same neuron as in Figure 2A,B but with trials grouped by the choice (Figure 2E) or outcome (Figure 2F) on the previous, as opposed to the current, trial. In contrast to the lack of dependence on the choice or outcome on the current trial (Figure 2A,B), there appears to be a noticeable difference in firing rate around the time the odor valve opens depending on the choice or outcome on the previous trial (Figure 2E,F). We quantified this dependence across the population using the same preference analysis previously described, but with trials grouped according to the choice (Figure 2G) or outcome (Figure 2H) on the previous trial. Across the population, a sizeable fraction of neurons exhibited a preference for the choice (Figure 2G; 26% preferred ipsilateral, 49% preferred contralateral, p<0.05; more preferred contralateral than ipsilateral, p=0.001, χ² test) and outcome (Figure 2H; 46% were selective, p<0.05) on the previous trial. We found a modest correlation between preference for the choice on the previous (Figure 2G) and current (Figure 2C) trials (r = 0.25, p=1.42 × 10⁻⁸).

In principle, the dependence of PPTg activity on previous choice could actually reflect a dependence on the direction of movement from the chosen reward port back to the odor port. While either representation would be useful for conveying information about previous choice, to determine whether PPTg activity reflects choice per se or movement direction, we examined how activity during movement to and from the reward port (which are in opposite directions) depended on which reward port was selected. We found that, rather than representing movement direction, two-thirds of neurons maintained their preference for which reward port was chosen (p=9.12 × 10⁻⁶, χ² test), suggesting that the preference of PPTg neurons for previous choice (Figure 2G) cannot be accounted for by a dependence on movement direction.

We then calculated choice and outcome preference with respect to two trials back (Figure 2I–L) and found that, as was the case for the current trial, few neurons exhibited a preference for choice (Figure 2K) or outcome (Figure 2L), consistent with the relatively little influence on behavior of the choice and outcome from two trials in the past (Figure 1F). Thus, a larger proportion of neurons exhibited a preference for choice or outcome one trial back than on either the current trial or two trials back (choice, p=2.0 × 10^–48; outcome, p=3.0 × 10^–5, Kruskal-Wallis ANOVA; post-hoc comparisons: choice, one trial back vs. current trial, p=5.6 × 10^–30; one trial back vs. two trials back, p=1.6 × 10^–30; outcome, one trial back vs. current trial, p=4.0 × 10^–4; one trial back vs. two trials back, p=0.002), and there were no differences between the fractions of neurons exhibiting a preference for choice or outcome on the current trial and two trials back (choice, two trials back vs. current trial, p=0.92; outcome, two trials back vs. current trial, p = 0.99).

These results suggest that PPTg activity during the pre-stimulus epoch is primarily modulated by the choice and outcome on the previous trial (although our analyses again cannot entirely rule out the possibility that body position also modulates neural activity). Previous work using the same task has shown that PPTg activity in other epochs reflects the choice and outcome on the current trial (Thompson and Felsen, 2013), consistent with other studies of PPTg function (Matsumura et al., 1997; Dormont et al., 1998; Kobayashi et al., 2002; Norton et al., 2011; Maclaren et al., 2013). Together, these results raise the question of how neural activity evolves over the course of the trial from representing previous trial information during the pre-stimulus epoch (Figure 2G,H) to current trial information in later epochs. To quantify these temporal dynamics, we performed a linear regression analysis investigating how PPTg activity is influenced, over time, by four factors: previous choice ( $β_{C h o i c e}^{1}$ ), current choice ( $β_{C h o i c e}^{0}$ ), previous outcome ( $β_{O u t c o m e}^{1}$ ) and current outcome ( $β_{O u t c o m e}^{0}$ ) [Materials and methods; we did not include terms for two trials back because this trial had little influence on PPTg activity (Figure 2K,L)]. Figure 3A shows these $β$ s as a function of time, for the same example PPTg neuron shown in Figure 2. Consistent with the PSTHs shown in Figure 2, around the time the mouse enters the odor port (which precedes odor valve open by about 100 ms), the activity of this neuron began to be influenced by previous choice and previous outcome, but not by current choice and current outcome (Figure 3, left). The influence of the previous trial declines by the time the mouse exits the odor port (Figure 3A, middle), and the influence of the current choice increases upon reward port entry (Figure 3A, right), consistent with the fact that many neurons exhibit preference for the current choice during and following entrance to the reward port (Thompson and Felsen, 2013).

Figure 3

Download asset Open asset

Dynamics of the influence of the choice and outcome on the previous and current trial on PPTg activity throughout the trial.

(A) For the example neuron shown in Figure 3, 95% confidence intervals are shown for each regression coefficient calculated in 100 ms bins (shifted by 10 ms), aligned to three trial events. (B) Average regression coefficients across all neurons that exhibited a significant preference for choice on the previous trial. Ribbons reflect mean ± SEM $β_{C h o i c e}^{1}$ (black) $β_{C h o i c e}^{0}$ (red) coefficient values as a function of time. (C) Fraction of neurons with activity in each 100 ms bin significantly influenced by choice on the previous or current trial, aligned as in A. (D) As in B, with respect to outcome instead of choice. (E) As in C, with respect to outcome instead of choice.

https://doi.org/10.7554/eLife.16572.005

We performed this regression analysis on each neuron that exhibited a significant preference for choice on the previous trial (corresponding to the black bars in Figure 2G; 378/506 total neurons). Figure 3B and C show, over the course of the trial, the mean $β_{C h o i c e}^{1}$ and $β_{C h o i c e}^{0}$ , as well as the fraction of neurons with activity influenced by the choice on the previous (solid line) and current (dashed line) trial. Initially – even preceding odor delivery – the activity of many neurons reflects the choice on the previous trial, while the activity of virtually no neurons reflects the choice on the current trial (Figure 3C, left). During stimulus presentation many neurons begin to reflect the choice on the current trial, and by the time the movement is initiated more neurons reflect the choice on the current than the previous trial (Figure 3C, middle). Interestingly, the sign of $β_{C h o i c e}^{1}$ of many neurons briefly inverted immediately preceding movement initiation (Figure 3B), consistent with the dip in the fraction of significant neurons at this time (Figure 3C). The choice on the current trial retains its influence for the remainder of the trial while the influence of the choice on the previous trial declines (Figure 3C, right), although, interestingly, a sizeable fraction of neurons continues to reflect the choice on the previous trial until well-after the mouse has made its choice on the current trial. We also examined, for the same population shown in Figure 3B,C, how the fraction of neurons with activity influenced by the outcome on the previous trial (solid line) and current trial (dashed line) changes over the course of the trial (Figure 3D,E). Similar to the evolving representation of choice, the activity of more neurons is more strongly influenced by previous trial outcome early in the trial (Figure 3D,E, left), and by current trial outcome later in the trial (Figure 3D,E, right). In contrast to the long-lasting representation of previous choice (Figure 3B,C), virtually none of these neurons represent previous outcome by the time the mouse initiates its movement to the reward port (Figure 3D,E, middle; however, note that the population of neurons analyzed was selected based on preference for previous choice, and not previous outcome, during the pre-stimulus epoch). These analyses show how PPTg activity evolves from representing information about the previous trial to representing information about the current trial, which may reflect the transfer of previous-trial information from the PPTg to a downstream region where it can be integrated with sensory evidence in order to form a motor plan.

Influence of PPTg representations of previous trials on behavior

In order to address whether the PPTg activity representing previous trial information is used to guide behavior, we next examined whether this activity influences the choice on the current trial. For simplicity we focused on PPTg activity representing previous choice, and not outcome, because we found that the former representation was more robust (Figure 2G,H). If this activity indeed contributes to behavior – e.g., by providing information about the value of each option – we would predict a trial-by-trial correlation between the activity of individual neurons that exhibit a preference for previous choice and the likelihood of the mouse choosing the corresponding reward port, particularly on trials in which the sensory cue is ambiguous. Specifically, on trials in which a given neuron exhibits higher activity during the pre-stimulus epoch, we would expect the mouse to be more likely to choose the reward port corresponding to the preferred choice (ipsilateral or contralateral) of that neuron. Therefore, for each neuron exhibiting a significant preference for choice on the previous trial (corresponding to the black bars in Figure 2G; 378/506 total neurons), we examined how choice on ambiguous trials (% Left odor = 40, 50, or 60; Figure 1B) depended on firing rate during the pre-stimulus epoch (Figure 4A). Specifically, for each trial of the session we calculated the firing rate (normalized to its maximum across trials, separately for each previous choice condition) and classified it by whether the mouse chose the preferred or antipreferred choice of the neuron, quantified as 1 and 0, respectively. We then calculated the slope of the best-fit line through these points, each of which correspond to a trial. For a given neuron, a positive slope indicates that the mouse is more likely to choose the reward port corresponding to the preferred choice of the neuron on trials in which its activity is high during the pre-stimulus epoch, as we had predicted. Across this population of neurons, many more exhibited a significantly positive slope than negative slope (p<0.05, Monte Carlo permutation test; positive slopes: 136/378; negative slopes: 30/378; positive > negative, p=0.001, χ² test; Figure 4B), indicating a similar relationship between firing rate and current choice as shown in Figure 4A. However, since firing rate and current choice both depend on previous choice (Figuers 1 and 2), it is possible that this relationship is indirect. To determine whether this was the case, we examined the distribution of slopes separately for each mouse and found that each exhibited more neurons with positive than negative slopes (Figure 4C), even though the behavior of some mice did not depend on previous choice (Figure 1E). These results suggest that the representation of the choice on the previous trial (Figure 2) can directly influence the choice made by the mouse on the current trial.

Figure 4

Download asset Open asset

PPTg representation of previous choice affects behavior.

(A) Probability that the mouse chose the reward port corresponding to the previous-trial choice preference of an example neuron, plotted as a function of the firing rate of that neuron during the pre-stimulus epoch. Firing rates were normalized to the maximum for the neuron, separately within each previous choice condition. Only current trials with an ambiguous sensory cue are shown. Dashed line, best linear fit to raw data (antipreferred choice, y = 0; preferred choice, y = 1). Circles represent mean ± s.e.m. fraction preferred choices for binned normalized firing rates and are shown for display only (not used to fit the line). (B) Slope of best-fit line calculated as in A for all neurons exhibiting a significant preference for choice on the previous trial during the pre-stimulus epoch. Slopes were determined to be significantly different from 0 (p<0.05) with a Monte Carlo permutation test with 1000 repeats. Significance of slopes was identical when the data were fit to a logistic function rather than a line as shown in A. Arrowhead indicates slope for example neuron shown in A. (C) Fraction of significantly positive (white) and negative (black) slopes, and non-significant (gray) slopes (calculated as in B), separately for each mouse, numbered as in Figure 2.

https://doi.org/10.7554/eLife.16572.006

To test the causality of this relationship, we next asked whether inactivating the PPTg, with muscimol (a GABA_A agonist) would change the degree to which the previous choice influences behavior. We first examined the direct effect on behavior of unilateral PPTg inactivation (see Materials and methods). We found that choices were biased ipsilateral to the inactivated PPTg, as compared to the preceding and following control sessions in which saline was infused to the same PPTg (Figure 5A). We quantified this effect across all 30 sets of sessions [saline (pre), muscimol, and saline (post)] from three mice by estimating the influence of muscimol on choice (represented by $β_{M u s c i m o l}$ ; see Materials and methods). Positive values of $β_{M u s c i m o l}$ correspond to an ipsilateral influence and negative values correspond to a contralateral influence. We found that unilaterally inactivating the PPTg resulted in a modest ipsilateral influence on choices (Figure 5B; p<0.05, two-tailed unpaired Student’s t test, pooled across 3 animals and 30 sets of sessions (424 ± 105 trials; mean ± STD); $β_{M u s c i m o l}$ > 0 in 14/30 individual sets of sessions, $β_{M u s c i m o l}$ < 0 in 6/30 individual sets of sessions, p<0.05, two-tailed one-sample Student’s t test; black bars), indicating a causal relationship between PPTg activity and contralateral movements in this task. Consistent with the modest but significant effect of unilateral inactivation on choice bias, we found that movement latency and duration – both to and from the reward port – were longer in muscimol than in saline sessions, even when contralateral and ipsilateral choices were pooled (Materials and methods; p=4.55 × 10⁻²⁰, 5.24 × 10⁻³², and 0.0026, respectively, two-tailed unpaired Student’s t tests).

Figure 5

Download asset Open asset

Inactivating the PPTg causally affects behavior.

(A) Behavioral performance during example sessions after which either muscimol (black circles) or 0.9% saline (gray circles) was infused into the left PPTg. Saline was infused during the session before (gray circles, solid gray line) and after (gray triangles, dashed gray line) the muscimol session. One session was performed per day. Lines show best-fit logistic functions using the *Simple model*. Error bars, ± s.d. Behavioral data shown here and in all subsequent figures were collected from well-trained mice (Materials and methods). (B) $β_{M u s c i m o l}$ calculated for all 3 mice and 30 sets of sessions. Positive values represent leftward (ipsilateral) influence of muscimol. Arrowhead indicates $β_{M u s c i m o l}$ for example sessions shown in A. (C) Behavioral influence of the previous choice calculated separately for muscimol and saline sessions (same sessions shown in B). *p=0.021, one-tailed Mann-Whitney U test.

https://doi.org/10.7554/eLife.16572.007

Finally, we wondered whether the effect on behavior of inactivating PPTg could be due to a decreased dependence of current choice on previous choice. We tested this idea by estimating $β_{C h o i c e}^{1}$ separately for the muscimol and saline sessions and found that $β_{C h o i c e}^{1}$ was indeed lower when PPTg activity was inhibited (Figure 5C, p=0.021, one-tailed Mann-Whitney U test), perhaps accounting for the modest effect on choices shown in Figure 5B [with respect to the other terms in the model, we found that $β_{O d o r : L}$ and $β_{O d o r : R}$ did not differ between muscimol and saline sessions (p=0.44 and p=0.28, respectively, two-tailed Mann-Whitney U tests), but that $β_{0}$ was higher in muscimol than saline sessions (p=0.040, two-tailed Mann-Whitney U test), indicating an odor-independent leftward bias consistent with Figure 5B]. These results support our hypothesis that the PPTg activity representing previous experience (Figure 2) influences action selection.

Discussion

This study examined the neural basis by which action selection is influenced by recent experience. We first found that behavior in a stimulus-cued spatial-choice task can be accounted for by a Bayesian framework (among others) in which choices are influenced by both recent trial history and the sensory stimulus (Figure 1), consistent with previous studies (Gold et al., 2008; Busse et al., 2011; Akaishi et al., 2014). We then found that activity of a subpopulation of PPTg neurons reflects choices and outcomes in the recent past (Figures 2 and 3) and correlated with upcoming choice (Figure 4), suggesting a role in mediating the influence of recent experience on behavior. Furthermore, inactivating PPTg decreased the influence of recent experience on behavior (Figure 5C). Together, our results demonstrate that the PPTg encodes representations of recent experience that can contribute to action selection. Below, we consider our findings in light of the evolving view of the role of the PPTg within the interconnected network of brain regions involved in integrating priors with sensory evidence in order to select and control motor output (Mena-Segovia et al., 2004; Gold and Shadlen, 2007).

The motor-related role of the PPTg has traditionally been considered in the context of central pattern generation (Garcia-Rill, 1991). More recent studies in behaving animals have found that PPTg activity encodes specific actions and reflects their outcomes (Matsumura et al., 1997; Dormont et al., 1998; Okada and Kobayashi, 2009; Norton et al., 2011; Maclaren et al., 2013; Thompson and Felsen, 2013) and have suggested that the PPTg may be involved in attention and other cognitive processes (Steckler et al., 1994; Winn, 2008). The results described here are consistent with these findings and suggest that PPTg output may play a role in providing information about recent actions and their outcomes that can be used to guide the selection of actions mediated by downstream motor circuits. However, downstream circuits do not necessarily utilize this information, given that it is represented in PPTg activity even in mice that do not exhibit a behavioral dependence on previous choice (Figure 1D).

To where might PPTg representations of recent experience be conveyed in order to modulate action selection? While the PPTg projects to several motor-related regions including other brainstem nuclei and the striatum (Beninato and Spencer, 1987; Inglis and Winn, 1995), and the current study cannot rule out the involvement of these regions, an attractive candidate is the SC. Cholinergic PPTg neurons project to and excite neurons in the SC responsible for motor output (Beninato and Spencer, 1986; Sooksawate et al., 2008; Stubblefield et al., 2015), and nicotinic signaling in the SC has been shown to modulate orienting behavior (Weldon et al., 1983; Aizawa et al., 1999; Watanabe et al., 2005). The SC is thought to integrate a wide range of inputs in order to select orienting actions (Kobayashi and Isa, 2002; Krauzlis et al., 2004; Grossberg et al., 2015; Wolf et al., 2015) and is thus well-positioned to combine – possibly additively – incoming sensory information with prior representations of action value (provided before, during, or even after the onset of sensory input; Figure 3B,C) in order to select the most valuable action (Trappenberg et al., 2001; Dorris et al., 2007; Kim and Basso, 2010). Our results suggest that the source of these prior representations may be the PPTg, which could complement other SC inputs – such as inhibition from the basal ganglia (Hikosaka and Wurtz, 1983; Chevalier et al., 1985) – in biasing action selection by modulating SC processing according to recently-experienced action-value associations (Kobayashi and Isa, 2002; Hikosaka et al., 2006; Wolf et al., 2015). While we have found it useful to think of the input from the PPTg to the SC as representing priors, in a Bayesian sense, accepting this framework – which may be imperfect given that choices in our task should not depend on previous trials – is not necessary for interpreting our results.

Given the importance of trial history for action selection, it is not surprising that representations of trial history similar to that shown here have been observed in several brain areas, including the striatum (Lau and Glimcher, 2007, 2008; Histed et al., 2009; Kim et al., 2013), prefrontal cortex (Histed et al., 2009) and premotor cortex (Marcos et al., 2013). Activity in these regions typically reflects information from several previous trials, while PPTg activity was primarily associated with the immediately preceding trial (Figure 2). While this difference may be due to different task demands (recall that tracking past trials conferred no behavioral benefit in our task), the representations in the PPTg that we observed suggest a complementary mechanism to the more computationally expensive processing in these other regions for integrating internal representations of experience with sensory evidence: Specifically, that PPTg directly influences the circuits underlying action selection based on recent experience, which may be well-suited to the relevant dynamics of some real-world situations. Future studies can expand upon our findings by recording from specific types of neurons (e.g., cholinergic) in the PPTg (Lima et al., 2009; Cohen et al., 2012; Roseberry et al., 2016), as well as by examining how associations between stimuli and reward location are initially learned, in order to further elucidate the function of the PPTg in selecting actions.

Materials and methods

Animals

All experiments were performed according to protocols approved by the University of Colorado School of Medicine Institutional Animal Care and Use Committee. Mice were bred in the animal facilities of the University of Colorado Anschutz Medical Campus or purchased (Jackson Labs). We used 10 male adult mice, aged 124–186 days at the start of experiments. Pharmacology experiments were performed in three C57BL/6J mice, and electrophysiological experiments were performed in three C57BL/6J and four ChAT-Cre mice (Jackson Labs, strain B6; 129S6-Chat^tm2(cre)Lowl) (we observed no differences across backgrounds and data are combined here). Mice were housed singly in a vivarium with a 12-hr light/dark cycle with lights on at 5:00 am. Food (Teklad Global Rodent Diet No. 2918; Harlan) was available ad libitum. Access to water was restricted to the behavioral session (see below) unless less than ~1 ml was received, in which case free water was provided for 2–5 min following the session (Thompson and Felsen, 2013).

Behavior

Request a detailed protocol

Mice were trained on an odor-guided spatial choice task (Uchida and Mainen, 2003) as described in detail in Thompson and Felsen (2013). Briefly, in each trial of the task, the mouse waited for a central port to be illuminated, entered the port, waited 144 ± 64 ms (mean ± s.d.) for the odor valve to open, sampled a binary odor mixture, waited 434 ± 68 ms (mean ± s.d.) for a go signal (simultaneous port light off and tone presentation; 5kHz, ~85dB), exited the odor port, and moved toward the left or right reward port [Figure 1A; these delays were selected in order to temporally segregate behavioral events for determining how they were correlated with neural activity, minimize training time, and for consistency with our previous work (Thompson and Felsen, 2013)]. In this previous work, we used a photo-ionization-detector to estimate the latency between odor valve open and the odor first arriving at the port to be 75–100 ms (Thompson and Felsen, 2013). Exiting the odor port prior to the go signal resulted in the unavailability of reward on that trial. All training and experimental behavioral sessions were conducted during the light cycle.

Odors were comprised of binary mixtures of (+)-carvone and (−)-carvone (Acros), commonly perceived as caraway and spearmint, respectively. In all sessions – including training on the task, as well as during neural recording and manipulation – mixtures in which (+)-carvone > (−)-carvone indicated reward availability at the left port, and (−)-carvone > (+)-carvone indicated reward availability at the right port. When (+)-carvone = (−)-carvone, the probability of reward at the left and right ports, independently, was 0.5. The full set of (+)-carvone/ (−)-carvone ratios used was 95/5, 80/20, 60/40, 50/50, 40/60, 20/80, 5/95. Mixtures were diluted in mineral oil and carrier air and delivered to the odor port at 800 ml/min. The mixture presented in each trial was selected pseudo-randomly. Reward (5 µL water) was delivered by transiently opening a calibrated water valve 41 ± 32 ms (mean ± s.d.) after reward port entry. Mice completed training in 6–8 weeks and were then implanted with a drug-delivery cannula or a neural recording drive, as described below. Since our neural recording and manipulation experiments were performed in mice that were well-trained on the task, we do not attempt to examine how neural activity underlies task acquisition here (e.g., via reinforcement learning).

Surgery

Two types of implants were used in these experiments: (1) a steel infusion cannula (Plastics One, Minneaspolis, MN) for muscimol delivery and (2) a Versa Drive 4 microdrive (containing four independently-adjustable tetrodes; Neuralynx) for tetrode recordings. All implants were targeted to the PPTg using the same general stereotactic procedure; implant-specific details are described below. Mice were placed in a ventilated chamber and briefly exposed to a volatile anesthesia (isoflurane, 2%; Priamal Healthcare Limited). Immediately following the onset of deep anesthesia (verified by toe-pinch), the mouse was placed in a stereotaxic device with a nose cone that continuously delivered 1%–1.5% isoflurane to maintain anesthesia. When the mouse was fully unresponsive to foot pinch and appeared to maintain a consistent breathing rate, the fur on the surface of the scalp was removed, and topical antiseptic (Betadine; Purdue Products) was applied along with ophthalmic ointment on the eyes. Before exposing the skull, a bolus of topical anesthetic (250 µL 2% lidocaine; Aspen Veterinary Resources) was injected under the surface of the scalp. With the skull exposed by central incision and scalp retraction, we adjusted head angle to align the elevation of bregma and lambda and drilled a 1.5 mm diameter cranial window centered on coordinates for the left PPTg (4.5 mm posterior from bregma, 1.1 mm lateral from the midline [Paxinos and Watson, 2006]). Following craniotomy and durotomy, we implanted one of the following:

Infusion cannula

Request a detailed protocol

The steel guide and removable steel insert assembly were targeted to 0.15 mm dorsal to the surface of the PPTg. The cannula cap was affixed to the skull with two small screws (Plastics One), luting (3 M) and dental acrylic (A-M Systems).

Versa Drive 4 microdrive

Request a detailed protocol

The drive was lowered into the craniotomy such that the tetrodes extending from its base targeted 100–200 µm above the dorsal surface of the PPTg (2.5 mm from brain surface). The drive was then affixed to the skull using two small screws, a luting, and dental acrylic. One screw was soldered to a ground wire connected to the drive.

In all surgeries, after the acrylic hardened, a topical antibiotic was applied to the scalp around the drive implant, the isoflurane was turned off and oxygen alone was delivered to the animal to gradually alleviate anesthesia. Immediately following surgery, animals were administered sterile 0.9% saline for rehydration and an analgesic (5 mg/kg Ketofen; Zoetis). Post-operative care, including analgesic and antibiotic administration, continued for up to 5 days after surgery and animals were closely monitored for signs of distress.

Pharmacology

Request a detailed protocol

Prior to each session, an injection cannula was prepared with either muscimol (test sessions) or saline (control sessions) and inserted into the chronically implanted guide sans anesthesia. An infusion pump (Harvard Apparatus) was used to administer 150 nL of solution at 0.075 μL/min. Muscimol dosages ranged from 22 to 44 pmol. Mice recovered for at least 10 min before beginning the behavioral session.

Electrophysiology

Request a detailed protocol

Recordings were collected using four tetrodes. Each tetrode consisted of four polyimide-coated nichrome wires (12.5 µm diameter; Sandvik) gold plated to 0.2–0.4 MΩ impedance. Electrical signals were amplified and recorded using the Digital Lynx S multichannel acquisition system in conjunction with Cheetah data acquisition software (Neuralynx). Tetrode depths, estimated by calculating the rotation of the screw affixed to the shuttle holding the tetrode (one rotation = ~250 µm), were adjusted ~75 µm between recording sessions to sample independent populations of neurons across sessions. Offline spike sorting and cluster quality analysis was performed using MCLUST software (MClust-4.0, A.D. Redish et al.) in MATLAB. Briefly, single units were isolated by manual clustering based on features of the sampled waveforms (amplitude, energy, and the first principal component normalized by energy). Clusters with L-ratio <0.75 and isolation distance >12 were deemed single units (Schmitzer-Torbert et al., 2005), which resulted in excluding 30% of clusters. Although units were clustered blind to inter-spike interval (ISI), clusters with ISIs <1 ms were excluded.

Histology

Request a detailed protocol

Final tetrode location was confirmed histologically using electrolytic lesions made after the last recording session and tetrode tracks (Thompson and Felsen, 2013). On day-one, post-lesion, mice were overdosed with an intraperitoneal injection of sodium pentobarbital (100 mg/kg; Sigma Life Science) and transcardially perfused with saline and ice-cold 4% paraformaldehyde (PFA) in 0.1 M phosphate buffer (PB). After perfusion, brains were submerged in 4% PFA in 0.1 M PB for 3.5 hr for post-fixation and then cryoprotected overnight by immersion in 20% sucrose in 0.1 M PB. The brain was embedded in optimal cutting temperature compound (ThermoFisher Scientific) and frozen rapidly on dry ice. Serial coronal sections (50 µm) were cut on a cryostat. Alexa 555 fluorescent Nissl (1:500, NeuroTrace; Invitrogen, catalog #N-21480) was used to identify cytoarchitectural features of the PPTg and verify tetrode tracks and lesions.

Behavioral analysis

Request a detailed protocol

We quantified choice behavior with a logistic function of the form $p = \frac{1}{1 + e^{- η}}$ , where p is the choice made on a given trial (right choice, p=0; left choice, p=1; trials in which no reward port was entered within 1.5 s of odor port exit were excluded) and η is the linear predictor, which was adapted to specific analyses as described below. For all analyses, η consisted of at least $η_{0} = β_{0} + β_{O d o r : L} x_{O d o r : L} + β_{O d o r : R} x_{O d o r : R}$ , where $β_{0}$ represents overall choice bias, $x_{O d o r : L}$ and $x_{O d o r : R}$ represent the strength of the odors associated with the left and right reward port, respectively, and $β_{O d o r : L}$ and $β_{O d o r : R}$ represent the influence of the odors on choice. $x_{O d o r : L}$ and $x_{O d o r : R}$ are calculated as (fraction of odor - 0.5) / 0.5 and range from 0 to 1; we used separate terms for left and right odors to allow for the possibility that they asymmetrically influenced choice. In Figure 1E, for the Simple model, $η = η_{0}$ .

To assess the effect on choice behavior of inhibiting PPTg activity with muscimol (Figure 5A,B), we combined each muscimol session with its pre- and post-saline session and set $η = η_{0} + β_{M u s c i m o l} x_{M u s c i m o l}$ , where $x_{M u s c i m o l} = 0$ for saline trials, $x_{M u s c i m o l} = 1$ for muscimol trials, and $β_{M u s c i m o l}$ represents the influence of muscimol on choice [positive values represent leftward (ipsilateral) influence] (Salzman et al., 1992). In addition, we examined the effect of inhibiting PPTg activity on movement latency and duration by examining the duration between the go signal and odor port exit, between odor port exit and reward port entry, and between reward port exit and odor port entry (to initiate the next trial).

To assess the effect on choice behavior of the choices and outcomes on previous trials, we set $η = η_{0} + \sum_{n = 1}^{N} (β_{C h o i c e}^{n} x_{C h o i c e}^{n} + β_{O u t c o m e : L}^{n} x_{O u t c o m e : L}^{n} + β_{O u t c o m e : R}^{n} x_{O u t c o m e : R}^{n}$ ), where

x_{C h o i c e}^{n} = {\begin{cases} - 1 & f o r r i g h t c h o i c e \\ 0 & f o r n o c h o i c e & n t r i a l s b a c k, \\ 1 & f o r l e f t c h o i c e \end{cases}

x_{O u t c o m e : L}^{n} = {\begin{cases} - 1 & f o r u n r e w a r d e d l e f t c h o i c e \\ 0 & f o r n o n l e f t c h o i c e & n t r i a l s b a c k, \\ 1 & f o r r e w a r d e d l e f t c h o i c e \end{cases}

x_{O u t c o m e : R}^{n} = {\begin{cases} - 1 & f o r u n r e w a r d e d r i g h t c h o i c e \\ 0 & f o r n o n r i g h t c h o i c e & n t r i a l s b a c k, \\ 1 & f o r r e w a r d e d r i g h t c h o i c e \end{cases}

$β_{C h o i c e}^{n}$ represents the influence of the choice n trials back (‘no choice’ includes any excluded trials and those in which odor port entry to initiate the next trial did not occur within 1.5 s of water port exit), $β_{O u t c o m e : L}^{n}$ represents the influence of the outcome at the left n trials back, and $β_{O u t c o m e : R}^{n}$ represents the influence of the outcome at the right n trials back. This set of coefficients provided the most intuitive interpretation of our data; our results did not depend on whether we instead included separate coefficients for previous left and right choice or a single coefficient for previous outcome. In Figure 2D–F, for the Extended model, 1 trial back, n = 1; for the Extended model, 2 trials back, n = 2.

To assess how inhibiting PPTg activity modulated the influence of the choice on the previous trial on behavior, we used a reduced form of the Extended model, 1 trial back by setting $η = η_{0} + β_{C h o i c e}^{1} x_{C h o i c e}^{1}$ , and calculated $β_{C h o i c e}^{1}$ separately for inhibited (muscimol) and control (saline) sessions (Figure 5C).

Preference analysis of neural data

Request a detailed protocol

To quantify the selectivity of single neurons for choice and outcome, we used an ROC-based algorithm (Green and Swets, 1966) that calculates the ability of an ideal observer to classify whether a given spike rate was recorded in one of two conditions (e.g., on trials in which the left or right reward port was selected). ‘Preference’ was calculated as 2(ROC_area – 0.5), a measure ranging from −1 to 1, where −1 signifies the strongest possible preference for one alternative, 1 signifies the strongest possible preference for the other alternative, and 0 signifies no preference (Thompson and Felsen, 2013). For choice preference, −1 = left and 1 = right; for outcome preference, −1 = no reward and 1 = reward. Since choice is correlated with movement direction, this measure of choice preference alone formally cannot disambiguate whether activity reflects the chosen side itself or the direction of movement either towards or from the reward port. However, either movement is a perfect proxy for the choice – e.g., leftward movement towards the reward port and rightward movement from the reward port both indicate a left choice – and we report in the Results section an analysis derived from choice preference that can disambiguate whether PPTg activity reflects choice or movement direction. For clarity, we refer to neural activity with respect to the chosen reward port and the direction of movement towards that port (i.e., left and leftward in the above example). Statistical significance of preference was determined with a Monte Carlo permutation test: we recalculated preference after randomly reassigning each firing rate to one of the two groups, repeated this procedure 1000 times to obtain a distribution, and calculated the fraction of randomly generated preferences exceeding the actual preference. We examined preference during the pre-stimulus epoch (from odor port entry to 100 ms after odor valve open, before stimulus information can reach the PPTg; Figure 1A). For all preference analyses, we tested for significance at α = 0.05. Neurons with fewer than four trials for each condition or with a firing rate below 2 spikes/s for both conditions were excluded from analysis (Thompson and Felsen, 2013).

Regression analysis of neural data

Request a detailed protocol

To assess the influence on PPTg activity of the choices and outcomes on the previous and current trials (Figure 3), we fit the electrophysiological data with a multi-variable linear regression model of the form

F R (t) = β_{0} (t) + β_{C h o i c e}^{1} (t) x_{C h o i c e}^{1} + β_{O u t c o m e}^{1} (t) x_{O u t c o m e}^{1} + β_{C h o i c e}^{0} (t) x_{C h o i c e}^{0} + β_{O u t c o m e}^{0} (t) x_{O u t c o m e}^{0},

where $F R (t)$ is the mean firing rate in a given time bin t (100 ms duration, sampled every 10 ms),

x_{C h o i c e}^{1} = {\begin{cases} - 1 & f o r a n i p s i l a t e r a l (l e f t) c h o i c e \\ 0 & f o r n o c h o i c e & o n t h e p r e v i o u s t r i a l, \\ 1 & f o r a c o n t r a l a t e r a l (r i g h t) c h o i c e \end{cases}

x_{O u t c o m e}^{1} = {\begin{cases} - 1 & f o r a n o n r e w a r d e d c h o i c e \\ 0 & f o r n o c h o i c e & o n t h e p r e v i o u s t r i a l, \\ 1 & f o r a r e w a r d e d c h o i c e \end{cases}

x_{C h o i c e}^{0} = {\begin{cases} - 1 & f o r a n i p s i l a t e r a l c h o i c e \\ 0 & f o r n o c h o i c e & o n t h e c u r r e n t t r i a l, \\ 1 & f o r a c o n t r a l a t e r a l c h o i c e \end{cases}

x_{O u t c o m e}^{0} = {\begin{cases} - 1 & f o r a n o n r e w a r d e d c h o i c e \\ 0 & f o r n o c h o i c e & o n t h e c u r r e n t t r i a l, \\ 1 & f o r a r e w a r d e d c h o i c e \end{cases}

$β_{0} (t)$ represents the mean firing rate across trials in time bin t, and $β_{C h o i c e}^{1} (t)$ , $β_{O u t c o m e}^{1} (t)$ , $β_{C h o i c e}^{0} (t)$ , and $β_{O u t c o m e}^{0} (t)$ represent the influence on firing rate of the previous choice, previous outcome, current choice and current outcome, respectively, in time bin t (Rorie et al., 2010). Positive values for $β_{C h o i c e}^{1}$ and $β_{C h o i c e}^{0}$ indicate that firing rate is increased by contralateral choices and negative values indicate that firing rate is increased by ipsilateral choices. Positive values for $β_{O u t c o m e}^{1}$ and $β_{O u t c o m e}^{0}$ indicate that firing rate is increased by rewarded choices and negative values indicate that firing rate is increased by unrewarded choices.

References

1. Aizawa H
2. Kobayashi Y
3. Yamamoto M
4. Isa T
(1999)
Injection of nicotine into the superior colliculus facilitates occurrence of express saccades in monkeys

Journal of Neurophysiology 82:1642–1646.
- Google Scholar
1. Akaishi R
2. Umeda K
3. Nagase A
4. Sakai K
(2014) Autonomous mechanism of internal choice estimate underlies decision inertia
Neuron 81:195–206.

https://doi.org/10.1016/j.neuron.2013.10.018
- Google Scholar
1. Beninato M
2. Spencer RF
(1986) A cholinergic projection to the rat superior colliculus demonstrated by retrograde transport of horseradish peroxidase and choline acetyltransferase immunohistochemistry
The Journal of Comparative Neurology 253:525–538.

https://doi.org/10.1002/cne.902530409
- Google Scholar
1. Beninato M
2. Spencer RF
(1987) A cholinergic projection to the rat substantia nigra from the pedunculopontine tegmental nucleus
Brain Research 412:169–174.

https://doi.org/10.1016/0006-8993(87)91455-7
- Google Scholar
(2003) Superior colliculus encodes distance to target, not saccade amplitude, in multi-step gaze shifts
Nature Neuroscience 6:404–413.

https://doi.org/10.1038/nn1027
- Google Scholar
1. Busse L
2. Ayaz A
3. Dhruv NT
4. Katzner S
5. Saleem AB
6. Schölvinck ML
7. Zaharia AD
8. Carandini M
(2011) The detection of visual contrast in the behaving mouse
Journal of Neuroscience 31:11351–11361.

https://doi.org/10.1523/JNEUROSCI.6689-10.2011
- Google Scholar
(1985) Disinhibition as a basic process in the expression of striatal functions. I. The striato-nigral influence on tecto-spinal/tecto-diencephalic neurons
Brain Research 334:215–226.

https://doi.org/10.1016/0006-8993(85)90213-6
- Google Scholar
1. Cohen JY
2. Haesler S
3. Vong L
4. Lowell BB
5. Uchida N
(2012) Neuron-type-specific signals for reward and punishment in the ventral tegmental area
Nature 482:85–88.

https://doi.org/10.1038/nature10754
- Google Scholar
(1998) The role of the pedunculopontine tegmental nucleus in relation to conditioned motor performance in the cat
Experimental Brain Research 121:401–410.

https://doi.org/10.1007/s002210050474
- Google Scholar
(2007) Competitive integration of visual and preparatory signals in the superior colliculus during saccadic programming
Journal of Neuroscience 27:5053–5062.

https://doi.org/10.1523/JNEUROSCI.4212-06.2007
- Google Scholar
1. Felsen G
2. Mainen ZF
(2008) Neural substrates of sensory-guided locomotor decisions in the rat superior colliculus
Neuron 60:137–148.

https://doi.org/10.1016/j.neuron.2008.09.019
- Google Scholar
1. Felsen G
2. Mainen ZF
(2012) Midbrain contributions to sensorimotor decision making
Journal of Neurophysiology 108:135–147.

https://doi.org/10.1152/jn.01181.2011
- Google Scholar
1. Garcia-Rill E
(1991) The pedunculopontine nucleus
Progress in Neurobiology 36:363–389.

https://doi.org/10.1016/0301-0082(91)90016-T
- Google Scholar
1. Glimcher PW
2. Sparks DL
(1992) Movement selection in advance of action in the superior colliculus
Nature 355:542–545.

https://doi.org/10.1038/355542a0
- Google Scholar
1. Gold JI
2. Law CT
3. Connolly P
4. Bennur S
(2008) The relative influences of priors and sensory evidence on an oculomotor decision variable during perceptual learning
Journal of Neurophysiology 100:2653–2668.

https://doi.org/10.1152/jn.90629.2008
- Google Scholar
1. Gold JI
2. Shadlen MN
(2007) The neural basis of decision making
Annual Review of Neuroscience 30:535–574.

https://doi.org/10.1146/annurev.neuro.29.051605.113038
- Google Scholar
1. Graybiel AM
(1978) A stereometric pattern of distribution of acetylthiocholinesterase in the deep layers of the superior colliculus
Nature 272:539–541.

https://doi.org/10.1038/272539b0
- Google Scholar
Book
1. Green DM
2. Swets JA
(1966)
Signal Detection Theory and Psychophysics

Hoboken, NJ: John Wiley and Sons.
- Google Scholar
(2015) Resonant cholinergic dynamics in cognitive and motor decision-making: attention, category learning, and choice in neocortex, superior colliculus, and optic tectum
Frontiers in Neuroscience 9:.

https://doi.org/10.3389/fnins.2015.00501
- Google Scholar
1. Hikosaka O
2. Kim HF
3. Yasuda M
4. Yamamoto S
(2014) Basal ganglia circuits for reward value–guided behavior
Annual Review of Neuroscience 37:289–306.

https://doi.org/10.1146/annurev-neuro-071013-013924
- Google Scholar
(2006) Basal ganglia orient eyes to reward
Journal of Neurophysiology 95:567–584.

https://doi.org/10.1152/jn.00458.2005
- Google Scholar
1. Hikosaka O
2. Wurtz RH
(1983)
Visual and oculomotor functions of monkey substantia nigra pars reticulata. IV. Relation of substantia nigra to superior colliculus

Journal of Neurophysiology 49:1285–1301.
- Google Scholar
(2009) Learning substrates in the primate prefrontal cortex and striatum: sustained activity related to successful actions
Neuron 63:244–253.

https://doi.org/10.1016/j.neuron.2009.06.019
- Google Scholar
1. Horwitz GD
2. Newsome WT
(2001)
Target selection for saccadic eye movements: direction-selective visual responses in the superior colliculus

Journal of Neurophysiology 86:2527–2542.
- Google Scholar
1. Inglis WL
2. Winn P
(1995) The pedunculopontine tegmental nucleus: where the striatum meets the reticular formation
Progress in Neurobiology 47:1–29.

https://doi.org/10.1016/0301-0082(95)00013-L
- Google Scholar
1. Kim B
2. Basso MA
(2010) A probabilistic strategy for understanding action selection
Journal of Neuroscience 30:2340–2355.

https://doi.org/10.1523/JNEUROSCI.1730-09.2010
- Google Scholar
1. Kim H
2. Lee D
3. Jung MW
(2013) Signals for previous goal choice persist in the dorsomedial, but not dorsolateral striatum of rats
Journal of Neuroscience 33:52–63.

https://doi.org/10.1523/JNEUROSCI.2422-12.2013
- Google Scholar
1. Kim JY
2. Lee WY
3. Chung EJ
4. Dhong HJ
(2007) Analysis of olfactory function and the depth of olfactory sulcus in patients with Parkinson's disease
Movement Disorders 22:1563–1566.

https://doi.org/10.1002/mds.21490
- Google Scholar
1. Kobayashi Y
2. Inoue Y
3. Yamamoto M
4. Isa T
5. Aizawa H
(2002)
Contribution of pedunculopontine tegmental nucleus neurons to performance of visually guided saccade tasks in monkeys

Journal of Neurophysiology 88:715–731.
- Google Scholar
1. Kobayashi Y
2. Isa T
(2002) Sensory-motor gating and cognitive control by the brainstem cholinergic system
Neural Networks 15:731–741.

https://doi.org/10.1016/S0893-6080(02)00059-X
- Google Scholar
(2004) Target selection and the superior colliculus: goals, choices and hypotheses
Vision Research 44:1445–1451.

https://doi.org/10.1016/j.visres.2004.01.005
- Google Scholar
1. Körding KP
2. Wolpert DM
(2006) Bayesian decision theory in sensorimotor control
Trends in Cognitive Sciences 10:319–326.

https://doi.org/10.1016/j.tics.2006.05.003
- Google Scholar
1. Lau B
2. Glimcher PW
(2005) Dynamic response-by-response models of matching behavior in rhesus monkeys
Journal of the Experimental Analysis of Behavior 84:555–579.

https://doi.org/10.1901/jeab.2005.110-04
- Google Scholar
1. Lau B
2. Glimcher PW
(2007) Action and outcome encoding in the primate caudate nucleus
Journal of Neuroscience 27:14502–14514.

https://doi.org/10.1523/JNEUROSCI.3060-07.2007
- Google Scholar
1. Lau B
2. Glimcher PW
(2008) Value representations in the primate striatum during matching behavior
Neuron 58:451–463.

https://doi.org/10.1016/j.neuron.2008.02.021
- Google Scholar
1. Lau B
2. Welter ML
3. Belaid H
4. Fernandez Vidal S
5. Bardinet E
6. Grabli D
7. Karachi C
(2015) The integrative role of the pedunculopontine nucleus in human gait
Brain 138:1284–1296.

https://doi.org/10.1093/brain/awv047
- Google Scholar
(2009) PINP: a new method of tagging neuronal populations for identification during in vivo electrophysiological recording
PLoS One 4:e6099.

https://doi.org/10.1371/journal.pone.0006099
- Google Scholar
(2013) Updating of action-outcome associations is prevented by inactivation of the posterior pedunculopontine tegmental nucleus
Neurobiology of Learning and Memory 102:28–33.

https://doi.org/10.1016/j.nlm.2013.03.002
- Google Scholar
1. Marcos E
2. Pani P
3. Brunamonti E
4. Deco G
5. Ferraina S
6. Verschure P
(2013) Neural variability in premotor cortex is modulated by trial history and predicts behavioral performance
Neuron 78:249–255.

https://doi.org/10.1016/j.neuron.2013.02.006
- Google Scholar
(1997) Single-unit activity in the primate nucleus tegmenti pedunculopontinus related to voluntary arm movement
Neuroscience Research 28:155–165.

https://doi.org/10.1016/S0168-0102(97)00039-4
- Google Scholar
(2004) Pedunculopontine nucleus and basal ganglia: distant relatives or part of the same family?
Trends in Neurosciences 27:585–588.

https://doi.org/10.1016/j.tins.2004.07.009
- Google Scholar
1. Norton AB
2. Jo YS
3. Clark EW
4. Taylor CA
5. Mizumori SJ
(2011) Independent neural coding of reward and movement by pedunculopontine tegmental nucleus neurons in freely navigating rats
European Journal of Neuroscience 33:1885–1896.

https://doi.org/10.1111/j.1460-9568.2011.07649.x
- Google Scholar
1. Okada K
2. Kobayashi Y
(2009) Characterization of oculomotor and visual activities in the primate pedunculopontine tegmental nucleus during visually guided saccade tasks
European Journal of Neuroscience 30:2211–2223.

https://doi.org/10.1111/j.1460-9568.2009.07009.x
- Google Scholar
Book
1. Paxinos G
2. Watson C
(2006)
He Rat Brain in Stereotaxic Coordinates: Hard Cover Edition

San Diego CA: Academic Press.
- Google Scholar
(2010) Integration of sensory and reward information during perceptual decision-making in lateral intraparietal cortex (LIP) of the macaque monkey
PLoS One 5:e9308.

https://doi.org/10.1371/journal.pone.0009308
- Google Scholar
1. Roseberry TK
2. Lee AM
3. Lalive AL
4. Wilbrecht L
5. Bonci A
6. Kreitzer AC
(2016) Cell-type-specific control of brainstem locomotor circuits by basal ganglia
Cell 164:526–537.

https://doi.org/10.1016/j.cell.2015.12.037
- Google Scholar
(1992)
Microstimulation in visual area MT: effects on direction discrimination performance

The Journal of Neuroscience 12:2331–2355.
- Google Scholar
(2005) Quantitative measures of cluster quality for use in extracellular recordings
Neuroscience 131:1–11.

https://doi.org/10.1016/j.neuroscience.2004.09.066
- Google Scholar
(2011) Deficits in reach target selection during inactivation of the midbrain superior colliculus
PNAS 108:E1433–E1440.

https://doi.org/10.1073/pnas.1109656108
- Google Scholar
1. Sooksawate T
2. Isa K
3. Isa T
(2008) Cholinergic responses in crossed tecto-reticular neurons of rat superior colliculus
Journal of Neurophysiology 100:2702–2711.

https://doi.org/10.1152/jn.90723.2008
- Google Scholar
1. Sparks DL
2. Hartwich-Young R
(1989)
The deep layers of the superior colliculus

Reviews of Oculomotor Research 3:213–255.
- Google Scholar
1. Steckler T
2. Inglis W
3. Winn P
4. Sahgal A
(1994) The pedunculopontine tegmental nucleus: a role in cognitive processes?
Brain Research. Brain Research Reviews 19:298–318.

https://doi.org/10.1016/0165-0173(94)90016-7
- Google Scholar
(2013) Optogenetic investigation of the role of the superior colliculus in orienting movements
Behavioural Brain Research 255:55–63.

https://doi.org/10.1016/j.bbr.2013.04.040
- Google Scholar
(2015) Optogenetic cholinergic modulation of the mouse superior colliculus in vivo
Journal of Neurophysiology 114:978–988.

https://doi.org/10.1152/jn.00917.2014
- Google Scholar
1. Sul JH
2. Jo S
3. Lee D
4. Jung MW
(2011) Role of rodent secondary motor cortex in value-based action selection
Nature Neuroscience 14:1202–1208.

https://doi.org/10.1038/nn.2881
- Google Scholar
1. Tai LH
2. Lee AM
3. Benavidez N
4. Bonci A
5. Wilbrecht L
(2012) Transient stimulation of distinct subpopulations of striatal neurons mimics changes in action value
Nature Neuroscience 15:1281–1289.

https://doi.org/10.1038/nn.3188
- Google Scholar
1. Thompson JA
2. Felsen G
(2013) Activity in mouse pedunculopontine tegmental nucleus reflects action and outcome in a decision-making task
Journal of Neurophysiology 110:2817–2829.

https://doi.org/10.1152/jn.00464.2013
- Google Scholar
(2001) A model of saccade initiation based on the competitive integration of exogenous and endogenous signals in the superior colliculus
Journal of Cognitive Neuroscience 13:256–271.

https://doi.org/10.1162/089892901564306
- Google Scholar
1. Uchida N
2. Mainen ZF
(2003) Speed and accuracy of olfactory discrimination in the rat
Nature Neuroscience 6:1224–1229.

https://doi.org/10.1038/nn1142
- Google Scholar
1. Watanabe M
2. Kobayashi Y
3. Inoue Y
4. Isa T
(2005) Effects of local nicotinic activation of the superior colliculus on saccades in monkeys
Journal of Neurophysiology 93:519–534.

https://doi.org/10.1152/jn.00558.2004
- Google Scholar
(1983) Rotational behavior following cholinergic stimulation of the superior colliculus in rats
Pharmacology Biochemistry and Behavior 19:813–820.

https://doi.org/10.1016/0091-3057(83)90086-2
- Google Scholar
1. Winn P
(2008) Experimental studies of pedunculopontine functions: are they motor, sensory or integrative?
Parkinsonism & Related Disorders 14:S194–S198.

https://doi.org/10.1016/j.parkreldis.2008.04.030
- Google Scholar
(2015) An integrative role for the superior colliculus in selecting targets for movements
Journal of Neurophysiology 114:2118–2131.

https://doi.org/10.1152/jn.00262.2015
- Google Scholar

Article and author information

Author details

John A Thompson
1. Department of Neurosurgery, University of Colorado School of Medicine, Aurora, United States
2. Department of Physiology and Biophysics, University of Colorado School of Medicine, Aurora, United States
Contribution
JAT, Designed the experiments, Collected the data, Analyzed the data, Wrote the manuscript

For correspondence
john.a.thompson@ucdenver.edu

Competing interests
The authors declare that no competing interests exist.

"This ORCID iD identifies the author of this article:" 0000-0003-2991-5194
Jamie D Costabile

Department of Physiology and Biophysics, University of Colorado School of Medicine, Aurora, United States

Contribution
JDC, Designed the experiments, Collected the data, Analyzed the data, Wrote the manuscript

Competing interests
The authors declare that no competing interests exist.
Gidon Felsen

Department of Physiology and Biophysics, University of Colorado School of Medicine, Aurora, United States

Contribution
GF, Designed the experiments, Analyzed the data, Wrote the manuscript

Competing interests
The authors declare that no competing interests exist.

"This ORCID iD identifies the author of this article:" 0000-0003-0745-8279

Funding

National Institute of Neurological Disorders and Stroke (R01NS079518)

Gidon Felsen

Boettcher Foundation

Gidon Felsen

National Institute of Neurological Disorders and Stroke (P30NS048154)

Gidon Felsen

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Acknowledgements

This work was supported by NINDS/NIH grant R01NS079518, the Boettcher Foundation's Webb-Waring Biomedical Research Award, and the University of Colorado Anschutz Medical Campus Optogenetics and Neural Engineering Core (P30NS048154). We thank Stephen V David, Brian Lau, and members of the Felsen lab for helpful discussions and comments on the manuscript.

Ethics

Animal experimentation: All experiments were performed according to protocols approved by the University of Colorado School of Medicine Institutional Animal Care and Use Committee (protocol #: B-90215(11)1D).

Copyright

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.