Dynamic modulation of decision biases by brainstem arousal systems

Abstract
eLife digest
Introduction
Results
Discussion
Materials and methods
Data availability
References
Article and author information
Metrics

Abstract

Decision-makers often arrive at different choices when faced with repeated presentations of the same evidence. Variability of behavior is commonly attributed to noise in the brain’s decision-making machinery. We hypothesized that phasic responses of brainstem arousal systems are a significant source of this variability. We tracked pupil responses (a proxy of phasic arousal) during sensory-motor decisions in humans, across different sensory modalities and task protocols. Large pupil responses generally predicted a reduction in decision bias. Using fMRI, we showed that the pupil-linked bias reduction was (i) accompanied by a modulation of choice-encoding pattern signals in parietal and prefrontal cortex and (ii) predicted by phasic, pupil-linked responses of a number of neuromodulatory brainstem centers involved in the control of cortical arousal state, including the noradrenergic locus coeruleus. We conclude that phasic arousal suppresses decision bias on a trial-by-trial basis, thus accounting for a significant component of the variability of choice behavior.

https://doi.org/10.7554/eLife.23232.001

eLife digest

When asked to make repeated decisions we will often choose differently each time even when we are given the same information to inform our choice. A stock trader, for example, will typically be more inclined to buy on some days and sell on others even if the financial markets remain unchanged. Fluctuations in the brain’s level of alertness or excitability, otherwise known as its arousal, are thought to contribute to this variability in decision-making.

An area at the base of the brain called the brainstem – and in particular one of its subregions, the locus coeruleus – helps shape arousal levels by releasing chemicals called neuromodulators. For reasons that remain unknown, activation of the locus coeruleus also causes the pupil of the eye to suddenly increase in size. Now, de Gee et al. have exploited this link to unravel how changes in brain arousal lead to systematic changes in decision-making.

Volunteers were asked to judge whether a faint pattern was embedded in flickering noise on a computer screen, and to report their judgment by pressing one of two buttons to indicate “yes” or “no”. Although the decision was comparatively simple, it did involve evaluating changing information over time before making a choice – like when considering the stock market. As the volunteers performed the task, de Gee et al. measured their brain activity and the size of their pupils. Most of the volunteers had a tendency to respond “no” even when the pattern was present. However, whenever their locus coeruleus was particularly active, and their pupils increased in size, their decision process was changed so that this unhelpful choice bias decreased.

This suggests that by boosting arousal, the locus coeruleus reduces existing biases in our decision-making. Varying levels of locus coeruleus activity may thus explain why we can reach different conclusions when considering the same information on multiple occasions. The next challenge is to identify what it is about the decision-making process that activates the locus coeruleus on some occasions but not others.

https://doi.org/10.7554/eLife.23232.002

Introduction

Decision-makers often arrive at different choices in the face of repeated presentations of the same evidence (Glimcher, 2005; Gold and Shadlen, 2007; Shadlen et al., 1996; Sugrue et al., 2005; Wyart and Koechlin, 2016). This intrinsic behavioral variability is typically attributed to spontaneous fluctuations of neural activity in the brain regions computing decisions (Glimcher, 2005; Shadlen et al., 1996) (but see [Beck et al., 2012; Brunton et al., 2013]). Indeed, fluctuations of neural activity are ubiquitous in the cerebral cortex (Faisal et al., 2008; Glimcher, 2005; Lin et al., 2015).

One candidate source of these fluctuations in cortical activity is systematic variation in central arousal state. Central arousal state is controlled by the neuromodulatory systems of the brainstem, which have widespread projections to cortex and tune neuronal parameters governing the operating mode of their cortical target circuits (Aston-Jones and Cohen, 2005; Harris and Thiele, 2011; Lee and Dan, 2012). Importantly, these neuromodulatory systems operate at different timescales (Aston-Jones and Cohen, 2005; Parikh et al., 2007). Some, in particular the noradrenergic locus coeruleus (LC), are rapidly recruited, in a time-locked fashion, during elementary decisions (Aston-Jones and Cohen, 2005; Bouret and Sara, 2005; Dayan and Yu, 2006; Parikh et al., 2007). Pupil diameter, a reliable peripheral marker of central (cortical) arousal state (McGinley et al., 2015b), also increases during decisions (Beatty, 1982; de Gee et al., 2014; Gilzenrat et al., 2010; Lempert et al., 2015; Nassar et al., 2012). These observations point to an important role of phasic (i.e., fast) pupil-linked arousal signals in decision-making (Aston-Jones and Cohen, 2005; Dayan and Yu, 2006). Yet, the precise nature of this role has remained unknown.

Here, we investigated how phasic, task-related arousal interacts with decision computations in the human brain. We combined pupillometry, fMRI, and computational modeling to probe into the interplay between task-related arousal and decision computations underlying elementary sensory-motor choice tasks. Sensory-motor decisions entail the gradual accumulation of noisy ‘sensory evidence’ about the state of the world towards categorical decision states governing behavioral choice (Bogacz et al., 2006; Brody and Hanks, 2016; Gold and Shadlen, 2007; Ratcliff and McKoon, 2008). A large-scale network of regions in frontal and parietal cortex seems to accumulate stimulus responses provided by sensory cortices towards choices of motor movements (Gold and Shadlen, 2007; Siegel et al., 2011) (but see [Brody and Hanks, 2016; Katz et al., 2016]). We here aimed to elucidate the interaction between pupil-linked arousal responses, evidence accumulation, and decision processing across several (cortical and subcortical) brain regions.

Large task-evoked pupil responses were consistently accompanied by a reduction in perceptual decision bias in different sensory modalities (visual and auditory) and task protocols (detection and discrimination). Decision bias reflects the degree to which an observer’s choice deviates from the objective sensory evidence. Using fMRI for one of these tasks revealed that the bias reduction was accompanied by a modulation of choice-encoding pattern signals in prefrontal and parietal cortex. Further, the bias reduction was predicted by task-evoked, pupil-linked responses in a network of neuromodulatory brainstem nuclei controlling cortical arousal state. We conclude that phasic neuromodulatory signals reduce biases in the brain’s decision-making machinery. As a consequence, phasic arousal accounts for a significant component of the variability of choice behavior, over and above the objective evidence gathered from the outside world.

Results

We systematically quantified the interaction between pupil-linked arousal responses and decision computations at the algorithmic and neural levels of analysis. We here operationalize ‘phasic arousal’ as task-evoked pupil responses (TPR). This operational definition is based on recent animal work, which established remarkably strong correlations between non-luminance mediated variations in pupil diameter and global cortical arousal state (McGinley et al., 2015b).

The Results section is organized as follows. First, we quantify TPRs during the main behavioral task studied in this paper. The key observation here was the substantial trial-to-trial variability of the TPR amplitude. All subsequent analyses exploited this variability to pinpoint the functional correlates of phasic arousal. We then present results from modeling TPR-dependent changes in choice behavior, identifying precise algorithmic correlates of phasic arousal. These results yielded detailed predictions for the underlying modulations of cortical signals. Third, we present tests of these predictions, focusing on functionally delineated cortical regions of interest. We conclude by establishing that the trial-to-trial fluctuations in TPR amplitude, and the associated bias reduction, were closely linked to task-evoked responses of neuromodulatory brainstem centers involved in regulating cortical arousal state.

Tracking trial-to-trial fluctuations in phasic arousal

The main task used in this study was detection (‘yes-no’, simple forced choice protocol) of a low-contrast grating (Figure 1A). The grating contrast was titrated to the 75% correct level, and subjects did not receive trial-by-trial feedback. As observed previously (de Gee et al., 2014), TPR amplitudes during this task fluctuated widely from trial to trial (Figure 1B,C; see Materials and Methods for quantification of TPR). To illustrate, pooling trials into two bins containing the lowest and highest 40% of TPR amplitudes (Figure 1B) yielded, on average, the commonly observed task-evoked pupil dilations for the high TPR bin, but pupil constrictions for the low TPR bin (Figure 1C). We used a previously established model to estimate the time course of the neural input driving the measured TPRs (GLM; see Materials and methods; Figure 1—figure supplement 1A–C). This revealed that the difference between the low and high TPR bins was primarily due to the difference in a sustained component that spanned the entire interval from cue to behavioral choice (Figure 1D). The difference of the sustained component between low and high TPR was significantly larger than the corresponding difference for two components at cue or choice, respectively (2-way repeated measures ANOVA with factors temporal component and TPR bin; interaction: F_2,26 = 79.00, p<0.001).

Figure 1 with 1 supplement see all

Download asset Open asset

Behavioral task and task-evoked pupil responses.

(A) Yes-no contrast detection task. Top: schematic sequence of events during a signal+noise trial. Subjects reported the presence or absence of a faint grating signal superimposed onto dynamic noise. Bottom left: the signal, if present, was oriented clockwise or counter clockwise on different blocks (known to the subject beforehand). Signal contrast is high for illustration only. Bottom right: trial types. (B) Quantifying task-evoked pupillary response (TPR) amplitude. Top: mean TPR time course of an example subject. Green box, interval for averaging TPR values on single trials. Bottom: trials were pooled into three bins of TPR amplitudes (lowest/highest 40% and intermediate 20%). (C) TPR time course for the three bins. (D) Mean beta weights of transient (cue, choice) and sustained input components under low vs. high TPR, estimated with a general linear model (see Materials and methods; Figure 1—figure supplement 1A,B), separately for low and high TPR trials. Panels C, D: group average (N = 14); shading, s.e.m.; data points, individual subjects; stats, permutation test.

https://doi.org/10.7554/eLife.23232.003

In sum, TPR amplitude exhibited substantial trial-to-trial fluctuations, which were predominantly driven by changing levels in sustained input during decision formation. Given the prolonged nature of the decision (median of subject-median reaction time, RT: 2.11 s), the sustained, intra-decisional arousal boost might have interacted with the decision computation. To test for such an interaction between arousal boost and decision computation, we next modeled subjects’ choice behavior as a function of TPR amplitude.

Phasic arousal is inversely related to decision bias

We found a robust and consistent relationship between TPR and decision bias. This effect was present in two independent data sets using an analogous contrast detection task: the newly collected fMRI data set, and a re-analysis of an existing data set (de Gee et al., 2014)) (Figure 2A,D, middle and right panels). Decision bias was quantified in two ways (for details, see Materials and methods). First, we computed signal detection-theoretic (SDT) criterion (Figure 2A,D, middle panels). Second, we computed the fraction of ‘yes’-choices (right panels), after balancing the number of signal+noise and noise trials within each TPR bin. We did not find a consistent relationship between phasic arousal, as measured by TPR, and perceptual sensitivity, quantified by SDT d’ (Figure 2A,D, left panels).

Figure 2 with 1 supplement see all

Download asset Open asset

Phasic arousal predicts reduction of choice bias.

(A) Perceptual sensitivity SDT d’ (left), decision bias, measured as SDT criterion (middle) or fraction of ‘yes’-choices (right), for low and high TPR. For the fraction of ‘yes’-choices analysis, we ensured that each TPR bin consisted of an equal number of signal+noise and noise trials (see Materials and methods). Data points, individual subjects. (B) Relationship between TPR and d’ or criterion (5 bins). Linear fits are plotted wherever the first-order fit was superior to the constant fit (see Materials and methods). Quadratic fits are plotted wherever the second-order fit was superior to first-order fit. (C) Sliding window linear correlation between TPR and SDT criterion (5 bins), aligned to button press. Dashed line, median decision onset (cue). The group average pupil response time course is plotted for reference in blue. (**D–F**) As panels A-C, for an independent data set (de Gee et al., 2014). All panels: group average (N = 14 and N = 21); shading or error bars, s.e.m.; stats, permutation test.

https://doi.org/10.7554/eLife.23232.005

Figure 2—source data 3 Table with variable identifiers used in Figure 2—source data 1 and 2.: https://doi.org/10.7554/eLife.23232.006
Download elife-23232-fig2-data3-v3.txt
Figure 2—source data 1 This csv table contains the data for Figure 2 panel A.: https://doi.org/10.7554/eLife.23232.007
Download elife-23232-fig2-data1-v3.csv
Figure 2—source data 2 This csv table contains the data for Figure 2 panel D.: https://doi.org/10.7554/eLife.23232.008
Download elife-23232-fig2-data2-v3.csv

The negative association between TPR and decision bias (SDT criterion) was approximately linear across a range of five TPR-defined bins (Figure 2B,E, right panels). In all cases, here and below, we tested whether fits of second-order polynomials, reflecting non-monotonic relationships between TPR and behavior, were superior to the linear fits (via sequential polynomial regression analysis; Materials and methods). We found a non-monotonic relationship between TPR and sensitivity in the behavioral data set from de Gee et al. (2014), but not in the fMRI dataset (Figure 2B,E, left panels). This non-monotonic (inverted U-shape) relationship between pupil diameter and sensitivity is consistent with previous animal work on correlations between baseline arousal and behavior (Aston-Jones and Cohen, 2005; McGinley et al., 2015a). However, it was less consistent across the data sets analyzed in this paper than the negative linear effect of TPR on decision bias. The consistent effect of TPR on decision bias has not been reported before in previous studies of slow fluctuations of baseline pupil diameter. In what follows, we focus on the negative effect of TPR on decision bias.

Most subjects were overall (i.e., without splitting trials by TPR) intrinsically biased to respond ‘no’: 10 out of 14 subjects exhibited a significantly conservative criterion (within-subject permutation tests; p<0.05) in the fMRI data set, and 14 out of 21 subjects in the data set from de Gee et al. (2014). Because signal+noise and noise trials were equally frequent in both experiments, this bias was always maladaptive. Critically, this maladaptive bias was particularly pronounced under low TPR; but under high TPR the bias was nearly neutralized, especially in the fMRI data set (criterion around zero, and fraction of ‘yes’-choices around 0.5 for highest TPR bins, Figure 2A,B).

A robust effect of phasic arousal on the decision computation

A number of control analyses and experiments supported the idea that the negative correlation between TPR amplitude and decision bias reflected a specific effect of phasic arousal on the decision computation that generalized across perceptual choice tasks. First, the effect emerged during, not after, decision formation: a sliding-window correlation between TPR and criterion became negative from decision onset onwards, and reached statistical significance before button press (Figure 2C,F). In the fMRI data set, this correlation was highly significant more than 800 ms before button press (Figure 2C). Given the sluggish nature of the pupil response (see above), the underlying central arousal transients must have occurred even earlier than that, leaving substantial time for shaping the decision outcome.

Second, there was no robust association between baseline pupil diameter and decision bias (Figure 2—figure supplement 1A–D). This ruled out possible concerns that the effect might be due to corresponding (opposite) associations between baseline pupil diameter and behavior, ‘inherited’ by TPR through its negative correlation with baseline pupil diameter (de Gee et al., 2014).

Third, the effect of TPR on decision bias was robust with respect to the details of the analysis approach. For Figure 2, as for all other analyses reported in the main text, we removed (via linear regression) components explained by RT. The rationale was to specifically isolate variations in the amplitudes of the neural responses driving TPR, irrespective of RT, variations of which might also cause variations of TPR amplitude without changes in the underlying neural response amplitudes (for details see Materials and methods). We observed the same linear effect of TPR on bias without removing trial-to-trial variations in TPR that were due to RT (Figure 2—figure supplement 1E–J).

Pupil-linked bias reduction is a general phenomenon

Fourth, the effect of TPR on decision bias shown in Figure 2 generalized to other perceptual choice tasks, which differed on several dimensions from the main contrast detection task used in this paper (Figure 3). In one follow-up experiment, we measured pupil-linked behavior during an auditory yes-no (tone-in-noise) detection task near psychophysical threshold using the same stimuli as in (McGinley et al., 2015a) (see Materials and methods). The only visual stimulus was a stable fixation dot. The decision interval contained only auditory noise (the same as in (McGinley et al., 2015a)) on half the trials, and a pure sine wave superimposed onto the noise on the other half of the trials. Again, TPR predicted a significant (linear) reduction in conservative decision bias, and an increased tendency to respond ‘yes’ (Figure 3A,B). TPR also exhibited a non-monotonic relationship with sensitivity, as observed in rodents for baseline pupil diameter in (McGinley et al., 2015a).

Figure 3

Download asset Open asset

Arousal-linked bias reduction generalizes to other choice tasks.

(A) Perceptual sensitivity (d’; left) and decision bias, measured as criterion (middle) or fraction of ‘yes’-choices (computed as for Figure 2A, right), for low and high TPR. Data points, individual subjects. (B) Relationship between TPR and d’ or criterion (5 bins). Linear fits were plotted wherever the first-order fit was superior to the constant fit (see Materials and methods). Quadratic fits were plotted wherever the second-order fit was superior to first-order fit. (C) Perceptual sensitivity (d’, left) and decision bias, measured as absolute criterion (middle) or fraction of non-preferred choices (right), for low and high TPR. For the fraction of non-preferred choices analysis, we ensured that each TPR bin consisted of an equal number of motion up and down trials (see Materials and methods). (D) Relationship between TPR and d’ or absolute criterion (4 bins instead of 5, because of fewer trials per subject, see Materials and methods). All panels: group average (N = 24 and N = 15); shading or error bars, s.e.m.; stats, permutation test.

https://doi.org/10.7554/eLife.23232.010

Figure 3—source data 3 Table with variable identifiers used in Figure 3—source data 1 and 2.: https://doi.org/10.7554/eLife.23232.011
Download elife-23232-fig3-data3-v3.txt
Figure 3—source data 1 This csv table contains the data for Figure 3 panel A.: https://doi.org/10.7554/eLife.23232.012
Download elife-23232-fig3-data1-v3.csv
Figure 3—source data 2 This csv table contains the data for Figure 3 panel C.: https://doi.org/10.7554/eLife.23232.013
Download elife-23232-fig3-data2-v3.csv

Another follow-up experiment assessed whether the pupil-linked bias reduction observed above may have been due to the asymmetric nature of the detection tasks (i.e., discriminating the presence from the absence of a signal) or due to the absence of single-trial feedback. Symmetric two-alternative forced choice tasks are commonly associated with weaker biases than yes-no detection tasks (Green and Swets, 1966). We used a symmetric visual random dot motion (up vs. down) discrimination task near psychophysical threshold with feedback after each trial (see Materials and methods). Although many subjects exhibited clear biases for reporting one or the other direction, these were more evenly distributed around zero than in the above yes-no tasks, in which the sign of the bias was largely consistent across individuals. Therefore, we here analyzed subjects’ absolute criterion values (i.e., overall bias regardless of sign) and fraction of non-preferred choices (i.e., the choice opposite to their general bias, irrespective of TPR). Again, TPR predicted a reduction in absolute decision bias, and an increase in the fraction of non-preferred choices (Figure 3C,D), analogous to the effects observed for the detection tasks above.

In sum, a number of analyses and experiments showed that pupil-linked, phasic arousal was consistently associated with a monotonic reduction in perceptual decision biases in different sensory modalities and task protocols.

Phasic arousal predicts a reduction of evidence accumulation bias

To further pinpoint the nature of the TPR-induced bias suppression, we fitted the drift diffusion model, an established dynamic model of two-choice decision processes (Figure 4A; [Ratcliff and McKoon, 2008]) to subjects’ RT distributions from the main task (contrast detection). The drift diffusion model posits the perfect accumulation of noisy sensory evidence towards one of two decision bounds, here for ‘yes’ and ‘no’ (Figure 4A).

Figure 4 with 2 supplements see all

Download asset Open asset

Phasic arousal predicts reduction of accumulation bias.

(A) Schematic and simplified equation of drift diffusion model accounting for RT distributions for ‘yes’- and ‘no’-choices (‘stimulus coding’; see Materials and methods). Notation: dy, change in decision variable y per unit time *dt; v•dt*, mean drift (multiplied with 1 for signal+noise trials, and −1 for noise trials); *dc•dt*, drift criterion (an evidence-independent constant added to the drift); and *cdW,* Gaussian white noise (mean = 0, variance = c² dt). (B) RT distributions of one example subject for ‘yes’- and ‘no’-choices, separately for signal+noise and noise trials and separately for low and high TPR. RTs for ‘no’-choices were sign-flipped for illustration purposes. Straight lines, mode (i.e., maximum) of the fitted RT distributions. Please note that TPR predicts an increased fraction of ‘yes’-choices with only a minor change of the mode of the RT distribution, consistent with a drift criterion effect rather than a starting point effect (Figure 4—figure supplement 1). (C) Group-level posterior probability densities for means of parameters. To maximize the robustness of parameter estimates (Wiecki et al., 2013), two data sets were fit jointly (the current fMRI and our previous study (de Gee et al., 2014); N = 35). Starting point (z) is expressed as a proportion of the boundary separation (a). (D) Drift criterion point estimates for low and high TPR trials, separately for both data sets (N = 14 and N = 21, respectively). Data points, individual subjects; stats, permutation test. (E) Change in fraction of ‘yes’-choices for low vs. high TPR trials, plotted against change in drift criterion. Data points, individual subjects.

https://doi.org/10.7554/eLife.23232.014

Figure 4—source data 2 Table with variable identifiers used in Figure 4—source data 1.: https://doi.org/10.7554/eLife.23232.015
Download elife-23232-fig4-data2-v3.txt
Figure 4—source data 1 This csv table contains the data for Figure 4 panel D.: https://doi.org/10.7554/eLife.23232.016
Download elife-23232-fig4-data1-v3.csv

We fitted the model separately for low and high TPR trials (see Figure 4B for an individual example). Within the model, the TPR-induced reduction of conservative bias, evident in Figures 2 and 3, may have been brought about by two distinct mechanistic scenarios: (i) the evidence accumulation process started from a level closer to the ‘yes’-bound (i.e., a change in the ‘starting point’ parameter); or (ii) the accumulation process was driven more towards the ‘yes’-bound (i.e., a change in the ‘drift criterion’ parameter). The drift criterion is equivalent to an evidence-independent constant added to the drift. A non-zero drift criterion results in a bias of the decision variable that grows linearly with time. Although clearly distinct in nature, both mechanisms (starting point and drift criterion) would have resulted in an increase in the fraction of ‘yes’-choices, and thus a reduction of decision bias. Critically, both mechanisms were distinguishable through their distinct effects on the shape of the RT distribution (Figure 4—figure supplement 1). To dissociate between these alternative mechanisms we fitted the model, while allowing several model parameters (boundary separation, non-decision time, mean drift rate, starting point, and drift criterion) to vary with TPR.

The model fits (see Materials and methods and [Wiecki et al., 2013]) supported the second mechanism: a change in drift criterion. An individual example is shown in Figure 4B, and group data are shown in Figure 4C. Drift criterion was generally negative, indicating an overall conservative accumulation bias towards the bound for ‘no’-choices. But drift criterion was pushed closer towards zero under high TPR, indicating an unbiased drift, as optimal for the current task (Figure 4B,C). The other main parameters (including starting point and mean drift rate) were not significantly affected by TPR. The TPR-linked effect on drift criterion was also evident in the individual point estimates from the fMRI sample only (Figure 4D).

Again, we we found no evidence for an effect on any parameter of the drift diffusion model when comparing trials with low and high baseline pupil diameters (Figure 4—figure supplement 2A), and we obtained qualitatively identical results without removing trial-to-trial variations of RT from the TPR amplitudes (Figure 4—figure supplement 2B–D; Materials and methods).

As a control of the significance of the TPR-dependent effect on drift criterion, we re-fitted the model, but now fixing drift criterion with TPR, while still allowing all other of the above parameters to vary with TPR. In this variant of the model, we again found no TPR-dependent change in any of the other parameters (boundary separation: p=0.428; non-decision time: p=0.370; starting point: p=0.117; mean drift rate: p=0.361). Critically, model comparison favored the complete version of the model with TPR-dependent variation in drift criterion (deviance information criterion, 50437 vs. 50528, respectively; see Materials and methods). This implies that the TPR-dependent variability in accumulation bias was essential to account for the TPR-dependent effects on behavior.

The individual changes in drift criterion between low vs. high TPR trials established by means of diffusion modeling accounted for a substantial fraction of the individual differences in TPR-predicted changes in the fraction of ‘yes’-choices (Figure 4E) obtained in the model-free analyses (Figure 2A,D, right panels). TPR-related changes in starting point had a weaker, and statistically not significant, effect on the fraction of ‘yes’-choices (fMRI data set: r = −0.345, p=0.227; de Gee et al. (2014) data set: r = −0.419, p=0.059).

In sum, in the decision task studied here, pupil-linked, phasic arousal predicted a reduction of conservative bias, specifically in the evidence accumulation, and was neither reflected in the baseline level of the decision variable at the start of the accumulation nor its mean drift. In other words, TPR accounted for a portion of the trial-to-trial variability in the drift unrelated to the objective sensory evidence. This correlate of phasic arousal at the algorithmic level was in line with the notion that phasic arousal shapes decision outcome by interacting with the evidence accumulation computation that lies at the heart of the decision process.

Taken together, the behavioral modeling results reported in Figures 2–4 put strong constraints on the expected changes in cortical decision processing due to phasic arousal. Specifically, changes in the encoding of the incoming evidence by sensory cortical areas, as observed in previous work on fluctuations in baseline arousal levels (McGinley et al., 2015a; Reimer et al., 2014; Vinck et al., 2015), would be associated with changes in perceptual sensitivity. However, we found that TPR was not associated with any robust change in sensitivity (measured as d’ or as mean drift rate) in the fMRI dataset, thus, predicting no TPR-linked modulation of sensory responses in visual cortex. Instead, the observed effect of TPR on choice bias (criterion, drift criterion) predicted a directed shift (towards ‘yes’) in neural signals encoding subjects’ choices, in downstream cortical regions. We next tested these predictions by assessing the relationship between TPR and (i) stimulus-specific responses in early visual cortex, and (ii) choice-specific responses in downstream cortical regions.

Phasic arousal does not boost sensory responses in visual cortex

The fMRI response in early visual cortex (areas V1, V2, and V3) during near-threshold visual tasks is made up of distinct components, including a (weak and focal) stimulus-specific component and a (large and global) task-related, but stimulus-independent, component (Cardoso et al., 2012; Donner et al., 2008; Ress et al., 2000). We used an approach based on multi-voxel pattern analysis analogous to previous work (Choe et al., 2014; Pajani et al., 2015) to isolate the stimulus-specific response component. Because the majority of visual cortical neurons encoding stimulus contrast are also tuned to stimulus orientation, orientation-tuning could serve as a ‘filter’ to separate the cortical stimulus response from stimulus-unrelated signals. Specifically, the low contrast signal in our task should have evoked a small response in each visual cortical neuron selective for the orientation of the target signal (45° or 135°, on different experimental runs, Figure 1A) across a substantial part of the retinotopic map. Thus, the presence or absence of the target signal should be reliably encoded in the orientation-specific component of the cortical population response, within the retinotopic sub-region corresponding to the signal. We first individually delineated these retinotopic sub-regions within each of V1-V3 (see Figure 5A for an example subject) and then quantified the orientation-specific response component therein as the spatial correlation of multi-voxel response patterns with an orientation-specific ‘template’ (Materials and methods).

Figure 5 with 1 supplement see all

Download asset Open asset

Phasic arousal does not boost sensory responses in visual cortex.

(A) Map of fMRI responses during stimulus localizer runs (see Materials and methods); example subject. V1-V3 borders were defined based on a separate retinotopic mapping session. ‘Stimulus sub-regions’, regions with positive stimulus-evoked response; ‘surround sub-regions’, regions with negative stimulus-evoked response. (B) Orientation-specific fMRI responses in ‘center’ sub-regions of V1-V3, separately for signal+noise and noise trials, and separately for low and high TPR trials. Statistical tests are reported in main text. Data points, individual subjects (N = 14); stats in main text.

https://doi.org/10.7554/eLife.23232.019

As expected, this orientation-specific response component differed robustly between signal+noise and noise trials (Figure 5B). A 2-way repeated measures ANOVA with factors stimulus and TPR bin yielded a highly significant main effect of stimulus for V1, V2, and V3 (V1: F_1,13 = 303.5, V2: F_1,13 = 646.3, V3: F_1,13 = 316.6; all p<0.001).

The orientation-specific response component also reliably discriminated between signal+noise and noise trials on a single-trial basis (Figure 5—figure supplement 1). Consequently, we henceforth refer to this component as the ‘stimulus-specific response’. However, the stimulus-specific response was not boosted under high TPR (Figure 5B, no significant main effect of TPR, nor stimulus x TPR interaction in any of V1-V3).

No evidence for arousal-dependent boost of sensory responses in any cortical area

The above analysis focused on the stimulus-specific response in early visual cortex. To avoid missing TPR-dependent modulations of sensory responses in higher cortical regions, we also mapped out modulations of fMRI responses by TPR across cortex (see Materials and methods). Various regions including visual, parietal, prefrontal, and motor cortices exhibited robust task-evoked overall fMRI responses (i.e., difference between the decision interval and baseline; Figure 6A), as well as robust modulations by TPR (Figure 6B), whereby TPR-induced boosts only partly overlapped with the task-positive responses.

Figure 6

Download asset Open asset

Cortex-wide fMRI correlates of phasic arousal and stimulus.

(A) Functional map of task-evoked fMRI responses computed as the mean across all trials. (B) As panel A, but for the contrast high vs. low TPR trials. (C) As panel A, but for the contrast signal+noise vs. noise. (D) As panel A, but for the interaction between TPR (2 levels) and stimulus (2 levels). All panels: functional maps are expressed as t-scores computed at the group level (N = 14) and presented with cluster-corrected statistical threshold (see Materials and methods).

https://doi.org/10.7554/eLife.23232.021

However, in no single region did the overall fMRI responses differ between signal+noise and noise trials (Figure 6C). This indicates that our multi-voxel pattern approach described above was, in fact, essential for detecting the weak cortical response to the near-threshold target signals. Critically, in no region did we find a significant interaction between the factors stimulus (signal+noise vs. noise) and TPR (low vs. high TPR; Figure 6D).

Taken together, both complementary analyses showed that phasic, task-evoked arousal signals did not modulate cortical responses encoding the presence of the low-contrast signal. This is in line with the lack of TPR-linked change in perceptual sensitivity in the fMRI dataset (Figure 2A, Figure 4D).

Phasic arousal modulates choice-specific signals in frontal and parietal cortex

We then sought to test for directed shifts in neural signals encoding subjects’ choices under high TPR, which would be in line with the changes in decision biases identified by behavioral modeling. Here, we use the term ‘choice-specific’ to refer to fMRI-signals that reliably discriminated between subjects’ choice (‘yes’ vs. ‘no’). Two complementary approaches delineated several cortical regions that exhibited such choice-specific signals (Figure 7). The first approach (Figure 7A) was based on the lateralization of fMRI responses with respect to the motor effector used to report the choice (i.e., response hand; see (de Lange et al., 2013; Donner et al., 2009) and Materials and methods). In addition to the hand area of primary motor cortex (henceforth referred to as M1), this approach yielded reliable effector-specific lateralization also in two regions of posterior parietal association cortex: the junction of the intraparietal and postcentral sulcus (IPS/PostCeS) and the anterior intraparietal sulcus (aIPS1; Figure 7A and Figure 7—figure supplement 1A,B). The second approach (Figure 7B) was based on multi-voxel pattern classification of choice, using a ‘searchlight’ procedure that scanned the entire cortex for choice information (see (Hebart et al., 2012, 2016) and Materials and methods). The underlying rationale was to identify cortical regions encoding choice in other formats (e.g., in terms of more fine-grained patterns) than the hemispheric lateralization of response amplitudes. The second approach revealed robust (and reproducible) choice-specific response patterns in a number of additional regions in bilateral posterior parietal cortex and (right) prefrontal cortex: superior and inferior parietal lobule (SPL and IPL, respectively), a second region within aIPS (aIPS2), posterior insula (pIns), the junction of precentral sulcus and right inferior frontal gyrus (PreCeS/IFG) and right medial frontal gyrus (MFG; Figure 7B and Figure 7—figure supplement 1C,D). In both approaches, choice specific regions were delineated after factoring out the physical stimulus (see Materials and methods).

Figure 7 with 1 supplement see all

Download asset Open asset

Phasic arousal predicts change of cortical decision signals.

(A) Conjunction of session-wise maps of logistic regression coefficients of choice against fMRI lateralization (see Figure 7—figure supplement 1A for individual sessions). Tested against 0.5 at group level; red outlines, ROIs used for further analyses. (B) Conjunction of session-wise maps of searchlight choice classification precision scores (see Figure 7—figure supplement 1C for individual sessions). Tested against 0.5 at group level; red outlines, ROIs used for further analyses. (C) Choice-predictive indexes for choice-specific responses (‘yes’ vs. ‘no’, irrespective of stimulus; see Materials and methods and Figure 7—figure supplement 1G). Dashed line, index for M1, which can be regarded as a reference given the measurement noise. Data points, individual subjects. (D) Choice-specific responses, obtained through mapping lateralization (M1 and the combined ‘lateralization signal’, i.e., regions from Figure 7A excluding M1; see Materials and methods) and through searchlight classification (combined ‘searchlight signal’, i.e., all regions from Figure 7B), for low and high TPR trials. Data points, individual subjects. (E) Correlation between TPR and M1 (left), or the combined ‘lateralization signal’ (middle), or the combined ‘searchlight signal’ (right) (5 bins). In all cases, the effect of the physical stimulus was removed (see Materials and methods). Shading or error bars, s.e.m. All panels: group average (N = 14); stats, permutation test.

https://doi.org/10.7554/eLife.23232.022

In all the above choice-encoding regions, responses (estimated in a cross-validated fashion, see Materials and methods) reliably differentiated between ‘yes’- and ‘no’-choices – both on average (Figure 7—figure supplement 1E,F) and at the single-trial level (Figure 7C, see also Figure 7—figure supplement 1G). As expected, the single-trial reliability of the choice-specific responses differed between cortical regions (1-way repeated measures ANOVA with factor region of interest (9 levels): F_8,104 = 30.20, p<0.001), with the strongest reliability for M1 (dashed horizontal line in Figure 7C), the region closest to the subjects’ motor output.

For analysis of the association with TPR, we pooled the choice-specific signals of these different regions into three groups (Figure 7—figure supplement 1A): the motor end stage of the decision process M1, the combined ‘lateralization signal’ (i.e., regions from Figure 7A excluding M1), and the combined ‘searchlight signal’ (i.e., all regions from Figure 7B). Critically, as predicted, the combined choice-specific signals, but not the M1 response, were significantly pushed towards the ‘yes’-choice (i.e., more positive in Figure 7D) for high compared to low TPR. The effect of TPR differed by cortical signal (2-way repeated measures ANOVA with factors signal type (3 levels) and TPR bin (2 levels); interaction: F_2,26 = 7.30, p=0.003). Specifically, the difference of the choice-specific signals between low and high TPR was significantly larger for the combined lateralization signal and the combined searchlight signal than for M1 (combined lateralization signal vs. M1: p=0.015; combined searchlight signal vs. M1: p=0.004; permutation tests).

Because subjects’ mean accuracy was about 74% correct, their choices were partially correlated with the physical stimulus (i.e., signal+noise vs. noise trials). Consequently, the choice-specific cortical responses were also (weakly) predictive of the stimulus (Figure 7—figure supplement 1H). To isolate variations in the amplitude of the choice-specific response that were independent of the stimulus, we removed (via linear regression) components explained by the stimulus and quantified the effect of TPR on the residual choice-specific cortical signals. Fitting the linear model to the combined choice-specific responses yielded highly significant TPR coefficients, for both the combined lateralization and combined searchlight signals (Figure 7E, middle and right panel). By contrast, the TPR-linked modulation was absent in the end stage region M1 (Figure 7E, left panel).

In sum, a number of fronto-parietal cortical regions exhibited signals that reliably encoded subjects’ behavioral choice and were robustly modulated by phasic arousal, with a larger tendency towards the ‘yes’-choice under high TPR. This was true even when factoring out the effect of the sensory evidence (i.e. presence of the target signal).

Task-evoked pupil response are predicted by responses in a network of brainstem centers

Finally, we aimed to identify brainstem regions whose task-evoked responses were (i) linked to the trial-to-trial fluctuations of TPR, and (ii) accounted for the trial-to-trial modulation of subjects’ evidence accumulation bias, and the resulting tendency to choose ‘yes’. Previous work from monkey physiology has implicated three brainstem nuclei in particular in the control of TPR: the locus coeruleus (LC), the inferior colliculus (IC), and the superior colliculus (SC), respectively (Joshi et al., 2016; Varazzani et al., 2015; Wang et al., 2012). Here, we exploited the wide coverage of our fMRI measurements to concurrently monitor responses across a wider brainstem network, including a number of other nuclei implicated in central arousal: the dopaminergic substantia nigra (SN) and ventral tegmental area (VTA), as well as the (partly) cholinergic basal forebrain (BF). We further sub-divided the BF region into the part including cell groups within the septum and the horizontal limb of the diagonal band (BF-sept) and the sublenticular part (BF-subl). BF-subl contains cholinergic neurons with widespread ascending projections (Zaborszky et al., 2008), which are involved in the regulation of cortical arousal state (Lee and Dan, 2012; McGinley et al., 2015b). Our analysis approach minimized the effect of physiological noise on the brainstem fMRI responses, including removal of the fourth ventricle signal (see Materials and methods). We also verified that the fourth ventricle signal was unrelated to TPR (Figure 8—figure supplement 1D,E). The LC region of each subject was delineated through independent structural scans (Figure 8A, and Figure 8—figure supplement 1A; for details see Materials and methods).

Figure 8 with 1 supplement see all

Download asset Open asset

Pupil responses reflect responses of a network of brainstem nuclei.

(A) Delineation of LC by structural scan. The LC corresponds to two hyper-intense spots; example subject (see Figure 8—figure supplement 1 for all subjects). Left inset, magnification of yellow box with LC ROI. Right inset, three-dimensional representation of signal intensity levels in yellow box. (B) Task-evoked LC responses for low and high TPR. Red bar, high TPR time course significantly different from zero; green bar, high TPR time course significantly different from low TPR time course (p<0.05; cluster-corrected). Grey box, time window for computing scalar response amplitudes. (C) As panel B, but split by signal+noise and noise trials. (D) As panel B, but for the 2 voxels with highest probability of containing the LC. (E) As panel B, but for SN, VTA, and two BF-ROIs. (F) Map of single-trial correlation between TPR and evoked fMRI responses (tested against 0 at group level). Yellow outlines, brainstem nuclei from probabilistic atlases. (G) Matrix of correlations between evoked brainstem fMRI responses. Stats corrected with false discovery rate (FDR). (H) Partial correlation of evoked fMRI responses and TPR. For each ROI, responses of all other ROIs were first removed via linear regression. (I) Correlation between fMRI responses in ACC and TPR and LC. All panels: group average (N = 14); shading, s.e.m.; data points, individual subjects; stats, permutation test.

https://doi.org/10.7554/eLife.23232.024

The LC region exhibited a robust positive response on high TPR trials and a trend towards deactivation on low TPR trials (Figure 8B–D, and Figure 8—figure supplement 1C). The same pattern was evident for both signal+noise and noise trials separately (Figure 8C). The association to TPR was also highly significant in the most spatially specific definition of the LC region afforded by our measurements: evaluating only the two fMRI voxels with the largest probability of containing the individual LC region (Figure 8D, and see Materials and methods). Fluctuations of task-evoked fMRI responses measured in the LC were also robustly coupled to fluctuations in TPR amplitude at the single trial level (Figure 8F,H).

Similar to the LC region, we found a robust difference between low and high TPR conditions for fMRI responses in the SC and VTA regions (Figure 8E,F, and Figure 8—figure supplement 1B,C). Mapping the trial-to-trial correlations between TPR and brainstem fMRI responses at the single-voxel level yielded robust coupling to TPR in the LC, SC, VTA and as well as in BF-subl regions (Figure 8F).

As expected from the anatomical connectivity between brainstem centers (España and Berridge, 2006; Sara, 2009; Wang and Munoz, 2015), the trial-to-trial fluctuations of the task-evoked responses were significantly correlated among a number of these brainstem nuclei (Figure 8G). Removing components of the trial-to-trial fluctuations in TPR and fMRI responses shared with the other ROIs yielded significant residual (i.e., partial) correlations between TPR and responses in SC, LC region, VTA and BF-subl (Figure 8H). This indicates robust and unique contributions of these four nuclei to TPR.

Phasic brainstem responses during decision tasks might be driven by top-down signals from anterior cingulate cortex (ACC), which sends descending projections to the LC (Aston-Jones and Cohen, 2005) and other brainstem nuclei. In line with this notion, trial-to-trial fluctuations of both LC responses and TPR were robustly correlated to trial-to-trial fluctuations of task-evoked responses of the ACC (Figure 8I).

Task-evoked responses in neuromodulatory centers, but not the colliculi, predict suppression of evidence accumulation bias

The task-evoked responses in the neuromodulatory nuclei, but not the colliculi, were tightly linked to the inferred decision computation and subjects’ overt choice behavior. We computed the combined ‘neuromodulatory brainstem signal’ as the linear combination of responses from LC, VTA, SN, and BF that maximized the correlation to TPR (Materials and methods; correlation coefficient across subjects, 0.146 (±0.014 s.e.m.)). The amplitude of this combined signal predicted a significant reduction in conservative decision bias (Figure 9A), and an increased tendency to choose ‘yes’ (Figure 9B), but no change in sensitivity (Figure 9—figure supplement 1A). This pattern of effects was absent for the combined ‘colliculi signal’ (Figure 9A,B), a linear combination of responses from SC and IC that maximized the correlation to TPR (correlation coefficient across subjects, 0.092 (±0.011 s.e.m.)). Further, the trial-to-trial variations in the strength of the combined neuromodulatory (but not colliculi) response robustly pushed the trial-to-trial drift towards the ‘yes’-boundary, in effect reducing the overall negative drift criterion (Figure 9D, see Materials and methods for details).

Figure 9 with 1 supplement see all

Download asset Open asset

Brainstem neuromodulatory nuclei predict reduction of choice bias.

(A) Correlation between decision bias (criterion) and the combined neuromodulatory brainstem signal (linear combination of responses in LC, SN, VTA, BF-sept, and BF-subl maximizing the correlation to TPR; see Materials and methods; left), and the combined colliculi signal (linear combination of responses in SC and IC maximizing the correlation to TPR; right) (5 bins). Stats, permutation test. (B) As panel A but for the correlation to fraction of ‘yes’-choices. (C) Group-level posterior probability densities for means of parameters in the DDM regression model, through which we assessed the trial-by-trial, linear relationship between single-trial drift and the combined neuromodulatory response or the combined colliculi response (see Materials and methods; see Figure 9—figure supplement 1 for the remaining parameters ‘starting point’, ‘boundary separation’ and ‘non-decision time’). All panels: group average (N = 14); shading or error bars, s.e.m.

https://doi.org/10.7554/eLife.23232.026

In sum, trial-to-trial fluctuations in TPR were predicted by fluctuations in the task-evoked responses of a network of brainstem regions, most notably the LC, VTA and SC. Despite the expected coupling between these and other brainstem regions (Figure 8G), TPR carried robust LC-, SC-, and (less strongly) VTA-specific components (Figure 8H). But only the responses of the neuromodulatory ROIs, not of the colliculi, accounted for the concomitant reduction of the bias in evidence accumulation and the resulting behavioral choice patterns. These results establish a tight link between phasic neuromodulator release and the dynamics of evidence accumulation.

Discussion

Intrinsic variability in the face of uncertain evidence is a pervasive feature of decision-making (Glimcher, 2005; Gold and Shadlen, 2007; Shadlen et al., 1996; Sugrue et al., 2005; Wyart and Koechlin, 2016). Most current models of choice treat this intrinsic behavioral variability as a nuisance to be accounted for by additional ‘noise parameters’ (Bogacz et al., 2006; Ratcliff and McKoon, 2008). Other theories have proposed that the behavioral variability may be due to hidden, but systematic, biases in the decision process (Beck et al., 2012; Wyart and Koechlin, 2016). Here, we present evidence that helps reconcile these ideas. We found that a significant component of choice variability was explained by trial-to-trial variations in the amplitude of task-evoked, pupil-linked arousal responses. Specifically, pupil-linked arousal responses accounted for trial-to-trial variations in the bias of the evidence accumulation process as well as decision-related cortical population signals: under large phasic arousal conservative biases were reduced. The implication is that, without monitoring arousal responses, the associated, systematic variations in accumulation bias would appear as random trial-to-trial variability in the accumulation process (i.e., drift). Going further, we established that the dynamic bias suppression was explained by responses in a network of neuromodulatory brainstem systems controlling cortical arousal state. Taken together, our results are consistent with a scenario in which phasic neuromodulatory activity during decision-making optimizes choice behavior through a suppression of maladaptive biases in the evidence accumulation process.

Challenges and limitations of brainstem fMRI

Imaging the brainstem with fMRI is challenging (Astafiev et al., 2010; Beissner, 2015; Brooks et al., 2013; Forstmann et al., 2017) because this region is prone to physiological noise artifacts (Brooks et al., 2013), and brainstem nuclei tend to be small relative to the spatial resolution of standard fMRI measurements. For example, although the adult human LC is an elongated structure of approximately 15 mm length along the rostro-caudal axis, its diameter is only a few millimeters, as assessed by high-resolution MRI (Figure 8A and Figure 8—figure supplement 1A) (Keren et al., 2009, 2015). Our study addressed these challenges by following the recommendations of Eckert and colleagues (Eckert et al., 2010): We (i) delineated the LC in each brain, based on individual (neuromelanin-sensitive) structural MRI scans; (ii) performed fMRI tailored to the anatomical layout of the LC while maximizing functional signal-to-noise ratio (SNR), by using an in-plane spatial resolution of 2 × 2 mm and 3 mm thick slices that were oriented perpendicular to the longitudinal extent of the LC; (iii) performed no spatial smoothing of these functional data; and (iv) rigorously removed measured cardiac and respiratory signal components, as well as residual fourth ventricle signal, which have been identified as a major source of uncertainty regarding previous fMRI work on the LC (Astafiev et al., 2010). The resulting time-course of task-evoked fMRI responses exhibited the standard features of hemodynamic responses (Figure 8B–E), and correlations to pupil responses that are largely consistent with single-unit physiology in monkeys (see below). Taken together, the brainstem responses in Figures 8 and 9 likely reflect true neural signal from brainstem nuclei, rather than physiological noise. However, there is some inevitable uncertainty regarding the spatial specificity of our measurements. Due to the lower spatial resolution of fMRI images, the co-registration between functional and structural images, and the point spread of the hemodynamic response, each fMRI voxel is likely to sample activity from brain tissue neighboring the nuclei depicted as the regions of interest (e.g., LC). Consequently, we do not conclude that the LC responses in Figure 8B–D reflect the activity of noradrenergic neurons only; such a conclusion would require single-unit measurements. The focus of our conclusions instead lies on the distribution of pupil and behavioral correlations across different brainstem structures, which provides an important complement to targeted single-unit measurements.

Brainstem correlates of pupil dilation and decision bias

Despite the above-mentioned limitations, the overall distribution of pupil-linked brainstem responses shown in Figure 8F meaningfully follows the outlines of key candidate structures, in a fashion that is largely consistent with monkey physiology, and a previous human study on fMRI correlates of fluctuations in baseline pupil diameter (Murphy et al., 2014a). Our approach also identified so-far unknown effects. Previous monkey physiology has established significant coupling of pupil responses to responses of the LC, SC, and IC (Joshi et al., 2016; Varazzani et al., 2015; Wang et al., 2012), but not yet for dopaminergic and cholinergic structures (i.e., SN, VTA, and BF). The ability to monitor all of the above brainstem regions at once enabled quantification of their trial-to-trial correlation structure, and hence isolating the contributions that were unique to each region. This revealed that (i) many brainstem nuclei co-fluctuated during the decision task and that (ii) not only the LC and SC, but also the VTA and sublenticular part of the BF each made robust and specific contributions to task-evoked pupil dilations, over and above those shared with other brainstem centers (Figure 8H). Thus, the noradrenergic, cholinergic and dopaminergic systems are all phasically, and to some extent independently, recruited during challenging decision tasks, and jointly shape the concomitant changes in arousal state. Our findings provide a basis for a more comprehensive neurophysiological interpretation of results from cognitive pupillometry studies in humans.

Most importantly, we also established that only a subset of those brainstem nuclei exhibiting robust correlations with pupil responses were also predictive of the trial-by-trial suppression in decision bias. The latter effect was solely accounted for by responses in the (noradrenergic, dopaminergic, and cholinergic) neuromodulatory nuclei with diffuse projections to cortex, but not by responses in the superior or inferior colliculi. This indicates that the phasic release of neuromodulators in the brain, possibly a combination of different neuromodulators, is key for behavioral correlates of phasic arousal identified here.

Phasic versus tonic arousal effects

A number of recent studies have characterized the relationship between tonic arousal levels (measured through baseline pupil diameter) and cortical state (McGinley et al., 2015a; Reimer et al., 2014; Vinck et al., 2015; Warren et al., 2016). Other studies have characterized the relationship between tonic arousal levels and behavioral performance (McGinley et al., 2015a; Murphy et al., 2014b). The comparison between this previous work and ours points to possible differences between the functional correlates of tonic arousal levels and phasic, task-evoked changes in arousal. We found that phasic, task-evoked arousal responses were primarily linked to decision bias, at both, the algorithmic and cortical levels. By contrast, the above studies of tonic arousal levels have revealed effects on the quality of sensory cortical responses and behavioral sensitivity to sensory evidence (McGinley et al., 2015b). While we also found some evidence for non-monotonic (inverted U-shape) relationships between phasic arousal and sensitivity, the dominant and most consistent link was a monotonic and approximately linear relationship between phasic arousal and decision bias. Candidate factors accounting for these apparent differences between the functional correlates of phasic and tonic arousal might be the dynamics of the underlying neuromodulatory effects on cortical circuits, or the different combination of neuromodulatory systems involved. It will be instrumental to track TPR-linked changes in brainstem and cortical state in real time in future work.

Post-decisional versus intra-decisional drive of phasic arousal

One account holds that the phasic arousal signals (specifically, phasic responses of the noradrenergic LC) are triggered by the bound-crossing in one of the cortical accumulator circuits; the resulting transient and cortex-wide neuromodulator release then facilitates the translation of the choice into a motor act (Aston-Jones and Cohen, 2005). An alternative idea (Dayan and Yu, 2006), supported by indirect evidence (Cheadle et al., 2014; de Gee et al., 2014), is that arousal systems are already recruited before the bound-crossing, throughout the evidence accumulation process. In line with the latter notion, we found that task-evoked pupil responses are driven most strongly by a sustained central input throughout decision formation, not only after commitment to a choice. This finding has potentially important implications for the functional role of phasic arousal in decision processing. The finding indicates that at least one of the brainstem nuclei linked to pupil responses was, likewise, activated in a sustained fashion throughout decision formation. The resulting neuromodulatory transients might alter the state of brain regions involved in decision computations as the decision unfolds, provided that the accumulation operates on timescales of seconds or longer. Because the tasks used in previous animal physiology studies of task-related LC responses involved much faster decision processes than the one studied here (reaction times of about 0.5 s vs. 2 s, respectively), it remains unknown whether the more sustained, task-evoked responses also occur in noradrenergic neurons (but see [Varazzani et al., 2015]). Sustained responses encoding reward uncertainty have been observed in dopaminergic neurons in the VTA (Fiorillo et al., 2003), one of the structures whose task-evoked responses predicted pupil responses. Future electrophysiological studies should determine the time course of task-related activation in the different nuclei of the brain’s arousal network during sensory-motor decisions involving protracted evidence accumulation (Nomoto et al., 2010).

How do diffuse neuromodulatory signals translate into specific effects on choice behavior?

One notable aspect of our findings is that the functional correlates of neuromodulatory responses were specific for a particular choice option (see Figure 9). Also in the context of learning, the interplay between pupil-linked arousal and competitive cortical circuitry has been found to translate into specific effects on cognition and behavior (Eldar et al., 2013).

A scenario consistent with our results is that phasic neuromodulator release alters the relative strength of information flow between cortical processing stages, suppressing ‘top-down’ relative to ‘bottom-up’ signals (Friston, 2010; Gil et al., 1997; Hsieh et al., 2000; Kimura et al., 1999; Kobayashi et al., 2000). In perceptual decisions like the ones studied here, early sensory cortices provide bottom-up sensory likelihood signals, while top-down signals might encode prior beliefs (Friston, 2010; Pouget et al., 2013). Thus, through a relative suppression of ‘top-down’ signal flow, phasic arousal might reduce the weight of the prior (reflecting subjects’ intrinsic bias) relative to the likelihood. Specifically, in our yes-no task, the prior may have been a conservative bias for choosing ‘no’. Reducing its weight would reduce this bias. Such an increase in the relative weight of bottom-up signals might be implemented by synaptic gain modulation through neuromodulators. This gain modulation, in turn, might depend on the precision (inverse of uncertainty) of incoming sensory data (Friston, 2010; Moran et al., 2013).

The above scenarios postulate a uni-directional effect of neuromodulatory transients on cortical decision computations. However, this interaction may also be bi-directional, with trial-to-trial fluctuations of cortical decision signals driving fluctuations of phasic arousal responses (Aston-Jones and Cohen, 2005; Dayan and Yu, 2006). Specifically, phasic LC responses may be driven by specific cortical regions (e.g., the ACC), which compute the ratio of the posterior probability of target presence over the (estimated) prior probability of target occurrence (Dayan and Yu, 2006). The resulting phasic norepinephrine release across cortex might reset cortical networks (Bouret and Sara, 2005) and interrupt the (default) state encoding the prior (Dayan and Yu, 2006). In a yes-no task such as ours, a tendency towards the ‘no’-option may correspond to the default state for conservative subjects, and a phasic arousal signal is generated when decision-related neural activity ramps towards ‘yes’, facilitating the transition of the entire cortical system towards that non-default state.

Conclusion

Our findings establish that phasic task-evoked pupil responses during the formation of sensory-motor decisions reflect responses of a network of neuromodulatory brainstem centers including the noradrenergic LC. Phasic, pupil-linked arousal alters choice-encoding population signals in parietal and prefrontal association cortices. Phasic arousal in general, and neuromodulatory brainstem responses in particular, explain a dynamic reduction in decision-makers’ bias towards one particular choice. The resulting trial-to-trial variability of decision bias accounts for a significant component of the intrinsic behavioral variability: when decisions are made in the face of uncertainty, tracking phasic arousal signals may be just as important for predicting choice behavior as tracking the objective evidence gathered from the outside world.

Materials and methods

Subjects

We report analyses of four independent data sets, from behavioral tasks described in the subsequent section. All subjects had normal or corrected-to-normal vision and gave written informed consent. Subjects received €15 per hour (all visual tasks) or research credit (auditory task) for their participation. The ethics committee of the Psychology Department of the University of Amsterdam approved the experiments.

Fifteen healthy subjects (5 females; age range, 22–35 y) participated in the main experiment of this study, entailing concurrent pupillometry and brainstem as well as cortical fMRI recordings. Here, each subject participated in several fMRI sessions: one to define retinotopically organized visual cortical areas (75 min) and two sessions (three for one subject) for the main experiment (about 2 hr per session). Three subjects were authors, and the remaining 12 subjects were naive to the purpose of the study. The results were unchanged when excluding the three authors (see Author response, online) and the one subject who performed three sessions (and more trials; see section Behavioral tasks) of the main experiment. One (male) subject was excluded from the analyses because the stimulus software did not receive the triggers from the MRI scanner in two sessions (the age range remained the same).

We also re-analyzed the 21 subjects from an existing behavioral data set, for which we had previously published different analyses (de Gee et al., 2014) (Figure 2, Figure 2—figure supplement 1, Figure 4 and Figure 4—figure supplement 1). In that experiment, 23 subjects had performed a yes-no visual contrast detection task with trial structure analogous to that of the fMRI experiment, enabling joint fitting of the drift diffusion model to both data sets using a hierarchical Bayesian procedure (see below). To this end, we excluded the two subjects from the (de Gee et al., 2014) data set who had also participated in the current fMRI experiment, keeping the two samples independent.

Finally, 24 subjects (20 females; age range, 19–23 y) performed an auditory tone-in-noise detection task (Figure 3A,B), and 15 subjects (six females; age range, 23–37 y) performed a visual random dot motion discrimination task (Figure 3C,D).

Share this article

Cite this article

Behavioral task and task-evoked pupil responses.

Phasic arousal predicts reduction of choice bias.

Figure 2—source data 3

Figure 2—source data 1

Figure 2—source data 2

Arousal-linked bias reduction generalizes to other choice tasks.

Figure 3—source data 3

Figure 3—source data 1

Figure 3—source data 2

Phasic arousal predicts reduction of accumulation bias.

Figure 4—source data 2

Figure 4—source data 1

Phasic arousal does not boost sensory responses in visual cortex.

Cortex-wide fMRI correlates of phasic arousal and stimulus.

Phasic arousal predicts change of cortical decision signals.

Pupil responses reflect responses of a network of brainstem nuclei.

Brainstem neuromodulatory nuclei predict reduction of choice bias.

Author details

Jan Willem de Gee

Contribution

For correspondence

Competing interests

Olympia Colizoli

Contribution

Competing interests

Niels A Kloosterman

Contribution

Competing interests

Tomas Knapen

Contribution

Competing interests

Sander Nieuwenhuis

Contribution

Competing interests

Tobias H Donner

Contribution

For correspondence

Competing interests

Citations by DOI

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

Categories and tags

Research organism

Further reading