A process model account of the role of dopamine in intertemporal choice

  1. Alexander Soutschek  Is a corresponding author
  2. Philippe N Tobler
  1. Department of Psychology, Ludwig Maximilian University Munich, Germany
  2. Zurich Center for Neuroeconomics, Department of Economics, University of Zurich, Switzerland
  3. Neuroscience Center Zurich, University of Zurich, Swiss Federal Institute of Technology Zurich, Switzerland
5 figures, 4 tables and 1 additional file

Figures

Task design and experimental procedures.

(A) Participants made choices between alternatives that provided smaller-sooner rewards (e.g., 100 Swiss francs in 0 day) or larger-later rewards (e.g., 250 Swiss francs in 60 days). (B) In a double-blind crossover design, participants performed the intertemporal decision task after administration of the D2 antagonist amisulpride or placebo on two separate days. (C) Model-free Bayesian analyses revealed weaker influences of delay costs on decision making under amisulpride compared with placebo, consistent with previous findings that D2R antagonism strengthens the preference for delayed rewards. (D) Individual coefficients for the impact of delay on choices in the amisulpride and placebo conditions. (E) Illustration of the choice process in the framework of a drift-diffusion model. After a non-decision time τ (not shown here), evidence is accumulated from a starting point ζ with the weighted difference between benefits and action costs determining the speed of the accumulation process (drift rate v) toward the boundaries for the larger-later or smaller-sooner option. (F) Delay discounting under placebo (log-k; dots correspond to individual participants, with more negative values indicating weaker delay discounting) decreased with the difference in weights assigned to rewards and delay costs during evidence accumulation, replicating previous findings (Amasino et al., 2019).

D2R blockade affects multiple components of the intertemporal decision process.

(A) Larger differences in reward magnitude between the larger-later (LL) and smaller-sooner (SS) option increased drift rates, speeding up evidence accumulation toward LL options under placebo. (B) The impact of differences in reward magnitude was significantly stronger under amisulpride than under placebo. (C) Drug-dependent impact of differences in reward magnitude on the drift rate. Because the sensitivity to differences in reward magnitude was stronger under amisulpride than under placebo (steeper slope), D2R blockade sped up evidence accumulation toward the boundary for LL choices if differences in reward magnitude between the LL and SS options were large. In contrast, if the difference in reward magnitude was small, the drift rate was more negative under amisulpride compared with placebo, speeding up evidence accumulation toward the SS option. (D) Larger differences between delay of reward promoted evidence accumulation toward the (negative) boundary for SS choices under placebo, (E, F) but the impact of delay was not significantly altered by amisulpride. (G) The starting point of the accumulation processes was closer to the boundary for LL than SS choices under placebo, (H) and this starting bias toward the LL option was significantly reduced by amisulpride. (I) For larger differences in waiting costs, the starting point of the evidence accumulation process was increasingly shifted toward the SS option under placebo. (J) This impact of delay costs on the starting bias was significantly reduced under amisulpride. As illustrated in (K), reducing dopaminergic action on D2R with amisulpride shifted the starting bias toward the boundary for the SS option predominantly if no option possessed a clear proximity advantage (small difference between delays). In A, B, D, E, G, H, I, and J, yellow bars close to x-axis indicate 95% HDIs.

Model comparison.

(A) The deviance information criterion (DIC; lower numbers correspond to better fit) suggests that DDM-1 explained the data slightly better than DDM-2, DDM-4, and DDM-6 and clearly outperformed DDM-3 and DDM-5. (B, C) Posterior predictive checks on the group level (collapsed across all participants), separately for (B) placebo and (C) amisulpride. Particularly DDM-1, DDM-2, and DDM-3 described the empirically observed data well, whereas decisions simulated based on DDMs 4–6 more strongly deviated from observed behavior.

Posterior predictive checks.

For each individual participant (p1–p56), observed RTs (in black) are plotted against the RTs simulated based on the parameters for drift diffusion model (DDM) 1–6, separately for differences in (A) reward magnitude and (B) delay (quintiles). The plots suggest that the DDMs provide reasonable accounts of the observed RTs.

Illustration of how dopaminergic effects on intertemporal choices depend on differences in both reward magnitude and delay in the proposed framework, separately for (A) placebo, (B) amisulpride, and (C) the difference between amisulpride and placebo.

Plots are based on simulations assuming the group-level parameter estimates we observed under placebo and amisulpride. As dopaminergic effects on decision making affect both reward processing (via the drift rate) and cost processing (via the starting bias), the specific combination of rewards and delays determines whether D2R blockade increases or decreases the probability of larger-later (LL) choices. Low dopamine levels reduce the proximity advantage of smaller-sooner (SS) over LL options particularly if differences in action costs between reward options are large, promoting choices of the LL option. In contrast, if no option possesses a proximity advantage (small differences between delays), dopaminergic effects on evidence accumulation dominate, such that the LL option is perceived as less worth the waiting costs, particularly if its reward magnitude differs only little from that of the alternative SS option.

Tables

Table 1
Results of Bayesian generalized linear model regressing binary choices (larger-later [LL] vs. smaller-sooner [SS] option) in the amisulpride study on predictors for Drug, Magnitudediff, Delaydiff, and the interaction terms.

Standard errors of the mean of the posterior distributions are in brackets.

PredictorMean2.5%97.5%
Intercept3.10 (0.64)1.914.43
Drug–0.10 (0.53)–1.120.96
Delaydiff–1.27 (0.32)–1.87–0.60
Magnitudediff6.32 (0.72)5.037.83
Drug×Delaydiff0.75 (0.41)0.021.67
Drug×Magnitudediff0.25 (0.71)–1.051.77
Delaydiff ×Magnitudediff0.16 (0.43)–0.611.10
Drug×Delaydiff×Magnitudediff0.73 (0.60)–0.302.07
Table 2
Results of hierarchical DDM-1 for the amisulpride data.

SEM: standard errors of the mean of the posterior distributions.

ParameterRegressorMean (SEM)2.5%97.5%
Drift rateDelaydiff–1.13 (0.19)–1.53–0.78
Magnitudediff2.41 (0.26)1.932.95
Drug×Delaydiff–0.30 (0.27)–0.850.20
Drug×Magnitudediff0.81 (0.42)0.041.71
vmax0.80 (0.05)0.700.90
Drug×vmax–0.00 (0.06)–0.120.12
Decision thresholdPlacebo3.96 (0.15)3.674.26
Drug0.17 (0.15)–0.110.46
Starting biasPlacebo0.57 (0.01)0.540.60
Drug–0.04 (0.02)–0.08–0.001
Delaydiff–0.02 (0.01)–0.03–0.002
Drug×Delaydiff0.02 (0.01)0.0010.04
Non-decision timePlacebo1.48 (0.06)1.361.60
Drug–0.10 (0.07)–0.240.03
Table 3
Overview over drift diffusion model (DDM) parameters included in the DDMs.

Note that we modeled drug effects for all parameters included in the DDMs.

ParameterDDM-1DDM-2DDM-3DDM-4DDM-5DDM-6
Drift rateDelaydiff
Magnitudediff
vmax
Decision thresholdIntercept
Starting biasIntercept
Delaydiff
Delaysum
Non-decision timeIntercept
τdiff
Table 4
Results of the best-fitting DDM-1 for the D1 agonist experiment.

Standard errors of the mean of the posterior distributions are in brackets.

ParameterRegressorMean2.5%97.5%
Drift rate:
Delaydiff
Placebo (0 mg)–0.42–0.74–0.20
6 mg vs. 0 mg–0.02–0.350.28
15 mg vs. 0 mg–0.15–0.520.19
30 mg vs. 0 mg–0.12–0.500.14
Drift rate:
Magnitudediff
Placebo (0 mg)0.860.491.31
6 mg vs. 0 mg0.17–0.260.63
15 mg vs. 0 mg0.11–0.530.73
30 mg vs. 0 mg0.23–0.230.83
Drift rate: vmaxPlacebo (0 mg)1.721.152.66
6 mg vs. 0 mg–0.36–1.370.60
15 mg vs. 0 mg0.02–1.181.35
30 mg vs. 0 mg–0.39–1.600.75
Decision thresholdPlacebo (0 mg)2.822.593.04
6 mg vs. 0 mg–0.13–0.460.20
15 mg vs. 0 mg–0.11–0.450.24
30 mg vs. 0 mg–0.06–0.390.25
Starting bias:
Intercept
Placebo (0 mg)0.590.540.64
6 mg vs. 0 mg–0.01–0.070.06
15 mg vs. 0 mg–0.01–0.080.06
30 mg vs. 0 mg–0.03–0.090.04
Starting bias:
Delaydiff
Placebo (0 mg)0.00–0.010.02
6 mg vs. 0 mg–0.01–0.030.02
15 mg vs. 0 mg0.01–0.010.04
30 mg vs. 0 mg–0.01–0.030.01
Non-decision timePlacebo (0 mg)0.850.780.93
6 mg vs. 0 mg0.03–0.090.15
15 mg vs. 0 mg–0.02–0.130.09
30 mg vs. 0 mg0.03–0.060.13

Additional files

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Alexander Soutschek
  2. Philippe N Tobler
(2023)
A process model account of the role of dopamine in intertemporal choice
eLife 12:e83734.
https://doi.org/10.7554/eLife.83734