Catecholaminergic modulation of meta-learning

  1. Jennifer L Cook  Is a corresponding author
  2. Jennifer C Swart
  3. Monja I Froböse
  4. Andreea O Diaconescu
  5. Dirk EM Geurts
  6. Hanneke EM den Ouden
  7. Roshan Cools
  1. University of Birmingham, United Kingdom
  2. Radboud University, Netherlands
  3. University of Zurich and ETH Zurich, Switzerland
  4. University of Basel, Switzerland
  5. University of Toronto, Canada
  6. Radboud University Medical Centre, Netherlands
8 figures, 6 tables and 1 additional file

Figures

Figure 1 with 1 supplement
Task design.

Participants selected between two bandits (blue and green boxes) in order to win points. On each trial, participants saw the direct sources (boxes, 1–4 s), subsequently either the blue or green box was highlighted with a red frame (the indirect source, 1–4 s). Participants were instructed that the frame represented either the most popular choice made by a group of participants who had completed the task previously (Social Group) or the choice from rigged roulette wheels (Non-social Group). After participants had responded, their selected option was framed in grey. After 0.5–2 s participants received feedback in the form of a green or blue box between the two bandits. A new trial began after 1–3 s. Success resulted in the red reward bar progressing towards the silver and gold goals. The probability of reward associated with the blue and green boxes and the probability that the red frame surrounded the correct box varied according to probabilistic schedules which comprised stable and volatile phases.

Figure 1—figure supplement 1
Probabilistic schedules.

Solid blue lines show the probability of blue being the correct choice, dashed red lines show the probability of the red frame information being correct. For the win-stay, lose-shift analysis the interaction between drug, learning type and volatility remained significant when probabilistic schedule group (1,2,3,4) was included as a factor in the ANOVA (F(1,94) = 7.935, p = 0.006). There was no significant main effect of schedule (F(3,94) = 0.906, p = 0.441) and no interactions involving both drug and schedule (all p>0.05). For the learning rate analysis, the interaction between drug, learning type and volatility remained significant when schedule (1,2,3,4) was included as a factor in the ANOVA (F(1,94) = 7.500, p = 0.007). There was no significant main effect of schedule (F(3,94) = 0.850, p = 0.470) and no drug x learning type x volatility x group x schedule interaction (F(3,94) = 0.404, p = 0.751). There was a significant drug x learning type x volatility x probabilistic schedule interaction (F(3,94) = 2.725, p = 0.049). Although the pattern of results was comparable, the interaction between drug x learning type x volatility was only statistically significant for schedule 2 (F(1,23) = 12.231, p = 0.002), and did not reach significance for schedules 1 (F(1,24) = 4.015, p = 0.057), 3 (F(1,22) = 2.881, p = 0.104) and 4 (F(1,25) = 0.501, p = 0.486), likely due to a lack of power with these reduced group sizes.

Figure 2 with 1 supplement
Drug effects on win-stay, lose-shift beta values.

(A) Win-stay, lose-shift betas, for experienced-value learning, in stable and volatile periods, under MPH (purple) and PLA (green). (B) Win-stay, lose-shift betas, for inferred-value learning, in stable and volatile periods, under MPH and PLA. (C) There was a significant interaction between drug and volatility for experienced-value learning, but not for inferred-value learning. (D) Drug effect on win-stay, lose-shift betas for volatile minus stable blocks, for experienced-value (orange) and inferred-value (blue) learning. There was no difference between the social and non-social (roulette) groups. Boxes = standard error of the mean, shaded region = standard deviation, individual datapoints are displayed, MPH = methylphenidate, PLA = placebo, * indicates statistical significance.

Figure 2—figure supplement 1
Illustration of relationship between experienced-value βWSLS_vol-stable and accuracy.

Experienced-value βWSLS_vol-stable was significantly positively correlated with accuracy under both PLA (Top: Pearson’s r = 0.453, p < 0.001) and MPH (Bottom: Pearson’s r = 0.236, p = 0.017).

Drug effects on learning rates.

(A) Learning rate for experienced-value learning, in stable and volatile periods, under MPH (purple) and PLA (green). (B) Learning rate for inferred-value learning, in stable and volatile periods, under MPH and PLA. (C) There was a significant interaction between drug and volatility for experienced-value learning, but not for inferred-value learning. (D) The interaction between drug and volatility was specific to experienced-value learning even for a sub-sample of participants who were keen inferred-value learners. Boxes = standard error of the mean, shaded region = standard deviation, individual datapoints are displayed, MPH = methylphenidate, PLA = placebo, * indicates statistical significance.

Experienced-value learning rates under PLA, but not MPH, were further from the optimal value in the stable compared to volatile phase.
Participant data (left) juxtaposed against model simulations (right).

Top: Accuracy, in stable and volatile phases, under MPH (purple) and PLA (green). Boxes = standard error of the mean, shaded region = standard deviation, individual datapoints are displayed. Middle and Bottom: Running average, across 5 trials, of blue choices for probabilistic randomisation schedules 1 (middle) and 2 (bottom). Shaded region = standard error of the mean. MPH = methylphenidate, PLA = placebo.

Participant data (left) juxtaposed against model simulations (right).

Top: Accuracy, in stable and volatile phases, under MPH (purple) and PLA (green). Boxes = standard error of the mean, shaded region = standard deviation, individual datapoints are displayed. Middle and Bottom: Running average, across 5 trials, of blue choices for probabilistic randomisation schedules 3 (middle) and 4 (bottom). Shaded region = standard error of the mean. MPH = methylphenidate, PLA = placebo.

Appendix 4—figure 1
p(y|m) = posterior model probability, ϕ = exceedance probability, MPH = purple, PLA = green.
Author response image 1

Tables

Table 1
Coefficients from regression model with MPH-PLA accuracy as dependent variable.
ββ(std. Err.)standardised βtp
constant−0.0080.006−1.30.197
βMPH-PLA0.0140.0020.365.763*<0.001
ζMPH-PLA−0.2410.024−0.658−9.943*<0.001
αstable_experienced_MPH-PLA0.0050.0320.0110.1720.864
αvol_experienced_MPH-PLA0.1260.0290.2854.376*<0.001
αstable_inferred_MPH-PLA0.0140.0410.0230.3450.731
αvolatile_inferred_MPH-PLA0.020.0410.030.4830.63
  1. * indicates statistical significance.

Appendix 1—table 1
There were no significant differences between the social and non-social groups in age, gender or socioeconomic status as measured using the Barratt Simplified Measure of Social Status (Barratt, 2012).
SocialNon-social
Age in years (x¯(σ))21.8 (2.6)21.3 (2.0)t(100) = 1.1, p = 0.3
Socioeconomic status (x¯(σ))60.4 (7.8)59.2 (8.4)t(100) = 0.5, p = 0.6
N(Male:Female)50 (26:24)52 (25:27)χ2(1) = 0.16, p = 0.7
Appendix 3—table 1
Win-stay, lose-shift analysis.

Main effects and interactions not involving the factor drug.

Main effect/interactionStatisticsFurther information
Learning typeF(1,100) = 58.595, p < 0.001βWSLS for experienced (x¯(σx¯) = 0.366(0.017)) > inferred (x¯(σx¯) = 0.188(0.012))
VolatilityF(1,100) = 70.671, p < 0.001βWSLS for volatile (x¯(σx¯) = 0.327(0.010)) > stable (x¯(σx¯) = 0.227(0.011))
GroupF(1,100) = 4.001, p = 0.048βWSLS for non-social (x¯(σx¯) = 0.295(0.013)) > social (x¯(σx¯) = 0.259(0.013))
Learning type x groupF(1,100) = 7.729, p = 0.006βWSLS for social (x¯(σx¯) = 0.138(0.017)) < non-social (x¯(σx¯) = 0.239(0.017)) for inferred (F(1,100) = 17.743, p < 0.001).
No difference between groups for experienced (social x¯(σx¯) = 0.381 (0.024), non-social x¯(σx¯) = 0.352 (0.024), F(1,100) = 0.694, p = 0.407)
Appendix 3—table 2
Learning rate analysis.

Main effects and interactions not involving the factor drug. Note that the learning type x volatility interaction is consistent with previous studies of experienced-value learning which have reported higher learning rates in volatile relative to stable environments (Behrens et al., 2007; Behrens et al., 2008; Browning et al., 2015; Krugel et al., 2009; Nassar et al., 2010).

Main effect/interactionStatisticsFurther information
Learning typeF(1,100) = 65.409, p < 0.001αdirect (x¯(σx¯) = 0.328(0.014)) > αinferred (x¯(σx¯) = 0.194(0.010))
Learning type x volatilityF(1,100) = 4.882, p = 0.029αvol_experienced (x¯(σx¯)= 0.341(0.017)) > αstable_experienced (x¯(σx¯) = 0.315(0.015); F(1,100) = 3.916, p = 0.051).
No significant difference between αstable_inferred (x¯(σx¯) = 0.201(0.012)) and αvol_inferred (x¯(σx¯) = 0.187(0.011); F(1,100) = 1.675, p = 0.199)
Learning type x groupF(1,100) = 4.097, p = 0.046αsocial (x¯(σx¯) = 0.161(0.014)) < αnon-social (x¯(σx¯) = 0.227(0.014)) for inferred (F(1,100) = 11.349, p = 0.001).
No difference between groups for experienced (social x¯(σx¯)0.329(0.021), non-social x¯(σx¯) 0.328(0.020); F(1,100) = 0.002, p = 0.966)
Appendix 5—table 1
Coefficients from regression models with MPH and PLA accuracy as dependent variables.
ββ(std. Err.)standardised βtp
PLA
constant0.670.0230.15*<0.001
βPLA−0.250.02−0.66−12.84*<0.001
ζPLA0.030.000.438.53*<0.001
αstable_experienced_PLA0.050.040.091.350.18
αvol_experienced_PLA0.100.040.182.67*0.01
αstable_inferred_PLA−0.030.04−0.03−0.560.58
αvolatile_inferred_PLA0.130.050.162.84*0.01
MPH
constant0.690.0239.88*<0.001
βMPH−0.260.02−0.78−15.92*<0.001
ζMPH0.020.000.449.15*<0.001
αstable_experienced_MPH0.030.030.061.050.30
αvol_experienced_MPH0.100.030.224.05*<0.001
αstable_inferred_MPH0.050.030.081.490.14
αvolatile_inferred_MPH0.030.040.050.920.36
  1. * indicates statistical significance at p < 0.05.

Appendix 6—table 1
Mean(standard deviation) scores for neuropsychological tests and self-report questionnaires for the groups that received PLA and MPH on day 1.

Significance level for the between group differences are reported. Self-report questionnaires include the Barratt Impulsiveness Scale (BIS-11; Patton et al., 1995), the Behavioural Inhibition Scale/Behavioural Activation Scale (BISBAS) (Carver and White, 1994), Need for Cognition Scale (NCS) (Cacioppo et al., 1984), Multidimensional Scale of Perceived Social Support (MSPSS) (Zimet et al., 1988), Barratt Simplified Measure of Social Status (BSMSS) (Barratt, 2012), Sociable and Aggressive Dominance Questionnaire (SADQ) (Kalma et al., 1993), Beck Depression Inventory II (BDI-II) (Beck et al., 1996), Spielberger Trait Anxiety Inventory (STAI) (Spielberger et al., 1983).

Group 1
PLA Day 1
Group 2
MPH Day 1
Neuropsychological testsListening span5.0 (0.9)4.6 (1.2)p = 0.160
NLV94.4 (7.6)92.6 (7.6)p = 0.230
Digit span – forward17.2 (3.7)16.2 (3.6)p = 0.158
Digit Span - backward14.7 (3.4)13.9 (2.7)p = 0.219
Self-report questionnairesImpulsivity (BIS-11)63.5 (8.9)60.2 (7.9)p = 0.052*
Behavioural inhibition (BIS)16.4 (3.7)16.3 (3.5)p = 0.899
Behavioural activation (BAS)22.8 (3.9)23.9 (4.0)p = 0.166
Need for cognition (NCS)64.5 (10.5)62.2 (10.5)p = 0.264
Social support (MDSPSS)71.1 (10.1)69.3 (9.6)p = 0.349
Social status (BSMSS)49.8 (12.1)45.9 (12.7)p = 0.114
Social dominance (SADQ)4.1 (0.9)4.1 (0.8)p = 0.823
Aggressive dominance (SADQ)2.6 (0.6)2.6 (0.6)p = 0.691
Depressive symptoms (BDI)3.5 (3.7)3.6 (3.9)p = 0.969
Anxiety symptoms (STAI)32.4 (6.6)32.4 (7.2)p = 0.995
  1. *One participant had an outlying score on the BIS-11. Without outlier: p 0.090.

Additional files

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Jennifer L Cook
  2. Jennifer C Swart
  3. Monja I Froböse
  4. Andreea O Diaconescu
  5. Dirk EM Geurts
  6. Hanneke EM den Ouden
  7. Roshan Cools
(2019)
Catecholaminergic modulation of meta-learning
eLife 8:e51439.
https://doi.org/10.7554/eLife.51439