Abstract
The key elements for fear extinction learning are unexpected omissions of expected aversive events, which are considered to be rewarding. Given its reception of reward information, we tested the hypothesis that the cerebellum contributes to reward prediction error processing driving extinction learning via its connections with the ventral tegmental area (VTA). Forty-three young and healthy participants performed a three-day fear conditioning paradigm in a 7T MR scanner. The cerebellum and VTA were active during unexpected omissions of aversive unconditioned stimuli, particularly during initial extinction trials. Increased functional connectivity was observed between the cerebellum and VTA, indicating that the cerebellum could positively modulate VTA activity, which in turn might facilitate dopaminergic signaling during fear extinction learning. These results imply that an interaction between the cerebellum and VTA should be incorporated into the existing model of the fear extinction network.
Introduction
Deficits in learning to extinguish previously associated threat responses to cues that no longer signal danger are thought to be one of the main causes in the development of anxiety disorders, including post-traumatic stress disorder and social anxiety disorder1. Exposure therapy addresses this deficit and attempts to compensate for failed or absent extinction learning from the past2. This is modeled effectively with extinction training in classical fear conditioning paradigms where the previously paired conditioned stimulus (CS+) is no longer followed by the aversive unconditioned stimulus (US)3,4. At the beginning of extinction training the omission of the US is unexpected and this unexpected lack of the US is considered to be rewarding5–7. This reward prediction error is thought to drive safety learning. This newly learned safety association should then inhibit the initially learned fear association (which is never fully lost)8.
The reward system in the brain is primarily associated with the mesolimbic dopamine system9, i.e., the ventral tegmental area (VTA) and the ventral striatum. In fact, recent rodent studies provide strong evidence that the VTA is central for extinction learning, and that (reward) prediction errors are encoded in a subset of VTA neurons5–7,10. Importantly, new evidence indicates that the cerebellum receives reward signals11–13, and has direct efferent connections with the mesolimbic dopaminergic system, in particular the VTA14–17.
Functional magnetic resonance imaging (fMRI) findings in healthy human participants suggest that the cerebellum is involved in processing of prediction errors in fear conditioning18,19. Ernst et al.18 studied acquisition of learned fear responses using a partial reinforcement rate, i.e., a subset of CS+ trials were not reinforced during acquisition training. Cerebellar activations were strongest in the unreinforced CS+ trials, when the US was expected but did not occur. An unreinforced CS+ trial can be considered as a very first extinction trial, and cerebellar activations may reflect prediction errors driving extinction learning. Likewise, recordings from the VTA in rodents showed strong activations at the time the US is expected and does not occur during initial extinction trials6,10. The aim of the present fMRI study is therefore to test the hypothesis that the cerebellum contributes to fear extinction learning via its connection to the VTA.
In contrast to the previous one-day study design18, here we used a three-day design with acquisition training on day 1, extinction training on day 2 and a recall test on day 3. This multi-day study design allowed enough time for consolidation of both fear and extinction memory traces. Furthermore, to increase the size of the reward prediction error at the initial US omission in extinction training trials, a full (100%) reinforcement rate was used during acquisition training. Previous studies showed significant spontaneous recovery of learned fear in initial recall test trials (unreinforced CS+) following extinction training in humans, which allowed us to test for cerebellar activations related to reward prediction errors also on day 3. Additionally, after the recall test on day 3, a reacquisition phase with a partial reinforcement rate (i.e., with unexpected US omissions between reinforced trials) and a subsequent reextinction phase were realized. Taken together, we were able to access cerebellar activity related to the unexpected omission of the US at various points during the experiment. We used a deep reinforcement learning model fitted to trial-by-trial skin conductance response (SCR) data to compute individual prediction errors. 7T fMRI allowed us to assess VTA activation. Results showed significant functional interaction between the cerebellum and VTA related to prediction errors, particularly during initial extinction trials. These findings suggest that the cerebellum contributes to extinction learning via the VTA, and thus supports safety learning inhibiting the initial fear association.
Results
Behavioral data
SCR data is reported below, pupil size data is provided in the supplementary information (Supplementary figure 1, Supplementary table 1). Statistical analysis of SCR data is summarized in Supplementary table 2.
Skin conductance responses (SCRs)
During habituation (day 1), SCRs did not differ between CS+ and CS- (Figure 1, top row). SCRs for both stimuli decreased significantly in habituation trials. Non-parametric ANOVA-type statistics revealed a significant effect for Time (early vs. late; F1 = 26.49; p<.001) but not for Stimulus (CS+ vs. CS-; F1 = 1.19; p=0.28) or the Stimulus x Time interaction (F1 = 0.06; p=0.80).
During fear acquisition training (day 1), participants underwent a fully reinforced, semi-instructed differential fear conditioning protocol. Participants quickly learned the associations, and their responses decreased as expected due to habituation effects (Figure 1, top row). SCRs for the CS+ were significantly higher than for the CS- and decreased in late trials. Non-parametric ANOVA-type statistics revealed a significant effect of Stimulus (CS+ vs. CS-; F1 = 16.28; p<.001) and Time (early vs. late; F1 = 24.49; p<.001). No significant Stimulus x Time interaction was observed (F1 = 3.33; p=0.07).
During extinction training (day 2), SCRs were significantly higher for the CS+ than for the CS– only in initial extinction trials (Figure 1, top row). Non-parametric ANOVA-type statistics revealed a significant effect of Stimulus (CS+ vs. CS-; F1 = 6.57; p=0.01) and Time (early vs. late; F1 = 21.10; p<.001). The Stimulus x Time interaction was not significant (F1 = 2.83; p=0.09). For pupil size responses (PSRs), the Stimulus x Time interaction was significant (F1 = 4.28; p=0.039). Post-hoc tests showed that PSR related to the CS+ was significantly lower during late extinction compared to early (least squares means test; p < .001), but not for PSR related to the CS- (least squares means test; p = 0.64; see supplementary materials), indicating extinction learning towards the CS+.
During the recall test (day 3), SCRs for CS+ were significantly higher than for CS- (Figure 1, bottom row). In late recall, no difference between SCRs for CS+ and CS- were observed. Non-parametric ANOVA-type statistics revealed a significant main effect of Stimulus (CS+ vs. CS-; F1 = 4.19; p=0.04), Time (early vs. late; F1 = 32.47, p<0.001) and a Stimulus x Time interaction (F1 = 5.27; p=0.02). Post-hoc tests revealed that SCRs for CS+ were significantly higher than for CS- in early trials (least squares means test; p < .001) but not in late trials (least squares means test; p = 0.98). SCRs for both stimuli significantly decreased in late trials compared to early trials (least squares means test; both: p < .001).
During reacquisition on day 3, SCRs for the CS+ were significantly higher than for the CS- and decreased for both stimuli in late reacquisition (Figure 1, bottom row). Non-parametric ANOVA-type statistics revealed a significant effect of Stimulus (CS+ vs. CS-; F1 = 26.65, p<0.001), Time (early vs. late; F1 = 30.37, p<0.001) and a Stimulus x Time interaction (F1 = 6.34, p=0.01). Post-hoc tests showed that both stimuli were higher in early vs late reacquisition training (CS- early vs. CS- late; p = 0.02; CS+ early vs. CS+ late; p < .001), and a significant difference between stimulus types early and late (all p values < .001).
During reextinction on day 3, SCRs related to the CS+ were significantly higher than for the CS- although the difference was small (Figure 1, bottom row). Non-parametric ANOVA-type statistics revealed a significant main effect of Stimulus (CS+ vs. CS-; F1 = 4.3; p=0.04), but not for Time (early vs. late; F1 = 0.2; p=0.65) nor for the Stimulus x Time interaction (F1 = 1.1; p=0.29). For PSRs, the Stimulus x Time interaction was significant (F1 = 4.43; p=0.035), and PSRs related to the CS+ were significantly higher than to the CS- in early trials, but not in late trials (see supplementary materials).
In the last phase on day 3, three CS+ trials were unexpectedly reinforced (“unexpected US phase”). SCRs for the CS+ were significantly higher than for the CS- (Figure 1, bottom row). Non-parametric ANOVA-type statistics revealed a significant main effect of Stimulus (CS+ vs. CS-; F1 = 5.68; p=0.02), but neither for Time (early vs. late; F1 = 0.26; p=0.61) nor for the Stimulus x Time interaction (F1 = 2.81; p=0.09).
Self-reports
Valence, arousal, fear and US expectancy ratings of the CS+ did not differ significantly from the CS- before acquisition training (Supplementary table 3,4). Following acquisition training, the CS+ was rated as more arousing, fearful and unpleasant compared to the CS-. Participants expected the US to follow the CS+ but not the CS-. Although less pronounced, these CS+/CS- differentiations persisted after extinction training and recall test. The difference between CS+ and CS- increased at the end of day 3, when CS+/US pairings were reintroduced.
Non-parametric ANOVA-type statistics showed a significant main effect of Stimulus, Time and a Stimulus x Time interaction (all p<0.001; Supplementary table 4). Post-hoc tests revealed significant differences between stimulus type (CS+, CS-) following acquisition training, extinction training, recall test and at the end of day 3 but not after habituation.
fMRI results
Analysis of BOLD fMRI responses revealed significant activations related to the prediction, the presentation and the unexpected omission of the US (Figure 2). Our primary focus was on activations related to the unexpected omission of the US, which could be detected across four distinct phases: extinction training, recall test, reacquisition, and reextinction (Figure 3). Statistical analyses are summarized in Supplementary table 5-9.
Activations related to US prediction (CS+ > CS- and CS+ x prediction)
In the event-based analysis, there was no significantly higher activation in the cerebellum during the CS+ compared to the CS- (CS+ > CS-) during acquisition or extinction training. In the same contrast, the lateral VTA exerted significant differential activations during acquisition (Figure 2A; threshold free cluster enhancement (TFCE) and family-wise error (FWE) corrected p<0.05), but not during extinction training. Trend-level results showed differential cerebellar activation in posterolateral regions (lobule VI, Crus I, Crus II, lobule VIIb and VIIa; Figure 2A, D; uncorrected p<0.05) during acquisition training, which match results obtained by Ernst and colleagues (2019).
In the parametric modulation analysis, there were significant activations in the cerebellum (lobule and vermis I-VI), deep cerebellar nuclei (DCN) and VTA negatively associated with CS+ prediction value modulation during acquisition training (CS+ x prediction; Figure 2B), whereas there were significant positively associated activations during extinction training (Figure 2E; lobule VI and Crus I bilaterally, right lobule V and VTA). Parametric modulation results indicate that early CS+ presentations are paired with high cerebellar activations in both acquisition and extinction training, with prediction values being low during early and high during late acquisition, whereas prediction values are high during early and low during late extinction (Figure 5C). In other words, the activations were negative due to high activations of early CS+ compared to late CS+ events with prediction values taking the opposite course (i.e., low during early acquisition and high during late acquisition).
Activations related to US presentation (US post CS+ > no US post CS-)
During acquisition training, significant activations related to the presentation of the US (US post CS+ > no US post CS-) were widespread and included the whole cerebellum and VTA (Figure 2C,F). Cerebellar activations extended to the DCN. Peak activations in the cerebellum were found ipsilateral to the US (electrical stimulation applied to right hand).
Activations related to the unexpected omission of the US (first 3 no US post CS+ > no US post CS- and no US post CS+ x prediction error)
Considering the whole extinction training phase, trend-level results showed activations in posterolateral lobules and the VTA related to the omission of the US (no US post CS+ > no US post CS-; Figure 2F). When focusing specifically on the early trials in extinction training to quantify activations related to the unexpected omission of the US, we found significant activations in both event-based and parametric modulation analyses.
In the event-based analysis, the cerebellar cortex, DCN and VTA showed significant activations related to unexpected omissions of the US in initial trials for all four phases (first 3 no US post CS+ > no US post CS-; extinction training, recall test, reacquisition and reextinction; Figure 3A-D). The location of activations was consistent across phases, with highest activations in left posterolateral lobule VI and Crus I. Activations in the left posterolateral cerebellar cortex were paired with high activations in the left dentate nucleus. The VTA was bilaterally activated.
In the parametric modulation analysis, significant activations were found in the cerebellum for unexpected omissions during extinction training and the recall test (Figure 3E,F), and showed trend-level activations during reacquisition and reextinction (Figure 3G,H). The highest activations were again found in left lobule VI and Crus I. In the DCN, significant activations were found during extinction training with peak activations in the left dentate, with trend-level results in the recall test. In the VTA, results were significant during extinction training (Figure 3E), with trend-level activations during the recall test, reacquisition and reextinction.
PPI: Functional connectivity during unexpected US omissions between the cerebellum and VTA
Next, we aimed to assess functional interactions between the cerebellum and VTA using psychophysiological interaction (PPI), using the VTA as a seed region. In the parametric modulation analysis, we observed trend-level positive functional connectivity in left lobule VI and Crus I for extinction training, recall test, reacquisition and reextinction (Figure 4A; Supplementary table 7,8). When combining the contrasts by summing, the combination of all four phases showed similar positive functional connectivity between the cerebellar cortex and the VTA in lobule VI and Crus I, but only on a trend-level (Figure 4B). No activations were seen in the DCN.
DCM: Functional connectivity during unexpected US omissions between the cerebellum and VTA
In the dynamic causal modeling (DCM) parametric modulation analysis, unexpected omissions significantly positively modulated both the connection from cerebellar cortex (CB) to VTA and VTA to CB during extinction training (Figure 4D; very strong evidence, posterior probability P>0.99, strong evidence: 0.95<P<0.99). In the recall test, the VTA to CB connection was positively modulated. In reacquisition, the CB to VTA connection was positively modulated. Results were less strong for the other unexpected omission modulations, with trend-level moderate evidence for the CB to VTA connection in the recall test and the VTA to CB connections in reacquisition, and weak evidence for both CB to VTA and VTA to CB connections in reextinction (Figure 4D; moderate 0.75<P<0.95; weak P<0.75). In the model incorporating the DCN and VTA, there was a trend-level modulation from the DCN to the VTA during extinction training, as well as a trend-level modulation from the VTA to the DCN during reextinction (Figure 4E; moderate significance, 0.75<P<0.95).
Discussion
The cerebellum and VTA were active during unexpected omissions of aversive unconditioned stimuli, particularly during initial extinction trials. Importantly, increased functional connectivity was observed between the cerebellum and VTA, indicating that during unexpected omissions in extinction training, the cerebellum could positively modulate VTA activity. These results support the view that an interaction between the cerebellum and VTA should be incorporated into the existing model of the fear extinction network. Furthermore, given that the unexpected omission of an aversive stimulus can be considered rewarding, data suggest that the cerebellum contributes to reward processing in humans, and that this may be one way how the cerebellum modulates non-motor functions, in our case the control of emotions. Our findings build on previous research in rodents showing that the cerebellum modulates addictive, social and depression-like behavior via direct projection to VTA in rodents16,21.
Cerebellar activations were most prominent in trials with maximum prediction errors, i.e., during the initial extinction training phase, but were also seen in early recall, reacquisition and early reextinction. The activations in lobule VI and Crus I are consistent with previous findings by Ernst et al.18 during unexpected omissions in a partially reinforced acquisition phase.
Additionally, other studies have reported activations in these regions during the prediction of the US in extinction training22–25 and during unexpected omissions of aversive stimuli26–29. More broadly, activations in lobule VI and Crus I have been observed during US prediction30, particularly in the early phases of acquisition training. Conversely, unexpected rewards lead to activation in Crus I31, and a meta-analysis implicated left lobule VI and Crus I in both the anticipation and processing of rewarding outcomes32. Anticipation-related activations were more hemispheric, whereas rewarding outcome activations were more paravermal.
Activated areas in more lateral parts of lobule VI and Crus I overlap with multi-demand regions33, whereas activations neighboring the vermis overlap with emotional regions34. The main components of the multi-demand (or executive) neural network are working memory, attention and inhibition35. Thus, attention-related processes involved in fear conditioning may also contribute to cerebellar activations. Attention-related processes have been related to the salience of the stimulus36, but are also thought to change when a cue is followed by an unexpected outcome with more attention being paid in the next trial37,38. Attention related processes may therefore contribute to activations in the more lateral cerebellar parts of lobule VI/Crus I. However, unexpected omission of the US should lead to increased attention to the CS in the next trial and therefore increased activation towards the CS (which was not the case) and not at the time the US was presented and did not occur.
Our findings of increased activation in the VTA related to the unexpected omission of the aversive US aligns well with rodent data6,7. In rodents, it has been shown that dopamine neurons in the VTA are active in initial extinction trials, and that lack of this dopamine signal prevents extinction learning5,6,39. In a more recent study, Salinas-Hernandez et al.10 observed that these dopamine neurons project to extinction-related neurons of the nucleus accumbens, and receive input from the dorsal raphe. Our data show that the VTA is also involved in extinction learning in humans, and provide first evidence that the VTA receives input from the cerebellum during extinction learning. Esser et al.40 found that the administration of L-Dopa in healthy human participants enhanced activation of the nucleus accumbens which was functionally coupled with the VTA at the time of unexpected omission of the US in early extinction learning, which is in good agreement with our findings.
The unexpected omission of the aversive US is thought to be rewarding and fMRI signals in the cerebellum and VTA may be related to reward prediction errors. Whereas the VTA has long been known to process rewards, and fMRI data in humans indicate that the VTA is active related to the unexpected presentation of rewards41,42, the cerebellum has only recently been found to process reward signals in rodents11,13,43. FMRI signals are known to be driven by synaptic input44–46. In the cerebellum this is mainly related to mossy fiber input to the cerebellar cortex47. There is evidence, however, that granule cell activity also modulates vasodilatation and likely contributes to fMRI BOLD signals in the cerebellar cortex48,49. Likewise, recent evidence suggests that activation of dopamine neurons in the VTA leads to increased BOLD signals using DREADD-fMRI in rodents50. Human fMRI-PET studies have shown that VTA BOLD activation in response to rewards correlates with increases in dopamine in the striatum, suggesting that the fMRI signal in the SN/VTA is related to dopaminergic neuron activity42,51. FMRI signals in the cerebellar cortex and VTA may therefore reflect incoming reward-related signals, prediction error processing or, maybe most likely, both. The origin of the afferent reward signals to the cerebellum remains unclear; while an early study showed afferent connections from the VTA to the cerebellum52, this finding has not been replicated in more recent research53. However, preliminary evidence from Guarque-Chabrera et al.54 suggests the existence of a connection from the VTA to the cerebellum.
In rodents, a subset of VTA dopamine neurons has been found to be active when the US is expected and does not occur, and are therefore considered to represent prediction error signals driving extinction learning6,10. The VTA, however, also contains neurons which are active during the presentation and prediction of threat55,56, and likewise we observed fMRI activation of the VTA related to threat prediction and presentation. Cerebellar activations and their interactions are likely not limited to reward prediction errors; they may also be related to unexpected presentation of aversive stimuli. Because aversive stimulus presentation results in pronounced cerebellar activations, we were unable to separate cerebellar activation related to the unexpected (initial acquisition trials) and the expected (late acquisition trials) presentation of the US.
Although, the present findings are consistent with the interpretation of prediction error related activations in the VTA (and cerebellum), based on the present fMRI data, we cannot decide whether activations reflect surprise signals (i.e., unsigned prediction error related signals) or signed prediction error signals37,57–59. Rodent data show that cerebellar neurons in the cortex and nuclei increase activations both related to the unexpected presentation and omission of reward signals60, which would indicate that the cerebellum primarily contributes to unsigned (reward) prediction errors. Likewise, a meta-analysis of human prediction error showed that activations in the cerebellum were more consistently related to unsigned prediction errors61.
Results showed significant functional interaction between the cerebellum and VTA related to prediction errors in unexpected omission trials in extinction training, which was further confirmed by similar functional interactions in spontaneous recovery trials on day 3, and unexpected omissions of the US in unreinforced trials in a partially reinforced reacquisition phase. Predictive output of the cerebellum may be linked to the timing of the unconditioned response and therefore its omission and may help to refine prediction error signals in the VTA.
Notably, the unexpected US omission area in the cerebellum was more prominent on the left, while we applied the US to the right. In our previous work it was also the left cerebellum which was mainly activated, however, stimulation was always done on the left18,62. As outlined above, we believe that cerebellar activation is reward related. In a recent meta-analysis of fMRI data in monetary reward tasks, reward outcome was also related to the left cerebellum (i.e., lobule VI in the cerebellar hemisphere and vermis)32. There is good evidence that emotional information is most prominently processed in the right cerebral hemisphere63,64. The present data suggest that emotional processing may also be lateralized in the cerebellum, with the left cerebellar hemisphere projecting to the right cerebral cortex and vice versa. Lateralization of positive emotions to left cerebellar lobule VI has also been reported by Liu et al.65. Furthermore, emotion-related tasks resulted in increased activity of the left cerebellar hemisphere in patients with anorexia nervosa compared to controls66. Meta-analyses of fMRI studies of emotional tasks, however, report mostly bilateral cerebellar activations with no clear lateralization67–69. These studies primarily focused on emotion recognition tasks. Future research should explore the lateralization of emotional processing in the cerebellum using a variety of emotional tasks. In addition to emotions, there is also a preference of the right cerebral hemisphere for the ventral fronto-parietal attention network, which has been related to the detection of potentially harmful events70. Thus, as outlined above, attention-related processes involved in fear conditioning may also contribute to (lateralized) cerebellar activations.
In the following, we discuss potential limitations of our study. Firstly, while we demonstrated functional connectivity in the DCM analysis, the results were weaker with PPI. PPI is known to frequently lack power and to have a high proportion of false negatives71,72, particularly for event-based designs. Additionally, the low number of unexpected events in our experiment, due to the 100% reinforcement/non-reinforcement rates in acquisition training (which was designed to induce maximum prediction error in the initial extinction trials), may have made detecting activation challenging. Furthermore, a lack of specificity of the unexpected omission condition to the cerebellum-VTA interaction may have contributed. As the PPI regressor is demeaned, any persistent activation (i.e., due to an anatomical connection) will decrease detected connectivity. Because cerebellar and VTA activation co-occur not only in response to unexpected US omissions, but also to US presentation and prediction, this lack of specificity could have weakened our results. The same reasons apply for the lack of observation in functional connectivity between the DCN and the VTA.
Secondly, in the DCM analysis, we observed strong modulation related to unexpected US omissions when using the cerebellar cortex and VTA as volumes of interest (VOIs), but observed only trend-level results when substituting the cerebellar cortex with the DCN. This is likely explained due to the high iron content in the DCN causing susceptibility artefacts and reducing signal73. Although we opted for a whole-brain sequence with a conventional echo- time in order to generate a dataset of general value, future studies could opt for lower echo times and optimize for detection of signal in the DCN74.
Thirdly, the extent of the VTA varies depending on the MRI atlas used. The VTA in the atlas from Pauli et al.75 is about 4 times smaller than the VTA in the atlas by Trutti et al.76 due to different VTA definitions used. Our average volumes are in good accordance with Trutti et al.76, which in turn is in good accordance with histological data. The VTA and its boundaries, particularly laterally, are not clearly visible on MRI scans and were identified by using landmarks such as the substantia nigra, red nucleus, interpeduncular fossa and the cerebral aqueduct. We have approximated the VTA region as closely as we could, but cannot rule out imperfect masks, especially at both lateral ends of the VTA.
In conclusion, our study provides evidence that the cerebellum contributes to fear extinction learning via its connections with the VTA, possibly by modulating dopaminergic VTA neurons processing (reward) prediction errors driving fear extinction learning. Previous studies found that the cerebellum modulates addictive and social behavior via direct projection to VTA in rodents. Our findings extent this observation to the control of emotions. Added knowledge to the fear extinction network, on the other hand, may provide new ways of improving exposure therapy by targeting the dopaminergic system or stimulating the cerebellum specifically during unexpected learning events.
Methods
Preregistration
This study was preregistered on OSF on 16/11/202377. The preregistration document can be viewed at https://osf.io/n4df3/?view_only=d541dc3b6a304e31a16eadd3a02f152f.
The preregistration outlines the methods explained here, and includes the main hypotheses. Adjustments were made to specific parts of the methods, which include the optimized localization of the VTA and DCN by manual drawing by an expert annotator, and the inclusion as volumes of interest (VOIs) in the functional connectivity analyses.
Participants
Experiment power estimate and number of participants
Experiment power was estimated based on previous fear conditioning data acquired at 7T fMRI18 using the FMRIPower toolbox (fmripower.org)78. Considering first level unexpected US omission contrasts during a partially reinforced acquisition training and aiming for a power of 80% at p<.005 for a one-sided hypothesis test, group sizes were estimated to 41 participants for Crus I ipsilaterally to US presentation (effect size 0.56 in units of standard deviations). To account for potential dropouts and outliers, 50 participants were initially recruited. However, only 44 successfully completed the three-day study, with reasons for non-completion including Covid-related issues (1 participant), problems with the Digitimer equipment (3 participants), and failure to attend on day 3 (2 participants). Additionally, one participant dropped out due to a shimming failure, resulting in a final sample size of 43 participants.
Exclusion criteria and demographic information
Participants were between 20 and 35 years old (23 men, 20 women, mean age: 24.7 (SD = 3.4) years), fluent in German, non-smokers, reported no intake of medication or illicit drugs affecting the central nervous system, had no history of neurological or mental disorders for themselves or their first-degree relatives, and had no previous participation in similar learning experiments. Additionally, female participants were not pregnant, breastfeeding, or using hormonal contraceptives. Before the experiment, participants completed the Depression Anxiety Stress Scale (DASS21G)79. Participants who scored higher than 20 for depression, 14 for anxiety or 25 for stress components were excluded from the study. All participants were right-handed based on the Edinburgh handedness inventory80. Participants were asked to refrain from alcohol consumption the night before the experiment. Informed consent was obtained from all participants. The study was approved by the local ethics committee and conducted in accordance with the Declaration of Helsinki.
Fear conditioning paradigm
Participants underwent a three-day differential fear conditioning paradigm. The paradigm presentation was controlled by a computer running Presentation (version 21.0, Neurobehavioral System Inc, Berkeley, CA). Participants were shown two visual stimuli (square and diamond) as CS (CS+ and CS-) and informed that electrical stimulations would be applied during the experiment. Three types of trials were used: reinforced CS+ (CS+ paired with the US; CS+/US), unreinforced CS+ (no US) and unreinforced CS-. The CS- was never reinforced.
The paradigm consisted of seven phases separated over three days (Figure 5A). On day 1, the experiment started with a habituation phase consisting of 8 unreinforced CS- and CS+ trials (4 CS+ trials, 4 CS- trials, presented in semi-randomized order). This phase is intended to decrease novelty effects and enable participants to acclimate to the scanner environment and presentation of visual stimuli. The habituation phase was followed by acquisition training consisting of 32 trials with fully reinforced CS+ (16 CS+/US, 16 CS- trials), during which the fear association was learned. On day 2, extinction training was applied which consisted of 32 unreinforced trials (16 CS+, 16 CS-). On day 3, the recall test, reacquisition, reextinction and the unexpected US phase were applied. The recall test consisted of 16 unreinforced trials (8 CS+, 8 CS-). This phase tested the recall of extinguished fear responses and showed spontaneous recovery (see Results). Reacquisition, reextinction, and the unexpected US phases were done in a single continuous fMRI run without pauses between sequences. The partially reinforced reacquisition phases consisted of 36 trials (15 CS+/US trial, 3 CS+, 18 CS-) with 3 unexpected omissions of the US, reextinction consisted of 24 trials (12 CS+, 12 CS-) and the unexpected US phase consisted of 28 trials (3 CS+/US trial, 11 CS+, 14 CS-) with 3 unexpected reinforced CS+ trials (Figure 5A). Learned fear responses were measured by skin conductance responses (SCRs) and pupil size reactions (PSRs) in each trial. The reacquisition, reextinction and unexpected US phases were included to increase the number of trials with high prediction errors (i.e., discrepancy between US expectation and actual US presentation).
We used the model from Batsikadze et al.28 to optimize our experimental design. Values for US prediction and US prediction error produced by the model were recorded during simulation. Simulations were repeated for different paradigm CS sequences, and the sequence which required a short number of trials and still yielded high prediction errors was selected.
The CS+ and CS- were presented in equal numbers during the first and second half of each phase, regardless of whether the CS+ was followed by a US or not. Counterbalancing was implemented for the visual CS (CS+ square and CS- diamond, or CS+ diamond and CS- square) as well as for the order of presentation of the CS during the unreinforced habituation, extinction training, and recall test phases (i.e., CS+ presented first or CS- presented first). The reinforced acquisition training and reacquisition phases always started with a CS+. The two orders of CS presentation were pseudorandomly generated, similar to Ernst et al.18. Participants were instructed to pay attention to any possible connection between the CS and US presentations. They were also informed that the patterns of CS and US presentations would stay the same across all phases. However, they were not told about the CS/US contingencies or the timing and occurrence of the US.
In the analysis, the focus was on trials containing unexpected omissions of the US (CS+/no US) with high prediction error values. These trials occurred during early extinction, early recall, reacquisition and the reextinction phase.
CS visual stimuli
Visual stimuli were displayed on an fMRI monitor (BOLD-screen 32, Cambridge Research Systems Ltd., Rochester, UK) placed at the end of the scanner bore which was projected to the participant using a mirror system (Figure 5B). Two pictures of black geometric figures (a square and a diamond shape) on a gray background were used as CS+ and CS-. Both CS were presented for 6 s. The period of the CS was shortened from 8 s used in our previous study18 to 6 s to decrease the length of the experiment, while still being able to assess SCRs4. In the periods between visual stimuli a fixation cross was shown (time of ITI: varies between 9 s and 13.5 s).
US aversive stimulus
In reinforced CS+ trials, the visual stimulus co-terminated with the presentation of the aversive US, an electric shock (100 ms) applied to the first dorsal interosseous muscle area of the right hand via a 6.5 mm concentric surface electrode (WASP electrode, Specialty Developments, Bexley, UK). The US was produced by a high voltage DC stimulator (DS7A, Digitimer Ltd., London, UK) and consisted of four consecutive 500 µs current pulses (maximum output voltage: 400 V) with an inter pulse interval of 33 ms. The location of the US electrode was kept constant across all days of measurement by marking the electrode position with a permanent marker on day 1 and 2. Before the first experimental phase began, the individual electrical stimulation intensity threshold was calibrated to be perceived as ‘very unpleasant but not painful’. To reach the threshold, the current strength was progressively increased and modulated according to each participant’s feedback. The calibrated US intensity was increased by 20 % to compensate for possible habituation to the US, which could result in a weakening of the conditioned responses as successfully done in previous studies of our group28,81. The US intensity remained the same for the three days of the experiment (mean US intensity: 3.8 (SD = 2.9) mA, range 0.6-19.2 mA).
Physiological data acquisition
During each phase, SCRs and pupil size were recorded. SCRs were acquired with appropriate hardware filters sampling at 2 kHz through an MP160 Data Acquisition Hardware unit (BIOPAC Systems Inc, Goleta, CA). Two SC electrodes were attached to the left-hand hypothenar eminence. Pupil size was measured using an MRI compatible eye tracking system (Avotec Inc, Stuart, FL). Calibration of the eye tracking system was performed prior to each phase to track gaze position on the screen.
Self-reports
After each learning phase, participants completed questionnaires using a 4-button fiberoptic response device (Current Designs, Haverford, PA). The Likert scale questionnaires assessed self-reports such as arousal (rated on a 1-9 scale from 1: “very calm” to 9: “very nervous”), valence (rated on a 1-9 scale from 1: “comfortable” to 9: “uncomfortable”), and fear (rated on a 1-9 scale from 1: “not afraid” to 9: “very afraid”) related to the visual stimuli. The questionnaires also included questions about US perception and CS-US contingency. Additionally, participants completed standardized questionnaires before the start of the experiment to assess handedness, depression, anxiety, and stress levels. All participants were screened for contraindications of 7T MRI.
Skin conductance analysis
To eliminate high frequency artefacts, skin conductance data was first low-pass filtered at 10 Hz (62nd-order Blackman FIR filter) in MATLAB (Release 2022b, RRID:SCR_001622, The MathWorks Inc., Natick, MA). SCRs were defined as the maximum trough-to peak-amplitude within a given time interval after CS onset using semi-automated peak detection implemented in a MATLAB-based EDA-Analysis App82. A minimum amplitude criterion of 0.01 μS was used as the SCR detection threshold. In each trial, SCRs were evaluated for two distinct time windows: the conditioned response within a time window of 1.0 s to 5.9 s after CS onset and the unconditioned response within a time window of 6 s to 10 s after CS onset (irrespective of whether a US was presented). To account for between-subjects variance, the resulting raw SCR amplitudes were increased by 1 μS and normalized through a logarithmic transformation (LN(1+SCR))83,84.
Pupil size analysis
Preprocessing of the raw pupil size data was performed to detect and remove blinks85. Trials with fewer than 50% of their data points remaining after blink removal were excluded from the analysis. For each trial, the baseline was computed as the mean pupil size during the 500 ms period prior to CS onset. The baseline was subtracted from the corresponding pupil size and the result was divided by the baseline to compute the pupil size response86,87. The mean pupil size response for CS+ and CS- trials were calculated in the time windows from 4 to 5.9 s and 6.0 to 8.0 s for the CS and US pupil size response, respectively. Pupil size responses during possible US presentation were excluded (5.9 to 6 s). The time window for the CS was chosen based on Jentsch et al.86 who observed the largest differentiation between CS+ and CS- 2 s prior to the US. The US time window was chosen to match the CS time window. For valid trials, a missing pupillometric data point was considered as ‘not a number value’ to not influence the pupil size mean computation87.
Behavioral statistical analysis
For physiological data, due to non-normal distribution (SCR and PSR: Shapiro-Wilk test, p < 0.001), statistical analysis was performed using the ANOVAF option in the PROC Mixed procedure in SAS Studio 3.8 (SAS Institute Inc., Cary, NC, USA). This method is suggested for addressing skewed distributions, outliers, or small sample sizes. To ensure more reliable results, an ANOVA-type statistic (ATS) was applied, with the denominator degrees of freedom set to infinity88–90 as the use of finite denominator degrees of freedom can result in increased type I errors91. The ATS was calculated for each phase (habituation, acquisition training, extinction training, recall test, reacquisition, reextinction, unexpected US phase), with stimulus type (CS+, CS-) and block (early vs. late) as within-subjects factor and SCRs or PSRs as dependent variables.
For analyses of self-reports, an ATS with repeated measures was calculated with stimulus type and phase as within-subjects factor and ratings (valence, arousal, fear and US expectancy) as dependent variables.
For behavioral results, a p<0.05 criterion was used to determine if results were significantly different from those expected if the null hypotheses were correct. Post-hoc comparisons were calculated using least square means tests and were adjusted for multiple comparisons using the Tukey-Kramer method.
Computational modeling
An artificial agent was trained to predict the likelihood of a shock for a given visual input in a virtual version of the experiment as in Batsikadze et al.28. The model was based on reinforcement learning92 and consisted of a deep neural network (DNN)93. The model hyper- parameters were fit to SCRs recorded in the experiment, which served as a read-out of the participants’ expectation of an US.
The same simplified visual stimuli for trial stimuli st and the same encoding for reinforcement signals rt were used as in Batsikadze et al.28. We employed the same network architecture, which comprised two hidden fully connected layers with 64 units each and an output layer with activation function φ. For each of the two trial sequences experienced by the participants, 25 randomly initialized agents were trained during the fitting of hyper- parameters, and 100 randomly initialized agents were trained to derive trial-by-trial values for predictions vt and prediction errors δtfrom the best fit. On each trial t, the agent stored an experience tuple et = (st, rt, δt) in memory for later replay94. The agents were trained using the backpropagation algorithm95 on batches of experiences of size b, which were sampled randomly from memory with a probability that was proportional to a priority score p:
Priority scores depended on the experiences’ recency λτ, where τ is the time passed since the experience and λ is a decay factor. Optionally, the priority could additionally depend on the magnitude of US prediction error, i.e., p = |δ|λτ 96. The parameter RPE ∊ {Yes, No} indicates whether the replay priorities also depended on the magnitude of the US prediction error. The number of replays i was varied to control the degree of learning on each trial.
While the previous model could account for ABA renewal28, it cannot account for spontaneous recovery in the AAA paradigm, because, in the model, extinction in the same context overwrites the association formed during acquisition. Hence, we extended the model by an additional replay phase, which takes place between habituation, acquisition, extinction, recall and reextinction phases, and serves to recover the initially acquired association. This replay phase prioritized experiences according to the reinforcement rkreceived in a trial and consisted of a total 100 replays of batches of size 128. Reactivation probabilities for these replays were computed as follows:
where β denotes the inverse temperature and controls the relative difference of reactivation probabilities.
Hyper-parameter fitting
We averaged SCRs from CS+ and CS- trials separately for each trial sequence and applied min- max scaling. The averaged SCRs were accordingly defined as Y̅l = (y̅+,1, . . ., y̅+,N, y̅−,1, . . ., y̅−,N), where y̅+,nand y̅−,n are the averaged SCRs for the n-th CS+ and n-th CS- presentations across all participants who completed a given trial sequence l, respectively. Analogously, the averaged US predictions of the model were defined as V̅l(b, λ, i, β, RPE, φ) = (v̅+,1, . . ., v̅+,N, v̅−,1, . . ., v̅−,N), where v̅+,n and v̅−,n are the averaged US predictions for the n-th CS+ and CS- presentations across all model instances who were trained on a given trial sequence l, respectively. The goodness of fit was defined as:
where wl is the number of participants who experienced trial sequence l. To ensure that the overall learning curve in the model resembled that of the participants, we added the following penalty terms, if the model failed to
learn that CS+ is followed by the where v̅Acq,End is the average US prediction over the last 2 CS+ presentations of acquisition training.
show the CR at the start of extinction training: where v̅Ext,start is the average US prediction over the first 2 CS+ presentations of extinction training.
successfully extinguish the CR during extinction training: where v̅Ext,End is the average US prediction over the last 2 CS+ presentations of extinction training.
show renewal of the CR: where v̅ is the average US prediction over the first 2 CS+ presentations of recall.
successfully extinguish the CR during recall: where v̅Rec,Start is the average US prediction over the last 2 CS+ presentations of recall
A grid search was conducted over the hyper-parameter sets shown in Table 1. The model with the best goodness-of-fit was then chosen to derive trial-by-trial values for US predictions and prediction errors. Prediction errors were used for parametric modulation of fMRI data to test hypotheses related to prediction error processing of the cerebellum and VTA.
MRI acquisition
All MRI imaging data was collected while participants were lying supine in the 7T MRI scanner (MAGNETOM Terra, Siemens Healthineers AG, Forchheim, Germany). A 1-channel transmit/32-channel receive head RF coil (Nova Medical Inc., Wilmington MA, USA), was used. Two dielectric pads for signal homogenization were placed on either side of each participant’s upper neck97. Depending on head size, further cushioning was added to prevent participant head movement and discomfort.
Whole brain functional MRI acquisition was performed with a 3-dimensional echo planar image sequence98 with an isotropic voxel size of 1.5 mm. The sequence was run for each phase (habituation, 98 volumes; acquisition training, 354 volumes; extinction training, 350 volumes; recall test, 182 volumes; reacquisition, reextinction and unexpected US phases; 938 volumes).
Imaging parameters were as follows: TR/TE, 1620/20 ms; flip angle, 11°; phase encoding acceleration factor, 2; 3D acceleration factor, 3; slice partial Fourier factor, 7/8; acquisition matrix, 140 × 140; number of slices, 96.
After acquisition training on day 1, a transversal QSM ASPIRE sequence was acquired99,100 with an isotropic voxel size of 0.7 mm. Further imaging parameters were as follows: TR/TE1/TE2/TE3/TE4/TE5, 28/5/10/15/20/25 ms; flip angle 15°; phase encoding acceleration factor, 3; slice partial Fourier factor, 6/8; acquisition matrix, 320 × 280; number of slices, 256; TA, 10:52 min.
After the acquisition of the QSM ASPIRE sequence, a sagittal MP2RAGE sequence including fat navigators101–103 was acquired with an isotropic voxel size of 0.75 mm. Further imaging parameters were as follows: TR/TE, 6000/1.85 ms; TI1/TI2, 800/2750 ms; flip angles 1/2, 4°/5°; phase encoding acceleration factor, 3; acquisition matrix, 340 × 340; number of slices, 256; TA, 14:58 min.
Image processing
Motion correction of MP2RAGE volumes including fat navigators was performed using offline reconstruction with the Retro-MoCo toolbox for MATLAB provided by David Gallichan (version23 0.9.0dev, https://github.com/dgallichan/retroMoCoBox.git). A T1 map was generated from the motion corrected MP2RAGE with the MP2RAGE-utils package implemented in MATLAB (release 1.0, https://github.com/srikash/MP2RAGE-utils). The MP2RAGE was normalized to MNI-space using the CAT12 (CAT12, release 1450)104 toolbox in SPM12 (Wellcome Department of Cognitive Neurology, London, UK).
Functional MRI volumes were brain extracted with BET (Brain Extraction Tool)105 in FSL (Release 6.0.1, RRID:SCR_002823, Centre for Functional MRI of the Brain, Oxford, UK). Motion and distortion correction was performed for each fMRI run with ANTs (Version 2.3.5, RRID:SCR_004757, University of Pennsylvania, Philadelphia, USA)106. A 5-volume fMRI sequence preceding each run with phase encoding in the opposite direction was used for the distortion correction. All volumes were coregistered to a T1 map derived from the acquired MP2RAGE with ANTs. The CAT12 processing of the MP2RAGE was used to normalize fMRI data. A 4.5 mm Gaussian kernel was used for smoothing. Motion nuisance regressors (3 translations, 3 rotations) were derived from ANTS affine transformation matrices output.
fMRI analysis
The 1st level analysis was done in MNI space, defining conditions for both the CS and US window (i.e., CS+, CS-, US post CS+, no US post CS+, no US post CS-) separately for each phase. Regressors were calculated from the conditions in SPM12 by convolving a canonical hemodynamic response function with events, which were modelled as delta functions in an event-based design (i.e., the duration of each event is 0 s). Beta weights were fitted for each regressor by a general linear model for each voxel fMRI time series. Contrasts were computed as linear combinations of beta weights. Contrasts included main effects of conditions, CS differentiation (CS+ > CS-, CS- > CS+), unexpected omission of the US (no US post CS+ > no US post CS-) and US presentation (US post CS+ > no US post CS-). Values in contrast maps were converted to t-statistic values in first level analysis. Contrast maps are combined and used in a one-sample t-test for second level analysis in SPM12. Tests for significance (p<0.05) were done after threshold-free cluster enhancement (TFCE toolbox in SPM12, R174, http://dbm.neuro.uni-jena.de/tfce/) and family-wise error (FWE) corrections. Any trend-level results refer to significance tests without TFCE and FWE corrections (p<0.05). For the cerebellar cortex, results were visualized on cerebellar flatmaps using the SUIT toolbox in SPM12 20. For the cerebellar nuclei and VTA, results were visualized in coronal slices and 3D renderings generated in MRIcroGL107 using custom probabilistic atlases (described in “Volume of interest definition”). To acquire cerebellar anatomical region labels, activation maps were projected onto the SUIT atlas volume (Cerebellum-SUIT.nii, Diedrichsen, 2006).
Event-based analysis
In addition to main effects of conditions (e.g., during acquisition training), events were additionally separated over time for each experimental phase. For fear acquisition training, extinction training, recall test, reacquisition, reextinction and the unexpected US phase, events were grouped in two equal-size blocks representing early and late halves of those phases (e.g., the 8 first CS+ trials of fear acquisition training correspond to early acquisition and the 8 last CS+ trials of fear acquisition training correspond to late acquisition). Trials with unexpected events—such as unexpected US presentations or omissions—were treated as separate single trial regressors. These included the initial three trials of acquisition training, extinction training, recall test, reacquisition and reextinction and specific events such as the three unexpected US omissions in reacquisition. Notably, all single trials comprised CS+ trials, with CS- trials also being modeled as single trials to facilitate paired analysis.
Parametric modulation analysis
Parametric modulation of fMRI data was performed with prediction and prediction error values in all learning phases derived from our computational model. CS+ and CS- events were modulated with prediction values, while US omission and presentation events were modulated with prediction error values (Figure 5C). Events were not further divided within phases (e.g., no early or late halves or separation of the first three trials). Modulations were done separately for each phase, except for reacquisition, reextinction and the unexpected US phase which were modulated together as they were part of the same fMRI run.
Volumes of interest (VOI) definition
A global conjunction between unexpected US omission parametric modulation contrasts during extinction, the recall test and reacquisition was multiplied with a cerebellum mask and showed a region in lobule VI and Crus I. This region was used as a cerebellar cortex (lobule VI and Crus I) volume of interest (VOI). Deep cerebellar nuclei (DCN) and VTA VOIs were drawn for each participant. Drawing of the DCN (i.e., the left and right dentate, globose, emboliform and fastigial nuclei) was done by an expert annotator using both MP2RAGE and QSM information in ITK-SNAP (version 3.8.0)109. The nuclei of the DCN were combined into a single bihemispheric VOI. Drawing of the VTA was done by adjusting estimated masks with both MP2RAGE and QSM information, using the red nuclei, substantia nigra, the cerebral aqueduct and interpeduncular fossa as landmarks. Initial masks were estimated for the VTA by inversely transforming the probabilistic atlas by Trutti et al.76 thresholded at a probability of p>0.4. All drawn masks were transformed into MNI space. After transformation to MNI space, a probabilistic atlas was generated by averaging binary masks for each structure across participants. The probabilistic atlas was thresholded so that the volume of each structure matched the average volume across participants. For drawing of globose, emboliform and fastigial nuclei, the thresholded atlas was inversely transformed from MNI to T1 space in order to add missing nuclei for each participant with lower quality QSM data. The VTA VOI was used as a seed region in the PPI analysis. In addition, the cerebellar cortex (CB), DCN and VTA VOIs were used as nodes in the 2-node dynamic causal modelling (DCM) networks. The DCN and VTA VOIs were used as masks for displaying fMRI activations in coronal slices and 3D renderings.
Functional connectivity: PPI
For the PPI analysis, the VTA VOI was used to extract time series with SPM12 in MNI space. Time series were used as physiological regressors, and psychophysiological regressors were added for each condition using the gPPI toolbox110. Both first- and second-level analyses were performed with the same contrasts as defined in the parametric modulation (e.g., no US x prediction error during extinction) analysis. To summarize connectivity in the parametric modulation analysis, all contrasts related to unexpected US omissions were combined in a single contrast within the gPPI toolbox.
Functional connectivity: DCM
To calculate dynamic causal models (DCMs), the cerebellar cortex (CB) and VTA VOIs were used to extract time series in a concatenated version of the event based and parametric modulation models using SPM12111–113. Each condition was defined as input to the CB and VTA nodes (i.e., CS+, CS-, US post CS+, no US post CS+, no US post CS-), while each prediction or prediction error related event-based contrast (i.e., CS+ > CS-, no US post CS+_first3 > no US post CS-) or parametric modulation (i.e., no US prediction error) could modulate the reciprocal intrinsic connections. Recurrent connections were active but were not modulated. Parametric Empirical Bayes (PEB) was used for analysis on the second level114. All DCM and PEB analyses were done within SPM12.
Supplementary information
Pupil size responses (PSRs)
During habituation on day 1, PSRs related to the CS+ and CS- did not differ significantly (non- parametric ANOVA-type statistic, main effects, all p>0.2).
During fear acquisition training on day 1, PSRs were significantly higher in CS+ trials than in CS- trials, and PSR was more pronounced in early compared to late trials. Non-parametric ANOVA-type statistics revealed a significant main effect of Stimulus (CS+ vs. CS-; F1 = 39.51; p<0.001) and Time (early vs. late; F1 = 24.31; p<0.001). No significant Stimulus x Time interaction was observed (F1 = 0.58; p=0.45).
During extinction training on day 2, PSRs for CS+ significantly decreased early vs late but there were no differences between early and late trials for CS-. Non-parametric ANOVA-type statistics revealed a significant main effect of Time (early vs. late; F1 = 18.21; p<0.001) and a significant Stimulus x Time interaction (F1 = 4.28; p=0.04). No significant main effect of Stimulus was observed (CS+ vs. CS-; F1 = 1.12; p=0.29). Post-hoc tests showed that PSRs related to the CS+ were significantly lower during late extinction compared to early (least squares means test; p<0.001). This result was not observed for PSRs related to the CS- (least squares means test; p=0.64).
During the recall test on day 3, PSRs for the CS+ were significantly higher than for the CS- in both early and late trials, though the difference was less pronounced in late trials. The PSRs for the CS+ were significantly higher in early compared to late trials. Non-parametric ANOVA- type statistics revealed a significant effect of Stimulus (CS+ vs. CS-; F1 = 6.59; p=0.01) and Time (early vs. late; F1 = 6.41; p=0.01) and a significant Stimulus x Time interaction (F1 = 4.87; p=0.03). Post-hoc tests showed that PSRs related to the CS+ were significantly higher than towards the CS- early but not late (least squares means test; early p<0.05; late p=0.86) and that the PSRs related to the late CS+ were significantly lower than early (least squares means test; p<0.05)
During reacquisition on day 3, PSRs for CS+ were significantly higher in early trials, but no significant difference in PSRs for CS- early vs late was observed. PSRs for CS+ and CS- were significantly different between early and late. Non-parametric ANOVA-type statistics reveal a significant effect of Stimulus (CS+ vs. CS-; F1 = 47.87; p<0.001), Time (early vs. late; F1 = 4.54; p=0.03) and Stimulus x Time interaction (F1 = 7.7; p=0.006). Post-hoc tests using the least squares means test showed that PSRs related to the CS+ were higher early in the phase (least square means test; p=0.004) but not the CS- (least square means test; p=0.97). Stimulus types were significantly different both early and late (least square means test; CS+ vs. CS-; all: p<0.008).
During reextinction on day 3, PSRs for CS+ were significantly higher than for CS- early, but the difference became insignificant in late trials. Non-parametric ANOVA-type statistics revealed a significant main effect of Stimulus (CS+ vs. CS-; F1 = 9.1; p=0.003) and a Stimulus x Time interaction (F1 = 4.43; p=0.04), but not a significant main effect of Time (early vs. late; F1 = 0.83, p=0.36). Post-hoc test showed a significant difference between the CS+ and CS- early but not late (least squares means test; early: p=0.01; late: p=0.77).
During the unexpected US phase on day 3, PSRs for the CS+ were higher than for the CS-. Non- parametric ANOVA-type analysis revealed a significant main effect of Stimulus (CS+ vs. CS-; F1 = 17.35; p<0.001), but not of Time (early vs. late; F1 = 0.83; p=0.363) and the Stimulus x Time interaction (F1 = 0.03; p=087)
Acknowledgements
We would like to thank our technician Beate Brol for the drawing and adjustment of DCN and VTA masks. Additionally, we would like to thank Greta Wippich for providing helpful illustrations. The MAGNETOM Terra 7T MRI system used in the study was funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation), grant number 432647511. This project has received funding from the DFG (project number 316803389 – SFB1280), the European Union’s Horizon 2020 research and innovation program under the Marie Skłodowska-Curie grant agreement No 956414 and an individual scholarship for Patrick Pais Pereira from the Hans-Böckler-Stiftung.
References
- 1.Uncertainty and anticipation in anxiety: An integrated neurobiological and psychological perspectiveNat. Rev. Neurosci 14:488–501
- 2.Dopamine: from prediction error to psychotherapyTransl. Psychiatry 10
- 3.Fear Extinction as a Model for Translational Neuroscience: Ten Years of ProgressAnnu. Rev. Psychol 63:129–151
- 4.Don’t fear ‘fear conditioning’: Methodological considerations for the design and analysis of studies on human fear acquisition, extinction, and return of fearNeurosci. Biobehav. Rev 77:247–285
- 5.A dopaminergic switch for fear to safety transitionsNat. Commun 9:1–11
- 6.Dopamine neurons drive fear extinction learning by signaling the omission of expected aversive outcomesElife 7:1–25
- 7.A Dopaminergic Basis for Fear ExtinctionTrends Cogn. Sci 23:274–277
- 8.Role of Inhibition in Exposure TherapyJ. Exp. Psychopathol 3:322–345
- 9.Neuronal Coding of Prediction ErrorsAnnu. Rev. Neurosci 23:473–500
- 10.Functional architecture of dopamine neurons driving fear extinction learningNeuron 111:3854–3870
- 11.Cerebellar granule cells encode the expectation of rewardNature 544:96–100
- 12.Classical conditioning drives learned reward prediction signals in climbing fibers across the lateral cerebellumElife 8:1–21
- 13.Predictive and reactive reward signals conveyed by climbing fiber inputs to cerebellar Purkinje cellsNat. Neurosci 22:950–962
- 14.Cerebellar contributions to the papez circuitJ. Neurosci. Res 2:133–146
- 15.Whole-Brain Mapping of Direct Inputs to Midbrain Dopamine NeuronsNeuron 74:858–873
- 16.Cerebellar modulation of the reward circuitry and social behaviorScience (80-.) 363
- 17.Homologous organization of cerebellar pathways to sensory, motor, and associative forebrainCell Rep 36
- 18.The cerebellum is involved in processing of predictions and prediction errors in a fear conditioning paradigmeLife 8https://doi.org/10.7554/eLife.46831
- 19.Dissociating pain from its anticipation in the human brainScience (80-.) 284:1979–1981
- 20.Surface-based display of volume-averaged cerebellar imaging dataPLoS One 10:1–18
- 21.VTA-projecting cerebellar neurons mediate stress-dependent depression-like behaviorsElife 11:1–38
- 22.Resting Amygdala and Medial Prefrontal Metabolism Predicts Functional Activation of the Fear Extinction CircuitAm. J. Psychiatry 169:415–423
- 23.Cerebellar contributions to different phases of visceral aversive extinction learningCerebellum 13:1–8
- 24.Proximal threats promote enhanced acquisition and persistence of reactive fear-learning circuitsProc. Natl. Acad. Sci. U. S. A 117:16678–16689
- 25.Temporal dynamics of fMRI signal changes during conditioned interoceptive pain-related fear and safety acquisition and extinctionBehav. Brain Res 427
- 26.Learning about pain: The neural substrate of the prediction error for aversive eventsProc. Natl. Acad. Sci. U. S. A 97:9281–9286
- 27.Brain response to visceral aversive conditioning: A functional magnetic resonance imaging studyGastroenterology 128:1819–1829
- 28.The cerebellum contributes to context-effects during fear extinction learning: A 7T fMRI studyNeuroimage 253
- 29.The cerebellum and fear extinction: evidence from rodent and human studiesFront. Syst. Neurosci 17:1–8
- 30.Neural signatures of human fear conditioning: An updated and extended meta-analysis of fMRI studiesMol. Psychiatry 21:500–508
- 31.Prediction error for free monetary reward in the human prefrontal cortexNeuroimage 23:777–786
- 32.The human cerebellum in reward anticipation and outcome processing: An activation likelihood estimation meta- analysisNeurosci. Biobehav. Rev 149
- 33.A hierarchical atlas of the human cerebellum for functional precision mappingNat. Commun 15
- 34.Triple representation of language, working memory, social and emotion processing in the cerebellum: convergent evidence from task and seed-based resting-state fMRI analyses in a single large cohortNeuroimage 172:437–449
- 35.Definition and characterization of an extended multiple-demand networkNeuroimage 165:138–147
- 36.A theory of Pavlovian conditioning: The effectiveness of reinforcement and non-reinforcementClass. Cond. Curr. Res. Theory
- 37.A model for Pavlovian learning: Variations in the effectiveness of conditioned but not of unconditioned stimuliPsychol. Rev 87:532–552
- 38.Attentional bias for uncertain cues of shock in human fear conditioning: Evidence for attentional learning theoryFront. Hum. Neurosci 11:1–13
- 39.Distinct signals in medial and lateral VTA dopamine neurons modulate fear extinction at different timesElife 9:75–103
- 40.L-DOPA modulates activity in the vmPFC, nucleus accumbens, and VTA during threat extinction learning in humansElife 10:1–21
- 41.BOLD responses reflecting dopaminergic signals in the human ventral tegmental areaScience (80-.) 319:1264–1267
- 42.Mesolimbic functional magnetic resonance imaging activations during reward anticipation correlate with reward-related ventral striatal dopamine releaseJ. Neurosci 28:14311–14319
- 43.Reward signals in the cerebellum: Origins, targets, and functional implicationsNeuron 110:1290–1303
- 44.Interpreting the BOLD signalAnnu. Rev. Physiol 66:735–769
- 45.Neurophysiological investigation of the basis of the fMRI signalNature 412:150–157
- 46.What we can do and what we cannot do with fMRINature 453:869–878
- 47.Selective recruitment of the cerebellum evidenced by task-dependent gating of inputseLife 13https://doi.org/10.7554/eLife.96386.3
- 48.Non-Linear Frequency Dependence of Neurovascular Coupling in the Cerebellar Cortex Implies Vasodilation–Vasoconstriction CompetitionCells 11
- 49.Granular layer neurons control cerebellar neurovascular coupling through an NMDA receptor/NO-dependent systemJ. Neurosci 37:1340–1351
- 50.Investigations of brain-wide functional and structural networks of dopaminergic and CamKIIα-positive neurons in VTA with DREADD-fMRI and neurotropic virus tracing technologiesJ. Transl. Med 21:1–14
- 51.Functional imaging of the human dopaminergic midbrainTrends Neurosci 32:321–328
- 52.Dopaminergic and non-dopaminergic neurons in the ventral tegmental area of the rat project, respectively, to the cerebellar cortex and deep cerebellar nucleiNeuroscience 51:719–728
- 53.Purkinje cell dopaminergic inputs to astrocytes regulate cerebellar- dependent behaviorNat. Commun 14
- 54.Exploring monosynaptic midbrain-to-deep cerebellar nuclei projectionsAbstracts, Neuroscience 2024
- 55.Dopamine Neurons Reflect the Uncertainty in Fear GeneralizationNeuron 100:916–925
- 56.Acute Aversive Stimuli Rapidly Increase the Activity of Ventral Tegmental Area Dopamine Neurons in Awake MiceNeuroscience 386:16–23
- 57.Signed and unsigned reward prediction errors dynamically enhance learning and memoryElife 10:1–28
- 58.A framework for mesencephalic dopamine systems based on predictive Hebbian learningJ. Neurosci 16:1936–1947
- 59.Adaptive Critics and the Basal GangliaModels of Information Processing in the Basal Ganglia The MIT Press :215–232https://doi.org/10.7551/mitpress/4708.003.0018
- 60.Prediction signals in the cerebellum: Beyond supervised motor learningeLife 9https://doi.org/10.7554/eLife.54073
- 61.Meta-analysis of human prediction error for incentives, perception, cognition, and actionNeuropsychopharmacology 47:1339–1349
- 62.Mild deficits in fear learning: Evidence from humans and mice with cerebellar cortical degenerationeneuro 29https://doi.org/10.1523/ENEURO.0365-23.2023
- 63.Emotions and the Right Hemisphere: Can New Data Clarify Old Models?Neuroscientist 25:258–270
- 64.Brain lateralization: A comparative perspectivePhysiol. Rev 100:1019–1063
- 65.Praising others differently: Neuroanatomical correlates to individual differences in trait gratitude and elevationSoc. Cogn. Affect. Neurosci 13:1225–1234
- 66.Brain functional alterations in patients with anorexia nervosa: A meta-analysis of task-based functional MRI studiesPsychiatry Res 327
- 67.A meta-analysis of cerebellar contributions to higher cognition from PET and fMRI studiesHum. Brain Mapp 35:593–615
- 68.Functional topography in the human cerebellum: A meta-analysis of neuroimaging studiesNeuroimage 44:489–501
- 69.Explicit and Implicit Emotion Processing in the Cerebellum: A Meta-analysis and Systematic ReviewCerebellum 22:852–864
- 70.Hemispheric lateralization of attention processes in the human brainCurr. Opin. Psychol 29:90–96
- 71.Psychophysiological and Modulatory Interactions in NeuroimagingNeuroimage 6:218–229
- 72.Tools of the trade: Psychophysiological interactions and functional connectivitySoc. Cogn. Affect. Neurosci 7:604–609
- 73.Imaging the deep cerebellar nuclei: A probabilistic atlas and normalization procedureNeuroimage 54:1786–1794
- 74.Comparing functional MRI protocols for small, iron-rich basal ganglia nuclei such as the subthalamic nucleus at 7 T and 3 THum. Brain Mapp 38:3226–3248
- 75.A high-resolution probabilistic in vivo atlas of human subcortical brain nucleiSci. Data 5
- 76.A probabilistic atlas of the human ventral tegmental area (VTA) based on 7 Tesla MRI dataBrain Struct. Funct 226:1155–1167
- 77.7T fMRI study on extinction learning in young and healthy participantsCent. Open Sci https://doi.org/10.17605/OSF.IO/2PXWE
- 78.A power calculation guide for FMRI studiesSoc. Cogn. Affect. Neurosci 7:738–742
- 79.The short-form version of the Depression anxiety stress scales (DASS-21): Construct validity and normative data in a large non-clinical sampleBr. J. Clin. Psychol 44:227–239
- 80.The assessment and analysis of handedness: The Edinburgh inventoryNeuropsychologia 9:97–113
- 81.Interaction of fear conditioning with eyeblink conditioning supports the sensory gating hypothesis of the amygdala in meneNeuro 7:1–15
- 82.EDA-Analysis App (5.11)Zenodo https://doi.org/10.5281/zenodo.7965376
- 83.Electrodermal Activity: Second EditionSpringer Science + Business Media
- 84.Electrodermal activityTech. Psychophysiol 54
- 85.Preprocessing pupil size data: Guidelines and codeBehav. Res. Methods 51:1336–1342
- 86.Temporal dynamics of conditioned skin conductance and pupillary responses during fear acquisition and extinctionInt. J. Psychophysiol 147:93–99
- 87.Safe and sensible baseline correction of pupil-size dataBehav. Res. Methods :94–106
- 88.Nonparametric analysis of longitudinal data in factorial experimentsWiley Series in Probability and Mathematical Statistics http://ci.nii.ac.jp/ncid/BA54899817
- 89.Nonparametric analysis of ordinal dataPhytopathology 94:33–43
- 90.nparLD : An R Software Package for the Nonparametric Analysis of Longitudinal Data in Factorial ExperimentsJ. Stat. Softw 50
- 91.Greenhouse-Geisser adjustment and the ANOVA-type statistic: Cousins or twins?Am. Stat 63:239–246
- 92.Reinforcement Learning: An IntroductionMIT press
- 93.Deep learningNature 521:436–444
- 94.Self-improvement Based On Reinforcement Learning, Planning and TeachingMachine Learning Proceedings 321:323–327
- 95.Learning representations by back- propagating errorsNature 323:533–536
- 96.Prioritized Experience ReplayarXiv https://arxiv.org/abs/1511.05952
- 97.Quantitative assessment of the effects of high-permittivity pads in 7 Tesla MRI of the brainMagn. Reson. Med 67:1285–1293
- 98.Segmented K-space blipped-controlled aliasing in parallel imaging for high spatiotemporal resolution EPIMagn. Reson. Med 85:1540–1551
- 99.Computationally Efficient Combination of Multi-channel Phase Data From Multi-echo Acquisitions (ASPIRE)Magn. Reson. Med 79:2996–3006
- 100.Improved susceptibility weighted imaging at ultra-high field using bipolar multi-echo acquisition and optimized image processing: CLEAR-SWINeuroimage 237
- 101.Optimizing the acceleration and resolution of three- dimensional fat image navigators for high-resolution motion correction at 7TMagn. Reson. Med 77:547–558
- 102.MP2RAGE, a self bias-field corrected sequence for improved segmentation and T1-mapping at high fieldNeuroimage 49:1271–1281
- 103.Retrospective correction of involuntary microscopic head movement using highly accelerated fat image navigators (3D FatNavs) at 7TMagn. Reson. Med 75:1030–1039
- 104.CAT: a computational anatomy toolbox for the analysis of structural MRI dataGigascience 13
- 105.BET2: MR-based estimation of brain, skull and scalp surfacesElev. Annu. Meet. Organ. Hum. Brain Mapping
- 106.Large-scale evaluation of ANTs and FreeSurfer cortical thickness measurementsNeuroimage 99:166–179
- 107.Spatial normalization of brain images with focal lesions using cost function maskingNeuroimage 14:486–500
- 108.A spatially unbiased atlas template of the human cerebellumNeuroimage 33:127–138
- 109.User-guided 3D active contour segmentation of anatomical structures: Significantly improved efficiency and reliabilityNeuroimage 31:1116–1128
- 110.A generalized form of context- dependent psychophysiological interactions (gPPI): A comparison to standard approachesNeuroimage 61:1277–1286
- 111.Dynamic causal modellingNeuroimage 19:1273–1302
- 112.Ten simple rules for dynamic causal modelingNeuroimage 49:3099–3109
- 113.A guide to group effective connectivity analysis, part 1: First level analysis with DCM for fMRINeuroimage 200:174–190
- 114.A guide to group effective connectivity analysis, part 2: Second level analysis with PEBNeuroimage 200:12–25
Article and author information
Author information
Version history
- Preprint posted:
- Sent for peer review:
- Reviewed Preprint version 1:
Copyright
© 2025, Nio et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics
- views
- 22
- downloads
- 0
- citations
- 0
Views, downloads and citations are aggregated across all versions of this paper published by eLife.