Distinct signals in medial and lateral VTA dopamine neurons modulate fear extinction at different times
Abstract
Dopamine (DA) neurons are thought to encode reward prediction error (RPE), in addition to other signals, such as salience. While RPE is known to support learning, the role of salience in learning remains less clear. To address this, we recorded and manipulated VTA DA neurons in mice during fear extinction. We applied deep learning to classify mouse freezing behavior, eliminating the need for human scoring. Our fiber photometry recordings showed DA neurons in medial and lateral VTA have distinct activity profiles during fear extinction: medial VTA activity more closely reflected RPE, while lateral VTA activity more closely reflected a salience-like signal. Optogenetic inhibition of DA neurons in either region slowed fear extinction, with the relevant time period for inhibition differing across regions. Our results indicate salience-like signals can have similar downstream consequences to RPE-like signals, although with different temporal dependencies.
Introduction
A critical function of VTA DA neurons is to signal reward prediction error (RPE), or the difference between experienced and expected reward (Schultz et al., 1997; Cohen et al., 2012). This signal supports reinforcement learning (Steinberg et al., 2013; Chang et al., 2016; Tsai et al., 2009; Witten et al., 2011; Parker et al., 2016; Saunders et al., 2018; Zweifel et al., 2009; Kim et al., 2012; Stopper et al., 2014; Adamantidis et al., 2011). However, rather than uniformly representing a scalar RPE, there is recent appreciation that VTA DA neurons can display heterogeneous and spatially organized signals. For example, we recently demonstrated that during navigation-based decision making, there are spatially segregated representations within VTA DA neurons of a variety of sensory, motor, and cognitive variables (Engelhard et al., 2019). Others have demonstrated neural correlates of salience-like signals in certain DA neurons (de Jong et al., 2019; Saddoris et al., 2015; Wang and Tsien, 2011; Gore et al., 2014; Yuan et al., 2019; Cho et al., 2017; Aitken et al., 2016).
Although most work examining neural correlates of behavior in VTA DA neurons has focused on reward-based tasks, several studies have recorded VTA DA neuron activity during aversive associations (Robinson et al., 2019; Lutas et al., 2019; Wang and Tsien, 2011; Mileykovskiy and Morales, 2011). In particular, VTA DA neurons were shown to represent RPE-like signals during fear extinction, in that they display elevated activity when the shock is omitted at the offset of the cue, signaling better-than-expected outcome (Salinas-Hernández et al., 2018; Badrinarayan et al., 2012; Jo et al., 2018). Manipulation of this activity altered the rate of extinction, suggesting that an RPE-like signal in DA neurons drives reinforcement learning for aversive associations as well as rewarding associations (Salinas-Hernández et al., 2018; Luo et al., 2018).
Here, we first seek to determine if in addition to RPE-like signals, there are also heterogeneous and spatially organized signals in VTA DA neurons during fear extinction. One possibility is that there are neural correlates of salience, and not only RPE, in subregions of the VTA during fear extinction. Salience can be considered an unsigned prediction error - in other words, it is the absolute value of RPE (Bromberg-Martin et al., 2010; Rutledge et al., 2010). During fear extinction, a neural correlate of salience should have elevated activity during the tone that has been paired with a footshock, and elevated activity at the tone offset, when the expected footshock is not presented.
Spatially segregated RPE versus salience signals in VTA DA neurons would provide a scenario to determine whether or not such distinct signals have similar or different effects on learning, an important question that has not been definitively answered in any behavioral paradigm. One hypothesis is that a salience signal may support learning, similar to an RPE signal. Alternatively, salience signals may acutely modify behavior rather than drive learning.
An advantage of utilizing fear extinction to address these questions is that it provides a continuous readout of learning via a mouse’s freezing, allowing us to examine the precise temporal relationship between DA and the expression of learned behavior. However, one limitation of freezing as a readout of learning is that it traditionally requires hand scoring when mice are tethered to neural headgear, as existing software is not designed to dissociate tether movement from mouse movement (Luyten et al., 2014; Shoji et al., 2014). The need for human labeling has often restricted the analysis of freezing to specific epochs, such as the presentation of the conditioned stimuli.
To overcome this limitation, and provide an automated and unbiased measure of freezing, we developed an analysis pipeline that uses deep learning to automatically identify freezing behavior. We combined this approach with fiber photometry to characterize the spatial distribution of RPE and salience correlates across the medial-lateral axis of the VTA during fear extinction. Finally, we performed optogenetic inhibition of DA neurons in each VTA subregion to assess if, when and how these signals affect fear extinction.
Results
Development and application of a convolutional neural network (CNN) to identify freezing behavior
We developed an analysis pipeline based on a convolutional neural network (CNN) to identify freezing behavior in mice. The CNN was initialized on the pre-trained ResNet18 architecture (He et al., 2016), and further trained on ‘difference images,’ the pixel-by-pixel intensity difference between consecutive pairs of frames. The rationale for inputting difference images to the CNN was to capture frame-by-frame motion. Each difference image was human-labeled as 1 or 0 to signify ‘freeze’ or ‘no freeze,’ and the network learned to predict labels for new difference images (Figure 1A–C, Figure 1—figure supplement 1A). We trained a CNN for each of two different experimental chambers (fear conditioning chamber and fear extinction chamber) and two different neural headgears (fiber photometry and optogenetics). Each classifier achieved optimal training within 50 training epochs (Figure 1—figure supplement 1B–E) and yielded 92–96% accuracy, 5–10% false positive rate (FPR) and 4–6% false negative rate (FNR) (Figure 1D). This was comparable to the relative performance between two humans: given one person’s scoring held as ground truth, the other person scored with 95% accuracy, 11.5% FPR and 0.3% FNR (Figure 1—figure supplement 1F).
We compared the CNN performance with the popular proprietary software FreezeFrame (Figure 1E–G) on an additional 33,000 frames from several mice. Since FreezeFrame produces a second-by-second readout of freezing, we calculated the mean freezing for each second from both the CNN and from human-labeled frames to create a comparable second-by-second readout. We found that the CNN better reflected the human observer than FreezeFrame (Figure 1E). The CNN and human observer yielded a correlation of 0.96 (Pearson correlation, Figure 1F), while FreezeFrame and human observer yielded a correlation of 0.85 (Pearson correlation, Figure 1G). In comparison, the correlation between two human observers was 0.98 (Pearson correlation, Figure 1—figure supplement 1F).
Taken together, our pipeline provides an automatic, fast and effective method for scoring freezing. In this paper, we used the CNN to analyze over 500 hrs of behavioral data during fear conditioning and extinction, which would have been prohibitively time consuming without an automated approach.
Neural activity in medial and lateral VTA DA neurons during fear extinction correlates with RPE and salience, respectively
We performed fear conditioning and extinction (Figure 2A) while simultaneously performing fiber photometry to record from VTA DA neurons. On day 1, mice were presented with ten tones of 20 s duration (‘habituation’), followed by ten 20 s tones that coterminated with a 1 s, 0.5 mA foot shock (‘conditioning’). On days 2 to 4, mice were presented with twenty-one 20 s tones alone each day (‘extinction’). Mice froze very little during habituation, quickly increased freezing during conditioning, and slowly decreased freezing to the tone over three days of extinction (Figure 2B).
We sought to determine if in VTA subregions, DA neurons correlated with RPE or salience. During fear extinction, we expect a neural correlate of RPE to have suppressed activity during a tone that has been paired with a foot shock to signal worse than expected outcome, and elevated activity during the tone offset to signal better than expected outcome because the footshock was omitted. A neural correlate of salience, which can be considered an unsigned prediction error, or the absolute value of RPE, should instead have elevated activity during the tone that has been paired with a footshock, and elevated activity at the tone offset, when the shock was unexpectedly omitted (Figure 2C).
To perform fiber photometry recordings from VTA DA neurons, we expressed the calcium indicator GCaMP6f in DA neurons by crossing Dat-Cre mice with Ai148D mice (see Materials and methods; Engelhard et al., 2019). We targeted the recording fiber to either medial or lateral VTA (Figure 3A–C, Figure 3—figure supplement 1A; medial VTA group: n = 10 mice, lateral VTA group: n = 11 mice). We chose these subregions because of the electrophysiological, anatomical and functional evidence that there is a medial/lateral distinction within the VTA (Lammel et al., 2014; Beier et al., 2015; Yang et al., 2018; Engelhard et al., 2019; de Jong et al., 2019). We found that both regions showed increased activity to reward (Figure 3—figure supplement 2).
During habituation and fear conditioning, medial and lateral VTA DA neuron signals were similar, but could not be easily explained as purely RPE or salience (Figure 3—figure supplement 1D,E top two rows, and F-H). GCaMP6f fluorescence in both regions decreased throughout the tone during habituation and conditioning. Both regions also showed increased fluorescence to the shock (Figure 3—figure supplement 1B,C).
During fear extinction, we observed that medial versus lateral VTA cell bodies preferentially reflected RPE versus salience, respectively (Figure 3D–F, Figure 3—figure supplement 1D,E bottom three rows). Medial VTA DA neuron activity was consistent with RPE (Figure 2C): GCAMP6f fluorescence decreased throughout the tone that had been associated with a negative outcome, and increased at the offset of the tone, during the omission of the expected shock (Figure 3E; percentile rank of the mean GCaMP6f fluorescence every second relative to shuffled data: p<0.01 after Bonferroni correction for multiple time point comparison). Both of these signals diminished in magnitude throughout extinction, as expected with RPE.
In contrast, during fear extinction, lateral VTA DA neuron activity was more consistent with salience (Figure 2C). GCaMP6f fluorescence increased at the tone onset, consistent with an unsigned prediction error (Figure 3F; percentile rank of the mean GCaMP6f fluorescence every second relative to shuffled data: p<0.01 after Bonferroni correction for multiple time point comparisons). GCaMP6f fluorescence also increased at the tone offset, consistent with the idea that the omission of the shock is salient. Both these signals also diminished throughout extinction, as the tone loses its salience.
To control for the possibility that the lateral VTA signal could be coming from the adjacent SNc, in a separate group of mice we recorded from SNc and found different activity profiles (Figure 3—figure supplement 3).
To control for possible artifacts in these recordings, we recorded from DAT-Cre mice expressing Cre-dependent GFP (AAV5-DIO-eGFP or AAV5-DIO-eYFP) in medial or lateral VTA. There was little modulation in the signal in the control mice. The exception was the time of the shock, which generated depressed fluorescence, which was the opposite of the increased fluorescence observed with GCaMP6f in medial and lateral VTA (Figure 3—figure supplement 4). This suggests that our conclusions above were not due to a recording artifact.
To determine whether dopamine neuron activity in medial and lateral VTA would be similar to dopamine axon activity in the striatum, we recorded from dopamine axons in the nucleus accumbens core (NAcC) and dorsal medial striatum (DMS) using the same fiber photometry technique during the same fear conditioning and extinction behavior. We found that dopamine axons in the nucleus accumbens and DMS both showed RPE during fear extinction, similar to dopamine cell bodies in medial VTA (Figure 3—figure supplement 5).
Medial and lateral VTA DA neuron activity during extinction correlate distinctly with freezing on a trial-by-trial basis and across animals
We next characterized the correlation between freezing and GCaMP6f fluorescence during fear extinction in medial and lateral VTA, both on a trial-by-trial basis, and across animals (Figure 4).
In the medial VTA DA neurons, we observed a negative correlation between freezing during the tone and GCaMP6f fluorescence during the tone onset of the same trial (Figure 4A; one sample t-test p=0.021, n = 10 mice), but a positive correlation between freezing during the tone and GCaMP6f fluorescence at the tone offset of the same trial (Figure 4A; one sample t-test p<10−4, n = 10 mice). These correlations are consistent with the interpretation that medial VTA DA encodes an RPE-like signal during fear extinction. Specifically, the negative correlation between fluorescence during the tone and freezing is consistent with the idea that the degree of inhibition in DA during the tone reflects the learned negative association with the tone (Mileykovskiy and Morales, 2011). Similarly, the positive correlation between DA after the tone and freezing is consistent with the idea that the fluorescence reflects the degree of ‘relief’ that the animal experiences when the expected shock is omitted. These same correlations that we observed across trials were also evident when correlating trial-averaged medial VTA GCaMP6f fluorescence and freezing across animals (Figure 4B,C). Mice with lower average GCaMP6f fluorescence during the tone onset or higher average GCaMP6f fluorescence at the tone offset froze more to the tone (Pearson’s correlation between GCaMP6f 5 s after tone onset and freezing during tone: r = −0.644, p=0.044; GCaMP6f 5 s after tone offset and freezing during tone: r = 0.768, p=0.010).
In contrast, in the lateral VTA, we observed a positive correlation across trials between freezing throughout the tone and GCaMP6f during the tone onset for each trial (Figure 4D; one sample t-test p<10−4, n = 11 mice), but not the tone offset (Figure 4D; one sample t-test p=0.071). This is consistent with the interpretation that fluorescence during the tone in lateral VTA reflects the salience of the tone, given that more salient stimuli should elicit more freezing. We observed similar trends across mice (Figure 4E,F; Pearson’s correlation between GCaMP6f during tone onset and freezing during tone: r = 0.583, p=0.060. Pearson’s correlation between GCaMP6f after tone offset and freezing during tone: Pearson’s correlation, r = 0.665, p=0.025).
In contrast to these medial and lateral VTA trial-by-trial correlations between GCaMP6f and freezing, any correlation between GCaMP6f and change in freezing across trials was much more subtle (Figure 4—figure supplement 1).
At the tone offset, inhibition of medial but not lateral VTA DA neurons slows fear extinction
To investigate whether medial or lateral VTA DA activity contributes to extinction learning, we used optogenetics to inhibit these subregions at specific time points. We injected Cre-dependent NpHR (AAV2/5 DIO-eNpHR3.0-EYFP) or YFP control virus (AAV2/5 DIO-EYFP) into the VTA of DAT-Cre mice and bilaterally implanted optic fibers above medial or lateral VTA.
We first confirmed the efficacy of photoinhibition using whole-cell recordings in acute slices (Figure 5A-D). We additionally confirmed photoinhibition in vivo in each subregion by performing real-time place aversion (RTPA) (Figure 5—figure supplement 1A-C; medial VTA cohort: NpHR group n = 15, YFP group n = 10. Lateral VTA cohort: NpHR group n = 20, YFP group n = 18), as prior reports show inhibiting VTA DA causes conditioned place aversion (Calcagnetti and Schechter, 1991; Schechter and Meechan, 1994; Tan et al., 2012). Inhibiting either medial or lateral VTA DA caused real time place aversion to the inhibition chamber (Figure 5—figure supplement 1D; 2-factor ANOVA with group and subregion as factors: group effect: F(1,1) = 103.33, p = 10−14; subregion effect: F(1,1) = 0.32, p = 0.576; group × subregion interaction: F(1,1) = 0.01, p = 0.935) and decreased velocity during inhibition (Figure 5—figure supplement 1E; 2-factor ANOVA with group and subregion as factors: group effect: F(1,1) = 14.35, p < 10−3; subregion effect: F(1,1) = 0.1, p = 0.754; group × subregion interaction: F(1,1) = 0, p = 0.9441).
We next examined the effect of VTA DA neuron inhibition during fear extinction in separate cohorts of mice with fibers in medial or lateral VTA; each cohort had separate groups of NpHR- or YFP-virus injected mice (Figure 5E–K, Figure 5—figure supplement 2A–B; medial VTA cohort: NpHR group n = 14, YFP group n = 10. Lateral VTA cohort: NpHR group n = 8, YFP group n = 8). Both cohorts underwent fear conditioning, and during fear extinction, received inhibition lasting for 6 s starting from the last second of the tone to 5 s after the tone off.
Inhibiting medial VTA DA neurons at tone offset yielded an effect consistent with the representation of RPE in this subregion: NpHR mice froze more to the tone compared to the YFP controls (Figure 5F-H; 2-factor ANOVA with group and tone number as factors, tone number being a repeated measure: group effect: F(1,62) = 7.528, p = 0.012; group × tone number interaction: F(1,62) = 1.056 p = 0.361). This was consistent with previous work that showed inhibiting medial VTA during cue offset impairs extinction learning (Luo et al., 2018; Salinas-Hernández et al., 2018). In addition to attenuating extinction of tone-induced freezing, NpHR mice increased freezing during and after the inhibition period compared to YFP controls (2-factor ANOVA with group and tone number as factors, tone number being a repeated measure. During 6 s inhibition period: group effect: F(1,62) = 17.668, p < 10−3; group × tone number interaction: F(1,62) = 1.371 p = 0.032. During 6 s after inhibition period: group effect: F(1,62) = 28.082, p < 10−4; group × tone number interaction: F(1,62) = 0.854 p = 0.782.) Thus, not only do medial VTA DA neurons lead to updating of the value (or freezing behavior) upon subsequent presentations of the tone that preceded inhibition, they also modify behavior subsequent to the inhibition period.
In contrast to the effects observed with medial VTA inhibition, inhibiting the lateral VTA DA neurons at tone offset yielded no change in freezing during the tone (Figure 5I-K; 2-factor ANOVA with group and tone number as factors, tone number being a repeated measure: group effect: F(1,62) = 0.002, p = 0.964; group × tone number interaction: F(1,62) = 1.121 p = 0.249). This manipulation did, however, increase freezing after the inhibition period, an effect that increased with extinction (2-factor ANOVA with group and tone number as factors, tone number being a repeated measure: group effect: F(1,62) = 6.981, p = 0.019; group × tone number interaction: F(1,62) = 1.480 p = 0.011). This suggests that lateral VTA inhibition primarily affects freezing at time points after inhibition, rather than causing learned changes in the value of preceding events.
We performed analogous optogenetic experiments in mice with fibers targeting the SNc (Figure 5—figure supplement 3A-C and Figure 5—figure supplement 2D, NpHR group n = 7, YFP group n = 6). Similar to the lateral VTA cohort, inhibiting SNc DA neurons at tone offset did not have an effect on tone freezing. However, SNc NpHR mice showed decreased freezing during the ITI relative to the YFP group (Figure 5—figure supplement 3D; 2-factor ANOVA with group and tone number as factors, tone number being a repeated measure: group effect: F(1,62) = 4.980, p = 0.047; group × tone interaction: F(1,62) = 1.276, p 0.081), which we did not observe with the lateral VTA cohort. This suggests that lateral VTA and SNc inhibition during the cue offset may not have identical consequences.
Inhibition of lateral VTA at the tone onset slows extinction learning
We next inhibited lateral VTA DA neurons during fear extinction at the tone onset, given that is when the fiber photometry recordings showed a pronounced salience-like signal in that subregion (Figure 6A, B, Figure 5—figure supplement 2C; NpHR group n = 12, YFP group n = 10; 6 s inhibition period). We found that NpHR mice extinguished more slowly to the tone. This is reflected in that there was no main effect of group, but a significant interaction between group and tone number (Figure 6C, D; 2-factor ANOVA with group and tone number as factors, tone number being a repeated measure: group effect: F(1,62) = 0.014, p = 0.908; group × tone number interaction: F(1,62) = 2.217, p < 10−6). Additionally, we observed that NpHR mice showed decreased freezing during the ITI (Figure 6D; 2-factor ANOVA with group and tone number as factors, tone number being a repeated measure: group effect: F(1,62) = 6.477, p = 0.019; group × tone number interaction: F(1,62) = 1.562 p = 0.004).
Together with the results from the previous experiments, this suggests that inhibiting the salience-like signal at tone onset slows extinction learning over time, even though activity of these neurons at tone offset does not update the value of the preceding tone.
Discussion
DA neurons originating in the VTA are known to be modulated by aversive stimuli, and have been implicated in fear conditioning and extinction (El-Ghundi et al., 2001; Young et al., 1993; Inoue et al., 2000; Nader and LeDoux, 1999; Guarraci and Kapp, 1999; Holtzman-Assif et al., 2010; Delgado et al., 2008; Luo et al., 2018; Salinas-Hernández et al., 2018; Zweifel et al., 2011; Pezze and Feldon, 2004; Mueller et al., 2010; Pignatelli et al., 2017; Nasehi et al., 2016; Pezze et al., 2003; Budygin et al., 2012; Robinson et al., 2019; Lammel et al., 2011; Jo et al., 2018; Lutas et al., 2019; Fadok et al., 2009; Groessl et al., 2018; Bouchet et al., 2018; Wenzel et al., 2018; Wang and Tsien, 2011; Mileykovskiy and Morales, 2011). However, it was unclear if and how neural correlates of fear extinction are topographically organized within the VTA. In addition, the causal contribution of spatially localized DA activity within the VTA to fear extinction was unknown. Here, we found that during fear extinction, medial VTA more closely resembled RPE, while lateral VTA more closely resembled salience. While activity in both subregions contributed causally to fear extinction, the temporal relationship between activity and freezing differed. Consistent with the idea that RPE signals update the value of preceding events, inhibition in medial VTA prevented the update of value (or freezing response) to the tone preceding the inhibition, causing freezing to be more likely on subsequent tone presentations. In contrast, inhibition of the salience-like signal in lateral VTA affected freezing in the time period during or immediately following the inhibition, but did not update the value (or freezing response) to subsequent presentations of the tone if it preceded the inhibition.
Development and application of a CNN to identify freezing behavior
Learned fear is typically quantified by measuring a mouse’s freezing. When mice are tethered to neural headgear, existing automated algorithms cannot accurately dissociate movement of the mouse from movement of the tether. To solve this problem, we turned to deep learning, and used an off-the-shelf CNN architecture, ResNet, known for its applications in image classification and pose estimation (Nath et al., 2019; Insafutdinov et al., 2016; He et al., 2016; Mathis et al., 2018). Our CNN allowed accurate and automated classification of freezing behavior throughout the duration of our experiments with minimal labor, and enabled us to determine that the precise temporal relationship between dopamine neuron activity and freezing behavior depended on VTA subregion.
Spatial organization of RPE and salience-like signals in the medial versus lateral VTA during fear extinction
Our finding of RPE-like signals in medial VTA and salience-like signals in lateral VTA during fear extinction may reflect a larger organizational structure across both the VTA and the substantia nigra pars compacta (SNc) dopamine neurons. Specifically, there is previous evidence that the SNc, which is lateral to the VTA, has a greater proportion of DA neurons that signal salience rather than RPE (Matsumoto and Hikosaka, 2009; Menegas et al., 2017). One question is how the presence of salience signals in lateral VTA and SNc may relate to other work showing correlates of kinematics in addition to RPE in those same regions (Engelhard et al., 2019; da Silva et al., 2018; Howe and Dombeck, 2016). One possible relationship between these seemingly disconnected findings of salience versus kinematic tuning is that an animal’s speed is a reflection of the motivational salience of an environment (Zenon et al., 2016; Panigrahi et al., 2015; Wang et al., 2013).
It is worth noting that this medial/lateral distinction between RPE and salience does not hold across all behaviors (Heymann et al., 2020). In fact, we found that during fear conditioning, medial and lateral VTA shared similar activity profiles, which may reflect heterogeneity in single neuron activity that is not captured with fiber photometry. Furthermore, it is not obvious how to integrate our spatial findings with recent DA terminal recordings within the NAc, despite the rough topography between VTA cell bodies and their projections (Yang et al., 2018; de Jong et al., 2019; Beier et al., 2015). In particular, recent studies reported increased activity to aversive stimuli in ventromedial NAc, and decreased activity to aversive stimuli in lateral NAc (de Jong et al., 2019; Yuan et al., 2019). Assuming topography between VTA cell bodies and their terminals in NAc, we may have expected the opposite result. This discrepancy could be due to a number of factors, including imperfect topography between cell bodies and terminals, differences in the behavioral paradigm, or terminal activity that does not reflect cell body activity (Berke, 2018; Threlfell et al., 2012).
Distinct temporal relationships between activity in the medial and lateral VTA and fear extinction
Our manipulations of medial VTA DA neurons produced changes in extinction learning that were consistent with the presence of an RPE-like signal in that subregion. Specifically, the burst of DA at the omission of the expected shock contributed causally to extinction learning, a finding which aligned with previous work (Salinas-Hernández et al., 2018; Luo et al., 2018).
In addition, this manipulation increased freezing in the time period immediately after the inhibition, suggesting that not only do these neurons regulate learning about the value of a preceding event (the tone), they also affect freezing behavior in the time period subsequent to the manipulation.
While our results in medial VTA were consistent with evidence that RPE signals in DA neurons support reinforcement learning in a variety of paradigms (Steinberg et al., 2013; Witten et al., 2011; Chang et al., 2016; Zweifel et al., 2009; Tsai et al., 2009; Kim et al., 2012; Stopper et al., 2014; Adamantidis et al., 2011), the causal contribution of salience signals in DA neurons to behavior is much less clear. By leveraging the spatial organization we uncovered between RPE versus salience signals in medial versus lateral VTA, we could directly examine the causal role of salience-like signals. One possibility is that, similar to RPE, salience signals in lateral VTA may also support learning (Bromberg-Martin et al., 2010).
We found that inhibition of the lateral VTA at the tone offset did not influence extinction learning of the preceding tone. This was consistent with the previous observation that inhibition of a subpopulation of VTA neurons with projections to NAc Core did not affect extinction (Luo et al., 2018). However, similar to medial VTA, this manipulation increased freezing in the timepoints directly after the inhibition, suggesting that despite the fact that these neurons did not regulate learning about the value of a preceding event (the tone), they did affect behavior in the time period subsequent to the manipulation. Consistent with this manipulation affecting subsequent timepoints, inhibition at the tone onset, when these neurons have a burst of activity, slowed the rate of extinction. However, we cannot rule out that this manipulation affected the expression of freezing rather than extinction learning. Additionally, this lateral VTA tone onset inhibition had more subtle effects on behavior relative tomedial VTA tone offset inhibition. .
In summary, during fear extinction, medial and lateral VTA DA neurons provide distinct but complementary signals that contribute to extinction learning, but at different times. Medial VTA encodes an RPE-like signal, which serves to update the value of the preceding tone. Lateral VTA encodes a salience-like signal at the tone onset, which does not update the value of the preceding tone, but affects freezing during the tone. This is consistent with the emerging framework that despite differences in neural correlates, different VTA/SNc DA subpopulations share a common function of mediating learning, even though they may differ in the specific setting in which they contribute to learning, or the specific aspect of learning that they mediate (Ellwood et al., 2017; Saunders et al., 2018; Cox and Witten, 2019; Bromberg-Martin et al., 2010; Menegas et al., 2018).
Materials and methods
Animals
All experiments followed guidelines established by the National Institutes of Health and reviewed by Princeton University Institutional Animals Care and Use Committee (IACUC). Dat-Cre mice with IRES-Cre inserted just 3' of the termination colon of the Slc6a3 gene that encodes the dopamine transporter (B6.SJL-Slc6a3tm1.1(cre)Bkmn/J, Jackson Labs) were bred in house with Ai148D mice (Ai148(TIT2L-GC6f-ICL-tTA2), Jackson Labs), and male progeny expressing GCaMP6f in dopamine neurons (Dat-Cre x Ai148D mice) were used for fiber photometry experiments. Male Dat-Cre mice were used for optogenetic experiments and electrophysiology recordings. Mice were group housed with up to five other mice, allowed ad libitum access to food and water, and kept on a 12 hr light on and 12 hr light-off schedule. Mice between 7–10 weeks of age were used during surgery. We conducted all surgery and behavioral experiments during light off period.
Stereotactic surgeries
Request a detailed protocolMice were induced to a surgical plane of anaesthesia using 4% isoflurane, and maintained at. 0.5–2% isoflurane for the duration of the surgery. Mice were kept warm using a heating pad, and breathing rate was monitored by the surgeon. At surgery onset, mice received i.p. injections of nonsteroidal anti-inflammatory drug meloxicam (2 mg/kg) and the antibiotic baytril (5 mg/kg). Twenty-four hours post surgery, mice received a second equivalent dose of meloxicam.
For fiber photometry experiments, we used transgenic Dat-Cre x Ai148D male mice which expressed GCaMP6f in dopamine neurons. Optic fibers of 400 µm core diameter (Mono Fiberoptic Cannula, Doric Lenses) were implanted unilaterally either in the medial VTA (-3.1 mm posterior, +/-0.5 mm lateral, -4.5 mm ventral, n=10 mice), lateral VTA (-3.1 mm posterior, +/- 0.75 mm lateral, -4.2 mm ventral, n=11 mice), or SNc (-2.9 posterior, +/-1.25 lateral, -4.2 ventral relative to Bregma n = 2 mice), NAcC (+1.2 anterior, +/-1.0 lateral, -4.5 ventral relative to Bregma, n = 7 mice) or DMS (+0.74 anterior, +/-1.4 lateral, -2.6 ventral relative to Bregma, n = 13 mice). Fibers were affixed to the skull using dental cement (C&B-Metabond). To control for motion artifacts in fiber photometry experiments, we used a nanosyringe (World Precision Instruments) to inject Dat-Cre mice with 600 nL of virus containing Cre-dependent green or yellow fluorescent protein (AAV2/5-CAG-FLEX-eGFP-WPRE-bGH (Allen Institute) or AAV2/5-EF1a-DIO-eYFP-hGHpA (UPenn or Princeton Vector Core)) into the VTA (-3.1 mm posterior, +/-. 5 mm lateral, -4.75 mm ventral). Virus was injected bilaterally and needle remained in position post injection for 6 min to allow injection to disperse before removal. We implanted optic fibers in the medial or lateral VTA as mentioned above (medial VTA n = 4 mice, lateral VTA n = 4 mice).
For optogenetic experiments, we used a nanosyringe (World Precision Instruments) to deliver 600 nL of virus containing Cre-dependent halorhodopsin AAV2/5-EF1a-DIO-eNpHR3.0-EYFP-WPRE-hGHpA (UPenn or Princeton vector core, 1013 to 1014 parts/ml) or the control virus AAV2/5-EF1a-DIO-eYFP-WPRE-hGHpA (Upenn and Princeton vector core, 1013 to 1014 parts/ml) into the brain. Virus was injected bilaterally into the medial VTA (−3.1 mm posterior, +/- 0.5 mm lateral, −4.75 mm ventral, n = 26 mice, needle bevel pointed posterior), lateral VTA (−3.1 mm posterior, +/- 0.75 mm lateral, −4.75 mm ventral, n = 28 mice, needle bevel pointed posterior), or SNc (−2.9 mm posterior, +/- 0.75 mm lateral, −4.70 mm ventral, n = 13 mice, needle bevel pointed lateral); each injection was 600 nL in volume and injected at 100 nL/min, and the needle remained in position for 6 min post injection to allow injection to disperse before removal. Mice were then bilaterally implanted respectively with optic fibers (300 um core diameter) in the medial VTA (−3.1 mm posterior, +/- 0.1.09 mm lateral, −4 mm ventral at 10 degree angle; no-angle exact coordinates −3.1 mm posterior, +/- 0.5 mm lateral, −4.5 ventral), lateral VTA (−3.1 mm posterior, +/- 0.1.69 mm lateral, −4 mm ventral at 10 degree angle; no-angle exact coordinates: −3.1 mm posterior, +/- 0.75 mm lateral, −4.2 mm ventral), or SNc (−2.9 mm posterior, −1.9 mm lateral, −3.8 mm ventral at 10 degree angle; no angle coordinates: −2.9 mm posterior, −1.25 mm lateral, −4.0 mm ventral relative to Bregma). Optic fibers were affixed to the skull using dental cement.
Behavioral assays
Auditory fear conditioning and extinction
Request a detailed protocolFollowing a minimum of 6 days to recover from cranial surgery, mice 8–12 weeks of age underwent auditory fear conditioning and extinction. We performed this assay using FreezeFrame (Coulbourn Instruments), and throughout the assay gray-scaled video was recorded at 11.2 Hz while mice were tethered to either fiber photometry or optogenetic cables for neural recording or manipulation, respectively. On day one, mice were placed in a chamber (10 cm x 10 cm) with a conductive metal floor and a smell of ethanol. Mice were habituated to ten 20 s tones (5 kHz, 70 dB), followed by ten 20 s tones that coterminated with a 1 s foot shock administered through the metal floor (1 mA, scrambled). On days two to four, mice underwent extinction in a different but similarly sized experimental chamber with plastic floor and walls, and a different smell (quatricide). During extinction, mice heard 21 tones per day without the shock. For all experiment days, the inter-trial interval (ITI) between tones was jittered with a Poisson distribution with a mean of 80 s; the ITI was randomly selected prior to all experiments and was the same for each mouse.
Real-time place preference (RTPP)
Request a detailed protocolPrior to, or following the fear conditioning and extinction assay, mice underwent real-time place preference assay, counter-balanced between groups. Mice were connected to optical fibers and individually placed in a rectangular enclosure with two chambers (28 cm x 28 cm each), and allowed to freely explore the two chambers for 20 min. Gray-scaled video was recorded at 30 fps and mouse location and velocity was tracked throughout experiment using Ethovision XT 9. For the first 5 min, the mice freely explored both chambers but they did not receive optical stimulation (‘baseline’). For the next 15 min, mice continued to have access to both chambers, but the presence in one chamber resulted in continuous delivery of laser light (594 nm, 6 mW, Cobalt) to the implanted optic fibers. The ‘light on’ chamber was randomly assigned and counterbalanced within each experimental group (NpHR or YFP).
Random reward delivery
Request a detailed protocolMice with regular ad libitum access to food and water were put in a 21 cm ×18 cm modular operant chamber (MED Associates, ENV-307W). 12 uL of sweetened condensed milk (Carnation, diluted 1:10) were delivered 5-10 times at random intervals ranging from 60 to 200 s. Mouse licking was detected if the mouse’s body closed the loop on an open circuit, where one end of the circuit was the metal floor of the operant chamber, and the other end of the circuit was the reward delivery spout; the licking signal was acquired at 25 Hz. Fiber photometry fluorescence signal was also acquired at 25 Hz (instead of 100 Hz in all other fiber photometry recordings).
Fiber photometry experiments
Request a detailed protocolOne to two weeks after surgery, mice individually underwent fear conditioning and extinction during fiber photometry recording of VTA DA neurons expressing GCaMP6f. Mice were connected to a fiber photometry set up described in previous reports (Gunaydin et al., 2014). A 488 nm laser light (Micron Technology) was filtered (FL488, Thor Labs) then passed through a dichroic mirror (MD498, Thor Labs) and traveled through a patch cable (Mono Fiberoptic Patchcord, 400 um core, 0.48 Numerical Aperture, Doric Lenses) coupled via a ceramic split sleeve (2.5 mm diameter, Precision Fiber Products) to the optic fiber implanted in the mouse brain; the light traveled down the optic fiber into the VTA for fluorescence excitation. Laser light delivery was controlled by a lock-in amplifier (Ametek, 7265 Dual Phase DSP Lock-in Amplifier), which delivered light at 210.999 Hz, and the laser intensity at the tip of the patch cable was approximately 5 uW. Fluorescent emission from GCaMP6f at 500–550 nm was then passed through the same patch cable, filtered (MF525-39, Thor Labs), and passed through the same dichroic mirror into a photodetector (Model 2151, New Focus), and the signal was filtered at the same 210.999 Hz using the same lock-in amplifier, and a time constant of 20 ms. AC gain on the lock-in amplifier was set to 0 dB. Signal was digitized at 100 Hz. dF/F was calculated by the following formula:
Where is the second order polynomial fit to the GCaMP6f signal for the duration of the experimental session, in order to account for very slow decline of the signal over time, presumably caused by photobleaching. Z-score of dF/F signal was calculated across experiment days for each mouse; the mean used for Z-score is the mean signal across all 4 experiment days, and standard deviation used for Z-score is the standard deviation across all 4 days. To obtain shuffled data, GCaMP6f fluorescence from each session was circularly shifted across the session in time by a random time bin (n = 1000 shuffles unless otherwise specified). The Bonferroni corrections used in Figure 3E, F, Figure 3—figure supplement 1G, H, Figure 3—figure supplement 3C; Figure 3—figure supplement 4D,E, were corrected by 40 time points to account for the 40 1-second-long bins. The Bonferroni corrections used in Figure 3—figure supplement 2 were corrected with 10 time points to account for the 10 1-second-long bins.
One lateral VTA fiber photometry mouse was excluded from analysis because the fiber location was at the SNc border.
Optogenetic experiments
Request a detailed protocolApproximately 1 month after opsin or control virus injection, mice underwent fear conditioning and extinction. Littermates within a cage were randomly allocated to either NpHR or YFP group, with as equal distribution as possible. (Ie. in a cage with four mice: two mice were in NpHR group, two mice in YFP group. In a cage with five mice, two mice were in NpHR group and three mice in YFP group or vice versa.) Masking was not used during group allocation or data collection. This ensured that NpHR and YFP mice were interleaved during the experimental sessions and counterbalanced in different behavioral chambers. Masking was a central part of the behavioral analysis: the CNN scores freezing the same regardless of the mouse’s experimental group.
Medial or lateral VTA dopamine neurons were inhibited using continuous 594 nm laser illumination (~6 mW, Cobalt) during tone offset or tone onset during extinction (Figures 5E and 6A). Tone onset inhibition lasted for 6 s and started concurrent to the onset of the tone. Tone offset inhibition also lasted 6 s and started 1 s prior to the ending of the tone and lasted for 5 s after the tone offset. Video of the behavior was acquired as normal, with the addition of a 400–650 nm filter (Kentek) in front of the camera lens to prevent laser light from appearing in the video.
Prior to or following fear extinction, the same mice underwent the real time place preference (RTPP) assay with the same laser intensity (Figure 5—figure supplement 1). For medial VTA tone off and lateral VTA tone on cohorts, an equal number of mice received RTPP before versus after fear conditioning and extinction. For lateral VTA tone off cohorts, all mice received RTPP after the fear conditioning and extinction assay.
No explicit power analysis was used to determine sample size for fiber photometry experiments. We decided the sample size (n is approximately 10 for each group) based on what is typically seen in the literature. The sample size of approximately N of 10 per group are biological replicates, where biological replicate means an individual mouse. For two of the three optogenetic cohorts, two different rounds of mice were run on separate occasions and data was combined for the full cohort. For the third cohort, all mice were run at once. All mice had at least one fiber in the lateral VTA (from histology images) and were included in statistical analysis. No outliers were discarded. From all cohorts, 1 NpHR mouse was removed from fear extinction analysis because the optic fiber connection slipped off during extinction day 3.
For the data in Figure 5, Figure 6 and Figure 5—figure supplement 3, a 2-factor ANOVA with group (NpHR or YFP) and tone number (repeated measure: 1 to 63 extinction tones) as factors was used to predict either (1) freezing during the tone (calculated as mean freezing for 1 to 19 s of the tone), (2) freezing during the 6 s laser offset inhibition period (calculated as the mean freezing for 1 s prior to tone offset through 5 s after tone offset), (3) freezing 6 s post inhibition period (calculated as mean freezing 6–11 s after tone offset) or (4) freezing for 30 s during the ITI (calculated as 12–41 s after tone offset). For the data in Figure 5—figure supplement 1B–E, a 2-factor ANOVA with group (NpHR or YFP) and subregion (medial VTA or lateral VTA) was used to determine significance.
Correlation between GCaMP6f and freezing
Request a detailed protocolNo explicit power analysis was used to determine sample size for fiber photometry experiments. We decided the sample size (N is approximately 10 for each group) based on what is typically seen in the literature. No outliers were discarded. For each fiber photometry cohort, 1–4 mice were run on each occasion and all data was aggregated to create the cohorts for the paper. For trial-by-trial analysis, we calculated the Pearson correlation coefficient between GCaMP6f and freezing during extinction for each mouse. Since there were 63 total extinction trials across three extinction days, each mouse had 63 values for mean freezing during the tone, correlated with 63 values for mean GCaMP6f fluorescence for 5 s after tone onset or 5 s after tone offset. We next gathered the correlation coefficients from each region (medial or lateral VTA) and GCaMP6f time point (5 s after tone onset or tone offset) and used a one sample t-test to determine if they significantly differed from 0 (Figure 3A and D). For across animal analysis, we used Pearson’s correlation coefficient and resulting p-value to find the correlation between mean freezing per mouse with mean GCaMP6f (either 5 s after tone onset or after tone offset) per mouse, we drew the best fit line using estimates from a generalized linear regression model (Figure 3B,C,E,F).
Histology
Request a detailed protocolTo confirm the location of viral targeting and optical fiber implant, mice were anesthetized with Euthasol (.1 mL/mouse), perfused with 10 mL of phosphate buffered saline (PBS) followed by 10 mL of 4% paraformaldehyde (PFA) in PBS, their brains extracted, post-fixed with 4% PFA for 24 hrs, then stored in a 30% sucrose in PBS solution for at least 24 hrs before slicing. Brains were sliced coronally at 40 μm, and relevant VTA slices were stained for tyrosine hydroxylase (TH), a marker for dopamine neurons (primary antibody: Chicken Anti-TH, Aves Labs; secondary antibody: Alexa Fluor 647 Donkey Anti-Chicken IgG, Jackson ImmunoResearch) and GFP, to enhance viral expression (primary antibody: GFP Recombinant Rabbit Monoclonal Antibody, Thermo Fisher Scientific; secondary antibody: Alexa Fluor 488 Donkey Anti-Rabbit IgG, Life Technologies). Slices were then mounted with Fluoromount-G with DAPI (Thermo Fisher Scientific) to determine the location of cell nuclei.
Ex vivo electrophysiology recordings to relate optogenetic inhibition with decreased currents and spiking activity
Request a detailed protocolTo confirm optogenetic inhibition of VTA DA cells, we performed ex vivo electrophysiology in Dat-Cre mice. Coronal slices containing the VTA were prepared from 3 to 4 month old male Dat-Cre mice approximately 4 weeks after injecting with AAV5-DIO-NpHR-eYFP virus. Mice were deeply anaesthetized with an intraperitoneal injection of euthasol (0.06 ml per 30 g) and decapitated. After extraction, the brain was immersed in ice-cold carbogenated N-methyl-D-glucamine (NMDG) artificial cerebrospinal fluid (ACSF) (92 mM NMDG, 2.5 mM KCl, 1.25 mM NaH2PO4, 30 mM NaHCO3, 20 mM HEPES, 25 mM glucose, 2 mM thiourea, 5 mM Na-ascorbate, 3 mM Na-pyruvate, 0.5 mM CaCl2·4H2O, 10 mM MgSO4·7H2O and 12 mM N-acetyl-L-cysteine) for 2 min. Afterwards, coronal slices (300 μm) were sectioned using a vibratome (VT1200s, Leica) and then incubated in NMDG ACSF at 34°C for 15 min. Slices were then transferred into a holding solution of HEPES ACSF (92 mM NaCl, 2.5 mM KCl, 1.25 mM NaH2PO4, 30 mM NaHCO3, 20 mM HEPES, 25 mM glucose, 2 mM thiourea, 5 mM Na-ascorbate, 3 mM Na-pyruvate, 2 mM CaCl2·4H2O, 2 mM MgSO4·7H2O and 12 mM N-acetyl-L-cysteine, bubbled at room temperature with 95% O2, 5% CO2) for at least 45 min until recordings were performed. Whole cell recordings were performed using a Multiclamp 700B (Molecular Devices, Sunnyvale, CA) using pipettes with a resistance of 3–5 MΩ filled with a potassium-based internal solution containing 120 mM potassium gluconate, 0.2 mM EGTA, 10 mM HEPES, 5 mM NaCl, 1 mM MgCl2, 2 mM Mg-ATP and 0.3 mM NA-GTP, with the pH adjusted to 7.2 with KOH. DA neurons in the VTA were identified for recordings based on YFP expression. Photostimulation parameters were 560 nm and 5 mW/mm2. Neurons were held at −70 mV during photocurrent measurements. To confirm the ability of photocurrents to eliminate action potentials, action potentials were induced by a positive current injection (120 pA, 100 ms pulse duration, 5 Hz).
CNN
Request a detailed protocolWe introduce an open-source pipeline to automate freezing analysis of behavioral videos. The main motivation and advantage of this pipeline is automation of freezing analysis, even when mice are tethered to neural headgear -- our analysis does not confuse headgear movement for mouse movement. Drawbacks to this technique include necessity for manual scoring to train the network (it took approximately seven hours to score the 33,000 images we used to train each network), and different networks must be trained for different backgrounds and neural headgear. Additionally, having a graphics processing unit (GPU) greatly speeds up network training (1–2 hrs to train 200 epochs on a 320 nVidia P100 GPU vs. 1 week on a 32 GB RAM, Intel Core i7 CPU) and using the network for analysis (10 min to run 25,000 frames on a 320 nVidia P100 GPU vs. 1 hr on a 32 GB RAM CPU).
Training and using the network involved: (1) Human labeling of freezing behavior, (2) Generating images for CNN input, (3) Training the CNN, (4) Assessing CNN accuracy and precision, and (5) Using network for analysis. Custom MATLAB software was developed for parts (1 - 2), and custom Python software was developed for parts (3 - 5). Code for all steps is available on GitHub: https://github.com/neurocaience/deepfreeze/ (Cai et al., 2020; copy archived at https://github.com/elifesciences-publications/deepfreeze).
Human Labeling of Freezing behavior
First, behavioral videos were broken down into individual frames. Next, random pairs of consecutive frames were selected for human labeling as ‘1’ (‘freeze’ - no mouse movement occurred between frames) or ‘0’ (‘no freeze’ - mouse movement occurred between frames). Scored images were saved and processed in step (2). To ensure that all possible mouse movements are represented, we recommend using behavioral videos from random time points of at least 7 to 20 mice for each context. In particular, the fear conditioning context requires 15–20 mice due to low variability of movements within mice resulting from highly prevalent freezing. Mice in the fear extinction context are more apt to moving around, thus 5–7 mice are sufficient to capture this variability. Each network needs 30–40 k labels, or approximately 7 hrs of human labeling. The network trains best with roughly equal amounts of ‘freeze’ and ‘no freeze’ data.
Generating images for CNN input
Pairs of consecutive frames used for human labeling were condensed into a ‘difference image’:
. This difference image reflects the threshold-normalized absolute value of the pixel intensity difference between consecutive images.
Where:
is the difference image
where is the number of pixels along each axis of the image
where is an individual frame and is the frame number in the video
µ is the mean function
σ is the standard deviation function
θ is the threshold value, the same threshold is used for all calculations. In our case, we used set as approximately the top 0.05% percentile of
is a scale factor that ensures the pixel value is at a maximum of 255 (for grayscale images, pixel intensity values range from 0 to 255).
is a function that transforms a floating point number into an 8-bit integer between 0 to 255, so the can be saved into a. png file.
Training the CNN
Next, the difference images, along with their respective human labels, were split across mice into train and test sets, where one mouse’s images were either in the training or test set, but not both. This was analogous to K-fold cross validation, but where the data are partitioned by mouse rather than randomly. This ensured that similarity of behavior within a mouse did not confound accuracy of the test dataset.
The training set was then used to train a CNN (ResNet18 He et al., 2016, pretrained on ImageNet, batch size = 128, learning rate = 0.001, Adam optimization Kingma and Ba, 2014, random rotation and flip) over the course of 200 epochs; the CNN weight matrix of each epoch was saved for analysis and network selection in the next section. The CNN aimed to minimize cross entropy loss (a form of negative log likelihood) using the PyTorch module (Paszke et al., 2019).
Assessing CNN Accuracy and Precision
To ascertain which CNN epoch to use as the final classifier, we examined its test loss, accuracy, false positive rate and false negative rate. We accepted epochs when test loss plateaued, accuracy was above 90%, and false positive rate (FPR) and false negative rate (FNR) were below or around 10%. The exact epoch varried with each network, but ranged from 20–40 epochs. The CNN weight matrix from the appropriate epoch was then chosen as the classifier for behavioral analysis. Below are the methods used for calculating false positives, false negatives, true positives, true negatives, false positive rate and false negative rate:
False positives (FP): Number of frames that were classified as ‘freeze’, but labeled ‘no freeze’
False negative (FN): Number of frames that were classified as ‘no freeze’, but labeled ‘freeze’
True positives (TP): Number of frames that were classified as ‘freeze’ and labeled ‘freeze’
True negatives (TN): Number of frames that were classified as ‘no freeze’ and labeled ‘no freeze’
False positive rate (FPR):
False negative rate (FNR):
Using Network to Classify Freezing
All behavioral videos are broken down into individual frames, which were then formatted into ‘difference images’. These difference images were fed as input into a trained CNN, which then predicted an output of either ‘1’ or ‘0’, representing a ‘freeze’ or ‘no freeze’. Post-processing involved averaging the predicted label for each frame across each second of video, to obtain %freeze/s. Our videos had a frame rate of 11.2 Hz.
We trained separate networks for each of our four experimental contexts (fear conditioning vs fear extinction; fiber photometry vs. optogenetics), and used the above steps (1 - 5) to identify the frames in which mice were freezing during our behavioral videos.
Data availability
All neural recordings, behavioral data and code to reproduce figures is at: https://github.com/neurocaience/DopamineAndFear (copy archived at https://github.com/elifesciences-publications/DopamineAndFear). Code for human labeling and the CNN pipeline is at https://github.com/neurocaience/deepfreeze (copy archived at https://github.com/elifesciences-publications/deepfreeze).
References
-
Optogenetic interrogation of dopaminergic modulation of the multiple phases of reward-seeking behaviorJournal of Neuroscience 31:10829–10835.https://doi.org/10.1523/JNEUROSCI.2246-11.2011
-
Nucleus accumbens core dopamine signaling tracks the need-based motivational value of food-paired cuesJournal of Neurochemistry 136:1026–1036.https://doi.org/10.1111/jnc.13494
-
Activation of nigrostriatal dopamine neurons during fear extinction prevents the renewal of fearNeuropsychopharmacology 43:665–672.https://doi.org/10.1038/npp.2017.235
-
Conditioned place aversion following the central administration of a novel dopamine release inhibitor CGS 10746BPharmacology Biochemistry and Behavior 40:255–259.https://doi.org/10.1016/0091-3057(91)90548-G
-
Striatal circuits for reward learning and decision-makingNature Reviews Neuroscience 20:482–494.https://doi.org/10.1038/s41583-019-0189-2
-
The role of the striatum in aversive learning and aversive prediction errorsPhilosophical Transactions of the Royal Society B: Biological Sciences 363:3787–3800.https://doi.org/10.1098/rstb.2008.0161
-
Dopamine is necessary for cue-dependent fear conditioningJournal of Neuroscience 29:11089–11097.https://doi.org/10.1523/JNEUROSCI.1616-09.2009
-
Dorsal tegmental dopamine neurons gate associative learning of fearNature Neuroscience 21:952–962.https://doi.org/10.1038/s41593-018-0174-5
-
ConferenceDeep residual learning for image recognition2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).https://doi.org/10.1109/cvpr.2016.90
-
Effect of the dopamine D(1/5) antagonist SCH 23390 on the acquisition of conditioned fearPharmacology Biochemistry and Behavior 66:573–578.https://doi.org/10.1016/S0091-3057(00)00254-9
-
Reward and aversion in a heterogeneous midbrain dopamine systemNeuropharmacology 76:351–359.https://doi.org/10.1016/j.neuropharm.2013.03.019
-
A dopaminergic switch for fear to safety transitionsNature Communications 9:2483.https://doi.org/10.1038/s41467-018-04784-7
-
State-specific gating of salient cues by midbrain dopaminergic input to basal amygdalaNature Neuroscience 22:1820–1833.https://doi.org/10.1038/s41593-019-0506-0
-
Parameter optimization for automated behavior assessment: plug-and-play or trial-and-error?Frontiers in Behavioral Neuroscience 8:28.https://doi.org/10.3389/fnbeh.2014.00028
-
DeepLabCut: markerless pose estimation of user-defined body parts with deep learningNature Neuroscience 21:1281–1289.https://doi.org/10.1038/s41593-018-0209-y
-
ConferencePytorch: an imperative style, High-Performance deep learning library33rd Conference on Neural Information Processing Systems.
-
Mesolimbic dopaminergic pathways in fear conditioningProgress in Neurobiology 74:301–320.https://doi.org/10.1016/j.pneurobio.2004.09.004
-
Testing the reward prediction error hypothesis with an axiomatic modelJournal of Neuroscience 30:13525–13536.https://doi.org/10.1523/JNEUROSCI.1747-10.2010
-
Conditioned place aversion produced by dopamine release inhibitionEuropean Journal of Pharmacology 260:133–137.https://doi.org/10.1016/0014-2999(94)90329-8
-
Contextual and cued fear conditioning test using a video analyzing system in miceJournal of Visualized Experiments 8:50871.https://doi.org/10.3791/50871
-
A causal link between prediction errors, dopamine neurons and learningNature Neuroscience 16:966–973.https://doi.org/10.1038/nn.3413
-
Topography of reward and aversion encoding in the mesolimbic dopaminergic systemThe Journal of Neuroscience 39:6472–6481.https://doi.org/10.1523/JNEUROSCI.0271-19.2019
-
Dopamine Manipulation Affects Response Vigor Independently of Opportunity CostJournal of Neuroscience 36:9516–9525.https://doi.org/10.1523/JNEUROSCI.4467-15.2016
Article and author information
Author details
Funding
National Institutes of Health (T32MH065214)
- Lili X Cai
New York Stem Cell Foundation
- Ilana B Witten
Army Research Office (W911NF1710554)
- Ilana B Witten
National Institutes of Health (1R01MH106689-01A1)
- Ilana B Witten
McKnight Foundation
- Ilana B Witten
National Alliance for Research on Schizophrenia and Depression
- Ilana B Witten
National Institutes of Health (U19NS104648-01)
- Ilana B Witten
National Institutes of Health (DP2 DA035149-01)
- Ilana B Witten
National Institutes of Health (5R01MH106689-02)
- Ilana B Witten
Pew Charitable Trusts
- Ilana B Witten
National Institutes of Health (F32MH112320-03)
- Julia M Cox
National Institutes of Health (5R01DA047869)
- Ilana B Witten
New York Stem Cell Foundation (Robertson Investigator)
- Ilana B Witten
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Acknowledgements
We thank H Wang for assistance with labeling behavioral videos, R Witten for CNN coding proof of concepts and discussions, M Murugan, V Corbit and R Cho for comments on this paper, Y Niv for advice on this project, as well as other members of the Witten lab for their feedback, advice and support. This research was funded by NYSCF, Pew, McKnight, NARSAD, ARO W911NF1710554, NIH U19 NS104648-01, NIH DP2 DA035149-01, NIH 5R01MH106689-02, NIH 5R01DA047869 to IBW, NIH T32 MH065214 to LXC and NIH F32 MH112320-03 to JMC. IBW is a New York Stem Cell Foundation—Robertson Investigator.
Ethics
Animal experimentation: All experiments followed guidelines established by the National Institutes of Health and reviewed by Princeton University Institutional Animals Care and Use Committee (IACUC protocol 1876-18).
Copyright
© 2020, Cai et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 6,177
- views
-
- 786
- downloads
-
- 77
- citations
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Medicine
- Neuroscience
The advent of midazolam holds profound implications for modern clinical practice. The hypnotic and sedative effects of midazolam afford it broad clinical applicability. However, the specific mechanisms underlying the modulation of altered consciousness by midazolam remain elusive. Herein, using pharmacology, optogenetics, chemogenetics, fiber photometry, and gene knockdown, this in vivo research revealed the role of locus coeruleus (LC)-ventrolateral preoptic nucleus noradrenergic neural circuit in regulating midazolam-induced altered consciousness. This effect was mediated by α1 adrenergic receptors. Moreover, gamma-aminobutyric acid receptor type A (GABAA-R) represents a mechanistically crucial binding site in the LC for midazolam. These findings will provide novel insights into the neural circuit mechanisms underlying the recovery of consciousness after midazolam administration and will help guide the timing of clinical dosing and propose effective intervention targets for timely recovery from midazolam-induced loss of consciousness.
-
- Neuroscience
Motivation depends on dopamine, but might be modulated by acetylcholine which influences dopamine release in the striatum, and amplifies motivation in animal studies. A corresponding effect in humans would be important clinically, since anticholinergic drugs are frequently used in Parkinson’s disease, a condition that can also disrupt motivation. Reward and dopamine make us more ready to respond, as indexed by reaction times (RT), and move faster, sometimes termed vigour. These effects may be controlled by preparatory processes that can be tracked using electroencephalography (EEG). We measured vigour in a placebo-controlled, double-blinded study of trihexyphenidyl (THP), a muscarinic antagonist, with an incentivised eye movement task and EEG. Participants responded faster and with greater vigour when incentives were high, but THP blunted these motivational effects, suggesting that muscarinic receptors facilitate invigoration by reward. Preparatory EEG build-up (contingent negative variation [CNV]) was strengthened by high incentives and by muscarinic blockade, although THP reduced the incentive effect. The amplitude of preparatory activity predicted both vigour and RT, although over distinct scalp regions; frontal activity predicted vigour, whereas a larger, earlier, central component predicted RT. The incentivisation of RT was partly mediated by the CNV, though vigour was not. Moreover, the CNV mediated the drug’s effect on dampening incentives, suggesting that muscarinic receptors underlie the motivational influence on this preparatory activity. Taken together, these findings show that a muscarinic blocker impairs motivated action in healthy people, and that medial frontal preparatory neural activity mediates this for RT.