The influence of nucleus accumbens shell D1 and D2 neurons on outcome-specific Pavlovian instrumental transfer

eLife Assessment

This study provides novel and convincing evidence that both dopamine D1 and D2 expressing neurons in the nucleus accumbens shell are crucial for the expression of cue-guided action selection, a core component of decision-making. The research is systematic and rigorous in using optogenetic inhibition of either D1- or D2-expressing medium spiny neurons in the NAc shell to reveal attenuation of sensory-specific Pavlovian-Instrumental transfer, while largely sparing value-based decision on an instrumental task. The important findings in this report build on prior research and resolve some conflicts in the literature regarding decision-making.

https://doi.org/10.7554/eLife.107566.4.sa0

Significance of the findings:

Important: Findings that have theoretical or practical implications beyond a single subfield

Landmark
Fundamental
Important
Valuable
Useful

Strength of evidence:

Convincing: Appropriate and validated methodology in line with current state-of-the-art

Exceptional
Compelling
Convincing
Solid
Incomplete
Inadequate

During the peer-review process the editor and reviewers write an eLife Assessment that summarises the significance of the findings reported in the article (on a scale ranging from landmark to useful) and the strength of the evidence (on a scale ranging from exceptional to inadequate). Learn more about eLife Assessments

Abstract
Introduction
Results
Discussion
Materials and methods
Appendix 1
Appendix 2
Data availability
References
Article and author information
Metrics

Abstract

The nucleus accumbens shell (NAc-S) and its projections to the ventral pallidum (VP) are thought to be critical for stimulus-based decisions. The NAc-S is predominantly composed of spiny projection neurons (SPNs) that express either the dopamine D1 (D1-SPNs) or the dopamine D2 receptor (D2-SPNs). Yet, the role of these two neuronal subpopulations and their inputs to the VP in stimulus-based decisions remains unknown. Here, we used optogenetics in female and male knock-in rats to selectively silence D1- or D2-SPNs and their projections to the VP at a time when the rats were required to use predictive stimuli to choose between two instrumental actions. Silencing either population of NAc-S SPNs disrupted choice. Silencing NAc-S D1-SPNs terminals in the VP also disrupted choice. However, choice was left intact by silencing NAc-S D2-SPNs terminals in the VP. Together, these findings provide novel insights into the cellular mechanisms and circuitry underlying stimulus-based decisions. We discuss how these insights are consistent with a recent model proposing that these decisions are controlled by an opioid-based memory system residing in the NAc-S.

Introduction

Our choices and decisions are often influenced by predictive signals available in the environment (Hollis, 1984). This influence is studied in the laboratory using the outcome-specific Pavlovian-instrumental transfer (PIT) task (Trapold and Overmier, 1972; Colwill and Rescorla, 1988; Holmes et al., 2010; Cartoni et al., 2016; Laurent and Balleine, 2021; Leung et al., 2024b), which comprises three stages. The first two stages are Pavlovian and instrumental conditioning, which can be administered in any order. In Pavlovian conditioning, two stimuli (S1 and S2; e.g., sounds or lights) are paired with two distinct and motivationally significant outcomes (O1 and O2; e.g., food outcomes). In instrumental conditioning, subjects are trained to perform two actions (A1 and A2), with each earning one of the two outcomes (i.e., A1 → O1 and A2 → O2). The final stage is the PIT test, which evaluates choice between the two actions in the presence (and absence) of the stimuli. Typically, each of the Pavlovian stimuli biases choice toward the action with which it shares an outcome: that is, S1: A1 > A2 and S2: A2 > A1.

At a neural level, PIT has been found to rely on interactions between a number of structures, including cortical (Keistler et al., 2015; Bradfield et al., 2015; Bradfield et al., 2018; Tensaouti et al., 2025; Ostlund and Balleine, 2007; Lichtenberg et al., 2021; Sias et al., 2021; Lichtenberg et al., 2017), amygdala (Corbit and Balleine, 2005; Morse et al., 2020; Sias et al., 2024; Lichtenberg et al., 2017; Shiflett and Balleine, 2010; Derman et al., 2020; Prévost et al., 2012), basal ganglia (Corbit and Balleine, 2011; Corbit et al., 2001; Corbit and Janak, 2010; Morse et al., 2020; Laurent et al., 2012; Bertran-Gonzalez et al., 2013; Laurent et al., 2014; Corbit et al., 2016; Leung et al., 2024a; Leung and Balleine, 2015; Leung and Balleine, 2013; Laurent et al., 2015; Prévost et al., 2012), midbrain (Corbit et al., 2007; Sias et al., 2024; Seitz et al., 2022), and thalamic (Ostlund and Balleine, 2008; Leung et al., 2024a) territories. Among these, the nucleus accumbens shell (NAc-S) stands out because, unlike the others, it does not itself contribute to learning the stimulus–outcome (S–O) or the action–outcome (A–O) associations formed during the Pavlovian and instrumental stages of the PIT task and appears only to mediate their interaction (Corbit and Balleine, 2011; Corbit et al., 2001; Morse et al., 2020; Laurent et al., 2012; Laurent et al., 2014). Therefore, it has been proposed that the NAc-S acts as a central hub integrating S–O and A–O information at the time of PIT to guide choice between actions in an outcome-selective manner (Shiflett and Balleine, 2010; Bertran-Gonzalez and Laurent, 2018; Laurent and Balleine, 2021). Accordingly, PIT is disrupted by manipulations undermining NAc-S function during the PIT test (Morse et al., 2020; Corbit et al., 2016; Laurent et al., 2012; Laurent et al., 2014; Laurent et al., 2015).

The NAc-S is predominantly composed of spiny projection neurons (SPNs), which can be classified into two distinct subpopulations depending on the dopamine receptor they express (Gerfen and Surmeier, 2011). One population harbors the dopamine D1 receptors (D1Rs; D1-SPNs), whereas the other population expresses the dopamine D2 receptors (D2Rs; D2-SPNs). To date, only one study has examined whether activity in both populations is recruited during PIT (Laurent et al., 2014), finding that PIT is associated with an increase in ERK1/2 phosphorylation in D1-SPNs but not D2-SPNs. Further, blockade of D1Rs in NAc-S during choice between actions disrupted PIT whereas D2Rs blockade had no effect. Although these findings indicate a major involvement of NAc-S D1-SPNs in PIT, caution must be exerted when drawing conclusions on the role of D2-SPNs. While D1Rs are exclusively expressed on NAc-S D1-SPNs, D2Rs are present on cholinergic interneurons (CINs) and local presynaptic dopamine terminals in addition to D2-SPNs (Gerfen and Surmeier, 2011). Additionally, research shows that ERK1/2 phosphorylation may not capture all transcriptional events that occur in D2-SPNs (Bertran-Gonzalez et al., 2008; Matamales et al., 2020). Lastly, pharmacological D2Rs blockade has been found to be quite ineffective at influencing D2-SPNs function (Tozzi et al., 2007). Thus, the relative contribution of NAc-S D1- and D2-SPNs in driving choice between actions during PIT remains unclear.

The present experiments, therefore, aimed to provide an unambiguous assessment of the roles played by the two main populations of NAc-S SPNs during choice between actions in an outcome-specific PIT task. This assessment was achieved through optogenetic silencing in two knock-in rat lines that express Cre recombinase in either D1- or D2-SPNs (Pettibone et al., 2019). We first used tract-tracing and ex vivo electrophysiology to confirm our capacity to selectively silence activity in NAc-S D1- or D2-SPNs. We then conducted two experiments that assessed outcome-specific PIT while D1- or D2-SPNs in the NAc-S were silenced during presentations of the predictive stimuli in the choice test. Finally, we assessed the degree to which the influence of the two SPNs populations on outcome-specific PIT depends on their projections to the ventral pallidum (VP). We focused on the VP for two main reasons: First, both NAc-S D1- and D2-SPNs send dense projections to this region (Lu et al., 1997; Kupchik et al., 2015); and, second, communication between the NAc-S and VP has previously been found to be critical for outcome-specific PIT (Leung and Balleine, 2013).

Results

Anterograde tracing and ex vivo cell recordings in D1-Cre and A2a-Cre rats

We first examined the connectivity of striatal D1- and D2-SPNs in transgenic rats expressing Cre recombinase in neurons harboring the dopamine D1 receptor (D1-Cre rats) or the adenosine A2a receptor (A2a-Cre rats) (Pettibone et al., 2019). A2a receptors are predominantly expressed on striatal D2-SPNs (Schiffmann et al., 2007), thereby enabling selective targeting of these SPNs in the A2a-Cre line. D1-Cre (n = 1 female and 1 male) and A2a-Cre rats (n = 1 female and 1 male) were unilaterally infused with a Cre-dependent eYFP virus in the NAc-S (Figure 1A). To validate expression in these rat lines, other D1-Cre (n = 4 males) and A2a-Cre rats (n = 3 males) were given a unilateral infusion in the dorsal striatum (DS) with the same virus. In both transgenic strains, viral expression was largely restricted to local SPNs in the two targeted regions: ~95% of eYFP-positive neurons expressed DARPP-32 (D32 | eYFP; Figure 1C–F). Viral expression was consistent with D1- and D2-SPNs representing two neuronal populations similar in size: in both transgenic strains and targeted regions ~40% of DARPP-32-positive neurons expressed eYFP (eYFP | D32; Figure 1C–F). DS infusion in D1-Cre rats resulted in eYFP-positive terminals (Figure 1B, C) in the substantia nigra pars reticulata (SNr) and the globus pallidus externus (GPe). By contrast, the same infusion in A2a-Cre rats only produced eYFP-positive terminals in the GPe (Figure 1B, D). NAc-S infusion in both D1-Cre and A2a-Cre rats resulted in eYFP-positive terminals in the VP (Figure 1E, F). By contrast, eYFP-positive terminals in the lateral hypothalamus (Figure 1E, F) were only observed in D1-Cre rats. Overall, these results are in line with current knowledge about the cellular composition of the striatum (Gerfen and Surmeier, 2011; Bertran-Gonzalez et al., 2010) and well-established projection patterns of striatal D1- and D2-SPNs (Gerfen and Surmeier, 2011; Lu et al., 1997; Kupchik et al., 2015; O’Connor et al., 2015). We were therefore confident in the capacity of the two rat transgenic lines to reveal the function of NAc-S D1- and D2-SPNs.

Figure 1

Download asset Open asset

Anterograde tracing in D1-Cre and A2a-Cre rats.

(A) D1-Cre or A2a-Cre rats were unilaterally infused in the dorsal striatum (DS; D1-Cre: n = 4 males; A2A-Cre: n = 3 males) or nucleus accumbens shell (NAc-S; D1-Cre: n = 1 female and 1 male; A2A-Cre: n = 1 female and 1 male) with DIO-eYFP. (B) Sagittal micrographs obtained in D1-Cre (top) and A2A-Cre (bottom) rats following viral infusion in the DS. (C) DIO-eYFP infusion in the DS of D1-Cre rats. Micrographs show eYFP expression in the DS, D32 staining, and co-labeling (eYFP + D32) in the DS. They also show that DS D1-SPNs project to the substantia nigra pars reticulata (SNr) and the globus pallidus externus (GPe). Viral expression was restricted to putative SPNs (D32 | eYFP), with ~40% of SPNs expressing eYFP (eYFP | D32). (D) DIO-eYFP infusion in the DS of A2a-Cre rats. Micrographs show eYFP expression in the DS, D32 staining, and co-labeling (eYFP + D32) in the DS. They also show that DS D2-SPNs project to the SNr but not the GPe. Viral expression was restricted to putative SPNs (D32 | eYFP), with ~40% of SPNs expressing eYFP (eYFP | D32). (E) DIO-eYFP infusion in the NAc-S of D1-Cre rats. Micrographs show eYFP expression in the NAc-S, D32 staining, and co-labeling (eYFP + D32) in the DS. They also show that NAc-S D1-SPNs project to the ventral pallidum (VP) and the lateral hypothalamus (LH). Viral expression was restricted to putative SPNs (D32 | eYFP), with ~41% of SPNs expressing eYFP (eYFP | D32). (F) DIO-eYFP infusion in the NAc-S of A2a-Cre rats. Micrographs show eYFP expression in the NAc-S, D32 staining, and co-labeling (eYFP + D32) in the DS. They also show that NAc-S D2-SPNs project to the VP but not the LH. Viral expression was restricted to putative SPNs (D32 | eYFP), with ~40% of SPNs expressing eYFP (eYFP | D32).

Next, we used ex vivo electrophysiological recordings to assess our capacity to silence activity of NAc-S D1- and D2-SPNs. D1-Cre and A2a-Cre rats were bilaterally infused in the NAc-S with either a null Cre-dependent eYFP virus (D1-Cre: 2 females; A2a-Cre: 2 females) or a Cre-dependent inhibitory halorhodospin (eNpHR3.0; D1-Cre: 2 females; A2a-Cre: 2 females) virus (Figure 2A, B). LED light illumination (625 nm, continuous) of either NAc-S D1-SPNs (Figure 2A) or NAc-S D2-SPNs (Figure 2B) transfected with the eYFP virus had no effect on action potential firing (Baseline vs. LED ON; D1-Cre: p = 1, 5 cells; A2A-Cre: p = 0.35, 8 cells). By contrast, the same illumination onto NAc-S D1- (Figure 2A) or D2-SPNs (Figure 2B) transfected with eNpHR3.0 suppressed action potential firing (D1-Cre: F_(1,6) = 16.82; η² = 0.74, p < 0.001, 7 cells; A2A-Cre: F_(1,5) = 135.00; η² = 0.96, p < 0.001, 6 cells). Importantly, there was no difference in firing prior to or following LED light illumination, indicating that the optical manipulation did not alter D1- or D2-SPNs physiology. Together, these results validate the efficacy of eNpHR3.0 to specifically silence D1- and D2-SPNs activity at the time of LED light delivery.

Figure 2

Download asset Open asset

Ex vivo cell recordings in D1-Cre and A2a-Cre rats.

(A) D1-Cre was bilaterally infused in the NAc-S with DIO-eYFP (black; 2 females) or DIO-eNpHR3.0 (blue; eNpHR3.0; 2 females). The representative raw traces of cell-attached recordings are those of transfected neurons that were depolarized to elicit action potentials by injecting a brief positive current step (+150 pA, 200 ms duration, 0.5 Hz). 625 nm LED illumination (orange bar, continuous wave, 2 mW) had no effect in eYFP transfected neurons (black; 5 cells) but it inhibited action potential in eNpHR3.0 transfected neurons (blue; 7 cells). The grouped data for recordings include overlapping data points. (B) A2a-Cre was bilaterally infused in the NAc-S with DIO-eYFP (black; 2 females) or DIO-eNpHR3.0 (red; 2 females). The representative raw traces of cell-attached recordings are those of transfected neurons that were depolarized to elicit action potentials by injecting a brief positive current step (+150 pA, 200ms duration, 0.5 Hz). 625 nm LED illumination (orange bar, continuous wave, 2 mW) had no effect in eYFP transfected neurons (black; 8 cells) but it inhibited action potential in eNpHR3.0 transfected neurons (red; 6 cells). The grouped data for recordings include overlapping data points.

NAc-S D1-SPNs mediate outcome-specific PIT

This experiment examined the effects of silencing NAc-S D1-SPNs on outcome-specific PIT. D1-Cre rats were bilaterally infused in the NAc-S with either the null Cre-dependent eYFP virus (eYFP: 5 females and 5 males) or the Cre-dependent halorhodopsin (eNpHR3.0: 5 females and 5 males) virus and were implanted with fiber-optic cannulas above the NAc-S (Figure 3A, Figure 3—figure supplement 1A, B). Using the outcome-specific PIT protocol depicted in Figure 3B, the rats first received Pavlovian conditioning during which two stimuli (S1 and S2; white noise or clicker) were paired with two food outcomes (O1 and O2; grain pellets or sucrose solution). Next, the rats underwent instrumental conditioning and learned that pressing one lever earned one of the food outcomes whereas pressing a second lever delivered the other outcome. Finally, an outcome-specific PIT test was administered and assessed choice between the two lever press actions in the presence or absence of each stimulus. Critically, half of the trials for each stimulus took place under LED light stimulation (625 nm, continuous; ON trials) whereas the other half of the trials was conducted without the stimulation (OFF trials). Thus, NAc-S D1-SPNs were silenced on half of the trials in rats infused with eNpHR3.0.

Figure 3 with 1 supplement see all

Download asset Open asset

NAc-S D1-SPNs mediate outcome-specific Pavlovian instrumental transfer (PIT).

(A) D1-Cre rats were bilaterally infused in the NAc-S with DIO-eYFP (black; 5 females and 5 males) or DIO-eNpHR3.0 (blue; 5 females and 5 males). Fiber-optic cannulas were implanted above the NAc-S to provide 625 nm LED illumination (continuous). (B) Schematic representation of the behavioral design; S1 and S2: noise and clicker stimuli (counterbalanced); O1 and O2: grain pellets and sucrose solution (counterbalanced); A1 and A2: left and right lever press (counterbalanced). At the test, S1 and S2 were presented four times each, in a pseudorandom order. Half of the trials for each stimulus was conducted under 625 nm LED illumination (ON; continuous wave; ~10 mW) whereas the LED remained inactivated during the other half of the trials (OFF). ON/OFF trials were counterbalanced. (C) Outcome-specific PIT test: net lever presses when the stimuli predicted the same outcome as the action (Same) or when the stimuli predicted the different outcome (Different). Lever presses are shown for each group in trials conducted under 625 nm LED illumination (ON) and in trials without illumination (OFF). Data are shown as mean ± SEM. Panel C includes individual data points for female (filled circle) and male (open circle) rats. Asterisks denote significant effect (*p < 0.05; **p < 0.01; ***p < 0.001; *n.s*., nonsignificant).

Pavlovian and instrumental conditioning proceeded as expected (Figure 3—figure supplement 1C, D). The data of most interest are those from the outcome-specific PIT test and are presented in Figure 3C (see also Figure 3—figure supplement 1E, F and Appendix 1). They are plotted as the mean number of lever presses per minute when the stimulus predicted the same outcome as the action (Same) and when the stimulus predicted a different outcome to the action (Different). Thus, A1 was identified as ‘Same’ and A2 as ‘Different’ in the presence of S1. Conversely, A2 was identified as ‘Same’ and A1 as ‘Different’ in the presence of S2. Further, baseline responding (number of lever presses per minute on the two actions during the 2 min preceding each stimulus presentation) was subtracted from the rates of responding during the stimuli since it did not differ between groups (Group – eYFP vs. eNpHR3.0: p = 0.42). This approach allowed us to focus on the net effect of the stimuli on choice performance.

Silencing NAc-S D1-SPNs eliminated outcome-specific PIT. Overall lever press rates were similar between groups (Group: p = 0.48), LED light condition (LED – ON vs. OFF: p = 0.14), and the two factors did not interact (Group x Light: p = 0.07). However, lever press rates were higher on the action earning the same outcome as the stimuli compared to the action earning the different outcome (Lever – Same vs. Different: F_(1,18) = 50.06; η² = 0.74; p < 0.001), regardless of group (Group x Lever: p = 0.77). There was no Lever by LED light condition interaction (Lever x LED: p = 0.72) but critically, there was an interaction between Group, LED light condition, and Lever during the presentation of the predictive stimuli (Group x LED x Lever: F_(1,18) = 7.19; η² = 0.29; p = 0.02). Follow-up analyses revealed that rats in the eYFP group displayed intact outcome-specific PIT whether the LED light was OFF (F_(1,9) = 6.14; η² = 0.67; p < 0.05) or ON (F_(1,9) = 39.46; η² = 0.81; p < 0.001). By contrast, rats in the eNpHR3.0 group displayed outcome-specific PIT when the light was OFF (F_(1,9) = 12.38; η² = 0.58; p < 0.01) but not when it was ON (p = 0.67). Together, these results indicate that activity in NAc-S D1-SPNs is necessary for the outcome-specific influence exerted by predictive stimuli on choice between actions. This is consistent with previous pharmacological work showing that NAc-S D1Rs blockade removes specific PIT (Laurent et al., 2014). Importantly, the impairment produced by silencing NAc-S D1-SPNs was restricted to the influence of predictive stimuli on choice between actions. In the same rats, we found that this silencing had no effect on value-based choice (Figure 3—figure supplement 1G and Appendix 2) using an outcome devaluation procedure: all rats performed the action that previously earned a valuable outcome more than the action that earned a devalued outcome. This agrees with studies reporting that NAc-S lesion or inactivation spares value-based choice (Corbit and Balleine, 2011; Corbit et al., 2001; Morse et al., 2020; Laurent et al., 2012; Laurent et al., 2014).

NAc-S D2-SPNs mediate outcome-specific PIT

The removal of outcome-specific PIT under silencing of NAc-S D1-SPNs reproduced the impairment previously observed with blockade of NAc-S D1Rs (Laurent et al., 2014). Interestingly, the latter pharmacological study did not find any effect following NAc-S infusion of a D2Rs antagonist. To evaluate this finding, the present experiment examined the effects of silencing NAc-S D2-SPNs on outcome-specific PIT. A2a-Cre rats were bilaterally infused in the NAc-S with either the null Cre-dependent eYFP virus (eYFP: 4 females and 4 males) or the Cre-dependent halorhodospin (eNpHR3.0: 4 females and 2 males) virus and were implanted with fiber-optic cannulas above the NAc-S (Figure 4A, Figure 4—figure supplement 1A, B). The rats then received the outcome-specific PIT protocol previously described (Figure 3B).

Figure 4 with 1 supplement see all

Download asset Open asset

NAc-S D2-SPNs mediate for outcome-specific Pavlovian instrumental transfer (PIT).

(A) A2a-Cre rats were bilaterally infused in the NAc-S with DIO-eYFP (black; 4 females and 4 males) or DIO-eNpHR3.0 (red; 4 females and 2 males). Fiber-optic cannulas were implanted above the NAc-S to provide 625 nm LED illumination (continuous). (B) Outcome-specific PIT test: net lever presses when the stimuli predicted the same outcome as the action (Same) or when the stimuli predicted the different outcome (Different). Lever presses are shown for each group in trials conducted under 625 nm LED illumination (ON) and in trials without illumination (OFF). Data are shown as mean ± SEM. Panel B includes individual data points for female (filled circle) and male (open circle) rats. Asterisks denote significant effect (*p < 0.05; **p < 0.01; *n.s*., nonsignificant).

Pavlovian and instrumental conditioning went as expected (Figure 4—figure supplement 1C, D). The data from the outcome-specific PIT test are presented in Figure 4B, again with baseline subtracted since it did not differ between groups (Group: p = 0.89; see also Figure 4—figure supplement 1E, F). Silencing NAc-S D2-SPNs activity eliminated outcome-specific PIT. Overall lever press rates during the test were similar between groups (Group: p = 0.86), LED light condition (LED: p = 0.21), and the two factors did not interact (Group x LED: p = 0.70). Lever press rates were higher on the action earning the same outcome as the stimuli compared to the action earning the different outcome (Lever: F_(1,12) = 22.77; η² = 0.66; p < 0.001), regardless of group (Group x Lever: p = 0.46). There was no Lever by LED light condition interaction (Lever x LED: p = 0.15) but critically, there was an interaction between group, LED light condition, and Lever during the presentation of the predictive stimuli (Group LED x Lever: F_(1,12) = 8.73; η² = 0.42; p < 0.01). Follow-up analyses revealed that control eYFP rats expressed outcome-specific PIT whether the LED light was OFF (F_(1,7) = 16.25; η² = 0.70; p < 0.01) or ON (F_(1,7) = 20.58; η² = 0.75; p < 0.01). By contrast, rats in the eNpHR3.0 group displayed outcome-specific PIT when the light was OFF (F_(1,5) = 6.94; η² = 0.58; p < 0.05) but not ON (p = 0.34). Thus, these results show for the first time that activity in NAc-S D2-SPNs is necessary for the outcome-specific influence exerted by predictive stimuli on choice between actions.

The present finding contrasts with our previous observation that outcome-specific PIT remains unaffected by NAc-S infusion of a D2Rs antagonist (Laurent et al., 2014). However, the capacity of such pharmacological manipulation to reveal the function of D2-SPNs is unclear. For instance, a D2Rs antagonist could be expected to enhance D2-SPN activity, since D2Rs are Gi coupled and so their activation should reduce adenylyl cyclase activity. Further, D2Rs are not exclusively expressed on this cell population (Gerfen and Surmeier, 2011), as they can be found on local CINs and presynaptic dopamine terminals. Finally, previous work found that pharmacological treatment targeting D2Rs can leave D2-SPNs activity unaffected (Tozzi et al., 2007). These issues are overcome by the present optical manipulation in A2a-Cre rats, and we are therefore confident that NAc-S D2-SPNs activity is in fact critical for outcome-specific PIT. Importantly, we also found that the impairment produced by silencing NAc-S D2-SPNs was restricted to the influence of predictive stimuli on choice between actions. In the same rats, this silencing had no effect on value-based choice (Figure 4—figure supplement 1G).

NAc-S D1-SPNs projections to the VP mediate outcome-specific PIT

Previous work indicates that communication between the NAc-S and VP is critical for outcome-specific PIT (Leung and Balleine, 2013). Consistent with the literature (Lu et al., 1997; Kupchik et al., 2015), we observed that both NAc-S D1- and D2-SPNs densely innervate the VP (Figure 1E, F). Since we found that activity in both neuron populations mediates the outcome-specific PIT effect, we investigated whether projections from each population contribute to the effect. We first focused on those originating from NAc-S D1-SPNs. D1-Cre rats were bilaterally infused in the NAc-S with either the null Cre-dependent eYFP virus (eYFP: 3 females and 5 males) or the Cre-dependent halorhodopsin (eNpHR3.0: 6 females and 6 males) virus and were implanted with fiber-optic cannulas above the VP (Figure 5A, Figure 5—figure supplement 1A, B). The rats then received the behavioral protocol previously described (Figure 3B).

Figure 5 with 1 supplement see all

Download asset Open asset

NAc-S D1-SPNs projections to the ventral pallidum (VP) mediate outcome-specific Pavlovian instrumental transfer (PIT).

(A) D1-Cre rats were bilaterally infused in the NAc-S with DIO-eYFP (black; 3 females and 5 males) or DIO-eNpHR3.0 (blue; 6 females and 6 males). Fiber-optic cannulas were implanted above the VP to provide 625 nm LED illumination (continuous). (B) Outcome-specific PIT test: net lever presses when the stimuli predicted the same outcome as the action (Same) or when the stimuli predicted the different outcome (Different). Lever presses are shown for each group in trials conducted under 625 nm LED illumination (ON) and in trials without illumination (OFF). Data are shown as mean ± SEM. Panel B includes individual data points for female (filled circle) and male (open circle) rats. Asterisks denote significant effect (**p < 0.01; ***p < 0.01; *n.s*., nonsignificant).

Pavlovian and instrumental conditioning went as expected (Figure 5—figure supplement 1C, D). The data from the outcome-specific PIT test are presented in Figure 5B in the manner described previously since baseline responding did not differ between groups (Group: p = 0.34; see also Figure 5—figure supplement 1E, F). Silencing NAc-S D1-SPNs projections to the VP eliminated outcome-specific PIT. Overall lever press rates during the test were similar between groups (Group: p = 0.11). LED activation reduced these rates (LED: F_(1,18) = 7.58; η² = 0.30; p < 0.05) but this reduction depended on the group considered (Group x LED: F_(1,18) = 9.78; η² = 0.35; p < 0.01). Lever press rates were higher on the action earning the same outcome as the stimuli compared to the action earning the different outcome (Lever: F_(1,18) = 35.73; η² = 0.67; p < 0.001), regardless of group (Group x Lever: p = 0.19). There was a Lever by LED light condition interaction (Lever x LED: F_(1,18) = 7.56; η² = 0.29; p < 0.05) and critically, there was an interaction between Group, LED light condition, and Lever during the presentation of the predictive stimuli (Group LED x Lever: F_(1,18) = 5.01; η² = 0.22; p < 0.05). Follow-up analyses revealed that control eYFP rats expressed outcome-specific PIT whether the LED light was OFF (F_(1,7) = 26.63; η² = 0.79; p < 0.001) or ON (F_(1,7) = 12.31; η² = 0.64; p < 0.01). By contrast, rats in the eNpHR3.0 group displayed outcome-specific PIT when the light was OFF (F_(1,11) = 26.53; η² = 0.71; p < 0.001) but not ON (p = 0.48). Thus, these results show for the first time that NAc-S D1-SPNs mediate outcome-specific PIT via their projections to the VP. Importantly, we also found that the impairment produced by silencing NAc-S D1-SPNs terminals in the VP was restricted to the influence of predictive stimuli on choice between actions. In the same rats, this silencing had no effect on value-based choice (Figure 5—figure supplement 1G).

NAc-S D2-SPNs projections to the VP do not mediate outcome-specific PIT

Since activity in NAc-S D2-SPNs is required for outcome-specific PIT (Figure 3) and these neurons innervate the VP (Figure 1F), we assessed whether D2-SPNs projections to the VP are involved in outcome-specific PIT. A2a-Cre rats were bilaterally infused in the NAc-S with either the null Cre-dependent eYFP virus (eYFP: 3 females and 5 males) or the Cre-dependent halorhodopsin (eNpHR3.0: 2 females and 5 males) virus and were implanted with fiber-optic cannulas above the VP (Figure 6A, Figure 6—figure supplement 1A, B). The rats then received the behavioral protocol previously described (Figure 3B).

Figure 6 with 1 supplement see all

Download asset Open asset

NAc-S D2-SPNs projections to the ventral pallidum (VP) do not mediate outcome-specific Pavlovian instrumental transfer (PIT).

(A) A2a-Cre rats were bilaterally infused in the NAc-S with DIO-eYFP (black; 3 females and 5 males) or DIO-eNpHR3.0 (red; 2 females and 5 males). Fiber-optic cannulas were implanted above the VP to provide 625 nm LED illumination (continuous). (B) Outcome-specific PIT test: net lever presses when the stimuli predicted the same outcome as the action (Same) or when the stimuli predicted the different outcome (Different). Lever presses are shown for each group in trials conducted under 625 nm LED illumination (ON) and in trials without illumination (OFF). Data are shown as mean ± SEM. Panel B includes individual data points for female (filled circle) and male (open circle) rats.

Pavlovian and instrumental conditioning went as expected (Figure 6—figure supplement 1C, D). The data from the outcome-specific PIT test are presented in Figure 6B as previously since baseline responding did not differ between groups (Group: P=0.90; see also Figure 6—figure supplement 1E, F). Silencing NAc-S D2-SPNs projections to the VP had no effect on outcome-specific PIT. Overall lever press rates were similar between groups (Group: p = 0.56), LED light condition (LED: p = 0.75), and the two factors did not interact (Group x LED: p = 0.73). The rates were higher on the action earning the same outcome as the stimuli relative to the action earning the different outcome (Lever: F_(1,13) = 22.71; η² = 0.64; p < 0.001), irrespective of group (Group x Lever: p = 0.55). There was no interaction between Group, LED light condition, and Lever (Group x Light x Lever: p = 0.59). Thus, NAc-S D2-SPNs do not appear to mediate outcome-specific PIT via their projections to the VP. Likewise, we found no evidence that these projections influence value-based choice (Figure 6—figure supplement 1G).

Discussion

The present experiments investigated the role of NAc-S D1- and D2-SPNs in choice between actions using an outcome-specific PIT task. First, they combined anatomical tract-tracing and ex vivo electrophysiology to demonstrate that two recently developed knock-in rat lines (Pettibone et al., 2019) enable selective silencing of activity in either population of NAc-S SPNs. Consistent with previous findings (Laurent et al., 2014), they found that NAc-S D1-SPNs are necessary for PIT since their silencing eliminated outcome-specific choice. Additionally, they showed that this choice is also abolished by silencing NAc-S D2-SPNs, providing the first evidence that both SPN populations contribute to PIT expression. Finally, the last two experiments revealed for the first time that outcome-specific choice is likely to involve downstream regulation of VP function by NAc-S D1-SPNs but not NAc-S D2-SPNs. Together, these findings offer novel insights into the cellular mechanisms that govern the outcome-specific influence of predictive stimuli on choice between actions.

Convincing evidence had been provided for a significant role of NAc-S D1-SPNs in PIT (Laurent et al., 2014). The evidence was based on the observations that PIT is associated with an increase in ERK1/2 phosphorylation within these SPNs and that pharmacological blockade of NAc-S D1Rs eliminates outcome-specific choice between actions. These findings align with the present study, showing that PIT is eliminated when NAc-S D1-SPNs are silenced during presentations of the predictive stimuli at the time of choice. Importantly, the effect of NAc-S D1-SPNs silencing was specific to the assessment of choice between actions in the presence of predictive stimuli. For instance, the ability of stimuli to elicit approach behaviors toward the magazine remained intact despite NAc-S D1-SPNs silencing, indicating that the silencing effect was not mediated by modulating potential competition between Pavlovian and instrumental responses (Lovibond, 1981; Holmes et al., 2010). Furthermore, NAc-S D1-SPNs silencing preserved the capacity to select between actions based on the value of their respective outcomes. This selectivity in the impairment produced by NAc-S D1-SPNs is consistent with previous findings demonstrating that the NAc-S does not contribute to learning and retrieving the S–O and A–O associations produced by Pavlovian and instrumental conditioning (Corbit and Balleine, 2011; Corbit et al., 2001; Morse et al., 2020; Laurent et al., 2012; Laurent et al., 2014).

Previous research failed to provide any evidence supporting the involvement of NAc-S D2-SPNs in outcome-specific choice (Laurent et al., 2014). Specifically, PIT was found to produce no change in ERK1/2 phosphorylation in these neurons and was left intact by NAc-S D2Rs blockade. However, these assessments have significant limitations, including the widespread distribution of striatal D2Rs (Gerfen and Surmeier, 2011) and the inability of ERK1/2 phosphorylation or D2Rs antagonism to capture or distort activity in NAc-S D2-SPNs (Bertran-Gonzalez et al., 2008; Tozzi et al., 2007). Further, pharmacological blockade of D2Rs would be expected to enhance D2-SPNs activity, since D2Rs are Gi coupled and so their activation reduces adenylyl cyclase activity. The present study addressed these limitations by implementing optogenetic silencing in a knock-in rat line giving access to D2-SPNs through the targeting of the A2a receptor that is predominantly expressed in these neurons in the NAc-S (Schiffmann et al., 2007). We found that PIT is eliminated when NAc-S D2-SPNs are silenced during presentations of the predictive stimuli at the time of choice. Moreover, the same silencing left intact the ability of the predictive stimuli to elicit magazine approaches and preserved the capacity to choose between actions based on the value of their outcomes. Thus, we conclude that both NAc-S D1- and D2-SPNs are indispensable to outcome-specific PIT. Nevertheless, it will be important for future studies to confirm NAc-S D2-SPNs involvement in PIT using alternative approaches, such as chemogenetic or immunohistochemical assessments employing a marker capable of capturing the function of this neuronal population (Matamales et al., 2020).

Consistent with the literature (Lu et al., 1997; Kupchik et al., 2015), we found that both NAc-S D1- and D2-SPNs send dense projections to the VP, which is often described as the major efferent of the nucleus accumbens (Heimer et al., 1991; Zahm and Heimer, 1990). Importantly, the VP has been shown to play a crucial role in outcome-specific choice (Leung and Balleine, 2013; Leung and Balleine, 2015; Leung et al., 2024a). VP neurons exhibit enhanced cFos activation following an outcome-specific PIT test, and their levels of activation correlate with PIT performance. Moreover, pharmacological inactivation of the VP eliminates outcome-specific choice, and the same elimination is observed when the NAc-S is disconnected from the VP. We therefore tested for a role of NAc-S D1- and D2-SPNs projections to the VP during choice between actions in our outcome-specific PIT task. We found that PIT was abolished by silencing NAc-S D1-SPNs terminals but not by silencing NAc-S D2-SPNs terminals. Neither silencing affected magazine approaches elicited by the predictive stimuli nor choice between actions based on the value of their outcomes. It is important to note that our study does not provide any evidence about the efficacy of NAc-S D2-SPNs terminals silencing in the VP, and future experiments should aim to provide such evidence or adopt other methods to study this pathway. This could involve using opsins with enhanced axonal silencing efficacy (Mahn et al., 2021; Copits et al., 2021), or employing alternative methods known to disrupt neurotransmitter release such as chemogenetics (Rost et al., 2022). Yet, the same silencing for NAc-S D1-SPNs terminals resulted in PIT elimination. Therefore, it seems reasonable to conclude that NAc-S D1-SPNs, but not NAc-S D2-SPNs, projections to the VP are indispensable to observe outcome-specific choice in a PIT task. Since NAc-S D2-SPNs appear to exclusively project to the VP (Humphries and Prescott, 2010; Kupchik et al., 2015), our findings suggest that these SPNs mediate outcome-specific PIT by locally regulating NAc-S function. By contrast, it remains to be determined whether NAc-S D1-SPNs also coordinate outcome-specific PIT via their projections to the lateral hypothalamus (O’Connor et al., 2015) and/or the ventral tegmental area (VTA) (Humphries and Prescott, 2010), in addition to the VP.

The present findings are consistent with a recent model proposing that outcome-specific choice in PIT relies on an opioid-based memory residing in the NAc-S (for full description and illustration of the model see Leung et al., 2024b; Laurent and Balleine, 2021; Morse et al., 2020). In this model, as the basolateral amygdala encodes and stores outcome-specific S–O associations across Pavlovian conditioning, it also drives the formation of the NAc-S memory, which involves the durable accumulation of delta-opioid receptors (DOPRs) on the somatic membrane of local CINs (Bertran-Gonzalez et al., 2013; Morse et al., 2020). Although this memory is not necessary for Pavlovian conditioning per se, its expression is later required to enable the predictive stimuli to guide outcome-specific choice during the PIT test (Morse et al., 2020). The model specifically proposes that memory expression is controlled by fluctuations in glutamatergic release from cortical inputs and dopamine release from projections originating from the VTA, as this brain region has been found to be important for outcome-specific PIT (Corbit et al., 2007; Sias et al., 2024; Seitz et al., 2022; Leung and Balleine, 2015). One consequence is to activate NAc-S D2-SPNs and enkephalin discharge by these neurons. Enkephalin then binds onto DOPRs that had accumulated on local CINs and dampens acetylcholine secretion from the interneurons. The sudden drop in NAc-S acetylcholine frees D1-SPNs from the inhibitory tone imposed by acetylcholine occupancy of muscarinic M4 receptors that are exclusively found on these SPNs (Tayebati et al., 2004; Lobo et al., 2006; Guo et al., 2010; Jeon et al., 2010). The ultimate consequence is to promote NAc-S D1-SPNs function, including the coordination of outcome-specific choice in PIT by regulating activity in downstream brain regions such as the VP. The model predicts all the findings presented here. NAc-S D2-SPNs silencing eliminates PIT at the time of choice by preventing enkephalin release and thereby expression of the DOPR-based memory. The main consequence of this expression is precluded by NAc-S D1-SPNs silencing at the time of choice, while silencing the terminals of these SPNs in the VP squashes their ability to regulate the activity of this brain region to coordinate outcome-specific choice in PIT. Thus, one fundamental implication of the present findings is to strengthen the proposal that an opioid-based memory system in the NAc-S enables outcome-specific choice between actions.

In summary, the present experiments found that the two main populations of SPNs in the NAc-S are indispensable for outcome-specific choice between actions in a PIT task. They also revealed that NAc-S D1-SPNs coordinate this choice by downstream regulation of VP activity. By contrast, NAc-S D2-SPNs function in PIT appears to be restricted to modulating local activity. These findings provide novel insights into the cellular mechanisms and circuitry underlying the outcome-specific influence of predictive stimuli on choice between actions and are consistent with a recent model proposing that this influence is mediated by an opioid-based memory system in the NAc-S. Beyond these mechanistic considerations, the findings offer an opportunity to gain novel insights about various disorders in which the outcome-specific influence of predictive stimuli over our choices and decisions is dysfunctional. These disorders include depression (Geurts et al., 2013; Nord et al., 2018), anxiety disorders (Quail et al., 2017; Krypotos and Engelhard, 2020), substance-use disorders (Heinz et al., 2019; Hogarth et al., 2019; Hogarth et al., 2019; Steins-Loeber et al., 2020; Garbusow et al., 2016; Garbusow et al., 2019; Hogarth et al., 2019), gambling disorders (Genauck et al., 2019), anorexia nervosa (Vogel et al., 2020), and obesity (Watson et al., 2014; Lehner et al., 2017; Meemken and Horstmann, 2019).

Materials and methods

Subjects

135 rats from two genetically modified knock-in lines were used and obtained from the breeding facility at the University of New South Wales (Sydney, Australia). D1-Cre rats (Rat Research & Resource Center, Columbia, MO, USA; LE-Drd1^{em1(iCre)Berke}, RRRC#: 00856) expressed Cre recombinase in neurons expressing the dopamine D1 receptors (D1R). A2a-Cre rats (Rat Research & Resource Center, Columbia, MO, USA; LE-Adora2a^{em1(iCre)Berke}, RRRC#: 00857) expressed Cre recombinase in neurons expressing the adenosine A2A receptors (A2R). All rats were heterozygous and generated by crossing a heterozygous male with a Long-Evans wild-type female obtained from a colony maintained by the University of New South Wales (founding animals were sourced from Envigo/Inotiv, Blue Spruce outbred; HsdBlu:LE). Genotyping was completed by sending ear clippings to Transnetyx (Cordova, TN, USA). The rats were at least 8 weeks old at the beginning of each experiment. Efforts were made to allocate an equal number of female and male rats in each group. However, these efforts were hampered by exclusions following post-mortem assessments of viral spread and fiber-optic cannula placement. The final number of female and male rats did not provide sufficient power to analyze an effect of sex (our previous work found no influence of sex on PIT or value-based choice; Burton et al., 2024). Therefore, all test data present individual performance for female and male rats. Rats were housed in transparent plastic boxes with their littermates (up to four rats per box) throughout, and in a climate-controlled colony room maintained on a 12-hr light–dark cycle (lights on at 7:00 am). Behavioral procedures were conducted during the light phase (8:00 am to 6:00 pm). Water and standard lab chow were available ad libitum prior to the start of each experiment. Rats were food restricted to maintain them at ~90% of their ad libitum body weight during the behavioral protocols. Food restriction was initiated 3–5 days prior to the start of the protocols and was maintained throughout all training and testing phases. Rats received a daily food amount, which was adjusted based on body weight measurements recorded every 2 days. This study was performed in strict accordance with the recommendations in the Guide for the Care and Use of Laboratory Animals of the National Health and Medical Research Council in Australia. All of the animals were handled according to approved Animal Care and Ethics Committee (ACEC) protocols of the University of New South Wales. The experimental protocols were approved by the UNSW ACEC (Permit Number: 20/35A). All surgery was performed under isoflurane anesthesia, and every effort was made to minimize suffering.

Share this article

Cite this article

Anterograde tracing in D1-Cre and A2a-Cre rats.

Ex vivo cell recordings in D1-Cre and A2a-Cre rats.

NAc-S D1-SPNs mediate outcome-specific Pavlovian instrumental transfer (PIT).

NAc-S D2-SPNs mediate for outcome-specific Pavlovian instrumental transfer (PIT).

NAc-S D1-SPNs projections to the ventral pallidum (VP) mediate outcome-specific Pavlovian instrumental transfer (PIT).

NAc-S D2-SPNs projections to the ventral pallidum (VP) do not mediate outcome-specific Pavlovian instrumental transfer (PIT).

Author details

Octavia Soegyono

Contribution

Competing interests

Elise Pepin

Contribution

Competing interests

Beatrice K Leung

Contribution

Competing interests

Billy Chieng

Contribution

Competing interests

Bernard W Balleine

Contribution

Competing interests

Vincent Laurent

Contribution

For correspondence

Competing interests

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

Categories and tags

Research organism