The dopamine (DA) hypothesis posits the increase of mesolimbic dopamine levels as a defining commonality of addictive drugs, initially causing reinforcement, eventually leading to compulsive consumption. While much experimental evidence from psychostimulants supports this hypothesis, it has been challenged for opioid reinforcement. Here, we monitor genetically encoded DA and calcium indicators as well as cFos in mice to reveal that heroin activates DA neurons located in the medial part of the VTA, preferentially projecting to the medial shell of the nucleus accumbens (NAc). Chemogenetic and optogenetic manipulations of VTA DA or GABA neurons establish a causal link to heroin reinforcement. Inhibition of DA neurons blocked heroin self-administration, while heroin inhibited optogenetic self-stimulation of DA neurons. Likewise, heroin occluded the self-inhibition of VTA GABA neurons. Together, these experiments support a model of disinhibition of a subset of VTA DA neurons in opioid reinforcement.https://doi.org/10.7554/eLife.39945.001
The DA hypothesis of drug reinforcement is rooted in the observation that electrical activation of the medial forebrain bundle leads to repetitive action (Olds and Milner, 1954). Rats willingly self-stimulate brain regions populated by DA neurons or receiving inputs from DA neurons. Moreover, pharmacological blockade of DA receptors impairs the reinforcing properties of psychostimulants in both rats (Maldonado et al., 1993; McGregor and Roberts, 1993; Roberts et al., 1977) and primates (Bergman et al., 1989; Johanson and Schuster, 1975). Several microdialysis and voltammetry studies demonstrated the increase of DA in the NAc shell as a common feature of addictive drugs, including opioids (Aragona et al., 2008; Di Chiara and Imperato, 1988; Pontieri et al., 1995; Stuber et al., 2005). Furthermore, electrolytic lesions of the VTA to NAc pathway decreased reinforcement during intravenous self-administration of morphine and cocaine under a progressive ratio schedule (Suto et al., 2011).
The DA hypothesis has also received support from molecular investigations. Indeed, the reinforcing effects of opioids require µ-opioid receptors (Charbogne et al., 2017; Contarino et al., 2002; Matthes et al., 1996), which are enriched in VTA GABA neurons (Cohen et al., 1992; Devine and Wise, 1994; Johnson and North, 1992). Based on in vivo single unit and brain slice recordings, a disinhibition scenario of VTA DA neurons has been proposed (Gysling and Wang, 1983), whereby MOR activation inhibits GABA neurons (Johnson and North, 1992) through somatodendritic hyperpolarization and the reduction of the efferent release probability. The former effect would be mediated by G protein–coupled inwardly rectifying K+ (GIRK) channels, while inhibition of calcium entry underlies the later (Lüscher et al., 1997).
Regardless, it has been repeatedly argued that the initial reinforcing effects of opioids, can escape DA involvement. These results were largely based on pharmacological experiments. For example, the non-selective DA antagonists alpha-flupenthixol and haloperidol decreased cocaine SA but only to a lesser extent heroin SA (Ettenberg et al., 1982; Van Ree and Ramsey, 1987). In addition, lesioning DA terminals in the NAc with 6-OHDA had no effect on the initiation of heroin self-administration (Gerrits and Van Ree, 1996; Pettit et al., 1984) and the D1 antagonist SCH23390, when systemically administered, significantly decreased heroin self-administration, but had no effect when directly infused into the NAc (Gerrits et al., 1994).
The challenge of the DA hypothesis is also supported by genetic manipulations. For example, DA-deficient mice (targeted deletion of TH and DBH: tyrosine hydroxylase and dopamine beta-hydroxylase) still expressed conditioned place preference for morphine (Hnasko et al., 2005) and the downregulation of accumbal D1Rs prevented the acquisition of cocaine but not heroin self-administration (Pisanu et al., 2015).
If not through DA, how would opioids cause reinforcement? A model has been proposed with the pedunculopontine nucleus (PPN, called TPP in the original publication) as the initial target of opioids, which receives a descending GABA projection from the VTA. (Bechara and van der Kooy, 1992; Nader et al., 1994; Nader and van der Kooy, 1997). In this scenario DA-dependent mechanisms would take control only after chronic exposure, once dependence is established.
Not surprisingly, the question whether DA modulation underlies the reinforcing properties of opioid is therefore still hotly debated (Badiani et al., 2011; Blum et al., 2015; Nutt et al., 2015), which is why in the present study we use advanced genetic tools that allow for selective observation and manipulation of neuronal populations to revisit this fundamental question.
Mice were trained to intravenously self-administer heroin under a fixed-ratio one schedule (Figure 1a, see Methods) for 12 daily sessions of 6 hr maximum (Figure 1b). The dose was decreased from 50 to 25 µg/kg/infusion after six days, which led to higher acquisition rates (Figure 1c). The animals quickly learned to discriminate between an active and an inactive lever (after 6 days of training: 144.9 ± 26.0 active lever presses versus 8.3 ± 2.5 inactive ones; after 12 days: 283.4 ± 28 versus 20.9 ± 9.3. Figure 1d–f) and readily reached a robust number of heroin infusions (after 6 days of training with the higher dose: 50.6 ± 6.9 infusions; after 6 days with the lower dose: 138.1 ± 5.1 infusions after 12 days of training) in two to three hours at the end of the acquisition (Figure 1g). After 30d of withdrawal, mice were brought back into the apparatus in the absence of heroin injections and significantly differentiated between active and inactive lever (Figure 1h and i). Taken together this experiment shows that heroin was highly reinforcing and induced seeking behavior, a widely used model for relapse (García Pardo et al., 2017; Shaham et al., 2003).
To test whether heroin, when administered to drug-naïve mice, causes a DA increase in the NAc, we recorded the fluorescence changes of an intensity-based genetically encoded DA sensor (here called dLight1), in freely moving mice with fiber photometry. dLight1 enables optical readout of changes in DA concentration by coupling the agonist binding-induced conformational changes in human DA receptors to changes in the fluorescence intensity of circularly permuted (cp) GFP derived from GCaMP6 (Patriarchi et al., 2018). We started by replicating dopamine-specific responses in dLight-transfected HEK cells (Figure 2a, Patriarchi et al., 2018). Next, to probe DA release in freely moving mice, we delivered an adeno associated virus encoding dLight 1 (AAV9-CAG- dLight1) in the NAc, followed by implantation of an optic fiber for recordings. A group of DAT-cre mice was also injected the red-shifted opsin Chrimson (AAV8-hSyn-DIO-ChrimsonR-tdTo) into the VTA (Figure 2b–c). Brief (5 ms) delivery of 593 nm laser light pulses into the VTA resulted in an increase of fluorescence in the NAc that co-varied with frequency of stimulation (Figure 2d). To then test the effects of heroin on accumbal dopamine levels, animals were habituated to a recording arena and injected with either saline or heroin on subsequent days. Within less than a minute after the intraperitoneal heroin administration, we observed the onset of a fluorescence transient that peaked after three minutes (Figure 2e–f, saline: −0.008 ± 0.007, dF/F heroin: 0.133 ± 0.03 dF/F, p=0.0062, t(6) = 4.117, Paired Student’s T-Test, n = 7). Importantly, the effect size was similar to that of cocaine, but was not seen following administration of the selective-serotonin reuptake inhibitor citalopram nor the norepinephrine reuptake inhibitor reboxetine (Figure 2g saline: −0.01 ± 0.005 dF/F, citalopram: −0.008 ± 0.008 dF/F, cocaine: 0.165 ± 0.025 dF/F, reboxetine −0.009 ± 0.006 dF/F). These experiments demonstrate that dLight fluorescence specifically captures dopamine transients, and a first injection of heroin increases DA concentration in the NAc within minutes.
We then expressed a Cre-dependant GCaMP6m (AAV-DJ-EF1a-DIO-GCaMP6m) in DAT-Cre mice to record VTA DA neuron activity with fiber photometry (Figure 2g–h, Cui et al., 2013; Gunaydin et al., 2014; Lerner et al., 2015) after an i.v. heroin injection in freely moving mice. Repeated infusions of heroin through the jugular vein, every 2 min, increased the calcium signal in VTA-DA neurons within minutes and a plateau is observed after the third infusion (Figure 2i–j, dF/F saline compared to baseline: −0.0063 ± 0.016, dF/F heroin compared to baseline: 0.22 ± 0.085, n = 11, p=0.001, Wilcoxon signed rank test). Interestingly, even though the methods are inherently technically different, the kinetic of this activity is similar to the DA surge we observed in the NAc suggesting a tight correlation between VTA DA neurons activity and DA release. Taken together, these experiments demonstrate that the first exposure to heroin in naive animals increases VTA DA neuron activity and increases DA release in the NAc.
To map the activated neurons in the VTA following heroin administration, mice were perfused after the very first self-administration session and brain slices were stained for the expression of the immediate early gene cFos and tyrosine hydroxylase (TH, Figure 3a). Cells positive for cFos that were also TH-immunoreactive were found most prominently in the medial part of the VTA. In this area of the VTA, 27.8% of TH+ were cFos+ after heroin SA, whereas 1.5% of TH+ were cFos+ after saline SA (Figure 3b–e).
DA neurons located in the medial part of the VTA preferentially project to the NAc core and medial Shell as well as in the medial prefrontal cortex, whereas DA neurons located in more lateral portions of the VTA project to the lateral Shell (Lammel et al., 2008). To reveal the target of the heroin-activated VTA DA neurons, we therefore injected the cholera toxin subunit B (CTB) tracer of two distinct colors (CTB-555 and CTB-488) in the medial and lateral NAc shell respectively (Figure 3f) confirming the topography of medio-lateral VTA-NAc parallel projections. CTB-555 seeded in the medial NAc retrogradly migrated to the medial VTA, whereas CTB-455 injected into the lateral Shell was mostly found in the lateral VTA, with very little co-localization of the two tracers (1.5%, Figure 3g). This result indicates that there are very few collaterals (Yang et al., 2018), allowing for a quantification of the co-localization of the two CTB markers with cFos after a first heroin self-administration session. We thus injected mice with the CTB tracer of two distinct colors (CTB-555 or CTB-647 in the medial or lateral NAc shell respectively, counterbalanced between animals) and submitted the mice to a session of heroin SA. We found that 51.0% of medial Shell projecting VTA neurons were also cFos positive, while this was only the case in 20.5% lateral Shell projecting cells. (Figure 3h–j). Taken together, heroin self-administration activates DA neurons in the medial part of the VTA that project preferentially to the medial NAc shell.
To probe for a causal relationship between enhanced mesolimbic dopamine and heroin reinforcement, we tested whether inhibiting VTA DA neurons during the initial sessions would impact acquisition and maintenance of heroin SA. To this end, we injected DAT-Cre mice expressing hSyn-DIO-hM4D(Gi) in VTA DA neurons with CNO 1 hr prior to heroin self-administration (Figure 4a–b). This chemogenetic intervention has been shown to be efficient to hyperpolarize DA neurons in acute midbrain slices (Bariselli et al., 2018). Silencing VTA DA neurons in animals where self-administration was well established significantly decreased the number of active lever presses and ensuing heroin infusions (Figure 4c–e, 223 ± 60 LP for 111 ± 25 infusions dropped to 23 ± 9 LP for 15 ± 6 infusions after 4 days of treatment condition (DAT-Cre+ versus DAT-Cre-) x CNO (present, absent), two-way repeated measures ANOVA and multiple comparison post-hoc Sidack test, *p<0.05, **p<0.005, ****p<0.0001). We further tested the necessity of VTA DA signaling during the very early heroin SA sessions. Silencing VTA DA neurons from the first to the fourth session significantly prevented the acquisition of heroin SA (Figure 4f–h). After CNO was stopped (from session 5), the mice quickly acquired the task and reached a number of lever presses and infusions similar to the control animals. CNO had no effect on self-administration in DAT-Cre- mice. All together these results suggest that VTA DA activity is required for the initial reinforcing properties of opioids from the very early stage of drug exposure.
We next tested whether heroin would occlude optogenetic VTA DA neuron self-stimulation as shown for cocaine (Pascoli et al., 2015). If heroin employs the same neuronal circuitry as VTA DA neuron self-stimulation, a heroin injection should decrease lever pressing for VTA DA activation. DAT-cre+ mice were infected with the blue-light shifted excitatory opsin AAV5-DIO-hChR2-eYFP (Figure 5a–b). To avoid the development of tolerance to heroin, we injected various doses of heroin in a randomized order intraperitoneally (i.p.) immediately prior to the self-stimulation sessions (Figure 5c) and each session under heroin was followed by two sessions with free access to laser stimulation (LS) (Figure 5c). At baseline, the mice pressed up to 291 ± 39 times to obtain 134 ± 1.15 LS in 60 min under a fixed-ratio 1 (FR1, followed by 20 s time out period) schedule (Figure 5d and e). After heroin injection, the performance decreased significantly in a dose-dependent fashion (Figure 5i), suppressing lever pressing completely at the highest dose (Figure 5d–i). To rule out any sedative effects of heroin at these doses, a separate set of mice were tested over a 30 min free exploration period in an open field (see Methods). We observed that heroin actually increases locomotor activity in open field, even at the highest dose tested (Figure 5j). These experiments indicate that reinforcement induced by optogenetic self-stimulation of VTA DA neurons or heroin SA share underlying neural circuits.
Finally to examine the involvement of VTA GABA neurons (Tan et al., 2012; van Zessen et al., 2012), we tested the reinforcing properties of their self-inhibition and asked whether heroin exposure would also occlude this behavior. We expressed the light-gated inhibitory proton pump eArchT-3.0 in the VTA of GAD-Cre mice (Figure 6a–b; O'Connor et al., 2015) and gave the mice control over the laser switch. The mice quickly learned to discriminate the active and inactive lever by significantly increasing the number of active lever press and laser-stimulation triggered during the acquisition sessions (293.1 ± 40 LP to obtain 119 ± 15 LS in 180 min). Intraperitoneal injection of heroin just prior to the VTA GABA self-inhibition session significantly decreased the operant behavior in a dose-dependent manner (Figure 6d–f) and abolished this behavior at the highest dose (Figure 6d–f). In fact, the IC50 was very similar to the IC50 calculated based on the occlusion with DA neuron self-stimulation (6.9 vs 6.4 mg/kg, Figure 6i and Figure 5i). This experiment indicates that reinforcement by optogenetic VTA GABA self-inhibition and reinforcement by heroin share underlying neural circuits and are compatible with a disinhibitory mechanism where heroin targets GABA neurons leading to an increase of DA neurons activity.
In the present study, we found that heroin increases DA in the NAc through the activation of a subset of VTA DA neurons located in the medial VTA, which preferentially project to the NAc medial shell. Our chemo- and optogenetic manipulations support a disinhibitory motif and establish a link of causality with behavioral reinforcement.
The VTA consists of DA (60–65%), GABA (30–35%), glutamate (2%) and neurons that express more than one marker (Margolis et al., 2006; Nair-Roberts et al., 2008; Roberts and Ribak, 1987; Steffensen et al., 1998; Yamaguchi et al., 2011). Among DA neurons, subpopulations have been proposed based on DAT expression (Blanchard et al., 1994; Li et al., 2013), properties of afferent excitatory and inhibitory synaptic inputs, as well as projection to distinct targets, which have been mapped to specific functions. For example, aversive stimuli potentiate glutamatergic inputs onto DA neurons projecting to the mPFC, while rewarding stimuli potentiate inputs onto medial shell and lateral shell NAc projecting DA neurons (Lammel et al., 2011). The mediolateral topography of accumbens shell neurons is conserved by their dopamine inputs from the VTA, where medial and lateral shell projecting DA neurons segregate along the mediolateral VTA (Beier et al., 2015; Lammel et al., 2012; Lammel et al., 2008). We find that heroin during initial reinforcement preferentially activates neurons of the medial VTA projecting to the medial Shell, but cannot exclude the contribution from other projections.
Most of the afferents to midbrain DA neurons are GABAergic. Back projecting accumbal medium-spiny neurons, while targeting both DA and GABA neurons (Beier et al., 2015; Henny et al., 2012; Yang et al., 2018), form particularly strong synaptic connections to the latter (Bocklisch et al., 2013), which then control the activity of DA neurons (Johnson and North, 1992). This disinhibitory motif is particularly strong for neurons of the lateral VTA (Yang et al., 2018). Removal of tonic inhibition from VTA DA neurons by interneurons and accumbal projections can therefore cause increases in DA neuron activity (Jhou et al., 2009; Johnson and North, 1992) and NAc DA release (van Zessen et al., 2012). Using a genetically encoded DA reporter we confirm that already the very first dose of heroin increases DA in the shell.
In the alternate model of DA-independent reinforcement, the initial effect of opioids would still be on VTA GABA neurons that however mainly project to the PPN (Laviolette et al., 2004), a heterogeneous nucleus containing GABA, glutamate and acetylcholine neurons. The PPN projects back to the midbrain (Oakman et al., 1995; Wang and Morales, 2009; Watabe-Uchida et al., 2012) regulating reinforcement (Floresco et al., 2003; Inglis et al., 2000; Lammel et al., 2012; Pan and Hyland, 2005; Steidl and Veverka, 2015). The nearby LDT which strongly innervates the VTA may also contribute (Omelchenko and Sesack, 2005). There is direct evidence that the activation of the LDT-VTA pathway leads to CPP, reinforces operant responses with natural- (Lammel et al., 2012; Steidl and Veverka, 2015) and drug-rewards (Shinohara et al., 2014; Wise, 2004). However, all these scenarios converge to activate VTA DA neurons, and are thus not DA-independent. Such a ‘non-dopaminergic substrate for reward within the VTA’ (Nader and van der Kooy, 1997) is also at odds with several publications that observe an increase of extracellular DA levels in the shell following acute administration of opioids (Aragona et al., 2008; Di Chiara and Imperato, 1988; Pontieri et al., 1995; Stuber et al., 2005). Moreover our data suggest that the same circuits maintain reinforcement as exposure becomes chronic. After 12 days of heroin self-administration, inhibition of DA neurons still caused a strong, but fully reversible decrease in the responding behavior. Taken together DA-independent heroin reinforcement seems unlikely.
Heroin decreased lever-pressing for VTA DA neuron self-stimulation (Pascoli et al., 2015) in a dose-dependent fashion, which could not be explained by sedative effects as the same dose was able to increase locomotor activity in an open field. Such occlusion strongly suggests that heroin converges on the same cellular mechanism (similar to cocaine occlusion experiments). We also - to the best of our knowledge for the first time in the literature - observed strong reinforcement with GABA neuron self-inhibition, which was sensitive to heroin exposure. Opioids suppress the activity of VTA GABA interneurons by activation of µ-opioid receptors (MORs) (Jalabert et al., 2011; Johnson and North, 1992; Mazei-Robison et al., 2011), which then hyperpolarize the neurons and decrease the release probability at the axon terminal via the activation of GIRK channels and the inhibition of voltage gated calcium channels, respectively (Cohen et al., 1992; Johnson and North, 1992). The most straightforward interpretation for the effect on the behavior is thus again an occlusion scenario. Interestingly the IC50 for this effect was virtually identical to the IC50 observed with the occlusion of self-stimulation of DA neurons.
Our results are in direct contrast with older pharmacological experiments, where DA receptor antagonists had an effect on reinforcement of cocaine but not heroin (Ettenberg et al., 1982; Pettit et al., 1984). Maybe the receptor occupancy of the antagonists was insufficient, particularly when administrated intra-cranially as suggested by the discrepancy between the systemic and intra-cranial results (Neisewander et al., 1998). Moreover, several studies relied on CPP rather than testing for the effect on self-administration. Moreover, validation of the pharmacological effect on neural activity in vivo remains difficult.
On the other hand, there is much support for functional accumbal DA transmission underlying opioid reinforcement. Electrolytic lesions or inactivation of the NAc result in significantly decreased responding for intravenous self-administration of opioids (Alderson et al., 2001; Dworkin et al., 1988; Suto et al., 2011; Zito et al., 1985). Mice also learn to self-administer MORs agonists when infused directly into the NAc (David and Cazala, 2000; Goeders et al., 1984), most likely via the disinhibitory loop connecting the NAc back to the VTA. A recent study using a siRNA to downregulate D1aR in the NAc shell blocked the acquisition of cocaine but not heroin self-administration (Pisanu et al., 2015), raising the question for the role of specific DA receptors in drug-adaptive behavior.
The observation that morphine can induce CPP in DA-deficient mice (Hnasko et al., 2005) is likely explained by developmental adaptations. Moreover, these mice suffer from a severe locomotion deficit, which precluded the testing for reinforcement in a self-administration setting.
Here, by confirming the validity of the DA hypotheses for opioids, we aim to integrate work on these drugs into the emerging circuit model for addiction. Our study thus supports VTA DA neuron disinhibition for the opioid reinforcement. Untangling the circuits underlying opioid reinforcement may not only allow refining addiction treatments, but also draft the roadmap for the development of analgesic compounds without addiction liability.
Wild-type C57BL/6 and transgenic mice were used throughout the study. Weights, ages and genders were homogeneously distributed among the groups. Transgenic mice were backcrossed to the C57BL/6 line for a minimum of four generations. Transgenic DAT-Cre mice (Slc6a3) were heterozygous BAC transgenic mice in which the Cre recombinase expression was under the control of the regulatory elements of the DA transporter gene (DAT-Cre+ mice; (Turiault et al., 2007) DAT-Cre mice were originally provided by Günther Schutz. GAD-Cre+ (Gad65Cre non-inducible;(Kätzel et al., 2011)) mice (Gad2) were also used. All animals were kept in a temperature (21 ± 2°C) and humidity (50 ± 5%) controlled environment with a 12 hr light/12 hr dark cycle (lights on at 7:00). Food and water were available ad libitum, unless otherwise stated. All procedures were approved by the animal welfare committee of the Cantonal of Geneva, in accordance with Swiss law.
Anesthesia was induced at 5% and maintained at 2.5% isoflurane (w/v) (Baxter AG) during the surgery. The animal was placed in a stereotaxic frame (Angle One) and craniotomies were performed using stereotaxic coordinates (for VTA: AP −3.3; ML −0.9 with a 10° angle; DV −4.3. For the lateral NAc shell: AP 0.98; ML +- 1.6; DV −4.5. For the medial NAc shell: AP 1.6; ML +- 0.5; DV −4.3). Injections of virus (0.5 μl) used graduated pipettes (Drummond Scientific Company), broken back to a tip diameter of 10–15 mm, at an infusion rate of 0.05 μl / min. Following the same procedure AAV5-hSyn-DIO-HM4D(Gi)-mCherry, AAV5-EF1a-DIO-ChR2(H134R)-eYFP and AAV5-EF1a-DIO-eArch3.0-eYFP (all from University of North Carolina Vector Core) were also injected bilaterally in the VTA, while AAV-DJ-EF1a-DIO-GCaMP6m (Stanford Vector Core) and dLight1 (AAV9-CAG-dLight1.1, courtesy of Dr. Lin Tian, University of California Davis) were injected unilaterally in the VTA and the NAc, respectively. Finally, cholera toxin subunit B Alexa Fluor 555, 488 and 647 conjugate (CTB 555, CTB 488 and CTB 647, Invitrogen) were injected bilaterally in the lateral or medial NAc shell respectively. When the experimental paradigm required it, during the same surgical procedure, unique chronically indwelling optic fiber cannula (Sparta et al., 2011) were implanted above the VTA using the exact same coordinates as for the injection except for DV coordinate, which was reduced to 4.2. Three screws were fixed into the skull to support the implant, which was further secured with dental cement. First behavioral session typically occurs 10–14 days after surgery to allow sufficient expression of the virus.
Mice were anaesthetized with a mix of ketamine (60 mg/kg, Ketalar) and xylazine (12 mg/kg, Rompun) solution. Catheters (CamCaths, model MIVSA) made of silicone elastomer tubing (outside diameter 0.63 mm, inside diameter 0.30 mm) were inserted 1.0–1.2 cm in the right jugular vein, about 5 mm from the pectoral muscle, to reach the right atrium. The other extremity of the catheter was placed subcutaneously in the mid-scapular region and connected to stainless steel tubing appearing outside the skin. Blood reflux in the tubing was checked to confirm correct placement of the catheter. Mice were allowed to recover for 3–5 days before the start of drug self-administration training and received antibiotics (Baytril 10%, 1 ml in 250 ml of water) in the drinking water during this period. Catheters were flushed daily with a heparin solution (Heparin ‘Bichsel’) in saline (30 IU) during the recovery period and just before and after each self-administration session.
All behavioral experiments were performed during the light phase and took place in mouse operant chambers (ENV-307A-CT, Med Associates) situated in sound-attenuating cubicle (Med Associates). Two retractable levers were present on both sides of one wall of the chamber. A cue-light was located above each lever and a house light was present in each chamber. During intravenous drug self-administration sessions, the stainless steel tubing of the catheter device was connected through a CoEx PE/PVCtubing (BCOEX-T25, Instech Solomon) to a swivel (Instech Solomon) and then an infusion pump (PHM-100, Med-Associates). The apparatus was controlled and data captured using a PC running MED-PC IV (Med-Associates).
To familiarize the mice with the operant self-administration setting, we performed four days of food shaping, whereby the mouse had to press an active lever once to obtain a food reward (FR1, one 60 min session per day, 20 mg sucrose pellet, Test Diet, USA). Following IV-catheter placement surgery mice were deprived of food for 12 hr before the first self-administration session to promote exploratory activity and were given food access ad libitum after the first session. Each session was 360 min in duration and started with the illumination of the house light and the insertion of the two levers into the operant chamber. During the first six sessions, a single press on the active lever (termed fixed-ratio one, or FR1) resulted in an infusion of 0.05 mg/kg of heroin (diacetylmorphine, DiaMo Narcotics GmbH, dissolved in 0.9% saline at 0.05 mg/mL and delivered at 0.0177 ml/s as a unit dose depending on the weight of the mouse) paired with a 5 s continuous illumination of the cue light above the active lever. Completion of the fixed-ratio also initiated a timeout period of 40 s during which heroin was no longer available. For the next six sessions, the dose of heroin was halved to 0.025 mg/kg in order to boost lever pressing (as a measure of motivation) while avoiding overdoses or any eventual sedative effect. Time out period was reduced to 10 s. The active lever (left or right lever) was randomly assigned for each mouse. To avoid an overdose of heroin, a maximum of 75 infusions for the ‘high’ dose and 150 for the ‘low’ dose were allowed per session. Only mice having reached a stable rate of correct lever responses were included in the study. Saline control mice undertook the same procedure as heroin mice except that saline (NaCl 0.9% B. Braun) replaced heroin infusions.
Thirty days after the final self-administration session (that is day 42), mice were tested for cue-associated seeking. The cue-associated drug-seeking test was a 90 min session, identical to the heroin acquisition period (house light on, insertion of the two levers), except that one press on the active lever (FR1 schedule) now triggered illumination of the cue light for 5 s but without a heroin infusion or a timeout period. The infusion pump was also activated during the drug-seeking session, because the pump noise provided an extra drug-associated cue.
Following viral injections (see above), DAT-Cre+ or wildtype mice were chronically implanted with an optic fiber (MFC_400/430–0.48_4 mm_ZF2.5(G)_FLT, Doric Lenses) above the VTA (GCaMP6m experiments) or NAc (dLight experiments). During recordings excitation (470 nm, M470F3, Thorlabs) and control LED light (405 nm, M405FP1, Thorlabs) was passed through excitation filters and focused onto a patch cord (MFP_400/430/1100–0.48_4 m_FC-ZF2.5, Doric Lenses). The fiber patch cord was connected to the chronically implanted fiber, and emission light (500–550 nm) was collected through the same fiber and passed onto a photoreceiver (Newport 2151, Doric Lenses). Excitation light was sinusoidally modulated at 211 and 531 Hz (470 nm and 405 nm light, respectively) and collected raw signal was demodulated by a real-time signal processor (RZ5P, Tucker Davis Systems) to determine contributions from 470 nm and 405 nm excitation sources (see Lerner et al., 2015).
For bulk GCaMP6 imaging of VTA DA calcium activity, animals were also surgically implanted with an intravenous catheter in the right jugular vein (see above). After habituation they were then recorded while freely moving in standard Med-Associates operant chambers. They were recorded for a baseline period (10 min) and then received five saline IV injections, immediately followed by a second baseline period (10 min) and five IV heroin injections. All injections were non-contingent with a two minute inter-injection interval.
For recordings of striatal dopamine dynamics using dLight, after habituation to handling, animals were injected intraperitoneally while freely moving in their homecage. In order to assess the effects of heroin (8 mg/kg), intraperitoneal Injections were performed on two experimental days separated by at least 48 hr (saline or heroin in counterbalanced design). A separate cohort of animals was subsequently treated with saline, citalopram (10 mg/kg, Cayman Chemical), reboxetine (20 mg/kg, Tocris) and cocaine (20 mg/kg) on separated recording days. During each day, fluorescence was recorded for at least a five-minute baseline period before injection, and a twenty-minute period after injection.
A subgroup of dLight animals was DAT-Cre positive (n = 3), and also injected with AAV8-hSyn-FLEX-ChrimsonR-TDTomato in the VTA, while a second optic fiber was placed above the structure. On a separate session, animals were optogenetically stimulated in the VTA while freely moving in their homecage. Using a counterbalanced sequence, bursts of 10–15 mW 593 nm wavelength light were administered once per 30 s. Bursts consisted of 5 pulses of 5 ms duration at 5, 10, 20, and 50 Hz. Laser light originated from a 593 nm DPSS laser that was gated by a shutter (CMSA-SR475_FC, Doric Lenses).
All data analyses were performed offline in Matlab (custom script https://github.com/tjd2002/tjd-shared-code, Davidson, 2016). To calculate dF/F, a linear fit was applied to the 405 nm signal during the baseline period to align it to the 470 nm signal, producing a fitted 405 nm signal that was used as F0 to normalize the 470 nm using standard dF/F normalization: (470 nm signal - fitted 405 nm signal)/fitted 405 nm signal. To quantify signal changes, for GCaMP6 experiments the average signal in the five minutes preceding the first IV injection (saline or heroin) was then compared to the average signal in the five minutes following the fifth IV injection (saline or heroin). For dLight experiments, the average signal in the five minutes preceding IP injection was compared to the average signal between ten and fifteen minutes after IP injection. At the end of all experiments, mice were euthanized and brains fixed in paraformaldehyde to prepare histological slices for verification of virus expression.
HEK293T cells (ATCC, Manassas VA #1573, STR authenticated, mycoplasma negative) were cultured and transfected as previously described (Patriarchi et al., 2018). Briefly, hippocampal neurons were isolated and infected using AAVs (1 x 109 GC/ml) at DIV5. Two weeks later, cells were washed with HBSS (Life Technologies) and imaged using a 40X oil-based objective on an inverted Zeiss Observer LSN710 confocal microscope with 488/513 ex/em wavelengths. For testing dLight1 sensitivity, neurotransmitters were directly applied to the bath during imaging sessions. A dual buffer gravity-driven perfusion system was used to exchange buffers between different drug concentrations. One-photon emission spectrum for the sensors was determined using the lambda-scan function of the confocal microscope. Two-photon emission spectrum was obtained with a 40X water-based objective on a SliceScore (Scientifica) and used to obtain normalized two-photon cross-section using MATLAB ROIs were generated using the threshold function in Fiji. Spatial movies and images of dF/F in response to a ligand was calculated as with the pixel-wise fluorescence value at each time and mean fluorescence in time points prior to ligand application,.
Mice were injected with a lethal dose of pentobarbital (150 mg/kg) and perfused trans-cardially with cold PBS and 4% paraformaldehyde solution. Brains were extracted and submerged in fixative for 24 hr at 4°C. Series of coronal 60 mm thick sections were cut on a vibratome. Immunostaining started by blocking slices in PBS 10% BSA and 0.3% Triton X-100 followed by overnight incubation in PBS 3% BSA and 0.3% Triton X-100 with primary antibody: cFos (dilution 1:5000, rabbit polyclonal, Santa Cruz, RRID: AB_2106783), TH (dilution 1:500, Mouse monoclonal anti-Tyrosine Hydroxylase, Sigma T2928) or GFP (dilution 1:500, rabbit polyclonal, Invitrogen, A11122). After three 15 min washes in PBS at room temperature, slices were incubated with 1:500 Alexa-conjugated secondary antibodies against the corresponding species (Alexa-Fluor 488, 555, Life Technologies). After three more steps of washing in PBS, a Hoechst staining was used to stain all neurons. Slices were then mounted and covered on microscope slides using mounting medium Mowiol (Calbiochem, Cat 475904–100 GM). Images were obtained in a confocal laser-scanning microscopy with a Fluoview 300 system (Olympus) using a 488 nm argon laser and a 537 nm heliumneon laser or in Leica SP5 confocal microscope using additional 350 nm laser with a 20x/0.7 NA oil immersion or objective. A semi-automated method was used to quantify viral infection and cFos expression in confocal images of brain slices containing the VTA or the NAc. Equally thresholded images were subjected to multiparticle analysis (NIH ImageJ). Region of interest (ROI) intensity values were obtained from the z stack of raw images by using Multi Measure tool. Colocalization was determined by overlap of the ROI obtained from the two independent fluorescence signals. Analysis was performed in at least three sections per animal.
For optogenetic studies, fiber optic cannulae of mice were connected via patch cords (Thor Labs, Germany) to a rotary joint (FRJ_1 × 2_FC-2FC; Doric Lenses, Quebec, Canada), suspended above the operant chamber. A second patch cord connected from the rotary joint to a blue or orange DPSS laser (SDL-473–100 mW or SDL-593–100 mW, respectively; Shanghai Dream Lasers; Shanghai, China) positioned outside of the cubicle. Laser power was typically 15–20 mW measured at the end of each patch cord. Thus, allowing for up to 30% power loss in connecting the patch cord to the implanted cannulae, we estimated laser power to be approximately 10–14 mW at the tip of the cannulae. In some cases, a mechanical shutter was used to control laser output (SR474 driver with SR476 shutter head; Stanford Research Systems, aligned using a connectorized mechanical shutter adaptor; Doric Lenses).
Each of the 22 optogenetic stimulation of VTA DA acquisition sessions lasted 60 min with no maximum number of reward. During all the sessions, a single press on the active lever (termed fixed ratio one, or FR1) resulted in a 10 s illumination of a cue light (pulses of 1 s at 1 Hz). After a delay of 5 s, onset of a 15 s blue laser stimulation (473 nm) composed of 30 bursts separated by 250 ms (each burst consisted of 5 laser pulses of 4 ms pulse width at 20 Hz; Brown et al., 2010). A 20 s timeout followed the rewarded lever press, during which lever presses had no consequence but were recorded.
Each of the 19 optogenetic inhibition of VTA GABA acquisition sessions lasted 180 min with no maximum number of reward. During all the sessions, a single press on the active lever (termed fixed ratio one, or FR1) resulted in a 10 s illumination of a cue light (pulses of 1 s at 1 Hz). After a delay of 5 s, onset of a 15 s continuous orange laser stimulation. A 20 s timeout followed the rewarded lever press, during which lever presses had no consequence but were recorded.
To assess locomotor activity, mice were tested over a 30 min free exploration period in an open field. The apparatus consisted of Plexiglas square (40 × 40 cm). Light intensity was respectively 150 lux and 120 lux at the center and walls of the arena. Animals were injected daily with saline or increasing doses of heroin (4, 8, 16, 32 mg/kg, i.p., 10 ml/kg) immediately before being placed in the apparatus. The distance travelled was recorded and analyzed by a video-tracking system (ANY-maze; Stoelting).
One hour before the behavioral test, DAT-Cre+ mice expressing DREADD receptors were intraperitoneally injected with CNO 2 mg/kg in saline solution (10 ml/kg). The mice were then placed in the same settings as in a self-administration 360 min session (as described on page 12). Mice were randomly assigned to one of the two behavioral protocols. DAT-Cre- mice were used in the same settings as negative controls.
Sample sizes were calculated using publicly available sample size calculators; group sizes are in the range use for similar methodology by us and others. Experiments were typically repeated in at least two cohorts. Samples were randomly assigned to experimental groups. Experimenters were not blinded for data collection and analysis, except for cFos quantification (both acquisition and analysis),. Multiple comparisons were first subject to mixed-factor ANOVA defining both between- (for example, DAT-cre +vs DAT-cre-; saline or heroin self-administration groups) and/or within- (for example, active or inactive lever presses) group factors. Where significant main effects or interaction terms were found (p<0.05), further comparisons were made by a two-tailed Student’s t-test with Bonferroni correction. Single comparisons of between- or within-group measures were made by two-tailed non-paired or paired Student’s t-test, respectively.
Opiate versus psychostimulant addiction: the differences do matterNature Reviews Neuroscience 12:685–700.https://doi.org/10.1038/nrn3104
Neuronal signature of social novelty exploration in the VTA: implication for autism spectrum disorderbioRxiv, 280537, 10.1101/280537.
Dopamine in the brain: hypothesizing surfeit or deficit links to reward and addictionJournal of Reward Deficiency Syndrome 1:95–104.https://doi.org/10.17756/jrds.2015-016
Lack of reward and locomotor stimulation induced by heroin in mu-opioid receptor-deficient miceEuropean Journal of Pharmacology 446:103–109.https://doi.org/10.1016/S0014-2999(02)01812-5
Self-administration of morphine, DAMGO, and DPDPE into the ventral tegmental area of ratsThe Journal of Neuroscience 14:1978–1984.https://doi.org/10.1523/JNEUROSCI.14-04-01978.1994
Kainic acid lesions of the nucleus accumbens selectively attenuate morphine self-administrationPharmacology Biochemistry and Behavior 29:175–181.https://doi.org/10.1016/0091-3057(88)90292-4
Structural correlates of heterogeneous in vivo activity of midbrain dopaminergic neuronsNature Neuroscience 15:613–619.https://doi.org/10.1038/nn.3048
Opioids excite dopamine neurons by hyperpolarization of local interneuronsThe Journal of Neuroscience 12:483–488.https://doi.org/10.1523/JNEUROSCI.12-02-00483.1992
Heterogeneous composition of dopamine neurons of the rat A10 region: molecular evidence for diverse signaling propertiesBrain Structure and Function 218:1159–1176.https://doi.org/10.1007/s00429-012-0452-z
D1 dopamine receptors in the nucleus accumbens modulate cocaine self-administration in the ratPharmacology Biochemistry and Behavior 45:239–242.https://doi.org/10.1016/0091-3057(93)90112-7
The ventral tegmental area revisited: is there an electrophysiological marker for dopaminergic neurons?The Journal of Physiology 577:907–924.https://doi.org/10.1113/jphysiol.2006.117069
The dopamine theory of addiction: 40 years of highs and lowsNature Reviews Neuroscience 16:305–312.https://doi.org/10.1038/nrn3939
Positive reinforcement produced by electrical stimulation of septal area and other regions of rat brainJournal of Comparative and Physiological Psychology 47:419–427.https://doi.org/10.1037/h0058775
Laterodorsal tegmental projections to identified cell populations in the rat ventral tegmental areaThe Journal of Comparative Neurology 483:217–235.https://doi.org/10.1002/cne.20417
On the role of ascending catecholaminergic systems in intravenous self-administration of cocainePharmacology Biochemistry and Behavior 6:615–620.https://doi.org/10.1016/0091-3057(77)90084-3
Electrophysiological characterization of GABAergic neurons in the ventral tegmental areaThe Journal of Neuroscience 18:8003–8015.https://doi.org/10.1523/JNEUROSCI.18-19-08003.1998
The dopamine hypothesis of opiate reward challengedEuropean Journal of Pharmacology 134:239–243.https://doi.org/10.1016/0014-2999(87)90172-5
Disruption of cocaine and heroin self-administration following kainic acid lesions of the nucleus accumbensPharmacology Biochemistry and Behavior 23:1029–1036.https://doi.org/10.1016/0091-3057(85)90110-8
Lisa M MonteggiaReviewing Editor; UT Southwestern Medical Center, United States
Catherine DulacSenior Editor; Harvard University, United States
Marina E WolfReviewer
In the interests of transparency, eLife includes the editorial decision letter and accompanying author responses. A lightly edited version of the letter sent to the authors after peer review is shown, indicating the most substantive concerns; minor comments are not usually included.
Thank you for submitting your article "Dopamine neurons projecting to medial shell of the nucleus accumbens drive heroin reinforcement" for consideration by eLife. Your article has been reviewed by three peer reviewers, one of whom is a member of our Board of Reviewing Editors and the evaluation has been overseen by a Senior Editor. The reviewers have opted to remain anonymous.
The reviewers have discussed the reviews with one another and the Reviewing Editor has drafted this decision to help you prepare a revised submission.
The reviewers were overall positive about your study and found the manuscript balanced and scholarly in its effort to address a timely question. However, the reviewers agreed that certain key issues need to be addressed before publication. Importantly, two methods used in the study need some further validation. The authors are strongly encouraged to control for potential non-specific effects of CNO as well as establish the specificity of the dLight1 probe. The authors may already have data to address these issues. Inclusion of further controls will strengthen the arguments presented in the manuscript.
1) D-Light is a novel tool. The authors should validate the specificity of the signal. It would be great to replicate some of the controls shown in the original paper to establish the veracity of D-Light as a reporter for dopamine.
2) Given the recent issues with the selectivity and potential metabolism of clozapine-n-oxide (CNO), the authors should control the DREADD experiments with clozapine to ensure that the effects are specific to DREADD action.https://doi.org/10.7554/eLife.39945.013
1) D-Light is a novel tool. The authors should validate the specificity of the signal. It would be great to replicate some of the controls shown in the original paper to establish the veracity of D-Light as a reporter for dopamine.
We agree with the reviewers that replication of key observations with dLight1.1 constitutes a service to the community. We have therefore carried out fluorescence measurements in HEK cells that express the dLight1.1 in response to the bath application of the major transmitters found in the NAc, confirming the molecular specificity of the sensor. Moreover, we have also looked at three additional conditions in vivo, measuring dLight1.1 fluorescence after application of different monoamine reuptake inhibitors. We find that selective serotonin reuptake inhibitor citalopram and norepinephrine reuptake inhibitor reboxetine yield no increase in the dLight1.1 signal, while cocaine administration drove a transient similar to heroin, again confirming the selectivity of the sensor.
These data have now been added to Figure 2.
2) Given the recent issues with the selectivity and potential metabolism of clozapine-n-oxide (CNO), the authors should control the DREADD experiments with clozapine to ensure that the effects are specific to DREADD action.
We thank the reviewer to bring up this important point, as CNO may be metabolized to clozapine that antagonizes endogenous DRD2 receptor weakly. For these reasons, we chose an experimental design where we compared DAT-Cre+ to DAT-Cre– mice. In the latter the DREADD was not expressed, but they were still injected with CNO. These experiments will demonstrate whether CNO (or its metabolites) has a non-specific effect in the absence of the DREADD, which however was not the case.https://doi.org/10.7554/eLife.39945.014
- Christian Lüscher
- Christian Lüscher
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
The Swiss National Science Foundation and the European Research Council supported the work.
Animal experimentation: This study was performed in accordance with Swiss law (LPA). All of the animals were handled according to approved institutional animal care and use committee of Unige. The protocol was approved by the Committee on the Ethics of Animal Experiments of canton of Geneva (Permit Number: GE-128-16). Every effort was made to minimize suffering.
- Catherine Dulac, Harvard University, United States
- Lisa M Monteggia, UT Southwestern Medical Center, United States
- Marina E Wolf
© 2018, Corre et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.