Research Article

Neuroscience

Functional diversity of dopamine axons in prefrontal cortex during classical conditioning

Department of Neuroscience, Medical University of South Carolina, United States
Department of Pharmacology, Kagoshima University, Japan
Institute for Quantum Life Science, National Institutes for Quantum Science and Technology, Japan
Japan Science and Technology PRESTO, Japan
Department of Physiology, Monash University, Australia
Neuroscience Program, Biomedicine Discovery Institute, Monash University, Australia
Faculty of Data Science, Shiga University, Japan
Department of Biology, College of Charleston, United States
Department of Neuro-Medical Science, Osaka University, Japan
Japan Science and Technology FOREST, Japan

May 15, 2024

https://doi.org/10.7554/eLife.91136.3

Open access
Copyright information

Version of Record

The authors declare this version of their article to be the Version of Record.

About eLife's process

Download
Cite
Share
CommentOpen annotations (there are currently 0 annotations on this page).

Version of Record published: May 15, 2024 (This version)
Reviewed preprint version 2: April 17, 2024 (Go to version)
Reviewed preprint version 1: September 28, 2023 (Go to version)
Preprint posted: August 24, 2023 (Go to version)
Sent for peer review: August 6, 2023

1. Of interest
Emergence of brain-like mirror-symmetric viewpoint tuning in convolutional neural networks

Amirhossein Farzmahdi, Wilbert Zarco ... Tal Golan

Research Article May 31, 2024
Further reading

Abstract
eLife assessment
Introduction
Results
Discussion
Materials and methods
Data availability
References
Article and author information
Metrics

Abstract

Midbrain dopamine neurons impact neural processing in the prefrontal cortex (PFC) through mesocortical projections. However, the signals conveyed by dopamine projections to the PFC remain unclear, particularly at the single-axon level. Here, we investigated dopaminergic axonal activity in the medial PFC (mPFC) during reward and aversive processing. By optimizing microprism-mediated two-photon calcium imaging of dopamine axon terminals, we found diverse activity in dopamine axons responsive to both reward and aversive stimuli. Some axons exhibited a preference for reward, while others favored aversive stimuli, and there was a strong bias for the latter at the population level. Long-term longitudinal imaging revealed that the preference was maintained in reward- and aversive-preferring axons throughout classical conditioning in which rewarding and aversive stimuli were paired with preceding auditory cues. However, as mice learned to discriminate reward or aversive cues, a cue activity preference gradually developed only in aversive-preferring axons. We inferred the trial-by-trial cue discrimination based on machine learning using anticipatory licking or facial expressions, and found that successful discrimination was accompanied by sharper selectivity for the aversive cue in aversive-preferring axons. Our findings indicate that a group of mesocortical dopamine axons encodes aversive-related signals, which are modulated by both classical conditioning across days and trial-by-trial discrimination within a day.

eLife assessment

This important study shows that distinct midbrain dopaminergic axons in the medial prefrontal cortex respond to aversive and rewarding stimuli and suggest that they are biased toward aversive processing. The use of innovative microprism based two-photon calcium imaging to study single axon heterogeneity is convincing, although the experimental design makes it difficult to definitively distinguish aversive valence from stimulus salience in this dopamine projection. This work will be of interest to neuroscientists working on neuromodulatory systems, cortical function and decision making.

https://doi.org/10.7554/eLife.91136.3.sa0

Introduction

The prefrontal cortex (PFC) contributes to a variety of higher cognitive functions, achieving the flexible control of behaviors that enables animals to adapt to a changing environment (Miller and Cohen, 2001; Fuster, 2015). The PFC is involved, for instance, in stimulus selection, working memory, rule switching, and decision making (Miller and Wallis, 2009). PFC processing and circuits are highly sensitive to neuromodulators, including dopamine (Seamans and Yang, 2004; Arnsten et al., 2012). Indeed, studies using pharmacological or optogenetic manipulation of dopamine signaling have suggested roles of dopamine in gating sensory signals (Popescu et al., 2016; Vander Weele et al., 2018), maintaining working memory (Sawaguchi and Goldman-Rakic, 1994), and relaying decisions to motor structures (Ott et al., 2014). Consistently, dysregulation of dopamine signaling in the PFC has been suggested to underlie a wide array of neuropsychiatric disorders, including schizophrenia, depression, attention-deficit/hyperactivity disorder, and post-traumatic stress disorder (Okubo et al., 1997; Lindström et al., 1999; Granon et al., 2000; Arnsten and Dudley, 2005; Howes and Kapur, 2009; Hoexter et al., 2012; Grace, 2016).

The PFC receives dopaminergic inputs from a subset of dopamine neurons in the midbrain, but the information encoded by this subset in vivo remains unclear. Decades of investigations have revealed that midbrain dopamine neurons in the ventral tegmental area (VTA) generally encode reward prediction errors (Schultz et al., 1997): the neurons increase their firing to unexpected reward delivery and shift their response to cues that precede reward delivery after instrumental learning or classical conditioning (Rescorla and Wagner, 1972; Sutton and Barto, 1981). However, several studies have reported that a subpopulation of dopamine neurons show phasic responses to aversive stimuli as a part of salience signaling (Chiodo et al., 1980; Mantz et al., 1989; Guarraci and Kapp, 1999; Matsumoto and Hikosaka, 2009), implying that midbrain dopamine neurons may not be functionally homogeneous. Indeed, depending on the projection target, dopamine neurons can have distinct molecular, anatomical, and electrophysiological features (Lammel et al., 2008; Poulin et al., 2018). Dopamine neurons that project to the PFC might locate primarily to the medial posterior VTA (Lammel et al., 2008) and show different genetic profiles from other dopamine neurons (Poulin et al., 2018). In addition, optogenetic stimulation of the PFC-projecting DA neurons does not reinforce specific actions (Popescu et al., 2016; Ellwood et al., 2017; Vander Weele et al., 2018). Moreover, these neurons might respond not only to rewarding stimuli but also to aversive stimuli. Microdialysis, amperometry, and voltammetry measurements in the PFC have demonstrated an increase of dopamine in response to appetitive stimuli (Hernandez and Hoebel, 1990; Ahn and Phillips, 1999; St. Onge et al., 2012), aversive stimuli (Thierry et al., 1976; Abercrombie et al., 1989; Finlay et al., 1995; Vander Weele et al., 2018), or both (Bassareo et al., 2002). Similarly, measurements of the bulk calcium activity of mesocortical dopaminergic fibers have shown responses to appetitive (Ellwood et al., 2017) and aversive (Kim et al., 2016) stimuli. This apparent discrepancy is difficult to reconcile because none of these approaches could investigate the activity of individual dopamine neurons. Moreover, most previous studies evaluated the effects of either a rewarding or an aversive stimulus, rather than both. Consequently, it remains unknown whether the same or different mesocortical dopamine neurons respond to behaviorally opposing stimuli. It is also not known how these dopamine neurons change their response during classical conditioning, where rewarding or aversive stimuli are paired with conditioned cues.

To address these knowledge gaps, we developed an approach for imaging individual dopamine axons based on in vivo two-photon imaging with a microprism (Low et al., 2014). We optimized the microprism design and imaged dopamine axon terminals expressing genetically encoded calcium sensors in the mouse medial PFC (mPFC). We then head-fixed the mice to give rewards or aversive stimuli (water drops or electrical shocks) and trained the mice to associate the stimuli with preceding auditory cues (classical conditioning). During classical conditioning, we tracked the activity of dopamine axons over a period of days. We found that the dopamine axons showed diverse preferences for unconditioned (rewarding or aversive) stimuli. Through the classical conditioning, activity preferences for conditioned auditory cues were enhanced only for aversive-preferring axons. Moreover, in aversive-preferring axons, a machine learning-based analysis revealed that cue activity became more selective when the behavior of animals was judged as correct. We conclude that mesocortical dopamine axon activity is involved in aversive-related processing that is modulated by both classical conditioning across days and trial-by-trial judgements of conditioned cues within a day.

Results

Two-photon imaging shows dopaminergic axons in the mPFC of awake mice

To investigate the signal sent by dopamine neurons to the mPFC in mice, we developed an approach based on two-photon imaging using a microprism (Andermann et al., 2013; Low et al., 2014). We first expressed axon-jGCaMP8m, an axon-targeted (Broussard et al., 2018) genetically encoded calcium sensor (Zhang et al., 2023), in dopamine neurons in the VTA. We injected Cre-dependent AAV into the midbrain regions of transgenic mice (DAT-Cre), which express Cre-recombinase in dopamine neurons (Kim et al., 2016; Figure 1A, see Materials and methods). After 2–3 weeks, using sectioned slices, we confirmed that GCaMP expression in cell bodies in the VTA (and substantia nigra pars compacta [SNc]) (Figure 1C) coincides with the expression of tyrosine hydroxylase, an endogenous marker for dopamine neurons (Figure 1D). Dopamine neurons in the VTA are known to project sparsely to the mPFC, including the superficial layers (Vander Weele et al., 2018; Figure 1—figure supplement 1), but the mPFC itself is located deep in the medial bank (Figure 1B), rendering two-photon imaging of GCaMP (which is typically excited at 920–980 nm) infeasible. Therefore, we inserted a microprism into the longitudinal fissure between the two medial banks (two hemispheres) to optically access the mPFC (Figure 1E–F). The right-angle microprism bends the optical axis within the brain, providing optical access to the fissure wall and the mPFC surface (Low et al., 2014). We optimized the microprism assembly (Figure 1—figure supplement 2C) in order to reach up to 2 mm in depth from the dorsal surface (Figure 1F). The assembly incorporated double-layer glass at the top (Komiyama et al., 2010), stabilizing the brain from both the medial and dorsal sides, which significantly reduced the movement of the brain (Figure 1—figure supplement 2). Through the microprism, we could visualize GCaMP-expressing axons in the superficial layers of the mPFC in live animals (at a depth of 30–100 μm, Figure 1G, Figure 1—figure supplement 3). Sparse axons in the superficial layers are advantageous for two-photon imaging, achieving low background noise. In contrast, axons in the deep layers, which are known to be denser in sectioned slices (Figure 1—figure supplement 3), could not be visualized under our experiments. The GCaMP signal can indicate the calcium influx into axons and terminals, which is triggered by axonal action potentials (Petreanu et al., 2012; Howe and Dombeck, 2016; Lutas et al., 2019), thereby providing a measure of the activity of dopamine neurons that send projections to the mPFC. In contrast, when we inserted a gradient refractive index (GRIN) lens into the mPFC (Kamigaki and Dan, 2017), we could not reliably visualize GCaMP-expressing dopamine axons, unlike the case for dopamine axons in the basal amygdala (Lutas et al., 2019). This difference might indicate that the dopamine axons in the mPFC have weaker signals requiring a lens with a larger numerical aperture (GRIN lens: NA 0.5 vs. Nikon objective lens: NA 0.8) or that these axons are less resilient to mechanical damage in close vicinity.

Figure 1 with 3 supplements see all

Download asset Open asset

Two-photon imaging of dopaminergic axons projecting to the mPFC.

(**A, B**) Experimental design. The activity of midbrain dopamine neurons projecting to the mPFC was measured by two-photon calcium imaging of their axons. The axons were accessed through a microprism that bends the optical axis inside the brain (black arrows in B). (C) GCaMP was expressed virally in dopamine neurons in DAT-Cre transgenic mice. A coronal section shows GCaMP expression in the VTA, demonstrating that AAV-axon-DIO-jGCaMP8m was injected into the VTA. (D) jGCaMP8m-expressing neurons were positive for tyrosine hydroxylase (TH), a marker for dopamine neurons. (**E, F**) Dorsal view of a mouse head implanted with a microprism assembly. The microprism was 1x2 mm. (G) An example in vivo image of jGCaMP8m-expressing axons.

Dopaminergic axons in the mPFC have diverse responses to rewarding and aversive stimuli

Using our imaging approach, we first investigated whether individual dopamine axons respond to unexpected rewards (Schultz et al., 1997) and unexpected aversive stimuli (Vander Weele et al., 2018). As a reward, we delivered drops of water through a spout with random timing to water-deprived mice (Figure 2A). In response to the reward delivery, the mice licked the water spout, and we filmed this behavior to quantify the tongue position (Figure 2B and Figure 2—figure supplement 1). Upon the delivery of a water drop, the mice started licking (licking latency: 0.538±0.065 s, n=8 animals; Otis et al., 2017). Two-photon calcium imaging revealed that the water reward evoked brief calcium transients in many dopamine axons (40.1% of axons in eight animals, example in Figure 2I). The brief calcium response to the reward is consistent with increased phasic firing in dopamine neurons at the time of unexpected reward, as previously reported in many studies in primates (Schultz et al., 1997) and rodents (Engelhard et al., 2019; Amo et al., 2022).

Figure 2 with 4 supplements see all

Download asset Open asset

Dopaminergic axonal response to unexpected rewarding or aversive stimuli.

(A) Experimental design. Mice were placed under a two-photon microscope on a linear treadmill and were given unexpected rewarding (water drops) or aversive (electrical shock to the tail) stimuli. The mouse’s face was filmed with an infrared camera to track the tongue position. (B) Example behavioral response to rewarding (top, tongue position) or aversive (bottom, treadmill speed) stimuli. (C) Average behavioral response on a single day for the same animal shown in B. (D) Representative image for dopamine axons labeled with jGCaMP8m. Scale bar: 50 μm. (**E, F**) Heatmaps of rewarding (E) or aversive (F) stimuli for the same imaging plane shown in D. (G–I) Calcium response for rewarding (left) and aversive (right) stimuli of three example axons. (J) Comparison between reward (x-axis) and aversive (y-axis) responses for dopamine axons (n=162). Statistically significant axons were labeled in either cyan (reward-preferring axons, n=25) or magenta (aversive-preferring axons, n=75). Vector averages representing reward-preferring axons and aversive-preferring axons are depicted as cyan and magenta arrows. (K) Histogram of the polar angle of the scatter plot in J (n=162). The solid line indicates probability density, estimated by kernel smoothing. A value of 0° represents axons that solely prefer rewards, whereas 90° represents those that solely prefer aversive stimuli.

In contrast to conventional midbrain dopamine neurons, mPFC dopamine axons are proposed to play a key role in aversive processing (Weele et al., 2019). To investigate the calcium response to an unexpected aversive stimulus, we delivered mild electrical shocks to the tail of the mice (Kim et al., 2016; Patriarchi et al., 2018; Lutas et al., 2019; Figure 2A) that were randomly interleaved with reward delivery (one shock for every seven rewards on average). The mild shock evoked calcium transients in many dopamine projections (Figure 2F, G and H), together with locomotion (Figure 2B). These transients could simply reflect locomotion initiation, similar to dopamine axons in the dorsal striatum (Howe and Dombeck, 2016). To explore this possibility, we investigated whether locomotion without aversive stimuli is accompanied by increased calcium activity. We found no significant calcium increase at the time point of spontaneous locomotion initiation (Figure 2—figure supplement 2). Therefore, unlike the axons projecting from the SNc to the dorsal striatum (Howe and Dombeck, 2016; Ma et al., 2022), mPFC dopamine axons do not encode the initiation of movement; rather, these axons respond to the aversive stimulus.

Some previous studies have demonstrated that the overall dopamine release at the mPFC or the summed activity of mPFC dopamine axons exhibits a strong response to aversive stimuli (e.g. tail shock), but little to rewards (Kim et al., 2016). We evaluated the preference of individual axons for rewarding and aversive signals at a single-axon resolution by computing the polar angle for individual axons on a Cartesian representation of reward and shock activity (Figure 2J and K, Figure 2—figure supplement 3). In the polar representation, an angle of 0° indicates a strong preference for reward information, whereas 90° indicates a preference for aversive information. The polar angle distribution revealed that a significant number of axons preferred aversive stimuli, although some preferred reward. As a result, probability density, estimated by kernel smoothing, showed a bimodal distribution (Figure 2K, solid line) with a trough at around 45–50°. In addition, axons showing significant responses were categorized into two clusters based on k-means clustering (Figure 2J), the separation of which coincided roughly with 45–50° (Figure 2K). Thereafter, we refer to these clusters as aversive- or reward-preferring axons (colored magenta or cyan, respectively). These clusters do not respond exclusively to one hedonic valence (rewarding or aversive stimuli), as evident from the broad angle distributions. We could not find any anatomical patterns for aversive- or reward-preferring axons. These axons were present in either half of the prism view (i.e. anterior or posterior; ventral or dorsal), implying no obvious functional projection patterns within the mPFC. We note that the strength of preference could be quantitatively changed. Indeed, we found that the reward response to 10 μL nearly reached saturation, but the aversive response could be further increased at a stronger current (Figure 2—figure supplement 4). Therefore, the exclusive preference for aversive stimuli observed in some studies might possibly be explained by a smaller reward volume and/or stronger aversive stimulus. It may also be possible that the relative frequency of the aversive stimulus could influence the aversive preference. Moreover, measured signals in these studies may arise from deep layers and be different from the superficial axons that we image. Altogether, our two-photon imaging revealed, for the first time, that individual axons in the superficial layers show diverse preferences for rewarding and aversive stimuli.

Aversive cue processing is enhanced in aversive-preferring axons during classical conditioning

How do the reward and aversive activities of individual axons change while animals are learning that the reward and aversive events are preceded and predicted by sensory cues? This paradigm, known as classical conditioning, is a key framework for capturing learning-related changes in midbrain dopamine neurons (Sutton and Barto, 1981; Schultz et al., 1997) and mPFC neurons (Takehara-Nishiuchi and McNaughton, 2008; Otis et al., 2017). We presented mice with a 2 s pure tone as a conditioned stimulus (CS_reward and CS_aversive: 9 and 13 kHz, or 13 and 9 kHz), and then, after a 1 s delay, we presented either a rewarding or an aversive unconditioned stimulus (Figure 3A). Previous work has shown that mice are able to discriminate between two tones that differ by more than 7% (de Hoz and Nelken, 2014). Indeed the mice learned the contingency between the conditioned stimulus (tone) and the outcome (reward or electrical shock), which was reflected in changes in their behavior throughout this conditioning process (Figure 3B and C). To quantify such behavioral changes during learning, we separated the learning into three phases in addition to the first day (Figure 3D–G, for six phases, Figure 3—figure supplement 1). On the first day, the animals licked the water spout only after the reward was delivered (first day; Figure 3B). However, during the middle and late phases, animals gradually showed licking behavior even before the reward delivery, representing an anticipation of reward (Figure 3D). We observed this anticipatory licking more frequently after CS_reward than CS_aversive (Figure 3D vs. 3E, p=0.031 for the middle phase, p=0.031 for the late phase, Wilcoxon signed-rank test, n=6 animals), indicating that the animals behaviorally learned to discriminate the two conditioned stimulus tones. Similarly, running before the delivery of the unconditioned stimulus was more frequent after CS_aversive than CS_reward at the late phase (Figure 3F vs. 3 G, p=0.031, Wilcoxon signed-rank test, n=6 animals), again indicating that the two conditioned auditory cues were behaviorally discriminated. Therefore, as in previous studies, anticipatory licking (Otis et al., 2017) and anticipatory running (Lutas et al., 2019) can capture whether animals behaviorally discriminate conditioned cues in classical conditioning.

Figure 3 with 4 supplements see all

Download asset Open asset

Classical conditioning induced behavioral and neural changes.

(A) Experimental design. Auditory cues were presented before unconditioned stimuli (rewarding or aversive stimuli). (**B, C**) Behavioral changes in one example animal across 12 days (B: licking, C: running). In the reward condition, the animal gradually developed anticipatory licking (B, left). In the aversive condition, the animal usually ran after the shock delivery, but sometimes even before the delivery (C, right). Licking and running traces were normalized to the instantaneous maximum values for this animal and then averaged over a single day. (**D–G**) Anticipatory behavior during classical learning (n=6 animals). In the late phase of learning, anticipatory licking was primarily observed in the reward condition (D) but not in the aversive condition (E). Anticipatory running was seen more often in the aversive condition (G) than in the reward condition (F). (H) Activity change in one example axon across 12 days. The axon was from the same animal shown in B, C. (**I, J**) Learning induced changes in response to conditioned cues (I) and unconditioned stimuli (J) for aversive-preferring axons (magenta) and reward-preferring axons (cyan) together with non-significant axons (black). Aversive- and reward-preferring axons were defined before the start of classical training. The x-axis represents the reward condition, and the y-axis represents the aversive condition. n=47 for aversive-preferring axons, n=12 for reward-preferring axons. Vector averages representing aversive-preferring axons and reward-preferring axons are depicted as magenta and cyan arrows, overlaid in each panel. (K) Learning induced a change in the amplitude of unconditioned response of aversive-preferring axons for the aversive condition (magenta solid line) and reward condition (magenta dotted line) and that of reward-preferring axons for reward condition (cyan solid line) and aversive condition (cyan dotted line). Number of axons is the same as I, J. (L) The polar angle of the scatter plot in J. The magenta line represents aversive-preferring axons and the cyan line represents reward-preferring axons. (M) Similar to K, but for the conditioned response. (N) Similar to L, but for the conditioned response.

Through the classical conditioning paradigm, our long-term two-photon imaging revealed that aversive-preferring dopamine axons maintained their preference for the unconditioned response but enhanced their selectivity for the aversive cue activity (Figure 3H). We evaluated the activity change at the time of the unconditioned stimuli (US) throughout the learning process for aversive- and reward-preferring axons (Figure 3J, magenta and cyan, respectively). On the first day, aversive-preferring dopamine axons showed stronger activity for the aversive stimuli (Figure 3J, top, Figure 3—figure supplement 2A), similar to the response without classical conditioning (Figure 2J). Across learning, the activity for the rewarding and aversive unconditioned stimuli gradually decreased (Figure 3K, magenta, p=0.002 for rewarding stimuli, p=0.007 for aversive stimuli, n=47; Wilcoxon signed-rank test, comparison between the first day and the last phase), maintaining similar preferences for rewarding and aversive stimuli (Figure 3L, magenta, p=0.24, n=47; circular statistics, comparison between the first day and the last phase). Similarly, reward-preferring axons maintained their preferences over the course of classical conditioning (Figure 3L, cyan, p=0.77, n=12; circular statistics, Figure 2—figure supplement 2B).

Next, we quantified the activity change at the time of the conditioned auditory cues (CS_reward and CS_aversive, Figure 3I) throughout the learning process. On the first day, aversive-preferring axons already showed a transient response to conditioned cues, implying that the conditioned stimulus response was not acquired through learning (first day in Figure 3H and M, and Figure 3—figure supplement 2). In addition, the conditioned stimulus response showed no particular reward/aversive preference (Figure 3N, first day, for aversive-preferring axons, p=0.66, n=47), indicating that aversive-preferring axons did not distinguish the two conditioned cues. However, at the later phases of learning, the conditioned stimulus response was enhanced for CS_aversive in aversive-preferring axons (Figure 3M, magenta, p<0.0001, n=47; Wilcoxon signed-rank test, comparison between the first day and the last phase) and slightly attenuated for CS_reward (p<0.001), resulting in a stronger preference for aversive processing (late phase in Figure 3N, magenta, p<0.001). In contrast, for reward-preferring axons, the conditioned stimulus response increased both for CS_aversive (Figure 3M, dotted cyan line, non-significantly, p=0.09, n=12, Wilcoxon signed-rank test) and for CS_reward (solid cyan line, significantly, p<0.007), resulting in an unchanged preference (Figure 3N, cyan, p=0.77). Consistently, these changes in CS and US responses across the training phases were evident in the population-averaged calcium response (Figure 3—figure supplement 2).

We also tested whether the dopamine axons showed suppressed activity when the predicted reward was omitted, one of the major features of reward prediction error coding (Schultz et al., 1997; Engelhard et al., 2019; Amo et al., 2022). Such activity suppression has been detected with GCaMP6m at cell bodies of dopamine neurons (Engelhard et al., 2019) as reduced signal at 0–4 s after the delivery of reward. Therefore, we included one condition for an unexpected reward omission on the last day of the late phase of the classical conditioning (Figure 3—figure supplement 3). We found that upon the reward omission, the reward-preferring dopamine axons did not show activity suppression, indicating that the mPFC dopamine axons do not respond to reward omission.

Taken together, our two-photon imaging revealed that a minority of mPFC dopamine axons prefer reward activity (reward-preferring axons), and that these axons are not involved in reward prediction error in a classical learning paradigm. In contrast, the majority of dopamine axons are strongly involved in aversive-related processing (aversive-preferring axons), and the preference for the aversive cue is enhanced through classical conditioning.

Dopamine axons show enhanced selectivity of cue activity in trials with correct discrimination

In the classical conditioning paradigm, an enhanced preference of aversive-preferring dopamine axons for aversive cues (Figure 3N) was accompanied by improved behavioral discrimination of the two conditioned cues (Figure 3D–G). Based on this finding, can correct cue discrimination be accompanied by an enhanced neural preference when animals make trial-by-trial judgements in discriminating cues even after conditioning?

To investigate trial-by-trial judgements of conditioned cues, we classified the trials into four groups (Figure 4A) based on correct or incorrect discriminating behavior. First, we focused on the presence or absence of anticipatory licking, as the licking behavior can discriminate the two conditioned stimulus tones, particularly at the late phase of learning (Figure 4—figure supplement 1, based on the random forest classifier). The first group exhibited licking after CS_reward (correct reward discrimination), the second group exhibited no licking after CS_reward (incorrect reward discrimination), the third group displayed no licking after CS_aversive (correct aversive discrimination), and the fourth group displayed licking after CS_aversive (incorrect aversive discrimination). The classification is invalid when animals make random guesses (discrimination of 50%), so we focused on results from the late phase of learning (or the middle phase if there were no errors in anticipatory licking in the late phase).

Figure 4 with 2 supplements see all

Download asset Open asset

Axonal cue response in trials with correct or incorrect cue discrimination.

(A) Classification of trials based on the behavioral response that occurred between the conditioned stimulus onset and unconditioned stimulus onset. Such behaviors include anticipatory licking or facial expressions. (B) Comparison of cue response between correct (x-axis) and incorrect (y-axis) trials based on anticipatory licking (magenta, aversive-preferring axons, n=44; cyan, reward-preferring axons, n=12). The left panel shows the reward cue response, and the right panel shows the aversive cue response. (C) Preference for reward or aversive predictive cues in correct (left panel) and incorrect (right) discrimination. Magenta represents aversive-preferring axons (n=44) and cyan represents reward-preferring axons (n=12). Vector averages representing aversive-preferring axons and reward-preferring axons are depicted as magenta and cyan arrows. Note stronger preference for aversive cue as a population in correct discrimination. (D) Similar to B, but based on facial expressions (magenta, aversive-preferring axons, n=47; cyan, reward-preferring axons, n=12) (E) Similar to C, but based on facial expressions (magenta, aversive-preferring axons, n=47; cyan, reward-preferring axons, n=12). As in C, preference for the aversive cue is stronger in correct discrimination.

Can the axonal response to conditioned cues be impacted by whether animals discriminate the cues correctly or incorrectly? An incorrect discrimination of the aversive cue is accompanied by the presence of anticipatory licking, resulting in error trials in our machine learning-based analysis. Such error trials (Figure 4A, fourth group) occurred in 1.6% of cases, showing a weaker aversive cue response than correct trials (third group; p<0.0001, n=44 magenta axons, Wilcoxon signed-rank test, Figure 4B, right). In contrast, the absence of anticipatory licking despite the reward-predictive cue comprises another type of error (second group, 49.0%). In such error trials, the reward cue response was not significantly different from that in the correct trials (first group; p=0.26 n=44 axons, Wilcoxon signed-rank test, Figure 4B, left). Overall, the reward/aversive preference was stronger in correct discrimination trials than in incorrect trials (left vs. right in Figure 4C, magenta, 82.2°±1.1° vs. 70.0°±7.6°, p=0.049, circular statistics).

In addition to anticipatory licking, the discrimination of predictive cues can be inferred by the facial expressions of mice. Facial expressions of mice can capture emotional states (Dolensek et al., 2020), and have been used to make binary judgements of the presence or absence of pain with the application of a deep neural network (Tuttle et al., 2018). In this study, we combined a pretrained deep neural network (ResNet3D) (Tran et al., 2018) and a machine learning classifier (random forest classifier; Breiman, 2001) to make binary judgements of whether the animals experienced reward or aversive conditions, based on facial expressions during the cue presentation (Figure 4—figure supplement 2). The percentage of errors in discrimination during the 2 s cue presentation was 16.0% ± 1.4% (n=6 animals), comparable to the result for anticipatory licking during the cue plus delay periods (25.3% ± 6.1%). However, discrimination based on facial expressions resulted in a higher number of error trials in aversive conditions than discrimination based on licking (14.1% vs 1.6%), and a lower number in reward conditions (18.0% vs 49.0%). This discrepancy might be explained either by temporal discrepancy between the cue period (facial expression) and the delay period (most cases of anticipatory licking) or by the fact that anticipatory licking might represent reward uncertainty rather than reward expectation (Ogawa et al., 2013).

Correct cue-discrimination based on facial expressions analysis also revealed sharper selectivity for reward or aversive cues. A trial-by-trial error analysis revealed that the axonal activity to CS_aversive was enhanced in the correct trials in the aversive-preferring axons (Figure 4D, right, p=0.023, n=47 axons, Wilcoxon signed-rank test), consistent with the analysis based on anticipatory licking (Figure 4B, right). In addition, the response to CS_reward was significantly weaker in the correct trials (Figure 4D, left, p=0.019, n=47 axons, Wilcoxon signed-rank test). As a result, the reward/aversive preference was stronger in correct discrimination trials than in incorrect trials for the aversive-preferring axons (left vs. right in Figure 4E, magenta, 84.7°±3.0° vs. 81.7°±4.0°, p=0.019, circular statistics).

In contrast to the aversive-preferring axons, correct discrimination had no effect on the CS activity of the reward-preferring axons. We found that the response to CS_reward and CS_aversive was not significantly different between correct and incorrect judgement trials (cyan points in Figure 4B and D, anticipatory licking: CS_aversive p=0.052, facial expressions: CS_rewad p=0.08, CS_aversive p=0.20, n=12, Wilcoxon signed-rank test) except for the CS_reward response based on anticipatory licking (p=0.016, n=12). As a result, selectivity for CS_reward/CS_aversive was not improved in correct trials (anticipatory licking: p=0.15, facial expressions: p=0.39, cyan points in Figure 4C and E). Therefore, correct/incorrect discrimination impacts aversive- and reward-preferring axons differentially.

Altogether, when animals exhibited the correct behavioral response (either anticipatory licking or facial expression), aversive-preferring but not reward-preferring axons showed a higher selectivity for aversive cue processing (Figure 4C and E).

Discussion

Dopamine projections to the mPFC are considered one of the key neuromodulators that enable flexibility in neural processing of the mPFC. However, due to technical difficulties in recording the dopamine neurons of specific projections, little is known about the signals conveyed by mPFC projections, including the basic question of whether individual projections signal reward or aversive information. In this study, we optimized a two-photon imaging approach based on a microprism to image the calcium activity of dopaminergic axons in the mPFC. We uncovered differences in reward/aversive preferences in individual dopamine axons with an overall preference for aversive stimuli. In addition, we demonstrated that aversive-preferring axons responded equally to reward- and aversive-predictive conditioned cues in classical conditioning on the first day; however, this response became strongly biased toward the aversive conditioned cue through the conditioning. Finally, based on a trial-by-trial analysis of the animals’ behavior following reward- or aversive-predictive cues, we found that aversive-preferring axons exhibited higher selectivity for cues when the cues were successfully discriminated behaviorally.

Our study revealed functional diversity in mPFC-projecting dopamine axons by addressing a long-standing question of whether dopamine neurons send reward- or aversion-related signals to the mPFC (Weele et al., 2019; Verharen et al., 2020). The activity of mPFC-projecting dopamine neurons can be investigated extracellularly by incorporating antidromic stimulation (Mantz et al., 1989), but this approach is laborious. Thus, many studies have used more technically feasible but less direct approaches, particularly for awake animals, such as measuring dopamine release with microdialysis (Abercrombie et al., 1989; Bassareo et al., 2002), measuring catecholamine release with fast-scan cyclic voltammetry recently combined with optogenetic and pharmacological identification (Vander Weele et al., 2018), and measuring bulk calcium activity from dopamine axons with fiber photometry (Kim et al., 2016; Ellwood et al., 2017). These studies have led to somewhat inconsistent conclusions: some studies have reported reward signals whereas others have reported aversive signals. Reconciling these findings is challenging, as different studies have used different approaches to assess the effects of either a rewarding or an aversive stimulus, but not both. Our two-photon imaging approach provided a unique opportunity to compare rewarding and aversive signals of individual projection neurons (i.e. individual axon projections). Our comparison revealed diversity in the dopamine axons, and that many dopamine axons responded to both rewarding and aversive stimuli, with a strong bias for aversive stimuli at the population level. However, this population bias was not fixed; rather, the bias depended on both the reward volume and the intensity of the aversive stimulus. In addition, the bias might also rely on of the frequency of rewarding and aversive events, which our study could not address; we presented aversive stimuli less frequently to keep mice engaged. All these parameters may collectively explain why some studies have reported a strong response to aversive stimuli but little response to rewards. Our study revealed functional diversity in dopamine axons in the superficial layers, but did not address whether this diversity could also be found in axons in the deep layers. It is not clear whether single axons have branches in both the superficial and deep layers, even in anatomical studies that used single-cell tracing of dopamine neurons (Matsuda et al., 2009; Aransay et al., 2015). Further investigation into these layers may reveal a richer functional diversity in dopamine axons in the mPFC.

The firing of mPFC-projecting dopamine neurons cannot be simply explained by value coding performed by conventional midbrain dopamine neurons; their firing rates increase in response to opposite hedonic valences, rewarding and aversive stimuli, exhibiting diverse preference. Their firing might be captured by salience coding, which includes motivational salience signals and alerting signals (Bromberg-Martin et al., 2010). Similarly, salience coding may explain the response of aversive-preferring axons to aversive-predictive cues. On the first day of classical conditioning, when the animals had not yet established a link between conditioned and unconditioned stimuli, the aversive-preferring axons showed a transient activity increase to two types of conditioned cues with no bias, implying that activity serves as an alerting signal (Vander Weele et al., 2018). After days of training, the activity became strongly biased toward the aversive cue, indicating that the activity might additionally encode a motivational salience signal (Bromberg-Martin et al., 2010; Lee et al., 2021). In our study, motivational salience might appear to play a significant role in the processing of the aversive predictive cue. Specifically, in aversive-preferring axons, the activity associated with the aversive cue exhibits distinct trends within daily sessions compared to those of the reward cue and unconditioned cue activities (magenta points in Figure 3—figure supplement 4). The response of the aversive-preferring axons may be useful for the recipient mPFC to allocate its resources to the most salient outcomes and their predictors, a proposed role of the mPFC (Desimone and Duncan, 1995; Miller and Cohen, 2001; Ridderinkhof et al., 2004; Isoda and Hikosaka, 2007; Nee et al., 2011; Bissonette et al., 2013; Sharpe et al., 2019). The response of reward-preferring axons also does not follow value-coding, considering the absence of reward omission suppression. It is an open question whether saliency coding plays more important roles in aversive processing than in reward processing. To clarify the detailed nature of the saliency coding, together with the functional diversity of mPFC dopamine axons, further study is necessary. Such a study should include different types of unconditioned stimuli, systematically vary the physical features of conditioned and unconditioned stimuli, and separate the motivational salience and alerting signals with different task designs.

Consistent with salience coding without hedonic valences, phasic optogenetic stimulation of dopamine axons in the mPFC does not reinforce or suppress any behavioral actions (Popescu et al., 2016; Ellwood et al., 2017; Vander Weele et al., 2018) (but see [Gunaydin et al., 2014]). Instead, optogenetic stimulation can increase the signal-to-noise ratio of aversive processing in mPFC neurons for competitive situations in which reward and appetitive cues are simultaneously presented (Vander Weele et al., 2018). Our results are consistent with the recent view that dopamine at the mPFC gates sensory inputs for aversive processing (Ott and Nieder, 2019; Weele et al., 2019).

Using aversive classical conditioning, we revealed that aversive learning can induce activity changes in dopamine axons. The classical conditioning is a form of aversive learning distinct from instrumental aversive learning including punishment and active avoidance (Jean-Richard-Dit-Bressel et al., 2018). Although all types of aversive learning are processed in the mPFC and dopamine systems, each type may be expected to include distinct neural circuits. Further research is necessary to reveal the detailed processing of aversive learning in the mPFC and dopamine projections.

Our study provides new insights into the functional diversity of dopamine neurons that constitute mesocortical pathways. Previous studies employing fiber photometry imaging have identified functional diversity among dopamine neurons with distinct projection pathways; dopamine axons in the ventral nucleus accumbens medial shell (de Jong et al., 2019; Yuan et al., 2019), in the tail of the striatum (Menegas et al., 2017), and in the basal amygdala (Lutas et al., 2019) do not show activity that matches reward prediction error coding, but instead show increased activity for aversive stimuli (Verharen et al., 2020). Our two-photon imaging results demonstrate that even the same projection-defined dopamine neurons can be inhomogeneous, with some preferring aversive signals and others preferring reward signals. The aversive response found in some dopamine pathways, including mesocortical dopamine projections, might be linked to glutamate co-release, as vesicular glutamate transporter 2 (Slc17a6) genes are expressed in dopamine neurons projecting to the ventral nucleus accumbens medial shell, the tail of the striatum, and the mPFC (Poulin et al., 2018), and as AMPA-receptor-mediated excitatory postsynaptic currents have been confirmed upon the stimulation of dopamine axon terminals in the basal amygdala (Lutas et al., 2019). These glutamate co-releasing dopamine axons might even be collaterals of the same dopamine neurons; a single-cell tracing study showed examples of dopamine neurons with their axon collaterals in the PFC and the basal amygdala or those in the PFC and nucleus accumbens shell (Aransay et al., 2015). Meanwhile, dopamine axons projecting to the PFC are not just from Slc17a6-expressing neurons but also from Slc17a6-negative neurons (Poulin et al., 2018). One possible scenario is that Slc17a6 may be expressed in aversive-preferring axons in the mPFC but not in reward-preferring axons. This scenario might be in line with a recent fibermetry study on genetic features of dopamine neurons in the SNc, where Slc17a6⁺dopamine neurons showed a strong aversive and weak reward response as a population, whereas Slc17a6^-/Calb1⁺ dopamine neurons showed a weak aversive and strong reward response (Azcorra et al., 2023). As of now, it is not clear how functional diversity within the same mesocortical pathway is linked to molecular diversity. Clarifying such a link will further advance our understanding of distinct dopamine subsystems and may shed light on how dopamine subsystems are dysregulated in prefrontal psychiatric diseases.

Materials and methods

Key resources table

Reagent type (species) or resource	Designation	Source or reference	Identifiers	Additional information
Antibody	Anti-GFP (Rabbit Polyclonal)	Thermo Fisher Scientific	Cat# A-11122, RRID:AB_221569	IF (1:1000)
Antibody	Anti-Tyrosine Hydroxylase (Sheep Polyclonal)	Abcam	Cat# ab113, RRID:AB_297905	IF (1:200)
Antibody	Anti-Rabbit IgG (H+L) Antibody, Alexa Fluor 488 Conjugated (Donkey Polyclonal)	Thermo Fisher Scientific	Cat# A-21206, RRID:AB_2535792	IF (1:500)
Antibody	Anti-Sheep IgG (H+L) Antibody, Alexa Fluor 568 Conjugated (Donkey Polyclonal)	Thermo Fisher Scientific	Cat# A-21099, RRID:AB_2535753	IF (1:500)
Recombinant DNA reagent	hSynapsin1-FLEx-axon-jGCaMP8m (plasmid)	This paper	Addgene #216533	Described at ‘Headplate implant and virus injection’ of ‘Surgery’ section
Strain, strain background (AAV)	AAV2/1-hSynapsin1-FLEx-axon-jGCaMP8m-WPRE-SV40	University of South Carolina Viral Vectors Core
Genetic reagent (mouse)	Mouse: Slc6a3^{tm1.1(cre)Bkmn}	The Jackson Laboratory	JAX: 006660
Chemical compound	Normal Donkey Serum	Sigma-Aldrich	Cat# D9663, RRID:AB_2810235
Software, algorithm	MATLAB	Mathworks	RRID: SCR_001622 https://www.mathworks.com
Software, algorithm	Python	Mathworks	RRID:SCR_008394 https://www.anaconda.com/
Software, algorithm	Suite2p	Patriarchi et al., 2018; MouseLand, 2024	https://github.com/MouseLand/suite2p
Software, algorithm	DeepLabCut	Mathis et al., 2018	https://github.com/DeepLabCut/DeepLabCut; Mathis et al., 2024
Software, algorithm	B-spline Grid, Image and Point based Registration	Dirk-Jan Kroon	https://jp.mathworks.com/matlabcentral/fileexchange/20057-b-spline-grid-image-and-point-based-registration
Software, algorithm	Pytorch, TorchVision	Meta AI	https://pytorch.org/
Other	Mouse Brain Connectivity Atlas	Allen Brain Map	https://connectivity.brain-map.org/	Further details are provided in the caption of Figure 1—figure supplement 1.

Experimental model details

Animals

All experimental procedures were approved by local institutions supervising animal experiments at the Medical University of South Carolina, Monash University, Kagoshima University. Heterozygous dopamine transporter (DAT)-Cre mice (Slc6a3^{tm1.1(cre)Bkmn}, Jackson Laboratory, #006660, crossed with wild-type C57BL/6) was used in this study, including 12 mice for two-photon imaging and 10 for histology. Previous research utilized the same mouse line to express GCaMP6f in dopamine axon terminals in the mPFC that could be detected by one-photon fiber photometry (Kim et al., 2016). Mice of both sexes, aged >8 weeks were included. The mice were maintained in group housing (up to five mice per cage) and experiments were performed during the dark period of a 12 hr light/12 hr dark cycle.

Share this article

Cite this article

Two-photon imaging of dopaminergic axons projecting to the mPFC.

Dopaminergic axonal response to unexpected rewarding or aversive stimuli.

Classical conditioning induced behavioral and neural changes.

Axonal cue response in trials with correct or incorrect cue discrimination.

Author details

Kenta Abe

Contribution

Contributed equally with

Competing interests

Yuki Kambe

Contribution

Contributed equally with

Competing interests

Kei Majima

Contribution

Contributed equally with

Competing interests

Zijing Hu

Contribution

Contributed equally with

Competing interests

Makoto Ohtake

Contribution

Contributed equally with

Competing interests

Ali Momennezhad

Contribution

Competing interests

Hideki Izumi

Contribution

Competing interests

Takuma Tanaka

Contribution

Competing interests

Ashley Matunis

Contribution

Competing interests

Emma Stacy

Contribution

Competing interests

Takahide Itokazu

Contribution

Competing interests

Takashi R Sato

Contribution

For correspondence

Competing interests

Additional information

Tatsuo Sato

Contribution

For correspondence

Competing interests

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

Categories and tags

Research organism

Further reading