Acetylcholine is released in the basolateral amygdala in response to predictors of reward and enhances the learning of cue-reward contingency
The basolateral amygdala (BLA) is critical for associating initially neutral cues with appetitive and aversive stimuli and receives dense neuromodulatory acetylcholine (ACh) projections. We measured BLA ACh signaling and activity of neurons expressing CaMKIIα (a marker for glutamatergic principal cells) in mice during cue-reward learning using a fluorescent ACh sensor and calcium indicators. We found that ACh levels and nucleus basalis of Meynert (NBM) cholinergic terminal activity in the BLA (NBM-BLA) increased sharply in response to reward-related events and shifted as mice learned the cue-reward contingency. BLA CaMKIIα neuron activity followed reward retrieval and moved to the reward-predictive cue after task acquisition. Optical stimulation of cholinergic NBM-BLA terminal fibers led to a quicker acquisition of the cue-reward contingency. These results indicate BLA ACh signaling carries important information about salient events in cue-reward learning and provides a framework for understanding how ACh signaling contributes to shaping BLA responses to emotional stimuli.
Learning how environmental stimuli predict the availability of food and other natural rewards is critical for survival. The basolateral amygdala (BLA) is a brain area necessary for associating cues with both positive and negative valence outcomes (Baxter and Murray, 2002; Janak and Tye, 2015; LeDoux et al., 1990). A recent study has shown that genetically distinct subsets of BLA principal neurons encode the appetitive and aversive value of stimuli (Kim et al., 2016b). This encoding involves the interplay between principal neurons, interneurons, and incoming terminal fibers, all of which need to be tightly regulated to function efficiently.
The neuromodulator acetylcholine (ACh) is released throughout the brain and can control neuronal activity via a wide range of mechanisms. ACh signals through two families of receptors (nicotinic, nAChRs and muscarinic, mAChRs) that are differentially expressed on BLA neurons as well as their afferents (Picciotto et al., 2012). ACh signals through these receptors to increase signal-to-noise ratios and modify synaptic transmission and plasticity in circuits involved in learning new contingencies (Picciotto et al., 2012), especially in areas that receive dense cholinergic input, like the BLA (Woolf, 1991; Zaborszky et al., 2012). The effect of ACh signaling can differ depending on the receptor, as metabotropic mAChRs work on a slower timescale than the rapid, ionotropic nAChRs (Gu and Yakel, 2011; Picciotto et al., 2012). The overall impact of ACh signaling on the BLA is likely quite heterogeneous as mAChRs are coupled to both inhibitory and excitatory signaling cascades and nAChRs are found on both glutamatergic and GABAergic BLA neurons (Picciotto et al., 2012).
The basal forebrain complex is a primary source of ACh input to the BLA. In particular, the nucleus basalis of Meynert (NBM) sends dense cholinergic projections to the BLA (Woolf, 1991; Zaborszky et al., 2012). Optical stimulation of BLA-projecting cholinergic terminal fibers (NBM-BLA) during fear conditioning is sufficient to strengthen fear memories (Jiang et al., 2016) and may support appetitive behavior (Aitta-aho et al., 2018). Cholinergic NBM neurons increase their firing in response to both rewarding and aversive unconditioned stimuli (Hangya et al., 2015). Cholinergic signaling in the medial prefrontal cortex and visual cortex has been linked to cue detection (Parikh et al., 2007) and reward timing (Chubykin et al., 2013; Liu et al., 2015), respectively. A recent study has also demonstrated that NBM cells fire in response to a conditioned stimulus during trace fear conditioning, indicating that ACh signaling may be involved in learning about cues that predict salient outcomes (Guo et al., 2019).
We hypothesized that ACh signaling in the BLA is a critical neuromodulatory signal that responds to both unconditioned stimuli and cues that gain salience, thereby coordinating activity in circuits necessary for learning cue-reward contingencies. To test this hypothesis, we measured relative levels of BLA ACh (ACh signaling), cholinergic NBM-BLA terminal fiber activity (BLA ACh signal origin), and the activity of BLA principal neurons (BLA output) across all phases of learning in an appetitive operant learning task to evaluate how BLA output and ACh signaling are related to behavioral performance in this paradigm. We then optically stimulated cholinergic NBM fibers locally in the BLA, while mice learned to nose poke in response to an auditory cue to receive a food reward to determine if accelerating the increase in ACh signaling that occurs as mice learn the task would enhance performance. We also pharmacologically blocked different ACh receptors during the learning task to determine the subtypes involved, and varied the timing of optical stimulation of cholinergic NBM-BLA terminal fibers to determine whether ACh release time-locked with the reward-predictive cue is necessary for the improvement of the task performance. These studies provide a novel framework for understanding how NBM ACh signaling in the BLA is recruited during the perception of novel stimuli and how it contributes to linking previously neutral cues to predictions about future salient outcomes.
Acetylcholine release in the BLA occurs at salient points in the cue-reward learning task and shifts as mice learn the cue-reward contingency
The BLA is critical for learning that previously neutral cues can predict future punishments or rewards and for assigning valence to those cues (Baxter and Murray, 2002; Janak and Tye, 2015). The BLA receives dense cholinergic input (Woolf, 1991; Zaborszky et al., 2012) and we speculated that, since ACh signaling is involved in both attention and several types of learning (Picciotto et al., 2012), it could be essential for learning about cues that predict salient events, such as reward delivery. Based on data showing that ACh neurons fire in response to unexpected or salient events (Hangya et al., 2015), we also hypothesized that ACh release might vary as mice learn a cue-reward contingency. Therefore, we designed a cue-reward learning task in which food-restricted mice were trained to perform a nose poke when signaled by a cue (tone) to receive a palatable reward (Ensure) on a 30 s variable intertrial interval (ITI; Figure 1A–D). We injected adeno-associated virus (AAV) carrying an improved version of the fluorescent ACh sensor GRABACh3.0 (ACh3.0; Jing et al., 2018; Jing et al., 2019) construct into the BLA of mice and implanted an optical fiber above the BLA to record ACh signaling during the cue-reward learning task (Figure 2A + Figure 2—figure supplement 1A).
During the Pre-Training phase of the task, mice received reward and receptacle light presentation for performing a nose poke in the active port during tone presentation (Figure 1C, purple active nose poke coincident with tone) but there was no consequence for an incorrect nose poke (Figure 1C, red active nose poke not coincident with tone). Animals quickly learned to make a high number of responses over the course of each Pre-Training session. In this paradigm, mice obtained most available rewards by day 5 of Pre-Training (Figure 2B, blue shaded region). However, this phase of training did not promote learning of the cue-reward contingency, (i.e. that they should only nose poke during tone presentation) seen by the high number of incorrect nose pokes (Figure 2—figure supplement 2A, blue shaded region). Mice performed roughly eightfold more incorrect nose pokes than correct nose pokes, suggesting that mice were not attending to the task contingency. The Training phase of the task was identical to Pre-Training except incorrect nose pokes resulted in a 5 s timeout, during which the house light was illuminated, that concluded with a restarting of the ITI timer (Figure 1D, red active nose poke not coincident with tone). On day 1 of the Training phase, all animals earned fewer rewards (Figure 2B, pink shading) and, while still high, incorrect nose pokes dropped (Figure 2—figure supplement 2A, pink shading). Animals that did not meet acquisition criterion by day 9 (defined as consistently earning 20 or more rewards per session, Figure 2B, white horizontal line) were moved to a 20 s variable ITI to promote responding (Figure 2B, pink shading day 10). Following the change in ITI, mice acquired the cue-reward behavior at different rates. After acquisition, animals were switched to Extinction training in which correct nose pokes did not result in reward delivery, and all mice decreased nose poke responding (Figure 2B + Figure 2—figure supplement 2A, orange shading).
During Pre-Training, when there were high numbers of both correct and incorrect nose pokes, there was a large increase in ACh release following correct nose pokes, which were followed by reward delivery and receptacle light, but not incorrect nose pokes (Figure 2C + Figure 2—figure supplement 1B–C). We used bootstrapped confidence intervals (bCIs) to determine when transients were statistically significant (bCI did not contain the null of 0 [Jean-Richard-Dit-Bressel et al., 2020]). Correct, but not incorrect, nose poke trials consistently showed a sustained, significant increase in fluorescence close to the time of nose poke onset (Figure 2C). We also observed a significant decrease in fluorescence for most mice around 2–4 s after correct nose poke, which corresponds to the time of reward retrieval.
ACh release occurred in response to different events as mice learned the task (data for individual mice are shown in Figure 2D + Figure 2—figure supplement 1D–G and averaged data across all mice at key time points in the task is shown in Figure 2E + Figure 2—figure supplement 1H). During Pre-Training rewarded trials, the highest levels of ACh release occurred close to the time of correct nose pokes (NP), with a smaller peak at the time of reward retrieval (entry into the reward receptacle, Rec). As Training began, the ACh release during reward trials shifted dramatically toward the time of reward retrieval, likely because the animals were learning that many nose pokes did not result in reward delivery. Incorrect nose pokes that triggered a timeout were also followed by a modest but non-significant increase in BLA ACh levels (Figure 2—figure supplement 2B–H). As mice began to learn the contingency (Figure 2E + Figure 2—figure supplement 1H, 10 rewards), the peak ACh release during rewarded trials shifted back to the time of the correct nose poke response. As animals approached the acquisition criterion (Figure 2E + Figure 2—figure supplement 1H, Acq.), ACh level significantly increased at the time of the tone, suggesting that as animals learned the cue-reward contingency, the tone became a more salient event. At this time point, there was still a peak at the time of reward, but its magnitude was diminished. After task acquisition, the increase in ACh following correct nose pokes remained but was diminished, and incorrect nose pokes did not elicit apparent ACh release (Figure 2—figure supplement 2C–H, Acq.). During Extinction, ACh release to tone onset diminished. We replicated this experiment in an independent cohort of mice and found similar results (Figure 2—figure supplements 3–4). Mice in this replicate cohort learned in a similar fashion (Figure 2—figure supplement 3B + 4A) but met the acquisition criteria faster than initial mice because aspects of the behavioral setup were optimized (3D printed wall extensions) to allow the imaging apparatus to be used inside sound attenuating chambers (see Materials and methods section). One difference observed in this group that learned the task more rapidly, was small magnitude, but significant, increases in BLA ACh release following tone onset late in Pre-Training (Figure 2—figure supplement 3C–I). As behavioral performance during the Training phase increased, ACh release to tone onset became more pronounced, as in the initial cohort.
In order to determine the source of the ACh released in the BLA during cue-reward learning, we recorded calcium dynamics as a measure of cell activity of ChAT+ NBM terminal fibers in the BLA (NBM-BLA), since the NBM is a major source of cholinergic input to the BLA (Jiang et al., 2016; Woolf, 1991; Zaborszky et al., 2012). We injected AAV carrying a Cre-recombinase-dependent, genetically-encoded calcium indicator (DIO-GCaMP7s) into the NBM of ChAT-IRES-Cre mice and implanted an optical fiber above the ipsilateral BLA (Figure 2F + Figure 2—figure supplement 5A–C). As with the ACh3.0 sensor, there was a significant increase in NBM-BLA cholinergic terminal activity following correct, but not incorrect, nose pokes (Figure 2H + Figure 2—figure supplement 5D–E). NBM-BLA cholinergic terminal activity evolved across phases of the reward learning task as was seen for ACh levels in the BLA (data for each mouse shown in Figure 2I + Figure 2—figure supplement 5F–G, averaged across all mice at key time points in the task shown in Figure 2J + Figure 2—figure supplement 7G). Strikingly, NBM-BLA cholinergic terminal activity most closely followed correct nose pokes in Pre-Training and shifted primarily to tone onset as mice learned the contingency during Training. As in the replication cohort for the ACh sensor, small magnitude, but significant, increases in terminal activity were observed following tone onset late in Pre-Training (Figure 2J + Figure 2—figure supplement 7G). Incorrect nose pokes that resulted in a timeout in Training sessions were followed by a modest increase in NBM-BLA cholinergic terminal activity before task acquisition (Figure 2—figure supplement 6B–E). During Extinction, activity of NBM-BLA terminals following tone onset diminished. These findings were replicated in an independent cohort of mice, which we combined for across-mouse statistical analyses (Figure 2—figure supplements 7–8).
In order to record NBM-BLA cholinergic terminal activity and BLA ACh levels simultaneously in the same mouse, we injected AAV carrying a construct for Cre-recombinase dependent red-shifted genetically-encoded calcium indicator (DIO-jRCaMP1b) into the NBM of ChAT-IRES-Cre mice, ACh3.0 sensor into the ipsilateral BLA, and implanted a fiber above the BLA (Figure 2—figure supplement 9A–E, mouse 1). DIO-jRCaMP1b was also injected into the NBM of a wild-type littermate so Cre-mediated recombination would not occur to control for any crosstalk between the ACh3.0 and jRCaMP1b channels. While this was only a single animal and proof of principle for future studies, we found that NBM-BLA cholinergic terminal activity coincided with ACh levels (Figure 2—figure supplement 9F–G). Importantly, this relationship between ACh release and NBM-BLA terminal fiber activity was not explained by signal crosstalk (Figure 2—figure supplement 9H–I), further indicating that the BLA ACh measured comes at least in part from the NBM.
BLA principal neurons respond to reward availability and follows cue-reward learning
Glutamatergic principal cells are the primary output neurons of the BLA (Janak and Tye, 2015), and their firing is modulated by NBM-BLA cholinergic signaling (Jiang et al., 2016; Unal et al., 2015). BLA principal neurons can increase their firing in response to cues as animals learn cue-reward contingencies (Sanghera et al., 1979; Schoenbaum et al., 1998; Tye and Janak, 2007). Calcium/calmodulin-dependent protein kinase (CaMKII) has been shown to be a marker for glutamatergic BLA principal cells (Butler et al., 2011; Felix-Ortiz and Tye, 2014; McDonald, 1992; Tye et al., 2011). To determine whether ACh modulates principal neuron activity during cue-reward learning, we injected AAV carrying a Cre-recombinase dependent genetically encoded calcium indicator (DIO-GCaMP6s) into the BLA of CaMKIIα-Cre mice to record BLA principal cell activity during the learning task (Figure 3A + Figure 3—figure supplement 1A). As was seen for BLA ACh levels, there was a significant increase in BLA CaMKIIα cell activity following correct and a modest decrease in activity following incorrect nose pokes on the last day of Pre-Training (Figure 3B). However, the activity peaked later after the correct nose poke response (~2.5 s) compared to the ACh3.0 signal (~0.5 s) and appeared to align more tightly with reward retrieval (Figure 3—figure supplement 1B). As mice learned the task (Figure 3C + Figure 3—figure supplement 2A), BLA CaMKIIα cell activity increased first in response to reward and, after acquisition of the task, to the reward-predictive cue (individual data for each mouse shown in Figure 3D + Figure 3—figure supplement 1E–F, and averaged data across all mice at key time points in task is shown in Figure 3E + Figure 3—figure supplement 1G–H).
During Pre-Training, the highest levels of BLA CaMKIIα cell activity followed reward retrieval. In addition, during the first few days of Training, BLA CaMKIIα cell activity after reward retrieval was higher than it was during Pre-Training, and the magnitude of response decreased as mice learned the contingency and earned more rewards, ultimately reaching similar intensity to that observed during Pre-Training. Concurrently, as mice approached acquisition of the task (Figure 3C, white horizontal line), BLA CaMKIIα cell activity significantly increased in response to tone onset (Figure 3D-E + Figure 3—figure supplement 1E–H, Acq.), suggesting that the recruitment of BLA CaMKIIα cell activity likely reflects the association of the cue with a salient outcome (Lutas et al., 2019; Sengupta et al., 2018). Incorrect nose pokes that triggered a timeout did not elicit a different response in CaMKIIα cell activity compared to before timeouts were incorporated (Figure 3—figure supplement 2B–G). In an independent cohort of mice, those with more posterior fiber tip placements (mice 4 + 7) replicated the primary findings (Figure 3—figure supplements 3–4).
Stimulation of cholinergic terminals in BLA improves cue-reward learning
Since ACh released by NBM-BLA terminals during Training shifted to tone onset during acquisition of cue-reward learning (Figure 2E,J), we hypothesized that ACh may potentiate learning the cue-reward contingency. We, therefore, tested whether increasing ACh release in BLA during learning could alter cue-reward learning by injecting AAV carrying a Cre-recombinase-dependent channelrhodopsin-EYFP (AAV-DIO-ChR2-EYFP) construct bilaterally into the NBM of ChAT-IRES-Cre transgenic mice and placing fibers over the BLAs to optically stimulate cholinergic terminals originating from the NBM selectively (Figure 4A + Figure 4—figure supplement 1). Optical control over ChAT+ NBM cells was verified by ex vivo slice recordings, depolarizations followed light flashes and clear action potentials were observed ex vivo (Figure 4B + Figure 4—figure supplement 2). After operant familiarization, ChAT+ NBM-BLA terminals were stimulated via bilateral optical fibers (2 s, 20 Hz, 25 ms pulses) triggered by a correct nose poke throughout both Pre-Training (Figure 4C) and Training (Figure 4D). Stimulation usually occurred during at least a portion of all three components of a rewarded trial: tone, correct nose poke, and reward retrieval, since these events were often separated by short latencies.
As seen in previous experiments, during the Pre-Training phase animals made a high number of nose poke responses over the course of each session, obtained most available rewards by the last day (Figure 4E + Figure 4—figure supplement 3A, blue shading), and committed a very high number of incorrect nose pokes (Figure 4F + Figure 4—figure supplement 3B, blue shading). There were no differences in rewards earned (main effect of group (EYFP versus ChR2) in a two-way repeated-measures ANOVA, F (1, 9)=1.733, p=0.2205) or incorrect nose pokes (main effect of group (EYFP versus ChR2) in a two-way repeated-measures ANOVA, F (1, 9)=0.002433, p=0.9617) between the EYFP control (n = 5) and ChR2 (n = 6) groups during the Pre-Training phase (Figure 4E-F + Figure 4—figure supplement 3A–B, blue shading), suggesting that increasing BLA ACh signaling was not sufficient to modify behavior during the Pre-Training phase of the task.
On Day 1 of the Training phase, all animals earned fewer rewards (Figure 4E + Figure 4—figure supplement 3A, pink shading) and incorrect nose pokes remained high (Figure 4F + Figure 4—figure supplement 3B, pink shading). As the animals learned that a nose poke occurring outside of the cued period resulted in a timeout, both control EYFP and ChR2 groups learned the contingency and improved their performance, resulting in acquisition of the cue-reward task (20 rewards earned). However, significant group differences emerged, such that ChR2 mice earned significantly more rewards than EYFP controls (Figure 4E + Figure 4—figure supplement 3A, pink shaded; main effect of group (EYFP versus ChR2) in a two-way repeated-measures ANOVA, F (1, 9)=9.434, p=0.0133), and there was a significant Day × Group (EYFP versus ChR2) interaction (two-way repeated-measures ANOVA, F (11, 99)=3.210, p=0.0009). ChR2 mice also made significantly fewer incorrect nose pokes than control mice (Figure 4F + Figure 4—figure supplement 3B, pink shaded; two-way repeated-measures ANOVA, F (1, 9)=12.67, p=0.0061), suggesting that the ChR2 group learned the tone-reward contingency more quickly than the EYFP group. EYFP mice were able to reach the same peak cue-reward performance as the ChR2 group only after 4–6 additional days of training. Once peak performance was achieved, there was no difference in extinction learning between the groups (main effect of group (EYFP versus ChR2) in a two-way repeated-measures ANOVA, F (1, 9)=2.293, p=0.1643). While sex differences in the behavior were not formally tested side by side, an independent cohort of male mice (EYFP n = 7, ChR2 n = 7, Figure 4—figure supplement 4) was tested to determine whether both male and female mice would respond to ACh stimulation, revealing similar trends during Training for rewards earned (Figure 4—figure supplement 3C,E, pink shaded; two-way repeated-measures ANOVA, Group main effect (EYFP versus ChR2): F (1, 12)=3.636, p=0.0808, Day × Group interaction: F (11, 132)=3.033, p=0.0012) and incorrect nose pokes (Figure 4—figure supplement 3D,F, red shaded; two-way repeated-measures ANOVA, Group main effect (EYFP versus ChR2): F (1, 12)=4.925, p=0.0465).
In order to determine if optical stimulation of NBM-BLA cholinergic terminals improved performance in the task by increasing the rewarding value of the outcome, rather than enhancing cue-reward learning by some other means, we allowed mice to nose poke for optical stimulation rather than for Ensure (Figure 4—figure supplement 5A). There were no differences between the EYFP control and ChR2 groups (two-way repeated-measures ANOVA, F (1, 9)=0.6653, p=0.4357). We also tested whether NBM-BLA cholinergic terminal activation was reinforcing on its own by stimulating these terminals in a real-time place preference test. Mice were allowed to explore two similar compartments to determine baseline preference, and NBM-BLA cholinergic terminals were then stimulated in one of the two chambers to determine whether it increased time spent in the simulation-paired chamber. There was no difference between groups (Figure 4—figure supplement 5B, main effect of group (EYFP versus ChR2) in a two-way repeated-measures ANOVA, F (1, 9)=0.1311, p=0.7257) in place preference, confirming that optical activation of NBM-BLA cholinergic terminals is not innately rewarding. Stimulation of NBM-BLA cholinergic terminals also did not lead to changes in nose poke behavior in an uncued progressive ratio task (Figure 4—figure supplement 5C, main effect of group (EYFP versus ChR2) in a two-way repeated-measures ANOVA, F (1, 12)=0.0009814, p=0.975). Locomotor behavior was also not significantly affected by NBM-BLA cholinergic terminal activation (Figure 4—figure supplement 5D, two-way repeated-measures ANOVA, F (1, 9)=0.05804, p=0.8150.) Finally, to determine whether there was any effect of NBM-BLA cholinergic terminal stimulation on preference for, or avoidance of, a stressful environment, mice were tested for changes in time spent in the dark or light side due to laser stimulation in the Light/Dark Box test, and there were no differences between the groups (Figure 4—figure supplement 5E–F, unpaired t-tests, number of crosses: p=0.3223; time in light side: p=0.1565).
Muscarinic, but not nicotinic, receptors are required for acquisition of the cue-reward contingency
ACh signals through multiple receptor subtypes, with rapid, ionotropic signaling mediated through stimulation of nAChRs, and metabotropic signaling mediated through stimulation of mAChRs (Picciotto et al., 2012). To determine which ACh receptors were involved in this cue-reward learning task, mice were injected intraperitoneally with saline (n = 8), mecamylamine (non-competitive nicotinic antagonist, Mec, n = 9), scopolamine (competitive muscarinic antagonist, Scop, n = 8), or a combination of both antagonists (Mec+Scop, n = 9) 30 min prior to Pre-Training and Training, during the same epochs of the task in which optical stimulation was administered (Figure 5A). Like optical stimulation, blockade of ACh receptors during the Pre-Training phase of the task had no effect on rewards earned (Figure 5B + Figure 5—figure supplement 1A, blue shading, main effect of Group (antagonist) in a two-way repeated-measures ANOVA, F (3, 30)=1.285, p=0.2973) or on the large number of incorrect nose pokes (Figure 5C + Figure 5—figure supplement 1B, blue shading, main effect of Group (antagonist) in a two-way repeated-measures ANOVA, F (3, 30)=1.496, p=0.2356). By contrast, blockade of muscarinic signaling abolished the ability of mice to learn the correct cue-reward contingency during the Training period (Figure 5B + Figure 5—figure supplement 1A, pink shading, two-way repeated-measures ANOVA, Antagonist main effect: F (3, 30)=23.13, p<0.0001, Day x Antagonist interaction: F (33, 330)=10.79, p<0.0001), with these mice maintaining high levels of incorrect nose pokes for the duration of Training compared to Saline and Mec treated mice (Figure 5C + Figure 5—figure supplement 1B, pink shading, main effect of Group (antagonist) in a two-way repeated-measures ANOVA, F (3, 30)=25.64, p<0.0001). Saline and Mec groups were not significantly different in any phase of the task, including across Extinction (Figure 5B-C + Figure 5—figure supplement 1A–B, orange shading, main effect of Group (antagonist) in a two-way repeated-measures ANOVA, F (1, 15)=1.201, p=0.2903). We have shown that this dose of mecamylamine delivered i.p. has significant effects in tests of anxiety-like behavior and responses to inescapable stress. In addition, chronic treatment with mecamylamine at this dose is sufficient to decrease BLA c-fos immunoreactivity (Mineur et al., 2007). Consistent with the inability to acquire the cue-reward contingency, mice treated with Scop or Mec+Scop also obtained very few rewards during Extinction (Figure 5B + Figure 5—figure supplement 1A, orange shading). The antagonists had no effect on locomotion as measured by beam breaks (Figure 5—figure supplement 1C, one-way ANOVA, F (3, 30)=0.5074, p=0.6802).
ACh-mediated accelerated cue-reward learning does not require contingent stimulation of ChAT+ NBM terminals in the BLA
Acetylcholine is often thought of as a neuromodulator (Picciotto et al., 2012), and the window for cholinergic effects on synaptic plasticity varies across ACh receptor subtypes (Gu and Yakel, 2011). It is therefore possible that ACh signaling may result in intracellular signaling changes that outlast the cue presentation window. In order to determine if the effect of NBM-BLA stimulation is dependent upon the timing of correct nose poke and laser stimulation contingency, we repeated the experiment in an independent cohort of mice with an additional non-contingent ChR2 group that received the same number of stimulation trains as the contingent ChR2 group, but in which light stimulation was explicitly unpaired with task events (Figure 6A + Figure 6—figure supplement 1). As in the previous experiment, there were no differences between the EYFP control (n = 6) and stimulation groups (contingent-ChR2 n = 5 and non-contingent ChR2 n = 5) during Pre-Training (Figure 6B-C + Figure 6—figure supplement 2A–B, blue shading; main effect of group (EYFP versus contingent-ChR2 versus non-contingent ChR2) two-way repeated-measures ANOVAs; rewards earned: F (2, 13)=0.7008, p=0.5140; incorrect nose pokes: F (2, 13)=0.3906, p=0.6843). However, the non-contingent ChR2 group was not significantly different from the contingent ChR2 group during the training period with respect to number of rewards earned (two-way repeated-measures ANOVA, F (1, 8)=0.09147, p=0.7700) or incorrect nose pokes (two-way repeated-measures ANOVA, F (1, 8)=0.3681, p=0.5609), but both ChR2 groups were significantly better than the EYFP control group (Figure 6B-C + Figure 6—figure supplement 2A–B, pink shading; two-way repeated-measures ANOVAs; rewards earned: Group (EYFP versus contingent-ChR2 versus non-contingent-ChR2) main effect: F (2, 13)=7.254, p=0.0077; Day × Group interaction: F (22, 143)=1.861, p=0.0164. Incorrect nose pokes: Group main effect: F (2, 13)=4.884, p=0.0262). These results demonstrate that ChR2-mediated ACh release does not have to be time-locked to the cue, nose poke, or reward retrieval to improve performance of the task, suggesting that ACh may alter the threshold for neuronal plasticity for cue-reward pairing over a much longer timescale than might be expected based on results from the ACh3.0 recording and NBM-BLA recordings, which could be consistent with the involvement of mAChR signaling in this effect. As in the previous experiment, once all groups reached criterion for acquisition of the cue-reward contingency, there were no differences between any of the groups during Extinction (Figure 6B-C + Figure 6—figure supplement 2A–B, orange shaded; two-way repeated-measures ANOVA, F (2, 13)=0.04229, p=0.9587).
It is increasingly recognized that the BLA is involved in learning to predict both positive and negative outcomes from previously neutral cues (Cador et al., 1989; Janak and Tye, 2015; LeDoux et al., 1990). Cholinergic cells in the basal forebrain complex fire in response to both positive and negative reinforcement (Hangya et al., 2015). The results shown here indicate that ACh signaling in the BLA is intimately involved in cue-reward learning. Endogenous ACh is released in the BLA in response to salient events in the task, and ACh dynamics evolved as the subject formed associations between stimuli and reward. While the pattern of ACh signaling in the BLA may seem reminiscent of how dopamine neurons encode reward prediction errors as measured in other brain areas (Schultz et al., 1997), the current results suggest that ACh release in the BLA may instead be involved in signaling a combination of salience and novelty. ACh release and NBM-BLA activity increased following correct nose poke and, around the time that animals acquired the cue-reward task, following tone onset. However, earlier in Training, incorrect nose pokes that resulted in a timeout were also followed by ACh release, although this was smaller in magnitude. Further, stimulating NBM-BLA cholinergic terminals during learning enhanced behavioral performance, but was not intrinsically rewarding on its own and did not support responding for the tone alone. Although ACh was released in the BLA at discrete points during the task, the effects of heightened BLA ACh signaling were relatively long lasting, since it was not necessary for stimulation to be time-locked to cue presentation or reward retrieval to enhance behavioral performance. Thus, cholinergic inputs from the basal forebrain complex to the BLA are a key component of the circuitry that links salient events to previously neutral stimuli in the environment and uses those neutral cues to predict future rewarded outcomes.
BLA ACh signaling and CaMKIIα cell activity are related to cue-reward learning
We have shown that ACh release in the BLA is coincident with the stimulus that was most salient to the animal at each phase of the task. Use of the fluorescent ACh sensor was essential in determining these dynamics (Jing et al., 2018; Jing et al., 2019). Previous microdialysis studies have shown that ACh is released in response to positive, negative, or surprising stimuli, but this technique is limited by relatively long timescales (min) and cannot be used to determine when cholinergic transients align to given events in an appetitive learning task and how they evolve over time (Sarter and Lustig, 2020). In this cue-reward learning paradigm, when there was no consequence for incorrect nose-poking (Pre-Training phase), animals learned to perform a very high number of nose pokes and received a large number of rewards, and BLA ACh signaling peaked following correct nose pokes. Both the behavioral response (nose poking that was not contingent with the tone) and the ACh response (linked to the correct nose poke) suggest that the animals were not attending to the tone during most of the Pre-Training phase of the task, but rather were attending to the cues associated with reward delivery, such as the reward light or the sound of the pump that delivered the reward. Consistent with this possibility, in the next phase of the task when mice received a timeout for responding if the tone was not presented, performance of all groups dropped dramatically. Interestingly, in animals that had difficulty learning the cue-reward contingency, during early Training sessions ACh release shifted to reward retrieval, likely because this was the most salient aspect of the task when the majority of nose pokes performed did not result in reward. Finally, as mice acquired the contingency between tone and reward availability, the tone also began to elicit ACh release in the BLA, suggesting that mice learned that the tone is a salient event predicting reward availability. Since there are multiple sources of ACh input to the BLA, it was important to determine whether NBM cholinergic neurons were active during the periods when ACh levels were high (Woolf, 1991). Recordings from cholinergic NBM-BLA terminal fibers showed similar dynamics to ACh measurements, suggesting that the NBM is a primary source of ACh across the phases of cue-reward learning.
Perhaps the most well-known example of dynamic responding related to learning cue-reward contingencies and encoding of reward prediction errors is the firing of dopaminergic neurons of the ventral tegmental area (VTA; Schultz, 1998). After sufficient pairings, dopaminergic neurons will fire in response to the cue that predicts the reward, and no longer to the rewarding outcome, which corresponds with behavioral changes that indicate an association has been formed between conditioned stimuli (CS) and unconditioned stimuli (US). It should be noted that dopamine signaling is not unique in this learning-related dynamic response profile. Serotonergic neuronal responses also evolve during reward learning in a manner distinct from dopaminergic neurons (Zhong et al., 2017). Plasticity related to learning has also been observed in cholinergic neurons in the basal forebrain complex during aversive trace conditioning, such that after several training days, neuronal activity spans the delay between CS and US (Guo et al., 2019). Additionally, a recent study suggested that ACh may signal a valence-free reinforcement prediction error (Sturgill et al., 2020). Future studies on the selective inputs to NBM to BLA cholinergic neurons would be of interest to identify the links between brain areas involved in prediction error coding.
We found that BLA CaMKIIα cells were most reliably activated following reward retrieval before contingency acquisition (both when they were receiving several rewards but no timeouts in Pre-Training and few rewards early in Training). Similar to the recording of ACh levels, after acquisition, the tone began to elicit an increase in BLA CaMKIIα cell population activity. However, activity of CaMKIIα neurons differed from ACh signaling in the BLA in important ways. ACh was released in response to the salient events in the task that were best able to predict reward delivery or availability. By contrast, the activity of BLA CaMKIIα neurons was not tightly time-locked to correct nose poking, and instead followed reward retrieval until acquisition, when activity increased in response to tone onset. The divergent dynamics of ACh release and CaMKIIα neuron activity underscores that ACh’s role in the BLA is to modulate, rather than drive, the activity of CaMKIIα neurons, and therefore may alter dynamics of the network through selective engagement of different populations of GABA interneurons (Unal et al., 2015).
Increasing BLA acetylcholine levels enhances cue-reward learning
Neuronal activity and plasticity in the BLA is required for both acquisition of appetitive learning (conditioned reinforcement) and fear conditioning, however the inputs that increase activity in the structure during salient events likely come from many brain areas (McKernan and Shinnick-Gallagher, 1997; Rogan et al., 1997; Tye et al., 2008). In particular, dopaminergic inputs to the BLA are important for acquisition of conditioned reinforcement and for linking the rewarding properties of addictive drugs to cues that predict their availability (Cador et al., 1989). Our results indicate that ACh is a critical neuromodulator upstream of the BLA that is responsive to salient events, such as reward availability, motor actions that elicit reward, and cues that predict reward. We show here that increasing endogenous ACh signaling in the BLA caused mice to perform significantly better than controls in an appetitive cued-learning task. Heightened ACh release during learning of a cue-action-reward contingency led to fewer incorrect responses and increased acquisition rate in both female and male mice. The optical stimulation was triggered by correct nose poke, thus the cholinergic NBM-BLA terminal fiber stimulation overlapped with all three salient events: tone, nose poke, and reward retrieval, since the tone terminated 2 s after correct nose poke. We chose this stimulation pattern, as opposed to linking optical stimulation to tone onset, to ensure stimulation was dependent on behavioral responses. Therefore, stimulation did not precisely recapitulate the ACh release profile observed in mice that had already acquired the task (when ACh increases following tone onset). This suggests that behaviorally-contingent increases in BLA ACh are sufficient to enhance task acquisition (but see below). It is also possible that optogenetic-mediated ACh release may last longer than endogenous, tone-evoked release. A simultaneous stimulation and recording approach would be required to compare ACh release under both conditions (Pisansky et al., 2019). It is important to note that basal forebrain neurons have been demonstrated to co-release ACh and GABA (Ma et al., 2018; Saunders et al., 2015), and cholinergic basal forebrain neurons that project to the BLA have been shown to co-express a glutamate transporter (Ma et al., 2018; Nickerson Poulin et al., 2006). Thus, it is possible that both fiber photometry and optogenetic results could be influenced, in part, by co-release of other neurotransmitters from ChAT-positive neurons. Future studies employing additional fluorescent neurotransmitter sensors (Marvin et al., 2013; Marvin et al., 2018; Marvin et al., 2019) could help understand the interaction between the different signals employed by basal forebrain neurons.
It is possible that ACh improved learning by increasing the intensity of the reward, potentiating the learned association, improving discrimination, or a combination of these phenomena. However, increasing ACh release in the BLA was not inherently rewarding, because it did not support self-stimulation or real-time place preference. This is at odds with a recent study that found stimulation of NBM-BLA cholinergic terminals could induce a type of place-preference and modest self-stimulation (Aitta-aho et al., 2018). Perhaps slight differences in targeting of ChR2 infusion or differences in the behavioral paradigm could be responsible for the lack of direct rewarding effects of optical ChAT terminal stimulation in the current study. Other recent work (Jiang et al., 2016) has demonstrated that stimulating this NBM-BLA cholinergic pathway is sufficient to strengthen cued aversive memory, suggesting that the effect of ACh in the BLA may not be inherently rewarding or punishing but instead potentiates plasticity in the BLA, allowing learning of cue-outcome contingencies. Similarly, it is possible that ACh alters motor activity. However, there were no effects of optical stimulation on locomotion or responding in the inactive nose poke port. In addition, during the Pre-Training phase when there was no consequence for incorrect nose pokes, all groups earned the same number of rewards, regardless of optical stimulation or pharmacological blockade of ACh receptors, suggesting that ACh is not involved in the motor aspects of the task or the value of the reward. Indeed, differences emerged only during the Training phase, when attention to the tone was critical to earn rewards. Further, incorrect nose poking remained high for mice administered scopolamine. This suggests that scopolamine-treated animals were seeking the reward, as in the operant familiarization and Pre-Training phases of training, but were unable to learn that they should only nose poke in response to the tone.
Cell-type-specific expression of AChRs and activity-dependent effects place cholinergic signaling at a prime position to shape BLA activity during learning. For instance, late-firing interneurons in the BLA exhibit nAChR-dependent EPSP’s when no effect is seen on fast-spiking interneurons, while principal neurons can be either excited or inhibited through mAChRs, depending on activity level of the neuron at the time of cholinergic stimulation (Unal et al., 2015). BLA mAChRs can support persistent firing in principal neurons and can be important for the expression of conditioned place preference behavior, as well as trace fear conditioning (Baysinger et al., 2012; Egorov et al., 2006; McIntyre et al., 1998). Similar to studies of trace fear conditioning, in which activity of the network over a delay period must be maintained, we found that metabotropic (mAChRs) but not ionotropic (nAChRs) ACh receptors were required for learning the contingency of this cue-reward task. The timing of cholinergic signaling can be a critical factor in the induction of synaptic plasticity in other brain regions, so we hypothesized that the enhancement of cue-reward learning observed might be dependent upon when NBM-BLA terminal fibers were stimulated with respect to tone presentation and/or behavioral responses (Gu and Yakel, 2011). However, we found that heightened ACh signaling in the BLA improved behavioral performance even when stimulations were explicitly unpaired with the cue or correct nose poking. This suggests that the effect of increased cholinergic signaling in the BLA is long lasting, and that stimulation during a learning session is sufficient to potentiate synaptic events linking the cue to a salient outcome—even if CS and/or reward delivery are presented tens of seconds later. Given the findings from fiber photometry recordings, which showed endogenous ACh release was time-locked to salient stimuli during the task and evolved with learning, it is surprising that time-locking of exogenous ACh release was not necessary for enhancement of cue-reward learning. Coupled with pharmacological evidence demonstrating that muscarinic signaling is necessary for reward learning in this task, these results suggest the involvement of metabotropic signaling downstream of muscarinic receptors that outlasts the initial cholinergic stimulation.
To conclude, the abundant ACh input to the BLA results in ACh release in response to stimuli that predict reward in a learned cue-reward task. Increasing cholinergic signaling results in accelerated learning of the cue-reward contingency. These findings are consistent with the hypothesis that ACh is a neuromodulator that is released in response to salient stimuli and suggests that ACh signaling may enhance neuronal plasticity in the BLA network, leading to accelerated cue-reward learning.
Materials and methods
All procedures were approved by the Yale University Institutional Animal Care and Use Committee (protocol: 2019–07895) in compliance with the National Institute of Health’s Guide for the Care and Use of Laboratory Animals. Experiments were performed in mice of both sexes, in keeping with the NIH policy of including sex as a biological variable. Sex of mice in behavioral graphs is indicated by circles for females and squares for males.
Female and male heterozygous mice with Cre recombinase knocked into the choline acetyltransferase (ChAT) gene (ChAT-IRES-Cre, B6;129S6-Chattm2(cre)Lowl/J, Jackson Laboratory, Bar Harbor, ME) were bred in house by mating ChAT-IRES-Cre with C57BL6/J mice. CaMKIIα-Cre (Tg(Camk2a-cre)2Gsc) mice obtained from Ronald Duman (Casanova et al., 2001; Wohleb et al., 2016) were bred in house as above. C57BL6/J mice were obtained from The Jackson Laboratory at 6–10 weeks of age, and tested at 5–7 months of age, following at least 1 week of acclimation. All mice were maintained in a temperature-controlled animal facility on a 12 hr light/dark cycle (lights on at 7:00 AM). Mice were group housed 3–5 per cage and provided with ad libitum food and water until undergoing behavioral testing. Mice were single housed 1–3 weeks before surgery to facilitate food restriction and body weight maintenance.
Surgical proceduresRequest a detailed protocol
Surgical procedures for behavior were performed in fully adult mice at 4–6 months of age, age-matched across conditions. For viral infusion and fiber implantation, mice were anesthetized using isoflurane (induced at 4%, maintained at 1.5–2%) and secured in a stereotactic apparatus (David Kopf Instruments, Tujunga, CA). The skull was exposed using a scalpel and Bregma was determined using the syringe needle tip (2 µL Hamilton Neuros syringe, 30 gauge needle, flat tip; Reno, NV).
For fiber photometry surgeries, 0.4 µL of AAV9 hSyn-ACh3.0 (Vigene Biosciences Inc) to measure BLA ACh levels (Figure 2A-E + Figure 2—figure supplements 1–2) was delivered unilaterally to the BLA (A/P; −1.34 mm, M/L + or - 2.65 mm, D/V −4.6 mm, relative to Bregma) of ChAT-IRES-Cre or wild-type C57BL6/J mice at a rate of 0.1 µL/min. The needle was allowed to remain at the infusion site for 5 min before and 5 min after injection. A mono fiber-optic cannula (1.25 mm outer diameter metal ferrule; 6 mm long, 400 µm core diameter/430 µm outer diameter, 0.48 numerical aperture (NA), hard polymer cladding outer layer cannula; Doric Lenses, Quebec City, Quebec, Canada) was implanted above the BLA (A/P; −1.34 mm, M/L + 2.65 mm, D/V −4.25 mm) and affixed to the skull using opaque dental cement (Parkell Inc, Edgewood, NY). For BLA CaMKIIα cell calcium dynamic recordings (Figure 3 + Figure 3—figure supplements 1–2), 0.5 µL of AAV1 Syn-FLEX-GCaMP6s-WPRE-SV40 (Addgene, Watertown, MA) was injected into the left BLA using the same procedure and coordinates but was injected into CaMKIIα-Cre mice. Cholinergic NBM-BLA terminal fiber calcium dynamic recording (Figure 2F-J + Figure 2—figure supplements 5–8) surgeries were performed as above except AAV1-Syn-FLEX-jGCaMP7s-WPRE (Addgene) was infused unilaterally into the NBM (A/P: - 0.7 mm, M/L + or - 1.75 mm, D/V – 4.5 mm) of ChAT-IRES-Cre mice, with the optical fiber being placed above the ipsilateral BLA. The jRCaMP1b + ACh3.0 surgeries to simultaneously measure cholinergic NBM-BLA terminal fiber calcium dynamics and BLA ACh levels (Figure 2—figure supplement 9) consisted of both the NBM and BLA infusions above, except AAV1 Syn-FLEX-NES-jRCaMP1b-WPRE-SV40 (Addgene) was infused the NBM of ChAT-IRES-Cre mice. The RCaMP sham mouse (Figure 2—figure supplement 9E,H) was a wild-type littermate and thus had no jRCaMP1b expression. pAAV.Syn.Flex.GCaMP6s.WPRE.SV40 (Addgene viral prep # 100845-AAV1; http://n2t.net/addgene:100845; RRID:Addgene_100845), pGP-AAV-syn-FLEX-jGCaMP7s-WPRE was a gift from Douglas Kim and GENIE Project (Addgene viral prep # 104491-AAV1; http://n2t.net/addgene:104491; RRID:Addgene_104491), and pAAV.Syn.Flex.NES-jRCaMP1b.WPRE.SV40 (Addgene viral prep # 100850-AAV1; http://n2t.net/addgene:100850; RRID:Addgene_100850) were gifts from Douglas Kim and GENIE Project (Chen et al., 2013; Dana et al., 2016; Dana et al., 2019).
Mice were allowed to recover in a cage without bedding with a microwavable heating pad underneath it until recovery before being returned to home cage. For 2 d following surgery, mice received 5 mg/Kg Rimadyl i.p (Zoetis Inc, Kalamazoo, MI) as postoperative care.
For optical stimulation experiments (Figure 4, Figure 6 + Figure 4—figure supplements 1–5 + Figure 6—figure supplements 1–2), surgeries were performed as above except as follows: 0.5 µL of control vector (AAV2 EF1a-DIO-EYFP) or channelrhodopsin (AAV2 EF1a-DIO-hChR2(H134R)-EYFP; University of North Carolina Gene Therapy Center Vector Core, Chapel Hill, NC) was delivered bilaterally into the NBM (A/P: - 0.7 mm, M/L ± 1.75 mm, D/V – 4.5 mm) of ChAT-IRES-Cre mice. Mono fiber-optic cannulas (1.25 mm outer diameter zirconia ferrule; 5 mm long, 200 µm core diameter/245 µm outer diameter, 0.37 NA, polyimide buffer outer layer cannula; Doric Lenses) were inserted bilaterally above the basolateral amygdala (BLA, A/P; −1.22 mm, M/L ± 2.75 mm, D/V −4.25 mm). Mice were randomly assigned to EYFP or ChR2 groups, controlling for average group age.
For ex vivo local field potential electrophysiology experiments (Figure 4B), the NBM was injected with DIO-ChR2-EYFP as described above, except mice were 8 weeks of age (see Supplemental Methods for current clamp recording methods). The coronal brain slices containing the NBM were prepared after 2–4 weeks of expression. Briefly, mice were anesthetized with 1× Fatal-Plus (Vortech Pharmaceuticals, Dearborn, MI) and were perfused through their circulatory systems to cool down the brain with an ice-cold (4°C) and oxygenated cutting solution containing (mM): sucrose 220, KCl 2.5, NaH2PO4 1.23, NaHCO3 26, CaCl2 1, MgCl2 6 and glucose 10 (pH 7.3 with NaOH). Mice were then decapitated with a guillotine immediately; the brain was removed and immersed in the ice-cold (4°C) and oxygenated cutting solution to trim to a small tissue block containing the NBM. Coronal slices (300 μm thick) were prepared with a Leica vibratome (Leica Biosystems Inc, Buffalo Grove, IL) after the tissue block was glued on the vibratome stage with Loctite 404 instant adhesive (Henkel Adhesive Technologies, Düsseldorf, Germany). After preparation, slices were maintained at room temperature (23–25°C) in the storage chamber in the artificial cerebrospinal fluid (ACSF; bubbled with 5% CO2% and 95% O2) containing (in mM): NaCl 124, KCl 3, CaCl2 2, MgCl2 2, NaH2PO4 1.23, NaHCO3 26, glucose 10 (pH 7.4 with NaOH) for recovery and storage. Slices were transferred to the recording chamber and constantly perfused with ACSF with a perfusion rate of 2 mL/min at a temperature of 33°C for electrophysiological experiments. Cell-attached extracellular recording of action potentials was performed by attaching a glass micropipette filled with ACSF on EYFP-expressing cholinergic neurons with an input resistance of 10–20 MΩ under current clamp. Blue light (488 nm) pulse (60 ms) was applied to the recorded cells through an Olympus BX51WI microscope (Olympus, Waltham, MA) under the control of the Sutter filter wheel shutter controller (Lambda 10–2, Sutter Instrument, Novato, CA). All data were sampled at 3–10 kHz, filtered at 3 kHz and analyzed with an Apple Macintosh computer using Axograph X (AxoGraph). Events of field action potentials were detected and analyzed with an algorithm in Axograph X as reported previously (Rao et al., 2008).
HabituationRequest a detailed protocol
One week after surgery, mice were weighed daily and given sufficient food (2018S standard chow, Envigo, Madison, WI) to maintain 85% free-feeding body weight. All behavioral tests were performed during the light cycle. Mice were allowed to acclimate to the behavioral room for 30 min before testing and were returned to the animal colony after behavioral sessions ended.
Two weeks after surgery, mice were handled 3 min per day for 7 d in the behavioral room. Mice were given free access to the reward (EnsurePlus Vanilla Nutrition Shake solution mixed with equal parts water (Ensure); Abbott Laboratories, Abbott Park, IL) in a 50 mL conical tube cap in their home cages on the last 3 d of handling to familiarize them to the novel solution. Mice were also habituated to patch cord attachment during the last 3 d of handling for optical stimulation and fiber photometry experiments. Immediately before training each day, a patch cord was connected to their optical fiber(s) via zirconia sleeve(s) (1.25 mm, Doric Lenses) before being placed in the behavioral chamber.
Operant trainingRequest a detailed protocol
All operant training was carried out using Med Associates modular test chambers and accessories (ENV-307A; Med Associates Inc, Georgia, VT). For optical stimulation experiments, test chambers were housed in sound attenuating chambers (ENV-022M). Two nose poke ports (ENV-313-M) were placed on the left wall of the chamber and the reward receptacle (ENV-303LPHD-RL3) was placed on the right wall. The receptacle cup spout was connected to a 5 mL syringe filled with Ensure loaded in a single speed syringe pump (PHM-100). Nose pokes and receptacle entries were detected by infrared beam breaks. The tone generator (ENV-230) and speaker (ENV-224BM) were placed outside the test chamber, but within the sound attenuating chamber, to the left. The house light (used for timeout, ENV-315M) was placed on top of the tone generator to avoid snagging patch cords. Each chamber had a fan (ENV-025F28) running throughout the session for ventilation and white noise. Behavior chambers were connected to a computer running MEDPC IV to collect event frequency and timestamps. For optical stimulation experiments, a hole drilled in the top of the sound attenuating chambers allowed the patch cord to pass through. Initial BLA ACh3.0 (Figure 2A–E) and BLA CaMKIIα GCaMP6s (Figure 3) fiber photometry recordings occurred in a darkened behavioral room outside of sound attenuating chambers due to steric constraints with rigid fiber photometry patch cords. Later behavioral chamber optimization (wall height was extended with 3D printed and laser cut acrylic panels to allow removing the test chamber lid while preventing escape) allowed all other fiber photometry cohorts to be tested inside sound attenuating chambers. For fiber photometry experiments, a custom receptacle was 3D printed that extended the cup beyond the chamber wall to allow mice to retrieve the reward with more rigid patch cords. Each mouse was pseudo-randomly assigned to behavioral chamber when multiple chambers were used, counterbalancing for groups across boxes.
Three weeks after surgery, initial operant familiarization consisted of one 35 min session of Free Reward to demonstrate the location of reward delivery; all other sessions were 30 min. During Free Reward, only the reward receptacle was accessible. After 5 min of habituation, Ensure (24 µL over 2 s) was delivered in the receptacle cup and a light was turned on above the receptacle. The receptacle light was turned off upon receptacle entry. The next phase of operant familiarization, mice learned to nose poke to receive reward on a fixed-ratio one (FR1) schedule of reinforcement. Mice in experiments involving manipulations (optical stimulation and antagonist studies) were pseudo-randomly assigned to left or right active (reinforced) nose poke port. Mice in fiber photometry experiments were all assigned to right active port to minimize potential across subject variability. The inactive (unreinforced) port served as a locomotor control. During FR1 operant familiarization, each nose poke response into the active port resulted in receptacle light and reward delivery. After the mice reached criterion on FR1 operant familiarization (group average of 30 rewards for 2 consecutive days, usually 4–5 d), mice were advanced to the Pre-Training phase. This phase incorporated an auditory tone (2.5–5 kHz, ~60 dB) that lasted for at most 10 s and signaled when active nose pokes would be rewarded. Only active nose pokes made during the 10 s auditory tone (correct nose pokes) resulted in reward and receptacle light delivery. The tone co-terminated with Ensure delivery. During Pre-Training, there was no consequence for improper nose pokes, neither in the active port outside the tone (incorrect nose pokes) nor in the inactive port (inactive nose pokes). The number of inactive nose pokes were typically very low after operant familiarization and were not included in analysis. After reward retrieval (receptacle entry following reward delivery) the receptacle light was turned off and the tone was presented again on a variable intertrial interval schedule with an average interval of 30 s (VI 30), ranging from 10 to 50 s (Ambroggi et al., 2008). After 4–5 d of Pre-Training, mice progressed to the Training phase, which had the same contingency as Pre-Training except incorrect nose pokes resulted in a 5 s timeout signaled by house light illumination, followed by a restarting of the previous intertrial interval. Mice were considered to have acquired the task after earning 20 rewards during the Training phase of the task. In order to promote task acquisition, mice that were not increasing number of rewards earned reliably were moved to a VI 20 schedule after 9 d of VI 30 Training for BLA ACh3.0 or 6–7 d for BLA CaMKIIα cell mice. The VI 20 schedule was only needed for the two groups that were trained outside of the sound attenuating chambers. Mice progressed to Extinction after 12 d of Training or, in the case of fiber photometry cohorts, once all mice met the acquisition criteria. Extinction was identical to Training except no Ensure was delivered in response to correct nose pokes. The replicate cohorts of the BLA CaMKIIα GCaMP6s and NBM-BLA terminal fiber recording experiments were advanced to 1 d of Extinction after only 7 d of Training due to the COVID-19 shutdown.
Between mice, excrement was removed from the chambers with a paper towel. At the end of the day chambers were cleaned with Rescue Disinfectant (Virox Animal Health, Oakville, Ontario, Canada) and Ensure syringe lines were flushed with water then air. Mice were excluded from analyses if a behavioral chamber malfunctioned (e.g. syringe pump failed) or they received the improper compound. Fiber photometry mice were excluded from analyses if they did not meet the acquisition criterion by the last day of Training. See Supplementary file 1 for number of mice that acquired, were excluded, and further explanations for behavioral paradigm deviations.
Optical stimulationRequest a detailed protocol
Optical stimulation was generated by a 473 nm diode-pumped solid-state continuous wave laser (Opto Engine LLC, Midvale, UT) controlled by a TTL adapter (SG-231, Med Associates Inc). The laser was connected to a fiber optic rotary joint (Doric Lenses) via a mono fiber optic patch cord (200 µm core, 220 µm cladding, 0.53 NA, FC connectors; Doric Lenses). The rotary joint was suspended above the sound attenuating chamber with a connected branching fiber optic patch cord (200 µm core, 220 µm cladding, 0.53 NA, FC connector with metal ferrule; Doric Lenses) fed into the behavioral box. Laser power was adjusted to yield 10–12 mW of power at each fiber tip. The stimulation pattern was 25 ms pulses at 20 Hz for 2 s modified from parameters in Jiang et al., 2016. Jiang et al. used a 20 Hz pulse frequency, 5 ms pulses, and 10–12 mW power at the fiber tips. In this study, we used a 2 s stimulation duration because it matched the time of syringe pump activation for reward delivery and co-terminated with the pump and auditory tone. A 25 ms pulse width was used because our lasers were not able to generate sufficient power with 5 ms pulses. Optical stimulation was only delivered during the Pre-Training and Training phases of the operant task. Both control (EYFP) and experimental (ChR2) groups received identical light delivery, and stimulation was triggered by a correct nose poke and co-terminated with the auditory tone and Ensure delivery. For the non-contingent experiment, the number of light stimulations was yoked to the concurrently running ChR2 mouse. The timing of the non-contingent stimulation was explicitly unpaired with correct nose pokes or tones and was held in queue until the mouse had not made a response in the last 2 s, a tone was not going to be delivered within the next 2 s, or at least 5 s had passed since the mouse entered the receptacle after earning reward.
AcquisitionRequest a detailed protocol
Fluorescent measurements of ACh and calcium levels were recorded using two Doric Lenses 1-site Fiber Photometry Systems: a standard 405/465 nm system and a 405/470/560 nm system. The standard 405/465 system was configured as follows: the fiber photometry console controlled the two connectorized LEDs (CLEDs, 405 nm modulated at 208.616 Hz and 465 nm modulated at 572.205 Hz) through the LED module driver (Cassidy et al., 2019). Each CLED was connected via attenuating patch cord to the five-port Fluorescence MiniCube (FMC5_AE(405)_AF(420-450)_E1(460–490)_F1(500–550)_S). A pigtailed fiber optic rotary joint was connected to the MiniCube and suspended above the behavioral chamber with a rotary joint holder in order to deliver and receive light through the implanted optical fiber. The other end of the rotary joint was connected to the mono fiber optic patch cord via M3 connector and attached with a zirconia sleeve to the implanted fiber optic as above. The F1 (500–550 nm) port of the MiniCube was connected to the photoreceiver (AC low mode, New Focus 2151 Visible Femtowatt Photoreceiver, New Focus, San Jose, CA) via a fiber optic adapter (Doric Lenses) that was finally connected back to the fiber photometry console through an analog port. The 405/470/560 nm system was set up similarly, except a 560 nm LED was incorporated (modulated at 333.786 Hz), a six-port MiniCube with two integrated photodetector heads was used (iFMC6_IE(400-410)_E1(460–490)_F1(500–540)_E2(555–570)_F2(580–680)_S), and Doric Fluorescence Detector Amplifiers were used (AC 1× or 10× mode, DFD_FOA_FC). A TTL adapter (SG-231, Med Associates Inc) was connected to the digital input/output port to allow for timestamping when events occurred in the behavioral chamber. Signal was recorded using Doric Neuroscience Studio (V 22.214.171.124) via the Lock-In demodulation mode with a sampling rate of 12.0 kS/s. Data were decimated by a factor of 100 and saved as a comma-separated file.
Pre-processingView detailed protocol
Pre-processing of raw data was performed using a modified version of a MATLAB (MathWorks, Natick, MA) script provided by Doric. The baseline fluorescence (F0) was calculated using a first-order least mean squares regression over the ~30 min recording session. Second-order least mean squares regressions were used when photobleaching of the sensor was more pronounced, as in the case of NBM-BLA terminal fiber recordings. The change in fluorescence for a given time point (ΔF) was calculated as the difference between it and F0, divided by F0, which was multiplied by 100 to yield %ΔF/F0. The %ΔF/F0 was calculated independently for both the signal (465 nm) and reference (405 nm) channels to assess the degree of movement artifact. Since little movement artifact was observed in the recordings (Figure 2—figure supplement 1B–C, Figure 2—figure supplement 5D–E, Figure 3—figure supplement 1C–D, tan lines), the signal %ΔF/F0 was analyzed alone (the code provided allows for running the entire analysis pipeline with the reference channel %ΔF/F0 if desired). The %ΔF/F0 was z-scored to give the final Z%ΔF/F0 reported here. For the BLA CaMKIIα cell recordings (Figure 3—figure supplement 1C–D), the reference channel displayed some mirroring (moving in the opposite direction) compared to the signal. This is likely because 405 nm is not the ‘true’ isosbestic point for GCaMP and we were instead measuring some changes in calcium-unbound GCaMP rather than calcium-insensitive GCaMP signal alone (Barnett et al., 2017; Kim et al., 2016a; Sych et al., 2019). Graphs and heatmaps for averaged traces aligned to actions were based on licking bout epoch filtering code from TDT (Alachua, FL; link in code comments).
HeatmapsRequest a detailed protocol
Combined action heatmaps were generated in MATLAB (2020a) by analyzing data 5 s preceding tone onset (rewarded trials only) to 5 s after receptacle entry. Actions were aligned despite variable latencies by evenly splitting a maximum of 4 s post-tone onset/pre-correct nose poke and 1 s post-correct nose poke/pre-receptacle entry for each trial within a day. The resulting aligned trials were averaged to generate daily averages that made up the rows of the individual animal heatmaps. Blanks in the rows of heatmaps (black time bins) indicate time bins added for alignment, meaning that no trials for that day had a latency that stretched the entire window. Only rewarded trials where the mouse entered the receptacle within 5 s after nose poke were analyzed. Full or partial training days were excluded from analysis if there were acquisition issues such as the patch cord losing contact with the fiber or behavioral apparatus malfunction. Lack of trials (fewer than three) for analysis or recording issues led to missing rows of fiber photometry data in the heatmap despite having behavioral data, in which case these rows were skipped rather than adding entire blank rows. Due to individual differences in behavior, across-mouse average data was calculated by using a selection of days in which behavior was roughly similar or milestones such as the first and last day of Pre-Training, first day earning 10 rewards in Training, first day crossing acquisition threshold (and maintaining afterward), last day of Training, last day of Extinction (with three or more rewarded trials that met analysis criteria). Additional days were included in across-mouse average heatmaps when possible. Incorrect nose poke heatmaps were generated by averaging signals for 5 s before and 5 s after incorrect nose pokes that were not preceded by an incorrect nose poke in the last 5 s. The incorrect nose poke heatmaps averaged across mice were generated using the same selection of days as the combined action heatmaps for a given experiment.
Bootstrapped confidence interval analysesRequest a detailed protocol
Bootstrapped confidence intervals (bCIs) of the Z-scored % ΔF/F0 fiber photometry data within and across mice were generated using the methods described in Jean-Richard-Dit-Bressel et al., 2020 for the following events: tone onset, correct nose poke, receptacle entry, and incorrect nose poke. For the within-mouse analysis by day, trials were aligned to event onset, and bCIs were generated for events that had at least 3 trials from 5 s before to 10 s after each event. Each series of data were bootstrapped 10,000 times and a two-sided 99% confidence interval was constructed. Data were considered significantly different from baseline (Z% ΔF/F0 = 0) when their 99% bCIs did not contain zero for an interval of time designated by a consecutive threshold of 0.5 s.
To avoid comparing vastly different numbers of trials, in graphs where correct and incorrect nose pokes were plotted together, incorrect nose pokes were downsampled to match the number of correct nose poke trials. For Incorrect Nose Pokes graphs where last Pre-Training Day and Training Day 1 were plotted together, both days were downsampled to the number of correct nose pokes on the last Pre-Training Day.
For the combined action bCI plots (tone onset, correct nose poke, and receptacle entry), the selection of days for each mouse matched that of the cohort-averaged combined action heatmaps. The three-event plots were combined by cropping to match the maximum latencies used in the combined action heatmaps. For the across-mouse averaged bCI plots, analyses were carried out as above except the bootstrapping used mouse trial averages. The mean lines for across-mouse averaged bCI plots were calculated by taking the mean of all individual trials together. The NBM-BLA cholinergic terminal fiber experiment required combining the two independent cohorts to obtain n ≥ 3. For the incorrect nose poke bCI plots, the number of trials used for each day was downsampled to 20 if a mouse performed more than 20.
PharmacologyRequest a detailed protocol
Male wild-type C57BL/6J mice were injected i.p. 30 min before each Pre-Training and Training session with a volume of 10 mL/kg with the following compounds: 1× DPBS (Thermo Fisher Scientific, Waltham, MA), 1 mg/kg mecamylamine hydrochloride (Millipore Sigma, St. Louis, MO), 0.5 mg/kg (-) scopolamine hydrochloride (Millipore Sigma), or 1 mg/kg mecamylamine + 0.5 mg/kg scopolamine (Figure 5 + Figure 5—figure supplement 1).
HistologyRequest a detailed protocol
After completion of behavioral experiments, animals were anesthetized with 1× Fatal-Plus (Vortech Pharmaceuticals). Once there was no response to toe-pinch, mice were transcardially perfused with 20 mL ice-cold 1× DPBS followed by 20 mL 4% paraformaldehyde (PFA, Electron Microscopy Sciences, Hatfield, PA). Brains were extracted and post-fixed for at least 1 d in 4% PFA at 4°C and transferred to 30% sucrose (Millipore Sigma) for at least 1 dy at 4°C. Brains were sliced 40 µm thick on a self-cooling microtome and stored in a 0.02% sodium azide (Millipore Sigma) PBS solution. Brain slices were washed in PBS, blocked for 2–3 hr (0.3% Triton X-100, American Bioanalytical, Canton, MA; 3% normal donkey serum, Jackson ImmunoResearch, West Grove, PA), then incubated overnight with primary antibodies (1:1000 + 1% normal donkey serum). Slices were then washed in PBS and incubated with secondary antibodies (1:1000) for 2 hr, washed, stained with DAPI for 5 min, washed, mounted, and coverslipped with Fluoromount-G (Electron Microscopy Sciences). All incubations were at room temperature. Microscope slides were imaged using a FLUOVIEW FV10i confocal microscope (Olympus). Injection sites and fiber placements were designated on modified Allen Mouse Brain Atlas figures (Lein et al., 2007). Mice were excluded from analyses if fluorescence was not observed at injection sites or if fiber tips were not identified at the intended site.
Statistical analyses of behaviorRequest a detailed protocol
Operant behavioral data saved by MEDPC IV was transferred to Excel using MPC2XL. Data were organized in MATLAB and analyzed in Prism (V8.3.0, GraphPad Software, San Diego, CA). Differences between groups and interactions across days for Training were evaluated using Two-Way Repeated Measures ANOVAs. We computed the required sample size for a 90% power level with an alpha of 0.05 by estimating the control (EYFP) group mean would be 10 rewards and the mean experimental (ChR2) group would be 20 rewards with a standard deviation of 5. We utilized a power calculator for continuous outcomes of two independent samples, assuming a normal distribution. The result was six samples per group. Each manipulation experiment started with at least six mice included in each group (Sealed Envelope, 2020). In each experiment, each animal within a group served as a biological replicate. These studies did not include technical replicates. Masking was not applied during data acquisition but data analyses were semi-automated in MATLAB and performed blind to condition.
Ex Vivo electrophysiology
Coronal brain slices were prepared from virus injected mice after 3 weeks from surgery. Animals were anesthetized with a mixture of ketamine and xylazine (100 mg ketamine and 6 mg xylazine/kg body weight injected ip). Then the mice were transcardially perfused with a sucrose-based solution (see below). After decapitation, the brain was rapidly transferred into a sucrose-based cutting solution bubbled with 95% O2 and 5% CO2 and maintained at ~3°C. This solution contained (in mM): sucrose 230; KCl 2.5; MgSO4 10; CaCl2 0.5; NaH2PO4 1.25; NaHCO3 26; glucose 10 and pyruvate 1.5. Coronal brain slices (300 µm) were prepared using a Leica VT1000S vibratome (Leica Biosystems Inc). Slices were equilibrated with a mixture of oxygenated artificial cerebrospinal fluid (aCSF) and sucrose-based cutting solution at room temperature (24–26°C) for at least 1 hr before transfer to the recording chamber. Pyruvate (0.15–0.75 mM) was added to reduce oxidative damage and enhance survival. With this protocol, slices are initially incubated in a mixture of 50% cutting solution with pyruvate and 50% aCSF (in mM): sucrose 115; NaCl 63; KCl 2.5; NaH2PO4 1.25; MgSO4 5; CaCl21.25; MgCl2 1; NaHCO3 26; glucose 10; and sodium pyruvate 0.75 at 35°C for 30 min and then transferred to a mixture of 10% cutting solution and 90% aCSF (in mM): sucrose 23; NaCl 113.4; KCl 2.5; NaH2PO4 1.25; MgSO4 1; CaCl2 1.85; MgCl2 1.8; NaHCO3 26; glucose 10; and sodium pyruvate 0.15 at 35°C for 1–4 hr before recording. The slices were continuously superfused with aCSF at a rate of 2 ml/min containing (in mM); NaCl 126, KCl 2.5, NaH2PO4 1.25, NaHCO3 26, CaCl2 2, MgCl2 2 and glucose 10 bubbled with 95% O2 and 5% CO2 at room temperature.
Brain slices were placed on the stage of an upright, infrared-differential interference contrast microscope (Olympus BX51WI, Olympus). NBM neurons were visualized with a 40 × water-immersion objective by infrared microscopy (COHU 4915 camera, COHU, Inc, Poway, CA). Patch electrodes with a resistance of 4–6 MΩ were pulled with a laser-based micropipette puller (P-2000, Sutter Instrument Company). Signals were recorded with a Multi Clamp 700A amplifier and pClamp10 software (Molecular Devices, Inc, San Jose, CA). The pipette solution contained (in mM) 130 K-gluconate, 2 KCl, 2 MgCl2, 10 HEPES, 0.5 EGTA, 1 ATP and 0.2 GTP (pH = 7.3).
To examine action potential firing frequency, NBM neurons were recorded in a current clamp configuration after forming a giga-ohm seal. Membrane potentials were clamped at −60 mV by injecting 0- ~ 50 pA current through the recording electrode as needed. Cells that maintained steady membrane potentials for at least five mins were included in the analysis.
Optogenetic stimulation ex vivo
Channelrhodopsin was activated with a train of light flashes delivered through the 40× microscope objective. The light source was an Olympus x-cite 120Q lamp (Olympus) gated with a TTL controlled shutter (LAMBDA SC, Sutter Instrument). The filter cube contained an HQ480/40x excitation filter, a Q505lp bypass filter and an HQ535/50 m emission filter (Chroma Technology Corp., Bellows Falls, VT). The fluorescence illumination intensity delivered at the brain slices was adjusted to 1–3 mW/mm2, measured with a PM100D optical power and energy meter (Thorlabs Inc, Newton, NJ). In the NBM, cholinergic neurons were identified by EGFP fluorescence and light flashes were delivered at 1 Hz, 5 Hz, 10 Hz, 15 Hz, 20 Hz, 25 Hz, and 30 Hz.
After xtinction, responding was reinstated in Training for 2 d. Then mice underwent a modified Training paradigm where correct nose pokes yielded only laser stimulation, without Ensure delivery.
Real time place preference
An empty, clear mouse cage (29.5 cm × 19 cm × 12.5 cm) had half of its floor covered in printer paper to provide a distinct floor texture. A video camera was placed above the cage and was connected to a computer running EthoVision XT (version 10.1.856, Noldus, Wageningen, Netherlands) to track the position of the mouse and deliver optical stimulation when the mouse was on the laser-paired side (via TTL pulse to OTPG_4 laser controller (Doric Lenses) connected to the laser; 20 Hz, 25 ms pulses). Mice were randomly assigned and counterbalanced to receive laser stimulation only on one side of the cage. Mice were allowed free access to either side for 15 min during a session. Baseline was established in the absence of optical stimulation on day 1. Mice then received optical stimulation on Day two only when on the laser-paired side. Data are presented as percent time spent on the laser-paired side.
Progressive ratio testing
In the progressive ratio test, mice were given 60 min to nose poke for ensure and 2 s of optical stimulation on a progressive ratio schedule (escalations given below). Training day escalation: 1, 2, 2, 2, 2, 3, 3, 3, 3, 3, 5, 5, 5, 5, 5, 8, 8, 8, 8, 8, 8, 11, 11, 11, 11, 11, 11, 15, 15, 15, 15, 15, 15, 22, 22, 22, 22, 22, 33, 33, 33, 33, 33, 44, 44, 44, 44, 44, 55, 66, 77, 88, 99, 133, 166, 199, 255, 313, 399, 499, 599, 777, 900,1222. Test Day escalation: 1, 2, 4, 6, 9, 12, 15, 20, 25, 32, 40, 50, 62, 77, 95, 118, 145, 178, 219, 268, 328, 402, 492, 603, 777, 900, 1222.
Mice were placed in a square box (47 cm × 47 cm × 21 cm) for 20 min with a floor of filter paper that was changed between mice. During the 3rd 5 min bin of the session, mice received optical stimulation (20 s on/off, 20 Hz, 25 ms pulses). Locomotor activity was recorded via overhead camera and analyzed in 5 min bins with EthoVision.
Locomotor data was collected using an Accuscan Instruments (Columbus, Ohio) behavior monitoring system and software. Mice were individually tested in empty cages, with bedding and nesting material removed to prevent obstruction of infrared beams. Mice were injected (i.p.) with saline, mecamylamine (1 mg/kg, Sigma), scopolamine (0.5 mg/kg, Sigma), or mecamylamine+scopolamine (1 mg/kg and 0.5 mg/kg, respectively) 30 min before locomotor testing. Locomotion was monitored for 20 min using 13 photocells placed 4 cm apart to obtain an ambulatory activity count, consisting of the number of beam breaks recorded during a period of ambulatory activity (linear motion rather than quick, repetitive beam breaks associated with behaviors such as scratching and grooming).
Light/dark box exploration
A rectangular box was divided evenly into a light (clear top, illuminated by an 8W tube light) and dark (black walls, black top) side with a black walled divider in the middle with a small door. The lid and divider were modified to allow the optical fiber and patch cord to pass through freely. Mice were placed facing the corner on the light side furthest from the divider and the latency to crossing to the dark side was measured. The number of crosses and time spent on each side were measured for 6 min following the initial cross.
All data generated or analysed during this study are included in the manuscript and supporting files. Source data files have been provided for all experiments on Dryad Digital Repository: https://doi.org/10.5061/dryad.3xsj3txcf.
Dryad Digital RepositoryAcetylcholine is released in the basolateral amygdala in response to predictors of reward and enhances learning of cue-reward contingency.https://doi.org/10.5061/dryad.3xsj3txcf
The amygdala and rewardNature Reviews Neuroscience 3:563–573.https://doi.org/10.1038/nrn875
Muscarinic control of graded persistent activity in lateral amygdala neuronsEuropean Journal of Neuroscience 24:3183–3194.https://doi.org/10.1111/j.1460-9568.2006.05200.x
Amygdala inputs to the ventral Hippocampus bidirectionally modulate social behaviorJournal of Neuroscience 34:586–595.https://doi.org/10.1523/JNEUROSCI.4257-13.2014
Analyzing Event-Related transients: confidence intervals, permutation tests, and consecutive thresholdsFrontiers in Molecular Neuroscience 13:14.https://doi.org/10.3389/fnmol.2020.00014
A genetically encoded fluorescent acetylcholine Indicator for in vitro and in vivo studiesNature Biotechnology 36:726–737.https://doi.org/10.1038/nbt.4184
Antagonistic negative and positive neurons of the basolateral amygdalaNature Neuroscience 19:1636–1646.https://doi.org/10.1038/nn.4414
The lateral amygdaloid nucleus: sensory interface of the amygdala in fear conditioningThe Journal of Neuroscience 10:1062–1069.https://doi.org/10.1523/JNEUROSCI.10-04-01062.1990
State-specific gating of salient cues by midbrain dopaminergic input to basal amygdalaNature Neuroscience 22:1820–1833.https://doi.org/10.1038/s41593-019-0506-0
Dual-transmitter systems regulating arousal, attention, learning and memoryNeuroscience & Biobehavioral Reviews 85:21–33.https://doi.org/10.1016/j.neubiorev.2017.07.009
Vesicular glutamate transporter 3 immunoreactivity is present in cholinergic basal forebrain neurons projecting to the basolateral amygdala in ratThe Journal of Comparative Neurology 498:690–711.https://doi.org/10.1002/cne.21081
Nucleus accumbens Fast-Spiking interneurons constrain impulsive actionBiological Psychiatry 86:836–847.https://doi.org/10.1016/j.biopsych.2019.07.002
Visual responses of neurons in the dorsolateral amygdala of the alert monkeyExperimental Neurology 63:610–626.https://doi.org/10.1016/0014-4886(79)90175-4
Forebrain cholinergic signaling: wired and Phasic, not tonic, and causing behaviorThe Journal of Neuroscience 40:712–719.https://doi.org/10.1523/JNEUROSCI.1305-19.2019
Orbitofrontal cortex and basolateral amygdala encode expected outcomes during learningNature Neuroscience 1:155–159.https://doi.org/10.1038/407
Predictive reward signal of dopamine neuronsJournal of Neurophysiology 80:1–27.https://doi.org/10.1152/jn.19126.96.36.199
WebsitePower calculator for continuous outcome superiority trialAccessed October 10, 2016.
Basolateral amygdala neurons maintain aversive emotional salienceThe Journal of Neuroscience 38:3001–3012.https://doi.org/10.1523/JNEUROSCI.2460-17.2017
Amygdala neurons differentially encode motivation and reinforcementJournal of Neuroscience 27:3937–3945.https://doi.org/10.1523/JNEUROSCI.5281-06.2007
Impact of basal forebrain cholinergic inputs on basolateral amygdala neuronsThe Journal of Neuroscience 35:853–863.https://doi.org/10.1523/JNEUROSCI.2706-14.2015
GABA interneurons mediate the rapid antidepressant-like effects of scopolamineJournal of Clinical Investigation 126:2482–2494.https://doi.org/10.1172/JCI85033
Cholinergic systems in mammalian brain and spinal cordProgress in Neurobiology 37:475–524.https://doi.org/10.1016/0301-0082(91)90006-M
The basal forebrain cholinergic projection system in miceThe Mouse Nervous System 2012:684–718.https://doi.org/10.1016/B978-0-12-369497-3.10028-7
Learning and stress shape the reward response patterns of serotonin neuronsThe Journal of Neuroscience 37:8863–8875.https://doi.org/10.1523/JNEUROSCI.1181-17.2017
Matthew N HillReviewing Editor; University of Calgary, Canada
Laura L ColginSenior Editor; University of Texas at Austin, United States
David M LovingerReviewer; NIH/NIAAA, United States
Gavan P McNallyReviewer; University of New South Wales, Australia
In the interests of transparency, eLife publishes the most substantive revision requests and the accompanying author responses.
The reviewers agree that this research is novel, intriguing and important, as well as technically accomplished, set of findings and that the authors have sufficiently addressed all the comments they raised.
Decision letter after peer review:
Thank you for submitting your article "Acetylcholine is released in the BLA in response to predictors of reward and enhances learning of cue-reward contingency" for consideration by eLife. Your article has been reviewed by two peer reviewers, and the evaluation has been overseen by a Reviewing Editor and Laura Colgin as the Senior Editor. The following individuals involved in review of your submission have agreed to reveal their identity: David M Lovinger (Reviewer #1); Gavan P McNally (Reviewer #2).
The reviewers have discussed the reviews with one another and the Reviewing Editor has drafted this decision to help you prepare a revised submission.
We would like to draw your attention to changes in our revision policy that we have made in response to COVID-19 (https://elifesciences.org/articles/57162). Specifically, we are asking editors to accept without delay manuscripts, like yours, that they judge can stand as eLife papers without additional data, even if they feel that they would make the manuscript stronger. Thus the revisions requested below only address clarity and presentation.
The manuscript by Crouse et al. uses a variety of approaches to provide important new information about the role of acetylcholine release (ACh) in the basolateral amygdala (BLA) in appetitive learning. The authors use a variety of powerful techniques to examine and manipulate ACh levels and neuronal activity in this brain region. The findings support the conclusion that ACh facilitates conditioning in a cue-driven nosepoke task through actions on a muscarinic receptor. The quality of the data are generally high, especially the fiber photometry data that exhibits little-to-no artifactual signal in most of the experiments. Important controls are included for the most crucial experiments. There are some important and real design strengths here. These include inclusion of data from a pretraining phase for each experiment (showing neatly that the ACh system is neither non-specifically recruited nor non-specifically contributing to general behaviours in this task), concurrent measurement of release via ACh3.0 and terminal activity via jRCaMP1b (showing strong temporal agreement between the measures), pharmacological identification of the critical receptor family, augmentation of learning in the task via ChR2 excitation, and inclusion of useful controls (non-contingent stimulation) that constrain interpretation. The manuscript is well-written with clear findings. Overall, the findings reported here are an important advance and are interesting for their knowledge gains about both ACh and amygdala function in a fundamentally important learning process. There are a few aspects of the study that require clarification, and additional analysis of existing data and other information should be included. No new experiments are required.
1) The first revision relates to the unpaired stimulation experiment. There are three points to address here:
– The duration of the ACh effect suggested by the explicitly unpaired stimulation procedure appears to be on the order or 10s of seconds. This is consistent with mAChR involvement as the authors suggest. It is possible that ACh release induced by optogenetic stimulation is longer-lasting than that induced by CS delivery. This possibility should be considered, and the authors might want to discuss the known durations of ACh modulatory actions. The possibility of co-release of another neurotransmitter or modulator with this optical stimulation should also be considered (Saunders et al., 2015). It may be that strong ACh release activates long-term synaptic plasticity independent of neural responses to CS or reward delivery, and the authors might also mention this possibility.
– How did the authors determine the pattern of light stimulation used in the optogenetic experiments? Were effects of this stimulation pattern examined in slices (Figure 4B only shows responses to lower frequency stimulation)? Please state the frequency and duration of ChR2 stimulation in the main text (subsection “Stimulation of cholinergic terminals in BLA improves cue-reward learning”, first paragraph).
– One of the surprising findings here was that yoked ChR2 stimulation was as effective in enhancing performance. This was surprising given how well timed the endogenous signals are in this task. This finding was not really explored in the Discussion and it appears to contradict interpretation at some key places in the Discussion (e.g., –subsection “Increasing BLA acetylcholine levels enhances cue-reward learning”). Also, the yoked stimulation paradigm may actually be something of a "semi-yoking" procedure in that stimulation was not time-locked to stimulation in the original training group. Have other authors used the term yoked for this exact type of procedure?
2) It is surprising that such a complicated conditioning paradigm (i.e. presentation of a time-out period after incorrect responses in addition to reward for correct responses) is required for learning to respond specifically to the reward-predicting CS. Perhaps this is related to the several days of shaping as this may reinforce a strategy of near-automatized nosepoking that is then harder to modify in subsequent conditioning phases. Was it necessary to shape mice for this long? Also, the task appears to use a Pavlovian-to-instrumental transfer (PIT) conditioning design. If this is the case, perhaps the authors should use that term to describe the paradigm at some point in the manuscript, as that is a generally-accepted term that will be instantly recognized by many readers.
Following the comment above, it isn't clear if all mice learned the task, both in the training period and at the end (i.e. were levels of nose poking greater for correct versus incorrect?). The separation of these data between main and supplementary figures, and the required use of different scales, makes it difficult for the reader to understand the comparative frequencies of the behaviours. The authors should indicate what proportion of mice met the criteria for learning and a clearer statement of the criteria for successful completion of each training phase should also be included in the Materials and methods.
3) There are a few points with the photometry data that need to be addressed.
– In the photometry experiments, the group sizes are relatively small (4) and on some important control experiments (cross talk between the sensors) only single subject data are shown, and this might be addressed in the manuscript. The absence of any quantitative analysis of photometry is a also potential issue, but may well be related to sample sizes. The authors do provide a thoughtful and careful descriptive, qualitative analysis of photometry data, but statistical analyses (either using trial or subjects) appear absent. For the reader, this is especially relevant when trying to understand the relevance of signals around the incorrect nosepoke. There are approaches (within-subject bootstrapped confidence intervals to ask if CI includes 0% df/f or permutation tests) that could be applied. Again, the most straightforward and transparent solution could be to explain this choice to readers.
– For the fiber photometry data, how were data handled if there was mirroring in the 405 channel? If the data in this channel do not reflect calcium changes, presumably the data in the other channels are most likely artifacts. Were these periods then eliminated from analysis as others have done?
– Figure 3—figure supplement 1C is the only instance where the 405 channel shows a noticeable change. Do the authors have any idea why the large decrease was observed in this experiment?https://doi.org/10.7554/eLife.57335.sa1
1) The first revision relates to the unpaired stimulation experiment. There are three points to address here:
– The duration of the ACh effect suggested by the explicitly unpaired stimulation procedure appears to be on the order or 10s of seconds. This is consistent with mAChR involvement as the authors suggest. It is possible that ACh release induced by optogenetic stimulation is longer-lasting than that induced by CS delivery. This possibility should be considered, and the authors might want to discuss the known durations of ACh modulatory actions.
We agree with the reviewers that optogenetic-mediated release of ACh could be longer lasting than that induced by tone onset (CS delivery, or in early phases of training, correct nosepoke). We have added an explicit statement of this:
“It is also possible that optogenetic-mediated ACh release may last longer than endogenous, tone-evoked release. A simultaneous stimulation and recording approach would be required to compare ACh release under both conditions (Pisansky et al., 2019).”
We also added text to the Introduction that explicitly mentions the differing timescales of nAChR and mAChR action:
“The effect of ACh signaling can differ depending on the receptor, as metabotropic mAChRs work on a slower timescale than the rapid, ionotropic nAChRs (Gu and Yakel, 2011; Picciotto et al., 2012).”
The possibility of co-release of another neurotransmitter or modulator with this optical stimulation should also be considered (Saunders et al., 2015).
We agree that co-release of other neurotransmitters along with acetylcholine should be considered and unintentionally neglected to discuss this in the previous version of the manuscript. We have added the following text to the Discussion:
“It is important to note that basal forebrain neurons have been demonstrated to co-release ACh and GABA (Ma et al., 2018; Saunders et al., 2015), and cholinergic basal forebrain neurons that project to the BLA have been shown to co-express a glutamate transporter (Ma et al., 2018; Poulin et al., 2006). […] Future studies employing additional fluorescent neurotransmitter sensors (Marvin et al., 2013, 2018, 2019) could help understand the interaction between the different signals employed by basal forebrain neurons.”
It may be that strong ACh release activates long-term synaptic plasticity independent of neural responses to CS or reward delivery, and the authors might also mention this possibility.
We agree this is possible and indeed, is likely. We believe that the neural signals induced by CS or reward are likely necessary to induce firing of basal forebrain cholinergic neurons, but once ACh is released, it likely influences the circuit for tens of seconds and induces plasticity that alters BLA responses on a much longer timescale. This is addressed in the Discussion:
“This suggests that the effect of increased cholinergic signaling in the BLA is long lasting, and that stimulation during a learning session is sufficient to potentiate synaptic events linking the cue to a salient outcome – even if CS and/or reward delivery are presented tens of seconds later. […] Coupled with pharmacological evidence demonstrating that muscarinic signaling is necessary for reward learning in this task, these results suggest the involvement of metabotropic signaling downstream of muscarinic receptors that outlasts the initial cholinergic stimulation.”
– How did the authors determine the pattern of light stimulation used in the optogenetic experiments?
We have elaborated on how we determined the pattern of light stimulation in the Materials and methods:
“The stimulation pattern was 25 ms pulses at 20 Hz for 2 sec modified from parameters in (Jiang et al., 2016). […] A 25 ms pulse width was used because our lasers were not able to generate sufficient power with 5 ms pulses.”
Were effects of this stimulation pattern examined in slices (Figure 4B only shows responses to lower frequency stimulation)?
We have now recorded from NBM ChAT+ cells expressing ChR2 in slices at different stimulation frequencies. It appears that all frequencies elicited time locked depolarization in ChAT neurons and action potentials were apparent in slices up to ~10 Hz stimulation. These data are now included in Figure 4—figure supplement 2.
Please state the frequency and duration of ChR2 stimulation in the main text (subsection “Stimulation of cholinergic terminals in BLA improves cue-reward learning”, first paragraph).
We have added the frequency and duration of ChR2 stimulation to these lines:
“After operant familiarization, ChAT+ NBM-BLA terminals were stimulated via bilateral optical fibers (2 sec, 20 Hz, 25 ms pulses) triggered by a correct nose poke throughout both Pre-Training (Figure 4C) and Training (Figure 4D).”
– One of the surprising findings here was that yoked ChR2 stimulation was as effective in enhancing performance. This was surprising given how well timed the endogenous signals are in this task. This finding was not really explored in the Discussion and it appears to contradict interpretation at some key places in the Discussion (e.g., –subsection “Increasing BLA acetylcholine levels enhances cue-reward learning”).
We were also quite surprised by this finding precisely because of the tight time-locking seen in the fiber photometry experiments. We have now expanded the Discussion of the outcome of the non-contingent experiment and more explicitly discuss this contradiction:
“The timing of cholinergic signaling can be a critical factor in the induction of synaptic plasticity in other brain regions, so we hypothesized that the enhancement of cue-reward learning observed might be dependent upon when NBM-BLA terminal fibers were stimulated with respect to tone presentation and/or behavioral responses (Gu and Yakel, 2011). […] Coupled with pharmacological evidence demonstrating that muscarinic signaling is necessary for reward learning in this task, these results suggest the involvement of metabotropic signaling downstream of muscarinic receptors that outlasts the initial cholinergic stimulation.”
Also, the yoked stimulation paradigm may actually be something of a "semi-yoking" procedure in that stimulation was not time-locked to stimulation in the original training group. Have other authors used the term yoked for this exact type of procedure?
We agree that our paradigm is actually closer to “semi-yoking” with the number of stimulations matched to the concurrently running ChR2 counterpart. To avoid confusion, we changed the name of the “Yoked” group to “Non-contingent ChR2” throughout the manuscript and modified the description of the group:
“…we repeated the experiment in an independent cohort of mice with an additional non-contingent ChR2 group that received the same number of stimulation trains as the contingent ChR2 group, but in which light stimulation was explicitly unpaired with task events (Figure 6A + Figure 6—figure supplement 1).”
2) It is surprising that such a complicated conditioning paradigm (i.e. presentation of a time-out period after incorrect responses in addition to reward for correct responses) is required for learning to respond specifically to the reward-predicting CS. Perhaps this is related to the several days of shaping as this may reinforce a strategy of near-automatized nosepoking that is then harder to modify in subsequent conditioning phases. Was it necessary to shape mice for this long?
We were also surprised that a timeout period was required for mice to acquire the contingency. However, in our initial piloting for the behavioral paradigm, mice did not decrease incorrect nose poking after 7 days of Pre-Training, and only decreased incorrect nose poking after including the timeout period. We agree that this could be related to the several days of behavioral shaping before training. The one day of reward receptacle “Free Reward” and 4-5 days of nose poke “FR1 Shaping” were required to ensure all mice in a cohort earned at least 20 rewards for two consecutive days before advancing to Pre-Training. We preferred to advance all mice together after 4-5 days of shaping in order to avoid excluding mice from the experiment. This also ensured that each mouse underwent the same number of training sessions within and across experiments. This was not always possible, though, because in the initial BLA ACh3.0 and BLA CaMKIIα fiber photometry experiments we trained mice outside of the sound attenuating chambers (Figure 2A-E + Figure 3) and required one extra day of FR1 shaping to learn the task. We think it could be interesting to investigate if a timeout period is necessary to learn the cue-reward contingency if mice are advanced individually based on their behavior during shaping, however, for the purposes of these experiments, we chose to prioritize consistency in training across cohorts. (Note: in the revised manuscript we have removed the term “shaping” and changed it to “operant familiarization” to avoid confusion. See point below.)
Also, the task appears to use a Pavlovian-to-instrumental transfer (PIT) conditioning design. If this is the case, perhaps the authors should use that term to describe the paradigm at some point in the manuscript, as that is a generally-accepted term that will be instantly recognized by many readers.
We understand the reviewer’s concern, but we believe there may be a misunderstanding here because of our use of the term “shaping”, which could be confused with the autoshaping commonly seen in PIT. However, the “shaping” performed here was simply familiarization with the operant chamber and the nosepoke response to encourage mice to learn to perform the operant task. To avoid confusion, we now refer to this phase as “operant familiarization”. Mice are not exposed to the auditory tone signaling that an active nose poke can be made to receive a reward until the “Pre-Training” phase of the task. More directly, the mice do not form a Pavlovian association between the tone and reward before being required to nose poke for reward (Cartoni et al., 2016).
Following the comment above, it isn't clear if all mice learned the task, both in the training period and at the end (i.e. were levels of nose poking greater for correct versus incorrect?).
All mice included in fiber photometry figures (Figures 1-3) acquired the task. Not all mice in the optogenetic or antagonist studies reached the criterion for acquisition, however, and the proportion of animals that reached the 20-reward threshold or were excluded is now listed in Supplementary file 1.
The separation of these data between main and supplementary figures, and the required use of different scales, makes it difficult for the reader to understand the comparative frequencies of the behaviours.
We agree that separation of rewards earned and incorrect nose pokes between main and supplementary figures (in the case of the fiber photometry experiments) makes it difficult to compare the frequencies of correct and incorrect responding. We did not include the rewards earned and incorrect nose pokes overlaid in the same graph because the far greater number of incorrect nose pokes during the “Pre-Training” period dwarfs the rewards earned and prevents the overlay from being useful. However, we provide the data used in the figures in a readily plottable format (Excel sheets in Dryad).
The authors should indicate what proportion of mice met the criteria for learning and a clearer statement of the criteria for successful completion of each training phase should also be included in the Materials and methods.
In general, mice progressed from FR1 Nosepoke Training to Pre-Training after 30 rewards were earned by the group for at least two consecutive days (which was when the majority of mice in the group passed the individual threshold for acquisition of 20 rewards earned), typically 4 days. Mice progressed from Pre-Training typically after 4 days or when all mice had earned at least 20 rewards for two consecutive days, whichever came first. Task acquisition for individual animals was defined as earning 20 rewards during the Training phase of the task. Training lasted for 12 days when possible, but was extended for fiber photometry experiments carried outside of the sound attenuating chambers.
For the optogenetic stimulation and pharmacological antagonist studies, some groups included mice that never acquired the task by the end of Training. All of these mice underwent the standard 4 days of Pre-Training and 12 days of Training (as determined by initial task piloting). All mice shown in the fiber photometry figures acquired the task, but some mice in these cohorts did not acquire the task by the end of Training and were excluded from analyses We now include Supplementary file 1 to demonstrate more clearly the proportion of mice that met acquisition criteria, any excluded mice, and any deviations from the standard paradigm.
We have defined task acquisition and criteria more clearly for successful completion of each training phase in the Materials and methods section as follows:
“After the mice reached criterion on FR1 operant familiarization (group average of 30 rewards for 2 consecutive days, usually 4-5 days), mice were advanced to the Pre-Training phase. […] Mice were considered to have acquired the task after earning 20 rewards during the Training phase of the task.”
3) There are a few points with the photometry data that need to be addressed.
– In the photometry experiments, the group sizes are relatively small (4) and on some important control experiments (cross talk between the sensors) only single subject data are shown, and this might be addressed in the manuscript.
We were also concerned about the group sizes of the photometry experiments, so we have added replication groups for the BLA ACh3.0 (4 mice), NBM-BLA ChAT+ terminal fiber GCaMP7s (2 mice), and BLA CaMKII+ GCaMP6s (4 mice) experiments. The BLA ACh3.0 and BLA CaMKII+ GCaMP6s replication cohorts cannot be easily combined with the original cohorts due to differences in the number of behavioral sessions performed, but importantly, the replication studies show comparable outcomes as the original experiments and now all groups have at least one cohort that was trained inside the sound attenuating chambers in the optimized behavioral setup. Since the original and NBM-BLA ChAT+ terminal fiber GCaMP7s cohorts acquired at similar rates, they were combined to allow for cohort bootstrapping. The new BLA ACh3.0 cohort underwent the standard 4 days of Pre-Training, 12 days of Training, and 3 days of Extinction training, while the new NBM-BLA ChAT+ terminal fiber GCaMP7s and BLA CaMKII+ GCaMP6s cohorts ended Training prematurely at 7 days followed by 1 day of Extinction due to the COVID-19 shutdown. Data for all replication cohorts are shown in new supplementary figures, and mouse numbering is continued from original cohorts to avoid confusion.
The sensor cross talk control experiment was a proof of principle with only a single animal so we do not feel justified in drawing strong conclusions from the experiment. We did not refer to it in the Discussion, but wanted to include it as a supplementary figure, since it was a striking example of convergence that we consider as one of many control experiments. We have modified our description of the results from this one animal to further caution readers from drawing overly strong conclusions from the data as follows:
“While this was only a single animal and proof of principle for future studies, we found that NBM-BLA cholinergic terminal activity coincided with ACh levels (Figure 2—figure supplement 9F-G). Importantly, this relationship between ACh release and NBM-BLA terminal fiber activity was not explained by signal crosstalk (Figure 2—figure supplement 9H-I), further indicating that the BLA ACh measured comes at least in part from the NBM.”
The absence of any quantitative analysis of photometry is a also potential issue, but may well be related to sample sizes. The authors do provide a thoughtful and careful descriptive, qualitative analysis of photometry data, but statistical analyses (either using trial or subjects) appear absent. For the reader, this is especially relevant when trying to understand the relevance of signals around the incorrect nosepoke. There are approaches (within-subject bootstrapped confidence intervals to ask if CI includes 0% df/f or permutation tests) that could be applied. Again, the most straightforward and transparent solution could be to explain this choice to readers.
We were hesitant to perform quantitative analyses because of the low sample sizes, as the reviewers inferred, and initially decided to limit ourselves to qualitative descriptions. We thank the reviewers for the suggestion of bootstrapped confidence intervals and have incorporated both within- and across- subject bootstrapped confidence intervals (bCI) to assist readers in interpreting our results (Jean-Richard-dit-Bressel et al., 2020). We have modified our line plots with FP data to incorporate significance bars to denote time bins when bCIs do not contain Z%ΔF/F0 = 0 for at least 0.5 sec. We have also included new bCI plots that show similar data to the tone-poke-reward retrieval heatmaps with indicators of significance. The same subset of days used for the across-mouse averaged data heatmaps was used for an example mouse (within mouse bCI) and averaged (when possible) bCI plots.
– For the fiber photometry data, how were data handled if there was mirroring in the 405 channel? If the data in this channel do not reflect calcium changes, presumably the data in the other channels are most likely artifacts. Were these periods then eliminated from analysis as others have done?
We did not observe gross systematic artefacts or instances when the 405 channel mirrored the 465 channel’s transients. In order to minimize the processing of our data we opted to run the analyses for the reference and signal channels separately instead of using a “reference-corrected” ΔF/F0. In addition to the non-Z-scored % ΔF/F0 comparison of reference and signal channels for correct and incorrect nose pokes, we ran the entire analysis pipeline for the reference channels for each experiment and did not see anywhere near the same magnitude of signal. For the case of the BLA CaMKIIα GCaMP6s signal, mirroring was observed due to a known phenomenon as the sensor binds to calcium (see response to the point below). We did not include a duplication of all graphs to avoid doubling the number of figures, but our Dryad upload includes an option for readers to easily rerun the entire analyses for the reference channel and compare the outputs themselves. We include one example comparison as Author response image 1.
In addition, to avoid confusion, we now explicitly define mirroring in the Materials and methods section to refer only to the situation when the reference channel moves in the opposite direction to the signal channel (see Point below).
– Figure 3—figure supplement 1C is the only instance where the 405 channel shows a noticeable change. Do the authors have any idea why the large decrease was observed in this experiment?
Based on published studies (see citations below), we believe that the mirroring in the 405 channel observed in the BLA CaMKIIα GCaMP experiment is due to a decrease in calcium-unbound GCaMP (as calcium-bound GCaMP increases). This is due to a reference LED that is not at the “true” isosbestic point for GCaMP. We have explained this more clearly:
“For the BLA CaMKIIα cell recordings (Figure 3—figure supplement 1C-D), the reference channel displayed some mirroring (moving in the opposite direction) compared to the signal. This is likely because 405 nm is not the “true” isosbestic point for GCaMP and we were instead measuring some changes in calcium-unbound GCaMP rather than calcium-insensitive GCaMP signal alone (Barnett et al., 2017; Kim et al., 2016; Sych et al., 2019).”
Cartoni, E., Balleine, B., & Baldassarre, G. (2016). Appetitive Pavlovian-instrumental Transfer: A review. Neuroscience and Biobehavioral Reviews, 71, 829–848. https://doi.org/10.1016/j.neubiorev.2016.09.020https://doi.org/10.7554/eLife.57335.sa2
Article and author information
National Institute on Drug Abuse (DA14241)
- Richard B Crouse
- Kristen Kim
- Hannah M Batchelor
- Rufina Kamaletdinova
- Justin Chan
- Steven T Pittenger
- Yann S Mineur
- Marina R Picciotto
National Institute on Drug Abuse (DA037566)
- Richard B Crouse
- Kristen Kim
- Hannah M Batchelor
- Rufina Kamaletdinova
- Justin Chan
- Steven T Pittenger
- Yann S Mineur
- Marina R Picciotto
National Institute of Mental Health (MH077681)
- Richard B Crouse
- Kristen Kim
- Hannah M Batchelor
- Rufina Kamaletdinova
- Justin Chan
- Steven T Pittenger
- Yann S Mineur
- Marina R Picciotto
National Institute of Neurological Disorders and Stroke (NS022061)
- Prithviraj Rajebhosale
- Lorna W Role
- David A Talmage
National Institute of Mental Health (MH109104)
- Prithviraj Rajebhosale
- Lorna W Role
- David A Talmage
National Institute on Drug Abuse (DA046160)
- Xiao-Bing Gao
National Institute of Neurological Disorders and Stroke (Intramural)
- Prithviraj Rajebhosale
- Lorna W Role
National Institute of Mental Health (Intramural)
- Prithviraj Rajebhosale
- David A Talmage
National Institute of Neurological Disorders and Stroke (T32-NS007224)
- Richard B Crouse
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
These studies were supported by grants DA14241, DA037566, MH077681. LW, DT, and PR were supported by NS022061, MH109104 from the National Institutes of Health, and by the intramural programs of NINDS and NIMH. X-BG was supported by DA046160. RBC was supported by T32-NS007224. This work was funded in part by the State of Connecticut, Department of Mental Health and Addiction Services but this publication does not express the views of the Department of Mental Health and Addiction Services or the State of Connecticut. The views and opinions expressed are those of the authors. We thank Samantha Sheppard for the use of her mouse illustration and animal care assistance and Nadia Jordan-Spasov for genotyping and laboratory help. Li Jiang performed the ex vivo current-clamp recordings. Angela Lee and Wenliang Zhou provided helpful input into experimental planning. Colin Bond, Marcelo Dietrich, Usman Farooq, Onur Iyilikci, Sharif Kronemer, Matthew Pettus, and Zach Saltzman provided insightful discussion and assistance with analysis and figure design. Ralph DiLeone, Stephanie Groman, Hyojung Seo, and Jane Taylor offered helpful discussion about experimental design and analysis. The support teams at Doric Lenses (Alex Côté and Olivier Dupont-Therrien) and Tucker-Davis Technologies provided discussion, analysis support, and MATLAB code assistance.
Animal experimentation: All procedures were approved by the Yale University Institutional Animal Care & Use Committee in compliance with the National Institute of Health's Guide for the Care and Use of Laboratory Animals. (protocol: 2019-07895).
- Laura L Colgin, University of Texas at Austin, United States
- Matthew N Hill, University of Calgary, Canada
- David M Lovinger, NIH/NIAAA, United States
- Gavan P McNally, University of New South Wales, Australia
- Received: March 27, 2020
- Accepted: September 17, 2020
- Accepted Manuscript published: September 18, 2020 (version 1)
- Accepted Manuscript updated: September 22, 2020 (version 2)
- Version of Record published: October 1, 2020 (version 3)
This is an open-access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 public domain dedication.
- Page views
Article citation count generated by polling the highest count across the following sources: Crossref, Scopus, PubMed Central.
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
In value-based decision making, options are selected according to subjective values assigned by the individual to available goods and actions. Despite the importance of this faculty of the mind, the neural mechanisms of value assignments, and how choices are directed by them, remain obscure. To investigate this problem, we used a classic measure of utility maximization, the Generalized Axiom of Revealed Preference, to quantify internal consistency of food preferences in Caenorhabditis elegans, a nematode worm with a nervous system of only 302 neurons. Using a novel combination of microfluidics and electrophysiology, we found that C. elegans food choices fulfill the necessary and sufficient conditions for utility maximization, indicating that nematodes behave as if they maintain, and attempt to maximize, an underlying representation of subjective value. Food choices are well-fit by a utility function widely used to model human consumers. Moreover, as in many other animals, subjective values in C. elegans are learned, a process we find requires intact dopamine signaling. Differential responses of identified chemosensory neurons to foods with distinct growth potentials are amplified by prior consumption of these foods, suggesting that these neurons may be part of a value-assignment system. The demonstration of utility maximization in an organism with a very small nervous system sets a new lower bound on the computational requirements for utility maximization and offers the prospect of an essentially complete explanation of value-based decision making at single neuron resolution in this organism.
While there is a wealth of knowledge about core object recognition—our ability to recognize clear, high-contrast object images—how the brain accomplishes object recognition tasks under increased uncertainty remains poorly understood. We investigated the spatiotemporal neural dynamics underlying object recognition under increased uncertainty by combining MEG and 7 Tesla (7T) fMRI in humans during a threshold-level object recognition task. We observed an early, parallel rise of recognition-related signals across ventral visual and frontoparietal regions that preceded the emergence of category-related information. Recognition-related signals in ventral visual regions were best explained by a two-state representational format whereby brain activity bifurcated for recognized and unrecognized images. By contrast, recognition-related signals in frontoparietal regions exhibited a reduced representational space for recognized images, yet with sharper category information. These results provide a spatiotemporally resolved view of neural activity supporting object recognition under uncertainty, revealing a pattern distinct from that underlying core object recognition.