Dopaminergic neurons write and update memories with cell-type-specific rules
Abstract
Associative learning is thought to involve parallel and distributed mechanisms of memory formation and storage. In Drosophila, the mushroom body (MB) is the major site of associative odor memory formation. Previously we described the anatomy of the adult MB and defined 20 types of dopaminergic neurons (DANs) that each innervate distinct MB compartments (Aso et al., 2014a, 2014b). Here we compare the properties of memories formed by optogenetic activation of individual DAN cell types. We found extensive differences in training requirements for memory formation, decay dynamics, storage capacity and flexibility to learn new associations. Even a single DAN cell type can either write or reduce an aversive memory, or write an appetitive memory, depending on when it is activated relative to odor delivery. Our results show that different learning rules are executed in seemingly parallel memory systems, providing multiple distinct circuit-based strategies to predict future events from past experiences.
https://doi.org/10.7554/eLife.16135.001Introduction
Animals use memories of past events to predict the future. In some cases, an animal is best served by making a prediction based solely on their most recent experience. In others, a series of experiences is integrated to make a probabilistic prediction, discounting an event experienced only once. How are such different strategies implemented in the brain? One idea is that individual components of a memory—often called engrams—are simultaneously stored in distinct sub-circuits whose outputs can then be combined upon recall to affect behavior. These sub-circuits would vary in their rules for writing, updating and retaining these engrams, having differences in synaptic plasticity and circuit properties (Hikosaka et al., 2014). Experiments aimed at uncovering the mechanisms by which different forms of memory are established and maintained, and then coherently coordinated to drive behavior, are facilitated by using a model system in which the relevant cells and circuits can be identified and manipulated either individually or in specific combinations. In this report, we describe experiments performed in such a model system, the olfactory circuitry of Drosophila melanogaster.
Neuronal circuits for learning associations often share a common architecture: a large array of anatomically similar neurons that represent the sensory environment converge onto a much smaller number of output neurons (Luo, 2015, Dean et al., 2010) (Figure 1A). Punishment or reward activates modulatory neurons that in turn cause changes in the synaptic weight matrix between the neurons representing the sensory cues and the output neurons, resulting in memory formation. The mushroom body (MB) shares this architecture and is the major site of associative learning in Drosophila (Heisenberg et al., 1985; de Belle and Heisenberg, 1994; Dubnau et al., 2001; McGuire et al., 2003; Heisenberg, 2003). Odor identity is represented by a pattern of sparse activity in the ~2000 Kenyon cells (KCs) (Laurent and Naraghi, 1994; Perez-Orive et al., 2002), whose parallel axons form the lobes of the MB. The dendrites of MB output neurons (MBONs) and terminals of dopaminergic input neurons (DANs) tile the KC axons; the extent of the arbors of individual MBONs and DANs overlap precisely, defining 15 distinct compartmental units that tile the MB lobes of adult Drosophila (Tanaka et al., 2008; Aso et al., 2014a).

Learning rules in one MB compartment.
(A) Conceptual diagram of memory circuit organization. The parallel axonal fibers of the Kenyon cells represent odor stimuli, modulatory dopaminergic inputs induce plasticity at KC to MBON synapses in distinct compartments (represented by the shaded rectangles) along the length of these axons and MB output neurons from the compartments read out memory. See Aso et al. (2014a) for more details. (B) Left: Design of the optogenetic olfactory arena. See Figure 1—figure supplement 1 and Supplementary file 1 for details. Right: Diagram illustrating odor paths in the arena. Flies are confined in the circular arena at the center (3-mm high and 10-cm diameter). Video 1 illustrates the pattern of airflow. (C) Diagram of the expression pattern of the split-GAL4 line MB320C driving CsChrimson-mVenus (blue) in PPL1-γ1pedc; see Aso et al. (2014a) and www.janelia.org/split-gal4 for primary image data for this and other split-GAL4 lines. (D) Top: Training protocols. For odor delivery, valves were open for 60 s. For training, thirty 1 s pulses of red light (627 nm peak and 34.9 µW/mm2 at the position of the flies) were applied over 60 s starting 5 s after valve opening. Experiments were done reciprocally: In one group of flies, odor A and B were 3-octanol and 4-methylcyclohexanol, respectively, while in a second group of flies, the odors were reversed. All flies went through the same initial training and test protocol, and then the flies, without removal from the arena, went through one of the four diagramed training and test protocols. Bottom: Time course of the performance index (PI) during first test period (from 4–6 min of the experiment; left) and second test period (from 10–12 min of the experiment; right). The PI is defined as [(number of flies in the odor A quadrants) - (number of flies in odor B quadrants)]/(total number of flies). The average PI of reciprocal experiments is shown. The overall PI, which is reported in Figure 1F and Figures 2 and 3 was calculated by averaging the PIs from the final 30 s of each test period (indicated by the black horizontal line on the time axis). In the left panel, thin lines show individual reciprocal experiments and the thick line the mean of all experiments. In the right panel the mean with error bars representing the SEM are shown. Control genotypes did not show any significant memory in the same training protocols: (1) no driver control, pBDPGAL4 in attP2/20xUAS-CsChrimson in attP18 (PI = 0.07, SEM = 0.037, N = 14); and (2) no effector control, MB320C/w1118 (PI = −0.01, SEM = 0.031, N = 10). (E) A single frame of Video 2 showing the position of odor-conditioned flies at the end of the 1-min test period. Lines show trajectory of four flies. Video 2 shows the behavior of flies in the area for the full 1 min test period. (F) Inter-stimulus-interval (ISI) curve. A single training was done for each experiment. The relative timing of a 10 s delivery of odor A and a 10 s period in which three 1 s light pulse were delivered was varied. The diagram on top illustrates the cases of ISI = −10 s, 0 s and +10 s corresponding to the light pulses starting −10 s, 0 s, or +10 s after the initiation of odor delivery, respectively. The data points and error bars show the mean and SEM for MB320C/CsChrimson-mVenus. N = 10–14. Asterisk indicates significance from 0: *p<0.05; **p<0.01; ***p<0.001; n.s., not significant.
While these compartmental units share a similar general structure, there are important anatomical and functional differences between them. Each compartment contains only one of the three major classes of KCs: γ, α′/β′ and α/β. Anatomical data suggest that each MBON samples from ~90 to ~2000 KCs in one or two compartments, while each KC forms en passant synapses on 5–6 types of MBONs along its axon in the lobes (Aso et al., 2009, 2014a). During associative learning, each DAN is thought to modulate KC-MBON synapses only in its target compartment(s) (Hige et al., 2015; Cohn et al., 2015). Thus, the information encoded by KC activity might contribute to multiple distinct engrams by compartmental specific modulation of KC-MBON en passant synapses. A large body of previous work has established that the KCs, MBONs and DANs innervating individual compartments are differentially involved in forming memories with different valence—that is, appetitive and aversive memories—and with different stabilities (Schwaerzel et al., 2003; Zars et al., 2000; Isabel et al., 2004; Blum et al., 2009; Krashes et al., 2009; Claridge-Chang et al., 2009; Aso et al., 2010; Sejourne et al., 2011; Trannoy et al., 2011; Liu et al., 2012; Burke et al., 2012; Aso et al., 2012; Placais et al., 2013; Pai et al., 2013, Aso et al., 2014b; Lin et al., 2014; Bouzaiane et al., 2015; Ichinose et al., 2015; Yamagata et al., 2015; Owald et al., 2015; Huetteroth et al., 2015). However little is known about the rules for writing and updating memory in each compartment. By using the anatomical map of the MB and cell type specific drivers we reported previously (Aso et al., 2014a, 2014b) in conjunction with newly developed behavioral assays, we have been able to establish that different compartments can employ vastly different learning rules.
Separation of airflow in the four quadrants of arena.
Ammonium chloride smoke was introduced in two opposing quadrants allowing the borders of airflow in the circular arena to be seen. Valves opened at the beginning of the movie. After a ~2 s delay, smoke reached the peripheral of arena, a further ~3 s was required to fill the arena. At a flow rate of 400 mL/min, replacing the 23.5 mL of air in the arena takes ~3.5 s. Elapsed time after valve opening is shown in the upper right.
Conditioned odor avoidance after training with PPL1-γ1pedc.
An example of conditioned odor avoidance behaviors after training using optogenetic activation of PPL1-γ1pedc. The movie shows the 1-min test period (corresponding to the 5 to 6 min time range in Figure 1D left). Colored lines follow the trajectories of four flies. Notice that flies approaching the borders between compartments readily cross into the B quadrants with the control odor, but avoid crossing into the A quadrants with the trained odor.
Results and discussion
Experimental design
Punitive or rewarding stimuli such as electric shock, heat, cold, bitter taste and sugar generally activate a complex pattern of DANs (Riemensperger et al., 2005; Mao and Davis, 2009; ; Liu et al., 2012; Tomchik, 2013; Galili et al., 2014; Das et al., 2014; Kirkhart and Scott, 2015). Believing that a reduction in complexity will be essential to understand the roles played by DAN inputs to different MB compartments, we utilized intersectional split-GAL4 drivers to express CsChrimson, a red-shifted channelrhodopsin, in specific cell types (see Materials and methods). While activation of CsChrimson in these driver lines provides a stimulus that is unlikely to occur naturally, it allowed us to separately examine the memory components induced by individual DAN cell types.
We developed an olfactory arena that allows fine temporal control of both odor delivery and DAN activation using optogenetics (Figure 1B, Video 1 and Figure 1—figure supplement 1; detailed construction documents are included as a Supplementary file 1). In this arena, freely moving flies can be repeatedly trained and tested without the manual handling or temperature changes required in previous assays, thereby minimizing variability that might obscure subtle behavioral effects. These methods allowed us to systematically examine the properties of memories induced in different MB compartments, including: (1) the temporal pairing requirements of odor presentation and DAN activation; (2) the amount of training required for memory formation; (3) retention time; (4) weakening of the conditioned response induced by either DAN activation in the absence of odor presentation or by odor presentation in the absence of DAN activation; (5) the ability to learn new associations; and (6) the capacity to store multiple memories.
Assessing the learning rules in one compartment
Dopamine signaling to KCs has been implicated in both learning and forgetting (Schwaerzel et al., 2003; Schroll et al., 2006, Kim et al., 2007; Tomchik and Davis, 2009; Gervasi et al., 2010; Qin et al., 2012; Placais et al., 2012; Berry et al., 2012; Boto et al., 2014; Shuai et al., 2015). However, it has not been determined if a single DAN cell type can drive both processes within the same compartment. Pairing one of two odors with activation of PPL1-γ1pedc, results in robust aversive memory to the paired odor (Aso et al., 2010; Figure 1C–E and Video 2). That memory is fully retained after 10 min (blue line in Figure 1D) but has largely decayed by 24 hr (Figure 3B). Presentation of the odors alone a few minutes after training resulted in a modest reduction in the conditioned response (orange line in Figure 1D). A second activation of the same DAN a few minutes after training in the absence of odor can almost completely abolish the conditioned response (red line in Figure 1D). Recent imaging data of the γ4 MBON (Cohn et al., 2015) suggest that this reduction most likely results from restoration of the response of the MBON to the odor; that is, erasure of the memory. If, after the first training, contingencies are reversed such that the other odor is presented paired with DAN activation, the first memory is reduced and a memory of the new association is formed (purple line in Figure 1D). Taken together, these data indicate that the same DANs can write a new memory or reduce an existing conditioned response, enabling the flexibility to rapidly change the associations formed between a conditioned stimulus (CS), the odor, and an unconditioned stimulus (US) represented by dopamine release.
In classical conditioning, both the rate of learning and the valence of the resultant memory depend on the relative timing between the CS and the US (Christian and Thompson, 2003; Gerber et al., 2014). Our ability to precisely control DAN stimulation and odor presentation enabled us to examine the CS-US timing relationship (Figure 1F). We found that when PPL1-γ1pedc stimulation fell within a 30-s time window following the onset of a 10-s odor presentation, an aversive memory was formed (negative PI, Figure 1F). Interestingly, we observed that DAN stimulation that precedes odor presentation by 20 to 60 s induced an appetitive memory (positive PI, Figure 1F). These observations are consistent with the notion that it is the predictive aspect of CS-US timing that matters. When the timing is such that the CS predicts a subsequent aversive US, animals learn to avoid the CS. However, animals can also learn that the CS predicts the end of an aversive US, and are consequently attracted to the CS (Tanimoto et al., 2004). It has been suggested that this timing dependency could result from the dynamics of the biochemical signaling cascade acting downstream of dopamine receptors (Yarali et al., 2012).
Pairing activation of PPL1-γ1pedc with odor has been shown to depresses the subsequent spiking rate of the MBON from the γ1pedc compartments in response to the trained odor (Hige et al., 2015). In behavioral assays, optogenetic activation of this MBON was shown to attract flies (Aso et al., 2014b). Taken together, these observations suggest that DAN activation paired with an odor produces an aversive behavioral response to that odor by decreasing the MBON’s attractive output. Thus our data can be most easily explained if this single DAN can bi-directionally alter the strength of KC-MBON synapses depending on the presence and relative timing of odor-driven KC activity; a full testing of this model awaits additional physiological measurements.
Comparing rules for writing memory across compartments
In order to compare the parameters of learning in different MB compartments, we selected a set of additional split-GAL4 drivers that express at similar and high levels in different DAN cell types (diagrammed in Figure 2A; primary imaging data documenting expression patterns can be found at http://www.janelia.org/split-gal4). In addition, the three DAN cell types, which express CsChrimson using the MB320C and MB099C drivers, have been shown to have similar spiking responses to CsChrimson activation (Hige et al., 2015). In three of six cases, we chose drivers expressing in a combination of two cell types because we found that activation of only a single DAN cell type did not produce a sufficiently robust memory (Figure 2—figure supplements 1,2). We confirmed that the lines used in our optogenetic experiments (Figure 2A) showed comparable memory formation when trained with electric shock or sugar reward (Figure 2—figure supplement 3). Together, the selected drivers innervate 11 of the 15 MB compartments. Below we describe the results obtained in a number of different learning assays by activating these split-GAL4 drivers.

Rules for writing memory.
(A) Diagram of the DANs contained in split-GAL4 driver lines, which have been color-coded to facilitate comparison with the plots shown in the subsequent panels. Expression patterns of these drivers, including full confocal stacks, can be found at www.janelia.org/split-gal4. In all experiments, the drivers were crossed with 20xUAS-CsChrimson-mVenus in attP18. (B) Differential effect of training length among DANs. Left: Diagram of the experimental design. Immediate memories formed after paring a 60-s odor presentation with thirty 1-s CsChrimson-activating light pulses (Test 1) were compared with those obtained with a 10-s odor presentation paired with three 1-s light pulses (Test 2). Center: A 60-s training period resulted in significantly better memory performance compared with a 10-s training for MB043C, MB213B and MB315C+MB109B (data from ISI = 0 s for MB099C and ISI =+ 10 s for others drivers were used to provide maximum memory formation; see ISI curves in Figure 2—figure supplement 4). We also observed increased learning with 60-s versus 10-s training (PI of 0.72 versus 0.15) when using R58E02-GAL4 (Liu et al., 2012), a strong GAL4 driver expressed in ~90 PAM cluster DANs that includes all of the ~50 DANs that have expression in MB043C, MB213B and MB315C+MB109B. To facilitate comparison of PI magnitudes, the sign of the PI in this and subsequent panels was reversed for DANs that induced aversive memory (MB320C, MB099C and MB630B). The bottom and top of each box represents the first and third quartile, and the horizontal line dividing the box is the median. The whiskers represent the 10th and 90th percentiles. N = 8–16. Right: Comparison of the effect of training time on memory formation induced by activation of different DANs. Ratios of the mean PI obtained with short training and individual PIs obtained with long training are shown for each driver. Asterisk indicates significance of depicted pairs after comparing all pairs. (C) Comparison of learning after single and repetitive training using the three drivers MB320C, MB099C and MB630B. Either a single training with memory test after 1 min (immediate memory; left) or 10 trainings separated by 15 min resting intervals and then memory tests after 1 (middle) or 4 (right) days were used. Significant aversive 1-day memory was seen with all drivers, while 4-day memory was observed only with MB099C and MB630B. MB320C failed to show 4-day memory despite displaying the most robust immediate memory, while MB630B did not induce significant immediate memory. N = 8–12. Asterisk indicates significance of comparison of indicated pairs in B and from 0 in C: *p<0.05; **p<0.01; ***p<0.001.
Longer and repetitive training has been shown to induce stronger and more persistent memory across animal phyla (Frost et al., 1985; Tully et al., 1994). Consistent with those observations, we found that a 10-s training generally induced memory less effectively than a 60-s training in our immediate memory assay (Figure 2B). We also found that the optimal temporal relationship of DAN activation and odor presentation for memory formation was similar, but not identical, for DANs innervating different MB compartments (Figure 2—figure supplement 4).
Long-term aversive memory in flies requires repetitive electric shock conditioning with resting intervals, so-called spaced training (Tully et al., 1994). We found that two sets of DANs, PPL1-α3 alone (MB630B) or the combination of PPL1-γ2α′1 and PPL1-α′2α2 (MB099C) can induce 1-day and 4-day aversive memory after spaced training (Figure 2C), suggesting that the effects of spaced training can be implemented in individual compartments. Making memory formation dependent on repetitive training might be beneficial by allowing an animal to ignore spurious one-time events. Recent work has shown that the γ2α′1 compartments play key roles in both sleep regulation and long-term memory (Sitaraman et al., 2015; Haynes et al., 2015). The observation that co-activation of PPL1-γ2α′1 and other DANs synergistically prolongs memory retention (Aso et al., 2012) raises the possibility that PPL1-γ2α′1 might act broadly to facilitate memory consolidation by promoting sleep after learning.
We found that a particular DAN’s ability to induce the formation of immediate, 1-day and 4-day memories is not correlated. For example, immediate memory after a single pairing with activation of PPL1-α3 (MB630B) was barely detectable, although multiple activations resulted in 4-day memory (Figure 2C). In contrast, PPL1-γ1pedc (MB320C) activation resulted in robust immediate memory acquisition after a single round of training, but its activation failed to induce 4-day memory even after extensive spaced training (Figure 2C). These results imply the stability of memory is an intrinsic property of the MB compartment, rather than a consequence of the training protocol. In this view, repetitive training with naturalistic stimuli that activate many DAN cell types would recruit additional compartments with slower acquisition rates and the behaviorally assayed retention of memory would reflect the combined memories formed in different compartments (Figure 2—figure supplement 5). It remains an open question whether short-term memories are converted into long-term memories as biochemical changes in the same synapses or whether these memories are formed separately and in parallel. For olfactory learning in Drosophila, our data are consistent with a model in which memory formation and consolidation can occur independently and in parallel in individual MB compartments; this view does not exclude the possibility that network activity facilitates memory consolidation.
Comparing rules for updating memory
We found that the memories induced in different compartments have different stabilities, displaying different dynamics of spontaneous memory decay over a 1-day period (Figure 3A–B). Memories in each compartment also differed in the extent to which they were reduced by a second presentation of the trained odor without reinforcement (Figure 3C).

Rules for updating memory.
(A) Diagram of the DANs contained in split-GAL4 driver lines, which have been color-coded to facilitate comparison with the plots shown in the subsequent panels. In all experiments, the drivers were crossed with 20xUAS-CsChrimson-mVenus in attP18. (B) Memory decay after 1 d. Left: Flies were trained with a 60-s odor delivery during which thirty 1-s pulses of red light were given, and then tested either immediately (Test 1) or after 1 d (Test 2). Center: To facilitate comparison of PI magnitudes, the sign of the PI in this and subsequent panels was reversed for DANs that induced aversive memory (MB320C, MB099C and MB630B). The bottom and top of each box represents the first and third quartile, and the horizontal line dividing the box is the median. The whiskers represent the 10th and 90th percentiles. N = 8–18. Right: Comparison of memory retention times induced by activation of different DANs. Ratios of the mean PI measured at 1d and individual PIs immediately after training are shown for each driver. Asterisk indicates significance of depicted pairs after comparing all pairs. (C) Decrease in the conditioned response by unpaired odor exposure. After the first training and test (Test 1), flies were exposed to both odors without optogenetic DAN activation and then retested (Test 2). In panels C–E, Test 1 and Test 2 were performed on the same group of flies and thus the ratios of individual data points are plotted in the rightmost graphs. N = 10–18. (D) Decrease in the conditioned response by unpaired DAN activation. After first training and test (Test 1), flies were exposed to light to activate DANs (thirty 1-s pulses) without odor delivery, and then retested (Test 2). N = 10–17. (E) Flexibility to learn a new association. After first training and test (Test 1), flies were trained with the opposite pairing of odor and DAN activation and tested for their ability to learn the new pairing (Test 2). N = 10–16. For most DANs, the ability to learn the second association was not significantly impaired by the first training. The exception was MB043C, which expresses in the DAN innervating the α1 compartment. (F) Memory capacity. Flies were trained and tested with two pairs of odors. Odor A and B were pentyl acetate and 3-octanol, respectively. Odor C and D were ethyl lactate and 4-methylcyclohexanol, respectively. N = 8–12. For MB320C (compartments γ1 and pedc) only the most recently trained odor was retained, an effective memory capacity of one. For MB043C (α1 compartment), flies were able to remember both comparisons, demonstrating a memory capacity of at least two. Asterisk indicates significance: n.s. not significant, *p<0.05; **p<0.01; ***p<0.001.
Likewise, DAN activation without odor presentation significantly reduced immediate memory (Figure 3D) for four of the five sets of DANs tested. These two effects might be mechanistically linked as odor presentation alone can result in activation of a subset of dopaminergic neurons (Riemensperger et al., 2005; Mao and Davis, 2009).
In both the case of presentation of odor without dopamine and of dopamine without odor, the association the fly had previously learned is not confirmed. It would make sense for a memory to be diminished when the contingency upon which it is based is found to be unreliable. Consistent with this idea, repetitive spaced training with these same DANs can induce 1-day memory that is resistant to DAN activation (Figure 3—figure supplement 1). The differences we observed between compartments suggest that they weigh the importance of the reliability of the correlation between CS and US differently.
The α1 compartment differed from the other compartments we tested in that it was resistant to memory reduction by DAN activation (Figure 3D). This compartment plays a key role in long-term appetitive memory of nutritious foods (Yamagata et al., 2015; Huetteroth et al., 2015) and has an unusual circuit structure: its MBON (MBON-α1) appears to form synapses on the dendrites of the DAN that innervates the α1 compartment (PAM-α1) forming a recurrent circuit necessary for long-term memory formation (Ichinose et al., 2015). The α1 compartment also showed the least ability to replace an older association with a new one (Figure 3E). This observation suggests that the initial memory may not be affected by the second training, resulting in co-existing appetitive memories for both odors. Indeed, flies were able to retain associations between each of two odors and PAM-α1 (MB043C) activation, while only the most recently learned association was remembered with PPL1-γ1pedc (MB320C) activation (3F). The higher memory capacity of the α1 compartment is not due to generalization, since training with one odor pair did not affect the innate odor preference observed with a different, untrained odor pair. Thus two distinct strategies for updating memories appear to be used in different MB compartments: (1) writing a new memory, while diminishing the old memory; or (2) writing a new memory, while retaining the old memory.
Processing of conflicting memories
Our results suggest that memory formation in each compartment is largely parallel and independent, with compartmental specific rules for updating memories. Such a model of independent memory storage should allow appetitive and aversive memories to be simultaneously formed for the same odor in different compartments. We tested this idea by simultaneously activating DANs to α1 and γ1pedc while exposing flies to an odor (Figure 4A). When flies were tested immediately after training, the odor was strongly aversive, but the same odor became appetitive after 1 day. These results are most easily explained by simultaneous formation of an aversive memory in γ1pedc and an appetitive memory in α1, with rapid decay of the memory in γ1pedc and slow decay in α1 resulting in a shift in valence of the conditioned response over time. However, the fact that we observed strongly aversive immediate memory, rather than an intermediate response, suggests that the MB network non-linearly integrates these conflicting signals. The known feedforward connection between γ1pedc and α1 provides a possible circuit mechanism (Figure 5B; Aso et al. 2014a). Recent studies (Kaun et al., 2011; Das et al., 2014; Aso et al., 2014b) provide further examples most easily explained by parallel induction of conflicting memories of different decay rates. We also found that wild type flies are capable of efficiently switching odor preference when they had conflicting sequential experiences of sugar reward followed by shock punishment with the same odor (Figure 4B).

Processing of conflicting memories.
(A) Flies expressing CsChrimson in PPL1-γ1pedc (VT045661-LexA in JK22C x 13xLexAop2-CsChrimson-tdTomato in attP18), or in PAM-α1 (MB043C x 20xUAS-CsChrimson-mVenus in attP18), or in both PPL1-γ1pedc and PAM-α1 (by combining all four transgenes) were starved for 48 hr and then trained with a 60-s odor delivery during which thirty 1-s pulses of red light were given, and then tested either immediately or after 1 d. N = 8–12. (B) Wild type flies were starved for 48 hr and then trained using 2-min exposures to odor A with sucrose and then to odor B without sucrose and tested for odor preference after 1d (Test 1). Following the first memory test, flies were trained with a 1-min exposure of odor A (the odor previously paired with sugar) and electric shock (twelve 1.25 s pulses of 60V) in the olfactory arena and then with odor B without shock. Odor preference was measured immediately after the second training (Test 2). For comparison, wild type flies starved for same period were conditioned with electric shock and tested immediately (Test 3). N = 8.

Summary of distinct rules for learning and updating memory.
(A) Summary table of distinct learning rules induced by different DAN cell types. Criteria were qualitatively judged to be +, ++, +++ or n.d. (not determined) based on the data presented in Figures 2,3. (B) Diagram summarizing the feed forward network within the MB lobes (compartments shown as boxes) and the convergence of MBONs in common target zones in other brain areas (shown as ovals). These circuit motifs might provide a path through which memories distributed in different MB compartments might be integrated (Aso et al., 2014a). Not shown in this diagram are cases where MBONs target the dendrites of DANs, a circuit motif that is known to occur (Aso et al., 2014a; Ichinose et al., 2015) and could also promote communication between compartments.
Concluding remarks
Our results demonstrate that different MB compartments use distinct rules for writing and updating memories of odors (Figure 5A). By analyzing individual memory components–or engrams–induced by local dopamine release, we found that the interpretation of a common odor representation carried by sparse KC activity to multiple compartments could be modified differently in each of those compartments. We do not know the mechanisms that generate these distinct learning rules. They could arise from differences in the dopamine release properties of different DAN cell types or from local differences in the biochemical response to dopamine signaling in each MB compartment. For example, KCs express four distinct dopamine receptors (Crocker et al., 2016), which might be deployed differently in each compartment. Or they could originate from circuit properties: we know from anatomical (Tanaka et al., 2008; Aso et al., 2014a), behavioral (Ichinose et al., 2015) and functional imaging (Boto et al., 2014; Cohn et al., 2015; Owald et al., 2015) studies that MB compartments can communicate through connections between their extrinsic neurons, the DANs and MBONs, as well as by a layered network within the MB (Figure 5B). In the mammalian brain, associative memories are also stored as distributed and parallel changes with partially overlapping functions (Herry and Johansen, 2014; Hikosaka et al., 2014; Tonegawa et al., 2015); for example, different populations of dopaminergic neurons develop representations of a visual objects’ value with distinct learning rules (Kim et al., 2015). We expect many of the underlying strategies and mechanisms may be shared between flies and other species. Our work provides a foundation for experiments aimed at understanding the molecular and circuit mechanisms by which distributed memory components are written with distinct rules and later integrated to guide memory-based behaviors.
Materials and methods
Fly strains
Request a detailed protocolCrosses of split-GAL4 lines for DANs (Aso et al., 2014a) and 20xUAS-CsChrimson-mVenus in attP18 (Klapoetke et al., 2014) were kept on standard cornmeal food supplemented with retinal (0.2 mM all-trans-retinal prior to eclosion and then 0.4 mM) at 22°C at 60% relative humidity in the dark. Female flies were sorted on cold plates at least 1 d prior to the experiments and 4–10 d old flies were used for experiments. The new split-GAL4 driver, MB630B was designed based on confocal image databases (http://flweb.janelia.org) (Jenett et al., 2012), BrainBase (http://brainbase.imp.ac.at), and constructed by inserting VT026773-p65ADZp in attP40 and R72B05-ZpGAL4DBD in attP2 as described previously (Pfeiffer et al., 2010). VT045661-LexA was constructed as described previously (Pfeiffer et al., 2010) and injected into JK22C (Knapp et al., 2015). The confocal images of expression patterns are available online (http://www.janelia.org/split-gal4). For driving CsChrimson by both MB109B and MB315C, 20xUAS-CsChrimson-mVenus in attP18 was first combined with MB315C, and then crossed with MB109B.
Optogenetic olfactory arena
Request a detailed protocolThe olfactory arena for optogenetics experiments was designed based on the four-field olfactometer (Pettersson, 1970; Vet et al., 1983) and was briefly described in previous reports (Aso et al., 2014b; Hige et al., 2015). The overview of the assay is described in Figure 1—figure supplement 1 and a detailed description of the apparatus is provided in Supplementary file 1; the Janelia TechTransfer office (techtransfer@janelia.hhmi.org) will provide complete construction documentation and CAD files upon request.
Behavioral assay
Request a detailed protocolGroups of approximately 20 females of 4–10 d post-eclosion were trained and tested at 25°C at 50% relative humidity in a dark chamber. The flow rate of input air from each of the four arms was maintained at 100 mL/min throughout the experiments by mass-flow controllers, and air was extracted from the central hole at 400 mL/min. Odors were delivered to the arena by switching the direction of airflow to the tubes containing diluted odors using solenoid valves. The odors were diluted in paraffin oil (Sigma–Aldrich): 3-octanol (OCT; 1:1000; Merck) and 4-methylcyclohexanol (MCH; 1:750; Sigma–Aldrich), Pentyl acetate (PA: 1:5000; Sigma–Aldrich) and ethyl lactate (EL: 1:5000; Sigma–Aldrich). Shock and sugar conditioning was performed as previously described by using tubes with sucrose absorbed Whatman 3 MM paper or copper grids (Figure 2—figure supplement 3) (Aso et al., 2012; Liu et al 2012). For the experiments in Figure 4B, a sheet of copper grid was placed at the bottom of arena. For appetitive memory assays, flies were starved for 24–48 hr on 1% agar. Videography was performed at 30 frames per second and analyzed using Fiji (Schindelin et al., 2012). Statistical comparisons were performed (Prism; Graphpad Inc, La Jolla, CA 92037) using the Kruskal Wallis test followed by Dunn's post-test for multiple comparison, except those in Figure 1F, Figure 2C and Figure 2—figure supplement 4 which used Wilcoxon signed-rank test with Bonferroni correction to compare from zero.
References
-
The mushroom body of adult Drosophila characterized by GAL4 driversJournal of Neurogenetics 23:156–172.https://doi.org/10.1080/01677060802471718
-
Specific dopaminergic neurons for the formation of labile aversive memoryCurrent Biology 20:1445–1451.https://doi.org/10.1016/j.cub.2010.06.048
-
Neural substrates of eyeblink conditioning: acquisition and retentionLearning & Memory 10:427–455.https://doi.org/10.1101/lm.59603
-
Drosophila learn opposing components of a compound food stimulusCurrent Biology 24:1723–1730.https://doi.org/10.1016/j.cub.2014.05.078
-
The cerebellar microcircuit as an adaptive filter: experimental and computational evidenceNature Reviews Neuroscience 11:30–43.https://doi.org/10.1038/nrn2756
-
Drosophila mushroom body mutants are deficient in olfactory learningJournal of Neurogenetics 2:1–30.https://doi.org/10.3109/01677068509100140
-
Mushroom body memoir: from maps to modelsNature Reviews Neuroscience 4:266–275.https://doi.org/10.1038/nrn1074
-
Encoding of fear learning and memory in distributed neuronal circuitsNature Neuroscience 17:1644–1654.https://doi.org/10.1038/nn.3869
-
Basal ganglia circuits for reward value-guided behaviorAnnual Review of Neuroscience 37:289–306.https://doi.org/10.1146/annurev-neuro-071013-013924
-
A Drosophila model for alcohol rewardNature Neuroscience 14:612–619.https://doi.org/10.1038/nn.2805
-
Gustatory learning and processing in the Drosophila mushroom bodiesJournal of Neuroscience 35:5950–5958.https://doi.org/10.1523/JNEUROSCI.3930-14.2015
-
Independent optical excitation of distinct neural populationsNature Methods 11:338–346.https://doi.org/10.1038/nmeth.2836
-
Odorant-induced oscillations in the mushroom bodies of the locustJournal of Neuroscience 14:2993–3004.
-
Neural correlates of water reward in thirsty DrosophilaNature Neuroscience 17:1536–1542.https://doi.org/10.1038/nn.3827
-
An aphid sex attractant. I. Biological studiesInsect Systematics & Evolution 1:63–73.https://doi.org/10.1163/187631270X00357
-
Punishment prediction by dopaminergic neurons in DrosophilaCurrent Biology 15:1953–1960.https://doi.org/10.1016/j.cub.2005.09.042
-
Fiji: an open-source platform for biological-image analysisNature Methods 9:676–682.https://doi.org/10.1038/nmeth.2019
-
Dopamine and octopamine differentiate between aversive and appetitive olfactory memories in DrosophilaJournal of Neuroscience 23:10495–10502.
-
Neuronal assemblies of the Drosophila mushroom bodyThe Journal of Comparative Neurology 508:711–755.https://doi.org/10.1002/cne.21692
-
Dopaminergic neurons encode a distributed, asymmetric representation of temperature in DrosophilaJournal of Neuroscience 33:2166–2176.https://doi.org/10.1523/JNEUROSCI.3933-12.2013
Decision letter
-
Liqun LuoReviewing Editor; Howard Hughes Medical Institute, Stanford University, United States
In the interests of transparency, eLife includes the editorial decision letter and accompanying author responses. A lightly edited version of the letter sent to the authors after peer review is shown, indicating the most substantive concerns; minor comments are not usually included.
Thank you for submitting your article "Dopaminergic neurons write and update memories with cell-type-specific rules" for consideration by eLife. Your article has been reviewed by three peer reviewers, and the evaluation has been overseen by a Reviewing Editor, Liqun Luo, and K VijayRaghavan as the Senior Editor. The following individuals involved in review of your submission have agreed to reveal their identity: Josh Dubnau (Reviewer #1); Yi Zhong (Reviewer #2); Leslie C Griffith (Reviewer #3).
The reviewers have discussed the reviews with one another and the Reviewing Editor has drafted this decision to help you prepare a revised submission.
All three reviewers, as well as the Reviewing Editor, are highly enthusiastic about the significance, and most of the critiques can be addressed by textual changes. However, the first critique of Reviewer 1 regarding back-crossing represents substantial amount of work if the current data were collected from flies that had not been back-crossed to a common genetic background. We hope that these flies were indeed back-crossed and it was an omission in the methods. If they had not been, after much discussion among the Reviewers and Reviewing Editor, we think that it is necessary to perform additional controls to exclude the possibility that the differences reported in Figures 3,2 are due to different backgrounds of flies harboring individual transgenes.
[We note that in the Aso et al., 2014 paper, you used many conditional manipulations (e.g. shibire). So even though each Gal4 line or split may be on a unique background, you had a within genotype temperature shift manipulation so we could see the acute impact of that cell type. Here this is not the case, and there are direct comparisons between lines and conclusions drawn in a comparative sense between the cell types.]
Specifically, we suggest that you perform standard electric shock/sugar training paradigms with each of the key strains (Chrimson included), and examine short term and long term memory. If each of these strains you use has comparable conditioned responses to electric shock and sugar, we can be more certain that the differences with light-induced training are due to cell-type specific Chr expression, not something in the strain background.
Reviewer #1:
The MB are a highly studied neural center of associative olfactory memory and learning in flies. A large body of prior work establishes that odor driven CS inputs are represented as sparse patterns of activity within the large ensemble of MB Kenyon cslls. US driven dopaminergic neuron inputs are thought to converge onto MB Kenyon cells in the MB lobes, which is the site of Kenyon cell synapses with output neurons. Recent work has established that the different sub-types of dopaminergic neurons (DANs) and output neurons (MBONs) are each restricted within zones of the MB lobes. The anatomical organization of the MB Kenyon cell sub-types also implies that the DAN/MBON zones are restricted to sub-types of Kenyon cells. Layered onto this anatomical model is a large body of work that demonstrates that memory is processed through a series of genetically dissectible phases, that rely on different sub-sets of the MB Kenyon cells. This manuscript investigates very interesting questions about how the anatomical organization of the DANs and MBONs into sub-zones provide functional modules upon which each of the memory phases after appetitive vs aversive memories are built. The questions being asked are important and the world view that the authors propose is beautiful. So on a deep level, I am highly enthusiastic. Unfortunately, however, I have several foundational critiques that undermine the major conclusions of this study. This is really unfortunate, because I think that if properly done, this could be a really important piece of work. Below I outline in roughly descending order of importance, the issues that I have with this study in its current form.
1) If my read of the Methods section is accurate, the authors have made no attempt to control the genetic backgrounds in any of their experiments. This is a fundamental and in my view fatal flaw. There is a long and rich history of this, and it is well established that for all quantitative behavioral traits, genetic background needs to be controlled between all groups for which differences in phenotype are used to draw conclusions. In this case, the authors have used Gal4, and Gal4 reagents that were generated on an outbred background. So every single line has a founder effect, which means that the genetic background of every single strain used is completely unique. As a result, all bets are off in terms of how we interpret differences in magnitude of performance, kinetics of memory decay, and even olfactory acuity and locomotion behaviors that underlie the memory performance. Any and all of the 'rules' that are discovered here that govern how activation of a given DAN might contribute to short term or long-term memory, or to its extinction, etc., could just as easily be due to differences in the kinetics and processing of memory between genetic backgrounds. If I had to guess from my own personal experience, I would wager that a decent fraction of observed quantitative differences between different DANs will fail to reproduce when the genetic backgrounds are equilibrated. We simply cannot fill the literature with this sort of doubt.
2) The authors state up front that strong optogenetic activation of a single class of neuron may not reflect real physiological US inputs. Although some reviewers may have an issue with that, I actually have no conceptual problem with it: I buy the argument that this is a good framework by which to pick apart the rules that govern plasticity at each DAN:KC:MBON zone. However, it is essential to do this sort of analysis in parallel with 'real' learning in which a real US is used (sugar reward, electric shock or some other aversive stimulus). And then within the context of a real US, it is important to query the contribution of that DAN to performance. Otherwise, the insight that we gain from activating a DAN in place of the US is severely reduced.
Critiques that need to be addressed with text revisions:
3) The major dopamine receptor that is established to mediate the US inputs is the type 1 receptor, DopR. Mutations in DopR can be rescued by expression only in MB (Kim et al) and in the case of aversive memory, expression of this receptor only in MB γ neurons is sufficient to fully restore all stages of memory. Given this, how do the authors interpret the fact that activation of DANs that are restricted to zones outside γ lobes are able to induce aversive memories? Is there a different Dopamine receptor at play? Or should we conclude that DANs also co-release other transmitters that are not acting through a dopamine receptor? I am not arguing that the authors need to resolve this issue experimentally, but they cannot ignore this.
4) Long term memory in this field has an operational meaning that is mis-applied here. When this field uses this term, we mean memory that requires new protein synthesis. In general, it has been required for any new behavioral assay that one use inhibitors of protein synthesis and/or establish a requirement for the CREB gene and/or show a difference between massed versus spaced training. Here, the authors use a spaced training protocol, and then refer to a long lived trace as long-term memory. But we really don't know whether this is the same consolidated form of memory that others have discussed. Moreover, the text confounds the impact of multiple training sessions on acquisition versus consolidation.
5) The accepted historical nomenclature for Pavlovian literature is flaunted badly. Extinction has a specific meaning in the literature, and it does not involve presentation of the US absent the CS. So activation of DANs after training should not be called extinction, and it shouldn't be said to extinguish memory. And even in the cases where CS+ is presented post training, it is not clear that the decrement in performance is due to extinction. If I read the methods correctly, this is a degradation of avoidance under continuous odor exposure. So this easily could be due to (e.g.) sensory adaptation. Extinction has specific definitions, and there are behavioral tests for whether extinction has taken place. Similarly, the term 'relieve learning' is not really correct. I realize that this term has been used by Tanimoto et al. But the larger literature on Pavlovian learning has already established terms: conditioned excitation and conditioned inhibition. Relief learning is a rediscovery of CI and CE.
6) The discussion of parallel vs. sequential memory formation is superficial. Isabel et al. and Blum et al. do not impact this issue in the way the authors have implied, and I don't think the findings in this paper bear on this discussion in a compelling way.
In sum, I very much like the outlook and the over-all approach. But this manuscript does not provide compelling data to support the main conclusions. I realize that the bar I am setting would require (e.g.) back-crossing all lines to the same strain and then repeating most or all of the experiments. But this bar has been set, appropriately, by a consensus in this field that conclusions that are drawn from behavioral comparisons with uncontrolled genetic backgrounds do not withstand the test of time. We all live by this standard.
Reviewer #2:
In this manuscript, the authors systematically analyzed the impacts of artificial activation of several DAN subsets on olfactory memory formation, retention, extinction, extinguishment and reverse. The majority of their discoveries are new while a few observations are confirmative of previously published reports. All observations presented in this manuscript together help to clarify the roles of DANs in different memory processes and memory components and build a systematic understanding of involvement of the dopaminergic system in memory. This manuscript is suitable for publication in eLife if the following concerns are addressed:
1) The statement concerning "extinguish" needs to be clarified. To show whether the first odor memory is extinguished, the authors may want to perform a third odor test (test the preference between the first odor and a third odor).
2) The manuscript included 1d and 4d odor memories. It would be of interests to determine whether these memories are long-term one or not through feeding of flies with CXM.
3) The grammatical mistakes should be corrected, such as "each innervate distinct MB compartments" and "a series of experiences is integrated to make a probabilistic prediction".
4) Some technical terms, such as "Drosophila" and Gal4 names, should be written in the right format.
Reviewer #3:
This paper is an interesting and important contribution to our understanding of how dopamine can influence behavior. The last few years have seen a number of high profile papers that suggest that particular subsets of DAergic neurons can have specialized roles in either temporal or functional domains. This paper steps back and takes a very broad view of this issue looking at most of the DAergic neurons and testing their roles in multiple temporal scales in the encoding and extinguishing of appetitive and aversive memories.
The results are important for the field in two ways. First, they demonstrate that essentially all DAergic neurons have some specialist qualities. The previously described "forgetting neurons" and "STM" and "LTM" neurons are not outliers, but rather reflect a basic feature of the system. Second, they provide very strong evidence to buttress the idea that there are multiple parallel streams of memory formation. Short-term memories are not simply converted to long-term memories, but rather they are independently encoded. This paper is well-suited for this eLife format since these data are directly tied to the previous Aso papers and the tools they generated.
My only comments regard putting these data in context of older findings:
1) The figure legends have way too much text. Descriptions of data not presented in a figure should be in the Results section.
2) A subset of PPL1 neurons are activated by odors (Mao & Davis 2009). How does this affect the interpretation of Figure 3C (odor extinction) and Figure 3D (US extinction)?
3) Yarali & Gerber (2010) found that TH-gal4 neurons are not necessary for relief learning. Does this study conflict with these prior findings (i.e., in finding that gamm1-pedc is sufficient for relief learning), or do you propose that there are both DAN and non-DAN mediated relief learning pathways?
https://doi.org/10.7554/eLife.16135.018Author response
All three reviewers, as well as the Reviewing Editor, are highly enthusiastic about the significance, and most of the critiques can be addressed by textual changes. However, the first critique of Reviewer 1 regarding back-crossing represents substantial amount of work if the current data were collected from flies that had not been back-crossed to a common genetic background. We hope that these flies were indeed back-crossed and it was an omission in the methods. If they had not been, after much discussion among the Reviewers and Reviewing Editor, we think that it is necessary to perform additional controls to exclude the possibility that the differences reported in Figures 3,2 are due to different backgrounds of flies harboring individual transgenes.
[We note that in the Aso et al., 2014 paper, you used many conditional manipulations (e.g. shibire). So even though each Gal4 line or split may be on a unique background, you had a within genotype temperature shift manipulation so we could see the acute impact of that cell type. Here this is not the case, and there are direct comparisons between lines and conclusions drawn in a comparative sense between the cell types.]
Specifically, we suggest that you perform standard electric shock/sugar training paradigms with each of the key strains (Chrimson included), and examine short term and long term memory. If each of these strains you use has comparable conditioned responses to electric shock and sugar, we can be more certain that the differences with light-induced training are due to cell-type specific Chr expression, not something in the strain background.
We performed the suggested experiments. These data, which confirm that the strains have comparable conditioned responses, have been added as Figure 2—figure supplement 3.
Reviewer #1:
The MB are a highly studied neural center of associative olfactory memory and learning in flies. A large body of prior work establishes that odor driven CS inputs are represented as sparse patterns of activity within the large ensemble of MB Kenyon cslls. US driven dopaminergic neuron inputs are thought to converge onto MB Kenyon cells in the MB lobes, which is the site of Kenyon cell synapses with output neurons. Recent work has established that the different sub-types of dopaminergic neurons (DANs) and output neurons (MBONs) are each restricted within zones of the MB lobes. The anatomical organization of the MB Kenyon cell sub-types also implies that the DAN/MBON zones are restricted to sub-types of Kenyon cells. Layered onto this anatomical model is a large body of work that demonstrates that memory is processed through a series of genetically dissectible phases, that rely on different sub-sets of the MB Kenyon cells. This manuscript investigates very interesting questions about how the anatomical organization of the DANs and MBONs into sub-zones provide functional modules upon which each of the memory phases after appetitive vs aversive memories are built. The questions being asked are important and the world view that the authors propose is beautiful. So on a deep level, I am highly enthusiastic. Unfortunately, however, I have several foundational critiques that undermine the major conclusions of this study. This is really unfortunate, because I think that if properly done, this could be a really important piece of work. Below I outline in roughly descending order of importance, the issues that I have with this study in its current form.
1) If my read of the Methods section is accurate, the authors have made no attempt to control the genetic backgrounds in any of their experiments. This is a fundamental and in my view fatal flaw. There is a long and rich history of this, and it is well established that for all quantitative behavioral traits, genetic background needs to be controlled between all groups for which differences in phenotype are used to draw conclusions. In this case, the authors have used Gal4, and Gal4 reagents that were generated on an outbred background. So every single line has a founder effect, which means that the genetic background of every single strain used is completely unique. As a result, all bets are off in terms of how we interpret differences in magnitude of performance, kinetics of memory decay, and even olfactory acuity and locomotion behaviors that underlie the memory performance. Any and all of the 'rules' that are discovered here that govern how activation of a given DAN might contribute to short term or long-term memory, or to its extinction, etc., could just as easily be due to differences in the kinetics and processing of memory between genetic backgrounds. If I had to guess from my own personal experience, I would wager that a decent fraction of observed quantitative differences between different DANs will fail to reproduce when the genetic backgrounds are equilibrated. We simply cannot fill the literature with this sort of doubt.
We agree that difference in genetic background can influence behavioral phenotypes, especially for long-term memory. However, we note that the split-GAL4 stocks that we used are much more similar in background than the stocks that the reviewer correctly notes have proved problematic in the past. Those stocks were often genetic mutations or enhancer trap GAL4 lines in widely different genetic backgrounds. While the split-GAL4 lines we used were not back-crossed, these lines are in similar genetic backgrounds: the AD and DBD DNA constructs used in the split-GAL4 lines were injected into the same set of recipient stocks and inserted into the same landing sites, and the X chromosome in all split-GAL4 lines was exchanged with that of a common stock. Also, unlike in many previous experiments that studied the behavioral effects of mutations, we do not test these split-GAL4 lines as homozygotes, but as a population of the heterozygous progeny of a cross between a specific split-GAL4 line and a common 20xUAS-CsChrimson-mVenus stock, thus masking the effect of any recessive genetic differences between the split-GAL4 stocks.
2) The authors state up front that strong optogenetic activation of a single class of neuron may not reflect real physiological US inputs. Although some reviewers may have an issue with that, I actually have no conceptual problem with it: I buy the argument that this is a good framework by which to pick apart the rules that govern plasticity at each DAN:KC:MBON zone. However, it is essential to do this sort of analysis in parallel with 'real' learning in which a real US is used (sugar reward, electric shock or some other aversive stimulus). And then within the context of a real US, it is important to query the contribution of that DAN to performance. Otherwise, the insight that we gain from activating a DAN in place of the US is severely reduced.
We agree that studying the effects of blocking specific DANs after presentation of a “real” US (shock or sugar) will be needed to fully understand associative learning. However, interpreting the results of such experiments is complicated by the fact that shock or sugar activate multiple DANs (Kirkhart, C. & Scott, K. 2015; Mao, Z. & Davis, R. L. 2009; Liu, C. et al. 2012) and almost certainly, given previous studies (Aso et al., 2012) and the results we present in this paper, induce partially redundant memories in multiple compartments in parallel. Thus making conclusions about the roles of individual DANs by studying the behavioral effects of their inactivation is not straightforward and would, at a minimum, require construction of strains that would allow different, specific combinations of DANs to be inactivated together. These experiments are technically challenging and beyond the scope of the current study. Indeed, avoiding the complications of redundancy was a major motivation of our current study’s experimental design: inducing memory in defined compartment(s) by optogenetic stimulation and then assaying memory dynamics using behavioral assays.
Critiques that need to be addressed with text revisions:
3) The major dopamine receptor that is established to mediate the US inputs is the type 1 receptor, DopR. Mutations in DopR can be rescued by expression only in MB (Kim et al) and in the case of aversive memory, expression of this receptor only in MB γ neurons is sufficient to fully restore all stages of memory. Given this, how do the authors interpret the fact that activation of DANs that are restricted to zones outside γ lobes are able to induce aversive memories? Is there a different Dopamine receptor at play? Or should we conclude that DANs also co-release other transmitters that are not acting through a dopamine receptor? I am not arguing that the authors need to resolve this issue experimentally, but they cannot ignore this.
In Figure 2D, we showed that PPL1-α3 can induce 1-day and 4-day memory after 10x spaced training. These data fit well with previous reports that rutabaga rescue in α/β KCs can restore LTM (Blum 2010), but appear at first glance to be inconsistent with the results reported in Qin et al. 2012 that showed full restoration of both immediate and 1-day memory in DopR1 mutants by driving DopR1 in γ KCs but not in α/β KCs. There are many possible explanations for such lack of agreement, including the use of other dopamine receptors in α/β Kenyon cells. Indeed, α/β and γ Kenyon cells have been reported to express high levels of four different dopamine receptors (Crocker et al., 2016). We added a statement about the presence of multiple dopamine receptors to the text.
4) Long term memory in this field has an operational meaning that is mis-applied here. When this field uses this term, we mean memory that requires new protein synthesis. In general, it has been required for any new behavioral assay that one use inhibitors of protein synthesis and/or establish a requirement for the CREB gene and/or show a difference between massed versus spaced training. Here, the authors use a spaced training protocol, and then refer to a long lived trace as long-term memory. But we really don't know whether this is the same consolidated form of memory that others have discussed. Moreover, the text confounds the impact of multiple training sessions on acquisition versus consolidation.
We have modified the text to use the simple operational description of “4-day” memory throughout, rather than “long-term” memory, which we agree implies specific attributes that were not assessed by our experiments.
5) The accepted historical nomenclature for Pavlovian literature is flaunted badly. Extinction has a specific meaning in the literature, and it does not involve presentation of the US absent the CS. So activation of DANs after training should not be called extinction, and it shouldn't be said to extinguish memory. And even in the cases where CS+ is presented post training, it is not clear that the decrement in performance is due to extinction. If I read the methods correctly, this is a degradation of avoidance under continuous odor exposure. So this easily could be due to (e.g.) sensory adaptation. Extinction has specific definitions, and there are behavioral tests for whether extinction has taken place. Similarly, the term 'relieve learning' is not really correct. I realize that this term has been used by Tanimoto et al. But the larger literature on Pavlovian learning has already established terms: conditioned excitation and conditioned inhibition. Relief learning is a rediscovery of CI and CE.
We removed the terms “extinction” and “relief learning” from the text. Instead we simply describe our experiments in operational terms.
6) The discussion of parallel vs. sequential memory formation is superficial. Isabel et al. and Blum et al. do not impact this issue in the way the authors have implied, and I don't think the findings in this paper bear on this discussion in a compelling way.
We have modified this section of the text and removed these two citations. We agree that our data to not resolve the issue of parallel vs. sequential memory formation and now simply say that our results “provide support for” parallel learning.
We also added an additional experiment that we feel bears directly on the issue of parallel memory (new Figure 4).
Reviewer #2:
In this manuscript, the authors systematically analyzed the impacts of artificial activation of several DAN subsets on olfactory memory formation, retention, extinction, extinguishment and reverse. The majority of their discoveries are new while a few observations are confirmative of previously published reports. All observations presented in this manuscript together help to clarify the roles of DANs in different memory processes and memory components and build a systematic understanding of involvement of the dopaminergic system in memory. This manuscript is suitable for publication in eLife if the following concerns are addressed:
1) The statement concerning "extinguish" needs to be clarified. To show whether the first odor memory is extinguished, the authors may want to perform a third odor test (test the preference between the first odor and a third odor).
We removed the terms “extinction” and “relieve learning” from the text. Instead we simply describe our experiments in operational terms.
2) The manuscript included 1d and 4d odor memories. It would be of interests to determine whether these memories are long-term one or not through feeding of flies with CXM.
We have modified the text to use the simple operational description of “4-day” memory throughout, rather than “long-term” memory, which we agree implies specific attributes that were not assessed by our experiments.
3) The grammatical mistakes should be corrected, such as "each innervate distinct MB compartments" and "a series of experiences is integrated to make a probabilistic prediction".
Done. (“Series” is a singular noun.)
4) Some technical terms, such as "Drosophila" and Gal4 names, should be written in the right format.
Done.
Reviewer #3:
This paper is an interesting and important contribution to our understanding of how dopamine can influence behavior. The last few years have seen a number of high profile papers that suggest that particular subsets of DAergic neurons can have specialized roles in either temporal or functional domains. This paper steps back and takes a very broad view of this issue looking at most of the DAergic neurons and testing their roles in multiple temporal scales in the encoding and extinguishing of appetitive and aversive memories.
The results are important for the field in two ways. First, they demonstrate that essentially all DAergic neurons have some specialist qualities. The previously described "forgetting neurons" and "STM" and "LTM" neurons are not outliers, but rather reflect a basic feature of the system. Second, they provide very strong evidence to buttress the idea that there are multiple parallel streams of memory formation. Short-term memories are not simply converted to long-term memories, but rather they are independently encoded. This paper is well-suited for this eLife format since these data are directly tied to the previous Aso papers and the tools they generated.
My only comments regard putting these data in context of older findings:
1) The figure legends have way too much text. Descriptions of data not presented in a figure should be in the Results section.
We moved some text from figure legends into Results as suggested.
2) A subset of PPL1 neurons are activated by odors (Mao & Davis 2009). How does this affect the interpretation of Figure 3C (odor extinction) and Figure 3D (US extinction)?
We agree that the results of Mao & Davis (2009) showing that odor presentation alone can result in activation of a subset of dopaminergic neurons (different to those that we studied here) raises the possibility that activated dopaminergic neurons might play a role in the phenotype we report in Figure 3C. We have added a comment to that effect in the text.
3) Yarali & Gerber (2010) found that TH-gal4 neurons are not necessary for relief learning. Does this study conflict with these prior findings (i.e., in finding that gamm1-pedc is sufficient for relief learning), or do you propose that there are both DAN and non-DAN mediated relief learning pathways?
This apparent conflict could be explained by redundancy among DANs, since neither TH-GAL4 nor DDC-GAL4 include all the DANs to MB.
https://doi.org/10.7554/eLife.16135.019Article and author information
Author details
Funding
Howard Hughes Medical Institute
- Yoshinori Aso
- Gerald M Rubin
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Acknowledgements
We thank Igor Negrashov, Steven Sawtelle, Peter Polidoro, William Rowell, Jinyang Liu, Alice Robie, Kristin Branson, Chuntao Dan, Roman Huber and Hiromu Tanimoto for help in establishing the olfactory arena. Brandi Sharp, James McMahon, Lori Laughrey, Teri Ngo and the Janelia Fly Facility helped in fly husbandry. Rebecca Vorimo, Allison Sowell and the FlyLight Project Team performed brain dissections and histological preparations. VT026773-p65ADZp was a gift from Barry J Dickson. Heather Dionne made new molecular constructs. We thank Gowan Tervo, Joshua Dudman, Ulrike Heberlein, TJ Florence, Yichun Shuai, Kit Longden, Adam Hantman, Daisuke Hattori, Larry Abbott, Richard Axel, Chuntao Dan, Krystyna Keleman, Toshihide Hige, Glenn Turner, Toshiharu Ichinose, Nobuhiro Yamagata and Hiromu Tanimoto for stimulating discussions and for comments on earlier drafts of the manuscript.
Reviewing Editor
- Liqun Luo, Howard Hughes Medical Institute, Stanford University, United States
Version history
- Received: March 21, 2016
- Accepted: July 18, 2016
- Accepted Manuscript published: July 21, 2016 (version 1)
- Version of Record published: August 16, 2016 (version 2)
Copyright
© 2016, Aso et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 10,867
- Page views
-
- 2,150
- Downloads
-
- 140
- Citations
Article citation count generated by polling the highest count across the following sources: Scopus, Crossref, PubMed Central.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Neuroscience
Animals discriminate stimuli, learn their predictive value and use this knowledge to modify their behavior. In Drosophila, the mushroom body (MB) plays a key role in these processes. Sensory stimuli are sparsely represented by ∼2000 Kenyon cells, which converge onto 34 output neurons (MBONs) of 21 types. We studied the role of MBONs in several associative learning tasks and in sleep regulation, revealing the extent to which information flow is segregated into distinct channels and suggesting possible roles for the multi-layered MBON network. We also show that optogenetic activation of MBONs can, depending on cell type, induce repulsion or attraction in flies. The behavioral effects of MBON perturbation are combinatorial, suggesting that the MBON ensemble collectively represents valence. We propose that local, stimulus-specific dopaminergic modulation selectively alters the balance within the MBON network for those stimuli. Our results suggest that valence encoded by the MBON ensemble biases memory-based action selection.
-
- Neuroscience
Medial frontal cortical areas are thought to play a critical role in the brain’s ability to flexibly deploy strategies that are effective in complex settings, yet the underlying circuit computations remain unclear. Here, by examining neural ensemble activity in male rats that sample different strategies in a self-guided search for latent task structure, we observe robust tracking during strategy execution of a summary statistic for that strategy in recent behavioral history by the anterior cingulate cortex (ACC), especially by an area homologous to primate area 32D. Using the simplest summary statistic – strategy prevalence in the last 20 choices – we find that its encoding in the ACC during strategy execution is wide-scale, independent of reward delivery, and persists through a substantial ensemble reorganization that accompanies changes in global context. We further demonstrate that the tracking of reward by the ACC ensemble is also strategy-specific, but that reward prevalence is insufficient to explain the observed activity modulation during strategy execution. Our findings argue that ACC ensemble dynamics is structured by a summary statistic of recent behavioral choices, raising the possibility that ACC plays a role in estimating – through statistical learning – which actions promote the occurrence of events in the environment.