Generalization of learned responses in the mormyrid electrosensory lobe

  1. Conor Dempsey
  2. LF Abbott
  3. Nathaniel B Sawtell  Is a corresponding author
  1. Columbia University, United States

Abstract

Appropriate generalization of learned responses to new situations is vital for adaptive behavior. We provide a circuit-level account of generalization in the electrosensory lobe (ELL) of weakly electric mormyrid fish. Much is already known in this system about a form of learning in which motor corollary discharge signals cancel responses to the uninformative input evoked by the fish’s own electric pulses. However, for this cancellation to be useful under natural circumstances, it must generalize accurately across behavioral regimes, specifically different electric pulse rates. We show that such generalization indeed occurs in ELL neurons, and develop a circuit-level model explaining how this may be achieved. The mechanism involves regularized synaptic plasticity and an approximate matching of the temporal dynamics of motor corollary discharge and electrosensory inputs. Recordings of motor corollary discharge signals in mossy fibers and granule cells provide direct evidence for such matching.

https://doi.org/10.7554/eLife.44032.001

Introduction

A learned response that is adaptive only in the precise context in which it was learned is of limited value in the real world. Though cellular and synaptic underpinnings of learning have been elucidated in many systems, less is known about the mechanisms that allow learning to generalize appropriately to conditions different from those in which the learning originally took place (Censor, 2013; Fahle, 2005; Poggio and Bizzi, 2004). We address the question of generalization of learned responses in the passive electrosensory system of weakly electric mormyrid fish. These fish, like a number of other aquatic animals, possess a specialized class of electroreceptors on their skin that are sensitive to minute, low-frequency electrical fields, such as those emitted by other animals in the water (Engelmann et al., 2010; Enikolopov et al., 2018; von der Emde and Bleckmann, 1998). However, the detection and processing of such signals is complicated by the fact that mormyrid fish also emit their own pulsed electric fields, known as electric organ discharges (EODs). Though EODs are used for sensing nearby objects through active electrolocation as well as for communication with conspecifics (processes mediated by separate classes of electroreceptors), they also strongly activate the receptors subserving passive electrolocation, inducing a ringing pattern of activation that persists for ~200 ms (Bell and Russell, 1978). If left uncancelled, these responses to the fish’s own EOD could impede the detection and processing of behaviorally-relevant signals such as prey (Enikolopov et al., 2018).

Past work has suggested that this problem is solved in ELL neurons through the integration of electrosensory input and corollary signals (CD) related to the motor command to discharge the electric organ (Bell et al., 1981). CD signals are conveyed to ELL neurons by granule cells, similar to the granule cells of the cerebellum (Bell et al., 2008). Anti-Hebbian plasticity at synapses between granule cells and ELL neurons generates negative images that serve to cancel the effects of the EOD on ELL output (Bell, 1981; Bell et al., 1993; Bell et al., 1997a) (Figure 1A). However, all past studies of negative image formation and sensory cancellation were restricted to periods when fish emitted EOD commands at low, regular rates (~5 Hz). Although this pattern is typical of paralyzed preparations, the fish’s actual electromotor behavior is far more dynamic. For example, in freely behaving fish it is common to observe prolonged periods of discharge at low rates (1–5 Hz), while resting or hiding, followed by abrupt transitions to much higher rates (up to 60 Hz) when foraging for prey or exploring a novel object (Figure 1B); (Hofmann et al., 2014; Moller et al., 1989; Schwarz and von der Emde, 2001; Toerring and Moller, 1984).

Cancelling the effects of the EOD under natural conditions requires generalization.

(A) Schematic of ELL circuit elements responsible for cancellation of self-generated electrosensory responses. Granule cell corollary discharge responses form a temporal basis (blue trace at left) that is shaped by an anti-Hebbian spike-timing dependent synaptic plasticity rule into a negative image of the predictable sensory response to an EOD (blue trace at right). Signals related to the EOD (orange traces, left and right), along with behaviorally relevant stimuli that the system is meant to detect (not shown), are conveyed by afferent fibers (orange) originating from electroreceptors on the skin. Question mark indicates the process of sensory cancellation being studied. (B) A sequence of inter-EOD intervals recorded in a freely swimming mormyrid fish, adapted with permission from Toerring and Moller (1984). Note the wide range of discharge rates and abrupt transition from lower, irregular rates to a high regular rate (arrow). Such transitions highlight the need for negative images to generalize across different EOD rate regimes.

https://doi.org/10.7554/eLife.44032.002

During such transitions, negative images learned during low-frequency resting periods should generalize to higher EOD rates. If they do not, passive electrolocation would be degraded at precisely the moment when it would seemingly be most needed. Furthermore, this generalization must be accurate because, at high frequencies, the ringing sensory receptor responses to EODs overlap and, if uncancelled, would continuously interfere with the detection of external stimuli such as prey. Using microstimulation of the EOD motor command pathway to control EOD rate, we show that, indeed, sensory cancellation in ELL output neurons generalizes across EOD rates. In theory, such generalization is expected if electrosensory and corollary discharge responses at high rates were simply the linear sum of the responses at low rates. We show that this is not the case and, instead, identify two key features that, when added to existing models of sensory cancellation in ELL, account for generalization. The first is regularization of synaptic plasticity between granule cells and ELL neurons to prevent overfitting, which is closely related to machine learning approaches to generalization. The second feature, which we support directly by recordings from granule cells and their mossy fiber inputs, involves an approximate matching between the EOD rate-dependence of corollary discharge and electrosensory inputs to ELL neurons.

Results

Sensory cancellation in ELL output cells generalizes from low to high EOD rates

We first tested whether sensory cancellation in ELL output cells generalizes across different EOD rates. As in past studies, we used a preparation in which the EOD is blocked by a paralytic, but in which fish are alert and continue to generate the motor commands that normally evoke EODs. The electric field normally generated by the EOD is mimicked experimentally. This preparation permits study of the responses to motor corollary discharge inputs triggered by the EOD command (by turning off the mimic), the sensory response to the artificially produced EOD mimic (by generating the mimic in the absence of an EOD command), and the response to EOD mimics paired with the EOD command. The paired condition replicates the natural situation in which the EOD command evokes an EOD pulse and both electrosensory and corollary discharge pathways are engaged together.

Past studies have shown that the response to locally delivered EOD mimics triggered by the EOD command are cancelled if mimics are paired with commands in this way over ~15 min. For this reason, we will use the term ‘learning’ to refer to extended periods when EOD mimics are triggered by, and hence paired with, commands. Turning the mimic off after learning reveals that the response to the command alone resembles a negative image of the response to the mimic in the absence of a command (Bell, 1981, Bell, 1982). As discussed in the Introduction, a limitation of past studies is that cancellation and negative images were only studied at the low EOD command rates (~5 Hz) typical of the paralyzed preparation. We overcame this limitation by using microstimulation of the electromotor command pathway (see Materials and methods) to control the timing and rate of EOD commands (von der Emde et al., 2000). Using this approach, we could achieve almost perfect control over the timing of EOD commands at rates up to 50 or 60 Hz.

Extracellular single-unit recordings were made from output cells in the region of the ELL dedicated to passive electrosensory processing—the ventrolateral zone (VLZ). These output neurons are classified into two types, known as E and I cells, according to the polarity of their response to electrosensory stimuli (Bell, 1981, Bell, 1982). To avoid firing-rate rectification, which complicates quantitative measurements of sensory cancellation, we adjusted the polarity of the EOD mimic to evoke excitatory responses in both E and I cells (see Materials and methods). Consistent with previous findings (Bell, 1982; Enikolopov et al., 2018), no obvious differences in plasticity were observed between E and I cells and responses were pooled.

To test generalization, we paired evoked commands with EOD mimics at a single constant rate (10 Hz) for a 10–20 min learning period (by which time significant cancellation had occurred; Figure 2A, top row) and then probed responses to EOD mimics paired across a range of rates (10, 40, and 60 Hz or 10, 30, and 50 Hz). Responses after learning are reduced across rates even though learning occurred at only the lowest rate, consistent with generalization of cancellation (Figure 2A, bottom row, solid lines). An additional set of experiments were performed to provide a benchmark for evaluating the quality of generalization. In this case, the EOD mimic was paired for the same duration but this time learning took place at all the different frequencies that were subsequently tested for cancellation (10, 40, and 60 Hz or 10, 30, and 50 Hz; Figure 2B). In this scenario, for which no generalization is required, we expect the system to achieve the best level of cancellation across all rates that can be achieved on the timescale of these experiments. The degree of cancellation, measured as the residual power in the response after learning divided by the power before learning, was comparable in the two sets of experiments (Figure 2C,D), indicating excellent generalization.

Figure 2 with 1 supplement see all
Sensory cancellation in ELL output cells generalizes from low to high EOD rates.

(A) Top, pre-learning response of an ELL output cell to a sequence of mimics triggered by EOD commands at 10 Hz. Shaded box indicates the learning condition. Empty dashed box indicates that no learning was performed at 60 Hz in this series of experiments. Red ticks show the times of EOD commands and black ticks show the times of EOD mimics. Bottom, response of the same cell after learning at 10 Hz. Dashed line is the response of the cell to the EOD mimic presented independent of the command after learning. Note, the response to the mimic is largely cancelled at both 10 and 60 Hz even though learning occurred only at 10 Hz. Responses were also probed at 40 Hz in this cell with similar results (not shown). Scale bar is 1 s. (B) Top, pre-learning responses of an ELL output cell to paired EOD command and mimic sequences at 10 and 60 Hz. Shaded boxes indicate that learning took place at both 10 Hz and 60 Hz. Bottom, the response of the same cell after learning. Learning was also conducted at 40 Hz in this cell with similar results (not shown). (C) Degree of cancellation at each rate for learning only at 10 Hz, expressed as the ratio of the power of the residual response after learning to the power of the pre-learning response (n = 17, median residual power ratios are 0.34, 0.48, 0.63 at 10, 40, and 60 Hz respectively). (D) Degree of cancellation at each rate when learning and testing were at the same frequencies of 10, 40, and 60 Hz, expressed as in C (n = 12, median residual power ratios are 0.36, 0.49, 0.61 at 10, 40, and 60 Hz respectively).

https://doi.org/10.7554/eLife.44032.003

Past studies have shown that cancellation of predictable electrosensory responses is due to the generation and subtraction of negative images (Bell, 1981, Bell, 1982). Several observations suggest that the cancellation observed in Figure 2 is likewise due to the formation of negative images. First, cancellation is unlikely to be due to adaptation of peripheral receptors or neuronal fatigue as we routinely probed responses to the EOD mimic delivered independently of the command both before and after learning (Figure 2A, bottom, dashed lines). Reductions in the response to the mimic alone were never observed. Second, in a subset of experiments we probed responses to the command alone across EOD rates after learning only at a low rate. Changes in the response to the command alone resembled a negative image of the response to the mimic sequence (Figure 2—figure supplement 1).

Regularized synaptic plasticity partially explains generalization

To gain insights into the mechanisms that support generalization, we adapted a previously developed model of negative image formation and sensory cancellation in the ELL (Kennedy et al., 2014). The model ELL neuron receives two classes of inputs. The first is a non-plastic electrosensory input that we simulated by using the recorded response of an ELL output cell to an EOD mimic sequence. This corresponds anatomically to the input onto the basilar dendrites of ELL neurons from interneurons in the deep layers of ELL receiving somatotopic input from ampullary electroreceptor afferents (Meek et al., 1999). The second class of inputs consists of a set of ~20,000 model granule cell responses conveying corollary discharge signals related to the EOD command. This corresponds anatomically to excitatory granule cell-parallel fiber synapses onto the apical dendrites of ELL neurons. The model is simplified in that it does not differentiate between two distinct classes of ELL neurons: output cells and medium ganglion (MG) cells (see Discussion). Granule cells are modeled as integrate-and-fire units receiving inputs generated from recorded responses of mossy fibers and unipolar brush cells (the main excitatory inputs to granule cells) to isolated EOD commands (>200 ms intervals between commands (Kennedy et al., 2014). This granule cell model is one component of the full model; the other is a mathematical description of the plasticity of synapses from granule cells to ELL neurons (Bell et al., 1997a; Han et al., 2000). The anti-Hebbian spike timing-dependent plasticity rule used in the model includes a regularization mechanism to prevent excessively large synaptic weights. Regularization consists of having the synaptic weights decay exponentially toward a baseline value with a time constant of 1000 s, in addition to their modification due to anti-Hebbian plasticity. We refer to this version of the plasticity rule as minimally regularized (see Materials and methods).

To explore mechanisms of generalization using this model, we first needed to extend its granule cell component to simulate high EOD command rates. To begin, we made simple assumptions about how the previously recorded mossy fibers and unipolar brush cells would respond at higher command rates (see Materials and methods). For example, the most common class of mossy fiber inputs, known as early, fire a precisely-timed burst of spikes (duration ~12 ms) at a short delay after each EOD command. To create early mossy fibers responses to command sequences at different EOD rates, we simply repeated the same burst pattern and timing for each command in the sequence (see Materials and methods for assumptions used for other response types; Figure 3—figure supplement 1). Later, we will replace these simple assumptions with results derived from experimental measurements of the true EOD-rate dependence of mossy fiber and other inputs. We refer to the granule cell model without these later modifications as the original model.

Using the original model with minimal regularization, we first simulated the generalization experiment in which the system is repeatedly exposed to 10 Hz sequences of EODs for learning and cancellation and then tested at various rates. In agreement with the experimental results, plasticity in the model gradually reduces ELL neuron responses to the EOD, and cancellation is accurate when it is subsequently tested at 10 Hz (Figure 3A, lower left). However, in contrast to the experimental results, the model exhibits a dramatic over-cancellation when tested at higher EOD rates (Figure 3A, lower right). To determine whether this resulted from a failure of learning or generalization, we simulated the experiments in which the system was trained at all the rates at which it is tested. Under these conditions, the model ELL neuron learns to cancel sensory responses at all the rates tested (Figure 3B, lower panels). This indicates that the model can learn to cancel at different EOD rates but fails to generalize low-frequency learning to high EOD rates.

Figure 3 with 1 supplement see all
Regularization of synaptic plasticity improves but does not fully account for generalization.

(A) Top, pre-learning response of a model ELL neuron to paired EOD command and mimic sequences delivered at 10 Hz. Shaded box indicates the learning condition. Red ticks show the times of EOD commands and black ticks show the times of EOD mimics. Bottom, response of the model cell after learning. The response to the mimic is largely cancelled at 10 Hz but is dramatically over-cancelled at 60 Hz. (B) Top, pre-learning response of a model ELL neuron to paired EOD command and mimic sequences at 10 and 60 Hz. Shaded boxes indicate the learning conditions. Bottom, response after learning. The response is largely cancelled at both 10 and 60 Hz. (C) Degree of cancellation at 10 Hz and 60 Hz for model and real cells across rates when training occurred at both rates. (D) Degree of cancellation at 10 Hz for real cells and model cells, with full and minimal regularization, when learning was only at 10 Hz. (E) Degree of cancellation at 60 Hz for real and model cells, with full and minimal regularization, when learning was only at 10 Hz.

https://doi.org/10.7554/eLife.44032.005

Cancellation performance is comparable between model and data when generalization is not required because training is at both 10 Hz and 60 Hz (Figure 3C, data and minimal regularization). Interestingly, when learning is only at 10 Hz, cancelation at 10 Hz is actually better in the minimally regularized model than in the data (Figure 3D, data and minimal regularization). This is consistent with overfitting, a feature that is expected to limit generalization. Indeed, when generalization is required, real neurons outperform the minimally regularized model by a large margin (Figure 3E, data and minimal regularization). These results show that: (1) our current understanding of ELL circuitry cannot explain the ability of the system to cancel the sensory consequences of EOD sequences in a manner that generalizes from low to high rates and (2) this is not due to an inability of the model system to cancel across rates but is specifically a failure of generalization.

One strategy for improving generalization that is commonly used in machine learning is regularization (Bishop, 2006). To enhance regularization, we decreased the decay time constant for the synaptic weights from 1000 s to 10 s. We also changed the value toward which the weights decay from zero to a non-zero baseline (see Materials and methods). The utility of this latter change will be discussed in a later section. We refer to this modified plasticity rule as fully regularized.

When training is performed at both 10 Hz and 60 Hz, cancellation in the fully regularized model is similar to the data and to the minimally regularized model (Figure 3C). When trained only at 10 Hz, the fully regularized model matches the data better than the minimally regularized model, presumably by avoiding overfitting (Figure 3D). Consistent with this, the fully regularized model exhibits substantially improved generalization across rates compared to the minimally regularized model (Figure 3E, minimal and full regularization). However, the fully regularized model still fails to match the generalization performance seen in the data (Figure 3E, data and full regularization). These results suggest that the original model is subject to overfitting and that regularization provides a partial solution. However, additional mechanisms are required to match the data. We reasoned that this failure likely reflects the inadequacy of the assumptions we made about how granule cells respond to high-rate EOD commands.

Rate dependence of granule cell corollary discharge responses in vivo

The corollary discharge responses of granule cells provide the ‘raw material’ from which negative images are sculpted via synaptic plasticity, and hence they are critical for sensory cancellation. Although the granule cell corollary discharge responses used in our model are based on an extensive set of recordings, all of the data was collected in the context of isolated EOD commands (Kennedy et al., 2014). As mentioned above, we modeled cancellation at high rates based on assumptions about how mossy fiber inputs to granule cells respond at high command rates. The failure of our original model to match the generalization performance seen in the data, even with full regularization, may indicate that these assumptions are incorrect. To test this, we used whole-cell recordings to characterize corollary discharge responses across EOD rates for 28 granule cells (see Materials and methods).

Most recorded granule cells (21 of 28) exhibited a prominent (~8 mV), short-latency (~2.5 ms) depolarization in response to spontaneously emitted EOD commands. Previous studies have shown that this response type, known as early, is due to mossy fiber input originating from a specific midbrain nucleus that relays electric corollary discharge information (Bell et al., 1983). Command-locked hyperpolarizations, indicative of inhibition, were rarely observed, also consistent with past studies. After characterizing responses to spontaneous commands, microstimulation of the electromotor command pathway was used to evoke trains of 25 commands at rates of 10–60 Hz (for clarity, only responses to low and high rates are shown in the figures). As can be seen in the example traces in Figure 4A, command-evoked depolarizations show little or no temporal summation at high command rates, with some cells even exhibiting a relatively hyperpolarized membrane potential at high versus low rates (not shown). Additional examples are shown in Figure 4—figure supplement 1. The responses of recorded granule cells contrast with those of the original model, which show pronounced summation and membrane potential depolarization at high rates (Figure 4—figure supplement 2, Figure 5B).

Figure 4 with 5 supplements see all
Command rate-dependence of granule cells and their mossy fiber inputs.

(A) Membrane potential of a granule cell in response to a 60 Hz sequence of 25 EOD commands, with stimulus artifacts removed. Red ticks show the times of EOD commands. (B) The response to a single command at 10 Hz along with the time at which a subsequent command would occur at a rate of 60 Hz (red arrow). Bottom trace is the electromotoneuron volley recorded near the electric organ. Same cell as in (A). (C) Distribution of median percentage increase in maximum membrane voltage from 10 Hz to 60 Hz command rates across n = 28 model granule cells. Dashed line shows the experimental value of 0.006 (p=0.03). (D) Distribution of median slope of membrane voltage in response to a 60 Hz sequence for model granule cells, dashed line shows value from the data, −0.43 mV/s (p<0.002). (E) Initial portion of the response of the granule cell shown in panel (A). Black lines are data with stimulus artifacts removed, blue dashed line shows a fit using a model granule cell with input spike times inferred from the recorded membrane voltage. (F) Example traces from an early mossy fiber recorded extracellularly in the granular layer. Responses to 25 commands in a 10 Hz (top), 40 Hz (middle), or 60 Hz (bottom) sequence are overlaid. Note the ‘dropping’ of spikes in the burst at high rates. Bottom trace is the electromotoneuron volley recorded near the electric organ. (G) Average number of spikes fired per EOD command by early mossy fibers (gray, n = 9). Symbols show the mean ±S.D. (H) Average firing rate across all inferred tonic mossy fiber inputs to granule cells across EOD command frequencies (mean ±SEM, n = 19).

https://doi.org/10.7554/eLife.44032.007
Model granule cells with rate-dependent command inputs match recorded granule cells.

(A–D) Dark versus light blue indicates model cell response with and without rate-dependent inputs matched to the data. (A) Response of a model granule cell to two EODs in a 10 Hz command sequence. Note that the cell responds very similarly at this rate with either set of inputs. (B) Response of the same model cell as in (A), but for a sequence of EOD commands at 60 Hz. Note the qualitatively distinct responses with and without input rate-dependencies at this higher rate. (C–D) Distributions of two response statistics for new and old models, dashed lines show the value found in real granule cells. (C) Median percentage increase in membrane voltage from 10 to 60 Hz (for old model p=0.03, for new model p=0.72). (D) Median membrane potential slope across a 60 Hz train of EOD commands (for old model p<0.002, for new model p=0.81).

https://doi.org/10.7554/eLife.44032.013

To quantify the differences between recorded versus model granule cells, we computed the average percentage increase in membrane voltage from 10 Hz to 60 Hz (Figure 4C) and the average slope of the line best fit to the membrane voltage of each cell across a 60 Hz train of EOD commands (Figure 4D). For the model cells, we generated a distribution by drawing 1000 sets of 28 cells from our model population (matching the 28 recorded cells) and used this to compute both a distribution and a p-value. In each histogram the vertical dashed line shows the value calculated for the set of recorded granule cells. Whereas recorded granule cells showed very little change in their average membrane potential at high command rates, model cells increased their membrane potential substantially (Figure 4C). Recorded granule cells have, on average, a negative slope in their membrane potential across 60 Hz trains, whereas model cells have positive sloping membrane potentials (Figure 4D), consistent with greater summation in the model versus the recorded granule cells. Clearly the original granule cell model provides a poor description of actual granule cell responses at high EOD command rates.

The shortcomings of the granule cell model we have been using could arise from a mismatch of the model to the biophysical properties of real granule cells, or it could be the result of poorly describing their mossy-fiber and unipolar-brush-cell inputs. To differentiate between these possibilities, we modeled granule cell responses using the same integrate-and-fire description we have been using, but we replaced the computed input to the model cells with experimentally measured inputs. For each granule cell, we fit integrate-and-fire model parameters and, at the same time, inferred its excitatory inputs from the recorded membrane potential. This process was relatively straightforward given that granule cells exhibit large EPSPs, low noise, and receive just a few inputs (Kennedy et al., 2014; Requarth et al., 2014; Sawtell, 2010) (see Materials and methods). We found that the original integrate-and-fire model did a good job of fitting the data provided that we used inputs inferred from data, not the inputs computed in the original model (Figure 4E shows the data and model fit for an example cell, also see Figure 4—figure supplement 1). We tried a number of augmented models, including features such as synaptic depression, inhibition and conductance-based soma and synapses, but these did not substantially improve the fit compared to the basic current-based integrate-and-fire model with purely excitatory input. This analysis suggested that the failure of the original model (Figure 3) to generalize may indeed lie in its failure to accurately represent the EOD-rate dependence of mossy fiber and unipolar brush cell inputs.

To test this more directly, we recorded from mossy fiber axons, unipolar brush cells, and Golgi cells. Criteria for distinguishing between these different elements in neural recordings have been established previously (Bell et al., 1992; Kennedy et al., 2014; Sawtell, 2010). Prominent effects of EOD command rate were observed in early mossy fibers, the most common class of mossy fiber input to granule cells. Consistent with past observations, early mossy fibers fire extremely precise bursts of spikes following after EOD command. After the successive commands in a 10 Hz train they fire exactly the same burst of spikes (Figure 4F, top), but on the second command of a 40 Hz or 60 Hz train they drop one or more spikes (Figure 4F, middle and bottom). The result of this dropping out is that the average number of mossy fiber spikes fired per command decreases with increasing command rate (Figure 4G).

An additional effect observed at high command rates was a decrease in the rate of tonic input to granule cells. Tonic inputs (EPSPs not time-locked to the EOD command) were observed in 19 of 28 granule cells (Figure 4H). Similar command-rate dependent decreases in tonic firing were observed in recordings from putative mossy fibers and unipolar brush cells (Figure 4—figure supplement 3). Command rate-dependent responses were also found in another previously characterized functional class of granule cell input, known as pause inputs. Pause inputs exhibit a sharply-timed cessation of tonic firing following each EOD command and are believed to correspond to unipolar brush cells (Kennedy et al., 2014). Recordings from granule cells and putative UBCs indicate that such pause inputs decrease their firing significantly at higher command rates, often ceasing to fire completely (Figure 4—figure supplement 4). Finally, we found that Golgi cells, inhibitory interneurons that synapse onto granule cells, markedly increase their firing with increasing EOD command rate (Figure 4—figure supplement 5). Hence, command rate-dependent Golgi inhibition could also contribute to reducing the effect of temporal summation of excitatory inputs in granule cells.

Model granule cells with rate-dependent command inputs match recorded granule cells

As described above, excitatory inputs to granule cells exhibit EOD command-rate dependencies that are more complex than those assumed in our original model. To determine whether such effects could help to explain generalization in ELL output neurons, we incorporated features of the recorded mossy fibers into a revised model. Specifically, we introduced the rate-dependent dropping out of early mossy fiber spikes and the reduction in tonic mossy fiber firing into the model. The measured rate-dependence of pause mossy fibers was similar to what was assumed in the original model, so no modification was necessary for them. Golgi cells were not considered further because we felt that too little is currently known about the details of Golgi inhibition onto granule cells to incorporate them into the model.

We characterized the effects of these changes by simulating populations of granule cells with and without command rate-dependent inputs. At low command rates model granule cells from the two populations show similar responses (Figure 5A). However, at high command rates the two populations differ. Granule cells in the revised model no longer exhibit the increased depolarization at high versus low rates that was observed in the original model granule cells (Figure 5B). Examining the statistics we used previously to characterize EOD-rate dependencies in granule cell responses reveals that the inclusion of realistic assumptions regarding mossy fiber inputs dramatically changes the overall character of the granule cell responses in the revised model. The result is model granule cell responses that are clearly more consistent with the subthreshold responses recorded in actual granule cells across EOD rates (Figure 5C,D).

The revised model with full regularization matches the generalization performance of ELL output neurons

Finally, we sought to determine whether the revised granule cell model, combined with fully regularized synaptic plasticity, can explain generalization in ELL output cells. We again simulated the generalization experiment where the system learns with 10 Hz sequences of EODs and cancellation performance is subsequently probed at different EOD rates (Figure 6A). The revised model with full regularization shows cancellation across rates that generalizes at a level comparable to the recorded ELL neurons (Figure 6A,C,D). To understand the roles of both regularization and EOD rate dependencies, we compared results obtained using the revised model granule cell population (with rate-dependent input) but with the minimally regularized synaptic plasticity rule. In this case, model ELL neurons trained only at 10 Hz exhibited over-cancellation at high EOD rates (Figure 6B). Hence the more realistic mossy fiber-granule cell model, on its own, is also insufficient to explain generalization (Figure 6B,C,D).

Figure 6 with 3 supplements see all
A revised ELL model accounts for generalization in ELL neurons.

In all panels dashed blue traces are the sensory response to be cancelled and solid blue traces are the response to the paired EOD command plus mimic sequences. Learning occurs only at 10 Hz as indicated by grey boxes. Red ticks show the times of EOD commands and black ticks show the times of EOD mimics. (A) Top, pre-learning response of a revised model output cell with full regularization to paired EOD command and mimic sequences delivered at 10 Hz. Bottom, response of the same cell after learning. Note, the response to the mimic is largely cancelled at both 10 and 60 Hz. (B) Top, pre-learning response of a revised model output cell with minimal regularization to paired EOD command and mimic sequences delivered at 10 Hz. Bottom, response of the same cell after learning. Note, the response to the mimic is largely cancelled at 10 Hz but is now over-cancelled at 60 Hz. (C) Level of cancellation achieved at 10 Hz across different model and real ELL cells is similar (p=0.72 for minimally regularized model versus data; p=0.62 for fully-regularized model versus data, Wilcoxon signed rank test). (D) Similar to C but showing the level of cancellation achieved at 60 Hz, (p<0.001 for minimally regularized model versus data; p=0.38 for fully-regularized model versus data, Wilcoxon signed rank test). (E) Dark blue, mean spiking response of model granule cells with input rate dependencies; grey, mean response of electroreceptor afferents, both at 60 Hz EOD rate. Note the similarity in shape.

https://doi.org/10.7554/eLife.44032.014

Further examination of the model suggests a hypothesis regarding how regularized synaptic plasticity and rate-dependent mossy fiber inputs work together to support generalization. The form of regularized synaptic plasticity we have used involves a decay of each synaptic weight toward a constant non-zero value. Increasing the strength of this regularization decreases the variance of the learned weights because synaptic weights from different granule cells are constrained to be similar to this value and hence to one another (Figure 6—figure supplement 1). This means that, with strong regularization, the learned negative image is constrained to be approximately proportional to the mean response of the granule cell population. This average shape is, in turn, affected strongly by the rate-dependence of inputs to granule cells. As we have shown, in the absence of realistic mossy fiber rate-dependencies the mean model granule cell response has an increasing profile across a 60 Hz train, whereas, with the actual mossy fiber rate dependence, the mean granule cell response has a decreasing profile.

Notably, the sensory responses of ELL neurons to high rate trains of EOD mimics (prior to cancellation) also exhibit a decreasing profile (Figure 2A, Figure 6—figure supplement 2). To determine the origin of such responses, we performed a separate set of extracellular recordings from ampullary electroreceptor afferents (the source of electrosensory input to ELL neurons). Ampullary afferent firing rate also exhibited a decreasing profile at high EOD rates (Figure 6E, Figure 6—figure supplement 3). Responses of ampullary afferents to isolated EOD pulses consist of a firing rate increase followed by a reduction below baseline and in some cases additional smaller waves of increased and decreased firing resembling a damped oscillation (Figure 6—figure supplement 3D)(Bell and Russell, 1978). Estimating the impulse response of an ampullary afferent from its average response to a single EOD mimic and then convolving this impulse response with a sequence of EOD mimics yielded a reasonable approximation to the observed responses (Figure 6—figure supplement 3A, red lines). Hence the decaying profile of the sensory response to high-rate sequences of EODs as well as the inhibitory rebound at the end of such sequences are expected features of a linear system with an impulse response resembling a damped oscillation.

In summary, these results suggest that generalization across EOD discharge rates may be achieved in the ELL by combining two features: (1) a form of plasticity that encourages low variance in learned weights, forcing the negative image to be close to the mean granule cell response and (2) mossy fiber rate dependencies that ensure that the mean granule cell corollary discharge response has a shape that approximates the sensory signal to be cancelled. Together, these two features may allow accurate negative images to be generated across a wide range of EOD rates for which no previous learning has taken place.

Discussion

Functional significance of generalization in the ELL

Past work on negative image formation and sensory cancellation in mormyrid fish has been restricted to one particular behavioral regime, namely, periods when EOD rates are low and regular. However, the rate and timing of EODs are under voluntary control and vary widely during both electrocommunication and active electrolocation (Hofmann et al., 2014; Moller et al., 1989; Schwarz and von der Emde, 2001; Toerring and Moller, 1984). This suggests that negative images learned over periods of minutes or hours at low EOD rates (e.g. while the fish is inactive) must generalize when the fish transitions to a high EOD rate (e.g. during foraging, fleeing, exploring a novel object, or interacting with a conspecific). If generalization did not occur in such instances, the passive electrosensory system would be vulnerable to self-generated interference during the periods when it would be needed the most. Behavioral studies suggest that multiple senses (including both the passive and active electrosensory systems) are used in concert to detect prey (von der Emde and Bleckmann, 1998). In light of these considerations, our observation that the cancellation performance of ELL neurons generalizes accurately supports and substantially extends the ethological relevance of negative images for passive electrolocation in mormyrid fish. A caveat is that our study focused on generalization in only one specific set of circumstances, that is an abrupt transition from low to high rates. Although this allowed us to focus on in-depth analysis of the mechanisms of generalization, important questions remain about how such mechanisms operate under more natural circumstances, for example in the context of the more complex and variable EOD interval patterns characteristic of behaving fish.

Our results are likely relevant to a number of other behavioral contexts and brain structures in which the cancellation of self-generated sensory inputs is known to occur. Negative images have been described in the active electrosensory system of mormyrid as well as gymnotid fish where they serve to cancel the effects of movements of the fish’s body as well as spatially redundant electrosensory signals resulting from interactions with conspecifics (Bastian, 1996; Requarth et al., 2014; Requarth and Sawtell, 2014). Movements of the tail, for example, generate reafference by changing the position of the electric organ (located in the tail) relative to electroreceptors on the head and body. In the passive electrosensory system of elasmobranchs (the group that includes sharks and skates), negative images cancel the effects of swimming movements and respiration (Bodznick et al., 1999). Cancellation of self-generated inputs has also been described in related cerebellum-like structures associated with the mechanosensory lateral line system in fish and the auditory system in mice (Montgomery and Bodznick, 1994; Singla et al., 2017). In all of these cases, generalization is expected to be vital in assuring that negative images remain accurate across different behavioral and/or environmental contexts.

Mechanisms of generalization

Using a combined experimental and theoretical approach we identified two features that, when added to existing models of ELL, were sufficient to explain how negative images learned at one rate generalize to another. The first element was that synaptic plasticity from granule cells to ELL neurons be appropriately regularized. Regularization of learned parameters is ubiquitous in machine learning as a technique to prevent overfitting, or the learning of parameters that fit the idiosyncrasies and noise present in training data and therefore do not generalize well to new data (Bishop, 2006). Consistent with this, in an ELL model lacking adequate regularization we found that ELL neurons could learn negative images at low EOD rates, however, cancellation at high (untrained) rates was poor. Although we do not have direct evidence for such regularization of synaptic plasticity in ELL, we note that there are a number of candidate mechanisms described in other systems. For example bounded synaptic strengths (Amit and Fusi, 1992), discrete synaptic weights (O'Connor et al., 2005; Petersen et al., 1998), synaptic scaling (Turrigiano, 2008), coupling of synaptic changes between nearby synapses (Engert and Bonhoeffer, 1997), synaptic competition (Miller, 1996), and various sources of noise (Basalyga and Salinas, 2006), could all act as forms of regularization even if they are simply due to constraints on the system or have additional purposes. In our model we found that a constant decay of the strength of each synapse towards a baseline value worked best. This rule has the appealing property of being implementable locally at each synapse. However, our rule does require an explicit setting for the regularization decay rate. This parameter could itself be learned over a longer timescale, which would be a form of meta-plasticity (Abraham and Bear, 1996) or meta-learning (Doya, 2002). To our knowledge, little is currently known in any biological system regarding whether and how synaptic plasticity is regularized or about whether such regularization plays a role in generalization. Addressing these questions is an interesting challenge for future research that may be aided by emerging methods for directly visualizing morphology, activity, and synaptic proteins at the level of dendrites and spines (Roth et al., 2017).

The second feature we identified as important for generalization is an approximate matching between the EOD rate dependence of electrosensory inputs to ELL output neurons and the rate dependence of the summed corollary discharge input that an output cell receives via the granule cells. In vivo recordings from ampullary electroreceptor afferents, ELL output neurons, mossy fibers, and granule cells provided direct evidence for such matching. The temporal dynamics of granule cell corollary discharge responses across EOD rates are, on average, much more similar to those of electroreceptor afferents than expected based on past recordings and modeling of granule cell responses to isolated EOD commands. This matching appears to be achieved via a variety of previously unknown EOD command rate dependencies in the inputs to granule cells. So-called early mossy fibers are known from previous studies to fire a highly-stereotyped burst of action potentials following each EOD command (Bell et al., 1992). We found that the number of spikes in such bursts declines progressively with increases in the command rate. The multiple spikes in the burst seem redundant in the context of isolated EODs. Why would multiple spikes be needed to signal the time of occurrence of an EOD command? The present work suggests that rate-dependent grading of such bursts conveys information that is important for generalization.

We mainly focused on the command rate-dependence of so-called early mossy fiber inputs because these inputs are by far the most frequently encountered in our blind recordings. However, the command-rate dependence of less common elements such as unipolar brush cells and Golgi cells was also qualitatively consistent with the proposed matching. Determining the relative importance for generalization of these different sources of command-rate dependence (i.e. mossy fibers, unipolar brush cells, and Golgi cells) is difficult given that we lack methods for selectively targeting them for recordings or manipulations. We also cannot rule out the importance for generalization of other circuit elements not studied here and for which we lack sufficient physiological data under conditions of different EOD rates. Our model (like all past models of the mormyrid ELL) does not distinguish between two distinct classes of ELL neurons: glutamatergic output cells versus GABAergic MG cells which inhibit output cells. MG cells occupy an analogous position in the circuitry of the mormryid ELL as Purkinje cells in the teleost cerebellum and cartwheel cells in the dorsal cochlear nucleus (Bell, 2002; Bell et al., 2008). Importantly, both MG and output cells integrate electrosensory and corollary discharge input and both exhibit anti-Hebbian plasticity (Bell et al., 1997a; Bell et al., 1993; Meek et al., 1996; Mohr et al., 2003). However, it is presently unknown, even in the context of low EOD rates, how MG cells contribute to sensory cancellation and negative image formation. Our model also omits molecular layer interneurons and does not distinguish between E- and I-type output cells. Constructing a more complete and realistic model that includes these additional features is a focus of ongoing experimental and theoretical studies of the mormyrid ELL.

Generalization of negative images could be accomplished quite simply if both electrosensory and corollary discharge signals had a linear dependence on EOD rate. Responses of ampullary electroreceptor afferents, indeed, appear to exhibit a roughly linear dependence on EOD rate (Figure 6—figure supplement 3). However, recordings from granule cells in mormyrid fish (Kennedy et al., 2014; Requarth and Sawtell, 2014; Sawtell, 2010), as well as studies of cerebellar granule cells in mammals (Barmack and Yakhnitsa, 2008; Chabrol et al., 2015; Chadderton et al., 2004; Ruigrok et al., 2011), suggest that granule cells exhibit markedly nonlinear properties, including prominent rectification and burst firing. In our initial modeling we found that even when electrosensory and mossy fiber inputs both varied approximately linearly with EOD rate and inputs were summed linearly by model granule cells, the model failed to match the generalization performance seen in real ELL neurons. One reason for its failure is the nonlinearity introduced by the firing rate threshold of the granule cells. Whenever a threshold is applied, portions of a signal that are subthreshold at low repetition rates can become supra-threshold at higher rates due to temporal summation, resulting in nonlinear responses (Figure 4—figure supplement 2). Rather than linearizing the granule cell population response, the EOD rate dependencies we found in mossy fiber inputs to granule cells actually introduce additional nonlinearities on top of the threshold linearity. It is the summed effect of these nonlinearities across the granule cell population that guarantees that an approximate negative image is always available in the scaled mean of the population activity. This may be a useful principle employed by other neural systems - encoding of approximate solutions across contexts in a robust manner, in this case through a simple average population activity, allowing flexible learning while maintaining information that supports generalization.

Connections to generalization in other systems

The issue of generalization has been explored in the gymnotid ELL in the context of cancellation of spatially redundant electrosensory signals, such as those generated by tail movements or conspecifics (Bol et al., 2011; Mejias et al., 2013). Such cancellation is similar to that in the mormyrid ELL in that it is mediated by anti-Hebbian plasticity at synapses between granule cells and ELL neurons (Harvey-Girard et al., 2010). However, cancellation in the gymnotid ELL is driven by proprioception or electrosensory feedback to granule cells rather than by corollary discharge (Bastian et al., 2004; Chacron et al., 2005). In vivo recordings from ELL neurons in gymnotids demonstrated that cancellation remains accurate over a wide range of stimulus contrasts (as might be produced by conspecifics at different distances) (Mejias et al., 2013). Modeling was used to show how learning at one contrast could generalize to higher or lower contrasts, despite numerous nonlinearities in the system. Interestingly, features of the model identified to be important for such generalization are related to those described here for the mormyrid ELL, including granule cell response properties and a slow decay in parallel fiber synaptic strength (Mejias et al., 2013; Lewis and Maler, 2004), which can be considered a form of regularization. Although responses of granule cells have not yet been measured in vivo in gymnotids, several lines of evidence suggest that they are important in relation to the specificity and generalization of learning in the gymnotid ELL (Bol et al., 2011; Mejias et al., 2013).

A role for cerebellar granule cells in generalization has been suggested based on studies of motor learning. Adaptation of the vestibulo-ocular reflex (VOR) shows various patterns of generalization and specificity when training and testing are carried out at different head rotation frequencies or static head tilts (Boyden et al., 2004). Under some experimental conditions, VOR learning has been shown to be quite specific to the training context (Baker et al., 1987; Yakushin et al., 2000). Such specificity can be explained by models in which learning is mediated by changes in granule cell inputs conveying highly-specific representations of the training context--for example, granule cells that fire for specific combination of head rotation and head tilt. Such hypotheses have not been directly tested in the context of the VOR or other forms of motor learning, however, numerous lines of evidence support the existence of highly-selective granule cell representations of this sort (Chabrol et al., 2015; Huang et al., 2013; Ishikawa et al., 2015; Sawtell, 2010). Generalization of VOR learning is also observed under some circumstances, for example when training at a high head rotation frequency and testing on a lower frequency (Boyden et al., 2004). Broader tuning in granule cells could underlie generalization in such cases. Studies of generalization of VOR learning may be informed by recent characterizations of the statistics of vestibular input during natural behavior in primates and rodents (Carriot et al., 2014, Carriot et al., 2017).

It has been suggested that patterns of generalization in human motor learning, such as adaptation to force fields in reaching, can be explained by the tuning of a set of basis elements (Donchin et al., 2003; Ghahramani et al., 1996; Shadmehr and Mussa-Ivaldi, 1994). Given their large numbers and the well-characterized plasticity of their parallel fiber synapses, granule cells are a natural candidate for such elements. To our knowledge, the present study is the first to directly relate responses of granule cells recorded during a learning task to generalization.

Materials and methods

Experimental preparation

Request a detailed protocol

All experiments performed in this study adhere to the American Physiological Society’s Guiding Principles in the Care and Use of Animals and were approved by the Columbia University Institutional Animal Care and Use Committee, protocol AAAW4462. Mormyrid fish (7-12 cm in length) of the species Gnathonemus petersii were used in these experiments. Surgical procedures to expose the brain for recording were identical to those described previously (Bell, 1982; Enikolopov et al., 2018; Sawtell, 2010). Gallamine triethiodide (Flaxedil) was given at the end of the surgery (∼20 μg / cm of body length) and the anesthetic (MS:222, 1:25,000) was removed. Aerated water was passed over the fish’s gills for respiration. Paralysis blocks the effect of electromotoneurons on the electric organ, preventing the EOD, but the motor command signal that would normally elicit an EOD continues to be emitted spontaneously at rates of 2-5 Hz. The timing of the EOD motor command can be measured precisely allowing the central effects of corollary discharge inputs to be observed in isolation from the electrosensory input that would normally result from the EOD. In a few experiments, recordings from electroreceptor afferents were performed in unparalyzed fish anesthetized with metomidate which leaves the fish’s EOD intact (Engelmann et al., 2006).

EOD command stimulation

Request a detailed protocol

We controlled the EOD motor command rate by targeting a concentric bipolar stimulating electrode (FHC, Bowdoin, ME) to the axon tract connecting the precommand nucleus to the EOD command nucleus, located near the ventral surface of the brainstem. The electrode was inserted at the midline through the corpus cerebellum just anterior to ELL (angled 22 degrees caudally in the sagittal plane) and lowered into the brain using a hydraulic manipulator until commands could be evoked by a strong stimulus (0.2 ms duration; 50 μA). The depth of the electrode was then fine-tuned until commands could be reliably evoked at short latencies by single pulses using minimal current (typically 5-15 μA). In most cases, such stimulation gave near perfect control over the timing of the EOD command. Occasionally, stimulation failed to evoke a command during high rate trains or the fish discharged spontaneously during a low rate train. However, these errors were easy to detect and were sufficiently infrequent that they were deemed negligible. Finally, microstimulation-evoked corollary discharge responses were indistinguishable from those evoked by the fish’s spontaneous commands at the level of field potentials, mossy fibers, and granule cells.

Electrophysiology

Request a detailed protocol

The EOD motor command signal was recorded with an electrode placed over the electric organ. The command signal is the synchronized volley of electromotoneurons that would normally elicit an EOD in the absence of neuromuscular blockade. The command signal lasts about 3 ms and consists of a small negative wave followed by three larger biphasic waves. The latencies of central corollary discharge or command-evoked responses were measured with respect to the negative peak of the first large biphasic wave in the command signal.

Extracellular recordings from the VLZ were made with glass microelectrodes filled with 2M NaCl (8-30 MΩ). Consistent with previous studies, ampullary afferents were encountered in the deeper layers of ELL (medial in our penetrations) and were characterized by highly regular spontaneous firing at ~50 Hz, the absence of any response to the EOD motor command, an excitatory responses to a stomach negative EOD mimic pulse (0.2-2 ms duration), and strong responses to small (<1 μA), long duration (10-100 ms) electrosensory stimuli. Output cells were encountered in more superficial layers (on penetrations slightly lateral to those in which afferents were encountered) and were characterized by much lower and more irregular firing rates than ampullary afferents. E-cells showed increased firing in responses to a stomach negative EOD mimic pulse, while I-cells showed decreased firing. None of the E- or I-cells included in our analysis exhibited two distinct action potential waveforms, the hallmark of the other major cell type of in VLZ, the medium ganglion cells (Bell et al., 1997a; Bell et al., 1993). Hence these recordings are presumed to be from the efferent neurons of ELL.

Extracellular recordings from mossy fibers in EGp and in the paratrigeminal command-associated nucleus were made using glass microelectrodes filled with 2M NaCl (40-100 MΩ). For in vivo whole-cell recordings from EGp neurons patch electrodes (9-15 MΩ) were filled with an internal solution containing, in mM: K-gluconate (122); KCl (7); HEPES (10); Na2GTP (0.4); MgATP (4); EGTA (0.5), and 0.5% biocytin (pH 7.2, 280-290 mOsm). No correction was made for liquid junction potentials. Only cells with stable membrane potentials more hyperpolarized than -50 mV and access resistance < 100 MΩ were analyzed. Membrane potentials were filtered at 3-10 kHz and digitized at 20 kHz (CED power1401 hardware and Spike2 software; Cambridge Electronics Design, Cambridge, UK).

Pairing experiments

Request a detailed protocol

Cancellation and negative image formation was tested in E- and I-cells by pairing EOD commands with an EOD mimic pulse (0.2-2 ms wide square pulses; 1-5 μA) delivered at the delay at which the EOD would normally occur (4.5 ms after the EOD command). This delay was fixed and independent of command rate in our experiments. This is assumed to be the case under natural conditions as well, although to our knowledge, this has never been directly shown. EOD mimics were delivered using a dipole electrode positioned < 2 mm from the skin within the unit’s receptive field. These methods are the same as those used previously to characterize negative images and sensory cancellation in the context of low command rates. In I cells, the EOD mimic often drove the firing rate to zero, making it difficult to quantify cancellation. To avoid this firing rate rectification, we reversed the EOD mimic polarity when recording from I cells, such that they responded with excitation instead of inhibition. This response reversal is due to known properties of ampullary electroreceptor afferents, which increase (or decrease) firing above (or below) their baseline rate for stimuli that make the pore of the receptor positive (or negative) with respect to the basal face within the body. Past studies have commonly used this approach to demonstrate the specificity of negative image formation in the VLZ by performing multiple pairing in the same neuron using opposite stimulus polarities (Bell, 1981, Bell, 1982; Enikolopov et al., 2018). Negative images are invariably observed for both stimulus polarities in such experiments, that is responses to the corollary discharge alone after pairing are opposite in sign to the response to the stimulus during pairing. Systematic differences between negative images and cancellation in E versus I cells or for mimics of opposite polarities have never been noted, justifying this approach to avoid rectification.

Two types of pairing experiments were conducted. For the first type, pairing was performed across a range of rates (10, 40, and 60 Hz or 10, 30, and 50 Hz). Cancellation was assessed by comparing responses early and late during pairing which lasted 10-20 minutes. Negative images were assessed in a subset of cells by comparing responses to the command alone across rates before versus after pairing. Responses to identical trains of electrosensory stimuli presented independent of the command were also tested for each cell. In some cases, multiple pairings were conducted in the same cell after allowing 10-15 minutes for recovery from the effects of prior pairing. The second type of experiment was the same as described above except that pairing was only conducted at 10 Hz. Cancellation was assessed in these experiments by briefly (60-100 sec) probing responses at all three rates before and immediately after pairing at 10 Hz.

Linear model of electroreceptor sequence responses

Request a detailed protocol

To test whether electroreceptor afferent responses to EOD sequences could be approximated as linear, we estimated the impulse response kernel, K(t), of each recorded unit from its response to an isolated EOD mimic. We first computed the average firing rate evoked by isolated EOD mimics (those separated by at least 150 ms). We treated this as an estimate of the impulse response of the recorded unit. To compute the predicted linear response, L(t), we convolved this kernel with a series of delta functions centered on the times of the EOD mimics:

L(t)=ΣiK(t)*δ(ti-t)

where ti are the times of the EOD commands in the sequence.

Quantification of cancellation and generalization

Request a detailed protocol

To quantify cancellation and generalization, the degree of cancellation, C, was measured as the ratio of the total variance of the response to a sequence starting at time tstart and ending at time tend, to an EOD command plus mimic sequence post pairing, rpost(t), to that pre pairing,rpre(t):

C=tstarttendrpost(t)-rpost2dttstarttendrpre(t)-rpre2dt

Modeling granule cells

Request a detailed protocol

Our general approach to modeling granule cells follows that used previously (Kennedy et al., 2014). We generate model granule cell populations by random mixing of mossy fiber inputs, as described below. To extend this model to the case of different EOD command rates we also directly fit integrate-and-fire models to recordings of real granule cell responses, inferring the mossy fiber inputs at the same time. We then use information about rate dependencies in these mossy fiber inputs gleaned from this fitting procedure as well as from direct recordings of mossy fibers to show that rate-dependent changes in the inputs to granule cells can account for their responses to EOD command sequences. This information was then used to update the granule cell population model. The following sections describe these different modeling steps in more detail.

Fitting granule cell voltage responses to EOD command sequences

Request a detailed protocol

When fitting models to real granule cell data we first removed stimulus artifacts caused by electromotor command nucleus stimulation as well as any spikes using a simple threshold on the gradient of the membrane voltage. We found that a gradient threshold of 1.8 mV/ms worked well. We used multiple methods and models to fit a set of 28 intracellularly recorded granule cell responses to EOD command sequences from 10 to 60 Hz. The basic model was an integrate-and-fire model with current based synapses. The parameters of the model were the membrane time constant, the leak potential, and synaptic parameters. Each cell could receive two inputs. Each input had the following parameters, a fast and a slow time constant, and a fast and a slow weight. This accounts for the fact that granule cell EPSPs often show a combination of fast and slow components. We allowed two inputs to permit both command-associated inputs and command-independent tonic inputs. The response of a model granule cell was given by

τmdVdt=El-V+Σi,jEi(t)δ(t-tij)

where tij is the time of the j-th spike of the i-th input and the synaptic kernels for each input are given by

(1) Ei(t)=1τifastwifastetτifast+1τislowwislowetτislow.

To fit granule cell models, we further needed to estimate the times of input spikes for each cell. We used multiple methods to make this input inference, all of which gave similar results. The first method was a wavelet-based detection method. We computed the continuous wavelet transform of the membrane voltage at 16 scales from 800 to 2000 Hz, using the MATLAB Wavelet Toolbox. We searched for times where the wavelet transform exhibited peaks at multiple scales and considered these times putative input spike times. We then combined peak locations into single putative input times if they were closer than 0.5 ms together. We then visually validated all of the data by checking if the input times detected by this procedure corresponded to clear upticks in the membrane voltage. We made corrections when it appeared an EPSP had occurred and then used both the corrected and uncorrected input times when fitting models and compared results. The qualitative results described in the main text did not depend on the input details at this level of accuracy. We used two different methods for estimating the parameters of granule cell models. Our results did not depend on which method was used. The first method was to use the putative input times we found, assume these were the actual input times to the granule cells, and then use least squares minimization to find the optimal parameters of the granule cell model, given these input times. The second method was to use these putative input times as an initialization for an MCMC method which then generated joint samples from the posterior distribution of granule cell model parameters and input times. We initialized the input times based on those found by the wavelet method. We then used Gibbs sampling to sample model parameters after a burn-in of 500 sweeps. Approximate sampling of input times was achieved by allowing the following moves: an input spike could be jittered around its current location, an input spike could be removed, and an input spike could be added. We placed priors on the total number of spikes based on the estimated number detected by the wavelet method to prevent the addition of many extra spikes. We also placed hard bounds on the parameters so that synaptic weights were always positive.

Basic granule cell model

Request a detailed protocol

As in previous work (Kennedy et al., 2014), we generated populations of model granule cells from a random mixing procedure based on the following assumptions. Each cell receives input from classes early (E), medium (M), late (L), pause (P) or tonic (T). (i) Each granule cell has three sites for mossy fiber synaptic inputs. (ii) The probabilities of a given input being of E, M, L, P and T type are given by Pe, Pm, Pl, Pp and Pt, with Pe+Pm+Pl+Pp+Pt1. (iii) The type of input received at one mossy fiber-granule cell synapse is independent of that received at any other synapse. We used input type probabilities as calculated previously based on fits to individual granule cells (Kennedy et al., 2014).

We introduced two sources of variability. We included trial to trial variability in the peak height of recorded single EPSPs from a normal distribution with σ=0.224 mV; during simulation of model granule cells, we sampled this distribution for each mossy fiber spike. Some granule cells further receive tonic mossy fiber inputs in addition to corollary discharge inputs. These inputs fire at high rates, independent of the EOD command. We included tonic input as previously, based on 72 tonic mossy fiber recordings.

For each model granule cell we randomly determined whether each potential connection to that model cell received early (Pe=0.425), medium (Pm=0.075), late (Pl=0.05), pause (Pp=0.05), tonic (Pt=0.157) or no input (Pn=0.243), as in previous work. We then chose a particular mossy fiber response of the previously-determined class as the source of that input; we assumed that a connection is equally likely to be from any of the mossy fibers within a given class. These steps constitute the basic procedure for modeling populations of granule cells. The mossy fiber recordings we use to generate granule cells were based on responses to single EOD commands. To model the responses of these cells to EOD command sequences we needed to choose a method for predicting the responses of each mossy fiber to sequences of EOD commands. The following sections describe how this was achieved.

In the model granule cells we used synapses with fast and slow components. We used the same synapse model described above when fitting responses of real granule cells. The synaptic dynamics were described by Equation 1. When generating model granule cell populations we had to choose values for the four parameters τfast, wifast, τslow, and wislow. These were chosen by fitting Gamma distributions to the values of these parameters obtained by fitting the granule cell model to granule cell data as described above. We then drew parameters randomly from these distributions for each model granule cell we generated. Parameters were the same for each input to a given granule cell, and the values of the four parameters were assumed to be independent.

Model granule cell responses to EOD command sequences

Request a detailed protocol

To generate responses of model granule cells to EOD command sequences we needed to model the responses of each mossy fiber to that same sequence. We did not have a sufficiently large set of mossy fiber recordings from each class in response to EOD sequences at different rates to simply use these recordings directly as inputs to model granule cells. Instead we made simple models of how each of our previously recorded mossy fibers (whose responses only to isolated EOD commands we had recorded) would respond to EOD command sequences, based on actual responses to command sequences recorded from mossy fibers and UBCs in the present study. We considered two different models, referred to in the main text as the original model and the revised model. Medium, late and pause inputs were treated identically in the two models. Early and tonic inputs differed. For medium inputs we simply assumed that the set of spikes fired after each EOD command was the same, no matter where that command came in a sequence. This meant that spikes due to one command could overlap with spikes from subsequent commands, which we allowed, although we checked that this did not result in unrealistic firing rates of medium mossy fibers. Late inputs are characterized by a delay in firing after a command followed by a period of spiking. To model the response of a late mossy fiber to EOD command sequences we assumed that the firing delays accumulated if they overlapped. This amounts to computing the spiking response of a late mossy fiber to an EOD command sequence by starting at the first command in the sequence and proceeding through the sequence, allowing the delay in firing following a command to prevent spikes that would otherwise have been caused by the previous command. For pause mossy fibers we estimated the length of the pause in tonic firing induced by each command. To create the response of the fiber to an EOD command sequence we drew randomly from the empirical inter-spike interval of the fiber and populated the period of the sequence with spikes. We then deleted spikes occurring within the estimated pause period after any EOD command in the sequence. This naturally gave rise to cessation of firing at high EOD command frequencies, due to accumulation of pausing. Similar responses at high command rates were observed in recorded pause mossy fibers.

Early and tonic mossy fiber are treated differently in the two models

Request a detailed protocol

The key differences between the original and revised models were in the way we treated early mossy fiber inputs and tonic mossy fiber inputs. Recordings from early mossy fibers as well as mossy fiber inputs inferred from granule cell recordings showed that early mossy fibers tend to fire progressively fewer spikes per EOD command during high-frequency command sequences and that tonic mossy fibers also tend to fire at a progressively lower rate during high frequency EOD sequences. The original model does not take these new findings into account, whereas the revised model does. In the original model we assume that early mossy fibers fire the exact same burst of spikes (known from recorded responses to single EOD commands) after each command in a sequence and we create tonic mossy fiber spike trains in response to EOD command sequences by sampling from estimated inter-spike interval distributions for each recorded tonic mossy fiber. In the revised model, the fraction of spikes fired by each early mossy fiber following each EOD command, compared to the number fired after an isolated EOD command, was a function of recent EOD command history. The fraction f relaxed to 1 with a characteristic timescale (80 ms) and is reduced by a factor α= 0.72 following each EOD command:

τfdfdt=1-f

and fαf after each EOD command. These parameters were chosen to approximately match the dropping observed in recorded responses of early mossy fibers to EOD command sequences. In the revised model we modified the responses of tonic mossy fibers by removing a number of spikes from the spike train based on the recent EOD command rate (computed over the last 100 ms). The decrease in tonic firing was again based on recorded tonic mossy fiber responses to EOD sequences. Tonic firing rates were decreased linearly from their maximum rate at an EOD command frequency of 10 Hz to 0.6 times their maximum rate at an EOD command frequency of 60 Hz.

Additional granule-cell responses types

Request a detailed protocol

Not all granule cells from the revised model population behaved in the same way. For example, a minority of cells, specifically those receiving previously described medium mossy fiber inputs active at intermediate delays, integrate and fire more spikes at high EOD command rates. Only 2 of the 28 recorded granule cells received a medium input, consistent with the small proportion of medium inputs found previously (Kennedy et al., 2014). One of these cells, indeed, exhibited prominent summation and increased spiking at high rates. However, given the small proportion of medium inputs, a much larger number of actual granule cells would have to be recorded to determine whether such response types are a consistent feature of real granule cells.

Synaptic plasticity

Request a detailed protocol

As in previous work we modeled the membrane potential of ELL neurons, V(t), as a passive, current-based leaky unit receiving excitatory input from 20,000 model granule cells ri(t) and sensory input s(t), with anti-Hebbian spike-timing dependent plasticity at granule cell-ELL neuron synapses with weights wi, and EPSP kernel E fit to recorded granule cell-evoked EPSPs (Kennedy et al., 2014). As discussed above, we adjusted the polarity of the sensory stimulus such that excitation was evoked in both E and I cells. Hence, no distinction was made in the model between E and I cells. The granule cell-ELL neuron learning rule has the form Δ+-Δ-L0(t) where t=tpostspike-tprespike and Lo(t) determines the time dependence of associative depression. Theoretical analysis has shown that the negative images are guaranteed to be stable when L0=E, where E is the EPSP from granule cells to the ELL neuron (Roberts and Bell 2000). The timescale of E agrees with learning rules fit to experimental data, thus we set L0=E. We further included a regularization term as mentioned in the main text. This regularization is equivalent to a constant decay of each synaptic weight toward a baseline value that is the same for all synapses. Using this approach, the rate of change of wi is equal to Δ+ri(t)dt-Δ-V(t)(E*ri)(t)dt-λ(wi-wc), where the integral is over the period of the EOD command sequence being paired. The regularization constant λ sets the time constant, 1λ, for the decay of synaptic weights to the baseline value wc. The values of λ and wc chosen here were selected by hand in order to match the experimental data. The model introduces these two parameters as a minimal extension of our previous model which can account for the experimental results. See the Discussion for thoughts about how these parameters might be set in the biological system.

We used 1λ=10s in the case of full regularization and 1λ=1000s for minimal regularization. The value used with full regularization was chosen to bring the overall performance of the model when generalizing as close to that found in the data without compromising cancellation at 10 Hz to the point where the model could not cancel as well as the data. The value used with minimal regularization was chosen to prevent unrealistically large weights from being learned. We chose the value of wc depending on the ELL neuron being modeled so that the mean model granule cell response scaled by wc was approximately equal to the negative of the sensory input to the ELL neuron, that is such that wcrit-st. Δ+ and Δ- were taken from previous work, where they were fit to negative images recorded experimentally (Kennedy et al., 2014).

Data availability

Data and model code are available via Zenodo (doi: 10.5281/zenodo.2590782).

The following data sets were generated
    1. Nathaniel Sawtell
    2. Conor Dempsey
    3. Larry F Abbott
    (2019) Zenodo
    Data and model associated with "Generalization of learned responses in the mormyrid electrosensory lobe" published in eLife.
    https://doi.org/10.5281/zenodo.2590782

References

  1. Book
    1. Bishop CM
    (2006)
    Pattern Recognition and Machine Learning (Information Science and Statistics)
    Springer-Verlag.
    1. Bodznick D
    2. Montgomery JC
    3. Carey M
    (1999)
    Adaptive mechanisms in the elasmobranch hindbrain
    The Journal of Experimental Biology 202:1357–1364.
    1. Meek J
    2. Grant K
    3. Bell C
    (1999)
    Structural organization of the mormyrid electrosensory lateral line lobe
    The Journal of Experimental Biology 202:1291–1300.
    1. von der Emde G
    2. Bleckmann H
    (1998)
    Finding food: senses involved in foraging for insect larvae in the electric fish Gnathonemus petersii
    The Journal of Experimental Biology 201:969–980.

Decision letter

  1. Catherine Emily Carr
    Reviewing Editor; University of Maryland, United States
  2. Eve Marder
    Senior Editor; Brandeis University, United States
  3. Catherine Emily Carr
    Reviewer; University of Maryland, United States
  4. Maurice J Chacron
    Reviewer; McGill University, Canada
  5. Len Maler
    Reviewer; University of Ottawa, Canada

In the interests of transparency, eLife includes the editorial decision letter and accompanying author responses. A lightly edited version of the letter sent to the authors after peer review is shown, indicating the most substantive concerns; minor comments are not usually included.

Thank you for submitting your article "Generalization of learned responses in the mormyrid electrosensory lobe" for consideration by eLife. Your article has been reviewed by four peer reviewers, including Catherine Emily Carr as the Reviewing Editor and Reviewer #1, and the evaluation has been overseen by Eve Marder as the Senior Editor. The following individuals involved in review of your submission have also agreed to reveal their identity: Maurice J Chacron (Reviewer #2); Len Maler (Reviewer #3).

The reviewers have discussed the reviews with one another and the Reviewing Editor has drafted this decision to help you prepare a revised submission.

Summary:

This manuscript examines circuit-level account of generalization, using the electrosensory lobe of weakly electric mormyrid fish. Electrosensory systems show adaptive cancellation of self-generated inputs, and this paper develops a circuit-level model, with physiological recordings, to show how such systems could generalize across behavioral regimes. The mechanism involves changes in synaptic plasticity and matching of the temporal dynamics.

Essential revisions:

The reviewers do not think that extra experiments are required, given the many novel protocols and experimental results this submission already contains. The main points to be addressed are:

1) With respect to the negative image model, reviewers raised questions about a potential role for medium ganglion cells. Would a model based on the response of MGs to varying EOD rates do as well ? This could be addressed by a summary of MG cell circuitry and physiology, followed by an argument for why a first order model of negative image formation and cancellation can ignore the MGs, then future directions to bring the MGs into a more complete model.

2) There is strong spike frequency adaptation of both ampullary afferents and their ELL targets. The cited Engelmann et al., 2010, paper quantifies the receptor SFA in detail, and the authors could discuss this with respect to how EOD rate and SFA of the ampullary receptors/ELL targets might interact.

3) With respect to the model, there are ambiguities in the model description noted in the full reviews (appended). The authors should check their model against the issues raised by the reviewers.

A further point is that 'generalization' also occurs in wave-type gymnotiform fish (Bol et al., and Mejias et al.). These papers could be cited, with discussion of how cerebellar descending input could be adapted to generalizing over different EOD frequencies (mormyrid ampullary system) vs different 'noise' amplitudes (gymnotiform wave species and maybe mormyrid tuberous system).

Useful details will be found in the full reviews (appended).

Reviewer #1:

With respect to the negative image model and a potential role for medium ganglion cells, would a model based on the response of MGs to varying EOD rates do just as well? A serious discussion of the MGs and explanation of why they think these are not important for negative image formation and cancellation and what other role(s) they may play would be helpful.

Reviewer #2:

This manuscript investigates whether the cancellation signal that is internally generated by mormyrid weakly electric fish (i.e., the negative image) will cancel out EOD pulses that occur in rapid succession. The authors find that this is indeed the case and use modelling to make predictions as to the nature of the underlying mechanisms. Overall, I think that this is a very nice study. However, there are a few issues that should be dealt with:

– It is unclear how behaviorally relevant the situation considered here is to the animal. The authors do cite previous studies in the Introduction, but these appear to be dealing with object localization rather than prey capture. It should also be mentioned that the animal most likely does not exclusively use its ampullary system to locate prey but that information coming from both the passive and the active electrosense are likely combined. Finally, in looking at the authors' data, it appears that ampullary afferents and central ELL cells display strong spike frequency adaptation when EOD pulses are 15 msec apart, which might hamper their ability to detect low frequency prey stimuli. These issues need to be discussed properly.

– The model assumed by the authors uses both the "unlearned" input as well as the "learned" one. What is the physiological justification for this as I am not aware of ELL cells receiving the "unlearned" input?

– Although I like the study, it took me a few reads to fully understand the story and its implications. I suggest rewriting it such that the experimental data is presented first, then the "full" model, and the other "candidates" relegated to supplementary information. This would greatly help the reader focus on what is relevant rather than being taken through several models that do not work for various reasons.

– Finally, although the modeling is compelling, it only makes a prediction as to why negative images can "generalize" to higher frequencies, the actual nature of the mechanisms remains to be determined experimentally.

Reviewer #3:

This is potentially a strong manuscript but has what appears to be a serious flaw in reasoning (or in my understanding of the literature – see below). I have followed work in this system for many years but had never thought about the underlying assumptions. Mormyrids (like gymnotiform fish) have electroreceptors (ampullary) tuned to exogenous low frequency signals (e.g., respiration movements of prey). They also have tuberous receptors that are tuned to the fish's own EOD and detect changes in local EOD amplitude caused by, e.g., conductors or non-conductors. The ampullary receptors project to one map of the ELL (VLZ). This map contains E (fusiform I think) and I (large ganglion cells I think) projection neurons. These cells also have apical dendrites that are in receipt of the EOD command signal (corollary discharge) conveyed via granule cells of the overlying cerebellum (EGp).

The problem for mormyrids is that the EOD also stimulates the ampullary receptors. Their EOD triggered discharge (demonstrated decades ago by Curt Bell and colleagues) would prevent the fish from detecting prey. Curt Bell and colleagues, over many years and papers, demonstrated that a corollary discharge from EGp was able to cancel out the EOD evoked response of of VLZ neurons.

The plasticity associated with the corollary discharge input to ELL was always assayed at the convenient low EOD rates of an inactive fish. I knew that mormyrid fish can increase their EOD rates to much high levels when actively sampling their environment. I had never make the connection and question raised in this paper: do the cancellation models developed for low frequency EOD rates carry over to the high frequency EOD rate condition?

The key experiments are very nicely designed with independent control of the EOD rate and stimulus presentation. This is very nice in that pairing or not is completely under experimental control: (1) corollary discharge alone (curare: no EOD and therefore no sensory input) and experimenter stimulates corollary discharge pathway (one control); (2) sense input alone (EOD pulse mimic) in the absence of a central EOD command and therefore no corollary discharge (second control); (3) paired artificial sensory input + experimenter evoked corollary discharge (natural condition at low frequency EOD rates). Recordings are from the principle E and I cells (output cells) of VLZ.

The results, as presented in Figure 2 are striking – cancellation learned with 10 Hz paired stimulation generalizes to 40 and 60 Hz EOD rates. The careful control experiments proved very convincingly that it cancellation at the novel EOD frequencies as due to a negative image; this is a very strong extension of Curt Bell's original negative image hypothesis (for the same EOD rate for pairing and test stimulation). The nicest control is the demonstration that the response to the command stimulation alone (only corollary discharge input to E or I cells) produced a negative image. These are novel and very interesting results.

The next step was to apply a model previously developed to account for negative image formation and cancellation; this model was developed for learning and testing at the same low frequency. The response of model elements to low frequency EOD rates was extended in a simple and natural manner to the higher EOD rates used in the experiments. This model worked as expected when the test stimuli were at 10 Hz (a trivial control for the model changes). The model failed in that it over-cancelled at the higher test EOD rates. The model required training at all test frequencies and was not able to generalize thus failing to replicate the experimental data.

The original model was then changed in what appears to be minor ways (full regularization) and this did appear to improve the ability of the model to account for generalization. At this point, the authors decided that the problem lay in that they had falsely assumed the responses of the cerebellar granule cells conveying the corollary discharge signal was the same for a 10 Hz and, e.g,, a 60 Hz signal. Now, with their new protocol, they were able to record the granule cell response to the corollary discharge alone and at many different 'EOD rates'. Here a very interesting difference emerged between the data and the original model predictions: the model granule cell, but not the real granule cell, showed a pronounced temporal summation in response to the higher frequencies. A series of very nice experiments then suggested that the problem lay not in the granule cell model itself, but rather in the assumption that the mossy fibers conveying the corollary discharge responded in the same manner for 10 Hz and, e.g., 60 Hz discharge. These mossy fibers could not follow the high frequency EOD command signal – i.e., spikes dropped out. In addition, tonic mossy cell firing also was reduced.

When the authors modified the mossy fiber response component of their model, it now very nicely predicted the effect of EOD rate increases on negative image formation and cancellation. The authors explanation of what is going on is quite involved and I won't try to summarize it. It is also quite convincing.

I really could not find any flaw in the reasoning. Nonetheless, I do have one serious problem with the negative image model. To explain this, I will first summarize my possibly outdated memories of the VLZ circuitry and physiology. Any potential confusion is because there has been extensive work on both the VLZ and the mormyromast ELL zones; it is often difficult for me to determine whether conclusions are limited to ampullary vs mormyromast regions. In addition to the principle E and I cells, VLZ (and the mormyromast zones) contains medium ganglion cells (MGs). These are GABAergic interneurons with extensive apical dendrites that receive corollary discharge input from EGp granule cells. As I recall, there are two types of such MGs and they may connect differentially with the principle cell types. These MGs have narrow and broad (dendritic) spikes and the broad spikes are associated with plasticity of their corollary discharge input; in fact, I believe that Curt Bell first demonstrated parallel fiber plasticity in the VLZ MGs. The MGs provide a strong (I suspect cause of the anatomy) and plastic (Bell's data I believe) inhibitory input to principle cells. And they will be activated for both the low and high EOD rates. The authors have chosen not to incorporate these cells into their model. This seems counterintuitive to me.

I am worried that the model, though it is based on mossy fiber recordings and does a reasonable job explaining generalization, is nevertheless seriously flawed in that it omits a major source of corollary discharge input to the principle cells. Perhaps a model based on the response of MGs to varying EOD rates and including the MG to principle cell inhibitory input would do just as well as the author's model in explaining their results. The authors are certainly aware of the MGs: "…an additional prominent class of GABAergic ELL cells known as medium

ganglion cells.", but do not further discuss their potential contribution to cancellation by the principle cells.

I think the authors treatment of cerebellar Golgi cells and unipolar brush cells is OK (for a first order model). What I would like to see is a serious discussion of the MGs and explanation of why they think these are not important for negative image formation and cancellation and what other role(s) they may play. Perhaps I have gotten mixed up re VLZ and mormyromast zone data (at least some of my knowledge re comparing ampullary and morrmyromast zone cancellation comes from direct discussions with Curt Bell, Kirsty Grant and Hans Meek, and I am not even sure that all their results were even published). Basically, I want the authors to take the MGs seriously because they are a very prominent component of VLZ circuitry and must convey corollary discharge associated input to the principle cells. And then convince me that their limited model is still a reasonable first order model that will not be completely overturned once the contribution(s) of MGs are taken into account.

Reviewer #4:

This is a very nice study that presents novel experiments and associated modeling of negative image cancellation in the electrosensory system of mormyrid fish. The work addresses the important question of the generalizability of this cancellation from the standard low frequency sensory sampling rates used in lab settings – and by the animal at rest – to higher rates known to occur during movement. This cancellation is necessary for disentangling true external inputs from those caused by active sensing, a process that relies on a cerebellar-like corollary discharge. The authors present data that show how the animal can generalize from 10 Hz rates (training) to higher test rates.

The beauty of this study lies in particular in the intricate measurements of the granule cells and their inputs (various types of mossy fibers and unipolar brush cells), and the construction of a state-of-the-art model of a heterogeneous population of granule cells. The improved model, constructed using machine learning techniques to estimate parameters as well as the timing of spike input, reproduces the main features of the data. With their previous biophysical model thus further constrained by these new data, the improved model is shown to generalize properly, as the fish does, from the learned 10Hz sampling rate to the novel 60Hz rate. This is possible provided two ingredients are present: 1) regularization, which means that synaptic weights decay down to a value (chosen to be different for each cell), which is quicker (10s) then in their previous study (1000s) – and such an ingredient has been used for generalization in neural net studies – and there is a matching between time courses of corollary discharge and electrosensory input – although this precise statement differs in parts of the paper -see below.

The statement about the matching is confusing. In the Abstract, it is said that an approximate matching of the temporal dynamics (including EOD rate dependence) of corollary discharge and electrosensory inputs. This statement actually relies on Figure 6E and Figure 6—figure supplement 3. But in the Discussion, second paragraph of subsection “Mechanisms of generalization”, the matching is between the EOD rate dependence of electrosensory responses in ELL output cells and the rate dependence of the summed corollary discharge input that an ELL output cell receives from the granule cells. The difference is that the first case looks at the firing rate of sensory cells, and the second at the firing rate of their target cells. At the very least the statement needs to be clarified. But also I want to understand the nature of this "matching", in the sense of whether it is required to make things work or not. It seems to me that this hinges a lot on the method used to choose wc, i.e. the constant driving force in the regularization dynamics (subsection “Synaptic plasticity”). In other words, the ELL output cell is a function of inputs from both the electroreceptors and the granule cells, and I cannot see how to simply swap ELL and electroreceptors in the "matching" statement.

Relatedly, the choice of making wc proportional to the ELL output s(t) (subsection “Synaptic plasticity”) seems to "direct" this matching, since the ELL output must depend on receptor input. The parameter wc is important because it is part of both your key ingredients (regularization and matching). A clarification is also needed with regard to to precisely how wc is determined: if I understand correctly, the weights wi are said to evolve according to the associative learning rule (plus the regularization) which depends on the choice of wc; in turn wc is said to depend on s(t). Yet this ELL output must depend on the wi since these weights scale the 20000 granule cell inputs. So wc depends on wc: is it self-consistently determined numerically? It would also be good to know the typical size and distribution of wc values you used.

The second concern is about the regularization: your associative rule has Δ+ which drives the weights up proportionally to spike count. Yet you need wc to drive them back down exponentially. It seems that one can be traded off for the other to some extent: have you checked whether Δ+ can be changed to fit the data? Was it a free parameter, or was it fixed from prior experiments?

The Discussion mentions cancellation of self-generated inputs, and thus the need for generalization in other systems, including gymnotiform weakly electric fish which have no corollary discharge. Recent experimental and modeling work (Bol et al., 2011) has shown how that system can learn to cancel many frequencies at the same time using a depressing correlation rule (not exactly the anti-hebbian rule used here) – these different frequencies are analogous in some sense to the different self-generated EOD rates the mormyrids must cancel. An elaboration of that work also showed how the experimentally observed contrast invariance of the cancellation can be achieved across frequencies (i.e. generalization to contrasts not learned, and the optimal contrast to learn at – Mejias et al., 2013). These studies require regularization, yet in that case to prevent weights from decaying to zero. An optimal rate of decay of weights was numerically predicted for the best cancellation. This sweet spot might be of a similar nature as the one of 1 𝜆λ=10 s used by trial and error by the authors (subsection “Synaptic plasticity”).

It would be important for the authors to at least make a comparative statement to this system given its close functional proximity to theirs, briefly highlighting commonalities/contrasts between corollary discharge-driven and online driven cancellation strategies. The authors' results makes interesting predictions that may apply to this system.

Finally, I would like to see a few more words about how the authors dealt with avoiding (Results paragraph three) firing-rate rectification in the ELL output cells, which they say complicated quantitative measurements of sensory cancellation. How was the polarity of the EOD mimic adjusted "to evoke excitatory responses in both E and I cells"? Which parameter does this polarity correspond to in their model? Was this parameter fitted? Heterogeneously? I can't quite wrap my head around how big or systematic a correction this was, but such a constant may influence a lot of other things, given e.g. the choice of wc above.

https://doi.org/10.7554/eLife.44032.022

Author response

Essential revisions:

The reviewers do not think that extra experiments are required, given the many novel protocols and experimental results this submission already contains. The main points to be addressed are:

1) With respect to the negative image model, reviewers raised questions about a potential role for medium ganglion cells. Would a model based on the response of MGs to varying EOD rates do as well ? This could be addressed by a summary of MG cell circuitry and physiology, followed by an argument for why a first order model of negative image formation and cancellation can ignore the MGs, then future directions to bring the MGs into a more complete model.

We are glad that the reviewers are interested in the function of MG cells. Though MG cells have been well-studied in some respects, e.g. in terms of their synaptic plasticity, it is important to recognize that their function is unknown, even in the context of low EOD rates studied in the past. By virtue of their anti-Hebbian plasticity, the MG cells should cancel out the effects of the EOD. However, this raises a paradox. How can MG cells aid cancellation in output cells when plasticity stops them from communicating with the rest of the ELL network? We have spent the past several years working on this problem and will shortly submit for publication a major study that offers a solution. Note, this new work focuses exclusively on the case of low rate EODs and does not address the problem of generalization. Without getting too deeply into the details, the solution involves compartmentalization of communication and learning functions within MG cells and a novel wiring scheme for which we provide experimental evidence. Essentially, this scheme allows MG cells to pass on an appropriate negative image to output cells.

Importantly, such a function is entirely consistent with the scheme for generalization proposed in the present manuscript. In light of our new work on MG cells, the question would be how can MG cells generate an appropriate negative image (to be passed onto output cells) that generalizes across rates? This brings us right back to the conclusions of the present manuscript. Namely, that regularized synaptic plasticity and an approximate matching of granule cell corollary discharge and electrosensory responses is sufficient. We would simply posit that this occurs at the level of MG cells as well as at the level of output cells. Note, there is nothing in the simplified singleneuron model of the ELL presented in the present manuscript that is inconsistent with this (i.e. the model could just as well be an MG cell).

For the purposes of the present manuscript, we do not think it is feasible or appropriate to attempt to communicate all this, given that the reasoning is based on unpublished work. However, we have added text that amplifies the important points that MG cells are likely important and that the present model is a simplification (see below). We have also removed the potentially confusing term “principal cell”.

“We also cannot rule out the importance for generalization of other circuit elements not studied here and for which we lack sufficient physiological data under conditions of different EOD rates. Our model (like all past models of the mormyrid ELL) does not distinguish between two distinct classes of ELL neurons: glutamatergic output cells versus GABAergic MG cells which inhibit output cells. MG cells occupy an analogous position in the circuitry of the mormryid ELL as Purkinje cells in the teleost cerebellum and cartwheel cells in the dorsal cochlear nucleus (Bell, 2002; Bell, Han, and Sawtell, 2008). Importantly, both MG and output cells integrate electrosensory and corollary discharge input and both exhibit anti-Hebbian plasticity (Bell, Caputi, and Grant, 1997; Bell et al., 1993; Meek et al., 1996; Mohr, Roberts, and Bell, 2003). However, it is presently unknown, even in the context of low EOD rates, how MG cells contribute to sensory cancellation and negative image formation. Our model also omits molecular layer interneurons, similar to those found in the cerebellar cortex, and does not distinguish between E- and I-type output cells. Constructing a more complete and realistic model that includes these additional features is a major focus of ongoing experimental and theoretical studies of the mormyrid ELL.”

“The model is simplified in that it does not differentiate between two distinct classes of ELL neurons: output cells and medium ganglion (MG) cells (see Discussion).”

2) There is strong spike frequency adaptation of both ampullary afferents and their ELL targets. The cited Engelmann et al., 2010, paper quantifies the receptor SFA in detail, and the authors could discuss this with respect to how EOD rate and SFA of the ampullary receptors/ELL targets might interact.

Thanks for this comment. We failed to provide a clear explanation in the main text of why responses of ampullary afferents decrease at high EOD rates. The response of ampullary afferents to a single EOD pulse consists of an increase in firing rate followed by a longer-lasting decrease below baseline. We show in Figure 6—figure supplement 3 that the response of afferent at high rates is well approximated by a linear sum of their responses to isolated pulses. This effect could indeed be related to spike frequency adaptation. This point is now discussed in the main text, see below:

“Interestingly, the sensory responses of ELL neurons to high rate trains of EOD mimics (prior to cancellation) also exhibit a decreasing profile (Figure 2A, Figure 6—figure supplement 2). A separate set of recordings from ampullary electroreceptor afferents also revealed a decreasing profile at high EOD rates (Figure 6E, Figure 6—figure supplement 3). Responses of ampullary afferents to isolated EOD pulses consist of a firing rate increase followed by a reduction below baseline and in some cases additional smaller waves of increased and decreased firing resembling a damped oscillation (Figure 6—figure supplement 3D)(Bell and Russell, 1978). Estimating the impulse response of an ampullary afferent from its average response to a single EOD mimic and then convolving this impulse response with a sequence of EOD mimics indeed yielded a reasonable approximation to the observed responses (Figure 6—figure supplement 3A, red lines). Hence the decaying profile of the sensory response to high-rate sequences of EODs as well as the inhibitory rebound at the end of such sequences are expected features of a linear system with an impulse response resembling a damped oscillation.”

Incidentally, the Engelmann paper focuses exclusively on responses to low-frequency stimuli characteristic of prey and does not characterize responses to EOD-like stimuli (extremely brief pulses ~0.2 ms in duration).

3) With respect to the model, there are ambiguities in the model description noted in the full reviews (appended). The authors should check their model against the issues raised by the reviewers.

See below.

A further point is that 'generalization' also occurs in wave-type gymnotiform fish (Bol et al., and Mejias et al.). These papers could be cited, with discussion of how cerebellar descending input could be adapted to generalizing over different EOD frequencies (mormyrid ampullary system) vs different 'noise' amplitudes (gymnotiform wave species and maybe mormyrid tuberous system).

Thanks for this, these papers are indeed relevant and are now discussed in a new paragraph in the discussion.

“The issue of generalization has been explored in the gymnotid ELL in the context of cancellation of spatially redundant electrosensory signals, such as those generated by tail movements or conspecifics (Bol et al., 2011; Mejias et al., 2013). Such cancellation is similar to that in the mormyrid ELL in that it is mediated by anti-Hebbian plasticity at synapses between granule cells and ELL neurons (Harvey-Girard, Lewis, and Maler, 2010). However, cancellation in the gymnotid ELL is driven by electrosensory feedback to granule cells rather than by corollary discharge (Bastian, Chacron, and Maler, 2004; Chacron, Maler, and Bastian, 2005). in vivo recordings from ELL neurons in gymnotids demonstrated that cancellation remains accurate over a wide range of stimulus contrasts (as might be produced by conspecifics at different distances) (Mejias et al., 2013). Modeling was used to show how learning at one contrast could generalize to higher or lower contrasts, despite numerous nonlinearities in the system. Interestingly, features of the model identified to be important for such generalization are related to those described here for the mormyrid ELL, including granule cell response properties and a slow decay in parallel fiber synaptic strength (Mejias et al., 2013), which can be considered a form of regularization. Although responses of granule cells have not yet been measured in vivo in gymnotids, several lines of evidence suggest that they are important in relation to the specificity and generalization of learning in the gymnotid ELL (Bol et al., 2011; Mejias et al., 2013).”

Useful details will be found in the full reviews (appended).

Reviewer #1:

With respect to the negative image model and a potential role for medium ganglion cells, would a model based on the response of MGs to varying EOD rates do just as well? A serious discussion of the MGs and explanation of why they think these are not important for negative image formation and cancellation and what other role(s) they may play would be helpful.

See above.

Reviewer #2:

This manuscript investigates whether the cancellation signal that is internally generated by mormyrid weakly electric fish (i.e., the negative image) will cancel out EOD pulses that occur in rapid succession. The authors find that this is indeed the case and use modelling to make predictions as to the nature of the underlying mechanisms. Overall, I think that this is a very nice study. However, there are a few issues that should be dealt with:

– It is unclear how behaviorally relevant the situation considered here is to the animal. The authors do cite previous studies in the Introduction, but these appear to be dealing with object localization rather than prey capture. It should also be mentioned that the animal most likely does not exclusively use its ampullary system to locate prey but that information coming from both the passive and the active electrosense are likely combined.

Added sentence and reference to discussion:

“Behavioral studies suggest that multiple senses (including both the passive and active electrosensory systems) are used in concert to detect prey (von der Emde and Bleckmann, 1998).”

Finally, in looking at the authors' data, it appears that ampullary afferents and central ELL cells display strong spike frequency adaptation when EOD pulses are 15 msec apart, which might hamper their ability to detect low frequency prey stimuli. These issues need to be discussed properly.

See above.

– The model assumed by the authors uses both the "unlearned" input as well as the "learned" one. What is the physiological justification for this as I am not aware of ELL cells receiving the "unlearned" input?

We are referring to the peripheral electrosensory input to ELL neurons. Text has been added for clarification. This input is known anatomically and is central to ELL function.

“This corresponds anatomically to the input onto the basilar dendrites of ELL neurons from interneurons in the deep layers of ELL receiving somatotopic input from ampullary electroreceptor afferents (Meek, Grant, and Bell, 1999)”

– Although I like the study, it took me a few reads to fully understand the story and its implications. I suggest rewriting it such that the experimental data is presented first, then the "full" model, and the other "candidates" relegated to supplementary information. This would greatly help the reader focus on what is relevant rather than being taken through several models that do not work for various reasons.

We appreciate the reviewer’s patience and agree that model failures should not be the focus. We tried numerous different versions of the manuscript, including some in which we separated the experimental and theoretical results. In the end, we felt that integrating the experiments and the models, as we have tried to do, was best. In the revised manuscript we have added some material to the Introduction to better set up the problem and preview our main findings (per the suggestion of reviewer 2 below). We think this will help readability.

– Finally, although the modeling is compelling, it only makes a prediction as to why negative images can "generalize" to higher frequencies, the actual nature of the mechanisms remains to be determined experimentally.

We agree that the model has not been proven. On the other hand, the model does provide a functional interpretation for previously unknown and unexpected features of mossy fiber and granule cell corollary discharge responses reported here for the first time. For example, the observation from past work that mossy fibers fire identical bursts of ~8 precisely timed spike in response to each command seems odd. Why are 8 spikes needed to signal the timing of the EOD when a single spike would seemingly suffice? Here we show experimentally that the mossy fiber bursts change with EOD rate and the model provides a non-trivial, mechanistic explanation for how information encoded in the EOD command rate-dependence of the mossy fibers could be used to support generalization of learning.

“So-called early mossy fibers are known from previous studies to fire a highly-stereotyped burst of action potentials following each EOD command (Bell, Grant, and Serrier, 1992). We found that the number of spikes in such bursts declines progressively with increases in the command rate. The multiple spikes in the burst seem redundant in the context of isolated EODs. Why would multiple spikes be needed to signal the time of occurrence of an EOD command? The present work suggests that rate-dependent grading of such bursts conveys information that is important for generalization. “

Reviewer #3:

This is potentially a strong manuscript but has what appears to be a serious flaw in reasoning (or in my understanding of the literature – see below). I have followed work in this system for many years but had never thought about the underlying assumptions. Mormyrids (like gymnotiform fish) have electroreceptors (ampullary) tuned to exogenous low frequency signals (e.g., respiration movements of prey). They also have tuberous receptors that are tuned to the fish's own EOD and detect changes in local EOD amplitude caused by, e.g., conductors or non-conductors. The ampullary receptors project to one map of the ELL (VLZ). This map contains E (fusiform I think) and I (large ganglion cells I think) projection neurons. These cells also have apical dendrites that are in receipt of the EOD command signal (corollary discharge) conveyed via granule cells of the overlying cerebellum (EGp).

[…]

I think the authors treatment of cerebellar Golgi cells and unipolar brush cells is OK (for a first order model). What I would like to see is a serious discussion of the MGs and explanation of why they think these are not important for negative image formation and cancellation and what other role(s) they may play. Perhaps I have gotten mixed up re VLZ and mormyromast zone data (at least some of my knowledge re comparing ampullary and morrmyromast zone cancellation comes from direct discussions with Curt Bell, Kirsty Grant and Hans Meek, and I am not even sure that all their results were even published). Basically, I want the authors to take the MGs seriously because they are a very prominent component of VLZ circuitry and must convey corollary discharge associated input to the principle cells. And then convince me that their limited model is still a reasonable first order model that will not be completely overturned once the contribution(s) of MGs are taken into account.

Agreed. See above.

Reviewer #4:

This is a very nice study that presents novel experiments and associated modeling of negative image cancellation in the electrosensory system of mormyrid fish. The work addresses the important question of the generalizability of this cancellation from the standard low frequency sensory sampling rates used in lab settings – and by the animal at rest – to higher rates known to occur during movement. This cancellation is necessary for disentangling true external inputs from those caused by active sensing, a process that relies on a cerebellar-like corollary discharge. The authors present data that show how the animal can generalize from 10 Hz rates (training) to higher test rates.

[…]

The statement about the matching is confusing. In the Abstract, it is said that an approximate matching of the temporal dynamics (including EOD rate dependence) of corollary discharge and electrosensory inputs. This statement actually relies on Figure 6E and Figure 6—figure supplement 3. But in the Discussion, second paragraph of subsection “Mechanisms of generalization”, the matching is between the EOD rate dependence of electrosensory responses in ELL output cells and the rate dependence of the summed corollary discharge input that an ELL output cell receives from the granule cells. The difference is that the first case looks at the firing rate of sensory cells, and the second at the firing rate of their target cells. At the very least the statement needs to be clarified. But also I want to understand the nature of this "matching", in the sense of whether it is required to make things work or not. It seems to me that this hinges a lot on the method used to choose wc, i.e. the constant driving force in the regularization dynamics (subsection “Synaptic plasticity”). In other words, the ELL output cell is a function of inputs from both the electroreceptors and the granule cells, and I cannot see how to simply swap ELL and electroreceptors in the "matching" statement.

The matching is between the average granule cell response and the electrosensory inputs to ELL output cells. The electrosensory response of the output cells is very similar to the electrosensory response of the afferent fibers providing input to those cells prior to the learning of negative images, which is the reason for the confusing statements where the electrosensory response of output cells is said to match the average granule cell response. We have changed the statement in the Discussion to clarify this:

“The second feature we identified as important for generalization is an approximate matching between the EOD rate dependence of electrosensory inputs to ELL output cells and the rate dependence of the average corollary discharge response in granule cells.”

Relatedly, the choice of making wc proportional to the ELL output s(t) (subsection “Synaptic plasticity”) seems to "direct" this matching, since the ELL output must depend on receptor input. The parameter wc is important because it is part of both your key ingredients (regularization and matching). A clarification is also needed with regard to to precisely how wc is determined: if I understand correctly, the weights wi are said to evolve according to the associative learning rule (plus the regularization) which depends on the choice of wc; in turn wc is said to depend on s(t). Yet this ELL output must depend on the wi since these weights scale the 20000 granule cell inputs. So wc depends on wc: is it self-consistently determined numerically? It would also be good to know the typical size and distribution of wc values you used.

This is a simple confusion: s(t) is the “sensory only response”, not the summed response to corollary discharge and sensory input. We have added the following clarifying statement.

“We chose the value of 𝑤𝑐 depending on the ELL neuron being modeled so that the mean model granule cell response scaled by 𝑤𝑐 was approximately equal to the negative of the sensory input to the ELL neuron, that is such that 𝑤𝑐⟨𝑟𝑖(𝑡)⟩ ≈ −𝑠(𝑡). “

The second concern is about the regularization: your associative rule has Δ+ which drives the weights up proportionally to spike count. Yet you need wc to drive them back down exponentially. It seems that one can be traded off for the other to some extent: have you checked whether Δ+ can be changed to fit the data? Was it a free parameter, or was it fixed from prior experiments?

Δ+ was estimated from previous in vitro experiments, Bell, 1997 and Han, 2000. Regularization and Δ+/Δ- cannot be traded off against one another. The ratio of Δ+/Δ- sets the steady-state firing rate of the cell, and the scale of the two parameters sets the rate of learning. Regularization is a decay toward wc in this model, which can involve both potentiation and depression toward wc.

The Discussion mentions cancellation of self-generated inputs, and thus the need for generalization in other systems, including gymnotiform weakly electric fish which have no corollary discharge. Recent experimental and modeling work (Bol et al., 2011) has shown how that system can learn to cancel many frequencies at the same time using a depressing correlation rule (not exactly the anti-hebbian rule used here) – these different frequencies are analogous in some sense to the different self-generated EOD rates the mormyrids must cancel. An elaboration of that work also showed how the experimentally observed contrast invariance of the cancellation can be achieved across frequencies (i.e. generalization to contrasts not learned, and the optimal contrast to learn at – Mejias et al., 2013). These studies require regularization, yet in that case to prevent weights from decaying to zero. An optimal rate of decay of weights was numerically predicted for the best cancellation. This sweet spot might be of a similar nature as the one of 1 𝜆=10 s used by trial and error by the authors (subsection “Synaptic plasticity”).

It would be important for the authors to at least make a comparative statement to this system given its close functional proximity to theirs, briefly highlighting commonalities/contrasts between corollary discharge-driven and online driven cancellation strategies. The authors' results makes interesting predictions that may apply to this system.

See above. Thanks for this, these papers are indeed relevant and are now discussed in a new paragraph in the discussion.

Finally, I would like to see a few more words about how the authors dealt with avoiding (Results paragraph three) firing-rate rectification in the ELL output cells, which they say complicated quantitative measurements of sensory cancellation. How was the polarity of the EOD mimic adjusted "to evoke excitatory responses in both E and I cells"? Which parameter does this polarity correspond to in their model? Was this parameter fitted? Heterogeneously? I can't quite wrap my head around how big or systematic a correction this was, but such a constant may influence a lot of other things, given e.g. the choice of wc above.

Text has been added to the experimental and modeling sections of the Materials and methods section:

“In I cells, the EOD mimic often drove the firing rate to zero, making it difficult to quantify cancellation. To avoid this firing rate rectification, we reversed the EOD mimic polarity when recording from I cells, such that they responded with excitation instead of inhibition. This response reversal is due to known properties of ampullary electroreceptor afferents, which increase (or decrease) firing above (or below) their baseline rate for stimuli that make the pore of the receptor positive (or negative) with respect to the basal face within the body. Past studies have commonly used this approach to demonstrate the specificity of negative image formation in the VLZ by performing multiple pairing in the same neuron using opposite stimulus polarities (Bell, 1981, 1982; Enikolopov, Abbott, and Sawtell, 2018). Negative images are invariably observed for both stimulus polarities in such experiments, i.e. responses to the corollary discharge alone after pairing are opposite in sign to the response to the stimulus during pairing. Systematic differences between negative images and cancellation in E versus I cells or for mimics of opposite polarities have never been noted, justifying this approach to avoid rectification.”

“As discussed above, we adjusted the polarity of the sensory stimulus such that excitation was evoked in both E and I cells. Hence, no distinction was made in the model between E and I cells.”

https://doi.org/10.7554/eLife.44032.023

Article and author information

Author details

  1. Conor Dempsey

    Department of Neuroscience, Zuckerman Mind Brain Behavior Institute, Columbia University, New York, United States
    Contribution
    Conceptualization, Formal analysis, Investigation, Methodology, Writing—original draft
    Competing interests
    No competing interests declared
  2. LF Abbott

    1. Department of Neuroscience, Zuckerman Mind Brain Behavior Institute, Columbia University, New York, United States
    2. Department of Physiology and Cellular Biophysics, Columbia University, New York, United States
    Contribution
    Conceptualization, Supervision, Writing—review and editing
    Competing interests
    No competing interests declared
  3. Nathaniel B Sawtell

    Department of Neuroscience, Zuckerman Mind Brain Behavior Institute, Columbia University, New York, United States
    Contribution
    Conceptualization, Funding acquisition, Investigation, Methodology, Writing—original draft, Project administration, Writing—review and editing
    For correspondence
    ns2635@columbia.edu
    Competing interests
    No competing interests declared
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-1859-8026

Funding

National Science Foundation (1025849)

  • LF Abbott
  • Nathaniel B Sawtell

National Institute of Neurological Disorders and Stroke (NS075023)

  • Nathaniel B Sawtell

Irma T. Hirschl Trust

  • Nathaniel B Sawtell

Simons Foundation

  • LF Abbott

Gatsby Charitable Foundation

  • LF Abbott

National Science Foundation (1707398)

  • Larry F Abbott

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Acknowledgements

This work was supported by grants from the NSF (1025849) to NBS and LFA and NIH (NS075023) to NBS and by the Irma T Hirschl Trust. LFA was further supported by the Simons and Gatsby Charitable Foundations and by NSF NeuroNex Award DBI-1707398.

Ethics

Animal experimentation: All experiments performed in this study adhere to the American Physiological Society's Guiding Principles in the Care and Use of Animals and were approved by the Columbia University Institutional Animal Care and Use Committee, protocol AAAW4462.

Senior Editor

  1. Eve Marder, Brandeis University, United States

Reviewing Editor

  1. Catherine Emily Carr, University of Maryland, United States

Reviewers

  1. Catherine Emily Carr, University of Maryland, United States
  2. Maurice J Chacron, McGill University, Canada
  3. Len Maler, University of Ottawa, Canada

Version history

  1. Received: November 29, 2018
  2. Accepted: February 21, 2019
  3. Accepted Manuscript published: March 12, 2019 (version 1)
  4. Accepted Manuscript updated: March 14, 2019 (version 2)
  5. Version of Record published: April 10, 2019 (version 3)

Copyright

© 2019, Dempsey et al.

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.

Metrics

  • 1,188
    Page views
  • 201
    Downloads
  • 8
    Citations

Article citation count generated by polling the highest count across the following sources: Crossref, PubMed Central, Scopus.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Conor Dempsey
  2. LF Abbott
  3. Nathaniel B Sawtell
(2019)
Generalization of learned responses in the mormyrid electrosensory lobe
eLife 8:e44032.
https://doi.org/10.7554/eLife.44032

Further reading

    1. Neuroscience
    Mikhail Proskurin, Maxim Manakov, Alla Karpova
    Research Article Updated

    Medial frontal cortical areas are thought to play a critical role in the brain’s ability to flexibly deploy strategies that are effective in complex settings, yet the underlying circuit computations remain unclear. Here, by examining neural ensemble activity in male rats that sample different strategies in a self-guided search for latent task structure, we observe robust tracking during strategy execution of a summary statistic for that strategy in recent behavioral history by the anterior cingulate cortex (ACC), especially by an area homologous to primate area 32D. Using the simplest summary statistic – strategy prevalence in the last 20 choices – we find that its encoding in the ACC during strategy execution is wide-scale, independent of reward delivery, and persists through a substantial ensemble reorganization that accompanies changes in global context. We further demonstrate that the tracking of reward by the ACC ensemble is also strategy-specific, but that reward prevalence is insufficient to explain the observed activity modulation during strategy execution. Our findings argue that ACC ensemble dynamics is structured by a summary statistic of recent behavioral choices, raising the possibility that ACC plays a role in estimating – through statistical learning – which actions promote the occurrence of events in the environment.

    1. Neuroscience
    Connon I Thomas, Melissa A Ryan ... Benjamin Scholl
    Research Article

    Postsynaptic mitochondria are critical for the development, plasticity, and maintenance of synaptic inputs. However, their relationship to synaptic structure and functional activity is unknown. We examined a correlative dataset from ferret visual cortex with in vivo two-photon calcium imaging of dendritic spines during visual stimulation and electron microscopy reconstructions of spine ultrastructure, investigating mitochondrial abundance near functionally and structurally characterized spines. Surprisingly, we found no correlation to structural measures of synaptic strength. Instead, we found that mitochondria are positioned near spines with orientation preferences that are dissimilar to the somatic preference. Additionally, we found that mitochondria are positioned near groups of spines with heterogeneous orientation preferences. For a subset of spines with a mitochondrion in the head or neck, synapses were larger and exhibited greater selectivity to visual stimuli than those without a mitochondrion. Our data suggest mitochondria are not necessarily positioned to support the energy needs of strong spines, but rather support the structurally and functionally diverse inputs innervating the basal dendrites of cortical neurons.