Dynamic control of sequential retrieval speed in networks with heterogeneous learning rules
Abstract
Temporal rescaling of sequential neural activity has been observed in multiple brain areas during behaviors involving time estimation and motor execution at variable speeds. Temporally asymmetric Hebbian rules have been used in network models to learn and retrieve sequential activity, with characteristics that are qualitatively consistent with experimental observations. However, in these models sequential activity is retrieved at a fixed speed. Here, we investigate the effects of a heterogeneity of plasticity rules on network dynamics. In a model in which neurons differ by the degree of temporal symmetry of their plasticity rule, we find that retrieval speed can be controlled by varying external inputs to the network. Neurons with temporally symmetric plasticity rules act as brakes and tend to slow down the dynamics, while neurons with temporally asymmetric rules act as accelerators of the dynamics. We also find that such networks can naturally generate separate ‘preparatory’ and ‘execution’ activity patterns with appropriate external inputs.
eLife assessment
The authors provide a valuable analysis of what neural circuit mechanisms enable varying the speed of retrieval of sequences, which is needed in situations such as reproducing motor patterns. Their use of heterogeneous plasticity rules to allow external currents to control speed of sequence recall is a novel alternative to other mechanisms proposed in the literature. They perform a convincing characterization of relevant properties of recall via simulations and theory, though a better mapping to biologically plausible mechanisms is left for future work.
https://doi.org/10.7554/eLife.88805.3.sa0Introduction
Timing is a critical component in the proper planning and execution of temporally extended motor behaviors. In behaviors consisting of a single motor action, it may be desirable to control the duration of its execution. In behaviors composed of multiple actions, the precise time interval between actions can be a key determinant in the success of the behavior. How can the duration of these intervals be flexibly controlled in a network of neurons?
A simple mechanistic hypothesis for the regulation of motor related timing intervals posits a specialized neural circuit with network dynamics that vary in speed as a consequence of differing levels of constant external input. Several network models utilizing external input as a means of speed control have been proposed to account for cortical and striatal dynamics observed during motor execution (Murray and Escola, 2017; Wang et al., 2018). To account for speed control in cortex, a recurrent neural network model has been trained to achieve temporal rescaling of network activity as a function of external input (Wang et al., 2018). However, this model relies on supervised learning rules that may not be biologically plausible, and cannot generalize to other speeds from training on just one example timing interval. To explain speed control of sequential activity in striatum, a recurrent inhibitory network model has been proposed with a feedforward structure learned through anti-Hebbian plasticity (Murray and Escola, 2017). This model demonstrates transient winner-take-all dynamics, with short-term synaptic depression facilitating transitions in activity from one group of neurons to the next, and external input controlling the duration of each group’s transient activation. While experimental evidence for the necessary type of depressive adaptation mechanism exists in the striatum, it may not be present in all cortical areas where rescaling of sequential activity is observed. Whether speed can be controlled in network models constructed using Hebbian learning without this mechanism remains unknown.
Network models with a connectivity generated by temporally asymmetric synaptic plasticity provide a potential framework for explaining how sequential activity can arise from local biologically plausible learning rules (Sompolinsky and Kanter, 1986; Kleinfeld, 1986). In both rate and spiking networks, the temporal statistics of sequential activity in networks using this type of rule qualitatively match experimental findings made over both short and long timescales of observation in multiple tasks with timing components (Gillett et al., 2020). However, the speed of sequential dynamics in these models is constrained by the choice of temporal offset in the learning rule and neuronal time constant, and cannot be modulated with external input.
The Hebbian rules explored in this work and previous studies are approximations of various forms of spike-timing dependent plasticity (STDP). The effects of STDP can be quantified through kernels that measure the change in excitatory postsynaptic potential size at a synapse (as a proxy for synaptic strength), as a function of the timing difference between pre and postsynaptic spikes. Experimentally, a large diversity of STDP kernels have been characterized across cortical, subcortical, and cerebellar structures (Abbott and Nelson, 2000; Suvrathan et al., 2016). Kernels measured in cortex and hippocampus typically, but not always, exhibit a temporal asymmetry, in which presynaptic activity must precede postsynaptic activity to elicit a positive change in synaptic strength (Bi and Poo, 1998; Egger et al., 1999). Theoretical studies have shown that this temporal asymmetry can be used to store and retrieve sequences of activity (Jun and Jin, 2007; Liu and Buonomano, 2009; Fiete et al., 2010; Waddington et al., 2012; Zheng and Triesch, 2014; Okubo et al., 2015; Ravid Tannenbaum and Burak, 2016; Murray and Escola, 2017; Weissenberger et al., 2017; Theodoni et al., 2018; Pereira and Brunel, 2019; Tupikov and Jin, 2020; Gillett et al., 2020). However, symmetric kernels, in which coincident activity leads to strengthening regardless of the order of pre and post-synaptic spikes, have also been observed in multiple contexts - with high frequency plasticity induction protocols in cortex (Sjöström et al., 2001), in hippocampal cultures in the presence of dopamine (Zhang et al., 2009), and at excitatory-to-excitatory synapses in hippocampal CA3 (Mishra et al., 2016). Hebbian learning rules that are temporally symmetric lead instead to the creation of fixed point attractors (Hopfield, 1982; Amit and Brunel, 1997; Wang, 2001; Brunel, 2005; Pereira and Brunel, 2018). It is not known to what degree temporal asymmetry varies across synapses at the scale of local networks, but analysis of a calcium-based plasticity model demonstrates that the degree of asymmetry can be controlled via adjustment of biophysical parameters (Graupner and Brunel, 2012). We hypothesize that variability in the temporal offset expressed at a synapse may be a key ingredient in permitting the control of retrieval speed, suggesting a potential new role for the observed heterogeneity in STDP kernels.
In this work, we explore a generalization of previously investigated temporally asymmetric learning to multiple temporal offsets that captures this heterogeneity. Specifically, we find that varying the temporal asymmetry of the learning rule across synapses gives rise to network mechanisms that allow for the control of speed as a function of external inputs to the network. We start by considering a network with a bimodal distribution of heterogeneity in the learning rule, resulting in two distinct populations: one with a symmetric learning rule, and one with an asymmetric rule. We characterize the effect of input strength on retrieval speed and quality in these networks with connectivity generated using linear and nonlinear synaptic plasticity rules. We also find that transitions between fixed-point attractor-like ‘preparatory’ periods and sequential ‘execution’ phases can be realized in this model by rescaling the magnitude of external input. Finally, we demonstrate that networks with a uniform distribution of heterogeneity lead to qualitatively similar findings.
Results
Degree of symmetry in learning rule determines retrieval speed
We explore a network model in which the firing rate dynamics of each neuron ri in a population of size is described by the equation
where is the time constant of firing rate dynamics, is the connectivity matrix, is a sigmoidal neuronal transfer function (see Methods), and describes the external input provided to each neuron at time .
We follow a similar learning procedure as in Gillett et al., 2020. A sequence of random i.i.d standard Gaussian patterns is presented to the network and stored in network connectivity. This sequence of patterns modifies the strength of synaptic connections from neuron to according to a Hebbian learning rule that transforms pre and post synaptic inputs into synaptic weight changes. The resulting connectivity matrix is a generalization of previously studied rules which combines both temporally symmetric and asymmetric learning (Pereira and Brunel, 2018; Gillett et al., 2020),
where is a matrix describing the structural connectivity, whose entries are given by i.i.d. Bernoulli random variables, , , where is the connection probability; The functions and describe how the synaptic plasticity rule depends on pre and postsynaptic input patterns during learning, respectively; The parameter controls the overall strength of the recurrent connections; And describes the degree of temporal symmetry at synapses of neuron i. A neuron with fully temporally symmetric plasticity is described by , while indicates a neuron with fully temporally asymmetric plasticity. Note that we focus here to the case of a single sequence stored in synaptic connectivity, but such networks can also store multiple sequences (Gillett et al., 2020).
We first explore the bilinear learning rule scenario ( with homogeneous synaptic plasticity, i.e. for all . At the two extremes of this variable we can recover previously studied learning rules. When , only the second term in Equation 2 is present, resulting in a purely temporally asymmetric rule. Networks with connectivity constructed using such a rule can recall a sequence of stored patterns, and their sequential retrieval dynamics have been extensively characterized (Gillett et al., 2020). When , synaptic plasticity is temporally symmetric, potentially leading to fixed point attractor dynamics (Pereira and Brunel, 2018). If is instead fixed to a value between 0 and 1, then the asymmetric component in the plasticity rule leads to the retrieval of the whole sequence, but the speed at which the sequence is retrieved strongly depends on . For instance, in Figure 1b we demonstrate retrieval for an intermediate value of . Retrieval is quantified by plotting the Pearson correlation of the instantaneous firing rate with each stored pattern as a function of time (see Methods). During sequence retrieval, correlations with individual patterns in the sequence increase, peak and decrease one after the other, indicating the network transiently visit states close to each of the patterns in succession. We find that in such a network, retrieval speed strongly depends on . For the parameters in Figure 1b, retrieval proceeds nearly twice as slowly as compared to a network with connectivity arising from a purely asymmetric learning rule, where retrieval speed is fixed by the time constant of the firing rate dynamics (Gillett et al., 2020). However, retrieval speed is fixed by the choice of (see Figure 1c showing a linear dependence of speed on ), and cannot be dynamically modulated in response to changes in the external input .
Heterogeneity in synaptic plasticity temporal asymmetry gives rise to a speed control mechanism
We next explored whether adding heterogeneity to this learning rule, allowing zi to differ across synapses, can produce networks capable of both recalling stored sequences of patterns and modulating the speed of recall. We initially consider a bimodal distribution of degrees of temporal symmetry across the network. For each neuron, zi was drawn randomly and independently as a Bernoulli random variable with probability , . As a result, the network of neurons can be divided into two subpopulations of approximately equal sizes neurons, according to the learning rule present at their synapses:
where the connectivity matrix is given by
and where denotes the presynaptic population. Note that the external input now depends on the population. To reduce the space of possible learning rules, we have assumed that the type of learning at a synapse depends only on the identity of the postsynaptic neuron. The bimodal distribution of zi restricts synapses to only one of the two types of plasticity, but in the final section entitled ‘Retrieval with a broad distribution of learning rules’ we relax this constraint.
In Figure 1e, we show an example of how the stored sequence can be retrieved under different input conditions. In both the top and bottom panels of 1e, network activity is initialized to the first pattern in the sequence, and a constant external input is provided to each subpopulation (‘asymmetric’ input , and ‘symmetric’ input ). In the top panel, the symmetric population is effectively silenced with strongly negative input, resulting in retrieval that lasts approximately , consistent with the dynamics being driven purely by the asymmetric component in the learning rule (Gillett et al., 2020). In the bottom panel, this input is no longer strongly negative, causing retrieval time to more than double, due to the effect of the symmetric population that tends to slow down the dynamics. Retrieval in both conditions is robust to noise, as shown in Figure 1E, in which noisy inputs to neurons strongly perturb single neuron firing rates but leave sequence retrieval intact at both speeds (see Methods).
To characterize how retrieval time depends on these two sources of external input, we explored the space of the parameters defining the inputs to the network, and . In Figure 2, we show the dependence of retrieval quality and speed on these variables. Retrieval quality is quantified by measuring the maximal correlation of the final pattern in the retrieved sequence. Retrieval speed is measured in units of the inverse of the neural time constant, . It is computed by measuring the average inverse time between the peaks of consecutive correlations of the network state with consecutive patterns in the sequence. For example, a speed of 0.5 corresponds to an average time difference of between the peaks of the correlations of two consecutive retrieved patterns with network state. In the upper left quadrant of Figure 2b, speed depends primarily on the strength of input to the symmetric population. Moving away from this region in the direction of increasing symmetric input, retrieval speed slows down to approximately 0.5. In the lower right quadrant, retrieval speed instead depends primarily on the strength of external input provided to the asymmetric population. As this negative input grows, retrieval speed becomes approximately four times slower than the speed of the purely asymmetric network. In Figure 2, we have focused on the region in which external inputs are negative. This is because in our model external inputs are expressed relative to the threshold, and this region leads to biologically plausible average firing rates that are much smaller than the maximal firing rates (see Methods). While we have focused on negative input in Figure 2, retrieval speed is also modulated by positive input. Interestingly, it is the magnitude, not sign, of the input that determines retrieval speed. Expanding the phase diagram in panel (b) to positive input shows that the same dependence holds: values for retrieval speed are approximately symmetric about the and axes (not shown).
Flexible retrieval with a non-linear plasticity rule
We next considered the consequences of a nonlinear learning rule implemented by the following presynaptic and postsynaptic functions in Equation 2:
where is the Heaviside function. This rule binarizes the activity patterns according to a threshold, and its effects on persistent and sequential network activity have been studied extensively (Lim et al., 2015; Pereira and Brunel, 2018; Gillett et al., 2020). The parameter qg is chosen such that , which keeps the mean connection strength at zero. The general dependency of retrieval speed on asymmetric and symmetric inputs in a network utilizing this rule is similar to that of the bilinear rule (see Figure 2). One key difference is that a much wider range of speeds can be achieved using a nonlinear rule within the same retrieval quality bounds (see Methods). In fact, retrieval speed can now be arbitrarily slowed down, and even completely stopped when the input to the asymmetric population is sufficiently negative (see white dots in Figure 3b). In this region, persistent activity is stable, and there exists a fixed point attractor correlated with any of the patterns in any stored sequence. There also exists a region in which sequential activity stops in the middle of retrieval and switches to stable persistent activity (see hatched diagonal lines in Figure 3b). Note that retrieval is not considered to be successful in this region (as the sequence is not fully retrieved), and so it is plotted in black.
Temporally varying external inputs can lead to transitions between persistent and sequential activity
We next explored how this heterogeneity might be used not only to control the speed of dynamics, but also to trigger transitions between qualitatively different dynamics. In Figure 4, we use the same nonlinear model as in the previous section, and present discrete, time-dependent inputs intended to achieve persistent retrieval of a single pattern, followed by sequential retrieval of the remaining patterns at a specified time. To initiate persistent activity, we briefly present the first pattern as an input to the symmetric population. This elicits persistent activity in this population, as reflected by the sustained positive correlation of the symmetric population with the first pattern during the first 200ms (Figure 4b). This activity does not recruit sequential activity in either population, however, as the asymmetric population responsible for that transition is presented with sufficiently strong negative input during this period. To initiate sequential activity, inhibition to the asymmetric population is released after s, prompting the network to retrieve the stored sequence in both populations.
Note that in this scenario also, a sequence can be retrieved at various speeds, using the same inputs during the persistent period, but changing the level of constant stimulation provided during retrieval (compare left and right panels in Figure 4b). As in a network with only a single asymmetric population, single neuron activity in this network is temporally sparse, with many neurons being active only at specific time intervals (Figure 4c).
In our network, stability of persistent activity requires the dependence of the plasticity rule on pre and/or post synaptic firing rates to be non-linear. With a bilinear learning rule and Gaussian patterns, the network dynamics does not converge to fixed-point attractors that are correlated with a single pattern, but rather to mixed states correlated with multiple patterns (Amit et al., 1985).
The dynamics shown in Figure 4 reproduces some of the landmark features observed in electrophysiological recordings during delayed motor tasks. In such tasks, a preparatory period follows presentation of a cue (e.g. instructing a target direction or a desired response speed), during which the animal can prepare the motor response, but not execute it (Churchland et al., 2012). This period is typically characterized by persistent activity of specific groups of neurons, whereas during motor execution those same neurons instead display transient activity (Svoboda and Li, 2018).
Flexible sequence retrieval in networks with a continuous distribution of degrees of temporal symmetry
Up to this point, we have analyzed a network model in which neurons are separated in two discrete classes distinguished by their plasticity rule (symmetric or asymmetric). For a given postsynaptic neuron, the learning rule present at all presynaptic synapses was chosen to be either temporally symmetric or asymmetric with equal probability, defining two distinct subpopulations of neurons. Can retrieval speed still be modulated by external input when synapses do not fall into such a binary classification, but have more heterogeneous properties? To model this heterogeneity, we chose to embed a continuum of learning rules. Instead of a bimodal distribution for zi in Equation 2, we choose a uniform distribution on the interval . The input provided to each neuron i in Equation 1 is a linear combination of symmetric and asymmetric input components: . We also choose to investigate a network with the previously described non-linear plasticity rule. Figure 5 shows that a network with these modifications also exhibits flexible sequence retrieval, and that speed decreases as the asymmetric input component becomes more negative. However, as shown in Figure 5c, to reach slower speeds a positive is now required. Note that a region of stable persistent activity is no longer present in this scenario, as stable persistent activity requires that a finite fraction of neurons in the network have a symmetric plasticity rule.
Learning external input strengths using a reward-based plasticity rule
The low-dimensional external inputs used to regulate speed are unrelated to the stored sequential input patterns. This suggests that a mapping from external inputs to retrieval speed can be learned independently from a particular set of sequential patterns. We demonstrated that a reinforcement learning rule can be used to converge to external input values implementing a desired speed (Figure 6). By using a reward signal measuring how similar retrieval is to the desired speed, the rule adjusts initially random external inputs to the appropriate values over the course of multiple trial repetitions (see Methods for details). Critically, once these external input values are learned, they can be used to modulate the retrieval speed of other stored sequences without having to relearn this mapping.
Flexible retrieval of sequences in a spiking network
We have until now focused exclusively on rate networks that do not obey Dale’s law. We now turn to networks composed of excitatory and inhibitory spiking neurons, as a more realistic model of neurobiological networks. We implemented learning in excitatory to excitatory synaptic connectivity, generalizing the procedure described in Gillett et al., 2020 to two excitatory subpopulations. We found that successful speed control can be obtained in such networks using biases in external inputs to symmetric and asymmetric populations, as in the simpler rate model described above. Figure 7 shows network simulations using two different external input configurations, leading to sequence retrieval at two different speeds. Interestingly, small external input biases () relative to the difference in spiking threshold and resting potential () are sufficient to generate a temporal rescaling of as large as ∼ 2.
Discussion
In this paper, we have introduced a new mechanism for flexible control of retrieval speed in networks storing sequences. This mechanism relies on heterogeneity of synaptic plasticity rules across neurons in the network, with different degrees of temporal asymmetry. Neurons with temporally symmetric plasticity act as brakes of the dynamics, as they stabilize network activity in its current state, while neurons with temporally asymmetric plasticity act instead as accelerators, as they push the network toward the next pattern in the sequence. The speed of retrieval can then be modified in a flexible way by changing external inputs driving these two types of neurons. Furthermore, we found that this mechanism can be used to gate transitions between persistent and sequential activity. We showed that appropriate inputs can be learned using a reinforcement learning scheme. Finally, we also showed that networks of spiking neurons can generate the same behavior, provided the excitatory network is subdivided in asymmetric and symmetric neurons.
Heterogeneity of synaptic plasticity
Our findings suggest a potential functional role for the experimentally observed diversity in synaptic plasticity rules (Bi and Poo, 1998; Abbott and Nelson, 2000; Sjöström et al., 2001; Mishra et al., 2016; Suvrathan et al., 2016). In particular, a wide diversity of spike-timing dependent plasticity (STDP) curves have been reported in various brain structures, and sometimes in the same structure. In the hippocampus, temporally asymmetric STDP is typically observed in cultures (Bi and Poo, 1998) or in CA3 to CA1 connections in slices in some conditions, but temporally symmetric STDP is observed in area CA3 (Mishra et al., 2016). Interestingly, the degree of temporal symmetry at CA3 to CA1 connections can be modulated by extracellular calcium concentration (Inglebert et al., 2020) and post-synaptic bursting (Wittenberg and Wang, 2006; Inglebert et al., 2020). In the cerebellum, synaptic plasticity rules with diverse temporal requirements on the time difference between parallel fiber and climbing fiber inputs have been found in Purkinje cells in different zones of this structure suvrathan16. While this heterogeneity has been found so far across structures or across different regions in the same structure, this heterogeneity could also be present within local networks, as current experimental methods for probing plasticity only have access to a single delay between pre and post-synaptic spikes in each recorded neuron, and would therefore miss this heterogeneity.
For simplicity, the degree of temporal asymmetry was chosen in our model to depend only on the identity of the postsynaptic neuron. This is consistent with the observation that a model of synaptic plasticity that depends only on the postsynaptic concentration of calcium can account for a range of experimentally observed STDP curves (Graupner and Brunel, 2012). This suggests that heterogeneities in temporal asymmetry could arise due to heterogeneities in biophysical parameters that control calcium dynamics in post-synaptic spines.
Comparison with other mechanisms of speed control
The mechanism investigated here is distinct from previously described models of input-driven speed control. It does not require adaptation mechanisms or delays to slow down retrieval of subsequent patterns (Sompolinsky and Kanter, 1986; Murray and Escola, 2017). It also does not require presentation of multiple exemplars spanning the desired range of retrieval speeds in order to find the appropriate network structure (Wang et al., 2018). However, the mapping between external input strength and retrieval speed must be learned in order for the network to be able to perform retrieval at desired speeds. Unlike the model explored in Wang et al., 2018, however, once this mapping is learned, it can be used to control the speed of other stored sequences.
Another recent study (Beiran et al., 2023) has investigated how a recurrent network could flexibly control its temporal dynamics using a different approach. They trained a low-rank recurrent network using back-propagation through time to produce specific dynamics with flexible timing, and showed that the resulting network can then be flexibly controlled by a one-dimensional input. It would be interesting to investigate whether the low-rank structure found in such a manner exhibits similarities with the synaptic connectivity structure in our model.
Future experimental work could analyze the evolution of neural activity across the training of interval timing tasks, and evaluate whether it is consistent with such a reinforcement-based rule.
Experimental predictions
This mechanism presented here makes several predictions regarding the relationship between plasticity rules, external input, and the speed of network dynamics. One prediction is that retrieval speed could be modified by providing different external inputs to each population (asymmetric and symmetric). In vivo, these populations could be identified using the dependence of mean firing rates on speed of retrieval - neurons who increase their rates with slower/faster retrieval speeds would be predicted to be the symmetric/asymmetric neurons, respectively. Targeting one class of neurons or the other, using holographic techniques (see e.g. Marshel et al., 2019) would then be expected to increase or decrease the speed of retrieval. Another prediction is that these cells have distinct profiles of temporal asymmetry in their synaptic plasticity. The model presented here also predicts the existence of ‘null’ input directions, for which no change in retrieval speed is expected as external input is changed. When moving along these ‘null’ directions, single neurons would only be expected to change their temporal firing patterns, but without affecting the speed of retrieval.
Transitions between persistent and sequential activity
Heterogeneity in the learning rule also provides a mechanism that enables input changes to drive transitions in activity states. An example of such a transition is frequently reported in primary motor cortex (M1) during delayed reaching tasks, where a preparatory period with persistent activity or ramping dynamics is followed by an execution period with transient, sequential dynamics (Riehle and Requin, 1989; Li et al., 2016). We demonstrated how an input change can gate such a transition in a simple network model composed of neurons with two distinct plasticity rules, the first temporally symmetric, and the second temporally asymmetric. At the start of the preparatory period, asymmetric neurons are inhibited, and a transient specific input elicits persistent activity in symmetric neurons. When inhibition is removed, asymmetric neurons become activated and drive a transition to sequential activity in both types of neurons.
Inhibitory gating has been previously hypothesized as a mechanism to control the initiation of execution period activity. Analysis of M1 activity suggests that local inhibitory interneurons do not engage in this gating, as putative inhibitory neurons do not appear to be preferentially active during the preparatory period compared to the execution period (Kaufman et al., 2013). However, this does not rule out the possibility that the necessary inhibition could arise from other external inputs to M1. It is also possible that inhibition may not be required at all. Effective silencing of the asymmetric neurons could occur by a reduction of excitatory input during the preparatory period. Recent work in mice suggests that thalamocortical interactions may be a potential candidate for driving the required transition. Recorded activity in motor thalamus during a reaching task shows that at movement onset, thalamus activity is negatively correlated with premotor activity, but positively correlated with activity in M1 (Nashef et al., 2021). In a separate directional licking task, thalamus projections were shown to be required for initiating cued movement, and mimicked presentation of the cue when optogenetically stimulated (Inagaki et al., 2022). An alternative model for transitions between preparatory and execution activity has recently been proposed (Bachschmid-Romano et al., 2023), in which external inputs trigger a switch between a preparatory state and a nearly orthogonal execution state. However, in the model of Bachschmid-Romano et al., 2023, the execution epoch is described by a single pattern, and any temporal dynamics within this epoch is inherited from external inputs, while in the present paper the temporal dynamics during the execution phase is generated by the recurrent connectivity structure.
Limitations and future directions
We have focused here on a simple learning scenario in which a temporally asymmetric plasticity rule imprints a sequence of external input patterns into the recurrent synaptic connectivity. In real neuronal networks, one expects recurrent synaptic inputs to shape the response of a network to external inputs, and therefore how such inputs sculpt recurrent connectivity. Studying such a learning process is outside the scope of this paper, but is an important topic for future work.
In this paper, we have focused on Hebbian specific synaptic plasticity rules to store a sequence of input patterns. Another fruitful approach to investigate learning and memory in neural circuits was introduced by Gardner, 1988. In Gardner’s approach, the idea is to consider the space of all possible connectivity matrices that store a given set of memories as fixed point attractor states. It was later shown that the statistics of the connectivity matrix in attractor networks with sign-constrained synapses optimizing the storage capacity and/or robustness of learning is in striking agreement with cortical data - in particular, the resulting connectivity is sparse, with an overrepresentation of bidirectional motifs in pairs of neurons, compared to random directed Erdos-Renyi networks (Brunel, 2016). However, in networks storing sequences, no such overrepresentation exists (Brunel, 2016). It will be interesting to investigate the statistics of connectivity in networks with flexibility constraints, such that sequences can be retrieved at different speeds, or with a coexistence of fixed point attractor dynamics with sequential retrieval.
Methods
Neuronal transfer function
The neuronal transfer function is given by the sigmoidal function
where determines the input at which the neuron fires at half the maximal value , and is inversely proportional to the gain. This function was chosen for continuity with previous work (Gillett et al., 2020). We expect that using qualitatively similar functions should not alter the results of this paper.
Noisy inputs
We introduce noisy inputs to each neuron in Figures 1e and 2c through independent realizations of an Ornstein-Uhlenbeck process with a mean equal to either or , respectively, with standard deviation of 0.3, and a correlation time constant of 4 ms. This noise leads to fluctuations of firing rate that are comparable to rate fluctuations induced by sequence retrieval (Figure 2c), while leaving sequence retrieval intact (Figure 1e).
Measuring pattern correlations
To compute the Pearson pattern correlation , we compute the overlap of each of the stored patterns with the instantaneous firing rates for the entire population and divide by the standard deviation of firing rate activity: . In Figures 3 and 4, we compute the correlations separately for each subpopulation.
Measuring retrieval speed
To measure retrieval speed in Figures 2, 3 and 5, we recorded the times at which each pattern correlation attained its peak value, and computed the average time difference between the peaks of successive patterns in a sequence. We then divided the time constant of the rate dynamics by this averaged value in order to convert speed into units of :
To account for simulations with dynamics that did not have well-defined correlation peaks (typically observed at extreme storage loads or with persistent activity), we excluded peak time differences that exceeded two standard deviations of the average difference value. If no peak time difference passed this criteria, the sequence was considered not retrieved (black regions in Figures 2, 3 and 5).
Mean-field theory of single-population network with variable degree of temporal asymmetry
In this section we derive a mean-field theory for the single population network with homogeneous synaptic plasticity. This is a generalization of the theory derived for a purely temporally asymmetric network Gillett et al., 2020. We define order parameters and , describing the average overlap of network activity with pattern and the average squared firing rate, respectively.
Using Equations 1 and 2, we derive equations describing the temporal evolution of the overlaps (for ),
where is a ‘noise’ term due to patterns in the sequence (see Gillett et al., 2020 for details) By making the following change of variables:
in which we have defined , we obtain
where
Assuming that , which is the case during successful retrieval (see also Gillett et al., 2020), then we can simplify to:
This equation makes it clear that retrieval speed depends linearly on , that is on the balance between the symmetric and asymmetric components of synaptic plasticity.
Mean-field theory of heterogeneous network and conditions for retrieval
Mean-field theory can be used to further analyze retrieval speed dynamics, along the lines of Gillett et al., 2020. We define order parameters and , describing the average overlap of network activity in subpopulation with pattern and the average squared firing rate in subpopulation , respectively. The equations for the overlaps are given by:
where Gis given, for arbitrary transfer functions by:
For the transfer function used in this paper, Equation 9, the expression simplifies,
As in the previous section, and are ‘noise’ terms due to patterns in the sequence, which also depends on the average squared firing rates and . Using Equations 17 and 18, we can derive the dynamics of the combined population overlap :
To compute the boundary for successful retrieval given by the white line in Figure 2, we analyze this equation when the gains are constant: . Plugging in and rearranging, we find:
This equation shows that the sequence can only be retrieved if , otherwise the peak of the overlaps decay to zero with increasing . Thus retrieval of an asymptotically long sequence is successful if the gain converges to a value greater or equal to one during retrieval. This condition can only be satisfied if
To test for successful sequence retrieval in Figure 2, we computed the maximal correlation value of the final pattern , and compared this value to a threshold . If the value fell below this threshold, then retrieval was considered unsuccessful, and was denoted by a black square. This threshold criterion was also used in Figures 3 and 5.
Reward-driven learning
A simple perturbation-based reinforcement learning rule is used to demonstrate that external inputs can be generated that produce network dynamics at a desired target speed over the course of multiple trial repetitions. We simulate a series of trials with stochastically varying external inputs. At each trial , the external inputs used in the previous trial are perturbed randomly,
where λ is the strength of the perturbation, and are uniformly distributed random variables over the interval , drawn independently for each population at each trial . If these external inputs lead to an improvement in speed compared to previous trials, then
else,
In Figure 6, the correlation threshold , the target speed , and . On the first trial , the external inputs are taken to be and (open circle in Figure 6).
Network of excitatory and inhibitory spiking neurons
We simulated a network of excitatory and inhibitory leaky integrate-and-fire (LIF) neurons similar to the one described described in the Appendix of Gillett et al., 2020 (sections 3 and 4) with a few differences described below.
In this network, the dynamics of the membrane potential of neuron i () in population () are governed by the following equations:
where , controls the synaptic delay, (t) controls the time-dependent external input drive, controls the refractory period, is the Heaviside function, and is a white noise input with zero mean and unit variance density.
Excitatory neurons are divided into two (asymmetric and symmetric) populations of equal size (), with connectivity matrices given by the following, where is the rectified synaptic transfer function defined in the procedure and :
The excitatory populations receive external input that depends on their identity, and on the retrieval configuration. For slow retrieval, we set the input equal to and for asymmetric and symmetric neurons, respectively. For fast retrieval, we use and . In inhibitory neurons, we use .
The learning strength () is set to .25, which result in changes to the following parameters: , , . All other parameter values are identical to those documented in Table 7c of the referenced Appendix (Gillett et al., 2020).
Data availability
The current manuscript is a computational study, so no data have been generated for this manuscript. Code to run the simulations and build the figures have been uploaded to Github at https://github.com/maxgillett/dynamic_speed_control (copy archived at Gillett, 2024).
References
-
Synaptic plasticity: taming the beastNature Neuroscience 3:1178–1183.https://doi.org/10.1038/81453
-
Spin-glass models of neural networksPhysical Review A 32:1007–1018.https://doi.org/10.1103/PhysRevA.32.1007
-
BookCourse 10 - network models of memoryIn: Chow CC, Gutkin B, Hansel D, Meunier C, editors. Les Houches. Methods and Models in Neurophysics. Elsevier. pp. 407–476.https://doi.org/10.1016/S0924-8099(05)80016-2
-
Is cortical connectivity optimized for storing information?Nature Neuroscience 19:749–755.https://doi.org/10.1038/nn.4286
-
The space of interactions in neural network modelsJournal of Physics A 21:257–270.https://doi.org/10.1088/0305-4470/21/1/030
-
SoftwareDynamic_speed_control, version swh:1:rev:462af73bfeb54c0a8b88522988f8b83c002a49d4Software Heritage.
-
The roles of monkey M1 neuron classes in movement preparation and executionJournal of Neurophysiology 110:817–825.https://doi.org/10.1152/jn.00892.2011
-
Inferring learning rules from distributions of firing rates in cortical neuronsNature Neuroscience 18:1804–1810.https://doi.org/10.1038/nn.4158
-
Embedding multiple trajectories in simulated recurrent neural networks in a self-organizing mannerThe Journal of Neuroscience 29:13172–13181.https://doi.org/10.1523/JNEUROSCI.2358-09.2009
-
Unsupervised learning of persistent and sequential activityFrontiers in Computational Neuroscience 13:97.https://doi.org/10.3389/fncom.2019.00097
-
Shaping neural circuits by high order synaptic interactionsPLOS Computational Biology 12:e1005056.https://doi.org/10.1371/journal.pcbi.1005056
-
Temporal association in asymmetric neural networksPhysical Review Letters 57:2861–2864.https://doi.org/10.1103/PhysRevLett.57.2861
-
Neural mechanisms of movement planning: motor cortex and beyondCurrent Opinion in Neurobiology 49:33–41.https://doi.org/10.1016/j.conb.2017.10.023
-
Triphasic spike-timing-dependent plasticity organizes networks to produce robust sequences of neural activityFrontiers in Computational Neuroscience 6:88.https://doi.org/10.3389/fncom.2012.00088
-
Synaptic reverberation underlying mnemonic persistent activityTrends in Neurosciences 24:455–463.https://doi.org/10.1016/s0166-2236(00)01868-3
-
Flexible timing by temporal scaling of cortical responsesNature Neuroscience 21:102–110.https://doi.org/10.1038/s41593-017-0028-6
-
Long synfire chains emerge by spike-timing dependent plasticity modulated by population activityInternational Journal of Neural Systems 27:1750044.https://doi.org/10.1142/S0129065717500447
-
Malleability of spike-timing-dependent plasticity at the CA3-CA1 synapseThe Journal of Neuroscience 26:6610–6617.https://doi.org/10.1523/JNEUROSCI.5388-05.2006
-
Robust development of synfire chains from multiple plasticity mechanismsFrontiers in Computational Neuroscience 8:66.https://doi.org/10.3389/fncom.2014.00066
Article and author information
Author details
Funding
National Institutes of Health (R01 EB022891)
- Maxwell Gillett
- Nicolas Brunel
Office of Naval Research (N00014-16-1-2327)
- Maxwell Gillett
- Nicolas Brunel
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Version history
- Preprint posted:
- Sent for peer review:
- Reviewed Preprint version 1:
- Reviewed Preprint version 2:
- Version of Record published:
Cite all versions
You can cite all versions using the DOI https://doi.org/10.7554/eLife.88805. This DOI represents all versions, and will always resolve to the latest one.
Copyright
© 2023, Gillett and Brunel
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 322
- views
-
- 19
- downloads
-
- 2
- citations
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Neuroscience
Female sexual receptivity is essential for reproduction of a species. Neuropeptides play the main role in regulating female receptivity. However, whether neuropeptides regulate female sexual receptivity during the neurodevelopment is unknown. Here, we found the peptide hormone prothoracicotropic hormone (PTTH), which belongs to the insect PG (prothoracic gland) axis, negatively regulated virgin female receptivity through ecdysone during neurodevelopment in Drosophila melanogaster. We identified PTTH neurons as doublesex-positive neurons, they regulated virgin female receptivity before the metamorphosis during the third-instar larval stage. PTTH deletion resulted in the increased EcR-A expression in the whole newly formed prepupae. Furthermore, the ecdysone receptor EcR-A in pC1 neurons positively regulated virgin female receptivity during metamorphosis. The decreased EcR-A in pC1 neurons induced abnormal morphological development of pC1 neurons without changing neural activity. Among all subtypes of pC1 neurons, the function of EcR-A in pC1b neurons was necessary for virgin female copulation rate. These suggested that the changes of synaptic connections between pC1b and other neurons decreased female copulation rate. Moreover, female receptivity significantly decreased when the expression of PTTH receptor Torso was reduced in pC1 neurons. This suggested that PTTH not only regulates female receptivity through ecdysone but also through affecting female receptivity associated neurons directly. The PG axis has similar functional strategy as the hypothalamic–pituitary–gonadal axis in mammals to trigger the juvenile–adult transition. Our work suggests a general mechanism underlying which the neurodevelopment during maturation regulates female sexual receptivity.
-
- Neuroscience
Theoretical computational models are widely used to describe latent cognitive processes. However, these models do not equally explain data across participants, with some individuals showing a bigger predictive gap than others. In the current study, we examined the use of theory-independent models, specifically recurrent neural networks (RNNs), to classify the source of a predictive gap in the observed data of a single individual. This approach aims to identify whether the low predictability of behavioral data is mainly due to noisy decision-making or misspecification of the theoretical model. First, we used computer simulation in the context of reinforcement learning to demonstrate that RNNs can be used to identify model misspecification in simulated agents with varying degrees of behavioral noise. Specifically, both prediction performance and the number of RNN training epochs (i.e., the point of early stopping) can be used to estimate the amount of stochasticity in the data. Second, we applied our approach to an empirical dataset where the actions of low IQ participants, compared with high IQ participants, showed lower predictability by a well-known theoretical model (i.e., Daw’s hybrid model for the two-step task). Both the predictive gap and the point of early stopping of the RNN suggested that model misspecification is similar across individuals. This led us to a provisional conclusion that low IQ subjects are mostly noisier compared to their high IQ peers, rather than being more misspecified by the theoretical model. We discuss the implications and limitations of this approach, considering the growing literature in both theoretical and data-driven computational modeling in decision-making science.