Abstract
Temporal rescaling of sequential neural activity has been observed in multiple brain areas during behaviors involving time estimation and motor execution at variable speeds. Temporally asymmetric Hebbian rules have been used in network models to learn and retrieve sequential activity, with characteristics that are qualitatively consistent with experimental observations. However, in these models sequential activity is retrieved at a fixed speed. Here, we investigate the effects of a heterogeneity of plasticity rules on network dynamics. In a model in which neurons differ by the degree of temporal symmetry of their plasticity rule, we find that retrieval speed can be controlled by varying external inputs to the network. Neurons with temporally symmetric plasticity rules act as brakes and tend to slow down the dynamics, while neurons with temporally asymmetric rules act as accelerators of the dynamics. We also find that such networks can naturally generate separate ‘preparatory’ and ‘execution’ activity patterns with appropriate external inputs.
Introduction
Timing is a critical component in the proper planning and execution of temporally extended motor behaviors. In behaviors consisting of a single motor action, it may be desirable to control the duration of its execution. In behaviors composed of multiple actions, the precise time interval between actions can be a key determinant in the success of the behavior. How can the duration of these intervals be flexibly controlled in a network of neurons?
A simple mechanistic hypothesis for the regulation of motor related timing intervals posits a specialized neural circuit with network dynamics that vary in speed as a consequence of differing levels of constant external input. Several network models utilizing external input as a means of speed control have been proposed to account for cortical and striatal dynamics observed during motor execution [1, 2]. To account for speed control in cortex, a recurrent neural network model has been trained to achieve temporal rescaling of network activity as a function of external input [2]. However, this model relies on supervised learning rules that may not be biologically plausible, and cannot generalize to other speeds from training on just one example timing interval. To explain speed control of sequential activity in striatum, a recurrent inhibitory network model has been proposed with a feedforward structure learned through anti-Hebbian plasticity [1]. This model demonstrates transient winner-take-all dynamics, with short term synaptic depression facilitating transitions in activity from one group of neurons to the next, and external input controlling the duration of each group’s transient activation. While experimental evidence for the necessary type of depressive adaptation mechanism exists in the striatum, it may not be present in all cortical areas where rescaling of sequential activity is observed. Whether speed can be controlled in network models constructed using Hebbian learning without this mechanism remains unknown.
Network models with a connectivity generated by temporally asymmetric synaptic plasticity provide a potential framework for explaining how sequential activity can arise from local biologically plausible learning rules [3, 4]. In both rate and spiking networks, the temporal statistics of sequential activity in networks using this type of rule qualitatively match experimental findings made over both short and long timescales of observation in multiple tasks with timing components [5]. However, the speed of sequential dynamics in these models is constrained by the choice of temporal offset in the learning rule and neuronal time constant, and cannot be modulated with external input.
The Hebbian rules explored in this work and previous studies are approximations of various forms of spike-timing dependent plasticity (STDP). The effects of STDP can be quantified through kernels that measure the change in excitatory postsynaptic potential size at a synapse (as a proxy for synaptic strength), as a function of the timing difference between pre and postsynaptic spikes. Experimentally, a large diversity of STDP kernels have been characterized across cortical, subcortical, and cerebellar structures [6, 7]. Kernels measured in cortex and hippocampus typically, but not always, exhibit a temporal asymmetry, in which presynaptic activity must precede postsynaptic activity to elicit a positive change in synaptic strength [8, 9]. Theoretical studies have shown that this temporal asymmetry can be used to store and retrieve sequences of activity [1, 5, 10–20]. However, symmetric kernels, in which coincident activity leads to strengthening regardless of the order of pre and post-synaptic spikes, have also been observed in multiple contexts - with high frequency plasticity induction protocols in cortex [21], in hippocampal cultures in the presence of dopamine [22], and at excitatory-to-excitatory synapses in hippocampal CA3 [23]. Hebbian learning rules that are temporally symmetric lead instead to the creation of fixed point attractors [24–28].
It is not known to what degree temporal asymmetry varies across synapses at the scale of local networks, but analysis of a calcium-based plasticity model demonstrates that the degree of asymmetry can be controlled via adjustment of biophysical parameters [29]. We hypothesize that variability in the temporal offset expressed at a synapse may be a key ingredient in permitting the control of retrieval speed, suggesting a potential new role for the observed heterogeneity in STDP kernels.
In this work, we explore a generalization of previously investigated temporally asymmetric learning to multiple temporal offsets that captures this heterogeneity. Specifically, we find that varying the temporal asymmetry of the learning rule across synapses gives rise to network mechanisms that allow for the control of speed as a function of external inputs to the network. We start by considering a network with a bimodal distribution of heterogeneity in the learning rule, resulting in two distinct populations: one with a symmetric learning rule, and one with an asymmetric rule. We characterize the effect of input strength on retrieval speed and quality in these networks with connectivity generated using linear and nonlinear synaptic plasticity rules. We also find that transitions between fixed-point attractor-like “preparatory” periods and sequential “execution” phases can be realized in this model by rescaling the magnitude of external input. Finally, we demonstrate that networks with a uniform distribution of heterogeneity lead to qualitatively similar findings.
Degree of symmetry in learning rule determines retrieval speed
We explore a network model in which the firing rate dynamics of each neuron ri in a population of size N is described by the equation
where τ is the time constant of firing rate dynamics, Jij is the connectivity matrix, ϕ (x) is a sigmoidal neuronal transfer function (see Methods), and describes the external input provided to each neuron at time t.
We follow a similar learning procedure as in [5]. A sequence of P random i.i.d standard Gaussian patterns is presented to the network and stored in network connectivity. This sequence of patterns modifies the strength of synaptic connections Jij from neuron j to i according to a Hebbian learning rule that transforms pre and post synaptic inputs into synaptic weight changes. The resulting connectivity matrix Jij is a generalization of previously studied rules which combines both temporally symmetric and asymmetric learning [5, 28],
where cij is a matrix describing the structural connectivity, whose entries are given by i.i.d. Bernoulli random variables, p(cij = 1) = c, p(cij = 0) = 1 − c, where c is the connection probability; The functions g(x) and f (x) describe how the synaptic plasticity rule depends on pre and postsynaptic input patterns during learning, respectively; The parameter A controls the overall strength of the recurrent connections. And zi ∈ [0, 1] describes the degree of temporal symmetry at synapses of neuron i. A neuron with fully temporally symmetric plasticity is described by zi = 1, while zi = 0 indicates a neuron with fully temporally asymmetric plasticity. Note that we focus here to the case of a single sequence stored in synaptic connectivity, but such networks can also store multiple sequences [5].
We first explore the bilinear learning rule scenario (f (x) = g(x) = x) with homogeneous synaptic plasticity, i.e. zi = z for all i = 1, …, N . At the two extremes of this variable we can recover previously studied learning rules. When z = 0, only the second term in Eq. 2 is present, resulting in a purely temporally asymmetric rule. Networks with connectivity constructed using such a rule can recall a sequence of stored patterns, and their sequential retrieval dynamics have been extensively characterized [5]. When z = 1, synaptic plasticity is temporally symmetric, potentially leading to fixed point attractor dynamics [28]. If z is instead fixed to a value between 0 and 1, then the asymmetric component in the plasticity rule leads to the retrieval of the whole sequence, but the speed at which the sequence is retrieved strongly depends on z. For instance, in Fig. 1b we demonstrate retrieval for an intermediate value of z = 0.5. Retrieval is quantified by plotting the Pearson correlation of the instantaneous firing rate r(t) with each stored pattern ξµ as a function of time (see Methods). During sequence retrieval, correlations with individual patterns in the sequence increase, peak and decrease one after the other, indicating the network transiently visit states close to each of the patterns in succession. We find that in such a network, retrieval speed strongly depends on z. For the parameters in Fig. 1b, retrieval proceeds nearly twice as slowly as compared to a network with connectivity arising from a purely asymmetric learning rule, where retrieval speed is fixed by the time constant of the firing rate dynamics [5]. However, retrieval speed is fixed by the choice of z (see Fig. 1c showing a linear dependence of speed on z), and cannot be dynamically modulated in response to changes in the external input.
Heterogeneity in synaptic plasticity temporal asymmetry gives rise to a speed control mechanism
We next explored whether adding heterogeneity to this learning rule, allowing zi to differ across synapses, can produce networks capable of both recalling stored sequences of patterns and modulating the speed of recall. We initially consider a bimodal distribution of degrees of temporal symmetry across the network. For each neuron, zi was drawn randomly and independently as a Bernoulli random variable with probability p(zi = 1) = 0.5, p(zi = 0) = 0.5. As a result, the network of N neurons can be divided into two subpopulations of approximately equal sizes neurons, according to the learning rule present at their synapses:
where the connectivity matrix is given by
and where X = a, s denotes the presynaptic population. Note that the external input now depends on the population. To reduce the space of possible learning rules, we have assumed that the type of learning at a synapse depends only on the identity of the postsynaptic neuron. The bimodal distribution of zi restricts synapses to only one of the two types of plasticity, but in the final section entitled “Retrieval with a broad distribution of learning rules” we relax this constraint.
In Fig. 1e, we show an example of how the stored sequence can be retrieved under different input conditions. In both the top and bottom panels of 1e, network activity is initialized to the first pattern in the sequence, and a constant external input is provided to each subpopulation (“asymmetric” input , and “symmetric” input). In the top panel, the symmetric population is effectively silenced with strongly negative input, resulting in retrieval that lasts approximately τ P, consistent with the dynamics being driven purely by the asymmetric component in the learning rule [5]. In the bottom panel, this input is no longer strongly negative, causing retrieval time to more than double, due to the effect of the symmetric population that tends to slow down the dynamics. Retrieval in both conditions is robust to noise, as shown in Fig. 1E, in which noisy inputs to neurons strongly perturb single neuron firing rates but leave sequence retrieval intact at both speeds (see Methods).
To characterize how retrieval time depends on these two sources of external input, we explored the space of the parameters defining the inputs to the network, Ia and Is. In Fig. 2, we show the dependence of retrieval quality and speed on these variables. Retrieval quality is quantified by measuring the maximal correlation of the final pattern in the retrieved sequence. Retrieval speed is measured in units of the inverse of the neural time constant, τ −1. It is computed by measuring the average inverse time between the peaks of consecutive correlations of the network state with consecutive patterns in the sequence. For example, a speed of 0.5 corresponds to an average time difference of 2τ between the peaks of the correlations of two consecutive retrieved patterns with network state. In the upper left quadrant of Fig. 2b, speed depends primarily on the strength of input to the symmetric population. Moving away from this region in the direction of increasing symmetric input, retrieval speed slows down to approximately 0.5. In the lower right quadrant, retrieval speed instead depends primarily on the strength of external input provided to the asymmetric population. As this negative input grows, retrieval speed becomes approximately four times slower than the speed of the purely asymmetric network. In Fig. 2 we have focused on the region in which external inputs are negative. This is because in our model external inputs are expressed relative to the threshold, and this region leads to biologically plausible average firing rates that are much smaller than the maximal firing rates (see Methods). While we have focused on negative input in Fig. 2, retrieval speed is also modulated by positive input. Interestingly, it is the magnitude, not sign, of the input that determines retrieval speed. Expanding the phase diagram in panel (b) to positive input shows that the same dependence holds: values for retrieval speed are approximately symmetric about the and axes (not shown).
Flexible retrieval with a non-linear plasticity rule
We next considered the consequences of a nonlinear learning rule implemented by the following presynaptic and postsynaptic functions in Eq. 2:
where Θ(x) is the Heaviside function. This rule binarizes the activity patterns ξ according to a threshold, and its effects on persistent and sequential network activity have been studied extensively [5, 28, 30]. The parameter qg is chosen such that ∫ Dzg(z) = 0, which keeps the mean connection strength at zero. The general dependency of retrieval speed on asymmetric and symmetric inputs in a network utilizing this rule is similar to that of the bilinear rule (see Fig. 2). One key difference is that a much wider range of speeds can be achieved using a nonlinear rule within the same retrieval quality bounds (see Methods). In fact, retrieval speed can now be arbitrarily slowed down, and even completely stopped when the input to the asymmetric population is sufficiently negative (see white dots in Fig. 3b). In this region, persistent activity is stable, and there exists a fixed point attractor correlated with any of the patterns in any stored sequence. There also exists a region in which sequential activity stops in the middle of retrieval and switches to stable persistent activity (see hatched diagonal lines in Fig. 3b). Note that retrieval is not considered to be successful in this region (as the sequence is not fully retrieved), and so it is plotted in black.
Temporally varying external inputs can lead to transitions between persistent and sequential activity
We next explored how this heterogeneity might be used not only to control the speed of dynamics, but also to trigger transitions between qualitatively different dynamics. In Fig. 4 we use the same nonlinear model as in the previous section, and present discrete, time-dependent inputs intended to achieve persistent retrieval of a single pattern, followed by sequential retrieval of the remaining patterns at a specified time. To initiate persistent activity, we briefly present the first pattern as an input to the symmetric population. This elicits persistent activity in this population, as reflected by the sustained positive correlation of the symmetric population with the first pattern during the first 200ms (Fig. 4b). This activity does not recruit sequential activity in either population, however, as the asymmetric population responsible for that transition is presented with sufficiently strong negative input during this period. To initiate sequential activity, inhibition to the asymmetric population is released after t = 0.2s, prompting the network to retrieve the stored sequence in both populations.
Note that in this scenario also, a sequence can be retrieved at various speeds, using the same inputs during the persistent period, but changing the level of constant stimulation provided during retrieval (compare left and right panels in Fig. 4b). As in a network with only a single asymmetric population, single neuron activity in this network is temporally sparse, with many neurons being active only at specific time intervals (Fig. 4c).
In our network, stability of persistent activity requires the dependence of the plasticity rule on pre and/or post synaptic firing rates to be non-linear. With a bilinear learning rule and Gaussian patterns, the network dynamics does not converge to fixed-point attractors that are correlated with a single pattern, but rather to mixed states correlated with multiple patterns [31].
The dynamics shown in Fig. 4 reproduces some of the landmark features observed in electrophysiological recordings during delayed motor tasks. In such tasks, a preparatory period follows presentation of a cue (e.g. instructing a target direction or a desired response speed), during which the animal can prepare the motor response, but not execute it [32]. This period is typically characterized by persistent activity of specific groups of neurons, whereas during motor execution those same neurons instead display transient activity [33].
Flexible sequence retrieval in networks with a continuous distribution of degrees of temporal symmetry
Up to this point, we have analyzed a network model in which neurons are separated in two discrete classes distinguished by their plasticity rule (symmetric or asymmetric). For a given postsynaptic neuron, the learning rule present at all presynaptic synapses was chosen to be either temporally symmetric or asymmetric with equal probability, defining two distinct subpopulations of neurons. Can retrieval speed still be modulated by external input when synapses do not fall into such a binary classification, but have more heterogeneous properties? To model this heterogeneity, we chose to embed a continuum of learning rules. Instead of a bimodal distribution for zi in Eq. 2, we choose a uniform distribution on the interval [0, 1]. The input provided to each neuron i in Eq. 1 is a linear combination of symmetric and asymmetric input compo-nents:. We also choose to investigate a network with the previously described non-linear plasticity rule. Fig. 5 shows that a network with these modifications also exhibits flexible sequence retrieval, and that speed decreases as the asymmetric input component becomes more negative. However, as shown in Fig. 5c, to reach slower speeds a positive is now required. Note that a region of stable persistent activity is no longer present in this scenario, as stable persistent activity requires that a finite fraction of neurons in the network have a symmetric plasticity rule.
Learning external input strengths using a reward-based plasticity rule
The low-dimensional external inputs used to regulate speed are unrelated to the stored sequential input patterns. This suggests that a mapping from external inputs to retrieval speed can be learned independently from a particular set of sequential patterns. We demonstrated that a reinforcement learning rule can be used to converge to external input values implementing a desired speed (Fig. 6). By using a reward signal measuring how similar retrieval is to the desired speed, the rule adjusts initially random external inputs to the appropriate values over the course of multiple trial repetitions (see Methods for details). Critically, once these external input values are learned, they can be used to modulate the retrieval speed of other stored sequences without having to relearn this mapping.
Flexible retrieval of sequences in a spiking network
We have until now focused exclusively on rate networks that do not obey Dale’s law. We now turn to networks composed of excitatory and inhibitory spiking neurons, as a more realistic model of neurobiological networks. We implemented learning in excitatory to excitatory synaptic connectivity, generalizing the procedure described in ref. [5] to two excitatory subpopulations.
We found that successful speed control can be obtained in such networks using biases in external inputs to symmetric and asymmetric populations, as in the simpler rate model described above. Figure 7 shows network simulations using two different external input configurations, leading to sequence retrieval at two different speeds. Interestingly, small external input biases (∼ 1mV) relative to the difference in spiking threshold and resting potential (20mV) are sufficient to generate a temporal rescaling of as large as ∼ 2.
Discussion
In this paper, we have introduced a new mechanism for flexible control of retrieval speed in networks storing sequences. This mechanism relies on heterogeneity of synaptic plasticity rules across neurons in the network, with different degrees of temporal asymmetry. Neurons with temporally symmetric plasticity act as brakes of the dynamics, as they stabilize network activity in its current state, while neurons with temporally asymmetric plasticity act instead as accelerators, as they push the network towards the next pattern in the sequence. The speed of retrieval can then be modified in a flexible way by changing external inputs driving these two types of neurons. Furthermore, we found that this mechanism can be used to gate transitions between persistent and sequential activity. We showed that appropriate inputs can be learned using a reinforcement learning scheme. Finally, we also showed that networks of spiking neurons can generate the same behavior, provided the excitatory network is subdivided in asymmetric and symmetric neurons.
Heterogeneity of synaptic plasticity
Our findings suggest a potential functional role for the experimentally observed diversity in synaptic plasticity rules [6–8, 21, 23]. In particular, a wide diversity of spike-timing dependent plasticity (STDP) curves have been reported in various brain structures, and sometimes in the same structure. In the hippocampus, temporally asymmetric STDP is typically observed in cultures [8] or in CA3 to CA1 connections in slices in some conditions, but temporally symmetric STDP is observed in area CA3 [23]. Interestingly, the degree of temporal symmetry at CA3 to CA1 connections can be modulated by extracellular calcium concentration [34] and post-synaptic bursting [34, 35]. In the cerebellum, synaptic plasticity rules with diverse temporal requirements on the time difference between parallel fiber and climbing fiber inputs have been found in Purkinje cells in different zones of this structure [7]. While this heterogeneity has been found so far across structures or across different regions in the same structure, this heterogeneity could also be present within local networks, as current experimental methods for probing plasticity only have access to a single delay between pre and post-synaptic spikes in each recorded neuron, and would therefore miss this heterogeneity.
For simplicity, the degree of temporal asymmetry was chosen in our model to depend only on the identity of the postsynaptic neuron. This is consistent with the observation that a model of synaptic plasticity that depends only on the postsynaptic concentration of calcium can account for a range of experimentally observed STDP curves [29]. This suggests that heterogeneities in temporal asymmetry could arise due to heterogeneities in biophysical parameters that control calcium dynamics in post-synaptic spines.
Comparison with other mechanisms of speed control
The mechanism investigated here is distinct from previously described models of input-driven speed control. It does not require adaptation mechanisms or delays to slow down retrieval of subsequent patterns [1, 3]. It also does not require presentation of multiple exemplars spanning the desired range of retrieval speeds in order to find the appropriate network structure [2]. However, the mapping between external input strength and retrieval speed must be learned in order for the network to be able to perform retrieval at desired speeds. Unlike the model explored in [2], however, once this mapping is learned, it can be used to control the speed of other stored sequences.
Another recent study [36] has investigated how a recurrent network could flexibly control its temporal dynamics using a different approach. They trained a low-rank recurrent network using back-propagation through time to produce specific dynamics with flexible timing, and showed that the resulting network can then be flexibly controlled by a one-dimensional input. It would be interesting to investigate whether the low-rank structure found in such a manner exhibits similarities with the synaptic connectivity structure in our model.
Future experimental work could analyze the evolution of neural activity across the training of interval timing tasks, and evaluate whether it is consistent with such a reinforcement-based rule.
Experimental predictions
This mechanism presented here makes several predictions regarding the relationship between plasticity rules, external input, and the speed of network dynamics. One prediction is that retrieval speed could be modified by providing different external inputs to each population (asymmetric and symmetric). In vivo, these populations could be identified using the dependence of mean firing rates on speed of retrieval neurons who increase their rates with slower/faster retrieval speeds would be predicted to be the symmetric/asymmetric neurons, respectively. Targeting one class of neurons or the other, using holographic techniques (see e.g. [37]) would then be expected to increase or decrease the speed of retrieval. Another prediction is that these cells have distinct profiles of temporal asymmetry in their synaptic plasticity. The model presented here also predicts the existence of “null” input directions, for which no change in retrieval speed is expected as external input is changed. When moving along these “null” directions, single neurons would only be expected to change their temporal firing patterns, but without affecting the speed of retrieval.
Transitions between persistent and sequential activity
Heterogeneity in the learning rule also provides a mechanism that enables input changes to drive transitions in activity states. An example of such a transition is frequently reported in primary motor cortex (M1) during delayed reaching tasks, where a preparatory period with persistent activity or ramping dynamics is followed by an execution period with transient, sequential dynamics [38, 39]. We demonstrated how an input change can gate such a transition in a simple network model composed of neurons with two distinct plasticity rules, the first temporally symmetric, and the second temporally asymmetric. At the start of the preparatory period, asymmetric neurons are inhibited, and a transient specific input elicits persistent activity in symmetric neurons. When inhibition is removed, asymmetric neurons become activated and drive a transition to sequential activity in both types of neurons.
Inhibitory gating has been previously hypothesized as a mechanism to control the initiation of execution period activity. Analysis of M1 activity suggests that local inhibitory interneurons do not engage in this gating, as putative inhibitory neurons do not appear to be preferentially active during the preparatory period compared to the execution period [40]. However, this does not rule out the possibility that the necessary inhibition could arise from other external inputs to M1. It is also possible that inhibition may not be required at all. Effective silencing of the asymmetric neurons could occur by a reduction of excitatory input during the preparatory period. Recent work in mice suggests that thalamocortical interactions may be a potential candidate for driving the required transition. Recorded activity in motor thalamus during a reaching task shows that at movement onset, thalamus activity is negatively correlated with premotor activity, but positively correlated with activity in M1 [41]. In a separate directional licking task, thalamus projections were shown to be required for initiating cued movement, and mimicked presentation of the cue when optogenetically stimulated [42]. An alternative model for transitions between preparatory and execution activity has recently been proposed [43], in which external inputs trigger a switch between a preparatory state and a nearly orthogonal execution state. However, in the model of ref. [43], the execution epoch is described by a single pattern, and any temporal dynamics within this epoch is inherited from external inputs, while in the present paper the temporal dynamics during the execution phase is generated by the recurrent connectivity structure.
Limitations and future directions
We have focused here on a simple learning scenario in which a temporally asymmetric plasticity rule imprints a sequence of external input patterns into the recurrent synaptic connectivity. In real neuronal networks, one expects recurrent synaptic inputs to shape the response of a network to external inputs, and therefore how such inputs sculpt recurrent connectivity. Studying such a learning process is outside the scope of this paper, but is an important topic for future work.
In this paper, we have focused on Hebbian specific synaptic plasticity rules to store a sequence of input patterns. Another fruitful approach to investigate learning and memory in neural circuits was introduced by Elizabeth Gardner[44]. In Gardner’s approach, the idea is to consider the space of all possible connectivity matrices that store a given set of memories as fixed point attractor states. It was later shown that the statistics of the connectivity matrix in attractor networks with sign-constrained synapses optimizing the storage capacity and/or robustness of learning is in striking agreement with cortical data - in particular, the resulting connectivity is sparse, with an over-representation of bidirectional motifs in pairs of neurons, compared to random directed Erdos-Renyi networks [45]. However, in networks storing sequences, no such overrepresentation exists [45]. It will be interesting to investigate the statistics of connectivity in networks with flexibility constraints, such that sequences can be retrieved at different speeds, or with a coexistence of fixed point attractor dynamics with sequential retrieval.
Methods
Neuronal transfer function
The neuronal transfer function is given by the sigmoidal function
where θ determines the input at which the neuron fires at half the maximal value rmax, and σ is inversely proportional to the gain. This function was chosen for continuity with previous work [5]. We expect that using qualitatively similar functions should not alter the results of this paper.
Noisy inputs
We introduce noisy inputs to each neuron in Figs. 1e, 2c through independent realizations of an Ornstein-Uhlenbeck process with a mean equal to either or, respectively, with standard deviation of 0.3, and a correlation time constant of 4ms. This noise leads to fluctuations of firing rate that are comparable to rate fluctuations induced by sequence retrieval (Fig. 2c), while leaving sequence retrieval intact (Fig. 1e).
Measuring pattern correlations
To compute the Pearson pattern correlation mµ(t), we compute the overlap of each of the stored patterns ξµ with the instantaneous firing rates for the entire population and divide by the standard deviation of firing rate activity: . In Figs. 3 and 4, we compute the correlations separately for each subpopulation.
Measuring retrieval speed
To measure retrieval speed v in Figs. 2, 3, and 5, we recorded the times at which each pattern correlation attained its peak value, and computed the average time difference between the peaks of successive patterns in a sequence. We then divided the time constant of the rate dynamics by this averaged value in order to convert speed into units of τ −1:
To account for simulations with dynamics that did not have well-defined correlation peaks (typically observed at extreme storage loads or with persistent activity), we excluded peak time differences that exceeded two standard deviations of the average difference value. If no peak time difference passed this criteria, the sequence was considered not retrieved (black regions in Figs. 2, 3, and 5).
Mean-field theory of single-population network with variable degree of temporal asymmetry
In this section we derive a mean-field theory for the single population network with homogeneous synaptic plasticity. This is a generalization of the theory derived for a purely temporally asymmetric network [5]. We define order parameters qµ(t) = 𝔼 (r(t)ξµ) and M = 𝔼 ((r)2(t)), describing the average overlap of network activity with pattern ξµ and the average squared firing rate, respectively.
Using equations (1,2), we derive equations describing the temporal evolution of the overlaps (for l ∈ {2, .., P }),
where Rl is a ‘noise’ term due to patterns µ ≠ l in the sequence (see [5] for details) By making the following change of variables:
in which we have defined, we obtain
where
Assuming that G(x) ≈ 1, which is the case during successful retrieval (see also [5]), then we can simplify to:
This equation makes it clear that retrieval speed depends linearly on z, i.e. on the balance between the symmetric and asymmetric components of synaptic plasticity.
Mean-field theory of heterogeneous network and conditions for retrieval
Mean-field theory can be used to further analyze retrieval speed dynamics, along the (lines of [5]. We define order parameters and MX = 𝔼 ((rX)2(t)), describing the average overlap of network activity in subpopulation X with pattern ξX,µ and the average squared firing rate in subpopulation X, respectively. The equations for the overlaps are given by:
where G is given, for arbitrary transfer functions ϕ by:
For the transfer function used in this paper, Eq. (9), the expression simplifies,
As in the previous section, and are ‘noise’ terms due to patterns µ ≠ l in the sequence, which also depends on the average squared firing rates Ma and Ms. Using Eqs. 17-18, we can derive the dynamics of the combined population overlap:
To compute the boundary for successful retrieval given by the white line in Fig. 2, we analyze this equation when the gains are constant: Gx(t) = Gx. Plugging in and rearranging, we find:
This equation shows that the sequence can only be retrieved if Ga/(1 − Gs) > 1, otherwise the peak of the overlaps decay to zero with increasing l. Thus retrieval of an asymptotically long sequence is successful if the gain converges to a value greater or equal to one during retrieval. This condition can only be satisfied if
To test for successful sequence retrieval in Fig. 2, we computed the maximal correlation value of the final pattern mP (t), and compared this value to a threshold θP = 0.05. If the value fell below this threshold, then retrieval was considered unsuccessful, and was denoted by a black square. This threshold criterion was also used in Figures 3 and 5.
Reward-driven learning
A simple perturbation-based reinforcement learning rule is used to demonstrate that external inputs can be generated that produce network dynamics at a desired target speed over the course of multiple trial repetitions. We simulate a series of trials with stochastically varying external inputs. At each trial n, the external inputs used in the previous trial are perturbed randomly,
where λ is the strength of the perturbation, and are uniformly distributed random variables over the interval [− 1, 1], drawn independently for each population p ∈ {a, s} at each trial n. If these external inputs lead to an improvement in speed compared to previous trials, then
else,
In Fig. 6, the correlation threshold θm = 0.05, the target speed vtarget = {0.3, 0.8}, and λ = 0.1. On the first trial (n = 0), the external inputs are taken to be and (open circle in Fig. 6).
Network of excitatory and inhibitory spiking neurons
We simulated a network of excitatory and inhibitory leaky integrate-and-fire (LIF) neurons similar to the one described described in the Appendix of ref. [5] (sections 3 and 4) with a few differences described below.
In this network, the dynamics of the membrane potential of neuron i (i = 1, …, Nα) in population α (α = E, I) are governed by the following equations:
where α, β ∈ {E, I}, D controls the synaptic delay, Iα(t) controls the time-dependent external input drive, τrp controls the refractory period, Θ(x) is the Heaviside function, and Wα(t) is a white noise input with zero mean and unit variance density.
Excitatory neurons are divided into two (asymmetric and symmetric) populations of equal size (NE = Na + Ns), with connectivity matrices given by the following, where ω is the rectified synaptic transfer function defined in the procedure and X ∈ {a, s}:
The excitatory populations receive external input that depends on their identity, and on the retrieval configuration. For slow retrieval, we set the input equal to and for asymmetric and sym-metric neurons, respectively. For fast retrieval, we use and . In inhibitory neurons, we use.
The learning strength (AEE) is set to 4.25, which result in changes to the following parameters: JIE/KIE = 0.134,. All other parameter values are identical to those documented in Table 7c of the referenced Appendix [5].
Data Availability
Code to generate the figures is available at GitHub, https://www.github.com/maxgillett/dynamic speed control.
References
- [1]Learning multiple variable-speed sequences in striatum via cortical tutoringeLife 6https://doi.org/10.7554/eLife.26084
- [2]Flexible timing by temporal scaling of cortical responsesNature Neuroscience 21:102–110https://doi.org/10.1038/s41593-017-0028-6
- [3]Temporal Association in Asymmetric Neural NetworksPhysical Review Letters 57:2861–2864https://doi.org/10.1103/PhysRevLett.57.2861
- [4]Sequential state generation by model neural networksProceedings of the National Academy of Sciences 83:9469–9473https://doi.org/10.1073/pnas.83.24.9469
- [5]Characteristics of sequential activity in networks with temporally asymmetric Hebbian learningProceedings of the National Academy of Sciences https://doi.org/10.1073/pnas.1918674117
- [6]Synaptic plasticity: Taming the beastNature Neuroscience 3:1178–1183https://doi.org/10.1038/81453
- [7]Timing Rules for Synaptic Plasticity Matched to Behavioral FunctionNeuron 92:959–967https://doi.org/10.1016/j.neuron.2016.10.022
- [8]Synaptic Modifications in Cultured Hippocampal Neurons: Dependence on Spike Timing, Synaptic Strength, and Postsynaptic Cell TypeJournal of Neuroscience 18:10464–10472https://doi.org/10.1523/JNEUROSCI.18-24-10464.1998
- [9]Coincidence detection and changes of synaptic efficacy in spiny stellate neurons in rat barrel cortexNature Neuroscience 2:1098–1105https://doi.org/10.1038/16026
- [10]Development of Neural Circuitry for Precise Temporal Sequences through Spontaneous Activity, Axon Remodeling, and Synaptic PlasticityPLoS ONE 2https://doi.org/10.1371/journal.pone.0000723
- [11]Embedding Multiple Trajectories in Simulated Recurrent Neural Networks in a Self-Organizing MannerJournal of Neuroscience 29:13172–13181https://doi.org/10.1523/JNEUROSCI.2358-09.2009
- [12]Spike-Time-Dependent Plasticity and Heterosynaptic Competition Organize Networks to Produce Long Scale-Free Sequences of Neural ActivityNeuron 65:563–576https://doi.org/10.1016/j.neuron.2010.02.003
- [13]Triphasic spike-timing-dependent plasticity organizes networks to produce robust sequences of neural activityFrontiers in Computational Neuroscience 6https://doi.org/10.3389/fncom.2012.00088
- [14]Robust development of synfire chains from multiple plasticity mechanismsFrontiers in Computational Neuroscience 8https://doi.org/10.3389/fncom.2014.00066
- [15]Growth and splitting of neural sequences in songbird vocal developmentNature 528:352–357https://doi.org/10.1038/nature15741
- [16]Shaping neural circuits by high order synaptic interactionsPLOS Computational Biology 12https://doi.org/10.1371/journal.pcbi.1005056
- [17]Long Synfire Chains Emerge by Spike-Timing Dependent Plasticity Modulated by Population ActivityInternational Journal of Neural Systems 27https://doi.org/10.1142/S0129065717500447
- [18]Theta-modulation drives the emergence of connectivity patterns underlying replay in a network model of place cellseLife 7https://doi.org/10.7554/eLife.37388
- [19]Unsupervised Learning of Persistent and Sequential ActivityFrontiers in Computational Neuroscience 13https://doi.org/10.3389/fncom.2019.00097
- [20]Addition of new neurons and the emergence of a local neural circuit for precise timingPreprint, Neuroscience
- [21]Rate, Timing, and Cooperativity Jointly Determine Cortical Synaptic PlasticityNeuron 32:1149–1164https://doi.org/10.1016/S0896-6273(01)00542-6
- [22]Gain in sensitivity and loss in temporal contrast of STDP by dopaminergic modulation at hippocampal synapsesProc. Natl. Acad. Sci. U.S.A 106:13028–13033
- [23]Symmetric spike timing-dependent plasticity at CA3–CA3 synapses optimizes storage and recall in autoassociative networksNature Communications 7https://doi.org/10.1038/ncomms11552
- [24]Neural networks and physical systems with emergent collective computational abilitiesProceedings of the National Academy of Sciences 79:2554–2558https://doi.org/10.1073/pnas.79.8.2554
- [25]Model of global spontaneous activity and local structured activity during delay periods in the cerebral cortexCerebral Cortex 7:237–252https://doi.org/10.1093/cercor/7.3.237
- [26]Synaptic reverberation underlying mnemonic persistent activityTrends in Neurosciences 24:455–463https://doi.org/10.1016/S0166-2236(00)01868-3
- [27]Course 10 - Network Models of MemoryLes Houches. Methods and Models in Neurophysics Elsevier :407–476
- [28]Attractor Dynamics in Networks with Learning Rules Inferred from In Vivo DataNeuron 99:227–2384https://doi.org/10.1016/j.neuron.2018.05.038
- [29]Calcium-based plasticity model explains sensitivity of synaptic changes to spike pattern, rate, and dendritic locationProceedings of the National Academy of Sciences 109:3991–3996https://doi.org/10.1073/pnas.1109359109
- [30]Inferring learning rules from distributions of firing rates in cortical neuronsNature Neuroscience 18:1804–1810https://doi.org/10.1038/nn.4158
- [31]Spin-glass models of neural networksPhysical Review A 32:1007–1018
- [32]Neural population dynamics during reachingNature 487:51–56https://doi.org/10.1038/nature11129
- [33]Neural mechanisms of movement planning: Motor cortex and beyondCurrent Opinion in Neurobiology 49:33–41https://doi.org/10.1016/j.conb.2017.10.023
- [34]Synaptic plasticity rules with physiological calcium levelsProc Natl Acad Sci U S A 117:33639–33648
- [35]Malleability of spike-timing-dependent plasticity at the CA3-CA1 synapseJ. Neurosci 26:6610–6617
- [36]Parametric control of flexible timing through low-dimensional neural manifoldsNeuron 111:739–753
- [37]Cortical layer-specific critical dynamics triggering perceptionScience 365
- [38]Monkey primary motor and premotor cortex: Single-cell activity related to prior information about direction and extent of an intended movementJournal of Neurophysiology 61:534–549https://doi.org/10.1152/jn.1989.61.3.534
- [39]Robust neuronal dynamics in premotor cortex during motor planningNature 532:459–464https://doi.org/10.1038/nature17643
- [40]The roles of monkey M1 neuron classes in movement preparation and executionJournal of Neurophysiology 110:817–825https://doi.org/10.1152/jn.00892.2011
- [41]Area-specific thalamocortical synchronization underlies the transition from motor planning to executionProceedings of the National Academy of Sciences of the United States of America 118https://doi.org/10.1073/pnas.2012658118
- [42]A midbrain-thalamus-cortex circuit reorganizes cortical dynamics to initiate movementCell 185:1065–108123https://doi.org/10.1016/j.cell.2022.02.006
- [43]Interplay between external inputs and recurrent dynamics during movement preparation and execution in a network model of motor cortexeLife 12https://doi.org/10.7554/eLife.77690
- [44]The phase space of interactions in neural network modelsJ. Phys. A: Math. Gen 21:257–270
- [45]Is cortical connectivity optimized for storing information? NatNeurosci 19:749–755
Article and author information
Author information
Version history
- Preprint posted:
- Sent for peer review:
- Reviewed Preprint version 1:
- Reviewed Preprint version 2:
- Version of Record published:
Copyright
© 2023, Maxwell Gillett & Nicolas Brunel
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics
- views
- 457
- downloads
- 45
- citations
- 2
Views, downloads and citations are aggregated across all versions of this paper published by eLife.