Abstract
The principle of efficient coding posits that sensory cortical networks are designed to encode maximal sensory information with minimal metabolic cost. Despite the major influence of efficient coding in neuro-science, it has remained unclear whether fundamental empirical properties of neural network activity can be explained solely based on this normative principle. Here, we rigorously derive the structural, coding, biophysical and dynamical properties of excitatory-inhibitory recurrent networks of spiking neurons that emerge directly from imposing that the network minimizes an instantaneous loss function and a time-averaged performance measure enacting efficient coding. The optimal network has biologically-plausible biophysical features, including realistic integrate-and-fire spiking dynamics, spike-triggered adaptation, and a non-stimulus-specific excitatory external input regulating metabolic cost. The efficient network has excitatory-inhibitory recurrent connectivity between neurons with similar stimulus tuning implementing feature-specific competition, similar to that recently found in visual cortex. Networks with unstructured connectivity cannot reach comparable levels of coding efficiency. The optimal biophysical parameters include 4 to 1 ratio of excitatory vs inhibitory neurons and 3 to 1 ratio of mean inhibitory-to-inhibitory vs. excitatory-to-inhibitory connectivity that closely match those of cortical sensory networks. The efficient network has biologically-plausible spiking dynamics, with a tight instantaneous E-I balance that makes them capable to achieve efficient coding of external stimuli varying over multiple time scales. Together, these results explain how efficient coding may be implemented in cortical networks and suggests that key properties of biological neural networks may be accounted for by efficient coding.
Introduction
Information about the sensory world is represented in the brain through the dynamics of neural population activity1,2. One prominent theory about the principles that may guide the design of neural computations for sensory function is efficient coding3,4,5. This theory posits that neural computations are optimized to maximize the information that neural systems encode about features of sensory stimuli while at the same time limiting the metabolic cost. Efficient coding has been highly influential, especially in visual neuroscience and computational vision6,7,8,9, and has been developed to become a normative theory of how networks are organized and designed to optimally process natural sensory stimuli in visual 10,11, auditory12 and olfactory sensory pathways13.
The first normative neural network models4,11 designed with efficient coding principles had at least two major levels of abstractions. First, information was assumed to be processed in a purely feedforward manner, whereas information processing in real neural circuits often involves recurrent or feedback computations. Second, neural dynamics was greatly simplified, ignoring the spiking nature of neural activity. Instead, in biological networks considerable amount of information are encoded or transmitted only through the millisecond-precise timing of spikes14,15,16,17,18,19,20. Also, these earlier works mostly considered encoding of static sensory stimuli, whereas the sensory environment changes continuously at multiple timescales and the dynamics of neural networks encodes these temporal variations of the environment 21,22,23,24.
Recent years have witnessed a considerable effort and success in laying down the mathematical tools and methodology to understand how to formulate efficient coding theories of neural networks with much more biological realism25. This work has established the incorporation of recurrent connectivity26,27, of spiking neurons, and of time-varying stimulus inputs28,29,30,31,32,33,34,35. In these models, the efficient coding principle has been implemented by designing networks whose activity minimizes the encoding accuracy (the error between a desired representation and a linear readout of network’s activity) subject to a constraint on the metabolic cost of processing (proportional to the total number of spikes fired by a population of neurons). This double objective is captured by a loss function that trades-off encoding accuracy and metabolic cost. The minimization of the loss function is performed through a greedy approach, by assuming that a neuron will emit a spike only if this will decrease the loss. This, in turn, yields a set of leaky integrate-and-fire (LIF) neural equations which govern the network dynamics28, which can also include biologically plausible non-instantaneous synaptic delays36,35,34. These previous implementations, however, had neurons that did not respect Dale’s law. In recent work37, we further extended the biological plausibility of these models by analytically deriving how to implement efficient coding in networks of spiking neurons that respect Dale’s law. These networks take the form of generalized leaky integrate-and-fire (gLIF) models of excitatory (E) and inhibitory (I) neurons endowed with spike-triggered adaptation38,39,40, which can provide highly accurate predictions of spike times in biological networks41. Efficient spiking models have thus the potential to provide a unifying theory of neural coding through spiking dynamics of E-I circuits 42,37 with elements that are fully biologically plausible and potentially interpretable as biophysical variables.
However, despite the major progress described above, as well the progress provided by other studies of efficient coding with spikes 31,43,44,33,29, we still lack a thorough characterization of which structural, coding, biophysical and dynamical properties of excitatory-inhibitory recurrent networks of spiking neurons are directly related to efficient coding principles. Previous studies only rarely made predictions that could be quantitatively compared against experimentally measurable properties of biological neural networks. As a consequence, we still do not know which, if any, fundamental properties of cortical networks emerge directly from imposing efficient coding.
To address the above questions, we analyze systematically our biologically plausible efficient coding model of E and I neurons that respect Dale’s law37 to make concrete predictions about experimentally measurable structural, coding and dynamical features of neural activity that arise from efficient coding. We systematically investigated how experimentally measurable emergent dynamical properties, such as firing rates, trial-to-trial spiking variability of single neurons, E-I balance45 and noise correlations, relate to optimally-efficient coding. We further analyze how the organization of the connectivity arising by imposing efficient coding relates to the anatomical and effective connectivity recently reported in visual cortex, which suggests competition between excitatory neurons with similar stimulus tuning. We found that several key and robustly found empirical properties of cortical circuits match the predictions of our efficient coding network, lending support to the notion that efficient coding may be a design principle that has shaped the evolution of cortical circuits and that may be used to conceptually understand and interpret them.
Results
Assumptions and emergent properties of the efficient E-I network derived from first principles
We study the properties of a spiking neural network in which the dynamics and structure of the network are analytically derived starting from first principles of efficient coding of sensory stimuli. The model relies on a number of assumptions, described next.
The network responds to M time-varying features of a sensory stimulus, sk(t) (e.g., for a visual stimulus, contrast, orientation, etc) received as inputs from an earlier sensory area (e.g., retina). We model features as independent Ornstein–Uhlenbeck (OU) processes (see Methods). The network’s objective is to compute a leaky integration of sensory features, a relevant computation of cortical sensory areas46. The target representations of the network, xk(t), are defined as
with τ a characteristic integration time-scale (Fig. 1A).

Structural and dynamical properties of the efficient E-I spiking network.
(A) Encoding of a target signal representing the evolution of a stimulus feature (top) with one E (middle) and one I spiking neuron (bottom). The target signal x(t) integrates the input signal s(t). The readout of the E neuron tracks the target signal and the readout of the I neuron tracks the readout of the E neuron. Neurons spike to bring the readout of their activity closer to their respective target. Each spike causes a jump of the readout, with the sign and the amplitude of the jump being determined by neuron’s tuning parameters.
(B) Schematic of the matrix of tuning parameters. Every neuron is selective to all stimulus features (columns of the matrix), and all neurons participate in encoding of every feature (rows).
(C) Schematic of the network with E (red) and I (blue) cell type. E neurons are driven by the stimulus features while I neurons are driven by the activity of E neurons. E and I neurons are connected through recurrent connectivity matrices.
(D) Schematic of E (red) and I (blue) synaptic interactions. Arrows represent the direction of the tuning vector of each neuron. Only neurons with similar tuning are connected.
(E) Schematic of similarity of tuning vectors (tuning similarity) in a 2-dimensional space of stimulus features.
(F) Synaptic strength as a function of tuning similarity.
(G) Coding and dynamics in a simulation trial. Top three rows show the signal (black), the E estimate (red) and the I estimate (blue) in each of the three stimulus dimensions. Below are the spike trains. In the bottom row, we show the average instantaneous firing rate (in Hz).
(H) Top: Example of the target signal (black) and the E estimate in 3 simulation trials (colors) in one signal dimension. Bottom: Distribution (across time) of the time-dependent bias of estimates in E and I cell type.
(I) Left: Distribution of time-averaged firing rates in E (top) and I neurons (bottom). Black traces are fits with log-normal distribution. Right: Distribution of coefficients of variation of interspike intervals for E and I neurons.
(J) Distribution (across neurons) of time-averaged synaptic inputs to E (left) and I neurons (right). In E neurons, the distribution of inhibitory and of net synaptic inputs overlap.
(K) Sum of synaptic inputs over time in a single E (top) and I neuron (bottom) in a simulation trial.
(L) Distribution (across neurons) of Pearson’s correlation coefficients measuring the correlation of synaptic inputs in single E (red) and I (blue) neurons. For model parameters, see Table 1.
The network is composed of two neural populations of excitatory (E) and inhibitory (I) neurons, defined by their postsynaptic action which respects Dale’s law. For each population, y ∈ {E, I}, we define a population readout of each feature,

Table of default model parameters for the efficient E-I network
Parameters above the double horizontal line are the minimal set of parameters needed to simulate model equations (Eqs. 30a-30h in Methods). Parameters below the double horizontal line are biophysical parameters, derived from the same model equations and from model parameters listed above the horizontal line. Parameters NE, M, τ and
where
Unlike previous approaches28,48, we hypothesize that E and I neurons have distinct normative objectives and define cell-type specific loss functions relative to the activity of the E and I neuron types. To implement at the same time, as requested by efficient coding, the constraints of faithful stimulus representation with limited computational resources49, we define the loss functions of E and I population as a weighted sum of a time-dependent encoding error and time-dependent metabolic cost:
We refer to β, the parameter controlling the relative importance of the metabolic cost over the encoding error, as the metabolic constant of the network. We hypothesize that population readouts of E neurons,
where
By assuming that each neuron emits a spike at time t only if this decreases the loss function of its population (Eq. 3), we derived the dynamics and network structure of a spiking network that instantiates efficient coding (Fig. 1C, see Methods). The derived dynamics of the subthreshold membrane potential
where
The synaptic currents in E neurons,
The optimization of the loss function also yields structured recurrent connectivity (Fig. 1D). The synaptic strength between two neurons is proportional to their tuning similarity if the tuning similarity is positive; otherwise the synaptic weight is set to zero (Fig. 1E,F) to ensure that Dale’s law is respected. This also sets the overall connection probability to 0.5. (For a study of how efficient coding would be implemented if the above Dale’s law constraint was removed and each neuron is free to have either an inhibitory or excitatory effect depending on the postsynaptic target, see Supplementary Text 1). Neurons with opposite tuning have low connection probability, consistent with experimental results51,52,53 (Fig. 1D). Note that the structured recurrent connectivity leads to both E and I cells being stimulus-tuned, even though I cells do not receive feedforward inputs (Fig. 1C). The spike-triggered adaptation current of neuron i in population y,
To summarize, the analytical derivation of an optimally efficient network includes gLIF neurons54,41,40,55,56, a distributed code with mixed selectivity to the input stimuli, spike-triggered adaptation current, structured synaptic connectivity, and an operating regime controlled by the metabolic constant β.
The equations for the E-I network of gLIF neurons in Eq. (5) optimize the loss functions at any given time and for any set of parameters. In particular, the network equations have the same analytical form for any positive value of the metabolic constant β. To find a set of parameters that optimizes the overall performance, we defined a performance measure as the average over time and trials of the loss function. We then optimized the parameters by setting the metabolic constant β such that the encoding error weights 70 % and the metabolic error weights 30 % of the total performance, and by choosing all other parameters such as to minimize numerically our network performance measure (see Methods). The numerical optimization was performed by simulating a model of 400 E and 100 I units, a network size relevant for computations within one layer of a cortical microcolumn57. The set of model parameters that optimized network efficiency is detailed in Table 1. Unless otherwise stated, in all simulations we will use the optimal parameters of Table 1 and only vary those parameters detailed in the figure axes.
With optimally efficient parameters, population readouts closely tracked the target signals (Fig. 1G, M=3, R2 = [0.95, 0.97] for E and I neurons, respectively). When stimulated by our 3-dimensional time-varying feedforward input, the optimal E-I network provided a precise and unbiased estimator of the multi-dimensional and time-dependent target signal (Fig. 1H).
Next, we examined the emergent dynamical properties of an optimally efficient E-I network. The distribution of firing rates was well described by a log-normal distribution (Fig. 1I, left). Neurons fired irregularly, with mean coefficient of variation (CV) slightly smaller than 1 (Fig. 1I, right; CV= [0.97, 0.95] for E and I neurons, respectively). We assessed E-I balance in single neurons through two complementary measures. First, we calculated the average (global) balance of E-I currents by taking the time-average of the net sum of currents58. Second, we evaluated the instantaneous59 (also termed detailed45) E-I balance using the Pearson correlation (ρ) of E and I currents received by a single neuron over time (see Methods).
We observed a strong average E-I balance (indicated by a net sum of synaptic inputs close to zero, with only a weak residual of inhibition in both E and I cells (Fig. 1J). Furthermore, we found a moderate instantaneous balance, stronger in I compared to E cell type (Fig. 1K-L, ρ = [0.44, 0.25], for I and E neurons, respectively). The presence of instantaneous balance between E and I synaptic currents within single neurons has been reported in cortical data59,60.
Competition across neurons with similar stimulus tuning emerging in efficient spiking networks
We next explored coding properties emerging from recurrent synaptic interactions between E and I populations in the optimally efficient networks.
An approach that has recently provided empirical insight into local recurrent interactions between neurons is measuring the effective connectivity with cellular resolution, by photostimulating individual neurons and measuring the effect of such perturbation on other neurons in the network. Recent effective connectivity experiments photostimulated single E neurons in primary visual cortex and measured its effect on neighbouring neurons, finding that the photostimulation of an E neuron led to a decrease in firing rate of similarly tuned close-by neurons61. This effective lateral inhibition26 between E neurons with similar tuning to the stimulus implements competition between neurons for the representation of stimulus features (termed feature-specific competition61).
To assess how E-I interactions shape coding in efficient networks, we simulated photostimulation experiments in these networks. We performed such experiments in the absence of the feedforward input to insure all effects are only due to the recurrent processing and not to feedforward processing. We stimulated a randomly selected single “target” E neuron and measured the change in the instantaneous firing rate from the baseline firing rate, Δzi(t), in all the other I and E neurons (Fig. 2A, left). The photo-stimulation was modeled as an application of a constant depolarising current with a strength parameter, ap, proportional to the distance between the resting potential and the firing threshold (ap = 0 means no stimulation, while ap = 1 indicates photostimulation at the firing threshold). We quantified the effect of the simulated photostimulation of a target E neuron on other E and I neurons, distinguishing neurons with either similar or different tuning with respect to the target neuron (Fig. 2A, right; Supplementary Fig. S2).

Mechanism of lateral excitation/inhibition in the efficient spiking network.
(A) Left: Schematic of the E-I network and of the stimulation and measurement in a perturbation experiment. Right: Schematic of the propagation of the neural activity between E and I neurons with similar tuning.
(B) Trial and neuron-averaged deviation of the firing rate from the baseline, for the population of I (top) and E (bottom) neurons with similar (magenta) and different tuning (gray) to the target neuron. The stimulation strength corresponded to an increase in the firing rate of the stimulated neuron by 28.0 Hz.
(C) Scatter plot of the tuning similarity vs. effective connectivity to the target neuron. Red line marks zero effective connectivity and magenta line is the least-squares line. Stimulation strength was ap = 1.
(D) Top: Firing rate of the photostimulated neuron as a function of the photostimulation strength. Middle: Effective connectivity with I neurons with similar and different tuning to the target neuron. Bottom: Effective connectivity with E neurons.
(E) Effective connectivity with I (top) and E neurons (bottom) while varying the length of the stimulation window. The window for measuring the effective connectivity was always 50 ms longer than the stimulation window.
(F) Correlation of membrane potentials vs. the tuning similarity in E (top) and I cell type (bottom), for the efficient E-I network (left), for the network where each E neuron receives independent instead of shared stimulus features (middle), and for the network with unstructured connectivity (right). In the model with unstructured connectivity, elements of each connectivity matrix were randomly shuffled. We quantified voltage correlation using the (zero-lag) Pearson’s correlation coefficient, denoted as
(G) Average cross-correlogram (CCG) of spike timing with strongly similar (orange), weakly similar (green) and different tuning (black).
(H) Distribution of noise correlations across neuronal pairs. The correlation coefficient was measured in bins of 30 ms.
The photostimulation of the target E neuron increased the instantaneous firing rate of similarly-tuned I neurons and reduced that of other similarly-tuned E neurons (Fig. 2B, Supplementary Fig. S2). We quantified the effective connectivity as the difference between the time-averaged firing rate of the recorded cell in presence or absence of the photostimulation of the targeted cell, measured during perturbation and up to 50 ms after. We found positive effective connectivity on I and negative effective connectivity on E neurons with similar tuning to the stimulated neuron, with a positive correlation between tuning similarity and effective connectivity on I neurons and a negative correlation on E neurons (Fig. 2C). As we varied the strength of the photostimulation, the firing rate of the target neuron increased proportionally to the photostimulation strength, as did the effect of the perturbation on I and E neurons with similar tuning to the target neuron (Fig. 2D, Supplementary Fig. S2). As we varied the time window of photostimulation, we found that the effective connectivity converges within a time window of about 300 ms (Fig. 2E). We confirmed these effects of photostimulation in presence of a weak feedforward input (Supplementary Fig. S2), similar to the experiments of Ref61 in which photostimulation was applied during the presentation of visual stimuli with weak contrast.
In summary, lateral excitation of I neurons and lateral inhibition of E neurons with similar tuning is an emerging coding property of the efficient E-I network. Lateral excitation and inhibition leads to competition between neurons with similar tuning to stimulus features, comparable to that found in the visual cortex61,62. An intuitive summary of how this mechanism is implemented is that the E neuron that fires first activates I neurons with similar tuning. In turn, these I neurons inhibit all similarly tuned E neurons (Fig. 2A, right), preventing them to generate redundant spikes and encoding the sensory information that has already been encoded by the first spike. Suppression of redundant spiking allows efficient coding because it reduces the metabolic cost without compromising on encoded information36.
To explore further the consequences of E-I interactions for stimulus encoding, we next investigated the dynamics of lateral inhibition in a network driven by the feed-forward sensory input but without perturbing neurons. In this case, shared feedforward inputs sk(t) create a particular pattern of voltage correlations in E-E neuronal pairs, where voltage correlations linearly depend on the tuning similarity (Fig. 2F, left). The feedforward inputs are shared across neurons and weighted by the tuning parameters of E neurons. For this reason, they cause strong positive voltage correlations between E-E neuronal pairs with very similar tuning and strong negative correlations between pairs with very different (opposite) tuning (Fig. 2F, top-left). Voltage correlations between E-E pairs vanished regardless of tuning similarity when we made the inputs independent across neurons (Fig. 2F, top-middle), showing the relation between tuning similarity and voltage correlation occurs because of shared feedforward inputs. In contrast to E neurons, I neurons do not receive feedforward inputs and are driven only by similarly tuned E neurons (Fig. 2A, right). This causes positive voltage correlations in I-I neuronal pairs with similar tuning and vanishing correlations in neurons with different tuning (Fig. 2F, bottom-left). Such dependence of voltage correlations on tuning similarity disappears when removing the structure from the E-I synaptic connectivity (Fig. 2F, bottom-right).
Although membrane potentials could be strongly correlated or anti-correlated depending on tuning similarity (Fig. 2F, left), the coordination of spike timing of pairs of E neurons (measured with cross-correlograms or CCGs) was very weak (Fig. 2G-H). For I-I neuronal pairs, the peaks of CCGs were stronger than those observed in E-E pairs, but they were present only at very short lags (lags < 1 ms), and the same was true for E-I pairs. Additionally, noise correlations measured as Pearson correlation on spike counts in trials with the same stimulus (rSC) had values distributed around zero (Fig. 2H). These findings lead to two conclusions. First, recurrent interactions of the efficient E-I network wipe away the effect of membrane potential correlations to produce largely uncorrelated spiking output, consistently with the efficient coding hypothesis of reducing redundancy in cases of low noise3,6. Second, such precise cancelling of correlations between voltages and the spiking output reflects the millisecond precision of information processing in efficient E-I networks.
The effect of structured connectivity on coding efficiency and neural dynamics
The analytical solution of the optimally efficient E-I network predicts that recurrent synaptic weights are proportional to the tuning similarity between neurons. We here investigated the role of such efficient connectivity structure by comparing the behavior of an efficiently structured network with a similar but randomly structured E-I network of the type studied in previous works63,64,23. We removed the connectivity structure by randomly permuting synaptic weights across neuronal pairs. We either randomized connections within a single connectivity type (E-I, I-I or I-E) or within all these three connectivity types at once (“all”). Such procedure destroys the relationship between tuning similarity and synaptic strength as in Fig. 1F while it preserves Dale’s law and the overall distribution of connectivity weights. We found that randomizing the connectivity structure significantly altered neural dynamics and coding (Fig. 3A-H). The structure in E-I and in I-E connectivity has a major effect on efficient coding. Randomizing E-I and I-E connectivity led to several-fold increases in the encoding error as well as to significant increases in the metabolic cost (Fig. 3A-B). In particular, with unstructured E-I connectivity the network failed completely to encode the target with I population (Fig. 3C).

Effects of connectivity structure on coding efficiency, neural dynamics and lateral inhibition.
(A) Relative error of networks with unstructured (shuffled) recurrent connectivity. The relative error is the RMSE of the unstructured network, relative to the RMSE of the structured network (dashed line). From left to right, we show the relative error for the unstructured E-I, I-I, I-E and all connectivities. (B Same as in A, showing the metabolic cost (MC) of unstructured networks relative to the metabolic cost of the structured network.
(C) Target signal (black), E estimate (red) and I estimate (blue) in one particular input dimension, for networks with unstructured connectivity.
(D) Standard deviation of the membrane potential (in mV) for networks with unstructured connectivity. Distributions are across neurons. The black vertical line marks the average SD of the structured network.
(E) Average firing rate of E neurons (top) and I neurons (bottom), for different cases of unstructured networks. Dashed lines show the same measures for the structured case.
(F) Same as in E, showing the average net synaptic input.
(G) Same as in E, showing the time-dependent correlation of synaptic inputs.
(H) Voltage correlation in E-E (top) and I-I neuronal pairs (bottom) for the four cases of unstructured connectivity (colored dots) and the equivalent result in the structured network (grey dots). We show the results for pairs with similar tuning.
(I) Scatter plot of effective connectivity in I (top) and E neurons (bottom) versus tuning similarity to the stimulated (“target”) E neuron, for networks with unstructured connectivity. The magenta line is the least-squares regression line. The strength of the photostimulation is at threshold (ap = 1.0). Other parameters for all plots are in Table 1.
Unstructured E-I and I-E connectivity also yielded an increase of the variance in the membrane potentials (Fig. 3D) and firing rate in E neurons (Fig. 3E), while pulling the average net synaptic inputs towards inhibition (Fig. 3F) and removing the instantaneous balance (Fig. 3G). Together, these findings suggest a shift from mean-driven to fluctuation-driven spiking activity as the connectivity structure is removed. The structure of E-I connectivity was also found to be crucial for the linear relation between voltage correlations and tuning similarity in pairs of I neurons (Fig. 3H, magenta). Interestingly, we found no effect of connectivity structure on the variability of spiking of single neurons, with both structured and unstructured networks showing strong variability (Supplementary Fig. S3), suggesting that the variability of spiking is independent of the connectivity structure.
Randomizing I-I connectivity was less detrimental to the coding efficiency as it led to a slightly higher encoding error, but to a lower metabolic cost, and still allowed for a relatively good tracking of target signals in both cell types (Fig. 3C, “permuted I to I”). Contrary to randomization of the E-I and I-E connectivity, shuffling I-I connectivity decreased the variance of the membrane potential, decreased the firing rate in E neurons and increased instantaneous balance in E neurons. Thus it had opposite effects compared to shuffling of E-I and I-E connectivity. To understand if there was a minimal connectivity structure necessary for efficient coding, we also removed the connectivity structure only partially, keeping like-to-like connectivity structure and removing all structure beyond like-to-like. This manipulation only had very modest effects on network’s coding and almost no effect on neural dynamics (Supplementary Fig. S3), thus showing that like-to-like structure of connectivity is largely sufficient to achieve efficient coding.
Finally, we analyzed how the structure in recurrent connectivity influences lateral inhibition that we observed in efficient (structured) networks (see Fig. 2A-E). We found that the dependence of lateral inhibition on tuning similarity vanish when the connectivity structure is fully removed (Fig. 3I, right), thus showing that connectivity structure is necessary for lateral inhibition. While networks with unstructured E-I and I-E connectivity still show inhibition in E neurons upon single neuron optostimulation (because of the net inhibitory effect of recurrent connectivity; Supplementary Fig. S4), this inhibition was largely unspecific to tuning similarity. Unstructured connectivity decreased the correlation between tuning similarity and effective connectivity from r = [0.31, − 0.54] in E and I neurons in a structured network to r = [0.02, − 0.13] and r = [0.57, 0.11] in networks with unstructured E-I and I-E connectivity, respectively (Fig. 3I, first and third from the left). Removing the structure in I-I connectivity, in contrast, increased the correlation between effective connectivity and tuning similarity in E neurons (r = [0.30, − 0.65], Fig. 3I, second from the left), showing that lateral inhibition takes place irrespective of the I-I connectivity structure. Furthermore, a partial removal of connectivity structure where we only removed the connectivity structure beyond like-to-like had smaller effects on lateral inhibition (Supplementary Fig. S4), thus confirming that like-to-like connectivity pattern is sufficient for lateral excitation/inhibition in I and E neurons.
While optimally structured connectivity predicted by efficient coding is biologically plausible, it may be difficult to realise it exactly on a synapse-by-synapse basis in biological networks. We verified the robustness of the model to small deviations from the optimal synaptic weights by adding a random jitter, proportional to the synaptic strength, to all synaptic connections (see Methods). The encoding performance and neural dynamics were barely affected by such perturbation, demonstrating that the network is robust against random perturbations of the optimal synaptic weights (Supplementary Fig. S3).
In summary, we found that some aspects of recurrent connectivity structure, such as the like-to-like organization of E-I and I-E connectivity, are crucial to achieve efficient coding. Instead, for other aspects there is considerable flexibility; the organization of I-I connectivity is less crucial, as is the connectivity structure beyond like-to-like, and adding small perturbations to optimal weights has only minor effects. Structured E-I and I-E, but not I-I connectivity, is necessary for a robust dependence of lateral inhibition on tuning similarity.
Weak spike-triggered adaptation optimizes network efficiency
We next investigated the role of spike-triggered adaptation current,
Depending on the sign of the difference of time constants, this spike-triggered current is negative, giving spike-triggered adaptation39, if the single-neuron readout has longer time constant than the population readout

Relation of time constants of single-neuron and population readout set an adaptation or a facilitation current.
The population readout that evolves on a faster (slower) time scale than the single neuron readout determines a spike-triggered adaptation (facilitation) in its own cell type.

Adaptation, network coding efficiency and excitation-inhibition balance.
(A) The encoding error (left), metabolic cost (middle) and average loss (right) as a function of single neuron time constants
(B) Top: Log-log plot of the RMSE of the E (red) and the I (blue) estimates as a function of the time constant of the single neuron readout of E neurons,
(C) Firing rate in E (left) and I neurons (right), as a function of
(D) Same as in (C), showing the coefficient of variation.
(E) Average net synaptic input in E neurons (left) and in I neurons (right) as a function of
(F) Correlation coefficient of synaptic inputs to E (left) and I neurons (right) as a function of
To gain further insights on how adaptation influences network performance, we set the adaptation in one cell type to 0 and vary the strength of adaptation in the other cell type by varying the time constant of the single neuron readout. In the absence of adaptation in I neurons
Firing rates and variability of spiking were sensitive to the strength of adaptation. As expected, adaptation in E neurons caused a decrease in the firing levels in both cell types (Fig. 4C). In contrast, adaptation in I neurons decreased the firing rate in I neurons, but increased the firing rate in E neurons, due to a decrease in the level of inhibition. Furthermore, adaptation decreased the variability of spiking, in particular in the cell type with strong adaptation (Fig. 4D), a well-known effect of spike-triggered adaptation in single neurons67.
Instantaneous balance of synaptic currents predicts network efficiency better than the average E-I balance
Next, we tested the capability of instantaneous and average E-I balance to predict the efficiency of the network. Measuring average balance and instantaneous balance of synaptic inputs from electrophysiology recordings is possible59,60,58, while measuring efficiency from empirical data is challenging. The estimation of network efficiency requires the comparison between typically unknown network’s target representations and the population readouts. The estimation of the population readout, in turn, requires an estimation of decoding weights and the knowledge of spiking dynamics from a complete neural network.
We focused the analysis on regimes with adaptation, because these regimes gave better performance. In regimes with adaptation, time constants of single neuron readout influenced the average imbalance (Fig. 4E) as well as the instantaneous balance (Fig. 4F) in E and I cell type. The average balance was precise (with the net synaptic current close to 0) with strong adaptation in E neurons, and it got weaker when increasing the adaptation in I neurons (Fig. 4E). However, regimes with precise average balance in both cell types coincided with suboptimal efficiency (compare Fig. 4A, right and E).
To test how well the average imbalance and the instantaneous balance of synaptic inputs predict network efficiency, we concatenated the column-vectors of the measured average loss and of the average imbalance in each cell type and computed the Pearson correlation between these quantities. The correlation between the average imbalance and the average loss was weak in the E cell type (R = 0.16) and close to zero in the I cell type (R = 0.02), suggesting almost no relation between efficiency and average imbalance in the E cell type. In contrast, the average loss was negatively correlated with the instantaneous balance in both E (R = − 0.35) and in I cell type (R = − 0.45), showing that instantaneous balance of synaptic inputs is positively correlated with network efficiency.
When measured for varying levels of spike-triggered adaptation, unlike the average balance of synaptic inputs, the instantaneous balance is therefore a reliable predictor of network efficiency.
State-dependent coding and dynamics are controlled by the metabolic cost on spiking
In our derivation of efficiency objectives, we obtained non-specific external current (in the following, non-specific current), described by the term
We found the metabolic constant β to significantly influence the spiking dynamics (Fig. 5A). The optimal efficiency was achieved for non-zero levels of the metabolic constant (Fig. 5B). The metabolic constant modulated the firing rate as expected, with the firing rate decreasing with the increasing of the metabolic constant (Fig. 5C, top). It also modulated the variability of spiking, as increasing the metabolic constant decreased the variability of spiking in single neurons (Fig. 5C, bottom). Furthermore, it modulated the average imbalance and the instantaneous balance in opposite ways: larger values of β led to regimes that had stronger average balance, but weaker instantaneous balance (Fig. 5D). We note that, even with suboptimal values of the metabolic constant, the neural dynamics remained within biologically relevant ranges.

State-dependent coding and dynamics are controlled by non-specific currents.
(A) Spike trains of the efficient E-I network in one simulation trial, with different values of the metabolic constant β. The network received identical stimulus across trials.
(B) Top: RMSE of E (red) and I (blue) estimates as a function of the metabolic constant. Bottom: Normalized average metabolic cost and average loss as a function of the metabolic constant. Black arrow indicates the minimum loss and therefore the optimal metabolic constant.
(C) Average firing rate (top) and the coefficient of variation of the spiking activity (bottom), as a function of the metabolic constant. Black arrow marks the metabolic constant leading to optimal network efficiency in B.
(D) Average imbalance (top) and instantaneous balance (bottom) balance as a function of the metabolic constant.
(E) Same as in A, but for different values of the noise intensity σ.
(F) Same as in B, as a function of the noise intensity. The noise is a Gaussian random process, independent over time and across neurons.
(G) Same as C, as a function of the noise intensity.
(H) Top: Same as in D, as a function of the noise intensity. For plots in B-D and F-H, we computed and averaged results over 100 simulation trials with 1 second of simulation time. For other parameters, see Table 1.
The fluctuation part of the non-specific current, modulated by the noise intensity σ, that we added in the definition of spiking rule for biological plausibility (see Methods), strongly affected the neural dynamics as well (Fig. 5E). The optimal performance was achieved with non-vanishing noise levels (Fig. 5F) and the beneficial effect of the noise in the non-specific current arose from its impact on the instantaneous E-I balance. While the average firing rate of both cell types, as well as the variability of spiking in E neurons, increased with noise variance (Fig. 5G), the average and instantaneous balance of synaptic currents exhibited a non-linear behavior as a function of noise variance (Fig. 5H). Due to decorrelation of membrane potentials by the noise, instantaneous balance decreased with increasing noise variance (Fig. 5H, bottom). Some level of noise in the non-specific inputs is therefore necessary to establish the optimal level of instantaneous E-I balance. Interestingly, single neurons manifest significant levels of spiking variability already in the absence of noise in the non-specific inputs (Fig. 5H, bottom), indicating that the recurrent network dynamics generates substantial variability even in absence of variability in the external current. Variability in absence of noise demonstrates the intrinsic chaotic behavior of the network72.
In summary, non-specific external currents derived in our optimal solution have a major effect on coding efficiency and on neural dynamics. The noise in the external current is particularly important to obtain optimal levels of the instantaneous E-I balance in I neurons.
Optimal ratio of E-I neuron numbers and of the mean I-I to E-I synaptic efficacy coincide with biophysical measurements
Next, we investigated how coding efficiency and neural dynamics depend on the ratio of the number of E and I neurons (NE : NI or E-I ratio) and on the relative synaptic strengths between E-I and I-I connections.
Efficiency objectives (Eq. 3) are based on population, rather than single-neuron activity. Our efficient E-I network thus realizes a computation of the target representation that is distributed across multiple neurons (Fig. 6A). We predict that, if number of neurons within the population decreases, neurons have to fire more spikes to achieve an optimal population readout because the task of tracking the target signal is distributed among fewer neurons. To test this prediction, we varied the number of I neurons while keeping the number of E neurons constant. As predicted, a decrease of the number of I neurons (and thus an increase in the ratio of the number of E to I neurons) caused a linear increase in the firing rate of I neurons, while the firing rate of E neurons stayed constant (Fig. 6B, top). However, the variability of spiking and the average synaptic inputs remained relatively constant in both cell types as we varied these ratios (Fig. 6B, bottom, C), indicating a compensation for the change in the ratio of E-I neuron numbers through adjustment in the firing rates. These results are consistent with the observation in neuronal cultures of a linear change in the rate of postsynaptic events but unchanged postsynaptic current in either E and I neurons for variations in the E-I neuron number ratio73.

Optimal ratios of E-I neuron numbers and of mean I-I to E-I efficacy.
(A) Schematic of the effect of changing the number of I neurons on firing rates of I neurons. As encoding of the stimulus is distributed among more I neurons, the number of spikes per I neuron decreases.
(B) Average firing rate as a function of the ratio of the number of E to I neurons. Black arrow marks the optimal ratio.
(C) Average net synaptic currents in E neurons (top) and in I neurons (bottom).
(D) Top: Encoding error (RMSE) of the E (red) and I (blue) estimates, as a function of the ratio of E-I neuron numbers. Bottom: Same as on top, showing the cost and the average loss. Black arrow shows the minimum of the loss, indicating the optimal parameter.
(E) Top: Optimal ratio of the number of E to I neurons as a function of the weighting of the average loss of E and I cell type (using the weighting of the error and cost of 0.7 and 0.3, respectively). Bottom: Same as on top, measured as a function of the weighting of the error and the cost when computing the loss. (The weighting of the losses of E and I neurons is 0.5.) Black triangles mark weightings that we typically used.
(F) Schematic of the readout of the spiking activity of an E neuron (red) and an I neuron (blue) with equal amplitude of decoding weight (left) and with stronger decoding weight in the I neuron (right). Stronger decoding weight in the I neuron results in a stronger effect of spikes of the I neuron on the readout, leading to less spikes by the I neuron.
(G) Same as in (D), as a function of the ratio of mean I-I to E-I efficacy.
(H) Same as in B, as a function of the ratio of mean I-I to E-I efficacy.
(I) Average imbalance (top) and instantaneous balance (bottom) balance, as a function of the ratio of mean I-I to E-I efficacy. For other parameters, see Table 1.
The ratio of the number of E to I neurons had a significant influence on coding efficiency. We found a unique minimum of the encoding error of each cell type, while the metabolic cost increased linearly with the ratio of the number of E and I neurons (Fig. 6D). We found the optimal ratio of E to I neuron numbers to be in range observed experimentally in cortical circuits (Fig. 6D, bottom, black arrow, NE : NI = 3.75 : 1;74). Due to the linear increase of the cost with the ratio of the number of E and I neurons (Fig. 6D, bottom, green), strong weighting of the error predicted higher ratios (Fig. 6E, bottom). Also the encoding error (RMSE) alone, without considering the metabolic cost, predicted optimal ratio of the number of E to I neurons within a plausible physiological range, NE : NI = [3.75 : 1, 5.25 : 1], with stronger weightings of the encoding error by I neurons predicting higher ratios (Fig. 6E, top).
Next, we investigated the impact of the strength of excitatory and inhibitory synaptic efficacy (EPSPs and IPSPs). In our model, the mean synaptic efficacy is fully determined by the distribution of tuning parameters (see Methods). As evident from the expression for the population readouts (Eq. 2), the amplitude of tuning parameters (which are also decoding weights) determines the amplitude of jumps of the population readout caused by spikes (Fig. 6F). The stronger the amplitude of these weights, the larger is the average impact of spikes on the population signals.
We parametrized the distribution of decoding weights as a normal distributions centered at zero, but allowed the standard deviation (SD) of distributions relative to E and I neurons (
We next searched for the optimal ratio of the mean I-I to E-I efficacy as the parameter setting that maximizes network efficiency. Network efficiency was maximized when such ratio was about 3 to 1 (Fig. 6G). Our results predict the maximum E-I and I-E synaptic efficacy, averaged across neuronal pairs, of 0.75 mV, and the maximal I-I efficacy of 2.25 mV, values that are consistent with empirical measurements in the primary sensory cortex75,52,53.
Similarly to the ratio of E-I neuron numbers, a change in the ratio of mean E-I to I-E synaptic efficacy was compensated for by a change in firing rates, with stronger I-I synapses leading to a decrease in the firing rate of I neurons (Fig. 6H). Conversely, weakening the E-I and I-E synapses resulted in an increase in the firing rate in E neurons (Supplementary Fig. S5). This is easily understood by considering that weakening the E-I and I-E synapses activates less strongly the lateral inhibition in E neurons (Fig. 2) and thus leads to an increase in the firing rate of E neurons. We also found that single neuron variability remained almost unchanged when varying the ratio of mean I-I to E-I efficacy (Fig. 6H, bottom) and the optimal ratio corresponded with previously found optimal levels of average and instantaneous balance of synaptic inputs (Fig. 6I). The instantaneous E-I balance monotonically decreased with increasing ratio of I-I to E-I efficacy (Fig. 6I, bottom, Supplementary Fig. S5).
In summary, our analysis suggests that optimal coding efficiency is achieved with four times more E neurons than I neurons and with mean I-I synaptic efficacy about 3 times stronger than the E-I and I-E efficacy. The optimal network has less I than E neurons, but the impact of spikes of I neurons on the population readout is stronger, also suggesting that spikes of I neurons convey more information.
Dependence of efficient coding and neural dynamics on the timescales and dimensionality of the stimulus
We finally investigated how the network’s behavior depends on the timescales and dimensionality of the input stimulus features. We manipulated the stimulus timescales by changing the time constant of the Ornstein-Uhlenbeck (O-U) process. The network efficiently encoded stimulus features when their time constants varied between 1 and 200 ms, with stable encoding error, metabolic cost (Fig. 7A) and neural dynamics (Supplementary Fig. S6).

Dependence of efficient coding and neural dynamics on stimulus parameters and advantages of E-I versus one cell type model architecture.
(A) Top: Root mean squared error (RMSE) of E estimates (red) and I estimates (blue), as a function of the time constant of stimulus features. Bottom: Same as on top, showing the metabolic cost (MC) of E and I cell type. The time constant τs is the same for all stimulus features.
(B) Top: Same as in A top, measured as a function of the number of stimulus features M. Bottom: Normalized cost and the average loss as a function of the number of input features. Black arrow marks the minimum loss and the optimal parameter M.
(C) Root mean squared error (top) and metabolic cost (bottom) in E and I populations in the E-I model and in the 1CT model. The distribution is across simulation trials.
(D) Average loss in the E-I and 1CT models with weighting gL = 0.7 for the error (and 0.3 for the cost).
(E) Firing rate in the 1CT model as a function of the metabolic constant. For other parameters of the E-I model see Table 1, and for the 1CT model see Supplementary Table S1.
Finally, we tested how the network’s behavior changed when we varied the number of stimulus features M processed by the network. The encoding error of E (RMSEE) and I neurons (RMSEI) had a minimum at 3 and 4 stimulus features, respectively (Fig.7B, top), while the metabolic cost increased monotonically with the number of features (Fig.7B, bottom). The number of features that optimized network efficiency (the average loss) ranged between M = [1, 4]. With strong weighting of the error (gL ≥ 0.89), the optimal number of features was M = 4, and with strong weighting of the cost, (gL < 0.27), the optimal number of features was M = 1. It is intriguing that the optimal encoding performance, when assuming the weighting for the error is stronger than for the cost, is achieved not for a single stimulus feature, but for 3 or 4 independent features. Increasing the number of features beyond the optimal number resulted in a monotonic increase in firing rates for both cell types and in a contrasting effect on average and instantaneous balance, as it increased the average E-I balance and weakened the instantaneous balance (Supplementary Fig. S6).
In sum, we found the optimal network efficiency in presence of several (3 or 4) stimulus features, and a surprising ability of the network to accurately encode stimuli on a wide range of timescales.
Advantages of E-I versus one cell type model architecture for coding efficiency and robustness to parameter variations
Neurons in the brain are either excitatory or inhibitory. To understand how differentiating E and I neurons benefits efficient coding, we compared the properties of our efficient E-I network with an efficient network with a single cell type (1CT). The 1CT model is a simplification of the E-I model (see Supplementary Text 1) and has been derived and analyzed in previous studies29,28,36,33,44,42. We compared the average encoding error (RMSE), the average metabolic cost (MC), and the average loss (see Supplementary Text 2) of the E-I model against the one cell type (1CT) model. Compared to the 1CT model, the E-I model exhibited a higher encoding error and metabolic cost in the E population, but a lower encoding error and metabolic cost in the I population (Fig. 7C). The average loss of the E-I model was significantly smaller than that of the 1CT model when using the typical weighting of the error and the cost of gL = 0.7 (Fig. 7D), as well as for the vast majority of other weightings (gL ≤ 0.95; Supplementary Fig. S1).
We further compared the 1CT and E-I models in terms of the robustness of firing rates to changes in the metabolic constant. Consistently with previous studies36,35, firing rates in the 1CT model were highly sensitive to variations in the metabolic constant (Fig. 7E, note the logarithmic scale on the y-axis), with a superexponential growth of the firing rate with the inverse of the metabolic constant in regimes with metabolic cost lower than optimal. This is in contrast to the E-I model, whose firing rates exhibited lower sensitivity to the metabolic constant, and never exceeded physiological limits (Fig. 5C). Because our E-I model does not incorporate a saturating input-output function as in34 that would constrain the range of firing rates, the ability of the E-I model to maintain firing rates within biologically plausible limits emerges as a highly desirable dynamic property.
In summary, we found that the optimal E-I model is more efficient than the 1CT model. Beyond the performance of optimal models, the E-I model is advantageous with respect to the 1CT model also because it does not enter into states of physiologically unrealistic firing rates.
Discussion
We analyzed comprehensively the structural, dynamical and coding properties that emerge in networks of spiking neurons that implement optimally the principle of efficient coding. We demonstrated that efficient recurrent E-I networks form highly accurate and unbiased representations of stimulus features with biologically plausible parameters, biologically plausible neural dynamics, instantaneous E-I balance and like-to-like lateral inhibition. The network can implement efficient coding with stimulus features varying over a wide range of timescales and when encoding even multiple such features. Here we discussed the implications of these findings.
By a systematic study of the model, we determined the model parameters that optimize network efficiency. Strikingly, the optimal parameters (including the ratio between the number of E and I neurons, the ratio of I-I to E-I synaptic efficacy and parameters of non-specific currents) were consistent with parameters measured empirically in cortical circuits, and generated plausible spiking dynamics. This result lends credibility to the hypothesis that cortical networks might be designed for efficient coding and may operate close to optimal efficiency, as well as provides a solid intuition about what specific parameter ranges (e.g. higher numbers of E and than I neurons) may be good for. Efficient networks still exhibited realistic dynamics and reasonably efficient coding in the presence of moderate deviations from the optimal parameters, suggesting that the optimal operational point of such networks is relatively robust. We also found that optimally efficient analytical solution derives generalized LIF (gLIF) equations for neuron models37. While gLIF67,40 and LIF63,64 models are reasonably biologically plausible and are widely used to model and study spiking neural network dynamics, it was unclear how their parameters affect network-level information coding. Our study provides a principled way to determine uniquely the parameter values of gLIF networks that are optimal for efficient information encoding. Studying the dynamics of gLIF networks with such optimal parameters thus provides a direct link between optimal coding and neural dynamics. Moreover, our formalism provides a framework for the optimization of neural parameters that can in principles be used not only for neural network models that study brain function but also for the design of artificial neuromorphic circuits that perform information coding computations76,77.
Unlike in previous randomly-connected recurrent networks of LIF and gLIF spiking neurons,63,64 in our efficient-coding solution, a highly structured E-I, I-I and I-E synaptic connectivity emerges as an optimal structural solution to support efficient coding. Our model generates a number of insights about the role of structured connectivity in efficient information processing. A first insight is that I neurons develop stimulus feature selectivity because of the structured recurrent connectivity. This is in line with recent reports of stimulus feature selectivity of inhibitory neurons, including in primary visual cortex78,79,80. A second insight is that a network with structured connectivity shows stronger average and instantaneous E-I balance, as well as significantly lower variance in membrane potentials compared to an equivalent network with the same connections organized randomly. This implies that the connectivity structure determines the operating regime of the network. In particular, a network structured as in our efficient coding solution operates in a dynamical regime that is more stimulus-driven, compared to an unstructured network that is more fluctuation driven. A third insight is that the structured network exhibits a several-fold lower encoding error compared to unstructured networks and achieves this precision with lower firing rates. Network with structured recurrent connectivity creates more precise representations with less spikes and is therefore significantly more efficient compared to unstructured networks. Our analysis of the effective connectivity created by the efficient connectivity structure shows that this structure sharpens stimulus representations, reduces redundancy and increases metabolic efficiency by implementing feature-specific competition, that is a negative effective connectivity between E neurons with similar stimulus tuning, as proposed by recent theories30 and experiments 61,62 of computations in visual cortex.
Our perturbation experiments on single E neurons predict a negative like-to-like effective connectivity between E neurons with similar tuning, as found experimentally in the mouse primary visual cortex with 2-photon optogenetic perturbations of E neurons61,62. This suggests that the effective connectivity found in mouse visual cortex could reflect efficient coding in visual cortex. Comparing effective connectivity in models and experiments is also useful for ruling in and out different theories of how efficient coding may be implemented in primary visual cortex. Earlier theories4,11 found evidence for efficient coding in visual cortex and proposed that such efficient computations relied only on feedforward connectivity; thus they predicted null effective connectivity between visual neurons and were ruled out by the empirical effective connectivity measures61,62. Our model, instead, implements efficient coding with recurrent interactions, suggesting a mechanism that is compatible with these empirical measures. Importantly, we made predictions for further optogenetics experiments that could better constraints models of visual cortical efficient coding. Previous studies61 optogenetically stimulated E neurons but did not determine whether the recorded neurons where excitatory or inhibitory. Our model predicts that stimulation of E neurons would increase firing in similarly tuned I neurons and decrease firing in similarly tuned E neurons. Our analysis confirms earlier model predictions81 that like-to-like connectivity between E and I neurons is necessary for lateral inhibition and competition between E neurons. Beyond like-to-like connectivity, our model predicts an optimally efficient connectivity where synaptic strength positively correlates with pair-wise tuning similarity, a connectivity pattern that was recently observed experimentally 82.
Our study determines how structured E-I connectivity affects the dynamics of E-I balancing and how this relates to information coding. Previous work32 proposed that the E-I balance in efficient spiking networks operates on a finer time scale than in classical balanced E-I networks with random connectivity64. However, a theory to determine the exact levels of instantaneous E-I balance that is optimal for coding was lacking. Consistent with the general idea put forth in32,31,48, we here showed that moderate levels of E-I balance are optimal for coding, and that too strong levels of instantaneous E-I balance are detrimental to coding efficiency. Our results predict that like-to-like structured E-I-E connectivity is necessary for optimal levels of temporal E-I balance. Finally, the E-I-E structured connectivity that we derived supports optimal levels of instantaneous E-I balance and causes desynchronization of the spiking output. Such intrinsically generated desynchronization is a desirable network property that in previously proposed models could only be achieved by the less plausible addition of strong noise to each neuron31,35.
We found that our efficient network, optimizing the representation of a leaky integration of stimulus features, does not require recurrent E-E connections. Supporting this prediction, recurrent E-E connections were reported to be sparse in primary visual cortex83), and the majority of E-E synapses in the visual cortex were suggested to be long-range84. However, future studies could address the role of recurrent excitatory synapses, that were shown to emerge in efficient coding networks implementing computations beyond leaky integration such as linear mixing of features37. Efficient networks with E-E connectivity show neural dynamics that goes well beyond the canonical case analyzed here and can potentially describe persistent network dynamics44. Such networks would also allow to address whether biologically plausible efficient networks exhibit criticality, as suggested by85. Finally, we note that efficient encoding might be the primary normative objective in sensory areas, while areas supporting high-level cognitive tasks such as decision-making might include other computational objectives such as efficient transmission of information downstream to generate reliable behavioral outputs86,87,88,25.
Acknowledgements
V.K. and T.S. thank Tatiana Engel for her contribution to the discussion of results and for her comments on an earlier version of the manuscript. This project was supported by funding from Technische Universität Berlin (“Equal Opportunity Program” to VK), by Internal Research Funding of Technische Universität Berlin (to TS), by NIH Brain Initiative (grants U19 NS107464, R01 NS109961, R01 NS108410 to SP), and the Simons Foundation for Autism Research Initiative (SFARI; grant 982347 to SP).
Code availability
The complete computer code for reproducing the results is available as a Github repository [will be shared upon acceptance].
Methods
Overview of the current approach and of differences with previous approaches
In the following, we present a detailed derivation of the E-I spiking network implementing the efficient coding principle. The analytical derivation is based on previous works on efficient coding with spikes28,36, and in particular on our recent work37. While these previous works analytically derived feedforward and recurrent transmembrane currents in leaky integrate-and fire neuron models, these models did not contain any synaptic current unrelated to feedforward and recurrent processing. Non-specific synaptic current was suggested to be important for an accurate description of coding and dynamics in cortical networks71. In the model derivation that follows, we also derived non-specific external current from efficiency objectives.
As we mapped the efficient coding objective on biologically plausible neural implementations, we found that such implementations (with plausible biophysical parameters) requires a transmembrane current that is independent of feedforward and recurrent processing. We interpreted this current as non-specific external current (shortly, non-specific current), collating the ensemble of synaptic projections from other brain areas that are not directly involved in processing of feedforward stimulus features70, as well as synaptic inputs from the local network from neurons that are not tuned to feedforward stimulus features69. The mechanistic effect of the non-specific current is to regulate the distance to firing threshold, a role that is close to the notion of “background” synaptic activity in cortical neurons71.
Moreover, previous models on efficient coding did not thoroughly consider physical units of variables that were interpreted as biophysical quantities (such as membrane potentials, firing thresholds, etc.). As these biophysical variables were derived from computational variables (such as target signals and population readouts), it remained unclear how biophysical variables might acquire their physical units. Here, we assigned physical units to the computational variables and thus naturally endowed the model with physical units. The network developed here allows for a better compatibility of efficient spiking models with neurobiology compared to previous works on efficient coding with spikes. With this model, we aim to describe neural dynamics and computation in early sensory cortices such as the primary visual cortex in rodents, even though many principles of the model developed here could be relevant throughout the brain.
Introducing variables of the model
We consider two types of neurons, excitatory neurons E and inhibitory neurons I. We denote as NE and NI the number of E-cells and I-cells, respectively. The spike train of neuron i of type y ∈ {E, I}, i = 1, 2, …, Ny, is defined as a sum of Dirac delta functions,
where
We define the readout of the spiking activity of neuron i of type y (in the following, “single neuron readout”) as a leaky integration of its spike train,
with λr denoting the inverse time constant. This way, the quantity
We denote as sk(t), k = 1, 2, …, M the set of M dynamical features of the external stimulus (in the following, stimulus features) which are transmitted to the network through a feedforward sensory pathway. The stimulus features have the unit of the square root of millivolt,
Furthermore, we define a linear population readout of the spiking activity of E and I neurons
with
Loss functions
We assume that the activity of a population y ∈ {E, I} is set so as to minimize a time-dependent encoding error and a time-dependent metabolic cost:
with βy > 0 in units of mV the Lagrange multiplier which controls the weight of the metabolic cost relative to the encoding error. The time-dependent encoding error is defined as the squared distance between the targets and their estimates, and the role of estimates is assigned to the population readouts
We use a quadratic metabolic cost because it promotes the distribution of spiking across neurons28. In particular, the loss function of I neurons, LI (t) implies the relevance of the approximation:
When shall a neuron spike?
We minimize the loss function by positing that neuron i of type y ∈ {E, I} emits a spike as soon as its spike decreases the loss function of its population y in the immediate future37. We also define t− and t+ as the left- and right-sided limits of a spike time
with
where
By applying the condition for spiking in Eq. (12) using y = E and y = I, respectively, we get
According to the definitions in Eqs. (7) and (9), if neuron i fires a spike at time
By inserting Eq. (15a)-(15b) in Eq. (12), we find that neuron i of type y should fire a spike if the following condition holds:
These equations tell us when the neuron i of type E and I, respectively, emits a spike, and are similar to the ones derived in previous works37,28. In addition to what has been found in these previous works, we here also find that each term on the left- and right-hand side in the Eq 16a has the physical units of millivolts.
We note that the expression derived from the minimization of the loss function of E neurons in the top row of Eq. (16a) is independent of the activity of I neurons, and would thus lead to the E population being unconnected with the I population. In order to derive a recurrently connected E-I network, the activity of E neurons must depend on the activity of I neurons. We impose this property by using the approximation of estimates that holds under the assumption of efficient coding in I neurons (see ϵI in the Eq. 11),
We now define new variables
The variables
Dynamic equations for the membrane potentials
In this section we develop the exact dynamic equations of the membrane potentials
with
with x(t) := [x1(t), …, xM (t)]⊤ the vector of M target signals,
In the case of E neurons, the time-derivative of the membrane potential
By inserting the dynamic equations of the target signal
where in the last line we used the definition of
In the case of I neurons, the time derivative of the membrane potential
By inserting the dynamic equations of the population readouts of E neurons
where in the last line we used the definition of
Leaky integrate-and-fire neurons
The terms on the right-hand-side in Eqs. (21) and (23) can be interpreted as transmembrane currents. The last term in these equations,
In the Eq. 24 we wrote explicitly the terms
Imposing Dale’s principle on synaptic connectivity
We now examine the synaptic terms in Eq. (24). As a first remark, we see that synaptic weights depend on tuning parameters
Another consequence of synaptic connectivity in the Eq. (24) is that the synaptic weight between a presynaptic neuron j of type x and a postsynaptic neuron i of type y is symmetric and depends on the similarity of tuning vectors of the presynaptic and the postsynaptic neuron:
with x, y ∈{E, I} and [a]+ ≡ max(0, a) a rectified linear function. This manipulation is also plausible from a biological point of view, because in the cortex, the connection probability of neurons with very different (e.g. opposite) tuning is typically close to 051. Since the elements of the matrix Jyx are all non-negative, it is the sign in front of the synaptic term in the Eq. (24) that determines the sign of the synaptic current between neurons i and j. The synaptic current is excitatory if the sign is positive, and inhibitory if the sign is negative.
It is also interesting to note that rectification affects the rank of connectivity matrices. Without rectification, the product in Eq. (25) yields a connectivity matrix with rank smaller or equal to the number of input features to the network, M, similarly as in previous works29,43,44. Since typically the number of input features is much smaller than the number of neurons, i.e., M << Ny, this would give a low-rank connectivity matrix. However, rectification in Eq. (25), necessary to ensure Dale’s principle in presence of positive and negative tuning parameters, typically results in a substantial increase of the rank of the connectivity matrix.
Using the synaptic connectivity defined in Eq. (25), we rewrite the network dynamics from Eq. (24) as:
These equations express the neural dynamics which minimizes the loss functions (Eq. (10)) in terms of a generalized leaky integrate-and-fire model with E and I cell types, and are consistent with Dale’s principle.
In principle, it is possible to use the same strategy as for the E-I network to enforce Dale’s principle in model with one cell type (introduced by28). To do so, we constrained the recurrent connectivity of the model with a single cell type from36 by keeping only connections between neurons with similar tuning vectors and setting other connections to 0 (see Supplementary text). This led to a network of only inhibitory neurons, a type of network model which is less relevant for the description of biological networks.
Model with resting potential and an external current
In the model given by the Eq. (26) the resting potential is equal to zero. In order to account for biophysical values of the resting potential and to introduce an implementation of the metabolic constant that is consistent with neurobiology, we add a constant value to the dynamical equations of the membrane potentials
Furthermore, in the same equations, the role of the metabolic constant βy as a biophysical quantity is questionable. The metabolic constant βy is an important parameter that weights the metabolic cost over the encoding error in the objective functions (Eq. 10). On the level of computational objectives, the metabolic constant naturally controls firing rates, as it allows the network to fire more or less spikes to correct for a certain encoding error. A flexible control of the firing rates is a desirable property, as gives the possibility to potentially capture different operating regimes of efficient spiking networks36. In the spiking model we developed thus far (Eq. 26), similarly to previous efficient spiking models36,33, the metabolic constant βy controls the firing threshold. In neurobiology, however, strong changes to the firing threshold that would reflect metabolic constraints of the network are not plausible. We thus searched for an implementation of the metabolic constant βy that is consistent with neurobiology.
The condition for threshold crossing of the neuron i can be written by Eq. (26) as
with c an arbitrary constant in units of millivolts. In Eq. (27) we added a constant c/2 and a resting potential
We now define new variables for y ∈ {E, I}:
and rewrite the model in Eq. 26 in these new variables
where
Efficient generalized leaky integrate-and-fire neuron model
Finally, we rewrite the model from Eq. (29) in a compact form in terms of transmembrane currents, and discuss their biological interpretation. The efficient coding with spikes is realized by the following model for the neuron i of type y ∈ {E, I}:
with Rm the current resistance. The leak current,
with τ = RmCm and Cm the capacitance of the neural membrane54, arose by assuming the same time constant for the target signals xk and estimates
where we note the presence of a feedforward current to E neurons,
which consist in a linear combination of the stimulus features s(t) weighted by the readout weights
The current providing within-neuron feedback triggered by each spike,
was recently recovered37. This current has the kinetics of the single neuron readout
Finally, we here derived the non-specific external current:
that captures the ensemble of non-specific synaptic currents received by each single neuron. The non-specific current has a homogeneous mean across all neurons of the same cell type, and a neuron-specific fluctuation. The mean of the non-specific current can be traced back to the weighting of the metabolic cost over the encoding error in model objectives (Eq. 10), while the fluctuation can be traced back to the noise intensity that we assumed in the condition for spiking (Eq. 12). The non-specific external current might arise because of synaptic inputs from other brain areas than the brain area that delivers feedforward projections to the E-I network we consider here, or it might result from synaptic activity of neurons that are part of the local network, but are not tuned to the feedforward input69.
We also recall the fast and slower time scales of single neuron activity:
and the connectivity matrices
The structure of synaptic connectivity is fully determined by the similarity of tuning vectors of the presynaptic and the postsynaptic neurons (
Stimulus features
We define stimulus features as a set of k = 1, …, M independent Ornstein-Uhlenbeck processes with vanishing mean, standard deviation σs and the correlation time τs,
If not mentioned otherwise, we use the following parameters: σs = 2 (mV)1/2 and τs = 10 ms. Variables ηk(t) are independent Gaussian white noise processes with zero mean and covariance function ⟨ηk(t)ηl(t′)⟩ = δklδ(t − t′). These variables should not be confused with the Gaussian white noises
Parametrization of synaptic connectivity
In the efficient E-I model, synaptic weights
We can achieve a further substantial decrease in the number of free parameters by using a parametric distribution of tuning parameters
Given M features, we sample tuning parameters,
This ensures that the length of tuning vectors
By combining Eq. (25) and Eq. (32), we obtain the synaptic weights,
In the M = 3 dimensional case, we have that the distribution of the angle between two vectors is
Thus, the upper bound for the synaptic weight between cell types x and y is simply
From the Eq. (34), we have that the mean E-I connectivity is equal to the mean I-E connectivity
Performance measures
Average encoding error and average metabolic cost
The definition of the time-dependent loss functions (Eq. 10) induces a natural choice for the performance measure: the mean squared error (MSE) between the targets and their estimators for each cell type. In the case of the E population, the time-dependent encoding error is captured by the variable ϵE(t) in the Eq. (11) and in case of I population it is captured by ϵI (t) defined in the same equation. We used the root MSE (RMSE), a standard measure for the performance of an estimator40. For the cell type y ∈ {E, I} in trial q, the RMSE is measured as
with ⟨zq(t)⟩t,q denoting the time- and trial-average.
Following the definition of the time-dependent metabolic cost in the loss functions (Eq. 10), we measured the average metabolic cost in a trial q for the cell type y ∈ {E, I} as
with time-dependent metabolic cost κy(t) as in model’s objectives (Eq. 11) and ⟨zq(t)⟩ t,q the time- and trial-average. The square root was taken to have the same scale as for the RMSE (see Eq. 37).
The bias of the estimator
The MSE can be decomposed into the bias and the variance of the estimator. The time-dependent bias of estimates
with ⟨zq(t)⟩ t,q the trial-averaged realization at time t. To have an average measure of the encoding bias, we averaged the bias of estimators over time and over input dimensions:
The averaging over time and input dimensions is justified because sk(t) are independent realizations of the Ornstein-Uhlenbeck process (see Eq.31) with vanishing mean and with the same time constant, and variance across input dimensions.
Criterion for determining optimal model parameters
The equations of the E-I spiking network in Eqs. 30a-30h (Methods), derived from the instantaneous loss functions, give efficient coding solutions valid for any set of parameter values. However, to choose parameters values in simulated data in a principled way, we performed a numerical optimization of the performance function detailed below. Numerical optimization gave the set of optimal parameters listed in Table 1. When testing the efficient E-I model with simulations, we used the optimal parameters in Table 1 and changed only the parameters plotted in the figure axes on a figure-by-figure basis.
To estimate the optimal set of parameters θ = θ*, we performed a grid search on each parameter θi while keeping all other parameters fixed as specified in Table 1. While varying the parameters, we measured a weighted sum of the time- and trial-averaged encoding error and metabolic cost. For each cell type y ∈ {E, I}, we computed
with ⟨zq(t)⟩ t,q the average over time and over trials and with ϵy(t) and κy(t) as in model’s objectives (Eq. 11).
To optimize the performance measure, we used a value of gL = 0.7. The parameter gL in the Eq. (40a) regulates the relative importance of the average encoding error over the average metabolic cost. Since the performance measure in Eq. (40a) is closely related to the average over time and trials of the instantaneous loss function (Eq. 10) where the parameter β regulates the relative weight of instantaneous encoding error over the metabolic cost, setting gL is effectively achieved by setting β.
The optimal parameter set θ = θ* reported in Table 1 is the parameter set that minimizes the sum of losses across E and I cell type
For visualization of the behavior of the average metabolic cost (Eq. 38) and average loss (Eq. 40a) across a range of a specific parameter θi, we summed these measures across the E and I cell type and normalized them across the range of tested parameters.
The exact dynamic and performance of our model depends on the realizations of random variables which describe the the tuning parameters
Functional activity measures
Tuning similarity
The pair-wise tuning similarity was measured as the cosine similarity91, defined as:
with
Cross-correlograms of spike timing
The time-dependent coordination of spike timing was measured with the cross-correlogram (CCG) of spike trains, corrected for stimulus-driven coincident spiking. The raw cross-correlogram (CCG) for neuron i of cell type y and neuron j of cell type x was measured as follows:
with q = 1, …, Q simulation trials with identical stimulus and T the duration of the trial. We subtracted from the raw CCG the CCG of trial-invariant activity. To evaluate the trial-invariant cross-correlogram, we first computed the peri-stimulus time histogram (PSTH) for each neuron as follows:
The trial-invariant CCG was then evaluated as the cross-correlation function of PSTHs between neurons i and j,
Finally, the temporal coordination of spike timing was computed by subtracting the correction term from the raw CCG:
Average imbalance of synaptic inputs
We considered time and trial-averaged synaptic inputs to each E and I neuron i in trial q, evaluated as:
with synaptic currents to E neurons
Instantaneous balance of synaptic inputs
We measured the instantaneous balance of synaptic inputs as the Pearson correlation of time-dependent synaptic inputs incoming to the neuron i. For those synaptic inputs that are defined as weighted delta-spikes (for which the Pearson correlation is not well defined; see Eq. 30c), we convolved spikes with a synaptic filter
where we used the expression for the feedforward synaptic current from the Eq. (30d). Note that the feedforward synaptic current is already already low-pass filtered (see Eq. 31). Using synaptic inputs from the Eq. 44, we computed the Pearson correlation of synaptic inputs incoming to single E neurons,
Perturbation of connectivity
To test the robustness of the model to random perturbations of synaptic weights, we applied a random jitter to optimally efficient recurrent synaptic connectivity weights. The random jitter was proportional to the synaptic weight,
Computer simulations
We run computer simulations with Matlab R2023b (Mathworks). The membrane equation for each neuron was integrated with Euler integration scheme with the time step of dt = 0.02 ms.
The simulation of the E-I network with 400 E units and 100 I units for an equivalent of 1 second of neural activity lasted approximately 1.65 seconds on a laptop.
Supplementary material
Supplementary text 1: Derivation of the one cell type model
An efficient spiking model network with one cell type (1CT) has been developed previously28, and properties of the 1CT model where the computation is assumed to be the leaky integration of inputs has been addressed in a number of previous studies29,43,36,33,42. Compared to the efficient E-I model, the 1CT model can be seen as a simplification, and can be treated similarly to the E-I model, which is what we demonstrate in this section.
As the name of the model suggests, all neurons in the 1CT model are of the same cell type, and we have i = 1, …, N such neurons. We can then use the definitions in Eqs. (6) - (9) (now without the index y) and a loss function similar to the one in36, but with only one (quadratic) regularizer
with β1 > 0. The encoding error of the one cell type model minimizes the squared distance between the target signal xk(t) and the estimate
with ξi(t) the noise at the condition for spiking. Same as in the E-I model, we define the noise as an Ornstein-Uhlenbeck process with zero mean, obeying
where ηi is a Gaussian white noise and λ = τ−1 is the inverse time constant of the process. We now define proxies of the membrane potential and the firing threshold as
Differentiating the proxy of the membrane potential ui(t) and rewriting the model as an integrate-and-fire neuron, we get
We now proceed in the same way as with the E-I model and define new variables
In these new variables, we can rewrite the membrane equation of the 1CT model as follows:
Finally, we rewrite the model with a more compact notation of a leaky integrate-and-fire neuron model with transmembrane currents,
with currents
Note that the model with one cell type does not obey Dale’s law, since the same neuron sends to its postsynaptic targets excitatory and inhibitory currents, depending on the tuning similarity of the presynaptic and the postsynaptic neuron wi and wj (Eq. S.8b). In particular, if the pre- and postsynaptic neurons have similar selectivity
Dale’s law can be imposed to the 1CT model the same way as in the E-I model. To do so, we removed synaptic interactions between neurons with different selectivity by rectifying the connectivity matrix,
However, this manipulation results in a network with only inhibitory recurrent synaptic interactions, and thus a network of only inhibitory neurons. Network with only inhibitory interactions is less relevant for the description of recurrently connected biological networks.
Supplementary text 2: Analysis of the one cell type model and comparison with the E-I model
We re-derived the 1CT model as a simplification of the E-I network (Supplementary Text 1, Supplementary Fig. S1A-B), with objective function of the same form as LE and by allowing a single type of neurons sending both excitatory and inhibitory synaptic currents to their post-synaptic targets (Supplementary Fig. S1C). Similarly to the E-I model, also the 1CT model exhibits structured connectivity, with synaptic strength depending on the tuning similarity between the presynaptic and the postsynaptic neuron. Pairs of neurons with stronger tuning similarity (dissimilarity) have stronger mutual inhibition (excitation); see Supplementary Fig. S1D.
We compared the coding performance of the E-I model with that of a fully connected 1CT model. Both models received the same set of stimulus features and performed the same computation. In the 1CT model, tuning parameters were drawn from the same distribution as used for the E neurons in the E-I model. We used the same membrane time constant τ in both models, while the metabolic constants (β of the E-I model and β1 of the 1CT model) and the noise intensity (σ of the E-I model and σ1 of the 1CT model) were chosen such as to optimize the average loss for each model (Fig. 5B for E-I model, Supplementary Fig. S1F-G for 1CT model). Parameters of the 1CT model are listed in the Supplementary Table S1. A qualitative comparison of the E-I and the 1CT model showed that with optimal parameters, both models accurately tracked multiple target signals (Fig. 1G and Supplementary Fig. S1E).
To compare the performance of the E-I and the 1CT models also quantitatively, we measured the average encoding error (RMSE), metabolic cost (MC) and loss of each model. The RMSE and the MC in the 1CT model were measured as in Eq. 37 and 38, while the average loss of each model was evaluated as follows:
Unless mentioned otherwise, we weighted stronger the encoding error compared to the metabolic cost and used gL = 0.7. Note that our comparison of the losses is conservative, because the metabolic cost is defined as a sum of activities across neurons (Eq. 38) and the total number of neurons in the E-I model (NE + NI) is larger than the number of neurons in the 1CT model (N1CT = NE).

Table of default model parameters for the efficient network with one cell type.
The parameters N, M, τ and σw are chosen identical to the E-I network (see Table 1 in the main text). Parameters σ1 and β1 are determined as values that maximize network efficiency (see section “Performance measures” in the main text).
Supplementary Figures

Efficient spiking model with one cell type.
(A) Schematic of efficient coding with a single spiking neuron with positive weight. The target signal (bottom, black) integrates the input signal (top). The neuron spikes to keep the readout of its activity (magenta) close to the target signal.
(B) Schematic of the efficient 1CT model. Target signal x(t) is computed from stimulus features s(t). The network generates the estimate of the target signal with the population readouts of the spiking activity.
(C) Schematic of excitatory (red) and inhibitory (blue) synaptic interactions in 1CT model. Neurons with similar selectivity inhibit each other (blue), while neurons with different selectivity excite each other (red). The same neuron is sending excitatory and inhibitory synaptic outputs.
(D) Strength of recurrent synapses as a function of the tuning similarity.
(E) Simulation of the network with 1CT. Top three rows show the signal (black), and the estimate (magenta) in each of the 3 input dimensions.
(F) Left: Root mean squared error (RMSE) as a function of the metabolic constant β1. Right: Normalized metabolic cost (green) and normalized average loss (black) as a function of the metabolic constant β1. The black arrow denotes the minimum of the loss and thus the optimal parameter β1.
(G) Same as in F, measured as a function of the noise intensity σ1.
(H) Average loss as a function of the weighting of the encoding error and the metabolic cost, gL, in the E-I model (black) and in the 1CT model (magenta). For plots F-H, results were computed in 100 simulation trials of duration of 1 second of simulated time. For other parameters, see Table 1 (E-I model) and Table S1 (1CT model).

Tuning similarity and its relation to lateral excitation/inhibition.
(A) Pair-wise tuning similarity for all pairs of E neurons. Tuning similarity between pairs of neurons is measured as the similarity of normalized tuning vectors.
(B) Histogram of tuning similarity across all E-E pairs shown in A.
(C) Tuning similarity to a single, randomly selected target neuron. Tuning similarity to a single neuron corresponds to a vector from the tuning similarity matrix in A. We sorted the tuning similarity to a single neuron from smallest to biggest value. Neurons with negative similarity are grouped as neurons with different tuning, while neurons with positive tuning similarity are grouped as neurons with similar tuning.
(D) Histogram of tuning similarity of E neurons to the target neuron shown in C. With distribution of tuning parameters symmetric around zero as used in our study, any choice of target neuron gives approximately the same number of neurons with similar and different selectivity.
(E) Top: Trial and neuron-averaged deviation of the instantaneous firing rate from the baseline firing rate, for the population of I (top) and E (bottom) neurons with similar tuning (magenta) and different tuning (gray). The baseline firing rates were 6.8 Hz and 12.7 Hz in the E and I cell types, respectively. The stimulation intensity is ap = 0.4. Figure shows the mean ± standard error of the mean (SEM), with SEM capturing the variance across neurons and across trials. Bottom: Scatter plot of the tuning similarity versus effective connectivity in I (top) and E neurons (bottom). Tuning similarity and effective connectivity are measured with respect to the (same) target neuron. Red line marks zero effective connectivity and magenta line marks the least-squares line.
(F) Same as in E, for stimulation intensity of ap = 0.8.
(G) Same as in E, in presence of weak feedforward stimulus, showing the activity of neurons with similar tuning (orange) and different tuning (gray) to the stimulated neuron. We used the stimulation intensity at threshold (ap = 1.0). The feedforward stimulus was received by all E neurons and it induced, together with the external current, the mean firing rates of 7.3 Hz and 13.5 Hz in E and I neurons, respectively. For model parameters, see Table 1. This figure is related to the Fig. 2 in the main paper.

Effect of complete and partial removal of connectivity structure and of minimal perturbation of synaptic weights.
(A) Average coefficient of variation in networks with fully unstructured connectivity. The dashed line marks the same measure in a structured network.
(B) Mean firing rate in E (top) and I neurons (bottom) in networks with partial removal of connectivity structure in recurrent connectivity. Partial removal of connectivity structure is achieved by limiting the permutation of synaptic connectivity to neuronal pairs with similar tuning.
(C) Same as in B, showing the coefficient of variation of spiking activity.
(D) Same as in B, showing the average net synaptic current, neural correlate of the average E-I balance.
(E) Same as in B, showing the correlation coefficient of synaptic currents, neural correlate of the instantaneous E-I balance.
(F) Encoding error in networks with partially unstructured recurrent connectivity, relative to the encoding error of the structured network (dashed line). From left to right: we perturb synaptic weights in E-I, I-I, I-E and in all three recurrent connectivities at once.
(G) Same as in F, showing the metabolic cost on spiking in E and I populations, relative to the metabolic cost in the structured network (dashed line).
(H) The RMSE (top) and the normalized metabolic cost (green) and average loss (black) average firing rate (bottom) in E and I cell type, as a function of the strength of perturbation of the synaptic connectivity.
(I) Average firing rate (top) and the coefficient of variation (bottom) as a function of the strength of random perturbation of all recurrent connectivities.
(J) Target signals, E estimates and I estimates in three input dimensions (three top rows), spike trains (fourth row) and the instantaneous estimate of the firing rate of E and I populations (bottom) in a single simulation trial, with significant perturbation of recurrent connectivity (perturbation strength of 0.5, see Methods). In spite of a relatively strong perturbation, the network shows excellent encoding of the target signal. Other parameters are in Table 1. This figure is related to the Fig. 3 in the main paper.

Lateral excitation/inhibition in models with full and partial removal of connectivity structure.
(A) Average deviation of the instantaneous firing rate from the baseline for the population of I (top) and E (bottom) neurons in networks with fully removed structure in E-I (left), I-E (middle) and in all connectivity matrices (right). We show the mean ± SEM for neurons with similar (ochre) and different (green) tuning to the stimulated neuron. The mean traces of the network with structured connectivity is shown for comparison, magenta and gray for similar and different tuning, respectively.
(B) Same as in A, for partial (fine-grained) removal of connectivity structure. Partial removal of connectivity structure is achieved by limiting the permutation of synaptic weights among neurons with similar tuning. Such manipulation maintains the like-like connectivity structure, but removes any structure beyond the like-like.
(C) Scatter plot of tuning similarity versus effective connectivity for networks with partial removal of connectivity structure. In such networks, the specificity of effective connectivity with respect to tuning similarity is largely preserved, in particular in E neurons. For all results, we iterated simulations in 200 trials, where we varied randomly the membrane potential noise and initial conditions of the membrane potentials in each trial, while tuning and synaptic parameters were kept fixed. In all cases, we used stimulation intensity at threshold (ap = 1.0). For model parameters, see Table 1. This figure is related to the Fig. 3 in the main paper.

Dependence of coding efficiency and neural dynamics on the ratio of mean I-I to E-I connectivity, computed by changing the mean E-I connectivity.
(A) Top: Encoding error (RMSE) of the E (red) and I (blue) estimates. Bottom: Normalized metabolic cost and average loss.
(B) Average firing rate (top), and average coefficient of variation (bottom) in E and I cell type.
(C) Average imbalance and instantaneous balance of synaptic currents in E and I neurons.
(D) Top: Optimal ratio of mean I-I to E-I connectivity as a function of the weighting of the average loss of E and I cell type. Bottom: Same as on top, as a function of the weighting between the error and the cost. Black triangles mark weightings that are typically used to estimate optimal efficiency. For other parameters, see Table 1. This figure is related to the Fig. 6 in the main paper.

Effect of stimulus properties on efficient neural coding and dynamics.
(A) Average firing rate (top), and average coefficient of variation (bottom) in E and I cell type, as a function of the time constant of the stimulus τs.
(B) Average imbalance (top) and instantaneous balance (bottom) as a function of the time constant of the stimulus τs.
(C-D) Same as in A-B, as a function of the number of encoded variables. For parameters, see Table 1. This figure is related to the Fig. 7 in the main paper.
References
- 1.Building functional networks of spiking model neuronsNature neuroscience 19:350–355
- 2.Learning universal computations with spikesPLoS computational biology 12:e1004895
- 3.Possible principles underlying the transformation of sensory messagesSensory communication 1:217–233
- 4.Emergence of simple-cell receptive field properties by learning a sparse code for natural imagesNature 381:607–609
- 5.Efficiency turns the table on neural encoding, decoding and noiseCurrent Opinion in Neurobiology 37:141–148
- 6.Natural image statistics and neural representationAnnual review of neuroscience 24:1193–1216
- 7.Sparse coding with an overcomplete basis set: A strategy employed by v1?Vision research 37:3311–3325
- 8.Sparse coding and decorrelation in primary visual cortex during natural visionScience 287:1273–1276
- 9.Understanding vision: theory, models, and dataUSA: Oxford University Press
- 10.Could information theory provide an ecological theory of sensory processing?Network: Computation in neural systems 3:213–251
- 11.Sparse coding of sensory inputsCurrent opinion in neurobiology 14:481–487
- 12.Efficient coding of natural soundsNature neuroscience 5:356–363
- 13.Sparse incomplete representations: A potential role of olfactory granule cellsNeuron 72:124–136
- 14.Reading a neural codeScience 252:1854–1857
- 15.Reliability and information transmission in spiking neuronsTrends in neurosciences 15:428–434
- 16.The role of spike timing in the coding of stimulus location in rat somatosensory cortexNeuron 29:769–777
- 17.Neural coding of natural stimuli: information at sub-millisecond resolutionPLoS computational biology 4:e1000025
- 18.Millisecond encoding precision of auditory cortex neuronsProceedings of the National Academy of Sciences 107:16976–16981
- 19.Neural codes formed by small and temporally precise populations in auditory cortexJournal of Neuroscience 33:18277–18287
- 20.Sensory neural codes using multiplexed temporal scalesTrends in neurosciences 33:111–120
- 21.Efficiency and ambiguity in an adaptive neural codeNature 412:787–792
- 22.Timescales of inference in visual adaptationNeuron 61:750–761
- 23.Encoding of naturalistic stimuli by local field potential spectra in networks of excitatory and inhibitory neuronsPLoS computational biology 4:e1000239
- 24.Efficient andadaptive sensory codesNature Neuroscience 24:998–1009
- 25.Computational methods to study information processing in neural circuitsComputational and Structural Biotechnology Journal 21:910–922
- 26.Perceptual inference predicts contextual modulations of sensory responsesJournal of Neuroscience 32:4179–4195
- 27.Visual nonclassical receptive field effects emerge from sparse coding in a dynamical systemPLoS computational biology 9:e1003191
- 28.Predictive coding of dynamical variables in balanced spiking networksPLoS Comput Biol 9:e1003258
- 29.Learning optimal spike-based representationsAdvances in neural information processing systems 25:2285–2293
- 30.Causal inference and explaining away in a spiking networkScientific Reports 5:17531
- 31.Neural oscillations as a signature of efficient coding in the presence of synaptic delaysElife 5:e13824
- 32.Efficient codes and balanced networksNature neuroscience 19:375–382
- 33.Population adaptation in efficient balanced networksElife 8:e46926
- 34.Predictive coding in balanced neural networks with noise, chaos and delaysAdvances in Neural Information Processing Systems Curran Associates, Inc :16677–16688
- 35.Poisson balanced spiking networksPLoS computational biology 16:e1008261
- 36.Computational account of spontaneous activity as a signature of predictive codingPLoS computational biology 13:e1005355
- 37.Biologically plausible solutions for spiking networks with efficient codingAdvances in Neural Information Processing Systems Curran Associates, Inc :20607–20620
- 38.Adaptive exponential integrate-and-fire model as an effective description of neuronal activityJournal of neurophysiology 94:3637–3642
- 39.Parameter extraction and classification of three cortical neuron types reveals two distinct adaptation mechanismsJournal of neurophysiology 107:1756–1775
- 40.Neuronal dynamics: From single neurons to networks and models of cognitionCambridge University Press
- 41.The quantitative single-neuron modeling competitionBiological cybernetics 99:417
- 42.Learning to represent signals spike by spikePLoS computational biology 16:e1007692
- 43.Optimal compensation for neuron lossElife 5:e12454
- 44.Learning nonlinear dynamics in efficient, balanced spiking networks using local plasticity rulesProceedings of the AAAI Conference on Artificial Intelligence https://doi.org/10.1609/aaai.v32i1.11320
- 45.Inhibitory plasticity balances excitation and inhibition in sensory pathways and memory networksScience 334:1569–1573
- 46.Influence of highly distinctive structural properties on the excitability of pyramidal neurons in monkey visual and prefrontal corticesJournal of Neuroscience 32:13644–13660
- 47.The importance of mixed selectivity in complex cognitive tasksNature 497:585–590
- 48.The brain as an efficient and robust adaptive learnerNeuron 94:969–977
- 49.What is optimal in optimal inference?Current Opinion in Behavioral Sciences 29:117–126
- 50.Noise in the nervous systemNature reviews neuroscience 9:292–303
- 51.Functional specificity of local synaptic connections in neocortical networksNature 473:87–91
- 52.In-vivo measurement of cell-type-specific synaptic connectivity and synaptic transmission in layer 2/3 mouse barrel cortexNeuron 85:68–75
- 53.Local connectivity and synaptic dynamics in mouse and human neocortexScience 375:eabj5861
- 54.A review of the integrate-and-fire neuron model: I. homogeneous synaptic inputBiological cybernetics 95:1–19
- 55.Towards a theory of cortical columns: From spiking neurons to interacting neural populations of finite sizePLoS Comput. Biol 13:e1005507
- 56.A user’s guide to generalized integrate-and-fire modelsComputational Modelling of the Brain: Modelling Approaches to Cells, Circuits and Networks Springer :69–86
- 57.The excitatory neuronal network of the C2 barrel column in mouse primary somatosensory cortexNeuron 61:301–316
- 58.What is the dynamical regime of cerebral cortex?Neuron 109:3373–3391
- 59.Instantaneous correlation of excitation and inhibition during ongoing and sensory-evoked activitiesNature neuroscience 11:535–537
- 60.Equalizing excitation–inhibition ratios across visual cortical neuronsNature 511:596–600
- 61.Single-neuron perturbations reveal feature-specific competition in V1Nature 567:334–340
- 62.The logic of recurrent circuits in the primary visual cortexNature Neuroscience 27:1–11
- 63.Dynamics of sparsely connected networks of excitatory and inhibitory spiking neuronsJournal of computational neuroscience 8:183–208
- 64.The asynchronous state in cortical circuitsScience 327:587–590
- 65.Synaptic plasticity: taming the beastNature neuroscience 3:1178–1183
- 66.Homeostatic plasticity in the developing nervous systemNature reviews neuroscience 5:97–107
- 67.Patterns of interval correlations in neural oscillators with adaptationFront. Comput. Neurosci 7:164
- 68.Network analysis of murine cortical dynamics implicates untuned neurons in visual stimulus codingCell Reports 31:107483
- 69.The role of untuned neurons in sensory information codingBioRxiv 134379
- 70.Impact of network activity on the integrative properties of neocortical pyramidal neurons in vivoJournal of neurophysiology 81:1531–1547
- 71.The high-conductance state of neocortical neurons in vivoNature reviews neuroscience 4:739–751
- 72.Chaos in neuronal networks with balanced excitatory and inhibitory activityScience 274:1724–1726
- 73.Neuronal circuits overcome imbalance in excitation and inhibition by adjusting connection numbersProceedings of the National Academy of Sciences 118:e2018459118
- 74.Interneurons of the neocortical inhibitory systemNature reviews neuroscience 5:793–807
- 75.Functional organization of excitatory synaptic strength in primary visual cortexNature 518:399–403
- 76.Towards spike-based machine intelligence with neuromorphic computingNature 575:607–617
- 77.Opportunities for neuromorphic computing algorithms and applicationsNature Computational Science 2:10–19
- 78.Excitatory and inhibitory subnetworks are equally selective during decision-making and emerge simultaneously during learningNeuron 105:165–179
- 79.Response features of parvalbumin-expressing interneurons suggest precise roles for subtypes of inhibition in visual cortexNeuron 67:847–857
- 80.Synaptic wiring motifs in posterior parietal cortex support decision-makingNature 627:367–373
- 81.Theory of neuronal perturbome in cortical networksProceedings of the National Academy of Sciences 117:26966–26976
- 82.Functional specificity of recurrent inhibition in visual cortexNeuron 112:991–1000
- 83.Sparse recurrent excitatory connectivity in the microcircuit of the adult mouse and human cortexElife 7:e37349
- 84.The fractions of short-and long-range connections in the visual cortexProceedings of the National Academy of Sciences 106:3555–3560
- 85.Signatures of criticality in efficient coding networksbioRxiv https://www.biorxiv.org/content/early/2023/02/14/2023.02.14.528465
- 86.Correlations enhance the behavioral readout of neural population activity in association cortexNature neuroscience 24:975–986
- 87.The structures and functions of correlations in neural population codesNature Reviews Neuroscience 23:551–567
- 88.Transformations of sensory information in the brain suggest changing criteria for optimalityPLOS Computational Biology 20:e1011783
- 89.What is Dale’s principleDale’s Principle and Communication Between Neurones :1–5
- 90.A note on a method for generating points uniformly on n-dimensional spheresCommunications of the ACM 2:19–20
- 91.Cosine normalization: Using cosine similarity instead of dot product in neural networksArtificial Neural Networks and Machine Learning–ICANN 2018: 27th International Conference on Artificial Neural Networks Springer :382–391
Article and author information
Author information
Version history
- Preprint posted:
- Sent for peer review:
- Reviewed Preprint version 1:
- Reviewed Preprint version 2:
- Reviewed Preprint version 3:
Copyright
© 2024, Koren et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics
- views
- 384
- downloads
- 20
- citation
- 1
Views, downloads and citations are aggregated across all versions of this paper published by eLife.