Abstract
Descriptive statistical models of neural responses generally aim to characterize the mapping from stimuli to spike responses while ignoring biophysical details of the encoding process. Here, we introduce an alternative approach, the conductancebased encoding model (CBEM), which describes a mapping from stimuli to excitatory and inhibitory synaptic conductances governing the dynamics of subthreshold membrane potential. Remarkably, we show that the CBEM can be fit to extracellular spike train data and then used to predict excitatory and inhibitory synaptic currents. We validate these predictions with intracellular recordings from macaque retinal ganglion cells. Moreover, we offer a novel quasibiophysical interpretation of the Poisson generalized linear model (GLM) as a special case of the CBEM in which excitation and inhibition are perfectly balanced. This work forges a new link between statistical and biophysical models of neural encoding and sheds new light on the biophysical variables that underlie spiking in the early visual pathway.
Introduction
Studies of neural coding seek to reveal how sensory information is encoded in neural spike responses. A complete understanding this code requires knowledge of the statistical relationship between stimuli and spike trains, as well as the biophysical mechanisms by which this transformation is carried out. A popular approach to the neural coding problem involves ‘cascade’ models, such as the linearnonlinearPoisson (LNP) or generalized linear model (GLM), to characterize how external stimuli are converted to spike trains. These descriptive statistical models describe the encoding process in terms of a series of stages: linear filtering, nonlinear transformation, and ending with noisy or conditionally Poisson spiking (Chichilnisky, 2001; Paninski, 2004; Vintch et al., 2012; Park et al., 2013; Theis et al., 2013; Vintch et al., 2015). These models have found broad application to neural data, and the Poisson GLM in particular has provided a powerful tool for characterizing neural encoding in a variety of sensory, cognitive, and motor brain areas (Harris et al., 2003; Truccolo et al., 2005; Pillow et al., 2008; Gerwinn, 2010; Stevenson et al., 2012; Weber et al., 2012; Park et al., 2014; Hardcastle et al., 2015; Yates et al., 2017).
However, there is a substantial gap between cascadestyle descriptive statistical models and mechanistic or biophysically interpretable models. In real neurons, stimulus integration is nonlinear, arising from an interplay between excitatory and inhibitory synaptic inputs that depend nonlinearly on the stimulus; these inputs in turn drive conductance changes that alter the nonlinear dynamics governing membrane potential. In retina and other sensory areas, the tuning of excitatory and inhibitory inputs can differ substantially (Roska et al., 2006; Trong and Rieke, 2008; Poo and Isaacson, 2009; Cafaro and Rieke, 2013), meaning that a single linear filter is not sufficient to describe stimulus integration in single neurons. Determining how stimuli influence neural conductance changes, and thus the computations that neurons perform, therefore remains an important challenge. This challenge is exacerbated by the fact that most studies of neural coding rely on extracellular recordings, which detect only spikes and not synaptic conductance changes that drive them.
Here, we aim to narrow the gap between descriptive statistical models and biophysically interpretable models, while remaining within the domain of models that can be estimated from extracellular spike train data (Meng et al., 2011; Meng et al., 2014; Volgushev et al., 2015; Lankarany, 2017). We first introduce a quasibiophysical interpretation of the standard Poisson GLM, which reveals its equivalence to a constrained conductancebased model with equal and opposite excitatory and inhibitory tuning. We then relax these constraints in order to obtain a more flexible and more realistic conductancebased model with independent tuning of excitatory and inhibitory inputs. The resulting model, which we refer to as the conductancebased encoding model (CBEM), can capture key features of real neurons such as shunting inhibition and timevarying changes in gain and membrane time constant. We show that the CBEM can predict excitatory and inhibitory synaptic conductances from stimuli and extracellular spike trains alone, which we validate by comparing model predictions to conductances measured with intracellular recordings in macaque parasol and midget retinal ganglion cells (RGCs). This work differs from previous cascade modeling approaches for separating excitatory and inhibitory inputs (e.g., Butts et al., 2011; Ozuysal et al., 2018; McFarland et al., 2013; Maheswaranathan et al., 2018) by explicitly defining the model components in a biophysical framework and directly comparing model predictions to measured excitation and inhibition tuning in individual cells. We also show that the CBEM outperforms the standard GLM at predicting retinal spike responses to novel stimuli. These differences highlight the CBEM’s ability to shed light on the computations performed by sensory neurons in naturalistic settings.
Results
Background: Poisson GLM with spike history
The Poisson GLM provides a simple yet powerful description of the encoding relationship between stimuli and neural responses (Truccolo et al., 2005). A recurrent Poisson GLM, often referred to in the neuroscience literature simply as ‘the GLM’, describes neural encoding in terms of a cascade of linear, nonlinear, and probabilistic spiking stages (Figure 1a). The GLM parameters consist of a stimulus filter $\mathbf{\mathbf{k}}$, which describes how the neuron integrates an external stimulus, a postspike filter $\mathbf{\mathbf{h}}$, which captures dependencies on spike history, and a baseline $b$ that determines baseline firing rate in the absence of input. The outputs of these filters are summed and passed through a nonlinear function ${f}_{r}$ to obtain the conditional intensity for an inhomogeneous Poisson spiking process. The model can be written concisely in discrete time as:
where ${\lambda}_{t}\ge 0$ is the spike rate (or conditional intensity) at time $t$, ${\mathbf{\mathbf{x}}}_{t}$ is the spatiotemporal stimulus vector at time $t$, ${\mathbf{\mathbf{y}}}_{t}^{hist}$ is a vector of relevant spike history at time $t$, and ${y}_{t}$ is the spike count in bin of size $\mathrm{\Delta}$. Although spike generation is conditionally Poisson, the model can capture complex historydependent response properties such as refractoriness, bursting, bistability, and adaptation (Weber and Pillow, 2016). Additional filters can be added to the model in order to incorporate dependencies on covariates of the response such as spiking in other neurons or local field potential recorded on nearby electrodes (Truccolo et al., 2005; Pillow et al., 2008; Kelly et al., 2010). A common choice for the nonlinearity is exponential, $f(z)=\mathrm{exp}(z)$, which corresponds to the ‘canonical’ inverse link function for Poisson GLMs.
Previous literature has offered a quasibiological interpretation of the GLM known as ‘soft threshold’ integrateandfire (IF) model (Plesser and Gerstner, 2000; Gerstner, 2001; Paninski et al., 2007; Mensi et al., 2011). This interpretation views the summed filter outputs as the neuron’s membrane potential. This is similar to the standard IF model in which membrane potential is a linearly filtered version of input current (as opposed to conductancebased input). The nonlinear function ${f}_{r}$ can be interpreted as a ‘soft threshold’ function that governs a smooth increase in the instantaneous spike probability as a function of membrane depolarization. Lastly, the postspike current $\mathbf{\mathbf{h}}$ determines how membrane potential is reset following a spike.
We can rewrite the standard GLM to emphasize this biological interpretation explicitly:
Note that in this ‘soft’ version of the IF model, the only noise source is the conditionally Poisson spiking mechanism; this differs from other noisy extensions of the IF model with linear currentbased input and ‘hard’ spike thresholds, which require more elaborate methods for computing likelihoods (Paninski, 2004; Pillow et al., 2005; Paninski et al., 2008). To convert this model to a classic leaky integrateandfire model, we could replace ${f}_{r}$ with a ‘hard’ threshold function that jumps from zero to infinity at some threshold value of the membrane potential, set the stimulus filter $\mathbf{\mathbf{k}}$ to an exponential decay filter, and set the postspike filter $\mathbf{\mathbf{h}}$ to a delta function that causes instantaneous reset of the membrane potential following a spike. The GLM membrane potential is a linear function of the input, just as in the classic leaky IF model, and thus both models fail to capture the nonlinearities apparent in the synaptic inputs to most real neurons (Schwartz and Rieke, 2011).
Interpreting the GLM as a conductancebased model
Here, we propose a novel biophysically realistic interpretation of the classic Poisson GLM as a dynamical model with conductancebased input. In brief, this involves writing the GLM as a conductancebased model with excitatory and inhibitory conductances governed by affine functions of the stimulus, but constrained so that total conductance is fixed. This removes voltagedependence of the membrane currents, making the membrane potential itself an affine function of the stimulus. The remainder of this section lays out the mathematical details of this interpretation explicitly.
Consider a neuron with membrane potential ${V}_{t}$ governed by the ordinary differential equation:
where ${g}_{l}$ is leak conductance, ${g}_{e}(t)$ and ${g}_{i}(t)$ are timevarying excitatory and inhibitory synaptic conductances, and ${E}_{l}$, ${E}_{e}$ and ${E}_{i}$ are the leak, excitatory and inhibitory reversal potentials.
A natural question to ask is: under what conditions, if any, is this model a GLM? Answering this question aims to reveal what biophysical assumptions the GLM implicitly enforces when modeling spike trains. Here, we provide a set of sufficient conditions for an equivalence between the two. The definition of a GLM requires the membrane potential ${V}_{t}$ to be an affine (linear plus constant) function of the stimulus, which holds if the two following conditions are met:
Total conductance ${g}_{tot}(t)$ is constant, so the membrane equation is a linear ODE.
The input current ${I}_{tot}(t)$ is an affine function of the stimulus ${\mathbf{\mathbf{x}}}_{t}$.
The first condition implies ${g}_{e}(t)+{g}_{i}(t)=c$, for some constant $c$, and the second implies that ${g}_{e}(t){E}_{e}+{g}_{i}(t){E}_{i}$ is a linear function of the stimulus.
We can satisfy these two conditions simultaneously by modeling the excitatory and inhibitory conductances as affine functions of the stimulus, driven by linear filters of opposite sign:
where ${\mathbf{\mathbf{k}}}_{c}$ denotes the linear ‘conductance’ filter, and ${b}_{e}$ and ${b}_{i}$ are arbitrary constants. Under this setting, excitatory and inhibitory conductances are driven by equal and opposite linear projections of the stimulus, with total conductance fixed at ${g}_{tot}={g}_{l}+{b}_{e}+{b}_{i}$.
We can therefore rewrite the membrane equation (Equation 6) as:
where ${b}_{tot}={b}_{e}{E}_{e}+{b}_{i}{E}_{i}$. Setting the initial voltage to the steadystate value ${V}_{0}={b}_{tot}/{g}_{tot}$, the instantaneous membrane potential is then given by
where the equivalent standard GLM filter $\mathbf{\mathbf{k}}$ is equal to the linear convolution of ${\mathbf{\mathbf{k}}}_{c}$ with an exponential decay filter, that is: $\mathbf{\mathbf{k}}={\int}_{0}^{t}({E}_{e}{E}_{i}){\mathbf{\mathbf{k}}}_{c}(t){e}^{{g}_{tot}\cdot (t{t}^{\prime})}\mathit{d}{t}^{\prime}.$ This shows that membrane potential ${V}_{t}$ is an affine function of the stimulus, so by adding a monotonic nonlinearity and conditionally Poisson spiking, the model is clearly a GLM.
Thus, to summarize, the GLM can be interpreted as a conductancebased model in which a linear filter drives equal and opposite fluctuations in excitatory and inhibitory synaptic conductances. The GLM filter $\mathbf{\mathbf{k}}$ is equal to the convolution of this conductance filter with an exponential decay filter whose time constant is the inverse of the (constant) total conductance.
The conductancebased encoding model (CBEM)
From this novel interpretation of the GLM, it is straightforward to formulate a more realistic conductancebased statistical spike train model. Namely, we can remove the constraint needed to construct a GLM: that excitatory and inhibitory conductance sum to a constant. Relaxing this constraint, so that total conductance can vary, results in a new model that we refer to as the conductancebased encoding model (CBEM). The CBEM represents an extension of GLM to allow for differential tuning of excitation and inhibition and adds rectifying nonlinearities governing the relationship between the stimulus and synaptic conductances. (See model diagram, Figure 1b). The CBEM model is no longer a GLM because the filtering it performs on the stimulus is nonlinear.
Formally, the CBEM is driven by excitatory and inhibitory synaptic conductances that are each linearnonlinear functions of the stimulus:
where ${\mathbf{\mathbf{k}}}_{e}$ and ${\mathbf{\mathbf{k}}}_{i}$ are linear filters driving excitatory and inhibitory conductance, respectively, ${f}_{g}$ is a softrectifying nonlinearity that ensures that conductances are nonnegative (see Materials and methods, Equation 14), and ${b}_{e}$ and ${b}_{i}$ determine the baseline excitatory and inhibitory conductances in the absence of input. The CBEM membrane potential ${V}_{t}$ then evolves according to the ordinary differential equation (Equation 6) under the influence of the two timevarying conductances ${g}_{e}(t)$ and ${g}_{i}(t)$.
To incorporate spikehistory effects, we add a linear autoregressive term to the membrane potential. This results in an ‘effective’ membrane potential ${\stackrel{~}{V}}_{t}$ given by:
where ${\mathbf{\mathbf{y}}}_{t}^{hist}$ is a vector of binned spike history at time t. We convert membrane potential to spike rate using a biophysically motivated output nonlinearity proposed by Mensi et al. (2011):
where $\mu $ is a ‘soft’ spike threshold, and $\alpha $ and $\beta $ jointly determine slope and sharpness of the nonlinearity, respectively (see Materials and methods). Spiking is then a conditionally Poisson process given the rate, as in the Poisson GLM (Equation 5).
The CBEM is similar to the Poisson GLM in that the only source of stochasticity is the conditionally Poisson spiking mechanism: we assume no additional noise in the conductances or the voltage. This simplifying assumption, although not biophysically accurate, makes loglikelihood simple to compute, allowing for efficient maximum likelihood inference using standard ascent methods (see Materials and methods).
Validating the CBEM modeling assumptions with intracellular data
To validate the modeling assumptions of the CBEM, we use intracellular recordings from RGCs. First, we establish that an LN model can capture the relationship between stimuli and synaptic conductances measured intracellularly (Figure 2). An LN model for RGC conductances is plausible because the bipolar cells that drive RGCs are known to be wellcharacterized by LN models (Rieke, 2001; Demb et al., 2001; Beaudoin et al., 2008; Gollisch and Meister, 2010; Liu et al., 2017; Real et al., 2017). To test the assumption in detail, we analyzed voltage clamp recordings from ON parasol RGCs in response to a fullfield noise stimulus (Trong and Rieke, 2008). We fit the measured conductances with a linearnonlinear model with a softrectified nonlinearity to account for synaptic thresholding at the bipolartoganglion cell synapse (and at the amacrine cell synapses for the inhibitory inputs): ${f}_{g}(\cdot )=\mathrm{log}(1+\mathrm{exp}(\cdot ))$. The model accurately captured the relationship between projected stimuli and observed conductances on test data, accounting for 79 ± 4% (mean ± SEM) and 63 ± 3% of the variance of mean excitatory and inhibitory conductances, respectively.
Second, we establish that the output nonlinearity ${f}_{r}$, which maps membrane potential to instantaneous firing rate (Equation 12), provides an accurate description of the empirical relationship between membrane potential and spiking (Figure 3). To validate this model component, we examined dynamic current clamp recordings from two ON parasol RGCs. The dynamic clamp recordings drove RGCs with currents determined by previously measured excitatory and inhibitory conductances. To reduce noise, we computed average membrane potential over repeated presentations of the same measured conductance traces. We then computed nonparametric estimates of the nonlinearity (see Materials and methods). We found that the parametric function we assumed (Equation 12) closely approximated a nonparametric estimate of the nonlinearity (Figure 3c black; see Materials and methods for details).
Note that although previous analyses of RGC responses using Poisson GLMs have shown that an exponential nonlinearity captures the mapping from stimuli to spike rates more accurately than a rectifiedlinear nonlinearity (Pillow et al., 2008), we found the opposite here: the nonlinearity was better described with a softrectification function. This discrepancy may result from the fact that the GLM has a single nonlinearity, whereas the CBEM has a cascade of two nonlinearities: one mapping filter output to conductance, and a second mapping membrane potential to spike rate.
Predicting conductances from spikes with CBEM
We now turn to a key application of the CBEM: the inferring of excitatory and inhibitory synaptic conductances from extracellular spike train data. To test the model’s ability to make such predictions, we fit the model parameters to a dataset consisting of stimuli and observed spike times. We then used the inferred filters to predict the excitatory and inhibitory conductances elicited in response to novel stimuli recorded in the same cells.
The training data consisted of spike trains from six macaque ONparasol RGCs obtained in cellattached recordings with fullfield white noise stimuli. Each cell was stimulated with ten unique 6 s stimulus segments, repeated three or four times each, resulting in a total of thirty to forty 6 s trials per neuron (Trong and Rieke, 2008). We fit the CBEM parameters (conductance filters and spike history filter) to a single cell’s responses to 9 of the stimulus segments and evaluated performance using the remaining heldout segment (10fold cross validation). Thus, the model was fit using spike trains elicited by three or four repeats of a 54 s fullfield noise stimulus (see Materials and methods). For comparison, we also fit the conductance filters directly to measured excitatory and inhibitory conductances from intracellular recordings using the same stimuli and the same crossvalidation procedure.
Figure 4 shows the conductance filters estimated from intracellular data (fit to conductances) and extracellular data (fit to spike trains only) for two example cells, along with the predicted excitatory and inhibitory conductances elicited by a novel test stimulus. The filters fit to spikes were similar to those fit to conductances, and the conductance predictions from both models were highly correlated with the measured traces. Figure 5 shows a summary statistics comparing the two models’ performance for all six neurons for which we had both spike train and conductance recordings. For both models, predicted conductances traces were highly correlated with the measured conductances for all six cells. Using only a few minutes of spiking data, the conductances predicted by the extracellular model had an average correlation of r = 0.73 ± 0.01 (mean ± SEM) for the excitatory conductance and r = 0.69 ± 0.03 for the inhibitory conductance, compared to averages of r = 0.89 ± 0.02 (excitation) and r = 0.82 ± 0.01 (inhibition) for the LN model fit directly to conductances (Figure 5a–b).
Although the extracellular model predicted the basic timecourse of the observed conductances with high fidelity, there were small systematic discrepancies between modelpredicted and measured conductances. For example, measured conductances had nearly zero lag in their crosscorrelation (0.0 ± 2.4 ms; see also Cafaro and Rieke, 2013), whereas the predicted excitatory conductance slightly preceded the inferred inhibition for all six cells (r = 12.6 ± 1.0 ms, Student’s ttest p < 0.0001; Figure 5e–f). The predicted excitation preceded the average measured excitation by 5.6 ± 0.7 ms (p = 0.0005), while the predicted inhibition showed only a slight and statistically insignificant delay compared to the measured inhibition (r = 2.5 ± 1.3 ms, p = 0.11; Figure 5g–h).
Positively correlated excitation and inhibition in ONmidget cells
We also applied the CBEM to spike trains recorded from 5 ONmidget cells in response to the same type of fullfield noise used for the parasol cells. In contrast to the parasol cells, ONmidget cells have positively correlated excitation and inhibition with excitation preceding inhibition (Cafaro and Rieke, 2013). This breaks the GLM assumption of equal and opposite tuning of the two conductances. A set of unique 6 s stimuli were used to the the model (33–35 trials for spike recordings and 5–20 trials for the LN conductance model). The models were compared to the average conductances recorded in response to a repeated novel 6 s stimulus (5–10 repeats).
The CBEM captured the tuning of the synaptic conductances received by the midget cells. An example cell is shown in Figure 6—figure supplement 1. The CBEM predicted the excitatory conductance with an average correlation of r = 0.85 ± 0.03 compared to the intracellular LN model with a correlation of r = 0.95 ± 0.003 (Figure 6a). The inhibitory conductance showed more nonlinear behavior than can be captured by a single LN unit: the CBEM predicted inhibition with r = 0.33 ± 0.06 and the LN fit to the conductance had a correlation coefficient of only r = 0.54 ± 0.07 (Figure 6b). The CBEM captured the fact that the inhibitory input had ON tuning, but delayed compared to excitation (Figure 6c–d). This was also seen in the crosscorrelation between excitation and inhibition (Figure 6e). The data showed a crosscorrelation peak with excitation preceding inhibition by 10.1 ± 0.52 ms, and the CBEM showed a similar timing difference of 8.2 ± 0.8 ms (paired Student’s ttest, p = 0.06; Figure 6f). However, midget cells receive OFF inhibitory input in addition to the larger ON inhibitory input (Cafaro and Rieke, 2013), and therefore a single LN unit could not completely capture inhibition in these cells. The true excitation was faster than the predicted excitation by r = 3.8 ± 0.8 ms (Student’s ttest p = 0.008), and the measured inhibition was similarly timed with the model estimate (1.56 ± 0.6 ms, p = 0.06; Figure 6g–h). In summary, the CBEM can discover positive correlations between excitation and inhibition despite being initialized using a GLM with oppositely tuned excitation and inhibition (see Materials and methods).
Characterizing spike responses with CBEM
Given the CBEM’s ability to infer intracellular conductances from spike train data, we sought to examine how well it predicts spike responses to novel stimuli. Most encoding models are only tested with data from extracellular recordings, which are far easier to obtain and to sustain over longer periods. It therefore seems natural to ask: does the CBEM’s increased degree of biophysical realism confer advantages for predicting spikes?
To answer this question, we fit the CBEM and classic Poisson GLM to a population of 9 extracellularly recorded macaque RGCs stimulated with fullfield binary white noise (Uzzell and Chichilnisky, 2004; Pillow et al., 2005). We evaluated spike rate prediction by comparing the peristimulus time histogram (PSTH) of the simulated models to the PSTH of real neurons using a 5 s test stimulus (Figure 7). The CBEM had higher prediction accuracy than the GLM for all nine cells, 86% of the variance of the PSTH on average vs. 77% for the GLM. We then evaluated spike train prediction by comparing loglikelihood on a 5 min test dataset. The CBEM again outperformed the GLM on all cells, offering an improvement of 0.34 ± 0.11 bits/spike on average over the GLM.
To gain insight into the CBEM’s superior performance, we examined the average firing rate predictions of the GLM along with the average conductance predictions of the CBEM (Figure 7c). We found that GLM rate prediction errors (relative to the PSTH of the real neuron) were anticorrelated with the magnitude of the CBEM inhibitory conductance; the CBEM inhibitory conductance at times when the GLM spike rate exceeded the true spike rate was significantly higher than the CBEM inhibitory conductance at times when the GLM spike rate underestimated the true spike rate (ttest, p < 0.0001; Figure 7—figure supplement 1b). This suggests that the CBEM inhibitory conductance helped CBEM predictions by reducing the firing at times when the GLM overpredicted the firing rate. In contrast, the distribution of excitatory conductances did not depend on the sign of the rate prediction error (ttest, p = 0.19; Figure 7—figure supplement 1a), and the predicted excitatory conductance was positively correlated with the magnitude of the error (r = 0.33, p < 0.0001).
Previous experiments have indicated that inhibition only weakly modulates parasol cell responses to fullfield Gaussian noise stimuli (Cafaro and Rieke, 2013). To test the effect of inhibition in the model, we also refit the CBEM without any inhibitory synaptic input (CBEM_{exc}). We compared the excitatory filters estimated by the CBEM_{exc} with the GLM filters and found that the filters are nearly identical (Figure 8e). This indicates that the GLM stimulus filter accounts only for the excitatory input received by the cell. The CBEM_{exc} still provided a superior prediction of the PSTH than the GLM (81% of the variance explained) and an increased crossvalidated loglikelihood (mean improvement of 0.14 ± 0.10 bits/sp over the GLM; Figure 8). The CBEM_{exc} can exhibit changes in total conductance through a second, spike history independent nonlinearity (so it is not technically a GLM, as discussed in Section 3), and it predicts RGC responses better than the GLM, but not as well as the full CBEM. Thus, the full CBEM achieves superior model performance over the GLM both by including an inhibitory input, and by treating the excitatory input as a conductancebased input in a simple biophysical model.
Capturing spike responses across contrasts
Retinal ganglion cells adapt to stimulus statistics such as contrast or variance; increases in stimulus contrast lead to decreases in gain of the neural response, allowing the dynamic range of the response to adapt to the range of contrast values present in the stimulus (Chander and Chichilnisky, 2001; Fairhall et al., 2001; Baccus and Meister, 2002; Beaudoin et al., 2008; Mante et al., 2005; Garvert and Gollisch, 2013; Marava, 2013; Demb and Singer, 2015). Understanding this phenomenon is critical for understanding how the retina codes natural stimuli, because natural scenes vary widely over contrast in both space and time. However, classic linearnonlinear models with a single linear component fail to capture such effects. This motivates the need for a biophysically plausible modeling framework that can explain RGC responses across stimulus conditions (Ozuysal and Baccus, 2012; Clark et al., 2013; Cui et al., 2016b).
Previous work has shown that changes in the balance of excitatory and inhibitory input can give rise to multiplicative gain changes in neural responses (Chance et al., 2002; Murphy and Miller, 2003). This raises the possibility that the CBEM may be able account for contrastdependent changes in RGC responses with a single set of parameters. To test this hypothesis, we fit both the CBEM and GLM to eight RGCs stimulated with fullfield binary stimuli of 24%, 48%, and 96% contrast. We compared models fit simultaneously to all contrasts with models fit separately to data from each contrast. Although the CBEM does not account for many aspects of adaptation, this modeling framework allows us to test how well the LN conductance tuning alone can account for gain changes across contrasts (Ozuysal et al., 2018; Latimer et al., 2019).
To quantify the CBEM’s ability to capture contrastdependent gain changes in RGC responses, we compared GLM filters fit to RGC responses at each contrast with GLM filters fit to data simulated from the allcontrasts CBEM. (Figure 9a) shows GLM filters obtained at each contrast for an example RGC, while Figure 9b shows comparable filters fit to spikes simulated from the CBEM fit to this neuron. Both sets of filters exhibit large reductions in amplitude with increasing contrast, the key signature of contrast gain adaptation. Across all eight RGCs, we found high correlation in the filter amplitude scaling for real RGC and simulated CBEM responses (r = 0.61, p < 0.05; Figure 9—figure supplement 1).
We found that the CBEM maintained predictive performance across contrast levels more accurately than the GLM (Figure 9c–d). At 24% contrast, the GLM fit to all contrasts lost an average 0.36 ± 0.41 bits/sp (normalized test loglikelihood) compared to GLM fit specifically to the 24% contrast stimulus, while the CBEM lost only 0.16 ± 0.2 bits/sp. At 48% contrast, the GLM lost 0.20 bits/sp while CBEM only lost 0.07 ± 0.14 bits/sp. Finally, both models only lost 0.05 ± 0.08 bits/sp in the 96% contrast probe. The GLM’s partial ability to generalize across these particular conditions despite having only one stimulus filter can be viewed as a consequence of our biophysical interpretation of the GLM; the GLM is equivalent to a biophysical model in which synaptic excitation and inhibition are governed by equal filters of opposite sign; Figure 4 left shows that this assumption is approximately correct for ON parasol RGCs. However, the flexibility conferred by the slight differences in these filters with separate nonlinearities gave the CBEM greater accuracy in predicting RGC responses across a range of contrasts. We find that the correlation between excitation and inhibition in the CBEM is not constant: the CBEM predicts that the magnitude of the correlation depends on contrast Figure 9—figure supplement 2ab. The CBEM predicted that, on average, excitation and inhibition were most anticorrelated at 22% contrast for the ON cells and 34% contrast for the OFF cells Figure 9—figure supplement 2c. Additionally, the CBEM predicts that the mean and variance of the total synaptic conductance increases with contrast Figure 9—figure supplement 2df.
Capturing spike responses to spatially varying stimuli
To analyze the CBEM’s ability to capture responses to spatially varying stimuli, we examined a dataset of 27 parasol RGCs stimulated with spatiotemporal binary white noise stimuli (Pillow et al., 2008). We fit spatiotemporal filters consisting of a 5 × 5 pixel field over the same temporal extent as the models fit to fullfield stimuli. The temporal profiles of excitatory and inhibitory CBEM filters were qualitatively similar to those that we observed in the fullfield stimulus condition (Figure 10a,c). The filters were not constrained to be spatiotemporally separable (the filters were constrained to be rank 2; Figure 10b), which allowed the synaptic inputs to have different temporal interactions compared to the fullfield stimulus.
We found that the CBEM predicted PSTHs more accurately than a Poisson GLM (83% vs. 79% average R^{2}; Figure 10e). The CBEM also predicted the singletrial responses with higher accuracy than the standard Poisson GLM (average improvement of 0.07 ± 0.04 bits/sp; Figure 10f). Even the CBEM with excitatory input only yielded more accurate PSTH prediction (81% R^{2}) than the GLM, but the singletrial spike train prediction fell to an average of 0.02 ± 0.04 bits/sp higher than the GLM (Figure 10g–h). Thus, the GLM predicted RGC responses to fullfield noise with similar accuracy to the more complex CBEM, suggesting that the predictive performance given the training data was nearing a ceiling. Therefore, we turned to simulations to explore what type of stimuli differentiate the two models.
To gain insight into how the model’s excitatory and inhibitory inputs shape the CBEM’s responses to spatiotemporal stimuli, we simulated the model with uncorrelated spatiotemporal noise and with spatially correlated stimuli. The uncorrelated spatiotemporal noise was the same independent binary pixel noise used in the RGC recordings, and we used a fullfield and a binary centersurround stimuli for the spatially correlated noise (Figure 11a). Each frame of the spatially correlated centersurround stimulus was constructed by setting the center pixel to the opposite sign of the pixels in the surround, and the center pixel had equal probability of being black or white. We examined the crosscorrelation of the CBEM’s excitatory and inhibitory conductances in each stimulus regime and found that they were similar for the fullfield and uncorrelated spatiotemporal noise stimuli (Figure 11b gray and black traces). In response to these two stimuli, the excitatory and inhibitory conductances showed a strong negative correlation with excitation preceding inhibition (as we saw in Figure 5e). The centersurround stimulus, however, produced a distinct crosscorrelation pattern with a larger positive peak at the positive lags (red traces).
Finally, we simulated GLM and CBEM responses to centersurround contrast steps. The stimulus sequence started as a gray field stepping to a black center pixel with white surround for 500 ms, stepping to a gray field for 500 ms, then stepping to a white center and black surround, finally returning to a gray field (Figure 11c bottom). The CBEM and GLM showed similar onset responses, but the sustained responses of the CBEM simulations showed inhibitiondependent suppression for both ON and OFF cells (Figure 11c top and middle). The shape and sustained response of the CBEM_{exc} fit to the OFF cells to centersurround steps qualitatively differed to the full CBEM: the CBEM_{exc} response decayed and then rebounded slightly instead of showing only a decaying response. Thus, fullfield and independent spatiotemporal noise resulted in excitatory and inhibitory correlations that fit closely with the assumptions contained in the GLM. Spatial correlations, and in particular negative correlations, in the stimulus break these assumptions by coactivating excitatory and inhibitory inputs (Cafaro and Rieke, 2013) and therefore spatially correlated stimuli differentiate the CBEM’s predictions from the GLM. These results indicate that, although the GLM captures much of the RGC responses to fullfield noise, capturing the inhibitory and excitatory synaptic inputs will aid in understanding processing of naturalistic stimuli which contain spatial structure.
Discussion
The point process GLM has found widespread use for modeling the statistical relationship between stimuli and spike trains. Here, we have offered a new biophysical interpretation of this model, showing that it can written as a conductancebased model with oppositely tuned linear excitatory and inhibitory conductances. This motivated us to introduce a more flexible and more biophysically plausible model with independent excitatory and inhibitory conductances, each given by a rectifiedlinear function of the sensory stimulus. This conductancebased encoding model (CBEM) is no longer technically a generalized linear model because the membrane potential is a nonlinear function of the stimulus; however, the CBEM has a well behaved pointprocess likelihood, making it tractable for fitting to extracellular data.
In contrast to purely statistical approaches to designing encoding models, we used intracellular measurements to motivate the choice of the nonlinear functions in the CBEM. We demonstrated that the CBEM accurately recovers the tuning of the excitatory and inhibitory synaptic inputs to RGCs purely from measured spike times. As an added bonus, it achieves improved prediction performance compared to the GLM, The interaction between excitatory and inhibitory conductances allows the CBEM to change its gain and integration time constant as a function of stimulus statistics (e.g. contrast), an effect that cannot be captured by a standard GLM. Thus, the CBEM can help reveal circuitlevel computations that support perception under naturalistic conditions.
The CBEM belongs to an extended family of neural encoding models that are not technically GLMs because they do not depend on a single linear projection of the stimulus. These include multifilter LNP models with quadratic terms (Schwartz et al., 2002; Rust et al., 2005; Park and Pillow, 2011; Fitzgerald et al., 2011; Park et al., 2013; Rajan et al., 2013) or general nonparametric nonlinearities (Sharpee et al., 2004; Williamson et al., 2015); models with input nonlinearities (Ahrens et al., 2008) and multilinear context effects (Williamson et al., 2016); models inspired by deep learning methods (McIntosh et al., 2016; Maheswaranathan et al., 2018); and models with biophysically inspired forms of nonlinear response modulation (Butts et al., 2011; Ozuysal and Baccus, 2012; McFarland et al., 2013; Cui et al., 2016b; Real et al., 2017). The CBEM has most in common with this last group of models. Although more flexible LNLN models may predict spike trains with higher accuracy, the CBEM stands as the only model so far to have directly linked model components fit to spikes alone to experimentally measured conductances.
Although the CBEM represents a step toward biophysical realism, it still lacks many properties of real neurons. For instance, the CBEM’s linearrectified conductance does not capture the nonmonotonic portions of the stimulusconductance nonlinearities observed in the data (Figure 2c,e); this nonmonotonicity likely arises from the fact that amacrine cells can receive inputs from both ON and OFF channels (Manookin et al., 2008; Cafaro and Rieke, 2013). Further developments to the CBEM can include additional sets of nonlinear inputs (McFarland et al., 2013; Maheswaranathan et al., 2018; Real et al., 2017). Such extensions could include multiple spatially distinct inputs to account for input from different bipolar cells (Schwartz et al., 2012; Freeman et al., 2015; Vintch et al., 2015; Turner and Rieke, 2016; Liu et al., 2017), and spatially selective rectification of inhibitory inputs that helps determine RGC responses to spatial stimuli (Brown and Masland, 2001; Cafaro and Rieke, 2013; Schwartz and Rieke, 2013). The model could also be extended to study presynaptic inhibition of the excitatory conductance, which can shape the spatial tuning of excitation (Turner et al., 2018) and contrast adaptation (Cui et al., 2016b). Adaptation can occur in localized regions of a ganglion cell’s RF (Garvert and Gollisch, 2013), suggesting that the linearnonlinear synaptic input functions in the CBEM should be allowed to vary over time. Additionally, future work could apply the CBEM to study the role of active conductances that depend spike history, such as an after hyperpolarization current (Johnston et al., 1995; Badel et al., 2008; Lundstrom et al., 2008), and recent work has shown that the parameters of HodgkinHuxley style biophysical models can in some instances be recovered from spike trains alone (Meng et al., 2011). Spikedependent conductances could also be examined in multineuron recordings; although the analyses presented here focused on the coding properties of single neurons, many of the RGCs analyzed were recorded simultaneously (Pillow et al., 2008; Volgushev et al., 2015).
Another aspect of the CBEM that departs from biophysical realism is that all stochasticity is confined to the spike generation mechanism. The CBEM models conductances and membrane potential as deterministic functions of the stimulus, which makes the likelihood tractable and allows for efficient fitting with standard conjugategradient methods (Real et al., 2017). However, the reliability of RGC spike trains depends on the stochasticity of synaptic conductances (Murphy and Rieke, 2006), and noise correlations between excitatory and inhibitory conductances may also affect encoding in RGCs (Cafaro and Rieke, 2010). A latent variable approach could be used to to incorporate stochasticity in conductances and membrane potential (Meng et al., 2011; Paninski et al., 2012; Lankarany et al., 2016).
We expect that the CBEM may also be useful for regions beyond the retina. Previous work on the prediction of membrane potential in primary visual cortex suggests that the CBEM could apply to neurons further along in the visual stream (Mohanty et al., 2012; Tan et al., 2011). The model could also be applied to nonvisual areas such as primary auditory cortex, where different tuning motifs of excitation and inhibition are of interest (Scholl et al., 2010). Future work might extend the CBEM to use deeper, nonlinear cascade models to predict conductances, as opposed to the simple LN cascade we have assumed here. For example, one might use the LNLN models of the lateral geniculate nucleus (e.g. Butts et al., 2011; McFarland et al., 2013) as providing the drive to synaptic conductances in V1 neurons. This principle can extend to higher sensory regions, such as the middle temporal cortex where cascade models can approximate the inputs received from V1 (Rust et al., 2006). Applications in cortex may also incorporate additional inputs to the model such as local field potential, which is thought to reflect the total synaptic drive to a region (Einevoll et al., 2013; Haider et al., 2016; Cui et al., 2016a).
Applications of the CBEM to new brain areas could involve testing the accuracy of conductance predictions with a small number of intracellular recordings, and then applying the model to larger set of extracellular recordings with a wider range of stimuli. Although the model’s simplifying assumptions limit the ability to make strong conclusions about the conductances estimated from spikes alone, the model may nevertheless guide experimental design and theories of sensory processing when intracellular recordings are unavailable.
Future work will require modeling the neural code using naturalistic stimuli, where the GLM has been shown to fail (Carandini et al., 2005; van Hateren et al., 2002; Butts et al., 2007; Heitman et al., 2016; Turner and Rieke, 2016). Modeling tools must also provide a link between the neural code and computations performed by the neural circuit. As we move toward stimuli with complex spatiotemporal statistics, the ability to connect distinct synaptic conductances to spiking will provide an essential tool for deciphering the complex, nonlinear neural code in sensory systems.
Materials and methods
Electrophysiology
Request a detailed protocolWe analyzed four sets of parasol RGCs. All data were obtained from isolated, peripheral macaque monkey, Macaca mulatta, retina.
Synaptic current recordings
Request a detailed protocolWe analyzed the responses of 6 ON parasol cells previously described in Trong and Rieke (2008). Cellattached and voltage clamp recordings were performed to measure spike trains and excitatory and inhibitory currents in the same cells. The stimulus, delivered with an LED, consisted of a one dimensional, fullfield white noise signal, filtered with a low pass filter with a 60 Hz cutoff frequency, and sampled at a 0.1ms resolution. Spike trains were recorded using 10 unique 6 s stimuli, and each stimulus was repeated three or four times. After the spike trains were recorded, the excitatory and inhibitory synaptic currents to the same stimuli were measured using voltage clamp recordings. Active conductances intrinsic to the RGC were blocked during these recordings and the holding potential was set to isolate either the excitatory or inhibitory inputs received by the cell. For four of the cells, two to four trials were recorded for each of the 10 stimuli for the excitatory and inhibitory currents. For the two remaining cells, three to four excitatory current trials were recorded for all 10 stimuli and one to two trials for the inhibitory current were obtained for 8 of the stimuli. Conductances were estimated by dividing the current by the approximate driving force (−70 mV for the excitatory currents, and 70 mV for the inhibitory).
The 5 ONmidget cells were recorded as described previously (Dunn et al., 2006; Trong and Rieke, 2008; Cafaro and Rieke, 2013). Retinas were obtained through the Tissue Distribution Program of the Regional Primate Research Center at the University of Washington and procedures were approved by the Institutional Animal Care and Use Committee. The same type of fullfield noise stimuli were used for the midget cells as with the parasol cells, and the recordings were again divided into 6 s trials. Spike trains were obtained with cell attached recordings. For each cell, 33–35 trials of unique stimuli were recorded, and 10 (for 4 cells) or 20 (for 1 cell) trials were recorded in response to a repeated stimulus. Excitatory and inhibitory currents were recorded for 5–20 trials each for nonrepeated stimuli, and 5–10 trials were recorded in response to the repeated validation stimulus. Conductances were again estimated by dividing the current by the approximate driving force (−70 mV for the excitatory currents, and 70 mV for the inhibitory).
Dynamic clamp recordings
Request a detailed protocolThe membrane potentials of 2 ON parasol retinal ganglion cells were recorded during dynamic clamp experiments previously reported in Cafaro and Rieke (2013). The cells were current clamped and current was injected into the cells according to the equation
where ${g}_{e}$ and ${g}_{i}$ were conductances recorded in RGCs in response to a light stimulus. The injected current at time $t$ was computed using the previous measured voltage with offset Δt = 100 μs. The reversal potentials were ${E}_{e}=0$ mV and ${E}_{i}=90$ mV.
For the first cell, 18 repeat trials were recorded for a 19 s stimulation, and 24 repeat trials were obtained from the second cell.
RGC population recordings: fullfield stimulus
Request a detailed protocolWe analyzed data from two experiments previously reported in Uzzell and Chichilnisky (2004) and Pillow et al. (2005). The first experiment included nine simultaneously recorded parasol RGCs (5 ON and 4 OFF). The stimulus consisted of a fullfield binary noise stimulus (independent black and white frames) with a rootmeansquare contrast of 96%. The stimulus was displayed on a CRT monitor at a 120 Hz refresh rate and the contrast of each frame was drawn independently. A 10 min stimulus was obtained for characterizing the cell responses, and a 5min segment was used to obtain a crossvalidated loglikelihood. Spike rates were obtained by recording 167 repeats of a 7.5 s stimulus.
In a second experiment, eight cells (3 ON and 5 OFF parasol) were recorded in response to a fullfield binary noise stimulus (120 Hz) at 24%, 48%, and 96% contrast. An 8 min stimulus segment at each contrast level was used for model fitting, and crossvalidated loglikelihoods were obtained using a novel 4 min segment at each contrast level.
RGC population recordings: spatiotemporal stimulus
Request a detailed protocolWe analyzed 11 ON and 16 OFF parasol RGCs which were previously reported in Pillow et al. (2005). The stimulus consisted of a spatiotemporal binary white noise pattern (i.e. a field of independent white and black pixels). The stimulus was 10 pixels by 10 pixels (pixel size of 120 μm × 120 μm on the retina), and the contrasts of each pixel was drawn independently on each frame (120 Hz refresh rate). The rootmeansquare contrast of the stimulus was 96%.
A 10min stimulus was obtained for characterizing the cell responses, and a 5min segment was used to obtain a crossvalidated loglikelihood. Spike rates were obtained by recording 600 repeats of a 10 s stimulus.
Modeling methods
The conductancebased encoding model
Request a detailed protocolThe CBEM introduced above models the spike train response of a RGC to a visual stimulus as a Poisson process where the spike rate is a function of the membrane potential (Figure 1b). The membrane potential is approximated by considering a singlecompartment neuron with linear membrane dynamics and conductancebased input (Equation 6). Note that we have ignored capacitance, which would provide an (unobserved) scaling factor on $dV/dt$, but will not affect our results. The synaptic inputs (Equation 10) take the form of linearnonlinear functions of the stimulus, $\mathbf{\mathbf{x}}$, where ${f}_{g}$ is a nonlinear function ensuring positivity of the conductances. We will assume a ‘softrectification’ nonlinearity given by
which behaves like a smooth version of a linear halfrectification function.
Given the conductances, we could then obtained the membrane voltage. We use a firstorder exponential integrator method to solve this equation, which is exact under the assumption that ${\mathrm{g}}_{\mathrm{e}}(\mathrm{t})$ and ${\mathrm{g}}_{\mathrm{i}}(\mathrm{t})$
where
for ${g}_{tot}(t)$ and ${I}_{tot}(t)$, and assuming $V(0)={E}_{l}$ at the start of each experiment.
For a set of spike times ${s}_{1:{n}_{sp}}$ in the interval $[0,S]$ and parameters $\mathrm{\Theta}$, the loglikelihood in continuous time is
where the spike rate, $\lambda (t)$, is a function of the voltage plus spike history (Equation 12). This likelihood can be discretely approximated as the product of $T$ Bernoulli trials in bins of width $\mathrm{\Delta}$ (Citi et al., 2014)
where ${y}_{i}=1$ if a spike occurred in the $i$th bin and 0 otherwise.
The membrane voltage (and firing rate) is computed by integrating the membrane dynamics equation (Equation 6). In practice, we evaluate $V$ along the same discrete lattice of points of width $\mathrm{\Delta}$ ($t=1,2,3,\dots T$) that we use to discretize the loglikelihood function. Assuming ${g}_{e}$ and ${g}_{i}$ remain constant within each bin, the voltage equation becomes a simple linear differential equation which we solve according to Equation 15.
The model parameters we fit were ${\mathbf{\mathbf{k}}}_{e}$, ${\mathbf{\mathbf{k}}}_{i}$, ${b}_{e}$, ${b}_{i}$, and $\mathbf{\mathbf{h}}$, which were selected using conjugategradient methods to maximize the loglikelihood.
The reversal potential and leak conductance parameters were kept fixed at ${E}_{e}=0mV$, ${g}_{l}=200$, ${E}_{l}=60mV$, and ${E}_{i}=80mV$. For modeling the cells in which we had access to intracellular recordings, we set the time bin width to $\mathrm{\Delta}=0.1ms$ to match the sampling frequency of the synaptic current recordings. For the remaining cells, which were recorded in separate experiments, we set $\mathrm{\Delta}=0.083ms$, 100 times the frame rate of the visual stimulus.
The stimulus filters spanned over 100 ms, or over 1000 time bins. Therefore, we restricted the excitation and inhibitory filters to a low dimensional basis to limit the total number of free parameters in the model. The basis consisted of 10 raised cosine ‘bumps’ (Pillow et al., 2005; Pillow et al., 2008) of the form
where $t$ is in seconds. We set c = 0.02 and $a=2({\varphi}_{2}{\varphi}_{1})/\pi $. The ${\varphi}_{j}$ were evenly spaced from ${\varphi}_{1}=\mathrm{log}(0.0+c)$, ${\varphi}_{10}=\mathrm{log}(0.150+c)$ so that the peaks of the filters spanned 0 ms to 150 ms. The spike history filter was also represented in a lowdimensional basis. The refractory period was accounted for with five square basis functions of width 0.4 ms, spanning the period $02$ ms after a spike. The remaining spike history filter consisted of 7 raised cosine basis functions (c = 0.0001) with filter peaks spaced from 2 ms to 90 ms.
The loglikelihood function for this model is not concave in the model parameters, which increases the importance of selecting a good initialization point compared to the GLM. We initialized the parameters by fitting a simplified model which had only one conductance with a linear stimulus dependence, ${g}_{lin}(t)={\mathbf{\mathbf{k}}}_{lin}{}^{\top}\mathbf{\mathbf{x}}_{t}$ (note that this allowed for negative conductance values). We initialized this filter at 0, and then numerically maximized the loglikelihood for ${\mathbf{\mathbf{k}}}_{lin}$. We then initialized the parameters for the complete model using ${\mathbf{\mathbf{k}}}_{e}=c{\mathbf{\mathbf{k}}}_{lin}$ and ${\mathbf{\mathbf{k}}}_{i}={\mathbf{\mathbf{k}}}_{lin}$, thereby exploiting a mapping between the GLM and the CBEM (see Results).
When fitting the model to real spike trains, one conductance filter would occasionally become dominant early in the optimization process. This was likely due to the limited amount of data available for fitting, especially for the cells that were recorded intracellularly. The intracellular recordings clearly indicated that the cells received similarly scaled excitatory and inhibitory inputs. To alleviate this problem, we added a penalty term, $\varphi $, to the loglikelihood to the ${L}_{2}$ norms of ${\mathbf{\mathbf{k}}}_{e}$ and ${\mathbf{\mathbf{k}}}_{i}$:
Thus, we maximized
All cells were fit using the same penalty weights: ${\lambda}_{e}=1$ and ${\lambda}_{i}=0.2$. We note that unlike the typical situation with cascade models that contain multiple filters, intracellular recordings can directly measure synaptic currents. Future work with this model could include more informative, datadriven priors on ${\mathbf{\mathbf{k}}}_{e}$ and ${\mathbf{\mathbf{k}}}_{i}$.
In several analyses, we fit the CBEM without the inhibitory conductance, labeled as the CBEM_{exc}. All the fixed parameters used in the full CBEM were held at the same values in the CBEM_{exc}.
Code for fitting the CBEM has been made available at https://github.com/pillowlab/CBEM (Latimer, 2018; copy archived at https://github.com/elifesciencespublications/CBEM).
Fitting the CBEM to simulated spike trains
Request a detailed protocolTo examine the performance of our numerical maximum likelihood estimation of the CBEM, we fit the parameters to simulated spike trains from the model with known parameters (Figure 1—figure supplement 1). Our first simulated cell qualitatively mimicked experimental RGC datasets, with input filters selected to reproduce the stimulus tuning of macaque ON parasol RGCs (excitation oppositely tuned and delayed compared to excitation, or ‘crossover’ inhibition). The second simulated cell had similar excitatory tuning, but the inhibitory input had the same tuning as excitation with a short delay. The stimulus consisted of a one dimensional white noise signal, binned at a 0.1 ms resolution, and filtered with a lowpass filter with a 60 Hz cutoff frequency. We validated our maximum likelihood fitting procedure by examining error in the fitted filters, and evaluating the loglikelihood on a 5min test set. With increasing amounts of training data, the parameter estimates converged to the true parameters for both simulated cells. Therefore, standard fast and nonglobal optimization algorithms can reliably fit the CBEM to spiking data, despite the fact that the model does not have the concavity guarantees of the standard GLM.
Fitting the conductance nonlinearity
Request a detailed protocolWe selected the nonlinear function ${f}_{g}$ governing the synaptic conductances by fitting a linearnonlinear cascade model to intracellularly measured conductances evoked during visual stimulation (Hunter and Korenberg, 1986; Paninski et al., 2012; Park et al., 2013; Barreiro et al., 2014). We modeled the mean conductance $\overline{{g}_{e}}(t)$ as
where $\mathbf{\mathbf{x}}$ is a fullfield temporal stimulus, and ${a}_{e}$ and ${b}_{e}$ are constants. We selected a fixed function for the nonlinearity ${f}_{g}$. Thus, we chose the ${\mathbf{\mathbf{k}}}_{e}$, ${a}_{e}$, and ${b}_{e}$ that minimized the squared error between the LN prediction and the measured excitatory conductance.
The softrectifying function was selected to model the conductance nonlinearity;
We chose to fix these nonlinearities to known functions rather than fitting with a more flexible empirical form (e.g., Ahrens et al., 2008; McFarland et al., 2013). Fixing these nonlinearities to a simple, closedform function allowed for fast and robust maximum likelihood parameter estimates while still providing an excellent description of the data.
Fitting the spikerate nonlinearity
Request a detailed protocolWe used a spiketriggered analysis (de Boer and Kuyper, 1968) on membrane voltage recordings to determine the spike rate nonlinearity, ${f}_{r}$, as a function of voltage for the CBEM. This is the same procedure for estimating the LNmodel nonlinearity proposed in Chichilnisky (2001); Mease et al. (2013), but substituting the filtered stimulus with the average voltage measured across trials. The membrane potential and spikes were recorded in dynamicclamp experiments over several repeats of simulated conductances for two cells. We computed the mean voltage recorded over all runs of the dynamicclamp condition, which largely eliminated the action potential shapes from the voltage trace. Using the spike times from all the repeats, we computed the probability of a spike occurring in one time bin given the mean voltage, $\overline{V}$:
where $p(\overline{V}Sp)$ is the spiketriggered distribution of the membrane potential. The distribution over voltage in all times bins is $p(\overline{V})$. The spike rate (in terms of spikes per bin) is $p(Sp)$. We combined the spike times and voltage distributions for the two cells to compute a common spike rate function.
We then obtained a leastsquares fit to approximate the nonlinearity with a softrectification function of the the form
The parameters fit to the empirical spikerate nonlinearity was $\alpha =90sp/s$, $\mu =53mV$ and $\beta =1.67mV$.
We chose to fit the spikerate nonlinearity with the average voltage recorded over repeat data, instead of looking at the voltage in bins preceding spikes (Jolivet et al., 2006). The average voltage is closer in spirit to the voltage in our model than the singletrial voltage, because the voltage dynamics we considered (Equation 6) did not include noise nor postspike currents.
Generalized linear models
Request a detailed protocolFor a baseline comparison to the CBEM, we also fit spike trains with a GLM. We used the same Bernoulli discretization of the pointprocess loglikelihood function for the GLM as we did with the CBEM:
where the firing rate is
The stimulus filter is $\mathbf{\mathbf{k}}$ and the spike history filter is ${\mathbf{\mathbf{y}}}^{hist}$. We used conjugategradient methods to find the maximum likelihood estimates for the parameters. We set ${f}_{r}(\cdot )=\mathrm{exp}(\cdot )$, which is the canonical inverselink function for Poisson GLMs. We confirmed previous results that the softrectifying nonlinearity, ${f}_{r}(\cdot )=\mathrm{log}(1+\mathrm{exp}(\cdot ))$, did not capture RGC responses as well as the exponential function (Pillow et al., 2008).
Modeling respones to spatiotemporal stimuli
Request a detailed protocolFor spatiotemporal stimuli, the filters for the CBEM and GLM ($\mathbf{\mathbf{k}},{\mathbf{\mathbf{k}}}_{e},$ and ${\mathbf{\mathbf{k}}}_{i}$) spanned both space and time. Although the stimulus we used was a 10 × 10 grid of pixels, the receptive field (RF) of the neurons did not cover the entire grid. We therefore limited the spatial extent of the linear filters to a 5 × 5 grid of pixels, where the center pixel was the strongest point in the GLM stimulus filter.
The filters were represented as a matrix where the columns span the pixel space and the rows span the temporal dimension. The number of parameters was reduced by decomposing the spatiotemporal filters into a lowrank representation (Pillow et al., 2008). The filter at pixel $x$ and time $\tau $ became
where ${\mathbf{\mathbf{k}}}_{s,j}$ was a vector containing the spatial portion of the filter of length 25 (the number of pixels in the RF) and ${\mathbf{\mathbf{k}}}_{t,j}$ represented the temporal portion of the filter. The temporal filters were projected into the same 10dimensional basis as the temporal filters used to model the fullfield stimuli and the spatial filters were represented in the natural pixel basis. For identifiability, we normalized the spatial filters and forced the sign of the center pixel of the spatial filters to be positive. We used rank two filters ($J=2$) for the CBEM and GLM. Therefore, each filter contained 2 × 25 spatial and 2 × 10 temporal parameters for a total of 70 parameters. In the GLM, we found no significant improvement using rank three filters. To fit these lowrank filters, we alternated between optimizing over the spatial and temporal components of the filters.
Evaluating model performance
Request a detailed protocolWe evaluated singletrial spike train predictive performance by computing the loglikelihood on a test spike train. We computed the difference between the log (base2) likelihood under the model and the loglikelihood under a homogeneous rate model ($L{L}_{h}$) that captured only the mean spike rate:
where the test stimulus is of length $T$ (in discrete bins) and contains ${n}_{sp}$ spikes. We then divided by the number of spikes to obtain the predictive performance in units of bits per spike (bits/sp) (Panzeri et al., 1996; Brenner et al., 2000; Paninski et al., 2004)
We evaluated model predictions of spike rate by simulating 2500 trials from the model for a repeated stimulus. We computed the firing rate, or PSTH, by averaging the number of spikes observed in 1 ms bins and smoothing with a Gaussian filter with a standard deviation of 2 ms. The percent of variance in the PSTH explained by the model is
where $\overline{PST{H}_{data}}$ denotes the average value of the PSTH.
Data availability
All modeling tools have been made publicly available at https://github.com/pillowlab/CBEM (copy archived at https://github.com/elifesciencespublications/CBEM). The datasets analyzed in this paper have been previously published as the following: 1. Conductance and cellattached spike recordings: Philipp Khuc Trong and Fred Rieke (2008). "Origin of correlated activity between parasol retinal ganglion cells." https://doi.org/10.1038/nn.2199. Dataset available via figshare https://figshare.com/articles/ONParasol_RGCs_for_the_conductancebased_encoding_model/9636854. 2. Fullfield extracellular recordings (including multiple contrasts): VJ Uzzell and EJ Chichilnisky (2004). "Precision of Spike Trains in Primate Retinal Ganglion Cells." https://doi.org/10.1152/jn.01171.2003. Dataset can be accessed through a response to the corresponding author. 3. Spatiotemporal stimuli: Jonathan W Pillow, Jonathon Shlens, Liam Paninski, Alexander Sher, Alan M Litke, EJ Chichilnisky and Eero P Simoncelli (2008). "Spatiotemporal correlations and visual signalling in a complete neuronal population." https://doi.org/10.1038/nature07140. Dataset can be accessed through a response to the corresponding author.

figshareONParasol RGCs for the conductancebased encoding model.https://doi.org/10.6084/m9.figshare.9636854
References

Inferring input nonlinearities in neural encoding modelsNetwork: Computation in Neural Systems 19:35–67.https://doi.org/10.1080/09548980701813936

When do microcircuits produce beyondpairwise correlations?Frontiers in Computational Neuroscience 8:10.https://doi.org/10.3389/fncom.2014.00010

Synergy in a neural codeNeural Computation 12:1531–1552.https://doi.org/10.1162/089976600300015259

Regulation of spatial selectivity by crossover inhibitionJournal of Neuroscience 33:6310–6320.https://doi.org/10.1523/JNEUROSCI.496412.2013

Do we know what the early visual system does?Journal of Neuroscience 25:10577–10597.https://doi.org/10.1523/JNEUROSCI.372605.2005

Adaptation to temporal contrast in primate and salamander retinaThe Journal of Neuroscience 21:9904–9916.https://doi.org/10.1523/JNEUROSCI.212409904.2001

A simple white noise analysis of neuronal light responsesNetwork: Computation in Neural Systems 12:199–213.https://doi.org/10.1080/713663221

Likelihood methods for point processes with refractorinessNeural Computation 26:237–263.https://doi.org/10.1162/NECO_a_00548

Dynamical adaptation in photoreceptorsPLOS Computational Biology 9:e1003289.https://doi.org/10.1371/journal.pcbi.1003289

Inferring cortical variability from local field potentialsThe Journal of Neuroscience 36:4121–4135.https://doi.org/10.1523/JNEUROSCI.250215.2016

Triggered correlationIEEE Transactions on Biomedical Engineering 15:169–179.https://doi.org/10.1109/TBME.1968.4502561

Functional circuitry of the retinaAnnual Review of Vision Science 1:263–289.https://doi.org/10.1146/annurevvision082114035334

Controlling the gain of rodmediated signals in the mammalian retinaJournal of Neuroscience 26:3959–3970.https://doi.org/10.1523/JNEUROSCI.514805.2006

Modelling and analysis of local field potentials for studying the function of cortical circuitsNature Reviews Neuroscience 14:770–785.https://doi.org/10.1038/nrn3599

Second order dimensionality reduction using minimum and maximum mutual information modelsPLOS Computational Biology 7:e1002249.https://doi.org/10.1371/journal.pcbi.1002249

Handbook of Biological Physics469–516, A framework for spiking neuron models: the spike response model, Handbook of Biological Physics, Elsevier, 10.1016/S13838121(01)800154.

Bayesian inference for generalized linear models for spiking neuronsFrontiers in Computational Neuroscience 4:e00012.https://doi.org/10.3389/fncom.2010.00012

The identification of nonlinear biological systems: wiener and Hammerstein cascade modelsBiological Cybernetics 55:135–144.https://doi.org/10.1007/bf00341929

Predicting spike timing of neocortical pyramidal neurons by simple threshold modelsJournal of Computational Neuroscience 21:35–49.https://doi.org/10.1007/s1082700670745

Local field potentials indicate network state and account for neuronal response variabilityJournal of Computational Neuroscience 29:567–579.https://doi.org/10.1007/s1082700902089

Simultaneous Bayesian estimation of excitatory and inhibitory synaptic conductances by exploiting multiple recorded trialsFrontiers in Computational Neuroscience 10:110.https://doi.org/10.3389/fncom.2016.00110

Multiple timescales account for adaptive responses across sensory corticesThe Journal of Neuroscience 39:10019–10033.https://doi.org/10.1523/JNEUROSCI.164219.2019

Fractional differentiation by neocortical pyramidal neuronsNature Neuroscience 11:1335–1342.https://doi.org/10.1038/nn.2212

Inferring hidden structure in multilayered neural circuitsPLOS Computational Biology 14:e1006291.https://doi.org/10.1371/journal.pcbi.1006291

Independence of luminance and contrast in natural scenes and in the early visual systemNature Neuroscience 8:1690–1697.https://doi.org/10.1038/nn1556

Principles of Neural Coding357–378, Adaptation and sensory coding, Principles of Neural Coding, CRC Press.

Inferring nonlinear neuronal computation based on physiologically plausible inputsPLOS Computational Biology 9:e1003143.https://doi.org/10.1371/journal.pcbi.1003143

ConferenceDeep learning models of the retinal response to natural scenesAdvances in Neural Information Processing Systems. pp. 1369–1377.

Emergence of adaptive computation by single neurons in the developing cortexJournal of Neuroscience 33:12154–12170.https://doi.org/10.1523/JNEUROSCI.326312.2013

A sequential monte carlo approach to estimate biophysical neural models from spikesJournal of Neural Engineering 8:065006.https://doi.org/10.1088/17412560/8/6/065006

ConferenceFrom stochastic nonlinear integrateandfire to generalized linear modelsAdvances in Neural Information Processing Systems 24. pp. 1377–1385.

The accuracy of membrane potential reconstruction based on spiking receptive fieldsJournal of Neurophysiology 107:2143–2153.https://doi.org/10.1152/jn.01176.2011

Multiplicative gain changes are induced by excitation or inhibition aloneThe Journal of Neuroscience 23:10040–10051.https://doi.org/10.1523/JNEUROSCI.233110040.2003

Adaptive feature detection from differential processing in parallel retinal pathwaysPLOS Computational Biology 14:e1006560.https://doi.org/10.1371/journal.pcbi.1006560

Maximum likelihood estimation of cascade pointprocess neural encoding modelsNetwork: Computation in Neural Systems 15:243–262.https://doi.org/10.1088/0954898X_15_4_002

Superlinear population encoding of dynamic hand trajectory in primary motor cortexJournal of Neuroscience 24:8551–8561.https://doi.org/10.1523/JNEUROSCI.091904.2004

Statistical models for neural encoding, decoding, and optimal stimulus designProgress in Brain Research 165:493–507.https://doi.org/10.1016/S00796123(06)650310

Integral equation methods for computing likelihoods and their derivatives in the stochastic integrateandfire modelJournal of Computational Neuroscience 24:69–79.https://doi.org/10.1007/s108270070042x

Inferring synaptic inputs given a noisy voltage trace via sequential monte carlo methodsJournal of Computational Neuroscience 33:1–19.https://doi.org/10.1007/s1082701103717

Speed, noise, information and the graded nature of neuronal responsesNetwork: Computation in Neural Systems 7:365–370.https://doi.org/10.1088/0954898X_7_2_018

Advances in Neural Information Processing Systems2454–2462, Spectral methods for neural characterization using generalized quadratic models, Advances in Neural Information Processing Systems, 26, Neural Information Processing Systems Foundation, Inc.

Encoding and decoding in parietal cortex during sensorimotor decisionmakingNature Neuroscience 17:1395–1403.https://doi.org/10.1038/nn.3800

Advances in Neural Information Processing System1692–1700, Bayesian spiketriggered covariance analysis, Advances in Neural Information Processing System, MIT Press.

Prediction and decoding of retinal ganglion cell responses with a probabilistic spiking modelJournal of Neuroscience 25:11003–11013.https://doi.org/10.1523/JNEUROSCI.330505.2005

Noise in integrateandfire neurons: from stochastic input to escape ratesNeural Computation 12:367–384.https://doi.org/10.1162/089976600300015835

Learning quadratic receptive fields from neural responses to natural stimuliNeural Computation 25:1661–1692.https://doi.org/10.1162/NECO_a_00463

Neural circuit inference from function to structureCurrent Biology 27:189–198.https://doi.org/10.1016/j.cub.2016.11.040

Temporal contrast adaptation in salamander bipolar cellsThe Journal of Neuroscience 21:9445–9454.https://doi.org/10.1523/JNEUROSCI.212309445.2001

How MT cells analyze the motion of visual patternsNature Neuroscience 9:1421–1431.https://doi.org/10.1038/nn1786

ConferenceCharacterizing neural gain control using SpikeTriggered covarianceAdvances in Neural Information Processing Systems 14. pp. 269–276.

The spatial structure of a nonlinear receptive fieldNature Neuroscience 15:1572–1580.https://doi.org/10.1038/nn.3225

Nonlinear spatial encoding by retinal ganglion cells: when 1 + 1 ≠ 2The Journal of General Physiology 138:283–290.https://doi.org/10.1085/jgp.201110629

Functional connectivity and tuning curves in populations of simultaneously recorded neuronsPLOS Computational Biology 8:e1002775.https://doi.org/10.1371/journal.pcbi.1002775

Orientation selectivity of synaptic input to neurons in mouse and cat primary visual cortexThe Journal of Neuroscience 31:12339–12350.https://doi.org/10.1523/JNEUROSCI.203911.2011

Beyond GLMs: a generative mixture modeling approach to neural system identificationPLOS Computational Biology 9:e1003356.https://doi.org/10.1371/journal.pcbi.1003356

Origin of correlated activity between parasol retinal ganglion cellsNature Neuroscience 11:1343–1351.https://doi.org/10.1038/nn.2199

Precision of spike trains in primate retinal ganglion cellsJournal of Neurophysiology 92:780–789.https://doi.org/10.1152/jn.01171.2003

Processing of natural temporal stimuli by macaque retinal ganglion cellsThe Journal of Neuroscience 22:9945–9960.https://doi.org/10.1523/JNEUROSCI.222209945.2002

Advances in Neural Information Processing Systems3113–3121, Efficient and direct estimation of a neural subunit model for sensory coding, Advances in Neural Information Processing Systems, MIT Press.

A convolutional subunit model for neuronal responses in macaque V1Journal of Neuroscience 35:14829–14841.https://doi.org/10.1523/JNEUROSCI.281513.2015

Identifying and tracking simulated synaptic inputs from neuronal firing: insights from in vitro experimentsPLOS Computational Biology 11:e1004167.https://doi.org/10.1371/journal.pcbi.1004167

The equivalence of informationtheoretic and likelihoodbased methods for neural dimensionality reductionPLOS Computational Biology 11:e1004141.https://doi.org/10.1371/journal.pcbi.1004141

Functional dissection of signal and noise in MT and LIP during decisionmakingNature Neuroscience 20:1285–1292.https://doi.org/10.1038/nn.4611
Decision letter

Stephanie PalmerReviewing Editor; University of Chicago, United States

Michael J FrankSenior Editor; Brown University, United States
In the interests of transparency, eLife publishes the most substantive revision requests and the accompanying author responses.
Acceptance summary:
This paper provides a bridge between traditional phenomenological models of neuronal spiking and more biophysically realistic models, by supplying a method to infer excitatory and inhibitory conductances directly from spiking data. This manuscript describes this new model, a conductancebased encoding model (CBEM), and adds a crucial stage in testing this type of biophysical modeling by direct comparison with intracellular data. The CBEM model is validated with spiking and conductance data measured in retinal ganglion cells. Excitatory and inhibitory conductances in both midget and parasol cells from the primate retina can be inferred reliably. The method has potential applications to other cell types beyond the retina and can provide more mechanistic insights into what drives spiking in the brain.
Decision letter after peer review:
Thank you for submitting your work entitled "Inferring synaptic inputs from spikes with a conductancebased neural encoding model" for consideration by eLife. Your article has been reviewed by three peer reviewers, one of whom is a member of our Board of Reviewing Editors, and the evaluation has been overseen by a Senior Editor. The reviewers have opted to remain anonymous.
All reviewers found this to be an interesting new model, and thought it is a useful tool for the computational neuroscience community to have. The manuscript was wellwritten and clear, but some major concerns about the broader impacts and applicability of the model were raised, as detailed below in the three individual reviews.
Reviewer #1:
This paper provides a bridge between traditional phenomenological LNP models and more biophysically realistic models, by supplying a method to infer excitatory and inhibitory conductances from spiking data. It is a wellreasoned and wellwritten account of this new model (CBEM) for computational neuroscience. The CBEM model is validated with a comparison to spiking and conductance data measured in retinal ganglion cells, and compares reasonably well to an LN model fit directly to the conductance data.
1) The biggest concern the current manuscript raises is: what important features of RGC encoding does the current model capture that a GLM cannot? Put another way, the new model explains more of the response variance of RGC cells, but what does that extra variance encode about the stimulus? What does this extra fitting power allow one to explain in terms of RGC computation? Can simulations be added that explore what aspects of stimulus encoding the full CBEM captures better than a GLM?
2) For the checkerboard stimuli (Figure 9), the GLM and CBEM models seem to have fairly comparable fits to the data. If these are the more interesting data to fit, how should one interpret the weaker additional explanatory power of the CBEM?
3) Some more discussion of the results of the model simulations of centersurround stimuli is needed. How do the observed spike rates compare to known RGC responses? The full CBEM model clearly makes different predictions about the spiking response to spatially correlated stimuli (particularly the inhibition of the sustained response). What aspects of RGC computation are captured by the full CBEM model compared to the excitation only CBEM or GLM for these kind of stimuli?
4) Some variations (dynamic gain adaptation, cascade nonlinearities) on LN models for RGC data can explain more of the observed spiking response to natural image sequences, but still fail to predict the majority of response variance. Several groups have shown that deep or recurrent neural nets can be fit to retinal data and explain about 30% more of this variance. How does a CBEM compare? It could potentially have a serious advantage over these NN models, both in the number of parameters and in interpretability. The absence of a fit to more complex stimuli, where LN models are known to fail spectacularly, stands out as a large hole in the current manuscript. Of course the points brought up in comment 1 apply here as well.
Reviewer #2:
This paper introduces a new pointprocess model applied to retinal ganglion cell data, which is based on model components that infer excitatory and inhibitory conductances to predict spikes. It uses a smorgasbord of data to demonstrate difference successes of this model (better performance over linear models, ability to explain some aspects of contrast adaptation), and has a number of mathematical derivations and some simulations to demonstrate how it works.
Overall, it is a mixture of what appears to be solid methods with some interesting results from combining them. However, I found this paper scattered, flipping between mathematical derivations, modeling results, and findings specific to the retina. This has the effect of not clearly tying down any element convincingly, nor clearly demonstrating novelty – or at least the need for the advances suggested in this paper.
1) If main purpose is to demonstrate a better way to model neurons (or perhaps just retinal ganglion cells), one would want to understand its generality, as well as how it compares to models of similar ilk, particularly phenomenological models of excitation/inhibition such as the Butts, 2011, 2016. There is a lot of overhead in computing the integrationproperties of this model, and not clear what is gained over a more phenomenological LNLN cascade model. I see that it is possible there could be a lot to be gained from the CBEM, but the current manuscript does not make this clear.
Furthermore, what is the scope of this model. Do the authors expect this to be a more generally applicable method, or simply a means to model retinal ganglion cell responses? Under what conditions?
2) Relatedly, one wonders how important it is to model the conductances explicitly, versus membrane potential. Comparisons to membrane potential has been explored in several previous modeling approaches requiring only LN modeling (since there will not be two separate terms), including work of Priebe (Mohanty, 2012) in V1 and Demb (Zaghloul, 2005) in retina. More could be said about motivation for wanting to infer conductances past fitting data better. If the motivation is simply to fit data better, see #1 above.
3) The ability to explain adaptation to contrast as totalconductance changes seems interesting, but is only presented to the level of validating the model, and not explored. It is thus difficult to evaluate based on the extensive literature studying this in the retina.
4) Relatedly, the Cui et al., 2016 paper that was cited in this manuscript also posits excitation and inhibition, models recorded synaptic currents and spikes, and offers an explanation for contrast adaptation, but with a different model form. It seems worth more of a discussion here commenting on it. Could their proposed model explain these results? (and/or vice versa). Can one distinguish between their proposed circuit (presynaptic inhibition being dominant) and normal inhibition proposed here? Likewise, although not matching in content as much, Ozuysal and Baccus also model intracellular recordings and contrast adaptation, using yet another model form.
5) The derivations in the subsection “Background: Poisson GLM with spike history”, motivating the modeling, do not make a clear argument that the model goes beyond the phenomenological, given its overly restrictive assumptions (e.g., that E=I). As soon as the GLMbased biophysical model is derived making explicit assumptions, it seems to say that these assumptions do not hold and defines the CBEM without them. Later data in the paper (and in previous papers) seem to invalidate these assumptions as well. If these assumptions do not hold, what is the purpose of deriving the model in this context?
6) It was very unclear the components of the various models and how they are fit to data, and I could not make sense until poring through the Materials and methods section. This was in part complicated by the use of different models in different figures (fit to different experiments), without a clear overarching structure. It might also be useful to compare and contrast the derived model forms to simpler phenomenological models of excitation/inhibition (see #1 above).
7) The subsection “Capturing spike responses to spatially varying stimuli” based on simulation had very confusing motivation and conclusions, and could be much better fit into the logical structure of the rest of the paper.
Reviewer #3:
The manuscript by Latimer et al., proposes a new model for analyzing single neurons under sensory stimulation. The model framework, here termed conductancebase encoding model (CBEM) is similar in spirit to the widely used generalized linear model (GLM), applying filters to integrate a sensory stimulus and a stochastic process to generate spikes. Also similar to the GLM approach, model parameters can be obtained from experimentally recorded spike trains by a maximumlikelihood approach. A conceptual advance of the CBEM, however, is that the model incorporates separate filters for excitatory and inhibitory inputs and that these inputs are treated as conductance changes of the neuron, providing an additional level of biological realism. The authors validate their approach by analyzing previously published data from parasol retinal ganglion cells recorded in primate retina. They show that the inferred conductances match intracellularly recorded conductances, and they show that their model improves predictions of spike trains to new data as compared to a GLM.
The manuscript presents an interesting and thoughtprovoking approach. The possibility of inferring features of excitatory and inhibitory input from recorded spike trains may be a great tool for investigating sensory processing. Also, the mathematical connections to the GLM, which are nicely drawn out in the manuscript, provide a good background for understanding how this model framework functions. In the present form, however, one concern is that it remains a bit unclear how directly applicable the model is to other systems or what sort of insights it may provide. I would imagine that a more general discussion of the applicability, limitations, and interpretation of the model, potentially supported by some additional example from data or simulations, would considerably strengthen the manuscript.
Essential revisions:
The successful inference of excitatory and inhibitory inputs is only shown for parasol cells under fullfield stimulation. Here, the excitatory and inhibitory filters are nearly inverted with respect to each other, similar to the specific case discussed in the text where the conductance model can be mapped onto a GLM. Therefore, one wonders how the model performs when excitation and inhibition are correlated instead of anticorrelated or when their filters are (nearly) orthogonal to each other. Are correlated components of the inhibition as well recovered as uncorrelated (orthogonal) components, or will there be some bias?
The match of the inferred and measured conductance is impressive and suggests that this should work as a general technique to assess inhibitory input. However, it is not clear to me how the model would treat inhibitory signals that are not direct conductance inputs into the analyzed neuron. In the retina, for example, many circuits are also influenced by presynaptic inhibition, which acts on bipolar cell terminals. Should one expect that such inhibition is captures by the inhibition filter (because it is likely filtered differently than the direct excitatory pathway) or by the excitation filter (because it affects the excitatory conductance)?
Regarding the improvement of response predictions over the GLM, I'm wondering whether the differences in the applied nonlinearities play any role here. While the softthreshold nonlinearities of the CBEM are backed up nicely by analyses, the exponential nonlinearity of the GLM could create problems in the quantitative predictions, especially for strong activation as is likely the case under fullfield stimulation.
To emphasize the significance of the work, maybe the authors could point out more clearly how the method could be used for providing new insight into the investigated neurons. Is there a specific finding regarding the inputs into parasol cells derived from the present analysis that could serve as an example? Or maybe the authors could point out questions where the model framework might provide answers.
[Editors’ note: what now follows is the decision letter after the authors submitted for further consideration.]
Thank you for submitting your article "Inferring synaptic inputs from spikes with a conductancebased neural encoding model" for consideration by eLife. Your article has been reviewed by three peer reviewers, one of whom is a member of our Board of Reviewing Editors, and the evaluation has been overseen by Michael Frank as the Senior Editor. The reviewers have opted to remain anonymous.
Essential revisions:
The reviews on this version of the manuscript were mixed, with some definite enthusiasm for the inclusion of the new midget cell data, but some remaining skepticism and questions about the presentation and generality of the current results. Several points need to be fully addressed for this manuscript to be accepted:
1) For the section on interpreting the GLM as a conductancebased model, and the biophysical motivation for the paper:
The model is set up to make some mathematical manipulations in order to remove the voltage dependence of the currents. That wasn't super clear on a first or second pass and should be made more accessible to the reader.
Other points to address for clarity and scholarship:a) There exists a common quasibiophysical interpretation of the GLM, wherein the output of the linear stage is thought of as an approximation of the intracellular input or voltage. This interpretation requires (1) that E and I inputs are thought of as currents and sum linearly. Near threshold, this approximation might be decent for modeling spiking, as the voltage is ~constant. (2) the integration time of the neuron must be short enough that the response is mostly a function of the inputs, not on its own voltage history. With these constraints presumed satisfied, many previous studies over the years have assumed that the generating function of the GLM has a quasibiophysical basis. Can this more standard derivation/interpretation be addressed in this manuscript? Does the explicit integration in the "biophysical" GLM detailed in this work (which addresses assumption 2 above) help in fitting neuronal spiking as compared to a standard GLM? No explicit comparisons are made in the manuscript, and should be added.
b) The derivations seem overly long to simply notice that ge(VEe) – gi(VEi) will have no voltage dependence if ge and gi cancel. This derivation might be difficult to follow for the broad readership of this journal and should be explained more clearly.
c) This should be reorganized so that it doesn't distract from subsequent results. It could be made more clear that the E = I assumption is made only to connect this to the GLMlike model class, to motivate the model setup, and is relaxed in the subsequent CBEM inference scheme.
2) For the CBEM model setup, justification, and background:a) The GLM derivation is what is used to motivate the CBEM, and excitation and inhibition are rectified (Equation 10 compared with Equation 9). While the motivation for such rectification is that conductances must be nonnegative, this seems somewhat misleading. In the retina, rectification of excitation and inhibition (where it exists) often comes from other sources, most notably nonlinear synaptic release from bipolar cells (see Schwartz et al., 2012, Turner and Rieke, 2016, and Freeman, 2015), as well as the effect of amacrine cell processing, which often preserve or extend nonlinear effects. While conductances cannot be negative (as the authors assert), the rectification in Equation 10 probably has other sources. This issue raises concerns about the motivation for the CBEM and should be addressed and clarified.
b) The model in its current form cannot capture any cell for which either excitation or inhibition is a nonmonotonic function of contrast (i.e. any cell with ONOFF excitation or ONOFF inhibition). This applies to many other RGC types, and likely most visual neurons downstream in the brain. Please discuss this limitation more fully and argue for the generality of the model. How does this limit the general utility of this approach outside the retina? Can this model be used to infer a larger class of different contrastresponse function shapes? That would certainly be impressive, but doesn't appear to be within the reach of the current model.
c) The advances of the CBEM over previous work on models with separate LN components for excitatory and inhibitory inputs needs to be more thoroughly reviewed, placing the CBEM in the context of this work and arguing for its particular advances.
d) Rectification of the inputs (Equation 10) is the basis of the LNLN cascade model used in many papers over the years, starting with the work of Shapley and Victor and Korenberg and Sakai (in 1970s and 80s). It has been explicitly incorporated in likelihoodbased models of the retina and LGN in more recent years (Butts et al., 2011) and most recently in (Maheswaranathan et al., 2018). This includes models that explicitly model excitation and inhibition in the retina using nearly equivalent mathematical forms as used in the current manuscript.
The fact that previous models use spike trains to infer excitation and inhibition does seem to detract a bit from the novelty of the CBEM, if the manuscript does not demonstrate why the CBEM's particular form leads to better inference (certainly the validation with intracellular data here is key to drawing these conclusions and is understood as the main innovation in the paper). Answering the following questions would also provide more biological insight into the success of the CBEM: Is the CBEM's ability to match measured excitatory and inhibitory conductances a result of the integration of currents? The rectification of inputs? The difference between conductance and current?
e) What does the restriction that the conductances are nonnoisy do to the CBEM in terms of the types of cellular computations it can and cannot reproduce? Are there particular biophysical effects (e.g. stimdependent spiking noise) that will be missed via this constraint?
3) For a reader interested in applying the method, it will likely be important to get a better feeling for the applicability and interpretability of the data, in particular when no intracellular data are available for comparison.
a) Important questions are, for example, whether the method also works for nonwhitenoise stimuli and what may be limitations for the applicability of the method, or the interpretation of the obtained filters as corresponding to actual excitatory or inhibitory conductances. If no data are available, simulations and/or thoughtful discussion could help address these concerns.
b) As a specific example, Figure 10 shows that the inhibitory component can capture surround effects, at least for OFF parasol cells. But is it clear that this actually corresponds to inhibition received by the ganglion cell and not a representation of presynaptic inhibition that nonlinearly interacts with excitatory signals? (Presynaptic effects appear to form a major part of the surround in parasol cells, see, e.g., discussion in Turner, Schwartz and Rieke, 2018.)
c) The method currently uses an explicit model and parameters of the output nonlinearity that are obtained from intracellular data. For pure extracellular data, these parameters (or the shape of the nonlinearity) will not be known a priori. How does that affect the model fitting? Can the parameters of the output nonlinearity be included in the fitting procedure?
4) The results concerning contrast adaptation could be shortened or omitted. The CBEM does a bit better than a GLM, but really, both fail at contrast adaptation because one needs to model it explicitly, as has been done in many previous models, including papers cited in this manuscript from the Baccus group.
5) Existing approaches for inferring excitation and inhibition from spike trains are incorrectly labeled in the Introduction as LN modeling, and only briefly mentioned as alternative to the CBEM in the third paragraph of the discussion. This should be corrected, and the CBEM should be presented in this fuller context.
[Editors’ note: further revisions were suggested before acceptance.]
Thank you for resubmitting your work entitled "Inferring synaptic inputs from spikes with a conductancebased neural encoding model" for further consideration at eLife. Your revised article has been favorably evaluated by Michael Frank (Senior Editor) and a Reviewing Editor.
The manuscript has been improved but there are some remaining issues that need to be addressed before acceptance, as outlined below:
1) To clarify a bit first: The main model comparison that was sought after by the reviewers had the following aim: Explain how the inference of conductances here (i.e. the biophysical basis of the CBEM) is the key ingredient in successfully predicting the E/I values, as opposed to LNLN models, which have more general nonlinearities (as compared to the GLM's). The reviewers agree with you that the goal here is not to outperform other models per se, in terms of fitting performance or whatnot, but to add interpretable knowledge about the underlying biophysics. The prompt, then, from the reviewers is this: show more clearly and directly how the biophysical assumptions in the CBEM are crucial for getting this E/I estimation right, which allows for the proper interpretation of the model results; show that it's not just the fact that the CBEM (like LNLN models) has a more generalized form of nonlinearity built into it. Essentially, that one needs to model the biophysics in this more correct way to get the right interpretation out.
Here's the concern spelled out more explicitly:
If the CBEM is no better (at predicting the E/I inputs to a cell) than other LNLN models that simply have voltagelike, LN approximations of excitatory and inhibitory inputs, this means that the biophysics presented here is somewhat misleading (at least in the sense that it has to do with considering conductances), which is the current basis of the paper.
Additionally, any moreflexible LNLN models (with many subunits)  now several in the literature – would actually outperform the CBEM because they can include multiple subunits, and model more general nonlinearities.
It is very possible that the more explicit model of the biophysics in the CBEM would do better and its particular structure is therefore an advance – but the manuscript does not show or address this directly.
There were certainly some things said about this in the response to reviewers, and more of that should be entered into the paper as well.
2) You had a question in the response to reviewers about references for the "quasibiophysical interpretation of the GLM". The Gerstner references are certainly great here, but please also cite (and if appropriate, discuss) Pillow et al., 2004 and 2005 (where the output of the linear term is explicitly treated as a voltage) and perhaps one of the more recent papers comparing GLM fits to intracellular data.
3) Please add a few more lines to the Discussion section outlining why you expect this model to be of broad utility beyond the retina. Specifically: what sorts of heuristics can a future user of the method employ to decide if the model's assumptions are appropriate for their data? Here, just saying a bit more to justify the breadth of the expected applicability of the model would be sufficient.
https://doi.org/10.7554/eLife.47012.sa1Author response
[Editors’ note: the author responses to the first round of peer review follow.]
Reviewer #1:
This paper provides a bridge between traditional phenomenological LNP models and more biophysically realistic models, by supplying a method to infer excitatory and inhibitory conductances from spiking data. It is a wellreasoned and wellwritten account of this new model (CBEM) for computational neuroscience. The CBEM model is validated with a comparison to spiking and conductance data measured in retinal ganglion cells, and compares reasonably well to an LN model fit directly to the conductance data.
1) The biggest concern the current manuscript raises is: what important features of RGC encoding does the current model capture that a GLM cannot? Put another way, the new model explains more of the response variance of RGC cells, but what does that extra variance encode about the stimulus? What does this extra fitting power allow one to explain in terms of RGC computation? Can simulations be added that explore what aspects of stimulus encoding the full CBEM captures better than a GLM?
The CBEM can capture nonlinear response properties that arise from the interaction of excitatory and inhibitory conductance based inputs (which can give rise, for example to gain changes and changes in time constant), whereas the GLM is limited to a single filter, which assumes linear summation of excitatory and inhibitory inputs. In Figure 8 and Figure 10, we presented simulations showing that E‑I dependence leads to different predictions for contrast changes and center surround integration. We will extend these analyses to highlight how the CBEM’s conductances lead to these effects. More broadly, we would like to emphasize that we do not view the paper’s primary contribution as the fact that the CBEM “captures more response variance of RGC cells”. Rather, we think that the fact that it reveals intracellular conductances (verified with intracellular recordings) purely on the basis of spike times is the most important finding, and future application of our model. We will emphasize this primary motivation more clearly in our revision.
2) For the checkerboard stimuli (Figure 9), the GLM and CBEM models seem to have fairly comparable fits to the data. If these are the more interesting data to fit, how should one interpret the weaker additional explanatory power of the CBEM?
For these stimuli, the GLM does a fairly good job of fitting the data, and thus there isn't that much room for improvement. We note that even on natural scene data, the GLM can predict a large amount of the variance (for example, Batty et al., 2017). Despite the similar predictions to white noise, the CBEM predicts significant differences in responses to correlated stimuli.
3) Some more discussion of the results of the model simulations of centersurround stimuli is needed. How do the observed spike rates compare to known RGC responses? The full CBEM model clearly makes different predictions about the spiking response to spatially correlated stimuli (particularly the inhibition of the sustained response). What aspects of RGC computation are captured by the full CBEM model compared to the excitation only CBEM or GLM for these kind of stimuli?
The CBEM captures nonlinear center‑surround modulation that cannot be explained by a single linear filter. Additionally, in our simulations the interactions between inhibitory and excitatory inputs could produce more precise responses than predicted by the GLM or CBEM with only excitation.
4) Some variations (dynamic gain adaptation, cascade nonlinearities) on LN models for RGC data can explain more of the observed spiking response to natural image sequences, but still fail to predict the majority of response variance. Several groups have shown that deep or recurrent neural nets can be fit to retinal data and explain about 30% more of this variance. How does a CBEM compare? It could potentially have a serious advantage over these NN models, both in the number of parameters and in interpretability. The absence of a fit to more complex stimuli, where LN models are known to fail spectacularly, stands out as a large hole in the current manuscript. Of course the points brought up in comment 1 apply here as well.
Again, we apologize that we were not clearer about the primary contributions of our paper. As noted above, our goal is not simply to explain more variance, but to offer biological insights into what drives spiking in retinal ganglion cells. We offered the comparison to LN models and standard GLM as an “added bonus” that the CBEM also can provide a more accurate model of spiking. (Presumably this is because of the higher level of biological realism it achieves). However, the main point of our paper was not to produce a better R^2 value (although a higher R^2 is a consequence of better capturing of the synaptic input to the RGC), but to link purely phenomenological models to biology. We agree of course that applications to natural scenes are an important avenue for future research, and we hope that the CBEM will be provide a tool for studying neural responses obtained from such datasets.
Reviewer #2:
This paper introduces a new pointprocess model applied to retinal ganglion cell data, which is based on model components that infer excitatory and inhibitory conductances to predict spikes. It uses a smorgasbord of data to demonstrate difference successes of this model (better performance over linear models, ability to explain some aspects of contrast adaptation), and has a number of mathematical derivations and some simulations to demonstrate how it works.
Overall, it is a mixture of what appears to be solid methods with some interesting results from combining them. However, I found this paper scattered, flipping between mathematical derivations, modeling results, and findings specific to the retina. This has the effect of not clearly tying down any element convincingly, nor clearly demonstrating novelty – or at least the need for the advances suggested in this paper.
1) If main purpose is to demonstrate a better way to model neurons (or perhaps just retinal ganglion cells), one would want to understand its generality, as well as how it compares to models of similar ilk, particularly phenomenological models of excitation/inhibition such as the Butts, 2011, 2016. There is a lot of overhead in computing the integrationproperties of this model, and not clear what is gained over a more phenomenological LNLN cascade model. I see that it is possible there could be a lot to be gained from the CBEM, but the current manuscript does not make this clear.
Furthermore, what is the scope of this model. Do the authors expect this to be a more generally applicable method, or simply a means to model retinal ganglion cell responses? Under what conditions?
We apologize — the main goal of our model is not a better model of RGCs in terms of spike rate prediction, but to tie phenomenological to biophysical models which include dynamics. The overhead of computing the integration is not actually that big: this is solved with a tri‑diagonal matrix operation which is computed in linear time. What we gain with this is a far more solid connection between our model and biophysical components of the neurons, than is achieved by a phenomenological model. We do however see the merit in including a comparison to the Nonlinear Interaction Model proposed in McFarland, Cui and Butts, (2013).
As with the GLM, we expect that this model may be useful in many sensory regions. The conditions that we expect this model to be useful under conditions that any cascade model might be useful.
2) Relatedly, one wonders how important it is to model the conductances explicitly, versus membrane potential. Comparisons to membrane potential has been explored in several previous modeling approaches requiring only LN modeling (since there will not be two separate terms), including work of Priebe (Mohanty, 2012) in V1 and Demb (Zaghloul, 2005) in retina. More could be said about motivation for wanting to infer conductances past fitting data better. If the motivation is simply to fit data better, see #1 above.
We apologize again for the confusion: we feel the problem of identifying the factors that govern spiking at an intracellular level are worthwhile in their own right (e.g., does an increase in spiking result from a decrease in inhibition or an increase in excitation). So the viewpoint that we take is not that “it is important to model the conductances explicitly in order to obtain a state‑of‑the‑art descriptive model of retinal spike responses”, but rather that identifying the separate excitatory and inhibitory inputs to neurons is an interesting scientific problem in its own right. We did in fact use an LN model, the GLM, as a basis of comparison throughout the paper; this would be practically equivalent to the reviewer’s suggestion of modeling the voltage as LN because our spike rate was a nonlinear function of voltage. There are situations for which the voltage can be modeled as a simple LN model, but such models do not capture cells' responses to all stimuli. We note that of course there are known features of RGC responses that cannot be captured with LN models, as the reviewer has pointed out in comments 1, 3, and 4.
3) The ability to explain adaptation to contrast as totalconductance changes seems interesting, but is only presented to the level of validating the model, and not explored. It is thus difficult to evaluate based on the extensive literature studying this in the retina.
Sorry, we are not sure we understand the reviewer's objection here. Because we are introducing the CBEM and presenting it broadly, we did not aim to explore the entire gamut of contrast adaptation experiments within one subsection of the results. However, we would like to extend the presentation in this section to include information on how the prediction distributions of excitation and inhibition depend on contrast.
4) Relatedly, the Cui et al., 2016 paper that was cited in this manuscript also posits excitation and inhibition, models recorded synaptic currents and spikes, and offers an explanation for contrast adaptation, but with a different model form. It seems worth more of a discussion here commenting on it. Could their proposed model explain these results? (and/or vice versa). Can one distinguish between their proposed circuit (presynaptic inhibition being dominant) and normal inhibition proposed here? Likewise, although not matching in content as much, Ozuysal and Baccus also model intracellular recordings and contrast adaptation, using yet another model form.
Cui et al., model did not define inhibitory and excitatory synaptic conductances in a biophysical model. Comparisons between our two methods is difficult because their work focused on capturing nonlinear effects of presynaptic inhibition in the mouse retina. We feel that exploring these specific nonlinear presynaptic inhibition is beyond the scope of our paper, but it would be interesting to explore in future work. We are familiar with the work of Ozuysal and Baccus, but would like to emphasize that work required intracellular recordings for model fitting — the main achievement of our work is that the model can be used to identify intracellular conductances despite being fit only spike trains; thus it can shed light on underlying biological mechanisms even when only extracellular recordings are available.
5) The derivations in the subsection “Background: Poisson GLM with spike history”, motivating the modeling, do not make a clear argument that the model goes beyond the phenomenological, given its overly restrictive assumptions (e.g., that E=I). As soon as the GLMbased biophysical model is derived making explicit assumptions, it seems to say that these assumptions do not hold and defines the CBEM without them. Later data in the paper (and in previous papers) seem to invalidate these assumptions as well. If these assumptions do not hold, what is the purpose of deriving the model in this context?
Sorry, but we feel we did not explain this clearly enough. Our point was not to suggest that E=‑I is a good model for real neurons, but to offer a new interpretation for the classic GLM. (This connection to a biophysical model — albeit an unrealistic one — has not to our knowledge been proposed before). The idea of relaxing this overly restrictive constraint is the step that leads to our model, the CBEM.
The main point of our paper is to link phenomenological models to biophysical models. These derivations were only intended show a link between a commonly used statistical model of spiking (GLM) and a biophysical model to demonstrate the assumptions on biophysics placed by the GLM that are not obvious from the purely statistical presentation. We will clarify this motivation in the revision.
6) It was very unclear the components of the various models and how they are fit to data, and I could not make sense until poring through the Materials and methods section. This was in part complicated by the use of different models in different figures (fit to different experiments), without a clear overarching structure. It might also be useful to compare and contrast the derived model forms to simpler phenomenological models of excitation/inhibition (see #1 above).
We apologize for the difficulty in understanding our model. Our motivation for the paper’s organization was to isolate each biophysically interpretable component of the model and relate the component to quantities measures in the real retina. Our Materials and methods section is structured in accordance to each Results section and figure. Secondly, as we mentioned in our response to comment 1, we are willing to compare the CBEM with NIM, a phenomenological model with excitatory and suppressive units.
7) The subsection “Capturing spike responses to spatially varying stimuli” based on simulation had very confusing motivation and conclusions, and could be much better fit into the logical structure of the rest of the paper.
We are sorry this was not clear. The simulation attempts to demonstrate how the models differ in their predictions of center‑surround tuning (with a stimulus that wasn’t used in fitting) despite giving similar fits to noise data. We can provide additional information on the simulation work.
Reviewer #3:
[…] The manuscript presents an interesting and thoughtprovoking approach. The possibility of inferring features of excitatory and inhibitory input from recorded spike trains may be a great tool for investigating sensory processing. Also, the mathematical connections to the GLM, which are nicely drawn out in the manuscript, provide a good background for understanding how this model framework functions. In the present form, however, one concern is that it remains a bit unclear how directly applicable the model is to other systems or what sort of insights it may provide. I would imagine that a more general discussion of the applicability, limitations, and interpretation of the model, potentially supported by some additional example from data or simulations, would considerably strengthen the manuscript.
Essential revisions:
The successful inference of excitatory and inhibitory inputs is only shown for parasol cells under fullfield stimulation. Here, the excitatory and inhibitory filters are nearly inverted with respect to each other, similar to the specific case discussed in the text where the conductance model can be mapped onto a GLM. Therefore, one wonders how the model performs when excitation and inhibition are correlated instead of anticorrelated or when their filters are (nearly) orthogonal to each other. Are correlated components of the inhibition as well recovered as uncorrelated (orthogonal) components, or will there be some bias?
We expect that the specific configuration of excitation and inhibition will matter. However, we have found in simulations that we can recover the conductance filters when inhibition is delayed, but not opposite, to excitation.
The match of the inferred and measured conductance is impressive and suggests that this should work as a general technique to assess inhibitory input. However, it is not clear to me how the model would treat inhibitory signals that are not direct conductance inputs into the analyzed neuron. In the retina, for example, many circuits are also influenced by presynaptic inhibition, which acts on bipolar cell terminals. Should one expect that such inhibition is captures by the inhibition filter (because it is likely filtered differently than the direct excitatory pathway) or by the excitation filter (because it affects the excitatory conductance)?
Presynaptic inhibition is an interesting effect (as noted by reviewer 2 in reference to Cui et al., 2016 which looked at such computations in the mouse retina). However, we believe these effects are beyond the scope of the current paper, but we would like to explore more complicated nonlinear effects in the future.
Regarding the improvement of response predictions over the GLM, I'm wondering whether the differences in the applied nonlinearities play any role here. While the softthreshold nonlinearities of the CBEM are backed up nicely by analyses, the exponential nonlinearity of the GLM could create problems in the quantitative predictions, especially for strong activation as is likely the case under fullfield stimulation.
We found that the GLM performed very poorly with the soft‑rectified nonlinearity for these cells (as was also reported in Pillow et al., 2008). We have made sure to mention this. We also emphasize that the choice of the CBEM’s nonlinearity was by matching voltage to spike rate, and not to maximize the performance of spike rate predictions in response to a stimulus.
To emphasize the significance of the work, maybe the authors could point out more clearly how the method could be used for providing new insight into the investigated neurons. Is there a specific finding regarding the inputs into parasol cells derived from the present analysis that could serve as an example? Or maybe the authors could point out questions where the model framework might provide answers.
Our method provides a quantitative framework to combine different measures of neural activity that cannot be simultaneously measured. In the array recordings we analyzed, we had access to far more data than with intracellular recordings, but we could only observe spikes.
The gain control section of our paper showed that independent excitation and inhibition can capture some of the contrast adaptation observed in RGCs. Because multiple mechanisms contribute to contrast adaptation, this affect could not easily be quantified without a model.
[Editors' note: the author responses to the rereview follow.]
Essential revisions:
The reviews on this version of the manuscript were mixed, with some definite enthusiasm for the inclusion of the new midget cell data, but some remaining skepticism and questions about the presentation and generality of the current results. Several points need to be fully addressed for this manuscript to be accepted:
1) For the section on interpreting the GLM as a conductancebased model, and the biophysical motivation for the paper:
The model is set up to make some mathematical manipulations in order to remove the voltage dependence of the currents. That wasn't super clear on a first or second pass and should be made more accessible to the reader.
We apologize for the confusion. We have revised this section to make clearer that the goal here was to achieve voltage independence. The revised text reads:
“Here we propose a novel biophysically realistic interpretation of the classic Poisson GLM as a dynamical model with conductancebased input. In brief, this involves writing the GLM as a conductancebased model with excitatory and inhibitory conductances governed by affine functions of the stimulus, but constrained so that total conductance is fixed. This removes voltagedependence of the membrane currents, making the membrane potential itself an affine function of the stimulus.
The remainder of this section lays out the mathematical details of this interpretation explicitly.”
Other points to address for clarity and scholarship:
a) There exists a common quasibiophysical interpretation of the GLM, wherein the output of the linear stage is thought of as an approximation of the intracellular input or voltage. This interpretation requires (1) that E and I inputs are thought of as currents and sum linearly. Near threshold, this approximation might be decent for modeling spiking, as the voltage is ~constant. (2) the integration time of the neuron must be short enough that the response is mostly a function of the inputs, not on its own voltage history. With these constraints presumed satisfied, many previous studies over the years have assumed that the generating function of the GLM has a quasibiophysical basis. Can this more standard derivation/interpretation be addressed in this manuscript? Does the explicit integration in the "biophysical" GLM detailed in this work (which addresses assumption 2 above) help in fitting neuronal spiking as compared to a standard GLM? No explicit comparisons are made in the manuscript, and should be added.
We thank the reviewers for this comment, and we apologize if we have failed to cite relevant literature about interpretations of the Poisson GLM. We have cited Plesser and Gerstner, 2000 (which first described the “escape rate” approximation to noisy integrateandfire as an LN neuron), as well as work from Gerstner on the spikeresponse model. We are not familiar however with other previous work interpreting the GLM as having a biophysical basis (e.g. in terms of E and I current inputs that sum linearly). If the reviewers wouldn’t mind clarifying which references they are referring to, we would be grateful for the opportunity to add citations and discuss the relationship to previous work more clearly.
We did not fully understand the comment about a lack of explicit comparisons, as we did indeed compare the “conductance based model” (CBEM) to the standard Poisson GLM in Figure7, Figure 8, Figure 10 and Figure 11. Was there another comparison the reviewers would have liked to see? We apologize if we have misunderstood.
We have updated the discussion of these previous interpretations that was included in subsection “Background: Poisson GLM with spike history”. We now state that this interpretation assumes a currentbased input rather than conductance based. The biophysical interpretation of the GLM offered in subsection “Interpreting the GLM as a conductancebased model”, unfortunately does not affect the fits data, as it is mathematically equivalent. (We have sought to emphasize that it is a new interpretation, not a new model).
b) The derivations seem overly long to simply notice that ge(VEe) – gi(VEi) will have no voltage dependence if ge and gi cancel. This derivation might be difficult to follow for the broad readership of this journal and should be explained more clearly.
We thank the reviewers for this comment. We have now added a summary(“In brief,…”, subsection “Interpreting the GLM as a conductancebased model”), which attempts to summarize the idea so that reviewers who are not interested in the detailed derivation can skip the remainder of the section. We have also shortened and simplified the derivation itself, as we agree with the reviewer that the section was overly long; although we still wish to make clear exactly how to relate the GLM filter k_glm defined in the previous section to the linear conductance filter “k” that we introduced in our interpretation. We hope the revised derivation will be simpler and easier to understand.
c) This should be reorganized so that it doesn't distract from subsequent results. It could be made more clear that the E = I assumption is made only to connect this to the GLMlike model class, to motivate the model setup, and is relaxed in the subsequent CBEM inference scheme.
Thanks for the suggestion. We have reiterated that this assumption is only to construct a GLM at the start of both the subsection “Interpreting the GLM as a conductancebased model” and subsection “The conductancebased encoding model (CBEM)”.
2) For the CBEM model setup, justification, and background:a) The GLM derivation is what is used to motivate the CBEM, and excitation and inhibition are rectified (eq 10 compared with eq 9). While the motivation for such rectification is that conductances must be nonnegative, this seems somewhat misleading. In the retina, rectification of excitation and inhibition (where it exists) often comes from other sources, most notably nonlinear synaptic release from bipolar cells (see Schwartz et al., 2012, Turner and Rieke, 2016, and Freeman, 2015), as well as the effect of amacrine cell processing, which often preserve or extend nonlinear effects. While conductances cannot be negative (as the authors assert), the rectification in Equation 10 probably has other sources. This issue raises concerns about the motivation for the CBEM and should be addressed and clarified.
Thanks for this suggestion. We have clarified that our nonlinearity is primarily aimed at accounting for synaptic thresholding/nonlinearities, instead of just enforcing positive conductances. This is included where we introduce the softrectified nonlinearity for RGC data (subsection: Validating the CBEM modeling assumptions with intracellular data”).
b) The model in its current form cannot capture any cell for which either excitation or inhibition is a nonmonotonic function of contrast (i.e. any cell with ONOFF excitation or ONOFF inhibition). This applies to many other RGC types, and likely most visual neurons downstream in the brain. Please discuss this limitation more fully and argue for the generality of the model. How does this limit the general utility of this approach outside the retina? Can this model be used to infer a larger class of different contrastresponse function shapes? That would certainly be impressive, but doesn't appear to be within the reach of the current model.
Our code allows for an arbitrary set of conductances with different nonlinear functions. However, adding more conductances with more nonlinearities makes fitting more challenging. Future work with additional experiments (which are beyond the scope of this current manuscript) would be needed to test how the model can estimate ONOFF inhibition or excitation. The discussion now reads “For instance, the CBEM's linearrectified conductance does not capture the nonmonotonic portions of the stimulusconductance nonlinearities observed in the data”
c) The advances of the CBEM over previous work on models with separate LN components for excitatory and inhibitory inputs needs to be more thoroughly reviewed, placing the CBEM in the context of this work and arguing for its particular advances.
d) Rectification of the inputs (Equation 10) is the basis of the LNLN cascade model used in many papers over the years, starting with the work of Shapley and Victor and Korenberg and Sakai (in 1970s and 80s). It has been explicitly incorporated in likelihoodbased models of the retina and LGN in more recent years (Butts et al., 2011) and most recently in (Maheswaranathan et al., 2018). This includes models that explicitly model excitation and inhibition in the retina using nearly equivalent mathematical forms as used in the current manuscript.
The fact that previous models use spike trains to infer excitation and inhibition does seem to detract a bit from the novelty of the CBEM, if the manuscript does not demonstrate why the CBEM's particular form leads to better inference (certainly the validation with intracellular data here is key to drawing these conclusions and is understood as the main innovation in the paper). Answering the following questions would also provide more biological insight into the success of the CBEM: Is the CBEM's ability to match measured excitatory and inhibitory conductances a result of the integration of currents? The rectification of inputs? The difference between conductance and current?
We thank the reviewers for this comment and suggestions which will help make our specific contributions much more clear within the context of existing literature in the revised manuscript. We have provided additional discussion of the relationship between CBEM and previous work with LNLN cascade models, some of which include functionally suppressive units in the Introduction and Discussion section. We have added clarifications to explain that our approach adds on these previous approaches by comparing the model components identified from spike trains to conductances measured in the same cells (Introduction and Discussion section), which to our knowledge has not been performed in previous modeling exercises. Comparing the model’s predictions specifically to excitatory and inhibitory inputs is a primary goal of our work because phenomenological models may combine inputs from amacrine and bipolar cells into common input channels (as described by Maheswaranathan et al., mentioned by the reviewer) and functionally suppressive units may not correspond to the actual inhibition received by the cell. Moreover, our approach takes the additional step of using a biophysical framework, but instead of being defined purely as an LNLN phenomenological model. Thus, our modeling approach may provide a superior tool to combine quantitative biophysical measurements of cells and circuits with measurement of stimulus encoding.
e) What does the restriction that the conductances are nonnoisy do to the CBEM in terms of the types of cellular computations it can and cannot reproduce? Are there particular biophysical effects (e.g. stimdependent spiking noise) that will be missed via this constraint?
This is an excellent point. Without correlations in the two channels, we cannot predict the coding benefits of correlations observed in Cafaro and Rieke, (2010). We have included this in the Discussion section as a possible direction for future research.
3) For a reader interested in applying the method, it will likely be important to get a better feeling for the applicability and interpretability of the data, in particular when no intracellular data are available for comparison.
We thank the reviewers for this suggestion. We have added text discussing the applicability of the CBEM to the Discussion section. Primarily, we believe that the model can be applied when the inputs to a cell are assumed to be approximately linear (or a linearized approximation is assumed through a model; for example, a model V1 as an input to MT).
a) Important questions are, for example, whether the method also works for nonwhitenoise stimuli and what may be limitations for the applicability of the method, or the interpretation of the obtained filters as corresponding to actual excitatory or inhibitory conductances. If no data are available, simulations and/or thoughtful discussion could help address these concerns.
Thanks for this comment, yes — the method makes no assumptions about Gaussiannity or whiteness of the stimulus. The applicability of our model to nonwhite noise stimuli is similar to other statistical methods fit by maximum likelihood MAP, because these tools do not rely on STA or STC features. GLMs and neural network models have been successfully fit using naturalistic stimuli.
b) As a specific example, Figure 10 shows that the inhibitory component can capture surround effects, at least for OFF parasol cells. But is it clear that this actually corresponds to inhibition received by the ganglion cell and not a representation of presynaptic inhibition that nonlinearly interacts with excitatory signals? (Presynaptic effects appear to form a major part of the surround in parasol cells, see, e.g., discussion in Turner, Schwartz and Rieke, 2018.)
We appreciate the reviewer’s comment, but we have considered the treatment of presynaptic inhibition beyond the scope of this manuscript. However, we are hoping to explore this more thoroughly in future work. Currently, our model only considers independent excitatory and inhibitory signals. We have made sure to better address this point in the Discussion section.
c) The method currently uses an explicit model and parameters of the output nonlinearity that are obtained from intracellular data. For pure extracellular data, these parameters (or the shape of the nonlinearity) will not be known a priori. How does that affect the model fitting? Can the parameters of the output nonlinearity be included in the fitting procedure?
We could fit these parameters in our optimization. We have run some model fits using a wider range of output nonlinearity parameters and found that this did not strongly impact our ability to predict excitatory and inhibitory tuning. Thus, we would caution against making conclusions about the spike nonlinearity that rest on a single setting of those output parameters in the extracellular setting. Instead, we would argue in favor of considering the range output nonlinearity parameters that are consistent with the data (similar to the work of Prinz and Marder), perhaps within a Bayesian framework.
4) The results concerning contrast adaptation could be shortened or omitted. The CBEM does a bit better than a GLM, but really, both fail at contrast adaptation because one needs to model it explicitly, as has been done in many previous models, including papers cited in this manuscript from the Baccus group.
We apologize but we do not fully understand what the reviewer means by the claim that the models fail at contrast adaptation. We appreciate the reviewers’ concerns and we have edited this section to make the limitations of our work more clear. We have better emphasized that we are not attempting to model all aspects of adaptation, but that our test is aimed only to determine to what extent adaptation can be predicted from the fixed LN conductances alone. Additionally, our model does a fair job at predicting gain change across contrast levels. We have added a citation to the recently published modeling work by Ozuysal, Kastner and Baccus, (2018) who concluded that “differences in synaptic threshold in the two pathways” was the primary source of adaptation in their study.
5) Existing approaches for inferring excitation and inhibition from spike trains are incorrectly labeled in the Introduction as LN modeling, and only briefly mentioned as alternative to the CBEM in the third paragraph of the discussion. This should be corrected, and the CBEM should be presented in this fuller context.
We apologize for this oversight, and have rephrased this paragraph to emphasize that we are referring more generally to the cascade model class (to which all these models belong), not just LN. Additionally, we have also elaborated in the final paragraph of the Introduction how the CBEM compares to the cascade models (LNLN models) the reviewer mentions in terms of excitatory and inhibitory conductance estimation.
[Editors' note: further revisions were suggested before acceptance.]
The manuscript has been improved but there are some remaining issues that need to be addressed before acceptance, as outlined below:
1) To clarify a bit first: The main model comparison that was sought after by the reviewers had the following aim: Explain how the inference of conductances here (i.e. the biophysical basis of the CBEM) is the key ingredient in successfully predicting the E/I values, as opposed to LNLN models, which have more general nonlinearities (as compared to the GLM's). The reviewers agree with you that the goal here is not to outperform other models per se, in terms of fitting performance or whatnot, but to add interpretable knowledge about the underlying biophysics. The prompt, then, from the reviewers is this: show more clearly and directly how the biophysical assumptions in the CBEM are crucial for getting this E/I estimation right, which allows for the proper interpretation of the model results; show that it's not just the fact that the CBEM (like LNLN models) has a more generalized form of nonlinearity built into it. Essentially, that one needs to model the biophysics in this more correct way to get the right interpretation out.
Here's the concern spelled out more explicitly:
If the CBEM is no better (at predicting the E/I inputs to a cell) than other LNLN models that simply have voltagelike, LN approximations of excitatory and inhibitory inputs, this means that the biophysics presented here is somewhat misleading (at least in the sense that it has to do with considering conductances), which is the current basis of the paper.
Additionally, any moreflexible LNLN models (with many subunits)  now several in the literature – would actually outperform the CBEM because they can include multiple subunits, and model more general nonlinearities.
It is very possible that the more explicit model of the biophysics in the CBEM would do better and its particular structure is therefore an advance – but the manuscript does not show or address this directly.
There were certainly some things said about this in the response to reviewers, and more of that should be entered into the paper as well.
We have made clarifications to the Discussion section to clarify that we believe more flexible LNLN models could help in extending the CBEM (see comment 3). The specific aims of the CBEM are primarily that simple biophysical assumptions leads to an accurate prediction of RGC synaptic input tuning with the CBEM (points which have not been directly testing with LNLN models), and we made sure that the paper did not make claims that one must model biophysics.
However, we admit that we are a bit confused by the request to show that the CBEM that our biophysical assumptions are crucial, or that CBEM is better at predicting conductances than other LNLN models. There simply are no models that we are aware of that have sought to predict conductances before!
Note that our paper does not represent a classic machine learning approach, where you take a bunch of stimuli and a bunch of voltage data and train a network to approximate the mapping from one to the other. In our paper, we take a stimuli and recorded spiketrains and learn a model to predict the membrane potential that underlay those spikes. So to ask, “does it achieve better voltage prediction than an LNLN model that accurately predicts firing rate?” seems like a non nonsequitur to us. It would be like asking how well a network that predicts MNIST digits can predict the labels of ImageNet images. (Such a model can’t, because it doesn’t even have the right output capability to do so!) Similarly, for our model: there are two conductances at any time point (exc and inh) whereas an LNLN model predicting spike rate has only one output.
But please forgive us if we have misunderstood something. Our view is that — while other models might reasonably be adapted to the prediction of conductances from spike train recordings, using either ours or different assumptions about the underlying biology — that is very much hypothetical “future work” that has not yet been carried out, and so is not a benchmark we aim for in this manuscript.
We are grateful for the feedback, nonetheless, as it clarifies some of the confusion the reviewers may have had with our paper. We have tried to revise the Discussion section to clarify that, while more flexible ML approaches may give better spike prediction, our model seeks an explicit link to biological processes which we confirmed using intracellular data (Discussion section). We also suggest some future avenues along which the two approaches (the CBEM and LNLN or deep learning models) might be fruitfully combined to achieve even better models for conductance prediction. (See Discussion section).
2) You had a question in the response to reviewers about references for the "quasibiophysical interpretation of the GLM". The Gerstner references are certainly great here, but please also cite (and if appropriate, discuss) Pillow et al., 2004 and 2005 (where the output of the linear term is explicitly treated as a voltage) and perhaps one of the more recent papers comparing GLM fits to intracellular data.
Thanks for the suggestion. We have made sure to discuss these references to the stochastic LIF models from Pillow et al., 2004 and 2005 to mention that those references assume the input is a currentbased, linear function of the stimulus (subsection “Background: Poisson GLM with spike history”).
3) Please add a few more lines to the Discussion section outlining why you expect this model to be of broad utility beyond the retina. Specifically: what sorts of heuristics can a future user of the method employ to decide if the model's assumptions are appropriate for their data? Here, just saying a bit more to justify the breadth of the expected applicability of the model would be sufficient.
We have included a much more detailed discussion on the utility of the CBEM and provided recommendations for future applications (Discussion section).
https://doi.org/10.7554/eLife.47012.sa2Article and author information
Author details
Funding
McKnight Foundation
 Jonathan W Pillow
Simons Foundation (SCGB AWD1004351)
 Jonathan W Pillow
National Science Foundation (IIS1150186)
 Jonathan W Pillow
National Institute of Mental Health (MH099611)
 Jonathan W Pillow
Howard Hughes Medical Institute
 Fred Rieke
National Institutes of Health (EY011850)
 Fred Rieke
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Acknowledgements
We thank EJ Chichilnisky for generously providing data and valuable discussion. We also thank Il Memming Park and Jacob Yates for helpful comments. This work was supported by the McKnight Foundation (JWP), the Simons Foundation (SCGB AWD1004351, JWP), an NSF CAREER Award IIS1150186 (JWP), a grant from the NIMH (MH099611, JWP), the Howard Hughes Medical Institute (FR), and a grant from the NIH (EY011850, FR).
Ethics
Animal experimentation: Tissue was obtained via the tissue distribution program at the Washington National Primate Research Center. All animal procedures were performed in accordance with IACUC protocols at the University of Washington (IACUC protocol number 427701).
Senior Editor
 Michael J Frank, Brown University, United States
Reviewing Editor
 Stephanie Palmer, University of Chicago, United States
Publication history
 Received: March 22, 2019
 Accepted: December 17, 2019
 Accepted Manuscript published: December 18, 2019 (version 1)
 Version of Record published: January 29, 2020 (version 2)
Copyright
© 2019, Latimer et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics

 1,417
 Page views

 236
 Downloads

 5
 Citations
Article citation count generated by polling the highest count across the following sources: PubMed Central, Crossref, Scopus.