Ultrafast simulation of large-scale neocortical microcircuitry with biophysically realistic neurons

Abstract
Editor's evaluation
Introduction
Results
Discussion
Methods
Data availability
References
Article and author information
Metrics

Abstract

Understanding the activity of the mammalian brain requires an integrative knowledge of circuits at distinct scales, ranging from ion channel gating to circuit connectomics. Computational models are regularly employed to understand how multiple parameters contribute synergistically to circuit behavior. However, traditional models of anatomically and biophysically realistic neurons are computationally demanding, especially when scaled to model local circuits. To overcome this limitation, we trained several artificial neural network (ANN) architectures to model the activity of realistic multicompartmental cortical neurons. We identified an ANN architecture that accurately predicted subthreshold activity and action potential firing. The ANN could correctly generalize to previously unobserved synaptic input, including in models containing nonlinear dendritic properties. When scaled, processing times were orders of magnitude faster compared with traditional approaches, allowing for rapid parameter-space mapping in a circuit model of Rett syndrome. Thus, we present a novel ANN approach allowing for rapid, detailed network experiments using inexpensive and commonly available computational resources.

Editor's evaluation

This study describes the use of artificial neural network (ANN) methods to accurately replicate the biophysical behavior of detailed single-neuron models. The method has the potential to greatly increase the speed of neuronal modeling compared to conventional differential equation-based modeling, and scales particularly well for large network models. The authors demonstrate the fidelity of their ANN model cells over a wide range of stimulus and recording conditions including electrical and optical readouts.

https://doi.org/10.7554/eLife.79535.sa0

Introduction

Understanding the behavior of complex neural circuits like the human brain is one of the fundamental challenges of this century. Predicting mammalian circuit behavior is difficult due to several underlying mechanisms at distinct organizational levels, ranging from molecular-level interactions to large-scale connectomics. Computational modeling has become a cornerstone technique for deriving and testing new hypotheses about brain organization and function (Sejnowski et al., 1988; Wolpert and Ghahramani, 2000; Dayan and Abbott, 2001; Kriegeskorte and Douglas, 2018). In little more than 60 years, our mechanistic understanding of neural function has evolved from describing action potential (AP)-related ion channel gating (Hodgkin and Huxley, 1952) to constructing models that can simulate the activity of whole-brain regions (Traub et al., 2005; Yu et al., 2013; Neymotin et al., 2016b; Chavlis et al., 2017; Turi et al., 2019). Although tremendous advancements have been made in the development of computational resources, the lack of available or affordable hardware for neural simulations currently represents a significant barrier to entry for most neuroscientists and renders many questions intractable. This is particularly well illustrated by large-scale neural circuit simulations. In contrast to detailed single-cell models, which have been a regular occurrence in publications since the 1990s (De Schutter and Bower, 1994; Mainen et al., 1995; Migliore et al., 1995; Mainen and Sejnowski, 1996; Destexhe et al., 1998; Stuart and Spruston, 1998; Aradi and Holmes, 1999; Migliore et al., 1999), parallel simulation of thousands, or even hundreds of thousands of detailed neurons have only become a possibility with the advent of supercomputers (Markram et al., 2015; Bezaire et al., 2016; Arkhipov et al., 2018; Joglekar et al., 2018; Schmidt et al., 2018; Antolík et al., 2019; Schwalger and Chizhov, 2019; Billeh et al., 2020). As these resources are still not widely accessible, several attempts have been made to mitigate the immense computational load of large-scale neural simulations by judicious simplification (Wang and Buzsáki, 1996; Bartos et al., 2002; Santhakumar et al., 2005; Eppler, 2008, Cutsuridis et al., 2010; Nowotny et al., 2014; Bezaire et al., 2016; Yavuz et al., 2016; Teeter et al., 2018; Amsalem et al., 2020; Knight et al., 2021; Knight and Nowotny, 2021, Wybo et al., 2021). However, simplification inevitably results in feature or information loss, such as sacrificing multicompartmental information for simulation speed (Wang and Buzsáki, 1996; Bartos et al., 2002; Santhakumar et al., 2005; Bezaire et al., 2016). Thus, there is a critical need for new approaches to enable efficient large-scale neural circuit simulations on widely available computational resources without surrendering biologically relevant information.

To counteract the increasing computational burden of ever-growing datasets on more traditional models, many fields have recently adopted various machine learning algorithms (Sharma et al., 2011; Montavon et al., 2013; Meredig et al., 2014; Merembayev et al., 2018; Schütt et al., 2020). Specifically, artificial neural networks (ANNs) are superior to conventional model systems both in terms of speed and accuracy when dealing with complex systems such as those governing global financial markets or weather patterns (Holmstrom, 2016; Ghoddusi et al., 2019). Due to their accelerated processing speed, ANNs are ideal candidates for modeling large-scale biological systems. The idea that individual neural cells could be represented by ANNs was proposed almost two decades ago (Poirazi et al., 2003); however, current ANN solutions are still unfit to replace traditional modeling systems as they cannot generate gradational neuronal dynamics needed for network simulations. Therefore, we aimed to develop an ANN that can (1) accurately replicate various features of biophysically detailed neuron models, (2) efficiently generalize for previously unobserved input conditions, and (3) significantly accelerate large-scale network simulations.

Here, we investigated the ability of several ANN architectures to represent membrane potential dynamics, in both simplified point neurons and multicompartment neurons. Among the selected ANNs, we found that a convolutional recurrent architecture can accurately simulate both subthreshold and suprathreshold voltage dynamics. Furthermore, this ANN could generalize to a wide range of input conditions and reproduce neuronal features following different input patterns beyond membrane potential responses, such as ionic current waveforms. Next, we demonstrated that this ANN could also accurately predict multicompartmental information by fitting this architecture to a biophysically detailed layer 5 (L5) pyramidal cell (PC; Hallermann et al., 2012) model. Importantly, we found that ANN representations could drastically accelerate large network simulations, as demonstrated by network parameter space mapping of a cortical L5 recurrent microcircuit model of Rett syndrome, a neurodegenerative disorder associated with cortical dysfunction and seizures (Hagberg et al., 1985; Armstrong, 2005, Glaze, 2005; Chahrour and Zoghbi, 2007). Thus, we provide a detailed description of an ANN architecture suitable for large-scale simulations of anatomically and biophysically complex neurons, applicable to human disease modeling. Most importantly, our ANN simulations are accelerated to the point where detailed network experiments can now be carried out using inexpensive, readily available computational resources.

Results

To create a deep learning platform capable of accurately representing the full dynamic membrane potential range of neuronal cells, we focused on model systems proven to be suitable for multivariate time-series forecasting (MTSF). To compare the ability of different ANNs to reproduce the activity of an excitable cell, we designed five distinct architectures (Figure 1). The first two models were a simple linear model with one hidden layer (linear model, Figure 1A, blue) and a similar model equipped with nonlinear processing (nonlinear model, Figure 1A, cyan), as even relatively simple model architectures can explain the majority of subthreshold membrane potential variance (Ujfalussy et al., 2018). The third and fourth models consist of recently constructed time-series forecasting architectures, including a recurrent ANN (convolutional neural network-long short-term memory [CNN-LSTM], Figure 1A, magenta) consisting of convolutional layers (Collobert and Weston, 2008), long short-term memory (LSTM; Hochreiter and Schmidhuber, 1997; Donahue et al., 2015) layers, and fully connected layers, termed the CNN-LSTM network (Figure 1—figure supplement 1, Shi, 2015), and a more recently developed architecture relying on dilated temporal convolutions (convolutional net, Figure 1A, orange) (based on the WaveNet architecture; Oord, 2016; Beniaguev et al., 2021), which is superior to the CNN-LSTM in several MTSF tasks. The CNN-LSTM has the distinct advantage of having almost two orders of magnitude more adjustable parameters compared to the aforementioned ANNs. Finally, we selected a fifth architecture (deep neural net, Figure 1A, green) with a comparable number of free parameters to the CNN-LSTM, composed of 10 hidden layers, which operates solely on linear and nonlinear transformations. Before moving to neural cell data, each of the five selected architectures were evaluated using a well-curated weather time-series dataset (see ‘Methods’). Each model performed similarly (0.070/0.069, 0.059/0.06, 0.089/0.094, 0.07/0.069, 0.092/0.095, mean absolute error on the validation/testing datasets for linear, nonlinear, convolutional net and CNN-LSTM, deep neural net architectures, respectively), demonstrating their suitability for MTSF problems.

Figure 1 with 1 supplement see all

Download asset Open asset

Single-compartmental neuronal simulations using artificial neural networks (ANNs).

(A) Representative diagrams of the tested architectures, outlining the ordering of the specific functional blocks of the ANNs. (B) Continuous representative trace of a point-by-point fit of passive membrane potential. (C) Point-by-point fit plotted against ground truth data (n = 45,000). (D) Mean squared error of ANN fits corresponds to the entire training dataset (n = 2.64 * 10⁶ datapoints). Single quantal inputs arrive stochastically with a fixed quantal size: 2.5 nS for excitatory, 8 nS for inhibitory inputs, sampling is 1 kHz. Red and green bars below membrane potential traces denote the arrival of inhibitory and excitatory events, respectively. (E) Representative trace of a continuous passive membrane potential prediction (left) created by relying on past model predictions. Explained variance (right) was calculated from 500-ms-long continuous predictions (n = 50). (F) Representative active membrane potential prediction by ANNs. (G) Explained variance (box chart) and Pearson’s r (circles) of model predictions and ground truth data for the five ANNs from 50 continuous predictions, 500 ms long each. (H) Spike timing of the convolutional neural network-long short-term memory (CNN-LSTM) model calculated from the same dataset as panel (G). Color coding is the same as in panel (A). (I) Representative continuous, 25-s-long simulation of subthreshold and spiking activity. (J) Explained variance as a function of time during the 25-s-long simulation depicted in panel (I). Red line and r-value correspond to the best linear fit. (K) Difference between voltage traces produced by NEURON and ANN simulations. Red line and r-value correspond to the best linear fit.

Prediction of point neuron membrane potential dynamics by ANNs

To test the ability of the five ANNs to represent input–output transformations of a neural cell, we next fitted these architectures with data from passive responses of a single-compartmental point-neuron model (NEURON simulation environment; Hines and Carnevale, 1997) using the standard backpropagation learning algorithm for ANNs (Rumelhart et al., 1986). Each model was tasked with predicting a single-membrane potential value based on 64 ms (a time window that yielded the best results both in terms of speed and accuracy) of preceding membrane potentials and synaptic inputs (Figure 1A). ANN fitting and query were run on a single-core central processing unit (CPU). We found that both the linear and nonlinear models predicted subsequent membrane potential values with low error rates (Figure 1B) with similar behavior in both the CNN-LSTM and convolutional architectures (2.16 * 10^–4 ± 1.18 * 10^–3, 2.07 * 10^–4 ± 1.11 * 10^–3, 1.43 * 10^–4 ± 9.31 * 10^–4, 1.29 * 10^–4 ± 9.42 * 10^–4 mean error for linear, nonlinear, CNN-LSTM, and convolutional models, respectively). However, the deep neural network performed considerably worse than all other tested models (3.94 * 10^–4 ± 1.56 * 10^–3 mean error), potentially due to the nonlinear correspondence of its predicted values to the ground truth data (Figure 1C and D).

Next, we tested ANNs in simulation conditions similar to the traditional models. To this end, we initialized ANNs with ground truth data followed by a continuous query period in which forecasted membrane potential values were fed back to the ANNs to observe continuous unconstrained predictions. As expected from the fit error rates of single-membrane potential forecasting (Figure 1D), continuous predictions of the linear, convolutional, and CNN-LSTM models could explain the ground truth signal variance at high accuracy. At the same time, the deep neural net performed slightly worse (Figure 1E, 97.1 ± 1.2, 99.3 ± 1.4, 97.2 ± 2.2, 84.0 ± 3.2 variance explained for linear, convolutional, CNN-LSTM, and deep neural net architectures, respectively, n = 50). Surprisingly, the nonlinear model produced the worst prediction for passive membrane potential traces (0.82 ± 0.03 variance explained, n = 50) despite performing the best on the benchmark dataset. Together, these results indicate that even simple linear ANNs can capture subthreshold membrane potential behavior accurately (Ujfalussy et al., 2018).

Next, we tested how these models perform on the full dynamic range of neural cells, which due to AP firing (which can also be viewed as highly relevant outlier data points) constitutes a non-normally distributed and thus demanding dataset for ANNs. Interestingly, we found that only the CNN-LSTM architecture could precisely reproduce both subthreshold membrane potential dynamics and spiking activity, while all other tested ANNs converged to the mean of the training dataset (Figure 1F and G, 4.4 ± 7.2%, 4.1 ± 6.9%, 0.5 ± 3.9%, 78.9 ± 6.7%, 4.4 ± 2.8% variance explained for linear, nonlinear, convolutional net and CNN-LSTM, deep neural net architectures, respectively, n = 50). We found that although the CNN-LSTM model explained substantially less variance for the active membrane potential traces (Figure 1G) than for subthreshold voltages alone (Figure 1E), the predictions showed high linear correlation with the ground truth signals (Pearson’s r = 0.76793 ± 0.10003, n = 50). For the four remaining ANN architectures, it is unlikely that convergence to the mean is caused by settling in local minima on the fitting error surface as ANNs have a large number of free parameters (2.07 * 10⁴, 2.07 * 10⁴, 2.47 * 10⁶, 3.64 * 10⁵, 1.95 * 10⁶ free parameters for linear, nonlinear, deep, convolutional ANNs, and CNN-LSTM, respectively). Therefore, the chance of having a zero derivative for each parameter at the same point is extremely low (Kawaguchi, 2016), suggesting that erroneous fitting is the consequence of the limitations of these ANN architectures. Consequently, of the tested ANN architectures, the CNN-LSTM is the only model that could depict the full dynamic range of a biophysical neural model.

Closer inspection of the timing of the predicted APs revealed that the CNN-LSTM models correctly learned thresholding as the occurrence of the APs matched the timing of the testing dataset (Figure 1H; 83.94 ± 16.89% precision and 90.94 ± 12.13% recall, 0.24 ± 0.79 ms temporal shift for true-positive spikes compared to ground truth, n = 283), thus CNN-LSTM predictions yielded voltage traces with good initial agreement to NEURON signals. To test the long-term stability of these predictions, we next performed a longer (25 s) ANN simulation (Figure 1I). During this extended simulation, we did not observe significant deviation from the ground truth signal in terms of explained variance (Figure 1J) or absolute difference (Figure 1K) and these metrics even improved slightly. Taken together, we developed an ANN architecture that is ideally suited for predicting both subthreshold membrane potential fluctuations and the precise timing of APs on a millisecond timescale.

Generalization of the CNN-LSTM architecture

To test the applicability of the CNN-LSTM for predicting physiological cellular behavior, we assessed the generalization capability of the architecture built for active behavior prediction (Figure 1F). Generalization is the ability of an ANN to respond accurately to novel data (Hassoun, 1995; Graupe, 2013). According to our hypothesis, if the CNN-LSTM correctly learned the mechanistic operations of a neural cell, then the architecture should behave appropriately when tasked with responding to novel quantal amplitudes and input patterns.

We first challenged the CNN-LSTM by administering excitatory inputs with variable quantal sizes (0.1–3.5 nS, 0.1 nS increment). Similar to the control NEURON model, the CNN-LSTM responded linearly in subthreshold voltage regimes (Figure 2A, Pearson’s r = 0.99, n = 35) and elicited an AP after reaching threshold. Independent evaluation of the NEURON model control revealed a surprisingly similar I/V relationship for the same quantal inputs (intercept, –0.003 ± 8.53 and –0.003 ± 0.001; slope for subthreshold linear I/V, 22.2 ± 0.41 and 23.31 ± 0.62; CNN-LSTM and NEURON model, respectively) and similar AP threshold (–58.03 mV and –56.64 mV for CNN-LSTM and NEURON model, respectively). Next, we tested temporal summation of excitatory inputs (Figure 2B). We found that the independently simulated NEURON model displayed similar temporal summation patterns to the CNN-LSTM for both sub- and suprathreshold events (Figure 2B). Finally, we combined the previous two tests and delivered unique temporal patterns of synaptic inputs with variable synaptic conductances randomly chosen from a normal distribution (mean: 2.5 nS; variance: 0.001 nS, Figure 2C). Again, the predictions of the CNN-LSTM architecture closely matched traces obtained from the NEURON model (Figure 2D, Pearson’s r = 0.81, n = 5000 ms) and the timing of the majority of the APs agreed with the ground truth data (91.02 ± 16.03% recall and 69.38 ± 22.43% precision, n = 50).

Figure 2

Download asset Open asset

Ideal generalization of the convolutional neural network-long short-term memory (CNN-LSTM).

(A) CNN-LSTM models predict similar subthreshold event amplitudes and action potential threshold (break in y-axis) for increasing input weight, compared to NEURON models. (B) CNN-LSTM models correctly represent temporal summation of synaptic events. Representative traces for different inter-event intervals (range: 2–10 ms, 1 ms increment) on the left, comparison of individual events in a stimulus train, relative to the amplitude of unitary events on the right. (C) Single-simulated active membrane potential trace in CNN-LSTM (purple) and NEURON (black) with variable synaptic input weights (left). The inset shows the distribution of synaptic weights used for testing generalization, with the original trained synaptic weight in purple. CNN-LSTM predicted membrane potential values plotted against NEURON model ground truth (right). Plotted values correspond to continuously predicted CNN-LSTM traces. (D) CNN-LSTM model predictions are accurate in various synaptic environments. Firing frequency was quantified upon two different excitation–inhibition ratios (2:1 – representative top trace on the left and bright magenta circles on the right, 1:2 – representative bottom trace on the left and dark magenta circles on the right). (E) Subthreshold effects of potassium conductance biophysical alterations are correctly depicted by the CNN-LSTM. Voltage dependence of the delayed rectifier conductances is illustrated on the left and their effect on subthreshold membrane potential is shown on the right (control conditions are shown in blue, 10 mV left-shifted delayed rectifier conditions in navy blue and 10 mV right-shifted conditions are shown in teal). (F) CNN-LSTM membrane potential predictions for left- (navy) or right-shifted potassium conditions are compared to control conditions., Membrane potential responses below and above –67 mV are quantified for the two altered potassium conductances in NEURON simulation and CNN-LSTM predictions. The effects of biophysical changes of potassium channels were only apparent at membrane potentials above their activation threshold (–67 mV). (G) Artificial neural networks (ANNs) fitting NEURON models with left-shifted (dark blue) and right-shifted (light blue) KDR conductances are plotted against membrane potential responses of ANNs with control KDR conductances. The separation of the two responses shows voltage response modulation of KDR at subthreshold membrane potentials. (H) Membrane potential responses of NEURON and ANN models below and above resting membrane potential (–67 mV).

In the initial training dataset for the CNN-LSTM, the ratio of excitatory and inhibitory events (8:3) was preserved while the total number of synaptic inputs was varied. We noticed that the firing rate of this model did not scale linearly with the number of synapses as initially expected in the presence of inhibitory inputs (Enoki et al., 2001). Thus, we systematically mapped AP firing of model cells in two different excitatory-inhibitory ratios with at a wide range of synaptic input frequencies (Figure 2E). We noted that varying excitation and inhibition could interact with each other in various ways, creating arithmetic operations like subtraction, division, or normalization (Carandini and Heeger, 2011). We approximated the resulting firing rates with two different models (see ‘Methods’). We found that the logistic function representing divisive normalization best fit our results (Bhatia et al., 2019) (Akaike information criterion [AIC] for linear models representing subtractive and divisive inhibition versus AIC for logistic function: 983.3 ± 231.66 and 905.87 ± 200.92, respectively, n = 700 each). Notably, the CNN-LSTM model was able to replicate firing responses to these variable synaptic conditions (R² values when comparing logistic fits for NEURON and CNN-LSTM models in 2:1 excitation–inhibition ratio: 0.996, for 1:2 excitation–inhibition ratio: 0.9, n = 700), further demonstrating the ability of the neuronal net to reproduce key features of neuronal excitability without prior entrainment.

Due to the opaque nature of neural net operations (Castelvecchi, 2016), it is reasonable to assume that instant modification of the trained architecture to account for specific biophysical alterations may not be feasible, highlighting a potentially significant shortcoming of our approach. However, the complexity of encoded features is correlated with the depth of the encoding layer in hierarchically constructed neural networks (Egmont-Petersen et al., 2002), which can be exploited through partial retraining. To test whether the ANN could accurately handle a specific biophysical change, we constructed a simple NEURON model equipped with a delayed rectifier K⁺ conductance with variable voltage dependences (Oláh et al., 2021; Figure 2F). Nonlinear signal summation at different subthreshold voltages was noted after shifting the steady-state activation and inactivation of the K⁺ conductance (Figure 2F). From this model, a single CNN-LSTM model was fitted to the control K⁺ condition. Subsequently, the CNN-LSTM model layers were frozen, with the exception of the (upper) fully connected layers, which were trained for 10 min on NEURON traces with either a 10 mV leftward or rightward shift introduced to the voltage dependence of the potassium conductance. All three models were in good agreement with the NEURON simulation results and provided similar deviations in subthreshold membrane potential regimes compared to control conditions (Figure 2G and H, below resting membrane potential: –0.13 ± 0.36, –0.11 ± 0.03, –0.01 ± 0.23, 0.01 ± 0.06; above resting membrane potential: –0.4 ± 0.43, –0.22 ± 0.55, 0.35 ± 0.27, 0.2 ± 0.1 for CNN-LSTM right-shift, NEURON right-shift, CNN-LSTM left-shift, and NEURON left-shift, respectively, n = 270), indicating that CNN-LSTM can be rapidly adapted to account for biophysical alterations.

NEURON models can calculate and display several features of neuronal behavior in addition to membrane potential, including ionic current flux. To test how our CNN-LSTMs perform in predicting ionic current changes, we supplemented ANN inputs with sodium (I_Na) and potassium currents (I_K) and tasked the models to predict these values as well. The accuracy of the CNN-LSTM prediction for these ionic currents was similar to membrane potential predictions (Figure 3, Pearson’s r = 0.999 and 0.99 for fitting, n = 5000, variance explained: 15.1 ± 11.6% and 82 ± 6.1%; prediction correlation coefficient: 0.85 ± 0.08 and 0.81 ± 0.1, n = 5, for I_K and I_Na, respectively) while the other ANNs again regressed to the mean.

Figure 3

Download asset Open asset

Convolutional neural network-long short-term memory (CNN-LSTM) prediction of neuronal mechanisms beyond somatic membrane potential.

(A) Representative membrane potential (V_m, top) and ionic current (I_K, potassium current; I_na, sodium current, bottom) dynamics prediction upon arriving excitatory (green, middle) and inhibitory (red, middle) events. Enlarged trace shows subthreshold voltage and current predictions. Color coding is same as for Figure 1. (black, NEURON model traces; magenta, CNN-LSMT; blue, linear model; teal, nonlinear model; green, deep neural net; orange, convolutional net). Notice the smooth vertical line corresponding to predictions by artificial neural networks (ANNs), with the exception of CNN-LSTM. On bottom left, magnified view illustrates the subthreshold correspondence of membrane potential and ionic current traces. (B) CNN-LSTM models accurately predict ionic current dynamics. Normalized ANN predictions are plotted against normalized neuron signals for sodium (dark gray, left) and potassium currents (light gray). (C) Variance of suprathreshold traces is largely explained by CNN-LSTM predictions (right, color coding is same as in panel [B], left). Correlation coefficients are superimposed in black.

Finally, we explored whether ANNs could represent nonlinear synaptic responses. Thus, we constructed single-compartmental models with two-component AMPA-NMDA containing synapses and inhibitory synapses. AMPA-NMDA model responses were voltage-dependent and produced nonlinear response curves with respect to AMPA alone (Figure 4A). Importantly, the CNN-LSTM architecture recreated the nonlinear response amplitude and time-course characteristic of AMPA-NMDA synapse activation (Schiller et al., 2000; Major et al., 2008; Branco and Häusser, 2011; Kumar et al., 2018).

Figure 4

Download asset Open asset

Accurate representation of nonlinear synaptic activation by convolutional neural network-long short-term memory (CNN-LSTM).

(A) Representative synaptic responses with variable synaptic activation, CNN-simulated AMPA receptors (light magenta) and AMPA + NMDA receptors (magenta), on the left. AMPA + NMDA response amplitudes nonlinearly depend on the activated synaptic conductance (magenta, CNN-LSTM; black, NEURON), compared to AMPA responses (light magenta, CNN-LSTM; gray, NEURON), on the right. (B) NMDA response nonlinearity enables coincidence detection in a narrow time window, resulting in action potential (AP) generation at short stimulus intervals. (C). Neuronal output modulation is dependent on synaptic NMDA receptor content in a naturalistic network condition. Representative traces on the left (CNN-LSTM, magenta; NEURON, black). Summary depiction of firing frequencies with varying amounts of NMDA receptor activation (percentages denominate the synaptic NMDA-AMPA fraction).

A well-defined functional role of NMDA receptors is coincidence detection, which allows boosting of consecutive subthreshold signals well beyond passive integration (Takahashi and Magee, 2009; Shai et al., 2015). To test whether our ANN could reliably perform coincidence detection, we simulated two excitatory inputs in NMDA-AMPA or AMPA-alone models. Closely spaced stimuli could generate significantly boosted EPSPs in models with NMDA-AMPA (Figure 4B). We found that both NEURON and ANN models exhibited strongly boosted excitatory signals within a well-defined ISI time window (±12 ms) when NMDA-AMPA receptors were activated, which could produce APs. Under physiological conditions, NMDA receptors have been reported to critically influence the AP output of neuronal cells (Smith et al., 2013). Thus, we subjected NEURON models to a barrage of excitatory and inhibitory inputs, such that AP generation was limited in the absence of NMDA (Figure 4C). Adding NMDA resulted in increased spike output (Figure 4C). Across several NMDA conditions, output in the NEURON and ANN models was indistinguishable (Figure 4C, 12.42 ± 1.36 and 12.39 ± 2.3 Hz firing, respectively, in condition where 100% synapses contained NMDA receptors, n = 50). Together, these results demonstrate that the CNN-LSTM correctly learned several highly specialized aspects of neuronal behavior.

Predicting the activity of morphologically realistic neurons using ANNs

Neurons multiply their adaptive properties by segregating different conductances into separate subcellular compartments (Magee and Cook, 2000; Kole et al., 2008; Losonczy et al., 2008; Kim et al., 2012; Rowan et al., 2014; Stuart and Spruston, 2015; Brunner and Szabadics, 2016; Stuart et al., 2016). Thus, in addition to simplified input integrating point neurons, a substantial portion of neuronal models developed in recent decades intended to address subcellular signal processing via detailed multicompartmental biophysical cellular representations (Major et al., 1994; Mainen and Sejnowski, 1996; Vetter et al., 2001; Hallermann et al., 2012; Brunner and Szabadics, 2016; Oláh et al., 2020). Therefore, our next aim was to examine how well ANNs describe multicompartmental information. To this end, a training dataset of synaptic inputs and corresponding somatic voltage responses was generated in NEURON from a morphologically and biophysically detailed in vivo-labeled neocortical L5 PC (Hallermann et al., 2012). The NEURON model included synapses placed at 200 synaptic locations along the dendritic tree. Although this number of synaptic sites is significantly lower compared to what has been established in biological neurons (Megías et al., 2001), this amount of discretization has proven to yield low errors compared to nondiscretized synaptic placements, with fast simulation runtimes and negligible memory consumption (Figure 5—figure supplement 1). It is noted that each synaptic location can be contacted by multiple presynaptic cells; therefore, the number of the synaptic locations does not constrain the connectivity. As the computational resource requirements for modeling such complex cells are much higher than in single-compartmental neurons, all NEURON models, data preprocessing, and ANN fitting and query were carried out on single graphical processing units (GPUs) and tensor processing units (TPUs) (‘Methods,’ Figure 5—figure supplement 2). We found that the trained CNN-LSTM performed in near-perfect accordance with the NEURON simulation (Figure 5A, Pearson’s r = 0.999, n = 45,000 ms). The continuous self-reliant prediction yielded lower yet adequate AP fidelity (Figure 5G, 68.28 ± 18.97% and 66.52 ± 25.37% precision and recall, 0.439 ± 4.181 ms temporal shift for true-positive spikes compared to ground truth, n = 205) compared to the point neuron, and the accuracy of subthreshold membrane potential fluctuations remained high (Pearson’s r = 0.83, n = 37).

Figure 5 with 3 supplements see all

Download asset Open asset

Multicompartmental simulation representation by convolutional neural network-long short-term memory (CNN-LSTM).

(A) CNN-LSTM can accurately predict membrane potential of a multicompartmental neuron upon distributed synaptic stimulation. Representative figure depicts the placement of synaptic inputs (150 excitatory inputs: 100 inputs on apical, oblique, and tuft dendrites and 50 inputs on the basal dendrite, randomly distributed; and 50 inhibitory inputs: 30 inputs on apical, oblique, and tuft dendrites and 20 inputs on the basal dendrite, randomly distributed) of a reconstructed level 5 (L5) pyramidal cell (PC) (left). Point-by-point forecasting of L5 PC membrane potential by a CNN-LSTM superimposed on biophysically detailed NEURON simulation (left). CNN-LSTM prediction accuracy of multicompartmental membrane dynamics is comparable to single-compartmental simulations (right, L5 PC in black, single-compartmental simulation of Figure 1D in gray, n = 45,000 and 50,000, respectively). (B) Convolutional filter information was gathered from the first convolutional layer (middle, color scale depicts the different weights of the filter), which directly processes the input (membrane potential in magenta, excitatory and inhibitory synapse onsets in green and red, respectively), providing convolved inputs to upper layers (gray bars, showing the transformed 1D outputs). (C) Distribution of filter weights from 512 convolutional units (n = 102,400) with double Gaussian fit (red). (D) Filter weight is independent of the somatic amplitude of the input (circles are averages from 512 filters, n = 200, linear fit in red). (E) Each synapse has a dedicated convolutional unit, shown by plotting the filter weights of the 200 most specific units against 200 synapses. Notice the dark diagonal illustrating high filter weights. (F) Excitatory and inhibitory synapse information is convolved by filters with opposing weights (n = 51,200, 25,600, 15,360, and 10,240 for apical excitatory, basal excitatory, apical inhibitory, and basal inhibitory synapses, respectively). (G) Representative continuous prediction of L5 PC membrane dynamics by CNN-LSTM (magenta) compared to NEURON simulation (black) upon synaptic stimulation (left, excitatory input in green, inhibitory input in red). Spike timing is measured on subthreshold traces (right, n = 50 for variance explained, precision and recall). (H) Artificial neural networks (ANNs) constrained on cortical layer 2/3 (top), layer 4 (middle), and layer 6 (bottom) PCs selected from the Allen Institute model database.

We previously demonstrated that CNN-LSTMs could accurately predict various neuronal mechanisms beyond somatic voltage fluctuations in single-compartmental cells (Figure 3). To investigate whether this architecture is sufficient to describe complex features of neuronal behavior in morphologically and biophysically realistic neurons as well, we tasked the ANN with simultaneously predicting membrane potentials from the soma and two dendritic locations (one apical and one basal) together with calcium current dynamics in the same locations (Figure 5—figure supplement 3). We found that CNN-LSTMs can accurately describe the selected aspects of neuronal activity, further demonstrating the versatility of this ANN architecture.

Establishing a proper multicompartmental representation of a neural system by relying solely on the somatic membrane potential is a nontrivial task due to complex signal processing mechanisms taking place in distal subcellular compartments (Schiller et al., 1997; Häusser and Mel, 2003; Jarsky et al., 2005; Harnett et al., 2015; Takahashi et al., 2016). This is especially true for signals arising from more distal synapses (Sjöström and Häusser, 2006; Larkum et al., 2009; Takahashi and Magee, 2009). To examine whether the CNN-LSTM considered distal inputs or neglected these in favor of more robust proximal inputs, we inspected the weights of the first layer of the neural network architecture (Figure 5B). This convolutional layer consists of 512 filters, which directly processes the input matrix (64 ms of 201 input vectors corresponding to the somatic membrane potential and vectorized timing information of 200 synapses). Despite the random initialization of these filters from a uniform distribution (He et al., 2015), only a small fraction of optimized filter weights were selected for robust information representation (13.83% of all weights were larger than 0.85), while the majority of them were closer to zero (Figure 5C), suggesting relevant feature selection. In order to demonstrate that this feature selection was not biased against distal inputs, the 512 convolutional filters were ranked by their selectivity for distinct synapses. We found that each synaptic input was assigned an independent selectivity filter (Figure 5D). Next, we compared the mean weights of each synapse with the somatic amplitude of the elicited voltage response as a proxy for input distance from the soma (Figure 5E). This comparison revealed a flat linear correspondence (Pearson’s r = 0.06), which combined with the filter specificity (Figure 5D) confirmed that distal and proximal synaptic inputs carry equally relevant information for the CNN-LSTM.

When comparing the weights of excitatory and inhibitory inputs, we found that even at the first layer the CNN-LSTM could determine that these inputs have opposing effects on subsequent membrane potential (5.91 * 10^–6, 2.66 * 10^–5, –6.22 * 10^–6, and –1.34 * 10^–5 mean weights for apical excitatory, basal excitatory, apical inhibitory, and basal inhibitory synapses, respectively, n = 51,200, 25,600, 15,360, and 10,240) even though these vectors only contain synaptic conductance information (comparable positive values for both excitatory and inhibitory synapses, Figure 5F). Taken together, the feature selectivity and prediction accuracy confirm that the CNN-LSTM architecture is well suited for representing multicompartmental information.

The recent surge in readily available cellular model datasets has significantly reduced the entry barrier for neuronal simulations as researchers no longer need to gather ground truth data individually. Therefore, we aimed to establish a pipeline to constrain ANNs on neuronal models from a publicly available, well-curated database (Gouwens et al., 2018) without developer involvement. Using this pipeline, we constrained ANNs on the remaining major cortical PC types; layer 2/3, layer 4, and layer 6 PCs (Figure 5H). We found that the resulting ANNs were fit adequately to the NEURON simulations (Figure 5I, 94.2 ± 14.2%, 74.5 ± 23.5%, and 67 ± 14.5% variance explained, 86.6 ± 23.1%, 70.1 ± 25.8%, and 63.2 ± 33.2% precision, 90.7 ± 18%, 74.5 ± 25.8%, and 63.5 ± 32.7% recall for layer 2/3, layer 4, and layer 6 PCs, respectively, n = 50), and the fitting procedure was devoid of ambiguities. Together, we developed an ANN architecture appropriate for multicompartmental neuronal simulations of diverse cell types and a user-friendly methodology for their construction.

Current injection-induced firing responses

The neuronal firing pattern upon direct current injection is one of the most prevalent means of establishing neuronal class and describing the cell’s potential in vivo behavior (Ascoli et al., 2008). Therefore, these recordings often serve as ground truth data during single-neuronal model constraining (Izhikevich, 2003; Naud et al., 2008; Druckmann et al., 2011; Teeter et al., 2018; Gouwens et al., 2020). Firing patterns are modulated by several ionic mechanisms in concert, several of which operate on much longer timescales than what the dimensions of our ANN input matrices allow us to observe. However, even complex firing patterns can be approximated by much simpler, biologically plausible, and computationally efficient single-cell models (Destexhe, 1997; Izhikevich, 2003; Brette and Gerstner, 2005; Sacerdote and Giraudo, 2013). Therefore, we created a custom ANN layer that can be inserted on top of CNN-LSTMs (for either single- and multicompartmental models) with its internal logic hard-coded based on the governing equations of the eloquent simple spiking model (Figure 6A) described by Izhikevich, 2003. In addition to the original variables of this model, we set the ‘time step’ parameter as a variable to account for differences in membrane time constant across cell types.

Figure 6 with 1 supplement see all

Download asset Open asset

Firing pattern representation with custom artificial neural network (ANN) layer.

(A) Representative figure depicting the custom ANN layer (termed custom Izhikevich layer) placed on the output of the fully connected layers of the convolutional neural network-long short-term memory (CNN-LSTM). This layer represents the final signal integration step, analogous to the soma of biological neurons. (B) Four firing patterns with different activity dynamics, produced by the custom ANN layer. (C) Firing pattern of a NEURON model (black, top) and the constrained ANN counterpart (magenta, bottom). The ANN model accurately reproduced the input–output relationship of the NEURON model. (D) Continuous subthreshold membrane potential fluctuations of the NEURON model (black trace) and faithfully captured by the custom ANN layer (magenta trace). (E) Relationship of membrane potential values predicted step-by-step by the ANN layer compared to the ground truth NEURON model. (F) The custom ANN layer continuous predictions explain the majority of the variance occurring in voltage signals produces by the NEURON simulation.

The custom ANN layer (Figure 6A) could reproduce a wide range of naturally occurring firing patterns (Figure 6B). In contrast to the millions of free parameters in CNN-LSTMs, this custom layer has only five trainable parameters and thus can be constrained using conventional optimization algorithms (Singer and Nelder, 2009). We created a single-compartmental NEURON model, equipped with Hodgkin–Huxley conductances based on a fast-spiking phenotype (Figure 6C) to generate a ground truth dataset of firing activity and subthreshold membrane potential fluctuations. We found that the custom ANN layer could reliably capture the input–output characteristics of the NEURON model (Pearson’s r: 0.982). We next fitted the ANN layer on randomly distributed synaptic inputs (Figure 6D). The custom ANN layer produced voltage responses in good agreement with the NEURON model (Figure 6E and F, Pearson’s r: 0.999, 96.9 ± 0.4% variance explained, n = 17). Together, this custom ANN layer approach imbues CNN-LSTMs with the ability to reproduce firing responses faithfully and also provides added flexibility allowing for the instantaneous alteration of firing behavior while preserving synaptic representations.

Generating diverse custom top layers operating on the output of CNN-LSTMs (Figure 6—figure supplement 1A) also creates opportunities to predict convoluted signals used to report neuronal activity in vivo, such as fluorescently reported calcium and voltage signals. To illustrate this possibility, we created custom ANN layers fitted to the dynamics of the GCamp6f fluorescent calcium indicator (Chen et al., 2013) and a recently developed fluorescent voltage indicator (Villette et al., 2019). Although these indicators severely distorted the underlying neuronal signals (i.e., membrane potential), we found that a custom recurrent encoder can accurately predict these characteristic waveforms (Figure 6—figure supplement 1), and importantly, stand-alone use of these layers can deconvolve even severely distorted ground truth signals.

Ultra-rapid simulation of multiple cells using CNN-LSTM

One of the main benefits of this machine learning approach as a substitute for traditional modeling environments is the potential for markedly reduced simulation runtimes. Simulation environments such as NEURON rely on compartment-specific mathematical abstractions of active and passive biophysical mechanisms (Hines and Carnevale, 1997), which results in high computational load in increasingly complex circuit models. In the case of small-sized (Nikolic, 2006; Migliore and Shepherd, 2008; Cutsuridis and Wennekers, 2009; Chadderdon et al., 2014; Hay and Segev, 2015) and mid-sized networks (Markram et al., 2015; Bezaire et al., 2016; Shimoura et al., 2018; Billeh et al., 2020) this hinders the possibility of running these models on nonspecialized computational resources. Although several attempts have been made to reduce the demanding computational load of neuronal simulations (Bush and Sejnowski, 1993; Destexhe and Sejnowski, 2001; Hendrickson et al., 2011; Marasco et al., 2012; Rössert, 2016; Amsalem et al., 2020; Wybo et al., 2021), the most commonly used approach is parallelization, both at the level of single cells (Hines et al., 2008) and network models (Hines and Carnevale, 2008; Lytton et al., 2016). However, ANNs offer a unique solution to this problem. Contrary to traditional modeling environments, graph-based ANNs are designed explicitly for parallel information processing. This means that ANN simulation runtimes on hardware that enables parallel computing, such as modern GPUs, do not increase linearly after additional cells are integrated into the simulated circuit (Figure 7A), resulting in better scaling for large networks where an immense number of similar cells are simulated.

Figure 7

Download asset Open asset

Orders of magnitude faster simulation times with convolutional neural network-long short-term memory (CNN-LSTM).

(A) An illustration demonstrating that CNN-LSTMs (top, magenta) handle both single-cell (left) and multiple-cell (right) simulations with a single graph, while the set of equations to solve increases linearly for NEURON simulations (bottom, black). (B) 100 ms simulation runtimes of 1-, 50-, and 5000-point neurons on four different resources. Bar graphs represent the average of five simulations. (C) Same as in panel (B), but for level 5 (L5) pyramidal cell(PC) simulations. Teal borders represent extrapolated datapoints.

To verify the efficiency of our CNN-LSTM, we compared single cells and small- to mid-sized simulation runtimes against NEURON models used in Figures 1 and 5. NEURON simulations were performed on a single CPU as this is the preferred and most widely used method (but see ; Ben-Shalom et al., 2022), while neural nets were run on both CPU and GPU because these calculations are optimized for GPUs. Although GPUs are inherently faster in numerical calculations, NEURON simulations are currently not suitable for this resource; therefore, simulation runtimes were compared using CPUs as well. NEURON simulations were repeated with custom initialization, during which simulations were pre-run to allow time-dependent processes, such as conductance inactivation, to reach steady-state values. Simulation of multiple cells was carried out without the implementation of synaptic connections to establish baseline runtimes, without additional runtime impeding factors. For point neurons, single-cell simulations ran significantly faster in NEURON than their CNN-LSTM counterparts when the optional initialization step was omitted (Figure 7B, 3.68 ± 0.24 s, 0.65 ± 0.03 s, 2.19 ± 0.69 ms, and 0.72 ± 0.04 s, 100 ms cellular activity by NEURON with initialization, NEURON without initialization, CNN-LSTM on CPU, and CNN-LSTM on GPU, respectively, n = 5). However, when increasing the number of cells, the predicted optimal scaling of CNN-LSTM models resulted in faster runtimes compared to NEURON models (e.g., for 50 cells, 24.23 ± 1.12 s, 7.45 ± 0.37 s, 4.42 ± 0.77 s, and 0.71 ± 0.05 s for a 100 ms simulation by NEURON with initialization, NEURON without initialization, CNN-LSTM on CPU, and CNN-LSTM on GPU, respectively, n = 5). These results show that while in NEURON the runtimes increased by approximately 6.6 times, CNN-LSTM runtimes on a GPU did not increase.

To demonstrate the practicality of ANNs for typical large-scale network simulations, we repeated these experiments with 5000 cells (representing the number of cells in a large-scale network belonging to the same cell type; Billeh et al., 2020). In these conditions, the NEURON simulation was ~148 times slower than a single-cell simulation. Notably, this large-scale CNN-LSTM simulation was only four times slower than that of a single cell (Figure 7B, 546.85 ± 4.61 ms, 407.2 ± 9 ms, 222.15458 ± 19.02 ms, and 2.97 ± 0.02ms for simulating 100 ms activity by NEURON with initialization, NEURON without initialization, CNN-LSTM on CPU, and CNN-LSTM on GPU, respectively, n = 5).

We next compared runtime disparities for NEURON and CNN-LSTM simulations of detailed biophysical models (Figure 7C). We found that the single-cell simulation of the L5 PC model ran significantly slower than the CNN-LSTM abstraction (2.08 * 10³ ± 84.66 s, 185.5 ± 3.7 s, 4.73 ± 0.13 s, and 1.02 ± 0.05 s for simulating 100 ms activity by NEURON with initialization, NEURON without initialization, CNN-LSTM on CPU, and CNN-LSTM on GPU, respectively, n = 5). This runtime disparity was markedly amplified in simulations with multiple cells (50 cells: 6.3 * 10⁴ s, 5.8 * 10³ s, 14.3 ± 0.24 s, and 1.19 ± 0.08 s, 5000 cells: 6.53 * 10⁶ s, 6.28 * 10⁵ s, 901.15 s, and 11.99 s for simulating 100 ms activity by NEURON with initialization, NEURON without initialization, CNN-LSTM on CPU, and CNN-LSTM on GPU respectively, n = 5), resulting in a four to five orders of magnitude faster runtime (depending on initialization) for the CNN-LSTM in case of mid-sized simulations. These results demonstrate that our machine learning approach yields far superior runtimes compared to traditional simulating environments. Furthermore, this acceleration is comparable to that afforded by increased parallel CPU cores used for several network simulations (Markram et al., 2015; Bezaire et al., 2016; Billeh et al., 2020), introducing the possibility of running large or full-scale network simulations on what are now widely available computational resources.

Efficient parameter space mapping using ANNs

Due to slow simulation runtimes, network simulations are typically carried out only a few times (but see Barros-Zulaica et al., 2019), hindering crucial network construction steps, such as parameter space optimization. Therefore, we sought to investigate whether our ANN approach was suitable for exploring parameter space in a pathophysiological system characterized by multidimensional circuit alterations, such as Rett syndrome. Rett syndrome is a neurodevelopmental disorder caused by loss-of-function mutations in the X-linked methyl-CpG binding protein (MeCP2) (Chahrour and Zoghbi, 2007). Rett syndrome occurs in ~1:10,000 births worldwide, resulting in intellectual disability, dysmorphisms, declining cortical and motor function, stereotypies, and frequent myoclonic seizures, mostly in girls (Belichenko et al., 1994; Armstrong, 1997; Steffenburg et al., 2001; Armstrong, 2002; Kishi and Macklis, 2004; Fukuda et al., 2005; Belichenko et al., 2009). Although the underlying cellular and network mechanisms are largely unknown, changes in synaptic transmission (Dani et al., 2005; Medrihan et al., 2008; Zhang et al., 2010), morphological alterations in neurons (Akbarian et al., 2001; Kishi and Macklis, 2004), and altered network connectivity (Dani and Nelson, 2009) have been reported in Rett syndrome.

We aimed to investigate the contribution of the distinct alterations on cortical circuit activity in Rett syndrome using a recurrent L5 PC network (Hay and Segev, 2015) composed entirely of CNN-LSTM-L5-PCs (Figure 8A). Simulations were run uninterrupted for 100 ms when a brief (1 ms) perisomatic excitation was delivered to mimic thalamocortical input onto thick tufted PCs (de Kock et al., 2007; Meyer et al., 2010; Constantinople and Bruno, 2013). In control conditions, cells fired well-timed APs rapidly after the initial stimuli followed by an extended AP firing as a consequence of the circuit recurrent connectivity (Figure 8B; Lien and Scanziani, 2013; Sun et al., 2013). First, we compared the runtime of the simulated L5 microcircuit of CNN-LSTM models and the run time of 150 unconnected L5 PCs in NEURON. We found that for a single simulation, CNN-LSTM models were more than 9300 times faster compared to NEURON models (Figure 8C, 21.153 ± 0.26 s vs. 54.69 hr for CNN-LSTM and NEURON models, respectively).

Figure 8

Download asset Open asset

Efficient parameter-space mapping with convolutional neural network-long short-term memory (CNN-LSTM) reveals a joint effect of recurrent connectivity and E/I balance on network stability and efficacy in Rett syndrome.

(A) 150 CNN-LSTM models of level 5 (L5) pyramidal cells (PCs) were simulated in a recurrent microcircuit. (B) The experimental setup consisted of a stable baseline condition for 100 ms, a thalamocortical input at t = 100 ms, and network response, monitored for 150 ms. Example trace from the first simulated CNN-LSTM L5 PC on top, raster plot of 150 L5 PCs in the middle, number of firing cells with 5 ms binning for the same raster plot in the bottom. Time is aligned to the stimulus onset (t = 0, black arrowhead). (C) Simulation runtime for single simulation (left, network of 150 cells simulated for 250 ms) and parameter space mapping (right, 150 cells simulated for 250 ms, 2500 times, for generating B). Teal border represents data extrapolation.

Rett cortical network alterations counteract circuit hyperexcitability

Cortical networks endowed with frequent recurrent connections between excitatory principal cells are prone to exhibit oscillatory behavior, which is often the mechanistic basis of pathophysiological network activities (McCormick and Contreras, 2001; Figure 9A). We quantified oscillatory activity (D’Cruz et al., 2010; McLeod et al., 2013; Roche et al., 2019) and the immediate response to thalamocortical stimuli independently (Figure 8C). By systematically changing excitatory quantal size (Dani et al., 2005) and the ratio of recurrent L5 PC innervation to mimic reduced recurrent connectivity and synaptic drive in Rett syndrome, we found that both alterations had considerable influence over network instability (Figure 9B, left panel; excitatory drive: 17.85 ± 61.61 vs. 388.92 ± 170.03 pre-stimulus APs for excitatory drive scaled by 0.75 and 1.25, respectively, n = 100 each, p<0.001; recurrent connectivity: 321.96 ± 200.42 vs. 157.66 ± 192.5 pre-stimulus APs for 10 and 5.2% recurrent connectivity, similar to reported values for adult wild-type and Mecp2-null mutant mice [Dani and Nelson, 2009], n = 50 each, p<0.001) and response to stimuli (excitatory drive: 147.58 ± 17.2 vs. 119.23 ± 18.1 APs upon stimulus for excitatory drive scaled by 0.75 and 1.25, respectively, n = 100 each, p=2.3 * 10^–22, t(198) = 11.03, two-sample t-test; recurrent connectivity: 134.76 ± 21.37 vs. 112.74 ± 34.99 APs upon stimulus for 10 and 5.2% recurrent connectivity, n = 50 each, p=2.54 * 10^–4, t(98) = 3.8, two-sample t-test). Contrary to disruption of the excitatory drive, when inhibitory quantal size (Chao et al., 2010) was altered, we found that inhibition had a negligible effect on network instability, as connectivity below 9% never resulted in oscillatory activity (Figure 9—figure supplement 1; inhibition corresponds to random inhibitory drive, as the network did not contain ANNs representing feed-forward inhibitory cells). Interestingly, we found no measurable relationship between the inhibitory quantal size and the network response to thalamocortical stimuli either. These results suggest that lowered recurrent connectivity reduces network instability. Specifically, recurrent connectivity observed in young Mecp2-null mice (7.8%; Dani and Nelson, 2009) yielded more stable microcircuits (54% of networks were stable, n = 100) than wild-type conditions (34% of networks were stable, n = 50). Recurrent connection probability of older animals (5.3%) further stabilized this network (64% of networks were stable). Taken together, our model suggests that reduced recurrent connectivity between L5 PCs is not causal to seizure generation and abnormal network activity (Steffenburg et al., 2001; Roche et al., 2019), which are crucial symptoms of Rett syndrome at a young age, but instead normal PC activation is disrupted. This may correspond to the early stages of Rett syndrome where cortical dysfunction emerges before the appearance of seizures (Chahrour and Zoghbi, 2007).

Figure 9 with 1 supplement see all

Download asset Open asset

Recurrent connectivity and excitatory drive jointly define network stability in a reduced level 5 (L5) cortical network.

(A) Two independent parameters were quantified: network instability (number of cells firing before the stimulus) and immediate response (number of cells firing within 10 ms of the stimulus onset). The example simulation depicts highly unstable network conditions. (B) Network instability (left) and immediate response (right) as a function of altered L5 pyramidal cell (PC) connectivity and excitatory drive. *a indicates network parameters used for generating panel (A). The white arrow in the right panel denotes circuit alterations observed in Rett syndrome. Namely, 5% recurrent connectivity between L5 PCs instead of 10% in control conditions and reduced excitatory drive.

Using the ANN approach, we successfully implemented multidimensional parameter space mapping in a cortical circuit exhibiting pathophysiological changes and could identify the isolated outcome of distinct circuit alterations. Furthermore, our accelerated multicompartmental neural circuit model demonstrated that parameter space mapping is not only attainable by CNN-LSTM models on commercially available computational resources, but it is almost fourfold faster than completing a single NEURON simulation.

Discussion

In this study, we present an ANN architecture (CNN-LSTM) capable of accurately capturing neuronal membrane dynamics. Most of the investigated ANN architectures predicted subthreshold voltage fluctuations of point neurons; however, only the CNN-LSTM was able to generate APs. This model could generalize well to novel input and also predict various other features of neuronal cells, such as voltage-dependent ionic current dynamics. Furthermore, the CNN-LSTM accounted for the majority of the variance of subthreshold voltage fluctuations of biophysically realistic L5 PC models with excitatory and inhibitory synapses distributed along the entirety of the dendritic tree. The timing of the predicted APs closely matched the ground truth data. Importantly, we found that the CNN-LSTM has superior scaling for large network simulations. Specifically, in the case of mid-sized biophysically detailed networks (50 cells), ANNs were more than three orders of magnitude faster, while for large-scale networks (5000 cells) ANNs are predicted to be five orders of magnitude faster than traditional modeling systems. These accelerated simulation runtimes allowed us to quickly investigate an L5 PC network in distinct conditions, for example, to uncover network effects of altered connectivity and synaptic signaling observed in Rett syndrome. In our Rett cortical circuit model, recurrent connectivity and excitatory drive jointly shape network stability and responses to sensory stimuli, showing the power of this approach in generating testable hypotheses for further empirical work. Together, the described model architecture provides a suitable alternative to traditional modeling environments with superior simulation speed for biophysically detailed cellular network simulations.

Advantages and limitations of the CNN-LSTM architecture

As our familiarity with neuronal circuits grows, so does the complexity of models tasked with describing their activity. Consequently, supercomputers are a regular occurrence in research articles that describe large-scale network dynamics built upon morphologically and biophysically detailed neuronal models (Markram et al., 2015; Bezaire et al., 2016; Billeh et al., 2020). Here, we developed an alternative to these traditional models, which can accurately represent the full dynamic range of neuronal membrane voltages in multicompartmental cells, but with substantially accelerated simulation runtimes.

ANNs are ideal substitutes for traditional model systems for several reasons. First, ANNs do not require hard-coding of the governing rules for neuronal signal processing. Upon creation, ANNs serve as a blank canvas that can derive the main principles of input–output processing and neglect otherwise unimpactful processes (Benitez et al., 1997; Dayhoff and DeLeo, 2001; Castelvecchi, 2016). The degree of simplification depends only on the ANN itself, not the developer, thereby reducing human errors. However, architecture construction and training dataset availability represent limiting steps in ANN development (Alwosheel et al., 2018). Fortunately, the latter issue is void as virtually infinite neuronal activity training datasets are now available for deep learning. Conversely, as we have demonstrated, the former concern can significantly impede ANN construction. Although we have shown that markedly divergent ANN architectures can accurately depict subthreshold signal processing, we found only one suitable for both subthreshold active membrane potential prediction. The presented architecture is unlikely to be the only suitable ANN model for neural simulations as machine learning is a rapidly progressing field that frequently generates highly divergent ANN constructs (da Silva et al., 2017). The importance of the network architecture is further emphasized by our findings demonstrating that ANNs with comparable or even greater numbers of freely adjustable parameters could not handle suprathreshold information.

The prevailing CNN-LSTM architecture was proven suitable for depicting membrane potential and ionic current dynamics of both simplified and biophysically detailed neuronal models and generalized well for previously unobserved simulation conditions. These results indicate that ANNs are ideal substitutes for traditional model systems for representing various features of neuronal information processing with significantly accelerated simulations. Future architecture alterations should focus on the continued improvement of AP timing and prediction, as well as the integration of additional dendritic and axonal properties.

A recent publication presented an excellent implementation of an ANN architecture for predicting neuronal membrane potentials (Beniaguev et al., 2021) of complex cortical neurons. The featured architecture was composed of nested convolutional layers, and membrane potential dynamics was represented with a combination of two output vectors (subthreshold membrane potential and a binarized vector for AP timing). Building on this idea, we aimed to design an architecture that could (1) produce sequential output with smaller temporal increments, (2) generalize to previously unobserved temporal patterns and discrepant synaptic weights as well, and lastly, (3) produce APs with plausible waveforms in addition to subthreshold signals. Fulfillment of these three criteria is imperative for modeling these cells in a network environment. Our ANN architecture fulfills these requirements, thus representing the first ANN implementation that can serve as a viable alternate for biophysically and morphologically realistic neurons in a network model environment.

The CNN-LSTM architecture has several advantages over traditional modeling environments beyond the runtime acceleration. For example, connectivity has no influence over simulation speed as connection implementation is a basic matrix transformation carried out on the entire population simultaneously. However, this approach is not without limitations. First, although ANN training can be carried out on affordable and widely available resources, training times can last up to 24 hr to achieve accurate fits (‘Methods’). Furthermore, judicious restrictions are needed in the amount of synaptic contact sites, to preserve realistic responses and at the same time mitigate computational requirements, as the number of contact sites directly correlates with simulation runtimes and memory consumption. Additionally, the 1 ms temporal discretization hinders the implementation of certain biological phenomena that operate on much faster timescales, such as gap junctions. Depending on the degree of justifiable simplification, several other modeling environments exist, which are faster and computationally less demanding than our ANN approach. These environments mostly rely on simplified point neurons, such as the Izhikevich formulation (Figure 6), often developed specifically to leverage accelerated GPU computations (Ros et al., 2006; Fidjeland et al., 2009; Nageswaran et al., 2009; Mutch, 2010; Thibeault, 2011; Nowotny et al., 2014; Vitay et al., 2015; Yavuz et al., 2016; Knight et al., 2021). Therefore, depending on the required biophysical resolution and the available computational resources, the ANN approach presented here has an advantage over other environments in certain situations, while traditional modeling environments such as NEURON and GPU accelerated network simulators have a distinct edge in other use cases.

Simulation runtime acceleration

Accelerated simulation runtimes are particularly advantageous for large-scale biological network simulations, which have seen an unprecedented surge in recent years. These network simulations not only provide support for experimentally gathered information but also as testing benchmarks in the future for several network-related queries such as pharmaceutical target testing and for systemic interrogation of cellular-level abnormalities in pathophysiological conditions (Gambazzi et al., 2010; Kerr et al., 2013; Neymotin et al., 2016a, Sanjay, 2017; Domanski et al., 2019; Zhang and Santaniello, 2019; Liou et al., 2020). However, widespread adaptation of large-scale network simulations is hindered by the computational demand of these models that can only be satisfied by the employment of supercomputer clusters (Markram et al., 2015; Bezaire et al., 2016; Billeh et al., 2020). Because these resources are expensive, they do not constitute a justifiable option for general practice. Importantly, we have shown that ANNs can provide a suitable alternative to traditional modeling systems, and that their simulation runtimes are also superior due to the structure of the machine learning platform (i.e., Tensorflow).

Traditional model systems linearly increase the number of equations to be solved for parallelly simulated cells, while ANNs can handle cells belonging to the same cell type on the same ANN graph (Dillon, 2017). In our network models (150 cells; Figure 8), NEURON simulations yield 150 times more linear equations for every time step, while ANNs used the same graph for all simulated cells. This property of ANNs in particular suits biological networks consisting of many cells. For example, the Allen Institute reported a computational model of the mouse V1 cortical area (Billeh et al., 2020), consisting of 114 models corresponding to 17 different cell types (with the number of cells corresponding to these cell types ranging from hundreds to more than 10,000), which means that simulation of a complete cortical area is feasible using only 114 ANNs. We have demonstrated that even for small networks consisting of only 150 cells of the same type ANNs are more than four orders of magnitude faster compared to model environments used in the aforementioned V1 simulations. As large-scale network simulations are typically run using several thousand CPU cores in parallel, the predicted runtime acceleration suggests that network simulations relying on ANNs could negate the need for supercomputers. Instead, ANN-equivalent models could be run on commercially available computational resources such as personal computers with reasonable time frames.

Another advantage of our approach is the utilization of GPU processing, which provides a substantially larger number of processing cores (Asano et al., 2009; Memon et al., 2017). The runtime differences are observable by comparing CNN-LSTM simulations on CPU and GPU (Figure 7B and C), which yields more than an order of magnitude faster simulations on GPU in the case of small-size networks (50 cells) and approximately two orders of magnitude difference for mid-sized networks. Our results demonstrate that cortical PC network simulations are at least four orders of magnitude faster than traditional modeling environments, confirming that disparities in the number of cores can only partially account for the observed ANN runtime acceleration. Furthermore, the NEURON simulation environment does not benefit as much from GPU processing as for ANN simulations (Vooturi et al., 2017; Kumbhar et al., 2019). These results confirm that the drastic runtime acceleration is the direct consequence of the parallelized graph-based ANN approach.

Efficient mapping of network parameter involvement in complex pathophysiological conditions

To demonstrate the superiority of ANNs in a biologically relevant network simulation, we mapped the effects of variable network parameters observed in Rett syndrome. Rett syndrome is a neurodevelopmental disorder leading to a loss of cognitive and motor functions, impaired social interactions, and seizures in young females due to loss-of-function mutations in the X-linked MeCP2 gene (Chahrour and Zoghbi, 2007). Like many brain diseases, these behavioral alterations are likely due to changes in several different synaptic and circuit parameters. MeCP2-deficient mice exhibit multiple changes in synaptic communication, affecting both excitatory and inhibitory neurotransmission and circuit-level connectivity. Excitatory transmission is bidirectionally modulated by MeCP2 knockout (Nelson et al., 2006; Chao et al., 2007) and overexpression (Na et al., 2012), and long-term synaptic plasticity is also impaired in MeCP2-deficient mice (Asaka et al., 2006; Guy et al., 2007). Inhibitory signaling is also altered in several different brain areas (Dani et al., 2005; Medrihan et al., 2008). Importantly, synaptic transmission is affected not only at the level of quantal parameters but also regarding synaptic connections as MeCP2 directly regulates the number of glutamatergic synapses (Chao et al., 2007). This regulation amounts to a 39% reduction of putative excitatory synapses in the hippocampus (Chao et al., 2007) and a 50% reduction in recurrent excitatory connections between L5 PCs (Dani and Nelson, 2009). Here, we investigated how these diverse underlying mechanisms contribute to overall circuit pathology using our ANN network model approach.

We found that the ability of the network to respond to external stimuli is affected by both alterations in synaptic excitation and changes in the recurrent connectivity of L5 PCs. Our results suggest that disruption of inhibitory transmission is not necessary to elicit network instability in Rett as changes in synaptic excitation and recurrent connectivity alone were sufficient in destabilizing the network. These results are supported by previous findings showing that both constitutive (Calfa et al., 2011) and excitatory-cell-targeted (Zhang et al., 2014) MeCP2 mutations lead to network seizure generation as opposed to inhibitory-cell-targeted MeCP2 mutation, which causes frequent hyperexcitability discharges but never seizures (Chao et al., 2010). Furthermore, our results suggest that excitatory synaptic alterations in Rett affect both general network responses and network stability, which may serve as substrates to cognitive dysfunction and seizures, respectively. Taken together, our results reveal how cellular-synaptic mechanisms may relate to symptoms at the behavioral level. Importantly, investigation of the multidimensional parameter space was made possible by the significantly reduced simulation times of our ANN as identical simulations with traditional modeling systems are proposed to be four orders of magnitude slower.

Methods

Single-compartmental NEURON simulation

Passive and active membrane responses to synaptic inputs were simulated in NEURON (Hines and Carnevale, 1997, version 7.7, available at http://www.neuron.yale.edu/neuron/). Morphology (single compartment with length and diameter of 25 µm) and passive cellular parameters (R_m: 1 kΩ/cm²; C_m: 1 µF/cm²; R_i: 35.4 Ω/cm) were the same for both cases and resting membrane potential was set to –70 mV. Additionally, the built-in mixed sodium, potassium and leak channel (Jaslove, 1992, based on the original Hodgkin–Huxley descriptions) was included in the active model (g_Na: 0.12 pS/µm²; g_K: 0.036 pS/µm²; g_leak: 0.3 nS/µm²). Reversal potentials were set to 50 mV for sodium, –77 mV for potassium, and –54.3 mV for leak conductance. Simulations were run with a custom steady-state initialization procedure (Carnevale and Hines, 2006) for 2 s, after which the temporal integration step size was set to 25 µs.

In order to simulate membrane responses to excitatory and inhibitory inputs, the built-in AlphaSynapse class of NEURON was used (excitatory synapse: τ: 2 ms; g_pas: 2.5 nS; E_rev: 0 mV; inhibitory synapse: τ: 1 ms; g_pas: 8 nS; E_rev: –90 mV). The number of synapses was determined by a pseudo-random uniform number generator (ratio of excitatory to inhibitory synapses: 8:3). Timing of individual synapses was also randomly picked from a uniform distribution. During the 10-s-long simulations, the membrane potential, I_Na, and I_K currents were recorded along with the input timings and weights and were subsequently saved to text files. Simulations were carried out in three different conditions. First, resting membrane potential was recorded without synaptic activity. Second, passive membrane potential was recorded. Third, active membrane potential responses were recorded with fixed synaptic weights.

The amount of training each ANN received varied widely, based on the complexity of the modeled system. We used model checkpoints to stop the training if the prediction error on the validation dataset did not improve within 20 training epochs. This checkpoint was reached between 12 and 24 hr, training on a single GPU.

Multicompartmental NEURON simulation

Active multicompartmental simulations were carried out using an in vivo-labeled and fully reconstructed thick tufted cortical L5 PC (Hallermann et al., 2012). The biophysical properties were unchanged, and a class representation was created for network simulations. Excitatory and inhibitory synapses were handled similarly to single-compartmental simulations. A total of 100 excitatory (τ: 1 ms; g_pas: 3.6 nS; E_rev: 0 mV) and 30 inhibitory synapses (τ: 1 ms; g_pas: 3 nS; E_rev: –90 mV) were placed on the apical, oblique, or tuft dendrites, and 50 excitatory and 20 inhibitory synapses were placed on basal dendrites. The placement of the synapses was governed by two uniform pseudo-random number generators, which selected dendritic segments weighed by their respective lengths and the location along the segment (ratio 2:1:1:1 for apical excitatory, apical inhibitory, basal excitatory, and basal inhibitory synapses). Simulations were carried out with varied synaptic weights and a wide range of synapse numbers.

ANN benchmarking

MTSF models are ideal candidates for modeling neuronal behavior in a stepwise manner as they can be designed to receive information about past synaptic inputs and membrane potentials in order to predict subsequent voltage responses. These ANNs have recently been demonstrated to be superior to other algorithms in handling multivariate temporal data such as audio signals (Kons and Toledo-Ronen, 2013), natural language (Collobert and Weston, 2008), and various other types of fluctuating time-series datasets (Zheng et al., 2014; Che et al., 2018; Zhang et al., 2019). To validate the overall suitability of different ANN architectures tested in this article for MTSF, we used a weather time-series dataset recorded by the Max Planck Institute for Biogeochemistry. The dataset contains 14 different features, including humidity, temperature, and atmospheric pressure collected every 10 min. The dataset was prepared by François Chollet for his book Deep Learning with Python (dataset preparation steps can be found on the Tensorflow website: https://www.tensorflow.org/tutorials/structured_data/time_series). All ANN architectures were implemented using the Keras deep-learning API (https://keras.io/) of the Tensorflow open-source library (version 2.3, Abadi, 2015;, https://www.tensorflow.org/), with Python 3.7.

The first architecture we implemented was a simple linear model consisting of three layers without activation functions; a Flatten layer, a Dense (fully connected) layer with 64 units, and a Dense layer with 3 units. The second architecture was a linear model with added nonlinear processing. The model contained three layers identical to the linear model, but the second layer had a sigmoid activation function. The third model was a deep neural net with mixed linear and nonlinear layers. Similar to the first two models, this architecture had a Flatten layer and a Dense layer with 64 units as the first two layers, followed by nine Dense layers (units 128, 256, 512, 1024, 1024, 512, 256, 128, and 64 for the nine Dense layers) with hyperbolic tangent (tanh) activation function and Dropout layers with 0.15 dropout rate. The last layer was the same Dense layer with three units as in case of the linear and nonlinear models. The fourth model was a modified version of the WaveNet architecture introduced in 2016 (Oord, 2016), implemented based on a previous publication (Beniaguev et al., 2021). The fifth and final architecture was a convolutional LSTM model (Donahue et al., 2015) that consists of three distinct functional layer segments. The lowest layers (close to the input layer) were three, one-dimensional convolutional layers (Conv1D) with 128, 100, and 50 units, and causal padding for temporal data processing. The first and third layers had a kernel size of 1, and the second layer had a kernel size of 5. The first two layers had ‘rectified linear unit’ (relu) activation functions, and the third layer had tanh activation; therefore, the first two layers were initialized by He-uniform variance scaling initializers (He et al., 2015), while the third layer was initialized by Glorot-uniform initialization (also known as Xavier uniform initialization) (Glorot, 2011). After flattening and repeating the output of this functional unit, a single LSTM layer (Hochreiter and Schmidhuber, 1997) handled the arriving input, providing recurrent information processing. This layer had 128 units, tanh activation function, Glorot-uniform initialization, and was tasked to return sequences instead of the last output. The final functional unit was composed of four Dense layers with 100 units, scaled exponential linear unit (selu) activations, and accordingly, LeCun-uniform initializations (Montavon et al., 2012). The dropout rate between Dense layers was set to 0.15.

All benchmarked architectures were compiled and fitted with the same protocol. During compiling, the loss function was set to calculate mean squared error and the Adam algorithm (Kingma and Ba, 2014) was chosen as the optimizer. The maximum number of epochs was set to 20; however, an early stopping protocol was defined to have a patience of 10, which was reached in all cases.

Single-compartmental simulation representation with ANNs

As neural nets favor processed data scaled between –1 and 1 or 0 and 1, we normalized the recorded membrane potentials and ionic currents. Due to the 1 Hz recording frequency, AP amplitudes were variable beyond physiologically plausible ranges; therefore, peak amplitudes were standardized. The trainable time-series data was consisting of 64-ms-long input matrices with three or five columns (corresponding to membrane potential, excitatory input, inhibitory input, and optionally I_Na and I_K current recordings) and target sequences were vectors with one or three elements (membrane potential and optional ionic currents). Training, testing, and validation datasets were created by splitting time-series samples 80-10–10%.

Benchmarking the five different ANN architectures proved that these models can handle time-series data predicting with similar accuracy; however, in order to obtain the best results, several optimization steps of the hyperparameter space were undertaken. Unless stated otherwise, layer and optimization parameters were unchanged compared to benchmarking procedures. First, linear models were created without a Flatten layer, instead of which a TimeDistributed wrapper was applied to the first Dense layer. The same changes were employed in case of the nonlinear model and the deep neural net. The fourth, convolutional model had 12 Conv1D layers with 128 filters, kernel size of 2, causal padding tanh activation function and dilatation rates constantly increasing by 2ⁿ. We found that the best optimization algorithm for passive and active membrane potential prediction is the Adam optimizer accelerated with Nesterov momentum (Dozat, 2015), with gradient clipping set to 1. Although mean absolute error and mean absolute percentage error were sufficient for passive membrane potential prediction, the active version warranted the usage of mean squared error in order to put emphasis on APs. We found out that the mechanistic inference of the full dynamic range of simulated neurons was a hard task for ANNs; therefore, we sequentially trained these models in a specific order. First, we taught the resting membrane potential by supplying voltage recordings with only a few or no synaptic inputs. This step was also useful to learn the isolated shapes of certain inputs. Second, we supplied highly active subthreshold membrane traces to the models and finally inputted suprathreshold membrane potential recordings. During the subsequent training steps, previous learning phases were mixed into the new training dataset in order to avoid the catastrophic forgetting of gradient-based neural networks (Goodfellow, 2015).

During altered excitation–inhibition ratios, the previously constructed single-compartmental model was used without modifications in layer weights and biases. Firing responses were fitted with different curves, a linear model,

y = a + b x

which could account for either subtractive or divisive inhibition (Bhatia et al., 2019), and a logistic curve,

y = \frac{A 1 - A 2}{1 + e^{(x - x 0) / d x}} + A 2

representing divisive normalization. Although the latter arithmetic operation is often approximated by an exponential curve, we felt the necessity to account for datapoints without spiking.

In experiments aimed at quantifying the effect of biophysical modifications of delayed rectifier potassium conductances, left- and right-shifted models were compared to control conditions point-by-point upon identical synaptic input streams, and the deviation from control conditions was expressed as absolute difference, measured in millivolts.

NMDA point-process model was constructed as a compound model consisting of an AMPA and an NMDA segment, both of which were designed based on NEURON’s built-in AlphaSynapse class. The logic of the model was based on a previous publication (Kim et al., 2013), where the AMPA model was only dependent on local membrane potential, while the NMDA model had an additional constraining Boltzmann function for gating voltage-dependent activation. The ANN was trained on several datasets having consistently higher randomly distributed synaptic inputs. The training dataset did not contain activity patterns tested in Figure 4. The training dataset consisted of an nX4 matrix, where the columns were membrane voltage, AMPA conductance, NMDA conductance, and inhibitory conductance. In the training dataset, AMPA and NMDA synapses were applied independently, and the Boltzmann function of NMDA was omitted. After the model learned the correct representation of NDMA activations, a hand-crafted layer was inserted into the ANN, which recalculated the conductance maximum of NMDA in accordance with the instantaneous membrane potential. Specifically, the function was expressed as

g N D M A = \frac{A 1 - A 2}{1 + e^{(v - x 0) / d x}} + A 2

where A1 is 1, A2 is –1, v denotes membrane potential, x0 is set to –63.32 in NEURON and 1.44 in the ANN, while dx is 0.013 in NEURON and 0.12 in the ANN.

CCN-LSTM for multicompartmental simulation representation

Data preprocessing was done as described for single-compartmental representations. Time-series data for CNN-LSTM input was prepared as matrices having 201 rows for membrane potential and 200 synapse vectors, and 64 rows (64-ms-long input). The CNN-LSTM architecture consisted of three Conv1d layers (512, 256, and 128 units), a Flatten layer, a RepeatVector, three LSTM layers (128 units each), and six Dense layers (128, 100, 100, 100, 100, 1 units). Activation functions and initializations were similar to the CNN-LSTM described above, with the exception of the first Dense layer, which included the relu activation function and He-uniform initialization. Additionally, Lasso regularization (Santosa and Symes, 1986) was applied to the first Conv1D layer. We found that the best optimizer for our purposes was a variant of the Adam optimizer based on the infinity norm, called Adamax (Kingma and Ba, 2014). Due to the non-normal distribution of the predicted membrane potentials, an inherent bias was present in our results, which was scaled by either an additional bias term, or a nonlinear function transformation.

Network construction was based on a previous publication (Hay and Segev, 2015). Briefly, 150 L5 PC were simulated in a network with varying unidirectional connectivity, and bidirectional connectivity proportional to the unidirectional connectivity (P_{bidirectional} = 0.5 * P_{unidirecional}). Reciprocal connections were 1.5 times stronger than unidirectional connections. In order to implement connectivity, a connection matrix was created, where presynaptic cells corresponded to the rows, and postsynaptic cells corresponded to the columns of the matrix. If there was a connection between two cells, the appropriate element of the matrix was set to 1, otherwise the matrix contained zeros. Next, cells were initialized with random input matrices. After a prediction was made for the subsequent membrane potential values, every cell was tested for suprathreshold activity. Upon spiking, rows of the connectivity matrix corresponding to the firing cells were selected, and the input matrices of the postsynaptic cells were supplemented with $x_{i j} * g_{c o n n}$ , where $x_{i j}$ corresponds to the element of the connectivity matrix for presynaptic cell i, and postsynaptic cells j, and $g_{c o n n}$ refers to the conductance of the synapses between two connected cells. As this step is carried out upon presynaptic spiking, regardless of whether two cells are connected or not ( $x_{i j}$ can be 0 or 1), the degree of connectivity does not influence simulation runtimes.

The delay between presynaptic AP at the soma and the onset of the postsynaptic response was 1 ms measured from the AP peak as the network simulations represent local circuit activity. If the simulated network is made to include spatially circuit components with more variability in their synaptic delays, to account for their spatial segregation, a buffer matrix must be created. The aim of this buffer matrix is to contain synaptic conductance values upon AP detection from the presynaptic cells, without immediately posting it on the input matrices of postsynaptic cells. Each connection consisted of five proximal contact sites. Compared to the original publication, we modified the parameters of the Tsodyks–Markram model (Tsodyks and Markram, 1997) used to govern synaptic transmission and plasticity. Based on a recent publication (Barros-Zulaica et al., 2019), we set U (fraction on synaptic resources used by a single spike) to 0.38, D (time constant for recovery from depression) to 365.6, and F (time constant for recovery from facilitation) to 25.71. The simulation was run for 250 or 300 ms, which consisted of a pre-stimuli period (to observe the occurrence of structured activity patterns) for 100 ms, and a post-stimuli period (to quantify network amplification). The stimulus itself consisted of a strong excitatory input (can be translated to 50 nS) delivered to a proximal dendritic segment, calibrated to elicit APs from all 150 cells in a 10-ms-long time window. Scaling of inhibitory inputs was carried out by changing inhibitory quantal size of background inputs, while scaling of excitatory drive affected quantal size of recurrent synaptic connections as well.

Custom top layers

We created custom top layers operating on the output layer of the CNN-LSTM in two different configurations, First, the ‘custom Izhikevich layer’ was implemented using the ‘CustomLayer’ class of Tensorflow. The internal variables and governing functions were implemented based on the original description of this model (Izhikevich, 2003). Briefly, the layer calculates the values of v and u dimensionless variables (v represents membrane potential, and u represents a membrane recovery variable), based on a, b, c, and d dimensionless parameters (a corresponds to the timescale of u, b sets the sensitivity of u, c describes the after-spike reset value of v, and d sets the after-spike reset value of u). Additionally, we set dt (time step) parameter free as it was necessary for accounting for the membrane time constant. Due to the low number of trainable parameters, this layer can be fitted with conventional fitting algorithms, such as the Nelder–Mead minimalization (Singer and Nelder, 2009), available in the ‘scipy’ package of Python. As the Izhikevich equations require information about the state of both u and v variable, yet the CNN-LSTM only predicts v, this layer requires inputs from two sources, v coming from the CNN-LSTM and u coming from previous predictions of the custom layer, directly bypassing the CNN-LSTM. Therefore, the previously used Sequential Application Programming Interface (API) of Tensorflow was discarded in favor of the Functional API. As the equations governing v and u require current as input, not voltage, the CNN-LSTM in this case needs to be tasked with solving for synaptic (and subsequent membrane) current. Consequently, to gauge the upper limits of this method, we administered a synaptic current waveform as input during layer evaluation.

The second approach we took for custom top layer creation involved a more conventional route, where recurrent encoder (stacked LSTM layers having first decreasing and then increasing number of units) were constructed, operating on a longer batch of CNN-LSTM predictions. Specifically, the encoder responsible for fluorescent calcium signal generation took 3 s of voltage input, while the voltage reporter encoder and decoder operated on 1024 ms of signal input.

Computational resources

We used several different commercially available and free-to-use computational resources to demonstrate the attainableness of large network simulations using neural networks. Single-compartmental NEURON simulations were carried out on a single CPU (Intel Core i7-5557U CPU @3.1 GHz), equipped with four logical processors and two cores. Python had access to the entirety of the CPU; however, no explicit attempts were made to enable code parallelization. To test runtimes on a CPU, only a single core was used. For multicompartmental NEURON simulations, we used the publicly available National Science Foundation-funded High Performance Computing resource via the Neuroscience Gateway (Sivagnanam et al., 2013). This resource was only used to generate training datasets. Speed comparison using CPUs was always carried out on the aforementioned single CPU. In contrast to NEURON models, ANN calculations are designed to run on GPUs rather than CPUs. Therefore, ANN models were run on the freely accessible Google Collaboratory GPUs (NVIDIA Tesla K80), Google Collaboratory TPUs (designed for handling tensor calculations typically created by Tensorflow) or a single high-performance GPU (GeForce GTX 1080 Ti). For speed comparisons, we ran these models on a single Google Collaboratory CPU (Intel Xeon, not specified, @2.2 GHz) and the previously mentioned single CPU as well. During NEURON and ANN simulations, parallelization was only employed for Neuroscience Gateway simulations and ANN fitting.

Statistics

Averages of multiple measurements are presented as mean ± SD. Data were statistically analyzed by ANOVA test using Origin software and custom-written Python scripts. Normality of the data was analyzed with Shapiro–Wilk test. Explained variance was quantified as 1 minus the fitting error normalized by the variance of the signal (Ujfalussy et al., 2018). For accuracy measurements, APs were counted within a 10 ms time window as true-positive APs. Precision and recall were calculated based on the following equations:

p r e c i s i o n = \frac{T P}{T P + F P}

r e c a l l = \frac{T P}{T P + F N}

where FP in the false-positive rate and FN is the false-negative rate.

Data and software availability

All codes used for simulating single- and multicompartmental NEURON models for training dataset creation, ANN benchmarking, ANN representations, and the L5 microcircuit are available on GitHub (https://github.com/ViktorJOlah/Neuro_ANN, copy archived at swh:1:rev:52616946edd6489a967a645bbab805577b15ad7f; Oláh, 2022) and Dryad.

Data availability

All code used for simulating single and multicompartmental NEURON models, ANN benchmarking, ANN representations, and the layer 5 microcircuit are available on GitHub (https://github.com/ViktorJOlah/Neuro_ANN, copy archived at swh:1:rev:52616946edd6489a967a645bbab805577b15ad7f) and Dryad (doi: https://doi.org/10.5061/dryad.0cfxpnw60). To adhere with eLife data availability policies, we also uploaded all data points displayed in the text and figures, on Dryad (doi: https://doi.org/10.5061/dryad.0cfxpnw60) in compliance with FAIR (Findable, Accessible, Interoperable, Reusable) principles.

The following data sets were generated

(2022) Dryad Digital Repository
Ultrafast simulation of large-scale neocortical microcircuitry with biophysically realistic neurons.

https://doi.org/10.5061/dryad.0cfxpnw60

References

Preprint
1. Abadi M
(2015) TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems
arXiv.

https://arxiv.org/abs/1603.04467
- Google Scholar
1. Akbarian S
2. Chen RZ
3. Gribnau J
4. Rasmussen TP
5. Fong H
6. Jaenisch R
7. Jones EG
(2001) Expression pattern of the rett syndrome gene mecp2 in primate prefrontal cortex
Neurobiology of Disease 8:784–791.

https://doi.org/10.1006/nbdi.2001.0420
- PubMed
- Google Scholar
(2018) Is your dataset big enough? sample size requirements when using artificial neural networks for discrete choice analysis
Journal of Choice Modelling 28:167–182.

https://doi.org/10.1016/j.jocm.2018.07.002
- Google Scholar
1. Amsalem O
2. Eyal G
3. Rogozinski N
4. Gevaert M
5. Kumbhar P
6. Schürmann F
7. Segev I
(2020) An efficient analytical reduction of detailed nonlinear neuron models
Nature Communications 11:1–13.

https://doi.org/10.1038/s41467-019-13932-6
- PubMed
- Google Scholar
Preprint
(2019) A Comprehensive Data-Driven Model of Cat Primary Visual Cortex
bioRxiv.

https://doi.org/10.1101/416156
- Google Scholar
1. Aradi I
2. Holmes WR
(1999) Role of multiple calcium and calcium-dependent conductances in regulation of hippocampal dentate granule cell excitability
Journal of Computational Neuroscience 6:215–235.

https://doi.org/10.1023/a:1008801821784
- PubMed
- Google Scholar
1. Arkhipov A
2. Gouwens NW
3. Billeh YN
4. Gratiy S
5. Iyer R
6. Wei Z
7. Xu Z
8. Abbasi-Asl R
9. Berg J
10. Buice M
11. Cain N
12. da Costa N
13. de Vries S
14. Denman D
15. Durand S
16. Feng D
17. Jarsky T
18. Lecoq J
19. Lee B
20. Li L
21. Mihalas S
22. Ocker GK
23. Olsen SR
24. Reid RC
25. Soler-Llavina G
26. Sorensen SA
27. Wang Q
28. Waters J
29. Scanziani M
30. Koch C
(2018) Visual physiology of the layer 4 cortical circuit in silico
PLOS Computational Biology 14:e1006535.

https://doi.org/10.1371/journal.pcbi.1006535
- PubMed
- Google Scholar
1. Armstrong DD
(1997) Review of rett syndrome
Journal of Neuropathology and Experimental Neurology 56:843–849.

https://doi.org/10.1097/00005072-199708000-00001
- PubMed
- Google Scholar
1. Armstrong DD
(2002) Neuropathology of Rett syndrome
Mental Retardation and Developmental Disabilities Research Reviews 8:72–76.

https://doi.org/10.1002/mrdd.10027
- PubMed
- Google Scholar
1. Armstrong DD
(2005) Neuropathology of Rett syndrome
Journal of Child Neurology 20:747–753.

https://doi.org/10.1177/08830738050200090901
- PubMed
- Google Scholar
(2006) Hippocampal synaptic plasticity is impaired in the MECP2-null mouse model of Rett syndrome
Neurobiology of Disease 21:217–227.

https://doi.org/10.1016/j.nbd.2005.07.005
- PubMed
- Google Scholar
Conference
(2009) 2009 International Conference on Field Programmable Logic and Applications (FPL)
Performance comparison of FPGA, GPU and CPU in image processing.

https://doi.org/10.1109/FPL.2009.5272532
- Google Scholar
1. Ascoli GA
2. L. AN
3. Anderson SA
4. Barrionuevo G
5. Benavides-Piccione R
6. Burkhalter A
7. Buzsáki G
8. Cauli B
9. DeFelipe J
10. Fairén A
11. Feldmeyer D
12. Fishell G
13. Fregnac Y
14. Freund TF
15. Gardner D
16. Gardner EP
17. Goldberg JH
18. Helmstaedter M
19. Hestrin S
20. Karube F
21. Kisvárday ZF
22. Lambolez B
23. Lewis DA
24. Marin O
25. Markram H
26. Muñoz A
27. Packer A
28. Petersen CCH
29. Rockland KS
(2008) Petilla terminology: nomenclature of features of gabaergic interneurons of the cerebral cortex
Nature Reviews Neuroscience 9:557–568.

https://doi.org/10.1038/nrn2402
- Google Scholar
(2019) Estimating the readily-releasable vesicle pool size at synaptic connections in the neocortex
Frontiers in Synaptic Neuroscience 11:29.

https://doi.org/10.3389/fnsyn.2019.00029
- PubMed
- Google Scholar
1. Bartos M
2. Vida I
3. Frotscher M
4. Meyer A
5. Monyer H
6. Geiger JRP
7. Jonas P
(2002) Fast synaptic inhibition promotes synchronized gamma oscillations in hippocampal interneuron networks
PNAS 99:13222–13227.

https://doi.org/10.1073/pnas.192233099
- PubMed
- Google Scholar
(1994) Rett syndrome: 3-D confocal microscopy of cortical pyramidal dendrites and afferents
Neuroreport 5:1509–1513.

https://doi.org/10.1097/00001756-199407000-00025
- PubMed
- Google Scholar
1. Belichenko P
2. Wright EE
3. Belichenko NP
4. Masliah E
5. Li HH
6. Mobley WC
7. Francke U
(2009) Widespread changes in dendritic and axonal morphology in mecp2-mutant mouse models of rett syndrome: evidence for disruption of neuronal networks
The Journal of Comparative Neurology 514:240–258.

https://doi.org/10.1002/cne.22009
- PubMed
- Google Scholar
1. Ben-Shalom R
2. Ladd A
3. Artherya NS
4. Cross C
5. Kim KG
6. Sanghevi H
7. Korngreen A
8. Bouchard KE
9. Bender KJ
(2022) NeuroGPU: accelerating multi-compartment, biophysically detailed neuron simulations on gpus
Journal of Neuroscience Methods 366:109400.

https://doi.org/10.1016/j.jneumeth.2021.109400
- PubMed
- Google Scholar
(2021) Single cortical neurons as deep artificial neural networks
Neuron 109:2727–2739.

https://doi.org/10.1016/j.neuron.2021.07.002
- PubMed
- Google Scholar
(1997) Are artificial neural networks black boxes?
IEEE Transactions on Neural Networks 8:1156–1164.

https://doi.org/10.1109/72.623216
- PubMed
- Google Scholar
1. Bezaire MJ
2. Raikov I
3. Burk K
4. Vyas D
5. Soltesz I
(2016) Interneuronal mechanisms of hippocampal theta oscillations in a full-scale model of the rodent CA1 circuit
eLife 5:e18566.

https://doi.org/10.7554/eLife.18566
- PubMed
- Google Scholar
(2019) Precise excitation-inhibition balance controls gain and timing in the hippocampus
eLife 8:e43415.

https://doi.org/10.7554/eLife.43415
- PubMed
- Google Scholar
1. Billeh YN
2. Cai B
3. Gratiy SL
4. Dai K
5. Iyer R
6. Gouwens NW
7. Abbasi-Asl R
8. Jia X
9. Siegle JH
10. Olsen SR
11. Koch C
12. Mihalas S
13. Arkhipov A
(2020) Systematic integration of structural and functional data into multi-scale models of mouse primary visual cortex
Neuron 106:388–403.

https://doi.org/10.1016/j.neuron.2020.01.040
- PubMed
- Google Scholar
1. Branco T
2. Häusser M
(2011) Synaptic integration gradients in single cortical pyramidal cell dendrites
Neuron 69:885–892.

https://doi.org/10.1016/j.neuron.2011.02.006
- PubMed
- Google Scholar
1. Brette R
2. Gerstner W
(2005) Adaptive exponential integrate-and-fire model as an effective description of neuronal activity
Journal of Neurophysiology 94:3637–3642.

https://doi.org/10.1152/jn.00686.2005
- PubMed
- Google Scholar
1. Brunner J
2. Szabadics J
(2016) Analogue modulation of back-propagating action potentials enables dendritic hybrid signalling
Nature Communications 7:1–13.

https://doi.org/10.1038/ncomms13033
- PubMed
- Google Scholar
1. Bush PC
2. Sejnowski TJ
(1993) Reduced compartmental models of neocortical pyramidal cells
Journal of Neuroscience Methods 46:159–166.

https://doi.org/10.1016/0165-0270(93)90151-g
- PubMed
- Google Scholar
(2011) Network hyperexcitability in hippocampal slices from MeCP2 mutant mice revealed by voltage-sensitive dye imaging
Journal of Neurophysiology 105:1768–1784.

https://doi.org/10.1152/jn.00800.2010
- PubMed
- Google Scholar
1. Carandini M
2. Heeger DJ
(2011) Normalization as a canonical neural computation
Nature Reviews. Neuroscience 13:51–62.

https://doi.org/10.1038/nrn3136
- PubMed
- Google Scholar
Book
1. Carnevale NT
2. Hines ML
(2006) The NEURON Book
Cambridge University Press.

https://doi.org/10.1017/CBO9780511541612
- Google Scholar
1. Castelvecchi D
(2016) Can we open the black box of AI?
Nature 538:20–23.

https://doi.org/10.1038/538020a
- PubMed
- Google Scholar
1. Chadderdon GL
2. Mohan A
3. Suter BA
4. Neymotin SA
5. Kerr CC
6. Francis JT
7. Shepherd GMG
8. Lytton WW
(2014) Motor cortex microcircuit simulation based on brain activity mapping
Neural Computation 26:1239–1262.

https://doi.org/10.1162/NECO_a_00602
- PubMed
- Google Scholar
1. Chahrour M
2. Zoghbi HY
(2007) The story of Rett syndrome: from clinic to neurobiology
Neuron 56:422–437.

https://doi.org/10.1016/j.neuron.2007.10.001
- PubMed
- Google Scholar
(2007) Mecp2 controls excitatory synaptic strength by regulating glutamatergic synapse number
Neuron 56:58–65.

https://doi.org/10.1016/j.neuron.2007.08.018
- PubMed
- Google Scholar
1. Chao H-T
2. Chen H
3. Samaco RC
4. Xue M
5. Chahrour M
6. Yoo J
7. Neul JL
8. Gong S
9. Lu H-C
10. Heintz N
11. Ekker M
12. Rubenstein JLR
13. Noebels JL
14. Rosenmund C
15. Zoghbi HY
(2010) Dysfunction in GABA signalling mediates autism-like stereotypies and Rett syndrome phenotypes
Nature 468:263–269.

https://doi.org/10.1038/nature09582
- PubMed
- Google Scholar
(2017) Dendrites of dentate gyrus granule cells contribute to pattern separation by controlling sparsity
Hippocampus 27:89–110.

https://doi.org/10.1002/hipo.22675
- PubMed
- Google Scholar
1. Che Z
2. Purushotham S
3. Cho K
4. Sontag D
5. Liu Y
(2018) Recurrent neural networks for multivariate time series with missing values
Scientific Reports 8:1–12.

https://doi.org/10.1038/s41598-018-24271-9
- PubMed
- Google Scholar
1. Chen T-W
2. Wardill TJ
3. Sun Y
4. Pulver SR
5. Renninger SL
6. Baohan A
7. Schreiter ER
8. Kerr RA
9. Orger MB
10. Jayaraman V
11. Looger LL
12. Svoboda K
13. Kim DS
(2013) Ultrasensitive fluorescent proteins for imaging neuronal activity
Nature 499:295–300.

https://doi.org/10.1038/nature12354
- PubMed
- Google Scholar
Conference
1. Collobert R
2. Weston J
(2008) A unified architecture for natural language processing: Deep neural networks with multitask learning
Proceedings of the 25th international conference on Machine learning.

https://doi.org/10.1145/1390156.1390177
- Google Scholar
1. Constantinople CM
2. Bruno RM
(2013) Deep cortical layers are activated directly by thalamus
Science 340:1591–1594.

https://doi.org/10.1126/science.1236425
- PubMed
- Google Scholar
1. Cutsuridis V.
2. Wennekers T
(2009) Hippocampus, microcircuits and associative memory
Neural Networks 22:1120–1128.

https://doi.org/10.1016/j.neunet.2009.07.009
- PubMed
- Google Scholar
(2010) Encoding and retrieval in a model of the hippocampal CA1 microcircuit
Hippocampus 20:423–446.

https://doi.org/10.1002/hipo.20661
- PubMed
- Google Scholar
1. Dani V
2. Chang Q
3. Maffei A
4. Turrigiano GG
5. Jaenisch R
6. Nelson SB
(2005) Reduced cortical activity due to a shift in the balance between excitation and inhibition in a mouse model of rett syndrome
PNAS 102:12560–12565.

https://doi.org/10.1073/pnas.0506071102
- PubMed
- Google Scholar
1. Dani VS
2. Nelson SB
(2009) Intact long-term potentiation but reduced connectivity between neocortical layer 5 pyramidal neurons in a mouse model of rett syndrome
The Journal of Neuroscience 29:11263–11270.

https://doi.org/10.1523/JNEUROSCI.1019-09.2009
- PubMed
- Google Scholar
Book
(2017) Artificial Neural Network Architectures and Training Processes
Springer.

https://doi.org/10.1007/978-3-319-43162-8
- Google Scholar
Book
1. Dayan P
2. Abbott LF
(2001)
Theoretical Neuroscience: Computational and Mathematical Modeling of Neural Systems-Computational Neuroscience Series

MIT Press.
- Google Scholar
1. Dayhoff JE
2. DeLeo JM
(2001) Artificial neural networks: opening the black box
Cancer 91:1615–1635.

https://doi.org/10.1002/1097-0142(20010415)91:8+3.0.co;2-l
- Google Scholar
1. D’Cruz JA
2. Wu C
3. Zahid T
4. El-Hayek Y
5. Zhang L
6. Eubanks JH
(2010) Alterations of cortical and hippocampal EEG activity in mecp2-deficient mice
Neurobiology of Disease 38:8–16.

https://doi.org/10.1016/j.nbd.2009.12.018
- PubMed
- Google Scholar
(2007) Layer- and cell-type-specific suprathreshold stimulus representation in rat primary somatosensory cortex
The Journal of Physiology 581:139–154.

https://doi.org/10.1113/jphysiol.2006.124321
- PubMed
- Google Scholar
1. De Schutter E
2. Bower JM
(1994) An active membrane model of the cerebellar Purkinje cell. I. simulation of current clamps in slice
Journal of Neurophysiology 71:375–400.

https://doi.org/10.1152/jn.1994.71.1.375
- PubMed
- Google Scholar
1. Destexhe A
(1997) Conductance-based integrate-and-fire models
Neural Computation 9:503–514.

https://doi.org/10.1162/neco.1997.9.3.503
- PubMed
- Google Scholar
(1998)
Dendritic low-threshold calcium currents in thalamic relay cells

The Journal of Neuroscience 18:3574–3588.
- PubMed
- Google Scholar
Book
1. Destexhe A
2. Sejnowski TJ
(2001)
Thalamocortical Assemblies: How Ion Channels, Single Neurons and Large-Scale Networks Organize Sleep Oscillations

Oxford University Press.
- Google Scholar
Preprint
1. Dillon JV
(2017) Tensorflow Distributions
arXiv.

https://arxiv.org/abs/1711.10604
- Google Scholar
(2019) Cellular and synaptic phenotypes lead to disrupted information processing in fmr1-KO mouse layer 4 barrel cortex
Nature Communications 10:1–18.

https://doi.org/10.1038/s41467-019-12736-y
- PubMed
- Google Scholar
Conference
(2015) Long-term recurrent convolutional networks for visual recognition and description
2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR.

https://doi.org/10.1109/CVPR.2015.7298878
- Google Scholar
Report
1. Dozat T
(2015)
Incorporating Nesterov Momentum into Adam Technical Report

Stanford University.
- Google Scholar
1. Druckmann S
2. Berger TK
3. Schürmann F
4. Hill S
5. Markram H
6. Segev I
(2011) Effective stimuli for constructing reliable neuron models
PLOS Computational Biology 7:e1002133.

https://doi.org/10.1371/journal.pcbi.1002133
- PubMed
- Google Scholar
(2002) Image processing with neural networks—a review
Pattern Recognition 35:2279–2301.

https://doi.org/10.1016/S0031-3203(01)00178-9
- Google Scholar
1. Enoki R
2. Inoue M
3. Hashimoto Y
4. Kudo Y
5. Miyakawa H
(2001) Gabaergic control of synaptic summation in hippocampal CA1 pyramidal neurons
Hippocampus 11:683–689.

https://doi.org/10.1002/hipo.1083
- PubMed
- Google Scholar
1. Eppler JM
(2008) PyNEST: a convenient interface to the nest simulator
Frontiers in Neuroinformatics 2:2008.

https://doi.org/10.3389/neuro.11.012.2008
- Google Scholar
Conference
(2009) NeMo: a platform for neural modelling of spiking neurons using GPUs
2009 20th IEEE International Conference on Application-specific Systems, Architectures and Processors (ASAP).

https://doi.org/10.1109/ASAP.2009.24
- Google Scholar
1. Fukuda T
2. Yamashita Y
3. Nagamitsu S
4. Miyamoto K
5. Jin J-J
6. Ohmori I
7. Ohtsuka Y
8. Kuwajima K
9. Endo S
10. Iwai T
11. Yamagata H
12. Tabara Y
13. Miki T
14. Matsuishi T
15. Kondo I
(2005) Methyl-Cpg binding protein 2 gene (MeCP2) variations in Japanese patients with Rett syndrome: pathological mutations and polymorphisms
Brain & Development 27:211–217.

https://doi.org/10.1016/j.braindev.2004.06.003
- PubMed
- Google Scholar
(2010) Diminished activity-dependent brain-derived neurotrophic factor expression underlies cortical neuron microcircuit hypoconnectivity resulting from exposure to mutant huntingtin fragments
The Journal of Pharmacology and Experimental Therapeutics 335:13–22.

https://doi.org/10.1124/jpet.110.167551
- PubMed
- Google Scholar
(2019) Machine learning in energy economics and finance: A review
Energy Economics 81:709–727.

https://doi.org/10.1016/j.eneco.2019.05.006
- Google Scholar
1. Glaze DG
(2005) Neurophysiology of rett syndrome
Journal of Child Neurology 20:740–746.

https://doi.org/10.1177/08830738050200090801
- PubMed
- Google Scholar
Conference
1. Glorot X
(2011)
Deep sparse rectifier neural networks

Proceedings of the fourteenth international conference on artificial intelligence and statistics, JMLR Workshop and Conference Proceedings.
- Google Scholar
Preprint
1. Goodfellow IJ
(2015) An Empirical Investigation of Catastrophic Forgetting in Gradient-Based Neural Networks
arXiv.

https://arxiv.org/abs/1312.6211
- Google Scholar
1. Gouwens NW
2. Berg J
3. Feng D
4. Sorensen SA
5. Zeng H
6. Hawrylycz MJ
7. Koch C
8. Arkhipov A
(2018) Systematic generation of biophysically detailed models for diverse cortical neuron types
Nature Communications 9:1–13.

https://doi.org/10.1038/s41467-017-02718-3
- PubMed
- Google Scholar
1. Gouwens NW
2. Sorensen SA
3. Baftizadeh F
4. Budzillo A
5. Lee BR
6. Jarsky T
7. Alfiler L
8. Baker K
9. Barkan E
10. Berry K
11. Bertagnolli D
12. Bickley K
13. Bomben J
14. Braun T
15. Brouner K
16. Casper T
17. Crichton K
18. Daigle TL
19. Dalley R
20. de Frates RA
21. Dee N
22. Desta T
23. Lee SD
24. Dotson N
25. Egdorf T
26. Ellingwood L
27. Enstrom R
28. Esposito L
29. Farrell C
30. Feng D
31. Fong O
32. Gala R
33. Gamlin C
34. Gary A
35. Glandon A
36. Goldy J
37. Gorham M
38. Graybuck L
39. Gu H
40. Hadley K
41. Hawrylycz MJ
42. Henry AM
43. Hill D
44. Hupp M
45. Kebede S
46. Kim TK
47. Kim L
48. Kroll M
49. Lee C
50. Link KE
51. Mallory M
52. Mann R
53. Maxwell M
54. McGraw M
55. McMillen D
56. Mukora A
57. Ng L
58. Ng L
59. Ngo K
60. Nicovich PR
61. Oldre A
62. Park D
63. Peng H
64. Penn O
65. Pham T
66. Pom A
67. Popović Z
68. Potekhina L
69. Rajanbabu R
70. Ransford S
71. Reid D
72. Rimorin C
73. Robertson M
74. Ronellenfitch K
75. Ruiz A
76. Sandman D
77. Smith K
78. Sulc J
79. Sunkin SM
80. Szafer A
81. Tieu M
82. Torkelson A
83. Trinh J
84. Tung H
85. Wakeman W
86. Ward K
87. Williams G
88. Zhou Z
89. Ting JT
90. Arkhipov A
91. Sümbül U
92. Lein ES
93. Koch C
94. Yao Z
95. Tasic B
96. Berg J
97. Murphy GJ
98. Zeng H
(2020) Integrated morphoelectric and transcriptomic classification of cortical gabaergic cells
Cell 183:935–953.

https://doi.org/10.1016/j.cell.2020.09.057
- PubMed
- Google Scholar
Book
1. Graupe D
(2013) Principles of Artificial Neural Networks
World Scientific.

https://doi.org/10.1142/8868
- Google Scholar
1. Guy J
2. Gan J
3. Selfridge J
4. Cobb S
5. Bird A
(2007) Reversal of neurological defects in a mouse model of Rett syndrome
Science 315:1143–1147.

https://doi.org/10.1126/science.1138389
- PubMed
- Google Scholar
1. Hagberg B
2. Goutières F
3. Hanefeld F
4. Rett A
5. Wilson J
(1985) Rett syndrome: criteria for inclusion and exclusion
Brain & Development 7:372–373.

https://doi.org/10.1016/s0387-7604(85)80048-6
- PubMed
- Google Scholar
(2012) State and location dependence of action potential metabolic cost in cortical pyramidal neurons
Nature Neuroscience 15:1007–1014.

https://doi.org/10.1038/nn.3132
- PubMed
- Google Scholar
(2015) Distribution and function of HCN channels in the apical dendritic tuft of neocortical pyramidal neurons
The Journal of Neuroscience 35:1024–1037.

https://doi.org/10.1523/JNEUROSCI.2813-14.2015
- PubMed
- Google Scholar
Book
1. Hassoun MH
(1995) Fundamentals of Artificial Neural Networks
MIT press.

https://doi.org/10.1109/JPROC.1996.503146
- Google Scholar
1. Häusser M
2. Mel B
(2003) Dendrites: bug or feature?
Current Opinion in Neurobiology 13:372–383.

https://doi.org/10.1016/s0959-4388(03)00075-8
- PubMed
- Google Scholar
1. Hay E
2. Segev I
(2015) Dendritic excitability and gain control in recurrent cortical microcircuits
Cerebral Cortex 25:3561–3571.

https://doi.org/10.1093/cercor/bhu200
- PubMed
- Google Scholar
Conference
1. He K
2. Zhang X
3. Ren S
4. Sun J
(2015) Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification
2015 IEEE International Conference on Computer Vision (ICCV).

https://doi.org/10.1109/ICCV.2015.123
- Google Scholar
(2011) The capabilities and limitations of conductance-based compartmental neuron models with reduced branched or unbranched morphologies and active dendrites
Journal of Computational Neuroscience 30:301–321.

https://doi.org/10.1007/s10827-010-0258-z
- PubMed
- Google Scholar
1. Hines ML
2. Carnevale NT
(1997) The neuron simulation environment
Neural Computation 9:1179–1209.

https://doi.org/10.1162/neco.1997.9.6.1179
- PubMed
- Google Scholar
1. Hines ML
2. Carnevale NT
(2008) Translating network models to parallel hardware in neuron
Journal of Neuroscience Methods 169:425–455.

https://doi.org/10.1016/j.jneumeth.2007.09.010
- PubMed
- Google Scholar
(2008) Fully implicit parallel simulation of single neurons
Journal of Computational Neuroscience 25:439–448.

https://doi.org/10.1007/s10827-008-0087-5
- PubMed
- Google Scholar
1. Hochreiter S
2. Schmidhuber J
(1997) Long short-term memory
Neural Computation 9:1735–1780.

https://doi.org/10.1162/neco.1997.9.8.1735
- PubMed
- Google Scholar
1. Hodgkin AL
2. Huxley AF
(1952) A quantitative description of membrane current and its application to conduction and excitation in nerve
The Journal of Physiology 117:500–544.

https://doi.org/10.1113/jphysiol.1952.sp004764
- PubMed
- Google Scholar
Book
1. Holmstrom M
(2016)
Machine Learning Applied to Weather Forecasting

Stanford University.
- Google Scholar
1. Izhikevich EM
(2003) Simple model of spiking neurons
IEEE Transactions on Neural Networks 14:1569–1572.

https://doi.org/10.1109/TNN.2003.820440
- PubMed
- Google Scholar
1. Jarsky T
2. Roxin A
3. Kath WL
4. Spruston N
(2005) Conditional dendritic spike propagation following distal synaptic activation of hippocampal CA1 pyramidal neurons
Nature Neuroscience 8:1667–1676.

https://doi.org/10.1038/nn1599
- PubMed
- Google Scholar
1. Jaslove SW
(1992) The integrative properties of spiny distal dendrites
Neuroscience 47:495–519.

https://doi.org/10.1016/0306-4522(92)90161-t
- PubMed
- Google Scholar
(2018) Inter-areal balanced amplification enhances signal propagation in a large-scale circuit model of the primate cortex
Neuron 98:222–234.

https://doi.org/10.1016/j.neuron.2018.02.031
- PubMed
- Google Scholar
Preprint
1. Kawaguchi K
(2016) Deep Learning without Poor Local Minima
arXiv.

https://arxiv.org/abs/1605.07110
- Google Scholar
(2013) Cortical information flow in Parkinson’s disease: a composite network/field model
Frontiers in Computational Neuroscience 7:39.

https://doi.org/10.3389/fncom.2013.00039
- PubMed
- Google Scholar
1. Kim S
2. Guzman SJ
3. Hu H
4. Jonas P
(2012) Active dendrites support efficient initiation of dendritic spikes in hippocampal CA3 pyramidal neurons
Nature Neuroscience 15:600–606.

https://doi.org/10.1038/nn.3060
- PubMed
- Google Scholar
1. Kim D
2. Paré D
3. Nair SS
(2013) Mechanisms contributing to the induction and storage of pavlovian fear memories in the lateral amygdala
Learning & Memory 20:421–430.

https://doi.org/10.1101/lm.030262.113
- PubMed
- Google Scholar
Preprint
1. Kingma DP
2. Ba J
(2014) Adam: A Method for Stochastic Optimization
arXiv.

https://arxiv.org/abs/1412.6980
- Google Scholar
1. Kishi N
2. Macklis JD
(2004) Mecp2 is progressively expressed in post-migratory neurons and is involved in neuronal maturation rather than cell fate decisions
Molecular and Cellular Neurosciences 27:306–321.

https://doi.org/10.1016/j.mcn.2004.07.006
- PubMed
- Google Scholar
(2021) PyGeNN: a python library for GPU-enhanced neural networks
Frontiers in Neuroinformatics 15:659005.

https://doi.org/10.3389/fninf.2021.659005
- PubMed
- Google Scholar
1. Knight J
2. Nowotny T
(2021) Larger GPU-accelerated brain simulations with procedural connectivity
Nature Computational Science 1:136–142.

https://doi.org/10.1038/s43588-020-00022-7
- Google Scholar
1. Kole MHP
2. Ilschner SU
3. Kampa BM
4. Williams SR
5. Ruben PC
6. Stuart GJ
(2008) Action potential generation requires a high sodium channel density in the axon initial segment
Nature Neuroscience 11:178–186.

https://doi.org/10.1038/nn2040
- PubMed
- Google Scholar
Conference
1. Kons Z
2. Toledo-Ronen O
(2013) Audio event classification using deep neural networks
Interspeech 2013.

https://doi.org/10.21437/Interspeech.2013-384
- Google Scholar
1. Kriegeskorte N
2. Douglas PK
(2018) Cognitive computational neuroscience
Nature Neuroscience 21:1148–1160.

https://doi.org/10.1038/s41593-018-0210-5
- PubMed
- Google Scholar
1. Kumar A
2. Schiff O
3. Barkai E
4. Mel BW
5. Poleg-Polsky A
6. Schiller J
(2018) Nmda spikes mediate amplification of inputs in the rat piriform cortex
eLife 7:e38446.

https://doi.org/10.7554/eLife.38446
- PubMed
- Google Scholar
1. Kumbhar P
2. Hines M
3. Fouriaux J
4. Ovcharenko A
5. King J
6. Delalondre F
7. Schürmann F
(2019) CoreNEURON: an optimized compute engine for the neuron simulator
Frontiers in Neuroinformatics 13:63.

https://doi.org/10.3389/fninf.2019.00063
- PubMed
- Google Scholar
1. Larkum ME
2. Nevian T
3. Sandler M
4. Polsky A
5. Schiller J
(2009) Synaptic integration in tuft dendrites of layer 5 pyramidal neurons: a new unifying principle
Science 325:756–760.

https://doi.org/10.1126/science.1171958
- PubMed
- Google Scholar
1. Lien AD
2. Scanziani M
(2013) Tuned thalamic excitation is amplified by visual cortical circuits
Nature Neuroscience 16:1315–1323.

https://doi.org/10.1038/nn.3488
- PubMed
- Google Scholar
1. Liou J-Y
2. Smith EH
3. Bateman LM
4. Bruce SL
5. McKhann GM
6. Goodman RR
7. Emerson RG
8. Schevon CA
9. Abbott LF
(2020) A model for focal seizure onset, propagation, evolution, and progression
eLife 9:e50927.

https://doi.org/10.7554/eLife.50927
- PubMed
- Google Scholar
(2008) Compartmentalized dendritic plasticity and input feature storage in neurons
Nature 452:436–441.

https://doi.org/10.1038/nature06725
- PubMed
- Google Scholar
(2016) Simulation neurotechnologies for advancing brain research: parallelizing large networks in neuron
Neural Computation 28:2063–2090.

https://doi.org/10.1162/NECO_a_00876
- PubMed
- Google Scholar
1. Magee JC
2. Cook EP
(2000) Somatic EPSP amplitude is independent of synapse location in hippocampal pyramidal neurons
Nature Neuroscience 3:895–903.

https://doi.org/10.1038/78800
- PubMed
- Google Scholar
(1995) A model of spike initiation in neocortical pyramidal neurons
Neuron 15:1427–1439.

https://doi.org/10.1016/0896-6273(95)90020-9
- PubMed
- Google Scholar
1. Mainen ZF
2. Sejnowski TJ
(1996) Influence of dendritic structure on firing pattern in model neocortical neurons
Nature 382:363–366.

https://doi.org/10.1038/382363a0
- PubMed
- Google Scholar
1. Major G
2. Larkman AU
3. Jonas P
4. Sakmann B
5. Jack JJ
(1994)
Detailed passive cable models of whole-cell recorded CA3 pyramidal neurons in rat hippocampal slices

The Journal of Neuroscience 14:4613–4638.
- PubMed
- Google Scholar
1. Major Guy
2. Polsky A
3. Denk W
4. Schiller J
5. Tank DW
(2008) Spatiotemporally graded NMDA spike/plateau potentials in basal dendrites of neocortical pyramidal neurons
Journal of Neurophysiology 99:2584–2601.

https://doi.org/10.1152/jn.00011.2008
- PubMed
- Google Scholar
(2012) Fast and accurate low-dimensional reduction of biophysically detailed neuron models
Scientific Reports 2:1–7.

https://doi.org/10.1038/srep00928
- PubMed
- Google Scholar
1. Markram H
2. Muller E
3. Ramaswamy S
4. Reimann MW
5. Abdellah M
6. Sanchez CA
7. Ailamaki A
8. Alonso-Nanclares L
9. Antille N
10. Arsever S
11. Kahou GAA
12. Berger TK
13. Bilgili A
14. Buncic N
15. Chalimourda A
16. Chindemi G
17. Courcol J-D
18. Delalondre F
19. Delattre V
20. Druckmann S
21. Dumusc R
22. Dynes J
23. Eilemann S
24. Gal E
25. Gevaert ME
26. Ghobril J-P
27. Gidon A
28. Graham JW
29. Gupta A
30. Haenel V
31. Hay E
32. Heinis T
33. Hernando JB
34. Hines M
35. Kanari L
36. Keller D
37. Kenyon J
38. Khazen G
39. Kim Y
40. King JG
41. Kisvarday Z
42. Kumbhar P
43. Lasserre S
44. Le Bé J-V
45. Magalhães BRC
46. Merchán-Pérez A
47. Meystre J
48. Morrice BR
49. Muller J
50. Muñoz-Céspedes A
51. Muralidhar S
52. Muthurasa K
53. Nachbaur D
54. Newton TH
55. Nolte M
56. Ovcharenko A
57. Palacios J
58. Pastor L
59. Perin R
60. Ranjan R
61. Riachi I
62. Rodríguez J-R
63. Riquelme JL
64. Rössert C
65. Sfyrakis K
66. Shi Y
67. Shillcock JC
68. Silberberg G
69. Silva R
70. Tauheed F
71. Telefont M
72. Toledo-Rodriguez M
73. Tränkler T
74. Van Geit W
75. Díaz JV
76. Walker R
77. Wang Y
78. Zaninetta SM
79. DeFelipe J
80. Hill SL
81. Segev I
82. Schürmann F
(2015) Reconstruction and simulation of neocortical microcircuitry
Cell 163:456–492.

https://doi.org/10.1016/j.cell.2015.09.029
- PubMed
- Google Scholar
1. McCormick DA
2. Contreras D
(2001) On the cellular and network bases of epileptic seizures
Annual Review of Physiology 63:815–846.

https://doi.org/10.1146/annurev.physiol.63.1.815
- PubMed
- Google Scholar
1. McLeod F
2. Ganley R
3. Williams L
4. Selfridge J
5. Bird A
6. Cobb SR
(2013) Reduced seizure threshold and altered network oscillatory properties in a mouse model of Rett syndrome
Neuroscience 231:195–205.

https://doi.org/10.1016/j.neuroscience.2012.11.058
- PubMed
- Google Scholar
1. Medrihan L
2. Tantalaki E
3. Aramuni G
4. Sargsyan V
5. Dudanova I
6. Missler M
7. Zhang W
(2008) Early defects of GABAergic synapses in the brain stem of a MeCP2 mouse model of Rett syndrome
Journal of Neurophysiology 99:112–121.

https://doi.org/10.1152/jn.00826.2007
- PubMed
- Google Scholar
1. Megías M
2. Emri Z
3. Freund TF
4. Gulyás AI
(2001) Total number and distribution of inhibitory and excitatory synapses on hippocampal CA1 pyramidal cells
Neuroscience 102:527–540.

https://doi.org/10.1016/s0306-4522(00)00496-6
- PubMed
- Google Scholar
1. Memon ZA
2. Samad F
3. Awan ZR
4. Aziz A
5. Siddiqi SS
(2017)
Cpu-gpu processing

International Journal of Computer Science and Network Security 17:188–193.
- Google Scholar
1. Meredig B
2. Agrawal A
3. Kirklin S
4. Saal JE
5. Doak JW
6. Thompson A
7. Zhang K
8. Choudhary A
9. Wolverton C
(2014) Combinatorial screening for new materials in unconstrained composition space with machine learning
Physical Review B 89:094104.

https://doi.org/10.1103/PhysRevB.89.094104
- Google Scholar
Conference
(2018) Machine Learning Algorithms for Classification Geology Data from Well Logging
2018 14th International Conference on Electronics Computer and Computation (ICECCO).

https://doi.org/10.1109/ICECCO.2018.8634775
- Google Scholar
1. Meyer HS
2. Wimmer VC
3. Hemberger M
4. Bruno RM
5. de Kock CPJ
6. Frick A
7. Sakmann B
8. Helmstaedter M
(2010) Cell type-specific thalamic innervation in a column of rat vibrissal cortex
Cerebral Cortex 20:2287–2303.

https://doi.org/10.1093/cercor/bhq069
- PubMed
- Google Scholar
1. Migliore M
2. Cook EP
3. Jaffe DB
4. Turner DA
5. Johnston D
(1995) Computer simulations of morphologically reconstructed CA3 hippocampal neurons
Journal of Neurophysiology 73:1157–1168.

https://doi.org/10.1152/jn.1995.73.3.1157
- PubMed
- Google Scholar
(1999) Role of an A-type K+ conductance in the back-propagation of action potentials in the dendrites of hippocampal pyramidal neurons
Journal of Computational Neuroscience 7:5–15.

https://doi.org/10.1023/a:1008906225285
- PubMed
- Google Scholar
1. Migliore M
2. Shepherd GM
(2008) Dendritic action potentials connect distributed dendrodendritic microcircuits
Journal of Computational Neuroscience 24:207–221.

https://doi.org/10.1007/s10827-007-0051-9
- PubMed
- Google Scholar
Book
(2012) Neural networks: tricks of the trade
In: Montavon G, editors. Efficient Backprop. Berlin, Heidelberg: Springer. pp. 9–48.

https://doi.org/10.1007/978-3-642-35289-8
- Google Scholar
(2013) Machine learning of molecular electronic properties in chemical compound space
New Journal of Physics 15:095003.

https://doi.org/10.1088/1367-2630/15/9/095003
- Google Scholar
Website
1. Mutch J
(2010) CNS: a GPU-based framework for simulating cortically-organized networks MIT CSAIL
Accessed May 17, 2013.

http://128.30.100.62:8080/media/fb/ps/mit-csail-tr-2010-013.pdf
1. Na ES
2. Nelson ED
3. Adachi M
4. Autry AE
5. Mahgoub MA
6. Kavalali ET
7. Monteggia LM
(2012) A mouse model for MeCP2 duplication syndrome: MeCP2 overexpression impairs learning and memory and synaptic transmission
The Journal of Neuroscience 32:3109–3117.

https://doi.org/10.1523/JNEUROSCI.6000-11.2012
- PubMed
- Google Scholar
(2009) A configurable simulation environment for the efficient simulation of large-scale spiking neural networks on graphics processors
Neural Networks 22:791–800.

https://doi.org/10.1016/j.neunet.2009.06.028
- PubMed
- Google Scholar
(2008) Firing patterns in the adaptive exponential integrate-and-fire model
Biol Cybern 99:335–347.

https://doi.org/10.1007/s00422-008-0264-7
- PubMed
- Google Scholar
(2006) Mecp2-Dependent transcriptional repression regulates excitatory neurotransmission
Current Biology 16:710–716.

https://doi.org/10.1016/j.cub.2006.02.062
- PubMed
- Google Scholar
(2016a) Multitarget multiscale simulation for pharmacological treatment of dystonia in motor cortex
Frontiers in Pharmacology 7:157.

https://doi.org/10.3389/fphar.2016.00157
- PubMed
- Google Scholar
1. Neymotin SA
2. McDougal RA
3. Bulanova AS
4. Zeki M
5. Lakatos P
6. Terman D
7. Hines ML
8. Lytton WW
(2016b) Calcium regulation of HCN channels supports persistent activity in a multiscale model of neocortex
Neuroscience 316:344–366.

https://doi.org/10.1016/j.neuroscience.2015.12.043
- PubMed
- Google Scholar
Book
1. Nikolic D
(2006)
Temporal Dynamics of Information Content Carried by Neurons in the Primary Visual Cortex

NIPS.
- Google Scholar
1. Nowotny T
2. Cope AJ
3. Yavuz E
4. Stimberg M
5. Goodman DF
6. Marshall J
7. Gurney K
(2014) SpineML and brian 2.0 interfaces for using GPU enhanced neuronal networks (genn)
BMC Neuroscience 15:1–2.

https://doi.org/10.1186/1471-2202-15-S1-P148
- Google Scholar
(2020) Functional specification of CCK+ interneurons by alternative isoforms of kv4.3 auxiliary subunits
eLife 9:e58515.

https://doi.org/10.7554/eLife.58515
- PubMed
- Google Scholar
Preprint
(2021) Biophysical Kv Channel Alterations Dampen Excitability of Cortical PV Interneurons and Contribute to Network Hyperexcitability in Early Alzheimer’s
bioRxiv.

https://doi.org/10.1101/2021.10.25.465789
- Google Scholar
Software
1. Oláh VJ
(2022) Neuro_ANN, version swh:1:rev:52616946edd6489a967a645bbab805577b15ad7f
Software Heritage.

https://archive.softwareheritage.org/swh:1:dir:9a4bb33ea65af35384c87764f456648c12f08d78;origin=https://github.com/ViktorJOlah/Neuro_ANN;visit=swh:1:snp:dfa0fca6e35143608ab67fcea4ae9431cd7f7cf5;anchor=swh:1:rev:52616946edd6489a967a645bbab805577b15ad7f
Preprint
1. Oord A
(2016) Wavenet: A Generative Model for Raw Audio
arXiv.

https://arxiv.org/abs/1609.03499
- Google Scholar
(2003) Pyramidal neuron as two-layer neural network
Neuron 37:989–999.

https://doi.org/10.1016/s0896-6273(03)00149-1
- PubMed
- Google Scholar
(2019) Electroencephalographic spectral power as a marker of cortical function and disease severity in girls with Rett syndrome
Journal of Neurodevelopmental Disorders 11:1–14.

https://doi.org/10.1186/s11689-019-9275-z
- PubMed
- Google Scholar
1. Ros E
2. Carrillo R
3. Ortigosa EM
4. Barbour B
5. Agís R
(2006) Event-driven simulation scheme for spiking neural networks using lookup tables to characterize neuronal dynamics
Neural Computation 18:2959–2993.

https://doi.org/10.1162/neco.2006.18.12.2959
- PubMed
- Google Scholar
Preprint
1. Rössert C
(2016) Automated Point-Neuron Simplification of Data-Driven Microcircuit Models
arXiv.

https://arxiv.org/abs/1604.00087
- Google Scholar
(2014) Distinct Kv channel subtypes contribute to differences in spike signaling properties in the axon initial segment and presynaptic boutons of cerebellar interneurons
The Journal of Neuroscience 34:6611–6623.

https://doi.org/10.1523/JNEUROSCI.4208-13.2014
- PubMed
- Google Scholar
(1986) Learning representations by back-propagating errors
Nature 323:533–536.

https://doi.org/10.1038/323533a0
- Google Scholar
1. Sacerdote L
2. Giraudo MT
(2013) Stochastic integrate and fire models: a review on mathematical methods and their applications
Stochastic Biomathematical Models 1:99–148.

https://doi.org/10.1007/978-3-642-32157-3_5
- Google Scholar
Conference
1. Sanjay M
(2017)
Multiscale computer modeling of epilepsy

Computational Models of Brain Behavior.
- Google Scholar
(2005) Role of mossy fiber sprouting and mossy cell loss in hyperexcitability: a network model of the dentate gyrus incorporating cell types and axonal topography
Journal of Neurophysiology 93:437–453.

https://doi.org/10.1152/jn.00777.2004
- PubMed
- Google Scholar
1. Santosa F
2. Symes WW
(1986) Linear inversion of band-limited reflection seismograms
SIAM Journal on Scientific and Statistical Computing 7:1307–1330.

https://doi.org/10.1137/0907087
- Google Scholar
(1997) Calcium action potentials restricted to distal apical dendrites of rat neocortical pyramidal neurons
The Journal of Physiology 505 (Pt 3):605–616.

https://doi.org/10.1111/j.1469-7793.1997.605ba.x
- PubMed
- Google Scholar
(2000) NMDA spikes in basal dendrites of cortical pyramidal neurons
Nature 404:285–289.

https://doi.org/10.1038/35005094
- PubMed
- Google Scholar
(2018) Multi-scale account of the network structure of macaque visual cortex
Brain Structure & Function 223:1409–1435.

https://doi.org/10.1007/s00429-017-1554-4
- PubMed
- Google Scholar
Book
(2020) Machine Learning Meets Quantum Physics
Springer.

https://doi.org/10.1007/978-3-030-40245-7
- Google Scholar
1. Schwalger T
2. Chizhov AV
(2019) Mind the last spike - firing rate models for mesoscopic populations of spiking neurons
Current Opinion in Neurobiology 58:155–166.

https://doi.org/10.1016/j.conb.2019.08.003
- PubMed
- Google Scholar
(1988) Computational neuroscience
Science 241:1299–1306.

https://doi.org/10.1126/science.3045969
- PubMed
- Google Scholar
(2015) Physiology of layer 5 pyramidal neurons in mouse primary visual cortex: coincidence detection through bursting
PLOS Computational Biology 11:e1004090.

https://doi.org/10.1371/journal.pcbi.1004090
- PubMed
- Google Scholar
Conference
1. Sharma N
2. Sharma P
3. Irwin D
4. Shenoy P
(2011) Predicting solar generation from weather forecasts using machine learning
2011 IEEE international conference on smart grid communications (SmartGridComm.

https://doi.org/10.1109/SmartGridComm.2011.6102379
- Google Scholar
Preprint
1. Shi X
(2015) Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting
arXiv.

https://dl.acm.org/doi/10.5555/2969239.2969329
- Google Scholar
Preprint
(2018) Reimplementation of the Potjans-Diesmann Cortical Microcircuit Model: From NEST to Brian
bioRxiv.

https://doi.org/10.1101/248401v1.full.pdf
- Google Scholar
1. Singer S
2. Nelder J
(2009) Nelder-mead algorithm
Scholarpedia 4:2928.

https://doi.org/10.4249/scholarpedia.2928
- Google Scholar
Conference
(2013) A Neuroscience Gateway XSEDE ’13
XSEDE ’13.

https://doi.org/10.1145/2484762.2484816
- Google Scholar
1. Sjöström PJ
2. Häusser M
(2006) A cooperative switch determines the sign of synaptic plasticity in distal dendrites of neocortical pyramidal neurons
Neuron 51:227–238.

https://doi.org/10.1016/j.neuron.2006.06.017
- PubMed
- Google Scholar
1. Smith SL
2. Smith IT
3. Branco T
4. Häusser M
(2013) Dendritic spikes enhance stimulus selectivity in cortical neurons in vivo
Nature 503:115–120.

https://doi.org/10.1038/nature12600
- PubMed
- Google Scholar
(2001) Epilepsy in a representative series of rett syndrome
Acta Paediatrica 90:34–39.

https://doi.org/10.1080/080352501750064842
- PubMed
- Google Scholar
1. Stuart G.
2. Spruston N
(1998)
Determinants of voltage attenuation in neocortical pyramidal neuron dendrites

The Journal of Neuroscience 18:3501–3510.
- PubMed
- Google Scholar
1. Stuart GJ
2. Spruston N
(2015) Dendritic integration: 60 years of progress
Nature Neuroscience 18:1713–1721.

https://doi.org/10.1038/nn.4157
- PubMed
- Google Scholar
Book
(2016) Dendrites
Oxford University Press.

https://doi.org/10.1093/acprof:oso/9780198745273.001.0001
- Google Scholar
1. Sun YJ
2. Kim Y-J
3. Ibrahim LA
4. Tao HW
5. Zhang LI
(2013) Synaptic mechanisms underlying functional dichotomy between intrinsic-bursting and regular-spiking neurons in auditory cortical layer 5
The Journal of Neuroscience 33:5326–5339.

https://doi.org/10.1523/JNEUROSCI.4810-12.2013
- PubMed
- Google Scholar
1. Takahashi H
2. Magee JC
(2009) Pathway interactions and synaptic plasticity in the dendritic tuft regions of CA1 pyramidal neurons
Neuron 62:102–111.

https://doi.org/10.1016/j.neuron.2009.03.007
- PubMed
- Google Scholar
(2016) Active cortical dendrites modulate perception
Science 354:1587–1590.

https://doi.org/10.1126/science.aah6066
- PubMed
- Google Scholar
1. Teeter C
2. Iyer R
3. Menon V
4. Gouwens N
5. Feng D
6. Berg J
7. Szafer A
8. Cain N
9. Zeng H
10. Hawrylycz M
11. Koch C
12. Mihalas S
(2018) Generalized leaky integrate-and-fire models classify multiple neuron types
Nature Communications 9:1–15.

https://doi.org/10.1038/s41467-017-02717-4
- PubMed
- Google Scholar
Conference
1. Thibeault CM
(2011)
A novel multi-GPU neural simulator

BICoB.
- Google Scholar
1. Traub RD
2. Contreras D
3. Cunningham MO
4. Murray H
5. LeBeau FEN
6. Roopun A
7. Bibbig A
8. Wilent WB
9. Higley MJ
10. Whittington MA
(2005) Single-column thalamocortical network model exhibiting gamma oscillations, sleep spindles, and epileptogenic bursts
Journal of Neurophysiology 93:2194–2232.

https://doi.org/10.1152/jn.00983.2004
- PubMed
- Google Scholar
1. Tsodyks MV
2. Markram H
(1997) The neural code between neocortical pyramidal neurons depends on neurotransmitter release probability
PNAS 94:719–723.

https://doi.org/10.1073/pnas.94.2.719
- PubMed
- Google Scholar
1. Turi GF
2. Li WK
3. Chavlis S
4. Pandi I
5. O’Hare J
6. Priestley JB
7. Grosmark AD
8. Liao Z
9. Ladow M
10. Zhang JF
11. Zemelman BV
12. Poirazi P
13. Losonczy A
(2019) Vasoactive intestinal polypeptide-expressing interneurons in the hippocampus support goal-oriented spatial learning
Neuron 101:1150–1165.

https://doi.org/10.1016/j.neuron.2019.01.009
- PubMed
- Google Scholar
(2018) Global and multiplexed dendritic computations under in vivo-like conditions
Neuron 100:579–592.

https://doi.org/10.1016/j.neuron.2018.08.032
- PubMed
- Google Scholar
(2001) Propagation of action potentials in dendrites depends on dendritic morphology
Journal of Neurophysiology 85:926–937.

https://doi.org/10.1152/jn.2001.85.2.926
- PubMed
- Google Scholar
1. Villette V
2. Chavarha M
3. Dimov IK
4. Bradley J
5. Pradhan L
6. Mathieu B
7. Evans SW
8. Chamberland S
9. Shi D
10. Yang R
11. Kim BB
12. Ayon A
13. Jalil A
14. St-Pierre F
15. Schnitzer MJ
16. Bi G
17. Toth K
18. Ding J
19. Dieudonné S
20. Lin MZ
(2019) Ultrafast two-photon imaging of a high-gain voltage indicator in awake behaving mice
Cell 179:1590–1608.

https://doi.org/10.1016/j.cell.2019.11.004
- PubMed
- Google Scholar
(2015) ANNarchy: a code generation approach to neural simulations on parallel hardware
Frontiers in Neuroinformatics 9:19.

https://doi.org/10.3389/fninf.2015.00019
- PubMed
- Google Scholar
Conference
(2017) Parallelizing Hines Matrix Solver in Neuron Simulations on GPU
2017 IEEE 24th International Conference on High Performance Computing (HiPC.

https://doi.org/10.1109/HiPC.2017.00051
- Google Scholar
1. Wang XJ
2. Buzsáki G
(1996)
Gamma oscillation by synaptic inhibition in a hippocampal interneuronal network model

The Journal of Neuroscience 16:6402–6413.
- PubMed
- Google Scholar
1. Wolpert DM
2. Ghahramani Z
(2000) Computational principles of movement neuroscience
Nature Neuroscience 3 Suppl:1212–1217.

https://doi.org/10.1038/81497
- PubMed
- Google Scholar
(2021) Data-driven reduction of dendritic morphologies with preserved dendro-somatic responses
eLife 10:e60936.

https://doi.org/10.7554/eLife.60936
- PubMed
- Google Scholar
(2016) GeNN: a code generation framework for accelerated brain simulations
Scientific Reports 6:1–14.

https://doi.org/10.1038/srep18854
- PubMed
- Google Scholar
1. Yu Y
2. McTavish TS
3. Hines ML
4. Shepherd GM
5. Valenti C
6. Migliore M
(2013) Sparse distributed representation of odors in a large-scale olfactory bulb circuit
PLOS Computational Biology 9:e1003014.

https://doi.org/10.1371/journal.pcbi.1003014
- PubMed
- Google Scholar
1. Zhang Z-W
2. Zak JD
3. Liu H
(2010) MeCP2 is required for normal development of gabaergic circuits in the thalamus
Journal of Neurophysiology 103:2470–2481.

https://doi.org/10.1152/jn.00601.2009
- PubMed
- Google Scholar
1. Zhang W
2. Peterson M
3. Beyer B
4. Frankel WN
5. Zhang Z
(2014) Loss of mecp2 from forebrain excitatory neurons leads to cortical hyperexcitation and seizures
The Journal of Neuroscience 34:2754–2763.

https://doi.org/10.1523/JNEUROSCI.4900-12.2014
- PubMed
- Google Scholar
1. Zhang X
2. Santaniello S
(2019) Role of cerebellar gabaergic dysfunctions in the origins of essential tremor
PNAS 116:13592–13601.

https://doi.org/10.1073/pnas.1817689116
- PubMed
- Google Scholar
1. Zhang C
2. Song D
3. Chen Y
4. Feng X
5. Lumezanu C
6. Cheng W
7. Ni J
8. Zong B
9. Chen H
10. Chawla NV
(2019) A deep neural network for unsupervised anomaly detection and diagnosis in multivariate time series data
Proceedings of the AAAI Conference on Artificial Intelligence 33:1409–1416.

https://doi.org/10.1609/aaai.v33i01.33011409
- Google Scholar
Conference
1. Zheng Y
2. Liu Q
3. Chen E
4. Ge Y
5. Zhao JL
(2014) Time series classification using multi-channels deep convolutional neural networks
International conference on web-age information management.

https://doi.org/10.1007/978-3-319-08010-9
- Google Scholar

Article and author information

Author details

Viktor J Oláh

Department of Cell Biology, Emory University School of Medicine, Atlanta, United States

Contribution
Conceptualization, Resources, Software, Formal analysis, Funding acquisition, Investigation, Methodology, Writing – original draft, Writing – review and editing

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0002-2069-7525
Nigel P Pedersen

Department of Neurology, Emory University School of Medicine, Atlanta, United States

Contribution
Conceptualization, Resources, Software, Formal analysis, Funding acquisition, Investigation, Methodology, Writing – original draft, Writing – review and editing

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0002-8494-0635
Matthew JM Rowan

Department of Cell Biology, Emory University School of Medicine, Atlanta, United States

Contribution
Resources, Software, Funding acquisition, Writing – original draft, Writing – review and editing

For correspondence
mjrowan@emory.edu

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0003-0955-0706

Funding

National Institutes of Health (R56-AG072473)

Matthew JM Rowan

Emory Alzheimer's Disease Research Center (00100569)

Matthew JM Rowan

CURE Epilepsy and the NIH (K08NS105929)

Nigel P Pedersen

National Institutes of Health (RF1-AG079269)

Matthew JM Rowan

Emory/Georgia Tech I3 Computational and Data analysis to Advance Single Cell Biology Research Award

Matthew JM Rowan

The funders had no role in study design, data collection, and interpretation, or the decision to submit the work for publication.

Acknowledgements

This work was supported by NIH grants R56-AG072473 (MJMR) and the Emory Alzheimer’s Disease Research Center Grant 00100569 (MJMR) with partial support (NPP) provided by CURE Epilepsy and the National Institutes of Health K08NS105929.

Copyright

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.

Metrics

2,658

views
271

downloads
9

citations

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Citations by DOI

9

citations for umbrella DOI https://doi.org/10.7554/eLife.79535

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Mendeley

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

Viktor J Oláh
Nigel P Pedersen
Matthew JM Rowan

(2022)

Ultrafast simulation of large-scale neocortical microcircuitry with biophysically realistic neurons

eLife 11:e79535.

https://doi.org/10.7554/eLife.79535

Share this article

Cite this article

Single-compartmental neuronal simulations using artificial neural networks (ANNs).

Ideal generalization of the convolutional neural network-long short-term memory (CNN-LSTM).

Convolutional neural network-long short-term memory (CNN-LSTM) prediction of neuronal mechanisms beyond somatic membrane potential.

Accurate representation of nonlinear synaptic activation by convolutional neural network-long short-term memory (CNN-LSTM).

Multicompartmental simulation representation by convolutional neural network-long short-term memory (CNN-LSTM).

Firing pattern representation with custom artificial neural network (ANN) layer.

Orders of magnitude faster simulation times with convolutional neural network-long short-term memory (CNN-LSTM).

Efficient parameter-space mapping with convolutional neural network-long short-term memory (CNN-LSTM) reveals a joint effect of recurrent connectivity and E/I balance on network stability and efficacy in Rett syndrome.

Recurrent connectivity and excitatory drive jointly define network stability in a reduced level 5 (L5) cortical network.

Author details

Viktor J Oláh

Contribution

Competing interests

Nigel P Pedersen

Contribution

Competing interests

Matthew JM Rowan

Contribution

For correspondence

Competing interests

Citations by DOI

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

Categories and tags

Further reading