Demixed principal component analysis of neural population data

  1. Dmitry Kobak
  2. Wieland Brendel
  3. Christos Constantinidis
  4. Claudia E Feierstein
  5. Adam Kepecs
  6. Zachary F Mainen
  7. Xue-Lian Qi
  8. Ranulfo Romo
  9. Naoshige Uchida
  10. Christian K Machens  Is a corresponding author
  1. Champalimaud Centre for the Unknown, Portugal
  2. École Normale Supérieure, France
  3. Centre for Integrative Neuroscience, University of Tübingen, Germany
  4. Wake Forest University School of Medicine, United States
  5. Cold Spring Harbor Laboratory, United States
  6. Universidad Nacional Autónoma de México, Mexico
  7. El Colegio Nacional, Mexico
  8. Harvard University, United States
15 figures, 2 videos and 1 table

Figures

Existing approaches to population analysis, illustrated with recordings from monkey PFC during a somatosensory working memory task (Romo et al., 1999).

(a) Cartoon of the paradigm, adapted from Romo and Salinas (2003). Legend shows 12 experimental conditions. (b) Average per-condition firing rates (PSTHs) for four exemplary neurons out of N=832. …

https://doi.org/10.7554/eLife.10989.003
Linear dimensionality reduction.

(a) Linear discriminant analysis maps the firing rates of individual neurons onto a latent component that allows us to decode a task parameter of interest. Shades of grey inside each neuron show the …

https://doi.org/10.7554/eLife.10989.004
Demixed PCA applied to recordings from monkey PFC during a somatosensory working memory task (Romo et al., 1999).

(a) Cartoon of the paradigm, adapted from Romo and Salinas (2003). (b) Demixed principal components. Top row: first three condition-independent components; second row: first three stimulus …

https://doi.org/10.7554/eLife.10989.005
Demixed PCA applied to recordings from monkey PFC during a visuospatial working memory task (Qi et al., 2011).

Same format as Figure 3. (a) Cartoon of the paradigm, adapted from Romo and Salinas (2003). (b) Demixed principal components. In each subplot there are ten lines corresponding to ten conditions (see …

https://doi.org/10.7554/eLife.10989.007
Demixed PCA applied to recordings from rat OFC during an olfactory discrimination task (Feierstein et al., 2006).

Same format as Figure 3. (a) Cartoon of the paradigm, adapted from Wang et al. (2013). (b) Each subplot shows one demixed principal component. In each subplot there are four lines corresponding to …

https://doi.org/10.7554/eLife.10989.008
Demixed PCA applied to recordings from rat OFC during an olfactory categorization task (Kepecs et al., 2008).

Same format as Figure 3 (a) Cartoon of the paradigm, adapted from Wang et al. (2013). (b) Each subplot shows one demixed principal component. In each subplot there are ten lines corresponding to ten …

https://doi.org/10.7554/eLife.10989.009
Encoder weights for the leading dPCA components across the neural population.

(a) Distributions of encoder weights for the 15 leading dPCA components across the neural population, in each of the four datasets. Each subplot shows 15 probability density curves, one curve per …

https://doi.org/10.7554/eLife.10989.010
Marginalization procedure.

PSTHs of three exemplary neurons from the somatosensory working memory task decomposed into marginalizations.

https://doi.org/10.7554/eLife.10989.012
Balanced and unbalanced data.

(a) In this toy example there are two task parameters (factors), with two possible values each. Parameter A (left) is represented by the size of the dot, parameter B (middle) is represented by the …

https://doi.org/10.7554/eLife.10989.014
Re-stretching (time warping) procedure.

We defined several alignment events (such as odour poke in, odour poke out, etc.) and for each trial found the times ti of these events. After aligning all trials on t1=0 (left) we computed median …

https://doi.org/10.7554/eLife.10989.015
Cross-validation errors depending on the regularization parameter λ.

Each subplot corresponds to one dataset and shows mean (solid lines) and min/max (boundaries of shaded regions) of the relative cross-validation errors for ten repetitions. Different colors refer to …

https://doi.org/10.7554/eLife.10989.016
Cross-validated time-dependent classification accuracies of linear classifiers (black lines) given by the first three stimulus/decision/interaction dPCs (columns) in each dataset (rows).

Shaded gray regions show distribution of classification accuracies expected by chance as estimated by 100 iterations of shuffling procedure.

https://doi.org/10.7554/eLife.10989.017
Toy example illustrating the pseudo-inverse intuition.

(a) Firing rate trajectories of two neurons for three different stimuli. (b) Same data with dPCA decoding and encoding axes. The encoding axes are approximately equivalent to the axes of the …

https://doi.org/10.7554/eLife.10989.018
Some demixed components as given by three different demixing methods (rows) in various datasets and marginalizations (columns).

Empty subplots mean that the corresponding method did not find any components. All projections were z-scored to make them of the same scale. Barplots on the right show fractions of variance in each …

https://doi.org/10.7554/eLife.10989.019
Fourier-like artifacts in PCA.

(Left) In this toy example, single neuron responses are generated from the same underlying Gaussian but are randomly shifted in time. (Right) First three PCA components of the population data. While …

https://doi.org/10.7554/eLife.10989.020

Videos

Video 1
Stimulus representation in the somatosensory working memory task

Two leading stimulus dPCs in the somatosensory working memory task (components #5 and #10 as horizontal and vertical axis correspondingly). Each frame of this movie corresponds to one time point t. …

https://doi.org/10.7554/eLife.10989.006
Video 2
Illustration of the dPCA algorithm.

Illustration of the dPCA algorithm using the somatosensory working memory task.

https://doi.org/10.7554/eLife.10989.013

Tables

Table 1

Demixed PCA in comparison with existing methods. Columns: 'Signif.' refers to the method of counting significantly tuned cells, as shown in Figure 1c–e. TDR refers to the 'targeted dimensionality …

https://doi.org/10.7554/eLife.10989.011
Signif.TDRLDAPCAFAGPFAjPCALDSdPCA
Takes task parameters into account & provides summary statistics of population tuning
Allows to reconstruct neural firing (captures variance)
Based on dynamical model
Based on probabilistic model
Takes spike trains as input

Download links