Whole-brain comparison of rodent and human brains using spatial transcriptomics

  1. Antoine Beauchamp  Is a corresponding author
  2. Yohan Yee
  3. Ben C Darwin
  4. Armin Raznahan
  5. Rogier B Mars  Is a corresponding author
  6. Jason P Lerch  Is a corresponding author
  1. The Hospital for Sick Children, Canada
  2. Mouse Imaging Centre, Canada
  3. Department of Medical Biophysics, University of Toronto, Canada
  4. Section on Developmental Neurogenomics, Human Genetics Branch, National Institute of Mental Health Intramural Research Program, United States
  5. Wellcome Centre for Integrative Neuroimaging, Centre for Functional MRI of the Brain (FMRIB), Nuffield Department of Clinical Neurosciences, John Radcliffe Hospital, University of Oxford, United Kingdom
  6. Donders Institute for Brain, Cognition and Behaviour, Radboud University Nijmegen, Netherlands
8 figures and 2 additional files

Figures

Transcriptomic similarity in the mouse and human brains.

(A) Similarity matrix displaying the correlation between 67 mouse regions and 88 human regions based on the expression of 2835 homologous genes. Columns are annotated with 11 broad mouse regions: cortical subplate (CTXsp), olfactory areas (OLF), hippocampal formation (HPF), isocortex, cerebral nuclei (CNU), interbrain (IB), midbrain (MB), pons (P), medulla (MY), cerebellar cortex (CBX), and cerebellar nuclei (CBN). Rows are annotated with 16 broad human regions: claustrum (Cl), limbic lobe (LL), frontal lobe (FL), insula (Ins), occipital lobe (OL), parietal lobe (PL), temporal lobe (TL), amygdala (Amg), basal ganglia (BG), basal forebrain (BF), diencephalon (DIE), mesencephalon (MES), pons, myelencephalon (MY), cerebellar cortex (CbCx), and cerebellar nuclei (CbN). Broad patterns of similarity are evident between coarsely defined brain regions, while correlation patterns are mostly homogeneous within these regions. (B) Mouse brain coronal slices showing similarity profiles for the human precentral gyrus, cuneus, and crus I. Correlation patterns for the precentral gyrus and cuneus are highly similar to one another and broadly similar to most isocortical regions. The crus I is homogeneously similar to the mouse cerebellum. (C) Anatomically ordered line charts displaying the similarity profiles for the seed regions in (B). Dashed vertical lines indicate the canonical mouse homologue for each human seed. Annotation colors correspond to atlas colors from the Allen Mouse Brain Atlas and Allen Human Brain Atlas for mouse and human regions, respectively.

Figure 1—source data 1

Mouse-human similarity matrix using homologous genes.

https://cdn.elifesciences.org/articles/79418/elife-79418-fig1-data1-v2.csv
Figure 2 with 1 supplement
Creating a new common space.

(A) Voxel-wise expression maps from 2835 homologous genes in the Allen Mouse Brain Atlas were used to train the neural network to classify each mouse voxel into one of the 67 atlas regions. (B) Once the network is trained, the output layer is removed. The mouse and human regional gene expression matrices are passed through the network, resulting in lower-dimensional latent space representations of the data. The training and transformation process were repeated 500 times. (C) A similarity matrix displaying the gene expression latent space correlation between mouse and human regions, averaged over 500 neural network training runs. Similar brain regions exhibit very high correlation values. Column and row annotations as described in Figure 1.

Figure 2—source data 1

Correlations between mouse and human brain regions in all latent spaces (1 of 3), related to Figure 2C.

https://cdn.elifesciences.org/articles/79418/elife-79418-fig2-data1-v2.csv
Figure 2—source data 2

Correlations between mouse and human brain regions in all latent spaces (2 of 3), related to Figure 2C.

https://cdn.elifesciences.org/articles/79418/elife-79418-fig2-data2-v2.csv
Figure 2—source data 3

Correlations between mouse and human brain regions in all latent spaces (3 of 3), related to Figure 2C.

https://cdn.elifesciences.org/articles/79418/elife-79418-fig2-data3-v2.csv
Figure 2—figure supplement 1
Multi-layer perceptron feature importance for the classification of the caudoputamen (A), the primary motor area (B), and the infralimbic area (C).

Top row: Rank-ordered distributions of feature importance for the 3 example regions, averaged over 200 training runs. While the perceptron relies on information from all input genes, a reduced subset of genes is often more informative for the classification of a given label. Bottom row: Coronal slice series displaying the normalized expression patterns for the two genes most informative in the classification of example regions. The spatial expression patterns can be specific to the region of interest, but this is not necessarily the case.

Quantifying improvement in locality in gene expression latent space.

(A) The amount of local signal within a broadly similar region of the brain for a finer seed region’s (e.g. primary motor area) similarity profile can be quantified by the decay rate of the head of the rank-ordered profile. Decay rate was quantified by computing the rank at a similarity of 0.75. This metric was compared between the initial gene expression space (orange line) and every gene expression latent space resulting from repeated training of the neural network (every blue line is a training outcome, heavy blue line serves as an example). A negative difference between these rank metrics indicates an improvement in locality in the latent space. (B) Structure-wise distributions of differences in rank at a similarity of 0.75 between the initial gene expression space and the gene expression latent spaces. Points and error bars represent mean and 95% CI with n = 500. Dashed black line at 0 indicates the threshold for improvement in one space over the other. Colors correspond to Allen Mouse Brain Atlas annotations as in Figures 1 and 2. Binomial likelihood (logistic regression) estimate of pB=0.78 with 95% CI [0.66, 0.86]. The probability of obtaining at least these many successes under the null binomial distribution, B67,0.5 , is p=8.64 · 10−7 . (C) Proportion of perceptron training runs resulting in an improvement or null difference in the gene expression latent space compared with the initial space, estimated using region-wise logistic regressions. Cortical and cerebellar regions exhibit high proportions of improvement, while subcortical regions are less likely to be improved by the classification process.

Figure 3—source data 1

Scaled similarity profiles of the mouse primary motor area, related to Figure 3A.

https://cdn.elifesciences.org/articles/79418/elife-79418-fig3-data1-v2.csv
Figure 3—source data 2

Ranks at a similarity of 0.75 for mouse regions in the homologous gene space and all latent spaces, related to Figure 3B.

https://cdn.elifesciences.org/articles/79418/elife-79418-fig3-data2-v2.csv
Figure 3—source data 3

Logistic regression model estimates for mouse regions, related to Figure 3C.

https://cdn.elifesciences.org/articles/79418/elife-79418-fig3-data3-v2.csv
Recovering canonical neuroanatomical pairs in gene expression space.

(A) Comparison between the ranks of canonical human matches for mouse seed regions between the initial gene expression space and gene expression latent spaces. Points and error bars represent mean and 95% CI with n = 500. Mouse region names are colored according to the Allen Mouse Brain Atlas palette. Binomial likelihood estimate of p=0.64 with 95% CI [0.47, 0.78]. The probability of obtaining at least thse many successes under the null binomial distribution, B36,0.5 , is p=0.033. (B) Proportion of latent spaces resulting in an improvement or null difference compared with the initial gene space, estimated using region-wise logistic regressions. Uncolored voxels correspond to regions with no established canonical human match.

Figure 4—source data 1

Ranks of canonical neuroanatomical pairs for mouse regions in the homologous gene space and all latent spaces, related to Figure 4A.

https://cdn.elifesciences.org/articles/79418/elife-79418-fig4-data1-v2.csv
Figure 4—source data 2

Logistic regression model estimates for mouse regions, related to Figure 4B.

https://cdn.elifesciences.org/articles/79418/elife-79418-fig4-data2-v2.csv
Figure 5 with 2 supplements
Similarity of mouse-human isocortical regions.

(A) Maximal correlation distributions of mouse isocortical regions. Points and error bars represent mean and 95% CI over n = 500 latent space samples. Linear regression using average maximal correlation values: β=-0.042, 95% CI -0.087,0.003 , t17=-1.854, p=0.0812. (B) Distributions of average maximal correlation for sensorimotor and supramodal isocortical areas in each gene expression latent space. Gray lines correspond to individual latent spaces. Linear mixed-effects regression: β=-0.042, 95% CI -0.044,-0.040 , t499=-49.9, p<2 · 10−16 . (C) Hierarchical clustering of mouse and human isocortical regions based on average latent space correlation values. Mouse regions are annotated as sensorimotor or supramodal. Four clusters were chosen for visualization using the elbow method. (D) Within-cluster sum of squared distances for different numbers of mouse and human isocortical clusters in the average latent space and initial homologous gene space.

Figure 5—source data 1

Maximal correlations of mouse isocortical regions in all latent spaces, related to Figure 5A and B.

https://cdn.elifesciences.org/articles/79418/elife-79418-fig5-data1-v2.csv
Figure 5—source data 2

Correlations between mouse and human isocortical regions in all latent spaces, related to Figure 5C.

https://cdn.elifesciences.org/articles/79418/elife-79418-fig5-data2-v2.csv
Figure 5—source data 3

Scree plot data, related to Figure 5D.

https://cdn.elifesciences.org/articles/79418/elife-79418-fig5-data3-v2.csv
Figure 5—figure supplement 1
Comparison between the ranks of canonical human matches for mouse cortical seed regions in various gene expression spaces.

Points and error bars represent mean and 95% CI with n = 500. Intervals are truncated at a minimal rank of 1. The original latent spaces are the latent spaces obtained from classifying all 67 mouse brain regions. The cortex latent spaces are the latent spaces obtained from classifying 19 mouse cortical regions only.

Figure 5—figure supplement 2
Similarity of mouse-human isocortical regions in latent spaces obtained using only cortical labels.

(A) Maximal correlation distributions of mouse isocortical regions. Points and error bars represent mean and 95% CI over n = 500 latent space samples. (B) Distributions of average maximal correlation for sensorimotor and supramodal isocortical areas in each cortical latent space. Gray lines correspond to individual latent spaces. (C) Hierarchical clustering of mouse and human isocortical regions based on average latent space correlation values. Mouse regions are annotated as sensorimotor or supramodal. (D) Within-cluster sum of squared distances for different numbers of mouse and human isocortical clusters in the average cortical latent space and initial homologous gene space.

Similarity among mouse and human striatal regions.

(A) Distributions over gene expression latent spaces of region-wise average correlation values for mouse and human striatal pairs. Human regions were chosen based on the Allen Human Brain Atlas ontology. Mouse target regions were chosen to be those with the highest average correlation values. (B) Latent space averaged correlations between voxels in the mouse striatum and human target regions. Target regions were selected based on the highest mean correlation across all striatal voxels. (C) Proportions of latent spaces in which mouse striatal voxels are maximally similar to human target regions.

Figure 6—source data 1

Correlations between human striatal samples and mouse striatal targets in all latent spaces, related to Figure 6A.

https://cdn.elifesciences.org/articles/79418/elife-79418-fig6-data1-v2.csv
Figure 6—source data 2

Average latent space correlations of mouse striatal voxels with human regions, related to Figure 6B.

https://cdn.elifesciences.org/articles/79418/elife-79418-fig6-data2-v2.csv
Figure 6—source data 3

Maximal correlations of mouse striatal voxels in all latent spaces (1 of 3), related to Figure 6C.

https://cdn.elifesciences.org/articles/79418/elife-79418-fig6-data3-v2.csv
Figure 6—source data 4

Maximal correlations of mouse striatal voxels in all latent spaces (2 of 3), related to Figure 6C.

https://cdn.elifesciences.org/articles/79418/elife-79418-fig6-data4-v2.csv
Figure 6—source data 5

Maximal correlations of mouse striatal voxels in all latent spaces (3 of 3), related to Figure 6C.

https://cdn.elifesciences.org/articles/79418/elife-79418-fig6-data5-v2.csv
Author response image 1
Author response image 2

Additional files

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Antoine Beauchamp
  2. Yohan Yee
  3. Ben C Darwin
  4. Armin Raznahan
  5. Rogier B Mars
  6. Jason P Lerch
(2022)
Whole-brain comparison of rodent and human brains using spatial transcriptomics
eLife 11:e79418.
https://doi.org/10.7554/eLife.79418