ANTIPODE Provides a Global View of Cell Type Homology and Transcriptomic Divergence in the Developing Mammalian Brain

Matthew T Schmitz; Jingwen W Ding; Sara Nolbrant; Reed McMullen; Chang N Kim; Bryan J Pavlovic; Tomasz J Nowakowski; Trygve E Bakken; Chun Jimmie Ye; Alex A Pollen

doi:10.7554/eLife.109659.1

eLife Assessment

This valuable study is an approach to integrating and comparing single-cell genomics data across species. The evidence supporting the conclusions of this work is solid, and ANTIPODE presents an updated methodological approach to determining how gene expression at the cell-type level has evolved. Thus, ANTIPODE should provide broad utility to studies of comparative neurogenomics and be of use to neuroscientists and evolutionary biologists.

https://doi.org/10.7554/eLife.109659.1.sa3

Significance of findings

valuable: Findings that have theoretical or practical implications for a subfield

landmark
fundamental
important
valuable
useful

Strength of evidence

solid: Methods, data and analyses broadly support the claims with only minor weaknesses

exceptional
compelling
convincing
solid
incomplete
inadequate

During the peer-review process the editor and reviewers write an eLife assessment that summarises the significance of the findings reported in the article (on a scale ranging from landmark to useful) and the strength of the evidence (on a scale ranging from exceptional to inadequate). Learn more about eLife assessments

Abstract

Diverse neurons and glia are generated in conserved spatial and temporal sequences during mammalian brain development. Divergence in gene regulatory networks can alter brain composition, scaling, timing, and function. However, resolving the identity, extent, and principles of gene regulatory divergence requires cellular-resolution surveys spanning brain regions and species and improved methods for defining cell type homologies. Here, we present ANTIPODE, a deep-learning variational inference framework that simultaneously integrates single-cell datasets, identifies homologous cell types, and parcellates differential expression across cell types, modules, and covariance. Applying ANTIPODE to a census of the whole developing macaque brain and a meta-atlas of human, macaque, and mouse brain development, we find broad conservation of initial neuron classes but widespread regulatory divergence within homologous types, shaped by genomic context, cell lineage, and developmental timing. Together, ANTIPODE provides a formalized and interpretable framework for cross-species single-cell analysis and reveals principles of gene regulatory divergence in mammalian brain evolution.

Introduction

The structure of the mammalian brain follows a conserved bauplan more than 150 million years old.(Glenn Northcutt and Kaas, 1995; Striedter, 2004) Across the rostrocaudal extent of the neural tube, progenitor expansion is followed by neurogenesis and gliogenesis, however, this process proceeds at vastly different rates across species and among different regions of the developing brain.(Charvet et al., 2011; Qian et al., 2000; Workman et al., 2013) The orchestration of progenitor division and progeny differentiation yields the construction of brains that vary widely in shape, size, and function. Understanding the diversity and evolution of cell types in the developing brain is further complicated by the cyclical nature of progenitor renewal, spatiotemporal state gradients, and the cascade of quasi-parallel differentiation trajectories of cellular maturation originating from myriad distinct germinal zones.(Cadwell et al., 2019; Javed et al., 2025; Nowakowski et al., 2017; Puelles et al., 2013)

Interspecies comparisons of gene expression across homologous cell types could provide insights into the conserved and divergent properties of mammalian brain development. Recent analytical strategies now enable predicting homologous cell types from single-cell gene expression.(Haghverdi et al., 2018; Hie et al., 2024; Johansen and Quon, 2019; Korsunsky et al., 2019; Tarashansky et al., 2021; Welch et al., 2019; Xu et al., 2021) However, homology determination often relies on heuristics such as shared nearest neighbors and nonlinear data transformations and is particularly challenging across continuous developmental trajectories. (Colonna et al., 2024) Similarly, determination of cell type-specific gene expression divergence relies on homology and is sensitive to altered distributions in gene expression.(Zhou et al., 2019) Moreover, gene expression divergence can be partitioned into distinct modes, including alterations in cell type-specific gene expression, gene covariance, and gene module expression. These modes, however, are obscured by analytical strategies for differential expression that only consider a few cell types or brain regions in isolation.(Bakken et al., 2021; Harris et al., 2021; Ovens et al., 2021) We reasoned that a model explicitly accounting for categories of gene expression divergence and developmental trajectories applied to a multi-species whole brain dataset could simultaneously reveal cell type homology and putative mechanisms driving cell type evolution.

We introduce ANTIPODE, Ancestral Node Taxonomy Inference by Parcellation Of Differential Expression, a model that considers inferring cell type homology via integration and modeling cross-species differential expression as two sides of the same coin. Applying ANTIPODE to a census of rhesus macaque whole brain development and a meta-analysis of human and mouse brain development, we uncover profound conservation of the developmental Euarchontoglire transcriptomic bauplan and elucidate the spatiotemporal progression of the progenitors which build the structures of these divergent brains. In this process, we reveal the gene expression architecture underlying cross-species and cross-regional heterochrony and find transcriptomic correlates for the classic "later is larger" principle of vertebrate brain development.(Finlay and Darlington, 1995)

Results

Construction of a cross-species developmental brain meta-atlas

To systematically explore the evolution of developmental cell types across species, we constructed a meta-atlas by integrating data that we generated from the developing rhesus macaque whole brain (Figure 1a) and 13 datasets (see Methods, Supplementary table 1) totaling 1,854,767 cells from 420 10X genomics scRNA-seq ports. These samples span periods of neurogenesis across regions and species: post-conception day (PCD) 10-21 in mouse, PCD 40-110 in macaque and PCD 21-175 in human (Figure 1b). We uniformly reprocessed these datasets by obtaining raw reads and quantifying genes using Kallisto to mitigate alignment and reference-based artefacts. After processing steps, including ambient RNA, doublet, and non-brain cell type removal (Methods), we developed a unified cross-species cellular atlas.

To identify homologous cell types, we introduce ANTIPODE, a novel variational inference model built using the pyro probabilistic programming language and inspired by scANVI, but designed specifically for simultaneous cross-species integration, de novo clustering and differential gene expression analysis (see full specification in Supplement 1).(Bingham et al., 2019; Xu et al., 2021) We modeled the expression of each gene in a cell type for a given species as a function of cell type-specific divergence, differential covariation, and differential module expression from a shared ancestral manifold of development (Figure 1c). To solve for these parameters while simultaneously clustering, we employed a maximum a posteriori variational autoencoder. This model applies the structural constraint that the discrete clusters and latent space generated by the encoder/classifier are decoded bilinearly using only the latent space, a single layer of weights and the differential expression parcels which are regularized by Laplace prior distributions (Figure 1b). This is in contrast to the typical nonlinear multi-layer perceptron decoder employed by most variational autoencoder models.(Lopez et al., 2018; Xu et al., 2021) In addition, the bilinear decoding of this model still allows removal of the linear effects of differentiation trajectories and secondary covariates like cell cycle phase, “regressing out” these effects in order to reconstruct observed expression values, which are modeled by a negative binomial distribution (Figure 1c).

We applied ANTIPODE to the developing brain meta-atlas to build a consensus taxonomy. Starting from 600 initialized clusters, the model identified 380 discrete state clusters. We next grouped these clusters into 98 initial classes, comprising the distinct progenitors and immediately post-mitotic transcriptional states in the developing brain (Figure 1e, Figure 2). (Schmitz et al., 2022) Clustering based on this method improved mixing between species and revealed normal distributions differential expression and covariance (Figure S2).

Taxonomy and ANTIPODE model architecture.
Heat-map summary of the 98 initial classes. For each class (rows) we display (left-to-right): regional abundance across broad areas, total cell count, Spearman correlation of gene expression across pairs of species, mean absolute inter-species log fold change (“ANTIPODE divergence”). Row dendrogram shows Kendall correlation hierarchical clustering based on ANTIPODE latent space.

We next interpreted the cell identities according to enriched genes and anatomy (see Supplementary Table 2 for clusters, initial classes, and explanations). Briefly, non-neuronal cells were grouped into divisions: neuroepithelial/neural progenitor cells (NPCs); various types of HES6+ intermediate progenitor cells/neurons (IPC); glial progenitor cells (GPCs); vascular cells including angioblasts, endothelial cells, pericytes, smooth muscle cells (SMC); mesenchyme including arachnoid barrier cells (ABC), vascular/leptomeningeal cells, (VLMC); choroid plexus cells; oligodendrocytes; astrocytes including radial glia-like telencephalic astrocytes and non-telencephalic astrocytes; ependymal cells; microglia (MG); and other immune cells.(Rowitch, 2004) Newborn neuronal classes (TUBB3+) were similarly organized into divisions: ventral and dorsal telencephalic neurons (TEv, TEd); dorsal thalamic and mesencephalic neurons (DEd, MEd); secondary prosencephalic neurons (PEv) including the DLX+ hypothalamic fields; hypothalamic glutamatergic neurons (Hypo_Glut); diencephalic/mesencephalic GABAergic-like neurons (DE-ME_GABA); rhombencephalic GABAergic and/or glycinergic neurons; brainstem motor and sensory neuron classes; and heterogeneous groups termed diencephalic-mesencephalic-rhombencephalic (DMR) neurons (Figure 1c,2).(Bulfone et al., 1993; Moreno and González, 2011; Nieuwenhuys and Puelles, 2015) More mature neurons of arising from these classes were not captured, likely due depletion of larger neurons during whole cell dissociation. We provide supervised markers in the initial class name and conserved markers based on the minimum natural log fold change (LFC) in an initial class compared to the 99th percentile of expression in all other initial classes.(Figure 2,S3). Together, these initial classes, though likely not exhaustive, provide a foundation of homologous cell types for comparative analysis of Euarchontogliran development.

To link developing initial classes to putative terminal class derivatives in the adult brain, we used the list of identity-defining transcription factors (TFs) proposed in Yao et al. and performed pairwise Pearson correlation of cell type means to find them most correlated developmental and adult correspondences (Figure S4).(Yao et al., 2023) The supervised adult list qualitatively outperformed unsupervised lists of TFs filtered by variance and bimodality (data not shown), and we note that TF expression appeared more binary in adult cell types compared to developmental initial classes (Figure S5). In absence of whole-brain fate mapping data, we used developmental vs adult transcription factor correlations to assess the coverage of initial classes corresponding to adult subclasses. Fewer than 20 of 337 adult subclasses (mostly from medulla) have a developing initial class correlation less than 0.33 (greater than the 0.95 quantile of non-max correlations), supporting candidate linkages showing coverage of the precursors of most adult types in our atlas (Figure S4). Across the entire brain, among the initial classes we define, we do not find evidence of the absence of any initial class in any species, consistent with recent studies of telencephalic inhibitory neurons.(Corrigan et al., 2024) We do note, however, that these initial classes represent only a fraction of the adult diversity of mature cell types. For example, caudal ganglionic eminence-derived (CGE) cortical GABAergic interneuron initial classes appear uniform in prenatal development but diverge substantially into multiple subclasses (e.g., SNCG, PAX6, LAMP5, LAMP5/LHX6, VIP) and many more types in the adult brain. Similarly, undifferentiated developmental classes likely mask further developmental diversity. This is especially true for classes like DMR_Glut or DMR_GABA, which contain cells from multiple regions of the ventral brainstem, and appear to correspond roughly to the diverse neuron types previously described as forming “splatter” clusters in the adult human brain.(Siletti et al., 2023)

ANTIPODE as a cross-species integration method and differential expression model

To provide additional validation of the ANTIPODE model as an integration method, we tested its capacity for batch correction and preservation of biological variation on two additional cross-species test datasets. One dataset included snRNAseq data from many human cortical areas along with many primates from a single area, and the other included adult retina snRNAseq from 13 highly diverged mammals.(Hahn et al., 2023; Jorstad et al., 2023b, 2023a) ANTIPODE demonstrated superior performance in cross-species integration tasks compared to established methods such as Harmony and SCANORAMA, and performed comparably to SCVI, despite these other methods utilizing much stronger nonlinear integration approaches (Figure 3b-h, S6). (Hie et al., 2024; Korsunsky et al., 2019; Lopez et al., 2018) Crucially, ANTIPODE also simultaneously provides clustering and differential gene expression analysis during its fitting procedure, offering practical advantages over other models requiring sequential analytical steps. As such, ANTIPODE was effective at the reconstruction of log expression values, with an R² value of 0.90 for log expression means across all species, types and genes (Figure S2). On the other hand, because ANTIPODE models real differential expression in log fold change space, it is likely not suitable for nonlinear integration tasks such as integration of single-cell RNA data with single nucleus or spatial transcriptomics data.

Benchmarking ANTIPODE against existing cross-species integration methods.
a, Schematic detailing the benchmarking strategy for evaluating ANTIPODE as an integration method. **b–e,** Bubble plots summarizing integration metrics calculated by scib for: (b) mammalian retinal ganglion cells from Hahn et al.,, (*c,d*) cortical cross-areal (XA) and cross-species (XS) clusters from combined Jorstad et al. and Jorstad et al.. (f) whole developing brain initial classes from the current study **f–h,** Aggregated integration space k-nearest neighbor classification entropy for each method’s latent space calculated for (f) mammalian retinal ganglion cells from Hahn et al.,(g) combined cortical cross-areal (XA) and cross-species (XS) clusters from combined Jorstad et al. and Jorstad et al., (h) whole developing brain initial classes from the current study. Boxplots show median and inter-quartile range.

Principles of evolutionary divergence in developmental gene expression

Approximately 71% of genes exhibited at least a two-fold expression change between at least one pair of species, violating typical assumptions underlying null models for differential expression (DE) analysis (Figure 4a). Moreover, continuous cellular identity changes during development make instantaneous gene expression dispersion estimates challenging. Therefore, instead of focusing on specific differentially expressed genes, we examined broad evolutionary trends across developmental gene expression patterns by jointly modeling gene expression across developmental states and species-specific divergence by differential expression categories (Figure 4b). We used ANTIPODE’s learned parameters to separate 4 categories of differential expression (DE), where differential-by-all (DA) is a species intercept representing DE across all cell types, differential-by-identity (DI) representing DE in a single-cell type, differential-by-module (DM) representing DE of an entire module of genes in a single type and differential-by-coexpression (DC) representing differential module membership of genes across species (Figure 4b). We found that in our model, the effects of DM and DC are correlated, as these are multiplied together, with Spearman correlations of effects within taxons around 0.7, while the correlation of either to DI is much lower (Figure S7d).

Modes and landscape of evolutionary differential expression.
a, Cumulative distribution showing proportion of genes by their per species log fold change. 71.6% exceed |log2FC > 1 in at least one pairwise species comparison (based on raw log expression, not ANTIPODE fit). b, Conceptual schematic of four categories of differential expression learned by ANTIPODE: DA (differential-by-all states intercept), DM (by-module), DC (by-co-expression) and DI (by-identity intercept). c, Boxplot showing median and quartile divergence (overall DE) for each ANTIPODE cluster, grouped by division. d, ANTIPODE UMAP colored by each cluster’s summary divergence (mean |LFC for all categories combined).

We observed that microglia showed the highest divergence among brain cell types across species, with ependymal cells also having higher overall DE divergence, aligning with prior reports, while other endogenous glia like astrocytes and oligodendrocytes diverge similarly to most neurons despite having higher divergence in adult cortex (Figure 4c,d).(Jorstad et al., 2023a) Intriguingly, the small number of brain-exogenous immune cells captured also displayed similarly high divergence, suggesting rapid microglial evolution may be attributed more to their immune/myeloid lineage rather than to brain-specific selective pressures. We also found that as expected, gene expression divergence tends to increase in neurons relative to progenitors (Figure 4c,d).

We next looked for associations between genomic context and the strength of divergence across categories. The strongest association was between gene length and DA (slope LFC/log length), with large genes showing a strong reduction in DA compared to smaller genes (Figure 5a,b). Non-TF genes and TFs not expressed in both developing initial and mature classes had similar divergences across categories, and were the most DA. We also found that shared TFs for non-neuronal cells displayed greater divergence in general than neuronal TFs in DC and DM, while neurons tended to have higher DI, possibly due to their diversity and relatively fewer clusters per initial class (Figure 5c). Additionally, genes located in conserved syntenic genomic regions exhibited coordinated, modular differential expression, whereas genes in non-syntenic regions tended to be broadly and diffusely differentially expressed across many/all cell types (Figure 5d).

Gene expression evolution in context.
a, Gene size vs DA for all genes. Line of best fit calculated by scipy linregress. b, Best fit line slopes between divergence categories vs 0-1 scaled log10 gene size and 0-1 scaled log10 intergenic distance. c, Divergence of DA, DI, DM and DC sets split by gene class (Shared Neu TF refers to TFs with shared expression in development and adult in neurons, Shared NN TF refers to TFs with shared expression in development and adult in nonneurons, TFs refers to TFs not shared between development and adult, Non-TF refers to all other genes not in one of the other categories). P values are Bonferroni corrected two sided Mann-Whitney test, from python’s statannotations package * p<0.05, ** p< 0.01, *** p<0.001, **** p<0.0001. d, Divergence of DA, DI, DM and DC sets split by conserved synteny. P value annotations are the same as **c. e,** Summary divergence versus specificity (tau, 0-1 score where 1 represents gene is expressed only by one type) colored by log gene mean expression value. f, Schematic depicting evolutionary implications of paired neuropeptide and receptor gene expression divergence in sender and recipient initial classes, respectively. Neuropeptides are represented by keys and receptors are represented by locks. **g–h,** Histogram showing observed ligand–receptor correlations (blue) are more concordant in direction (g) and magnitude (h) than 10 000 shuffled pairs (orange); vertical lines indicate mean values.

Given the simplified pairing of neuropeptides and their receptors (often one-to-one gene pairs), we hypothesized these pairings could serve as a test case for the extent of coordinated gene expression during evolution. Indeed, differential expression analysis revealed ligand and receptor genes frequently evolving cooperatively, showing directional changes that were significantly more often in the same direction (p < 0.000099), and coincidentally stronger (p = 0.0023) than expected by chance (Figure 5f-h, S8).

Spatiotemporal dynamics of progenitor states and neurogenic timing

The timing of mammalian brain development varies globally across species and sequentially across regions. For example in the pallium of the mouse the vast majority of neurogenesis occurs between PCD 10.5 to 18.5 while birth is at day 21, in the macaque occurs between PCD 40 and 100 while birth is around day 166, and in the human occurs between PCD 42 and 161 while birth is around day 280.(Di Bella et al., 2021; Rakic, 2002; Vanderhaeghen and Polleux, 2023) Meanwhile the neurogenic window of different brain structures shows dramatic heterochrony, where in the mouse: the ventral midbrain spans PCD10.5-14.5; the dorsal midbrain spans PCD12.5-16.5; the ventral-derived pontine neurons and Purkinje cells spans PCD9.5-11.5; rhombic lip-derived pons, cochlear nuclei and cerebellar granule cells span PCD12.5 to postnatal stages, and strikingly, developmental neurogenesis of olfactory bulb granule neurons continues well after birth.(Gritti et al., 2002; Hirata et al., 2021) Thus, we next sought to examine the relationship between gene expression and global and stereotypical aspects of developmental timing.

We developed a Bayesian model to examine the temporal dynamics of progenitor states, modeling their abundances across developmental time with asymmetric split Gaussian distributions (Figure 6a). From these models, we calculated the "lateness", defined as the temporal center-of-mass (COM) of progenitor abundance. As expected, the temporal COM of astrocyte and OPC progenitor clusters followed that of neural progenitors across regions.

A Bayesian model of progenitor-state progression across species and regions.
a, Progenitor timing model schematic. Progenitor abundances are fitted with asymmetric split-Gaussian curves with parameters: μ, σₐ, σᵦ, Δt, Δh; right inset is a cartoon depicting that larger structures tend to be later-developing. b, UMAP of progenitor classes used for timing model, colored by initial class. Abbreviations used throughout: (HB/RE: hindbrain/rhombencephalon, MB/ME: Midbrain/Mesencephalon, DE: Diencephalon, FB: Forebrain i.e. non-pallial, non-diencephalic prosencephalon, Ctx: Cortex i.e. pallium) c, Estimated centre-of-mass (COM) lateness for distinct progenitor state across regions (points colored by species) with NPC classes on the left and Astro-OPC progenitors on the right (these are the progenitor groupings that are analogous across all regions). Boxplots show median and quartile values. Boxplot including all values can be found in Supplementary Figure 9. d, Scatter of overall mean |logFC DE versus stereotypical COM for each progenitor state; color encodes the initial class. e, Schematic summarizing the known acceleration rostral-caudal spatiotemporal neurogenic gradient. f, Relative acceleration of cortical progenitor progression fit by progenitor state model using only cortical regions along the rostral-caudal axis for human and macaque.

Among neural progenitors, we found regional differences consistent with the "later-is-larger" developmental principle: cortex, ganglionic eminence, and cerebellum had late COMs, whereas neuroepithelial states, ventral midbrain and myelencephalon showed early COMs (Figure 6c). (Finlay and Darlington, 1995) Using the ordering of progenitor states across time, we hypothesized that gene expression would become increasingly divergent later in development. We however found the contrary, a weak negative relationship between a state’s lateness and overall expression divergence (DE) (Figure 6d). Finally, in a second model which included an overall regional shift term, fitting with only cortical progenitors from prefrontal, motor, somatosensory, parietal and occipital cortex, progenitor state progression was accelerated in macaque relative to human and rostral cortex relative to caudal cortex (Figure 6e-f). This aligns with known neurogenic gradients and previous observations of accelerated primate visual cortex progenitor progression relative to the frontal lobe.(Nowakowski et al., 2017)

Genes associated with developmental timing in progenitors

We then analyzed gene expression relationships with progenitor timing across species and regions. We considered three categories of timing differences: first "global lateness" captures the absolute timing of development across regions and species (e.g. the ordering mouse medulla, mouse cortex, macaque medulla, human medulla, macaque cortex, human cortex), second "stereotypical lateness" captures the conserved progression of states (e.g. the ordering medulla, diencephalon, cortex which is stereotypical to all species), and third, "differential lateness" which estimates shifts in the lateness of a state relative to that state’s stereotypical lateness. We scored genes against these metrics by calculating the correlation of a gene’s expression in states with those states’ lateness metrics (Figure 7a, S9f). Stereotypical lateness had the most genes with strong signal, followed by global lateness, while differential lateness’ signal was much more limited (Figure 7b). We also found that the magnitude of stereotypical lateness/earliness was most related to overall DI amongst progenitor types (Figure 7c).

Gene programs linked to early versus late neurogenesis.
a, Illustration of three lateness metrics: global, stereotypical and differential. b, Histogram of gene expression correlation scores with each progenitor lateness fit values c, Heatmap showing the correlation between the magnitude of each score category with the magnitude of each differential expression parcel’s overall divergence. d, Dot-plot of top genes correlated with each lateness metric (dot size = fraction expressing; color = log counts/count). e, Enrichment networks for genes associated with stereotypical lateness (left) and global lateness (right). Terms represent the top 15 positive and negative terms with resampling pvalue less than FDR<0.05. Edge width represents proportion of genes overlapping in gene sets, node color represents the mean lateness score for the genes in associated with that term. f, Gene set enrichment analysis (using gseapy prerank) of stereotypical lateness genes shows enrichment for neuromorphological disorder gene sets; curves plot directional enrichment score against ranked gene list.

Genes related to global lateness often displayed constitutive differences between species, many explained by pan-state DA in ANTIPODE, as expected, with the highest scored genes also showing temporal progression. Among these top global lateness genes, upregulation of transcription factors TEAD2, YBX1, TP53 was associated with mouse and global earliness while SOX2 was associated with global lateness and thus human (Figure 7d). There were no significant brain size malformation-related associations with global or differential lateness, but gene ontology clearly shows dynamics in mitochondrial respiration, splicing, translation and differentiation identity genes (Figure 7e-f).

We next examined the properties of genes related to stereotypical timing across species. Genes associated with smaller, early-developing brain regions (stereotypical earliness) included known pluripotency, proliferative factors, and oncogenes (e.g., LIN28A/B, IGF2BP1/2, HMGA1/2, SNRPE/F/G/A1, SALL4), whereas late-developing regions (stereotypical lateness-associated) expressed genes previously linked to growth restriction (e.g., NFIA/B/C/X, CEBPD, NDRG2, EGR1, PURA) (Figure 7d).(Nowakowski et al., 2017; Pollen et al., 2014) Many of these genes have been linked to timing of cortical development in previous studies, and our findings extend this timing relationship to developmental sequences across regions that are shared across species.(Nishino et al., 2013; Pollen et al., 2019) To add to this point, all 15 genes associated with macroencephaly by DisGeNET including AKT1/3, PTEN, PIK3CA/B/D/G and MTOR were positively correlated with stereotypical lateness. Genes associated with microcephaly, lissencephaly and agyria were all significantly associated with lateness as well (Figure 7f). We found a similar set of genes in the cortex area-only model, with notable additions of FZD3 as a top 5 gene related to cortical earliness, while NR2F1 and noted outer radial glia marker FGFR3 are the top genes related to cortical lateness (Supplementary Table 3) (Schaberg et al., 2022).

Finally, we examined the properties of genes with shifts in timing between species across regions, relative to stereotypical timing. Perhaps consistent with continuous, mosaic evolutionary modifications of brain structures, there was minimal coherence in the functional annotations of genes associated with "differential lateness", and most of these genes are associated with regional identity (e.g. HOX genes). While we cannot exclude that an increased temporal resolution of sampling would resolve convergent properties of altered regional timing, the limited coherence in this analysis suggests that diverse genetic mechanisms drive shifts across regions and species.

Discussion

Here we built a model of transcriptomic shifts across evolution, ANTIPODE, to create a unified taxonomy of the initial classes across the developing Euarchontoglire brain. The model performed at least as well as other widely-used integration methods showing that cross-species gene expression divergence, despite presumed complexity, can be captured by an interpretable model. ANTIPODE achieved this integration as a bilinear model with interpretable parameters that correspond to intuitive categories of gene expression divergence, while performing simultaneous de novo clustering. From the three species developing brain meta atlas we constructed, ANTIPODE detected clusters that we grouped into 98 initial classes, which appear to be the precursors of most of the subclasses seen in the adult brain. In our data, which is enriched for progenitors and newborn neurons and depleted for mature neurons due to whole cell dissociation, we did not observe comparable neuronal diversity in any developing region to the adult brain, strongly suggesting that the outstanding diversity of neuronal types arises post-mitotically across all brain regions as neurons settle in terminal niches and crystallize into stable adult types.(Cheng et al., 2022; Fishell and Kepecs, 2020; Kim et al., 2020)

Using ANTIPODE as a holistic differential expression method, despite the profound conservation of cell states across species, we uncovered several patterns of gene expression evolution across species. We found that the largest share of gene expression divergence could be attributed to DA, changes across all/many types, while the other categories of differential expression were complex and entangled. Additionally because of the prevalence of DA, DM and DC changes which represent gene expression shifts across multiple types, we stress the importance of holistic differential expression accounting across many cell types and regions, lest one assign shifts in expression across many cell types as a cell type-specific change.

The prevalence of differential expression, with more than two-thirds of genes having a fold change of 2 or greater for a given type between any two of our three species, also suggests that the hunt for consequential transcriptomic changes across species underlying particular physiological traits may be “searching for a needle in a needlestack” where many changes push a quantitative trait in opposing directions to reach a new equilibrium. We acknowledge that post-transcriptional regulation may buffer many of these changes such that they have small effects on the proteome. Amidst such a haystack of gene expression divergence, however, there must also be cases of oppositional changes in expression by genes in the same pathway to compensate, and still other changes which are incompletely opposed and contribute to the observed morphological and functional differences across species’ brains.(True and Haag, 2001) Indeed, in the limited case of neuropeptide and receptor evolution, even though we could not filter our sender and receiver types by peptide-connectomics, we saw a significant coincidence of both ligands and receptors shifting together and in the same direction.

Finally, with a vantage point of development in three species with highly different developmental rates, we also examined the evolution of progenitor state composition through neurogenesis. Using a simplified model of progenitor state composition over time, we estimated the stereotypical ordering of progenitor states across regions and species, and how the different species differ in the ordering and occupancy of states through development. As it has been shown that in general gene expression becomes more divergent across species as cells in the developing embryo mature (Cardoso-Moreira et al., 2019), it has also been assumed that stem cells with higher potency are more constrained due to the potential of pleiotropic effects propagating in all descendant lineages. Counter to this expectation, while expression divergence increased in many post-mitotic states, it was essentially constant (slightly decreasing) as progenitors progressed from higher (neuroepithelium and NPC) to lower potency states (glial progenitors). This suggests that the constraint on gene expression may not be strictly due to the pleiotropy of multiple descendant cell types, and future perturbation studies will be able to answer how gene expression changes in progenitors of various states and potency affect the overall structure and function of the brain.

We identified genes with expression that correlates with progenitor lateness. Because developmental lateness and structure size are closely linked, these genes likely also include regulators of mosaic regional expansion.(Finlay and Darlington, 1995; Herculano-Houzel, 2012) We document a surprising, but not unprecedented mix of genes associated with lateness, with many genes expected to drive both expansion and attenuation of growth in NPCs. Here we see the power of a global view of brain development, where we saw the upregulation of PI3K and mTOR pathways reported in the context of cortical expansion as key levers for progenitor proliferation across many regions.(Andrews et al., 2020; Nowakowski et al., 2017; Pollen et al., 2019)

While the present dataset captures the majority of canonical neurogenesis across the three species, it has relatively low temporal resolution in primates and lacks coverage of late-stage gliogenesis, which extends significantly beyond birth. Thus, estimates of late glial progenitor dynamics are likely incomplete, and could be greatly improved as larger single nuclei datasets spanning more species, timepoints and including molecular recording technologies like neuronal birthdating or lineage barcoding are created and collated.(Bandler et al., 2022) Study of diverse vertebrates with dramatic regional expansions like the primate cortex, the cyprinid vagal lobe, the mormyrid cerebellar valvula, or various other instances of mosaic regional expansion in vertebrate lineages will give better opportunity to elucidate the degree to which independent expansions converge on similar pathways or whether it is idiosyncratic.(Striedter and Northcutt, 2019) Soon, it will likely be possible to look with much higher resolution to determine subtle shifts in developmental timing and to conclusively determine genes expression signatures that might be convergently responsible for shifts toward earliness or lateness across species outside of those already associated with global/stereotypical lateness. ANTIPODE gives us a window into this point, as genes more strongly related to stereotypical earliness/lateness tended to be much more DI, lending support to the idea that species may control allometric shifts by mosaic changes in gene regulation.

ANTIPODE represents a framework for modeling gene-expression divergence across species. The current implementation returns maximum a posteriori parameter estimates rather than full posteriors, so we cannot yet provide credible intervals or formal significance for individual parameters. While the architecture admits phylogenetic structure, our attempts to include ancestral nodes led to clustering collapse, indicating the need for more stable variational families or priors. In addition, the structure of the model makes it difficult to directly control clustering resolution and rare cell types in the adult brain. Finally, the bilinear decoding that makes parameter interpretation straightforward limits applicability to strongly nonlinear integration tasks (e.g., single-cell with single-nucleus or spatial); constrained nonlinear corrections are a plausible extension.

ANTIPODE brings integration, taxonomy, and differential expression into a single framework, converting discrete steps that often undermine one another into a coherent whole in which the latent space is anchored in cell-state structure and integration is accomplished by modeling differential expression. Applied to human, macaque, and mouse development, it reveals broad conservation of initial classes alongside pervasive, context-dependent divergence shaped by genomic architecture, lineage, and developmental timing. We identified coordinated evolution in ligand–receptor pairs and link progenitor state to developmental rates. In this work we have sought to zoom out and view species- or region-specific transcriptomic landscapes as parts of a larger global developing brain, and we foresee that biologically inspired models like ANTIPODE can shift analytical focus to a more universal view of the developing vertebrate brain.

Methods

Generation of scRNA-seq data

Samples

The Primate Center at the University of California, Davis, provided nine specimens of cortical tissue from PCD40, PCD50, PCD65 (n = 3), PCD80 (n = 2), PCD90 and PCD100 macaques. All animal procedures conformed to the requirements of the Animal Welfare Act, and protocols were approved before implementation by the Institutional Animal Care and Use Committee at the University of California, Davis. In total, we analysed single-cell transcriptomes from 654637 cells from developing macaque (this study plus (Zhu et al., 2018)), 473517 cells from developing mouse(Di Bella et al., 2021; Kim et al., 2020; La Manno et al., 2020; Loo et al., 2019; Mayer et al., 2018) and 726613 cells from developing human (this study plus (Bhaduri et al., 2021; Eze et al., 2021; Jessa et al., 2019; Zhong et al., 2023, 2020; Zhou et al., 2022)). De-identified human tissue samples were collected with previous patient consent in strict observance of legal and institutional ethical regulations in accordance with the Declaration of Helsinki. Protocols were approved by the Human Gamete, Embryo, and Stem Cell Research Committee and the Committee on Human Research (institutional review board) at the University of California, San Francisco.

Single-cell RNA sequencing tissue processing

For the PCD40 to PCD100 macaques, dissections were performed in PBS under a stereo dissection microscope (Olympus SZ61). Tissue was dissociated and prepared for droplet partitioning as in (Schmitz et al., 2022). Single-cell RNA sequencing was completed using the 10x Genomics Chromium controller and version 2 or 3 3-prime RNA capture kits. Most samples were loaded at approximately 10,000 cells per well; up to 25,000 cells were loaded per lane for a small number of multiplexed samples. Transcriptome library preparation was completed using the associated 10x Genomics RNA library preparation kit. Multiseq barcode library preparation was completed as described in McGinnis et al.46. Following library preparation, libraries were sequenced on Illumina HiSeq and NovaSeq platforms.

Alignments and gene models

Fastq files were generated from Illumina BCL files using bcl2fastq2. Genes were quantified using Kallisto release 0.46 using each species respective reference: the human hg38 genome assembly and gencode version 33 transcript annotation, the RheMac10 genome assembly, annotated using the comparative annotation toolkit based on the human annotation48, and the transcript annotations of Mus musculus ENSEMBL release 100.(Bray et al., 2016) A custom Kallisto reference for each species was created for the quantification of exons and introns together, in which introns were defined as the complement of exonic and intergenic space. The Kallisto index used k-mers of length 31. Public data were downloaded as raw fastq files or as BAM files that were converted back to fastq files. All data were processed from raw reads using the same Kallisto pipeline to minimize annotation and alignment artefacts. 16738 1:1:1 ortholog genes were identified using the MGI human-mouse ortholog table, and this geneset was used for analysis.

Quality control

Kallisto–Bus output matrix files (including both introns and exons) were input to Cellbender (release 0.2.0; https://github.com/broadinstitute/CellBender), which was used to remove probable ambient RNA only.(Fleming et al., 2023) Only droplets with a greater than 0.99 probability of being cells (not empty droplets), as calculated by the Cellbender model, were included in further analysis. Only spliced read UMIs were used thenceforth. Droplets with fewer than 800 genes detected, or greater than 40% ribosomal or 15% mitochondrial reads, were filtered from the dataset. Finally doublets were detected using solo and cells with a doublet probability greater than 0.4 were excluded. Non-brain cell types (neural crest, placode, etc.), undetected doublet clusters and low quality cells were removed from the dataset by author judgement.

Processing of data

Raw fastq files were obtained from the NCBI Sequence Read Archive or directly from authors. Transcriptomic profiles were quantified using kallisto 0.46.0, with references built using gencode v33, comparative annotation toolkit rheMac10 (based on human gencode v33) and mm10 for human, macaque and mouse respectively. Exonic and intronic counts were then passed to cellbender using only the remove ambient model with parameters.

The ANTIPODE model is described in supplement 1. The data analysis herein uses the model trained with 600 starting clusters, 100 latent dimensions, a Laplace dispersion prior on posterior parameter estimates of 500 an encoder with layers of size [#genes, 6000,3000,100], and a classifier with layers of size [100,3000,3000,600].

Calculation of gene expression estimates

Direct calculation of gene expression means in clusters was formulated as the sum of counts for a gene across the cells assigned to a taxon divided by the total UMI counts, where a Laplace pseudocount is defined as [inline] is added. The division by 2 stems from the naive estimate of the number of additional counts required to observe a count for a gene for which no counts have been observed following the memorylessness property of the geometric distribution. ANTIPODE parameter-based calculation of gene expression of means [inline]. Marginal gene expression differences attributed to DA, DM, DI and DC are calculated as [inline], where [inline] is the mean calculation without the respective parameter. L represents cluster gaussian samples in latent space, DM_c,s is the differential-by-module with respect to cluster and species, W is the matrix of latent space components to genes weights, DC_s represents the differential-by-coexpression term with respect to species, DI_c,s represents the differential-by-identity intercept with respect to species and cluster and DA_s represents the differential-by-all intercept with respect to species only. In the model fitting, terms for batch effects by module, batch effects by identity, and secondary covariate by module effects are added as well (see code and Supplement 1).

Lateness

Regional progenitor taxa were defined as antipode clusters split by the simplified region (pallium, ventral forebrain (FB), diencephalon (DE), mesencephalon (ME) and rhombencephalon) for the neuroepithelial NPC initial class or simply antipode clusters for all other initial classes, as the NPC states tended to be similar enough to be undifferentiated by the model across regions. Regional progenitor taxon proportions (RPTP) were calculated as the number of cells from a droplet based sequencing batch divided by the total number of progenitors and endogenous glia (progenitors and glia excluding microglia). Raw lateness values for comparison were calculated as the center of mass of RPTP of the curve RPTP = f(Timepoint), calculated by the trapezoid sum. The lateness Bayesian models for lateness COM estimates in the whole brain and cortical areas are pyro models, the code for which can be found in the project github repository under analysis 5-2.

Neuropeptide ligand-receptor interactions

We calculated the interaction score following the method used by NATMI (Hou et al., 2020), where the score for each ligand is equal to the mean of the ligand expression multiplied by each potential receptor’s expression. Expression values used were the true counts/count value in each initial class normalized by the maximum expression of that gene in any initial class. The concordance of receptor and ligand expression divergence was calculated by the, compared to the null distribution of this value calculated from permuted receptors and ligand values across initial classes.

Other bioinformatic analysis

Disease enrichments were performed using the gseapy package’s preranked gene list test with FDR q values calculated with at least 1000 permutations. Gene ontology enrichments vs lateness score means were computed using permutation tests. The whole dataset UMAP layout was chosen from 100 random seeds of the cuML accelerated UMAP algorithm, with the displayed layout chosen based on the amount of space surrounding the DMR neuron classes for labeling. Mapping of adult subclasses to developing initial classes was based on the maximum Pearson correlation of counts per count values scaled from 0-1 per gene, using the list of cell type identity TFs provided in Yao et al.(Yao et al., 2023).

Large language model statement

LLMs were used to proofread and point out areas for textual clarity improvement, however none of the manuscript text was generated by an LLM. The ANTIPODE model was written without aid from an LLM, however later iterations and code organization and documentation was aided by ChatGPT. Some code to perform analysis for figures was written or reorganized by LLM.

Supplementary Figures

Dataset composition, batch structure and basic QC.
**a–c,** ANTIPODE UMAP colored by post-conception day (PCD) for human (a), macaque (b) and mouse (c) reveal continuous developmental trajectories that align across species after integration. d, UMAP colored by the 15 10X scRNA-seq datasets showing that biological structure, not dataset of origin, dominates the embedding. e, The same UMAP colored by individual library-preparation batches. f, Stacked-bar plot of the 1,854,767 high-quality cells grouped by species, structure (colors) and PCD demonstrates broad temporal and anatomical coverage. **g–i,** Violin plots of log10 UMI counts per cell across datasets for human **(g)**, macaque **(h)** and mouse **(i)**.

Goodness-of-fit for ANTIPODE gene-specific parameters.
a, Reconstructed (posterior) mean log-expression (log counts/count) for every gene-cluster pair versus its empirical mean (black points; red density contours). b, Empirical zero probability (p(count = 0)) versus empirical log-mean expression suggest a small degree of zero-inflation (left-right shift of the curve). c, Full heat-map summary of the 98 initial classes from figure 2. For each class (rows) we display (left-to-right): regional abundance across broad areas, total cell count, Spearman correlation of gene expression across pairs of species, mean absolute inter-species log fold change (“ANTIPODE divergence”), membership in 200 ANTIPODE gene modules. The bottom section shows divergence by DC and DM. Gene set enrichments are colored by gseapy prerank normalized enrichment scores of each ANTIPODE component, with color alpha equal to 1 – significance q value. Row and column dendrogram shows Kendall correlation hierarchical clustering by ANTIPODE latent space.

Conserved marker expression across initial classes.
Dot plots show mean normalised expression (dot size) and mean inter-species log fold change (color scale; blue = human-up, orange = mouse-up, green = macaque-up) for representative markers of a brain-endogenous non-neuronal divisions, b brain-adjacent initial classes a.k.a neighbors c neuroepithelial / neural progenitor cells (NPC), d brain-exogenous non-neurons a.k.a Exo NN, e telencephalic (TE) neurons and f diencephalic/mesencephalic/rhombencephalic (DMR) neurons.

Mapping developing clusters to adult mouse subclasses.
a, UMAP of developmental clusters colored by their most-correlated adult subclass from Yao et al. 2023. b, Heat-map of Pearson correlations (using Yao identity TF set) between 380 developmental clusters (rows) and 337 adult subclasses (columns). c, Distribution of correlation coefficients for maximal matches (orange) versus all other comparisons (blue). d, Jaccard index of gene-marker overlap for the same developing cluster–adult subclass pairs (because matches are called by cluster all jaccard indices are 0 or 1).

Shared transcription-factor programs between development and adulthood.
a, Heat-map of binary transcription-factor (TF) expression (presence > 0.1 fraction of cells) across 337 adult cortical subclasses (rows); clustered blocks highlight subclass-specific TF sets. b, Equivalent heat-map to a for the 380 developmental clusters. c, Histogram of TF “bimodality” scores (1 = all-or-none expression) shows higher discreteness in adult (orange) than in development (blue) cell types (dashed lines: medians). d, Four example subclass/cluster pairs illustrating TF sharing: each scatter plots mean TF expression in the developmental state (x-axis) versus its matched adult subclass (y-axis); dashed lines mark the 0.33 threshold used to call shared TFs.

Qualitative comparison of integration methods on three benchmark datasets.
UMAPs colored by species (left of each pair) or cell identity (right) for methods used for analysis: ANTIPODE, scVI, Harmony, Scanorama and PCA. a, Adult cross-areal (XA) + cross-species (XS) primate cortical dataset integrations. b, Adult multi-species mammalian retina. c, Developmental whole-brain atlas used in this study.

Additional evolutionary gene expression divergence.
a, Spearman correlation across all species pairs for 380 developmental clusters (named by initial class + cluster). b, Mean correlation across species pairs shown in (a) by each cluster, plotted in UMAP space c, ECDF plots of raw divergence category parameters. d, Scatterplot of the correlation of DM, DC, and DI divergence categories pairwise. Note the consistently high correlation of DC and DM as these are multiplied together in the model (point color). e, Boxplot showing the number of genes with mean expression in clusters greater than 20 counts per million (cpm), grouped by division. f, Boxplot showing the effect of each divergence category for each cluster, grouped by division. Note that divisions that contain many clusters are biased towards divergence being categorized as DM+DC, while those consisting of fewer clusters seem somewhat biased towards DI.

Neuropeptide ligand–receptor expression landscape across progenitor and neuronal states.
a, Tile-map of neuropeptide-receptor interaction scores (multiplied 0-1 scaled expression). Ligands in neurons (rows) vs receptors in non-neurons (columns). Colored by ligand in main (central heatmap). Along left and bottom shows raw 0-1 expression of ligands/receptors in each species, and tricolor species divergence tile-map. b, Tile-map showing species divergence, with 0-1 scaled tri-species means colored according triangular legend.

Additional analyses of the progenitor timing model.
a, Model-free trapezoid sum-based COM calculations based only on raw data for species and regions (colors). b, Heat-map of annotated finer regional dissection vs de novo inferred regions. c, Heat-map of annotated broad regional dissection vs de novo inferred regions. d, Species-level time-warp parameters (scale α, shift Δ) estimated from the model; mouse shows a pronounced left-shift (earlier development). e, Global comparison of σₐ versus σᵦ for all progenitor states. **f, g,** Heat-map of ANTIPODE cluster (f) or batch_name (g) vs de novo inferred regions **h, i,** Box plot of fit COM values for each progenitor region + cluster, colored by species in arbitrary warped time (i) or real time (j) and ordered by mean time across species j, Stacked barplots showing proportion of total cells included of each progenitor state at each timepoint, for each species, grouped by simplified regions.

Data availability

The sequencing data have been deposited in the Gene Expression Omnibus under accession number GSE306257; the data are browsable at dev-whole-brain-hqm.cells.ucsc.edu. Scripts and annotation files for the study have been deposited on github at https://github.com/mtvector/antipode_manuscript and the ANTIPODE package can be found at https://github.com/mtvector/scANTIPODE.

Acknowledgements

We thank Alice Tarantal for providing samples, Min Cheol Kim for discussions related to the ANTIPODE model, and Anna Wright for editing assistance.

We acknowledge the following funding sources: Ruth L. Kirschstein National Research Service Predoctoral Fellowship Award F31 F31NS124333 (M.T.S.), DP2MH122400-01 (A.A.P.), U01MH114825 and UM1MH130981 (A.A.P., T.J.N.), R01AI136972 (C.J.Y.), R01MH134981 (A.A.P.) the Chan Zuckerberg Biohub (C.J.Y., T.J.N., A.A.P.), National Institutes of Health DP2MH122400-01, Schmidt Futures Foundation, Shurl and Kay Curci Foundation Innovative, W.M. Keck Foundation, and William K. Bowes Jr. Foundation. A.A.P. and T.J.N. are New York Stem Cell Foundation Robertson Investigators and members of the UCSF Kavli Institute for Fundamental Neuroscience.

Additional files

Supplementary Table 1. Metadata for each sample in the meta-atlas, including the species, closest region annotation, developmental time point, individual and sequencing quality-control metrics.

Supplementary Table 2. Dictionary of initial classes. Qualitative definitions of classes explored in the atlas with extended explanations for inferences about initial–terminal class relationships.

Supplementary Table 3. Gene scores for the various lateness metrics.

Supplementary Document 1. Specification and further discussion of the ANTIPODE generative model.

Additional information

Funding

HHS | National Institutes of Health (NIH) (F31NS124333)

Matthew T Schmitz

HHS | National Institutes of Health (NIH) (DP2MH122400)

Alex Pollen

HHS | National Institutes of Health (NIH) (U01MH114825)

Alex Pollen
Tomasz Nowakowski

HHS | National Institutes of Health (NIH) (UM1MH130981)

Alex Pollen
Tomasz Nowakowski

HHS | National Institutes of Health (NIH) (R01AI136972)

Chun J Ye

HHS | National Institutes of Health (NIH) (R01MH134981)

Alex Pollen

References

1. Andrews M.G.
2. Subramanian L.
3. Kriegstein A.R
2020mTOR signaling regulates the morphology and migration of outer radial glia in developing human cortexeLife 9:e58737https://doi.org/10.7554/eLife.58737 Google Scholar
1. Bakken T.E.
2. Jorstad N.L.
3. Hu Q.
4. Lake B.B.
5. Tian W.
6. Kalmbach B.E.
7. Crow M.
8. Hodge R.D.
9. Krienen F.M.
10. Sorensen S.A.
11. Eggermont J.
12. Yao Z.
13. Aevermann B.D.
14. Aldridge A.I.
15. Bartlett A.
16. Bertagnolli D.
17. Casper T.
18. Castanon R.G.
19. Crichton K.
20. Daigle T.L.
21. Dalley R.
22. Dee N.
23. Dembrow N.
24. Diep D.
25. Ding S.-L.
26. Dong W.
27. Fang R.
28. Fischer S.
29. Goldman M.
30. Goldy J.
31. Graybuck L.T.
32. Herb B.R.
33. Hou X.
34. Kancherla J.
35. Kroll M.
36. Lathia K.
37. van Lew B.
38. Li Y.E.
39. Liu C.S.
40. Liu H.
41. Lucero J.D.
42. Mahurkar A.
43. McMillen D.
44. Miller J.A.
45. Moussa M.
46. Nery J.R.
47. Nicovich P.R.
48. Niu S.-Y.
49. Orvis J.
50. Osteen J.K.
51. Owen S.
52. Palmer C.R.
53. Pham T.
54. Plongthongkum N.
55. Poirion O.
56. Reed N.M.
57. Rimorin C.
58. Rivkin A.
59. Romanow W.J.
60. Sedeño-Cortés A.E.
61. Siletti K.
62. Somasundaram S.
63. Sulc J.
64. Tieu M.
65. Torkelson A.
66. Tung H.
67. Wang X.
68. Xie F.
69. Yanny A.M.
70. Zhang R.
71. Ament S.A.
72. Behrens M.M.
73. Bravo H.C.
74. Chun J.
75. Dobin A.
76. Gillis J.
77. Hertzano R.
78. Hof P.R.
79. Höllt T.
80. Horwitz G.D.
81. Keene C.D.
82. Kharchenko P.V.
83. Ko A.L.
84. Lelieveldt B.P.
85. Luo C.
86. Mukamel E.A.
87. Pinto-Duarte A.
88. Preissl S.
89. Regev A.
90. Ren B.
91. Scheuermann R.H.
92. Smith K.
93. Spain W.J.
94. White O.R.
95. Koch C.
96. Hawrylycz M.
97. Tasic B.
98. Macosko E.Z.
99. McCarroll S.A.
100. Ting J.T.
101. Zeng H.
102. Zhang K.
103. Feng G.
104. Ecker J.R.
105. Linnarsson S.
106. Lein E.S
2021Comparative cellular analysis of motor cortex in human, marmoset and mouseNature 598:111–119https://doi.org/10.1038/s41586-021-03465-8 Google Scholar
1. Bandler R.C.
2. Vitali I.
3. Delgado R.N.
4. Ho M.C.
5. Dvoretskova E.
6. Ibarra Molinas J.S.
7. Frazel P.W.
8. Mohammadkhani M.
9. Machold R.
10. Maedler S.
11. Liddelow S.A.
12. Nowakowski T.J.
13. Fishell G.
14. Mayer C
2022Single-cell delineation of lineage and genetic identity in the mouse brainNature 601:404–409https://doi.org/10.1038/s41586-021-04237-0 Google Scholar
1. Bhaduri A.
2. Sandoval-Espinosa C.
3. Otero-Garcia M.
4. Oh I.
5. Yin R.
6. Eze U.C.
7. Nowakowski T.J.
8. Kriegstein A.R
2021An atlas of cortical arealization identifies dynamic molecular signaturesNature 598:200–204https://doi.org/10.1038/s41586-021-03910-8 Google Scholar
1. Bingham E.
2. Chen J.P.
3. Jankowiak M.
4. Obermeyer F.
5. Pradhan N.
6. Karaletsos T.
7. Singh R.
8. Szerlip P.
9. Horsfall P.
10. Goodman N.D
2019Pyro: Deep Universal Probabilistic ProgrammingJ. Mach. Learn. Res 20:1–6Google Scholar
1. Bray N.L.
2. Pimentel H.
3. Melsted P.
4. Pachter L
2016Near-optimal probabilistic RNA-seq quantificationNat. Biotechnol 34:525–527https://doi.org/10.1038/nbt.3519 Google Scholar
1. Bulfone A.
2. Puelles L.
3. Porteus M.
4. Frohman M.
5. Martin G.
6. Rubenstein J
1993Spatially restricted expression of Dlx-1, Dlx-2 (Tes-1), Gbx-2, and Wnt-3 in the embryonic day 12.5 mouse forebrain defines potential transverse and longitudinal segmental boundariesJ. Neurosci 13:3155–3172https://doi.org/10.1523/JNEUROSCI.13-07-03155.1993 Google Scholar
1. Cadwell C.R.
2. Bhaduri A.
3. Mostajo-Radji M.A.
4. Keefe M.G.
5. Nowakowski T.J
2019Development and Arealization of the Cerebral CortexNeuron 103:980–1004https://doi.org/10.1016/j.neuron.2019.07.009 Google Scholar
1. Cardoso-Moreira M.
2. Halbert J.
3. Valloton D.
4. Velten B.
5. Chen C.
6. Shao Y.
7. Liechti A.
8. Ascenção K.
9. Rummel C.
10. Ovchinnikova S.
11. Mazin P.V.
12. Xenarios I.
13. Harshman K.
14. Mort M.
15. Cooper D.N.
16. Sandi C.
17. Soares M.J.
18. Ferreira P.G.
19. Afonso S.
20. Carneiro M.
21. Turner J.M.A.
22. VandeBerg J.L.
23. Fallahshahroudi A.
24. Jensen P.
25. Behr R.
26. Lisgo S.
27. Lindsay S.
28. Khaitovich P.
29. Huber W.
30. Baker J.
31. Anders S.
32. Zhang Y.E.
33. Kaessmann H
2019Gene expression across mammalian organ developmentNature 1https://doi.org/10.1038/s41586-019-1338-5 Google Scholar
1. Charvet C.J.
2. Striedter G.F.
3. Finlay B.L
2011Evo-Devo and Brain Scaling: Candidate Developmental Mechanisms for Variation and Constancy in Vertebrate Brain EvolutionBrain. Behav. Evol 78:248–257https://doi.org/10.1159/000329851 Google Scholar
1. Cheng S.
2. Butrus S.
3. Tan L.
4. Xu R.
5. Sagireddy S.
6. Trachtenberg J.T.
7. Shekhar K.
8. Zipursky S.L
2022Vision-dependent specification of cell types and function in the developing cortexCell 185:311–327https://doi.org/10.1016/j.cell.2021.12.022 Google Scholar
1. Colonna M.
2. Konopka G.
3. Liddelow S.A.
4. Nowakowski T.
5. Awatramani R.
6. Bateup H.S.
7. Cadwell C.R.
8. Caglayan E.
9. Chen J.L.
10. Gillis J.
11. Kampmann M.
12. Krienen F.
13. Marsh S.E.
14. Monje M.
15. O’Dea M.R.
16. Patani R.
17. Pollen A.A.
18. Quintana F.J.
19. Scavuzzo M.
20. Schmitz M.
21. Sloan S.A.
22. Tesar P.J.
23. Tollkuhn J.
24. Tosches M.A.
25. Urbanek M.E.
26. Werner J.M.
27. Bayraktar O.A.
28. Gokce O.
29. Habib N
2024Implementation and validation of single-cell genomics experiments in neuroscienceNat. Neurosci 27:2310–2325https://doi.org/10.1038/s41593-024-01814-0 Google Scholar
1. Corrigan E.K.
2. DeBerardine M.
3. Poddar A.
4. Turrero García M.
5. Schmitz M.T.
6. Harwell C.C.
7. Paredes M.F.
8. Krienen F.M.
9. Pollen A.A
2024Conservation, alteration, and redistribution of mammalian striatal interneuronsbioRxiv https://doi.org/10.1101/2024.07.29.605664 Google Scholar
1. Di Bella D.J.
2. Habibi E.
3. Stickels R.R.
4. Scalia G.
5. Brown J.
6. Yadollahpour P.
7. Yang S.M.
8. Abbate C.
9. Biancalani T.
10. Macosko E.Z.
11. Chen F.
12. Regev A.
13. Arlotta P.
2021Molecular logic of cellular diversification in the mouse cerebral cortexNature 595:554–559https://doi.org/10.1038/s41586-021-03670-5 Google Scholar
1. Eze U.C.
2. Bhaduri A.
3. Haeussler M.
4. Nowakowski T.J.
5. Kriegstein A.R
2021Single-cell atlas of early human brain development highlights heterogeneity of human neuroepithelial cells and early radial gliaNat. Neurosci 24:584–594https://doi.org/10.1038/s41593-020-00794-1 Google Scholar
1. Finlay B.L.
2. Darlington R.B
1995Linked regularities in the development and evolution of mammalian brainsScience 268:1578–1584https://doi.org/10.1126/science.7777856 Google Scholar
1. Fishell G.
2. Kepecs A.
2020Interneuron Types as Attractors and ControllersAnnu. Rev. Neurosci 43https://doi.org/10.1146/annurev-neuro-070918-050421 Google Scholar
1. Fleming S.J.
2. Chaffin M.D.
3. Arduini A.
4. Akkad A.-D.
5. Banks E.
6. Marioni J.C.
7. Philippakis A.A.
8. Ellinor P.T.
9. Babadi M
2023Unsupervised removal of systematic background noise from droplet-based single-cell experiments using CellBenderNat. Methods 20:1323–1335https://doi.org/10.1038/s41592-023-01943-7 Google Scholar
1. Glenn Northcutt R.
2. Kaas J.H.
1995The emergence and evolution of mammalian neocortexTrends Neurosci 18:373–379https://doi.org/10.1016/0166-2236(95)93932-N Google Scholar
1. Gritti A.
2. Bonfanti L.
3. Doetsch F.
4. Caille I.
5. Alvarez-Buylla A.
6. Lim D.A.
7. Galli R.
8. Verdugo J.M.G.
9. Herrera D.G.
10. Vescovi A.L
2002Multipotent Neural Stem Cells Reside into the Rostral Extension and Olfactory Bulb of Adult RodentsJ. Neurosci 22:437–445https://doi.org/10.1523/JNEUROSCI.22-02-00437.2002 Google Scholar
1. Haghverdi L.
2. Lun A.T.L.
3. Morgan M.D.
4. Marioni J.C
2018Batch effects in single-cell RNA-sequencing data are corrected by matching mutual nearest neighborsNat. Biotechnol 36:421–427https://doi.org/10.1038/nbt.4091 Google Scholar
1. Hahn J.
2. Monavarfeshani A.
3. Qiao M.
4. Kao A.H.
5. Kölsch Y.
6. Kumar A.
7. Kunze V.P.
8. Rasys A.M.
9. Richardson R.
10. Wekselblatt J.B.
11. Baier H.
12. Lucas R.J.
13. Li W.
14. Meister M.
15. Trachtenberg J.T.
16. Yan W.
17. Peng Y.-R.
18. Sanes J.R.
19. Shekhar K
2023Evolution of neuronal cell classes and types in the vertebrate retinaNature 624:415–424https://doi.org/10.1038/s41586-023-06638-9 Google Scholar
1. Harris B.D.
2. Crow M.
3. Fischer S.
4. Gillis J
2021Single-cell co-expression analysis reveals that transcriptional modules are shared across cell types in the brainCell Syst 12:748–756https://doi.org/10.1016/j.cels.2021.04.010 Google Scholar
1. Herculano-Houzel S
2012The remarkable, yet not extraordinary, human brain as a scaled-up primate brain and its associated costProc. Natl. Acad. Sci 109:10661–10668https://doi.org/10.1073/pnas.1201895109 Google Scholar
1. Hie B.L.
2. Kim S.
3. Rando T.A.
4. Bryson B.
5. Berger B
2024Scanorama: integrating large and diverse single-cell transcriptomic datasetsNat. Protoc 19:2283–2297https://doi.org/10.1038/s41596-024-00991-3 Google Scholar
1. Hirata T.
2. Tohsato Y.
3. Itoga H.
4. Shioi G.
5. Kiyonari H.
6. Oka S.
7. Fujimori T.
8. Onami S
2021NeuroGT: A brain atlas of neurogenic tagging CreER drivers for birthdate-based classification and manipulation of mouse neuronsCell Rep Methods 1:100012https://doi.org/10.1016/j.crmeth.2021.100012 Google Scholar
1. Hou R.
2. Denisenko E.
3. Ong H.T.
4. Ramilowski J.A.
5. Forrest A.R.R
2020Predicting cell-to-cell communication networks using NATMINat. Commun 11:5011https://doi.org/10.1038/s41467-020-18873-z Google Scholar
1. Javed A.
2. Gomez L.
3. Pravata V.
4. Lo Giudice Q.
5. Sarhadi M.
6. Cappello S.
7. Klingler E.
8. Jabaudon D
2025Developmental gene expression patterns driving species-specific cortical featuresbioRxiv https://doi.org/10.1101/2025.02.18.638637 Google Scholar
1. Jessa S.
2. Blanchet-Cohen A.
3. Krug B.
4. Vladoiu M.
5. Coutelier M.
6. Faury D.
7. Poreau B.
8. De Jay N.
9. Hébert S.
10. Monlong J.
11. Farmer W.T.
12. Donovan L.K.
13. Hu Y.
14. McConechy M.K.
15. Cavalli F.M.G.
16. Mikael L.G.
17. Ellezam B.
18. Richer M.
19. Allaire A.
20. Weil A.G.
21. Atkinson J.
22. Farmer J.-P.
23. Dudley R.W.R.
24. Larouche V.
25. Crevier L.
26. Albrecht S.
27. Filbin M.G.
28. Sartelet H.
29. Lutz P.-E.
30. Nagy C.
31. Turecki G.
32. Costantino S.
33. Dirks P.B.
34. Murai K.K.
35. Bourque G.
36. Ragoussis J.
37. Garzia L.
38. Taylor M.D.
39. Jabado N.
40. Kleinman C.L.
2019Stalled developmental programs at the root of pediatric brain tumorsNat. Genet 51:1702–1713https://doi.org/10.1038/s41588-019-0531-7 Google Scholar
1. Johansen N.
2. Quon G
2019scAlign: a tool for alignment, integration, and rare cell identification from scRNA-seq dataGenome Biol 20:166https://doi.org/10.1186/s13059-019-1766-4 Google Scholar
1. Jorstad N.L.
2. Close J.
3. Johansen N.
4. Yanny A.M.
5. Barkan E.R.
6. Travaglini K.J.
7. Bertagnolli D.
8. Campos J.
9. Casper T.
10. Crichton K.
11. Dee N.
12. Ding S.-L.
13. Gelfand E.
14. Goldy J.
15. Hirschstein D.
16. Kiick K.
17. Kroll M.
18. Kunst M.
19. Lathia K.
20. Long B.
21. Martin N.
22. McMillen D.
23. Pham T.
24. Rimorin C.
25. Ruiz A.
26. Shapovalova N.
27. Shehata S.
28. Siletti K.
29. Somasundaram S.
30. Sulc J.
31. Tieu M.
32. Torkelson A.
33. Tung H.
34. Callaway E.M.
35. Hof P.R.
36. Keene C.D.
37. Levi B.P.
38. Linnarsson S.
39. Mitra P.P.
40. Smith K.
41. Hodge R.D.
42. Bakken T.E.
43. Lein E.S
2023aTranscriptomic cytoarchitecture reveals principles of human neocortex organizationScience 382:eadf6812https://doi.org/10.1126/science.adf6812 Google Scholar
1. Jorstad N.L.
2. Song J.H.T.
3. Exposito-Alonso D.
4. Suresh H.
5. Castro-Pacheco N.
6. Krienen F.M.
7. Yanny A.M.
8. Close J.
9. Gelfand E.
10. Long B.
11. Seeman S.C.
12. Travaglini K.J.
13. Basu S.
14. Beaudin M.
15. Bertagnolli D.
16. Crow M.
17. Ding S.-L.
18. Eggermont J.
19. Glandon A.
20. Goldy J.
21. Kiick K.
22. Kroes T.
23. McMillen D.
24. Pham T.
25. Rimorin C.
26. Siletti K.
27. Somasundaram S.
28. Tieu M.
29. Torkelson A.
30. Feng G.
31. Hopkins W.D.
32. Höllt T.
33. Keene C.D.
34. Linnarsson S.
35. McCarroll S.A.
36. Lelieveldt B.P.
37. Sherwood C.C.
38. Smith K.
39. Walsh C.A.
40. Dobin A.
41. Gillis J.
42. Lein E.S.
43. Hodge R.D.
44. Bakken T.E
2023bComparative transcriptomics reveals human-specific cortical featuresScience 382:eade9516https://doi.org/10.1126/science.ade9516 Google Scholar
1. Kim D.W.
2. Washington P.W.
3. Wang Z.Q.
4. Lin S.H.
5. Sun C.
6. Ismail B.T.
7. Wang H.
8. Jiang L.
9. Blackshaw S
2020The cellular and molecular landscape of hypothalamic patterning and differentiation from embryonic to late postnatal developmentNat. Commun 11:4360https://doi.org/10.1038/s41467-020-18231-z Google Scholar
1. Korsunsky I.
2. Millard N.
3. Fan J.
4. Slowikowski K.
5. Zhang F.
6. Wei K.
7. Baglaenko Y.
8. Brenner M.
9. Loh P.
10. Raychaudhuri S
2019Fast, sensitive and accurate integration of single-cell data with HarmonyNat. Methods 16:1289–1296https://doi.org/10.1038/s41592-019-0619-0 Google Scholar
1. La Manno G.
2. Siletti K.
3. Furlan A.
4. Gyllborg D.
5. Vinsland E.
6. Langseth C.M.
7. Khven I.
8. Johnsson A.
9. Nilsson M.
10. Lönnerberg P.
11. Linnarsson S.
2020Molecular architecture of the developing mouse brainbioRxiv https://doi.org/10.1101/2020.07.02.184051 Google Scholar
1. Loo L.
2. Simon J.M.
3. Xing L.
4. McCoy E.S.
5. Niehaus J.K.
6. Guo J.
7. Anton E.S.
8. Zylka M.J
2019Single-cell transcriptomic analysis of mouse neocortical developmentNat. Commun 10:134https://doi.org/10.1038/s41467-018-08079-9 Google Scholar
1. Lopez R.
2. Regier J.
3. Cole M.B.
4. Jordan M.I.
5. Yosef N
2018Deep generative modeling for single-cell transcriptomicsNat. Methods 15:1053–1058https://doi.org/10.1038/s41592-018-0229-2 Google Scholar
1. Mayer C.
2. Hafemeister C.
3. Bandler R.C.
4. Machold R.
5. Brito R.B.
6. Jaglin X.
7. Allaway K.
8. Butler A.
9. Fishell G.
10. Satija R
2018Developmental diversification of cortical inhibitory interneuronsNature 555:457–462https://doi.org/10.1038/nature25999 Google Scholar
1. Moreno N.
2. González A
2011The Non-Evaginated Secondary Prosencephalon of VertebratesFront. Neuroanat 5https://doi.org/10.3389/fnana.2011.00012 Google Scholar
1. Nieuwenhuys R.
2. Puelles L.
2015Towards a New NeuromorphologySpringer Google Scholar
1. Nishino J.
2. Kim S.
3. Zhu Y.
4. Zhu H.
5. Morrison S.J
2013A network of heterochronic genes including Imp1 regulates temporal changes in stem cell propertieseLife 2:e00924https://doi.org/10.7554/eLife.00924 Google Scholar
1. Nowakowski T.J.
2. Bhaduri A.
3. Pollen A.A.
4. Alvarado B.
5. Mostajo-Radji M.A.
6. Lullo E.D.
7. Haeussler M.
8. Sandoval-Espinosa C.
9. Liu S.J.
10. Velmeshev D.
11. Ounadjela J.R.
12. Shuga J.
13. Wang X.
14. Lim D.A.
15. West J.A.
16. Leyrat A.A.
17. Kent W.J.
18. Kriegstein A.R
2017Spatiotemporal gene expression trajectories reveal developmental hierarchies of the human cortexScience 358:1318–1323https://doi.org/10.1126/science.aap8809 Google Scholar
1. Ovens K.
2. Eames B.F.
3. McQuillan I
2021Comparative Analyses of Gene Co-expression Networks: Implementations and Applications in the Study of EvolutionFront. Genet 12:695399https://doi.org/10.3389/fgene.2021.695399 Google Scholar
1. Pollen A.A.
2. Bhaduri A.
3. Andrews M.G.
4. Nowakowski T.J.
5. Meyerson O.S.
6. Mostajo-Radji M.A.
7. Di Lullo E.
8. Alvarado B.
9. Bedolli M.
10. Dougherty M.L.
11. Fiddes I.T.
12. Kronenberg Z.N.
13. Shuga J.
14. Leyrat A.A.
15. West J.A.
16. Bershteyn M.
17. Lowe C.B.
18. Pavlovic B.J.
19. Salama S.R.
20. Haussler D.
21. Eichler E.E.
22. Kriegstein A.R.
2019Establishing Cerebral Organoids as Models of Human-Specific Brain EvolutionCell 176:743–756https://doi.org/10.1016/j.cell.2019.01.017 Google Scholar
1. Pollen A.A.
2. Nowakowski T.J.
3. Shuga J.
4. Wang X.
5. Leyrat A.A.
6. Lui J.H.
7. Li N.
8. Szpankowski L.
9. Fowler B.
10. Chen P.
11. Ramalingam N.
12. Sun G.
13. Thu M.
14. Norris M.
15. Lebofsky R.
16. Toppani D.
17. Kemp D.W.
18. Wong M.
19. Clerkson B.
20. Jones B.N.
21. Wu S.
22. Knutsson L.
23. Alvarado B.
24. Wang J.
25. Weaver L.S.
26. May A.P.
27. Jones R.C.
28. Unger M.A.
29. Kriegstein A.R.
30. West J.A.A
2014Low-coverage single-cell mRNA sequencing reveals cellular heterogeneity and activated signaling pathways in developing cerebral cortexNat. Biotechnol 32:1053–1058https://doi.org/10.1038/nbt.2967 Google Scholar
1. Puelles L.
2. Harrison M.
3. Paxinos G.
4. Watson C
2013A developmental ontology for the mammalian brain based on the prosomeric modelTrends Neurosci 36:570–578https://doi.org/10.1016/j.tins.2013.06.004 Google Scholar
1. Qian X.
2. Shen Q.
3. Goderie S.K.
4. He W.
5. Capela A.
6. Davis A.A.
7. Temple S
2000Timing of CNS Cell Generation: A Programmed Sequence of Neuron and Glial Cell Production from Isolated Murine Cortical Stem CellsNeuron 28:69–80https://doi.org/10.1016/S0896-6273(00)00086-6 Google Scholar
1. Rakic P
2002Neurogenesis in adult primate neocortex: an evaluation of the evidenceNat. Rev. Neurosci 3:65–71https://doi.org/10.1038/nrn700 Google Scholar
1. Rowitch D.H
2004Glial specification in the vertebrate neural tubeNat. Rev. Neurosci 5:409–419https://doi.org/10.1038/nrn1389 Google Scholar
1. Schaberg E.
2. Götz M.
3. Faissner A
2022The extracellular matrix molecule tenascin-C modulates cell cycle progression and motility of adult neural stem/progenitor cells from the subependymal zoneCell. Mol. Life Sci 79:244https://doi.org/10.1007/s00018-022-04259-5 Google Scholar
1. Schmitz M.T.
2. Sandoval K.
3. Chen C.P.
4. Mostajo-Radji M.A.
5. Seeley W.W.
6. Nowakowski T.J.
7. Ye C.J.
8. Paredes M.F.
9. Pollen A.A
2022The development and evolution of inhibitory neurons in primate cerebrumNature 603:871–877https://doi.org/10.1038/s41586-022-04510-w Google Scholar
1. Siletti K.
2. Hodge R.
3. Mossi Albiach A.
4. Lee K.W.
5. Ding S.-L.
6. Hu L.
7. Lönnerberg P.
8. Bakken T.
9. Casper T.
10. Clark M.
11. Dee N.
12. Gloe J.
13. Hirschstein D.
14. Shapovalova N.V.
15. Keene C.D.
16. Nyhus J.
17. Tung H.
18. Yanny A.M.
19. Arenas E.
20. Lein E.S.
21. Linnarsson S
2023Transcriptomic diversity of cell types across the adult human brainScience 382:eadd7046https://doi.org/10.1126/science.add7046 Google Scholar
1. Striedter G.F
2004Principles of Brain EvolutionOxford University Press Google Scholar
1. Striedter G.F.
2. Northcutt R.G.
2019Brains Through Time: A Natural History of VertebratesOxford University Press https://doi.org/10.1093/oso/9780195125689.001.0001 Google Scholar
1. Tarashansky A.J.
2. Musser J.M.
3. Khariton M.
4. Li P.
5. Arendt D.
6. Quake S.R.
7. Wang B.
2021Mapping single-cell atlases throughout Metazoa unravels cell type evolutioneLife 10:e66747https://doi.org/10.7554/eLife.66747 Google Scholar
1. True J.R.
2. Haag E.S
2001Developmental system drift and flexibility in evolutionary trajectoriesEvol Dev 3:109–119https://doi.org/10.1046/j.1525-142x.2001.003002109.x Google Scholar
1. Vanderhaeghen P.
2. Polleux F
2023Developmental mechanisms underlying the evolution of human cortical circuitsNat. Rev. Neurosci 24:213–232https://doi.org/10.1038/s41583-023-00675-z Google Scholar
1. Welch J.D.
2. Kozareva V.
3. Ferreira A.
4. Vanderburg C.
5. Martin C.
6. Macosko E.Z
2019Single-Cell Multi-omic Integration Compares and Contrasts Features of Brain Cell IdentityCell 177:1873–1887https://doi.org/10.1016/j.cell.2019.05.006 Google Scholar
1. Workman A.D.
2. Charvet C.J.
3. Clancy B.
4. Darlington R.B.
5. Finlay B.L
2013Modeling Transformations of Neurodevelopmental Sequences across Mammalian SpeciesJ. Neurosci 33:7368–7383https://doi.org/10.1523/JNEUROSCI.5746-12.2013 Google Scholar
1. Xu C.
2. Lopez R.
3. Mehlman E.
4. Regier J.
5. Jordan M.I.
6. Yosef N
2021Probabilistic harmonization and annotation of single-cell transcriptomics data with deep generative modelsMol. Syst. Biol 17:e9620https://doi.org/10.15252/msb.20209620 Google Scholar
1. Yao Z.
2. Velthoven C.T.J.
3. Kunst M.
4. Zhang M.
5. McMillen D.
6. Lee C.
7. Jung W.
8. Goldy J.
9. Abdelhak A.
10. Baker P.
11. Barkan E.
12. Bertagnolli D.
13. Campos J.
14. Carey D.
15. Casper T.
16. Chakka A.B.
17. Chakrabarty R.
18. Chavan S.
19. Chen M.
20. Clark M.
21. Close J.
22. Crichton K.
23. Daniel S.
24. Dolbeare T.
25. Ellingwood L.
26. Gee J.
27. Glandon A.
28. Gloe J.
29. Gould J.
30. Gray J.
31. Guilford N.
32. Guzman J.
33. Hirschstein D.
34. Ho W.
35. Jin K.
36. Kroll M.
37. Lathia K.
38. Leon A.
39. Long B.
40. Maltzer Z.
41. Martin N.
42. McCue R.
43. Meyerdierks E.
44. Nguyen T.N.
45. Pham T.
46. Rimorin C.
47. Ruiz A.
48. Shapovalova N.
49. Slaughterbeck C.
50. Sulc J.
51. Tieu M.
52. Torkelson A.
53. Tung H.
54. Cuevas N.V.
55. Wadhwani K.
56. Ward K.
57. Levi B.
58. Farrell C.
59. Thompson C.L.
60. Mufti S.
61. Pagan C.
62. Kruse L.
63. Dee N.
64. Sunkin S.M.
65. Esposito L.
66. Hawrylycz M.J.
67. Waters J.
68. Ng L.
69. Smith K.A.
70. Tasic B.
71. Zhuang X.
72. Zeng H.
2023A high-resolution transcriptomic and spatial atlas of cell types in the whole mouse brainbioRxiv https://doi.org/10.1101/2023.03.06.531121 Google Scholar
1. Zhong S.
2. Ding W.
3. Sun L.
4. Lu Y.
5. Dong H.
6. Fan X.
7. Liu Z.
8. Chen R.
9. Zhang S.
10. Ma Q.
11. Tang F.
12. Wu Q.
13. Wang X
2020Decoding the development of the human hippocampusNature 577:531–536https://doi.org/10.1038/s41586-019-1917-5 Google Scholar
1. Zhong S.
2. Wang M.
3. Huang L.
4. Chen Y.
5. Ge Y.
6. Zhang Jiyao
7. Shi Y.
8. Dong H.
9. Zhou X.
10. Wang B.
11. Lu T.
12. Jing X.
13. Lu Y.
14. Zhang Junjing
15. Wang X.
16. Wu Q
2023Single-cell epigenomics and spatiotemporal transcriptomics reveal human cerebellar developmentNat. Commun 14:7613https://doi.org/10.1038/s41467-023-43568-6 Google Scholar
1. Zhou X.
2. Lu Y.
3. Zhao F.
4. Dong J.
5. Ma W.
6. Zhong S.
7. Wang M.
8. Wang B.
9. Zhao Y.
10. Shi Y.
11. Ma Q.
12. Lu T.
13. Zhang J.
14. Wang X.
15. Wu Q
2022Deciphering the spatial-temporal transcriptional landscape of human hypothalamus developmentCell Stem Cell 29:328–343https://doi.org/10.1016/j.stem.2021.11.009 Google Scholar
1. Zhou Y.
2. Zhu J.
3. Tong T.
4. Wang J.
5. Lin B.
6. Zhang J
2019A statistical normalization method and differential expression analysis for RNA-seq data between different speciesBMC Bioinformatics 20:1–10https://doi.org/10.1186/s12859-019-2745-1 Google Scholar
1. Zhu Y.
2. Sousa A.M.M.
3. Gao T.
4. Skarica M.
5. Li M.
6. Santpere G.
7. Esteller-Cucala P.
8. Juan D.
9. Ferrández-Peral L.
10. Gulden F.O.
11. Yang M.
12. Miller D.J.
13. Marques-Bonet T.
14. Imamura Kawasawa Y.
15. Zhao H.
16. Sestan N
2018Spatiotemporal transcriptomic divergence across human and macaque brain developmentScience 362:eaat8077https://doi.org/10.1126/science.aat8077 Google Scholar
1. Schmitz M
2. Pollen A
2025A Global View of Cell Type Homology and Transcriptomic Divergence in the Mammalian BrainNCBI Gene Expression Omnibus ID GSE306257https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE306257

Article and author information

Author information

Matthew T Schmitz
Allen Institute for Brain Science, Seattle, United States
ORCID iD: 0000-0002-6177-8161
- For correspondence: matthew.schmitz@alleninstitute.org
Jingwen W Ding
Eli and Edythe Broad Center of Regeneration Medicine and Stem Cell Research, University of California, San Francisco, San Francisco, United States, Department of Neurology, University of California, San Francisco, San Francisco, United States
Sara Nolbrant
Eli and Edythe Broad Center of Regeneration Medicine and Stem Cell Research, University of California, San Francisco, San Francisco, United States
Reed McMullen
Eli and Edythe Broad Center of Regeneration Medicine and Stem Cell Research, University of California, San Francisco, San Francisco, United States, Department of Neurology, University of California, San Francisco, San Francisco, United States
Chang N Kim
Department of Neurological Surgery, University of California, San Francisco, San Francisco, United States, Department of Anatomy, University of California, San Francisco, San Francisco, United States, Weill Institute for Neurosciences, University of California, San Francisco, San Francisco, United States
Bryan J Pavlovic
Eli and Edythe Broad Center of Regeneration Medicine and Stem Cell Research, University of California, San Francisco, San Francisco, United States, Department of Neurology, University of California, San Francisco, San Francisco, United States
Tomasz J Nowakowski
Department of Neurological Surgery, University of California, San Francisco, San Francisco, United States, Department of Anatomy, University of California, San Francisco, San Francisco, United States, Weill Institute for Neurosciences, University of California, San Francisco, San Francisco, United States, Department of Psychiatry and Behavioral Sciences, University of California, San Francisco, San Francisco, United States, Kavli Institute for Fundamental Neuroscience, University of California, San Francisco, San Francisco, United States
Trygve E Bakken
Allen Institute for Brain Science, Seattle, United States
ORCID iD: 0000-0003-3373-7386
Chun Jimmie Ye
Institute for Human Genetics, University of California, San Francisco, San Francisco, United States, Division of Rheumatology, Department of Medicine, University of California, San Francisco, San Francisco, United States
ORCID iD: 0000-0001-6560-3783
Alex A Pollen
Eli and Edythe Broad Center of Regeneration Medicine and Stem Cell Research, University of California, San Francisco, San Francisco, United States, Department of Neurology, University of California, San Francisco, San Francisco, United States, Weill Institute for Neurosciences, University of California, San Francisco, San Francisco, United States
ORCID iD: 0000-0003-3263-8634
- For correspondence: alex.pollen@ucsf.edu

Author Notes

Competing interests: No competing interests declared

Version history

Sent for peer review: October 30, 2025
Preprint posted: November 3, 2025
Reviewed Preprint version 1: March 16, 2026

Cite all versions

You can cite all versions using the DOI https://doi.org/10.7554/eLife.109659. This DOI represents all versions, and will always resolve to the latest one.

Copyright

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.

Metrics

views: 237
downloads: 9
citations: 0

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Significance of findings

Strength of evidence

Abstract

Introduction

Results

Construction of a cross-species developmental brain meta-atlas

Construction of a cross-species developmental brain meta-atlas.

Taxonomy and ANTIPODE model architecture.

ANTIPODE as a cross-species integration method and differential expression model

Benchmarking ANTIPODE against existing cross-species integration methods.

Principles of evolutionary divergence in developmental gene expression

Modes and landscape of evolutionary differential expression.

Gene expression evolution in context.

Spatiotemporal dynamics of progenitor states and neurogenic timing

A Bayesian model of progenitor-state progression across species and regions.

Genes associated with developmental timing in progenitors

Gene programs linked to early versus late neurogenesis.

Discussion

Methods

Generation of scRNA-seq data

Samples

Single-cell RNA sequencing tissue processing

Alignments and gene models

Quality control

Processing of data

Calculation of gene expression estimates

Lateness

Neuropeptide ligand-receptor interactions

Other bioinformatic analysis

Large language model statement

Supplementary Figures

Dataset composition, batch structure and basic QC.

Goodness-of-fit for ANTIPODE gene-specific parameters.

Conserved marker expression across initial classes.

Mapping developing clusters to adult mouse subclasses.

Shared transcription-factor programs between development and adulthood.

Qualitative comparison of integration methods on three benchmark datasets.

Additional evolutionary gene expression divergence.

Neuropeptide ligand–receptor expression landscape across progenitor and neuronal states.

Additional analyses of the progenitor timing model.

Data availability

Acknowledgements

Additional files

Additional information

Funding

References

Article and author information

Author information

Matthew T Schmitz

Jingwen W Ding

Sara Nolbrant

Reed McMullen

Chang N Kim

Bryan J Pavlovic

Tomasz J Nowakowski

Trygve E Bakken

Chun Jimmie Ye

Alex A Pollen

Author Notes

Version history

Cite all versions

Copyright

Metrics