Research Article

Transcriptional cartography integrates multiscale biology of the human cortex

Wellcome Centre for Human Neuroimaging, University College London, United Kingdom
UCL Great Ormond Street Institute for Child Health, United Kingdom
Department of Psychiatry, University of Pennsylvania, United States
Department of Child and Adolescent Psychiatry and Behavioral Science, The Children's Hospital of Philadelphia, United States
Department of Biostatistics, Vanderbilt University, United States
Psychiatric and Neurodevelopmental Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, United States
Department of Psychiatry, Harvard Medical School, United States
Department of Psychiatry, University of Cambridge, United Kingdom
Section on Developmental Neurogenomics, Human Genetics Branch, National Institute of Mental Health, United States
Lifespan Informatics and Neuroimaging Center, University of Pennsylvania School of Medicine, United States
Penn Statistics in Imaging and Visualization Center, Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine, University of Pennsylvania, United States
Center for Autism Research and Treatment, Semel Institute, Program in Neurogenetics, Department of Neurology and Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, United States

Feb 7, 2024

https://doi.org/10.7554/eLife.86933.3

Open access
Copyright information

eLife assessment

This study provides continuous maps of human brain gene expression and explores their relationship with a large variety of microscopic and macroscopic aspects of brain organisation. The authors provide convincing evidence for a relationship between gene expression maps with various aspects of the anatomy of adult brains, during development, and in the case of mental disorders. The data and methods introduced can be an important tool for neuroimaging research.

https://doi.org/10.7554/eLife.86933.3.sa0

Significance of the findings:

Important: Findings that have theoretical or practical implications beyond a single subfield

Landmark
Fundamental
Important
Valuable
Useful

Strength of evidence:

Convincing: Appropriate and validated methodology in line with current state-of-the-art

Exceptional
Compelling
Convincing
Solid
Incomplete
Inadequate

During the peer-review process the editor and reviewers write an eLife Assessment that summarises the significance of the findings reported in the article (on a scale ranging from landmark to useful) and the strength of the evidence (on a scale ranging from exceptional to inadequate). Learn more about eLife Assessments

Abstract
Introduction
Results
Discussion
Materials and methods
Data availability
References
Article and author information
Metrics

Abstract

The cerebral cortex underlies many of our unique strengths and vulnerabilities, but efforts to understand human cortical organization are challenged by reliance on incompatible measurement methods at different spatial scales. Macroscale features such as cortical folding and functional activation are accessed through spatially dense neuroimaging maps, whereas microscale cellular and molecular features are typically measured with sparse postmortem sampling. Here, we integrate these distinct windows on brain organization by building upon existing postmortem data to impute, validate, and analyze a library of spatially dense neuroimaging-like maps of human cortical gene expression. These maps allow spatially unbiased discovery of cortical zones with extreme transcriptional profiles or unusually rapid transcriptional change which index distinct microstructure and predict neuroimaging measures of cortical folding and functional activation. Modules of spatially coexpressed genes define a family of canonical expression maps that integrate diverse spatial scales and temporal epochs of human brain organization – ranging from protein–protein interactions to large-scale systems for cognitive processing. These module maps also parse neuropsychiatric risk genes into subsets which tag distinct cyto-laminar features and differentially predict the location of altered cortical anatomy and gene expression in patients. Taken together, the methods, resources, and findings described here advance our understanding of human cortical organization and offer flexible bridges to connect scientific fields operating at different spatial scales of human brain research.

Introduction

The human cerebral cortex is an astoundingly complex structure that underpins many of our distinctive facilities and vulnerabilities (Geschwind and Rakic, 2013). Achieving a mechanistic understanding of cortical organization in health and disease requires integrating information across its many spatial scales: from macroscale cortical folds and functional networks (Glasser et al., 2016) to the gene expression programs that reflect microscale cellular and laminar features (Hawrylycz et al., 2012; Kelley et al., 2018). However, a hard obstacle to this goal is that our measures of the human cortex at macro- and microscales are fundamentally mismatched in their spatial sampling. Macroscale measures from in vivo neuroimaging provide spatially dense estimates of structure and function, but microscale measures of gene expression are gathered from spatial discontinuous postmortem samples that have so far only been linked to macroscale features using methodologically imposed cortical parcellations (Hansen et al., 2021; Larivière et al., 2021; Seidlitz et al., 2020). Consequently, local transitions in human cortical gene expression remain uncharacterized and unintegrated with the spatially fine-grained topographies of human cortical structure and function that are revealed by in vivo neuroimaging (Gryglewski et al., 2018; Markello et al., 2021). Finding a way to bridge this gap would not only enrich both our micro- and macroscale models of human cortical organization, but also provide an essential framework for translation across traditionally siloed scales of neuroscientific research.

Here, we use spatially sparse postmortem data from the Allen Human Brain Atlas (AHBA; Hawrylycz et al., 2012) to generate spatially dense cortical expression maps (DEMs) for 20,781 genes in the adult brain, with accompanying DEM reproducibility scores to facilitate wider usage. These maps allow a fine-grained transcriptional cartography of the human cortex, which we integrate with diverse genomic, histological, and neuroimaging resources to shed new light on several fundamental aspects of human cortical organization in health and disease. First, we show that DEMs can recover canonical gene expression boundaries from in situ hybridization (ISH) data, predict previously unknown expression boundaries, and align with regional differences in cortical organization from several independent data modalities. Second, by focusing on the local transitions in gene expression which are captured by DEMs, we reveal a close spatial coordination between molecular and functional specializations of the cortex and establish that the spatial orientation of cortical folding and function at macroscale is aligned with local tangential transitions in cortical gene expression. Third, by defining and annotating gene co-expression modules across the cortex at multiple scales we systematically link macroscale measures of cortical structure and function in vivo to postmortem markers of cortical lamination, cellular composition, and development from early fetal to late adult life. Finally, as a proof of principle, we use this novel framework to secure a newly integrated multiscale understanding of atypical brain development in autism spectrum disorder (ASD).

The tools and results from this analysis of the human cortex, which we collectively call Multiscale Atlas of Gene expression for Integrative Cortical Cartography (MAGICC), open up an empirical bridge that can now be used to connect cortical models (and scientists) that have so far operated at segregated spatial scales. To this end, we share (i) all gene-level DEMs and derived transcriptional landscapes in neuroimaging-compatible files for easy integration with in vivo macroscale measures of human cortical structure and function; and (ii) all gene sets defining spatial subcomponents of cortical transcription for easy integration with any desired genomic annotation (https://github.com/kwagstyl/magicc).

Results

Creating and benchmarking spatially dense maps of human cortical gene expression

To create a dense transcriptomic atlas of the cortex, we used AHBA microarray measures of gene expression for 20,781 genes in each of 1304 cortical samples from six donor left cortical hemispheres (‘Materials and methods,’Table S1Supplementary file 1). We extracted a model of each donor’s cortical sheet by processing their brain MRI scan and identified the surface location (henceforth ‘vertex’) of each postmortem cortical sample in this sheet (‘Materials and methods,’ Figure 1a). For each gene, we then propagated measured expression values into neighboring vertices using nearest-neighbor interpolation followed by smoothing (‘Materials and methods,’ Figure 1b and c). Expression values were scaled across vertices and these vertex-level expression maps were averaged across donors to yield a single DEM for each gene, which provided estimates of expression at ~30,000 vertices across the cortical sheet (e.g., DEM for PVALB, upper panel of Figure 1d). These fine-grained vertex-level expression measures also enabled us to estimate the orientation and magnitude of expression change for each gene at every vertex (e.g., dense expression change map for PVALB, lower panel of Figure 1d).

Figure 1 with 2 supplements see all

Download asset Open asset

Creating and benchmarking spatial dense gene expression maps in the human cortex.

(a) Spatially discontinuous Allen Human Brain Atlas (AHBA) microarray samples (red points) were aligned with MRI-derived cortical surface mesh reconstructions. (b) AHBA vertex expression values were propagated using nearest-neighbor interpolation and subsequently smoothed (c). (d) Subject-level maps were z-normalized and averaged to generate a single reference dense expression map (DEM) for each gene, as well as the associated expression gradient map (shown here for PVALB: top and bottom, respectively). (e) DEMs can recover known expression boundaries in in situ hybridization (ISH) data. Four canonical V1 area markers (Zeng et al., 2012) show a significantly sharp DEM expression gradient at the V1/V2 boundary (inset cortical map and Figure 1—figure supplement 2a, b), which is also evident in all four individual gene DEMs and DEM gradients (SYT6, PENK, and Figure 1—figure supplement 2c). (f) DEMs can discover previously unknown expression boundaries. Genes with high DEM gradients across the PeEc (parahippocampal) and TF (fusiform) gyri (inset cortical map) were validated in ISH data, showing sharp expression changes in both directions at this boundary (CHRNA3, NGB, and Figure 1—figure supplement 2d-f). (g) Illustrative comparisons of selected DEMs against regional variation in microscale measures of cellular composition: scatterplot showing the global correlation of regional cellular proportions from single nucleus RNAseq (snRNAseq) across 16 cells and 6 regions (Lake et al., 2016) with DEM values for corresponding cell-type marker genes (R = 0.48, p_spin<0.001, excluding Ex3-V1 and In8-BA10 outlier samples). (h) DEMs for markers of six neuronal subtypes (three excitatory: FEZF2, RORB, THEMIS; three inhibitory: PVALB, SST, VIP) based on recently validated subtype marker genes (Bakken et al., 2021; Hodge et al., 2019). (i) Illustrative comparison of layer IV marker DEMs with corresponding mesoscale cortical measure of layer IV thickness from a 20 μm 3D histological atlas of cortical layers. (j) Illustrative comparisons of selected DEMs with corresponding macroscale cortical measures from independent neuroimaging markers.

We assessed the reproducibility of DEMs by repeating the above process (Figure 1) after repeatedly splitting the donors into non-overlapping groups of varying size and using learning curve analyses to estimate the DEM reproducibility achieved by our full set of six donors. For cortically expressed genes (‘Materials and methods,’ Supplementary file 2), the average reproducibility of gene expression maps was r_gene = 0.58 (correlation of expression values for a gene across vertices), and the average reproducibility of ranked gene expression at each vertex was r_vertex = 0.63 (correlation of expression values at a vertex across genes) (Figure 1—figure supplement 1c-d). These estimates were both substantially lower for genes not reported to be cortically expressed in the independent Human Protein Atlas (r_gene = 0.34, t = 37.6, p<0.001 and r_vertex = 0.39, t = 273.6, p<0.001, respectively, ‘Materials and methods,’ Supplementary file 2). Genes without recorded cortical expression were threefold enriched (p=0) among the 9647 genes with estimated DEM reproducibility values of r < 0.5. Regional differences in the density of postmortem sampling in the AHBA did not influence DEM reproducibility or the magnitude of local expression change captured by DEMs (‘Materials and methods,’ Figure 1—figure supplement 1h). Thus, remedying the current lack of any spatially dense gene expression maps in the human cortex, we provide DEMs (and accompanying dense expression change maps) for 20,781 genes and establish that >11k of these DEMs show a spatial reproducibility score of r_gene > 0.5 between sets of unrelated individuals. Gene-level DEM reproducibility scores allow future users to filter on this feature as desired, and we establish that key analytic outputs from DEMs (see below) show good reproducibility between unrelated individuals and can be recovered at different DEM reproducibility filters.

Given that DEMs were generated by interpolating expression values between sampled regions, we assessed whether DEMs could recover sharp local microscale transitions in gene expression that could theoretically be obscured by interpolation. Of the very few such transitions that have been verified by ISH in humans, the best established occurs between occipital areas V1 and V2 (Zeng et al., 2012). All four genes known to show a sharp V1/V2 expression boundary across layers by ISH – SYT6, TLE4, PCP4, PENK – exhibited qualitatively and quantitatively sharp expression transitions at the V1/V2 boundary in their DEMs (Figure 1e, Figure 1—figure supplement 2a-d). Motivated by this validation, we next asked whether DEMs could identify previously unknown expression boundary markers in the human cortex. To achieve this, we took advantage of extensive existing ISH data between parahippocampal (area PeEc) and fusiform gyri (area TF). We ranked genes by the magnitude of their expression gradient between these cortical regions in DEMs (‘Materials and methods’) and identified four genes with sharp expression transitions predicted by DEMs – NGB,HTR2A (TF > PeEc) and NTS, CHRNA3 (PeEc > TF) – for which independent ISH data were available. Expression profiling in ISH slabs verified the existence of sharp expression transition for all four genes (Figure 1f, Figure 1—figure supplement 2e-g). As the V1/V2 and the PeEc/TF boundaries both involve transitions between classical laminar types in cortical regions with highly conserved anatomical patterning (von Economo and Koskinas, 1925), we also tested whether DEMs could recover expression boundaries in more variable and uniformly laminated association cortex (Ronan and Fletcher, 2015). No such expression boundaries have been described in humans by ISH, but there are reports of sharp expression boundaries between frontal areas 44 and 45b for several genes in non-human primates: SCN1B, KCNS1, TRIM55 (Chen et al., 2022). These genes also exhibited high DEM gradients at the boundary between human frontal areas 44 and 45 (Figure 1—figure supplement 2h-g). Taken together, these observations demonstrate the capacity of DEMs to resolve sharp expression transitions and indicate that DEMs can be used to help target prospective postmortem validation of new expression boundaries in humans.

To benchmark and illustrate the use of DEMs to capture cortical features across contrasting spatial scales, we drew on selected micro- and macroscale cortical measures that DEMs should align with based on known biological processes (Figure 1g–j, ‘Materials and methods’). To assess whether DEMs could recover microscale differences in cellular patterning across the cortical sheet, we considered the ground truth of neuronal cell-type proportions as measured by single-nucleus RNAseq (snRNAseq) across six different cortical regions (Lake et al., 2016). We observed a strong spatial correlation (r = 0.6, p_spin<0.001) between regional marker gene expression in DEMs and regional proportions of their corresponding neuronal subtypes from snRNAseq (Figure 1g, ‘Materials and methods’). Figure 1h shows example marker gene DEMs for six canonical neuronal subtypes: three excitatory (FEZF2, RORB, THEMIS) and three inhibitory (PVAL, SST, VIP) (Bakken et al., 2021; Hodge et al., 2019). Next, to assess whether DEMs could recover regional variation in the mesoscale feature of cortical layering, we tested and verified that regional variation in the average DEM for layer IV marker genes (He et al., 2017; Maynard et al., 2021; Zeng et al., 2012) was highly correlated with regional variation in layer IV thickness as determined from a 3D histological atlas of cortical layers (Wagstyl et al., 2020; Figure 1i). Finally, we asked whether DEMs could recover spatially dense measures of regional variation across the cortical sheet as provided by neuroimaging data and found that maps from diverse measurement modalities showed strong and statistically significant spatial correlations with their corresponding DEM(s) relative to a null distribution based on random ‘spinning’ of maps (Alexander-Bloch et al., 2018; Figure 1j, ‘Materials and methods,’ all p_spin<0.01): (i) areas of cortex activated during motor fMRI tasks in humans (Glasser et al., 2016) vs. the average DEM for canonical cell markers of large pyramidal neurons (Betz cells) found in layer V of the motor cortex that are the outflow for motor movements (Bakken et al., 2021), (ii) an in vivo neuroimaging marker of cortical myelination (T1/T2 ratio [Glasser and Van Essen, 2011]) vs. the Myelin Basic Protein DEM, which marks myelin, and (iii) the degree of in vivo regional cortical thinning by MRI in Alzheimer’s disease (AD) patients who have at least one APOE E4 variant (Gutiérrez-Galve et al., 2009; LaMontagne et al., 2019) vs. the APOE DEM (thinning map generated from 119 APOE E4 patients and 633 controls structural MRI [sMRI] scans as detailed in ‘Materials and methods’), testing the hypothesis that higher regional APOE expression will result in greater cortical atrophy in individuals with the APOE E4 risk allele. Collectively, the above tests of reproducibility (Figure 1—figure supplement 1) and convergent validity (Figure 1e–j) supported the use of DEMs for downstream analyses.

Defining and surveying the human cortex as a continuous transcriptional terrain

As an initial summary view of transcriptional patterning in the human cortex, we first averaged all 20,781 DEMs to represent the cortex as a single continuous transcriptional terrain, where altitude encodes the transcriptional distinctiveness (TD) of each cortical point (vertex) relative to all others (TD = mean(abs(z_exp)), Figure 2a, Video 1). This terrain view revealed six statistically significant TD peaks (‘Materials and methods,’ Figure 2a and b) which recover all major archetypal classes of the mammalian cortex as defined by classical studies of laminar and myelo-architecture, connectivity, and functional specialization (Mesulam, 1998) encompassing primary visual (V1), somatosensory (Brodmann area [BA] [Brodmann, 1909] 2), and motor cortex (BA 4), as well as limbic (temporal pole centered on dorsal temporal area G [TGd]; von Economo and Koskinas, 1925, ventral frontal centered in orbitofrontal cortex [OFC]) and heteromodal association cortex (BA 9-46d). Of note, our agnostic parcellation of all TD peak vertices by their ranked gene lists (‘Materials and methods’) perfectly cleaved BA2 and BA4 along the central sulcus, despite there being no representation of this macroanatomical landmark in DEMs. The TD map observed from the full DEMs library was highly stable between all disjoint triplets of donors (‘Materials and methods,’ Figure 2—figure supplement 1a, median cross-vertex correlation in TD scores between triplets r = 0.77) and across library subsets at all deciles of DEM reproducibility (‘Materials and methods,’ Figure 2—figure supplement 1b, cross-vertex correlation in TD scores r > 0.8 for the 3rd to 10th deciles), but was not recapitulated in spun null datasets (Figure 2—figure supplement 1c).

Figure 2 with 1 supplement see all

Download asset Open asset

Mapping transcriptional distinctiveness (TD) in the human cortex and its alignment with macroscale structure and function.

(a) Regional TD can be quantified as the mean absolute z-score of dense expression map (DEM) values at each vertex (top) and visualized as a continuous cortical map (middle, TD encoded by color) or in a relief map of the flattened cortical sheet (bottom, TD encoded by color and elevation, Video 1). Black lines on the inflated view identify cuts for the flattening procedure. The cortical relief map is annotated to show the central sulcus (CS), and peaks of TD overlying dorsal sensory and motor cortices (Brodmann areas, BA2, BA4), the primary visual cortex (V1), temporal pole (TGd), insula (Ins), and ventromedial prefrontal cortex (OFC). (b) Thresholding the TD map through spatial permutation of DEMs (t_spin ; ‘Materials and methods’) and clustering significant vertices by their expression profile defined six TD peaks in the adult human cortex (depicted as colored regions on terrain and inflated cortical surfaces). (c) Cortical vertices projected into a 3D coordinate system defined by the first three principal components (PCs) of gene expression, colored by the continuous TD metric (left) and TD peaks (right). TD peaks are focal anchors of cortex-wide expression PCs. (d) TD peaks show statistically significant functional specializations in a meta-analysis of in vivo functional MRI data. (e) The average magnitude of local expression transitions across genes (color) and principal orientation of these transitions (white bars) varies across the cortex. (f) Cortical folds in Allen Human Brain Atlas (AHBA) donors (top surface maps and middle flat map) tend to be aligned with the principal orientation of TD change across cortical vertices (p<0.01, middle histogram, sulci running perpendicular to TD change), and the strength of this alignment varies between cortical regions. (g) Putative cortical areas defined by a multimodal in vivo MRI parcellation of the human cortex (Glasser et al., 2016) (top surface maps and middle flat map) also tend to be aligned with the principal direction of gene expression change across cortical vertices (p<0.01, middle histogram, sulci running perpendicular to long axis of area boundaries), and the strength of this alignment varies between cortical areas.

Video 1

Download asset

posterframe for video — Visualisation of Transcriptional Distinctiveness (TD) in the human cortex, encoded by both color and elevation.

Integration with principal component analysis (PCA) of DEMs across vertices (‘Materials and methods,’ Figure 2—figure supplement 1d and e) showed that TD peaks constitute sharp poles of more recently recognized cortical expression gradients (Burt et al., 2018; Figure 2c). The ‘area-like’ nature of these TD peaks is reflected by the steep slopes of transcriptional change surrounding them (Figure 2a and e) and could be quantified as TD peaks being transcriptomically more distinctive than their physical distance from other cortical regions would predict (Figure 2—figure supplement 1f and g). In contrast, transitions in gene expression are more gradual and lack such sharp transitions in the cortical regions between TD peaks (Figure 2a, c and e, Figure 2—figure supplement 1j). Thus, because DEMs provide spatially fine-grained estimates of cortical expression and expression change, they offer an objective framework for arbitrating between area-based and gradient-based views of cortical organization in a regionally specific manner.

The TD peaks defined above exist as both discrete patches of cortex and the distinctive profile of gene expression which defines each peak, and this duality offers an initial bridge between macro- and microscale views of cortical organization. Specifically, we found that each TD peak overlapped with a functionally specialized cortical region based on meta-analysis of in vivo functional neuroimaging data (Yarkoni et al., 2011; ‘Materials and methods,’ Figure 2d, Supplementary file 3), and featured a gene expression signature that was preferentially enriched for a distinct set of biological processes, cell-type signatures, and cellular compartments (‘Materials and methods,’ Supplementary file 2). For example, the peaks overlapping area TGd and OFC were enriched for synapse-related terms, while BA2 and BA4 TD peaks were predominantly enriched for metabolic and mitochondrial terms. At a cellular level, V1 closely overlapped with DEMs for marker genes of the Ex3 neuronal subtype known to be localized to V1 (Lake et al., 2016), while BA4 closely overlapped Betz cell markers (Bakken et al., 2021; Figure 2—figure supplement 1h).

The expression profile of each TD peak was achieved through surrounding zones of rapid transcriptional change (Figure 2a and e, Figure 2—figure supplement 1i and j). We noted that these transition zones tended to overlap with cortical folds, suggesting an alignment between spatial orientations of gene expression and folding. To formally test this idea, we defined the dominant orientation of gene expression change at each vertex (‘Materials and methods,’ Figure 2e) and computed the angle between this and the orientation of folding (‘Materials and methods’). The observed distribution of these angles across vertices was significantly skewed relative to a null based on random alignment between angles (p_spin<0.01, Figure 2f, ‘Materials and methods’), indicating that there is indeed a tendency for cortical sulci and the direction of fastest transcriptional change to run perpendicular to each other (p_spin<0.01, Figure 2f). A similar alignment was seen when comparing gradients of transcriptional change with the spatial orientation of putative cortical areas defined by multimodal functional and structural in vivo neuroimaging (Glasser et al., 2016) (expression change running perpendicular to area long axis, p_spin<0.01, Figure 2g, ‘Materials and methods’). Visualizing these expression-folding and expression-areal alignments revealed greatest concordance over sensorimotor, medial occipital, cingulate, and posterior perisylvian cortices (with notable exceptions of transcription change running parallel to sulci and the long axis of putative cortical areas in lateral temporoparietal and temporopolar regions). As a preliminary probe for causality, we examined the developmental ordering of regional folding and regional transcriptional identity. Mapping the expression of high-ranking TD genes in fetal cortical laser dissection microarray data (Miller et al., 2014) from 21 PCW (post conception weeks) (‘Materials and methods’) showed that the localized transcriptional identity of V1 and TGd regions in adulthood is apparent during the fetal periods that folding topology begins to emerge (Chi et al., 1977; Xu et al., 2022; Figure 2—figure supplement 1k). Thus, the unique capacity of DEMs to resolve local orientations of expression change reveals a close spatial alignment between regional transitions of cortical gene expression at microscale and regional transitions of cortical folding, structure, and function at macroscale.

Cortical gene co-expression integrates diverse spatial scales of human brain organization

To complement the TD analyses above (Figure 2), we next used weighted gene co-expression network analysis (WGCNA; Langfelder and Horvath, 2008, ‘Materials and methods’, Figure 3a) to achieve a more systematic integration of macro- and macroscale cortical features. Briefly, WGCNA constructs a connectivity matrix by quantifying pairwise co-expression between genes, raising the correlations to a power (here 6) to emphasize strong correlations while penalizing weaker ones, and creating a topological overlap matrix (TOM) to capture both pairwise similarities expression and connectivity. Modules of highly interconnected genes are identified through hierarchical clustering. The resultant WGCNA modules enable topographic and genetic integration because they each exist as both (i) a single expression map (eigenmap) for spatial comparison with neuroimaging data (Figure 3a and b, ‘Materials and methods’) and (ii) a unique gene set for enrichment analysis against marker genes systematically capturing multiple scales of cortical organization, namely cortical layers, cell types, cell compartments, protein–protein interactions (PPI), and GO terms (‘Materials and methods,’ Supplementary files 2 and 4). Furthermore, whereas prior applications of WGCNA to AHBA data have revealed gene sets that covary in expression across many different compartments of the brain (Hartl et al., 2021; Hawrylycz et al., 2015; Kelley et al., 2018), using DEMs as input to WGCNA generates modules that are purely based on the fine-scale coordination of gene expression across the cortex. Using WGCNA, we identified 16 gene modules (M1–M16), which we then deeply annotated against independent measures of cortical organization at diverse spatial scales and developmental epochs (Figure 3c, ‘Materials and methods’). Module eigenmaps were primarily driven by highly reproducible genes (Figure 3—figure supplement 1a) as were enrichments for annotational gene sets (median reproducibility of enriching genes = 0.59, p<0.001 elevated vs. background).

Figure 3 with 1 supplement see all

Download asset Open asset

Cortex-wide gene co-expression patterns reflect multiple spatial scales and developmental epochs of brain organization.

(a) Overview of weighted gene co-expression network analysis (WGCNA) pipeline applied to the full dense expression map (DEM) dataset. Starting top left: the pairwise DEM spatial correlation matrix is used to generate a topological overlap matrix between genes (middle top), which is then clustered. Of the 23 WGCNA-defined modules, 7 were significantly enriched for non-cortical genes and removed, leaving 16 modules. Each module is defined by a set of spatially co-expressed genes, for which the principal component of expression can be computed and mapped at each cortical point (eigenmap). M6 is shown as an example projected onto an inflated left hemisphere (M6 z-scored expression and M6 expression change), and the bulk transcriptional distinctiveness (TD) terrain view from Figure 2 (M6 expression). (b) The extremes of WGCNA eigenmaps highlight different peaks in the cortical terrain: the main TD terrain colored by TD value (center, from Figure 2), surrounded by TD terrain projections of selected WGCNA eigenmaps. (c) WGCNA modules (eigenmaps and gradient maps, rows) are enriched for multiscale aspects of cortical organization (columns). Cell color intensity indicates pairwise statistical significance (p<0.05), while black outlines show significance after correction for multiple comparisons across modules. Columns capture key levels of cortical organization at different spatial scales (arranged from macro- to microscale) and developmental epochs: spatial alignment between module eigenmaps and in vivo MRI maps of cortical folding orientation, cortical thickness and T1/T2 ratio, fMRI resting-state functional networks; enrichment for module gene sets for independent annotations (Supplementary file 2) marking: cortical layers (He et al., 2017; Maynard et al., 2021); cell types (Darmanis et al., 2015; Habib et al., 2017; Hodge et al., 2019; Lake et al., 2018; Lake et al., 2016; Li et al., 2018; Ruzicka et al., 2021; Velmeshev et al., 2019; Zhang et al., 2016); subcellular compartments (Binder et al., 2014); synapse-related genes (Koopmans et al., 2019); protein–protein interactions between gene products (Szklarczyk et al., 2019); temporal epochs of peak expression (Werling et al., 2020) (‘fetal’: 8–24 21 post conception weeks [PCW]/’‘perinatal’' 24 PCW–6 mo/‘postnatal’ > 6 mo); transient layers of the mid-fetal human cortex at 21 PCW (Miller et al., 2014) (subpial granular zone [SG], marginal zone [MZ], cortical plate [CP], subplate [SP], intermediate zone [IZ], subventricular zone [SZ], and ventricular zone [VZ]); and fetal cell types at 17–18 PCW (Polioudakis et al., 2019). (d) Independent validation of multiscale enrichments for selected modules M2 and M12. M2 significantly overlaps the Neurosynth topic associated with the terms motor, cortex, and hand. Two high-ranking M2 genes, MOG and TF, exhibit clear layer VI peaks on in situ hybridization (ISH) and GO enrichment analysis myelin-related annotations. M12, overlapping the limbic network most closely overlapped the Neurosynth topic associated with social reasoning. Two high-ranking M22 genes GABRA2 and GRIN2B showed layer II ISH peaks and GO enrichment analysis revealed synaptic annotations. (e) Network visualization of pairwise overlaps between annotational gene sets used in (c), including WGCNA module gene sets (inset expression eigenmaps).

Several WGCNA modules showed statistically significant alignments with structural and functional features of the adult cerebral cortex from in vivo imaging (‘Materials and methods,’ Figure 3c; Glasser and Van Essen, 2011; Yeo et al., 2011). For example, (i) the M6 eigenmap was significantly positively correlated with in vivo measures of cortical thickness from sMRI and enriched within a limbic functional connectivity network defined by resting-state functional connectivity MRI, and (ii) the M8, M9, and M14 eigenmaps showed gradients of expression change that were significantly aligned with the orientation of cortical folding (especially around the central sulcus, medial prefrontal, and temporo-parietal cortices, Figure 3—figure supplement 1b). At microscale, several WGCNA module gene sets showed statistically significant enrichments for genes marking specific cortical layers (He et al., 2017; Maynard et al., 2021) and cell types (Darmanis et al., 2015; Habib et al., 2017; Hodge et al., 2019; Lake et al., 2018; Lake et al., 2016; Li et al., 2018; Ruzicka et al., 2021; Velmeshev et al., 2019; Zhang et al., 2016; ‘Materials and methods,’ Figure 3c, Supplementary file 4). These microscale enrichments were often congruent between cortical layers and cell classes annotations, and in keeping with the linked eigenmap (Figure 3c, Supplementary file 4). For example, M4, which was uniquely co-enriched for markers of endothelial cells and middle cortical layers, showed peak expression over dorsal motor cortices which are known to show expanded middle layers (Bakken et al., 2021; Wagstyl et al., 2020) with rich vascularization (Pfeifer, 1940) relative to other cortical regions. Similarly, M6, which was enriched for markers of astrocytes, microglia, and excitatory neurons, as well as layers 1/2, showed peak expression over rostral frontal and temporal cortices which are known to possess relatively expanded supragranular layers (Wagstyl et al., 2020) that predominantly contain the apical dendrites of excitatory neurons and supporting glial cells (von Economo and Koskinas, 1925). We also observed that modules with similar eigenmaps (Figure 3—figure supplement 1c), (including overlaps of multiple modules with the same TD peak) could show contrasting gene set enrichments. For example, M2 and M4 both showed peak expression of dorsal sensorimotor cortex (i.e., TD areas BA2 and BA4), but M2 captures a distinct architectonic signature of sensorimotor cortex from the mid-layer vascular signal of M4: expanded and heavily myelinated layer 6 (Bakken et al., 2021; Palomero-Gallagher and Zilles, 2019; Wagstyl et al., 2020; Figure 3c). The spatially co-expressed gene modules detected by WGCNA were not only congruently co-enriched for cortical layer and cell markers, but also for nanoscale features such as subcellular compartments (Binder et al., 2014; Supplementary files 2 and 4) (often aligning with the cellular enrichments) and PPIs (Szklarczyk et al., 2019; ‘Materials and methods,’ Figure 3c, Supplementary file 4). This demonstrates the capacity of our resource to tease apart subtle subcomponents of neurobiology based on cortex-wide expression patterns.

To further assess the robustness of these multiscale relationships, we focused on two modules with contrasting multiscale signatures – M2 and M12 – and tested for reproducibility of our primary findings (Figure 3c) using orthogonal methods. Our primary analyses indicated that M2 has an expression eigenmap which overlaps with the canonical somatomotor network from resting-state functional neuroimaging (Yeo et al., 2011) and contains genes that are preferentially expressed in cortical layer 6 from layer-resolved transcriptomics (He et al., 2017; Maynard et al., 2021), and in oligodendrocytes from snRNAseq (Darmanis et al., 2015; Habib et al., 2017; Hodge et al., 2019; Lake et al., 2018; Lake et al., 2016; Li et al., 2018; Ruzicka et al., 2021; Velmeshev et al., 2019; Zhang et al., 2016; Figure 3c). We were able to verify each of these observations through independent validations including spatial overlap of M2 expression with meta-analytic functional activations relating to motor tasks (Yarkoni et al., 2011); immunohistochemistry localization of high-ranking M2 genes to deep cortical layers (Zeng et al., 2012; ‘Materials and methods’); and significant enrichment of M2 genes for myelin-related GO terms (Figure 3d, Supplementary file 4). By contrast, our primary analyses indicated that M12, which had peak expression over ventral frontal and temporal limbic cortices, was enriched for marker genes for layer 2, neurons and the synapse (Figure 3c). These multiscale enrichments were all supported by independent validation analyses, which showed that the M12 eigenmaps is enriched in a limbic network that is activated during social reasoning (Yarkoni et al., 2011) high-ranking M12 marker genes show elevated expression in upper cortical layers by immunohistochemistry (Zeng et al., 2012; ‘Materials and methods’); and there is a statistically significant over-representation of synapse compartment GO terms in the M12 gene set (Figure 3d, Supplementary file 4).

Linking spatial and developmental aspects of cortical organization

Given that adult cortical organization is a product of development, we next asked whether eigenmaps of adult cortical gene expression (Figure 3a and b) are related to the patterning of gene expression between fetal stages and adulthood. To achieve this, we tested WGCNA module gene sets for enrichment of developmental marker genes from three independent postmortem studies (rightmost columns, Figure 3c) capturing genes with differential expression between (i) three developmental epochs between 8 PCWs and adulthood (BrainVar dataset from prefrontal cortex [Werling et al., 2020]);(ii) seven histologically defined zones of mid-fetal (21 PCW) cortex (Miller et al., 2014; ‘Materials and methods,’ 2Supplementary files 1 and 2); and (iii) 16 mid-fetal (17–18 PCW) cell types (Polioudakis et al., 2019; ‘Materials and methods,’ Supplementary file 2).

Comparison with the BrainVar dataset revealed that most module eigenmaps (13 of all 16 cortical modules) were enriched for genes with dynamic, developmentally coordinated expression levels between early fetal and late adult stages (Figure 3c, Supplementary file 4). This finding was reinforced by supplementary analyses modeling developmental trajectories of eigenmap gene set expression between 12 PCW and 40 y in the BrainSpan dataset (Li et al., 2018; ‘Materials and methods,’ Figure 3—figure supplement 1d), and further qualified by the observation that several WGCNA modules were also differentially enriched for markers of mid-fetal cortical layers and cell types (Miller et al., 2014; Polioudakis et al., 2019; Figure 3c, Supplementary file 4). As observed for multiscale spatial enrichments (Figure 3c and d), the developmental enrichments of modules were often closely coordinated with one another, and eigenmaps with similar patterns of regional expression could possess different signatures of developmental enrichment. For example, the M6 and M12 eigenmaps shared a similar spatial expression pattern in the adult cortex (peak expression in medial prefrontal, anterior insula, and medioventral temporal pole), but captured different aspects of human brain development that aligned with the cyto-laminar enrichments of M6 and M12 in adulthood. The M6 gene set, which was enriched for predominantly glial elements of layers 1 and 2 in adult cortex, was also enriched for markers of mid-fetal microglia (Polioudakis et al., 2019), the transient fetal layers that are known to be particularly rich in mid-fetal microglia (subpial granular, subplate, and ventricular zone [Monier et al., 2007]), and the mid-late fetal epoch when most microglial colonization of the cortex is thought to be achieved (Menassa and Gomez-Nicola, 2018; Figure 3c). In contrast, the M12 gene set, which was enriched for predominantly neuronal elements of layer 2 in adult cortex, also showed enrichment for marker genes of developing fetal excitatory neurons, the fetal cortical subplate, and windows of mid-late fetal development when developing neurons are known to be migrating into a maximally expanded subplate (Molnár et al., 2019).

The striking co-enrichment of WGCNA modules for features of both the fetal and adult cortex (Figure 3c) implied a patterned sharing of marker genes between cyto-laminar features of the adult and fetal cortex. To more directly test this idea and characterize potential biological themes reflected by these shared marker genes, we carried out pairwise enrichment analyses between all annotational gene sets from Figure 3c. These gene sets collectively draw from a diverse array of study designs encompassing bulk, laminar, and single-cell transcriptomics of the human cortex between 10 PCW and 60 y of life (‘Materials and methods’; Darmanis et al., 2015; Habib et al., 2017; He et al., 2017; Li et al., 2018; Maynard et al., 2021; Miller et al., 2014; Polioudakis et al., 2019; Ruzicka et al., 2021; Velmeshev et al., 2019; Werling et al., 2020; Zhang et al., 2016). Network visualization and clustering of the resulting adjacency matrix (Figure 3—figure supplement 1e) revealed an integrated annotational space defined by five coherent clusters (Figure 3e). A mature neuron cluster encompassed markers of postmitotic neurons and the compartments that house them in both fetal and adult cortex (red, Figure 3e, Supplementary file 2, example core genes: NRXN1, SYT1, CACNG8). This cluster also included genes with peak expression between late fetal and early postnatal life, and those localizing to the plasma membrane and synapse. A small neighboring fetal ganglionic eminence cluster (fetal GE, yellow, Figure 3e, Supplementary file 2, example core genes: NPAS3, DSX, DCLK2) contained marker sets for migrating inhibitory neurons from the medial and caudal ganglionic eminence in mid-fetal life. These two neuronal clusters – mature neuron and fetal GE – were most strongly connected to the M12 gene set (‘Materials and methods’), which highlights medial prefrontal, and temporal cortices possessing a high ratio of neuropil:neuronal cell bodies (Collins et al., 2010; Spocter et al., 2012). A mitotic annotational cluster (blue, Figure 3e, Supplementary file 2, example core genes: CCND2, MEIS2, PHLDA1) was most distant from these two neuronal clusters and included genes showing highest expression in early development as well as markers of cycling progenitor cells, radial glia, oligodendrocyte precursors, germinal zones of the fetal cortex, and the nucleus. This cluster was most strongly connected to the M15 gene set, which shows high expression over occipito-parietal cortices distinguished by a high cellular density and notably low expression in lateral prefrontal cortices, which possess low cellular density (Collins et al., 2016). The mature neuron and mitotic clusters were separated by two remaining annotational clusters for non-neuronal cell types and associated cortical layers. A myelin cluster (orange, Figure 3e, Supplementary file 2, example core genes: MOBP, CNP, ACER3) – which contained gene sets marking adult layer 6, oligodendrocytes, and organelles supporting the distinctive biochemistry and morphology of oligodendrocytes (the golgi apparatus, endoplasmic reticulum, and cytoskeleton) – was most connected to the M2 gene set highlighting heavily myelinated motor cortex (Nieuwenhuys and Broere, 2017). A non-neuronal cluster (yellow, Figure 3e, Supplementary file 2, example core genes: TGFBR2, GMFG, A2M) – which encompassed marker sets for microglia, astrocytes, endothelial cells, pericytes, and markers of superficial adult and fetal cortical layers that are relatively depleted of neurons – was most connected to the M6 gene set highlighting medial temporal and anterior cingulate cortices with notably high non-neuronal content (Collins et al., 2010).

These analyses show that the regional patterning of bulk gene expression captures the organization of the human cortex across multiple spatial scales and developmental stages such that (i) the summary expression maps of spatially co-expressed gene sets align with independent in vivo maps of macroscale structure and function from neuroimaging, while (ii) the spatially co-expressed gene sets defining these maps show congruent enrichments for specific adult cortical layers and cell types as well as developmental precursors of these features spanning back to mid-fetal life.

ASD risk genes follow two different spatial patterns of cortical expression, which capture distinct aspects of cortical organization and differentially predict cortical changes in ASD

The findings above establish that gene co-expression modules in the human cortex capture multiple levels of biological organization ranging from subcellular organelles to cell types, cortical layers, and macroscale patterns of brain structure and function. Given that genetic risks for atypical brain development presumably play out through such levels of biological organization, we hypothesized that disease-associated risk genes would be enriched within WGCNA module gene sets. Testing this hypothesis simultaneously offers a means of further validating our analytic framework, while also potentially advancing understanding of disease biology. To test for disease gene enrichment in WGCNA modules, we compiled lists of genes enriched for deleterious rare variants in ASD (Ruzzo et al., 2019; Satterstrom et al., 2020), schizophrenia (Scz; Singh et al., 2020), severe developmental disorders (DDD; Deciphering Developmental Disorders Study, 2017), and epilepsy (Heyne et al., 2018; Supplementary file 2). We considered rare (as opposed to common) genetic variants to focus on high effect-size genetic associations and avoid ongoing uncertainties regarding the mapping of common variants to genes (Tam et al., 2019). We observed that disease-associated gene sets were significantly enriched in several WGCNA modules (Figure 4a), with two modules showing enrichments for more than one disease: M15 (ASD, Scz, and DDD) and M12 (ASD and epilepsy). ASD was the only disorder to show a statistically significant enrichment of risk genes within both M12 and M15 (Figure 4a), providing an ideal setting to ask if and how this partitioning of ASD risk genes maps onto (i) multiscale brain organization in health and (ii) altered brain organization in ASD.

Figure 4 with 1 supplement see all

Download asset Open asset

Autism spectrum disorder (ASD) risk genes follow two different spatial patterns of cortical gene expression which differentially predict cortical changes in ASD.

(a) Enrichment of weighted gene co-expression network analysis (WGCNA module gene sets for risk genes associated with atypical brain development through enrichment of rare deleterious variants in studies of ASD, schizophrenia (Scz), severe developmental disorders (DDD, deciphering developmental disorders study), and epilepsy. Cell color intensity indicates pairwise statistical significance (p<0.05)), while outlined matrix cells survived correction for multiple comparisons across modules. (b) Summary of multiscale and developmental annotations from Figure 3c for M12 and M15: the only two WGCNA modules enriched for risk genes of more than one neurodevelopmental disorder. (c) M12 and M15 genes clustered by the strength of their membership to each module. Color encodes module membership. Shape encodes annotations for two GO Biological Process annotations that differ between the module gene sets: neuronal communication and regulation of gene expression. Text denotes specific ASD risk genes. (d) Contrasting GO enrichment of M12 and M15 for neuronal communication and regulation of gene expression GO Biological Process annotations. (e) M12 and M15 differ in the developmental trajectory of their average cortical expression between early fetal and mid-adult life (Li et al., 2018). (f) Regional differences in intrinsic expression of the M15 module (but not the M12 module) in adult cortex is correlated with regional variation in the severity of altered cortical gene expression (number of differentially expressed genes) in ASD (Haney et al., 2020). (g) Statistically significant regional alterations of cortical thickness (CT) in ASD compared to typically developing controls from in vivo neuroimaging (Di Martino et al., 2017) (top). Areas of cortical thickening show a statistically significant spatial overlap (Dice overlap = 0.68, p_spin<0.01) with regions of peak intrinsic expression for M15 in adult cortex (bottom). (h) M15 eigenmap expression (but not M12 eigenmap) shows significant spatial correlation with relative CT change in ASD.

The eigenmaps and gene set enrichments of M12 vs. M15 implicated two contrasting multiscale motifs in the biology of ASD (Figure 4b). ASD risk genes, including SCN2A, SYNGAP1, and SHANK2, resided within the M12 module (Figure 4c), which is most highly expressed within a distributed cortical system that is activated during social reasoning tasks (p_spin<0.01, Figure 3c and d, Figure 4b). The M12 gene set is also enriched for: genes with peak cortical expression in late-fetal and early postnatal life; marker genes for the fetal subplate and developing excitatory neurons; markers of layer 2 and mature neurons in adult cortex; and synaptic genes involved in neuronal communication (Figures 3c and d and 4b–e, Supplementary file 4). In contrast, ASD risk genes, including ADNP, KMT5B, and MED13L, resided within the M15 module (Figure 4c), which is most highly expressed in primary visual cortex and associated ventral temporal pathways for object recognition/interpretation (Kravitz et al., 2013) (p_spin<0.05, Figures 3c and d and 4b, Supplementary file 4). The M15 module is also enriched for genes showing peak cortical expression in early fetal development, marker genes for cycling progenitor cells in the fetal cortex; markers of layer 2, inhibitory neurons and oligodendrocyte precursors in the adult cortex (Figures 3c and d and 4b–e, Supplementary file 4). The alignment of ASD risk genes with M12 and M15 was reinforced when considering all 135 ASD risk genes: spatial co-expression analyses split these genes into two clear subsets with mean expression maps that most closely resembled M12 and M15 (Figure 4—figure supplement 1a, b). Thus, using only spatial patterns of cortical gene expression in adulthood, our analytic framework was able to recover the previous PPI and GO-based partitioning of ASD risk genes into synaptic vs. nuclear chromatin remodeling pathways (Parikshak et al., 2013; Satterstrom et al., 2020), and then place these pathways into a richer biological context based on the known multiscale associations of M12 and M15 (Figures 3c and 4a).

We next sought to address whether regional differences in M12 and M15 expression were related to regional cortical changes observed in ASD. To test this idea, we used two orthogonal indices of cortical change in ASD that capture different levels of biological analysis – the number of differentially expressed genes (DEGs) postmortem (Haney et al., 2020), and the magnitude of changes in cortical thickness (CT) as measured by in vivo sMRI (Di Martino et al., 2017). Regional DEG counts were derived from a recent postmortem study of 725 cortical samples from 11 cortical regions in 112 ASD cases and controls (Haney et al., 2020), and compared with mean M12 and M15 expression within matching areas of a multimodal MRI cortical parcellation (Glasser et al., 2016). The magnitude of regional transcriptomic disruption in ASD was statistically significantly positively correlated with region expression of the M15 module (r = 0.6, p_spin<0.05), but not the M12 module (r = −0.3, p_spin>0.05) (Figure 4f). This dissociation is notable because M15 (but not M12) is enriched for genes involved in the regulation of gene expression (Figure 4d). Thus the enrichment of regulatory ASD risk genes within M15, and the intrinsically high expression of M15 in occipital cortex may explain why the occipital cortex is a hotspot of altered gene expression in ASD.

To compare M12 and M15 expression with regional variation in cortical anatomy changes in ASD, we harnessed the multicenter ABIDE datasets containing brain sMRI scans from 751 participants with idiopathic ASD and 773 controls (Di Martino et al., 2017; Di Martino et al., 2013). We preprocessed all scans using well-validated tools for harmonized estimation of cortical thickness (CT) (Fischl, 2012) from multicenter data (‘Materials and methods’), and then modeled CT differences between ASD and control cohorts at 150,000 points (vertices) across the cortex (‘Materials and methods’). This procedure revealed two clusters of statistically significant CT change in ASD (‘Materials and methods,’ Figure 4g, upper panel) encompassing visual and parietal cortices (relative cortical thickening vs. controls) as well as superior frontal vertices (relative cortical thinning). The occipital cluster of cortical thickening in ASD showed a statistically significant spatial overlap with the cluster of peak M15 expression (Figure 4g, upper panel, ‘Materials and methods,’ Dice coefficient = 0.7, p_spin<0.01), and relative cortical thickness change correlated with the M15 eigenmap (Figure 4h). In contrast, M12 expression was not significantly aligned with CT change in ASD (Figure 4g and h). Testing these relationships in the opposite direction, that is, asking whether regions of peak M12 and M15 expression are enriched for directional CT change in ASD relative to other cortical regions, recovered the M15-specific association with regional cortical thickening in ASD (Figure 4—figure supplement 1c).

Taken together, the above findings reveal that an occipital hotspot of altered gene expression and cortical thickening in ASD overlaps with an occipital hotspot of high expression for a subset of ASD risk genes. These ASD risk genes are spatially co-expressed in a module enriched for several connected layers of biological organization (Figures 3c and 4b–d) spanning: nuclear pathways for chromatin modeling and regulation of gene expression; G2/M phase cycling progenitors and excitatory neurons in the mid-fetal cortex; oligodendrocytes and layer 2 cortical neurons in adult cortex; and occipital functional networks involved in visual processing. These multiscale aspects of cortical organization can now be prioritized as potential targets for a subset of genetic risk factors in ASD, and the logic of this analysis in ASD can now be generalized to any disease genes of interest.

Discussion

We build on the most anatomically comprehensive dataset of human cortex gene expression available to date (Hawrylycz et al., 2012), to generate, validate, characterize, apply, and share spatially dense measures of gene expression that capture the topographically continuous nature of the cortical mantle. By representing patterns of human cortical gene expression without the imposition of a priori boundaries (Burt et al., 2018; Hawrylycz et al., 2015), our library of DEMs allows anatomically unbiased analyses of local gene expression levels as well as the magnitudes and directions of local gene expression change. This core spatial property of DEMs unlocks several methodological and biological advances. First, the unparcellated nature of DEMs allows us to agnostically define cortical zones with extreme transcriptional profiles or unusually rapid transcriptional change, which we show to capture microstructural cortical properties and align with folding and functional specializations at the macroscale (Figure 2). By establishing that some of these cortical zones are evident at the time of cortical folding, we lend support to a ‘protomap’ (O’Leary, 1989; O’Leary et al., 2007; Rakic, 1988; Rakic et al., 2009)-like model where the placement of some cortical folds is setup by rapid tangential changes in cyto-laminar composition of the developing cortex (Ronan et al., 2014; Toro and Burnod, 2005; Van Essen, 2020). The DEMs are derived from fully folded adult donors, and therefore some of the measured genetic-folding alignment might also be induced by mechanical distortion of the tissue during folding (Heuer and Toro, 2019; Llinares-Benadero and Borrell, 2019). However, no data currently exist to conclusively assess the directionality of this gene-folding relationship.

We show that DEMs can recover sharp boundaries in gene expression despite being generated by interpolation algorithms that do not explicitly encode step changes in expression between cortical regions. This property of DEMs will help to target future studies of human cortical patterning (e.g., directing single-cell and spatial omics resources), and we illustrate this utility by applying DEMs to discover two new expression boundaries in the human cortex. Second, we use spatial correlations between DEMs to decompose the complex topography of cortical gene expression into a smaller set of cortex-wide transcriptional programs that capture distinct aspects of cortical biology – at multiple spatial scales and multiple developmental epochs (Figure 3). This effort provides an integrative model that links expression signatures of cell types and layers in prenatal life to the large-scale patterning of regional gene expression in the adult cortex, which can in turn, through DEMs, be compared to the full panoply of in vivo brain phenotypes provided by modern neuroimaging. Indeed, future work might find direct links between these module eigenvectors and similar low-frequency eigenvectors of cortical geometry have been used as basis functions to segment the cortex (Lefèvre et al., 2018) and explain complex functional activation patterns (Pang et al., 2023). Third, we find that some of these cortex-wide expression programs in adulthood are enriched for disease risk genes, which offers a new path to nominating candidate disease mechanisms across different levels of biological organization (Figure 4). For example, the M15 module defines a normative spatial pattern of cortical gene co-expression which not only captures a functionally enriched subset of ASD genes (Satterstrom et al., 2020), but also shows multiscale enrichments and regionally specific expression patterns that tie together several independently reported aspects of ASD neurobiology. Specifically, M15 newly integrates (i) the concentration of ASD risk genes and dysregulated gene expression in upper-layer excitatory neurons (Velmeshev et al., 2019), (ii) the accentuation of altered gene expression and thickness in occipital cortical regions, and (iii) the early emergence among children at heightened genetic risk for ASD of behaviorally relevant changes in cortical structure and function (Girault et al., 2022) within occipital systems important for the processing of visual information. Crucially, the strategy applied in our analysis of ASD risk genes can be generalized to risk genes for any brain disorder of interest to place known risk factors for disease into the rich context of multiscale cortical biology.

Finally, the collection of DEMs, annotational gene sets, and statistical tools used in this work is shared as a new resource to accelerate multiscale neuroscience by allowing flexible and spatially unbiased translation between genomic and neuroanatomical spaces. Of note, this resource can easily incorporate any future expansions of brain data in either neuroanatomical or genomic space. We anticipate that it will be particularly valuable to incorporate new data from the nascent, but rapidly expanding fields of high-throughput histology (Wagstyl et al., 2020), single-cell omics (Bakken et al., 2021), and large-scale imaging-genetics studies (Smith et al., 2021). Taken together, MAGICC enables a new integrative capacity in the way we study the brain, and hopefully serves to spark new connections between previously distant datasets, ideas, and researchers.

Share this article

Cite this article

Creating and benchmarking spatial dense gene expression maps in the human cortex.

Mapping transcriptional distinctiveness (TD) in the human cortex and its alignment with macroscale structure and function.

Visualisation of Transcriptional Distinctiveness (TD) in the human cortex, encoded by both color and elevation.

Cortex-wide gene co-expression patterns reflect multiple spatial scales and developmental epochs of brain organization.

Autism spectrum disorder (ASD) risk genes follow two different spatial patterns of cortical gene expression which differentially predict cortical changes in ASD.

Statistical tests used to compare spatial maps and gene sets derived from the Allen Human Brain Atlas with independent multiscale neuroscientific resources.

Author details

Konrad Wagstyl

Contribution

For correspondence

Competing interests

Sophie Adler

Contribution

Competing interests

Jakob Seidlitz

Contribution

Competing interests

Simon Vandekar

Contribution

Competing interests

Travis T Mallard

Contribution

Competing interests

Richard Dear

Contribution

Competing interests

Alex R DeCasien

Contribution

Competing interests

Theodore D Satterthwaite

Contribution

Competing interests

Siyuan Liu

Contribution

Competing interests

Petra E Vértes

Contribution

Competing interests

Russell T Shinohara

Contribution

Competing interests

Aaron Alexander-Bloch

Contribution

Competing interests

Daniel H Geschwind

Contribution

Competing interests

Armin Raznahan

Contribution

Competing interests

Citations by DOI

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

Categories and tags

Research organism