Transcriptional Cartography Integrates Multiscale Biology of the Human Cortex

Konrad Wagstyl; Sophie Adler; Jakob Seidlitz; Simon Vandekar; Travis T. Mallard; Richard Dear; Alex R. DeCasien; Theodore D. Satterthwaite; Siyuan Liu; Petra E. Vértes; Russell T. Shinohara; Aaron Alexander-Bloch; Daniel H. Geschwind; Armin Raznahan

doi:10.7554/eLife.86933.2

eLife assessment

This study provides continuous maps of human brain gene expression and explores their relationship with a large variety of microscopic and macroscopic aspects of brain organisation. The authors provide convincing evidence for a relationship between gene expression maps with various aspects of the anatomy of adult brains, during development, and in the case of mental disorders. The data and methods introduced can be an important tool for neuroimaging research.

https://doi.org/10.7554/eLife.86933.2.sa2

Significance of findings

important: Findings that have theoretical or practical implications beyond a single subfield

landmark
fundamental
important
valuable
useful

Strength of evidence

convincing: Appropriate and validated methodology in line with current state-of-the-art

exceptional
compelling
convincing
solid
incomplete
inadequate

During the peer-review process the editor and reviewers write an eLife assessment that summarises the significance of the findings reported in the article (on a scale ranging from landmark to useful) and the strength of the evidence (on a scale ranging from exceptional to inadequate). Learn more about eLife assessments

Abstract

The cerebral cortex underlies many of our unique strengths and vulnerabilities - but efforts to understand human cortical organization are challenged by reliance on incompatible measurement methods at different spatial scales. Macroscale features such as cortical folding and functional activation are accessed through spatially dense neuroimaging maps, whereas microscale cellular and molecular features are typically measured with sparse postmortem sampling. Here, we integrate these distinct windows on brain organization by building upon existing postmortem data to impute, validate and analyze a library of spatially dense neuroimaging-like maps of human cortical gene expression. These maps allow spatially unbiased discovery of cortical zones with extreme transcriptional profiles or unusually rapid transcriptional change which index distinct microstructure and predict neuroimaging measures of cortical folding and functional activation. Modules of spatially coexpressed genes define a family of canonical expression maps that integrate diverse spatial scales and temporal epochs of human brain organization - ranging from protein-protein interactions to large-scale systems for cognitive processing. These module maps also parse neuropsychiatric risk genes into subsets which tag distinct cyto-laminar features and differentially predict the location of altered cortical anatomy and gene expression in patients. Taken together, the methods, resources and findings described here advance our understanding of human cortical organization and offer flexible bridges to connect scientific fields operating at different spatial scales of human brain research.

Introduction

The human cerebral cortex is an astoundingly complex structure that underpins many of our distinctive facilities and vulnerabilities(Geschwind and Rakic, 2013). Achieving a mechanistic understanding of cortical organization in health and disease requires integrating information across its many spatial scales: from macroscale cortical folds and functional networks(Glasser et al., 2016) to the gene expression programs that reflect microscale cellular and laminar features(Hawrylycz et al., 2012; Kelley et al., 2018). However, a hard obstacle to this goal is that our measures of the human cortex at macro- and microscales are fundamentally mismatched in their spatial sampling. Macroscale measures from in vivo neuroimaging provide spatially dense estimates of structure and function, but microscale measures of gene expression are gathered from spatial discontinuous postmortem samples that have so far only been linked to macroscale features using methodologically-imposed cortical parcellations(Hansen et al., 2021; Larivière et al., 2021; Seidlitz et al., 2020). Consequently, local transitions in human cortical gene expression remain uncharacterized and unintegrated with the spatially fine-grained topographies of human cortical structure and function that are revealed by in vivo neuroimaging(Gryglewski et al., 2018; Markello et al., 2021). Finding a way to bridge this gap would not only enrich both our micro- and macro-scale models of human cortical organization, but also provide an essential framework for translation across traditionally siloed scales of neuroscientific research.

Here, we use spatially sparse postmortem data from the Allen Human Brain Atlas [AHBA(Hawrylycz et al., 2012)] to generate spatially dense cortical expression maps (DEMs) for 20,781 genes in the adult brain, with accompanying DEM reproducibility scores to facilitate wider usage. These maps allow a fine-grained transcriptional cartography of the human cortex, which we integrate with diverse genomic, histological and neuroimaging resources to shed new light on several fundamental aspects of human cortical organization in health and disease. First, we show that DEMs can recover canonical gene expression boundaries from in situ hybridization (ISH) data, predict previously unknown expression boundaries and align with regional differences in cortical organization from several independent data modalities. Second, by focusing on the local transitions in gene expression which are captured by DEMs, we reveal a close spatial coordination between molecular and functional specializations of the cortex, and establish that the spatial orientation of cortical folding and function at macroscale is aligned with local tangential transitions in cortical gene expression. Third, by defining and annotating gene co-expression modules across the cortex at multiple scales we systematically link macroscale measures of cortical structure and function in vivo, to postmortem markers of cortical lamination, cellular composition and development from early fetal to late adult life. Finally, as a proof-of-principle, we use this novel framework to secure a newly-integrated multiscale understanding of atypical brain development in autism spectrum disorder (ASD).

The tools and results from this analysis of the human cortex - which we collectively call Multiscale Atlas of Gene expression for Integrative Cortical Cartography (MAGICC) - open up an empirical bridge that can now be used to connect cortical models (and scientists) that have so far operated at segregated spatial scales. To this end, we share: (i) all gene-level DEMs and derived transcriptional landscapes in neuroimaging-compatible files for easy integration with in vivo macroscale measures of human cortical structure and function; and (ii) all gene sets defining spatial subcomponents of cortical transcription for easy integration with any desired genomic annotation (https://github.com/kwagstyl/magicc).

Results

Creating and benchmarking spatially dense maps of human cortical gene expression

To create a dense transcriptomic atlas of the cortex, we used AHBA microarray measures of gene expression for 20,781 genes in each of 1304 cortical samples from six donor left cortical hemispheres (Methods, Table S1). We extracted a model of each donor’s cortical sheet by processing their brain MRI scan, and identified the surface location (henceforth “vertex”) of each postmortem cortical sample in this sheet (Methods, Fig 1a). For each gene, we then propagated measured expression values into neighboring vertices using nearest-neighbor interpolation followed by smoothing (Methods, Fig 1b,c). Expression values were scaled across vertices and these vertex-level expression maps were averaged across donors to yield a single dense expression map (DEM) for each gene - which provided estimates of expression at ∼ 30,000 vertices across the cortical sheet (e.g. DEM for PVALB upper panel Fig 1d). These fine-grained vertex-level expression measures also enabled us to estimate the orientation and magnitude of expression change for each gene at every vertex (e.g. dense expression change map for PVALB, lower panel Fig 1d)

Creating and Benchmarking Spatial Dense Gene Expression Maps in the Human Cortex.
a, Spatially discontinuous Allen Human Brain Atlas (AHBA) microarray samples (red points) were aligned with MRI-derived cortical surface mesh reconstructions. b, AHBA vertex expression values were propagated using nearest-neighbor interpolation and subsequently smoothed **(c)**. d, Subject-level maps were z-normalized and averaged to generate a single reference dense expression map (DEM) for each gene, as well as the associated expression gradient map (shown here for PVALB: top and bottom, respectively). e, DEMs can recover known expression boundaries in ISH data. Four canonical V1 area markers (Zeng et al., 2012 Cell) show a significantly sharp DEM expression gradient at the V1/V2 boundary (insert cortical map and **Fig S2a,b**), which is also evident in all four individual gene DEMs and DEM gradients (SYT6, PENK and **Fig S2c**). f, DEMs can discover previously unknown expression boundaries. Genes with high DEM gradients across the PeEc (parahippocampal) and TF (fusiform) gyri (inset cortical map) were validated in ISH data - showing sharp expression changes in both directions at this boundary (CHRNA3, NGB and **Fig Sd-f**).g, Illustrative comparisons of selected DEMs against regional variation in microscale measures of cellular composition: scatterplot showing the global correlation of regional cellular proportions from single nucleus RNAseq (snRNAseq) across 16 cells and 6 regions(Lake et al., 2016) with DEM values for corresponding cell-type marker genes (R=0.48, p_spin<0.001, excluding Ex3-V1 and In8-BA10 outlier samples). h, DEMs for markers of 6 neuronal subtypes (3 excitatory: FEZF2, RORB, THEMIS, 3 inhibitory: PVALB, SST, VIP) based on recently validated subtype marker genes(Bakken et al., 2021; Hodge et al., 2019)i, Illustrative comparison of layer IV marker DEMs with corresponding mesoscale cortical measure of layer IV thickness from a 20μm 3D histological atlas of cortical layers. j, Illustrative comparisons of selected DEMs with corresponding macroscale cortical measures from independent neuroimaging markers.

We assessed the reproducibility of DEMs by repeating the above process (Fig 1) after repeatedly splitting the donors into non-overlapping groups of varying size, and using learning curve analyses to estimate the DEM reproducibility achieved by our full set of 6 donors. For cortically expressed genes (Methods, Table S2), the average reproducibility of gene expression maps was r_gene=0.58 (correlation of expression values for a gene across vertices), and the average reproducibility of ranked gene expression at each vertex was r_vertex=0.63 (correlation of expression values at a vertex across genes) (Fig S1c-d). These estimates were both substantially lower for genes not reported to be cortically expressed in the independent Human Protein Atlas (r_gene=0.34, t=37.6, p<0.001 and r_vertex=0.39, t=273.6, p<0.001, respectively, Methods, Table S2). Genes without recorded cortical expression were 3-fold enriched (p=0) amongst the 9,647 genes with estimated DEM reproducibility values of r <0.5). Regional differences in the density of postmortem sampling in the AHBA did not influence DEM reproducibility or the magnitude of local expression change captured by DEMs (Methods, Fig S1h). Thus, remedying the current lack of any spatially dense gene expression maps in the human cortex, we provide DEMs (and accompanying dense expression change maps) for 20,781 genes, and establish that >11k of these DEMs show a spatial reproducibility score of r_gene>0.5 between sets of unrelated individuals. Gene-level DEM reproducibility scores allow future users to filter on this feature as desired, and we establish that key analytic outputs from DEMs (see below) show good reproducibility between unrelated individuals and can be recovered at different DEM reproducibility filters.

Given that DEMs were generated by interpolating expression values between sampled regions, we assessed if DEMs could recover sharp local microscale transitions in gene expression that could theoretically be obscured by interpolation. Of the very few such transitions that have been verified by ISH in humans, the best-established occurs between occipital areas V1 and V2(Zeng et al., 2012). All four genes known to show a sharp V1/V2 expression boundary across layers by ISH - SYT6, TLE4, PCP4, PENK - exhibited qualitatively and quantitatively sharp expression transitions at the V1/V2 boundary in their DEMs (Fig 1e, Fig S2a-d). Motivated by this validation, we next asked if DEMs could identify previously unknown expression boundary markers in the human cortex. To achieve this, we took advantage of extensive existing ISH data between parahippocampal (area PeEc) and fusiform gyri (area TF). We ranked genes by the magnitude of their expression gradient between these cortical regions in DEMs (Methods), and identified 4 genes with sharp expression transitions predicted by DEMs - NGB,HTR2A, (TF>PeEc) and NTS, CHRNA3 (PeEc>TF) - for which independent ISH data were available. Expression profiling in ISH slabs verified the existence of sharp expression transition for all four genes (Fig 1f, Fig S2e-g). As the V1/V2 and the PeEc/TF boundaries both involve transitions between classical laminar types in cortical regions with highly conserved anatomical patterning(von Economo and Koskinas, 1925), we also tested if DEMs could recover expression boundaries in more variable and uniformly laminated association cortex(Ronan and Fletcher, 2015). No such expression boundaries have been described in humans by ISH, but there are reports of sharp expression boundaries between frontal areas 44 and 45b for several genes in non-human primates: SCN1B, KCNS1, TRIM55(Chen et al., 2022). These genes also exhibited high DEM gradients at the boundary between human frontal areas 44 and 45 (Fig S2h-j). Taken together, these observations demonstrate the capacity of DEMs to resolve sharp expression transitions and indicate that DEMs can be used to help target prospective post mortem validation of new expression boundaries in humans.

To benchmark and illustrate the use of DEMs to capture cortical features across contrasting spatial scales, we drew on selected micro- and macro- and macroscale cortical measures that DEMs should align with based on known biological processes (Fig 1g-j, Methods). To assess if DEMs could recover microscale differences in cellular patterning across the cortical sheet, we considered the ground truth of neuronal cell-type proportions as measured by single nucleus RNAseq (snRNAseq) across 6 different cortical regions(Lake et al., 2016). We observed a strong spatial correlation (r=0.6, p_spin<0.001) between regional marker gene expression in DEMs and regional proportions of their corresponding neuronal subtypes from snRNAseq (Fig 1g, Methods). Fig 1h shows example marker gene DEMs for 6 canonical neuronal subtypes: 3 excitatory (FEZF2, RORB, THEMIS) and 3 inhibitory (PVAL, SST, VIP)(Bakken et al., 2021; Hodge et al., 2019). Next, to assess if DEMs could recover regional variation in the mesoscale feature of cortical layering, we tested and verified that regional variation in the average DEM for layer IV marker genes(He et al., 2017; Maynard et al., 2021; Zeng et al., 2012) was highly correlated with regional variation in layer IV thickness as determined from a 3D histological atlas of cortical layers(Wagstyl et al., 2020) (Fig 1i). Finally, we asked if DEMs could recover spatially-dense measures of regional variation across the cortical sheet as provided by neuroimaging data, and found that maps from diverse measurement modalities showed strong and statistically-significant spatial correlations with their corresponding DEM(s) relative to a null distribution based on random “spinning” of maps(Alexander-Bloch et al., 2018) (Fig 1j, Methods, all p_spin<0.01): (i) areas of cortex activated during motor fMRI tasks in humans(Glasser et al., 2016) vs. the average DEM for canonical cell markers of large pyramidal neurons (Betz cells) found in layer V of the motor cortex that are the outflow for motor movements(Bakken et al., 2021), (ii) an in vivo neuroimaging marker of cortical myelination (T1/T2 ratio(Glasser and Van Essen, 2011)) vs. the Myelin Basic Protein DEM, which marks myelin, and (iii) the degree of in vivo regional cortical thinning by MRI in Alzheimer disease patients who have at least one APOE E4 variant(Gutiérrez-Galve et al., 2009; LaMontagne et al., 2019) vs. the APOE DEM (thinning map generated from 119 APOE E4 patients and 633 controls structural MRI (sMRI) scans as detailed in Methods), testing the hypothesis that higher regional APOE expression will result in greater cortical atrophy in individuals with the APOE E4 risk allele. Collectively, the above tests of reproducibility (Fig S1) and convergent validity (Fig 1e-j) supported use of DEMs for downstream analyses.

Defining and surveying the human cortex as a continuous transcriptional terrain

As an initial summary view of transcriptional patterning in the human cortex, we first averaged all 20,781 DEMs to represent the cortex as a single continuous transcriptional terrain, where altitude encodes the transcriptional distinctiveness (TD) of each cortical point (vertex) relative to all others (TD = mean(abs(z_exp)), Figure 2a, Sup Movie 1). This terrain view revealed 6 statistically-significant TD peaks (Methods, Fig. 2a,b) which recover all major archetypal classes of the mammalian cortex as defined by classical studies of laminar and myelo-architecture, connectivity, and functional specialization(Mesulam, 1998) encompassing: primary visual (V1), somatosensory [Brodmann area (BA(Brodmann, 1909)) 2], and motor cortex (BA 4), as well limbic [temporal pole centered on dorsal temporal area G (TGd(von Economo and Koskinas, 1925)), ventral frontal centered in orbitofrontal cortex (OFC)] and heteromodal association cortex (BA 9-46d). Of note, our agnostic parcellation of all TD peak vertices by their ranked gene lists (Methods) perfectly cleaved BA2 and BA4 along the central sulcus - despite there being no representation of this macroanatomical landmark in DEMs. The TD map observed from the full DEMs library was highly stable between all disjoint triplets of donors (Methods, Fig S3a, median cross-vertex correlation in TD scores between triplets r=0.77) and across library subsets at all deciles of DEM reproducibility (Methods, Fig S3b, cross-vertex correlation in TD scores r>0.8 for the 3rd-10th deciles), but was not recapitulated in spun null datasets (Fig S3c).

Mapping transcriptional distinctiveness in the human cortex and its alignment with macroscale structure and function.
a, Regional transcriptomic distinctiveness (TD) can be quantified as the mean absolute z-score of dense expression map (DEM) values at each vertex (top), and visualized as a continuous cortical map (middle, TD encoded by color) or in a relief map of the flattened cortical sheet (bottom, TD encoded by color and elevation, **Sup Movie 1**). Black lines on the inflated view identify cuts for the flattening procedure. The cortical relief map is annotated to show the central sulcus (CS), and peaks of TD overlying dorsal sensory and motor cortices (Brodmann Areas, BA2, BA4), the primary visual cortex (V1), temporal pole (TGd), insula (Ins) and ventromedial prefrontal cortex (OFC). b, Thresholding the TD map through spatial permutation of DEMs (t_spin **Methods**) and clustering significant vertices by their expression profile defined six TD peaks in the adult human cortex (depicted as coloured regions on terrain and inflated cortical surfaces). c, Cortical vertices projected into a 3D coordinate system defined by the first 3 principal components (PCs) of gene expression, coloured by the continuous TD metric (left) and TD peaks (right). TD peaks are focal anchors of cortex-wide expression PCs d, TD peaks show statistically-significant functional specializations in a meta-analysis of in vivo functional MRI data. e, The average magnitude of local expression transitions across genes (color) and principal orientation of these transitions (white bars) varies across the cortex. f, Cortical folds in AHBA donors (top surface maps and middle flat-map) tend to be aligned with the principal orientation of TD change across cortical vertices (p<0.01, middle histogram, sulci running perpendicular to TD change), and the strength of this alignment varies between cortical regions. g, Putative cortical areas defined by a multimodal in vivo MRI parcellation of the human cortex(Glasser et al., 2016) (top surface maps and middle flat-map) also tend to be aligned with the principal direction of gene expression change across cortical vertices (p<0.01, middle histogram, sulci running perpendicular to long axis of area boundaries), and the strength of this alignment varies between cortical areas.

Integration with principal component analysis of DEMs across vertices (Methods, Fig S3d,e) showed that TD peaks constitute sharp poles of more recently-recognized cortical expression gradients(Burt et al., 2018) (Fig. 2c). The “area-like” nature of these TD peaks is reflected by the steep slopes of transcriptional change surrounding them (Figure 2a,e), and could be quantified as TD peaks being transcriptomically more distinctive than their physical distance from other cortical regions would predict (Fig. S3f,g). In contrast, transitions in gene expression are more gradual and lack such sharp transitions in the cortical regions between TD peaks (Fig 2a,c,e, Fig S3j). Thus, because DEMs provide spatially fine-grained estimates of cortical expression and expression change, they offer an objective framework for arbitrating between area-based and gradient-based views of cortical organization in a regionally-specific manner.

The TD peaks defined above exist as both discrete patches of cortex and the distinctive profile of gene expression which defines each peak, and this duality offers an initial bridge between macro- and microscale views of cortical organization. Specifically, we found that each TD peak overlapped with a functionally-specialized cortical region based on meta-analysis of in vivo functional neuroimaging data(Yarkoni et al., 2011) (Methods, Fig. 2d, Table S3), and featured a gene expression signature that was preferentially enriched for a distinct set of biological processes, cell type signatures and cellular compartments (Methods, Table S2). For example, the peaks overlapping area TGd and OFC were enriched for synapse-related terms, while BA2 and BA4 TD peaks were predominantly enriched for metabolic and mitochondrial terms. At a cellular level, V1 closely overlapped with DEMs for marker genes of the Ex3 neuronal subtype known to be localized to V1(Lake et al., 2016), while BA4 closely overlapped Betz cell markers(Bakken et al., 2021) (Fig S3h).

The expression profile of each TD peak was achieved through surrounding zones of rapid transcriptional change (Fig 2a,e, Fig S3i,j). We noted that these transition zones tended to overlap with cortical folds - suggesting an alignment between spatial orientations of gene expression and folding. To formally test this idea we defined the dominant orientation of gene expression change at each vertex (Methods, Fig 2e) and computed the angle between this and the orientation of folding (Methods). The observed distribution of these angles across vertices was significantly skewed relative to a null based on random alignment between angles (p_spin<0.01, Fig 2f, Methods) - indicating that there is indeed a tendency for cortical sulci and the direction of fastest transcriptional change to run perpendicular to each other (p_spin<0.01, Fig 2f). A similar alignment was seen when comparing gradients of transcriptional change with the spatial orientation of putative cortical areas defined by multimodal functional and structural in vivo neuroimaging(Glasser et al., 2016) (expression change running perpendicular to area long-axis, p_spin<0.01, Fig 2g, Methods). Visualizing these expression-folding and expression-areal alignments revealed greatest concordance over sensorimotor, medial occipital, cingulate, and posterior perisylvian cortices (with notable exceptions of transcription change running parallel to sulci and the long-axis of putative cortical areas in lateral temporoparietal and temporopolar regions). As a preliminary probe for causality, we examined the developmental ordering of regional folding and regional transcriptional identity. Mapping the expression of high-ranking TD genes in fetal cortical laser dissection microarray data(Miller et al., 2014) from 21 PCW (Post Conception Weeks) (Methods) showed that the localized transcriptional identity of V1 and TGd regions in adulthood is apparent during the fetal periods that folding topology begins to emerge(Chi et al., 1977; Xu et al., 2022) (Fig S3k). Thus, the unique capacity of DEMs to resolve local orientations of expression change reveals a close spatial alignment between regional transitions of cortical gene expression at microscale and regional transitions of cortical folding, structure and function at macroscale.

Cortical gene coexpression integrates diverse spatial scales of human brain organization

To complement the TD analyses above (Fig 2), we next used weighted gene co-expression network analysis (WGCNA(Langfelder and Horvath, 2008), Methods, Fig 3a) to achieve a more systematic integration of macro- and macroscale cortical features. Briefly, WGCNA constructs a constructs a connectivity matrix by quantifying pairwise co-expression between genes, raising the correlations to a power (here 6) to emphasize strong correlations while penalizing weaker ones, and creating a Topological Overlap Matrix (TOM) to capture both pairwise similarities expression and connectivity. Modules of highly interconnected genes are identified through hierarchical clustering. The resultant WGCNA modules enable topographic and genetic integration because they each exist as both (i) a single expression map (eigenmap) for spatial comparison with neuroimaging data (Fig 3a,b, Methods) and, (ii) a unique gene set for enrichment analysis against marker genes systematically capturing multiple scales of cortical organization, namely: cortical layers, cell types, cell compartments, protein-protein interactions (PPI) and GO terms (Methods, Table S2 and S4). Furthermore, whereas prior applications of WGCNA to AHBA data have revealed gene sets that covary in expression across many different compartments of the brain(Hartl et al., 2021; Hawrylycz et al., 2015; Kelley et al., 2018), using DEMs as input to WGCNA generates modules that are purely based on the fine-scale coordination of gene expression across the cortex. Using WGCNA, we identified 16 gene modules (M1-M16), which we then deeply annotated against independent measures of cortical organization at diverse spatial scales and developmental epochs (Fig 3c, Methods). Module eigenmaps were primarily driven by highly reproducible genes (Fig S4a) as were enrichments for annotational gene sets (median reproducibility of enriching genes=0.59, p<0.001 elevated vs. background).

Cortex-wide Gene Coexpression Patterns Reflect Multiple Spatial Scales and Developmental Epochs of Brain Organization.
a, Overview of Weighted Gene Co-expression Network Analysis (WGCNA) pipeline applied to the full DEM dataset. Starting top left: the pairwise DEM spatial correlation matrix is used to generate a topological overlap matrix between genes (middle top) which is then clustered. Of the 23 WGCNA-defined modules, 7 were significantly enriched for non-cortical genes and removed, leaving 16 modules. Each module is defined by a set of spatially co-expressed genes, for which the principal component of expression can be computed and mapped at each cortical point (eigenmap). M6 is shown as an example projected onto an inflated left hemisphere (M6 z-scored expression and M6 expression change), and the bulk transcriptional distinctiveness (TD) terrain view from Fig 2 (M6 expression). b, The extremes of WGCNA eigenmaps highlight different peaks in the cortical terrain: the main TD terrain colored by TD value (center, from Fig 2), surrounded by TD terrain projections of selected WGCNA eigenmaps. c, WGCNA modules (eigenmaps and gradient maps, rows) are enriched for multiscale aspects of cortical organization (columns). Cell color intensity indicates pairwise statistical significance (p<0.05), while black outlines show significance after correction for multiple comparisons across modules. Columns capture key levels of cortical organization at different spatial scales (arranged from macro-to microscale) and developmental epochs: spatial alignment between module eigenmaps and in vivo MRI maps of cortical folding orientation, cortical thickness and T1/T2 ratio, fMRI resting-state functional networks; enrichment for module gene sets for independent annotations (**Table S2**) marking: cortical layers(He et al., 2017; Maynard et al., 2021); cell types(Darmanis et al., 2015; Habib et al., 2017; Hodge et al., 2019; Lake et al., 2018, 2016; Li et al., 2018; Ruzicka et al., 2021; Velmeshev et al., 2019; Zhang et al., 2016); subcellular compartments(Binder et al., 2014); synapse-related genes(Koopmans et al., 2019); protein-protein interactions between gene products (Szklarczyk et al., 2019); temporal epochs of peak expression(Werling et al., 2020) [“fetal”: 8-24 21 post conception weeks (PCW) / “perinatal’’ 24 PCW-6 months / “postnatal” >6 months]; transient layers of the mid-fetal human cortex at 21 post conception weeks (PCW)(Miller et al., 2014)[subpial granular zone (SG), marginal zone (MZ), cortical plate (CP), subplate (SP), intermediate zone (IZ), subventricular zone (SZ) and ventricular zone (VZ)]; and fetal cell types at 17-18 PCW(Polioudakis et al., 2019). d, Independent validation of multiscale enrichments for selected modules M2 & M12. M2 significantly overlaps the Neurosynth topic associated with the terms motor, cortex and hand. Two high-ranking M2 genes, MOG & TF exhibit clear layer VI peaks on ISH and GO enrichment analysis myelin-related annotations. M12, overlapping the limbic network most closely overlapped the Neurosynth topic associated with social reasoning. Two high-ranking M22 genes GABRA2 and GRIN2B showed layer II ISH peaks and GO enrichment analysis revealed synaptic annotations. e, Network visualization of pairwise overlaps between annotational gene sets used in Fig 3c, including WGCNA module gene sets (inset expression eigenmaps).

Several WGCNA modules showed statistically significant alignments with structural and functional features of the adult cerebral cortex from in vivo imaging (Methods, Fig 3c(Glasser and Van Essen, 2011; Yeo et al., 2011)). For example, (i) the M6 eigenmap was significantly positively correlated with in vivo measures of cortical thickness from sMRI and enriched within a limbic functional connectivity network defined by resting state functional connectivity MRI, and (ii) the M8, M9 and M14 eigenmaps showed gradients of expression change that were significantly aligned with the orientation of cortical folding (especially around the central sulcus, medial prefrontal and temporo-parietal cortices, Fig S4b). At microscale, several WGCNA module gene sets showed statistically significant enrichments for genes marking specific cortical layers(He et al., 2017; Maynard et al., 2021) and cell types(Darmanis et al., 2015; Habib et al., 2017; Hodge et al., 2019; Lake et al., 2018, 2016; Li et al., 2018; Ruzicka et al., 2021; Velmeshev et al., 2019; Zhang et al., 2016) (Methods, Fig 3c, Table S4). These microscale enrichments were often congruent between cortical layers and cell classes annotations, and in keeping with the linked eigenmap (Fig 3c, Table S4). For example, M4 - which was uniquely co-enriched for markers of endothelial cells and middle cortical layers - showed peak expression over dorsal motor cortices which are known to show expanded middle layers(Bakken et al., 2021; Wagstyl et al., 2020) with rich vascularization(Pfeifer, 1940) relative to other cortical regions. Similarly, M6 - which was enriched for markers of astrocytes, microglia and excitatory neurons as well as layers 1/2 - showed peak expression over rostral frontal and temporal cortices which are known to possess relatively expanded supragranular layers(Wagstyl et al., 2020) that predominantly contain the apical dendrites of excitatory neurons and supporting glial cells(von Economo and Koskinas, 1925). We also observed that modules with similar eigenmaps (Fig S4c), (including overlaps of multiple modules with the same TD peak) could show contrasting gene set enrichments. For example M2 and M4 both showed peak expression of dorsal sensorimotor cortex (i.e. TD areas BA2 and BA4), but M2 captures a distinct architectonic signature of sensorimotor cortex from the mid-layer vascular signal of M4: expanded and heavily myelinated layer 6(Bakken et al., 2021; Palomero-Gallagher and Zilles, 2019; Wagstyl et al., 2020) (Fig 3c). The spatially co-expressed gene modules detected by WGCNA were not only congruently co-enriched for cortical layer and cell markers, but also for nanoscale features such as sub-cellular compartments(Binder et al., 2014) (Table S2 and S4) (often aligning with the cellular enrichments) and protein-protein interactions(Szklarczyk et al., 2019) (PPI) (Methods, Fig 3c, Table S4). This demonstrates the capacity of our resource to tease apart subtle subcomponents of neurobiology based on cortex-wide expression patterns.

To further assess the robustness of these multiscale relationships, we focused on two modules with contrasting multiscale signatures - M2 and M12 - and tested for reproducibility of our primary findings (Fig 3c) using orthogonal methods. Our primary analyses indicated that M2 has an expression eigenmap which overlaps with the canonical somato-motor network from resting-state functional neuroimaging(Yeo et al., 2011), and contains genes that are preferentially expressed in cortical layer 6 from layer-resolved transcriptomics(He et al., 2017; Maynard et al., 2021), and in oligodendrocytes from snRNAseq(Darmanis et al., 2015; Habib et al., 2017; Hodge et al., 2019; Lake et al., 2018, 2016; Li et al., 2018; Ruzicka et al., 2021; Velmeshev et al., 2019; Zhang et al., 2016) (Fig 3c). We were able to verify each of these observations through independent validations including: spatial overlap of M2 expression with meta analytic functional activations relating to motor tasks(Yarkoni et al., 2011); immunohistochemistry localization of high-ranking M2 genes to deep cortical layers(Zeng et al., 2012) (Methods); and significant enrichment of M2 genes for myelin-related GO terms (Fig 3d, Table S4). By contrast, our primary analyses indicated that M12 - which had peak expression over ventral frontal and temporal limbic cortices - was enriched for marker genes for layer 2, neurons and the synapse (Fig 3c). These multiscale enrichments were all supported by independent validation analyses, which showed that: the M12 eigenmaps is enriched in a limbic network that is activated during social reasoning(Yarkoni et al., 2011); high-ranking M12 marker genes show elevated expression in upper cortical layers by immunohistochemistry(Zeng et al., 2012) (Methods); and, there is a statistically-significant over representation of synapse compartment GO terms in the M12 gene set (Fig 3d, Table S4).

Linking spatial and developmental aspects of cortical organization

Given that adult cortical organization is a product of development, we next asked if eigenmaps of adult cortical gene expression (Fig 3a,b) are related to the patterning of gene expression between fetal stages and adulthood. To achieve this, we tested WGCNA module gene sets for enrichment of developmental marker genes from 3 independent postmortem studies (rightmost columns, Fig 3c) capturing genes with differential expression between (i) 3 developmental epochs between 8 post-conception weeks (PCWs) and adulthood (BrainVar dataset from prefrontal cortex(Werling et al., 2020)) (ii) 7 histologically-defined zones of mid-fetal (21 PCW) cortex(Miller et al., 2014) (Methods, Table S1 and S2), and (iii) 16 mid-fetal (17-18 PCW) cell-types(Polioudakis et al., 2019) (Methods, Table S2).

Comparison with the BrainVar dataset revealed that most module eigenmaps (13 of all 16 cortical modules) were enriched for genes with dynamic, developmentally-coordinated expression levels between early fetal and late adult stages (Figure 3c, Table S4). This finding was reinforced by supplementary analyses modeling developmental trajectories of eigenmap gene set expression between 12 PCW and 40 years in the BrainSpan dataset(Li et al., 2018) (Methods, Fig S4d), and further qualified by the observation that several WGCNA modules were also differentially enriched for markers of mid-fetal cortical layers and cell-types(Miller et al., 2014; Polioudakis et al., 2019) (Figure 3c, Table S4). As observed for multiscale spatial enrichments (Fig 3c,d); the developmental enrichments of modules were often closely coordinated with one another, and eigenmaps with similar patterns of regional expression could possess different signatures of developmental enrichment. For example, the M6 and M12 eigenmaps shared a similar spatial expression pattern in the adult cortex (peak expression in medial prefrontal, anterior insula and medio-ventral temporal pole), but captured different aspects of human brain development that aligned with the cyto-laminar enrichments of M6 and M12 in adulthood. The M6 gene set - which was enriched for predominantly glial elements of layers 1 and 2 in adult cortex - was also enriched for markers of mid-fetal microglia(Polioudakis et al., 2019), the transient fetal layers that are known to be particularly rich in mid-fetal microglia (subpial granular, subplate, and ventricular zone(Monier et al., 2007)), and the mid-late fetal epoch when most microglial colonization of the cortex is thought to be achieved(Menassa and Gomez-Nicola, 2018) (Fig 3c). In contrast, the M12 gene set - which was enriched for predominantly neuronal elements of layer 2 in adult cortex - also showed enrichment for marker genes of developing fetal excitatory neurons, the fetal cortical subplate, and windows of mid-late fetal development when developing neurons are known to be migrating into a maximally expanded subplate(Molnár et al., 2019).

The striking co-enrichment of WGCNA modules for features of both the fetal and adult cortex (Fig 3c) implied a patterned sharing of marker genes between cyto-laminar features of the adult and fetal cortex. To more directly test this idea, and characterize potential biological themes reflected by these shared marker genes, we carried out pairwise enrichment analyses between all annotational gene sets from Fig 3c. These gene sets collectively draw from a diverse array of study designs encompassing bulk, laminar, and single cell transcriptomics of the human cortex between 10 PCW and 60 years of life (Methods(Darmanis et al., 2015; Habib et al., 2017; He et al., 2017; Li et al., 2018; Maynard et al., 2021; Miller et al., 2014; Polioudakis et al., 2019; Ruzicka et al., 2021; Velmeshev et al., 2019; Werling et al., 2020; Zhang et al., 2016)). Network visualization and clustering of the resulting adjacency matrix (Fig S4e) revealed an integrated annotational space defined by five coherent clusters (Fig 3e). A mature neuron cluster encompassed markers of post-mitotic neurons and the compartments that house them in both fetal and adult cortex (red, Fig 3e, Table S2, example core genes: NRXN1, SYT1, CACNG8). This cluster also included genes with peak expression between late fetal and early postnatal life, and those localizing to the plasma membrane and synapse. A small neighboring fetal ganglionic eminence cluster (Fetal GE, yellow, Fig 3e, Table S2, example core genes: NPAS3, DSX, DCLK2) contained marker sets for migrating inhibitory neurons from the medial and caudal ganglionic eminence in mid-fetal life. These two neuronal clusters - mature neuron and Fetal GE - were most strongly connected to the M12 gene set (Methods), which highlights medial prefrontal, and temporal cortices possessing a high ratio of neuropil:neuronal cell bodies(Collins et al., 2010; Spocter et al., 2012). A mitotic annotational cluster (blue, Fig 3e, Table S2, example core genes: CCND2, MEIS2, PHLDA1) was most distant from these two neuronal clusters, and included genes showing highest expression in early development as well as markers of cycling progenitor cells, radial glia, oligodendrocyte precursors, germinal zones of the fetal cortex, and the nucleus. This cluster was most strongly connected to the M15 gene set, which shows high expression over occipito-parietal cortices distinguished by a high cellular density and notably low expression in lateral prefrontal cortices, which possess low cellular density(Collins et al., 2016). The mature neuron and mitotic clusters were separated by two remaining annotational clusters for non-neuronal cell types and associated cortical layers. A myelin cluster (orange, Fig 3e, Table S2, example core genes: MOBP, CNP, ACER3) - which contained gene sets marking adult layer 6, oligodendrocytes, and organelles supporting the distinctive biochemistry and morphology of oligodendrocytes (the golgi apparatus, endoplasmic reticulum and cytoskeleton) - was most connected to the M2 gene set highlighting heavily myelinated motor cortex(Nieuwenhuys and Broere, 2017). A non-neuronal cluster (yellow, Fig 3e, Table S2, example core genes: TGFBR2, GMFG, A2M) - which encompassed marker sets for microglia, astrocytes, endothelial cells, pericytes, and markers of superficial adult and fetal cortical layers that are relatively depleted of neurons - was most connected to the M6 gene set highlighting medial temporal and anterior cingulate cortices with notably high non-neuronal content(Collins et al., 2010).

These analyses show that the regional patterning of bulk gene expression captures the organization of the human cortex across multiple spatial scales and developmental stages such that (i) the summary expression maps of spatially co-expressed gene sets align with independent in vivo maps of macroscale structure and function from neuroimaging, while (ii) the spatially co-expressed gene sets defining these maps show congruent enrichments for specific adult cortical layers and cell-types as well as developmental precursors of these features spanning back to mid-fetal life.

ASD risk genes follow two different spatial patterns of cortical expression, which capture distinct aspects of cortical organization and differentially predict cortical changes in ASD

The findings above establish that gene co-expression modules in the human cortex capture multiple levels of biological organization ranging from subcellular organelles, to cell types, cortical layers and macroscale patterns of brain structure and function. Given that genetic risks for atypical brain development presumably play out through such levels of biological organization, we hypothesized that disease associated risk genes would be enriched within WGCNA module gene sets. Testing this hypothesis simultaneously offers a means of further validating our analytic framework, while also potentially advancing understanding of disease biology. To test for disease gene enrichment in WGCNA modules, we compiled lists of genes enriched for deleterious rare variants in autism spectrum disorder(Ruzzo et al., 2019; Satterstrom et al., 2020) (ASD), schizophrenia(Singh et al., 2020) (Scz), severe developmental disorders (DDD)(Deciphering Developmental Disorders Study, 2017) and epilepsy(Heyne et al., 2018) (Table S2). We considered rare (as opposed to common) genetic variants to focus on high effect-size genetic associations and avoid ongoing uncertainties regarding the mapping of common variants to genes(Tam et al., 2019). We observed that disease associated gene sets were significantly enriched in several WGCNA modules (Fig 4a), with two modules showing enrichments for more than one disease: M15 (ASD, Scz and DDD) and M12 (ASD and Epilepsy).

ASD risk genes follow two different spatial patterns of cortical gene expression which differentially predict cortical changes in ASD.
a, Enrichment of WGCNA module gene sets for risk genes associated with atypical brain development through enrichment of rare deleterious variants in studies of Autism Spectrum Disorder (ASD), Schizophrenia (Scz), severe developmental disorders (DDD, Deciphering Developmental Disorders study) and Epilepsy. Cell color intensity indicates pairwise statistical significance (p<0.05), while outlined matrix cells survived correction for multiple comparisons across modules. b, Summary of multiscale and developmental annotations from Fig 3c for M12 and M15: the only two WGCNA modules enriched for risk genes of more than one neurodevelopmental disorder. c, M12 and M15 genes clustered by the strength of their membership to each module. Color encodes module membership. Shape encodes annotations for two GO Biological Process annotations that differ between the module gene sets: neuronal communication and regulation of gene expression. Text denotes specific ASD risk genes. d, contrasting GO enrichment of M12 and M15 for neuronal communication and regulation of gene expression GO Biological Process annotations. e, M12 and M15 differ in the developmental trajectory of their average cortical expression between early fetal and mid-adult life(Li et al., 2018). f, Regional differences in intrinsic expression of the M15 module (but not the M12 module) in adult cortex is correlated with regional variation in the severity of altered cortical gene expression (number of differentially expressed genes) in ASD(Haney et al., 2020). g, Statistically-significant regional alterations of cortical thickness (CT) in ASD compared to typically developing controls from in vivo neuroimaging(Di Martino et al., 2017, 2013) (top). Areas of cortical thickening show a statistically-significant spatial overlap (Dice overlap = 0.68, p_spin<0.01) with regions of peak intrinsic expression for M15 in adult cortex (bottom). h, M15 eigenmap expression (but not M12 eigenmap) shows significant spatial correlation with relative cortical thickness change in ASD.

ASD was the only disorder to show a statistically-significant enrichment of risk genes within both M12 and M15 (Fig 4a) - providing an ideal setting to ask if and how this partitioning of ASD risk genes maps onto (i) multiscale brain organization in health, and (ii) altered brain organization in ASD. The eigenmaps and gene set enrichments of M12 vs. M15 implicated two contrasting multiscale motifs in the biology of ASD (Fig 4b). ASD risk genes including SCN2A, SYNGAP1, and SHANK2 resided within the M12 module (Fig 4c) which is most highly expressed within a distributed cortical system that is activated during social reasoning tasks (p_spin<0.01, Fig 3c,d, Fig 5b). The M12 gene set is also enriched for: genes with peak cortical expression in late-fetal and early postnatal life; marker genes for the fetal subplate and developing excitatory neurons; markers of layer 2 and mature neurons in adult cortex; and synaptic genes involved in neuronal communication (Fig 3c,d, Fig 4b,c,d,e, Table S4). In contrast, ASD risk genes including ADNP, KMT5B, and MED13L resided within the M15 module (Fig 4c), which is most highly expressed in primary visual cortex and associated ventral temporal pathways for object recognition/interpretation(Kravitz et al., 2013) (p_spin<0.05, Fig 3c,d, Fig 4b, Table S4). The M15 module is also enriched for: genes showing peak cortical expression in early fetal development, marker genes for cycling progenitor cells in the fetal cortex; markers of layer 2, inhibitory neurons and oligodendrocyte precursors in the adult cortex (Fig 3c,d, Fig 4b,c,d,e, Table S4). The alignment of ASD risk genes with M12 and M15 was reinforced when considering all 135 ASD risk genes: spatial co-expression analyses split these genes into two clear subsets with mean expression maps that most closely resembled M12 & M15 (Fig S5a,b). Thus - using only spatial patterns of cortical gene expression in adulthood, our analytic framework was able to recover the previous PPI and GO-based partitioning of ASD risk genes into synaptic vs. nuclear chromatin remodeling pathways(Parikshak et al., 2013; Satterstrom et al., 2020), and then place these pathways into a richer biological context based on the known multiscale associations of M12 and M15 (Figs 3c, 4a).

We next sought to address whether regional differences in M12 and M15 expression were related to regional cortical changes observed in ASD. To test this idea, we used two orthogonal indices of cortical change in ASD that capture different levels of biological analysis - the number of differentially expressed genes (DEGs) postmortem(Haney et al., 2020), and the magnitude of changes in cortical thickness (CT) as measured by in vivo sMRI(Di Martino et al., 2017). Regional DEG counts were derived from a recent postmortem study of 725 cortical samples from 11 cortical regions in 112 ASD cases and controls(Haney et al., 2020), and compared with mean M12 and M15 expression within matching areas of a multimodal MRI cortical parcellation(Glasser et al., 2016). The magnitude of regional transcriptomic disruption in ASD was statistically-significantly positively correlated with region expression of the M15 module (r=0.6, p_spin<0.05), but not the M12 module (r=-0.3, p_spin>0.05) (Fig 4f). This dissociation is notable because M15 (but not M12) is enriched for genes involved in the regulation of gene expression (Fig 4d). Thus the enrichment of regulatory ASD risk genes within M15, and the intrinsically high expression of M15 in occipital cortex may explain why the occipital cortex is a hotspot of altered gene expression in ASD.

To compare M12 and M15 expression with regional variation in cortical anatomy changes in ASD, we harnessed the multicenter ABIDE datasets containing brain sMRI scans from 751 participants with idiopathic ASD and 773 controls(Di Martino et al., 2017, 2013). We preprocessed all scans using well-validated tools for harmonized estimation of cortical thickness (CT)(Fischl, 2012) from multicenter data (Methods), and then modeled CT differences between ASD and control cohorts at 150,000 points (vertices) across the cortex (Methods). This procedure revealed two clusters of statistically-significant CT change in ASD (Methods, Fig 4g, upper panel) encompassing visual and parietal cortices (relative cortical thickening vs. controls) as well as superior frontal vertices (relative cortical thinning). The occipital cluster of cortical thickening in ASD showed a statistically-significant spatial overlap with the cluster of peak M15 expression (Fig 4g, upper panel, Methods, Dice coefficient = 0.7, p_spin<0.01), and relative cortical thickness change correlated with the M15 eigenmap (Fig 4h). In contrast, M12 expression was not significantly aligned with CT change in ASD (Fig 4g,h). Testing these relationships in the opposite direction - i.e. asking if regions of peak M12 and M15 expression are enriched for directional CT change in ASD relative to other cortical regions - recovered the M15-specific association with regional cortical thickening in ASD (Fig S5c).

Taken together, the above findings reveal that an occipital hotspot of altered gene expression and cortical thickening in ASD overlaps with an occipital hotspot of high expression for a subset of ASD risk genes. These ASD risk genes are spatially co-expressed in a module enriched for several connected layers of biological organization (Fig 3c, 4b,c,d) spanning: nuclear pathways for chromatin modeling and regulation of gene expression; G2/M phase cycling progenitors and excitatory neurons in the mid-fetal cortex; oligodendrocytes and layer 2 cortical neurons in adult cortex; and occipital functional networks involved in visual processing. These multiscale aspects of cortical organization can now be prioritized as potential targets for a subset of genetic risk factors in ASD, and the logic of this analysis in ASD can now be generalized to any disease genes of interest.

Discussion

We build on the most anatomically comprehensive dataset of human cortex gene expression available to date(Hawrylycz et al., 2012), to generate, validate, characterize, apply and share spatially dense measures of gene expression that capture the topographically continuous nature of the cortical mantle. By representing patterns of human cortical gene expression without the imposition of a priori boundaries(Burt et al., 2018; Hawrylycz et al., 2015) our library of dense gene expression maps (DEMs) allows anatomically-unbiased analyses of local gene expression levels as well as the magnitudes and directions of local gene expression change. This core spatial property of DEMs unlocks several methodological and biological advances. First, the unparcellated nature of DEMs allows us to agnostically define cortical zones with extreme transcriptional profiles or unusually rapid transcriptional change - which we show to capture microstructural cortical properties and align with folding and functional specializations at the macroscale (Fig 2). By establishing that some of these cortical zones are evident at the time of cortical folding, we lend support to a “protomap”(O’Leary, 1989; O’Leary et al., 2007; Rakic, 1988; Rakic et al., 2009) like model where the placement of some cortical folds is set-up by rapid tangential changes in cyto-laminar composition of the developing cortex(Ronan et al., 2014; Toro and Burnod, 2005; Van Essen, 2020). The DEMs are derived from fully folded adult donors, and therefore some of the measured genetic-folding alignment might also be induced by mechanical distortion of the tissue during folding(Heuer and Toro, 2019; Llinares-Benadero and Borrell, 2019). However, no data currently exist to conclusively assess the directionality of this gene-folding relationship.

We show that DEMs can recover sharp boundaries in gene expression despite being generated by interpolation algorithms that do not explicitly encode step-changes in expression between cortical regions. This property of DEMs will help to target future studies of human cortical patterning (for example directing single cell and spatial omics resources), and we illustrate this utility by applying DEMs to discover two new expression boundaries in the human cortex. Second, we use spatial correlations between DEMs to decompose the complex topography of cortical gene expression into a smaller set of cortex-wide transcriptional programs that capture distinct aspects of cortical biology - at multiple spatial scales and multiple developmental epochs (Fig 3). This effort provides an integrative model that links expression signatures of cell-types and layers in prenatal life to the large-scale patterning of regional gene expression in the adult cortex, which can in turn - through DEMs - be compared to the full panoply of in vivo brain phenotypes provided by modern neuroimaging. Indeed, future work might find direct links between these module eigenvectors and similar low-frequency eigenvectors of cortical geometry have been used as basis functions to segment the cortex (Lefèvre et al., 2018) and explain complex functional activation patterns(Pang et al., 2023). Third, we find that some of these cortex-wide expression programs in adulthood are enriched for disease risk genes - which offers a new path to nominating candidate disease mechanisms across different levels of biological organization (Fig 4). For example the M15 module defines a normative spatial pattern of cortical gene co-expression which not only captures a functionally-enriched subset of ASD genes(Satterstrom et al., 2020), but also shows multiscale enrichments and regionally-specific expression patterns that tie together several independently-reported aspects of ASD neurobiology. Specifically, M15 newly integrates (i) the concentration of ASD risk genes and dysregulated gene expression in upper-layer excitatory neurons(Velmeshev et al., 2019), (ii) the accentuation of altered gene expression and thickness in occipital cortical regions, and (iii) the early emergence amongst children at heightened genetic risk for ASD of behaviorally-relevant changes in cortical structure and function(Girault et al., 2022) within occipital systems important for the processing of visual information. Crucially, the strategy applied in our analysis of ASD risk genes can be generalized to risk genes for any brain disorder of interest to place known risk factors for disease into the rich context of multiscale cortical biology.

Finally, the collection of DEMs, annotational gene sets and statistical tools used in this work is shared as a new resource to accelerate multiscale neuroscience by allowing flexible and spatially unbiased translation between genomic and neuroanatomical spaces. Of note, this resource can easily incorporate any future expansions of brain data in either neuroanatomical or genomic space. We anticipate that it will be particularly valuable to incorporate new data from the nascent, but rapidly expanding fields of high throughput histology(Wagstyl et al., 2020), single cell-omics(Bakken et al., 2021), and large-scale imaging-genetics studies(Smith et al., 2021). Taken together, MAGICC enables a new integrative capacity in the way we study the brain, and hopefully serves to spark new connections between previously distant datasets, ideas and researchers.

Materials and Methods

Materials and Methods overview

Creating spatially dense maps of human cortical gene expression (Fig 1a-d)
Benchmarking dense expression maps (DEMs)
1. Spin tests for comparing two spatial maps
2. Replicability and independence from cortical sampling density (Fig S1).
3. Alignment with reference measures of cortical organization (Fig 1 e-g)
Characterizing the topography of DEMs
1. Transcriptomic distinctiveness (TD) and principal component analysis (Fig 2a-c)
2. Relating adult TD peaks to fetal gene expression (Fig S3j)
3. Local gradient analysis (Fig 2e-g)
4. Weighted Gene Co-expression Network Analysis (WGCNA) (Fig 3a-c)
Multiscale annotation of WGCNA modules (Fig 3c,d)
1. Map-based annotations
2. Gene-set based annotations
Combining gene-set based annotations of the cortical sheet (Fig 3e, Fig S3d)
Disease enrichment and ASD-based analysis of WGCNA modules
1. Characterizing ASD gene enrichments in M12 and M15
2. Comparing M12 and M15 expression to regional changes of cortical gene expression in ASD (Fig 4f)
3. Comparing M12 and M15 expression to regional changes of cortical thickness in ASD (Fig 4g, h, Fig S5c)
Preprocessing and analysis of structural MRI data
1. AHBA donors
2. OASIS (Fig 1e)
3. ABIDE

1. Creating spatially dense maps of human cortical gene expression (Fig 1a-d)

Cortical surfaces were reconstructed for each AHBA donor MRI using FreeSurfer(Fischl, 2012), and coregistered between donors using surface matching of individuals’ folding morphology (MSMSulc) (Robinson et al., 2018). An average donor cortical mesh was also created for analyses of cortical morphology, by averaging the vertex coordinates of volumetrically aligned meshes for the 6 donors.

Probe-level data measures of gene expression for all samples in the AHBA adult brain microarray dataset were downloaded from (https://human.brain-map.org/static/download) - providing log2-transformed measures of gene expression for 58,692 probes in each of 3,702 brain tissue samples from six donors (Table S1). Within- and across-brain normalization of these probe level gene expression values was implemented as detailed by the Allen Institute for Brain Science White Paper (http://help.brain-map.org/download/attachments/2818165/WholeBrainMicroarray_WhitePaper.pdf). Probes were reannotated using the updated manifest from Arnautkevic et al(Arnatkeviciute et al., 2019), excluding genes lacking an Entrez, and probe-level expression values were averaged for each gene to yield a single gene*sample expression matrix for each donor. As only 2 donors had measurements from right hemispheres, samples were filtered by region to retain those originating from the cerebral cortex left hemisphere only. This decision was made given evidence for potential asymmetries in gene expression within the human cortex(de Kovel et al., 2018), and known differences in cortical shape between the hemispheres that complicate the reflection of sample locations from left to right cortical sheets(Jo et al., 2012). The above steps resulted in a final set of 6 donor-level gene*sample matrices from the left cerebral cortex for downstream analyses. These matrices collectively contained scaled expression values for 20,781 genes in each of 1304 cortical samples.

Native subject MRI coordinates were extracted for every cortical sample in each donor (Fig 1a). Nearest mid-surface cortical vertices were identified for each sample, excluding samples further than 20mm from a cortical coordinate. For cortical vertices with no directly sampled expression, expression values were interpolated from their nearest sampled neighbor vertex on the spherical surface (Moresi and Mather, 2019) (Fig 1b). Sampling density ρ in each subject was calculated as the number of samples per mm², from which average inter-sample distance, d, was estimated using the formula: , giving a mean intersample distance of 17.7mm ± 1.2mm. Surface expression maps were smoothed using the Connectome Workbench toolbox (Glasser et al., 2013) with a 20mm full-width at half maximum Gaussian kernel, selected to be consistent with this sampling density (Fig 1c). To align subjects’ expression, expression values were z-scored by the mean and standard deviation across vertices (given the known criticality of within-subject scaling of AHBA expression values (Markello et al., 2021)) and then averaged across the 6 subjects (Fig 1d) - yielding spatially dense estimates of expression at 29696 vertices across the left cerebral cortex per gene. Vertex-wise, rather than sample-level, estimation of mean and standard deviation mitigates potential biases introduced by intersubject variability in the regional distribution and density of cortical samples. For Y-linked genes, DEMs were calculated from male donors only. For each of the resulting 20,781 gene-level expression maps, the orientation and magnitude of gene expression change at each vertex (i.e. the gradient) was calculated for folded, inflated, spherical and flattened mesh representations of the cortical sheet using Connectome Workbench’s metric gradient command (Glasser et al., 2013).

2. Benchmarking dense expression maps (DEMs)

a. Spin tests for comparing two spatial maps

Cortical maps exhibit spatial autocorrelation that can inflate the False Positive Rate, for which a number of methods have been proposed(Alexander-Bloch et al., 2018; Burt et al., 2020; Vos de Wael et al., 2020). At higher degrees of spatial smoothness, this high False Positive Rate is most effectively mitigated using the spin test(Alexander-Bloch et al., 2018; Markello and Misic, 2021; Vos de Wael et al., 2020). In the following analyses when generating a test statistic comparing two spatial maps, to generate a null distribution, we computed 1000 independent spins of the cortical surface using https://netneurotools.readthedocs.io, and applied it to the first map whilst keeping the second map unchanged. The test statistic was then recomputed 1000 times to generate a null distribution for values one might observe by chance if the maps shared no common organizational features. This is referred to throughout as the “spin test” and the derived p-values as p_spin.

An additional null dataset was generated to test whether intrinsic geometry of the cortical mesh and its impact on interpolation for benchmarking analyses of DEMs and gradients (Fig S1d, Fig S2d, Fig S3c). In these analyses, the original samples were rotated on the spherical surface prior to subsequent interpolation, smoothing and gradient calculation. Due to computational constraints the full dataset was recreated only for 10 independent spins. These are referred to as the “spun+interpolated null”.

b. Replicability and independence from cortical sampling density (Fig S1)

We assessed the replicability of DEMs by applying the above steps for DEM generation to non-overlapping donor subsets and comparing DEMs between the resulting sub-atlases. We quantified DEM agreement between sub-atlases at both the gene-level (correlation in expression across vertices for each gene, Fig S1c) and the vertex-level (correlation in ranking of genes by their scaled expression values at each vertex, Fig S1d,e). These sub-atlas comparisons were done between all possible pairs of individuals, donor duos and donor triplets to give distributions and point estimates of reproducibility for atlases formed of 1, 2 and 3 donors. Learning curves were fitted to these data to estimate the projected gene-level and vertex-level DEM reproducibility of our full 6-subject sample atlas(Figueroa et al., 2012)(Fig S1c).

To assess the effect of data interpolation in DEM generation we compared gene-level and vertex-level reproducibility of DEMs against a “ground truth” estimate of these reproducibility metrics based on uninterpolated expression data. To achieve a strict comparison of gene expression values between different individuals at identical spatial locations we focused these analyses on the subset of AHBA samples where a sample from one subject was within 3 mm geodesic distance of another. This resulted in 1097 instances (spatial locations) with measures of raw gene expression of one donor, and predicted values from the second donor’s un-interpolated AHBA expression data and interpolated DEM. We computed gene-level and vertex-level reproducibility of expression using the paired donor data at each of these sample points - for both DEM and uninterpolated AHBA expression values. By comparing DEM reproducibility estimates with those for uninterpolated AHBA expression data, we were able to quantify the combined effect of interpolation and smoothing steps in DEM generation. We used gene-level reproducibility values from DEMs and uninterpolated AHBA expression data to compute a gene-level difference in reproducibility, and we then visualized the distribution of these difference values across genes (Fig S1a). We used gene-rank correlation to compare vertex-level reproducibility values between DEMs and uninterpolated AHBA expression data (Fig S1b).

Theoretically, regional gradients of expression change in DEMs could be biased by regional variations in the density of AHBA cortical sampling. To test for this, in each individual subject, we calculated the spatial relationship between the sampling density and mean gene gradient magnitude (Fig S1g). We additionally tested whether the regional variability of gene rank predictability in the atlas (shown in Fig S1f) was linked to the sampling density within the atlas.

c. Alignment with reference measures of cortical organization (Fig 1 e-g)

We first determined if our DEM library was able to differentiate between genes that are known to show cortical expression (CExp) and those without any prior evidence of cortical expression (NCExp) - motivated by the strong expectation that NCExp genes should lack a consistent spatial gradient in expression. For this test, we defined a set of 16573 CExp genes by concatenating the genes coding for proteins found in the “cortex” tissue class of the human protein atlas(Sjöstedt et al., 2020) genes identified as markers for cortical layers or cortical cells (see below,(Darmanis et al., 2015; Habib et al., 2017; He et al., 2017; Hodge et al., 2019; Lake et al., 2018, 2016; Li et al., 2018; Maynard et al., 2021; Ruzicka et al., 2021; Velmeshev et al., 2019; Zhang et al., 2016)). The remaining 4,208 genes in our DEM library were classified as NCExp. Fisher’s exact test was used to assess whether genes with lower gene reproducibility (r<0.5) were enriched for NCExp genes. We projected vertex-level reproducibility values for CExp and NCExp genes onto the cortical surface for visual comparison, and also computed the mean cross-vertex reproducibility for each of these maps (Fig S1f).

We next compiled data from independent studies for a range of macroscale and microscale cortical features that would be expected to align with specific DEM maps, and asked if the spatial patterns of cortical gene expression from DEMs showed the expected alignment with these independent data. These independent comparison studies were selected to span diverse measurement methods and data modalities representing a range of spatial scales.

We first sought to establish whether local changes in DEMs, i.e. the gradient maps of gene expression, could be used to validate existing areal border genes and identify novel candidates. Using a parcellation of the cortex based on multimodal structural and functional neuroimaging (Glasser et al., 2016), we identified the vertices along the boundary between a pair of regions (e.g. V1 & V2). The mean DEM gradient at these vertices was quantified for each gene, enabling us to rank genes by their exhibited border-like features at this cortical location. We then assessed the ranking of known lists of areal marker genes for a given border against a randomly sampled null distribution. To validate known areal marker genes derived from previous ISH studies, we took examples from the human visual cortex (Zeng et al., 2012), macaque visual cortex and macaque frontal regions 44 and 45 (Chen et al., 2022). To test the capacity of our resource to identify novel putative areal border genes, we calculated average gradients of all genes across the boundary between mesial temporal parahippocampal gyrus (Perirhinal Ectorhinal cortex, PeEc) and the fusiform gyrus (area TF) for which there is openly available ISH data (https://human.brain-map.org/ish/search). Limiting analyses to those genes for which ISH was available, the two genes exhibiting the largest gradient in either direction (four in total) were selected. The ISH was visually inspected for the presence area-like features in gene expression. For quantitative support, the cortex in each ISH image was manually segmented over the area of interest. The pixel-wise transverse distance along the cortical segmentation from left to right was calculated and subdivided into 200 equally spaced columns, spanning from pial to white matter surface. Staining intensity was averaged across each column. For each column, we computed the t-statistic between columns to the right and left, and identified the column with the largest t-statistic as the location of the putative interareal boundary.

We used the spun+interpolated null to test whether peaks in gene gradient could be driven primarily by local folding morphology impacting non-uniform interpolation. We quantified the average gradient for all genes along the V1-V2 border in the atlas, as well as for 10 iterations of the atlas where the samples were spun prior to interpolation. We computed the median gradient magnitude for the 20 top-ranked genes for each (Fig S2d).

We benchmarked DEMs against regional differences in cellular measures of cortical organization from single nucleus RNA-sequencing studies (snRNA-seq). Specifically, we correlated regional differences in the estimated proportion of 16 neuronal subtypes across 6 cortical regions(Lake et al., 2016) with regional DEM estimates for the mean expression of provided markers for these cell types(Lake et al., 2016). The test statistic was tested against a null distribution generated through spinning and resampling the cell marker DEM estimates (Table 1). Given the observed correspondence between regional cellular proportions and regional expression of cell marker sets, we used more recently-generated reference cell-markers from the Allen Institute for Brain Sciences(Bakken et al., 2021; Hodge et al., 2019; Tasic et al., 2016) to generate DEMs for 11 of 14 major cell subclasses in the mammalian cortex (6 neuronal types shown in Fig 1h, all 11 used for TD peak enrichment analysis Fig S3h). Three markers were excluded due to absence in the original dataset or low gene-predictability (r<0.2, Fig S1c).

We benchmarked DEMs against orthogonal spatially dense measures of cortical through the following comparisons: (i) Layer IV thickness values from the 3D BigBrain atlas of cortical layers(Wagstyl et al., 2020) vs. the average DEM for later IV marker genes(He et al., 2017; Maynard et al., 2021) (Table S2); (ii) motor-associated areas of the cortex from multimodal in vivo MRI(Glasser et al., 2016), vs. the average DEM for two marker genes (ASGR2, CSN1S1) of Betz cells, which are giant pyramidal neurons that output from layer V of the human motor cortex(Bakken et al., 2021); (iii) an in vivo neuroimaging map of the T1/T2 ratio measuring of intracortical myelination(Glasser and Van Essen, 2011) vs. the DEM for Myelin Basic Protein; and, (iv) regional cortical thinning from in vivo sMRI data in Alzheimer disease patients with the APOE E4 (OASIS-3 dataset(LaMontagne et al., 2019), see MRI Data Processing below) vs. the APOE4 DEM. For all four of these comparisons, alignment between maps was quantified and test for statistical significance using a strict spin-based spatial permutation method that controls for spatial autocorrelation in cortical data ((Alexander-Bloch et al., 2018)methods on statistical testing of pairwise cortical maps can be found in Table 1).

3. Characterizing the topography of DEMs

a. Transcriptomic distinctiveness (TD) and principal component analysis (Fig 2a-c)

Transcriptomic distinctiveness (TD) of each cortical vertex was calculated as the mean of the absolute DEM value for all genes (Fig 2a). Statistically significant peaks in TD, driven by convergence of extreme values across multiple genes, were identified as follows. The DEM for each gene was independently spun and TD was recalculated at each vertex over 1000 sets of gene-level DEM permutations (Alexander-Bloch et al., 2018). The maximum vertex TD value for each permuted TD map was recorded and the 95th percentile value across the 1000 permutations was taken as a threshold value. This threshold represents the maximum TD one would expect in the absence of concentrated colocalisations of extreme expression signatures, and areas above this threshold were annotated as TD peaks. To disambiguate TD peaks that are spatially coalescent but potentially driven by extreme values of heterogeneous gene sets within different regions, we concatenated all suprathreshold TD vertices into a single vertex*gene matrix and vertices in this matrix were clustered based on their expression signatures.

Intervertex correlation of gene rankings were calculated and the matrix was clustered using a gaussian mixture model. Bayesian information criterion was used to identify the optimum number of clusters (k=6) from a range of 2-18. Automated labels to localize TD peaks were generated based on their intersection with a reference multimodal neuroimaging parcellation of the human cortex(Glasser et al., 2016). Each TD was given the label of the multimodal parcel that showed greatest overlap (Fig 2b).

The TD map was assessed for reproducibility through three approaches. First the 6-subject cohort was subdivided into pairs of triplets, for which there are 10 unique combinations. For each combination, independent TD maps were computed for each triplet and compared between triplets (Fig S3a). Second, for the full 6-subject cohort genes were grouped into deciles according to the reproducibility of their spatial patterns in independent sub-cohorts (Fig S1c). For each decile of genes a TD map was computed and compared to the TD map from the remaining 90% of genes (Fig S3b). Third, to assess whether the covariance in spatial patterning across genes could be a result of mesh-associated structure introduced through interpolation and smoothing, TD maps were recomputed for the spun+interpolated null datasets and compared to the original TD map (Fig S3c).

The cortical regions defined by TD peaks were annotated according to their spatial overlap with the 24 cortical cell marker expression DEMs used in Fig 1g,h (Bakken et al., 2021; Hodge et al., 2019; Lake et al., 2016). To establish that cell maps were aligned with TD peaks, we first tested whether the vertex with the highest DEM value for each cell map overlapped with a TD peak and compared the number of overlapping cells to a null distribution created through spinning the TD peaks independently 1000 times. We then identified the cell types whose expression most closely aligned with each TD peak, comparing mean TD expression with a null distribution generated through spinning the peaks 1000 times (Fig S3h). TD peaks were also annotated for their functional activations using the meta-analytic Neurosynth database (see Map annotations below).

Gene sets characterizing TD peaks were identified as follows. At the vertex with the highest TD value within a peak region, the 95th centile TD value across genes was selected as a threshold. Genes with z-scored expression values above this threshold or below its inverse were selected, allowing TD peaks to have asymmetric length gene lists for high and low-expressed genes (Table S3). These TD gene lists were submitted to a Gene Ontology (GO) enrichment analysis pipeline (see Gene-set based annotations below).

To contextualize the newly-described TD peaks using previously-reported principal components (PCs) of human cortical gene expression, we computed the first 5 PC of gene expression in our full DEM library. The percentage of variance explained by each PC was calculated and compared to a null threshold derived through fitting PCs to a permuted null given by 1000 random spatial rotations of gene-level DEMs (Fig S3d). Taking the gene-level loadings from the first 3 PCs (Fig S3e), each vertex could be positioned in a 3D PC space based on its expression signature and also be colored based on its membership of a TD peak - thereby visualizing the position of TD peaks relative to the dominant spatial gradients of transcriptomic variation across the cortex (Fig 2c).

The assignment of TD regions as “peaks” implies a rapid emergence of the TD signature surrounding the peak boundaries, which we formally assessed by cortex-wide analysis of local tangential changes in gene expression (see “Local Gradient Analysis” below), and a spatially fine-grained comparisons of the physical vs. transcriptional distance between cortical regions. In the latter of these two analytic approaches, a rapid “border-like” onset of TD features would appear as (i) TD regions showing a greater transcriptional distance from other cortical regions than would be expected from their physical distance from other cortical regions, and (ii) this disparity emerging sharply surrounding the peak. To achieve this test, we first quantified the geodesic physical distance and Euclidean transcriptomic distance between pairs of vertices. For computational tractability, we limited this analysis to a subsample of vertices, choosing central vertices from ROIs in a parcellation with 500 approximately evenly sized parcels(Schaefer et al., 2018). We fit a linear generalized additive model to the data - predicting transcriptomic distance from geodesic distance - and calculated the residuals for each inter-vertex edge (Fig S3f). For each sampled vertex we averaged these residuals and mapped them back to the surface to visualize cortical areas that were transcriptomically more distinctive than their physical distance to other areas would predict (Fig S3g).

b. Relating adult TD peaks to fetal gene expression (Fig S3k)

We sought to establish whether the regional expression signatures characterizing TD peaks were present early in fetal development. This goal required measures of gene expression from multiple regions across the fetal cortical sheet, which are provided by the Allen Institute from Brain Sciences fetal laser micro-dissection microarray dataset(Miller et al., 2014). In each samples’ fetal brain, this dataset represents approximately 25 cortical brain regions tangentially, and radially 7 transient fetal layers/compartments radially: Subpial granular zone (SG), marginal zone (MZ), outer and inner cortical plate (grouped together as CP), subplate zone (SP), intermediate zone (IZ), outer and inner subventricular zone (grouped together as SZ), and ventricular zone (VZ).

Probe-level data measures of gene expression for the two PCW21 donors in the AHBA fetal LMD microarray dataset were downloaded from (https://www.brainspan.org/static/download.html) - providing log2-transformed measures of gene expression for 58,692 probes in each of 536 tissue samples across both donors (Table S1). Preprocessing and normalization of these probe level gene expression values was implemented as detailed by the Allen Institute for Brain Science White Paper (https://help.brain-map.org/download/attachments/3506181/Prenatal_LMD_Microarray.pdf). Probe-level expression values were averaged for each gene to yield a single gene*sample expression matrix for each donor, which was filtered to include only cortical samples. Gene expression values were scaled across samples within each donor, and scaled gene expression values were compiled for the set of 235 cortical regions that was common to both donor datasets. We averaged scaled regional gene expression values between donors per gene, and filtered for genes in the fetal LDM dataset that were also represented in the adult DEM dataset - yielding a single final 20,476*235 gene-by-sample matrix of expression values for the human cortex at 21 PCW. Each TD peak region was then paired with the closest matching cortical label within the fetal regions. This matrix was then used to test if each TD expression signature discovered in the adult DEM dataset (Fig 2, Table 3) was already present in similar cortical regions at 21 PCW.

The analysis of fetal regional patterning of TD peak gene sets was carried out as follows (Fig S3k). For a given TD peak, the significantly enriched genes for that peak (see above for definition of these gene sets) were identified in the fetal dataset and averaged at each fetal sample - capturing how highly expressed the TD signature was in each fetal sample. Next, we identified all samples in the fetal expression dataset that originated from regions underlying the TD peak, and defined these as the “fetal target region set” for that TD region (i.e. occipital samples in the fetal brain were the fetal target region set for analysis of gene enriched in the adult occipital TD region). We ranked all fetal samples by their mean expression of the TD marker set, and normalized these ranks to between 0 (TD markers most highly expressed) and 1 (TD markers most lowly expressed). Normalization was done to adjust for varying numbers of areas recorded per compartment. This ranking enabled us to compute the median rank of the fetal target region set, and test if this was significantly lower compared to a null distribution of ranks from random reassignment of the fetal target region set labels across all fetal samples. Within this analytic framework, a statistically significant test means that the adult TD signature is significantly localized to homologous cortical regions at 21 PCW fetal life (Fig S3k). We repeated this procedure for each adult TD.

c. Local gradient analysis (Fig 2e-g)

Spatially dense expression maps enabled the calculation of a vector describing the first spatial derivative - i.e. the local gradient - of each gene’s expression at each vertex. These vectors describe both the orientation and the magnitude of gene expression change.

Averaging these gene-level magnitude estimates across genes provided a vertex-level summary map of the magnitude of local expression changes in our full DEM library (Fig 2e). Regions with a significantly high average expression gradient were identified using a similar spatial permutation procedure as described for the identification of TD peaks. Briefly, the DEM gradient map for each gene was independently spun and an average expression gradient magnitude was recalculated at each vertex over 1000 sets of these spatial permutations(Alexander-Bloch et al., 2018). For each permutation we recorded the maximum vertex-level average expression gradient value, and the 95th percentile value of these maximums across the 1000 permutations was taken as a threshold value. Vertices with observed average expression gradient values above this threshold represented cortical regions of significantly rapid transcriptional change (Fig S3j).

The principal orientation of gene expression change at each vertex was calculated considering the vectors describing gene expression gradients - thereby providing a single summary of local gene expression gradients that considers both direction and magnitude. Principal component analysis (PCA) of gene gradient vectors was used to calculate the primary orientation of gene expression change at each vertex (Fig 2e) and the percentage of orientation variance accounted for by this principal component (Fig 2e, Fig S3i). Gene-level PC weights for each vertex were stored for subsequent analyses, including alignment with folds and functional ROIs (Fig 2f & g, see annotational analyses below).

The rich DEM expression gradient information described above was applied in three downstream analyses. First, we used these resources to detail the emergence of TD expression signatures within the cortical sheet - focusing on all vertices that had been identified to show a significantly elevated mean expression gradient. Specifically, we ranked genes at these vertices by their loadings onto the 1st PC of gene expression gradients at each vertex, and correlated these rankings with the rankings of genes by the expression at each TD peak vertex. This vertex-level correlation score - which quantifies how closely the gene expression gradient at a given vertex resembles that expression signature of a given TD peak - was regenerated for each of the 6 TD peaks (colors, Fig S3j). In each of these 6 maps, we were also able to plot the principal orientations of expression change at the vertex-level (red lines, Fig S3i) to ask if gradients of expression change for a given TD signature were spatially oriented towards the TD in question.

Second, we used the principal orientation of expression change at each vertex to assess whether local transcriptomic gradients were aligned with the orientation of cortical folding patterns. Orientation of cortical folds was calculated using sulcal depth and cortical curvature (Xia et al., 2018). Gradient vectors for sulcal depth describe the primary orientation of cortical folds on the walls of sulci, while gradient vectors of cortical curvature better describe the orientation at sulcal fundi and gyral crowns. These two gradient vector-fields were combined and smoothed with a 10mm FWHM gaussian kernel to propagate the vector field into plateaus e.g. at large gyral crowns where neither sulcal depth nor curvature exhibit reliable gradients. The folding orientation vectors were calculated with reference to a 2D flattened cortical representation for statistical comparison with the gradient vectors derived from gene expression maps (Fig 2f). At each vertex, the minimum angle was calculated between the folding orientation vector and gene expression gradient vector. Aligned vector maps exhibit positive skew, with angles tending towards zero. Therefore the skewness of the distribution of angles across all vertices was calculated, and to test for significance, folding and expression vector maps were spun relative to one another 1000 times, generating a null distribution of skewness values against which the test-statistic was compared (Table 1). A similar analysis was applied to test the association between module eigenmap gradient vectors and cortical folding (see WGCNA section below).

Statistical tests used to compare spatial maps and gene sets derived from the Allen Human Brain Atlas with independent multiscale neuroscientific resources.

Third we sought to quantify the alignment between cortical expression gradients and cortical areas as defined by multimodal imaging. Orientation of each MRI multimodal parcel ROI from Glasser et al(Glasser et al., 2016), was calculated taking the coordinates for all vertices within a given ROI. Principal Component Analysis of coordinates was used to identify the short and long axis of the ROI object. The vector describing the short axis was taken for comparison with mean of expression gradient vectors for vertices in the same ROI. For each ROI, the minimum angle was calculated and the skewness of the angles across all ROIs was calculated and compared to a null distribution created through spinning maps independently 1000 times, recalculating angles and their skewness (Fig 2g).

d. Weighted Gene Co-expression Network Analysis (WGCNA) (Fig 3a-c)

Genes were clustered into modules for further analysis using WGCNA(Langfelder and Horvath, 2008). Briefly, gene-gene cortical spatial correlations were calculated across all vertices to generate a single square 20,781*20,781 signed co-expression matrix. This co-expression matrix underwent “soft-thresholding”, raising the values to a soft power of 6, chosen as the smallest power where the resultant network satisfied the scale-free topology model fit of r²>0.8(Zhang and Horvath, 2005). Next, a similarity matrix was created through calculating pairwise topological overlap, assessing the extent to which genes share neighbors in the network(Yip and Horvath, 2007). The inverse of the topological overlap matrix was then clustered using average linkage hierarchical clustering, with a minimum cluster size of 30 genes. The eigengene for each module is the first principal component of gene expression across vertices, and provides a single measure of module expression at each vertex (hence, “eigenmap”). As per past implementation of WGCNA, pairs of modules with eigengene correlations above 0.9 were merged. These procedures defined a total of 23 gene co-expression modules ranging in size from 77-3725 genes, and a single set of unconnected genes (gray module 265 genes). We filtered the gray module from further analysis, as well as all 6 other modules that were also statistically significantly enriched for NCExp genes (Table S4, Fisher’s test, all p<0.0001) - leaving a total of 16 modules for downstream analysis (Table S4). To assess the extent to which eigenmaps captured highly reproducible features of cortical organization, for each decile of genes, DEMs were correlated with their module eignmaps recomputed from the remaining 90% of genes. (Fig S4a).

Each WGCNA module could be visualized as a cortical eigenmap, and eigenmap gradient - on the TD terrain, or inflated cortical (Fig 3a). The eigenmap gradient for each module provides a vertex-level measure for the magnitude of change in module expression at each vertex, as well as a vertex-level orientation of module expression change - calculated as described in Local Gradient Analysis above. These anatomical representations of each WGCNA module are amenable to spatial comparison with any other cortical map through spatial permutations(Alexander-Bloch et al., 2018) (see Annotational analyses below). Each WGCNA module is also defined as a gene set, which is amenable to standard gene-set based enrichment analysis (see Annotational analyses below). WGCNA modules can each also be represented as a ranked list of all genes - based on gene-level kME scores for each module, which are the cross-vertex correlation between a gene’s DEM map and a module’s eigenmap.

4. Multiscale annotation of WGCNA modules (Fig 3c,d)

We used multiple open neuroimaging and genomic datasets to systematically sample diverse levels of cortical organization and achieve a multiscale annotation of WGCNA modules. All gene sets used in enrichment analysis are detailed in Table S2.

a. Map-based annotations

MRI-derived maps of cortical function

Functional annotations of the cortex were carried out using two independent functional MRI (fMRI) resources - one based on resting state fMRI (rs-FMRI)(Yeo et al., 2011), and one using task-based fMRI(Rubin et al., 2017; Yarkoni et al., 2011). Resting state functional connectivity networks were taken from(Yeo et al., 2011), which divides the cortex into seven coherent functional networks through surface-based clustering of resting state fMRI into: visual, somatomotor, dorsal attention, ventral attention, frontoparietal control, limbic and default networks. We used spin-based spatial permutation testing to test for networks in which WGCNA eigenmap expression was significantly elevated (Fig 3c, see Table 1).

For task fMRI-driven functional annotation of the cortex, we drew on meta-analytic maps of cortical activation from Neurosynth(Rubin et al., 2017; Yarkoni et al., 2011). Briefly, over 11,000 functional neuroimaging studies were text-mined for papers containing specific terms and associated activation coordinates(Yarkoni et al., 2011). Secondary analyses generated activation maps for 30 topics spanning a range of cognitive domains (Rubin et al., 2017). Topic activation maps were intersected with cortical surface meshes and thresholded to identify vertices with an activation value above 0. Example topics included “motor, cortex, hand” and “social, reasoning, medial prefrontal cortex” (Fig 3d). Topics were excluded if intersected cortical maps indicated activation in fewer than 1% of cortical vertices. Topic maps were used to annotate TD peaks (Fig 2d) - identifying for each ROI, the 2 topics with the highest Dice overlap. Topic maps also served as an independent validation of selected WGCNA eigenmaps (Fig 2d, Table 1). Topic maps from Neurosynth were also used to provide an orthogonal validation of observed resting state network enrichments from Yeo et al (Fig 3c) for M2 and M12: mean eigenmap expression for module M2 and M12 was calculated for Neurosynth topic maps and assessed for statistical significance using spin-based permutations (Fig 3d, Table 1).

MRI-derived maps of cortical structure

Cortical thickness and T1/T2 “myelin” maps were taken from the Human Connectome Project average(Glasser et al., 2016). Spatial correlations were calculated across all vertices with each WGCNA module eigenmap, and assessed for statistical significance using spin-based permutations (Fig 3c, see Table 1).

Orientation of cortical folds

We used the orientation of expression change at each vertex to assess whether local eigenmap gradients were aligned with the orientation of cortical folding patterns, mirroring the analysis described above (Fig S4b, see Local Gradient Analysis).

Inter-eigenmap correlations

We tested the pairwise spatial correlation between pairs of module eigenmaps. Statistical significance was assessed using a null distribution of correlation matrices through independently spinning eigenmaps and recalculating correlations, and correcting for multiple comparisons (Fig S4c, see Table 1).

b. Gene-set based annotations

GO enrichment

Gene Ontology Enrichment Analysis (see Table 1 below) were carried out on gene sets of interest, testing for enrichment of Biological Processes and Cellular Compartment, using the GOATOOLS python package(Klopfenstein et al., 2018). Where multiple gene lists were assessed simultaneously (e.g. for TD peak gene lists or WGCNA gene sets), correction for multiple comparisons was carried out by dividing the p<0.05 threshold for statistical significance by the number of tests (i.e. for 16 module p<0.05/16). To facilitate summary descriptions of multiple significant GO terms, terms were hierarchically clustered based on semantic similarity (Resnik, 1995) and representative terms were selected based on biological specificity (i.e. depth within the gene ontology tree) and magnitude of the enrichment statistic (Fig 3d, Table S2).

Layer marker gene sets and in situ hybridisation validation

We sought to assess the extent to which convergent spatial patterns of gene expression indicate convergent laminar and cellular features. Marker genes for each cortical layer were defined as the union of layer-specific marker genes from two comprehensive transcriptomic studies of layer-dependent gene expression sampling prefrontal cortical regions(He et al., 2017; Maynard et al., 2021). He et al., took human cortical samples from the prefrontal cortex, corresponding to areas BA 9, 10 & 46. Samples were sectioned into cortical depths and underwent RNAseq to identify 4131 genes exhibiting layer-dependent expression. Maynard et al., took samples from the dorsolateral prefrontal cortex and carried out spatial snRNAseq to identify 3785 genes enriched in specific cortical layers. These independent resources were combined for laminar enrichment analyses (i.e. we took each layer’s marker genes to be the union of layer genes defined in Maynard et al and He at al). WGCNA module genes were tested for laminar enrichment using Fisher’s exact test, correcting for multiple comparisons (Fig 3c, see Table 1). Independent validation of laminar associations of candidate genes identified through the above marker lists were carried out using in situ hybridisation (ISH) data from the Allen Institute(Zeng et al., 2012). For selected modules, we identified the highest kME genes represented within the ISH dataset. For each of these genes, the highest quality sections were downloaded, and the cortical ribbon was manually segmented. Equivolumetric estimates of cortical depth were generated and profiles of depth-dependent staining intensity were generated(Huber et al., 2021). Accompanying approximate cytoarchitectonic layer thickness estimations were derived from BigBrain and used to describe the laminar location of ISH peaks(Wagstyl et al., 2020) (Fig 3d).

Adult cortical cell type marker gene sets

Cell marker gene sets were compiled from multiple snRNAseq datasets, sampling a wide variety of cortical areas covering occipital, temporal, frontal, cingulate and parietal lobes(Darmanis et al., 2015; Habib et al., 2017; Hodge et al., 2019; Lake et al., 2018, 2016; Li et al., 2018; Ruzicka et al., 2021; Velmeshev et al., 2019; Zhang et al., 2016). To integrate across differing subcategories, cell subtype marker lists were grouped into the following cell classes according to their designated names: excitatory neurons, inhibitory neurons, oligodendrocytes, astrocyte, oligodendrocyte precursor cells, microglia and endothelial cells. Marker lists for each of these cell classes represented the union of all subtypes assigned to the category. Cells not fitting into these categorisations were excluded. WGCNA module genes were tested for cell class marker enrichment using Fisher’s exact test, correcting for multiple comparisons (Fig 3c, see Table 1).

Fetal cortical cell type marker gene sets

Fetal cell marker gene lists were taken from Polioudakis et al(Polioudakis et al., 2019). WGCNA module genes were tested for cell class marker enrichment using Fisher’s exact test, correcting for multiple comparisons (Fig 3c, see Table 1).

Compartments and SynGO

Cellular compartment gene lists were taken from the COMPARTMENTS database(Binder et al., 2014), which identifies subcellular localisation of marker genes based on integrated information from the Human Protein Atlas, literature mining and GO annotations. Examples of cellular compartments include nucleus, plasma membrane and cytosol. An additional compartment list for neuronal synapse was generated by collapsing all genes in the manually curated SynGO dataset(Koopmans et al., 2019). WGCNA module genes were tested for cell compartment gene set enrichment using Fisher’s exact test, correcting for multiple comparisons (Fig 3c, see Table 1).

PPI network

Protein-protein interactions were derived from the STRING database(Szklarczyk et al., 2019). Physical direct and indirect protein-protein interactions were considered. We tested for enrichment of protein-protein interactions for proteins coded by genes within WGCNA modules. The median number of intramodular connections was compared to a null distribution of median modular connectivity derived from 10000 randomly resampled modules with the same number of genes. Gene resampling was restricted within deciles defined by the degree of protein-protein connectivity.

Developmental peak epoch

Peak developmental epochs for genes were extracted from(Werling et al., 2020). Briefly, bulk transcriptomic expression values were measured from DLPFC samples across development (6 PCW to 20 years), fitting developmental trajectories to each gene. Genes were categorized according to developmental epoch in which their expression peaked. For descriptive purposes, epochs were renamed as 1: “early fetal” [“fetal”, 8 postconception weeks (PCW) - 24 PCW], 2: late fetal transition (“perinatal”, 24 PCW - 6 months postnatal) and 3: “postnatal” (>6 months). Genes associated with WGCNA modules were tested for enrichment correcting for multiple comparisons across 16 modules.

Developmental trajectories

Gene-specific developmental trajectories were generated for the cortical samples from(Li et al., 2018). Briefly, in this study bulk transcriptomic expression values were measured from brain tissue samples taken from individuals aged between 5 PCW and 64 years old. In our analysis, samples were filtered for cortical ROIs and restricted to post 10 PCW due to lack of samples before this time-point. Ages were log transformed and Generalized Additive Models were fit to each gene to generate an estimated developmental trajectory. To compute trajectory correlations between genes, we first resampled expression trajectories at 20 equally spaced time points (in log time), and then z-normalized these values per gene (using the mean and standard deviation of each trajectory). We then calculated expression trajectory Pearson correlations between each pair of genes in this dataset, and used these to determine if the spatially co-expressed genes defining each WGCNA module also showed significant temporal co-expression. To achieve this test, we calculated the median temporal co-expression (correlation in expression trajectories) for each WGCNA module gene set, and compared this to null median co-expression values for 1000 randomly resampled gene sets matching module size. Mean trajectories of genes in each module were calculated to visualize the developmental expression pattern of each module (Fig S4d).

Fetal compartmental analysis

We used the 21 PCW fetal microarray data processed for analysis of TD peaks (see Relating adult TD peaks to fetal gene expression above, Fig S3k)(Miller et al., 2014), to generated marker gene sets for each of 7 transient fetal cortical compartments: subpial granular zone (SG), marginal zone (MZ), outer and inner cortical plate (grouped together as CP), subplate zone (SP), intermediate zone (IZ), outer and inner subventricular zone (grouped together as SZ), and ventricular zone (VZ). We collapsed 21 PCW cortical expression data into compartments by averaging expression values across cortical regions for each compartment because compartment differences are known to explain the bulk of variation in cortical expression within this dataset (24%(Miller et al., 2014)). The top 5% expressed genes for each of the 7 fetal compartments was taken as the compartment marker set and used for enrichment analysis of WGCNA modules with Fisher’s exact test, correcting for multiple comparisons (see Table 1, Fig 3c).

Reproducibility of genes driving enrichment analyses

We calculated gene-level spatial reproducibilities for the union of all genes contributing to significant neurobiological enrichments of WGCNA modules. This was compared to a null distribution, randomly resampling the same number of genes from all those considered in the enrichment analyses.

5. Combining gene-set based annotations of the cortical sheet (Fig 3e, Fig S3d)

Our observation that many WGCNA modules showed statistically-significant enrichment for diverse gene sets that could span different spatial scales (e.g. layers and organelles) or temporal epochs (e.g. fetal and adult cortical features) (Fig 3c) suggested a potential sharing of marker gene across these diverse sets. To test this idea, and characterize potential biological themes reflected by these shared marker genes, we carried out pairwise enrichment analyses between all annotational gene lists (Fig 3e). Gene lists used for enrichment analysis of WGCNA modules for cortical layers, adult cells, cellular compartments, fetal cells, developmental peak epochs and fetal compartments, were taken for further analysis. A genelist-genelist pairwise enrichment matrix was generated. p-values above 0.1 were set to 1, to limit their contribution and p-values were converted to -log10(p). To remove isolated gene lists, all lists were ranked by their degree (edges defined as p<0.05) and the bottom 10% were excluded from further analysis. The matrix, excluding WGCNA modules, underwent Louvain clustering(Blondel et al., 2008), grouping together gene lists with similar properties. Clusters were assigned descriptive names according to their salient common features (e.g. Non-neuronal, Mature neuron, Mitotic, Myelin, Fetal GE) (Fig S4e). For visualization, the full matrix underwent UMAP embedding(McInnes et al., 2018), a non-linear dimensionality reduction technique assigning 2D coordinates to each gene list (Fig 3e), coloring gene lists by their assigned cluster along with the top 20% of edges.

6. Disease enrichment and ASD-based analysis of WGCNA modules

The proposed analyses above link regionally patterned cortical gene expression with macroscale imaging maps of structure and function, and microscale gene sets exhibiting laminar, cellular, subcellular and developmental transcriptomic specificity. We sought to assess whether WGCNA module gene lists capturing shared spatial and temporal features were also enriched for genes implicated in atypical brain development. We included genes identified in exome sequencing studies in neurodevelopmental disorders: autism spectrum disorder(Ruzzo et al., 2019; Satterstrom et al., 2020) (ASD), schizophrenia(Singh et al., 2020) (SCZ), severe developmental disorders(Deciphering Developmental Disorders Study, 2017) (Deciphering Developmental Disorders study, DDD) and epilepsy(Heyne et al., 2018). WGCNA module gene sets were tested for enrichment of these genes using Fisher’s test and corrected for multiple comparisons (Table 1, Fig 4a). Two modules - M12 and M15 - showed enrichment for multiple disease sets, with the ASD gene set being unique for showing enrichment in both modules. We therefore focused downstream analysis on further characterizing the enrichment of ASD genes in M12 and M15, and testing if these enrichments could predict regional cortical changes in ASD.

a. Characterizing ASD gene enrichments in M12 and M15

kME analysis

To better characterize the spatially distinctive properties of genes within M12 and M15, we defined the union of genes in both modules and collated the WGCNA-defined kME scores for each gene to both M12 and M15. This provided a basis for plotting all genes by their relative membership to both modules to: quantify the proximity of each gene to each module; assess the discreteness of gene assignment to modules; and - for any provide a common space within which to project gene functions and associations with ASD (Fig 4c)

Enrichment of ASD-linked GO terms

Genes linked to two specific GO terms, “Neuronal communication” and “Gene expression regulation”, enriched amongst risk genes for Autism Spectrum Disorder in(Satterstrom et al., 2020), were separately tested for enrichment within M12 and M15 (Fig 4d), using a Fisher’s exact test.

Developmental trajectories of disease-linked modules

To characterize the distinctive temporal trajectories of M12 & M15 (see Fig 3c), we took gene-level trajectories (see Developmental trajectories above) and calculated the mean gene-expression trajectory of genes in each module (Fig 4e).

Independent characterisation of ASD risk genes

To assess the extent to which modules M12 & M15 captured the underlying axes of spatial patterning across all 135 ASD risk genes, we took DEMs for all 135 risk genes and independently clustered them. Pairwise co-expression was calculated for all risk gene DEMs and the resultant matrix was clustered using Gaussian mixture modeling into two clusters, C1 and C2 (Fig S5a). kME values were calculated for each risk gene with all WGCNA modules and averaged within each cluster. For each cluster, we then identified the WGCNA module with the highest mean kME (Fig S5b)

b. Comparing M12 and M15 expression to regional changes of cortical gene expression in ASD (Fig 4f)

We mapped regional transcriptomic disruption in ASD measured from multiple cortical regions using RNA-seq data(Haney et al., 2020). This study compared bulk transcriptomic expression in ASD and control samples across 11 cortical areas, quantifying the extent of transcriptomic disruption by identifying the number of significantly differentially expressed genes in each region. Cortical areas sampled in this study were mapped to their closest corresponding area in a multimodal MRI parcellation(Glasser et al., 2016). The mean expression of M12 & M15 eigenmaps was quantified in the same cortical areas (Fig 4f). The test statistic, correlating eigenmap expression with the number of differentially expressed genes, was tested against a null distribution generated through spinning and resampling the eigenmaps (see Table 1).

c. Comparing M12 and M15 expression to regional changes of cortical thickness in ASD (Fig 4g, h, Fig S5c)

To assess the extent to which WGCNA module eigenmaps pattern macroscale in vivo anatomical differences in ASD, we took the map of relative cortical thickness change in autism (see Preprocessing and analysis of structural MRI data below) and compared this to eigenmap expression patterns. M12 and M15 eigenmaps were thresholded, identifying the 5% of vertices with the highest expression. Areas of high significant thickness change were tested for overlap with areas of significant cortical thickness change using the Dice overlap compared to a null distribution of Dice scores generated through spinning the thresholded eigenmaps (see Table 1)

7. Preprocessing and analysis of structural MRI data

a. AHBA donors

Pial and white matter cortical T1 MRI scans of the 6 AHBA donor brains were reconstructed using Freesurfer (v5.3)(Romero-Garcia et al., 2018)(see Table S1). Briefly, scans undergo tissue segmentation, cortical white and pial surface extraction. A mid-thickness surface, between pial and white surfaces was also created. The locations of tissue samples taken for bulk transcriptomic profiling, provided in the coordinates of the subject’s MRI were mapped to the mid-thickness surface as outlined above (see Creating spatially dense maps of human cortical gene expression from the AHBA). Individual subject cortical surfaces were co-registered to the fs_LR32k template surface brain using MSMSulc(Robinson et al., 2018) as part of the ciftify pipeline(Dickie et al., 2019), which warps subject meshes by non-linear alignment their folding patterns to the MRI-derived template surface. A donor-specific template surface was created through averaging the coordinates of the aligned meshes and used for analysis of cortical folding patterns used in Alignment with reference measures of cortical organization. Pial, Inflated and flattened representations of the fs_LR32k surface were used for the visualization of cortical maps throughout.

b. OASIS (Fig 1e)

To estimate relative cortical thickness change in AD patients with the APOE E4 variant, we utilized the openly available OASIS database(LaMontagne et al., 2019). T1w MRI data collected using a Siemens Tim Trio 3T scanner and underwent cortical surface reconstruction using Freesurfer v5.3 as above. Reconstructions underwent manual quality control and correction, with poor quality data being removed. Output cortical thickness maps, smoothed at 20mm fwhm and aligned to the fsaverage template surface were downloaded via https://www.oasis-brains.org/, along with age, sex, APOE genotype and cognitive status. Subjects were included in the analysis if they had been diagnosed with AD and had at least one APOE E4 allele (n=119), or were a healthy control (n=633) (see Table S1). Per-vertex coefficients for disease-associated cortical thinning and significance were calculated, adjusting for age, sex and mean cortical thickness. We controlled for mean CT to identify local anatomical changes given our finding of generalized cortical thickening in AD as compared to controls in OASIS. The map of cortical thickness coefficients was then registered from fsaverage to fs_LR32k for comparison with the DEM of APOE (Fig 1e)(Robinson et al., 2018).

c. ABIDE

To estimate relative cortical thickness change in ASD, MRI cortical thickness maps, generated through Freesurfer processing of 3T T1 structural MRI scans were downloaded from ABIDE, along with age, sex, site information(Di Martino et al., 2017, 2013)(Table S1). Multiple sites and scanners were used to acquire these data, which is known to introduce systematic biases in morphological measurements like cortical thickness. To mitigate this, we used neuroCombat which estimates and removes unwanted scanner-effects while retaining biological effects on variables such as age, sex and diagnosis(Fortin et al., 2018). Subjects with poor quality freesurfer segmentations were excluded using a threshold Euler count of 100 (ref). Cortical thickness change in ASD relative to controls was calculated adjusting for age, sex and mean cortical thickness. Neighbor-connected vertices exhibiting significant cortical thickness change (p<0.05) were grouped into clusters. A null distribution of cluster sizes was generated using 1000 random permutations of the cohort, storing the maximum significant cluster size for each permutation. The 95th percentile cluster size was used as a threshold for removing test clusters that could have arisen by chance(Hagler et al., 2006). Output coefficient and cluster maps were registered from fsaverage to fs_LR32k and compared with the M12 and M15 eigenmaps as described above.

Supporting information

Supplementary figures

Supplemental tables

Supplemental Table 1

Supplemental Table 2

Supplemental Table 3

Supplemental Table 4

Acknowledgements

The authors would like to thank all the participants and their families for their generous involvement in this study.

Funding

National Institute of Mental Health Intramural Research Program NIH Annual Report Number, 1ZIAMH002949-04 (AR)

Wellcome Trust 215901/Z/19/Z (KSW). NIH grant R01MH112847 (RTS, TDS) NIH grant R01MH123550 (RTS)

NIH grant R01MH120482 (RTS, TDS) NIH grant R37MH125829 (TDS)

NIH grant R01MH123563 (SNV) NIH grant R01MH120482 (TDS) NIH grant R01EB022573 (TDS)

Gates Cambridge Trust (RD) NIH grant T32HG010464 (TTM)

MQ: Transforming Mental Health MQF17_24 (PEV) NIH grant T32MH019112 (JS)

NIH grant K08MH120564 (AAB,JS)

Rosetrees Trust project grant A2665 (SA)

Competing interests

R.T.S. receives consulting income from Octave Bioscience and compensation for reviewership duties from the American Medical Association. All other authors declare no competing interests.

Data availability

The cortical dense expression and gradient maps of 20,781 genes and ∼30k vertices that support the findings of this study are available at https://figshare.com/s/82c8f6ebda38af670cd1. Scripts to download, visualize and analyze MAGICC are available at https://github.com/kwagstyl/MAGICC.

References

1. Alexander-Bloch AF
2. Shou H
3. Liu S
4. Satterthwaite TD
5. Glahn DC
6. Shinohara RT
7. Vandekar SN
8. Raznahan A
2018On testing for spatial correspondence between maps of human brain structure and functionNeuroimage 178:540–551Google Scholar
1. Arnatkeviciute A
2. Fulcher BD
3. Fornito A
2019A practical guide to linking brain-wide gene expression and neuroimaging dataNeuroimage 189:353–367Google Scholar
1. Bakken TE
2. Jorstad NL
3. Hu Q
4. Lake BB
5. Tian W
6. Kalmbach BE
7. Crow M
8. Hodge RD
9. Krienen FM
10. Sorensen SA
11. Eggermont J
12. Yao Z
13. Aevermann BD
14. Aldridge AI
15. Bartlett A
16. Bertagnolli D
17. Casper T
18. Castanon RG
19. Crichton K
20. Daigle TL
21. Dalley R
22. Dee N
23. Dembrow N
24. Diep D
25. Ding S-L
26. Dong W
27. Fang R
28. Fischer S
29. Goldman M
30. Goldy J
31. Graybuck LT
32. Herb BR
33. Hou X
34. Kancherla J
35. Kroll M
36. Lathia K
37. van Lew B
38. Li YE
39. Liu CS
40. Liu H
41. Lucero JD
42. Mahurkar A
43. McMillen D
44. Miller JA
45. Moussa M
46. Nery JR
47. Nicovich PR
48. Niu S-Y
49. Orvis J
50. Osteen JK
51. Owen S
52. Palmer CR
53. Pham T
54. Plongthongkum N
55. Poirion O
56. Reed NM
57. Rimorin C
58. Rivkin A
59. Romanow WJ
60. Sedeño-Cortés AE
61. Siletti K
62. Somasundaram S
63. Sulc J
64. Tieu M
65. Torkelson A
66. Tung H
67. Wang X
68. Xie F
69. Yanny AM
70. Zhang R
71. Ament SA
72. Behrens MM
73. Bravo HC
74. Chun J
75. Dobin A
76. Gillis J
77. Hertzano R
78. Hof PR
79. Höllt T
80. Horwitz GD
81. Keene CD
82. Kharchenko PV
83. Ko AL
84. Lelieveldt BP
85. Luo C
86. Mukamel EA
87. Pinto-Duarte A
88. Preissl S
89. Regev A
90. Ren B
91. Scheuermann RH
92. Smith K
93. Spain WJ
94. White OR
95. Koch C
96. Hawrylycz M
97. Tasic B
98. Macosko EZ
99. McCarroll SA
100. Ting JT
101. Zeng H
102. Zhang K
103. Feng G
104. Ecker JR
105. Linnarsson S
106. Lein ES
2021Comparative cellular analysis of motor cortex in human, marmoset and mouseNature 598:111–119Google Scholar
1. Binder JX
2. Pletscher-Frankild S
3. Tsafou K
4. Stolte C
5. O’Donoghue SI
6. Schneider R
7. Jensen LJ
2014COMPARTMENTS: unification and visualization of protein subcellular localization evidenceDatabase 2014:bau012Google Scholar
1. Blondel VD
2. Guillaume J-L
3. Lambiotte R
4. Lefebvre E.
2008Fast unfolding of communities in large networksarXiv [physics.soc-ph] Google Scholar
1. Brodmann K.
1909Vergleichende Lokalisationslehre der Grosshirnrinde in ihren Prinzipien dargestellt auf Grund des ZellenbauesBarth Google Scholar
1. Burt JB
2. Demirtaş M
3. Eckner WJ
4. Navejar NM
5. Ji JL
6. Martin WJ
7. Bernacchia A
8. Anticevic A
9. Murray JD
2018Hierarchy of transcriptomic specialization across human cortex captured by structural neuroimaging topographyNat Neurosci 21:1251–1259Google Scholar
1. Burt JB
2. Helmer M
3. Shinn M
4. Anticevic A
5. Murray JD
2020Generative modeling of brain maps with spatial autocorrelationNeuroimage 220:117038Google Scholar
1. Chen A
2. Sun Y
3. Lei Y
4. Li C
5. Liao S
6. Liang Z
7. Lin F
8. Yuan N
9. Li M
10. Wang K
11. Yang M
12. Zhang S
13. Zhuang Z
14. Meng J
15. Song Q
16. Zhang Y
17. Xu Y
18. Cui L
19. Han L
20. Yang H
21. Sun X
22. Fei T
23. Chen B
24. Li W
25. Huangfu B
26. Ma K
27. Li Z
28. Lin Y
29. Liu Z
30. Wang H
31. Zhong Y
32. Zhang H
33. Yu Q
34. Wang Y
35. Zhu Z
36. Liu X
37. Peng J
38. Liu C
39. Chen W
40. An Y
41. Xia S
42. Lu Y
43. Wang M
44. Song X
45. Liu S
46. Wang Z
47. Gong C
48. Huang X
49. Yuan Y
50. Zhao Y
51. Luo Z
52. Tan X
53. Liu J
54. Zheng M
55. Li S
56. Huang Y
57. Hong Y
58. Huang Z
59. Li M
60. Zhang R
61. Jin M
62. Li Y
63. Zhang H
64. Sun S
65. Bai Y
66. Cheng M
67. Hu G
68. Liu S
69. Wang B
70. Xiang B
71. Li S
72. Li H
73. Chen M
74. Wang S
75. Zhang Q
76. Liu W
77. Liu X
78. Zhao Q
79. Lisby M
80. Wang J
81. Fang J
82. Lu Z
83. Lin Y
84. Xie Q
85. He J
86. Xu H
87. Huang W
88. Wei W
89. Yang H
90. Sun Y
91. Poo M
92. Wang J
93. Li Y
94. Shen Z
95. Liu L
96. Liu Z
97. Xu X
98. Li C
2022Global Spatial Transcriptome of Macaque Brain at Single-Cell ResolutionbioRxiv https://doi.org/10.1101/2022.03.23.485448 Google Scholar
1. Chi JG
2. Dooling EC
3. Gilles FH
1977Gyral development of the human brainAnn Neurol 1:86–93Google Scholar
1. Collins CE
2. Airey DC
3. Young NA
4. Leitch DB
5. Kaas JH
2010Neuron densities vary across and within cortical areas in primatesProceedings of the National Academy of Sciences 107:15927–15932Google Scholar
1. Collins CE
2. Turner EC
3. Sawyer EK
4. Reed JL
5. Young NA
6. Flaherty DK
7. Kaas JH
2016Cortical cell and neuron density estimates in one chimpanzee hemisphereProc Natl Acad Sci U S A 113:740–745Google Scholar
1. Darmanis S
2. Sloan SA
3. Zhang Y
4. Enge M
5. Caneda C
6. Shuer LM
7. Hayden Gephart MG
8. Barres BA
9. Quake SR
2015A survey of human brain transcriptome diversity at the single cell levelProc Natl Acad Sci U S A 112:7285–7290Google Scholar
1. Deciphering Developmental Disorders Study
2017Prevalence and architecture of de novo mutations in developmental disordersNature 542:433–438Google Scholar
1. de Kovel CGF
2. Lisgo SN
3. Fisher SE
4. Francks C.
2018Subtle left-right asymmetry of gene expression profiles in embryonic and foetal human brainsSci Rep 8:12606Google Scholar
1. Dickie EW
2. Anticevic A
3. Smith DE
4. Coalson TS
5. Manogaran M
6. Calarco N
7. Viviano JD
8. Glasser MF
9. Van Essen DC
10. Voineskos AN.
2019Ciftify: A framework for surface-based analysis of legacy MR acquisitionsNeuroimage 197:818–826Google Scholar
1. Di Martino A
2. O’Connor D
3. Chen B
4. Alaerts K
5. Anderson JS
6. Assaf M
7. Balsters JH
8. Baxter L
9. Beggiato A
10. Bernaerts S
11. Blanken LME
12. Bookheimer SY
13. Braden BB
14. Byrge L
15. Castellanos FX
16. Dapretto M
17. Delorme R
18. Fair DA
19. Fishman I
20. Fitzgerald J
21. Gallagher L
22. Keehn RJJ
23. Kennedy DP
24. Lainhart JE
25. Luna B
26. Mostofsky SH
27. Müller R-A
28. Nebel MB
29. Nigg JT
30. O’Hearn K
31. Solomon M
32. Toro R
33. Vaidya CJ
34. Wenderoth N
35. White T
36. Craddock RC
37. Lord C
38. Leventhal B
39. Milham MP.
2017Enhancing studies of the connectome in autism using the autism brain imaging data exchange IISci Data 4:170010Google Scholar
1. Di Martino A
2. Yan C-G
3. Li Q
4. Denio E
5. Castellanos FX
6. Alaerts K
7. Anderson JS
8. Assaf M
9. Bookheimer SY
10. Dapretto M
11. Deen B
12. Delmonte S
13. Dinstein I
14. Ertl-Wagner B
15. Fair DA
16. Gallagher L
17. Kennedy DP
18. Keown CL
19. Keysers C
20. Lainhart JE
21. Lord C
22. Luna B
23. Menon V
24. Minshew NJ
25. Monk CS
26. Mueller S
27. Müller R-A
28. Nebel MB
29. Nigg JT
30. O’Hearn K
31. Pelphrey KA
32. Peltier SJ
33. Rudie JD
34. Sunaert S
35. Thioux M
36. Tyszka JM
37. Uddin LQ
38. Verhoeven JS
39. Wenderoth N
40. Wiggins JL
41. Mostofsky SH
42. Milham MP.
2013The autism brain imaging data exchange: towards a large-scale evaluation of the intrinsic brain architecture in autismMol Psychiatry 19:659–667Google Scholar
1. Figueroa RL
2. Zeng-Treitler Q
3. Kandula S
4. Ngo LH
2012Predicting sample size required for classification performance. BMC Med Inform Decis Mak 12:8. Fischl B. 2012. FreeSurferNeuroimage 62:774–781Google Scholar
1. Fortin J-P
2. Cullen N
3. Sheline YI
4. Taylor WD
5. Aselcioglu I
6. Cook PA
7. Adams P
8. Cooper C
9. Fava M
10. McGrath PJ
11. McInnis M
12. Phillips ML
13. Trivedi MH
14. Weissman MM
15. Shinohara RT
2018Harmonization of cortical thickness measurements across scanners and sitesNeuroimage 167:104–120Google Scholar
1. Geschwind DH
2. Rakic P
2013Cortical evolution: judge the brain by its coverNeuron 80:633–647Google Scholar
1. Girault JB
2. Donovan K
3. Hawks Z
4. Talovic M
5. Forsen E
6. Elison JT
7. Shen MD
8. Swanson MR
9. Wolff JJ
10. Kim SH
11. Nishino T
12. Davis S
13. Snyder AZ
14. Botteron KN
15. Estes AM
16. Dager SR
17. Hazlett HC
18. Gerig G
19. McKinstry R
20. Pandey J
21. Schultz RT
22. St John T
23. Zwaigenbaum L
24. Todorov A
25. Truong Y
26. Styner M
27. Jr Pruett JR
28. Constantino JN
29. Piven J
30. Network IBIS
2022Infant Visual Brain Development and Inherited Genetic Liability in AutismAm J Psychiatry 179:573–585Google Scholar
1. Glasser MF
2. Coalson TS
3. Robinson EC
4. Hacker CD
5. Harwell J
6. Yacoub E
7. Ugurbil K
8. Andersson J
9. Beckmann CF
10. Jenkinson M
11. Smith SM
12. Van Essen DC.
2016A multi-modal parcellation of human cerebral cortexNature 536:171–178Google Scholar
1. Glasser MF
2. Sotiropoulos SN
3. Wilson JA
4. Coalson TS
5. Fischl B
6. Andersson JL
7. Xu J
8. Jbabdi S
9. Webster M
10. Polimeni JR
11. Van Essen DC
12. Jenkinson M
13. WU-Minn HCP Consortium
2013The minimal preprocessing pipelines for the Human Connectome ProjectNeuroimage 80:105–124Google Scholar
1. Glasser MF
2. Van Essen DC.
2011Mapping human cortical areas in vivo based on myelin content as revealed by T1- and T2-weighted MRIJ Neurosci 31:11597–11616Google Scholar
1. Gryglewski G
2. Seiger R
3. James GM
4. Godbersen GM
5. Komorowski A
6. Unterholzner J
7. Michenthaler P
8. Hahn A
9. Wadsak W
10. Mitterhauser M
11. Kasper S
12. Lanzenberger R
2018Spatial analysis and high resolution mapping of the human whole-brain transcriptome for integrative analysis in neuroimagingNeuroimage 176:259–267Google Scholar
1. Gutiérrez-Galve L
2. Lehmann M
3. Hobbs NZ
4. Clarkson MJ
5. Ridgway GR
6. Crutch S
7. Ourselin S
8. Schott JM
9. Fox NC
10. Barnes J
2009Patterns of cortical thickness according to APOE genotype in Alzheimer’s diseaseDement Geriatr Cogn Disord 28:476–485Google Scholar
1. Habib N
2. Avraham-Davidi I
3. Basu A
4. Burks T
5. Shekhar K
6. Hofree M
7. Choudhury SR
8. Aguet F
9. Gelfand E
10. Ardlie K
11. Weitz DA
12. Rozenblatt-Rosen O
13. Zhang F
14. Regev A
2017Massively parallel single-nucleus RNA-seq with DroNc-seqNat Methods 14:955–958Google Scholar
1. Jr Hagler DJ
2. Saygin AP
3. Sereno MI
2006Smoothing and cluster thresholding for cortical surface-based group analysis of fMRI dataNeuroimage 33:1093–1103Google Scholar
1. Haney JR
2. Wamsley B
3. Chen GT
4. Parhami S
5. Emani PS
6. Chang N
7. Hoftman GD
8. Alba D
9. Kale G
10. Ramaswami G
11. Hartl CL
12. Jin T
13. Wang D
14. Ou J
15. Wu YE
16. Parikshak NN
17. Swarup V
18. Belgard T
19. Gerstein M
20. Pasaniuc B
21. Gandal MJ
22. Geschwind DH.
2020Broad transcriptomic dysregulation across the cerebral cortex in ASDbioRxiv https://doi.org/10.1101/2020.12.17.423129 Google Scholar
1. Hansen JY
2. Markello RD
3. Vogel JW
4. Seidlitz J
5. Bzdok D
6. Misic B
2021Mapping gene transcription and neurocognition across human neocortexNature Human Behaviour https://doi.org/10.1038/s41562-021-01082-z Google Scholar
1. Hartl CL
2. Ramaswami G
3. Pembroke WG
4. Muller S
5. Pintacuda G
6. Saha A
7. Parsana P
8. Battle A
9. Lage K
10. Geschwind DH
2021Coexpression network architecture reveals the brain-wide and multiregional basis of disease susceptibilityNat Neurosci 24:1313–1323Google Scholar
1. Hawrylycz MJ
2. Lein ES
3. Guillozet-Bongaarts AL
4. Shen EH
5. Ng L
6. Miller JA
7. van de Lagemaat LN
8. Smith KA
9. Ebbert A
10. Riley ZL
11. Abajian C
12. Beckmann CF
13. Bernard A
14. Bertagnolli D
15. Boe AF
16. Cartagena PM
17. Chakravarty MM
18. Chapin M
19. Chong J
20. Dalley RA
21. Daly BD
22. Dang C
23. Datta S
24. Dee N
25. Dolbeare TA
26. Faber V
27. Feng D
28. Fowler DR
29. Goldy J
30. Gregor BW
31. Haradon Z
32. Haynor DR
33. Hohmann JG
34. Horvath S
35. Howard RE
36. Jeromin A
37. Jochim JM
38. Kinnunen M
39. Lau C
40. Lazarz ET
41. Lee C
42. Lemon TA
43. Li L
44. Li Y
45. Morris JA
46. Overly CC
47. Parker PD
48. Parry SE
49. Reding M
50. Royall JJ
51. Schulkin J
52. Sequeira PA
53. Slaughterbeck CR
54. Smith SC
55. Sodt AJ
56. Sunkin SM
57. Swanson BE
58. Vawter MP
59. Williams D
60. Wohnoutka P
61. Zielke HR
62. Geschwind DH
63. Hof PR
64. Smith SM
65. Koch C
66. Grant SG
67. Jones AR.
2012An anatomically comprehensive atlas of the adult human brain transcriptomeNature 489:391–399Google Scholar
1. Hawrylycz M
2. Miller JA
3. Menon V
4. Feng D
5. Dolbeare T
6. Guillozet-Bongaarts AL
7. Jegga AG
8. Aronow BJ
9. Lee C-K
10. Bernard A
11. Glasser MF
12. Dierker DL
13. Menche J
14. Szafer A
15. Collman F
16. Grange P
17. Berman KA
18. Mihalas S
19. Yao Z
20. Stewart L
21. Barabási A-L
22. Schulkin J
23. Phillips J
24. Ng L
25. Dang C
26. Haynor DR
27. Jones A
28. Van Essen DC
29. Koch C
30. Lein E
2015Canonical genetic signatures of the adult human brainNat Neurosci 18:1832–1844Google Scholar
1. Heuer K
2. Toro R
2019Role of mechanical morphogenesis in the development and evolution of the neocortexPhys Life Rev 31:233–239Google Scholar
1. Heyne HO
2. Singh T
3. Stamberger H
4. Jamra R
5. Caglayan H
6. Craiu D
7. Jonghe P
8. Guerrini R
9. Helbig KL
10. Koeleman BPC
11. Kosmicki JA
12. Linnankivi T
13. May P
14. Muhle H
15. Møller RS
16. Neubauer BA
17. Palotie A
18. Pendziwiat M
19. Striano P
20. Tang S
21. Wu S
22. EuroEPINOMICS RES Consortium
23. Poduri A
24. Weber YG
25. Weckhuysen S
26. Sisodiya SM
27. Daly MJ
28. Helbig I
29. Lal D
30. Lemke JR.
2018De novo variants in neurodevelopmental disorders with epilepsyNat Genet 50:1048–1053Google Scholar
1. He Z
2. Han D
3. Efimova O
4. Guijarro P
5. Yu Q
6. Oleksiak A
7. Jiang S
8. Anokhin K
9. Velichkovsky B
10. Grünewald S
11. Khaitovich P
2017Comprehensive transcriptome analysis of neocortical layers in humans, chimpanzees and macaquesNat Neurosci 20:886–895Google Scholar
1. Hodge RD
2. Bakken TE
3. Miller JA
4. Smith KA
5. Barkan ER
6. Graybuck LT
7. Close JL
8. Long B
9. Johansen N
10. Penn O
11. Yao Z
12. Eggermont J
13. Höllt T
14. Levi BP
15. Shehata SI
16. Aevermann B
17. Beller A
18. Bertagnolli D
19. Brouner K
20. Casper T
21. Cobbs C
22. Dalley R
23. Dee N
24. Ding S-L
25. Ellenbogen RG
26. Fong O
27. Garren E
28. Goldy J
29. Gwinn RP
30. Hirschstein D
31. Keene CD
32. Keshk M
33. Ko AL
34. Lathia K
35. Mahfouz A
36. Maltzer Z
37. McGraw M
38. Nguyen TN
39. Nyhus J
40. Ojemann JG
41. Oldre A
42. Parry S
43. Reynolds S
44. Rimorin C
45. Shapovalova NV
46. Somasundaram S
47. Szafer A
48. Thomsen ER
49. Tieu M
50. Quon G
51. Scheuermann RH
52. Yuste R
53. Sunkin SM
54. Lelieveldt B
55. Feng D
56. Ng L
57. Bernard A
58. Hawrylycz M
59. Phillips JW
60. Tasic B
61. Zeng H
62. Jones AR
63. Koch C
64. Lein ES
2019Conserved cell types with divergent features in human versus mouse cortexNature 573:61–68Google Scholar
1. Holm S
1979A Simple Sequentially Rejective Multiple Test ProcedureScand Stat Theory Appl 6:65–70Google Scholar
1. Huber L
2. (renzo)
3. Poser BA
4. Bandettini PA
5. Arora K
6. Wagstyl K
7. Cho S
8. Goense J
9. Nothnagel N
10. Morgan AT
11. van den Hurk J
12. Müller AK
13. Reynolds RC
14. Glen DR
15. Goebel R
16. Gulban OF.
2021LayNii: A software suite for layer-fMRINeuroimage 237:118091Google Scholar
1. Jo HJ
2. Saad ZS
3. Gotts SJ
4. Martin A
5. Cox RW
2012Quantifying agreement between anatomical and functional interhemispheric correspondences in the resting brainPLoS One 7:e48847Google Scholar
1. Kelley KW
2. Nakao-Inoue H
3. Molofsky AV
4. Oldham MC
2018Variation among intact tissue samples reveals the core transcriptional features of human CNS cell classesNat Neurosci 21:1171–1184Google Scholar
1. Klopfenstein DV
2. Zhang L
3. Pedersen BS
4. Ramírez F
5. Warwick Vesztrocy A
6. Naldi A
7. Mungall CJ
8. Yunes JM
9. Botvinnik O
10. Weigel M
11. Dampier W
12. Dessimoz C
13. Flick P
14. Tang H
2018GOATOOLS: A Python library for Gene Ontology analysesSci Rep 8:10872Google Scholar
1. Koopmans F
2. Van Nierop P
3. Andres-Alonso M
4. Byrnes A
5. Cijsouw T
6. Coba MP
7. Cornelisse LN
8. Farrell RJ
9. Goldschmidt HL
10. Howrigan DP
11. Hussain NK
12. Imig C
13. Jong APH
14. Jung H
15. Kohansalnodehi M
16. Kramarz B
17. Lipstein N
18. Lovering RC
19. MacGillavry H
20. Mariano V
21. Mi H
22. Ninov M
23. Osumi-Sutherland D
24. Pielot R
25. Smalla K-H
26. Tang H
27. Tashman K
28. Toonen RFG
29. Verpelli C
30. Reig-Viader R
31. Watanabe K
32. Van Weering J
33. Achsel T
34. Ashrafi G
35. Asi N
36. Brown TC
37. Camilli P
38. Feuermann M
39. Foulger RE
40. Gaudet P
41. Joglekar A
42. Kanellopoulos A
43. Malenka R
44. Nicoll RA
45. Pulido C
46. de Juan-Sanz J
47. Sheng M
48. Südhof TC
49. Tilgner HU
50. Bagni C
51. Bayés À
52. Biederer T
53. Brose N
54. Chua JJE
55. Dieterich DC
56. Gundelfinger ED
57. Hoogenraad C
58. Huganir RL
59. Jahn R
60. Kaeser PS
61. Kim E
62. Kreutz MR
63. McPherson PS
64. Neale BM
65. O’Connor V
66. Posthuma D
67. Ryan TA
68. Sala C
69. Feng G
70. Hyman SE
71. Thomas PD
72. Smit AB
73. Verhage M.
2019SynGO: An Evidence-Based, Expert-Curated Knowledge Base for the SynapseNeuron 103:217–234Google Scholar
1. Kravitz DJ
2. Saleem KS
3. Baker CI
4. Ungerleider LG
5. Mishkin M
2013The ventral visual pathway: an expanded neural framework for the processing of object qualityTrends in Cognitive Sciences https://doi.org/10.1016/j.tics.2012.10.011 Google Scholar
1. Lake BB
2. Ai R
3. Kaeser GE
4. Salathia NS
5. Yung YC
6. Liu R
7. Wildberg A
8. Gao D
9. Fung H-L
10. Chen S
11. Vijayaraghavan R
12. Wong J
13. Chen A
14. Sheng X
15. Kaper F
16. Shen R
17. Ronaghi M
18. Fan J-B
19. Wang W
20. Chun J
21. Zhang K
2016Neuronal subtypes and diversity revealed by single-nucleus RNA sequencing of the human brainScience 352:1586–1590Google Scholar
1. Lake BB
2. Chen S
3. Sos BC
4. Fan J
5. Kaeser GE
6. Yung YC
7. Duong TE
8. Gao D
9. Chun J
10. Kharchenko PV
11. Zhang K
2018Integrative single-cell analysis of transcriptional and epigenetic states in the human adult brainNat Biotechnol 36:70–80Google Scholar
1. LaMontagne PJ
2. Benzinger TLS
3. Morris JC
4. Keefe S
5. Hornbeck R
6. Xiong C
7. Grant E
8. Hassenstab J
9. Moulder K
10. Vlassenko AG
11. Raichle ME
12. Cruchaga C
13. Marcus D.
2019OASIS-3: Longitudinal neuroimaging, clinical, and cognitive dataset for normal aging and Alzheimer diseasebioRxiv https://doi.org/10.1101/2019.12.13.19014902 Google Scholar
1. Langfelder P
2. Horvath S
2008WGCNA: an R package for weighted correlation network analysisBMC Bioinformatics 9:559Google Scholar
1. Larivière S
2. Paquola C
3. Park B-Y
4. Royer J
5. Wang Y
6. Benkarim O
7. de Wael RV
8. Valk SL
9. Thomopoulos SI
10. Kirschner M
11. Lewis LB
12. Evans AC
13. Sisodiya SM
14. McDonald CR
15. Thompson PM
16. Bernhardt BC.
2021The ENIGMA Toolbox: multiscale neural contextualization of multisite neuroimaging datasetsNature Methods https://doi.org/10.1038/s41592-021-01186-4 Google Scholar
1. Lefèvre J
2. Pepe A
3. Muscato J
4. De Guio F
5. Girard N
6. Auzias G
7. Germanaud D.
2018SPANOL (SPectral ANalysis of Lobes): A Spectral Clustering Framework for Individual and Group Parcellation of Cortical Surfaces in LobesFront Neurosci 12:354Google Scholar
1. Li M
2. Santpere G
3. Imamura Kawasawa Y
4. Evgrafov OV
5. Gulden FO
6. Pochareddy S
7. Sunkin SM
8. Li Z
9. Shin Y
10. Zhu Y
11. Sousa AMM
12. Werling DM
13. Kitchen RR
14. Kang HJ
15. Pletikos M
16. Choi J
17. Muchnik S
18. Xu X
19. Wang D
20. Lorente-Galdos B
21. Liu S
22. Giusti-Rodríguez P
23. Won H
24. de Leeuw CA
25. Pardiñas AF
26. BrainSpan Consortium, PsychENCODE Consortium
27. Subgroup PsychENCODE Developmental
28. Hu M
29. Jin F
30. Li Y
31. Owen MJ
32. O’Donovan MC
33. Walters JTR
34. Posthuma D
35. Reimers MA
36. Levitt P
37. Weinberger DR
38. Hyde TM
39. Kleinman JE
40. Geschwind DH
41. Hawrylycz MJ
42. State MW
43. Sanders SJ
44. Sullivan PF
45. Gerstein MB
46. Lein ES
47. Knowles JA
48. Sestan N
2018Integrative functional genomic analysis of human brain development and neuropsychiatric risksScience 362https://doi.org/10.1126/science.aat7615 Google Scholar
1. Llinares-Benadero C
2. Borrell V
2019Deconstructing cortical folding: genetic, cellular and mechanical determinantsNat Rev Neurosci https://doi.org/10.1038/s41583-018-0112-2 Google Scholar
1. Markello RD
2. Arnatkeviciute A
3. Poline J-B
4. Fulcher BD
5. Fornito A
6. Misic B
2021Standardizing workflows in imaging transcriptomics with the abagen toolboxElife 10https://doi.org/10.7554/eLife.72129 Google Scholar
1. Markello RD
2. Misic B
2021Comparing spatial null models for brain mapsNeuroimage 236:118052Google Scholar
1. Maynard KR
2. Collado-Torres L
3. Weber LM
4. Uytingco C
5. Barry BK
6. Williams SR
7. Catallini JL
8. Tran MN
9. Besich Z
10. Tippani M
11. Chew J
12. Yin Y
13. Kleinman JE
14. Hyde TM
15. Rao N
16. Hicks SC
17. Martinowich K
18. Jaffe AE
2021Transcriptome-scale spatial gene expression in the human dorsolateral prefrontal cortexNat Neurosci 24:425–436Google Scholar
1. McInnes L
2. Healy J
3. Melville J.
2018UMAP: Uniform Manifold Approximation and Projection for Dimension ReductionarXiv [statML] Google Scholar
1. Menassa DA
2. Gomez-Nicola D
2018Microglial Dynamics During Human Brain DevelopmentFront Immunol 9:1014Google Scholar
1. Mesulam MM
1998From sensation to cognitionBrain 121:1013–1052Google Scholar
1. Miller JA
2. Ding S-L
3. Sunkin SM
4. Smith KA
5. Ng L
6. Szafer A
7. Ebbert A
8. Riley ZL
9. Royall JJ
10. Aiona K
11. Arnold JM
12. Bennet C
13. Bertagnolli D
14. Brouner K
15. Butler S
16. Caldejon S
17. Carey A
18. Cuhaciyan C
19. Dalley RA
20. Dee N
21. Dolbeare TA
22. Facer BAC
23. Feng D
24. Fliss TP
25. Gee G
26. Goldy J
27. Gourley L
28. Gregor BW
29. Gu G
30. Howard RE
31. Jochim JM
32. Kuan CL
33. Lau C
34. Lee C-K
35. Lee F
36. Lemon TA
37. Lesnar P
38. McMurray B
39. Mastan N
40. Mosqueda N
41. Naluai-Cecchini T
42. Ngo N-K
43. Nyhus J
44. Oldre A
45. Olson E
46. Parente J
47. Parker PD
48. Parry SE
49. Stevens A
50. Pletikos M
51. Reding M
52. Roll K
53. Sandman D
54. Sarreal M
55. Shapouri S
56. Shapovalova NV
57. Shen EH
58. Sjoquist N
59. Slaughterbeck CR
60. Smith M
61. Sodt AJ
62. Williams D
63. Zöllei L
64. Fischl B
65. Gerstein MB
66. Geschwind DH
67. Glass IA
68. Hawrylycz MJ
69. Hevner RF
70. Huang H
71. Jones AR
72. Knowles JA
73. Levitt P
74. Phillips JW
75. Sestan N
76. Wohnoutka P
77. Dang C
78. Bernard A
79. Hohmann JG
80. Lein ES
2014Transcriptional landscape of the prenatal human brainNature 508:199–206Google Scholar
1. Molnár Z
2. Clowry GJ
3. Šestan N
4. Alzu’bi A
5. Bakken T
6. Hevner RF
7. Hüppi PS
8. Kostović I
9. Rakic P
10. Anton ES
11. Edwards D
12. Garcez P
13. Hoerder-Suabedissen A
14. Kriegstein A
2019New insights into the development of the human cerebral cortexJ Anat 235:432–451Google Scholar
1. Monier A
2. Adle-Biassette H
3. Delezoide A-L
4. Evrard P
5. Gressens P
6. Verney C
2007Entry and distribution of microglial cells in human embryonic and fetal cerebral cortexJ Neuropathol Exp Neurol 66:372–382Google Scholar
1. Moresi L
2. Mather B
2019Stripy: A Python module for (constrained) triangulation in Cartesian coordinates and on a sphereJ Open Source Softw 4:1410Google Scholar
1. Nieuwenhuys R
2. Broere CAJ
2017A map of the human neocortex showing the estimated overall myelin content of the individual architectonic areas based on the studies of Adolf HopfBrain Struct Funct 222:465–480Google Scholar
1. O’Leary DD
1989Do cortical areas emerge from a protocortex?Trends Neurosci 12:400–406Google Scholar
1. O’Leary DDM
2. Chou S-J
3. Sahara S
2007Area patterning of the mammalian cortexNeuron 56:252–269Google Scholar
1. Palomero-Gallagher N
2. Zilles K
2019Cortical layers: Cyto-, myelo-, receptor-and synaptic architecture in human cortical areasNeuroimage 197:716–741Google Scholar
1. Pang JC
2. Aquino KM
3. Oldehinkel M
4. Robinson PA
5. Fulcher BD
6. Breakspear M
7. Fornito A
2023Geometric constraints on human brain functionNature 618:566–574Google Scholar
1. Parikshak NN
2. Luo R
3. Zhang A
4. Won H
5. Lowe JK
6. Chandran V
7. Horvath S
8. Geschwind DH
2013Integrative functional genomic analyses implicate specific molecular pathways and circuits in autismCell 155:1008–1021Google Scholar
1. Pfeifer RA
1940Die angioarchitektonische areale gliederung der grosshirnrinde: auf grund vollkommener gefässinjektionspräparate vom gehirn des macacus rhesusG. Thieme Google Scholar
1. Polioudakis D
2. de la Torre-Ubieta L
3. Langerman J
4. Elkins AG
5. Shi X
6. Stein JL
7. Vuong CK
8. Nichterwitz S
9. Gevorgian M
10. Opland CK
11. Lu D
12. Connell W
13. Ruzzo EK
14. Lowe JK
15. Hadzic T
16. Hinz FI
17. Sabri S
18. Lowry WE
19. Gerstein MB
20. Plath K
21. Geschwind DH.
2019A Single-Cell Transcriptomic Atlas of Human Neocortical Development during Mid-gestationNeuron 103:785–801Google Scholar
1. Rakic P
1988Specification of cerebral cortical areasScience 241:170–176Google Scholar
1. Rakic P
2. Ayoub AE
3. Breunig JJ
4. Dominguez MH
2009Decision by division: making cortical mapsTrends Neurosci 32:291–301Google Scholar
1. Resnik P.
1995Using Information Content to Evaluate Semantic Similarity in a TaxonomyarXiv [cmp-lg] Google Scholar
1. Robinson EC
2. Garcia K
3. Glasser MF
4. Chen Z
5. Coalson TS
6. Makropoulos A
7. Bozek J
8. Wright R
9. Schuh A
10. Webster M
11. Hutter J
12. Price A
13. Cordero Grande L
14. Hughes E
15. Tusor N
16. Bayly PV
17. Van Essen DC
18. Smith SM
19. Edwards AD
20. Hajnal J
21. Jenkinson M
22. Glocker B
23. Rueckert D.
2018Multimodal surface matching with higher-order smoothness constraintsNeuroimage 167:453–465Google Scholar
1. Romero-Garcia R
2. Whitaker KJ
3. Váša F
4. Seidlitz J
5. Shinn M
6. Fonagy P
7. Dolan RJ
8. Jones PB
9. Goodyer IM
10. Consortium NSPN
11. Bullmore ET
12. Vértes PE
2018Structural covariance networks are coupled to expression of genes enriched in supragranular layers of the human cortexNeuroimage 171:256–267Google Scholar
1. Ronan L
2. Fletcher PC
2015From genes to folds: a review of cortical gyrification theoryBrain Struct Funct 220:2475–2483Google Scholar
1. Ronan L
2. Voets N
3. Rua C
4. Alexander-Bloch A
5. Hough M
6. Mackay C
7. Crow TJ
8. James A
9. Giedd JN
10. Fletcher PC
2014Differential tangential expansion as a mechanism for cortical gyrificationCereb Cortex 24:2219–2228Google Scholar
1. Rubin TN
2. Koyejo O
3. Gorgolewski KJ
4. Jones MN
5. Poldrack RA
6. Yarkoni T
2017Decoding brain activity using a large-scale probabilistic functional-anatomical atlas of human cognitionPLoS Comput Biol 13:e1005649Google Scholar
1. Ruzicka B
2. Mohammadi S
3. Davila-Velderrain J
4. Subburaju S
5. Tso R
6. Hourihan M
7. Kellis M
2021Single-Cell Dissection of Schizophrenia Reveals Neurodevelopmental-Synaptic Link and Transcriptional Resilience Associated Cellular StateBiol Psychiatry 89:S106Google Scholar
1. Ruzzo EK
2. Pérez-Cano L
3. Jung J-Y
4. Wang L-K
5. Kashef-Haghighi D
6. Hartl C
7. Singh C
8. Xu J
9. Hoekstra JN
10. Leventhal O
11. Leppä VM
12. Gandal MJ
13. Paskov K
14. Stockham N
15. Polioudakis D
16. Lowe JK
17. Prober DA
18. Geschwind DH
19. Wall DP
2019Inherited and De Novo Genetic Risk for Autism Impacts Shared NetworksCell 178:850–866Google Scholar
1. Satterstrom FK
2. Kosmicki JA
3. Wang J
4. Breen MS
5. De Rubeis S
6. An J-Y
7. Peng M
8. Collins R
9. Grove J
10. Klei L
11. Stevens C
12. Reichert J
13. Mulhern MS
14. Artomov M
15. Gerges S
16. Sheppard B
17. Xu X
18. Bhaduri A
19. Norman U
20. Brand H
21. Schwartz G
22. Nguyen R
23. Guerrero EE
24. Dias C
25. Autism Sequencing Consortium, iPSYCH-Broad Consortium
26. Betancur C
27. Cook EH
28. Gallagher L
29. Gill M
30. Sutcliffe JS
31. Thurm A
32. Zwick ME
33. Børglum AD
34. State MW
35. Cicek AE
36. Talkowski ME
37. Cutler DJ
38. Devlin B
39. Sanders SJ
40. Roeder K
41. Daly MJ
42. Buxbaum JD.
2020Large-Scale Exome Sequencing Study Implicates Both Developmental and Functional Changes in the Neurobiology of AutismCell 180:568–584Google Scholar
1. Schaefer A
2. Kong R
3. Gordon EM
4. Laumann TO
5. Zuo X-N
6. Holmes AJ
7. Eickhoff SB
8. Yeo BTT
2018Local-Global Parcellation of the Human Cerebral Cortex from Intrinsic Functional Connectivity MRICereb Cortex 28:3095–3114Google Scholar
1. Seidlitz J
2. Nadig A
3. Liu S
4. Bethlehem RAI
5. Vértes PE
6. Morgan SE
7. Váša F
8. Romero-Garcia R
9. Lalonde FM
10. Clasen LS
11. Blumenthal JD
12. Paquola C
13. Bernhardt B
14. Wagstyl K
15. Polioudakis D
16. de la Torre-Ubieta L
17. Geschwind DH
18. Han JC
19. Lee NR
20. Murphy DG
21. Bullmore ET
22. Raznahan A.
2020Transcriptomic and cellular decoding of regional brain vulnerability to neurogenetic disordersNat Commun 11:3358Google Scholar
1. Singh T
2. Poterba T
3. Curtis D
4. Akil H
5. Eissa M
6. Barchas JD
7. Bass N
8. Bigdeli TB
9. Breen G
10. Bromet EJ
11. Buckley PF
12. Bunney WE
13. Bybjerg-Grauholm J
14. Byerley WF
15. Chapman SB
16. Chen WJ
17. Churchhouse C
18. Craddock N
19. Curtis C
20. Cusick CM
21. DeLisi L
22. Dodge S
23. Escamilla MA
24. Eskelinen S
25. Fanous AH
26. Faraone SV
27. Fiorentino A
28. Francioli L
29. Gabriel SB
30. Gage D
31. Taliun SA
32. Ganna A
33. Genovese G
34. Glahn DC
35. Grove J
36. Hall M-H
37. Hamalainen E
38. Heyne HO
39. Holi M
40. Hougaard DM
41. Howrigan DP
42. Huang H
43. Hwu H-G
44. Kahn RS
45. Kang HM
46. Karczewski K
47. Kirov G
48. Knowles JA
49. Lee FS
50. Lehrer DS
51. Lescai F
52. Malaspina D
53. Marder SR
54. McCarroll SA
55. Medeiros H
56. Milani L
57. Morley CP
58. Morris DW
59. Mortensen PB
60. Myers RM
61. Nordentoft M
62. Olivares AM
63. Ongur D
64. Ouwehand WH
65. Palmer DS
66. Paunio T
67. Quested D
68. Rapaport MH
69. Rees E
70. Rollins B
71. Kyle Satterstrom F
72. Schatzberg A
73. Scolnick E
74. Scott L
75. Sharp SI
76. Sklar P
77. Smoller JW
78. Sobell J l.
79. Solomonson M
80. Stevens CR
81. Suvisaari J
82. Tiao G
83. Watson SJ
84. Watts NA
85. Blackwood DH
86. Borglum A
87. Cohen BM
88. Corvin AP
89. Esko T
90. Freimer NB
91. Glatt SJ
92. Hultman CM
93. McQuillin A
94. Palotie A
95. Pato CN
96. Pato MT
97. Pulver AE
98. St. Clair D
99. Tsuang MT
100. Vawter MP
101. Walters JT
102. Werge T
103. Ophoff RA
104. Sullivan PF
105. Owen MJ
106. Boehnke M
107. Neale BM
108. Daly MJ.
2020Exome sequencing identifies rare coding variants in 10 genes which confer substantial risk for schizophreniamedRxiv 2020:09.18.20192815Google Scholar
1. Sjöstedt E
2. Zhong W
3. Fagerberg L
4. Karlsson M
5. Mitsios N
6. Adori C
7. Oksvold P
8. Edfors F
9. Limiszewska A
10. Hikmet F
11. Huang J
12. Du Y
13. Lin L
14. Dong Z
15. Yang L
16. Liu X
17. Jiang H
18. Xu X
19. Wang J
20. Yang H
21. Bolund L
22. Mardinoglu A
23. Zhang C
24. Feilitzen K
25. Lindskog C
26. Pontén F
27. Luo Y
28. Hökfelt T
29. Uhlén M
30. Mulder J.
2020An atlas of the protein-coding genes in the human, pig, and mouse brainScience 367https://doi.org/10.1126/science.aay5947 Google Scholar
1. Smith SM
2. Douaud G
3. Chen W
4. Hanayik T
5. Alfaro-Almagro F
6. Sharp K
7. Elliott LT
2021An expanded set of genome-wide association studies of brain imaging phenotypes in UK BiobankNat Neurosci 24:737–745Google Scholar
1. Spocter MA
2. Hopkins WD
3. Barks SK
4. Bianchi S
5. Hehmeyer AE
6. Anderson SM
7. Stimpson CD
8. Fobbs AJ
9. Hof PR
10. Sherwood CC
2012Neuropil distribution in the cerebral cortex differs between humans and chimpanzeesJ Comp Neurol 520:2917–2929Google Scholar
1. Szklarczyk D
2. Gable AL
3. Lyon D
4. Junge A
5. Wyder S
6. Huerta-Cepas J
7. Simonovic M
8. Doncheva NT
9. Morris JH
10. Bork P
11. Jensen LJ
12. Mering C von.
2019STRING v11: protein-protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasetsNucleic Acids Res 47:D607–D613Google Scholar
1. Tam V
2. Patel N
3. Turcotte M
4. Bossé Y
5. Paré G
6. Meyre D
2019Benefits and limitations of genome-wide association studiesNature Reviews Genetics https://doi.org/10.1038/s41576-019-0127-1 Google Scholar
1. Tasic B
2. Menon V
3. Nguyen TN
4. Kim TK
5. Jarsky T
6. Yao Z
7. Levi B
8. Gray LT
9. Sorensen SA
10. Dolbeare T
11. Bertagnolli D
12. Goldy J
13. Shapovalova N
14. Parry S
15. Lee C
16. Smith K
17. Bernard A
18. Madisen L
19. Sunkin SM
20. Hawrylycz M
21. Koch C
22. Zeng H
2016Adult mouse cortical cell taxonomy revealed by single cell transcriptomicsNat Neurosci 19:335–346Google Scholar
1. Toro R
2. Burnod Y
2005A Morphogenetic Model for the Development of Cortical ConvolutionsCereb Cortex 15:1900–1913Google Scholar
1. Van Essen DC.
2020A 2020 view of tension-based cortical morphogenesisProc Natl Acad Sci U S A https://doi.org/10.1073/pnas.2016830117 Google Scholar
1. Velmeshev D
2. Schirmer L
3. Jung D
4. Haeussler M
5. Perez Y
6. Mayer S
7. Bhaduri A
8. Goyal N
9. Rowitch DH
10. Kriegstein AR
2019Single-cell genomics identifies cell type-specific molecular changes in autismScience 364:685–689Google Scholar
1. Economo CF
2. Koskinas GN.
1925Die cytoarchitektonik der hirnrinde des erwachsenen menschenJ. Springer Google Scholar
1. Wael R
2. Benkarim O
3. Paquola C
4. Lariviere S
5. Royer J
6. Tavakol S
7. Xu T
8. Hong S-J
9. Langs G
10. Valk S
11. Misic B
12. Milham M
13. Margulies D
14. Smallwood J
15. Bernhardt BC.
2020BrainSpace: a toolbox for the analysis of macroscale gradients in neuroimaging and connectomics datasetsCommun Biol 3:103Google Scholar
1. Wagstyl K
2. Larocque S
3. Cucurull G
4. Lepage C
5. Cohen JP
6. Bludau S
7. Palomero-Gallagher N
8. Lewis LB
9. Funck T
10. Spitzer H
11. Dickscheid T
12. Fletcher PC
13. Romero A
14. Zilles K
15. Amunts K
16. Bengio Y
17. Evans AC
2020BigBrain 3D atlas of cortical layers: Cortical and laminar thickness gradients diverge in sensory and motor corticesPLoS Biol 18:e3000678Google Scholar
1. Weinstein SM
2. Vandekar SN
3. Adebimpe A
4. Tapera TM
5. Robert-Fitzgerald T
6. Gur RC
7. Gur RE
8. Raznahan A
9. Satterthwaite TD
10. Alexander-Bloch AF
11. Shinohara RT
2021A simple permutation-based test of intermodal correspondenceHum Brain Mapp 42:5175–5187Google Scholar
1. Werling DM
2. Pochareddy S
3. Choi J
4. An J-Y
5. Sheppard B
6. Peng M
7. Li Z
8. Dastmalchi C
9. Santpere G
10. Sousa AMM
11. Tebbenkamp ATN
12. Kaur N
13. Gulden FO
14. Breen MS
15. Liang L
16. Gilson MC
17. Zhao X
18. Dong S
19. Klei L
20. Cicek AE
21. Buxbaum JD
22. Adle-Biassette H
23. Thomas J-L
24. Aldinger KA
25. O’Day DR
26. Glass IA
27. Zaitlen NA
28. Talkowski ME
29. Roeder K
30. State MW
31. Devlin B
32. Sanders SJ
33. Sestan N
2020Whole-Genome and RNA Sequencing Reveal Variation and Transcriptomic Coordination in the Developing Human Prefrontal CortexCell Rep 31:107489Google Scholar
1. Xia J
2. Zhang C
3. Wang F
4. Meng Y
5. Wu Z
6. Wang L
7. Lin W
8. Shen D
9. Li G
2018A COMPUTATIONAL METHOD FOR LONGITUDINAL MAPPING OF ORIENTATION-SPECIFIC EXPANSION OF CORTICAL SURFACE AREA IN INFANTSProc IEEE Int Symp Biomed Imaging 2018:683–686Google Scholar
1. Xu X
2. Sun C
3. Sun J
4. Shi W
5. Shen Y
6. Zhao R
7. Luo W
8. Li M
9. Wang G
10. Wu D
2022Spatiotemporal Atlas of the Fetal Brain Depicts Cortical Developmental GradientJ Neurosci 42:9435–9449Google Scholar
1. Yarkoni T
2. Poldrack RA
3. Nichols TE
4. Van Essen DC
5. Wager TD.
2011Large-scale automated synthesis of human functional neuroimaging dataNat Methods 8:665–670Google Scholar
1. Yeo BTT
2. Krienen FM
3. Sepulcre J
4. Sabuncu MR
5. Lashkari D
6. Hollinshead M
7. Roffman JL
8. Smoller JW
9. Zöllei L
10. Polimeni JR
11. Fischl B
12. Liu H
13. Buckner RL
2011The organization of the human cerebral cortex estimated by intrinsic functional connectivityJ Neurophysiol 106:1125–1165Google Scholar
1. Yip AM
2. Horvath S
2007Gene network interconnectedness and the generalized topological overlap measureBMC Bioinformatics 8:22Google Scholar
1. Zeng H
2. Shen EH
3. Hohmann JG
4. Oh SW
5. Bernard A
6. Royall JJ
7. Glattfelder KJ
8. Sunkin SM
9. Morris JA
10. Guillozet-Bongaarts AL
11. Smith KA
12. Ebbert AJ
13. Swanson B
14. Kuan L
15. Page DT
16. Overly CC
17. Lein ES
18. Hawrylycz MJ
19. Hof PR
20. Hyde TM
21. Kleinman JE
22. Jones AR
2012Large-scale cellular-resolution gene profiling in human neocortex reveals species-specific molecular signaturesCell 149:483–496Google Scholar
1. Zhang B
2. Horvath S
2005A general framework for weighted gene co-expression network analysisStat Appl Genet Mol Biol 4Google Scholar
1. Zhang Y
2. Sloan SA
3. Clarke LE
4. Caneda C
5. Plaza CA
6. Blumenthal PD
7. Vogel H
8. Steinberg GK
9. Edwards MSB
10. Li G
11. Duncan JA
12. Cheshier SH
13. Shuer LM
14. Chang EF
15. Grant GA
16. Gephart MGH
17. Barres BA
2016Purification and Characterization of Progenitor and Mature Human Astrocytes Reveals Transcriptional and Functional Differences with MouseNeuron 89:37–53Google Scholar

Article and author information

Author information

Konrad Wagstyl
Wellcome Centre for Human Neuroimaging, University College London, London, UK
ORCID iD: 0000-0003-3439-5808
- Corresponding author. Email: k.wagstyl@ucl.ac.uk
Sophie Adler
UCL Great Ormond Street Institute for Child Health, 30 Guilford St, Holborn, London WC1N 1EH
Jakob Seidlitz
Department of Psychiatry, University of Pennsylvania, Philadelphia, PA 19104, Department of Child and Adolescent Psychiatry and Behavioral Science, The Children’s Hospital of Philadelphia, Philadelphia, PA 19104
Simon Vandekar
Department of Biostatistics, Vanderbilt University, Nashville, Tennessee, USA
Travis T. Mallard
Psychiatric and Neurodevelopmental Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA, Department of Psychiatry, Harvard Medical School, Boston, MA, USA
Richard Dear
Department of Psychiatry, University of Cambridge, Cambridge, CB2 0SZ, UK
Alex R. DeCasien
Section on Developmental Neurogenomics, Human Genetics Branch, National Institute of Mental Health, Bethesda, MD, USA
Theodore D. Satterthwaite
Department of Psychiatry, University of Pennsylvania, Philadelphia, PA 19104, Lifespan Informatics and Neuroimaging Center, University of Pennsylvania School of Medicine, Philadelphia, PA, 19104
ORCID iD: 0000-0001-7072-9399
Siyuan Liu
Section on Developmental Neurogenomics, Human Genetics Branch, National Institute of Mental Health, Bethesda, MD, USA
Petra E. Vértes
Department of Psychiatry, University of Cambridge, Cambridge, CB2 0SZ, UK
Russell T. Shinohara
Penn Statistics in Imaging and Visualization Center, Department of Biostatistics, Epidemiology, and Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
Aaron Alexander-Bloch
Department of Psychiatry, University of Pennsylvania, Philadelphia, PA 19104, Department of Child and Adolescent Psychiatry and Behavioral Science, The Children’s Hospital of Philadelphia, Philadelphia, PA 19104
Daniel H. Geschwind
Center for Autism Research and Treatment, Semel Institute, Program in Neurogenetics, Department of Neurology, and Department of Human Genetics, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA, US
Armin Raznahan
Section on Developmental Neurogenomics, Human Genetics Branch, National Institute of Mental Health, Bethesda, MD, USA

Version history

Preprint posted: February 11, 2023
Sent for peer review: March 7, 2023
Reviewed Preprint version 1: May 24, 2023
Reviewed Preprint version 2: January 11, 2024
Version of Record published: February 7, 2024

Cite all versions

You can cite all versions using the DOI https://doi.org/10.7554/eLife.86933. This DOI represents all versions, and will always resolve to the latest one.

Copyright

This is an open-access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 public domain dedication.

Revised: This Reviewed Preprint has been revised by the authors in response to the previous round of peer review; the eLife assessment and the public reviews have been updated where necessary by the editors and peer reviewers.

Reviewing Editor
Saad Jbabdi
University of Oxford, Oxford, United Kingdom
Senior Editor
Floris de Lange
Donders Institute for Brain, Cognition and Behaviour, Nijmegen, Netherlands

Reviewer #1 (Public Review):

The manuscript by Wagstyl et al. describes an extensive analysis of gene expression in the human cerebral cortex and the association with a large variety of maps capturing many of its microscopic and macroscopic properties. The core methodological contribution is the computation of continuous maps of gene expression for >20k genes, which are being shared with the community. The manuscript is a demonstration of several ways in which these maps can be used to relate gene expression with histological features of the human cortex, cytoarchitecture, folding, function, development and disease risk. The main scientific contribution is to provide data and tools to help substantiate the idea of the genetic regulation of multi-scale aspects of the organisation of the human brain. The manuscript is dense, but clearly written and beautifully illustrated.

https://doi.org/10.7554/eLife.86933.2.sa1

Reviewer #2 (Public Review):

This is a valuable contribution that will facilitate brain transcriptomic analyses and the joint analyses of gene expression and structural and functional imaging. The methods used are solid, and the authors conducted a wide range of analyses to demonstrate the value of the dense gene expression data.

https://doi.org/10.7554/eLife.86933.2.sa0

Author Response

The following is the authors’ response to the original reviews.

Reviewer #1 (Public Review):

The manuscript by Wagstyl et al. describes an extensive analysis of gene expression in the human cerebral cortex and the association with a large variety of maps capturing many of its microscopic and macroscopic properties. The core methodological contribution is the computation of continuous maps of gene expression for >20k genes, which are being shared with the community. The manuscript is a demonstration of several ways in which these maps can be used to relate gene expression with histological features of the human cortex, cytoarchitecture, folding, function, development and disease risk. The main scientific contribution is to provide data and tools to help substantiate the idea of the genetic regulation of multi-scale aspects of the organisation of the human brain. The manuscript is dense, but clearly written and beautifully illustrated.

Main comments

The starting point for the manuscript is the construction of continuous maps of gene expression for most human genes. These maps are based on the microarray data from 6 left human brain hemispheres made available by the Allen Brain Institute. By technological necessity, the microarray data is very sparse: only 1304 samples to map all the cortex after all subjects were combined (a single individual's hemisphere has ~400 samples). Sampling is also inhomogeneous due to the coronal slicing of the tissue. To obtain continuous maps on a mesh, the authors filled the gaps using nearest-neighbour interpolation followed by strong smoothing. This may have two potentially important consequences that the authors may want to discuss further: (a) the intrinsic geometry of the mesh used for smoothing will introduce structure in the expression map, and (b) strong smoothing will produce substantial, spatially heterogeneous, autocorrelations in the signal, which are known to lead to a significant increase in the false positive rate (FPR) in the spin tests they used.

Many thanks to the reviewer for their considered feedback. We have addressed these primary concerns into point-by-point responses below. The key conclusions from our new analyses are: (i) while the intrinsic geometry of the mesh had not originally been accounted for in sufficient detail, the findings presented in this manuscript paper are not driven by mesh-induced structure, (ii) that the spin test null models used in this manuscript [(including a modified version introduced in response to (i)] are currently the most appropriate way to mitigate against inflated false positive rates when making statistical inferences on smooth, surface-based data.

a. Structured smoothing

A brain surface has intrinsic curvature (Gaussian curvature, which cannot be flattened away without tearing). The size of the neighbourhood around each surface vertex will be determined by this curvature. During surface smoothing, this will make that the weight of each vertex will be also modulated by the local curvature, i.e., by large geometric structures such as poles, fissures and folds. The article by Ciantar et al (2022, https://doi.org/10.1007/s00429-022-02536-4) provides a clear illustration of this effect: even the mapping of a volume of pure noise into a brain mesh will produce a pattern over the surface strikingly similar to that obtained by mapping resting state functional data or functional data related to a motor task.

Comment 1

It may be important to make the readers aware of this possible limitation, which is in large part a consequence of the sparsity of the microarray sampling and the necessity to map that to a mesh. This may confound the assessments of reproducibility (results, p4). Reproducibility was assessed by comparing pairs of subgroups split from the total 6. But if the mesh is introducing structure into the data, and if the same mesh was used for both groups, then what's being reproduced could be a combination of signal from the expression data and signal induced by the mesh structure.

Response 1

The reviewer raises an important question regarding the potential for interpolation and smoothing on a cortical mesh to induce a common/correlated signal due to the intrinsic mesh structure. We have now generated a new null model to test this idea which indicates that intrinsic mesh structure is not inflating reproducibility in interpolated expression maps. This new null model spins the original samples prior to interpolation, smoothing and comparison between triplet splits of the six donors, with independent spins shared across the triplet. For computational tractability we took one pair of triplets and regenerated the dataset for each triplet using 10 independent spins. We used these to estimate gene-gene null reproducibility for 90 independent pairwise combinations of these 10 spins. Across these 90 permutations, the average median gene-gene correlation was R=0.03, whereas in the unspun triplet comparisons this was R=0.36. These results indicate that the primary source of the gene-level triplet reproducibility is the underlying shared gene expression pattern rather than interpolation-induced structure.

In Methods 2a: "An additional null dataset was generated to test whether intrinsic geometry of the cortical mesh and its impact on interpolation for benchmarking analyses of DEMs and gradients (Fig S1d, Fig S2d, Fig S3c). In these analyses, the original samples were rotated on the spherical surface prior to subsequent interpolation, smoothing and gradient calculation. Due to computational constraints the full dataset was recreated only for 10 independent spins. These are referred to as the “spun+interpolated null”.

Author response image 1.

Figure S1d, Gene predictability was higher across all triplet-triplet pairs than when compared to spun+interpolated null.

Comment 2

It's also possible that mesh-induced structure is responsible in part for the "signal boost" observed when comparing raw expression data and interpolated data (fig S1a). How do you explain the signal boost of the smooth data compared with the raw data otherwise?

Response 2

We thank the reviewer for highlighting this issue of mesh-induced structure. We first sought to quantify the impact of mesh-induced structure through the new null model, in which the data are spun prior to interpolation. New figure S1d, S2d and S3c all show that the main findings are not driven by interpolation over a common mesh structure, but rather originate in the underlying expression data.

Specifically, for the original Figure S1a, the reviewer highlights a limitation that we compared intersubject predictability of raw-sample to raw-sample and interpolated-to-interpolated. In this original formulation improved prediction scores for interpolated-to-interpolated (the “signal boost”) could be driven by mesh-induced structure being applied to both the input and predicted maps. We have updated this so that we are now comparing raw-to-raw and interpolated-to-raw, i.e. whether interpolated values are better estimations of the measured expression values. The new Fig S1a&b (see below) shows a signal boost in gene-level and vertex level prediction scores (delta R = +0.05) and we attribute this to the minimisation of location and measurement noise in the raw data, improving the intersubject predictability of expression levels.

In Methods 2b: "To assess the effect of data interpolation in DEM generation we compared gene-level and vertex-level reproducibility of DEMs against a “ground truth” estimate of these reproducibility metrics based on uninterpolated expression data. To achieve a strict comparison of gene expression values between different individuals at identical spatial locations we focused these analyses on the subset of AHBA samples where a sample from one subject was within 3 mm geodesic distance of another. This resulted in 1097 instances (spatial locations) with measures of raw gene expression of one donor, and predicted values from the second donor’s un-interpolated AHBA expression data and interpolated DEM. We computed gene-level and vertex-level reproducibility of expression using the paired donor data at each of these sample points for both DEM and uninterpolated AHBA expression values. By comparing DEM reproducibility estimates with those for uninterpolated AHBA expression data, we were able to quantify the combined effect of interpolation and smoothing steps in DEM generation. We used gene-level reproducibility values from DEMs and uninterpolated AHBA expression data to compute a gene-level difference in reproducibility, and we then visualized the distribution of these difference values across genes (Fig S1a). We used gene-rank correlation to compare vertex-level reproducibility values between DEMs and uninterpolated AHBA expression data (Fig S1b)."

Author response image 2.

Figure S1. Reproducibility of Dense Expression Maps (DEMs) interpolated from spatially sparse postmortem measures of cortical gene expression. a, Signal boost in the interpolated DEM dataset vs. spatially sparse expression data. Restricting to samples taken from approximately the same cortical location in pairs of individuals (within 3mm geodesic distance), there was an overall improvement in intersubject spatial predictability in the interpolated maps. Furthermore, genes with lower predictability in the interpolated maps were less predictable in the raw dataset, suggesting these regions exhibit higher underlying biological variability rather than methodologically introduced bias. b, Similarly at the paired sample locations, gene-rank predictability was generally improved in DEMs vs. sparse expression data (median change in R from sparse samples to interpolated for each pair of subjects, +0.5).

How do you explain that despite the difference in absolute value the combined expression maps of genes with and without cortical expression look similar? (fig S1e: in both cases there's high values in the dorsal part of the central sulcus, in the occipital pole, in the temporal pole, and low values in the precuneus and close to the angular gyrus). Could this also reflect mesh-smoothing-induced structure?

Response 3

As with comment 1, this is an interesting perspective that we had not fully considered. We would first like to clarify that non-cortical expression is defined from the independent datasets including the “cortex” tissue class of the human protein atlas and genes identified as markers for cortical layers or cortical cells in previous studies. This is still likely an underestimate of true cortically expressed genes as some of these “non-cortical genes” had high intersubject reproducibility scores. Nevertheless we think it appropriate to use a measure of brain expression independent of anything included in other analyses for this paper. These considerations are part of the reason we provide all gene maps with accompanying uncertainty scores for user discretion rather than simply filtering them out.

In terms of the spatially consistent pattern of the gene ranks of Fig S1f, this consistent spatial pattern mirrors Transcriptomic Distinctiveness (r=0.52 for non-cortical genes, r=0.75 for cortical genes), so we think that as the differences in expression signatures become more extreme, the relative ranks of genes in that region are more reproducible/easier to predict.

To assess whether mesh-smoothing-induced structure is playing a role, we carried out an additional the new null model introduced in response to comment 1, and asked if the per-vertex gene rank reproducibility of independently spun subgroup triplets showed a similar structure to that in our original analyses. Across the 90 permutations, the median correlation between vertex reproducibility and TD was R=0.10. We also recalculated the TD maps for the 10 spun datasets and the mean correlation with the original TD did not significantly differ from zero (mean R = 0.01, p=0.2, nspins =10). These results indicate that folding morphology is not the major driver of local or large scale patterning in the dataset. We have included this as a new Figure S3c.

We have updated the text as follows:

In Methods 3a: "Third, to assess whether the covariance in spatial patterning across genes could be a result of mesh-associated structure introduced through interpolation and smoothing, TD maps were recomputed for the spun+interpolated null datasets and compared to the original TD map (Fig S3c)."

In Results: "The TD map observed from the full DEMs library was highly stable between all disjoint triplets of donors (Methods, Fig S3a, median cross-vertex correlation in TD scores between triplets r=0.77) and across library subsets at all deciles of DEM reproducibility (Methods, Fig S3b, cross-vertex correlation in TD scores r>0.8 for the 3rd-10th deciles), but was not recapitulated in spun null datasets (Fig S3c)."

Author response image 3.

Figure S3c, Correlations between TD and TD maps regenerated on datasets spun using two independent nulls, one where the rotation is applied prior to interpolation and smoothing (spun+interpolated) and one where it is applied to the already-created DEMs. In each null, the same rotation matrix is applied to all genes.

Comment 4

Could you provide more information about the way in which the nearest-neighbours were identified (results p4). Were they nearest in Euclidean space? Geodesic? If geodesic, geodesic over the native brain surface? over the spherically deformed brain? (Methods cite Moresi & Mather's Stripy toolbox, which seems to be meant to be used on spheres). If the distance was geodesic over the sphere, could the distortions introduced by mapping (due to brain anatomy) influence the geometry of the expression maps?

Response 4

We have clarified in the Methods that the mapping is to nearest neighbors on the spherically-inflated surface.

The new null model we have introduced in response to comments 1 & 3 preserves any mesh-induced structure alongside any smoothing-induced spatial autocorrelations, and the additional analyses above indicate that main results are not induced by systematic mesh-related interpolation signal. In response to an additional suggestion from the reviewer (Comment 13), we also assessed whether local distortions due to the mesh could be creating apparent border effects in the data, for instance at the V1-V2 boundary. At the V1-V2 border, which coincides anatomically with the calcarine sulcus, we computed the 10 genes with the highest expression gradient along this boundary in the actual dataset and the spun-interpolated null. The median test expression gradients along this border was higher than in any of the spun datasets, indicating that these boundary effects are not explained by the interpolation and cortical geometry effects on the data (new Fig S2d). The text has been updated as follows:

In Methods 1: "For cortical vertices with no directly sampled expression, expression values were interpolated from their nearest sampled neighbor vertex on the spherical surface (Moresi and Mather, 2019) (Fig 1b)."

In Methods 2: "We used the spun+interpolated null to test whether high gene gradients could be driven by non-uniform interpolation across cortical folds. We quantified the average gradient for all genes along the V1-V2 border in the atlas, as well as for 10 iterations of the atlas where the samples were spun prior to interpolation. We computed the median gradient magnitude for the 20 top-ranked genes for each (Fig S2d)."

Author response image 4.

Figure S2d Mean of gradient magnitudes for 20 genes with largest gradients along V1-V2 border, compared to values along the same boundary on the spun+interpolated null atlas. Gradients were higher in the actual dataset than in all spun version indicating this high gradient feature is not primarily due to the effects of calcarine sulcus morphology on interpolation

Comment 5

Could you provide more information about the smoothing algorithm? Volumetric, geodesic over the native mesh, geodesic over the sphere, averaging of values in neighbouring vertices, cotangent-weighted laplacian smoothing, something else?

Response 5

We are using surface-based geodesic over the white surface smoothing described in Glasser et al., 2013 and used in the HCP workbench toolbox (https://www.humanconnectome.org/software/connectome-workbench). We have updated the methods to clarify this.

In Methods 1: "Surface expression maps were smoothed using the Connectome Workbench toolbox (Glasser et al. 2013) with a 20mm full-width at half maximum Gaussian kernel , selected to be consistent with this sampling density (Fig 1c)."

Comment 6

Could you provide more information about the method used for computing the gradient of the expression maps (p6)? The gradient and the laplacian operator are related (the laplacian is the divergence of the gradient), which could also be responsible in part for the relationships observed between expression transitions and brain geometry.

Response 6

We are using Connectome Workbench’s metric gradient command for this Glasser et al., 2013 and used in the HCP workbench pipeline. The source code for gradient calculation can be found here: https://github.com/Washington-University/workbench/blob/131e84f7b885d82af76e be21adf2fa97795e2484/src/Algorithms/AlgorithmMetricGradient.cxx

In Methods 2: >For each of the resulting 20,781 gene-level expression maps, the orientation and magnitude of gene expression change at each vertex (i.e. the gradient) was calculated for folded, inflated, spherical and flattened mesh representations of the cortical sheet using Connectome Workbench’s metric gradient command (Glasser et al. 2013).

b. Potentially inflated FPR for spin tests on autocorrelated data."

Spin tests are extensively used in this work and it would be useful to make the readers aware of their limitations, which may confound some of the results presented. Spin tests aim at establishing if two brain maps are similar by comparing a measure of their similarity over a spherical deformation of the brains against a distribution of similarities obtained by randomly spinning one of the spheres. It is not clear which specific variety of spin test was used, but the original spin test has well known limitations, such as the violation of the assumption of spatial stationarity of the covariance structure (not all positions of the spinning sphere are equivalent, some are contracted, some are expanded), or the treatment of the medial wall (a big hole with no data is introduced when hemispheres are isolated).

Another important limitation results from the comparison of maps showing autocorrelation. This problem has been extensively described by Markello & Misic (2021). The strong smoothing used to make a continuous map out of just ~1300 samples introduces large, geometry dependent autocorrelations. Indeed, the expression maps presented in the manuscript look similar to those with the highest degree of autocorrelation studied by Markello & Misic (alpha=3). In this case, naive permutations should lead to a false positive rate ~46% when comparing pairs of random maps, and even most sophisticated methods have FPR>10%.

Comment 7 There's currently several researchers working on testing spatial similarity, and the readers would benefit from being made aware of the problem of the spin test and potential solutions. There's also packages providing alternative implementations of spin tests, such as BrainSMASH and BrainSpace, which could be mentioned.

Response 7

We thank the reviewer for raising the issue of null models. First, with reference to the false positive rate of 46% when maps exhibit spatial autocorrelation, we absolutely agree that this is an issue that must be accounted for and we address this using the spin test. We acknowledge there has been other work on nulls such as BrainSMASH and BrainSpace. Nevertheless in the Markello and Misic paper to which the reviewer refers, the BrainSmash null models perform worse with smoother maps (with false positive rates approaching 30% in panel e below), whereas the spin test maintains false positives rates below 10%.

Author response image 5.

We have added a brief description of the challenge and our use of the spin test.

In Methods 2a: "Cortical maps exhibit spatial autocorrelation that can inflate the False Positive Rate, for which a number of methods have been proposed(Alexander-Bloch et al. 2018; Burt et al. 2020; Vos de Wael et al. 2020). At higher degrees of spatial smoothness, this high False Positive Rate is most effectively mitigated using the spin test(Alexander-Bloch et al. 2018; Markello and Misic 2021; Vos de Wael et al. 2020). In the following analyses when generating a test statistic comparing two spatial maps, to generate a null distribution, we computed 1000 independent spins of the cortical surface using https://netneurotools.readthedocs.io, and applied it to the first map whilst keeping the second map unchanged. The test statistic was then recomputed 1000 times to generate a null distribution for values one might observe by chance if the maps shared no common organizational features. This is referred to throughout as the “spin test” and the derived p-values as pspin."

Comment 8

Could it be possible to measure the degree of spatial autocorrelation?

Response 8

We agree this could be a useful metric to generate for spatial cortical maps. However, there are multiple potential metrics to choose from and each of the DEMs would have their own value. To address this properly would require the creation of a set of validated tools and it is not clear how we could summarize this variety of potential metrics for 20k genes. Moreover, as discussed above the spin method is an adequate null across a range of spatial autocorrelation degrees, thus while we agree that in general estimation of spatial smoothness could be a useful imaging metric to report, we consider that it is beyond the scope of the current manuscript.

Comment 9

Could you clarify which version of the spin test was used? Does the implementation come from a package or was it coded from scratch?

Response 9

As Markello & Misic note, at the vertex level, the various implementations of the spin test become roughly equivalent to the ‘original’ Alexander-Bloch et al., implementation. We used took the code for the ‘original’ version implemented in python here: https://netneurotools.readthedocs.io/en/latest/_modules/netneurotools/stats.html# gen_spinsamples.

This has been updated in the methods (see Response 7).

Comment 10

Cortex and non-cortex vertex-level gene rank predictability maps (fig S1e) are strikingly similar. Would the spin test come up statistically significant? What would be the meaning of that, if the cortical map of genes not expressed in the cortex appeared to be statistically significantly similar to that of genes expressed in the cortex?

Response 10

Please see response to comment 3, which also addresses this observation.

Reviewer #2 (Public Review):

The authors convert the AHBA dataset into a dense cortical map and conduct an impressively large number of analyses demonstrating the value of having such data.

I only have comments on the methodology.

Comment 1

First, the authors create dense maps by simply using nearest neighbour interpolation followed by smoothing. Since one of the main points of the paper is the use of a dense map, I find it quite light in assessing the validity of this dense map. The reproducibility values they calculate by taking subsets of subjects are hugely under-powered, given that there are only 6 brains, and they don't inform on local, vertex-wise uncertainties). I wonder if the authors would consider using Gaussian process interpolation. It is really tailored to this kind of problem and can give local estimates of uncertainty in the interpolated values. For hyperparameter tuning, they could use leave-one-brain-out for that.

I know it is a lot to ask to change the base method, as that means re-doing all the analyses. But I think it would strengthen the paper if the authors put as much effort in the dense mapping as they did in their downstream analyses of the data.

Response 1

We thank the reviewer for the suggestion to explore Gaussian process interpolation. We have implemented this for our dataset and attempted to compare this with our original method with the 3 following tests: i) intertriplet reproducibility of individual gene maps, ii) microscale validations: area markers, iii) macroscale validations: bio patterns.

Overall, compared to our original nearest-neighbor interpolation method, GP regression (i) did not substantially improve gene-level reproducibility of expression maps (median correlation increase of R=0.07 which was greater for genes without documented protein expression in cortex): ii) substantially worsened performance in predicting areal marker genes and iii) showed similar but slightly worse performance at predicting macroscale patterns from Figure 1.

Given the significantly poorer performance on one of our key tests (ii) we have opted not to replace our original database, but we do now include code for the alternative GP regression methodology in the github repository so others can reproduce/further develop these methods.

Author response image 6.

ii) Genes ranked by mean expression gradient from current DEMs (left) and Gaussian process-derived interpolation maps (right). Established Human and macaque markers are consistently higher-ranked in DEM maps. iii) Figure 1 Interpolated vs GP regression

Author response table 1.

Comment 2

It is nice that the authors share some code and a notebook, but I think it is rather light. It would be good if the code was better documented, and if the user could have access to the non-smoothed data, in case they was to produce their own dense maps. I was only wondering why the authors didn't share the code that reproduces the many analyses/results in the paper.

Response 2

We thank the reviewer for this suggestion. In response we have updated the shared github repository (https://github.com/kwagstyl/magicc). This now includes code and notebooks to reproduce the main analyses and figures.

Reviewer #1 (Recommendations For The Authors):

Minor comments

Comment 11

p4 mentions Fig S1h, but the supp figures only goes from S1a to S1g

Response 11

We thank the reviewer for capturing this error. It was in fact referring to what is now Fig S1h and has been updated.

Comment 12

It would be important that the authors share all the code used to produce the results in the paper in addition to the maps. The core methodological contribution of the work is a series of continuous maps of gene expression, which could become an important tool for annotation in neuroimaging research. Many arbitrary (reasonable) decisions were made, it would be important to enable users to evaluate their influence on the results.

Response 12

We thank both reviewers for this suggestion. We have updated the github to be able to reproduce the dense maps and key figures with our methods.

Comment 13

p5: Could the sharp border reflect the effect of the geometry of the calcarine sulcus on map smoothing? More generally, could there be an effect of folds on TD?

Response 13

Please see our response to Reviewer 1, Comment 1 above, where we introduce the new null models now analyzed to test for effects of mesh geometry on our findings. These new null models - where original source data were spun prior to interpolation suggest that neither the sharp V1/2 border or the TD map are effects of mesh geometry. Specifically: (i) , the magnitudes of gradients along the V1/2 boundary from null models were notably smaller than those in our original analyses (see new figure S2d), and (ii) TD maps computed from the new null models showed no correlation with TD maps from ur original analyses (new Figure S3c, mean R = 0.01, p=0.2, nspins =10).

Comment 14

p5: Similar for the matching with the areas in Glasser's parcellation: the definition of these areas involves alignment through folds (based on freesurfer 'sulc' map, see Glasser et al 2016). If folds influence the geometry of TDs, could that influence the match?

Response 14

We note that Fig S3c provided evidence that folding was not the primary driver of the TD patterning. However, it is true that Glasser et al. use both neuroanatomy (folding, thickness and myelin) and fMRI-derived maps to delineate their cortical areas. As such Figure 2 f & g aren’t fully independent assessments. Nevertheless the reason that these features are used is that many of the sulci in question have been shown to reliably delineate cytoarchitectonic boundaries (Fischl et al., 2008).

In Results: "A similar alignment was seen when comparing gradients of transcriptional change with the spatial orientation of putative cortical areas defined by multimodal functional and structural in vivo neuroimaging(Glasser et al., 2016) (expression change running perpendicular to area long-axis, pspin<0.01, Fig 2g, Methods)."

Comment 15

p6: TD peaks are said to overlap with functionally-specialised regions. A comment on why audition is not there, nor language, but ba 9-46d is? Would that suggest a lesser genetic regulation of those functions?

Response 15

The reviewer raises a valid point and this was a result that we were also surprised by. The finding that the auditory cortex is not as microstructurally distinctive as, say V1, is consistent with other studies applying dimensionality-reduction techniques to multimodal microstructural receptor data (e.g. Zilles et al., 2017, Goulas et al., 2020). These studies found that the auditory microstructure is not as extreme as either visual and somatomotor areas. From a methodological view point, the primary auditory cortex is significantly smaller than both visual and somatomotor areas, and therefore is captured by fewer independent samples, which could reduce the detail in which its structure is being mapped in our dataset.

For the frontal areas, we would note that i) the frontal peak is the smallest of all peaks found and was more strongly characterised by low z-score genes than high z-score. ii) the anatomical areas in the frontal cortex are much more highly variable with respect to folding morphology (e.g. Rajkowska 1995). The anatomical label of ba9-46d (and indeed all other labels) were automatically generated as localisers rather than strict area labels. We have clarified this in the text as follows:

In Methods 3a: "Automated labels to localize TD peaks were generated based on their intersection with a reference multimodal neuroimaging parcellation of the human cortex(Glasser et al., 2016). Each TD was given the label of the multimodal parcel that showed greatest overlap (Fig 2b)."

Comment 16.

p7: The proposition that "there is a tendency for cortical sulci to run perpendicular to the direction of fastest transcriptional change", could also be "there is a tendency for the direction of fastest transcriptional change to run perpendicular to cortical sulci"? More pragmatically, this result from the geometry of transcriptional maps being influenced by sulcal geometry in their construction.

Response 16

Please see our response to Reviewer 1, Comment 1 above, where we introduce the new null models now analyzed to test for effects of mesh geometry on our findings. These models indicate that the topography of interpolated gene expression maps do not reflect influences of sulcal geometry on their construction.

Comment 17

p7: TD transitions are indicated to precede folding. This is based on a consideration of folding development based on the article by Chi et al 1977, which is quite an old reference. In that paper, the authors estimated the tempo of human folding development based on the inspection of photographs, which may not be sufficient for detecting the first changes in curvature leading to folds. The work of the Developing Human Connectome consortium may provide a more recent indication for timing. In their data, by PCW 21 there's already central sulcus, pre-central, post-central, intra-parietal, superior temporal, superior frontal which can be detected by computing the mean curvature of the pial surface (I can only provide a tweet for reference: https://twitter.com/R3RT0/status/1617119196617261056). Even by PCW 9-13 the callosal sulcus, sylvian fissure, parieto-occipital fissure, olfactory sulcus, cingulate sulcus and calcarine fissure have been reported to be present (Kostovic & Vasung 2009).

Response 17

Our field lacks the data necessary to provide a comprehensive empirical test for the temporal ordering of regional transcriptional profiles and emergence of folding. Our results show that transcriptional identities of V1 and TGd are - at least - present at the very earliest stages of sulcation in these regions. In response to the reviewers comment we have updated with a similar fetal mapping project which similarly shows evidence of the folds between weeks 17-21 and made the language around directionality more cautious.

In Results: "The observed distribution of these angles across vertices was significantly skewed relative to a null based on random alignment between angles (pspin<0.01, Fig 2f, Methods) - indicating that there is indeed a tendency for cortical sulci and the direction of fastest transcriptional change to run perpendicular to each other (pspin<0.01, Fig 2f).

As a preliminary probe for causality, we examined the developmental ordering of regional folding and regional transcriptional identity. Mapping the expression of high-ranking TD genes in fetal cortical laser dissection microarray data(Miller et al., 2014) from 21 PCW (Post Conception Weeks) (Methods) showed that the localized transcriptional identity of V1 and TGd regions in adulthood is apparent during the fetal periods when folding topology begins to emerge (Chi et al. 1977; Xu et al. 2022) (Fig " S2d).

In Discussion: "By establishing that some of these cortical zones are evident at the time of cortical folding, we lend support to a “protomap”(Rakic 1988; O'Leary 1989; O'Leary et al. 2007; Rakic et al. 2009) like model where the placement of some cortical folds is set-up by rapid tangential changes in cyto-laminar composition of the developing cortex(Ronan et al., 2014; Toro and Burnod, 2005; Van Essen, 2020). The DEMs are derived from fully folded adult donors, and therefore some of the measured genetic-folding alignment might also be induced by mechanical distortion of the tissue during folding(Llinares-Benadero and Borrell 2019; Heuer and Toro 2019). However, no data currently exist to conclusively assess the directionality of this gene-folding relationship."

Comment 18

p7: In my supplemental figures (obtained from biorxiv, because I didn't find them among the files submitted to eLife) there's no S2j (only S2a-S2i).

Response 18

We apologize, this figure refers to S3k (formerly S3j), rather than S2j. We have updated the main text.

Comment 19 p7: It is not clear from the methods (section 3b) how the adult and fetal brains were compared. Maybe using MSM (Robinson et al 2014)?

Response 19

We have now clarified this in Methods text as reproduced below.

In Methods 3b: "We averaged scaled regional gene expression values between donors per gene, and filtered for genes in the fetal LDM dataset that were also represented in the adult DEM dataset - yielding a single final 20,476*235 gene-by-sample matrix of expression values for the human cortex at 21 PCW. Each TD peak region was then paired with the closest matching cortical label within the fetal regions. This matrix was then used to test if each TD expression signature discovered in the adult DEM dataset (Fig 2, Table 3) was already present in similar cortical regions at 21 PCW."

Comment 20

p7: WGCNA is used prominently, could you provide a brief introduction to its objectives? The gene coexpression networks are produced after adjusting the weight of the network edges to follow a scale-free topology, which is meant to reflect the nature of protein-protein interactions. Soft thresholding increases contrast, but doesn't this decrease a potential role of infinitesimal regulatory signals?

Response 20

We agree with the reviewer that the introduction to WGCNA needed additional details and have amended the Results (see below). One limitation of WGCNA-derived associations is that it will downweigh the role of smaller relationships including potentially important regulatory signals. WGCNA methods have been titrated to capture strong relationships. This is an inherent limitation of all co-expression driven methods which lead to an incomplete characterisation of the molecular biology. Nevertheless we feel these stronger relationships are still worth capturing and interrogating. We have updated the text to introduce WGCNA and acknowledge this potential weakness in the approach.

In Results: "Briefly, WGCNA constructs a constructs a connectivity matrix by quantifying pairwise co-expression between genes, raising the correlations to a power (here 6) to emphasize strong correlations while penalizing weaker ones, and creating a Topological Overlap Matrix (TOM) to capture both pairwise similarities expression and connectivity. Modules of highly interconnected genes are identified through hierarchical clustering. The resultant WGCNA modules enable topographic and genetic integration because they each exist as both (i) a single expression map (eigenmap) for spatial comparison with neuroimaging data (Fig 3a,b, Methods) and, (ii) a unique gene set for enrichment analysis against marker genes systematically capturing multiple scales of cortical organization, namely: cortical layers, cell types, cell compartments, protein-protein interactions (PPI) and GO terms (Methods, Table S2 and S4)."

Comment 21

WGCNA modules look even more smooth than the gene expression maps. Are these maps comparable to low frequency eigenvectors? Autocorrelation in that case should be very strong?

Response 21

These modules are smooth as they are indeed eigenvectors which likely smooth out some of the more detailed but less common features seen in individual gene maps. These do exhibit high degrees of autocorrelation, nevertheless we are applying the spin test which is currently the appropriate null model for spatially autocorrelated cortical maps (Response 7).

Comment 22

If the WGCNA modules provide an orthogonal basis for surface data, is it completely unexpected that some of them will correlate with low-frequency patterns? What would happen if random low frequency patterns were generated? Would they also show correlations with some of the 16 WGCNA modules?

Response 22

We agree with the reviewer that if we used a generative model like BrainSMASH, we would likely see similar low frequency patterns. However, the inserted figure in Response 7 from Makello & Misic provide evidence that is not as conservative a null as the spin test when data exhibit high spatial autocorrelation. The spatial enrichment tests carried out on the WGCNA modules are all carried out using the spin test.

Comment 23

In part (a) I commented on the possibility that brain anatomy may introduce artifactual structure into the data that's being mapped. But what if the relationship between brain geometry and brain organisation were deeper than just the introduction of artefacts? The work of Lefebre et al (2014, https://doi.org/10.1109/ICPR.2014.107; 2018, https://doi.org/10.3389/fnins.2018.00354) shows that clustering based on the 3 lowest frequency eigenvectors of the Laplacian of a brain hemisphere mesh produce an almost perfect parcellation into lobes, with remarkable coincidences between parcel boundaries and primary folds and fissures. The work of Pang et al (https://doi.org/10.1101/2022.10.04.510897) suggests that the geometry of the brain plays a critical role in constraining its dynamics: they analyse >10k task-evoked brain maps and show that the eigenvectors of the brain laplacian parsimoniously explain the activity patterns. Could brain anatomy have a downward effect on brain organisation?

Response 23

The reviewer raises a fascinating extension of our work identifying spatial modes of gene expression. We agree that these are low frequency in nature, but would first like to note that the newly introduced null model indicates that the overlaps with salient neuroanatomical features are inherent in the expression data and not purely driven by anatomy in a methodological sense.

Nevertheless we absolutely agree there is likely to be a complex multidirectional interplay between genetic expression patterns through development, developing morphology and the “final” adult topography of expression, neuroanatomical and functional patterns.

We think that the current manuscript currently contains a lot of in depth analyses of these expression data, but agree that a more extensive modeling analysis of how expression might pattern or explain functional activation would be a fascinating follow on, especially in light of these studies from Pang and Lefebre. Nevertheless we think that this must be left for a future modeling paper integrating these modes of microscale, macroscale and functional anatomy.

In Discussion: "Indeed, future work might find direct links between these module eigenvectors and similar low-frequency eigenvectors of cortical geometry have been used as basis functions to segment the cortex (Lefèvre et al. 2018) and explain complex functional activation patterns(Pang et al. 2023)."

Comment 24

On p11: ASD related to rare, deleterious mutations of strong effect is often associated with intellectual disability (where the social interaction component of ASD is more challenging to assess). Was there some indication of a relationship with that type of cognitive phenotype?

Response 24

Across the two ABIDE cohorts, the total number of those with ASD and IQ <70, which is the clinical threshold for intellectual disability was n=10, which unfortunately did not allow us to conduct a meaningful test of whether ID impacts the relationship between imaging changes in ASD and the expression maps of genes implicated in ASD by rare variants.

Comment 25

Could you clarify if the 6 donors were aligned using the folding-based method in freesurfer?

Response 25

The 6 donors were aligned using MSMsulc (Robinson et al., 2014), which is a folding based method from the HCP group. This is now clarified in the methods.

In Methods 1: "Cortical surfaces were reconstructed for each AHBA donor MRI using FreeSurfer(Fischl, 2012), and coregistered between donors using surface matching of individuals’ folding morphology (MSMSulc) (Robinson et al., 2018)."

Comment 26

The authors make available a rich resource and a series of tools to facilitate their use. They have paid attention to encode their data in standard formats, and their code was made in Python using freely accessible packages instead of proprietary alternatives such as matlab. All this should greatly facilitate the adoption of the approach. I think it would be important to state more explicitly the conceptual assumptions that the methodology brings. In the same way that a GWAS approach relies on a Mendelian idea that individual alleles encode for phenotypes, what is the idea about the organisation of the brain implied by the orthogonal gene expression modules? Is it that phenotypes - micro and macro - are encoded by linear combinations of a reduced number of gene expression patterns? What would be the role of the environment? The role of non-genic regulatory regions? Some modalities of functional organisation do not seem to be encoded by the expression of any module. Is it just for lack of data or should this be seen as the sign for a different organisational principle? Likewise, what about the aspects of disorders that are not captured by expression modules? Would that hint, for example, to stronger environmental effects? What about linear combinations of modules? Nonlinear? Overall, the authors adopt implicitly, en passant, a gene-centric conceptual standpoint, which would benefit from being more clearly identified and articulated. There are citations to Rakic's protomap idea (I would also cite the original 1988 paper, and O'Leary's 1989 "protocortex" paper stressing the role of plasticity), which proposes that a basic version of brain cytoarchitecture is genetically determined and transposed from the proliferative ventricular zone regions to the cortical plate through radial migration. In p13 the authors indicate that their results support Rakic's protomap. Additionally, in p7 the authors suggest that their results support a causal arrow going from gene expression to sulcal anatomy. The reviews by O'leary et al (2007), Ronan & Fletcher (2014, already cited), Llinares-Benadero & Borrell (2019) could be considered, which also advocate for a similar perspective. For nuances on the idea that molecular signals provide positional information for brain development, the article by Sharpe (2019, DOI: 10.1242/dev.185967) is interesting. For nuances on the gene-centric approach of the paper the articles by Rockmann (2012, DOI: 10.1111/j.1558-5646.2011.01486.x) but also from the ENCODE consortium showing the importance of non-genic regions of the genome ("Perspectives on ENCODE" 2020 DOI: 10.1038/s41586-021-04213-8) could be considered. I wouldn't ask to cite ideas from the extended evolutionary synthesis about different inheritance systems (as reviewed by Jablonka & Lamb, DOI: 10.1017/9781108685412) or the idea of inherency (Newman 2017, DOI: 10.1007/978-3-319-33038-9_78-1), but the authors may find them interesting. Same goes for our own work on mechanical morphogenesis which expands on the idea of a downward causality (Heuer and Toro 2019, DOI: 10.1016/j.plrev.2019.01.012)

Response 26

We thank the reviewer for recommending these papers, which we enjoyed reading and have deepened our thinking on the topic. In addition to toning down some of the language with respect to causality that our data cannot directly address, we have included additional discussion and references as follows:

Overall, the manuscript is very interesting and a great contribution. The amount of work involved is impressive, and the presentation of the results very clear. My comments indicate some aspects that could be made more clear, for example, providing additional methodological information in the supplemental material. Also, making aware the readers and future users of MAGICC of the methodological and conceptual challenges that remain to be addressed in the future for this field of research.

Reviewer #2 (Recommendations For The Authors):

Comment 1

The supplementary figures seem to be missing from the eLife submission (although I was able to find them on europepmc)

Response 1

We apologize that these were not included in the documents sent to reviewers. The up-to-date supplementary figures are included in this resubmission and again on biorxiv.

https://doi.org/10.7554/eLife.86933.2.sa3

Significance of findings

Strength of evidence

Abstract

Introduction

Results

Creating and benchmarking spatially dense maps of human cortical gene expression

Creating and Benchmarking Spatial Dense Gene Expression Maps in the Human Cortex.

Defining and surveying the human cortex as a continuous transcriptional terrain

Mapping transcriptional distinctiveness in the human cortex and its alignment with macroscale structure and function.

Cortical gene coexpression integrates diverse spatial scales of human brain organization

Cortex-wide Gene Coexpression Patterns Reflect Multiple Spatial Scales and Developmental Epochs of Brain Organization.

Linking spatial and developmental aspects of cortical organization

ASD risk genes follow two different spatial patterns of cortical expression, which capture distinct aspects of cortical organization and differentially predict cortical changes in ASD

ASD risk genes follow two different spatial patterns of cortical gene expression which differentially predict cortical changes in ASD.

Discussion

Materials and Methods

Materials and Methods overview

1. Creating spatially dense maps of human cortical gene expression (Fig 1a-d)

2. Benchmarking dense expression maps (DEMs)

a. Spin tests for comparing two spatial maps

b. Replicability and independence from cortical sampling density (Fig S1)

c. Alignment with reference measures of cortical organization (Fig 1 e-g)

3. Characterizing the topography of DEMs

a. Transcriptomic distinctiveness (TD) and principal component analysis (Fig 2a-c)

b. Relating adult TD peaks to fetal gene expression (Fig S3k)

c. Local gradient analysis (Fig 2e-g)

Statistical tests used to compare spatial maps and gene sets derived from the Allen Human Brain Atlas with independent multiscale neuroscientific resources.

d. Weighted Gene Co-expression Network Analysis (WGCNA) (Fig 3a-c)

4. Multiscale annotation of WGCNA modules (Fig 3c,d)

a. Map-based annotations

MRI-derived maps of cortical function

MRI-derived maps of cortical structure

Orientation of cortical folds

Inter-eigenmap correlations

b. Gene-set based annotations

GO enrichment

Layer marker gene sets and in situ hybridisation validation

Adult cortical cell type marker gene sets

Fetal cortical cell type marker gene sets

Compartments and SynGO

PPI network

Developmental peak epoch

Developmental trajectories

Fetal compartmental analysis

Reproducibility of genes driving enrichment analyses

5. Combining gene-set based annotations of the cortical sheet (Fig 3e, Fig S3d)

6. Disease enrichment and ASD-based analysis of WGCNA modules

a. Characterizing ASD gene enrichments in M12 and M15

kME analysis

Enrichment of ASD-linked GO terms

Developmental trajectories of disease-linked modules

Independent characterisation of ASD risk genes

b. Comparing M12 and M15 expression to regional changes of cortical gene expression in ASD (Fig 4f)

c. Comparing M12 and M15 expression to regional changes of cortical thickness in ASD (Fig 4g, h, Fig S5c)

7. Preprocessing and analysis of structural MRI data

a. AHBA donors

b. OASIS (Fig 1e)

c. ABIDE

Supporting information

Acknowledgements

Funding

Competing interests

Data availability

References

Article and author information

Author information

Konrad Wagstyl

Sophie Adler

Jakob Seidlitz

Simon Vandekar

Travis T. Mallard

Richard Dear

Alex R. DeCasien

Theodore D. Satterthwaite

Siyuan Liu

Petra E. Vértes

Russell T. Shinohara

Aaron Alexander-Bloch

Daniel H. Geschwind

Armin Raznahan