Brain connectivity is often considered in terms of the communication between functionally distinct brain regions. Many studies have investigated the extent to which patterns of coupling strength between multiple neural populations relates to behaviour. For example, studies have used ‘functional connectivity fingerprints’ to characterise individuals' brain activity. Here, we investigate the extent to which the exact spatial arrangement of cortical regions interacts with measures of brain connectivity. We find that the shape and exact location of brain regions interact strongly with the modelling of brain connectivity, and present evidence that the spatial arrangement of functional regions is strongly predictive of non-imaging measures of behaviour and lifestyle. We believe that, in many cases, cross-subject variations in the spatial configuration of functional brain regions are being interpreted as changes in functional connectivity. Therefore, a better understanding of these effects is important when interpreting the relationship between functional imaging data and cognitive traits.https://doi.org/10.7554/eLife.32992.001
People differ a lot from one another in terms of their personality, behaviour and lifestyle. This individuality is attributed to the different regions in the brain, and the strength of communication between them. The connectivity pattern between these areas is thought to be as unique as a fingerprint. If the connections are weak or disrupted it can play a role in conditions such as schizophrenia, depression or Alzheimer’s disease. It is thought that the strength of the connection depends on how strongly the nerve cells in these regions communicate. But are these individual differences solely caused by different strengths of connection, or could other factors contribute to them?
Now, Bijsterbosch et al. found that the size, shape and exact position of the brain regions was also strongly linked to the different behaviours of individuals. The study used brain scans, behavioural tests and questionnaires from a large database about lifestyle choices and demographics, to analyse the relationship between the different brain features of healthy individuals. The results showed that the variations in the brain regions were linked to many behavioural factors including intelligence, life satisfaction, drug use and aggression problems. Moreover, Bijsterbosch et al. showed that the existing methods for estimating the strength of connection between brain regions could reveal more about the spatial layout of these regions than the actual connection strength between them. This suggests that new approaches are needed to properly evaluate the strength of the connections.
Some psychiatric and neurological diseases may be associated with changes in size and position of the different regions in the brain. In future, the findings of this study could be applied to individuals affected by such conditions, to see if the location of a region could be used as a diagnostic indicator.https://doi.org/10.7554/eLife.32992.002
The organisation of the human brain into large-scale functional networks has been investigated extensively over the past two decades using resting state functional magnetic resonance imaging (rfMRI). Spontaneous fluctuations in distinct brain regions (as measured with rfMRI) show temporal correlations with each other, revealing complex patterns of functional connectivity (FC) (Biswal et al., 1995; Friston, 1994, 2011). Extensive connectivity between cortical areas and with subcortical brain regions has long been considered a core feature of brain anatomy and function (Crick and Jones, 1993), and dysfunctional coupling is associated with a variety of neurological and psychiatric disorders including schizophrenia, depression, and Alzheimer’s disease (Castellanos et al., 2013). Given the great potential neuroscientific and clinical value of rfMRI, it is important to determine which aspects of rfMRI data most sensitively and interpretably reflect trait variability across subjects. At a neural level, potential sources of meaningful cross-subject variability include: (i) the strength of the functional coupling (i.e. interactions) between two different neural populations (‘coupling’), and (ii) the spatial configuration and organisation of functional regions (‘topography’). In this study, we aim to identify how these key aspects of rfMRI data influence derived measures of functional connectivity and how they relate to interesting trait variability in behaviour and lifestyle across individuals. Our findings reveal variability in the spatial topography of functional regions across subjects, and suggest that this variability is the primary driver of cross-subject trait variability in correlation-based FC measures obtained via group-level rfMRI parcellation approaches. These results have important implications for future rfMRI research, and for the interpretation of FC findings.
A commonly applied approach used to derive FC measures from rfMRI data is to parcellate the brain into a set of functional regions (‘nodes’), and estimate the temporal correlations between pairs of node timeseries (‘edges’) to build a network matrix (Smith et al., 2013b). This approach has previously been likened to a fingerprint, enabling the unique identification of individuals, and the prediction of behavioural traits such as intelligence (Finn et al., 2015; Passingham et al., 2002). Of particular interest is the ability of network matrices to explain cross-subject variability in behaviour and performance on psychometric tests. To this end, Cross Correlation Analysis (CCA) was previously adopted to link a ‘positive-negative’ axis of behaviour to network matrices in data from the Human Connectome Project (Smith et al., 2015). CCA allows the comparison of a set of variables obtained from rfMRI (such as network matrices of edges) to a set of behavioural variables by estimating independent linear transformations for the two sets of variables such that they are maximally correlated. Here, we replicated this previous work in a larger subject sample (almost double the number of individuals), and adopt CCA to determine which key aspect of rfMRI data is uniquely associated with behaviour.
Parcellation methods that can be used to estimate network matrices include the use of anatomical, functional, and multi-modal atlases (Glasser et al., 2016; Tzourio-Mazoyer et al., 2002; Yeo et al., 2011), with functional parcellations often being data driven via techniques such as clustering and independent component analysis (ICA) (Beckmann et al., 2005; Craddock et al., 2012). Data-driven approaches such as ICA have been used to identify consistent large-scale resting state networks (Damoiseaux et al., 2006) and to characterise FC abnormalities in a variety of mental disorders (Littow et al., 2015; Pannekoek et al., 2015). Any given parcellation is typically defined at the group level, and hence additional steps are required to map a group-level parcellation onto individual subjects’ data (that has undergone registration to a common space), in order to obtain subject-specific parcel timeseries and associated connectivity edge estimates. Timeseries derived from hard (binary, non-overlapping) parcellations are often obtained using a simple masking approach (i.e. extracting the averaged BOLD timeseries across all voxels or vertices in a node), whereas ICA parcellations (partially overlapping, soft parcellations that contain continuous weights) are mapped onto single-subject data using dual regression analysis or back projection (Calhoun et al., 2001; Filippini et al., 2009). The first stage of a dual regression approach involves multiple spatial regression of group ICA maps into each preprocessed individual dataset to obtain subject-specific timeseries; the second stage is a multiple temporal regression of these stage one timeseries into the same preprocessed dataset to obtain subject-specific spatial maps. Note, dual regression is, to some extent, expected to underestimate subject-specific spatial variability because it involves post-hoc regressions of a group-level set of spatial maps, which are unlikely to be an accurate model for the data of individual subjects. Indeed, previous work has shown that, in the presence of spatial variability or inaccurate intersubject alignment, these common methods for mapping group parcellations onto individuals do not recover accurate subject-specific functional regions, and this can severely impact the accuracy of estimated timecourses and derived FC edges (Allen et al., 2012; Smith et al., 2011).
More recently, several studies have developed more thorough characterisations of the patterns of spatial variability in network topography across subjects (i.e. spatial shape, size and position of functional regions) (Glasser et al., 2016; Gordon et al., 2017a, 2017b; Laumann et al., 2015; Swaroop Guntupalli and Haxby, 2017; Wang et al., 2015). For example, Glasser et al. showed that the subject-specific spatial topology of area 55b in relation to the frontal and premotor eye fields substantially diverged from the group average in 11% of subjects (Glasser et al., 2016). In addition, the size of all cortical areas, including large ones like V1, varies by twofold or more across individuals (Amunts et al., 2000; Glasser et al., 2016). This extensive presence of spatial variability across individuals highlights the need for analysis methods that are adaptive and better able to accurately capture functional regions in individual subjects. Another approach that aims to achieve a more accurate subject-specific description of this spatial variability is PROFUMO, which simultaneously estimates subject and group probabilistic functional mode (PFM) maps and network matrices (instead of separate parcellation and mapping steps). Specifically, PROFUMO is a matrix factorisation model that decomposes data into estimates of subject-specific spatial maps, time courses, and amplitudes using a variational Bayesian approach with both spatial and temporal priors that seek to optimise for both spatial map sparsity and temporal dynamics consistent with haemodynamically regularised neural activity (Harrison et al., 2015). PROFUMO adopts a hierarchical approach by iteratively optimising subject and group estimates (instead of first estimating group components using group ICA and separately mapping these onto subjects using dual regression), and is therefore expected to more accurately capture subject-specific spatial variability than does dual regression. Other approaches are available to obtain group and subject parcellations in one step, for example using a groupwise normalised cut spectral clustering approach (Shen et al., 2013). In the present study, we show that the spatial variability across subjects captured in PFMs is strongly associated with behaviour.
Conceptually, network edges are commonly thought of as reflecting coupling strength between spatially separated neuronal populations. However, as discussed above, edge estimates are highly sensitive to spatial misalignments across individuals. Additionally, correlation-based edge estimates are influenced by the amplitudes of localised spontaneous rfMRI fluctuations (Duff et al., 2018), which have been shown to capture trait variability across subjects, and state variability within an individual over time (Bijsterbosch et al., 2017). These findings demonstrate the sensitivity of edge-strength estimates to many different types of subject variability, and highlight the need to identify which aspects of FC tap most directly into behaviourally relevant population-level variability. Here, we investigate the complex relationships between different features of an rfMRI dataset and also the associations with variability across individuals in terms of their performance on behavioural tests, their lifestyle choices, and demographic information. Using data from the Human Connectome Project (HCP), we provide evidence for systematic differences in the spatial organisation of functional regions. We then use simulations that manipulate aspects of the data such that, for example, only cross-subject spatial variability is present in the data (i.e. by fixing edge strength to be the group average for each individual) to investigate whether these differences reflect meaningful cross-subject information and drive edge estimates for several common FC approaches.
To determine whether a given rfMRI-derived FC measure contains meaningful cross-subject information rather than random variability, we adopted an approach that makes use of the extensive set of behavioural, demographic, and lifestyle data acquired in the HCP. Our first analysis aims to determine which measures obtained from rfMRI and task data most strongly relate to interesting behavioural variability across individuals. Using Canonical Correlation Analysis (CCA), we extracted population modes of cross-subject covariation that represent maximum correlations between combinations of variables in the subject behavioural measures and in the fMRI-derived measures, uncovering multivariate relationships between brain and behaviour. For example, previous work has used CCA on HCP data to identify a mode of population covariation that linked a positive-negative axis of behavioural variables to patterns of FC edge strength (Smith et al., 2015). A specific pattern of connectivity, primarily between ‘task-negative’ (default mode) regions (Raichle et al., 2001), was found to be linked to scores on positive factors such as life satisfaction and intelligence, and inversely associated with scores on negative factors such as drug use.
CCA works by finding a linear combination of behavioural measures (V) that is maximally correlated with a linear combination of rfMRI-derived measures (U). CCA scores for each subject are obtained for the behavioural and fMRI-derived measures (V and U), which represent the subject’s position along the population continuum for the latent CCA variable(s). The key result of a CCA analysis for each mode of covariation is the correlation between U and V, denoted rUV, which describes the strength of the multivariate brain-behaviour relationship. Given that CCA explicitly optimises rUV, it is essential to perform permutation testing in order to test the significance of the CCA result. To determine which behavioural measures contribute strongly to the CCA result, V is subsequently regressed into original non-imaging variables (Figure 1B; although interpretation of these results is complicated by behaviour-behaviour correlations). Additionally, U is used to visualise variation at both the population extremes (see Figure 2 below and Figure 2—figure supplements 2–7), and across the full population continuum (Supplementary video files).
We applied a separate CCA analysis for each of the various fMRI-derived measures (including spatial, network matrix, and amplitude measures). The results (Figure 1 and Supplementary file 1a and b) reveal that highly similar associations with behaviour and life factors occur across a wide range of different fMRI-derived measures. Correlating the behavioural subject weights (V) across the different CCA instances in Figure 1 shows that a similar behavioural mode is obtained from the independent instances of CCA (particularly for those CCAs that have a high rU-V and low PU-V; Figure 1—figure supplement 1). Mapping these subject weights onto behaviour through correlation reveals consistent positive associations with, for example, fluid intelligence, life satisfaction, and delayed discounting, and consistent negative correlations with use of tobacco, alcohol and cannabis. All behavioural correlations with mean correlation r>|0.25| (chosen for visualisation purposes) are shown in Figure 1B. The results show that spatial features such as PFM subject spatial maps and subject task contrast maps are strongly associated with behaviour. Overall, these findings reveal that a large variety of fMRI measures have similarly strong associations with behaviour.
Direct comparison between the results in Figure 1—figure supplement 1a) and the HCP_MMP1.0 parcellation (e.g. the 360-region ‘Glasser parcellation’ [Glasser et al., 2016]) and against associated fractional surface area (in native space as a ratio to total surface area, for each of the 360 parcels in the HCP_MMP1.0 parcellation) is challenging due to the large difference in the number of subjects (n = 819 for Figure 1 and n = 441 for HCP_MMP1.0). Therefore, we have included an analysis on all PFM metrics in a reduced number of subjects (the same n = 441 subjects) in order to facilitate direct comparison between these two recent parcellation approaches that both aim to achieve accurate detection of subject-specific spatial boundaries (Supplementary file 1b). These results show that spatial features from a variety of sources (surface area, multimodal parcellation and PFMs) are strongly associated with measures of behaviour and lifestyle. Also note that network matrices obtained by the HCP_MMP1.0 parcellation are more predictive of behaviour than are PFM network matrices.
For correlation-based parcellated FC estimates (network edges), a common assumption is that functional coupling is primarily reflected in the edges. In theory, true network coupling information can be manifested along a continuum ranging from spatial maps to network matrices. On one extreme, coupling information is purely contained in spatial maps, as is the case when performing temporal ICA (where the temporal correlation matrix is by definition the identity matrix [Smith et al., 2012]). On the other extreme, coupling information can be fully contained in network matrices as is often assumed to be the case when using an individualised hard parcellation (however, coupling can only be represented fully in edge estimates if all subjects are perfectly functionally aligned to the parcellation, and if the node timeseries amplitudes do not contain useful cross-subject information). It is likely that the dimensionality of the decomposition may influence this; for example, for a low-dimensional decomposition (into a small number of large-scale networks), much cross-subject variation in functional coupling is likely to occur between sub-nodes of the networks, which is therefore more likely to be represented in the spatial maps, whereas in a higher dimensionality decomposition this information is more likely to be represented in the network matrix. However, the results in Figure 1 show that this CCA mode of population covariation is significantly present in both spatial maps and network matrices for both low- and high-dimensional decompositions (ICA 25 and 200). Therefore, the potential role of dimensionality is not sufficient to explain the common information present in spatial maps, timeseries amplitudes, and network matrices.
The presence of this behaviourally meaningful spatial variability is somewhat surprising, because these data were aligned using a Multimodal Surface Matching (MSM) approach (Robinson et al., 2014; 2018), driven by both structural and functional cortical features (including myelin maps and resting state network maps). MSM has been shown to achieve very good functional alignment compared with other methods, and particularly compared with volumetric alignment approaches or surface-based approaches that use cortical folding patterns rather than areal features (Coalson et al., 2018). However, residual cross-subject spatial variability is still present in the HCP data after the registration to a common surface atlas space (in part due to the constrained parameterisation of MSM and in part because weighted regression subject maps used to drive MSM may not fully capture all spatial variability). In line with this, approaches which are expected to better identify residual subject spatial variability (specifically, PFM spatial maps and subject task contrast maps in Figure 1) show strong correspondence between spatial variability and behaviour/life-factor measures.
To better understand what spatial features represent behaviourally relevant cross-subject information, we visually explored what aspects of the PFM spatial maps contributed to the CCA result in Figure 1 by calculating representative maps at extremes of the CCA mode of population covariation (based on CCA subject scores). While the PFM maps are estimated using the full set of cortical and subcortical grayordinates, we focus on cortical findings because these contribute most strongly to the CCA results. The results reveal complex changes in spatial topography (Figure 2, Figure 2—figure supplements 2–7, and Videos 1–9. For example, comparing left versus right panels shows the right inferior parietal node of the DMN extending farther into the intraparietal sulcus (in the vicinity of area IP1 [Choi et al., 2006; Glasser et al., 2016]) in subjects who score higher on the behavioural positive-negative mode of covariation. Qualitative inspection of Figure 2—figure supplements 2–7 suggests that many of the difference maps show notable bilateral symmetry.
Figure 1 showed that functionally-relevant cross-subject variability is represented in a variety of different measures derived from both resting state and task fMRI. These widespread similarities in correlations with behaviour across a range of measures invite the question of whether the same type of trait variability is meaningfully and interpretably reflected in a wide range of rfMRI measures, or whether (for example) estimates of network matrices may instead primarily reflect trait variability in spatial topography or amplitude (and not coupling strength). Therefore, we wanted to determine to what extent correlation-based FC measures derived from rfMRI can be influenced by specific aspects of the rfMRI data such as true topography and true coupling. To this end, we generated simulated datasets based on the original PFM subjects and/or group spatial maps and timeseries. By holding either the individual (simulated) subjects’ spatial maps or the network matrices fixed to the group average we eliminated specific forms of underlying subject variability from the simulated data (Figure 3). Note, we used PFMs in order to generate simulated data because the PROFUMO model separately estimates spatial maps, network matrices and amplitudes, thereby allowing each aspect to be fixed to the group average prior to generating simulated data using the outer product (as described in detail in Equation (1), and in the section on ‘Creating simulated data’ in the Material and methods). Previous simulation results have shown that PROFUMO is able to accurately estimate spatial maps and network matrices in the presence of cross-subject variability in spatial topography, relative strength of subregions, and between-mode connectivity (Harrison et al., 2015). The aim of the simulation analyses was to determine which features in the rfMRI data are likely to be most strongly reflected in network matrices estimated from rfMRI data. We assess this in terms of the amount of variability across subjects that can be explained, as this is the most relevant application in biomarker studies and in neuroimaging research more generally.
Timeseries were extracted from both the simulated and original datasets, and network matrices were estimated. Each simulated dataset was assessed using three metrics: (i) comparing subject-specific simulated and original network matrices (Znetwork matrix in Table 1), (ii) comparing cross-subject variability in the simulated and original network matrices (Rcorrelation in Table 1), and (iii) determining how much of the cross-subject variability in simulated and original network matrices is behaviourally informative using CCA (see Table 1 legend).
The results (Table 1 and Supplementary file 1c and d) show that, when the subject-varying aspects of the simulations were exclusively driven by spatial changes across subjects (with the predefined network matrix and amplitudes being identical for all subjects), up to 62% (i.e. square of Rcorrelation = 0.79 from Supplementary file 1d ‘maps only’) of the cross-subject variance present in the network matrices obtained from the original data was regenerated. Hence, this finding reveals that very similar network matrices can be obtained for any individual subject even if the only aspect of the rfMRI that is varying across subjects is the topographic information in PFM spatial maps. In addition, the variance that can be explained by spatial maps is behaviourally relevant; the CCA results were similarly strong (typically having the same permutation-based p-values) from simulated network matrices driven purely by spatial changes, compared with those obtained from the original dataset.
The influence of amplitudes on FC estimates was relatively minor (less than 2.5% of variance was explained by amplitude in all our simulations; i.e. square of Rcorrelation = 0.15 from Table 1 ‘amplitudes only’), although, when amplitudes were combined with spatial maps feeding into the simulations, the amplitudes did in most cases result in an increase in original network matrix regeneration.
Given the complex information present in PFM spatial maps, the effect of spatial information on network matrices can result from cross-subject variability in: (i) network size, (ii) relative strength of regions within a given network, or (iii) size and spatial location of functional regions. We performed two further tests to distinguish these influences by thresholding and binarising the subject-specific spatial maps used to create the simulated data. Maps were either thresholded using a fixed threshold (removing the influence of relative strength), or (separately) using a percentile threshold (removing the influence of relative strength and size, as the total number of grayordinates in binarised PFM maps is fixed across subjects and PFMs). The role of subject-varying spatial maps in driving the resulting estimated network matrices remains strong when highly simplified binarised maps are used to drive the simulations (Supplementary file 1e), further supporting our interpretation that the results are largely driven by the shape of the functional regions (i.e. variability in the location and shape of functional regions across subjects), rather than by size or local strength.
The results presented above show that a large proportion of the variance in estimated network matrices is also represented in spatial topography. This suggests either that cross-subject information is represented in both the coupling strength between neural populations and in the ‘true’ underlying spatial topography, or that edge estimates obtained from rfMRI data primarily reflect cross-subject spatial variability (which indirectly drives edge estimates through the influence of spatial misalignment on timeseries extraction, particularly when group parcellations are mapped onto individual subjects in the case of imperfect alignment). To test these hypotheses further, we investigated the unique information contained in spatial maps and network matrices using a set of 15 ICA basis maps derived from HCP task contrast maps (Figure 4A). These basis maps can be thought of as the spatial building blocks that can be linearly combined to create activation patterns for any specific HCP task contrast, and can be considered here to be another functional parcellation.
The advantage of using basis maps derived from task data is that the tasks essentially act as functional localisers that allow for the precise localisation of task-related functional regions within an individual; results at a single-subject level are not influenced in any way, including spatially, by the group results, as they are derived via the standard task-paradigm analysis (i.e. which relies solely on temporal information, and is not influenced by the group-level maps). The equivalence between group- and subject-level contrasts (i.e. the inherent assumption in any group-level analysis, namely that the group ‘2BK-0BK’ contrast map directly relates to any subject-level ‘2BK-0BK’ contrast) means that any combination of group-level contrasts is equally valid as a combination at the subject-level, but with the advantage that the resulting subject maps will be faithful to the precise location of functional regions that the subject-specific contrast maps capture. Hence, subject-based task basis maps are the most accurate description of subject-specific locations of functional regions, at least with respect to those regions identifiable from the range of tasks used.
To investigate the implications of these task-localised maps on typical rfMRI analyses, either group-based task basis maps or subject-based task basis maps were entered into a dual regression analysis against subjects’ resting-state fMRI data to obtain network matrices (from dual regression stage one timeseries) and rfMRI-based spatial maps (from dual regression stage 2) for each subject (Figure 4B). Subsequently, CCA was performed to determine how well each of the group-based and subject-task-based rfMRI maps and network matrices was able to predict behavioural variability. Furthermore, a ‘partial CCA’ was performed to characterise the unique variance that task rfMRI maps carry over and above network matrices, and vice versa. Here, we regressed any variance explained by network matrices out of the spatial maps prior to running the ‘partial CCA’ to determine the unique information contained in spatial maps (and vice versa, i.e., regressed any variance explained by spatial maps out of network matrices before running the ‘partial CCA’).
The results from the CCAs against behavioural measures show that subject-specific spatial maps (derived from either subject- or group-based task-fMRI maps) capture more behavioural information than network matrices (and continue to reach significance in the partial CCA), consistent with the PFM spatial results presented in Figure 1. The full CCA result is marginally stronger (∆ruv=0.005, p=0.46) for group-task-based rfMRI spatial maps compared with subject-task-based rfMRI spatial maps. None of the partial CCA results for network matrices reach significance, suggesting that network matrices do not contain any unique trait-level information that is not also captured by spatial variability. Note that dual regression maps derived from both group-task-based and subject-task-based templates capture unique subject-specific spatial variation in the partial CCA results, consistent with significant CCA results for ICA25 and ICA200 in Figure 1. Importantly, subject-task-based rfMRI network matrices explain the behavioural data considerably less well than group-based task-rfMRI network matrices (difference: p=0.0005 for full network matrices), confirming that spatial information is a significant factor in estimated network matrices. Hence, subject spatial variability is more uniquely represented in the spatial information for subject-task-based estimates, and therefore does not filter through into the network matrices (marked “§”).
Taken together, these results show that, while network matrices obtained from dual regression against group-level maps do contain behaviourally relevant cross-subject information, this can be almost completely explained by variability in spatial topographical features across subjects (to the extent that we can detect it). Hence, dual regression network matrices (obtained from multiple regression against group spatial maps) apparently contain little unique cross-subject information regarding coupling strength that is not also reflected in spatial topographical organisation. However, it is possible that network matrices obtained using parcellation methods and timeseries extraction approaches that are better able to capture subject-specific spatial variability (such as the HCP_MMP1.0 parcellation) do contain unique cross-subject information; further research is needed to test this possibility. Additionally, network matrices may contain unique state-level information relevant to ongoing behaviour (e.g. in a task paradigm).
Here, we have identified a key aspect of rfMRI data that directly reflects interesting variability in behaviour and lifestyle across individuals. Our results indicate that spatial variation in the topography of functional regions across individuals is strongly associated with behaviour (Figure 1). In addition, network matrices (as estimated with masking or dual regression against group-level hard or soft parcellations) reflect little or no unique cross-subject information that is not also captured by spatial topographical variability (Figure 4 and Figure 4—figure supplement 1). This unexpected finding implies that the common interpretation of FC as representing cross-subject (trait) variability in the coupling strength of interactions between neural populations may not be a valid inference (although within-subject state-dependent changes in coupling may still be reflected in FC measures). Specifically, we show that up to 62% of the variance in rfMRI-derived network matrices (a measure commonly taken as a proxy for coupling) can be explained purely by spatial variability. These findings have important implications for the interpretation of FC and may contribute to a deeper mechanistic understanding of the role of intrinsic FC in cognition and disease (Mill et al., 2017).
Our findings are consistent with previous research that has highlighted the presence of structured cross-subject spatial variance in both functional and anatomical networks (Glasser et al., 2016; Gordon et al., 2017a; Noble et al., 2015; Sabuncu et al., 2016; Tong et al., 2017; Xu et al., 2016). Furthermore, recent work has shown that resting state spatial maps can be used to predict task activation maps from individual subjects very accurately (Tavor et al., 2016), and that interdigitated and highly variable subnetworks can be identified within individuals (Braga and Buckner, 2017). Therefore, the presence of behaviourally relevant cross-subject variance in maps of functional (co-) activation in itself is not surprising. However, the fact that these variations in spatial topographical features capture a more direct and unique representation of subject variability than temporal correlations between regions defined by group parcellation approaches (coupling), was unexpected. The implication of this finding is that the cross-subject information represented in commonly adopted ‘connectivity fingerprints’ largely reflects spatial variability in the location of functional regions across individuals, rather than variability in coupling strength (at least for methods that directly map group-level parcellations onto individual data). Specifically, our partial CCA results (Figure 4) show that network matrices (as often estimated) contain little unique trait-level cross-subject information that is not also reflected in the spatial topographical organisation of functional regions.
How the functional organisation of the brain is conceptualised and operationally defined is of direct relevance to the interpretation of these findings. Some hard parcellation models of the human cortex (such as the Gordon and Yeo parcellations [Gordon et al., 2016; Yeo et al., 2011]) aim to fully represent connectivity information in the edges (i.e. correlations between node timeseries). Thus, hard parcellations of this type assume piecewise constant connectivity within any one parcel (i.e. each parcel is assumed to be homogeneous in function, with no state- or trait-dependent within-parcel variability in functional organisation). In contrast, the HCP_MMP1.0 multimodal parcellation presumes within-area uniformity of one or more major features, but overtly recognises within-area heterogeneity in other features, including connectivity, most notably for distinct body part representations (‘sub-areas’) of the somatomotor complex. Soft parcellation models (such as PROFUMO [Harrison et al., 2015]) allow for the presence of multiple modes of (potentially overlapping) functional organisation. Therefore, PFMs represent connectivity information through complex interactions between amplitude and shape in the spatial maps, and through network matrices. Our findings show that both the PROFUMO and the multimodal parcellation models successfully capture behaviourally relevant cross-subject spatial variability (Supplementary file 1b), but that the precise location of where this spatial variability is represented overlaps only modestly between the two approaches (Figure 2—figure supplement 7). Given the differences in the key assumptions made by the two models (i.e. binary parcellation versus multiple modes of functional organisation), this is not unexpected. However, it does highlight the need for further research into the optimal representation of (subject-specific) functional organization in the brain.
For most of the results presented in this work, we estimated spatial information using functional data (either resting or task fMRI data). While a comprehensive investigation of related anatomical features is beyond the scope of this work, we did identify significant correlations between fractional surface area size and subject CCA weights (Figure 2—figure supplement 7). This result suggests that anatomical variability in the cortical extent of a number of higher level sensory and cognitive brain regions may contribute to the overall findings presented here. Further research into the relationship between structural features and functional connectivity measures, and their contribution to trait-level subject variability is needed to test this hypothesis.
Our findings are relevant to a wide variety of approaches used to study connectivity. For example, our simulation results (Table 1 and Supplementary file 1c and d) reveal similar results regardless of whether we adopt a dual-regression or a masking approach to obtain timeseries, and the findings also do not differ qualitatively according to whether full or partial correlation is used to estimate network matrices. Therefore, our findings are relevant to any approach that is based on timeseries extracted from functional regions defined at the group-level (including graph theory methods and spectral analyses). The implications of this work may also extend beyond resting-state fMRI. For example, generative models such as dynamic causal modelling (DCM) are increasingly used to stratify patient populations (Brodersen et al., 2014), and to achieve predictions for individual patients (Stephan et al., 2017). Previous work has shown that including parameters for the position and shape of functional regions in individual subjects into the model improves DCM results and better differentiates between competing models (Woolrich et al., 2009). It is currently unknown to what extent cross-subject variability observed with these timeseries-based fMRI metrics reflects true coupling between neural populations, rather than being indirectly driven by spatial variability and misalignment, but given that many of these studies are conducted using alignment methods that perform substantially worse than the MSMAll surface-based alignment used in this study (Coalson et al., 2018), this is likely a significant confound for such studies. Going forward, it is important to disambiguate the influence of spatial topography to enable the estimation of fMRI measures that uniquely reflect coupling strength between neural populations.
Significant advances have already been made in recent years in order to tackle the issue of spatial misalignment across individuals. For example, the HCP data used in this work were spatially aligned using the multimodal surface mapping (MSM) technique, which achieves very good functional alignment by using features that are more closely tied to cortical areas (although note that, since the time of the HCP release, refinements to the MSM algorithm and regularisation have resulted in further improvements in the observed functional alignment of HCP data [Robinson et al., 2014, 2018]). Therefore, gross misalignment is unlikely to play a role in our results. In fact, some of the behaviourally relevant variability may have been ‘corrected’ in the MSM pipeline prior to our analyses (indeed, the same positive-negative mode of population covariation is identified when running the CCA on MSM warp fields; and the fractional surface area results in Supplementary file 1b and Figure 2—figure supplement 7 reflect the full variability from native space, and are not affected by the alignment accuracy). Therefore, it is possible that the degree to which spatial information may influence FC estimates varies considerably across studies, depending on the spatial alignment algorithm that was used, and the amount of subject spatial variability this has removed. It is encouraging that significant efforts have recently gone into the methods for more accurately estimating the spatial location of functional parcels in individual subjects in recent years (Chong et al., 2017; Glasser et al., 2016; Gordon et al., 2016; Hacker et al., 2013; Harrison et al., 2015; Varoquaux et al., 2011; Wang et al., 2015), and into advanced hyperalignment approaches (Chen et al., 2015; Guntupalli et al., 2016; Guntupalli and Haxby, 2017). The present results highlight the importance of such advances, and call for the continued development, comparison, and validation of such approaches.
In conclusion, we have demonstrated that spatial topography of functional regions are strongly predictive of variation in behaviour and lifestyle factors across individuals, and that timeseries-based methods (as often estimated based on group-level parcellations) contain little unique trait-level information that is not also explained by spatial variability.
For this study, we used data from the Human Connectome Project S900 release (820 subjects with fully complete resting-state fMRI data, 452 male, mean age 28.8 ± 3.7 years old) (Van Essen et al., 2013). Data were acquired across four runs using multiband echo-planar imaging (MB factor 8, TR = 0.72 s, 2 mm isotropic voxels) (Moeller et al., 2010; Uğurbil et al., 2013). Data were preprocessed according to the previously published pipeline that includes tools from FSL, Freesurfer, HCP’s Connectome Workbench, multimodal spatial alignment driven by myelin maps, resting state network maps, and resting state visuotopic maps (‘MSMAll’), resulting in data in the grayordinate coordinate system (Fischl et al., 1999; Glasser et al., 2013, 2016; Jenkinson et al., 2012; Marcus et al., 2013; Robinson et al., 2014; Smith et al., 2013a). ICA-FIX-cleanup was performed on individual runs to reduce structured noise (Griffanti et al., 2014; Salimi-Khorshidi et al., 2014). ICA-FIX achieves 99% sensitivity and 99% specificity on HCP data when compared to manual classification by trained raters (Smith et al., 2013a). Only subjects with the full 4800 resting state timepoints (4 scans of 1200 TRs each) were included for the analyses performed in this work. A detailed overview of quality assessment in the Human Connectome Project was previously published (Marcus et al., 2013).
HCP data are freely available from https://db.humanconnectome.org. The version of MSMAll that is compatible with the approach implemented for the alignment of HCP data can be found here: http://www.doc.ic.ac.uk/~ecr05/MSM_HOCR_v2/ (Robinson et al., 2017). Matlab code used in this work can be found here: https://github.com/JanineBijsterbosch/Spatial_netmat (Bijsterbosch, 2017; copy archived at https://github.com/elifesciences-publications/Spatial_netmat). Data from many figures in this study is freely available at https://balsa.wustl.edu/study/show/kKM0.
In order to obtain estimates of the spatial shape and size of functional networks for every subject, we decompose the HCP data into a set of probabilistic functional modes (PFMs) via the PROFUMO algorithm (Harrison et al., 2015). A set of PFMs describe each subject’s data (grayordinates; time points; ) in terms of a set of subject-specific spatial maps (), amplitudes () and timecourses (), all of which are linked via the outer product model:
These subject-specific decompositions are linked by a set of hierarchical priors. In the spatial domain, the group-level parameters encode the grayordinate-wise means, variances and sparsity of the subject maps, while in the temporal domain, the group-level priors constrain the subject-level network matrices (note that the component amplitudes and hierarchical priors are recent extensions to the PFMs model and were not included in the original PROFUMO paper [Harrison et al., 2015]). The PROFUMO framework gives us sensitive estimates of key subject-level parameters, while ensuring that there is direct correspondence between PFMs across subjects.
PROFUMO was run on the rfMRI data from all 820 subjects with a dimensionality of 50 PFMs. Importantly, the signal-subspace of any given subject’s dataset can be straightforwardly reconstructed from a set of modes via equation , and this can be used to generate the simulated data as described below.
For the ICA decompositions, amplitudes were estimated for each subject and component as the temporal standard deviation of the timeseries obtained from stage 1 of a dual regression analysis. Full and regularised partial correlation matrices were also calculated from these timeseries. The Tikhonov regularisation rho used during estimation of the partial correlation matrices was set to 0.01 for the ICA 25, 200 and PFM data (according to previous optimisation results). For high-dimensional parcellations (Yeo and HCP_MMP1.0), the rho was optimised by finding the maximum correlation between subject and group-average (using rho = 0.01) network matrices across a range of rho (0.01:0.5), leading to rho = 0.03 for Yeo and rho = 0.23 for HCP_MMP1.0 results. Lastly, the subject spatial maps obtained from stage 2 of a dual regression analysis were used. Similarly, for the PROFUMO decomposition, the PFM amplitudes, subject spatial maps and timeseries were used. For the HCP_MMP1.0 spatial results, either group-level or subject-specific node parcellations were used (Hacker et al., 2013). The subject-specific parcellations contain missing nodes (parcels) in some subjects (Glasser et al., 2016). Hence, for partial network matrices, the rows and columns in the covariance matrix were set to the scaled group average prior to inverting the covariance matrix. In the resulting network matrices, the rows and columns relating to missing nodes were set to the group average (for both partial and full network matrices). Before performing CCA, missing nodes were accounted for by estimating the subject-by-subject covariance matrix one element at a time, ignoring any missing nodes for any pair of subjects. The nearest valid positive-definite covariance matrix was subsequently obtained using nearest SPD in Matlab (http://uk.mathworks.com/matlabcentral/fileexchange/42885-nearestspd), prior to performing singular value decomposition as described below.
Each CCA analysis finds a linear combination of behavioural and life-factor measures (V) that is maximally correlated with a linear combination of rfMRI-derived measures (U) (Hotelling, 1936): . Y is the set behavioural measures, and X are the rfMRI-derived measures (i.e. spatial maps, or network matrices, or signal amplitudes), ~indicates that U and V are approximately equal. A and B are optimised such that the correlation between U and V is maximal. Summary measures from CCA include the correlation between (paired columns of) U and V, and the associated p-values (derived from permutation testing over n = 100,000 permutations) for the first one or more CCA modes.
To create the inputs to the CCA, a set of nuisance variables were regressed out of both the behavioural measures and the amplitudes, network matrices and spatial maps, as done in (Smith et al., 2015). Subject covariance matrices were subsequently estimated for the amplitudes, network matrices and for all spatial maps (by summing the covariance matrices of individual spatial maps). Then, a singular value decomposition was performed on the subject covariance matrices and the first 100 eigenvectors were entered into the CCA (against 100 eigenvectors obtained from behavioural variables as explained in Smith et al., 2015).
In addition to reporting the CCA results for the strength of the canonical correlation between imaging and non-imaging measures and the associated p-value (rU-V and PU-V), we also report the correlation between the CCA subject weights and the weights for the ICA 200 partial network matrices (rU-Uica). The reason for including this correlation is to facilitate direct comparison to previously published CCA results from HCP data (Smith et al., 2015). However, this earlier finding should not be taken as the gold standard CCA result. The rU-Uica correlation we report is the maximum correlation found between the first CCA mode from the ICA 200 partial network matrices, and any of the 100 modes of population covariation obtained for the comparison CCA result (i.e. the maximum correlation may not be with the strongest CCA mode).
Confidence intervals for CCA results in Table 1 were obtained using surrogate data for both the brain-based CCA input matrix and the behaviour CCA input matrix. To generate the surrogate data, row and column wise correlations of the original CCA input matrices were maintained using a multivariate normal random number generator (mvnrnd.m in Matlab). A total of 1000 instances of surrogate data were used to obtain 2.5–97.5% confidence intervals around rU-V.
For visualisation and interpretation purposes, we created videos of the spatial variability along the axis of the behavioural CCA mode of population covariation. For this, we took the U resulting from the CCA between PFM spatial maps and behaviour, and created a linearly spaced vector that spans just over the full range of U (extending beyond the lowest and highest measured subject score by 10% of the full range). As the CCA is linear, it is straightforward to project a set of U values back to form a rank-one reconstruction of the original space, which in this case is a set of spatial maps. This sequence of spatial maps is an approximation to the spatial variability that is encoded along the previously reported positive-negative axis. These are used as the frames for Videos 1–9 , and for the illustrative examples shown in Figure 2 and Figure 2—figure supplements 2–7.
The two rfMRI parcellation methods included in Supplementary file 1b (HCP_MMP1.0 and PFM) explicitly aim to capture cross-subject variability in the spatial location of functional regions. The subject spatial maps estimated by both methods are strongly associated with cross-subject behavioural variability (when matching the sample size rU-V did not significantly differ, and subject weights of the strongest CCA results were moderately correlated rU-U = 0.55). Therefore, it is of interest to compare these results in more detail, to determine whether cross-subject variability is represented similarly for the two approaches. Furthermore, given that fractional surface area (the fraction of cortex occupied by each area in the multimodal HCP_MMP1.0 parcellation) was also strongly predictive of behaviour (Supplementary file 1b), we investigated the potential relationship between rfMRI-based PFM weights, multimodally defined cortical areal boundaries (HCP_MMP1.0 parcellation), and structural variation in fractional surface area. To this end, we averaged CCA subject weights obtained from two separate CCA results (PFM spatial maps - behaviour, and HCP_MMP1.0 spatial maps - behaviour). These averaged subject weights were subsequently correlated against fractional surface area, and against subject-specific PFM and HCP_MMP1.0 spatial maps (grayordinate-wise), to investigate which brain regions contribute strongly to the association with behaviour, and to compare these localised effects across methods/modalities.
In order to create simulated datasets for each subject, we took the outer product between PFM spatial maps and timeseries. Compared with data that is completely simulated, this approach has the advantage of keeping many features in the data (such as the types of structured noise that are present, the signal-to-noise ratio, and the autocorrelation structure), while still achieving investigator control of specific aspects of interest. Data from each run (1200 time points) was processed separately through the simulation pipeline, including the following steps:
Each original PFM subject timecourse was set to unit variance, and the variances were retained.
The ZCA whitening transform (Bell and Sejnowski, 1997) was used to remove any correlations between timeseries:
Timeseries were modified such that the induced correlation matched a pre-specified structure.: . In the simulations that use a fixed group network matrix, this pre-specified correlation structure was estimated by projecting the S900 group average HCP dense connectome (following Wishart Rolloff) onto the group PFM spatial maps.
At this stage, the variances of the original timeseries are restored . This gives a set of simulated timeseries which have all the same properties as the reference timeseries (), except for their correlation structure.
We modify the inferred PFMs by selectively setting some of the parameters to their group averages. For example, if we set , where is the mean over all 820 subject maps, then we can eliminate any spatial variability across subjects. Similarly, we can set the temporal correlations to a fixed group mean using the procedure described above to remove any variability in FC across subjects. In order to remove amplitude variability across subjects, we add in group averaged variances instead of the subject variances. These simulated PFMs are then described by the simulated maps, amplitudes and timeseries, namely , and .
Finally, the full data can be reconstructed as per : . Spatio-temporally white-noise (with variance matched to the original data) is added to the activity described by the simulated modes to give a dataset that preserves the properties of the original data, but, crucially, one where we have direct control over where in the model subject variability can appear.
Once the simulated data is generated for each run, we extracted timeseries from both the simulated and original data using two different approaches that are commonly adopted in the literature. Dual regression analysis was performed using the group ICA maps that were estimated using the (original) HCP group data, and that are freely available with the S900 data release (www.humanconnectome.org). Two dimensionalities were tested, so for each simulated dataset dual regression was performed against 25 and against 200 group ICA components. The timecourses estimated in stage 1 of the dual regression analysis were used to compute network matrices (Filippini et al., 2009; Nickerson et al., 2017). Mean timeseries were also extracted from a set of 108 binary regions of interest (ROIs) based on the Yeo parcellation, and from the HCP_MMP1.0 group parcellations and individual subject parcellations (Glasser et al., 2016). The 108 Yeo ROIs were obtained from the 17-network parcellation (Yeo et al., 2011), by separating each of the 17 networks into individual contiguous regions that had a surface cluster area of at least 20 mm2. Timecourses were used to estimate full and regularised partial correlation network matrices using FSLnets (https://fsl.fmrib.ox.ac.uk/fsl/fslwiki/FSLNets). Z-transformation was applied to the network matrices before further comparisons. The network matrices derived from simulated data are compared against network matrices calculated from the original data as described below.
Firstly, we compare the simulated network matrix to the original network matrix for each subject, to determine how similar the measured FC is. For each subject, the node-by-node full or regularised partial network matrix estimated from the simulated data is reshaped into a single column after removing the diagonal and is correlated against the reshaped original estimated network matrix. Prior to reshaping the simulated and original network matrices, the respective group average network matrix (simulated or original) is subtracted from the subject network matrix, so that the subsequent correlation is sensitive to the unique subject variability instead of being driven by the group connectivity patterns. As such, a correlation coefficient between demeaned simulated and original network matrices is estimated for each subject. The Fisher r-to-z transform was applied to these correlations before averaging across subjects. This first test assesses how different a subject is from the group (and the similarity of this difference between original and simulated network matrices), and therefore does not test for cross-subject variability.
Secondly, the subject-by-subject correlation matrix was estimated from the subject-wise simulated network matrices. Again, this matrix was reshaped into a vector after discarding the diagonal and was correlated against the reshaped subject-by-subject correlation matrix obtained from the original network matrices. The aim of this test was to directly compare the cross-subject variability present in the simulated and original data, which is very important given that variability across subjects is typically of primary interest in FC research. Hence, this analysis aims to compare the cross-subject variability in original or simulated network matrices, as opposed to comparing the similarity of original and simulated network matrices within an individual subject (as is the case for the preceding approach).
The last test of the simulated network matrices was to perform a CCA against the set of behavioural and life-factor measures (Smith et al., 2015). A CCA was performed on the simulated network matrices against the subject behavioural measures as described below. To assess the CCA results, we report the correlation between U and V (for the first, strongest mode of population covariation), the associated permuted p-value (n = 100,000 permutations, respecting family structure), and the maximum correlation between any of the simulated U and the first U obtained when using the original ICA 200 dimensionality partial network matrices describing the positive-negative mode of covariation (Smith et al., 2015).
The PFM subject spatial maps contain a relatively complex set of information. This may include relative differences in amplitude in different brain regions that are part of the same mode, which effectively reflect connectivity rather than spatial shape and size. In order to exclude these potential connectivity-related aspects of the spatial maps and isolate the role of spatial shape, we simplified the spatial maps for some of the simulations presented. For this, the spatial maps were thresholded at a very liberal threshold of 1 (arbitrary units specific to the PFM algorithm) and binarised. The sign was retained such that grayordinates in the subject PFM maps with values > 1 were set to one and grayordinates with values <-1 were set to −1 and all others to zero. A liberal threshold was purposefully used as we wanted to retain extended (broad, low) shape information, and just remove any information encoded in the (relative) grayordinate amplitudes. Using a fixed threshold across subjects retains cross-subject variability in the size of networks. To further remove this source of information and focus purely on the shape of networks, we applied a percentile threshold such that the size of networks is fixed across subjects (grayordinates >95th percentile set to one and grayordinates <5th percentile set to −1, leading to each individual PFM map having the same size of 4564 1s and 4564 –1s across all subjects). The results of simulations where the maps were modulated in this way prior to calculating the simulation’s space-time outer product are presented in Supplementary file 1e, including results for which the maps were both thresholded and binarised, percentile thresholded and binarised, and also results for maps that were thresholded (at 1) but not binarised.
Given that variability between subjects is of primary interest in rfMRI research, this analysis aimed to directly compare the cross-subject variability present in a range of measures obtained from the original data. Between-subject correlation matrices were calculated from network matrices (ICA25, ICA200 and PFM50), from PFM amplitudes and from spatial maps (ICA25 and ICA200 dual regression stage two spatial maps, and PFM50 spatial maps). These subject by subject correlation matrices were reshaped after discarding the diagonal, and full and partial correlations were calculated between the subject correlation matrices (Figure 4—figure supplement 1).
To obtain a basis set of spatial maps based on task contrast data, we performed a spatial ICA (with a dimensionality of 15) on the concatenated group-averaged task contrast maps (a total of 86 maps, 47 of which are unique). The ICA dimensionality was determined based on the proportion variance explained in the PCA data reduction step (99.0% for d = 15). Spatial ICA was performed on the group-average task contrasts maps to avoid the correspondence problem that would arise if ICA were applied separately to individual subject task contrast maps. This resulted in a set of ICA weights (15*86), which describe the contribution of each task contrast map to each extracted ICA component. The outer product of these weights with either the group-averaged contrast maps or the corresponding subject-specific contrast maps was used to obtain maps to drive subsequent dual regression analysis. Dual regression analysis (driven by either group-averaged or subject-specific task basis maps after normalising the maximum of each subject and component map to 1) was run against subject resting state data to obtain timeseries and maps. CCA against behaviour was performed separately on the resulting network matrices and spatial maps as described above. Additionally, partial CCA was performed to determine the unique information contained in network matrices and in spatial maps. For this, any variance explained by network matrices was regressed out of the spatial maps and vice versa (i.e. was ‘partialled out’), before running the ‘partial CCA’. Specifically, the 100 eigenvectors used as the input matrix to the CCA (as explained above and following [Smith et al., 2015]) for partial network matrices were regressed out of the 100 eigenvectors for the spatial maps before running CCA, or conversely the 100 eigenvectors for spatial maps were regressed out of the 100 eigenvectors for the network matrices before running CCA.
Investigations into resting-state connectivity using independent component analysisPhilosophical Transactions of the Royal Society B: Biological Sciences 360:1001–1013.https://doi.org/10.1098/rstb.2005.1634
The "independent components" of natural scenes are edge filtersVision Research 37:3327–3338.https://doi.org/10.1016/S0042-6989(97)00121-1
Functional connectivity in the motor cortex of resting human brain using echo-planar MRIMagnetic Resonance in Medicine 34:537–541.https://doi.org/10.1002/mrm.1910340409
Dissecting psychiatric spectrum disorders by generative embeddingNeuroImage: Clinical 4:98–111.https://doi.org/10.1016/j.nicl.2013.11.002
A Reduced-Dimension fMRI Shared Response ModelIn: C Cortes, N. D Lawrence, D. D Lee, M Sugiyama, R Garnett, editors. Advances in Neural Information Processing Systems, 28. Curran Associates, Inc. pp. 460–468.
Cytoarchitectonic identification and probabilistic mapping of two distinct areas within the anterior ventral bank of the human intraparietal sulcusThe Journal of Comparative Neurology 495:53–69.https://doi.org/10.1002/cne.20849
A whole brain fMRI atlas generated via spatially constrained spectral clusteringHuman Brain Mapping 33:1914–1928.https://doi.org/10.1002/hbm.21333
Disambiguating brain functional connectivityNeuroImage, 10.1016/j.neuroimage.2018.01.053, 29476911.
Functional and effective connectivity in neuroimaging: A synthesisHuman Brain Mapping 2:56–78.https://doi.org/10.1002/hbm.460020107
A model of representational spaces in human cortexCerebral Cortex 26:2919–2934.https://doi.org/10.1093/cercor/bhw068
Family income, parental education and brain structure in children and adolescentsNature Neuroscience 18:773–778.https://doi.org/10.1038/nn.3983
The anatomical basis of functional localization in the cortexNature Reviews Neuroscience 3:606–616.https://doi.org/10.1038/nrn893
Parcellating cortical functional networks in individualsNature Neuroscience 18:1853–1860.https://doi.org/10.1038/nn.4164
FMRI Dynamic Causal Modelling with Inferred Regions of InterestS168, FMRI Dynamic Causal Modelling with Inferred Regions of Interest, 47.
The organization of the human cerebral cortex estimated by intrinsic functional connectivityJournal of Neurophysiology 106:1125–1165.https://doi.org/10.1152/jn.00338.2011
Chris HoneyReviewing Editor
In the interests of transparency, eLife includes the editorial decision letter and accompanying author responses. A lightly edited version of the letter sent to the authors after peer review is shown, indicating the most substantive concerns; minor comments are not usually included.
Thank you for submitting your article "The relationship between spatial configuration and functional connectivity of brain regions" for consideration by eLife. Your article has been reviewed by four peer reviewers, one of whom is a member of our Board of Reviewing Editors, and the evaluation has been overseen by Sabine Kastner as the Senior Editor. The following individuals involved in review of your submission have agreed to reveal their identity: Jakob Seidlitz (Reviewer #2); Daniel S Margulies (Reviewer #3).
The reviewers have discussed the reviews with one another and the Reviewing Editor has drafted this decision to help you prepare a revised submission.
You can see the specific comments from each reviewer below. During our discussion, the reviewers agreed that this was potentially an important result. We felt that the manuscript would be substantially improved if your team could:
i) Provide more information about the location (regions and subnetworks), scale and direction of the spatial variation that is most consistently observed across participants;
ii) Clarify the logic behind the central claim [that spatial variation accounts for the behaviorally-relevant variation in edge-strength] and show that the analytics are not biased to translate non-spatial sources of variability into spatial sources of variability;
iii) Justify the use of low-dimensional decompositions in some analyses and show whether fine-grained parcellations could ameliorate (or exacerbate) the misattribution effects that are reported;
iv) Provide more information about which specific behavioral metrics covary with the spatial/morphological variation.
We also recommend trying to make the manuscript (especially the Introduction and Results) more readable for those who are familiar with fMRI and functional connectivity methods, but who are not experts in the individual differences sub-field and who have not yet read the Materials and methods in detail.
This manuscript presents a potentially interesting study on the spatial arrangement of functional regions and how they interact, and the relationship of this to non-imaging measures. Overall I think I agree with the premise of the paper (anatomy and function are not independent nor completely separable) and find the result of interest to the brain-behaviour community. However, the paper is really hard to read/follow and its unclear to what extent the authors have proven their point.
The difficulty in following this also makes it very difficult to evaluate the significance of the findings. For example in the first simulation topographic information is added to the HCP data and then it is shown that topographic information accounts for some of the variance. It's not clear how this wouldn't be the case since that is what was added. While the simulation may indeed show that topology accounts for a significant fraction of the variance, this doesn't actually show that this is what is happening in real data. In general, the idea that spatial topology matters, while likely true, is poorly supported by an ad-hoc combination of ICA, parcellation, and PFMs.
Besides the intractable writing there are at least two major concerns:
1) The idea that individualized parcellations can reveal information has been shown previously. Comparing networks derived from group-parcellations with individualized spatial maps is not really a fair comparison. While it makes a nice argument for performing individualized functional parcellations the comparison as set up is not that interesting nor informative. To show that the correlation matrices (networks) do not add any unique information, one would need to show that the differences in parcellations (ICA-component definitions) across subjects can predict behavior better than, or comparable to, networks derived from the individualized functional maps. This is not what was done however.
2) It also appears to be the case that the authors derive the truest subject-specific map (Figure 4) from a group-level ICA. This is mentioned briefly in the Materials and methods section and is necessary to preserve the correspondences in ICA components across subjects. It appears that to maintain correspondence ICA was first performed at the group level and then brought to the individual level. It is not clear that this is a fair approach.
The CCA's in Figure 1 are interesting but somewhat unclear. Independent of the concerns with the simulation expressed above, the interpretation of Figure 1 seems to be that the functional data is confounded by the topographic information but this sort of summary does not show that. I think the authors are implying that the high CCA for topography means that this is the source of the functional CCA but these do not have to be from the same source. The summary data needs to be broken down further to prove that argument.
The only functional (nonICA atlas used) is the Yeo atlas, which was somehow extended from 17 regions up to 109, but this is really a low resolution atlas. It has been shown by many groups that functional atlases of this order (such as AAL) are very poor and there are numerous publicaly available atlases with many more nodes such as the Shen atlas (Shen, 2013, which should be mentioned in the Introduction) 268 nodes, the Glasser atlas (360 nodes), which is discussed but not apparently used. This is particularly important given the comment in subsection “Cross-subject information in fMRI-derived measures” that the dimensionality of the decomposition may influence the results.
While ICA 200, is high relative to the low 25 ICA it is still not high relative to many functional parcellation schemes.
There are many other questions and concerns about various steps in the analysis but these are too numerous to catalog with the manuscript in its current form.
This manuscript directly assesses the relationship between underlying spatial topography and functional connectivity. In general, from my knowledge of the literature, this is the first time that such a direct comparison has been performed in the context of brain-behavior comparisons. Overall, I thought the manuscript was well-written and the analyses well-defined and well-executed, and I believe that this topic (and these findings) are of particular relevance to the wider neuroimaging community, especially given the widespread use of this dataset from the Human Connectome Project. Some notable remarks include the use of high-quality publicly available data from a large cohort, and the reproducibility of the findings across parcellations (and thresholding/binarization). Please find below comments for the authors to consider.
1) What was the justification for the use of a dimensionality of 15 for the creation of the ICA basis maps? It would be good to add this to the Materials and methods section "Unique contribution of topography versus coupling" or within the Results (say in subsection “Unique contribution of topography versus coupling”).
2) How consistent were the loadings of the behavior CCA outputs from each of the comparisons with brain (i.e., Figure 1 in this manuscript)? Is it possible to quantify these relationships across the comparisons in this work? It would be good to evaluate this empirically, with reference to the CCA loadings in Smith et al., 2015 as reference in this manuscript. In addition to this quantitative analysis, the manuscript would benefit from more of a discussion (and potentially a supplemental figure) about the resultant behavior axes derived from the CCA (i.e., similar to Figure 1 from Smith et al., 2015).
3) Given the focus/topic of this manuscript, and as briefly touched on in the discussion (and Figure 1—figure supplement 1) about the relationship between fractional surface area and subject CCA weights, it would be of great interest to the audience of this manuscript to quantify and report the relationship between basic morphological features (for example, cortical thickness, surface area, volume, myelin maps, and/or curvature) and the spatial rfMRI maps/networks. Clearly spatial variability explains a great deal of variance in rfMRI-derived network matrices, but it would be good to compare these maps to classical morphological metrics, which are derivable from these data.
4) I would highly recommend the authors include a section (in the Materials and methods) about the data quality assessment and quality control that was performed, as well as elaborate on the preprocessing procedures. This would be of particular interest to those that would like to try to reproduce these findings or replicate them in an independent dataset.
The study by Bijsterbosch and colleagues addresses the important question of what aspect of functional connectivity varies across individuals. Prior work has largely investigated variance in the strength or amplitude of functional connectivity, however, the current findings demonstrate that much of that variance is better explained by differences in spatial topography. The analyses are conducted on data made available by the Human Connectome Project, and include validation using several network decomposition and parcellation approaches.
The topic of this study is of high relevance to the field, and my primary recommendation to the authors would be to provide more extensive illustration of the topographic variance in the main manuscript. Specifically, Figure 2 is an excellent example of the type of spatial variance underlying individual differences in functional connectivity measures. Further figures depicting the spectrum of spatial variance (based on the maps shown in the supplementary videos, for example) would further help the reader understand the nature of these patterns of interindividual difference. In addition, an additional figure that summarizes the topographic variance across the cortex, along the lines of Figure 1—figure supplement 1, would be a helpful addition to the current manuscript.
It is important to show that the PROFUMO decomposition is not biased to ascribe variance to the spatial mode when that variance arises elsewhere (e.g. from changes in inter-regional correlation). Please include a simulation which explicitly tests this idea (or explicitly describe for the reader the simulations from the prior PROFUMO papers that have demonstrated this fact). It is particularly important to demonstrate this for the case of overlapping generator ROIs. For, example, one could run a simulation on a 40 by 40 voxel grid, with Gaussian sources A, B, C, D located with centers as (10,10), (10,30), (30,10) and (30,30) respectively. The sources are expressed in space as Gaussians with s.d.s of (say) 10 units so that the signals from different sources do overlap. Now, we suppose that the four source signals are generated as a coupled linear system with some coupling matrix. We then ask whether changing the coupling matrix (without changing the spatial profile of the sources) leads PROFUMO to change its inferred spatial modes.For example, in condition 1, the B source is correlated with the C source, and the A source is correlated with the D source. In condition 2, the B source is additionally correlated with the A source. Are the _shapes_ of the sources recovered by Profumo different in Condition 1 and Condition 2? This is important to show, because otherwise the spatial variation inferred by PROFUMO may actually arise from changes in the true coupling matrix. Are the spatial maps the same even under conditions where the true coupling strengths fluctuate around their means over time?
The prediction of behavioral properties using fMRI is a central aspect of this manuscript, and yet there is almost no description of what aspects of human behaviors are being predicted. Which aspects of task performance are being well-captured? In subsection “Cross-subject information in fMRI-derived measures” there is reference to the in "positive-negative mode of covariation" but there is no description of what this means concretely.
Some of the writing in the Results section seems to assume that the readers have already read the Materials and methods, and yet the Materials and methods appear after the results. I suggest providing the readers with a bit more methodological context in the following subsections, to help them understand what was done:
In the Introduction section, when introducing the PROFUMO method, please elaborate a little more on what it does and how it works;
Relatedly, when comparing PROFUMO with dual regression approach, please provide readers with a brief description of the dual regression method, so that they can understand why it may not succeed in accurately recovering subject-specific networks;
Introduction section: "simulations that manipulate aspects of the data" – please be more precise about what is manipulated in the simulations;
Figure 1 – this Figure could be more effective if there was a graphical depiction of how the predictions are made from each of these fMRI-derived measures; as it is now, the reader arrives at this point in the manuscript with little idea of what is meant by any of the row-labels in this Figure;
Results section, paragraph three – once again the reader will have difficulty understanding what is being described because the relevant methods have not yet been laid;
Subsection “Spatiotemporal simulations demonstrating potential sources of variability in edges”; "thereby allowing each aspect to be fixed to the group average prior to generating simulated data using the outer product" – at this point in the manuscripts readers do not have any understanding why an outer product is relevant, or which matrix equation is being referenced;
The authors have quite convincingly shown that spatial variation in fMRI sources across subjects is an important challenge and widespread confound. Please provide some elaboration on your comments in the penultimate paragraph of the Discussion and provide the reader with some specific suggestions on how this problem might be overcome, and in what contexts it is more or less perilous. Is PROFUMO the answer? Is functional co-registration using localizers the answer? Could brain-wide functional hyper-alignment / shared response modeling provide another solution to the spatial variation problem? e.g. Chen et al., 2015.https://doi.org/10.7554/eLife.32992.036
- David C Van Essen
- Stephen M Smith
- Christian F Beckmann
- Stephen M Smith
- Stephen M Smith
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Data were provided by the Human Connectome Project, WU-Minn Consortium (Principal Investigators: David Van Essen and Kamil Ugurbil; 1U54MH091657) funded by the 16 NIH Institutes and Centers that support the NIH Blueprint for Neuroscience Research; and by the McDonnell Center for Systems Neuroscience at Washington University. CFB acknowledges support from The Netherlands Organization for Scientific Research (NWO, grant no 864.12.003). We are grateful for funding from the Wellcome Trust (grants 098369/Z/12/Z and 091509/Z/10/Z). The Wellcome Centre for Integrative Neuroimaging is supported by core funding from the Wellcome Trust (203139/Z/16/Z).
Human subjects: HCP data were acquired using protocols approved by the Washington University institutional review board. Informed consent was obtained from subjects. Anonymised data are publicly available from ConnectomeDB (db.humanconnectome.org; Hodge et al., 2016). Certain parts of the dataset used in this study, such as the age of the subjects, are available subject to restricted data usage terms, requiring researchers to ensure that the anonymity of subjects is protected (Van Essen et al., 2013).
- Chris Honey, Reviewing Editor
© 2018, Bijsterbosch et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.