Peer review process
Not revised: This Reviewed Preprint includes the authors’ original preprint (without revision), an eLife assessment, and public reviews.
Read more about eLife’s peer review process.Editors
- Reviewing EditorAndre MarquandRadboud University Nijmegen, Nijmegen, Netherlands
- Senior EditorAndre MarquandRadboud University Nijmegen, Nijmegen, Netherlands
Reviewer #1 (Public review):
Summary:
In this study, the authors advance our understanding of neurodevelopmental changes in the brain's structural and functional connectivity, as well as their coupling. The paper presents evidence of alterations in and stability of the principal organizational gradients of structure and function across development (age) and contrasts them between neurotypical and neurodivergent individuals. The authors further extend their findings by exploring links with graph theory measures of brain connectivity and indices of nodal structure-function coupling. Finally, the developmental shifts in structural and functional brain organization are examined for potential associations with cognitive and psychopathological markers. The results suggest that structure-function coupling, both brain-wide and within specific functional networks, is associated with certain cognitive dimensions but not with measures of psychopathology.
Strengths:
This manuscript makes a significant contribution to the field by synthesizing previous research while offering novel insights into the developmental trajectories of brain organization. A key strength of this study lies in its integration of both structural and functional connectivity data, providing a comprehensive view of brain changes throughout development. The authors present findings that challenge earlier reports of shifts in principal gradients during late childhood and early adolescence (e.g., Dong et al., 2021; Xia et al., 2022), underscoring an important inconsistency that could have broader implications for our understanding of developmental brain reorganization. The introduction and discussion sections are well-crafted, offering a thorough review of relevant prior studies and effectively situating the current findings within the broader context of the literature. Additionally, the study design and methodology are detailed and adhere to recommended best practices, demonstrating a commendable level of rigor in the formulation of the study and its various assessments.
Weaknesses:
Despite these strengths, I think there are aspects of the manuscript that would benefit from further refinement. Below is detailed feedback and suggestions provided point-by-point.
Lack of Sensitivity Analyses for some Key Methodological Decisions:
Certain methodological choices in this manuscript diverge from approaches used in previous works. In these cases, I recommend the following: (i) The authors could provide a clear and detailed justification for these deviations from established methods, and (ii) supplementary sensitivity analyses could be included to ensure the robustness of the findings, demonstrating that the results are not driven primarily by these methodological changes. Below, I outline the main areas where such evaluations are needed:
- Use of Communicability Matrices for Structural Connectivity Gradients: The authors chose to construct structural connectivity gradients using communicability matrices, arguing that diffusion map embedding "requires a smooth, fully connected matrix." However, by definition, the creation of the affinity matrix already involves smoothing and ensures full connectedness. I recommend that the authors include an analysis of what happens when the communicability matrix step is omitted. This sensitivity test is crucial, as it would help determine whether the main findings hold under a simpler construction of the affinity matrix. If the results significantly change, it could indicate that the observations are sensitive to this design choice, thereby raising concerns about the robustness of the conclusions. Additionally, if the concern is related to the large range of weights in the raw structural connectivity (SC) matrix, a more conventional approach is to apply a log-transformation to the SC weights (e.g., log(1+𝑆𝐶𝑖𝑗)), which may yield a more reliable affinity matrix without the need for communicability measures.
- Individual-Level Gradients vs. Group-Level Gradients: Unlike previous studies that examined alterations in principal gradients (e.g., Xia et al., 2022; Dong et al., 2021), this manuscript focuses on gradients derived directly from individual-level data. In contrast, earlier works have typically computed gradients based on grouped data, such as using a moving window of individuals based on age (Xia et al.) or evaluating two distinct age groups (Dong et al.). I believe it is essential to assess the sensitivity of the findings to this methodological choice. Such an evaluation could clarify whether the observed discrepancies with previous reports are due to true biological differences or simply a result of different analytical strategies.
- Procrustes Transformation: It is unclear why the authors opted to include a Procrustes transformation in this analysis, especially given that previous related studies (e.g., Dong et al.) did not apply this step. I believe it is crucial to evaluate whether this methodological choice influences the results, particularly in the context of developmental changes in organizational gradients. Specifically, the Procrustes transformation may maximize alignment to the group-level gradients, potentially masking individual-level differences. This could result in a reordering of the gradients (e.g., swapping the first and second gradients), which might obscure true developmental alterations. It would be informative to include an analysis showing the impact of performing vs. omitting the Procrustes transformation, as this could help clarify whether the observed effects are robust or an artifact of the alignment procedure. (Please also refer to my comment on adding a subplot to Figure 1)
- SC-FC Coupling Metric: The approach used to quantify nodal SC-FC coupling in this study appears to deviate from previously established methods in the field. The manuscript describes coupling as the "Spearman-rank correlation between Euclidean distances between each node and all others within structural and functional manifolds," but this description is unclear and lacks sufficient detail. Furthermore, this differs from what is typically referred to as SC-FC coupling in the literature. For instance, the cited study by Park et al. (2022) utilizes a multiple linear regression framework, where communicability, Euclidean distance, and shortest path length are independent variables predicting functional connectivity (FC), with the adjusted R-squared score serving as the coupling index for each node. On the other hand, the Baum et al. (2020) study, also cited, uses Spearman correlation, but between raw structural connectivity (SC) and FC values. If the authors opt to introduce a novel coupling metric, it is essential to demonstrate its similarity to these previous indices. I recommend providing an analysis (supplementary) showing the correlation between their chosen metric and those used in previous studies (e.g., the adjusted R-squared scores from Park et al. or the SC-FC correlation from Baum et al.). Furthermore, if the metrics are not similar and results are sensitive to this alternative metric, it raises concerns about the robustness of the findings. A sensitivity analysis would therefore be helpful (in case the novel coupling metric is not similar to previous ones) to determine whether the reported effects hold true across different coupling indices.
Methodological ambiguity/lack of clarity in the description of certain evaluation steps:
Some aspects of the manuscript's methodological descriptions are ambiguous, making it challenging for future readers to fully reproduce the analyses based on the information provided. I believe the following sections would benefit from additional detail and clarification:
- Computation of Manifold Eccentricity: The description of how eccentricity was computed (both in the results and methods sections) is unclear and may be problematic. The main ambiguity lies in how the group manifold origin was defined or computed. Specifically:
(1) In the results section, it appears that separate manifold origins were calculated for the NKI and CALM groups, suggesting a dataset-specific approach.
(2) Conversely, the methods section implies that a single manifold origin was obtained by somehow combining the group origins across the three datasets, which seems contradictory.
Moreover, including neurodivergent individuals in defining the central group manifold origin is conceptually problematic. Given that neurodivergent participants might exhibit atypical brain organization (as suggested by Fig. 1), this inclusion could skew the definition of what should represent a typical or normative brain manifold. A more appropriate approach might involve constructing the group manifold origin using only the neurotypical participants from both the NKI and CALM datasets. Given the reported similarity between group-level manifolds of neurotypical individuals in CALM and NKI, it would be reasonable to expect that this combined origin should be close to the origin computed within neurotypical samples of either NKI or CALM. As a sanity check, I recommend reporting the distance of the combined neurotypical manifold origin to the centers of the neurotypical manifolds in each dataset. Moreover, if the manifold origin was constructed while utilizing all samples (including neurodivergent samples) I think this needs to be reconsidered.
- Computation of SC-FC coupling: As noted in a previous comment, the explanation of this procedure is vague. The description lacks detail on the specific steps taken and differs from previous standard approaches in the field. I suggest clarifying the methodology and comparing with previous SC-FC coupling metrics.
- Performing Procrustes transformation: The brief explanation in the first paragraph of page 30 does not provide enough information about the procedure or its justification. Since the Procrustes transformation alters the shape of individual gradients, it could artificially inflate consistency across development. I recommend including a rationale for using the Procrustes transformation and conducting a sensitivity analysis to assess its impact on the findings. Additionally, clarifying how exactly the transformation was applied to align gradients across hemispheres, individuals, and or datasets would help resolve ambiguity.
Insufficient Supporting Evaluations for Certain Claims:
There are instances where additional analyses are necessary to substantiate the claims made in the manuscript. Without these evaluations, some conclusions may be premature or potentially misleading. I believe the following points need further analysis or, alternatively, adjustments to the claims:
- Evaluating the Consistency of Gradients Across Development: The results shown in Fig. 1.e are used as evidence suggesting that gradients are consistent across ages. However, I believe additional analyses are required to identify potential sources of the observed inconsistency compared to previous works. The claim that the principal gradient explains a similar degree of variance across ages does not necessarily imply that the spatial structure of the gradient remains stable. The observed variance explanation is hence not enough to ascertain inconsistency with findings from Dong et al., as the spatial configuration of gradients may still change over time. Moreover, the introduction of the Procrustes transformation (not used by Dong et al.) further ambiguates the cause of this inconsistency. I suggest the following additional analyses to strengthen this claim: (1) Alignment to Group-Level Gradients: Assess how much of the variance in individual FC matrices is explained by each of the group-level gradients (G1, G2, and G3, for both FC and SC). This analysis could be visualized similarly to Fig. 1.e, with age on the x-axis and variance explained on the y-axis. If the explained variance varies as a function of age, it may indicate that the gradients are not as consistent as currently suggested. (2) For each individual's gradients (G1, G2, and G3, separately for FC and SC, without Procrustes transformation), evaluate their spatial similarity to the corresponding group-level gradients using a similarity metric (e.g., correlation coefficient). High spatial similarity, without a Procrustes transformation, would support the claim of stable gradient structures across development. On the other hand, if the similarities alter during development (e.g. such that at a certain age, individual G1 is less similar to group G1) this would contradict the stability of gradients during development. These additional analyses could potentially be included as additional panels in Fig. 1. In case significant deviations are observed, it might help refine the interpretation of the results and provide a more nuanced understanding of developmental changes in gradient organization.
- Prediction vs. Association Analysis: The term "prediction" is used throughout the manuscript to describe what appear to be in-sample association tests. This terminology may be misleading, as prediction generally implies an out-of-sample evaluation where models trained on a subset of data are tested on a separate, unseen dataset. If the goal of the analyses is to assess associations rather than make true predictions, I recommend refraining from using the term "prediction" and instead clarifying the nature of the analysis. Alternatively, if prediction is indeed the intended aim (which would be more compelling), I suggest conducting the evaluations using a k-fold cross-validation framework. This would involve training the Generalized Additive Mixed Models (GAMMs) on a portion of the data and testing their predictive accuracy on a held-out sample (i.e., different individuals). Additionally, the current design appears to focus on predicting SC-FC coupling using cognitive or pathological dimensions. This is contrary to the more conventional approach of predicting behavioral or pathological outcomes from brain markers like coupling. Could the authors clarify why this reverse direction of analysis was chosen? Understanding this choice is crucial, as it impacts the interpretation and potential implications of the findings.
Methodological considerations
- In typical applications of diffusion map embedding, sparsification (e.g., retaining only the top 10% of the strongest connections) is often employed at the vertex-level resolution to ensure computational feasibility. However, since the present study performs the embedding at the level of 200 brain regions (a considerably coarser resolution), this step may not be necessary or justifiable. Specifically, for FC, it might be more appropriate to retain all positive connections rather than applying sparsification, which could inadvertently eliminate valuable information about lower-strength connections. Whereas for SC, as the values are strictly non-negative, retaining all connections should be feasible and would provide a more complete representation of the structural connectivity patterns. Given this, it would be helpful if the authors could clarify why they chose to include sparsification despite the coarser regional resolution, and whether they considered this alternative approach (using all available positive connections for FC and all non-zero values for SC). It would be interesting if the authors could provide their thoughts on whether the decision to run evaluations at the resolution of brain regions could itself impact the functional and structural manifolds, their alteration with age, and or their stability (in contrast to Dong et al. which tested alterations in high-resolution gradients).
The Issue of Abstraction and Benefits of the Gradient-Based View:
- The manuscript interprets the eccentricity findings as reflecting changes along the segregation-integration spectrum. Given this, it is unclear why a more straightforward analysis using established graph-theory measures of segregation-integration was not pursued instead. Mapping gradients and computing eccentricity adds layers of abstraction and complexity. If similar interpretations can be derived directly from simpler graph metrics, what additional insights does the gradient-based framework offer? While the manuscript argues that this approach provides "a more unifying account of cortical reorganization," it is not evident why this abstraction is necessary or advantageous over traditional graph metrics. Clarifying these benefits would strengthen the rationale for using this method.
Reviewer #2 (Public review):
Summary:
This study aims to show how structural and functional brain organization develops during childhood and adolescence using two large neuroimaging datasets. It addresses whether core principles of brain organization are stable across development, how they change over time, and how these changes relate to cognition and psychopathology. The study finds that brain organization is established early and remains stable but undergoes gradual refinement, particularly in higher-order networks. Structural-functional coupling is linked to better working memory but shows no clear relationship with psychopathology.
Strengths:
This study effectively integrates two different modalities (structural and functional) to identify shared patterns. It is supported by a relatively large dataset, which enhances its value and robustness.
Weaknesses:
General Comments:
- The introduction is overly long and includes numerous examples that can distract readers unfamiliar with the topic from the main research questions.
- While the methods are thorough, it is not always clear whether the optimal approaches were chosen for each step, considering the available data.
Detailed Comments:
- The use of COMBAT may have excluded extreme participants from both datasets, which could explain the lack of correlations found with psychopathology.
- Some differences in developmental trajectories between CALM and NKI (e.g., Figure 4d) are not explained. Are these differences expected, or do they suggest underlying factors that require further investigation?
- There is no discussion of whether the stable patterns of brain organization could result from preprocessing choices or summarizing data to the mean. This should be addressed to rule out methodological artifacts.