White Matter Stratification in Depression Predicts Multidimensional Antidepressant Responses

Jiaolong Qin; Xinyi Wang; Huangjing Ni; Ye Wu; Haiyan Liu; Lingling Hua; Rui Yan; Hao Tang; Peng Zhao; Zhijian Yao; Qing Lu

doi:10.7554/eLife.110078.1

eLife Assessment

This study presents valuable findings for identifying biotypes of depression patients using white matter measures, which are under-utilised and under-appreciated in current biological and computational psychiatry work. The evidence supporting the claims is solid, although enhanced interpretability of the identified biotypes across both white matter and symptom levels, and better justification of the choice of models would strengthen the paper. Overall, this study will be of interest to the broad community of neuroimagers, clinicians, and biological and computational psychiatry researchers.

https://doi.org/10.7554/eLife.110078.1.sa3

Significance of findings

valuable: Findings that have theoretical or practical implications for a subfield

landmark
fundamental
important
valuable
useful

Strength of evidence

solid: Methods, data and analyses broadly support the claims with only minor weaknesses

exceptional
compelling
convincing
solid
incomplete
inadequate

During the peer-review process the editor and reviewers write an eLife assessment that summarises the significance of the findings reported in the article (on a scale ranging from landmark to useful) and the strength of the evidence (on a scale ranging from exceptional to inadequate). Learn more about eLife assessments

Abstract

Background

Major depressive disorder (MDD) is clinically heterogeneous, posing a persistent challenge for personalized treatment. While neuroimaging offers a promising path, existing symptom-based stratification schemes have proven inadequate in predicting antidepressant response. Crucially, studies focusing on white matter (WM) heterogeneity — a potential source of neurobiological subtypes— have failed to address this critical gap. Here, we bridge this divide by investigating WM-based MDD subtypes and their predictive value for treatment outcomes.

Methods

We used non-negative matrix factorization biclustering of diffusion MRI data from 311 MDD patients (discovery: n=209; validation: n=102) to identified neuroanatomical subgroups with distinct WM microstructural signatures. Subgroups were characterized via neuroanatomical profiling, clinical phenotyping (symptom domains/treatment responses), and WM-symptom associations. Baseline WM features predicted 4-week treatment outcomes (overall/dimension-specific symptom reduction) across five antidepressant therapies using support vector regression.

Results

Three robust MDD subgroups emerged: (1) frontoparietal-corticospinal alterations linked to anxiety/hopelessness; (2) cerebellar-visual circuit disruptions tied to cognitive-psychomotor deficits; (3) fornix-centered abnormalities associated with attenuated symptom severity. Subgroup-specific WM networks predicted treatment outcomes with high cross-cohort consistency (discovery: r=0.24–0.58; validation: r=0.27–0.67; all p<0.05), notably for cognitive symptoms (max r=0.59). Importantly, baseline WM patterns—converging on limbic/default mode networks—reflected neuroplasticity reserve, enabling generalizable prediction across mechanistically distinct therapies.

Conclusions

Our findings establish WM-derived biotypes as robust, pathophysiologically distinct subtypes of MDD and validate baseline WM topology as a biomarker capable of predicting antidepressant treatment response, potentially by reflecting and individual’s neuroplasticity reserve.

Introduction

Major depressive disorder (MDD) is a highly heterogeneous condition (American Psychiatric Association, 2013; Fried and Nesse, 2015) with poorly understood pathophysiology (Fried and Nesse, 2015), which sustains a pervasive trial-and-error approach to treatment (Fried, 2017; James et al., 2018). Hence, reliable pretreatment predictors are critically needed to enable personalized therapy, minimize unnecessary medication exposure, and improve resource allocation (Kraus et al., 2019; Szegediet al., 2009). Initial attempts to forecast antidepressant outcomes have primarily depended on symptom-based stratification—utilizing either self-report or clinician-rated scales—to categorize patients and predict treatment response (Arnow et al., 2015; Kato et al., 2020). Although these clinical measures offer practical utility and are widely implemented in routine practice, they provide minimal insight into MDD’s underlying neurobiological mechanisms. This fundamental limitation stems from the non-specific nature of symptoms: identical clinical presentations may arise from disparate biological origins, while shared pathophysiology can manifest as divergent symptom profiles. Consequently, symptom-driven prediction models typically prove inadequate for guiding treatment selection. Illustrating this point, Arnow et al. (Arnow et al., 2015) categorized MDD patients into melancholic, anxious, and atypical subtypes, yet observed significant subgroup overlap and no differential treatment response, ultimately concluding that such classifications possess minimal clinical value for personalized antidepressant selection.

In response to the limitations of symptom-based approaches, research has increasingly turned to neuroimaging as a promising alternative, as it allows for biologically-anchored stratification by mapping symptoms onto discrete neural substrates. The central challenge involves identifying MDD biotypes characterized by distinctive functional or structural brain patterns, thereby facilitating pathophysiology-specific treatments. Crucially, as (Feczko et al., 2019) emphasize, meaningful subtype definitions must correlate with specific clinical or mechanistic outcomes. While pioneer neuroimaging-based endeavors have demonstrated potential for prediction—notably, (Drysdale et al., 2017) delineated four fMRI-based biotypes predictive of transcranial magnetic stimulation response — many existing studies still fail to demonstrate such authentic predictive value, instead predominantly reporting post-hoc correlations (Kennis et al., 2020; Whelan and Garavan, 2014). Furthermore, these problems have been compounded by methodological limitations such as inadequate sample sizes and a lack of external validation (Cohen et al., 2021; Kraus et al., 2019).

Beyond these general methodological issues, this emerging field faces two distinct yet critical limitations that our study aims to address. On one hand, diffusion MRI (dMRI) remains markedly underutilized for MDD subtyping. Despite its direct relevance to neural circuitry, only one study has employed white matter (WM) microstructural abnormalities to characterize heterogeneity (Liang et al., 2019). On the other hand, conventional clustering methodologies dominate current stratification approaches, where derived subtypes’ brain patterns are subsequently obtained through group-level comparative analyses, potentially obscuring intrinsic feature-sample relationships crucial for subtype definition. A noteworthy exception is the fMRI biclustering investigation by (Tokuda et al., 2018), which identified three MDD subtypes with unique functional connectivity patterns. Unlike traditional clustering, biclustering reveals localized associations between specific sample and feature subsets, which may be attenuated in global clustering models using all features.

To overcome these limitations and to translate an improved neurobiological taxonomy into clinically actionable predictors, we examine three key questions: (1) How to stratify MDD subtypes while identifying their unique WM patterns during clustering; (2) How these subtypes differ in their WM profiles and clinical presentations; (3) Whether baseline WM characteristics can predict both overall symptom reduction and domain-specific treatment responses (e.g., anxiety/somatization, cognitive impairment) across diverse therapeutic interventions. The neurobiological rationale for cross-treatment outcome prediction rests on WM pathology serving as a common pathway through which various interventions (e.g., pharmacotherapy, neuromodulation) restore neural circuit function (Long et al., 2025; Mao et al., 2025). Baseline WM architecture reflects neuroplasticity reserve—the inherent capacity for circuit reorganization that underlies treatment response (Castrén and Hen, 2013; Gazerani, 2025; Marzola et al., 2023)—thereby enabling prediction across therapeutic modalities. Our analytical framework addresses these questions through: (1) non-negative matrix factorization (NMF) based biclustering to identify subtype-specific WM-patient relationships; (2) comprehensive characterization of subtypes in discovery and validation cohorts via neuroanatomical profiling and clinical phenotyping; and (3) systematic evaluation of baseline symptom-WM associations, development of baseline WM-based predictive treatment effect models, and assessment of their generalization ability in an external validation dataset.

Results

Data-driven MDD subgroup identification

Three clinically distinct and robust subgroups emerged from the discovery cohort analysis, corresponding to the optimal rank k=3 (cophenetic correlation value with 0.96; Supplementary Figure S1). Subgroup 1 (n=32) exhibited WM alterations across 5, 730 voxels, subgroup 2 (n=129) involved 5,711 voxels, and subgroup 3 (n=48) comprised 5,566 voxels. Demographic characteristics (e.g., age, sex) showed no significant between-subgroup differences (p>0.05, Table 1).

Clinical characteristics of each subgroup in the discovery dataset.

In the independent validation cohort, the subgroup stratification demonstrated high robustness, as evidenced by a cophenetic correlation of 0.96. This high value, with <5% variation in individual assignments across 100 NMF iterations, indicates that the derived taxonomy yields consistent and stable clusters. Subgroup distribution mirrored patterns from the discovery phase: Subgroup 1 (n=19), Subgroup 2 (n=64), and Subgroup 3 (n=19), with preserved subgroup-specific WM spatial signatures.

Their demographic characteristics are presented in Table 2. Notably, Subgroup 3 consistently exhibited the lowest mean 24-Hamilton Depression Rating Scale (HAMD) total scores across discovery and validation cohorts, potentially indicating attenuated symptom severity in this neuroanatomical subtype.

Clinical characteristics of each subgroup in the external independent dataset.

Multidimensional subgroup characterization

Neuroanatomical characterization of MDD subgroups

The excellent clustering stability facilitates precise identification of subgroup-specific WM signatures (Figure 1). Anatomical location assignment using Natbrain and LNAO_SWM79 atlas (Catani and Thiebaut De Schotten, 2008; Guevara et al., 2017) revealed distinct microstructural profiles (detailed anatomical distribution is provided in Supplementary Table S2):

Subgroup 1: Predominantly superficial fibers concentrated in frontopariental regions (pars opercularis, superior/precentral gyri) with long-association involvement in corticospinal tracts, corpus callosum, and right cingulum/ inferior longitudinal fasciculus (ILF).

Subgroup 2: Cerebellar-focused pathology (cortico-ponto-cerebellar tracts, cerebellar peduncles) combined with distributed long-association alterations (bilateral cingulum/ILF) and diffuse superficial fibers across multiple lobes.

Subgroup 3: Fornix-centered abnormalities with complementary long-association changes (corpus callosum, cingulum) and superficial fibers in parieto-frontotemporal regions.

The results of WM signature extraction of the three subgroups.
a), b) and c) parts presented the coronal brain plane of subgroup 1, 2 and 3, respectively.

The analyses of mean fractional anisotropy (FA) value within the subgroup-specific WM signatures demonstrated consistent cross-cohort differences (Figure 2h and 3h). Subgroups 1 and 3 showed elevated FA versus healthy controls (HC) (p≤0.001), while Subgroup 2 exhibited reduced FA (p<0.004).

Clinical phenotypic profiles

(1) Clinical symptomatology across MDD subgroups

Analyses of 24-HAMD profiles revealed distinct symptom trajectories across subgroups, and there was a certain degree of agreement in reproducibility between the discovery and validation cohorts (Figure 2a-g, 3a-g). In the discovery cohort, significant between-subgroup differences emerged for 24-HAMD scores (p=0.02) and three symptom dimensions: psychomotor retardation (p=0.022), sleep disturbance (p=0.002), and hopelessness (p=0.043). Radar plot visualization showed that Subgroup 1 was characterized by elevated anxiety/somatization, cognitive impairment, and hopelessness; Subgroup 2, by pronounced cognitive impairment, retardation, and hopelessness; and Subgroup 3, by selective anxiety/somatization elevation.

Validation analyses confirmed core phenotype stability: Retardation (p=0.046) and hopelessness (p=0.015) remained significantly differentiated. Subgroup-specific severity patterns persisted, with Subgroup 1 remaining characterized by anxiety/somatization and hopelessness; Subgroup 2 by cognitive impairment, retardation, and hopelessness; and Subgroup 3 by anxiety/somatization.

Notably, sleep disturbance differentiation did not replicate in validation, suggesting state-dependent variability. The cross-cohort correlation for symptom profiles is r = 0.958 (p < 1.951e-08), confirming subgroup consistency.

The results of clinical symptomatology and mean FA values across MDD subgroups in the discovery dataset.
a) Five symptom dimensions were visualized using radar chart. b-f) Between-group differences in scores for five symptom—anxiety/somatization, cognitive impairment, retardation, sleep disturbance and feeling of hopelessness—across the three subgroups were shown in panels b to f, respectively. h) Comparison of the mean FA values of the three WM signature patterns between each subgroup and healthy controls.

The results of clinical symptomatology and mean FA values across MDD subgroups in the external validation dataset.
a) Five symptom dimensions were visualized using radar chart. b-f) Between-group differences in scores for five symptom—anxiety/somatization, cognitive impairment, retardation, sleep disturbance and feeling of hopelessness—across the three subgroups were shown in panels b to f, respectively. h) Comparison of the mean FA values of the three WM signature patterns between each subgroup and healthy controls.

(2) Differential treatment responses across MDD subgroups

Analysis of treatment outcomes revealed distinct response patterns among the three MDD subgroups in both discovery and validation cohorts. In the discovery cohort (Table 1), response rates showed a trend-level difference (p=0.09), with Subgroup 2 demonstrating the highest proportion of improvers (115/121, 95.0%) compared to Subgroup 1 (28/32, 87.5%) and Subgroup 3 (41/48, 85.4%). Treatment-stratified analysis revealed significant variation in response patterns (p=0.04). Selective Serotonin Reuptake Inhibitor (SSRI) monotherapy responses were most frequent in Subgroup 2 (49 cases). Serotonin-Norepinephrine Reuptake Inhibitor (SNRI) monotherapy showed limited efficacy across all subgroups. Repetitive Transcranial Magnetic Stimulation (rTMS) augmentation demonstrated subgroup-specific effects, particularly when combined with SSRIs in Subgroup 2 (28 cases). Electroconvulsive Therapy (ECT) showed relatively uniform responses across subgroups. Baseline 24-HAMD scores differed significantly between subgroups (p=0.02), suggesting varying baseline severity levels. While recurrent episodes and illness duration measures showed no significant differences, effective illness duration was significantly different among subgroups (p=0.02).

In the validation cohort (Table 2), response rates were consistently high across all subgroups (89.5 - 98.4%) without significant differences (p=0.20). None of the baseline clinical characteristics showed significant inter-subgroup differences in this cohort. Treatment-specific responses were more uniform in validation, with no significant subgroup differences (p=0.53), though similar numerical patterns emerged: SSRI monotherapy remained most effective in Subgroup 2 (9 cases); rTMS combinations showed broader efficacy; ECT maintained relatively consistent response rates.

Treatment outcome prediction

Distinct Symptom-WM associative networks across MDD subgroups

The 10 sparse canonical correlation analysis (sCCA) results robustly identified distinct symptom correlates for each subgroup’s WM signature. Subgroup 1 specifically correlated with anxiety/somatization, Subgroup 2 with cognitive impairment and retardation, and Subgroup 3 with all the above three symptom domains (The detailed association results are presented in Supplementary Table S1). Based on these symptom-related WM signatures, affected WM networks were constructed for each subgroup. Figure 4 illustrates the affected WM connectivity patterns observed in each MDD subgroup. Notably, the three subgroups exhibited distinct distributions of altered WM networks: Subgroup 1 primarily demonstrated disruptions in default mode network B, subcortical network, dorsal attention A, limbic networks A/B, salience/ventral attention A, somato-motor A, and central visual network; Subgroup 2 showed predominant alterations in the cerebellar network, default mode networks B/C, limbic A, salience/ventral attention A, dorsal attention A/B, and both peripheral and central visual networks; Subgroup 3 was characterized by WM connectivity changes predominantly involving default mode network B, limbic A, salience/ventral attention A, somato-motor A/B, and visual networks (peripheral and central).

Brain WM networks associated with major clinical symptoms at the average group-level.
The thickness of each connection reflected the proportion of edge presence across subgroups. TempPar - temporal parietal, DefaultC - default C, DefaultB - default B, DefaultA - default A, ContC - control C, ContB - control B, ContA - control A, LimbicA - limbic A, LimbicB - limbic B, SalVentAttnB - salience/ventral attention B, SalVentAttnA - salience/ventral attention A, DorsAttnB - dorsal attention B, DorsAttnA - dorsal attention A, SomMotB - somatomotor B, SomMotA - somatomotor A, VisPeri - peripheral visual, VisCent - central visual, Subcor - subcortical network.

Predictive model performance

In the discovery dataset, degree centrality measures derived from affected WM network demonstrated significant predictive capacity for treatment outcomes across all the three subgroups (Figure 5). Specifically, these network-based features successfully forecasted percentage reductions in: 24-HAMD total scores (all subgroups, r = 0.28 - 0.46, p < 0.05); anxiety/somatization (all subgroups, r = 0.38 - 0.58, p < 0.007); cognitive impairment (all subgroups, r = 0.24 - 0.55, p < 0.03); retardation (all subgroups, r = 0.33 - 0.52, p < 0.009); and hopelessness (all subgroups, r = 0.23 - 0.43, p < 0.05); as well as sleep disturbance (subgroup 2 and 3, r = 0.27 - 0.43, p < 0.009).

Prediction of antidepressant treatment outcomes using degree centrality of affected WM network in the discovery dataset.
Columns represented six antidepressant treatment outcome measures: percentage reduction in 24-HAMD total score and five core symptom domains—anxiety/somatization, cognitive impairment, retardation, sleep disturbance, and feeling of hopelessness. Rows indicated subgroups (*i.e.*, 1, 2 and 3). * p≤0.05, ** p≤0.01, *** p≤0.001

External validation in an independent cohort (Figure 6) affirmed the generalizability of these findings. Specifically, successful predictions were maintained for percentage reductions in: 24-HAMD total score (all subgroups, r = 0.28 - 0.60, p < 0.042), cognitive impairment (all subgroups, r = 0.29 - 0.59, p < 0.03), anxiety/somatization (Subgroup 1 and 2, r = 0.27 - 0.53, p < 0.04), retardation (Subgroup 2 and 3, r = 0.30 - 0.67, p < 0.03), and sleep disturbance (Subgroup 2, r = 0.27, p = 0.043), as well as feeling of hopelessness (Subgroup 1, r = 0.62, p = 0.021).

Prediction of antidepressant treatment outcomes using degree centrality of affected WM network in the external validation dataset.
Columns represented six antidepressant treatment outcome measures: percentage reduction in 24-HAMD total score and five core symptom domains—anxiety/somatization, cognitive impairment, retardation, sleep disturbance, and feeling of hopelessness. Rows indicated subgroups (*i.e.*, 1, 2 and 3). N/A indicated the absence of a valid SVR model applicable to the external dataset. * p≤0.05, ** p≤0.01, *** p≤0.001

Discussion

This study advances the neurobiological parsing of MDD heterogeneity by identifying three distinct subgroups with differential WM and clinical profiles. By identifying three robust MDD biotypes through NMF biclustering (cophenetic value = 0.96), we demonstrate that pre-treatment WM topology contains predictive information that is generalizable across independent cohorts and, crucially, across mechanistically distinct treatments. The consistent predictive accuracy (discovery: r = 0.24–0.58; validation: r = 0.27–0.67; all p < 0.05) suggests that these baseline WM patterns reflect a stable neurobiological trait—most plausibly, an individual’s neuroplasticity reserve—which underlies and constrains the brain’s capacity for functional recovery regardless of the specific therapeutic intervention.

Neuroanatomical subgroups and pathophysiological implications

Our identification of three neuroanatomical MDD subgroups, characterized by unique WM disruption patterns, provides novel insights into the heterogeneous pathophysiology of depression. Subgroup 1 exhibited frontoparietal-corticospinal alterations, predominantly involving superficial WM fibers—a finding contrasting with traditional deep WM-focused MDD studies. These disruptions align with this subgroup’s prominent anxiety and hopelessness, implicating dysregulated default mode and salience networks in emotional dysregulation (Menon, 2011). Subgroup 2 demonstrated cerebellar-visual circuit abnormalities, including the ILF, suggesting disrupted cerebro-cerebellar loops may underlie its cognitive-psychomotor deficits (Schmahmann et al., 2019). Subgroup 3 showed fornix-centered alterations, a structure integral to memory and reward processing (Benear et al., 2020; Godsil et al., 2013), potentially reflecting impaired hippocampal-prefrontal connectivity associated with attenuated symptom severity.

Notably, all subgroups shared disruptions in major WM tracts—corpus callosum, cingulum, and ILF—with distinct spatial patterns. The corpus callosum, the largest inter-hemisphere association, may be related to the pathogenesis of depressive symptoms in psychiatric disorders including MDD (Cole et al., 2012). The cingulum forms the outer ring of the limbic WM tracts, which are involved in key brain functions such as emotion, executive function, and episodic memory (Bubb et al., 2018; Pascalau et al., 2018). The ILF connects the occipital cortex to the anterior temporal region. As the WM backbone of the ventral visual pathway, it is considered crucial for maintaining a variety of cognitive and emotional processes in the visual modality, particularly in object and face recognition, visual emotion recognition, language, and semantics (Herbet et al., 2018; Zemmoura et al., 2021). A study by (Haghshomar et al., 2018) summarized the associations between the ILF and the symptoms of Parkinson’s disease, and suggested that this bundle is involved in the recognition of negative facial emotions in its symptoms related to mood disorders. Prior studies have reported the disruptions to these three WM bundles in MDD (Liao et al., 2013; Luttenbacher et al., 2022; Pasternak et al., 2018). Our results are consistent with previous findings, but we further reveal that damage to these bundles varies in specific locations and that FA value changes differ across different subgroups. Specifically, the mean FA value is decreased in Subgroup 2, while it is increased in Subgroup 1 and 3. These findings suggest that distinct sub-bundles exist within these bundles and play different roles in the pathological mechanisms of MDD.

WM stratification in MDD predicts symptom-specific treatment outcomes

Our study demonstrates that WM-based stratification directly predicts overall antidepressant outcomes and domain-specific improvements (especially in cognitive symptom) across subgroups. This neuroanatomical approach moves beyond the limitations of conventional, symptom-based subtyping, which often fails to inform treatment selection due to its inability to capture underlying pathophysiology (Arnow et al., 2015; Kato et al., 2020; Kautzky et al., 2021). In contrast, our WM-derived subgroups exhibit distinct neurostructural homogeneity, enabling a direct link between neuroanatomical pathology, clinical symptomatology, and treatment outcomes. By establishing this brain-symptom-outcome relationship, our work provides a pathophysiology-grounded framework for predicting treatment effects, thereby delivering actionable biomarkers to guide personalized therapy selection.

Our findings demonstrate symptom-specific neurobiological substrates underlying antidepressant treatment prediction across three depression subgroups. In Subgroup 2, cerebellar-visual circuit impairment predicted outcomes in five dimensions (overall efficacy, anxiety-somatization, cognitive impairment, psychomotor retardation, sleep disturbance), excluding hopelessness. This finding provides further support for the concept that cerebellar-visual abnormalities disrupt emotional-cognitive integration via two pathways: (1) amplifying negative visual biases via dysregulated limbic modulation (Ciapponi et al., 2023), and (2) impairing attention/visuospatial processing through compromised cerebellar-prefrontal interactions (Yildiz et al., 2010). Subgroup 1 exhibited frontoparietal-corticospinal alterations uniquely predicting hopelessness alongside anxiety-somatization and cognitive deficits. These associations may stem from disrupted executive control (frontoparietal network dysfunction) and failed emotional-somatic integration, where aberrant prefrontal-limbic connectivity biases reward processing and interoception, sustaining negative future expectations (Kaiser et al., 2015). In contrast, Subgroup 3 showed multi-tract WM pathology involving fornix, corpus callosum, cingulum, and ILF, specifically linked to severe cognitive impairment and psychomotor retardation. These structural disruptions form a disconnection syndrome that manifests as cognitive deficits through impaired information integration and psychomotor symptoms via disrupted motivation-motor coordination.

The baseline imaging data of the three subgroups can predict the treatment outcomes at 4 weeks in a mixed cohort of MDD patients receiving at least one of five antidepressant therapies. Prior studies reported that pretreatment baseline neuroimaging features could predict treatment response across diverse interventions (e.g., pharmacotherapy, neuromodulation therapy, and psychotherapy) (Long et al., 2025; Mao et al., 2025). Nevertheless, whether the imaging features predict outcomes generally or differ depending on specific treatments remains unclear. Our results support the generalizability of baseline imaging biomarkers. This cross-treatment validity may stem from shared neural mechanisms in MDD, as predictive features consistently involved limbic and default mode network, suggesting different therapies modulate common brain circuits. Ultimately, our findings imply that the pretreatment degree metrics of symptom-related network reflect robust network-level disturbances in MDD, capturing core pathophysiological states that influence treatment response beyond specific therapeutic modalities. These pretreatment features could serve as an initial screening tool to identify patients with a high risk of treatment resistance, avoiding unnecessary trial-and-error in clinical practice.

Data-driven identification of MDD subgroups through NMF-biclustering

We employed an NMF-based biclustering approach to identify localized WM alterations in MDD, yielding three robust WM patterns. These data-driven patterns demonstrated: (1) significant regional FA differences compared to HC; (2) strong associations with core depressive symptoms (r = 0.52 ∼ 0.92; supplementary Table S1); (3) predictive value for treatment outcomes.

This biclustering approach offers key advantages over conventional monoclustering method in the task of subtyping MDD: (1) captures localized WM alterations and preserves spatial heterogeneity through simultaneous sample-feature clustering; (2) identifies subgroup-specific features without requiring HC comparisons, avoiding the “averaging problem” inherent in classical subgroup stratification by the monoclustering method. This methodological advance enables more precise mapping of the neurobiological heterogeneity in MDD by identifying localized co-variation patterns that would be obscured in conventional case-control analyses.

Limitations

The limitations of our study include its reliance solely on WM features, which overlooks potential interactions with other modalities like cortical thickness or functional connectivity that could refine subtype characterization (Drysdale et al., 2017). Additionally, the coarse resolution of current WM atlases limits precise localization of alterations within fiber bundles, necessitating finer parcellation schemes. Future studies should employ multimodal imaging, larger multi-site cohorts, and standardized protocols to dissect confounding factors (e.g., scanner variability, sociogeographic influences) and validate the predictive models’ robustness.

Conclusion

Our study demonstrates that data-driven analysis of WM architecture robustly parses the heterogeneity of MDD into neurobiologically distinct biotypes. These subtypes are characterized by specific neuroanatomical signatures and clinical symptom profiles. The predictive power of baseline WM topology for treatment outcomes underscores its potential as an actionable biomarker, and their WM patterns may serve as a proxy for an individual’s neuroplasticity reserve, offering a mechanistic explanation for treatment efficacy.

Materials and methods

Participants

From March 2012 to July 2018, 311 MDD patients were recruited from Nanjing Brain Hospital (n=209, discovery dataset) and Nanjing Drum Tower Hospital (n=102, validation dataset). All patients met DSM-IV criteria (Association, 1994) as confirmed by the Chinese version of MINI, and their depressive symptoms were assessed using 24-HAMD (Hamilton, 1960). Medication-treated patients completed a 2-week washout period prior to baseline. Two trained psychiatrists conducted the 24-HAMD assessments at baseline and after 4 weeks of treatment. We also enrolled 100 age-, gender-, and handedness-matched HC from local communities. The demographic and clinical characteristics are presented in Table 3.

Demographic characteristic and clinical information of depressions and healthy controls in the current study.

The treatment strategies, determined by two experienced clinicians based on patients’ history and symptoms, included: (1) SSRIs monotherapy, (2) SNRIs monotherapy, (3) dual antidepressant therapy (SSRI+SNRI), (4) antidepressant combined with ECT therapy, or (5) antidepressant complemented by rTMS.

Exclusion criteria included: (1) substance abuse/dependence or major medical disorders, (2) history of neurological conditions or brain injury, and (3) MRI contraindications. All participants provided written informed consent after receiving detailed study information. The study protocol was approved by the Ethics Review Boards of both Nanjing Brain Hospital and Nanjing Drum Tower Hospital in accordance with the Declaration of Helsinki.

Imaging acquisitions and preprocessing

All MRI data were acquired using 3.0 T Siemens Verio scanners equipped with 12-channel head coils at both study sites. Consistent imaging protocols were implemented across sites for both dMRI and T1-weighted acquisitions. DMRI parameters included: 30 diffusion directions (b=1,000 s/mm²) with one b0 image, TR/TE=6600/93 ms, flip angle = 90^◦, FOV = 240 × 240 mm², matrix = 128 × 128, 3 mm slice thickness with no gap, and voxel size= 1.9 × 1.9 × 3 mm³. T1-weigthed imaging parameters were: TR/TE = 1900/2.48 ms, 1 mm slice thickness, flip angle = 9^◦, inversion time = 900 ms, matrix = 256 × 256, FOV = 250 × 250 mm², and isotropic 1 mm³ voxels.

All images underwent visual inspection for artifacts before standardized preprocessing. For both T1w and dMRI data, we performed: (1) axial alignment; (2) Gibbs ringing correction using local Subvoxel-Shifts (Kellner et al., 2016); and (3) N4ITK intensity inhomogeneity correction (Tustison et al., 2010). DMRI-specific processing included: (1) MP-PCA denoising (Veraart et al., 2016); (2) eddy current correction (FSL’s eddy_correct tool) (Jenkinson et al., 2012); (3) EPI correction; (4) CNN-based brain masking (pnlNipype); (5) T1w-dMRI alignment used FSL (Jenkinson et al., 2012) rigid registration while preserving native diffusion space. (6) Diffusion tensors were computed using the Stejskal-Tanner equation. Eigenvalue decomposition yielded fractional anisotropy (FA) maps.

Whole-brain probabilistic tractography was performed in MRtrix3 using the iFOD2 algorithm, with fiber orientation distributions first estimated using constrained spherical deconvolution. A total of 3 million seeds were employed to initiate streamline propagation, with all other tracking parameters set to their default values (Tournier et al., 2019). Subsequently, the resulting tractograms were processed using the Spherical-deconvolution Informed Filtering of Tractograms (SIFT) method (Smith et al., 2015, 2013) to obtain a refined and biologically meaningful set of 1 million streamlines.

For Tract Based Spatial Statistics (TBSS) analysis in FSL 6.0 (Smith et al., 2006), individual FA maps were nonlinearly registered to the FMRIB58_FA template. A mean FA skeleton was generated, thresholded at FA>0.2 to exclude partial volume effects, representing core WM tracts. Each subject’s FA data were projected onto this skeleton by identifying maximal FA values along directions perpendicular to the skeleton. This standardized approach ensured comparable WM analysis across all participants.

Data-driven identification of MDD subgroups

Biclustering identification via NMF in discovery dataset

Each patient’s FA-TBSS map was flattened into a feature vector, retaining only skeleton voxels. These vectors were then combined to form matrix X (109, 848 voxels × 209 subjects). Prior to NMF-based biclustering, we regressed out age and sex covariates from the X matrix to control for potential confouding variables. We employed the sparse NMF algorithm from Li (Li and Ngom, 2013) to decompose the TBSS-derived FA matrix X ∈ R^p^×ⁿ (p= voxels, n=subjects). The X was factorized into basis matrix A ∈ R^p^×^k (k latent components, and its value ranged from 2 to 8 here) and coefficient matrix Y ∈ R^k^×ⁿ, such that X ≈ AY. Each column of A represents a latent WM microstructural pattern formed by voxel combinations, while Y quantifies subject-specific expression levels of these patterns. Subject clustering was determined by identifying the dominant component (maximum value) in each column of Y, with basis vectors serving as cluster centroids.

To enhance feature interpretability, we reduced feature redundancy through sparsity constraints during decomposition and performed entropy-based feature selection on matrix A, retaining the top 20% voxels with highest discriminative power for subsequent biclustering. This dual optimization process simultaneously identifies: (a) cohesive WM patterns through column-wise clustering of A, and (b) patient subgroups via row-wise clustering of Y (Detailed implementation parameters are provided in Supplementary Methods Section: identification of biclusters based on NMF method).

To determine the optimal rank k that yields stable and meaningful clusters, we adopted Brunet’s consensus clustering approach (Brunet et al., 2004). For each rank k, we performed 100 iterations of matrix factorization to account for random initialization effects. Each iteration generated a connectivity matrix C (n × n), where C_ij =1 if samples i and j clustered together and 0 otherwise. The consensus matrix was computed by averaging these 100 connectivity matrices, with entries representing the probability of pairwise sample co-clustering. We then performed hierarchical clustering (average linkage) on the consensus matrix and evaluated clustering stability using the cophenetic correlation coefficient, which quantifies the agreement between the consensus matrix distances and hierarchical clustering linkage distances. The optimal k was selected as the value maximizing this coefficient (Figure 7).

The pipeline of stratifying subgroup in the discovery dataset.
First, the TBSS method was used to extract FA values within the WM skeleton. NMF bi-clustering was then performed across a range of predefined ranks k (e.g., 2 ∼ 8) to generate corresponding clustering results (e.g., 2 ∼ 8 subgroups). Finally, the cophenetic correlation coefficient was employed to determine the optimal number of subgroups.

Bicluster assignment in independent validation dataset

To classify new patients from the validation cohort, we projected each individual’s data onto the pre-established basis matrix A to compute coefficient matrix Y, where maximal values in each column determine cluster membership. This assignment process was repeated 100 times to generate a test consensus matrix, mirroring the protocol from the discovery phase. Hierarchical clustering with average linkage was then applied, followed by computation of the corresponding cophenetic correlation coefficient. This was done to assess the stability of the clustering using the predetermined optimal rank (k) when applied to the validation cohort.

Subgroup characterization analysis

WM signature extraction

To identify subgroup-specific WM microstructural signatures, we developed a frequency-based feature selection protocol following NMF biclustering. Given the stochastic nature of biclustering assignments across 100 iterations (performed at the predetermined optimal rank k), we quantified voxel-wise feature consistency through the following steps (Figure 8):

Feature occurrence mapping: Created a frequency matrix F (sample pairs × features) by incrementing entries when features co-occurred in biclusters across iterations, normalized to 0 - 1 probability values. Specifically, each row represented a unique sample pair, and each column corresponded to a voxel feature. For each biclustering run, if a feature (column) appeared in the same cluster as a sample pair (row), its entry in F was incremented by 1.
Consensus-driven partitioning: Segmented F into k submatrices by grouping rows (sample pairs) according to their subgroup labels derived from the hierarchical clustering.
Signature thresholding: For each subgroup-specific submatrix, retained features (columns) exceeding probability threshold (p > 0.9) were defined as characteristic WM signatures, ensuring > 90% consistency across biclustering realizations.

This approach transformed each run of biclustering solutions into spatially stable WM profiles, mitigating initialization-dependent variability inherent to NMF decomposition.

The flowchart of WM signature extraction.
a) Feature occurrence mapping was illustrated how to generate the frequency matrix F. b) The split of matrix F involved two parts: consensus-driven partitioning and signature thresholding.

FA-based case-control comparisons

To statistically validate the identified WM patterns, we conducted comparative analysis of mean FA values between each MDD subgroup and HCs within the signature WM profiles. These comparisons were performed independently in both discovery and validation cohorts using Wilcoxon rank-sum tests. Prior to analysis, we accounted for potential confounding effects by regressing out age and sex covariates.

Clinical phenotype differentiation

We conducted comprehensive clinical characterization of the MDD subgroups across discovery and validation datasets. First, we evaluated differences in depression severity (total 24-HAMD score) and symptom domain profiles (anxiety/somatization, cognitive impairment, psychomotor retardation, sleep disturbance, and hopelessness factor scores) (See Supplementary Materials for detailed definitions of symptom factors) using Kruskal-Wallis tests. Second, we examined illness course characteristics including recurrence frequency, current episode duration, total illness duration, and effective illness duration. Significant findings underwent post-hoc analysis using Wilcoxon rank-sum tests in both datasets. Before conducting the above comparisons, we regressed out age and sex effects. Third, we assessed treatment response heterogeneity by comparing responder rates (≥ 20% 24-HAMD reduction after 4 weeks) (Kennedy, 2022) across subgroups for five treatment strategies, analyzed by chi-square tests.

Predicting antidepressant treatment outcomes across MDD subgroups

Symptom-WM associations in discovery cohort

We employed sCCA to investigate associations between the 7 clinical symptom profiles (the above-mentioned 5 symptom domain profiles, plus weight and diurnal variations) and WM microstructural patterns within each MDD subgroup. The analytical pipeline comprised four stages:

Dimensionality reduction: Given the high feature-to-sample ratio (> 5,000 voxels vs. Subgroup sizes), we implemented region-of-interest (ROI) clustering to address dimensionality. WM patterns were parcellated into spatially contiguous clusters (10 - 20 voxels) (Zalesky et al., 2010), with FA values averaged within each cluster. This ROI-based feature extraction was repeated to 10 times per subgroup to account for clustering stochasticity.
sCCA implementation: Reduced-dimension data were analyzed via sCCA with cross-validation (CV) to identify maximally correlated linear combinations between clinical factors and WM features, generating canonical variate pairs.
Statistical validation: Permutation testing (1, 000 iterations) assessed the significance of canonical correlations against null distributions.
Feature stability assessment: Bootstrapping (1, 000 resamples) identified consistently contributing clinical and neuroimaging features across sCCA solutions.

To assess the robustness of our findings, the established sCCA workflow (Xia et al., 2018) was iterated over 10 different ROI parcellations per subgroup. We examined the stability of clinical loadings and model performance in held-out data. The technical details are provided in Supplementary Methods.

Following sCCA analysis, we identified WM ROI (WM_cca) associated with 7 clinical symptom factors in each subgroup, retaining ROI appearing in ≥ 2/10 iterations. Fiber tracts passing through these WM_cca regions were reconstructed into subgroup-specific WM networks (288 nodes, nodal definitions in Supplementary Excel 1 and are based on the three atlas: Yeo 17 - network, SUIT cerebellum, Tian’s subcortex (Diedrichsen et al., 2009; Schaefer et al., 2018; Tian et al., 2020), representing symptom-related neurocircuitry.

Subgroup-specific treatment outcome prediction

The nodes in the symptom-related network were categorized into 19 functional subsystems (e.g., Yeo 17 functional networks, cerebellum network, and subcortical network). Degree centrality values for each subsystem (Calculation details in Supplementary Methods) served as predictive features in support vector regression (SVR) models to forecast six treatment outcomes (e.g., percentage reductions in 24-HAMD total score and five core symptom domain profiles). All features were separately adjusted for age/sex covariates in both datasets prior to the prediction.

Model development employed epsilon-insensitive SVR with sigmoid kernel (Chang and Lin, 2011). Hyperparameter optimization via grid search explored: γ(default=1), coef0 (0.1 - 2, step = 0.01), ε (0.01 - 2, step = 0.01). Prediction performance was evaluated through nested leave-one-out CV (LOOCV), with grid search external to CV folds to prevent data leakage. Final model performance was quantified via Pearson correlation between predicted and observed values across all LOOCV iterations.

Model validation in independent cohort

The 18 subgroup-specific SVR models (3 subgroups × 6 treatment outcomes) trained in the discovery cohort were rigorously validated using the independent dataset. Fixed hyperparameters, optimized in the discovery phase, were retained during training of the SVR models. The external independent dataset underwent an identical feature preprocessing pipeline (including age/sex regression and degree centrality calculation) as applied to the discovery cohort. Model performance was quantified using the correlation between predicted and observed values for each model. This validation framework ensured generalizability while preventing overfitting through a complete separation between training and validation datasets.

Data availability

The code for the analyses presented in this paper are openly accessible on github: [https://github.com/qinjiaolong/MDDsubtype]. The subgroup-related data are accessible at [https://osf.io/3gauj].

Acknowledgements

Sincere appreciation is extended to the patients and control subjects for their valuable participation. We sincerely thank to Professor Chen Gong for his valuable suggestions of NMF-clustering method.

Additional information

CRediT Author Statement

Jiaolong Qin: Conceptualization, Methodology, Software, Formal analysis, Validation, Writing - Original Draft, Writing - Review & Editing

Xinyi Wang, Huangjing Ni, Ye Wu: Formal analysis, Data Curation

Haiyan Liu, Lingling Hua, Rui Yan, Hao Tang, Peng Zhao: Conceptualization, Investigation, Resources, Data Curation

Zhijian Yao: Conceptualization, Resources, Supervision

Qing Lu: Conceptualization, Methodology, Supervision, Writing - Review & Editing

Ethical approval

All procedures performed in studies involving human participants were in accordance with the ethical standards of the institutional and/or national research committee and with the 1964 Helsinki declaration and its later amendments or comparable ethical standards.

Funding

National Natural Science Foundation of China (81701346)

Jiaolong Qin
Ye Wu

National Natural Science Fundation of China (81871066)

Qing Lu
Ye Wu

National Natural Science Fundation of China (82151315)

Qing Lu
Ye Wu

National Natural Science Fundation of China (82271568)

Qing Lu
Ye Wu

Natural Science Foundation of Jiangsu Province (BK20190736)

Huangjing Ni
Ye Wu

National Key R&D Program of China (2023YFF1204803)

Ye Wu

National Natural Science Foundation of China (62201265)

Ye Wu

Key Project of Jiangsu Provincial Natural Science Fund (BK20253028)

Ye Wu

Additional files

Supplementary file 1

Supplementary file 2

References

1. American Psychiatric Association, DSM-5 Task Force
2013Diagnostic and statistical manual of mental disorders: DSM-5TM, 5th edAmerican Psychiatric Publishing https://doi.org/10.1176/appi.books.9780890425596 Google Scholar
1. Arnow BA
2. Blasey C
3. Williams LM
4. Palmer DM
5. Rekshan W
6. Schatzberg AF
7. Etkin A
8. Kulkarni J
9. Luther JF
10. Rush AJ
2015Depression Subtypes in Predicting Antidepressant Response: A Report From the iSPOT-D TrialAmerican Journal of Psychiatry 172:743–750https://doi.org/10.1176/appi.ajp.2015.14020181 Google Scholar
1. Association AP
1994Diagnostic Criteria from DSM-IVAmerican Psychiatric Association Google Scholar
1. Benear SL
2. Ngo CT
3. Olson IR
2020Dissecting the Fornix in Basic Memory Processes and Neuropsychiatric Disease: A ReviewBrain Connectivity 10:331–354https://doi.org/10.1089/brain.2020.0749 Google Scholar
1. Brunet J-P
2. Tamayo P
3. Golub TR
4. Mesirov JP
2004Metagenes and molecular pattern discovery using matrix factorizationProceedings of the National Academy of Sciences 101:4164–4169https://doi.org/10.1073/pnas.0308531101 Google Scholar
1. Bubb EJ
2. Metzler-Baddeley C
3. Aggleton JP
2018The cingulum bundle: Anatomy, function, and dysfunctionNeuroscience & Biobehavioral Reviews 92:104–127https://doi.org/10.1016/j.neubiorev.2018.05.008 Google Scholar
1. Castrén E
2. Hen R
2013Neuronal plasticity and antidepressant actionsTrends in Neurosciences 36:259–267https://doi.org/10.1016/j.tins.2012.12.010 Google Scholar
1. Catani M
2. Thiebaut De Schotten M.
2008A diffusion tensor imaging tractography atlas for virtual in vivo dissectionsCortex 44:1105–1132https://doi.org/10.1016/j.cortex.2008.05.004 Google Scholar
1. Chang C-C
2. Lin C-J
2011LIBSVM: A library for support vector machinesACM Trans Intell Syst Technol 2:1–27https://doi.org/10.1145/1961189.1961199 Google Scholar
1. Ciapponi C
2. Li Y
3. Osorio Becerra DA
4. Rodarie D
5. Casellato C
6. Mapelli L
7. D’Angelo E
2023Variations on the theme: focus on cerebellum and emotional processingFrontiers in Systems Neuroscience 17https://doi.org/10.3389/fnsys.2023.1185752 Google Scholar
1. Cohen SE
2. Zantvoord JB
3. Wezenberg BN
4. Bockting CLH
5. van Wingen GA.
2021Magnetic resonance imaging for individual prediction of treatment response in major depressive disorder: a systematic review and meta-analysisTranslational Psychiatry 11:1–10https://doi.org/10.1038/s41398-021-01286-x Google Scholar
1. Cole J
2. Chaddock CA
3. Farmer AE
4. Aitchison KJ
5. Simmons A
6. McGuffin P
7. Fu CHY
2012White matter abnormalities and illness severity in major depressive disorderThe British Journal of Psychiatry 201:33–39https://doi.org/10.1192/bjp.bp.111.100594 Google Scholar
1. Diedrichsen J
2. Balsters JH
3. Flavell J
4. Cussans E
5. Ramnani N
2009A probabilistic MR atlas of the human cerebellumNeuroImage 46:39–46https://doi.org/10.1016/j.neuroimage.2009.01.045 Google Scholar
1. Drysdale AT
2. Grosenick L
3. Downar J
4. Dunlop K
5. Mansouri F
6. Meng Y
7. Fetcho RN
8. Zebley B
9. Oathes DJ
10. Etkin A
11. Schatzberg AF
12. Sudheimer K
13. Keller J
14. Mayberg HS
15. Gunning FM
16. Alexopoulos GS
17. Fox MD
18. Pascual-Leone A
19. Voss HU
20. Casey BJ
21. Dubin MJ
22. Liston C
2017Resting-state connectivity biomarkers define neurophysiological subtypes of depressionNature Medicine 23:28–38https://doi.org/10.1038/nm.4246 Google Scholar
1. Feczko E
2. Miranda-Dominguez O
3. Marr M
4. Graham AM
5. Nigg JT
6. Fair DA
2019The Heterogeneity Problem: Approaches to Identify Psychiatric SubtypesTrends in Cognitive Sciences 23:584–601https://doi.org/10.1016/j.tics.2019.03.009 Google Scholar
1. Fried E
2017Moving forward: how depression heterogeneity hinders progress in treatment and researchExpert Review of Neurotherapeutics 17:423–425https://doi.org/10.1080/14737175.2017.1307737 PubMed Google Scholar
1. Fried EI
2. Nesse RM
2015Depression is not a consistent syndrome: An investigation of unique symptom patterns in the STAR*D studyJournal of Affective Disorders 172:96–102https://doi.org/10.1016/j.jad.2014.10.010 Google Scholar
1. Gazerani P
2025The neuroplastic brain: current breakthroughs and emerging frontiersBrain Research 1858:149643https://doi.org/10.1016/j.brainres.2025.149643 Google Scholar
1. Godsil BP
2. Kiss JP
3. Spedding M
4. Jay TM
2013The hippocampal–prefrontal pathway: The weak link in psychiatric disorders?European Neuropsychopharmacology 23:1165–1181https://doi.org/10.1016/j.euroneuro.2012.10.018 Google Scholar
1. Guevara M
2. Román C
3. Houenou J
4. Duclap D
5. Poupon C
6. Mangin JF
7. Guevara P
2017Reproducibility of superficial white matter tracts using diffusion-weighted imaging tractographyNeuroImage 147:703–725https://doi.org/10.1016/j.neuroimage.2016.11.066 Google Scholar
1. Haghshomar M
2. Dolatshahi M
3. Ghazi Sherbaf F
4. Sanjari Moghaddam H
5. Shirin Shandiz M
6. Aarabi MH
2018Disruption of Inferior Longitudinal Fasciculus Microstructure in Parkinson’s Disease: A Systematic Review of Diffusion Tensor Imaging StudiesFrontiers in Neurology 9https://doi.org/10.3389/fneur.2018.00598 Google Scholar
1. Hamilton M
1960A Rating Scale for Depression. Journal of NeurologyNeurosurgery & Psychiatry 23:56–62https://doi.org/10.1136/jnnp.23.1.56 PubMed Google Scholar
1. Herbet G
2. Zemmoura I
3. Duffau H
2018Functional Anatomy of the Inferior Longitudinal Fasciculus: From Historical Reports to Current HypothesesFrontiers in Neuroanatomy 12https://doi.org/10.3389/fnana.2018.00077 Google Scholar
1. James SL
2. Abate D
3. Abate KH
4. et al.
2018Global, regional, and national incidence, prevalence, and years lived with disability for 354 diseases and injuries for 195 countries and territories, 1990–2017: a systematic analysis for the Global Burden of Disease Study 2017The Lancet 392:1789–1858https://doi.org/10.1016/S0140-6736(18)32279-7 PubMed Google Scholar
1. Jenkinson M
2. Beckmann CF
3. Behrens TEJ
4. Woolrich MW
5. Smith SM
2012FSLNeuroImage 62:782–790https://doi.org/10.1016/j.neuroimage.2011.09.015 Google Scholar
1. Kaiser RH
2. Andrews-Hanna JR
3. Wager TD
4. Pizzagalli DA
2015Large-Scale Network Dysfunction in Major Depressive Disorder: A Meta-analysis of Resting-State Functional ConnectivityJAMA Psychiatry 72:603–611https://doi.org/10.1001/jamapsychiatry.2015.0071 Google Scholar
1. Kato M
2. Asami Y
3. Wajsbrot DB
4. Wang X
5. Boucher M
6. Prieto R
7. Pappadopulos E
2020Clustering patients by depression symptoms to predict venlafaxine ER antidepressant efficacy: Individual patient data analysisJournal of Psychiatric Research 129:160–167https://doi.org/10.1016/j.jpsychires.2020.06.011 Google Scholar
1. Kautzky A
2. Möller H-J
3. Dold M
4. Bartova L
5. Seemüller F
6. Laux G
7. Riedel M
8. Gaebel W
9. Kasper S
2021Combining machine learning algorithms for prediction of antidepressant treatment responseActa Psychiatrica Scandinavica 143:36–49https://doi.org/10.1111/acps.13250 Google Scholar
1. Kellner E
2. Dhital B
3. Kiselev VG
4. Reisert M
2016Gibbs-ringing artifact removal based on local subvoxel-shiftsMagnetic Resonance in Medicine 76:1574–1581https://doi.org/10.1002/mrm.26054 Google Scholar
1. Kennedy SH
2022Beyond Response: Aiming for Quality Remission in DepressionAdvances in Therapy 39:20–28https://doi.org/10.1007/s12325-021-02030-z Google Scholar
1. Kennis M
2. Gerritsen L
3. van Dalen M
4. Williams A
5. Cuijpers P
6. Bockting C.
2020Prospective biomarkers of major depressive disorder: a systematic review and meta-analysisMolecular Psychiatry 25:321–338https://doi.org/10.1038/s41380-019-0585-z Google Scholar
1. Kraus C
2. Kadriu B
3. Lanzenberger R
4. Zarate CA
5. Kasper S
2019Prognosis and improved outcomes in major depression: a reviewTranslational Psychiatry 9:1–17https://doi.org/10.1038/s41398-019-0460-3 Google Scholar
1. Li Y
2. Ngom A
2013The non-negative matrix factorization toolbox for biological data miningSource Code for Biology and Medicine 8:1–15https://doi.org/10.1186/1751-0473-8-10 Google Scholar
1. Liang S
2. Wang Q
3. Kong X
4. Deng W
5. Yang X
6. Xiaojing Li
7. Zhang Z
8. Zhang J
9. Zhang C
10. Xin-min Li
11. Ma X
12. Shao J
13. Greenshaw AJ
14. Li T
2019White Matter Abnormalities in Major Depression Biotypes Identified by Diffusion Tensor ImagingNeuroscience Bulletin 35:867–876https://doi.org/10.1007/s12264-019-00381-w Google Scholar
1. Liao Y
2. Huang X
3. Wu Q
4. Yang C
5. Kuang W
6. Du M
7. Lui S
8. Yue Q
9. Chan RCK
10. Kemp GJ
11. Gong Q.
2013Is depression a disconnection syndrome? Meta-analysis of diffusion tensor imaging studies in patients with MDDJournal of Psychiatry and Neuroscience 38:49–56https://doi.org/10.1503/jpn.110180 PubMed Google Scholar
1. Long F
2. Chen Y
3. Zhang Q
4. Li Q
5. Yaxuan Wang
6. Yitian Wang
7. Li H
8. Zhao Y
9. McNamara RK
10. DelBello MP
11. Sweeney JA
12. Gong Q
13. Li F
2025Predicting treatment outcomes in major depressive disorder using brain magnetic resonance imaging: a meta-analysisMolecular Psychiatry 30:825–837https://doi.org/10.1038/s41380-024-02710-6 Google Scholar
1. Luttenbacher I
2. Phillips A
3. Kazemi R
4. Hadipour AL
5. Sanghvi I
6. Martinez J
7. Adamson MM
2022Transdiagnostic role of glutamate and white matter damage in neuropsychiatric disorders: A Systematic ReviewJournal of Psychiatric Research 147:324–348https://doi.org/10.1016/j.jpsychires.2021.12.042 Google Scholar
1. Mao Y
2. Fan L
3. Feng C
4. Dai Z
2025Predicting responses of neuromodulation and psychotherapies for major depressive disorder: A coordinate-based meta-analysis of functional magnetic resonance imaging studiesNeuroscience & Biobehavioral Reviews 172:106120https://doi.org/10.1016/j.neubiorev.2025.106120 Google Scholar
1. Marzola P
2. Melzer T
3. Pavesi E
4. Gil-Mohapel J
5. Brocardo PS
2023Exploring the Role of Neuroplasticity in Development, Aging, and NeurodegenerationBrain Sciences 13:1610https://doi.org/10.3390/brainsci13121610 Google Scholar
1. Menon V
2011Large-scale brain networks and psychopathology: a unifying triple network modelTrends in Cognitive Sciences 15:483–506https://doi.org/10.1016/j.tics.2011.08.003 Google Scholar
1. Pascalau R
2. Popa Stănilă R
3. Sfrângeu S
4. Szabo B
2018Anatomy of the Limbic White Matter Tracts as Revealed by Fiber Dissection and TractographyWorld Neurosurgery 113:e672–e689https://doi.org/10.1016/j.wneu.2018.02.121 Google Scholar
1. Pasternak O
2. Kelly S
3. Sydnor VJ
4. Shenton ME
2018Advances in microstructural diffusion neuroimaging for psychiatric disordersNeuroImage 182:259–282https://doi.org/10.1016/j.neuroimage.2018.04.051 Google Scholar
1. Schaefer A
2. Kong R
3. Gordon EM
4. Laumann TO
5. Zuo X-N
6. Holmes AJ
7. Eickhoff SB
8. Yeo BTT
2018Local-Global Parcellation of the Human Cerebral Cortex from Intrinsic Functional Connectivity MRICerebral Cortex 28:3095–3114https://doi.org/10.1093/cercor/bhx179 Google Scholar
1. Schmahmann JD
2. Guell X
3. Stoodley CJ
4. Halko MA
2019The Theory and Neuroscience of Cerebellar CognitionAnnual Review of Neuroscience 42:337–364https://doi.org/10.1146/annurev-neuro-070918-050258 Google Scholar
1. Smith RE
2. Tournier J-D
3. Calamante F
4. Connelly A
2015The effects of SIFT on the reproducibility and biological accuracy of the structural connectomeNeuroImage 104:253–265https://doi.org/10.1016/j.neuroimage.2014.10.004 Google Scholar
1. Smith RE
2. Tournier J-D
3. Calamante F
4. Connelly A
2013SIFT: Spherical-deconvolution informed filtering of tractogramsNeuroImage 67:298–312https://doi.org/10.1016/j.neuroimage.2012.11.049 Google Scholar
1. Smith SM
2. Jenkinson M
3. Johansen-Berg H
4. Rueckert D
5. Nichols TE
6. Mackay CE
7. Watkins KE
8. Ciccarelli O
9. Cader MZ
10. Matthews PM
11. Behrens TEJ
2006Tract-based spatial statistics: Voxelwise analysis of multi-subject diffusion dataNeuroImage 31:1487–1505https://doi.org/10.1016/j.neuroimage.2006.02.024 Google Scholar
1. Szegedi A
2. Jansen WT
3. van Willigenburg AP
4. van der Meulen E
5. Stassen HH
6. Thase ME
2009Early Improvement in the First 2 Weeks as a Predictor of Treatment Outcome in Patients With Major Depressive Disorder: A Meta-Analysis Including 6562 PatientsThe Journal of Clinical Psychiatry 70:5290https://doi.org/10.4088/JCP.07m03780 Google Scholar
1. Tian Y
2. Margulies DS
3. Breakspear M
4. Zalesky A
2020Topographic organization of the human subcortex unveiled with functional connectivity gradientsNature Neuroscience 23:1421–1432https://doi.org/10.1038/s41593-020-00711-6 Google Scholar
1. Tokuda T
2. Yoshimoto J
3. Shimizu Y
4. Okada G
5. Takamura M
6. Okamoto Y
7. Yamawaki S
8. Doya K
2018Identification of depression subtypes and relevant brain regions using a data-driven approachScientific Reports 8:14082https://doi.org/10.1038/s41598-018-32521-z Google Scholar
1. Tournier J-D
2. Smith R
3. Raffelt D
4. Tabbara R
5. Dhollander T
6. Pietsch M
7. Christiaens D
8. Jeurissen B
9. Yeh C-H
10. Connelly A
2019MRtrix3: A fast, flexible and open software framework for medical image processing and visualisationNeuroImage 202:116137https://doi.org/10.1016/j.neuroimage.2019.116137 Google Scholar
1. Tustison NJ
2. Avants BB
3. Cook PA
4. Zheng Y
5. Egan A
6. Yushkevich PA
7. Gee JC
2010N4ITK: Improved N3 Bias CorrectionIEEE Transactions on Medical Imaging 29:1310–1320https://doi.org/10.1109/TMI.2010.2046908 Google Scholar
1. Veraart J
2. Novikov DS
3. Christiaens D
4. Ades-aron B
5. Sijbers J
6. Fieremans E
2016Denoising of diffusion MRI using random matrix theoryNeuroImage 142:394–406https://doi.org/10.1016/j.neuroimage.2016.08.016 Google Scholar
1. Whelan R
2. Garavan H
2014When Optimism Hurts: Inflated Predictions in Psychiatric NeuroimagingBiological Psychiatry 75:746–748https://doi.org/10.1016/j.biopsych.2013.05.014 Google Scholar
1. Xia CH
2. Ma Z
3. Ciric R
4. Gu S
5. Betzel RF
6. Kaczkurkin AN
7. Calkins ME
8. Cook PA
9. de la Garza A García
10. Vandekar SN
11. Cui Z
12. Moore TM
13. Roalf DR
14. Ruparel K
15. Wolf DH
16. Davatzikos C
17. Gur RC
18. Gur RE
19. Shinohara RT
20. Bassett DS
21. Satterthwaite TD.
2018Linked dimensions of psychopathology and connectivity in functional brain networksNature Communications 9:3003https://doi.org/10.1038/s41467-018-05317-y Google Scholar
1. Yildiz O
2. Kabatas S
3. Yilmaz C
4. Altinors N
5. Agaoglu B
2010Cerebellar mutism syndrome and its relation to cerebellar cognitive and affective function: Review of the literatureAnnals of Indian Academy of Neurology 13:23https://doi.org/10.4103/0972-2327.61272 Google Scholar
1. Zalesky A
2. Fornito A
3. Harding IH
4. Cocchi L
5. Yücel M
6. Pantelis C
7. Bullmore ET
2010Whole-brain anatomical networks: Does the choice of nodes matter?NeuroImage 50:970–983https://doi.org/10.1016/j.neuroimage.2009.12.027 Google Scholar
1. Zemmoura I
2. Burkhardt E
3. Herbet G
2021The inferior longitudinal fasciculus: anatomy, function and surgical considerations - Journal of Neurosurgical Sciences 2021 December;65(6):590-604Journal of Neurosurgical Sciences 65:590–604https://doi.org/10.23736/S0390-5616.21.05391-1 Google Scholar

Article and author information

Author information

Jiaolong Qin
PCA Lab, Key Lab of Intelligent Perception and Systems for High-Dimensional Information of Ministry of Education, School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing, China
ORCID iD: 0000-0003-1324-6053
- For correspondence: jiaolongq@njust.edu.cn
Xinyi Wang
School of Psychology, Nanjing Normal University, Nanjing, China
Huangjing Ni
School of Geographic and Biologic Information, Nanjing University of Posts and Telecommunications, Nanjing, China
Ye Wu
PCA Lab, Key Lab of Intelligent Perception and Systems for High-Dimensional Information of Ministry of Education, School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing, China
Haiyan Liu
Department of Psychiatry, The Affiliated Brain Hospital of Nanjing Medical University, Nanjing, China, Nanjing Brain Hospital, Medical School of Nanjing University, Nanjing, China
Lingling Hua
Department of Psychiatry, The Affiliated Brain Hospital of Nanjing Medical University, Nanjing, China, Nanjing Brain Hospital, Medical School of Nanjing University, Nanjing, China
Rui Yan
Department of Psychiatry, The Affiliated Brain Hospital of Nanjing Medical University, Nanjing, China, Nanjing Brain Hospital, Medical School of Nanjing University, Nanjing, China
Hao Tang
Department of Psychiatry, The Affiliated Brain Hospital of Nanjing Medical University, Nanjing, China, Nanjing Brain Hospital, Medical School of Nanjing University, Nanjing, China
Peng Zhao
Nanjing Drum Tower Hospital, Nanjing, China
Zhijian Yao
Department of Psychiatry, The Affiliated Brain Hospital of Nanjing Medical University, Nanjing, China, Nanjing Brain Hospital, Medical School of Nanjing University, Nanjing, China
- For correspondence: zjyao@njmu.edu.cn
Qing Lu
School of Biological Sciences and Medical Engineering, Southeast University, Nanjing, China, Child Development and Learning Science, Key Laboratory of Ministry of Education, Nanjing, China
- For correspondence: luq@seu.edu.cn

Author Notes

Competing interests: No competing interests declared

Funding statement: This work was supported by the National Key R&D Program of China (No. 2023YFF1204803), the Chinese National Science Foundation (No. 81701346, 81871066, 82151315, 82271568, 62201265), the Natural Science Foundation of Jiangsu Province (Grant No. BK20190736), and the Key Project of Jiangsu Provincial Natural Science Fund (No. BK20253028).

Version history

Sent for peer review: December 6, 2025
Preprint posted: December 18, 2025
Reviewed Preprint version 1: March 6, 2026

Cite all versions

You can cite all versions using the DOI https://doi.org/10.7554/eLife.110078. This DOI represents all versions, and will always resolve to the latest one.

Copyright

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.

Metrics

views: 320
downloads: 20
citations: 0

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Significance of findings

Strength of evidence

Abstract

Background

Methods

Results

Conclusions

Introduction

Results

Data-driven MDD subgroup identification

Clinical characteristics of each subgroup in the discovery dataset.

Clinical characteristics of each subgroup in the external independent dataset.

Multidimensional subgroup characterization

Neuroanatomical characterization of MDD subgroups

The results of WM signature extraction of the three subgroups.

Clinical phenotypic profiles

(1) Clinical symptomatology across MDD subgroups

The results of clinical symptomatology and mean FA values across MDD subgroups in the discovery dataset.

The results of clinical symptomatology and mean FA values across MDD subgroups in the external validation dataset.

(2) Differential treatment responses across MDD subgroups

Treatment outcome prediction

Distinct Symptom-WM associative networks across MDD subgroups

Brain WM networks associated with major clinical symptoms at the average group-level.

Predictive model performance

Prediction of antidepressant treatment outcomes using degree centrality of affected WM network in the discovery dataset.

Prediction of antidepressant treatment outcomes using degree centrality of affected WM network in the external validation dataset.

Discussion

Neuroanatomical subgroups and pathophysiological implications

WM stratification in MDD predicts symptom-specific treatment outcomes

Data-driven identification of MDD subgroups through NMF-biclustering

Limitations

Conclusion

Materials and methods

Participants

Demographic characteristic and clinical information of depressions and healthy controls in the current study.

Imaging acquisitions and preprocessing

Data-driven identification of MDD subgroups

Biclustering identification via NMF in discovery dataset

The pipeline of stratifying subgroup in the discovery dataset.

Bicluster assignment in independent validation dataset

Subgroup characterization analysis

WM signature extraction

The flowchart of WM signature extraction.

FA-based case-control comparisons

Clinical phenotype differentiation

Predicting antidepressant treatment outcomes across MDD subgroups

Symptom-WM associations in discovery cohort

Subgroup-specific treatment outcome prediction

Model validation in independent cohort

Data availability

Acknowledgements

Additional information

CRediT Author Statement

Ethical approval

Funding

Additional files

References

Article and author information

Author information

Jiaolong Qin

Xinyi Wang

Huangjing Ni

Ye Wu

Haiyan Liu

Lingling Hua

Rui Yan

Hao Tang

Peng Zhao

Zhijian Yao

Qing Lu

Author Notes

Version history

Cite all versions

Copyright

Metrics