Relationship between cognitive abilities and mental health as represented by cognitive abilities at the neural and genetic levels of analysis

eLife Assessment

This important study examines the relationship between cognition and mental health and investigates how brain, genetics, and environmental measures mediate that relationship. The methods and results are compelling and well-executed. Overall, this study will be of interest in the field of population neuroscience and in studies of mental health.

https://doi.org/10.7554/eLife.105537.3.sa0

Significance of the findings:

Important: Findings that have theoretical or practical implications beyond a single subfield

Landmark
Fundamental
Important
Valuable
Useful

Strength of evidence:

Compelling: Evidence that features methods, data and analyses more rigorous than the current state-of-the-art

Exceptional
Compelling
Convincing
Solid
Incomplete
Inadequate

During the peer-review process the editor and reviewers write an eLife Assessment that summarises the significance of the findings reported in the article (on a scale ranging from landmark to useful) and the strength of the evidence (on a scale ranging from exceptional to inadequate). Learn more about eLife Assessments

Abstract
Introduction
Results
Discussion
Materials and methods
Data availability
References
Article and author information
Metrics

Abstract

Cognitive abilities are closely tied to mental health from early childhood. This study explores how neurobiological units of analysis of cognitive abilities—multimodal neuroimaging and polygenic scores (PGS)—represent this connection. Using data from over 11,000 children (ages 9–10) in the Adolescent Brain Cognitive Development (ABCD) Study, we applied multivariate models to predict cognitive abilities from mental health, neuroimaging, PGS, and environmental factors. Neuroimaging included 45 MRI-derived features (e.g. task/resting-state fMRI, structural MRI, diffusion imaging). Environmental factors encompassed socio-demographics (e.g. parental income/education), lifestyle (e.g. sleep, extracurricular activities), and developmental adverse events (e.g. parental use of alcohol/tobacco, pregnancy complications). Cognitive abilities were predicted by mental health (r = 0.36), neuroimaging (r = 0.54), PGS (r = 0.25), and environmental factors (r = 0.49). Commonality analyses showed that neuroimaging (66%) and PGS (21%) explained most of the cognitive–mental health link. Environmental factors accounted for 63% of the cognitive–mental health link, with neuroimaging and PGS explaining 58% and 21% of this environmental contribution, respectively. These patterns remained consistent over two years. Findings highlight the importance of neurobiological units of analysis for cognitive abilities in understanding the cognitive–mental health connection and its overlap with environmental factors.

Introduction

Cognitive abilities across various domains, such as attention, working memory, declarative memory, verbal fluency, and cognitive control, are often altered in several psychiatric disorders (Millan et al., 2012). This is evident in recent meta-analyses of case-control studies involving patients with mood and anxiety disorders, obsessive-compulsive disorder, posttraumatic stress disorder, and attention-deficit/hyperactivity disorder (ADHD), among others (Abramovitch et al., 2021; East-Richard et al., 2020). Beyond typical case-control studies, the association between cognitive abilities and mental health is also observed when mental health varies from normal to abnormal in normative samples (Morris et al., 2022). For instance, our study Pat et al., 2022a found an association between cognitive abilities and mental health in a relatively large, non-referred sample of 9–10 year-old children from the ABCD study (Casey et al., 2018). In this study, we measured cognitive abilities using behavioural performance across cognitive tasks (Luciana et al., 2018) while measuring mental health using a broad range of emotional and behavioural problems (Achenbach et al., 2017). Thus, cognitive abilities are frequently considered crucial for understanding mental health issues throughout life, beginning in childhood (Abramovitch et al., 2021; Hankin et al., 2016; Morris and Cuthbert, 2012).

According to the National Institute of Mental Health’s Research Domain Criteria (RDoC) framework (Insel et al., 2010), cognitive abilities should be investigated not only behaviourally but also neurobiologically, from the brain to genes. It remains unclear to what extent the relationship between cognitive abilities and mental health is represented in part by different neurobiological units of analysis -- such as neural and genetic levels measured by multimodal neuroimaging and polygenic scores (PGS). To fully comprehend the role of neurobiology in the relationship between cognitive abilities and mental health, we must also consider how these neurobiological units capture variations due to environmental factors, such as socio-demographics, lifestyles, and childhood developmental adverse events (Morris et al., 2022). Our study investigated the extent to which (a) environmental factors explain the relationship between cognitive abilities and mental health, and (b) cognitive abilities at the neural and genetic levels capture these associations due to environmental factors. Specifically, we conducted these investigations in a large normative group of children from the ABCD study (Casey et al., 2018). We chose to examine children because, while their emotional and behavioural problems might not meet full diagnostic criteria (Kessler et al., 2007), issues at a young age often forecast adult psychopathology (Reef et al., 2010; Roza et al., 2003). Moreover, the associations among different emotional and behavioural problems in children reflect transdiagnostic dimensions of psychopathology (Michelini et al., 2019; Pat et al., 2022a), making children an appropriate population to study the transdiagnostic aetiology of mental health, especially within a framework that emphasises normative variation from normal to abnormal, such as the RDoC (Morris et al., 2022).

Recently, several neuroscientists have developed predictive models using neuroimaging data from brain magnetic resonance imaging (MRI) of various modalities in the so-called Brain-Wide Association Studies (BWAS) (Marek et al., 2022; Sui et al., 2020). BWAS aims to create models from MRI data that can accurately predict behavioural phenotypes in participants not included in the model-building process (Dadi et al., 2021). In one of the most extensive BWAS benchmarks to date, Marek et al., 2022 concluded, ‘More robust BWAS effects were detected for functional MRI (versus structural), cognitive tests (versus mental health questionnaires), and multivariate methods (versus univariate).’ This benchmark has significant implications for using neuroimaging as a neural unit of analysis for cognitive abilities. First, while current BWAS may not be robust enough to predict mental health directly, it is more suitable for predicting cognitive abilities (see Zhi et al., 2024 for a similar conclusion). This aligns with the Research Domain Criteria (RDoC) framework, which emphasises neurobiological units of analysis for functional domains, such as cognitive abilities, rather than mental health itself (Cuthbert and Insel, 2013). RDoC’s functional domains capture basic human functioning and include cognitive abilities along with negative/positive valence, arousal, and regulation, and social and sensory processes (Morris et al., 2022). Accordingly, the current study conducted BWAS to capture cognitive abilities rather than mental health.

The second implication of Marek et al., 2022 benchmark is the support it provides for using multivariate algorithms, which draw MRI information simultaneously across regions/voxels, over massively univariate algorithms that draw data from one area/voxel at a time. Similar to Marek et al., 2022 study, which focused on resting-state functional MRI (rs-fMRI), our recent study on task-fMRI also found that multivariate algorithms performed superiorly, up to several folds, in predicting cognitive abilities compared to massively univariate algorithms (Pat et al., 2023). The third implication is that the performance of neuroimaging in predicting cognitive abilities depends on MRI modalities. Previous research has used brain MRI data of different modalities to predict cognitive abilities (Vieira et al., 2022). For instance, many studies have used rs-fMRI, which reflects functional connectivity between regions during rest (Dubois et al., 2018; Keller et al., 2023; Rasero et al., 2021; Sripada et al., 2020; Sripada et al., 2021). Others have utilised structural MRI (sMRI), which reflects anatomical morphology based on thickness, area, and volume in cortical/subcortical areas, and diffusion tensor imaging (DTI), which reflects diffusion distribution within white matter tracts (Mihalik et al., 2019; Rasero et al., 2021). While less common, task-fMRI, which reflects blood-oxygen-level-dependent (BOLD) activity relevant to each task condition, shows relatively good predictive performance, especially from specific contrasts, such as the 2-Back vs 0-Back from the N-Back working-memory task (Barch et al., 2013) nor (Makowski et al., 2024; Pat et al., 2023; Pat et al., 2022b; Sripada et al., 2020; Tetereva et al., 2022; Zhao et al., 2023). A recent meta-analysis estimated the performance of multivariate methods in predicting cognitive abilities from MRI of different modalities at around an out-of-sample r of 0.42 (Vieira et al., 2022). However, we and others found that this predictive performance could be further boosted by drawing information across different MRI modalities, rather than relying on only one modality (Pat et al., 2022b; Rasero et al., 2021; Tetereva et al., 2022; Tetereva and Pat, 2024). Therefore, the current study used opportunistic stacking (Engemann et al., 2020; Pat et al., 2022b). This multivariate modelling technique allowed us to combine information across MRI modalities with the added benefit of handling missing values. With opportunistic stacking, we created a ‘proxy’ measure of cognitive abilities (i.e. predicted value from the model) at the neural unit of analysis using multimodal neuroimaging.

Geneticists, like neuroscientists, have conducted Genome-Wide Association Studies (GWAS) to explore the links between single-nucleotide polymorphisms (SNPs) and various behavioural phenotypes (Bogdan et al., 2018). Similar to BWAS, GWAS can develop predictive models from genetic profiles, resulting in polygenic scores (PGS) that predict behavioural phenotypes in participants not included in the model-building process (Choi et al., 2020). Several large-scale GWAS on cognitive abilities have been conducted, with some studies involving over 250,000 participants (Davies et al., 2018; Lee et al., 2018; Savage et al., 2018). Recently, researchers have used these large-scale GWAS to compute PGS for cognitive abilities and applied these scores to predict cognitive abilities in children (Allegrini et al., 2019; Pat et al., 2022b). For example, Allegrini et al., 2019 found that PGS based on Savage et al.’s (2018) GWAS accounted for approximately 5.3% of the variance in cognitive abilities among 12-year-old children. The current study adopted this approach with children of a similar age in the ABCD study, creating a proxy measure of cognitive abilities at the genetic unit of analysis using PGS.

Environmental factors, broadly defined, significantly influence cognitive abilities (Duyme et al., 1999; Pietschnig and Voracek, 2015). A classic example is the Flynn Effect (Flynn, 1984; Flynn, 2009; Rundquist, 1936; Williams, 2013), which describes the observed rise in cognitive abilities, as measured by various cognitive tasks, across generations in the general population over time, particularly in high-income countries during the 20th century (Pietschnig and Voracek, 2015; Trahan et al., 2014; Wongupparaj et al., 2017). Experts attribute the Flynn Effect to environmental factors such as improved living standards and better education (Baker et al., 2015; Rindermann et al., 2017). Recently, researchers have used multivariate algorithms to create proxy measures of cognitive abilities in children based on environmental factors, similar to approaches used in neuroimaging and polygenic scores (PGS) (Kirlic et al., 2021; Pat et al., 2022b). These environmental factors often include socio-demographic variables (e.g., parental income/education, area deprivation index, parental marital status), lifestyle factors (e.g. screen/video game use, extracurricular activities), and developmental adverse events (e.g. parental use of alcohol/tobacco before and after pregnancy, birth complications). Studies, including ours, Kirlic et al., 2021; Pat et al., 2022b have applied multivariate algorithms to predict cognitive abilities from various environmental factors in the ABCD study (Casey et al., 2018). In these predictive models, parental income/education, area deprivation index, and extracurricular activities are particularly important predictors of cognitive abilities (Kirlic et al., 2021; Pat et al., 2022b). Following this approach, the current study created another proxy measure of cognitive abilities based on socio-demographics, lifestyles, and developmental adverse events.

In this study, inspired by RDoC (Insel et al., 2010), we (a) focused on cognitive abilities as a functional domain, (b) created predictive models to capture the continuous individual variation (as opposed to distinct categories) in cognitive abilities, (c) computed two neurobiological units of analysis of cognitive abilities: multimodal neuroimaging and PGS, and (d) investigated the potential contributions of environmental factors. To operationalise cognitive abilities, we estimated a latent variable representing behavioural performance across various cognitive tasks, commonly referred to as general cognitive ability or the g-factor (Deary, 2012). The g-factor was computed from various cognitive tasks pertinent to RDoC constructs, including attention, working memory, declarative memory, language, and cognitive control. However, using the g-factor to operationalise cognitive abilities caused this study to diverge from the original conceptualisation of RDoC, which emphasises studying separate constructs within cognitive abilities (Morris et al., 2022; Morris and Cuthbert, 2012). Recent studies suggest that including a general factor, such as the g-factor, in the model, rather than treating each construct separately, improved model fit (Beam et al., 2021; Quah et al., 2025). The g-factor in children is also longitudinally stable and can forecast future health outcomes (Calvin et al., 2017; Deary et al., 2013). Notably, our previous research found that neuroimaging predicts the g-factor more accurately than predicting performance from separate individual cognitive tasks (Pat et al., 2023). Accordingly, we decided to conduct predictive models on the g-factor while keeping the RDoC’s holistic, neurobiological, and basic-functioning characteristics.

Using the ABCD study (Casey et al., 2018), we first developed predictive models to estimate the cognitive abilities of unseen children based on their mental health. These models enabled us to quantify the relationship between cognitive abilities and mental health, thereby creating a proxy measure of cognitive abilities derived from mental health data. The mental health variables included children’s emotional and behavioural problems (Achenbach et al., 2017) and temperaments, such as behavioural inhibition/activation (Carver and White, 1994) and impulsivity (Zapolski et al., 2010). These temperaments are linked to externalising and internalising aspects of mental health and are associated with disorders like depression, anxiety, and substance use (Carver and Johnson, 2018; Johnson et al., 2003). Next, we built predictive models of cognitive abilities using neuroimaging, polygenic scores (PGS), and socio-demographic, lifestyle, and developmental adverse event data, resulting in various proxy measures of cognitive abilities. For neuroimaging, we included 45 types of brain MRI data from task-fMRI, rs-fMRI, sMRI, and DTI. For PGS, we used three definitions of cognitive abilities based on previous large-scale GWAS (Davies et al., 2018; Lee et al., 2018; Savage et al., 2018). For socio-demographic, lifestyle, and developmental adverse events, we included 44 features, covering variables such as parental income/education, screen use, and birth/pregnancy complications. Finally, we conducted a series of commonality analyses (Nimon et al., 2008) using these proxy measures of cognitive abilities to address three specific questions. First, we examined the extent to which the relationship between cognitive abilities and mental health was represented in part by cognitive abilities at the neural and genetic levels, as measured by multimodal neuroimaging and PGS, respectively. Second, we assessed the extent to which this relationship was partly explained by environmental factors, as measured by socio-demographic, lifestyle, and developmental adverse events. Third, we tested whether the two neurobiological units of analysis for cognitive abilities, measured by multimodal neuroimaging and PGS, could account for the variance due to environmental factors. To ensure the stability of our results, we repeated the analyses at two time points (ages 9–10 and 11–12).

Results

Predictive modelling

Predicting cognitive abilities from mental health

Figure 1a and Table 1 illustrate the predictive performance of the Partial Least Square (PLS) models in predicting cognitive abilities from mental health features. These features included: (1) emotional and behavioral problems assessed by the Child Behaviour Checklist (CBCL) (Achenbach et al., 2017), and (2) children’s temperaments assessed by the Behavioural Inhibition System/Behavioural Activation System (BIS/BAS) (Carver and White, 1994) and the Urgency, Premeditation, Perseverance, Sensation seeking, and Positive urgency (UPPS-P) impulsive behaviour scale (Zapolski et al., 2010). Using these two sets of mental health features separately resulted in moderate predictive performance, with correlation coefficients ranging from r=0.24 to r=0.31. Combining them into a single set of features, termed ‘mental health,’ improved the performance to approximately r=0.36, consistent across the two time points.

Figure 1

Download asset Open asset

Predictive models, predicting cognitive abilities from mental-health features via Partial Least Square (PLS).

(a) Predictive performance of the models, indicated by scatter plots between observed vs predicted cognitive abilities based on mental health. Cognitive abilities are based on the second-order latent variable, the g-factor, based on a confirmatory factor analysis of six cognitive tasks. All data points are from test sets. r is the average Pearson’s r across 21 test sites. The parentheses following the r indicate bootstrapped 95% CIs, calculated based on observed vs predicted cognitive abilities from all test sites combined. UPPS-P Impulsive and Behaviour Scale and the Behavioural Inhibition System/Behavioural Activation System (BIS/BAS) were used for child temperaments, conceptualised as risk factors for mental issues. Mental health includes features from CBCL and child temperaments. (b) Feature importance of mental health, predicting cognitive abilities via PLS. The features were ordered based on the loading of the first PLS component. Univariate correlations were Pearson’s r between each mental-health feature and cognitive abilities. Error bars reflect 95% CIs of the correlations. CBCL = Child Behavioural Checklist (in green), reflecting children’s emotional and behavioural problems; UPPS-P = Urgency, Premeditation, Perseverance, Sensation seeking, and Positive urgency Impulsive Behaviour Scale; BAS = Behavioural Activation System (in orange).

Table 1

Performance metrics for predictive models, predicting cognitive abilities from mental health, neuroimaging, polygenic scores, and socio-demographics, lifestyles, and developments.

The metrics were averaged across test sites with standard deviations in parentheses.

Features	Correlation	R²	MAE	RMSE
Baseline
Mental Health	0.353 (0.051)	0.124 (0.038)	0.736 (0.019)	0.934 (0.02)
CBCL	0.272 (0.048)	0.074 (0.028)	0.758 (0.014)	0.961 (0.015)
Child personality	0.268 (0.058)	0.071 (0.034)	0.759 (0.019)	0.962 (0.017)
Neuroimaging	0.539 (0.073)	0.291 (0.082)	0.658 (0.039)	0.839 (0.05)
Polygenic scores	0.252 (0.056)	0.02 (0.075)	0.696 (0.055)	0.884 (0.066)
Socio-demo Life Dev Adv	0.486 (0.081)	0.239 (0.084)	0.686 (0.041)	0.87 (0.049)
Follow-up
Mental Health	0.36 (0.07)	0.116 (0.061)	0.715 (0.043)	0.903 (0.051)
CBCL	0.24 (0.056)	0.043 (0.034)	0.746 (0.045)	0.94 (0.053)
Child personality	0.311 (0.076)	0.084 (0.059)	0.728 (0.046)	0.919 (0.051)
Neuroimaging	0.524 (0.097)	0.266 (0.112)	0.645 (0.038)	0.818 (0.053)
Polygenic scores	0.25 (0.075)	0.031 (0.068)	0.672 (0.053)	0.854 (0.068)
Socio-demo Life Dev Adv	0.488 (0.093)	0.226 (0.096)	0.664 (0.044)	0.843 (0.05)

R²=coefficient of determination; MAE = mean-absolute error; RMSE = root mean square error.

Figure 1b illustrates the loadings and the proportion of variance in cognitive abilities explained by each PLS components. The first PLS component accounted for the highest proportion of variance, ranging from 22.3 to 25.7%. This component was primarily influenced by factors such as attention and social problems, rule-breaking and aggressive behaviours and behavioural activation system drive. A similar pattern was observed across both time points.

Predicting cognitive abilities from neuroimaging

Figure 2, Figure 2—figure supplements 1 and 2, and Tables 1–3 illustrate the predictive performance of the opportunistic stacking models in predicting cognitive abilities from 45 sets of neuroimaging features. The predictive performance of each set of neuroimaging features varied significantly, with correlation coefficients ranging from approximately 0 (ENBack: Negative vs. Neutral Face) to around 0.4 (ENBack: 2-Back vs. 0-Back). Combining information from all 45 sets of neuroimaging features into a stacked model improved the performance to approximately r=0.54, consistent across both time points. The stacked model (R² ≈0.29) explained almost twice as much variance in cognitive abilities as the model based on the best single set of neuroimaging features (ENBack: 2-Back vs. 0-Back, R² ≈0.15). Figures 2 and 3, Figure 3—figure supplements 1–11 highlight the feature importance of the opportunistic stacking models. Across both time points, the top contributing neuroimaging features, as indicated by SHAP values, were ENBack task-fMRI contrasts, rs-fMRI, and cortical thickness.

Figure 2 with 2 supplements see all

Download asset Open asset

Predictive models predicting cognitive abilities from neuroimaging via opportunistic stacking and polygenic scores via Elastic Net.

(a) Scatter plots between observed vs predicted cognitive abilities based on neuroimaging and polygenic scores. Cognitive abilities are based on the second-order latent variable, the g-factor, based on a confirmatory factor analysis of six cognitive tasks. The parentheses following the r indicate the bootstrapped 95% CIs, calculated based on observed vs predicted cognitive abilities from all test sites combined. All data points are from test sets. r is the average Pearson’s r across 21 test sites. The parentheses following the r indicate bootstrapped 95% CIs, calculated based on observed vs predicted cognitive abilities from all test sites combined. (b) Feature importance of the stacking layer of neuroimaging, predicting cognitive abilities via Random Forest. For the stacking layer of neuroimaging, the feature importance was based on the absolute value of SHapley Additive exPlanations (SHAP), averaged across test sites. A higher absolute value of SHAP indicates a higher contribution to the prediction. Error bars reflect standard deviations across sites. Different sets of neuroimaging features were filled with different colours: pink for dMRI, orange for fMRI, purple for resting-state functional MRI (rsMRI), and green for structural MRI (sMRI). (c) Feature importance of polygenic scores, predicting cognitive abilities via Elastic Net. For polygenic scores, the feature importance was based on the Elastic Net coefficients, averaged across test sites. We also plotted Pearson’s correlations between each polygenic score and cognitive abilities computed from the full data. Error bars reflect 95% CIs of these correlations.

Figure 3 with 11 supplements see all

Download asset Open asset

Feature importance of each set of neuroimaging features, predicting cognitive abilities in the baseline data.

The feature importance was based on the Elastic Net coefficients, averaged across test sites. We did not order these sets of neuroimaging features according to their contribution to the stacking layer (see Figure 2). Larger versions of the feature importance for each set of neuroimaging features can be found in Figure 3—figure supplements 1–11. MID = Monetary Incentive Delay task; SST = Stop Signal Task; DTI = Diffusion Tensor Imaging; FC = functional connectivity.

Table 2

Performance metrics for predictive models, predicting cognitive abilities from the 45 sets of neuroimaging features in the baseline data.

The metrics were averaged across test sites with standard deviations in parentheses.

Features	Correlation	$R^{2}$	MAE	RMSE
Neuroimaging	0.539 (0.073)	0.291 (0.082)	0.658 (0.039)	0.839 (0.05)
ENback 2back vs 0back	0.393 (0.048)	0.147 (0.042)	0.661 (0.038)	0.841 (0.045)
ENback 2back	0.367 (0.06)	0.128 (0.048)	0.667 (0.036)	0.848 (0.043)
rsfMRI temporal variance	0.3 (0.094)	0.09 (0.054)	0.728 (0.04)	0.921 (0.045)
rsfMRI cortical FC	0.299 (0.055)	0.088 (0.034)	0.734 (0.027)	0.929 (0.032)
ENback emotion	0.277 (0.06)	0.07 (0.041)	0.689 (0.031)	0.876 (0.035)
Cortical thickness	0.265 (0.1)	0.072 (0.055)	0.756 (0.026)	0.96 (0.03)
T2 gray matter avg intensity	0.264 (0.106)	0.069 (0.064)	0.752 (0.032)	0.953 (0.035)
T1 gray matter avg intensity	0.263 (0.103)	0.063 (0.071)	0.761 (0.033)	0.965 (0.039)
ENback 0back	0.261 (0.058)	0.061 (0.038)	0.688 (0.031)	0.878 (0.035)
T1 white matter avg intensity	0.26 (0.103)	0.067 (0.063)	0.76 (0.029)	0.963 (0.035)
rsfMRI subcortical-network FC	0.258 (0.083)	0.066 (0.043)	0.743 (0.033)	0.94 (0.035)
ENback place	0.239 (0.065)	0.049 (0.041)	0.695 (0.032)	0.886 (0.038)
T2 white matter avg intensity	0.238 (0.103)	0.056 (0.056)	0.756 (0.03)	0.96 (0.031)
T2 normalised intensity	0.236 (0.082)	0.057 (0.041)	0.755 (0.021)	0.96 (0.024)
DTI	0.23 (0.074)	0.042 (0.048)	0.762 (0.027)	0.967 (0.029)
Cortical volume	0.228 (0.095)	0.053 (0.044)	0.767 (0.02)	0.971 (0.024)
MID Small Rew vs Neu anticipation	0.223 (0.049)	0.048 (0.022)	0.743 (0.017)	0.938 (0.02)
Cortical area	0.218 (0.101)	0.049 (0.046)	0.768 (0.021)	0.973 (0.025)
T1 normalised intensity	0.215 (0.109)	0.047 (0.049)	0.769 (0.022)	0.974 (0.028)
MID Reward vs Neutral anticipation	0.214 (0.062)	0.043 (0.028)	0.745 (0.022)	0.944 (0.024)
MID Loss vs Neutral anticipation	0.214 (0.075)	0.043 (0.034)	0.745 (0.025)	0.944 (0.028)
MID Small Loss vs Neu anticipation	0.203 (0.073)	0.038 (0.03)	0.747 (0.026)	0.945 (0.026)
MID Pos vs Neg Punishment Feedback	0.202 (0.066)	0.037 (0.027)	0.745 (0.021)	0.945 (0.026)
T1 subcortical avg intensity	0.2 (0.087)	0.037 (0.043)	0.773 (0.023)	0.979 (0.026)
MID Large Rew vs Neu anticipation	0.2 (0.072)	0.037 (0.03)	0.747 (0.021)	0.946 (0.024)
MID Pos vs Neg Reward Feedback	0.198 (0.05)	0.036 (0.02)	0.748 (0.022)	0.945 (0.028)
T1 summations	0.196 (0.08)	0.009 (0.059)	0.784 (0.029)	0.992 (0.033)
Sulcal depth	0.18 (0.095)	0.032 (0.039)	0.777 (0.02)	0.984 (0.026)
MID Large Loss vs Neu anticipation	0.173 (0.066)	0.026 (0.026)	0.749 (0.022)	0.95 (0.025)
subcortical volume	0.17 (0.078)	0.028 (0.029)	0.775 (0.018)	0.982 (0.021)
SST Any Stop vs Correct Go	0.164 (0.065)	0.022 (0.025)	0.736 (0.038)	0.935 (0.043)
T2 subcortical avg intensity	0.158 (0.057)	0.023 (0.023)	0.77 (0.018)	0.977 (0.02)
ENback Face vs Place	0.148 (0.076)	0.014 (0.028)	0.712 (0.027)	0.904 (0.034)
SST Incorrect Stop vs Correct Go	0.147 (0.059)	0.017 (0.02)	0.738 (0.035)	0.937 (0.04)
SST Correct Stop vs Correct Go	0.145 (0.056)	0.017 (0.018)	0.739 (0.033)	0.936 (0.038)
SST Correct Go vs Fixation	0.145 (0.053)	0.017 (0.017)	0.74 (0.033)	0.938 (0.036)
MID Large Rew vs Small anticipation	0.133 (0.05)	0.015 (0.014)	0.757 (0.022)	0.956 (0.025)
T2 summations	0.114 (0.053)	0.008 (0.022)	0.777 (0.018)	0.984 (0.016)
SST Incorrect Go vs Correct Go	0.11 (0.061)	0.008 (0.015)	0.744 (0.034)	0.94 (0.038)
SST Correct Stop vs Incorrect Stop	0.096 (0.068)	0.005 (0.018)	0.744 (0.033)	0.943 (0.036)
MID Large vs Small Loss anticipation	0.093 (0.063)	0.006 (0.014)	0.756 (0.024)	0.96 (0.026)
SST Incorrect Go vs Incorrect Stop	0.061 (0.039)	0 (0.008)	0.744 (0.032)	0.943 (0.036)
ENback Positive vs Neutral Face	0.024 (0.06)	–0.007 (0.012)	0.716 (0.027)	0.908 (0.034)
ENback Emotion vs Neutral Face	0.019 (0.058)	–0.007 (0.01)	0.716 (0.026)	0.908 (0.033)
ENback Negative vs Neutral Face	0.002 (0.058)	–0.007 (0.009)	0.718 (0.024)	0.911 (0.03)

R²=coefficient of determination; MAE = mean-absolute error; RMSE = root mean square error.

Table 3

Performance metrics for predictive models, predicting cognitive abilities from the 45 sets of neuroimaging features in the follow-up data.

Features	Correlation	$R^{2}$	MAE	RMSE
Neuroimaging	0.524 (0.097)	0.266 (0.112)	0.645 (0.038)	0.818 (0.053)
ENback 2back vs 0back	0.402 (0.092)	0.15 (0.075)	0.671 (0.032)	0.844 (0.041)
ENback 2back	0.39 (0.083)	0.14 (0.071)	0.676 (0.036)	0.848 (0.045)
ENback place	0.32 (0.073)	0.089 (0.049)	0.695 (0.038)	0.874 (0.047)
ENback emotion	0.319 (0.076)	0.089 (0.05)	0.696 (0.04)	0.876 (0.047)
rsfMRI cortical FC	0.309 (0.093)	0.081 (0.071)	0.718 (0.037)	0.908 (0.046)
ENback 0back	0.299 (0.078)	0.077 (0.057)	0.7 (0.045)	0.881 (0.052)
rsfMRI temporal variance	0.297 (0.111)	0.077 (0.071)	0.718 (0.045)	0.903 (0.052)
rsfMRI subcortical-network FC	0.265 (0.092)	0.056 (0.059)	0.732 (0.039)	0.92 (0.048)
Cortical thickness	0.259 (0.106)	0.055 (0.062)	0.738 (0.034)	0.932 (0.041)
Cortical volume	0.243 (0.091)	0.046 (0.049)	0.744 (0.034)	0.936 (0.039)
T1 white matter avg intensity	0.243 (0.09)	0.044 (0.057)	0.742 (0.035)	0.937 (0.042)
T1 gray matter avg intensity	0.241 (0.105)	0.04 (0.069)	0.742 (0.039)	0.939 (0.047)
Cortical area	0.233 (0.092)	0.041 (0.05)	0.746 (0.032)	0.939 (0.04)
T2 gray matter avg intensity	0.226 (0.112)	0.04 (0.064)	0.743 (0.037)	0.939 (0.049)
DTI	0.218 (0.065)	0.022 (0.052)	0.747 (0.034)	0.944 (0.041)
T2 white matter avg intensity	0.213 (0.099)	0.033 (0.057)	0.747 (0.036)	0.942 (0.045)
T1 summations	0.213 (0.062)	0.011 (0.046)	0.756 (0.039)	0.954 (0.044)
MID Pos vs Neg Punish Feedback	0.208 (0.058)	0.025 (0.033)	0.743 (0.044)	0.933 (0.049)
MID Pos vs Neg Reward Feedback	0.196 (0.071)	0.021 (0.042)	0.742 (0.038)	0.933 (0.042)
T2 normalised intensity	0.195 (0.077)	0.025 (0.035)	0.749 (0.039)	0.946 (0.045)
T1 subcortical avg intensity	0.191 (0.094)	0.002 (0.083)	0.759 (0.039)	0.957 (0.046)
sulcal depth	0.185 (0.087)	0.018 (0.048)	0.756 (0.034)	0.95 (0.043)
MID Reward vs Neutral anticipation	0.185 (0.078)	0.016 (0.039)	0.746 (0.037)	0.937 (0.04)
SST Any Stop vs Correct Go	0.184 (0.079)	0.018 (0.034)	0.745 (0.047)	0.934 (0.054)
T1 normalised intensity	0.181 (0.077)	0.018 (0.036)	0.752 (0.038)	0.95 (0.045)
ENback Face vs Place	0.179 (0.075)	0.019 (0.03)	0.721 (0.039)	0.907 (0.044)
subcortical volume	0.178 (0.062)	0.016 (0.032)	0.752 (0.036)	0.949 (0.041)
SST Correct Stop vs Correct Go	0.175 (0.062)	0.015 (0.026)	0.746 (0.048)	0.936 (0.053)
MID Large Rew vs Neu anticipation	0.172 (0.055)	0.012 (0.028)	0.747 (0.04)	0.939 (0.044)
SST Incorrect Stop vs Correct Go	0.17 (0.085)	0.015 (0.032)	0.746 (0.051)	0.936 (0.059)
T2 subcortical avg intensity	0.157 (0.085)	0.011 (0.033)	0.755 (0.039)	0.952 (0.043)
MID Small Rew vs Neu anticipation	0.154 (0.086)	0.007 (0.04)	0.75 (0.04)	0.941 (0.044)
MID Loss vs Neutral anticipation	0.147 (0.07)	0.004 (0.024)	0.75 (0.04)	0.942 (0.043)
SST Correct Go vs Fixation	0.138 (0.065)	0.005 (0.026)	0.749 (0.046)	0.938 (0.054)
SST Incorrect Go vs Correct Go	0.122 (0.072)	0.001 (0.03)	0.752 (0.053)	0.944 (0.059)
MID Large Loss vs Neu anticipation	0.121 (0.074)	–0.004 (0.03)	0.752 (0.04)	0.942 (0.044)
T2 summations	0.116 (0.07)	–0.003 (0.029)	0.763 (0.041)	0.96 (0.048)
MID Small Loss vs Neu Anticipation	0.106 (0.071)	–0.005 (0.021)	0.755 (0.041)	0.948 (0.044)
SST Correct Stop vs Incorrect Stop	0.09 (0.086)	–0.006 (0.023)	0.754 (0.049)	0.947 (0.057)
MID Large vs Small Loss Anticipation	0.064 (0.07)	–0.012 (0.025)	0.756 (0.043)	0.948 (0.048)
MID Large vs Small Rew anticipation	0.063 (0.059)	–0.012 (0.018)	0.759 (0.042)	0.952 (0.046)
SST Incorrect Go vs Incorrect Stop	0.038 (0.067)	–0.014 (0.019)	0.756 (0.052)	0.95 (0.059)
ENback Positive vs Neutral Face	0.006 (0.069)	–0.013 (0.018)	0.732 (0.037)	0.919 (0.044)
ENback Negative vs Neutral Face	–0.012 (0.031)	–0.012 (0.015)	0.735 (0.039)	0.923 (0.043)
ENback Emotion vs Neutral Face	–0.027 (0.067)	–0.014 (0.016)	0.733 (0.038)	0.921 (0.045)

The metrics were averaged across test sites with standard deviations in parentheses. R²=coefficient of determination; MAE = mean-absolute error; RMSE = root mean square error.

Predicting cognitive abilities from polygenic scores

Figure 2a and Table 1 illustrate the predictive performance of the Elastic Net models in predicting cognitive abilities using three polygenic scores (PGSs). The predictive accuracy of these PGSs was r=0.25 at baseline and r=0.25 at follow-up. (Figure 2c) highlights the feature importance within these models, indicating a stronger contribution from the PGS based on Savage et al., 2018 GWAS.

Predicting cognitive abilities from socio-demographics, lifestyles, and developmental adverse events

Figure 4a and Table 1 illustrate the predictive performance of the PLS models in predicting cognitive abilities from socio-demographics, lifestyles, and developmental adverse events. Using 44 features covering these areas, the predictive performance was around r=0.49, consistent across the two time points. (Figure 4b) shows the loadings and the proportion of variance explained by these PLS models. The first PLS component accounted for the highest proportion of variance (around 10%).

Figure 4

Download asset Open asset

Predictive models, predicting cognitive abilities from socio-demographics, lifestyles, and developmental adverse events via Partial Least Square (PLS).

(a) Scatter plots between observed vs predicted cognitive abilities based on socio-demographics, lifestyles, and developmental adverse events. Cognitive abilities are based on the second-order latent variable, the g-factor, based on a confirmatory factor analysis of six cognitive tasks. All data points are from test sets. r is the average Pearson’s r across 21 test sites. The parentheses following the r indicate bootstrapped 95% CIs, calculated based on observed vs predicted cognitive abilities from all test sites combined. (b) Feature importance of socio-demographics, lifestyles, and developmental adverse events, predicting cognitive abilities via Partial Least Square. The features were ordered based on the loading of the first component. Univariate correlations were Pearson’s correlation between each feature and cognitive abilities. Error bars reflect 95% CIs of the correlations. Different types of environmental factors were filled with different colours: orange for socio-demographics, purple for developmental adverse events and green for lifestyle. A dashed horizontal line in the follow-up feature importance figure distinguishes whether the variables were collected at baseline or follow-up.

Based on its loadings, this first component was: (a) Positively influenced by features such as parental income and education, neighbourhood safety, and extracurricular activities, (b) Negatively influenced by features such as area deprivation, having a single parent, screen use, economic insecurities, lack of sleep, playing mature video games, watching mature movies, and lead exposure.

Commonality analyses

We separately conducted the four sets of commonality analyses.

Commonality analyses for proxy measures of cognitive abilities based on mental health and neuroimaging

At baseline, having both proxy measures based on mental health and neuroimaging in a linear mixed model explained 27% of the variance in cognitive abilities. Specifically, 9.8% of the variance in cognitive abilities was explained by mental health, which included the common effect between the two proxy measures (6.48%) and the unique effect of mental health (3.32%) (see Tables 4–5 and Figure 5). This indicates that 66% of the relationship between cognitive abilities and mental health, i.e., (6.48 ÷ 9.8)×100, was shared with neuroimaging. The common effects varied considerably across different sets of neuroimaging features, ranging from approximately 0.08 to 2.78%, with the highest being the ENBack task fMRI: 2-Back vs. 0-Back (see Figure 5—figure supplement 1). The pattern of results was consistent across both time points.

Table 4

Results of linear-mixed models using proxy measures of cognitive abilities based on mental health and/or neuroimaging as regressors to explain cognitive abilities across test sites in the baseline.

Response	Cognitive abilities			Cognitive abilities			Cognitive abilities
Regressors	Estimates	CI	p	Estimates	CI	p	Estimates	CI	p
(Intercept)	0.02	–0.00–0.03	0.058	0.02	–0.00–0.04	0.057	0.02	–0.00–0.03	0.067
mental savg	0.00	–0.02–0.02	0.895	0.00	–0.02–0.02	0.985
mental cws	0.19	0.17–0.20	<0.001	0.31	0.29–0.33	<0.001
neuroimaging savg	–0.01	–0.02–0.01	0.507				–0.01	–0.02–0.01	0.523
neuroimaging cws	0.43	0.41–0.44	<0.001				0.48	0.47–0.50	<0.001
Random Effects
σ²	0.55			0.54			0.57
τ₀₀	0.17 _{SITE_ID_L:REL_FAMILY_ID}			0.35 _{SITE_ID_L:REL_FAMILY_ID}			0.18 _{SITE_ID_L:REL_FAMILY_ID}
ICC	0.24			0.39			0.24
N	21 _{SITE_ID_L}			21 _{SITE_ID_L}			21 _{SITE_ID_L}
	9001 _{REL_FAMILY_ID}			9001 _{REL_FAMILY_ID}			9001 _{REL_FAMILY_ID}
Observations	10728			10728			10728
Marginal R²	0.272			0.098			0.238
Conditional R²	0.444			0.452			0.423

cws = values centred within each site; savg = values averaged within each site.

Table 5

Results of linear-mixed models using proxy measures of cognitive abilities based on mental health and/or neuroimaging as regressors to explain cognitive abilities across test sites in the follow-up.

Response	Cognitive abilities			Cognitive abilities			Cognitive abilities
Regressors	Estimates	CI	p	Estimates	CI	p	Estimates	CI	p
(Intercept)	0.82	0.80–0.84	<0.001	0.82	0.80–0.85	<0.001	0.82	0.80–0.84	<0.001
mental savg	0.02	0.00–0.04	0.047	0.02	0.00–0.05	0.037
mental cws	0.19	0.17–0.21	<0.001	0.31	0.29–0.33	<0.001
neuroimaging savg	0.02	0.00–0.05	0.021				0.03	0.01–0.05	0.012
neuroimaging cws	0.42	0.40–0.44	<0.001				0.47	0.45–0.49	<0.001
Random Effects
σ²	0.41			0.45			0.42
τ₀₀	0.24 _{SITE_ID_L:REL_FAMILY_ID}			0.37 _{SITE_ID_L:REL_FAMILY_ID}			0.27 _{SITE_ID_L:REL_FAMILY_ID}
ICC	0.37			0.46			0.40
N	21 _{SITE_ID_L}			21 _{SITE_ID_L}			21 _{SITE_ID_L}
	5434 _{REL_FAMILY_ID}			5434 _{REL_FAMILY_ID}			5434 _{REL_FAMILY_ID}
Observations	6315			6315			6315
Marginal R²	0.286			0.104			0.245
Conditional R²	0.552			0.513			0.545

cws = values centred within each site; savg = values averaged within each site.

Figure 5 with 1 supplement see all

Download asset Open asset

Venn diagrams showing common and unique effects of proxy measures of cognitive abilities based on mental health, neuroimaging, polygenic scores, and/or socio-demographics, lifestyles and developmental adverse events in explaining cognitive abilities across test sites.

We computed the common and unique effects in % based on the marginal $R^{2}$ of four sets of linear-mixed models.

Commonality analyses for proxy measures of cognitive abilities based on mental health and PGSs

At baseline, having both proxy measures based on mental health and PGSs in a linear mixed model explained 11.8% of the variance in cognitive abilities. Specifically, 9.21% of the variance in cognitive abilities was explained by mental health, which included the common effect between the two proxy measures (1.93%) and the unique effect of mental health (7.28%) (see Tables 6–7 and Figure 5). This indicates that 21% of the relationship between cognitive abilities and mental health, i.e., (1.93 ÷ 9.21) × 100, was shared with PGSs. The pattern of results was consistent across both time points.

Table 6

Results of linear-mixed models using proxy measures of cognitive abilities based on mental health and/or polygenic scores as regressors to explain cognitive abilities across test sites in the baseline.

Response	Cognitive abilities			Cognitive abilities			Cognitive abilities
Regressors	Estimates	CI	p	Estimates	CI	p	Estimates	CI	p
(Intercept)	0.23	0.21–0.26	<0.001	0.23	0.21–0.25	<0.001	0.23	0.21–0.26	<0.001
mental savg	0.06	0.02–0.09	0.004	0.13	0.10–0.15	<0.001
mental cws	0.25	0.23–0.27	<0.001	0.25	0.23–0.27	<0.001
PGS savg favg	–0.08	–0.12 to –0.05	<0.001				–0.13	–0.15 to –0.10	<0.001
PGS cws cwf	0.05	0.03–0.07	<0.001				0.06	0.04–0.08	<0.001
Random Effects
σ²	0.51			0.52			0.53
τ₀₀	0.27 _{SITE_ID_L:REL_FAMILY_ID}			0.26 _{SITE_ID_L:REL_FAMILY_ID}			0.32 _{SITE_ID_L:REL_FAMILY_ID}
ICC	0.34			0.33			0.38
N	21 _{SITE_ID_L}			21 _{SITE_ID_L}			21 _{SITE_ID_L}
	4734 _{REL_FAMILY_ID}			4734 _{REL_FAMILY_ID}			4734 _{REL_FAMILY_ID}
Observations	5766			5766			5766
Marginal R²	0.098			0.092			0.026
Conditional R²	0.408			0.394			0.394

cws = values centred within each site; savg = values averaged within each site; cws,cwf = values centred within each family first and then within each site; savg,favg = values averaged within each family first and then within each site. PGS = polygenic scores.

Table 7

Results of linear-mixed models using proxy measures of cognitive abilities based on mental health and/or polygenic scores as regressors to explain cognitive abilities across test sites in the follow-up.

Response	Cognitive abilities			Cognitive abilities			Cognitive abilities
Predictors	Estimates	CI	p	Estimates	CI	p	Estimates	CI	p
(Intercept)	1.06	1.03–1.09	<0.001	1.06	1.03–1.09	<0.001	1.06	1.03–1.09	<0.001
mental savg	0.03	–0.00–0.07	0.063	0.07	0.05–0.10	<0.001
mental cws	0.22	0.19–0.25	<0.001	0.22	0.20–0.25	<0.001
PGS savg favg	–0.07	–0.10 to –0.04	<0.001				–0.09	–0.12 to –0.06	<0.001
PGS cws cwf	0.04	0.02–0.06	<0.001				0.05	0.03–0.07	<0.001
Random Effects
σ²	0.42			0.43			0.43
τ₀₀	0.32 _{SITE_ID_L:REL_FAMILY_ID}			0.31 _{SITE_ID_L:REL_FAMILY_ID}			0.37 _{SITE_ID_L:REL_FAMILY_ID}
ICC	0.43			0.42			0.46
N	21 _{SITE_ID_L}			21 _{SITE_ID_L}			21 _{SITE_ID_L}
	3370 _{REL_FAMILY_ID}			3370 _{REL_FAMILY_ID}			3370 _{REL_FAMILY_ID}
Observations	4036			4036			4036
Marginal R²	0.075			0.068			0.013
Conditional R²	0.470			0.460			0.469

cws = values centred within each site; savg = values averaged within each site; cws,cwf = values centred within each family first and then within each site; savg,favg = values averaged within each family first and then within each site. PGS = polygenic scores.

Commonality analyses for proxy measures of cognitive abilities based on mental health and socio-demographics, lifestyles, and developmental adverse events

At baseline, having both proxy measures based on mental health and socio-demographics, lifestyles, and developmental adverse events in a linear mixed model explained 24.9% of the variance in cognitive abilities. Specifically, 9.75% of the variance in cognitive abilities was explained by mental health, which included the common effect between the two proxy measures (6.12%) and the unique effect of mental health (3.63%) (see Tables 8–9 and Figure 5). This indicates that over 63% of the relationship between cognitive abilities and mental health, i.e., (6.12 ÷ 9.75) × 100, was shared with socio-demographics, lifestyles, and developmental adverse events. The pattern of results was consistent across both time points.

Table 8

Results of linear-mixed models using proxy measures of cognitive abilities based on mental health and/or socio-demographics, lifestyles, and developmental adverse events as regressors to explain cognitive abilities across test sites in the baseline.

Response	Cognitive abilities			Cognitive abilities			Cognitive abilities
Regressors	Estimates	CI	p	Estimates	CI	p	Estimates	CI	p
(Intercept)	0.01	–0.01–0.02	0.525	0.01	–0.01–0.03	0.385	0.01	–0.01–0.02	0.558
mental savg	–0.00	–0.02–0.02	0.917	–0.00	–0.02–0.02	0.930
mental cws	0.20	0.18–0.22	<0.001	0.31	0.29–0.33	<0.001
sdl savg	0.00	–0.02–0.02	0.819				0.00	–0.01–0.02	0.792
sdl cws	0.40	0.38–0.41	<0.001				0.46	0.44–0.48	<0.001
Random Effects
σ²	0.52			0.53			0.54
τ₀₀	0.22 _{SITE_ID_L:REL_FAMILY_ID}			0.35 _{SITE_ID_L:REL_FAMILY_ID}			0.24 _{SITE_ID_L:REL_FAMILY_ID}
ICC	0.30			0.40			0.31
N	21 _{SITE_ID_L}			21 _{SITE_ID_L}			21 _{SITE_ID_L}
	9390 _{REL_FAMILY_ID}			9390 _{REL_FAMILY_ID}			9390 _{REL_FAMILY_ID}
Observations	11294			11294			11294
Marginal R²	0.249			0.098			0.213
Conditional R²	0.474			0.458			0.456

cws = values centred within each site; savg = values averaged within each site; sdl = socio-demographics, lifestyles and developmental adverse events.

Table 9

Results of linear-mixed models using proxy measures of cognitive abilities based on mental health and/or socio-demographics, lifestyles and developmental adverse events as regressors to explain cognitive abilities across test sites in the follow-up.

Response	Cognitive abilities			Cognitive abilities			Cognitive abilities
Regressors	Estimates	CI	p	Estimates	CI	p	Estimates	CI	p
(Intercept)	0.83	0.81–0.85	<0.001	0.83	0.81–0.86	<0.001	0.83	0.81–0.85	<0.001
mental savg	0.01	–0.01–0.03	0.185	0.01	–0.01–0.04	0.198
mental cws	0.20	0.18–0.22	<0.001	0.30	0.28–0.32	<0.001
sdl savg	0.00	–0.02–0.02	0.957				0.00	–0.02–0.02	0.757
sdl cws	0.39	0.37–0.41	<0.001				0.44	0.42–0.47	<0.001
Random Effects
σ²	0.42			0.45			0.43
τ₀₀	0.27 _{SITE_ID_L:REL_FAMILY_ID}			0.37 _{SITE_ID_L:REL_FAMILY_ID}			0.30 _{SITE_ID_L:REL_FAMILY_ID}
ICC	0.39			0.45			0.41
N	21 _{SITE_ID_L}			21 _{SITE_ID_L}			21 _{SITE_ID_L}
	6217 _{REL_FAMILY_ID}			6217 _{REL_FAMILY_ID}			6217 _{REL_FAMILY_ID}
Observations	7382			7382			7382
Marginal R²	0.256			0.099			0.213
Conditional R²	0.543			0.508			0.535

cws = values centred within each site; savg = values averaged within each site; sdl = socio-demographics, lifestyles and developmental adverse events.

Commonality analyses for proxy measures of cognitive abilities based on mental health, neuroimaging, PGSs and socio-demographics, lifestyles, and developmental adverse events

At baseline, having all four proxy measures based on mental health, neuroimaging, PGSs, and socio-demographics, lifestyles, and developmental adverse events in a linear mixed model explained 24.2% of the variance in cognitive abilities. Of the 8.97% of the variance in cognitive abilities explained by mental health, 7.05% represented common effects with the other proxy measures. This indicates that 79%, i.e., (7.05 ÷ 8.97) × 100, of the relationship between cognitive abilities and mental health was shared with the three other proxy measures (see Tables 10–11 and Figure 5). Additionally, among the variance that socio-demographics, lifestyles, and developmental adverse events accounted for in the relationship between cognitive abilities and mental health, neuroimaging could capture 58%, while PGSs could capture 21%. The pattern of results was consistent across both time points.

Table 10

Results of linear-mixed models using proxy measures of cognitive abilities based on mental health, neuroimaging, polygenic scores and/or socio-demographics, lifestyles and developmental adverse events as regressors to explain cognitive abilities across test sites in the baseline.

Response	Cognitive abilities			Cognitive abilities
Regressors	Estimates	CI	p	Estimates	CI	p
(Intercept)	0.24	0.21–0.26	<0.001	0.24	0.21–0.26	<0.001
mental savg	0.00	–0.05–0.05	0.975	0.09	0.05–0.12	<0.001
mental cws	0.14	0.11–0.16	<0.001	0.18	0.15–0.20	<0.001
neuroimaging savg	0.01	–0.03–0.05	0.533	0.05	0.01–0.09	0.006
neuroimaging cws	0.26	0.24–0.29	<0.001	0.31	0.28–0.33	<0.001
PGS savg favg	–0.04	–0.08–0.00	0.070
PGS cws cwf	0.05	0.03–0.07	<0.001
sdl savg	0.09	0.03–0.16	0.006
sdl cws	0.18	0.16–0.21	<0.001
σ²	0.50			0.52
τ₀₀	0.15 _{SITE_ID_L:REL_FAMILY_ID}			0.17 _{SITE_ID_L:REL_FAMILY_ID}
ICC	0.23			0.25
N	21 _{SITE_ID_L}			21 _{SITE_ID_L}
Observations	5520			5520
Marginal R²	0.241			0.197
Conditional R²	0.416			0.395
Regressors	Estimates	CI	p	Estimates	CI	p
(Intercept)	0.24	0.21–0.26	<0.001	0.24	0.21–0.26	<0.001
mental savg	0.06	0.03–0.10	0.001	0.00	–0.04–0.05	0.890
mental cws	0.24	0.22–0.27	<0.001	0.19	0.16–0.21	<0.001
neuroimaging savg
neuroimaging cws
PGS savg favg	–0.08	–0.12 to –0.05	<0.001
PGS cws cwf	0.06	0.04–0.08	<0.001
sdl savg				0.14	0.09–0.19	<0.001
sdl cws				0.25	0.22–0.27	<0.001
σ²	0.51			0.52
τ₀₀	0.27 _{SITE_ID_L:REL_FAMILY_ID}			0.20 _{SITE_ID_L:REL_FAMILY_ID}
ICC	0.34			0.28
N	21 _{SITE_ID_L}			21 _{SITE_ID_L}
	4571 _{REL_FAMILY_ID}			4571 _{REL_FAMILY_ID}
Observations	5520			5520
Marginal R²	0.097			0.163
Conditional R²	0.408			0.395

cws = values centred within each site; savg = values averaged within each site; cws,cwf = values centred within each family first and then within each site; savg,favg = values averaged within each family first and then within each site; PGS = polygenic scores; sdl = socio-demographics, lifestyles and developmental adverse events.

Table 11

Results of linear-mixed models using proxy measures of cognitive abilities based on mental health, neuroimaging, polygenic scores and/or socio-demographics, lifestyles, and developmental adverse events as regressors to explain cognitive abilities across test sites in the follow-up.

Response	Cognitive abilities			Cognitive abilities
Regressors	Estimates	CI	p	Estimates	CI	p
(Intercept)	1.05	1.02–1.08	<0.001	1.05	1.02–1.08	<0.001
mental savg	0.05	–0.01–0.10	0.100	0.06	0.03–0.10	<0.001
mental cws	0.13	0.11–0.16	<0.001	0.17	0.14–0.20	<0.001
neuroimaging savg	0.00	–0.06–0.06	0.935	0.03	–0.01–0.06	0.146
neuroimaging cws	0.27	0.24–0.30	<0.001	0.31	0.28–0.33	<0.001
PGS savg favg	0.00	–0.03–0.04	0.833
PGS cws cwf	0.04	0.02–0.06	<0.001
sdl savg	0.04	–0.04–0.12	0.349
sdl cws	0.20	0.17–0.23	<0.001
σ²	0.38			0.40
τ₀₀	0.23 _{SITE_ID_L:REL_FAMILY_ID}			0.25 _{SITE_ID_L:REL_FAMILY_ID}
ICC	0.38			0.39
N	21 _{SITE_ID_L}			21 _{SITE_ID_L}
	2930 _{REL_FAMILY_ID}			2930 _{REL_FAMILY_ID}
Observations	3423			3423
Marginal R²	0.242			0.190
Conditional R²	0.527			0.506
Regressors	Estimates	CI	p	Estimates	CI	p
(Intercept)	1.05	1.02–1.08	<0.001	1.05	1.02–1.08	<0.001
mental savg	0.08	0.04–0.11	<0.001	0.05	–0.00–0.10	0.074
mental cws	0.23	0.20–0.26	<0.001	0.18	0.15–0.21	<0.001
neuroimaging savg
neuroimaging cws
PGS savg favg	0.00	- 0.03–0.04	0.844
PGS cws cwf	0.05	0.03–0.07	<0.001
PGS savg favg	0.00	- 0.03–0.04	0.844
PGS cws cwf	0.05	0.03–0.07	<0.001
sdl savg				0.04	–0.01–0.09	0.092
sdl cws				0.25	0.22–0.28	<0.001
σ²	0.41			0.42
τ₀₀	0.33 _{SITE_ID_L:REL_FAMILY_ID}			0.27 _{SITE_ID_L:REL_FAMILY_ID}
ICC	0.45			0.39
N	21 _{SITE_ID_L}			21 _{SITE_ID_L}
	2930 _{REL_FAMILY_ID}			2930 _{REL_FAMILY_ID}
Observations	3423			3423
Marginal R²	0.076			0.153
Conditional R²	0.491			0.486

cws = values centred within each site; savg = values averaged within each site; cws,cwf = values centred within each family first and then within each site; savg,favg = values averaged within each family first and then within each site; PGS = polygenic scores; sdl = socio-demographics, lifestyles and developmental adverse events.

Discussion

We aim to understand the extent to which the relationship between cognitive abilities and mental health is represented in part by cognitive abilities at the neural and genetic levels of analysis. We began by quantifying the relationship between cognitive abilities and mental health, finding a medium-sized out-of-sample correlation of approximately r=0.36. This relationship was shared with neuroimaging (66% at baseline) and PGS (21% at baseline), based on two separate sets of commonality analyses. This suggests the significant roles of these two neurobiological units of analysis in shaping the relationship between cognitive abilities and mental health (Morris and Cuthbert, 2012). We also found that the relationship between cognitive abilities and mental health was partly shared with environmental factors, as measured by socio-demographics, lifestyles, and developmental adverse events (63% at baseline). In another set of commonality analysis, this variance due to socio-demographics, lifestyles, and developmental adverse events was explained by neuroimaging and PGS at 58% and 21%, respectively, at baseline. Accordingly, the neurobiological units of analysis for cognitive abilities captured the environmental factors, consistent with RDoC’s viewpoint (Morris et al., 2022). Notably, this pattern of results remained stable over two years in early adolescence.

Our predictive modelling revealed a medium-sized predictive relationship between cognitive abilities and mental health. This finding aligns with recent meta-analyses of case-control studies that link cognitive abilities and mental disorders across various psychiatric conditions (Abramovitch et al., 2021; East-Richard et al., 2020). Unlike previous studies, we estimated the predictive, out-of-sample relationship between cognitive abilities and mental disorders in a large normative sample of children. Although our predictive models, like other cross-sectional models, cannot determine the directionality of the effects, the strength of the relationship between cognitive abilities and mental health estimated here should be more robust than when calculated using the same sample as the model itself, known as in-sample prediction/association (Marek et al., 2022; Yarkoni and Westfall, 2017). Examining the PLS loadings of our predictive models revealed that the relationship was driven by various aspects of mental health, including thought and externalising symptoms, as well as motivation. This suggests that there are multiple pathways—encompassing a broad range of emotional and behavioural problems and temperaments—through which cognitive abilities and mental health are linked.

Our predictive modelling created proxy measures of cognitive abilities based on two neurobiological units of analysis: neuroimaging and PGS (Morris and Cuthbert, 2012). For neuroimaging, inspired by recent BWAS benchmarks (Engemann et al., 2020; Marek et al., 2022), we used a multivariate modelling technique called opportunistic stacking, which integrates information across various MRI features and modalities. Combining 45 sets of neuroimaging features resulted in relatively high predictive performance (out-of-sample r=0.54 at baseline), compared to using any single set. This finding aligns with previous research that pooled multiple neuroimaging modalities (Engemann et al., 2020; Rasero et al., 2021; Tetereva et al., 2022). This level of predictive performance is numerically higher than that found in a recent meta-analysis, which mainly included studies using only one set of neuroimaging features, with an r of 0.42 (Vieira et al., 2022). Moreover, this performance level in predicting cognitive abilities is nearly the same as our previous attempt using a similar stacking technique to integrate MRI modalities in young adult samples from the Human Connectome Project (HCP) (Van Essen et al., 2013), which achieved an out-of-sample r=0.57 (Tetereva et al., 2022). Similarly, in the current study, the top contributing set of neuroimaging features, the 2-Back vs. 0-Back task fMRI, was consistent with previous studies using the HCP (Sripada et al., 2020; Tetereva et al., 2022). Altogether, this demonstrates the robustness of our proxy measure of cognitive abilities based on multimodal neuroimaging. In addition to predictive performance, opportunistic stacking offers the added benefit of handling missing values (Engemann et al., 2020; Pat et al., 2022b), allowing us to retain data from 10,754 participants who completed the cognitive tasks at baseline and has at least one set of neuroimaging features. Consequently, with opportunistic stacking, we were more likely to retain MRI data from participants with higher fMRI noise, such as those with socioeconomic disadvantages (Cosgrove et al., 2022). More importantly, we demonstrated that the proxy measure based on multimodal neuroimaging explained the majority of the variance in the relationship between cognitive abilities and mental health, underscoring its significant role as a neurobiological unit of analysis for cognitive abilities (Morris and Cuthbert, 2012).

For PGS, we created a proxy measure based on three large-scale GWAS on cognitive abilities (Davies et al., 2018; Lee et al., 2018; Savage et al., 2018). Using PGS resulted in a numerically weaker predictive performance (out-of-sample r=0.25 at baseline) compared to multimodal neuroimaging. However, this predictive strength is still comparable to previous research. For instance, Allegrini et al., 2019 used a different cohort of children and found R²=0.053 when applying PGS based on Savage et al., 2018 to predict the cognitive abilities of 12-year-old children. Given that PGS based on Savage et al., 2018 also drove the prediction in the current study, as seen in its feature importance, this similar level of predictive performance between Allegrini et al., 2019 and our study suggests consistency in the predictive performance of PGS. Despite this level of performance, PGS was able to explain some variance (21% at baseline) in the relationship between cognitive abilities and mental health, indicating some capacity of PGS as a neurobiological unit of analysis for cognitive abilities.

There are multiple potential reasons why PGS performed much poorer than multimodal neuroimaging. First, unlike genes, the brain changes throughout development and lifespan (Bethlehem et al., 2022), and so do cognitive abilities (Hartshorne and Germine, 2015). This dynamic nature might make multimodal neuroimaging a better tool for tracing cognitive abilities. Second, there might be a mismatch in the age of participants between the original GWAS (Davies et al., 2018; Lee et al., 2018; Savage et al., 2018) and the current study. While the original GWAS conducted meta-analyses pooling data from participants aged 5–102, these studies might draw more heavily from older cohorts with large participant numbers, such as the UK Biobank (Sudlow et al., 2015). Allegrini et al., 2019 also demonstrated that PGS performs better in predicting cognitive abilities in older children (aged 16) compared to younger ones (aged 12). Therefore, a more child-specific PGS might be needed to explain more variance in children. Thirdly, the PGS used here included only common SNPs and not rare variants. Recent studies using whole-genome sequence data have found that rare variants contribute to the heritability of complex traits, such as height and body mass index (Wainschtein et al., 2022). Given that cognitive abilities are also complex traits, future studies might need to examine if including rare variants can improve the predictive performance of PGS.

Similarly, our predictive modelling created proxy measures of cognitive abilities for environmental factors based on socio-demographics, lifestyles, and developmental adverse events. In line with previous work (Kirlic et al., 2021; Pat et al., 2022b), we could predict unseen children’s cognitive abilities based on their socio-demographics, lifestyles, and developmental adverse events with a medium-to-high out-of-sample r=0.49 (at baseline). This prediction was driven more strongly by socio-demographics (e.g. parent’s income and education, neighbourhood safety, area deprivation, single parenting), somewhat weaker by lifestyles (e.g. extracurricular activities, sleep, screen time, video gaming, mature movie watching, and parental monitoring), and much weaker by developmental adverse events (e.g. pregnancy complications). Importantly, proxy measures based on socio-demographics, lifestyles, and developmental adverse events captured a large proportion of the relationship between cognitive abilities and mental health. Furthermore, this variance captured by socio-demographics, lifestyles, and developmental adverse events overlapped mainly with the neurobiological proxy measures. This reiterates RDoC’s central tenet that understanding the neurobiology of a functional domain, such as cognitive abilities, could help us understand the extent to which environments influence mental health (Cuthbert and Insel, 2013; Insel et al., 2010). More importantly, all the results regarding neuroimaging, PGS, and socio-demographics, lifestyles, and developmental adverse events were reliable across two years during a sensitive period for adolescents.

This study has several limitations that might affect its generalisability. Firstly, the range of mental health variables was not exhaustive. While we covered various emotional and behavioural problems (Achenbach et al., 2017) and temperaments, including behavioural inhibition/activation (Carver and White, 1994) and impulsivity (Zapolski et al., 2010), we may still miss other critical mental health variables, such as psychotic-like experiences, eating disorder symptoms, and mania. Similarly, our ABCD samples were young and community-based, likely limiting the severity of their psychopathological issues (Kessler et al., 2007). Future work needs to test if the results found here are generalisable to adults and participants with stronger severity. Next, for cognitive abilities, while the six cognitive tasks (Luciana et al., 2018; Thompson et al., 2019) covered most of the RDoC cognitive abilities/systems constructs, we still missed variability in some domains, such as perception (Morris and Cuthbert, 2012). Additionally, several children (3274) did not complete all six cognitive tasks at follow-up, which might create a discrepancy between baseline and follow-up samples. However, the differences in social demographics, lifestyles, and developmental adverse events between participants who provided cognitive scores in the follow-up were minimal (Cohen’s d ranging from 0.007 to 0.092, see Table 12). Moreover, given that we found a similar pattern of predictive performance across the two time points, we believe excluding the children who did not complete the cognitive tasks at follow-up should not alter our conclusions.

Table 12

The differences in social demographics, lifestyles, and developmental adverse events between participants who provided cognitive scores in the follow-up.

We used social demographics, lifestyles, and developmental adverse events collected at baseline.

Variable names	Having cognitive scores in the follow-up.	Not having cognitive scores in the follow-up.	Test statistics
Age in months	Mean (sd): 119.3 (7.5)	Mean (sd): 118.3 (7.6)	Yuen’s t(3783)=6.05, p < 0.001, Cohen’s d = 0.092
Sex	Male = 3918 (52.4%) Female = 3564 (47.6%) Intersex-Male=1 (0.0%) Intersex-female=0 (0.0%) Do not know = 0 (0.0%)	Male = 1776 (53.2%) Female = 1563 (46.8%) Intersex-Male=2 (0.1%) Intersex-female=0(0.0%) Do not know = 0 (0.0%)	(X² = 4, N = 10824)=6, p=0.199
Body Mass Index	Mean (sd): 18.7 (4.1)	Mean (sd): 18.9 (4.4)	Yuen’s t (3658)=1.605, p=0.109, Cohen’s d=0.023
Race	White = 4190 (56.0%) Black = 918 (12.3%) Hispanic = 1441 (19.3%) Asian = 157(2.1%) Other = 777 (10.4%)	White = 1611 (48.2%) Black = 612 (18.3%) Hispanic = 689 (20.6%) Asian = 68(2.0%) Other = 360 (10.8%)	X²(16, N=10823)=20, p = 0.22
Bilingual Use	Mean (sd): 1 (1.7)	Mean (sd): 1 (1.7)	Yuen’s t(3776)=0.696, p=0.486, Cohen’s d=0.011
Parent Marital Status	Married = 5239 (70.5%) Widowed = 59(0.8%) Divorced = 684 (9.2%) Separated = 264 (3.6%) NeverMarried = 806(10.8%) LivingWithPartner = 381 (5.1%)	Married = 2194 (66.0%) Widowed = 29(0.9%) Divorced = 290 (8.7%) Separated = 135 (4.1%) NeverMarried = 460(13.8%) LivingWithPartner = 214 (6.4%)	X²(25, N=10755)=30, p=0.224
Parents' Education	Mean (sd): 16.6 (2.6)	Mean (sd): 16.3 (2.8)	Yuen’s t(3262)=4.175, p<0.001, Cohen’s d=0.068
Parents' Income	Mean (sd): 7.4 (2.3)	Mean (sd): 7.2 (2.5)	Yuen’s t(2854)=2.243, p=0.025, Cohen’s d=0.034
Household Size	Mean (sd): 4.7 (1.5)	Mean (sd): 4.7 (1.6)	Yuen’s t(3718)=0.39, p=0.697, Cohen’s d=0.007
Economics Insecurities	Mean (sd): 0.4 (1.1)	Mean (sd): 0.5 (1.1)	Yuen’s t(1982)=2.65, p=0.008, Cohen’s d=0.033
Area Deprivation Index	Mean (sd): 94.6 (20.7)	Mean (sd): 94.9 (21.2)	Yuen’s t(3297)=1.686, p=0.092, Cohen’s d=0.029
Lead Risk	Mean (sd): 5 (3.1)	Mean (sd): 5.1 (3.1)	Yuen’s t(3374)=1.797, p=0.072, Cohen’s d=0.027
Uniform Crime Reports	Mean (sd): 12.1 (5.5)	Mean (sd): 12 (6.1)	Yuen’s t(3370)=0.873, p=0.383, Cohen’s d=0.014
Parent reported Neighbourhood Safety	Mean (sd): 11.8 (2.9)	Mean (sd): 11.6 (3)	Yuen’s t(3382)=1.799, p=0.072, Cohen’s d=0.025
Child reported Neighbourhood Safety	Mean (sd): 4.1 (1.1)	Mean (sd): 4 (1.1)	Yuen’s t(3786)=2.258, p=0.024, Cohen’s d=0.036
School Environment	Mean (sd): 20 (2.8)	Mean (sd): 19.8 (2.9)	Yuen’s t(3787)=1.763, p=0.078, Cohen’s d=0.029
School Involvement	Mean (sd): 13.1 (2.3)	Mean (sd): 12.9 (2.4)	Yuen’s t(3790)=3.203, p=0.001, Cohen’s d=0.05
School Disengagement	Mean (sd): 3.7 (1.4)	Mean (sd): 3.8 (1.5)	Yuen’s t(3800)=2.171, p=0.03, Cohen’s d=0.035
Lack of Sleep	Mean (sd): 1.7 (0.8)	Mean (sd): 1.7 (0.8)	Yuen’s t(3860)=3.084, p=0.002, Cohen’s d=0.05
Sleep Disturbance	Mean (sd): 1.9 (Abramovitch et al., 2021)	Mean (sd): 1.9 (Abramovitch et al., 2021)	Yuen’s t(3877)=1.567, p=0.117, Cohen’s d=0.025
Sleep Initiating Maintaining	Mean (sd): 11.7 (3.7)	Mean (sd): 11.9 (3.8)	Yuen’s t(3862)=2.481, p=0.013, Cohen’s d=0.038
Sleep Breathing Disorders	Mean (sd): 3.7 (1.2)	Mean (sd): 3.8 (1.3)	Yuen’s t(3834)=1.43, p=0.153, Cohen’s d=0.022
Sleep Arousal Disorders	Mean (sd): 3.4 (0.9)	Mean (sd): 3.4 (Abramovitch et al., 2021)	Yuen’s t(3885)=0.966, p=0.334, Cohen’s d=0.013
Sleep Wake Transition Disorders	Mean (sd): 8.2 (2.6)	Mean (sd): 8.1 (2.6)	Yuen’s t(3828)=1.198, p=0.231, Cohen’s d=0.022
Sleep Excessive Somnolence	Mean (sd): 6.9 (2.4)	Mean (sd): 7 (2.5)	Yuen’s t(3836)=0.131, p=0.896, Cohen’s d=0.007
Sleep Hyperhidrosis	Mean (sd): 2.4 (1.2)	Mean (sd): 2.5 (1.2)	Yuen’s t(4375)=1.755, p=0.079, Cohen’s d=0.029
Individual Physical Extracurricular Activities	Mean (sd): 5 (5.7)	Mean (sd): 4.7 (5.4)	Yuen’s t(4173)=2.933, p=0.003, Cohen’s d=0.044
Team Physical Extracurricular Activities	Mean (sd): 8.4 (7.7)	Mean (sd): 7.8 (7.4)	Yuen’s t(4007)=3.604, p<0.001, Cohen’s d=0.055
Non Physical Extracurricular Activities	Mean (sd): 5.1 (6.3)	Mean (sd): 4.8 (6.1)	Yuen’s t(4075)=2.961, p=0.003, Cohen’s d=0.047
Physically Active	Mean (sd): 3.5 (2.3)	Mean (sd): 3.4 (2.3)	Yuen’s t(3838)=2.094, p=0.036, Cohen’s d=0.033
Mature Video Games Play	Mean (sd): 0.5 (0.8)	Mean (sd): 0.6 (0.9)	Yuen’s t(3816)=1.396, p=0.163, Cohen’s d=0.022
Mature Movies Watch	Mean (sd): 0.4 (0.6)	Mean (sd): 0.4 (0.7)	Yuen’s t(3728)=4.038, p<0.001, Cohen’s d=0.065
Weekday Screen Use	Mean (sd): 3.3 (3)	Mean (sd): 3.6 (3.3)	Yuen’s t(3220)=4.161,p<0.001, Cohen’s d=0.069
Weekend Screen Use	Mean (sd): 4.5 (3.5)	Mean (sd): 4.8 (3.7)	Yuen’s t(3521)=3.218, p=0.001, Cohen’s d=0.053
Tobacco Before Pregnant	No = 6328 (86.7%) Yes = 974 (13.3%)	No = 2838 (86.7%) Yes = 436 (13.3%)	X²(1,=10576)=0, p=1
Tobacco After Pregnant	No = 6968 (95.2%) Yes = 351 (4.8%)	No = 3081 (94.2%) Yes = 190 (5.8%)	X²(1,=10590)=0, p=1
Alcohol Before Pregnant	No = 5174 (73.4%) Yes = 1871 (26.6%)	No = 2380 (75.4%) Yes = 775 (24.6%)	X²(1,=10200)=0, p=1
Alcohol After Pregnant	No = 7096 (97.1%) Yes = 210 (2.9%)	No = 3175 (97.4%) Yes = 85 (2.6%)	X²(1,=10566)=0, p=1
Marijuana Before Pregnant	No = 6874 (94.5%) Yes = 399 (5.5%)	No = 3044 (93.9%) Yes = 199 (6.1%)	X²(1,=10516)=0, p=1
Marijuana After Pregnant	No = 7182 (98.2%) Yes = 130 (1.8%)	No = 3191 (97.7%) Yes = 74 (2.3%)	X²(1,=10577)=0, p=1
Developmental Prematurity	No = 5945 (80.3%) Yes = 1458 (19.7%)	No = 2735 (83.0%) Yes = 561 (17.0%)	X²(1, N=10699)=0, p=1
Birth Complications	Mean (sd): 0.4 (0.8)	Mean (sd): 0.4 (0.7)	Yuen’s t(3591)=0.121, p=0.904, Cohen’s d=0.007
Pregnancy Complications	Mean (sd): 0.6 (Abramovitch et al., 2021)	Mean (sd): 0.6 (Abramovitch et al., 2021)	Yuen’s t(3543)=1.19, p=0.234, Cohen’s d=0.018
Parental Monitoring	Mean (sd): 4.4 (0.5)	Mean (sd): 4.4 (0.5)	Yuen’s t(3810)=0.451, p=0.652, Cohen’s d=0.009
Parent-reported Family Conflict	Mean (sd): 2.5 (1.9)	Mean (sd): 2.6 (2)	Yuen’s t(3805)=1.404, p=0.16, Cohen’s d=0.023
Child report Family Conflict	Mean (sd): 2 (1.9)	Mean (sd): 2.1 (2)	Yuen’s t(3809)=1.751, p=0.08, Cohen’s d=0.026
Parent reported Prosocial	Mean (sd): 1.8 (0.4)	Mean (sd): 1.8 (0.4)	Yuen’s t(3817)=0.288, p=0.774, Cohen’s d=0.007
Child reported Prosocial	Mean (sd): 1.7 (0.4)	Mean (sd): 1.7 (0.4)	Yuen’s t(3849)=2.529, p=0.011, Cohen’s d=0.041

Furthermore, while we used comprehensive multimodal MRI from 45 sets of features for neuroimaging, three fMRI tasks were not chosen based on their relevance to cognitive abilities (Casey et al., 2018). It is possible to obtain higher predictive performance based on other fMRI tasks. For all analyses involving PGS, we limited our participants to children of European ancestry due to the lack of summary statistics from well-powered GWAS for cognitive abilities in non-European participants. This prevented us from fully leveraging the diverse samples in the ABCD study (Garavan et al., 2018). Future GWAS work with more diverse samples is needed to ensure equity and fairness in developing neurobiological units of analysis for cognitive abilities. Lastly, we relied on 44 variables of socio-demographics, lifestyles, and developmental adverse events included in the study, which might have missed some variables relevant to cognitive abilities (e.g. nutrition). The ABCD study (Casey et al., 2018) is ongoing, and future data might address some of these limitations.

Overall, aligning with the RDoC perspective (Morris and Cuthbert, 2012), our findings support the use of neurobiological units of analysis for cognitive abilities, as assessed through multimodal neuroimaging and Polygenic Scores (PGS). These measures explain (a) the relationship between cognitive abilities and mental health and (b) the variance in this cognitive-ability-and-mental-health relationship attributable to environmental factors. Our results emphasise the importance of considering both neurobiology and environmental factors, such as socio-demographics, lifestyles, and adverse childhood events, to gain a comprehensive understanding of the aetiology of mental health (Insel et al., 2010; Morris et al., 2022).

Neuroimaging features	Data provided	Did not pass quality control	Had vision problems	From site 22	Had any missing feature	Flagged as outliers	Observations kept
ENback 0back	11771	3996	38	21	8	292	7416
ENback 2back	11771	3996	38	21	12	281	7423
ENback 2back vs 0back	11771	3996	38	21	10	397	7309
ENback emotion	11771	3996	38	21	10	303	7403
ENback Emotion vs Neutral Face	11771	3996	38	21	11	480	7225
ENback Face vs Place	11771	3996	38	21	10	391	7315
ENback Negative vs Neutral Face	11771	3996	38	21	11	454	7251
ENback Positive vs Neutral Face	11771	3996	38	21	10	500	7206
ENback place	11771	3996	38	21	11	331	7374
MID Reward vs Neutral anticipation	11771	2596	51	22	11	250	8841
MID Loss vs Neutral anticipation	11771	2596	51	22	11	245	8846
MID Positive vs Negative Reward Feedback	11771	2596	51	22	12	338	8752
MID Positive vs Negative Punishment Feedback	11771	2596	51	22	10	334	8758
MID Large Reward vs Neutral anticipation	11771	2596	51	22	12	241	8849
MID Small Reward vs Neutral anticipation	11771	2596	51	22	10	270	8822
MID Large Reward vs Small Reward anticipation	11771	2596	51	22	13	266	8823
MID Large Loss vs Neutral anticipation	11771	2596	51	22	11	250	8841
MID Small Loss vs Neutral anticipation	11771	2596	51	22	11	282	8809
MID Large Loss vs Small Loss anticipation	11771	2596	51	22	12	307	8783
SST Any Stop vs Correct Go	11771	3672	45	20	14	227	7793
SST Correct Go vs Fixation	11771	3672	45	20	13	262	7759
SST Correct Stop vs Correct Go	11771	3672	45	20	13	236	7785
SST Correct Stop vs Incorrect Stop	11771	3672	45	20	14	292	7728
SST Incorrect Go vs Correct Go	11771	3672	45	20	15	481	7538
SST Incorrect Go vs Incorrect Stop	11771	3672	45	20	14	366	7654
SST Incorrect Stop vs Correct Go	11771	3672	45	20	13	246	7775
rsfMRI temporal variance	11771	2397	62	25	14	682	8591
rsfMRI subcortical-network FC	11771	2397	62	25	14	1	9272
rsfMRI cortical FC	11771	2397	62	25	14	3	9270
T1 subcortical avg intensity	11771	501	66	27	0	60	11117
T1 white matter avg intensity	11771	501	66	27	12	13	11152
T1 gray matter avg intensity	11771	501	66	27	12	11	11154
T1 normalised intensity	11771	501	66	27	12	2	11163
T1 summations	11771	501	66	27	12	34	11131
cortical thickness	11771	501	66	27	12	2	11163
cortical area	11771	501	66	27	12	1	11164
cortical volume	11771	501	66	27	12	0	11165
subcortical volume	11771	501	66	27	0	215	10962
sulcal depth	11771	501	66	27	12	1106	10059
T2 subcortical avg intensity	11771	1217	58	25	0	67	10404

Neuroimaging features	Data provided	Did not pass quality control	Had vision problems	Had any missing feature	Flagged as outliers	Observations kept
ENback 0back	8123	1804	35	11	216	6057
ENback 2back	8123	1804	35	13	186	6085
ENback 2back vs 0back	8123	1804	35	14	294	5976
ENback emotion	8123	1804	35	13	202	6069
ENback Emotion vs Neutral Face	8123	1804	35	13	347	5924
ENback Face vs Place	8123	1804	35	13	295	5976
ENback Negative vs Neutral Face	8123	1804	35	11	355	5918
ENback Positive vs Neutral Face	8123	1804	35	13	342	5929
ENback place	8123	1804	35	12	234	6038
MID Reward vs Neutral anticipation	8123	1379	40	8	153	6543
MID Loss vs Neutral anticipation	8123	1379	40	8	154	6542
MID Positive vs Negative Reward Feedback	8123	1379	40	9	192	6503
MID Positive vs Negative Punishment Feedback	8123	1379	40	9	197	6498
MID Large Reward vs Neutral anticipation	8123	1379	40	8	142	6554
MID Small Reward vs Neutral anticipation	8123	1379	40	8	163	6533
MID Large Reward vs Small Reward anticipation	8123	1379	40	8	155	6541
MID Large Loss vs Neutral anticipation	8123	1379	40	8	150	6546
MID Smal Loss vs Neutral anticipation	8123	1379	40	9	179	6516
MID Large Loss vs Small Loss anticipation	8123	1379	40	9	173	6522
SST Any Stop vs Correct Go	8123	2036	33	7	123	5924
SST Correct Go vs Fixation	8123	2036	33	7	173	5874
SST Correct Stop vs Correct Go	8123	2036	33	7	163	5884
SST Correct Stop vs Incorrect Stop	8123	2036	33	7	187	5860
SST Incorrect Go vs Correct Go	8123	2036	33	7	345	5702
SST Incorrect Go vs Incorrect Stop	8123	2036	33	7	267	5780
SST Incorrect Stop vs Correct Go	8123	2036	33	7	131	5916
rsfMRI temporal variance	8123	1152	49	14	512	6396
rsfMRI subcortical-network FC	8123	1152	49	14	3	6905
rsfMRI cortical FC	8123	1152	49	14	3	6905
T1 subcortical avg intensity	8123	227	51	0	32	7813
T1 white matter avg intensity	8123	227	51	10	8	7827
T1 gray matter avg intensity	8123	227	51	10	9	7826
T1 normalised intensity	8123	227	51	10	0	7835
T1 summations	8123	227	51	10	19	7816
cortical thickness	8123	227	51	10	2	7833
cortical area	8123	227	51	10	0	7835
cortical volume	8123	227	51	10	0	7835
subcortical volume	8123	227	51	0	112	7733
sulcal depth	8123	227	51	10	890	6945
T2 subcortical avg intensity	8123	600	50	0	39	7434

Functionality	Baseline	Follow-up
Alimentary tract and metabolism	144	145
Blood and blood forming organs	12	22
Cardiovascular system	124	142
Dermatologicals	108	64
Genitourinary system and sex hormones	72	76
Systemic hormonal preparations, excl. sex hormones and insulins	26	24
Anti-infectives for systemic use	56	35
Antineoplastic and immunomodulating agents	5	5
Musculo-skeletal system	145	183
Nervous system	710	729
Antiparasitic products, insecticides and repellents	5	4
Respiratory system	721	538
Sensory organs	42	42
Various	1	3

Share this article

Cite this article

Predictive models, predicting cognitive abilities from mental-health features via Partial Least Square (PLS).

Performance metrics for predictive models, predicting cognitive abilities from mental health, neuroimaging, polygenic scores, and socio-demographics, lifestyles, and developments.

Predictive models predicting cognitive abilities from neuroimaging via opportunistic stacking and polygenic scores via Elastic Net.

Feature importance of each set of neuroimaging features, predicting cognitive abilities in the baseline data.

Performance metrics for predictive models, predicting cognitive abilities from the 45 sets of neuroimaging features in the baseline data.

Performance metrics for predictive models, predicting cognitive abilities from the 45 sets of neuroimaging features in the follow-up data.

Predictive models, predicting cognitive abilities from socio-demographics, lifestyles, and developmental adverse events via Partial Least Square (PLS).

Results of linear-mixed models using proxy measures of cognitive abilities based on mental health and/or neuroimaging as regressors to explain cognitive abilities across test sites in the baseline.

Results of linear-mixed models using proxy measures of cognitive abilities based on mental health and/or neuroimaging as regressors to explain cognitive abilities across test sites in the follow-up.

Venn diagrams showing common and unique effects of proxy measures of cognitive abilities based on mental health, neuroimaging, polygenic scores, and/or socio-demographics, lifestyles and developmental adverse events in explaining cognitive abilities across test sites.

Results of linear-mixed models using proxy measures of cognitive abilities based on mental health and/or polygenic scores as regressors to explain cognitive abilities across test sites in the baseline.

Results of linear-mixed models using proxy measures of cognitive abilities based on mental health and/or polygenic scores as regressors to explain cognitive abilities across test sites in the follow-up.

Results of linear-mixed models using proxy measures of cognitive abilities based on mental health and/or socio-demographics, lifestyles, and developmental adverse events as regressors to explain cognitive abilities across test sites in the baseline.

Results of linear-mixed models using proxy measures of cognitive abilities based on mental health and/or socio-demographics, lifestyles and developmental adverse events as regressors to explain cognitive abilities across test sites in the follow-up.

Results of linear-mixed models using proxy measures of cognitive abilities based on mental health, neuroimaging, polygenic scores and/or socio-demographics, lifestyles and developmental adverse events as regressors to explain cognitive abilities across test sites in the baseline.

Results of linear-mixed models using proxy measures of cognitive abilities based on mental health, neuroimaging, polygenic scores and/or socio-demographics, lifestyles, and developmental adverse events as regressors to explain cognitive abilities across test sites in the follow-up.

The differences in social demographics, lifestyles, and developmental adverse events between participants who provided cognitive scores in the follow-up.

Flow diagram of participants’ inclusion and exclusion criteria.

Exclusion criteria for neuroimaging features in the baseline.

Exclusion criteria for neuroimaging features in the follow-up.

Standardised weights of the second-order ‘g-factor’ model.

Predictive performance of leave one site out cross-validation vs 10-fold cross validation.

Illustration of data missingness (black) versus presence (grey) across different sets of neuroimaging features.

Medication reports in the baseline and follow-up.

Author details

Yue Wang

Contribution

Competing interests

Richard Anney

Contribution

Competing interests

Narun Pat

Contribution

For correspondence

Competing interests

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

Categories and tags

Research organism