(a) Central hypothesis. (b) Number of genes in elastic net regression models that explain at least 1% of variance. Colors represent different categories of variables. (c) Pearson’s correlations …
GLMnet model weights for the transcriptional signatures.
The transcriptional signatures are in columns, while the genes are in rows.
Color indicates the strength and direction of correlation; white indicates p-value>0.05. Hierarchical clustering is represented on the top and side of the heatmap. Variables are grouped by colors …
Color represents significance of the correlation (gray: nonsignificant correlations in observed variables or metagenes; blue: significant correlation between observed variables only; green: …
(a) Gene Ontology, (b) Kyoto Encyclopedia of Genes and Genomes, and (c) Reactome Pathway Database. Variable symbols: cddstf: self-disclosure; cdmcfl: child-reported conflict with mother; cdres: …
k0: probability of sharing zero alleles IBD; k1: probability of sharing one allele IBD.
Each dot is one sample.
(a) Proportion of global variance in gene expression explained by each covariate tested within a single linear model. (b) Proportion of variance explained by each covariate for each analyzed gene.
Heatmap of Pearson’s correlations between transcriptional signatures explaining at least 1% of variance. Heatmap color indicates strength and direction of correlation; white indicates p-value>0.05. …
(a) Interaction expression quantitative trait locus (eQTL) results. GxE genes: number of significant GxE interactions with transcriptional signatures at 10% FDR; OR: odds ratio of enrichment of GxE …
Results of cis-expression quantitative trait locus (cis-eQTL) mapping permutation pass with FastQTL correcting for top 18 gene expression principal components [PCs] (space-delimited file).
Columns are as follows. 1: Ensembl ID of the tested gene; 2: number of variants tested in cis for this gene; 3: maximum likelihood estimation (MLE) of the shape1 parameter of the beta distribution; 4: MLE of the shape2 parameter of the beta distribution; 5: dummy; 6: ID of the best variant found for this molecular phenotypes (i.e., with the smallest p-value); 7: distance between the molecular phenotype-variant pair; 8: the nominal p-value of association that quantifies how significant from 0, the regression coefficient is; 9: the slope associated with the nominal p-value of association; 10: a first permutation p-value directly obtained from the permutations with the direct method (corrected nominal p-value that accounts for the fact that multiple variants are tested per molecular phenotype); 11: a second permutation p-value obtained via beta approximation (used to calculate FDR).
Results of cis-expression quantitative trait locus (cis-eQTL) mapping permutation pass with FastQTL without correction for gene expression principal components [PCs] (space-delimited file).
Columns are as follows. 1: Ensembl ID of the tested gene; 2: number of variants tested in cis for this gene; 3: maximum likelihood estimation (MLE) of the shape1 parameter of the beta distribution; 4: MLE of the shape2 parameter of the beta distribution; 5: dummy; 6: ID of the best variant found for this molecular phenotypes (i.e., with the smallest p-value); 7: distance between the molecular phenotype-variant pair; 8: the nominal p-value of association that quantifies how significant from 0, the regression coefficient is; 9: the slope associated with the nominal p-value of association; 10: a first permutation p-value directly obtained from the permutations with the direct method (corrected nominal p-value that accounts for the fact that multiple variants are tested per molecular phenotype); 11: a second permutation p-value obtained via beta approximation (used to calculate FDR).
Overlap of eGenes with psychosocial effects and significant probabilistic transcriptome-wide association studies (PTWAS) association results (5% FDR) for asthma and allergic disease.
Columns are as follows. 1: GWAS trait tested in PTWAS; 2: Ensembl ID of the tested gene; 3: most significant tissue in PTWAS; 4: multitissue PTWAS p-value; 5–24: as in Figure 3—source data 4.
Results of cis-interaction-expression quantitative trait locus (eQTL) mapping (tab-delimited file).
Columns are as follows. 1: Ensembl ID of the tested gene; 2: ID of the tested variant; 3: p-value of the GxE model intercept; 4: p-value of the genotype dosage effect; 5: p-value of the transcriptional signature effect; 6: p-value of the interaction between genotype dosage and transcriptional signature effect; 7: intercept; 8: dosage effect; 9: transcriptional signature effect; 10: interaction between genotype dosage and transcriptional signature effect; 11: intercept standard error (SE); 12: genotype dosage effect SE; 13: transcriptional signature effect SE; 14: interaction between genotype dosage and transcriptional signature effect SE; 15: q-value of the GxE model intercept; 16: q-value of the genotype dosage effect; 17: q-value of the transcriptional signature effect; 18: q-value of the interaction between genotype dosage and transcriptional signature effect; 19: tested transcriptional signature; 20: permutation-corrected p-value of the interaction between genotype dosage and transcriptional signature effect; 21: q-value of the permutation-corrected p-value of the interaction between genotype dosage and transcriptional signature effect.
Overlap of GxE interactions and significant probabilistic transcriptome-wide association studies (PTWAS) association results (5% FDR).
Columns as in Figure 3—source data 3.
Results of cis-interaction-expression quantitative trait locus (eQTL) mapping after correcting for cell composition effects (tab-delimited file).
Columns as in Figure 3—source data 4.
Results of cis-interaction-expression quantitative trait locus (eQTL) mapping with measured variables (tab-delimited file).
Columns as in Figure 3—source data 4.
Color indicates whether the equivalent interaction test with observed variable was nominally significant (p-corrected <0.01, coral) or not significant (p-corrected >0.01, black).
Scatterplots depict the following. (a) Self-disclosure interacts with expression quantitative trait locus (eQTL) rs6458333:G:C to alter expression of SRF. (b) Self-disclosure interacts with eQTL …
Boxplots depict the following. (a) Self-disclosure interacts with expression quantitative trait locus (eQTL) rs6458333:G:C to alter expression of SRF. (b) Self-disclosure interacts with eQTL …
Psychosocial and biological variables are in the left column, eGenes in the central column, and complex traits in the right column. A connecting line represents either a causal link between eGene …
Supplementary tables.
(a) Basic demographic information on 119 individuals used to train the transcriptional signature models (‘training’) and the entire cohort of 251 participants (‘total’). Reported are the count (N) and percentage of non-missing values per each category. For each variable in parentheses, we reported the p-values for significant differences in variable distribution between the training group and the group not included in the model. (b) List of variables collected for the current study. DD: daily diary; SD: sleep diary; EAR: coded from Electronically Activated Recorder; YR: youth reported; PR: parent reported; CD: census data; GC: glucocorticoid; SD: standard deviation; α: Chronbach's alpha measuring reliability as the average correlation between scale items, as a function of the number of items included in the scale, N initial – sample size used for differential gene expression analysis and building transcriptional signatures (mean, SD, and α are reported for this subset), N longitudinal – number of samples with measurements from two time points available, N expanded – final sample size used for transcriptional signature and GxE validations. (c) Evaluation of transcriptional signatures derived using elastic net regression. For each variable, we report Pearson’s correlation coefficient, p-value, cross-validated percent variance explained, and sample size for the training dataset (columns 2–5), and Pearson’s correlation coefficient, p-value, and sample size for the entire cohort (columns 6–8). (d) Longitudinal replication of transcriptional signatures. For each variable, we report Spearman’s correlation coefficient between longitudinal change in observed variable and change in the transcriptional signature, p-value, coefficient of variation of longitudinal change in observed variable, and sample size. (e) Differentially expressed genes (DEGs) for all psychosocial variables (10% FDR; N: sample size). (f) Full DESeq results for differential gene expression analyses for psychosocial variables. (g) Correlations between z-scores of GxE interaction effects measured using transcriptional signatures and observed data. For each variable, we report Spearman’s correlation coefficient and its p-value, and odds ratio and p-value from Fisher’s test for enrichment of nominally significant results (permutation-corrected p-value<0.01) between GxE expression quantitative trait locus (eQTL) testing using transcriptional signatures and observed data. (h) Correlations between the top three principal components of covariate matrix and individual covariates (Pearson's product-moment correlations with numeric variables, polyserial correlations with bivariate variables; ns: correlation p-value>0.05). (i) DEGs associated with blood cell composition (10% FDR; N: sample size). (j) Full DESeq results for differential gene expression analyses for blood composition variables.
Detailed descriptions of methods for psychosocial data collection.
STREGA reporting recommendations, extended from STROBE Statement.