Figures and data

Workflow indicates the data acquisition and relations between the methods.

Enrichment heatmap of hallmark gene sets in transcriptional components associated with patient overall survival.
Gene Set Enrichment Analysis for 14 TCs associated with OS identified through univariate or multivariate survival analyses are presented. Only Hallmark gene sets with significant enrichment (Bonferroni-corrected p-value) for at least one TC are shown. The heatmap displays Z-scores, which indicate the relative enrichment strength, with values truncated at a maximum of 4 for visualization purposes. The gene sets were clustered based on Pearson correlation using the Ward D2 method, providing insights into related biological processes captured by different TCs. In the right column, chromosomal locations of copy number alterations (CNAs) are shown, reflecting the downstream effects on gene expression that each TC captures. This integration of CNA information highlights the biological relevance of each TC and its contribution to gene expression variability and patient outcomes.

Survival tree analysis of patients with platinum-treated HGSOC defines survival cohorts with distinct clinicopathologic and biological characteristics.
The results of survival tree analysis of 294 patients with high-grade serous ovarian cancer (HGSOC) treated with platinum-based chemotherapy are presented. The analysis utilized 14 transcriptional components (TCs) associated with overall survival (OS), along with other clinicopathologic factors, including age, tumor stage, grade, and debulking status. The resulting tree identified nine distinct survival cohorts, each represented as a bar in the Sankey diagram, where the bar height corresponds to the number of patients in each cohort. Kaplan-Meier survival curves with accompanying number-at-risk tables are shown for each cohort, with survival data censored at 10 years. The names of the survival cohorts were based on enriched biological processes in the TCs, as determined by the chromosomal location of genes captured by a TC, GSEA, and co-functionality analysis of the top genes. The p-values in the Kaplan-Meier plots were derived from log-rank tests comparing survival distributions between groups. Abbreviations: TC = transcriptional component, ECM = extracellular matrix.

Co-functionality network of top 100 absolute weighted genes in TC121.
Co-functionality network for the top 100 genes with the highest absolute weights in TC121 is presented. Genes were clustered based on predicted co-functionality (r > 0.7) across datasets, with clusters identified using both Gene Ontology (GO) Biological Processes and Cellular Components databases. One primary cluster, containing more than five genes, exhibited strong enrichment for synaptic signaling in the GO Biological Processes database and for synaptic membranes in the GO Cellular Components database. This highlights the biological specificity of TC121 in regulating gene expression linked to synaptic functions.

The activity of TC121 in bulk transcriptomes of CCLE, GDSC cell lines, and GEO and TCGA patient-derived samples. A. Cross-study TC projection of TC121 on CCLE and GDSC cell lines. The boxplots display the activity scores of TC121 in different tissue types, which are ordered based on their corresponding median activity scores. B. Cross-study TC projection of TC121 on GEO and TCGA bulk transcriptomes resulted in the activity scores presented in the boxplots. Cancer types were ordered based on corresponding medians of TC121 activity scores. Abbreviations: TC = transcriptional component.

Spatial transcriptomic profiles in ovarian cancer samples. A. We employed a permutation-based approach to pinpoint the areas of significant TC activity in spatial transcriptomic profiles. We ran 5,000 permutations for each TC-profile combination, yielding a p-value that indicates the extent to which the TC activity in the corresponding profile differs from what would be expected by chance (the null distribution). We then transformed these p-values into logarithmic values and represented them using a heatmap. Heatmaps of activity scores of the TCs are presented in individual rows for the HGSOC, serous papillary, and endometrioid adenocarcinoma of ovary samples. The first column represents the stained images of the samples. The second to seventh columns show heatmaps corresponding to the mentioned TCs. B. The heatmap illustrates the colocalization between two TC activities on spatial transcriptomic profiles from ovarian cancer samples. For each cell, the colocalization scores of the TCs at each of the three spatial transcriptomics samples OC 1, OC 2, and OC 3 are arranged in columns. A colocalization score of 4 between two TCs (red) indicates that the positively (+) and negatively (−) active regions of both TCs are perfectly colocalized. Conversely, a colocalization score of -4 between two TCs (blue) also indicates colocalization. Still, with inverse activity, i.e., the positively active regions of one TC are colocalized with the negatively active regions of the other TC or vice versa. A colocalization score close to 0 between two TCs (white) indicates that the activities of two TCs are spatially separated. The dashed and solid circle in the panel on the right side of the color bar represents two different TCs. Abbreviations: TC = transcriptional component.