Pathway activation model for personalized prediction of drug synergy

Quang Thinh Trac; Yue Huang; Tom Erkers; Päivi Östling; Anna Bohlin; Albin Österroos; Mattias Vesterlund; Rozbeh Jafari; Ioannis Siavelis; Helena Bäckvall; Santeri Kiviluoto; Lukas M Orre; Mattias Rantalainen; Janne Lehtiö; Sören Lehmann; Olli Kallioniemi; Yudi Pawitan; Trung Nghia Vu

doi:10.7554/eLife.100071.1

eLife assessment

This study presents a valuable report on a machine-learning tool for predicting synergistic drug combinations for cancer treatment. However, the evidence supporting the claims of the authors is incomplete, as the reported model shows some evidence of overfitting, and the claims of the authors could be strengthened if additional validation experiments were performed. The work will be of interest to oncologists and medical biologists working on cancer.

https://doi.org/10.7554/eLife.100071.1.sa3

Significance of findings

valuable: Findings that have theoretical or practical implications for a subfield

landmark
fundamental
important
valuable
useful

Strength of evidence

incomplete: Main claims are only partially supported

exceptional
compelling
convincing
solid
incomplete
inadequate

During the peer-review process the editor and reviewers write an eLife assessment that summarises the significance of the findings reported in the article (on a scale ranging from landmark to useful) and the strength of the evidence (on a scale ranging from exceptional to inadequate). Learn more about eLife assessments

Abstract

Targeted monotherapies for cancer often fail due to inherent or acquired drug resistance. By aiming at multiple targets simultaneously, drug combinations can produce synergistic interactions that increase drug effectiveness and reduce resistance. Computational models based on the integration of omics data have been used to identify synergistic combinations, but predicting drug synergy remains a challenge. Here, we introduce DIPx, an algorithm for personalized prediction of drug synergy based on biologically motivated tumor- and drug-specific pathway activation scores (PASs). We trained and validated DIPx in the AstraZeneca-Sanger (AZS) DREAM Challenge dataset using two separate test sets: Test Set 1 comprised the combinations already present in the training set, while Test Set 2 contained combinations absent from the training set, thus indicating the model’s ability to handle novel combinations. The Spearman correlation coefficients between predicted and observed drug synergy were 0.50 (95% CI: 0.47–0.53) in Test Set 1 and 0.26 (95% CI: 0.22–0.30) in Test Set 2, compared to 0.38 (95% CI: 0.34–0.42) and 0.18 (95% CI: 0.16–0.20), respectively, for the best performing method in the Challenge. We show evidence that higher synergy is associated with higher functional interaction between the drug targets, and this functional interaction information is captured by PAS. We illustrate the use of PAS to provide a potential biological explanation in terms of activated pathways that mediate the synergistic effects of combined drugs. In summary, DIPx can be a useful tool for personalized prediction of drug synergy and exploration of activated pathways related to the effects of combined drugs.

Introduction

Targeted therapies such as specific inhibitors are the most promising class of cancer drugs, but often fail or achieve only temporary remission due to inherent or acquired resistance. Theoretically, by aiming at multiple targets simultaneously, drug combinations can produce a synergistic interaction that increases drug effectiveness and reduces resistance and the chances of relapse (Medicine, 2017; Pemovska et al., 2018; Plana et al., 2022). This is illustrated in the combination of a BRAF inhibitor dabrafenib with a MEK inhibitor trametinib, which suppresses paradoxical reactivation and resistance observed in patients with BRAF-mutated melanoma treated with dabrafenib alone (Zhong et al., 2022; Banzi et al., 2016). This recently approved combination has been shown to improve progression-free and overall survival rates (Subbiah et al., 2023).

The discovery of effective drug combinations has traditionally relied on expert knowledge and understanding of known biological mechanisms (Li et al., 2015). However, this expert-based approach has limited scope to come up with novel combinations. Furthermore, ideally, novel combinations are clinically tested, but it is practically impossible to test all reasonable combinations in a clinical setting. Computational models of drug synergy have shown some potential for personalized prediction of synergistic combinations (Güvenç Paltun et al., 2021; Wu et al., 2022; Kong et al., 2022). These models are typically based on the integration of patient-specific molecular data, such as mutation profiles, gene expression, and drug response information (Güvenç Paltun et al., 2021). For example, TAJI, the best performing method in the AstraZeneca-Sanger DREAM Challenge, uses these diverse data types to predict drug synergy (Li et al., 2018). The drug combinations predicted to be effective will expand the therapeutic options while maintaining the same level of adverse effects profile. However, despite the advantages offered by modern machine learning methodologies and the availability of large-scale datasets, prediction of synergistic combinations and validating computational models remains challenging. For example, drug screening protocols often vary across studies, and there is a limited overlap in tested drugs and cell lines, complicating the external validation of these models. Additionally, the reliance on ‘black-box’ machine learning approaches hinders the exploration of underlying molecular mechanisms driving synergistic combinations.

To address this limitation, several studies have introduced statistical and computational approaches to infer the mechanisms of action of synergistic combinations within cancer signaling pathways. For example, Liu et al. proposed TranSynergy, a drug synergy prediction model that uses the interaction between drug target genes in a protein-protein interaction (PPI) network (Liu and Xie, 2021). However, TranSynergy only relies on target gene information, neglecting information on upstream and downstream activities of the targets and their differential contributions to synergy. More recently, Tang et al. developed SynPathy, a deep learning model for drug synergy prediction that incorporates drug-pathway associations (Tang and Gottlieb, 2022). SynPathy calculates pathway enrichment scores as a measure of the distance between target genes of each drug in a combination and pathway genes in the PPI network. These pathway enrichment scores, along with chemical structure information, are then combined to fit the model and infer pathway importance scores for each combination. More recently, Wu et al. introduced ForSyn (Wu et al., 2023), a deep forest-based method. Although ForSyn implemented a gene enrichment analysis to identify cancer-related pathways, it does not directly identify them through prediction.

Here we present a Drug synergy Interaction Prediction (DIPx) based on tumor- and drug-specific pathway activation scores (PASs). PASs are biologically motivated features that provide potentially relevant information on the underlying mechanisms of synergistic combinations. We trained and validated DIPx using the AstraZeneca-Sanger (AZS) DREAM Challenge dataset (Menden et al., 2019), and compared its performance with the best performing method in the Challenge. Furthermore, we assessed the generalizability of the model by validating it on the ONeil dataset (O’Neil et al., 2016), and provided illustrations of pathways that could mediate the synergistic combinations found by DIPx. DIPx is publicly available at https://www.github.com/tracquangthinh/DIPx.

Results

A pathway based drug synergy prediction model

Figure 1 provides an overview of DIPx, which uses gene expression, mutation profiles, and drug synergy data from the AZS dataset to train and validate its prediction model. The test set comprised two subsets: (i) Test Set 1 includes combinations from the training set, and (ii) Test Set 2 includes combinations absent from the training set. Together, both sets assess the generalizability of the prediction for new patients and new combinations. The analysis involved a total of 75 cell lines tested in 910 combinations in the AZS dataset. DIPx was also validated using an external dataset, as shown in Figure 1a.

Overview of DIPx. a) The AZS omics data were used to train and validate the model. The test set was split into two subsets: Test Set 1 contained combinations found in the training set, while Set 2 comprised combinations not found in the training set. The model was also externally validated using the ONeil dataset. b) A cartoon illustration of the ERBB pathway in a breast cancer cell line treated with the combination of Capivasertib + Sapitinib. Capivasertib targets the AKT gene, whereas Sapitinib targets the ERBB genes. Pathway genes were classified into upstream and downstream genes relative to the position of the target genes in the network. c) The drug synergy prediction model was trained using pathway activation scores (PAS) of the upstream, downstream and driver genes. d) The predicted and observed Loewe scores of a cell line achieved the median Spearman correlation in Test Set 1 of the AZS dataset. The color of each bar shows the confidence score information with the threshold of 0.75. e) The main pathways that contribute to the prediction of the synergy of the Capivasertib + Sapitinib combination. f) The proportion of validated high synergistic predictions (Loewe score ≥ 20) increases with higher confidence scores. The x-axis presents four groups defined by quartiles of confidence scores.

Figure 1b illustrates the ERBB signaling pathway in relation to the Capivasertib + Sapitinib combination, where the genes belonging to the pathway are classified into upstream and downstream genes relative to the position of the target genes: AKT targeted by Capivasertib and ERBB targeted by Sapitinib. Putative driver mutations were identified in each sample based on a well-characterized list of frequently mutated genes in cancer; see Section 4.3. We first calculate the PAS of the upstream and downstream part of the pathway relative to the driver genes; see the Methods section for details. The PAS values are then combined to train a random forest regression model. Given a new drug combination experiment, DIPx predicts the Loewe score for drug synergy, as shown in Figure 1c.

Figure 1d presents the predicted and observed synergies for the SW900 lung cancer cell line, which has a median correlation of 0.50 among the cell lines in Test Set 1; each bar in the figure represents a drug combination. The best predicted combinations include BCL2L1 + AZD5582, AZD5582 + etoposide and doxorubicin + AZD5582, with predicted Loewe scores of 42.34, 26.60, and 25.72, respectively, and high confidence scores of 1.0, 0.90, and 0.82, respectively. A combination with Loewe score greater than 20 is considered highly synergistic (Menden et al., 2019). Although the combination of doxorubicin + AZ12623380 is predicted to have high synergy, it is a low confidence prediction with a confidence score of 0.33. Indeed, the observed Loewe synergy score for this combination is near zero.

The use of PAS allows DIPx to infer the potential biological mechanisms of synergistic drug combinations. Figure 1e shows pathways with the highest contribution to prediction of drug synergy of the Capivasertib + Sapitinib combination: these include the ERBB-related pathways (ERBB2 signaling pathway, ERBB pathway), and tumor-related pathway (lymph-node metastates, focal adhesion).

Figure 1f demonstrates the association between the confidence scores and the validation of predictions. The x-axis represents the confidence scores grouped into quartiles, while the y-axis displays the proportion of validated high synergy (Loewe score ≥ 20). Predictions with higher confidence scores are expected to exhibit a greater level of validation. Indeed, in this figure, the proportion of high synergistic predictions that are validated in the combination of Test Set 1 and 2 of the AZS dataset increases as the confidence score rises.

Validation and comparisons in the AZS dataset

We evaluated the performance of DIPx in the AZS test sets and compared it with TAJI, which was the best performing method in the AZS DREAM Challenge (Li et al., 2018). TAJI was trained using both monotherapy drug-response and molecular data. Since DIPx uses only molecular data, to make a fair comparison, we trained TAJI using only molecular features and referred to it as TAJI-M. The extra information from the use of monotherapy data in TAJI is rather small, approximately 10% increase in the overall Spearman correlation, and, of course, we could also use such data in DIPx, so it is more convenient and informative to focus the comparisons on prediction based on molecular data alone. For instance, this allows us to compare DIPx with TAJI-M on the prediction of combinations that contain un-trained drug(s), which is not possible with TAJI.

Figure 2a shows the correlation between the predicted and observed Loewe scores of 963 experiments in Test Set 1 (r = 0.5, 95% CI: 0.47–0.53), where each experiment represents a combination drug A + drug B tried on cell line C, yielding one data point. In comparison, TAJI-M gives r = 0.38 (95% CI: 0.34–0.42). We also bootstrapped the training set (n = 100 times) and for each bootstrap replicate calculated the Spearman correlation between the predicted and observed scores of all experiments. As illustrated in Figure 2b, DIPx achieved stable Spearman correlations across all bootstrap replicates, which are significantly higher than that of TAJI-M. The bootstrap distribution actually indicates that the Spearman correlation from DIPx is negatively biased, while from TAJI it is slightly positively biased. This means that the gap between the bias-corrected estimates of the Spearman correlations from DIPx and TAJI-M would be even larger; see the Method section for a theoretical explanation.

Performance of DIPx in the AZS dataset. This includes both Test Set 1 (panels a, b, c, f) and Test Set 2 (panels d, e, g). a) Comparison of predicted vs observed synergy scores for all experiments in Test Set 1. b) Comparison of DIPx vs TAJI-M in terms of the correlation between predicted and observed synergy scores from all experiments in Test Set 1. Each boxplot shows the results based on 100 bootstrap replicates of the training set. c) Comparison of DIPx and TAJI-M performance across cell lines in Test Set 1. Each point represents the correlation between the predicted and observed synergy for a given cell line. d) Comparison of DIPx vs TAJI-M in Test Set 2. Each boxplot displays the correlations between the predicted and observed values obtained from 100 bootstrap replicates of the training set. e) Comparison of performance between DIPx and TAJI-M in Test Set 2 in relation to the number of drugs in common (x-axis) between the combinations in the test set and the training set. f) and g) DIPx vs TAJI-M in three groups classified by monotherapy sensivitity of two drugs in a combination in Test Set 1 (f) and Test Set 2 (g).

Furthermore, we compared the performance of DIPx and TAJI-M across all cell lines in Test Set 1 using a Spearman correlation between the predicted and observed synergy scores, as shown in Figure 2c. A majority of the cell lines (63%) are below the diagonal line, indicating that DIPx outperforms TAJI-M in predicting synergy scores for these cell lines.

We also compared the performance of DIPx and TAJI-M in Test Set 2. As expected, the prediction performance of both methods was worse in Test Set 2 than in Test Set 1 since Test Set 2 consists of new combinations absent from the training set. The Spearman correlation of the observed vs predicted synergy using DIPx is 0.26 (95% CI: 0.22–0.30), which is greater than 0.18 (95% CI: 0.16– 0.20) using TAJI-M. Figure 2d show that this result is stable across 100 bootstrap replications of the training set. A similar downward bias for DIPx is observed in the bootstrap distribution.

To investigate the effect of unseen combinations on prediction performance, we divided each combination (drug A + drug B) in Test Set 2 into one of three groups based on the number of individual drugs present in the training set: (i) neither drug A nor drug B in the training set (“no drug”), (ii) either drug A or drug B in the training set (‘one drug’), (iii) and both drugs A and B in the training set (‘two drugs’), as shown in Figure 2e. Overall, both DIPx and TAJI-M showed improved performance as the number of drugs present in the training set increased. For experiments in which both drugs were not in the training set (n = 262), TAJI-M achieved a median correlation of 0.11, while DIPx performed worse with a median correlation of –0.03. For experiments with at least one drug in the training set (n = 2, 499), both methods showed improved performance with median correlations of 0.16 and 0.12 for DIPx and TAJI-M, respectively. When both drugs in an experiment were present in the training set (n = 4, 370), DIPx achieved a median correlation of 0.30, which was better than TAJI-M’s performance (r = 0.22).

Monotherapy drug response profiles have been shown to correlate with synergistic effects and contribute to improving prediction performance, e.g., in TAJI (Li et al., 2018). Here, we compared the performance of DIPx and TAJI-M in relation to monotherapy sensitivity as measured by the IC50 value. We categorized each experiment in the AZS test sets into three groups according to the monotherapy response. Briefly, we first calculated the median sensitivity to monotherapy for each drug A (T_A) across all experiments. Measuring the response of a cell line to drug A in an experiment by S_A, the drug is considered sensitive if S_A ≥ T_A. We then compared the synergy of a combination of drugs A and B in relation to the monotherapy sensitivity to both drugs, only one drug, or neither drug.

In Test Set 1, DIPx outperformed TAJI-M in all three groups of monotherapy sensitivity, with the highest performance in the group sensitive to both drugs (median r = 0.48, P-value < 1.79 × 10⁻²⁷), see Figure 2f. In Test Set 2, TAJI-M performed slightly better in the group with no sensitive drug (median r = 0.21 vs r = 0.20 by DIPx, P-value < 1.26 × 10⁻⁵). Interestingly, we found that, while the performance of DIPx improved as the number of monotherapy-sensitive drugs in a combination increased, the performance of TAJI-M decreased, see Figure 2g.

External validation of DIPx in the ONeil dataset

We used a similar computational approach to evaluate the prediction performance of DIPx in relation to the sensitivity of the constituent monotherapies and the impact of unseen combinations in the ONeil dataset. As shown in Figure 3a, the performance of DIPx improved with an increasing number of monotherapy-sensitive drugs in the combination, consistent with the results of Test Set 2 of the AZS data. The highest Spearman correlation between the predicted and observed scores was seen in combinations with two sensitive drugs (median r = 0.11). In relation to the number of drugs in a combination present in the training set, DIPx achieved better performance for combinations with none or one drug in the training set (middle box plot, Figure 3 - figure supplement 1). Poor performance for combinations with two drugs in the training set could be due to the limited number of drug combinations (42/583). We also analyzed the prediction performance of DIPx across the 29 cell lines from 6 different cancer tissues (Figure 3b). Colon cancer (yellow boxplots) and lung cancer cell lines (purple boxplots) showed better validation compared to cell lines from breast, ovarian, melanoma, and prostate cancers.

Prediction performance of DIPx in the ONeil dataset. a) monotherapy sensitivity, b) 29 cell lines from 6 cancer tissues. The y-axis in all box plots shows the Spearman correlation between predicted and observed values in 100 bootstrap replicates.
Figure 3—figure supplement 1. Prediction performance of DIPx in the ONeil dataset, grouped by unseen combinations.

Inference of the mechanism of action based on PAS

The use of PAS in DIPx allows us to infer the potential mechanisms of action of drug combinations while maintaining the prediction performance of the model. For instance, in Test Set 1 of the AZS data, DIPx suggests the involvement of ERBB2 signaling pathways in the Capivasertib + Sapitinib combination, as illustrated by the top pathways depicted in Figure 1e and marked yellow in Figure 4a. This combination therapy has shown promise in overcoming resistance to anti-ERBB2 monotherapy in HER2+ breast cancer (Fujimoto et al., 2020), and ERBB2 has been identified as a key biomarker associated with synergistic responses to this combination in the AZS DREAM Challenge study (Menden et al., 2019).

Inference of pathway importance scores in the AZS dataset. a) Scatter plot showing feature importance (x-axis) vs PAS (y-axis) for the Capivasertib + Sapitinib combination. Pathways with high PAS and feature importance (top 5%) are of particular interest. **b, c**) Top pathways contributing to the prediction of the combinations in Test Set 1 (b) and Test Set 2 (c). For each pathway, the bar plots show its feature importance. **d, f**) Functional interaction between the pathway vs driver genes (x-axis) and the pathway vs target genes (y-axis) of the top pathways suggested by DIPx in the SW900 cell line treated with synergistic combination BCL2L1 + AZD5582 (d) and the non-synergistic combination Doxorubicin + AZ12623380 (f). The z-score from network enrichment analysis (NEA) is a measure of functional interaction between two gene sets. A higher z-score indicates a stronger interaction compared to a random permutation of the network. The upper right quadrant (z-score > 1.96) represents pathways that are potentially interesting. **e, g**) Cartoon illustration of the potential pathways mediated by the synergistic combination of BCL2L1 + AZD5582 (e) and the non-synergistic combination Doxorubicin + AZ12623380 (g).
Figure 4—figure supplement 2. A cartoon illustration of the RAS pathway mediated by the Selumetinib + MK-2206 combination
Figure 4—figure supplement 3. Observed vs predicted inhibition in the SW900 cell line treated by BCL2L1 + AZD5582 and Doxorubicin + AZ12623380 combinations
Figure 4—figure supplement 4. Functional interaction between driver genes, target genes, and top pathways suggested by DIPx in the SW900 cell line treated with BCL2L1 + AZD5582.
**Figure 4—figure supplement 5.** Functional interaction between driver genes, target genes, and top pathways suggested by DIPx in the SW900 cell line treated with Doxorubicin + AZ12623380.

Figure 4a further shows the distribution of feature importance versus PAS for all pathways for Capivasertib + Sapitinib combination. The feature importance value is calculated using the permutation method of Ishwaran et al. (Ishwaran and Lu, 2019). Our focus is on pathways with high feature importance (e.g., the top 5%) as well as highly activated (top 5% PAS). Therefore, the top-right section of Figure 4a is the interesting region. We present additional examples to further demonstrate the capabilities of DIPx. Figure 4b gives the top pathways of MEDI3622, an ADAM17 inhibitor, in combinations with AKT inhibitors including Capivasertib and MK-2206. These ADAM17 + AKT combinations target multiple parts of the PI3K/AKT pathway through ERBB activation (Menden et al., 2019), which aligns with the potential pathway candidates suggested by DIPx.

One of the key strengths of DIPx is its ability to infer potential mechanisms of both known and novel drug combinations, even in cases where limited biological or clinical information is available. This capability is particularly valuable for new combinations that have not been included in the training set. For instance, in Figure 4c, we present the key pathways identified for the Selumetinib + MK-2206 combination from Test Set 2 of the AZS data. We observe the involvement of RAS signaling, with Selumetinib targeting MEK and MK-2206 targeting AKT, as shown in Figure 4-figure supplement 2. A recent clinical study has used Selumetinib + MK-2206 to target downstream components of the RAS pathway (Chung et al., 2017).

If the drugs in a combination have the same target, the efficacy of the combination is likely similar to that of each individual drug at higher doses, i.e., they will only have an additive effect. So it seems reasonable to hypothesize that a synergistic combination is more likely to occur when the two drugs have different targets (Chen et al., 2015). But how should the targets be related to each other? To investigate this, we examine the pathways suggested by DIPx. First, we choose a synergistic combination of BCL2L1 + AZD5582 in the SW900 cell line for further illustration. The contour plot of the BCL2L1 + AZD5582 inhibition in the SW900 cell line is illustrated in Figure 4 - figure supplement 3a. We first collected the top 15 pathways (ranked by feature importance) for this BCL2L1 + AZD5582 combination suggested by DIPx. The full list of these pathways is shown in Figure 4 - figure supplement 4. Figure 4d illustrates the functional interaction between the genes of the top 15 pathways and the driver genes of the SW900 cell line (x-axis); the target genes of the combination BCL2L1 + AZD5582 (y-axis). To assess the strength of this interaction, we used the network enrichment analysis (NEA) (Alexeyenko et al., 2012), which provides z-score, an enrichment score, indicating the degree of interaction. A higher z-score reflects a stronger interaction between the two gene sets. The top pathways exhibiting high functional interaction with both the driver genes and target genes (z-score > 1.96) are particularly notable, located in the upper right quadrant of Figure 4d. In particular, the apoptosis pathway via NF-kB (highlighted in green) has the highest pathway-target interaction among these pathways. Figure 4e shows the cartoon illustration of the pathway in which the drug BCL2L1 targets BCL-xL and AZD5582 targets XIAP. This suggests an explanation for the observed synergy between the two drugs. Thus, it appears that in this case we get synergy when the drugs target different parts of a driving pathway, either directly or via other functional interactions.

As a negative control, we examine the non-synergistic combination Doxorubicin + AZ12623380, which targets the same gene TOP2; see Figure 4f and g and Figure 4 - figure supplement 3b. We similarly obtain 15 top-ranking pathways according to feature importance, but now we do not expect to see anything obviously relevant to the SW900 cell line (more details in Figure 4 - figure supplement 5). Some pathways that have a high functional interaction with the target genes (upper-left quadrant) have little interaction with the drivers. There are no clearly outlying points in the upper-right quadrant; the two pathways near the boundary are (i) Shen_Smarca2_targets_up, containing genes whose expression negatively correlated with the expression of the SMARCA2 gene in prostate cancer samples, discovered in relation to androgen-induced proliferation in the prostate; and (ii) Kokkinakis_Methione _deprivation_48hr_up, which contains up-regulated genes in melanoma cell-line MEWO cells after 48h of methionine deprivation. They do not appear to be relevant for the lung cancer cell line SW900.

PAS captures the functional interaction of drug targets

As we discuss above, to get synergy, the two drugs in a combination theoretically should not have the same target. However, there is of course no guarantee that two drugs that do not share target genes can produce synergy. In Figure 5a, using the AZS data, we compare the observed drug synergy of combinations of two drugs that share some target genes vs those that do not share any target genes. No significant differences were observed (p-value > 0.72), suggesting that non-overlapping drugs in terms of their targets do not necessarily result in improved drug synergy.

a) Comparison of drug synergy between combinations (drug A + drug B) with vs without overlapping target genes. The numbers in parentheses show the sample sizes of each group. b) Drug synergy between four groups in relation to increased functional interaction between the target genes of the two drugs. c) Comparison between the observed functional interaction (z-score in the network enrichment analysis) and the predicted z-score by PAS.

However, we also observed synergy when the two drugs target different genes in the same pathway. More generally, we hypothesize that synergistic effects occur when the targets have functional interaction. As before, the functional interaction is assessed using NEA (Alexeyenko et al., 2012), where a higher z-score value indicates a stronger functional interaction between the two drugs. Figure 5b shows the observed drug synergy (y-axis) in the AZS data for the four groups defined by the quartile values of the z-scores (x-axis). It indicates that combinations with higher functional interaction are more likely to achieve higher drug synergy, with the highest z-score group (z ∈ (2.97, 29.3]) exhibiting the most favorable drug synergy (median Loewe score = 10.34).

However, when added to the prediction model, the functional interaction z-score did not improve the prediction of synergy (data not shown). Statistically, this can happen if PAS already captures the functional interaction information. To show this, using the AZS training data, we trained a prediction model using PAS as the feature and the functional interaction z-score as the output. We then evaluated the performance of the model in the test set. As shown in Figure 5c, we observed a significant correlation between the predicted and observed z-scores, with a Spearman correlation coefficient of 0.46. This explains why the functional interaction does not give additional predictive power in our model.

Discussion

We have developed and validated DIPx, an advanced computational model that incorporates gene expression and mutation profiles to predict synergistic drug combinations. DIPx performs well against the best performing method in the AstraZeneca-Sanger DREAM Challenge. Through the use of tumor- and patient-specific pathway activation scores, DIPx also provides valuable information on the potential underlying pathways associated with an observed synergistic drug interaction. In addition to rigorous validation using the AZS dataset, DIPx is further validated on the independent ONeil dataset. This comprehensive validation ensures the robustness and reliability of DIPx in predicting drug synergy across different cancers.

The recent availability of large-scale drug combination assay data has allowed the development of realistic prediction models for drug synergy. These datasets offer a substantial number of samples encompassing hundreds of combinations, allowing for extensive validation studies. However, it is important to note that these datasets were generated using different protocols and drug screening techniques. For instance, the AZS data used a 5-by-5 concentration matrix, while the study by ONeil et al. used a 4-by-4 format. In addition, there is limited overlap in the cell lines used among the datasets. These differences pose challenges to the proper validation of prediction methods (Menden et al., 2019).

A particular strength of our study is that we use the best-performing method in the Challenge as a benchmark. This is a convenient but robust benchmarking, as there were 160 teams that participated in the Challenge (73 teams submitted in the final round). Altogether, these teams used practically all of the commonly used machine learning tools; see the summary in Menden et al(Menden et al., 2019). Another strength is our use and validation of the confidence score metric, which captures the statistical uncertainty in the predicted synergy by a single number. This is more convenient for clinical interpretation than the standard prediction interval, because there is a target level for which a combination is considered synergistic, so the score measures our confidence in achieving the target.

Despite promising results, our study has several limitations. First, the use of cell lines as training and validation samples from the AZS and ONeil datasets may not fully capture the heterogeneity present in actual tumors. Second, the computation of PAS relies solely on the primary target genes of the drug combinations, potentially disregarding valuable information from non-primary targets. There could also be off-targets that we do not know about. This limitation might lead to the loss of information about the broader effects of drug combinations. Third, cancer is a heterogeneous disease that occurs in many tissues. Even within a single tissue, cancer exhibits distinct (molecular) subtypes with varying biological mechanisms and clinical outcomes. Since DIPx was developed using pan-cancer datasets, it may not be optimal for tissue-specific predictions.

Last but not least, prediction of previously untrained combinations remains a great challenge. The worst case is for combinations of drugs that were not previously trained, with the Spearman correlation only around 0.1. However, from a clinical perspective, it is perhaps more realistic to look for combinations among drugs previously trained in monotherapy or in other combinations. Improving the prediction for the combination of such drugs would be worthwhile.

Methods and Materials

Pathway activation score for drug combinations

Pathway activation scores (PASs) are the key features in DIPx. The PAS of pathway P in cell line C is calculated for each drug combination (drug A + drug B) and pathway P. Genes in pathway P are grouped into three subgroups: (a) G_u, which includes all the target genes of drugs A and B, as well as the upstream genes of pathway P; (b) G_d , which includes the downstream genes of pathway P; and (c) G_dr, which consists of all the driver genes of cell line C in pathway P. In the example of the ERBB pathway targeted by Capivasertib + Sapitinib. (Figure 1b), G_u consists of ERBB, PI3K, and also AKT; G_d contains MTOR, RAS and MAPK, while G_dr includes TP53 and ERBB2.

The score for upstream activity (PAS_u) is calculated by the sum of mRNA expression for genes in G_u. Similarly, the scores for the downstream activity (PAS_d ) and the set of driver genes (PAS_dr) are calculated from G_d and G_dr. In practice, the genes of the N = 4,762 curated human pathways are provided from the MsigDB database (Liberzon et al., 2015). The target genes of the drugs are collected from the AZS dataset and extended from the DrugBank database (Wishart et al., 2018) and the ChEMBL database (Zdrazil et al., 2024). The extraction of the driver genes of the cell lines is described in the Datasets section.

A pathway based model for drug synergy prediction

The training features of DIPx consist of three components: upstream activity (PAS_u), downstream activity (PAS_d ), and driver genes (PAS_dr), as shown in Figure 1b. The final training matrix has a size of K experiments by 14,286 PASs, where each row corresponds to a specific experiment (drug A + drug B, cell line C).

To address potential sparsity in the training matrix caused by pathways with no target or driver genes, we explored an alternative model with N = 4,762 additional features. Each feature corresponds to a pathway P and is calculated as S(g) * (w1 + w2), where S(g) represents the sum of mRNA expression for all genes in pathway P, and w1 and w2 denote the functional interactions between gene sets: (pathway genes ↔ target genes) and (pathway genes ↔ driver genes), respectively. The functional interactions were estimated using NEA and converted into normal probability scores for w1 and w2. The feature value is zero only when the pathway lacks both targets and driver genes, as well as any interactions with drug targets and driver genes. Additionally, we incorporated the NEA enrichment score between target genes and driver genes into the final matrix. Despite adding these new features, the alternative model did not exhibit any significant improvements in predictive power (data not shown).

For the predictor, we used the random forest algorithm implemented in the randomforestRSC package (with default parameters) in R version 4.0.4. During the development of DIPx, we experimented with various machine learning methods, such as the support vector machine (SVM) and the elastic net. However, we found that these other methods yielded comparable results and that tuning their parameters did not significantly improve prediction performance while requiring extensive additional computations (data not shown). The random forest algorithm in the randomforestRSC package also offers multiple options to calculate the importance of features. In this study, we used the permutation (or Breiman-Cutler) method (Ishwaran and Lu, 2019) to infer the importance of each PAS.

The confidence score (CS) is used to assess the statistical quality of synergy prediction; see Pawitan (2001, Section 5.6) (Pawitan, 2001) for the confidence concept in general. First, as previously defined for example in (Menden et al., 2019), a combination is considered synergistic if the Loewe score is greater than or equal to 20. For each sample s, we have the actual predicted synergy P_s. We then generate N_b = 100 bootstrap replicates of the training data and obtain the bootstrap predictions for the sample: . The CS of P_s is defined as follows:

The bootstrap replicates are also used to evaluate the standard errors (se) of the Spearman correlation between the observed and predicted synergy scores in the test sets. The 95% confidence intervals are computed by the usual formula: se, where is the observed Spearman correlation. Though less frequently used, the bootstrap can also be used for bias correction (Pawitan, 2001, Section 5.2). Bias occurs if there is a nontrivial gap between the observed estimate and the mean of the bootstrap replications. Theoretically,

where F is the underlying data distribution. So, the bias-corrected estimate should be

In practice, the bias is estimated by

where are the bootstrap replicates of . When the estimated bias is negative, as we observed for DIPx, the bias-corrected estimate is shifted upward. And vice versa, if the bias is positive, as observed for TAJI-M, the corrected estimate is shifted downward.

Datasets

AstraZeneca-Sanger (AZS) DREAM Challenge dataset

The AZS DREAM Challenge is a rigorous competition in the effort to systematically develop and validate drug synergy prediction methods. Indicating the strong interest in the topic, 160 international teams (Menden et al., 2019) participated in the Challenge. It was organized into two subchallenges: i) Prediction for known (tested) combination and ii) Prediction for unknown (untested) drug combinations. The final dataset comprised 11,576 experiments from 85 cell lines and 910 combinations. The gene expression data of these cell lines was obtained from Affymetrix microarray ( Menden et al., 2019). H owever, to ensure consistency between the AZS dataset and the Oneil dataset (O’Neil et al., 2016) (which did not provide gene expression profiles of cell lines), we utilized gene expression data from the Cancer Cell Line Encyclopedia (CCLE) cohort (Ghandi et al., 2019).

Out of the 85 cell lines, we identified 75 cell lines with available gene expression data in the CCLE cohort, resulting in a total of 10,154 experiments involving 910 combinations used in our study. Supplementary File S1 shows the list of 75 cell lines. For the validation of the prediction model, the data were split into a training set (n = 2060) and two test sets (n = 963 and 7131) according to subchallenges 1 and 2, respectively. The first test set contains experiments from 167 combinations (of 69 single drugs) that are also in the training set. The second test set includes experiments with 729 drug combinations that are not in the training set.

We collected gene expression data of 75 cell lines, measuring the transcript per million (TPM) of 37,222 genes, of the CCLE cohort downloaded from the DepMap Portal (Tsherniak et al., 2017). The gene expression data was logarithmically transformed to the base 2 scale for downstream analysis. Additionally, we obtained potential driver genes for these cell lines, including both mutations and fusion genes, from the DepMap Portal. The portal provides information on mutations in 1,637 protein-coding genes associated with cancer biology in a collection of 1,030 cell lines.

To filter the list of mutations, we focused on those occurring in at least 2.5% of the total cell lines. Subsequently, we extracted the list of mutations specific to the 75 cell lines under investigation. For fusion genes, we focused on those present in the Miltelman database (Mitelman, 2022) and occurring at least twice, considering them as relevant for our analysis. The final list of potential driver genes for the 75 cell lines can be found in Supplementary File S1. On average, each cell line had a median of 29 potential driver genes.

For the drug synergy data, we used a 5-by-5 concentration matrix provided by the Challenge. Drug synergy values were estimated using the Loewe reference model from Combenefit (Di Veroli et al., 2016).

ONeil dataset

ONeil dataset is a large-scale drug synergy screening dataset from Merck&Co company (O’Neil et al., 2016). A total of 23,062 experiments with 583 unique drug combinations (38 monotherapy drugs) was carried out on 38 cancer cell lines by a 4-by-4 drug concentration matrix. Out of 38 cell lines, we found 29 cell lines with available gene expression data from the DepMap Portal. The detail of 29 cell lines is described in Supplementary File S1. The gene expression data of 37,222 genes from 29 cell lines, as well as the driver genes of these cell lines, were collected from the DepMap Portal using the same procedure as in the AZS dataset. The original release of this dataset provides only the raw data on drug synergy. Here, we calculated the Loewe synergy score for each experiment using Combenefit (Di Veroli et al., 2016). In total, we obtained 16,907 experiments for 583 combinations in 29 cell lines for further analysis. Drug targets of 38 monotherapy drugs were collected from the DrugBank database (Wishart et al., 2018) and the ChEMBL database (Zdrazil et al., 2024).

Data Availability

The implementation of DIPx, and related data are publicly available in https://www.github.com/tracquangthinh/DIPx. Drug synergy data are available from their original studies: Synapse database at synapse.org/DrugCombinationChallenge for the AZS dataset (Menden et al., 2019), raw data from the supplementary data for the ONeil dataset (O’Neil et al., 2016).

Acknowledgements

This work was partially supported by funding from the Swedish Research Council (VR), Cancer-Fonden, and the Swedish Foundation for Strategic Research (SSF). The computations were enabled by resources provided by the National Academic Infrastructure for Supercomputing in Sweden (NAISS) at UPPMAX partially funded by the Swedish Research Council through grant agreement no. 2018-05973. We acknowledge the investigators of the AstraZeneca-Sanger DREAM Challenge for data access.

Conflict of interest statement

The authors declare no competing interests.

Prediction performance of DIPx in the ONeil dataset, grouped by unseen combinations in the training set (x-axist). The y-axis in all box plots shows the Spearman correlation between predicted and observed values in 100 bootstrap replicates.

A cartoon illustration of the RAS pathway mediated by the Selumetinib + MK-2206 combination

Observed (red lines) vs predicted inhibition (black, dash lines) from Loewe reference model in the SW900 cell line treated by the synergistic BCL2L1 + AZD5582 combination (a) and the non-synergistic Doxorubicin + AZ12623380 combination (b). The number in each line presents the percentage of inhibition..

Functional interaction (x-axis) between the pathway vs driver genes (1st column), the pathway vs all target genes (2nd), the pathway vs BCL2L1 target genes (3th), and the pathway vs AZD5582 target genes (4th) of the top pathways suggested by DIPx in the SW900 cell line treated with synergistic combination BCL2L1 + AZD5582.

Functional interaction (x-axis) between the pathway vs driver genes (1st column), the pathway vs all target genes (2nd), the pathway vs Doxorubicin target genes (3th), and the pathway vs AZ12623380 target genes (4th) of the top pathways suggested by DIPx in the SW900 cell line treated with non-synergistic combination Doxorubicin + AZ12623380.

References

1. Alexeyenko A
2. Lee W
3. Pernemalm M
4. Guegan J
5. Dessen P
6. Lazar V
7. Lehtiö J
8. Pawitan Y.
2012Network enrichment analysis: extension of gene-set enrichment analysis to gene networksBMC bioinformatics 13:1–11Google Scholar
1. Banzi M
2. De Blasio S
3. Lallas A
4. Longo C
5. Moscarella E
6. Alfano R
7. Argenziano G.
2016Dabrafenib: a new opportunity for the treatment of BRAF V600-positive melanomaOncoTargets and therapy :2725–2733Google Scholar
1. Chen D
2. Liu X
3. Yang Y
4. Yang H
5. Lu P.
2015Systematic synergy modeling: understanding drug synergy from a systems biology perspectiveBMC systems biology 9:1–10Google Scholar
1. Chung V
2. McDonough S
3. Philip PA
4. Cardin D
5. Wang-Gillam A
6. Hui L
7. Tejani MA
8. Seery TE
9. Dy IA
10. Al Baghdadi T
11. et al.
2017Effect of selumetinib and MK-2206 vs oxaliplatin and fluorouracil in patients with metastatic pancreatic cancer after prior therapy: SWOG S1115 study randomized clinical trialJAMA oncology 3:516–522Google Scholar
1. Di Veroli GY
2. Fornari C
3. Wang D
4. Mollard S
5. Bramhall JL
6. Richards FM
7. Jodrell DI
2016Combeneﬁt: an interactive platform for the analysis and visualization of drug combinationsBioinformatics 32:2866–2868Google Scholar
1. Fujimoto Y
2. Morita TY
3. Ohashi A
4. Haeno H
5. Hakozaki Y
6. Fujii M
7. Kashima Y
8. Kobayashi SS
9. Mukohara T.
2020Com-bination treatment with a PI3K/Akt/mTOR pathway inhibitor overcomes resistance to anti-HER2 therapy in PIK3CA-mutant HER2-positive breast cancer cellsScientiﬁc reports 10:21762Google Scholar
1. Ghandi M
2. Huang FW
3. Jané-Valbuena J
4. Kryukov GV
5. Lo CC
6. McDonald ER
7. Barretina J
8. Gelfand ET
9. Bielski CM
10. Li H
11. et al.
2019Next-generation characterization of the cancer cell line encyclopediaNature 569:503–508Google Scholar
1. Güvenç Paltun B
2. Kaski S
3. Mamitsuka H.
2021Machine learning approaches for drug combination therapiesBriefings in bioinformatics 22:bbab293Google Scholar
1. Ishwaran H
2. Lu M.
2019Standard errors and conﬁdence intervals for variable importance in random forest regression, classiﬁcation, and survivalStatistics in medicine 38:558–582Google Scholar
1. Kong W
2. Midena G
3. Chen Y
4. Athanasiadis P
5. Wang T
6. Rousu J
7. He L
8. Aittokallio T.
2022Systematic review of computational methods for drug combination predictionComputational and structural biotechnology journal 20:2807–2814Google Scholar
1. Li H
2. Li T
3. Quang D
4. Guan Y.
2018Network propagation predicts drug synergy in cancersCancer research 78:5446–5457Google Scholar
1. Li P
2. Huang C
3. Fu Y
4. Wang J
5. Wu Z
6. Ru J
7. Zheng C
8. Guo Z
9. Chen X
10. Zhou W
11. et al.
2015Large-scale exploration and analysis of drug combinationsBioinformatics 31:2007–2016Google Scholar
1. Liberzon A
2. Birger C
3. Thorvaldsdóttir H
4. Ghandi M
5. Mesirov JP
6. Tamayo P.
2015The molecular signatures database hallmark gene set collectionCell systems 1:417–425Google Scholar
1. Liu Q
2. Xie L.
2021TranSynergy: Mechanism-driven interpretable deep neural network for the synergistic prediction and pathway deconvolution of drug combinationsPLoS computational biology 17:e1008653Google Scholar
1. Medicine N.
2017Rationalizing combination therapiesNature Medicine 23:1113https://doi.org/10.1038/nm.4426 Google Scholar
1. Menden MP
2. Wang D
3. Mason MJ
4. Szalai B
5. Bulusu KC
6. Guan Y
7. Yu T
8. Kang J
9. Jeon M
10. Wolﬁnger R
11. et al.
2019Community assessment to advance computational prediction of cancer drug combinations in a pharmacogenomic screenNature communications 10:2674Google Scholar
1. Mitelman F.
2022Mitelman database of chromosome aberrations and gene fusions in cancerhttps://mitelmandatabase.isb-cgc.org/
1. O’Neil J
2. Benita Y
3. Feldman I
4. Chenard M
5. Roberts B
6. Liu Y
7. Li J
8. Kral A
9. Lejnine S
10. Loboda A
11. et al.
2016An unbiased oncology compound screen to identify novel combination strategiesMolecular cancer therapeutics 15:1155–1162Google Scholar
1. Pawitan Y.
2001In all likelihood: statistical modelling and inference using likelihoodOxford University Press Google Scholar
1. Pemovska T
2. Bigenzahn JW
3. Superti-Furga G.
2018Recent advances in combinatorial drug screening and synergy scoringCurrent opinion in pharmacology 42:102–110Google Scholar
1. Plana D
2. Palmer AC
3. Sorger PK
2022Independent drug action in combination therapy: implications for precision oncologyCancer discovery 12:606–624Google Scholar
1. Subbiah V
2. Kreitman RJ
3. Wainberg ZA
4. Gazzah A
5. Lassen U
6. Stein A
7. Wen PY
8. Dietrich S
9. de Jonge MJ
10. Blay JY
11. et al.
2023Dabrafenib plus trametinib in BRAFV600E-mutated rare cancers: the phase 2 ROAR trialNature medicine 29:1103–1112Google Scholar
1. Tang YC
2. Gottlieb A.
2022SynPathy: Predicting drug synergy through drug-associated pathways using deep learningMolecular Cancer Research 20:762–769Google Scholar
1. Tsherniak A
2. Vazquez F
3. Montgomery PG
4. Weir BA
5. Kryukov G
6. Cowley GS
7. Gill S
8. Harrington WF
9. Pantel S
10. Krill-Burger JM
11. et al.
2017Deﬁning a cancer dependency mapCell 170:564–576Google Scholar
1. Wishart DS
2. Feunang YD
3. Guo AC
4. Lo EJ
5. Marcu A
6. Grant JR
7. Sajed T
8. Johnson D
9. Li C
10. Sayeeda Z
11. et al.
2018DrugBank 5.0: a major update to the DrugBank database for 2018Nucleic acids research 46:D1074–D1082Google Scholar
1. Wu L
2. Gao J
3. Zhang Y
4. Sui B
5. Wen Y
6. Wu Q
7. Liu K
8. He S
9. Bo X.
2023A hybrid deep forest-based method for predicting synergistic drug combinationsCell Reports Methods 3Google Scholar
1. Wu L
2. Wen Y
3. Leng D
4. Zhang Q
5. Dai C
6. Wang Z
7. Liu Z
8. Yan B
9. Zhang Y
10. Wang J
11. et al.
2022Machine learning methods, databases and tools for drug combination predictionBrieﬁngs in bioinformatics 23:bbab355Google Scholar
1. Zdrazil B
2. Felix E
3. Hunter F
4. Manners EJ
5. Blackshaw J
6. Corbett S
7. de Veij M
8. Ioannidis H
9. Lopez DM
10. Mosquera JF
11. et al.
2024The ChEMBL Database in 2023: a drug discovery platform spanning multiple bioactivity data types and time periodsNucleic acids research 52:D1180–D1192Google Scholar
1. Zhong J
2. Yan W
3. Wang C
4. Liu W
5. Lin X
6. Zou Z
7. Sun W
8. Chen Y.
2022BRAF inhibitor resistance in melanoma: mechanisms and alternative therapeutic strategiesCurrent Treatment Options in Oncology 23:1503–1521Google Scholar

Article and author information

Author information

Quang Thinh Trac
Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
ORCID iD: 0000-0003-2429-0287
- These authors contributed equally to this work
Yue Huang
Department of Health Statistics, School of Public Health, Weifang Medical University, Weifang, Shandong, China
- These authors contributed equally to this work
Tom Erkers
Department of Oncology Pathology, Karolinska Institutet, Science for Life Laboratory, Stockholm, Sweden
Päivi Östling
Department of Oncology Pathology, Karolinska Institutet, Science for Life Laboratory, Stockholm, Sweden, Institute for Molecular Medicine Finland, University of Helsinki, Helsinki, Finland
Anna Bohlin
Department of Medicine Huddinge, Karolinska Institutet, Unit for Hematology, Karolinska University Hospital Huddinge, Stockholm, Sweden
Albin Österroos
Department of Medical Sciences, Hematology, Uppsala University Hospital, Uppsala, Sweden
Mattias Vesterlund
Department of Oncology Pathology, Karolinska Institutet, Science for Life Laboratory, Stockholm, Sweden
ORCID iD: 0000-0001-9471-6592
Rozbeh Jafari
Department of Oncology Pathology, Karolinska Institutet, Science for Life Laboratory, Stockholm, Sweden
ORCID iD: 0000-0002-3396-4709
Ioannis Siavelis
Department of Oncology Pathology, Karolinska Institutet, Science for Life Laboratory, Stockholm, Sweden
Helena Bäckvall
Department of Oncology Pathology, Karolinska Institutet, Science for Life Laboratory, Stockholm, Sweden
Santeri Kiviluoto
Department of Oncology Pathology, Karolinska Institutet, Science for Life Laboratory, Stockholm, Sweden
Lukas M Orre
Department of Oncology Pathology, Karolinska Institutet, Science for Life Laboratory, Stockholm, Sweden
Mattias Rantalainen
Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
Janne Lehtiö
Department of Oncology Pathology, Karolinska Institutet, Science for Life Laboratory, Stockholm, Sweden
ORCID iD: 0000-0002-8100-9562
Sören Lehmann
Department of Medicine Huddinge, Karolinska Institutet, Unit for Hematology, Karolinska University Hospital Huddinge, Stockholm, Sweden, Department of Medical Sciences, Hematology, Uppsala University Hospital, Uppsala, Sweden
Olli Kallioniemi
Department of Oncology Pathology, Karolinska Institutet, Science for Life Laboratory, Stockholm, Sweden, Institute for Molecular Medicine Finland, University of Helsinki, Helsinki, Finland
Yudi Pawitan
Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
Trung Nghia Vu
Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
ORCID iD: 0000-0001-7945-5750
- For correspondence:⠀trungnghia.vu@ki.se (TNV)

Version history

Sent for peer review: June 5, 2024
Preprint posted: June 8, 2024
Reviewed Preprint version 1: August 9, 2024
Reviewed Preprint version 2: March 20, 2025
Reviewed Preprint version 3: May 20, 2025
Version of Record published: June 3, 2025

Cite all versions

You can cite all versions using the DOI https://doi.org/10.7554/eLife.100071. This DOI represents all versions, and will always resolve to the latest one.

Copyright

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.

Not revised: This Reviewed Preprint includes the authors’ original preprint (without revision), an eLife assessment, and public reviews.

Reviewing Editor
Alan Talevi
Universidad Nacional de La Plata, La Plata, Argentina
Senior Editor
Aleksandra Walczak
CNRS, Paris, France

Reviewer #1 (Public Review):

The authors introduce DIPx, a deep learning framework for predicting synergistic drug combinations for cancer treatment using the AstraZeneca-Sanger (AZS) DREAM Challenge dataset. While the approach is innovative, I have the following concerns and comments which hopefully will improve the study's rigor and applicability, making it a more powerful tool in the real clinical world.

(1) Test Set 1 comprises combinations already present in the training set, likely leading overfitting issue. The model might show inflated performance metrics on this test set due to prior exposure to these combinations, not accurately reflecting its true predictive power on unknown data, which is crucial for discovering new drug synergies. The testing approach reduces the generalizability of the model's findings to new, untested scenarios.

(2) The model struggles with predicting synergies for drug combinations not included in its training data (showing only a Spearman correlation of 0.26 in Test Set 2). This limits its potential for discovering new therapeutic strategies. Utilizing techniques such as transfer learning or expanding the training dataset to encompass a wider range of drug pairs could help to address this issue.

(3) The use of pan-cancer datasets, while offering broad applicability, may not be optimal for specific cancer subtypes with distinct biological mechanisms. Developing subtype-specific models or adjusting the current model to account for these differences could improve prediction accuracy for individual cancer types.

(4) Line 127, "Since DIPx uses only molecular data, to make a fair comparison, we trained TAJI using only molecular features and referred to it as TAJI-M.". TAJI was designed to use both monotherapy drug-response and molecular data, and likely won't be able to reach maximum potential if removing monotherapy drug-response from the training model. It would be critical to use the same training datasets and then compare the performances. From Figure 6 of TAJI's paper (Li et al., 2018, PMID: 30054332) , i.e., the mean Pearson correlation for breast cancer and lung cancer is around 0.5 - 0.6.

The following 2 concerns have been included in the Discussion section which is great:

(1) Training and validating the model using cell lines may not fully capture the heterogeneity and complexity of in vivo tumors. To increase clinical relevance, it would be beneficial to validate the model using primary tumor samples or patient-derived xenografts.

(2) The Pathway Activation Score (PAS) is derived exclusively from primary target genes, potentially overlooking critical interactions involving non-primary targets. Including these secondary effects could enhance the model's predictive accuracy and comprehensiveness.

https://doi.org/10.7554/eLife.100071.1.sa2

Reviewer #2 (Public Review):

Trac, Huang, et al used the AZ Drug Combination Prediction DREAM challenge data to make a new random forest-based model for drug synergy. They make comparisons to the winning method and also show that their model has some predictive capacity for a completely different dataset. They highlight the ability of the model to be interpretable in terms of pathway and target interactions for synergistic effects. While the authors address an important question, more rigor is required to understand the full behavior of the model.

Major Points

(1) The authors compare DIPx to the winning method of the DREAm challenge, TAJI to show that from molecular features alone they retrain TAJI to create TAJI-M without the monotherapy data inputs. They mention that "of course, we could also use such data in DIPx...", but they never show the behaviour of DIPx with these data. The authors need to demonstrate that this statement holds true or else compare it to the full TAJI.

(2) It would be neat to see how the DIPx feature importance changes with monotherapy input. For most realistic scenarios in which these models are used robust monotherapy data do exist.

(3) In Figure 2, the authors compare DIPx and TAJI-M on various test sets. If I understood correctly, they also bootstrapped the training set with n=100 and reported all the model variants in many of the comparisons. While this is a nice way of showing model robustness, calculating p-values with bootstrapped data does not make sense in my opinion as by increasing the value of n, one can make the p-value arbitrarily small. The p-value should only be reported for the original models.

(4) From Figures 2 and 3, it appears DIPx is overfit on the training set with large gaps in Spearman correlations between Test Set 2/ONeil set and Test Set 1. It also features much better in cases where it has seen both compounds. Could the authors also compare TAJI on the ONeil dataset to show if it is as much overfit?

https://doi.org/10.7554/eLife.100071.1.sa1

Reviewer #3 (Public Review):

Summary:

Predicting how two different drugs act together by looking at their specific gene targets and pathways is crucial for understanding the biological significance of drug combinations. Such combinations of drugs can lead to synergistic effects that enhance drug efficacy and decrease resistance. This study incorporates drug-specific pathway activation scores (PASs) to estimate synergy scores as one of the key advancements for synergy prediction. The new algorithm, Drug synergy Interaction Prediction (DIPx), developed in this study, uses gene expression, mutation profiles, and drug synergy data to train the model and predict synergy between two drugs and suggests the best combinations based on their functional relevance on the mechanism of action. Comprehensive validations using two different datasets and comparing them with another best-performing algorithm highlight the potential of its capabilities and broader applications. However, the study would benefit from including experimental validation of some predicted drug combinations to enhance its reliability.

Strengths:

The DIPx algorithm demonstrates the strengths listed below in its approach for personalized drug synergy prediction. One of its strengths lies in its utilization of biologically motivated cancer-specific (driver genes-based) and drug-specific (target genes-based) pathway activation scores (PASs) to predict drug synergy. This approach integrates gene expression, mutation profiles, and drug synergy data to capture information about the functional interactions between drug targets, thereby providing a potential biological explanation for the synergistic effects of combined drugs. Additionally, DIPx's performance was tested using the AstraZeneca-Sanger (AZS) DREAM Challenge dataset, especially in Test Set 1, where the Spearman correlation coefficient between predicted and observed drug synergy was 0.50 (95% CI: 0.47-0.53). This demonstrates the algorithm's effectiveness in handling combinations already in the training set. Furthermore, DIPx's ability to handle novel combinations, as evidenced by its performance in Test Set 2, indicates its potential for extrapolating predictions to new and untested drug combinations. This suggests that the algorithm can adapt to and make accurate predictions for previously unencountered combinations, which is crucial for its practical application in personalized medicine. Overall, DIPx's integration of pathway activation scores and its performance in predicting drug synergy for known and novel combinations underscore its potential as a valuable tool for personalized prediction of drug synergy and exploration of activated pathways related to the effects of combined drugs.

Weaknesses:

While the DIPx algorithm shows promise in predicting drug synergy based on pathway activation scores, it's essential to consider its limitations. One limitation is that the algorithm's performance was less accurate when predicting drug synergy for combinations absent from the training set. This suggests that its predictive capability may be influenced by the availability of training data for specific drug combinations. Additionally, further testing and validation across different datasets (more than the current two datasets) would be necessary to assess the algorithm's generalizability and robustness fully. It's also important to consider potential biases in the training data and ensure that DIPx predictions are validated through empirical studies including experimental testing of predicted combinations. Despite these limitations, DIPx represents a valuable step towards personalized prediction of drug synergy and warrants continued investigation and improvement. It would benefit if the algorithm's limitations are described with some examples and suggest future advancement steps.

https://doi.org/10.7554/eLife.100071.1.sa0

Significance of findings

Strength of evidence

Abstract

Introduction

Results

A pathway based drug synergy prediction model

Validation and comparisons in the AZS dataset

External validation of DIPx in the ONeil dataset

Inference of the mechanism of action based on PAS

PAS captures the functional interaction of drug targets

Discussion

Methods and Materials

Pathway activation score for drug combinations

A pathway based model for drug synergy prediction

Datasets

AstraZeneca-Sanger (AZS) DREAM Challenge dataset

ONeil dataset

Data Availability

Acknowledgements

Conflict of interest statement

References

Article and author information

Author information

Quang Thinh Trac†

Yue Huang†

Tom Erkers

Päivi Östling

Anna Bohlin

Albin Österroos

Mattias Vesterlund

Rozbeh Jafari

Ioannis Siavelis

Helena Bäckvall

Santeri Kiviluoto

Lukas M Orre

Mattias Rantalainen

Janne Lehtiö

Sören Lehmann

Olli Kallioniemi

Yudi Pawitan

Trung Nghia Vu

Version history

Cite all versions

Copyright

Peer review process

Editors

Quang Thinh Trac

Yue Huang