Peer review process
Not revised: This Reviewed Preprint includes the authors’ original preprint (without revision), an eLife assessment, public reviews, and a provisional response from the authors.
Read more about eLife’s peer review process.Editors
- Reviewing EditorYaoting JiWuhan University, Wuhan, China
- Senior EditorCaigang LiuShengjing Hospital of China Medical University, Shenyang, China
Reviewer #1 (Public review):
Summary:
The authors aimed to classify hepatocellular carcinoma (HCC) patients into distinct subtypes using a comprehensive multi-omics approach. They employed an innovative consensus clustering method that integrates multiple omics data types, including mRNA, lncRNA, miRNA, DNA methylation, and somatic mutations. The study further sought to validate these subtypes by developing prognostic models using machine learning algorithms and extending the findings through single-cell RNA sequencing (scRNA-seq) to explore the cellular mechanisms driving subtype-specific prognostic differences.
Strengths:
(1) Comprehensive Data Integration: The study's integration of various omics data provides a well-rounded view of the molecular characteristics underlying HCC. This multi-omics approach is a significant strength, as it allows for more accurate and detailed classification of cancer subtypes.
(2) Innovative Methodology: The use of a consensus clustering approach that combines results from 10 different clustering algorithms is a notable methodological advancement. This approach reduces the bias that can result from relying on a single clustering method, enhancing the robustness of the findings.
(3) Machine Learning-Based Prognostic Modeling: The authors rigorously apply a wide array of machine learning algorithms to develop and validate prognostic models, testing 101 different algorithm combinations. This comprehensive approach underscores the study's commitment to identifying the most predictive models, which is a considerable strength.
(4) Validation Across Multiple Cohorts: The external validation of findings in independent cohorts is a critical strength, as it increases the generalizability and reliability of the results. This step is essential for demonstrating the clinical relevance of the proposed subtypes and prognostic models.
Weaknesses:
(1) Inconsistent Storyline:
Despite the extensive data mining and rigorous methodologies, the manuscript suffers from a lack of a coherent and consistent narrative. The transition between different sections, particularly from multi-omics data integration to single-cell validation, feels disjointed. A clearer articulation of how each analysis ties into the overall research question would improve the manuscript.
(2) Questionable Relevance of Immune Cell Activity Analysis:
The evaluation of immune cell activities within the cancer cell model raises concerns about its meaningfulness. The methods used to assess immune function in the tumor microenvironment may not be fully appropriate, potentially limiting the insights gained from this part of the study.
(3) Incomplete Single-Cell RNA-Seq Validation:
The validation of the findings using single-cell RNA-seq data appears insufficient to fully support the study's claims. While the authors make an effort to extend their findings to the single-cell level, the analysis lacks depth. A more comprehensive validation is necessary to substantiate the robustness of the identified subtypes.
(4) Figures and Visualizations:
Several figures in the manuscript are missing necessary information, which affects the clarity of the results. For instance, the pathways in Figure 3A could be clustered to enhance interpretability, the blue bar in Figure 4A is unexplained, and Figure 4B is not discussed in the text. Additionally, the figure legend in Figure 7C lacks detail, and many figure descriptions merely repeat the captions without providing deeper insights.
(5) Appraisal of the Study's Aims and Results:
The authors have set out to achieve an ambitious goal of classifying HCC patients into distinct prognostic subtypes and validating these findings through both bulk and single-cell analyses. While the methodologies employed are innovative and the data integration comprehensive, the study falls short of fully achieving its aims due to inconsistencies in the narrative and incomplete validation. The results partially support the conclusions, but the lack of coherence and depth in certain areas limits the overall impact of the study.
(6) Impact on the Field:
If the identified weaknesses are addressed, this study has the potential to significantly impact the field of HCC research. The multi-omics approach combined with machine learning is a powerful framework that could set a new standard for cancer subtype classification. However, the current state of the manuscript leaves some uncertainty regarding the practical applicability of the findings, particularly in clinical settings.
(6) Additional Context
For readers and researchers, this study offers a valuable look into the potential of integrating multi-omics data with machine learning to improve cancer classification and prognostication. However, readers should be aware of the noted weaknesses, particularly the need for more consistent narrative development and comprehensive validation of the methods. Addressing these issues could greatly enhance the study's utility and relevance to the community.
Reviewer #2 (Public review):
Summary:
Overall, this is a well-executed and insightful study. With some refinement to the presentation and a deeper exploration of the implications, the manuscript will make a significant contribution to the field of cancer genomics and personalized medicine.
Strengths:
The manuscript integrates multi-omics data with machine learning to address the significant heterogeneity of hepatocellular carcinoma (HCC). The use of multiple clustering algorithms and a consensus method strengthens the robustness of the findings. The study successfully develops a prognostic model with excellent predictive accuracy, validated across independent datasets. This adds considerable value to the field, particularly in providing individualized treatment strategies. The identification of two distinct liver cancer subtypes with different biological and metabolic characteristics is well-supported by the data, offering a promising direction for personalized medicine.
Weaknesses:
(1) Consider streamlining the presentation of methods, especially regarding the clustering algorithms and machine learning models. Readers may find it difficult to follow the exact process unless more clearly outlined.
(2) Some figures, such as the signaling pathways and heatmaps, are critical to understanding the study's findings. Ensure that all figures are high quality, easy to interpret, and adequately labeled. You may also want to highlight the key findings within the figure captions more explicitly.
(3) While the manuscript does compare its prognostic model to those previously published, the novelty of the findings could be emphasized more clearly. Discussing the potential limitations of the study (e.g., the reliance on computational models and small sample sizes for scRNA-seq) could strengthen the manuscript.
(4) The manuscript mentions that the data was split into training and validation datasets in a 1:1 ratio. How was the performance verified? Is there an independent test set?
(5) The role of the MIF signaling pathway in subtype differentiation is intriguing, but further mechanistic insights into how this pathway drives the differences between CS1 and CS2 could be discussed in more detail. If experimental evidence for this pathway exists in the literature, it should be mentioned.
(6) Some sentences are quite long and complex, which can affect readability. Breaking them down into shorter, clearer sentences would improve the flow.