Signature-scoring methods developed for bulk samples are not adequate for cancer single-cell RNA sequencing data

  1. Nighat Noureen
  2. Zhenqing Ye
  3. Yidong Chen
  4. Xiaojing Wang
  5. Siyuan Zheng  Is a corresponding author
  1. The University of Texas Health Science Center at San Antonio, United States
  2. University of Texas Health Science Center at San Antonio, United States

Abstract

Quantifying the activity of gene expression signatures is common in analyses of single-cell RNA sequencing data. Methods originally developed for bulk samples are often used for this purpose without accounting for contextual differences between bulk and single-cell data. More broadly, these methods have not been benchmarked. Here we benchmark five such methods, including single sample gene set enrichment analysis (ssGSEA), Gene Set Variation Analysis (GSVA), AUCell, Single Cell Signature Explorer (SCSE), and a new method we developed, Jointly Assessing Signature Mean and Inferring Enrichment (JASMINE). Using cancer as an example, we show cancer cells consistently express more genes than normal cells. This imbalance leads to bias in performance by bulk-sample-based ssGSEA in gold standard tests and down sampling experiments. In contrast, single-cell-based methods are less susceptible. Our results suggest caution should be exercised when using bulk-sample-based methods in single-cell data analyses, and cellular contexts should be taken into consideration when designing benchmarking strategies.

Data availability

The current manuscript is a computational study, so no data have been generated for this manuscript. Single cell data sets used in this study including their downloading sources were listed in Supplementary Table 1. Gene sets were downloaded from MSigDB v.7.2. JASMINE source code is available on Github (https://github.com/NNoureen/JASMINE). Source Data contain the numerical data used to generate the figures.

The following data sets were generated
The following previously published data sets were used

Article and author information

Author details

  1. Nighat Noureen

    Greehey Children's Cancer Research Institute, The University of Texas Health Science Center at San Antonio, San Antonio, United States
    Competing interests
    The authors declare that no competing interests exist.
  2. Zhenqing Ye

    Greehey Children's Cancer Research Institute, The University of Texas Health Science Center at San Antonio, San Antonio, United States
    Competing interests
    The authors declare that no competing interests exist.
  3. Yidong Chen

    Greehey Children's Cancer Research Institute, University of Texas Health Science Center at San Antonio, San Antonio, United States
    Competing interests
    The authors declare that no competing interests exist.
  4. Xiaojing Wang

    Greehey Children's Cancer Research Institute, The University of Texas Health Science Center at San Antonio, San Antonio, United States
    Competing interests
    The authors declare that no competing interests exist.
  5. Siyuan Zheng

    Greehey Children's Cancer Research Institute, The University of Texas Health Science Center at San Antonio, San Antonio, United States
    For correspondence
    zhengs3@uthscsa.edu
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-1031-9424

Funding

Cancer Prevention and Research Institute of Texas (RR170055)

  • Siyuan Zheng

Cancer Prevention and Research Institute of Texas (RP170345)

  • Nighat Noureen

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Copyright

© 2022, Noureen et al.

This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.

Metrics

  • 13,397
    views
  • 768
    downloads
  • 34
    citations

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Nighat Noureen
  2. Zhenqing Ye
  3. Yidong Chen
  4. Xiaojing Wang
  5. Siyuan Zheng
(2022)
Signature-scoring methods developed for bulk samples are not adequate for cancer single-cell RNA sequencing data
eLife 11:e71994.
https://doi.org/10.7554/eLife.71994

Share this article

https://doi.org/10.7554/eLife.71994

Further reading

    1. Cancer Biology
    Pierluigi Scerbo, Benjamin Tisserand ... Bertrand Ducos
    Research Article

    Why does a normal cell possibly harboring genetic mutations in oncogene or tumor suppressor genes becomes malignant and develops a tumor is a subject of intense debate. Various theories have been proposed but their experimental test has been hampered by the unpredictable and improbable malignant transformation of single cells. Here, using an optogenetic approach we permanently turn on an oncogene (KRASG12V) in a single cell of a zebrafish brain that, only in synergy with the transient co-activation of a reprogramming factor (VENTX/NANOG/OCT4), undergoes a deterministic malignant transition and robustly and reproducibly develops within 6 days into a full-blown tumor. The controlled way in which a single cell can thus be manipulated to give rise to cancer lends support to the ‘ground state theory of cancer initiation’ through ‘short-range dispersal’ of the first malignant cells preceding tumor growth.

    1. Cancer Biology
    Han V Han, Richard Efem ... Richard Z Lin
    Research Article

    Most human pancreatic ductal adenocarcinoma (PDAC) are not infiltrated with cytotoxic T cells and are highly resistant to immunotherapy. Over 90% of PDAC have oncogenic KRAS mutations, and phosphoinositide 3-kinases (PI3Ks) are direct effectors of KRAS. Our previous study demonstrated that ablation of Pik3ca in KPC (KrasG12D; Trp53R172H; Pdx1-Cre) pancreatic cancer cells induced host T cells to infiltrate and completely eliminate the tumors in a syngeneic orthotopic implantation mouse model. Now, we show that implantation of Pik3ca−/− KPC (named αKO) cancer cells induces clonal enrichment of cytotoxic T cells infiltrating the pancreatic tumors. To identify potential molecules that can regulate the activity of these anti-tumor T cells, we conducted an in vivo genome-wide gene-deletion screen using αKO cells implanted in the mouse pancreas. The result shows that deletion of propionyl-CoA carboxylase subunit B gene (Pccb) in αKO cells (named p-αKO) leads to immune evasion, tumor progression, and death of host mice. Surprisingly, p-αKO tumors are still infiltrated with clonally enriched CD8+ T cells but they are inactive against tumor cells. However, blockade of PD-L1/PD1 interaction reactivated these clonally enriched T cells infiltrating p-αKO tumors, leading to slower tumor progression and improve survival of host mice. These results indicate that Pccb can modulate the activity of cytotoxic T cells infiltrating some pancreatic cancers and this understanding may lead to improvement in immunotherapy for this difficult-to-treat cancer.