Identifying gene expression programs of cell-type identity and cellular activity with single-cell RNA-Seq

  1. Dylan Kotliar  Is a corresponding author
  2. Adrian Veres
  3. M Aurel Nagy
  4. Shervin Tabrizi
  5. Eran Hodis
  6. Douglas A Melton
  7. Pardis C Sabeti
  1. Harvard Medical School, United States
  2. Massachusetts Institute of Technology, United States
  3. Broad Institute of MIT and Harvard, United States
  4. Harvard University, United States

Abstract

Identifying gene expression programs underlying both cell-type identity and cellular activities (e.g. life-cycle processes, responses to environmental cues) is crucial for understanding the organization of cells and tissues. Although single-cell RNA-Seq (scRNA-Seq) can quantify transcripts in individual cells, each cell's expression profile may be a mixture of both types of programs, making them difficult to disentangle. Here we benchmark and enhance the use of matrix factorization to solve this problem. We show with simulations that a method we call consensus non-negative matrix factorization (cNMF) accurately infers identity and activity programs, including their relative contributions in each cell. To illustrate the insights this approach enables, we apply it to published brain organoid and visual cortex scRNA-Seq datasets; cNMF refines cell types and identifies both expected (e.g. cell cycle and hypoxia) and novel activity programs, including programs that may underlie a neurosecretory phenotype and synaptogenesis.

Data availability

All of the analyzed real datasets are publicly available and the relevant GEO accession codes are included in the manuscript. All of the simulated and real data can be accessed through Code Ocean at the following URL: https://doi.org/10.24433/CO.9044782e-cb96-4733-8a4f-bf42c21399e6

The following previously published data sets were used

Article and author information

Author details

  1. Dylan Kotliar

    Department of Systems Biology, Harvard Medical School, Boston, United States
    For correspondence
    dylan_kotliar@hms.harvard.edu
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-7968-645X
  2. Adrian Veres

    Department of Systems Biology, Harvard Medical School, Boston, United States
    Competing interests
    The authors declare that no competing interests exist.
  3. M Aurel Nagy

    Harvard-MIT Division of Health Sciences and Technology, Massachusetts Institute of Technology, Cambridge, United States
    Competing interests
    The authors declare that no competing interests exist.
  4. Shervin Tabrizi

    Viral Computational Genomics, Broad Institute of MIT and Harvard, Cambridge, United States
    Competing interests
    The authors declare that no competing interests exist.
  5. Eran Hodis

    Harvard-MIT Division of Health Sciences and Technology, Massachusetts Institute of Technology, Cambridge, United States
    Competing interests
    The authors declare that no competing interests exist.
  6. Douglas A Melton

    Department of Stem Cell and Regenerative Biology, Harvard Stem Cell Institute, Harvard University, Cambridge, United States
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-1623-5504
  7. Pardis C Sabeti

    Department of Systems Biology, Harvard Medical School, Boston, United States
    Competing interests
    The authors declare that no competing interests exist.

Funding

National Institute of General Medical Sciences (T32GM007753)

  • Dylan Kotliar
  • Adrian Veres
  • M Aurel Nagy
  • Eran Hodis

National Institute of Allergy and Infectious Diseases (R01AI099210)

  • Pardis C Sabeti

U.S. Food and Drug Administration (HHSF223201810172C)

  • Dylan Kotliar
  • Pardis C Sabeti

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Reviewing Editor

  1. Alfonso Valencia, Barcelona Supercomputing Center - BSC, Spain

Version history

  1. Received: November 21, 2018
  2. Accepted: July 7, 2019
  3. Accepted Manuscript published: July 8, 2019 (version 1)
  4. Version of Record published: July 18, 2019 (version 2)

Copyright

© 2019, Kotliar et al.

This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.

Metrics

  • 36,284
    Page views
  • 3,250
    Downloads
  • 151
    Citations

Article citation count generated by polling the highest count across the following sources: Scopus, Crossref, PubMed Central.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Dylan Kotliar
  2. Adrian Veres
  3. M Aurel Nagy
  4. Shervin Tabrizi
  5. Eran Hodis
  6. Douglas A Melton
  7. Pardis C Sabeti
(2019)
Identifying gene expression programs of cell-type identity and cellular activity with single-cell RNA-Seq
eLife 8:e43803.
https://doi.org/10.7554/eLife.43803

Share this article

https://doi.org/10.7554/eLife.43803

Further reading

    1. Computational and Systems Biology
    2. Neuroscience
    Domingos Leite de Castro, Miguel Aroso ... Paulo Aguiar
    Research Article Updated

    Closed-loop neuronal stimulation has a strong therapeutic potential for neurological disorders such as Parkinson’s disease. However, at the moment, standard stimulation protocols rely on continuous open-loop stimulation and the design of adaptive controllers is an active field of research. Delayed feedback control (DFC), a popular method used to control chaotic systems, has been proposed as a closed-loop technique for desynchronisation of neuronal populations but, so far, was only tested in computational studies. We implement DFC for the first time in neuronal populations and access its efficacy in disrupting unwanted neuronal oscillations. To analyse in detail the performance of this activity control algorithm, we used specialised in vitro platforms with high spatiotemporal monitoring/stimulating capabilities. We show that the conventional DFC in fact worsens the neuronal population oscillatory behaviour, which was never reported before. Conversely, we present an improved control algorithm, adaptive DFC (aDFC), which monitors the ongoing oscillation periodicity and self-tunes accordingly. aDFC effectively disrupts collective neuronal oscillations restoring a more physiological state. Overall, these results support aDFC as a better candidate for therapeutic closed-loop brain stimulation.

    1. Cancer Biology
    2. Computational and Systems Biology
    Sara Latini, Veronica Venafra ... Francesca Sacco
    Research Article

    Currently, the identification of patient-specific therapies in cancer is mainly informed by personalized genomic analysis. In the setting of acute myeloid leukemia (AML), patient-drug treatment matching fails in a subset of patients harboring atypical internal tandem duplications (ITDs) in the tyrosine kinase domain of the FLT3 gene. To address this unmet medical need, here we develop a systems-based strategy that integrates multiparametric analysis of crucial signaling pathways, and patient-specific genomic and transcriptomic data with a prior knowledge signaling network using a Boolean-based formalism. By this approach, we derive personalized predictive models describing the signaling landscape of AML FLT3-ITD positive cell lines and patients. These models enable us to derive mechanistic insight into drug resistance mechanisms and suggest novel opportunities for combinatorial treatments. Interestingly, our analysis reveals that the JNK kinase pathway plays a crucial role in the tyrosine kinase inhibitor response of FLT3-ITD cells through cell cycle regulation. Finally, our work shows that patient-specific logic models have the potential to inform precision medicine approaches.