Self-assembling manifolds in single-cell RNA sequencing data
Abstract
Single-cell RNA sequencing has spurred the development of computational methods that enable researchers to classify cell types, delineate developmental trajectories, and measure molecular responses to external perturbations. Many of these technologies rely on their ability to detect genes whose cell-to-cell variations arise from the biological processes of interest rather than transcriptional or technical noise. However, for datasets in which the biologically relevant differences between cells are subtle, identifying these genes is challenging. We present the self-assembling manifold (SAM) algorithm, an iterative soft feature selection strategy to quantify gene relevance and improve dimensionality reduction. We demonstrate its advantages over other state-of-the-art methods with experimental validation in identifying novel stem cell populations of Schistosoma mansoni, a prevalent parasite that infects hundreds of millions of people. Extending our analysis to a total of 56 datasets, we show that SAM is generalizable and consistently outperforms other methods in a variety of biological and quantitative benchmarks.
Data availability
The schistosome stem cell scRNAseq data generated in this study is available through the Gene Expression Omnibus (GEO) under accession number GSE116920.
-
Single-cell RNA sequencing of proliferative stem cell population from juvenile Schistosoma mansoniNCBI Gene Expression Omnibus, GSE116920.
-
Single-cell RNA-Seq profiling of human preimplantation embryos and embryonic stem cellsNCBI Gene Expression Omnibus, GSE36552.
-
Adult mouse cortical cell taxonomy revealed by single cell transcriptomicsNCBI Gene Expression Omnibus, GSE71585-GPL17021.
-
The transcriptome and DNA methylome landscapes of human primordial germ cellsNCBI Gene Expression Omnibus, GSE63818.
-
Single-cell analysis uncovers clonal acinar cell heterogeneity in the adult pancreasNCBI Gene Expression Omnibus, GSE80032.
-
Single-cell RNA-seq reveals dynamic, random monoallelic gene expression in mammalian cellsNCBI Gene Expression Omnibus, GSE45719.
-
Single-cell RNA-seq highlights intratumoral heterogeneity in primary glioblastomaNCBI Gene Expression Omnibus, GSE57872.
-
Single-cell topological RNA-seq analysis reveals insights into cellular differentiation and developmentNCBI Gene Expression Omnibus, GSE94883.
-
Innate-like functions of natural killer T cell subsets result from highly divergent gene programsNCBI Gene Expression Omnibus, GSE74596.
-
Single-cell RNA-Seq resolves cellular complexity in sensory organs from the neonatal inner earNCBI Gene Expression Omnibus, GSE71982.
-
Cell fate inclination within 2-cell and 4-cell mouse embryos revealed by single-cell RNA sequencingNCBI Gene Expression Omnibus, GSE57249.
-
The dynamics and regulators of cell fate decisions are revealed by pseudotemporal ordering of single cellsNCBI Gene Expression Omnibus, GSE52529-GPL16791.
-
Single-cell RNA-seq reveals dynamic paracrine control of cellular variationNCBI Gene Expression Omnibus, GSE48968-GPL13112.
-
Oscope identifies oscillatory genes in unsynchronized single-cell RNAseq experimentsNCBI Gene Expression Omnibus, GSE64016.
-
Reconstructing lineage hierarchies of the distal lung epithelium using single-cell RNA-seqNCBI Gene Expression Omnibus, GSE52583-GPL13112.
-
Single-cell analysis of mixed-lineage states leading to a binary cell fate choiceNCBI Gene Expression Omnibus, GSE70245.
-
Single-cell RNA-seq with waterfall reveals molecular cascades underlying adult neurogenesisNCBI Gene Expression Omnibus, GSE71485.
-
Deciphering cell lineage specification during male sex determination with single-cell RNA sequencingNCBI Gene Expression Omnibus, GSE97519.
-
A molecular atlas of cell types and zonation in the brain vasculatureNCBI Gene Expression Omnibus, GSE99235.
-
Multipotent peripheral glial cells generate neuroendocrine cells of the adrenal medullaNCBI Gene Expression Omnibus, GSE99933.
-
Defining the earliest step of cardiovascular lineage segregation by single-cell RNA-seqNCBI Gene Expression Omnibus, GSE100471.
-
Temporal tracking of microglia activation in neurodegeneration at single-cell resolutionNCBI Gene Expression Omnibus, GSE103334.
-
Early emergence of cortical interneuron diversity in the mouse embryoNCBI Gene Expression Omnibus, GSE109796.
-
Single-cell transcriptional dynamics of flavivirus infectionNCBI Gene Expression Omnibus, GSE110496.
-
Single-cell RNA-seq supports a developmental hierarchy in human oligodendrogliomaNCBI Gene Expression Omnibus, GSE70630.
-
Transcriptional heterogeneity and lineage commitment in myeloid progenitorsNCBI Gene Expression Omnibus, GSE72857.
-
A survey of human brain transcriptome diversity at the single cell levelNCBI Gene Expression Omnibus, GSE67835.
-
Single-cell transcriptomics of the human endocrine pancreasNCBI Gene Expression Omnibus, GSE83139.
-
Cell type transcriptome atlas for the planarian Schmidtea mediterraneaNCBI Gene Expression Omnibus, GSE111764.
-
A Single-cell transcriptome atlas of the human pancreasNCBI Gene Expression Omnibus, GSE85241.
Article and author information
Author details
Funding
Burroughs Wellcome Fund
- Bo Wang
Arnold and Mabel Beckman Foundation
- Bo Wang
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Ethics
Animal experimentation: In adherence to the Animal Welfare Act and the Public Health Service Policy on Humane Care and Use of Laboratory Animals, all experiments with and care of mice were performed in accordance with protocols approved by the Institutional Animal Care and Use Committees (IACUC) of Stanford University (protocol approval number 30366).
Reviewing Editor
- Alex K Shalek, Broad Institute of MIT and Harvard, United States
Publication history
- Received: June 3, 2019
- Accepted: September 16, 2019
- Accepted Manuscript published: September 16, 2019 (version 1)
- Version of Record published: October 16, 2019 (version 2)
Copyright
© 2019, Tarashansky et al.
This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 6,553
- Page views
-
- 908
- Downloads
-
- 20
- Citations
Article citation count generated by polling the highest count across the following sources: Crossref, PubMed Central, Scopus.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Computational and Systems Biology
- Neuroscience
To decide whether a course of action is worth pursuing, individuals typically weigh its expected costs and benefits. Optimal decision-making relies upon accurate effort cost anticipation, which is generally assumed to be performed independently from goal valuation. In two experiments (n = 46), we challenged this independence principle of standard decision theory. We presented participants with a series of treadmill routes randomly associated to monetary rewards and collected both ‘accept’ versus ‘decline’ decisions and subjective estimates of energetic cost. Behavioural results show that higher monetary prospects led participants to provide higher cost estimates, although reward was independent from effort in our design. Among candidate cognitive explanations, they support a model in which prospective cost assessment is biased by the output of an automatic computation adjusting effort expenditure to goal value. This decision bias might lead people to abandon the pursuit of valuable goals that are in fact not so costly to achieve.
-
- Computational and Systems Biology
- Neuroscience
Synaptic communication relies on the fusion of synaptic vesicles with the plasma membrane, which leads to neurotransmitter release. This exocytosis is triggered by brief and local elevations of intracellular Ca2+ with remarkably high sensitivity. How this is molecularly achieved is unknown. While synaptotagmins confer the Ca2+ sensitivity of neurotransmitter exocytosis, biochemical measurements reported Ca2+ affinities too low to account for synaptic function. However, synaptotagmin's Ca2+ affinity increases upon binding the plasma membrane phospholipid PI(4,5)P2 and, vice versa, Ca2+-binding increases synaptotagmin's PI(4,5)P2 affinity, indicating a stabilization of the Ca2+/PI(4,5)P2 dual-bound syt. Here we devise a molecular exocytosis model based on this positive allosteric stabilization and the assumptions that (1.) synaptotagmin Ca2+/PI(4,5)P2 dual binding lowers the energy barrier for vesicle fusion and that (2.) the effect of multiple synaptotagmins on the energy barrier is additive. The model, which relies on biochemically measured Ca2+/PI(4,5)P2 affinities and protein copy numbers, reproduced the steep Ca2+ dependency of neurotransmitter release. Our results indicate that each synaptotagmin dual binding Ca2+/PI(4,5)P2 lowers the energy barrier for vesicle fusion by ~5 kBT and that allosteric stabilization of this state enables the synchronized engagement of several (typically three) synaptotagmins for fast exocytosis. Furthermore, we show that mutations altering synaptotagmin’s allosteric properties may show dominant-negative effects, even though synaptotagmins act independently on the energy barrier, and that dynamic changes of local PI(4,5)P2 (e.g. upon vesicle movement) dramatically impact synaptic responses. We conclude that allosterically stabilized Ca2+/PI(4,5)P2 dual binding enables synaptotagmins to exert their coordinated function in neurotransmission.