1. Chromosomes and Gene Expression
  2. Computational and Systems Biology
Download icon

Coverage-dependent bias creates the appearance of binary splicing in single cells

  1. Carlos F Buen Abad Najar
  2. Nir Yosef  Is a corresponding author
  3. Liana F Lareau  Is a corresponding author
  1. University of California, Berkeley, United States
Research Article
  • Cited 6
  • Views 4,264
  • Annotations
Cite this article as: eLife 2020;9:e54603 doi: 10.7554/eLife.54603

Abstract

Single cell RNA sequencing provides powerful insight into the factors that determine each cell's unique identity. Previous studies led to the surprising observation that alternative splicing among single cells is highly variable and follows a bimodal pattern: a given cell consistently produces either one or the other isoform for a particular splicing choice, with few cells producing both isoforms. Here we show that this pattern arises almost entirely from technical limitations. We analyze alternative splicing in human and mouse single cell RNA-seq datasets, and model them with a probabilistic simulator. Our simulations show that low gene expression and low capture efficiency distort the observed distribution of isoforms. This gives the appearance of binary splicing outcomes, even when the underlying reality is consistent with more than one isoform per cell. We show that accounting for the true amount of information recovered can produce biologically meaningful measurements of splicing in single cells.

Data availability

All sequencing data reanalyzed in this study were acquired from GEO.

The following previously published data sets were used

Article and author information

Author details

  1. Carlos F Buen Abad Najar

    Center for Computational Biology, University of California, Berkeley, Berkeley, United States
    Competing interests
    The authors declare that no competing interests exist.
  2. Nir Yosef

    Department of Electrical Engineering and Computer Science and the Center for Computational Biology, University of California, Berkeley, Berkeley, United States
    For correspondence
    niryosef@berkeley.edu
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0001-9004-1225
  3. Liana F Lareau

    Department of Bioengineering, University of California, Berkeley, Berkeley, United States
    For correspondence
    lareau@berkeley.edu
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0003-3223-3426

Funding

UC MEXUS-Conacyt (Doctoral Fellowship)

  • Carlos F Buen Abad Najar

Chan Zuckerberg Biohub

  • Nir Yosef

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Reviewing Editor

  1. L Stirling Churchman, Harvard Medical School, United States

Publication history

  1. Received: December 19, 2019
  2. Accepted: June 28, 2020
  3. Accepted Manuscript published: June 29, 2020 (version 1)
  4. Version of Record published: September 17, 2020 (version 2)

Copyright

© 2020, Buen Abad Najar et al.

This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.

Metrics

  • 4,264
    Page views
  • 458
    Downloads
  • 6
    Citations

Article citation count generated by polling the highest count across the following sources: Crossref, PubMed Central, Scopus.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Download citations (links to download the citations from this article in formats compatible with various reference manager tools)

Open citations (links to open the citations from this article in various online reference manager services)

Further reading

    1. Chromosomes and Gene Expression
    2. Genetics and Genomics
    Zeyang Shen et al.
    Research Article

    Regulation of gene expression requires the combinatorial binding of sequence-specific transcription factors (TFs) at promoters and enhancers. Prior studies showed that alterations in the spacing between TF binding sites can influence promoter and enhancer activity. However, the relative importance of TF spacing alterations resulting from naturally occurring insertions and deletions (InDels) has not been systematically analyzed. To address this question, we first characterized the genome-wide spacing relationships of 73 TFs in human K562 cells as determined by ChIP-seq. We found a dominant pattern of a relaxed range of spacing between collaborative factors, including 45 TFs exclusively exhibiting relaxed spacing with their binding partners. Next, we exploited millions of InDels provided by genetically diverse mouse strains and human individuals to investigate the effects of altered spacing on TF binding and local histone acetylation. These analyses suggested that spacing alterations resulting from naturally occurring InDels are generally tolerated in comparison to genetic variants directly affecting TF binding sites. To experimentally validate this prediction, we introduced synthetic spacing alterations between PU.1 and C/EBPβ binding sites at six endogenous genomic loci in a macrophage cell line. Remarkably, collaborative binding of PU.1 and C/EBPβ at these locations tolerated changes in spacing ranging from 5-bp increase to >30-bp decrease. Collectively, these findings have implications for understanding mechanisms underlying enhancer selection and for the interpretation of non-coding genetic variation.

    1. Chromosomes and Gene Expression
    2. Developmental Biology
    Graham JM Hickey et al.
    Research Article Updated

    Vertebrate embryos achieve developmental competency during zygotic genome activation (ZGA) by establishing chromatin states that silence yet poise developmental genes for subsequent lineage-specific activation. Here, we reveal the order of chromatin states in establishing developmental gene poising in preZGA zebrafish embryos. Poising is established at promoters and enhancers that initially contain open/permissive chromatin with ‘Placeholder’ nucleosomes (bearing H2A.Z, H3K4me1, and H3K27ac), and DNA hypomethylation. Silencing is initiated by the recruitment of polycomb repressive complex 1 (PRC1), and H2Aub1 deposition by catalytic Rnf2 during preZGA and ZGA stages. During postZGA, H2Aub1 enables Aebp2-containing PRC2 recruitment and H3K27me3 deposition. Notably, preventing H2Aub1 (via Rnf2 inhibition) eliminates recruitment of Aebp2-PRC2 and H3K27me3, and elicits transcriptional upregulation of certain developmental genes during ZGA. However, upregulation is independent of H3K27me3 – establishing H2Aub1 as the critical silencing modification at ZGA. Taken together, we reveal the logic and mechanism for establishing poised/silent developmental genes in early vertebrate embryos.