An improved zebrafish transcriptome annotation for sensitive and comprehensive detection of cell type-specific genes

  1. Nathan D Lawson  Is a corresponding author
  2. Rui Li
  3. Masahiro Shin
  4. Ann Grosse
  5. Onur Yukselen
  6. Oliver A Stone
  7. Alper Kucukural
  8. Lihua Zhu
  1. University of Massachusetts Medical School, United States
  2. University of Oxford, United Kingdom

Abstract

The zebrafish is ideal for studying embryogenesis and is increasingly applied to model human disease. In these contexts, RNA-sequencing (RNA-seq) provides mechanistic insights by identifying transcriptome changes between experimental conditions. Application of RNA-seq relies on accurate transcript annotation for a genome of interest. Here, we find discrepancies in analysis from RNA-seq datasets quantified using Ensembl and RefSeq zebrafish annotations. These issues were due, in part, to variably annotated 3' untranslated regions and thousands of gene models missing from each annotation. Since these discrepancies could compromise downstream analyses and biological reproducibility, we built a more comprehensive zebrafish transcriptome annotation that addresses these deficiencies. Our annotation improves detection of cell type-specific genes in both bulk and single cell RNA-seq datasets, where it also improves resolution of cell clustering. Thus, we demonstrate that our new transcriptome annotation can outperform existing annotations, providing an important resource for zebrafish researchers.

Data availability

All data generated in this study are available in accompanying source data files. Transcriptome annotation files described in this study are available for download at zf-transcriptome.umassmed.edu. Raw and processed RNA-seq data generated in this study are available at GEO (GSE152759).

The following data sets were generated
The following previously published data sets were used

Article and author information

Author details

  1. Nathan D Lawson

    Department of Molecular, Cell, and Cancer Biology, University of Massachusetts Medical School, Worcester, United States
    For correspondence
    nathan.lawson@umassmed.edu
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0001-7788-9619
  2. Rui Li

    Department of Molecular, Cell, and Cancer Biology, University of Massachusetts Medical School, Worcester, United States
    Competing interests
    The authors declare that no competing interests exist.
  3. Masahiro Shin

    Department of Molecular, Cell, and Cancer Biology, University of Massachusetts Medical School, Worcester, United States
    Competing interests
    The authors declare that no competing interests exist.
  4. Ann Grosse

    Department of Molecular, Cell, and Cancer Biology, University of Massachusetts Medical School, Worcester, United States
    Competing interests
    The authors declare that no competing interests exist.
  5. Onur Yukselen

    Department of Bioinformatics and Integrative Biology, University of Massachusetts Medical School, Worcester, United States
    Competing interests
    The authors declare that no competing interests exist.
  6. Oliver A Stone

    Department of Physiology, Anatomy and Genetics, University of Oxford, Oxford, United Kingdom
    Competing interests
    The authors declare that no competing interests exist.
  7. Alper Kucukural

    Department of Bioinformatic and Integrative Biology, University of Massachusetts Medical School, Worcester, United States
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0001-9983-394X
  8. Lihua Zhu

    Department of Molecular, Cell and Cancer Biology, University of Massachusetts Medical School, Worcester, United States
    Competing interests
    The authors declare that no competing interests exist.

Funding

National Heart, Lung, and Blood Institute (R35HL140017)

  • Nathan D Lawson

National Human Genome Research Institute (U01HG007910)

  • Onur Yukselen
  • Alper Kucukural

National Center for Advancing Translational Sciences (UL1TR001453)

  • Onur Yukselen
  • Alper Kucukural

National Institute of Neurological Disorders and Stroke (R21NS105654)

  • Nathan D Lawson

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Ethics

Animal experimentation: Zebrafish studies were performed in accordance with protocols #A2613 and #A2632 approved by the University of Massachusetts institutional animal care and use committee (IACUC).

Copyright

© 2020, Lawson et al.

This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.

Metrics

  • 10,330
    views
  • 877
    downloads
  • 103
    citations

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Nathan D Lawson
  2. Rui Li
  3. Masahiro Shin
  4. Ann Grosse
  5. Onur Yukselen
  6. Oliver A Stone
  7. Alper Kucukural
  8. Lihua Zhu
(2020)
An improved zebrafish transcriptome annotation for sensitive and comprehensive detection of cell type-specific genes
eLife 9:e55792.
https://doi.org/10.7554/eLife.55792

Share this article

https://doi.org/10.7554/eLife.55792

Further reading

    1. Developmental Biology
    Pablo Sanchez Bosch, Bomsoo Cho, Jeffrey D Axelrod
    Research Article

    The growth and survival of cells with different fitness, such as those with a proliferative advantage or a deleterious mutation, is controlled through cell competition. During development, cell competition enables healthy cells to eliminate less fit cells that could jeopardize tissue integrity, and facilitates the elimination of pre-malignant cells by healthy cells as a surveillance mechanism to prevent oncogenesis. Malignant cells also benefit from cell competition to promote their expansion. Despite its ubiquitous presence, the mechanisms governing cell competition, particularly those common to developmental competition and tumorigenesis, are poorly understood. Here, we show that in Drosophila, the planar cell polarity (PCP) protein Flamingo (Fmi) is required by winners to maintain their status during cell competition in malignant tumors to overtake healthy tissue, in early pre-malignant cells when they overproliferate among wildtype cells, in healthy cells when they later eliminate pre-malignant cells, and by supercompetitors as they compete to occupy excessive territory within wildtype tissues. ‘Would-be’ winners that lack Fmi are unable to overproliferate, and instead become losers. We demonstrate that the role of Fmi in cell competition is independent of PCP, and that it uses a distinct mechanism that may more closely resemble one used in other less well-defined functions of Fmi.

    1. Developmental Biology
    2. Stem Cells and Regenerative Medicine
    Paolo Petazzi, Telma Ventura ... Antonella Fidanza
    Tools and Resources

    A major challenge in the stem cell biology field is the ability to produce fully functional cells from induced pluripotent stem cells (iPSCs) that are a valuable resource for cell therapy, drug screening, and disease modelling. Here, we developed a novel inducible CRISPR-mediated activation strategy (iCRISPRa) to drive the expression of multiple endogenous transcription factors (TFs) important for in vitro cell fate and differentiation of iPSCs to haematopoietic progenitor cells. This work has identified a key role for IGFBP2 in developing haematopoietic progenitors. We first identified nine candidate TFs that we predicted to be involved in blood cell emergence during development, then generated tagged gRNAs directed to the transcriptional start site of these TFs that could also be detected during single-cell RNA sequencing (scRNAseq). iCRISPRa activation of these endogenous TFs resulted in a significant expansion of arterial-fated endothelial cells expressing high levels of IGFBP2, and our analysis indicated that IGFBP2 is involved in the remodelling of metabolic activity during in vitro endothelial to haematopoietic transition. As well as providing fundamental new insights into the mechanisms of haematopoietic differentiation, the broader applicability of iCRISPRa provides a valuable tool for studying dynamic processes in development and for recapitulating abnormal phenotypes characterised by ectopic activation of specific endogenous gene expression in a wide range of systems.