Identification of protein-protected mRNA fragments and structured excised intron RNAs in human plasma by TGIRT-seq peak calling

  1. Jun Yao
  2. Douglas C Wu
  3. Ryan M Nottingham
  4. Alan M Lambowitz  Is a corresponding author
  1. University of Texas at Austin, United States

Abstract

Human plasma contains >40,000 different coding and non-coding RNAs that are potential biomarkers for human diseases. Here, we used thermostable group II intron reverse transcriptase sequencing (TGIRT-seq) combined with peak calling to simultaneously profile all RNA biotypes in apheresis-prepared human plasma pooled from healthy individuals. Extending previous TGIRT-seq analysis, we found that human plasma contains largely fragmented mRNAs from >19,000 protein-coding genes, abundant full-length, mature tRNAs and other structured small non-coding RNAs, and less abundant tRNA fragments and mature and pre-miRNAs. Many of the mRNA fragments identified by peak calling correspond to annotated protein-binding sites and/or have stable predicted secondary structures that could afford protection from plasma nucleases. Peak calling also identified novel repeat RNAs, miRNA-sized RNAs, and putatively structured intron RNAs of potential biological, evolutionary, and biomarker significance, including a family of full-length excised introns RNAs, subsets of which correspond to mirtron pre-miRNAs or agotrons.

Data availability

Code availability: All scripts used for data processing are deposited in GitHub: https://github.com/wckdouglas/cfNADate deposition: The TGIRT-seq datasets in this manuscript are listed in the Supplementary File and have been deposited in the National Center for Biotechnology Information Sequence Read Archive (accession number: PRJNA640428).

The following data sets were generated
The following previously published data sets were used

Article and author information

Author details

  1. Jun Yao

    Institute for Cellular and Molecular Biology, University of Texas at Austin, Austin, United States
    Competing interests
    Jun Yao, is an inventors on a patent application filed by the University of Texas at Austin for the use of full-length excised intron RNAs and intron RNA fragments as biomarkers. US patent application 63/014,429.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-1232-1587
  2. Douglas C Wu

    Institute for Cellular and Molecular Biology, Department of Molecular Biosciences, University of Texas at Austin, Austin, United States
    Competing interests
    Douglas C Wu, is an inventor on a patent application filed by the University of Texas at Austin for the use of full-length excised intron RNAs and intron RNA fragments as biomarkers. US patent application 63/014,429; is currently an employee of QIAGEN..
  3. Ryan M Nottingham

    Institute for Cellular and Molecular Biology, University of Texas at Austin, Austin, United States
    Competing interests
    No competing interests declared.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0001-6937-5394
  4. Alan M Lambowitz

    Institute for Cellular and Molecular Biology, University of Texas at Austin, Austin, United States
    For correspondence
    lambowitz@austin.utexas.edu
    Competing interests
    Alan M Lambowitz, Thermostable group II intron reverse transcriptase (TGIRT) enzymes and methods for their use are the subject of patents and patent applications that have been licensed by the University of Texas and East Tennessee State University to InGex, LLC.Is a minority equity holder in InGex, LLC and receive royalty payments from the sale of TGIRT-enzymes and kits and from the sublicensing of intellectual property by InGex to other companies.Is an inventor on a patent application filed by the University of Texas at Austin for the use of full-length excised intron RNAs and intron RNA fragments as biomarkers. US patent application 63/014,429.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0001-6036-2423

Funding

National Institute of General Medical Sciences (R01 GM37949)

  • Alan M Lambowitz

National Institute of General Medical Sciences (R35 GM136216)

  • Alan M Lambowitz

Welch Foundation (F-1607)

  • Alan M Lambowitz

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Reviewing Editor

  1. Timothy W Nilsen, Case Western Reserve University, United States

Version history

  1. Received: July 6, 2020
  2. Accepted: September 1, 2020
  3. Accepted Manuscript published: September 2, 2020 (version 1)
  4. Accepted Manuscript updated: September 3, 2020 (version 2)
  5. Version of Record published: September 25, 2020 (version 3)

Copyright

© 2020, Yao et al.

This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.

Metrics

  • 3,527
    Page views
  • 395
    Downloads
  • 15
    Citations

Article citation count generated by polling the highest count across the following sources: PubMed Central, Crossref, Scopus.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Jun Yao
  2. Douglas C Wu
  3. Ryan M Nottingham
  4. Alan M Lambowitz
(2020)
Identification of protein-protected mRNA fragments and structured excised intron RNAs in human plasma by TGIRT-seq peak calling
eLife 9:e60743.
https://doi.org/10.7554/eLife.60743

Share this article

https://doi.org/10.7554/eLife.60743

Further reading

    1. Chromosomes and Gene Expression
    2. Genetics and Genomics
    Erandi Velazquez-Miranda, Ming He
    Insight

    Endothelial cell subpopulations are characterized by unique gene expression profiles, epigenetic landscapes and functional properties.

    1. Cell Biology
    2. Chromosomes and Gene Expression
    Monica Salinas-Pena, Elena Rebollo, Albert Jordan
    Research Article

    Histone H1 participates in chromatin condensation and regulates nuclear processes. Human somatic cells may contain up to seven histone H1 variants, although their functional heterogeneity is not fully understood. Here, we have profiled the differential nuclear distribution of the somatic H1 repertoire in human cells through imaging techniques including super-resolution microscopy. H1 variants exhibit characteristic distribution patterns in both interphase and mitosis. H1.2, H1.3, and H1.5 are universally enriched at the nuclear periphery in all cell lines analyzed and co-localize with compacted DNA. H1.0 shows a less pronounced peripheral localization, with apparent variability among different cell lines. On the other hand, H1.4 and H1X are distributed throughout the nucleus, being H1X universally enriched in high-GC regions and abundant in the nucleoli. Interestingly, H1.4 and H1.0 show a more peripheral distribution in cell lines lacking H1.3 and H1.5. The differential distribution patterns of H1 suggest specific functionalities in organizing lamina-associated domains or nucleolar activity, which is further supported by a distinct response of H1X or phosphorylated H1.4 to the inhibition of ribosomal DNA transcription. Moreover, H1 variants depletion affects chromatin structure in a variant-specific manner. Concretely, H1.2 knock-down, either alone or combined, triggers a global chromatin decompaction. Overall, imaging has allowed us to distinguish H1 variants distribution beyond the segregation in two groups denoted by previous ChIP-Seq determinations. Our results support H1 variants heterogeneity and suggest that variant-specific functionality can be shared between different cell types.