Identification of protein-protected mRNA fragments and structured excisedintron RNAs in human plasma by TGIRT-seq peak calling

  1. Jun Yao
  2. Douglas C Wu
  3. Ryan M Nottingham
  4. Alan M Lambowitz  Is a corresponding author
  1. University of Texas at Austin, United States

Abstract

Human plasma contains >40,000 different coding and non-coding RNAs that are potential biomarkers for human diseases. Here, we used thermostable group II intron reverse transcriptase sequencing (TGIRT-seq) combined with peak calling to simultaneously profile all RNA biotypes in apheresis-prepared human plasma pooled from healthy individuals. Extending previous TGIRT-seq analysis, we found that human plasma contains largely fragmented mRNAs from >19,000 protein-coding genes, abundant full-length, mature tRNAs and other structured small non-coding RNAs, and less abundant tRNA fragments and mature and pre-miRNAs. Many of the mRNA fragments identified by peak calling correspond to annotated protein-binding sites and/or have stable predicted secondary structures that could afford protection from plasma nucleases. Peak calling also identified novel repeat RNAs, miRNA-sized RNAs, and putatively structured intron RNAs of potential biological, evolutionary, and biomarker significance, including a family of full-length excised introns RNAs, subsets of which correspond to mirtron pre-miRNAs or agotrons.

Data availability

Code availability: All scripts used for data processing are deposited in GitHub: https://github.com/wckdouglas/cfNADate deposition: The TGIRT-seq datasets in this manuscript are listed in the Supplementary File and have been deposited in the National Center for Biotechnology Information Sequence Read Archive (accession number: PRJNA640428).

The following data sets were generated
The following previously published data sets were used

Article and author information

Author details

  1. Jun Yao

    Institute for Cellular and Molecular Biology, University of Texas at Austin, Austin, United States
    Competing interests
    Jun Yao, is an inventors on a patent application filed by the University of Texas at Austin for the use of full-length excised intron RNAs and intron RNA fragments as biomarkers. US patent application 63/014,429.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-1232-1587
  2. Douglas C Wu

    Institute for Cellular and Molecular Biology, Department of Molecular Biosciences, University of Texas at Austin, Austin, United States
    Competing interests
    Douglas C Wu, is an inventor on a patent application filed by the University of Texas at Austin for the use of full-length excised intron RNAs and intron RNA fragments as biomarkers. US patent application 63/014,429; is currently an employee of QIAGEN..
  3. Ryan M Nottingham

    Institute for Cellular and Molecular Biology, University of Texas at Austin, Austin, United States
    Competing interests
    No competing interests declared.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0001-6937-5394
  4. Alan M Lambowitz

    Institute for Cellular and Molecular Biology, University of Texas at Austin, Austin, United States
    For correspondence
    lambowitz@austin.utexas.edu
    Competing interests
    Alan M Lambowitz, Thermostable group II intron reverse transcriptase (TGIRT) enzymes and methods for their use are the subject of patents and patent applications that have been licensed by the University of Texas and East Tennessee State University to InGex, LLC.Is a minority equity holder in InGex, LLC and receive royalty payments from the sale of TGIRT-enzymes and kits and from the sublicensing of intellectual property by InGex to other companies.Is an inventor on a patent application filed by the University of Texas at Austin for the use of full-length excised intron RNAs and intron RNA fragments as biomarkers. US patent application 63/014,429.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0001-6036-2423

Funding

National Institute of General Medical Sciences (R01 GM37949)

  • Alan M Lambowitz

National Institute of General Medical Sciences (R35 GM136216)

  • Alan M Lambowitz

Welch Foundation (F-1607)

  • Alan M Lambowitz

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Reviewing Editor

  1. Timothy W Nilsen, Case Western Reserve University, United States

Version history

  1. Received: July 6, 2020
  2. Accepted: September 1, 2020
  3. Accepted Manuscript published: September 2, 2020 (version 1)
  4. Accepted Manuscript updated: September 3, 2020 (version 2)
  5. Version of Record published: September 25, 2020 (version 3)

Copyright

© 2020, Yao et al.

This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.

Metrics

  • 3,607
    views
  • 405
    downloads
  • 19
    citations

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Jun Yao
  2. Douglas C Wu
  3. Ryan M Nottingham
  4. Alan M Lambowitz
(2020)
Identification of protein-protected mRNA fragments and structured excisedintron RNAs in human plasma by TGIRT-seq peak calling
eLife 9:e60743.
https://doi.org/10.7554/eLife.60743

Share this article

https://doi.org/10.7554/eLife.60743

Further reading

    1. Chromosomes and Gene Expression
    Rupam Choudhury, Anuroop Venkateswaran Venkatasubramani ... Axel Imhof
    Research Article

    Eukaryotic chromatin is organized into functional domains, that are characterized by distinct proteomic compositions and specific nuclear positions. In contrast to cellular organelles surrounded by lipid membranes, the composition of distinct chromatin domains is rather ill described and highly dynamic. To gain molecular insight into these domains and explore their composition, we developed an antibody-based proximity-biotinylation method targeting the RNA and proteins constituents. The method that we termed Antibody-Mediated-Proximity-Labelling-coupled to Mass Spectrometry (AMPL-MS) does not require the expression of fusion proteins and therefore constitutes a versatile and very sensitive method to characterize the composition of chromatin domains based on specific signature proteins or histone modifications. To demonstrate the utility of our approach we used AMPL-MS to characterize the molecular features of the chromocenter as well as the chromosome territory containing the hyperactive X-chromosome in Drosophila. This analysis identified a number of known RNA binding proteins in proximity of the hyperactive X and the centromere, supporting the accuracy of our method. In addition, it enabled us to characterize the role of RNA in the formation of these nuclear bodies. Furthermore, our method identified a new set of RNA molecules associated with the Drosophila centromere. Characterization of these novel molecules suggested the formation of R-loops in centromeres, which we validated using a novel probe for R-loops in Drosophila. Taken together, AMPL-MS improves the selectivity and specificity of proximity ligation allowing for novel discoveries of weak protein-RNA interactions in biologically diverse domains.

    1. Cancer Biology
    2. Chromosomes and Gene Expression
    Gregory Caleb Howard, Jing Wang ... William P Tansey
    Research Article

    The chromatin-associated protein WD Repeat Domain 5 (WDR5) is a promising target for cancer drug discovery, with most efforts blocking an arginine-binding cavity on the protein called the ‘WIN’ site that tethers WDR5 to chromatin. WIN site inhibitors (WINi) are active against multiple cancer cell types in vitro, the most notable of which are those derived from MLL-rearranged (MLLr) leukemias. Peptidomimetic WINi were originally proposed to inhibit MLLr cells via dysregulation of genes connected to hematopoietic stem cell expansion. Our discovery and interrogation of small-molecule WINi, however, revealed that they act in MLLr cell lines to suppress ribosome protein gene (RPG) transcription, induce nucleolar stress, and activate p53. Because there is no precedent for an anticancer strategy that specifically targets RPG expression, we took an integrated multi-omics approach to further interrogate the mechanism of action of WINi in human MLLr cancer cells. We show that WINi induce depletion of the stock of ribosomes, accompanied by a broad yet modest translational choke and changes in alternative mRNA splicing that inactivate the p53 antagonist MDM4. We also show that WINi are synergistic with agents including venetoclax and BET-bromodomain inhibitors. Together, these studies reinforce the concept that WINi are a novel type of ribosome-directed anticancer therapy and provide a resource to support their clinical implementation in MLLr leukemias and other malignancies.