Identification of protein-protected mRNA fragments and structured excised intron RNAs in human plasma by TGIRT-seq peak calling
Abstract
Human plasma contains >40,000 different coding and non-coding RNAs that are potential biomarkers for human diseases. Here, we used thermostable group II intron reverse transcriptase sequencing (TGIRT-seq) combined with peak calling to simultaneously profile all RNA biotypes in apheresis-prepared human plasma pooled from healthy individuals. Extending previous TGIRT-seq analysis, we found that human plasma contains largely fragmented mRNAs from >19,000 protein-coding genes, abundant full-length, mature tRNAs and other structured small non-coding RNAs, and less abundant tRNA fragments and mature and pre-miRNAs. Many of the mRNA fragments identified by peak calling correspond to annotated protein-binding sites and/or have stable predicted secondary structures that could afford protection from plasma nucleases. Peak calling also identified novel repeat RNAs, miRNA-sized RNAs, and putatively structured intron RNAs of potential biological, evolutionary, and biomarker significance, including a family of full-length excised introns RNAs, subsets of which correspond to mirtron pre-miRNAs or agotrons.
Data availability
Code availability: All scripts used for data processing are deposited in GitHub: https://github.com/wckdouglas/cfNADate deposition: The TGIRT-seq datasets in this manuscript are listed in the Supplementary File and have been deposited in the National Center for Biotechnology Information Sequence Read Archive (accession number: PRJNA640428).
Article and author information
Author details
Funding
National Institute of General Medical Sciences (R01 GM37949)
- Alan M Lambowitz
National Institute of General Medical Sciences (R35 GM136216)
- Alan M Lambowitz
Welch Foundation (F-1607)
- Alan M Lambowitz
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Copyright
© 2020, Yao et al.
This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 4,004
- views
-
- 436
- downloads
-
- 25
- citations
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Chromosomes and Gene Expression
- Genetics and Genomics
Among the major classes of RNAs in the cell, tRNAs remain the most difficult to characterize via deep sequencing approaches, as tRNA structure and nucleotide modifications can each interfere with cDNA synthesis by commonly-used reverse transcriptases (RTs). Here, we benchmark a recently-developed RNA cloning protocol, termed Ordered Two-Template Relay (OTTR), to characterize intact tRNAs and tRNA fragments in budding yeast and in mouse tissues. We show that OTTR successfully captures both full-length tRNAs and tRNA fragments in budding yeast and in mouse reproductive tissues without any prior enzymatic treatment, and that tRNA cloning efficiency can be further enhanced via AlkB-mediated demethylation of modified nucleotides. As with other recent tRNA cloning protocols, we find that a subset of nucleotide modifications leave misincorporation signatures in OTTR datasets, enabling their detection without any additional protocol steps. Focusing on tRNA cleavage products, we compare OTTR with several standard small RNA-Seq protocols, finding that OTTR provides the most accurate picture of tRNA fragment levels by comparison to "ground truth" Northern blots. Applying this protocol to mature mouse spermatozoa, our data dramatically alter our understanding of the small RNA cargo of mature mammalian sperm, revealing a far more complex population of tRNA fragments - including both 5′ and 3′ tRNA halves derived from the majority of tRNAs – than previously appreciated. Taken together, our data confirm the superior performance of OTTR to commercial protocols in analysis of tRNA fragments, and force a reappraisal of potential epigenetic functions of the sperm small RNA payload.
-
- Chromosomes and Gene Expression
O-GlcNAcylation is the reversible post-translational addition of β-N-acetylglucosamine to serine and threonine residues of nuclear and cytoplasmic proteins. It plays an important role in several cellular processes through the modification of thousands of protein substrates. O-GlcNAcylation in humans is mediated by a single essential enzyme, O-GlcNAc transferase (OGT). OGT, together with the sole O-GlcNAcase OGA, form an intricate feedback loop to maintain O-GlcNAc homeostasis in response to changes in cellular O-GlcNAc using a dynamic mechanism involving nuclear retention of its fourth intron. However, the molecular mechanism of this dynamic regulation remains unclear. Using an O-GlcNAc responsive GFP reporter cell line, we identify SFSWAP, a poorly characterized splicing factor, as a trans-acting factor regulating OGT intron detention. We show that SFSWAP is a global regulator of retained intron splicing and exon skipping that primarily acts as a negative regulator of splicing. In contrast, knockdown of SFSWAP leads to reduced inclusion of a ‘decoy exon’ present in the OGT retained intron which may mediate its role in OGT intron detention. Global analysis of decoy exon inclusion in SFSWAP and UPF1 double knockdown cells indicate altered patterns of decoy exon usage. Together, these data indicate a role for SFSWAP as a global negative regulator of pre-mRNA splicing and positive regulator of intron retention.