Deep transcriptome annotation enables the discovery and functional characterization of cryptic small proteins
Abstract
Recent functional, proteomic and ribosome profiling studies in eukaryotes have concurrently demonstrated the translation of alternative open reading frames (altORFs) in addition to annotated protein coding sequences (CDSs). We show that a large number of small proteins could in fact be coded by these altORFs. The putative alternative proteins translated from altORFs have orthologs in many species and contain functional domains. Evolutionary analyses indicate that altORFs often show more extreme conservation patterns than their CDSs. Thousands of alternative proteins are detected in proteomic datasets by reanalysis using a database containing predicted alternative proteins. This is illustrated with specific examples, including altMiD51, a 70 amino acid mitochondrial fission-promoting protein encoded in MiD51/Mief1/SMCR7L, a gene encoding an annotated protein promoting mitochondrial fission. Our results suggest that many genes are multicoding genes and code for a large protein and one or several small proteins.
Data availability
Article and author information
Author details
Funding
Canadian Institutes of Health Research (MOP-137056)
- Xavier Roucou
Canada Research Chairs
- Aïda Ouangraoua
- Christian R Landry
- Xavier Roucou
Fonds de Recherche du Québec - Nature et Technologies (2015-PR-181807)
- Christian R Landry
- Xavier Roucou
Merck Sharp and Dohme
- Xavier Roucou
Canadian Institutes of Health Research (MOP-136962)
- Xavier Roucou
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Copyright
© 2017, Samandi et al.
This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 6,892
- views
-
- 1,012
- downloads
-
- 102
- citations
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Biochemistry and Chemical Biology
Copper is an essential enzyme cofactor in bacteria, but excess copper is highly toxic. Bacteria can cope with copper stress by increasing copper resistance and initiating chemorepellent response. However, it remains unclear how bacteria coordinate chemotaxis and resistance to copper. By screening proteins that interacted with the chemotaxis kinase CheA, we identified a copper-binding repressor CsoR that interacted with CheA in Pseudomonas putida. CsoR interacted with the HPT (P1), Dimer (P3), and HATPase_c (P4) domains of CheA and inhibited CheA autophosphorylation, resulting in decreased chemotaxis. The copper-binding of CsoR weakened its interaction with CheA, which relieved the inhibition of chemotaxis by CsoR. In addition, CsoR bound to the promoter of copper-resistance genes to inhibit gene expression, and copper-binding released CsoR from the promoter, leading to increased gene expression and copper resistance. P. putida cells exhibited a chemorepellent response to copper in a CheA-dependent manner, and CsoR inhibited the chemorepellent response to copper. Besides, the CheA-CsoR interaction also existed in proteins from several other bacterial species. Our results revealed a mechanism by which bacteria coordinately regulated chemotaxis and resistance to copper by CsoR.
-
- Biochemistry and Chemical Biology
- Genetics and Genomics
5-Methylcytosine (m5C) is one of the posttranscriptional modifications in mRNA and is involved in the pathogenesis of various diseases. However, the capacity of existing assays for accurately and comprehensively transcriptome-wide m5C mapping still needs improvement. Here, we develop a detection method named DRAM (deaminase and reader protein assisted RNA methylation analysis), in which deaminases (APOBEC1 and TadA-8e) are fused with m5C reader proteins (ALYREF and YBX1) to identify the m5C sites through deamination events neighboring the methylation sites. This antibody-free and bisulfite-free approach provides transcriptome-wide editing regions which are highly overlapped with the publicly available bisulfite-sequencing (BS-seq) datasets and allows for a more stable and comprehensive identification of the m5C loci. In addition, DRAM system even supports ultralow input RNA (10 ng). We anticipate that the DRAM system could pave the way for uncovering further biological functions of m5C modifications.