Circular RNA repertoires are associated with evolutionarily young transposable elements
Abstract
Circular RNAs (circRNAs) are found across eukaryotes and can function in post-transcriptional gene regulation. Their biogenesis through a circle-forming backsplicing reaction is facilitated by reverse-complementary repetitive sequences promoting pre-mRNA folding. Orthologous genes from which circRNAs arise, overall contain more strongly conserved splice sites and exons than other genes, yet it remains unclear to what extent this conservation reflects purifying selection acting on the circRNAs themselves. Our analyses of circRNA repertoires from five species representing three mammalian lineages (marsupials, eutherians: rodents, primates) reveal that surprisingly few circRNAs arise from orthologous exonic loci across all species. Even the circRNAs from orthologous loci are associated with young, recently active and species-specific transposable elements, rather than with common, ancient transposon integration events. These observations suggest that many circRNAs emerged convergently during evolution - as a byproduct of splicing in orthologs prone to transposon insertion. Overall, our findings argue against widespread functional circRNA conservation.
Data availability
Sequencing data have been deposited in GEO under accession code GSE162152
-
Suppl. Table 4. Mouse circRNA summary.Journal of Molecular and Cellular Cardiology, doi.org/10.1016/j.yjmcc.2016.07.007.
-
Suppl. Table 5. Human circRNA summary.Journal of Molecular and Cellular Cardiology, doi.org/10.1016/j.yjmcc.2016.07.007.
-
DNA replication time of the human genome G1 phase.Sequence Read Archive, SRA052697.
-
Suppl. Table S2. Haploinsufficiency predictions without study bias.Nucleic Acids Research, https://doi.org/10.1093/nar/gkv474.
-
The evolution of gene expression levels in mammalian organsNCBI Gene Expression Omnibus, GSE30352.
Article and author information
Author details
Funding
Swiss Institute of Bioinformatics (SIB PhD Fellowship)
- Franziska Gruhl
Human Frontiers Science Program (LT000158/2013-L)
- Peggy Janich
European Research Council (242597,SexGenTransEvolution)
- Henrik Kaessmann
European Research Council (615253,OntoTransEvol)
- Henrik Kaessmann
Swiss National Science Foundation (NCCR RNA & Disease (141735,182880))
- David Gatfield
Swiss National Science Foundation (individual grant 179190)
- David Gatfield
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Ethics
Animal experimentation: Mouse samples were collected by the Kaessmann lab at the Center for Integrative Genomics in Lausanne. Rat samples were kindly provided by Carmen Sandi, EPFL, Lausanne. Opossum samples were kindly provided by Peter Giere, Museum für Naturkunde, Berlin. All animal procedures were performed in compliance with national and international ethical guidelines and regulations for the care and use of laboratory animals and were approved by the local animal welfare authorities (Vaud Cantonal Veterinary office, Berlin State Office of Health and Social Affairs). The rhesus macaque samples were provided by the Suzhou Experimental Animal Center (China); the Biomedical Research Ethics Committee of Shanghai Institutes for Biological Sciences reviewed the use and care of the animals in the research project (approval ID: ER-SIBS-260802P). All rhesus macaques used in this study suffered sudden deaths for reasons other than their participation in this study and without any relation to the organ sampled. The use of all samples for the work described in this study was approved by an ERC Ethics Screening panel (associated with H.K.'s ERC Consolidator Grant 615253, OntoTransEvol).
Human subjects: The human post-mortem samples were provided by the NICHD Brain and Tissue Bank for Developmental Disorders at the University of Maryland (USA). They originated from individuals with diverse causes of death that, given the information available, were not associated with the organ sampled. Written consent for the use of human tissues for research was obtained from all donors or their next of kin by this tissue bank. The use of these samples was approved by an ERC Ethics Screening panel (associated with H.K.'s ERC Consolidator Grant 615253, OntoTransEvol), and, in addition, by the local ethics committee in Lausanne (authorization 504/12).
Copyright
© 2021, Gruhl et al.
This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 2,597
- views
-
- 345
- downloads
-
- 21
- citations
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Evolutionary Biology
- Genetics and Genomics
The loss of a single chromosome in a diploid organism halves the dosage of many genes and is usually accompanied by a substantial decrease in fitness. We asked whether this decrease simply reflects the joint damage caused by individual gene dosage deficiencies. We measured the fitness effects of single heterozygous gene deletions in yeast and combined them for each chromosome. This predicted a negative growth rate, that is, lethality, for multiple monosomies. However, monosomic strains remained alive and grew as if much (often most) of the damage caused by single mutations had disappeared, revealing an exceptionally large and positive epistatic component of fitness. We looked for functional explanations by analyzing the transcriptomes. There was no evidence of increased (compensatory) gene expression on the monosomic chromosomes. Nor were there signs of the cellular stress response that would be expected if monosomy led to protein destabilization and thus cytotoxicity. Instead, all monosomic strains showed extensive upregulation of genes encoding ribosomal proteins, but in an indiscriminate manner that did not correspond to their altered dosage. This response did not restore the stoichiometry required for efficient biosynthesis, which probably became growth limiting, making all other mutation-induced metabolic defects much less important. In general, the modular structure of the cell leads to an effective fragmentation of the total mutational load. Defects outside the module(s) currently defining fitness lose at least some of their relevance, producing the epiphenomenon of positive interactions between individually negative effects.
-
- Evolutionary Biology
- Medicine
Male germ cells share a common origin across animal species, therefore they likely retain a conserved genetic program that defines their cellular identity. However, the unique evolutionary dynamics of male germ cells coupled with their widespread leaky transcription pose significant obstacles to the identification of the core spermatogenic program. Through network analysis of the spermatocyte transcriptome of vertebrate and invertebrate species, we describe the conserved evolutionary origin of metazoan male germ cells at the molecular level. We estimate the average functional requirement of a metazoan male germ cell to correspond to the expression of approximately 10,000 protein-coding genes, a third of which defines a genetic scaffold of deeply conserved genes that has been retained throughout evolution. Such scaffold contains a set of 79 functional associations between 104 gene expression regulators that represent a core component of the conserved genetic program of metazoan spermatogenesis. By genetically interfering with the acquisition and maintenance of male germ cell identity, we uncover 161 previously unknown spermatogenesis genes and three new potential genetic causes of human infertility. These findings emphasize the importance of evolutionary history on human reproductive disease and establish a cross-species analytical pipeline that can be repurposed to other cell types and pathologies.