1. Microbiology and Infectious Disease
Download icon

Transcriptomics: Revisiting the genomes of herpesviruses

  1. Bhupesh K Prusty  Is a corresponding author
  2. Adam W Whisnant
  1. Julius-Maximilians-Universität Würzburg, Germany
Insight
  • Cited 1
  • Views 760
  • Annotations
Cite this article as: eLife 2020;9:e54037 doi: 10.7554/eLife.54037
Voice your concerns about research culture and research communication: Have your say in our 7th annual survey.

Abstract

Combining integrative genomics and systems biology approaches has revealed new and conserved features in the genome of human herpesvirus 6.

Main text

Herpesviruses cause a range of human diseases but many factors complicate the efforts made to precisely map the size and origin of RNA transcripts coded by these pathogens. For example, some mRNAs can code for more than one protein, coding sequences may overlap with each other, and the genes that are expressed may change depending on cell types or stages in the viral cycle. Moreover, the level of expression can greatly vary from gene to gene, which makes it difficult to distinguish between rare viral transcripts and other genetic products that accumulate in infected cells and during viral replication. In fact, in most herpesviruses, the majority of the genome is transcribed to some degree, yet only the most highly expressed or genomically isolated units are readily detectable.

Several new techniques have allowed researchers to bypass these problems to better annotate the genomes of herpesviruses. A tailored RNA sequencing method called cRNA-Seq, which enriches for the 5’ ends of RNA transcripts, has allowed the mapping of transcription start sites; in parallel, ribosome profiling (Ribo-Seq) has helped to highlight translational start sites. Combined, these approaches have revealed dozens to hundreds of new genes in herpesviruses such as the human cytomegalovirus (Stern-Ginossar et al., 2012) and the Kaposi’s sarcoma-associated herpesvirus (Arias et al., 2014). When paired with long-read sequencing platforms (which provide additional information about the 3’ ends of transcripts), the new methods have also led to a better understanding of a number of pathogens in the herpes family. Now, in eLife, Noam Stern-Ginossar and colleagues at the Weizmann Institute of Science and the Hebrew University Hadassah Medical School – including Yaara Finkel as first author – report new insights into human herpesvirus 6A and 6B (Finkel et al., 2020).

The results help to correct and complement previous textbook genome annotations for herpesviruses. Due to the technical limitations of the time, the exact beginnings of many transcripts and coding sequences were assigned a priori, and inclusion into published gene lists relied on rather conservative criteria. For instance, a sequence was classified as an open reading frame (the part of a genetic sequence that can potentially be translated) if it had more than 100 amino acids and started with an AUG codon. Instead, Finkel et al. demonstrate that roughly one-third of open reading frames in human herpesvirus 6A and 6B contain alternative start codons, which are also used by eukaryotes and other herpesviruses (Kearse and Wilusz, 2017; Arias et al., 2014). For instance, strains of human cytomegalovirus can have different start codons for a given gene, which may influence biological properties (Brondke et al., 2007); such questions can now be investigated in herpesvirus 6A and 6B .

Another exciting finding is the identification of hundreds of short, internal or upstream open reading frames (Figure 1). The proteins encoded by many of these sequences are likely to be too small to have direct functions. However, some of these short open reading frames are close to (or overlap with) longer coding sequences, suggesting that they may regulate translation – particularly during the later stage of viral gene expression, when homeostasis in the host cells is most disrupted. Finkel et al. observed that several of these open reading frames are also transcribed in human cytomegalovirus, indicating important conserved roles across the family of viruses that herpesvirus 6A and 6B belong to.

Taking a closer look at the genomes of human herpesviruses 6.

Finkel et al. have used a combination of techniques to reannotate the genomes of human herpesviruses 6A and 6B. They have identified new open reading frames (268 in human herpesvirus 6A and 216 in human herpesvirus 6B) and corrected the annotation of existing frames (10 in human herpesvirus 6A and 11 in human herpesvirus 6B). The figure shows how an open reading frame called U30, which codes for an important protein in both human herpesvirus 6A and 6B, was reannotated. Data from Ribo-Seq (orange) revealed that the start of the open reading frame was downstream of what was expected based on the previous annotation (black) or cRNA-Seq information (blue), leading to a new, more accurate annotation for this sequence (green).

Combining several methods that can pinpoint both translational and transcriptional start sites – as Finkel et al. did – is particularly important because modern sequencing protocols are sensitive enough to identify rare transcription events, but they cannot distinguish between ‘real’ transcriptional units and biological artifacts. Whole-genome conclusions based on one technique or method of analysis are heavily influenced by experimental noise, technical limitations and even the specific algorithm used to interpret the data. For instance, estimates of the exact number of transcriptional start sites in human cytomegalovirus vary by thousands between studies that use different methods (Stern-Ginossar et al., 2012; Parida et al., 2019); in herpes simplex virus, these numbers can vary by over six-fold (Tombácz et al., 2019; Depledge et al., 2019).

While our appreciation of the coding capacity of pathogens increases, efforts must be made to integrate newly identified gene products into already established nomenclatures. The first waves of new annotations using high-throughput techniques will probably be revised as sequencing technology and analysis techniques improve, and the results are validated in the lab. In particular, new algorithms that can better distinguish signal-to-noise values could help to identify hundreds of additional peptides in a second revision of the human cytomegalovirus genome (Erhard et al., 2018). As our ability to sequence deeper develops, multifaceted studies such as the one by Finkel et al. will provide an excellent framework to help distinguish between rare functional events and technical noise when re-examining herpesvirus genome annotations.

References

Article and author information

Author details

  1. Bhupesh K Prusty

    Bhupesh K Prusty is in the Institute for Virology and Immunobiology, Julius-Maximilians-Universität Würzburg, Würzburg, Germany

    For correspondence
    bhupesh.prusty@biozentrum.uni-wuerzburg.de
    Competing interests
    No competing interests declared
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0001-7051-4670
  2. Adam W Whisnant

    Adam W Whisnant is in the Institute for Virology and Immunobiology, Julius-Maximilians-Universität Würzburg, Würzburg, Germany

    Competing interests
    No competing interests declared
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0003-2039-2809

Publication history

  1. Version of Record published: January 16, 2020 (version 1)

Copyright

© 2020, Prusty and Whisnant

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.

Metrics

  • 760
    Page views
  • 87
    Downloads
  • 1
    Citations

Article citation count generated by polling the highest count across the following sources: Crossref, PubMed Central, Scopus.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Download citations (links to download the citations from this article in formats compatible with various reference manager tools)

Open citations (links to open the citations from this article in various online reference manager services)

Further reading

    1. Chromosomes and Gene Expression
    2. Microbiology and Infectious Disease
    J Stephan Wichers et al.
    Research Article Updated

    Sequestration of Plasmodium falciparum(P. falciparum)-infected erythrocytes to host endothelium through the parasite-derived P. falciparum erythrocyte membrane protein 1 (PfEMP1) adhesion proteins is central to the development of malaria pathogenesis. PfEMP1 proteins have diversified and expanded to encompass many sequence variants, conferring each parasite a similar array of human endothelial receptor-binding phenotypes. Here, we analyzed RNA-seq profiles of parasites isolated from 32 P. falciparum-infected adult travellers returning to Germany. Patients were categorized into either malaria naive (n = 15) or pre-exposed (n = 17), and into severe (n = 8) or non-severe (n = 24) cases. For differential expression analysis, PfEMP1-encoding var gene transcripts were de novo assembled from RNA-seq data and, in parallel, var-expressed sequence tags were analyzed and used to predict the encoded domain composition of the transcripts. Both approaches showed in concordance that severe malaria was associated with PfEMP1 containing the endothelial protein C receptor (EPCR)-binding CIDRα1 domain, whereas CD36-binding PfEMP1 was linked to non-severe malaria outcomes. First-time infected adults were more likely to develop severe symptoms and tended to be infected for a longer period. Thus, parasites with more pathogenic PfEMP1 variants are more common in patients with a naive immune status, and/or adverse inflammatory host responses to first infections favor the growth of EPCR-binding parasites.

    1. Microbiology and Infectious Disease
    Hannah Tabakh et al.
    Research Article

    Pathogens encounter numerous antimicrobial responses during infection, including the reactive oxygen species (ROS) burst. ROS-mediated oxidation of host membrane poly-unsaturated fatty acids (PUFAs) generates the toxic alpha-beta carbonyl 4-hydroxy-2-nonenal (4-HNE). Though studied extensively in the context of sterile inflammation, research into 4-HNE's role during infection remains limited. Here we found that 4-HNE is generated during bacterial infection, that it impacts growth and survival in a range of bacteria, and that the intracellular pathogen Listeria monocytogenes induces many genes in response to 4-HNE exposure. A component of the L. monocytogenes 4-HNE response is the expression of the genes lmo0103 and lmo0613, deemed rha1 and rha2 (reductase of host alkenals), respectively, which code for two NADPH-dependent oxidoreductases that convert 4-HNE to the product 4-hydroxynonanal (4-HNA). Loss of these genes had no impact on L. monocytogenes bacterial burdens during murine or tissue culture infection. However, heterologous expression of rha1/2 in Bacillus subtilis significantly increased bacterial resistance to 4-HNE in vitro and promoted bacterial survival following phagocytosis by murine macrophages in an ROS dependent manner. Thus, Rha1 and Rha2 are not necessary for 4-HNE resistance in L. monocytogenes but are sufficient to confer resistance to an otherwise sensitive organism in vitro and in host cells. Our work demonstrates that 4-HNE is a previously unappreciated component of ROS-mediated toxicity encountered by bacteria within eukaryotic hosts.