Transcriptomics: Revisiting the genomes of herpesviruses
Herpesviruses cause a range of human diseases but many factors complicate the efforts made to precisely map the size and origin of RNA transcripts coded by these pathogens. For example, some mRNAs can code for more than one protein, coding sequences may overlap with each other, and the genes that are expressed may change depending on cell types or stages in the viral cycle. Moreover, the level of expression can greatly vary from gene to gene, which makes it difficult to distinguish between rare viral transcripts and other genetic products that accumulate in infected cells and during viral replication. In fact, in most herpesviruses, the majority of the genome is transcribed to some degree, yet only the most highly expressed or genomically isolated units are readily detectable.
Several new techniques have allowed researchers to bypass these problems to better annotate the genomes of herpesviruses. A tailored RNA sequencing method called cRNA-Seq, which enriches for the 5’ ends of RNA transcripts, has allowed the mapping of transcription start sites; in parallel, ribosome profiling (Ribo-Seq) has helped to highlight translational start sites. Combined, these approaches have revealed dozens to hundreds of new genes in herpesviruses such as the human cytomegalovirus (Stern-Ginossar et al., 2012) and the Kaposi’s sarcoma-associated herpesvirus (Arias et al., 2014). When paired with long-read sequencing platforms (which provide additional information about the 3’ ends of transcripts), the new methods have also led to a better understanding of a number of pathogens in the herpes family. Now, in eLife, Noam Stern-Ginossar and colleagues at the Weizmann Institute of Science and the Hebrew University Hadassah Medical School – including Yaara Finkel as first author – report new insights into human herpesvirus 6A and 6B (Finkel et al., 2020).
The results help to correct and complement previous textbook genome annotations for herpesviruses. Due to the technical limitations of the time, the exact beginnings of many transcripts and coding sequences were assigned a priori, and inclusion into published gene lists relied on rather conservative criteria. For instance, a sequence was classified as an open reading frame (the part of a genetic sequence that can potentially be translated) if it had more than 100 amino acids and started with an AUG codon. Instead, Finkel et al. demonstrate that roughly one-third of open reading frames in human herpesvirus 6A and 6B contain alternative start codons, which are also used by eukaryotes and other herpesviruses (Kearse and Wilusz, 2017; Arias et al., 2014). For instance, strains of human cytomegalovirus can have different start codons for a given gene, which may influence biological properties (Brondke et al., 2007); such questions can now be investigated in herpesvirus 6A and 6B .
Another exciting finding is the identification of hundreds of short, internal or upstream open reading frames (Figure 1). The proteins encoded by many of these sequences are likely to be too small to have direct functions. However, some of these short open reading frames are close to (or overlap with) longer coding sequences, suggesting that they may regulate translation – particularly during the later stage of viral gene expression, when homeostasis in the host cells is most disrupted. Finkel et al. observed that several of these open reading frames are also transcribed in human cytomegalovirus, indicating important conserved roles across the family of viruses that herpesvirus 6A and 6B belong to.
Combining several methods that can pinpoint both translational and transcriptional start sites – as Finkel et al. did – is particularly important because modern sequencing protocols are sensitive enough to identify rare transcription events, but they cannot distinguish between ‘real’ transcriptional units and biological artifacts. Whole-genome conclusions based on one technique or method of analysis are heavily influenced by experimental noise, technical limitations and even the specific algorithm used to interpret the data. For instance, estimates of the exact number of transcriptional start sites in human cytomegalovirus vary by thousands between studies that use different methods (Stern-Ginossar et al., 2012; Parida et al., 2019); in herpes simplex virus, these numbers can vary by over six-fold (Tombácz et al., 2019; Depledge et al., 2019).
While our appreciation of the coding capacity of pathogens increases, efforts must be made to integrate newly identified gene products into already established nomenclatures. The first waves of new annotations using high-throughput techniques will probably be revised as sequencing technology and analysis techniques improve, and the results are validated in the lab. In particular, new algorithms that can better distinguish signal-to-noise values could help to identify hundreds of additional peptides in a second revision of the human cytomegalovirus genome (Erhard et al., 2018). As our ability to sequence deeper develops, multifaceted studies such as the one by Finkel et al. will provide an excellent framework to help distinguish between rare functional events and technical noise when re-examining herpesvirus genome annotations.
References
-
Non-AUG translation: a new start for protein synthesis in eukaryotesGenes & Development 31:1717–1731.https://doi.org/10.1101/gad.305250.117
Article and author information
Author details
Publication history
- Version of Record published: January 16, 2020 (version 1)
Copyright
© 2020, Prusty and Whisnant
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 925
- views
-
- 113
- downloads
-
- 3
- citations
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Epidemiology and Global Health
- Microbiology and Infectious Disease
Background:
Few national-level studies have evaluated the impact of ‘hybrid’ immunity (vaccination coupled with recovery from infection) from the Omicron variants of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2).
Methods:
From May 2020 to December 2022, we conducted serial assessments (each of ~4000–9000 adults) examining SARS-CoV-2 antibodies within a mostly representative Canadian cohort drawn from a national online polling platform. Adults, most of whom were vaccinated, reported viral test-confirmed infections and mailed self-collected dried blood spots (DBSs) to a central lab. Samples underwent highly sensitive and specific antibody assays to spike and nucleocapsid protein antigens, the latter triggered only by infection. We estimated cumulative SARS-CoV-2 incidence prior to the Omicron period and during the BA.1/1.1 and BA.2/5 waves. We assessed changes in antibody levels and in age-specific active immunity levels.
Results:
Spike levels were higher in infected than in uninfected adults, regardless of vaccination doses. Among adults vaccinated at least thrice and infected more than 6 months earlier, spike levels fell notably and continuously for the 9-month post-vaccination. In contrast, among adults infected within 6 months, spike levels declined gradually. Declines were similar by sex, age group, and ethnicity. Recent vaccination attenuated declines in spike levels from older infections. In a convenience sample, spike antibody and cellular responses were correlated. Near the end of 2022, about 35% of adults above age 60 had their last vaccine dose more than 6 months ago, and about 25% remained uninfected. The cumulative incidence of SARS-CoV-2 infection rose from 13% (95% confidence interval 11–14%) before omicron to 78% (76–80%) by December 2022, equating to 25 million infected adults cumulatively. However, the coronavirus disease 2019 (COVID-19) weekly death rate during the BA.2/5 waves was less than half of that during the BA.1/1.1 wave, implying a protective role for hybrid immunity.
Conclusions:
Strategies to maintain population-level hybrid immunity require up-to-date vaccination coverage, including among those recovering from infection. Population-based, self-collected DBSs are a practicable biological surveillance platform.
Funding:
Funding was provided by the COVID-19 Immunity Task Force, Canadian Institutes of Health Research, Pfizer Global Medical Grants, and St. Michael’s Hospital Foundation. PJ and ACG are funded by the Canada Research Chairs Program.
-
- Microbiology and Infectious Disease
Antimicrobial resistance (AMR) poses a significant threat to human health. Although vaccines have been developed to combat AMR, it has proven challenging to associate specific vaccine antigens with AMR. Bacterial plasmids play a crucial role in the transmission of AMR. Our recent research has identified a group of bacterial plasmids (specifically, IncHI plasmids) that encode large molecular mass proteins containing bacterial immunoglobulin-like domains. These proteins are found on the external surface of the bacterial cells, such as in the flagella or conjugative pili. In this study, we show that these proteins are antigenic and can protect mice from infection caused by an AMR Salmonella strain harboring one of these plasmids. Furthermore, we successfully generated nanobodies targeting these proteins, that were shown to interfere with the conjugative transfer of IncHI plasmids. Considering that these proteins are also encoded in other groups of plasmids, such as IncA/C and IncP2, targeting them could be a valuable strategy in combating AMR infections caused by bacteria harboring different groups of AMR plasmids. Since the selected antigens are directly linked to AMR itself, the protective effect extends beyond specific microorganisms to include all those carrying the corresponding resistance plasmids.