Short Report

Neolithic and medieval virus genomes reveal complex evolution of hepatitis B

Kiel University, Germany
Max Planck Institute for the Science of Human History, Germany
University Hospital Zurich, Switzerland
Broad Institute, United States
Massachusetts General Hospital, United States
Broad Institute of MIT & Harvard, United States
Heidelberg University, Germany
University of Bern, Switzerland
State Office for Heritage Management and Archaeology Saxony-Anhalt, State Museum of Prehistory, Germany
Danube Private University, Austria
University Hospital Basel, University of Basel, Switzerland
University of Basel, Switzerland
University Hospital Schleswig-Holstein, Germany

May 10, 2018

Open access
Copyright information

Abstract
Introduction
Results and discussion
Materials and methods
Data availability
References
Article and author information
Metrics

Abstract

The hepatitis B virus (HBV) is one of the most widespread human pathogens known today, yet its origin and evolutionary history are still unclear and controversial. Here, we report the analysis of three ancient HBV genomes recovered from human skeletons found at three different archaeological sites in Germany. We reconstructed two Neolithic and one medieval HBV genome by de novo assembly from shotgun DNA sequencing data. Additionally, we observed HBV-specific peptides using paleo-proteomics. Our results demonstrated that HBV has circulated in the European population for at least 7000 years. The Neolithic HBV genomes show a high genomic similarity to each other. In a phylogenetic network, they do not group with any human-associated HBV genome and are most closely related to those infecting African non-human primates. The ancient viruses appear to represent distinct lineages that have no close relatives today and possibly went extinct. Our results reveal the great potential of ancient DNA from human skeletons in order to study the long-time evolution of blood borne viruses.

https://doi.org/10.7554/eLife.36666.001

Introduction

The hepatitis B virus (HBV) is one of the most widespread human pathogens, with worldwide over 250 million people being infected, and an annual death toll of about 1 million globally (WHO, 2017). Infection of liver cells with HBV leads to acute hepatitis B, which is self-limiting in about 90–95% of cases. In about 5–10% of infected individuals virus clearance fails and patients develop chronic infection of hepatitis B, which puts them at lifelong elevated risk for liver cirrhosis and liver cancer (hepatocellular carcinoma). HBV is usually transmitted by contact with infectious blood, in highly endemic countries often during birth (WHO, 2017).

HBV has a circular, partially double-stranded DNA genome of about 3.2kbp that encodes four overlapping open reading frames (P, pre-S/S, pre-C/C, and X). Based on the genomic sequence diversity, HBVs are currently classified into eight genotypes (A-H) and numerous subgenotypes that show distinct geographic distributions (Castelhano et al., 2017). All genotypes are hypothesised to be primarily the result of recombination events (Littlejohn et al., 2016; Simmonds and Midgley, 2005). To a lesser extent, HBV evolution is also driven by the accumulation of point mutations (Schaefer, 2007; Araujo, 2015).

Despite being widespread and well-studied, the origin and evolutionary history of HBV are still unclear and controversial (Littlejohn et al., 2016; Souza et al., 2014). HBVs in non-human primates (NHP), for instance in chimpanzees and gorillas, are phylogenetically closely related to, and yet distinct from, human HBV isolates, supporting the notion of an Africa origin of the virus (Souza et al., 2014). Molecular-clock-based analyses dating the origin of HBV have resulted in conflicting estimates with some as recent as about 400 years ago (Zhou and Holmes, 2007; Souza et al., 2014). These observations have raised doubts about the suitability of molecular dating approaches for reconstructing the evolution of HBV (Bouckaert et al., 2013 , Souza et al., 2014). Moreover, ancient DNA (aDNA) research on HBV-infected mummies from the 16^th century AD revealed a very close relationship between the ancient and modern HBV genomes (Kahila Bar-Gal et al., 2012; Patterson Ross et al., 2018), indicating a surprising lack of temporal genetic changes in the virus during the last 500 years (Patterson Ross et al., 2018). Therefore, diachronic aDNA HBV studies are necessary, in which both the changes in the viral genome over time as well as the provenance and age of the archaeological samples are investigated, to better understand the origin and evolutionary history of the virus.

Here, we report the analysis of three complete HBV genomes recovered from human skeletal remains from the prehistoric Neolithic and Medieval Periods in Central Europe. Our results show that HBV already circulated in the European population more than 7000 years ago. Although the ancient forms show a relationship to modern isolates they appear to represent distinct lineages that have no close modern relatives and are possibly extinct today.

Results and discussion

We detected evidence for presence of ancient HBV in three human tooth samples as part of a metagenomic screening for viral pathogens that was performed on shotgun sequencing data from 53 skeletons using the metagenomic alignment software MALT (Vågene et al., 2018). The remains of the individuals were excavated from the Neolithic sites of Karsdorf (Linearbandkeramik [LBK], 5056–4959 cal BC) and Sorsum (Tiefstichkeramik group of the Funnel Beaker culture, 3335–3107 cal BC) as well as from the medieval cemetery of Petersberg/Kleiner Madron (1020–1116 cal AD), all located in Germany (Figure 1, Figure 1—figure supplements 1–3). After the three aDNA extracts had appeared HBV-positive in the initial virus screening, they were subjected to deep-sequencing without any prior enrichment resulting in 367 to 419 million reads per sample (Table 1). A principal component analysis (PCA) of the human DNA recovered from Karsdorf (3-fold genomic coverage) revealed that the sample clusters tightly with other contemporary early Neolithic individuals from the LBK (Figure 1—figure supplement 4). The genetic makeup of the early LBK agriculturalists was previously found quite distinct from the preceding western hunter-gatherers of Europe. The genetic shift between both populations was interpreted as a result of early farmers migrating from Western Anatolia into Central Europe introducing agriculture (Lazaridis et al., 2014; Haak et al., 2015). The almost 2000 years younger Sorsum individual (1.2-fold genomic coverage) clusters in the PCA most closely with individuals from the contemporary Funnel Beaker culture that inhabited Northern Germany at the end of the fourth millennium BCE (Figure 1—figure supplement 4). This population was previously shown to be quite admixed, as a result of a spatial and temporal overlap of early Neolithic farmers and remaining western hunter-gatherers for almost 2000 years (Bollongino et al., 2013; Haak et al., 2015). The Petersberg individual (2.9-fold genomic coverage), however, showed genetic affinities in the PCA with modern day central European populations. All three ancient human individuals are therefore in agreement with the archeological evidence and radiocarbon dates for their respective time of origin. Together with typical aDNA damage patterns (Figure 1—figure supplements 5–6), the human population genetic investigation supports the ancient origin of the obtained datasets.

Figure 1 with 9 supplements see all

Download asset Open asset

Origin of samples.

Geographic location of the samples from which ancient HBV genomes were isolated. Radiocarbon dates of the specimens is given in two sigma range. Icons indicate the sample material (tooth or mummy). HBV genomes obtained in this study are indicated by black frame.

https://doi.org/10.7554/eLife.36666.002

Table 1

Results of the genome reconstruction

https://doi.org/10.7554/eLife.36666.012

	*Merged reads	Length of HBV consensus sequence	Mean HBV coverage	Gaps in the consensus sequence at nt position	*Mapped reads HBV	*Mapped reads human	Mean human coverage	Human genomes/HBVgenomes
Karsdorf	386,780,892	3183	104X	2157–2175; 3107–3128; 3133–3183	10,718	122,568,310	2.96X	1: 35.1
Sorsum	367,574,767	3182	47X	-	3249	9,856,001	1.17X	1: 40.2
Petersberg	419,413,082	3161	46X	880–1000; 1232–1329; 1331–1415; 1420–1581; 1585–1598	2125	105,476,677	2.88X	1: 16

*number.

nt, nucleotide.

For successful HBV genome reconstruction, we mapped all metagenomic sequences to 16 HBV reference genomes (eight human genotypes (A-H) and 8 NHPs from Africa and Asia) that are representative of the current HBV strain diversity (Supplementary file 6). The mapped reads were used for a de novo assembly, resulting in contigs from which one ancient HBV consensus sequence per sample was constructed. The consensus genomes are 3161 (46-fold coverage), 3182 (47-fold coverage), and 3183 (104-fold coverage) nucleotides in length, which falls in the length range of modern HBV genomes and suggests that we successfully reconstructed the entire ancient HBV genomes (Table 1, Figure 2—figure supplements 1–3). Further, when we conducted liquid chromatography-mass spectrometry (LC-MS) based bottom-up proteomics on tooth material from the three individuals, we identified in the Karsdorf and Petersberg samples a peptide that is part of the very stable HBV core protein, supporting the presence and active replication of HBV in the individuals’ blood (Figure 1—figure supplement 7).

Phylogenetic network analysis was carried out with a dataset comprised of 493 modern HBV strains representing the full genetic diversity. Strikingly, the Neolithic HBV genomes did not group with any human strain in the phylogeny. Instead, they branched off in two lineages and were most closely related to the African NHP genomes (Figure 2, 93% similarity). Although the two Neolithic strains were recovered from humans who had lived about 2000 years apart, they showed a higher genomic similarity to each other than to any other human or NHP genotype. Still, their genomes differed by 6% from each other and may therefore be considered representatives of two separate lineages. They did, however, differ less than 8% from the African NHP strains and should therefore not be called a separate genotype (Figure 2—figure supplement 4). The genome from the 1000-year-old Petersberg individual clustered with modern D4 genotypes.

Figure 2 with 9 supplements see all

Download asset Open asset

Network.

Network of 493 modern, two published ancient genomes (light yellow box), and three ancient hepatitis B virus (HBV) obtained in this study (grey box). Colors indicate the eight human HBV genotypes (**A–H**), two monkey genotypes (Monkeys I, African apes and Monkeys II, Asian monkeys) and ancient genomes (red).

https://doi.org/10.7554/eLife.36666.013

Figure 2—source data 1 Results of the recombination analysis using the methods RDP, GENECOV, Chimera, MaxChi, BootScan, SiScan, 3Seq within the RDP v4 software package with all modern full reference genomes (n = 493) and five ancient genomes.: https://doi.org/10.7554/eLife.36666.023
Download elife-36666-fig2-data1-v2.rdp
Figure 2—source data 2 Multiple sequence alignment of the 493 representative and five ancient HBV genomes. The multiple sequence alignment was stripped of any sites that had gaps in more than 95%.: https://doi.org/10.7554/eLife.36666.024
Download elife-36666-fig2-data2-v2.fasta
Figure 2—source data 3 Maximum-likelihood tree based on the multiple sequence alignment of the 493 representative and five ancient HBV genomes with 2000 replicates.: https://doi.org/10.7554/eLife.36666.025
Download elife-36666-fig2-data3-v2.txt
Figure 2—source data 4 Neighbour-Joining tree based on the multiple sequence alignment of the 493 representative modern and five ancient HBV genomes with 10000 replicates.: https://doi.org/10.7554/eLife.36666.026
Download elife-36666-fig2-data4-v2.txt

Owing to continuous recombination over time, different gene segments or modules of the ancestral genomes can show up in various subsequent virus generations. Such precursors have been postulated (Simmonds and Midgley, 2005) and their existence is supported by the results of our recombination analysis (Figure 2—figure supplements 5–8, Figure 2—source data 1). Some fragments of the Karsdorf sequences appeared to be very similar to modern human (G, E) and African NHP genotypes, and the Sorsum genome partially showed a high similarity to the human genotypes G, E and B. (Figure 2—figure supplements 5–8, Figure 2—source data 1). Given the close relationship between the two Neolithic virus genomes, it is also conceivable that the older HBV from Karsdorf could have been a distant source for the younger Sorsum virus (Figure 2—figure supplements 5–8, Figure 2—source data 1). The closer relationship between the Neolithic and the NHP strains compared to other human strains is noteworthy and may have involved reciprocal cross-species transmission at one or possibly several times in the past (Simmonds and Midgley, 2005; Souza et al., 2014; Rasche et al., 2016).

Taken together, our results demonstrate that HBV already existed in Europeans 7000 years ago and that its genomic structure closely resembled that of modern hepatitis B viruses. Both Neolithic viruses fall between the present-day modern human and the known NHP diversity. Therefore, it can be hypothesized that although the two Neolithic HBV strains are no longer observed today and thus may reflect two distinct clades that went extinct, they could still be closely related to the remote ancestors of the present-day genotypes, which is supported by signs of ancient recombination events. More ancient precursors, intermediates and modern strains of both humans and NHPs need to be sequenced to disentangle the complex evolution of HBV. As this evolution is characterized by recombination and point mutations and may further be complicated by human-ape host barrier crossing (Simmonds and Midgley, 2005; Souza et al., 2014; Rasche et al., 2016), genetic dating is not expected to yield meaningful results. This is additionally supported by a TempEst analysis (Rambaut et al., 2016) that shows very little temporal signal (Figure 2—figure supplement 9). It should, however, be noted that the oldest genome (Karsdorf) was found in an individual that belonged to a population of early farmers that had migrated in the previous few hundred years from the Near East into central Europe. One might speculate that the close proximity to recently domesticated animals, changes in subsistence strategy as well as the adopted sedentary lifestyle might have contributed to the spread of HBV within Neolithic human populations.

Based on our analysis, HBV DNA can reliably be detected in tooth samples that are up to 7000 years old. Ancient HBV has so far only been identified in soft tissue from two 16^th-century mummies (Kahila Bar-Gal et al., 2012; Patterson Ross et al., 2018). The aDNA analysis of HBV from prehistoric skeletons, which facilitates evolutionary studies on a far-reaching temporal scale, has not been described up to now. One explanation for the difficulty of a molecular HBV diagnosis in bones is that the virus infection does not leave lesions on skeletal remains that would allow researchers to select affected individuals a priori, as it is the case for instance for leprosy (Schuenemann et al., 2013). The diagnosis of an HBV infection in skeletal populations is purely a chance finding and is thus more probable in a large-scale screening.

Overall, HBV biomolecules seem to be well preserved in teeth: Avoiding biases from DNA capture and reference-based mapping we could reconstruct three HBV genomes by de novo assembly from shotgun data and even observed HBV-specific peptides. The ratio of HBV genomes to the human genome in our samples was rather high and similar in all three samples (Karsdorf 35:1, Sorsum 40.2:1 and Petersberg 16:1). As there is no evidence that HBV DNA is more resistant to postmortem degradation than human DNA, the high rate of HBV compared to human DNA may reflect the disease state in the infected individuals at the time of death. High copy numbers of viral DNA in the blood of infected individuals are associated with acute HBV infection, or reactivation of chronic HBV. Thus, it seems likely that the death of the ancient individuals is related to the HBV infection, but might not be the direct cause of death as fulminant liver failure is rather rare in modern day patients. The HBV infection might have instead contributed to other forms of lethal liver failure such as cirrhosis or liver cancer.

In view of the unexpected complexity of our findings, we envisage future diachronic HBV studies that go beyond the temporal and geographic scope of our current work.

Share this article

Cite this article

Origin of samples.

Results of the genome reconstruction

Network.

Figure 2—source data 1

Figure 2—source data 2

Figure 2—source data 3

Figure 2—source data 4

Author details

Ben Krause-Kyora

Contribution

Contributed equally with

For correspondence

Competing interests

Julian Susat

Contribution

Contributed equally with

Competing interests

Felix M Key

Contribution

Competing interests

Denise Kühnert

Contribution

Competing interests

Esther Bosse

Contribution

Competing interests

Alexander Immel

Contribution

Competing interests

Christoph Rinne

Contribution

Competing interests

Sabin-Christin Kornell

Contribution

Competing interests

Diego Yepes

Contribution

Competing interests

Sören Franzenburg

Contribution

Competing interests

Henrike O Heyne

Contribution

Competing interests

Thomas Meier

Contribution

Competing interests

Sandra Lösch

Contribution

Competing interests

Harald Meller

Contribution

Competing interests

Susanne Friederich

Contribution

Competing interests

Nicole Nicklisch

Contribution

Competing interests

Kurt W Alt

Contribution

Competing interests

Stefan Schreiber

Contribution

Competing interests

Andreas Tholey

Contribution

Competing interests

Alexander Herbig

Contribution

Competing interests

Almut Nebel

Contribution

Competing interests

Johannes Krause

Contribution

For correspondence

Competing interests