Transmission Histories: Traversing missing links in the spread of HIV

Combining clinical and genetic data can improve the effectiveness of virus tracking with the aim of reducing the number of HIV cases by 2030.
  1. Erin Brintnell
  2. Art Poon  Is a corresponding author
  1. Department of Pathology and Laboratory Medicine, Western University, Canada
  2. Department of Computer Science, Western University, Canada

The human immunodeficiency virus type 1 (HIV-1), which can lead to acquired immune deficiency syndrome (AIDS), remains a leading cause of death and a health threat worldwide, with over 38.4 million people currently living with the virus. Global health sector strategies strive to end HIV-1 epidemics by 2030 (Duncombe et al., 2019). This requires significant investment in resources to treat and prevent the disease, such as reducing the number of people who do not know they are carrying the virus and improving the availability and affordability of effective treatments.

In cities that have scaled up HIV-1 treatment and prevention, it will be crucial to establish whether new HIV-1 infections are due to ongoing local transmission or to infections acquired abroad. This means reconstructing the spread of HIV-1 between individuals through contact tracing: interviewing people recently diagnosed with HIV, and locating and notifying their intimate partners. However, contact tracing is both time-consuming and intrusive (El-Sadr et al., 2022).

A cost-effective alternative to contact tracing is to compare the genomic sequences of HIV-1 samples from different patients, which are often collected to screen for mutations that confer drug resistance. Infections that are genetically similar are more likely to be related through recent transmissions. This is especially true for HIV-1, a rapidly evolving virus that becomes genetically unique within months of an infection (Williamson, 2003).

These genetic sequences can be used to build a tree that represents the shared evolutionary history of the infections and approximates the history of recent transmissions (De Maio et al., 2018; Romero-Severson et al., 2014). Furthermore, the spread of infections from one place to another can be extrapolated by reconstructing locations of ‘ancestral’ infections at deeper nodes of the tree from the known locations at the tips (Faria et al., 2011). The accuracy of these estimates, however, is impeded by the unknown number of people with undiagnosed infections, or with diagnosed infections that have not been sequenced (Didelot et al., 2017). In addition, reconstructing transmission patterns from HIV-1 sequences comes with its own ethical challenges because HIV-1 transmission is criminalized in many countries (Dawson et al., 2020).

Now, in eLife, Oliver Ratmann at the HIV Transmission Elimination Amsterdam Consortium and colleagues – including Alexandra Blenkinsop as first author – report an innovative approach to overcome the disadvantages of sequence analysis (Blenkinsop et al., 2022). Blenkinsop et al. combined different data sources to reconstruct the transmission histories of HIV-1 in Amsterdam, which has the highest concentration of HIV-1 cases in the Netherlands. Amsterdam is also part of the Fast-Track city network, which provides funds to expand effective HIV prevention, testing and treatment services.

Blenkinsop et al. extended the standard approach of reconstructing transmission histories from HIV-1 sequences by incorporating additional information from clinical biomarkers (biological indicators of disease progression or response to treatment) and other patient data (Figure 1). A statistical model was fitted to two biomarkers: the number of HIV-1 particles circulating in the blood (the viral load) and the number of white blood cells targeted by HIV-1. Based on how these biomarkers changed over time, it was possible to estimate the length of time between a person’s HIV-1 infection and diagnosis. These estimates were then used to infer how many cases were transmitted from people with unsequenced infections, adjusting for factors like route of transmission (e.g., injection drug use), age group, and place of birth.

Estimating the number of unsampled HIV-1 infections.

The top panel illustrates how a chain of HIV-1 infections may be partially sampled over time. The top dashed line shows an infection (represented by the virus particle symbol) that is transmitted (red arrow) before it is sequenced (DNA symbol), with the time between the infection occurring and sequencing taking place indicated by the two-headed arrow. The dashed line in the centre shows an infection resulting from transmission from the first infection, which is transmitted (red arrow) but never sequenced. The dashed line on the bottom represents a third infection resulting from the second infection, that is sequenced (DNA symbol) more quickly than the original infection. The bottom panel depicts two phylogenetic trees. The first tree (green) is inferred from the available sequences (in this case, the two infections sequenced in the top panel). By fitting a statistical model to HIV-1 cases with estimated dates of infection and clinical data, the number of unsampled infections (‘missing links’) in the new tree (red) can be extrapolated for different populations.

Despite extensive measures to curb the transmission of HIV in Amsterdam, results from Blenkinsop et al. suggest that many HIV-1 infections have remained undiagnosed for a long time, especially among heterosexual residents and recent arrivals from sub-Saharan Africa. Further, they provide evidence of ongoing HIV-1 transmission within the city over the duration of the five-year study. These results suggest that, while Amsterdam has made significant progress in reducing the spread of HIV-1, closing the final gap to end the local epidemic by 2030 remains a challenge.

The study also highlights the importance of linking HIV-1 sequences to both clinical and demographic information to determine which groups have been neglected by the generalized scale-up of public health testing and treatment. This may also be a critical step for other cities in the FastTrack initiative. Furthermore, the work of Blenkinsop et al. mirrors ongoing challenges in tracking and controlling other infectious diseases like COVID-19, which is characterised by an abundance of viral genome sequences but a lack of linked contextual information, including clinical outcomes, travel histories and sampling strategies (Chiara et al., 2021; Chen et al., 2022).


Article and author information

Author details

  1. Erin Brintnell

    Erin Brintnell is in the Department of Pathology and Laboratory Medicine, Western University, London, Canada

    Competing interests
    No competing interests declared
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0001-5042-7799
  2. Art Poon

    Art Poon in the Department of Pathology and Laboratory Medicine, the Department of Microbiology and Immunology, and the Department of Computer Science, Western University, London, Canada

    For correspondence
    Competing interests
    No competing interests declared
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0003-3779-154X

Publication history

  1. Version of Record published: September 30, 2022 (version 1)


© 2022, Brintnell and Poon

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.


  • 289
  • 41
  • 0

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Erin Brintnell
  2. Art Poon
Transmission Histories: Traversing missing links in the spread of HIV
eLife 11:e82610.

Further reading

    1. Computational and Systems Biology
    2. Epidemiology and Global Health
    Javier I Ottaviani, Virag Sagi-Kiss ... Gunter GC Kuhnle
    Research Article

    The chemical composition of foods is complex, variable, and dependent on many factors. This has a major impact on nutrition research as it foundationally affects our ability to adequately assess the actual intake of nutrients and other compounds. In spite of this, accurate data on nutrient intake are key for investigating the associations and causal relationships between intake, health, and disease risk at the service of developing evidence-based dietary guidance that enables improvements in population health. Here, we exemplify the importance of this challenge by investigating the impact of food content variability on nutrition research using three bioactives as model: flavan-3-ols, (–)-epicatechin, and nitrate. Our results show that common approaches aimed at addressing the high compositional variability of even the same foods impede the accurate assessment of nutrient intake generally. This suggests that the results of many nutrition studies using food composition data are potentially unreliable and carry greater limitations than commonly appreciated, consequently resulting in dietary recommendations with significant limitations and unreliable impact on public health. Thus, current challenges related to nutrient intake assessments need to be addressed and mitigated by the development of improved dietary assessment methods involving the use of nutritional biomarkers.

    1. Epidemiology and Global Health
    Caroline Krag, Maria Saur Svane ... Tinne Laurberg
    Research Article


    Comorbidity with type 2 diabetes (T2D) results in worsening of cancer-specific and overall prognosis in colorectal cancer (CRC) patients. The treatment of CRC per se may be diabetogenic. We assessed the impact of different types of surgical cancer resections and oncological treatment on risk of T2D development in CRC patients.


    We developed a population-based cohort study including all Danish CRC patients, who had undergone CRC surgery between 2001 and 2018. Using nationwide register data, we identified and followed patients from date of surgery and until new onset of T2D, death, or end of follow-up.


    In total, 46,373 CRC patients were included and divided into six groups according to type of surgical resection: 10,566 Right-No-Chemo (23%), 4645 Right-Chemo (10%), 10,151 Left-No-Chemo (22%), 5257 Left-Chemo (11%), 9618 Rectal-No-Chemo (21%), and 6136 Rectal-Chemo (13%). During 245,466 person-years of follow-up, 2556 patients developed T2D. The incidence rate (IR) of T2D was highest in the Left-Chemo group 11.3 (95% CI: 10.4–12.2) per 1000 person-years and lowest in the Rectal-No-Chemo group 9.6 (95% CI: 8.8–10.4). Between-group unadjusted hazard ratio (HR) of developing T2D was similar and non-significant. In the adjusted analysis, Rectal-No-Chemo was associated with lower T2D risk (HR 0.86 [95% CI 0.75–0.98]) compared to Right-No-Chemo.

    For all six groups, an increased level of body mass index (BMI) resulted in a nearly twofold increased risk of developing T2D.


    This study suggests that postoperative T2D screening should be prioritised in CRC survivors with overweight/obesity regardless of type of CRC treatment applied.


    The Novo Nordisk Foundation (NNF17SA0031406); TrygFonden (101390; 20045; 125132).