More effective drugs lead to harder selective sweeps in the evolution of drug resistance in HIV-1

  1. Alison F Feder  Is a corresponding author
  2. Soo-Yon Rhee
  3. Susan P Holmes
  4. Robert W Shafer
  5. Dmitri A Petrov
  6. Pleuni S Pennings
  1. Stanford University, United States
  2. San Francisco State University, United States
5 figures, 2 tables and 3 additional files

Figures

Prediction of drug resistance acquisition with more and less effective treatments.

Among patients treated with more effective treatments (top), we predict HIV populations to have a lower probability of acquiring resistance per generation. As a result, the population must wait a long time for a beneficial genotype, so when resistance does occur, it will spread through the population in a hard selective sweep before other resistant genotypes emerge (A). Since resistance only occurs on a single genetic background (background mutations in grey), all sequences with resistance will be similar (B) and diversity following this type of selective sweep will be reduced (C). We can use the reduction of diversity to determine that a selective sweep is hard. In patients treated with less effective treatments (bottom), we predict HIV populations should have a higher probability of acquiring resistance per generation, so resistance will be acquired more quickly and selective sweeps of drug resistance mutations will be soft (D). We can detect these soft selective sweeps, because diversity remains high when resistance mutations on different genetic backgrounds rise in frequency simultaneously (E,F).

https://doi.org/10.7554/eLife.10670.003
Figure 2 with 2 supplements
Effect of DRMs on sequence diversity.

(A) For the most common reverse transcriptase and protease mutations, 95% confidence intervals are drawn for the difference in diversity associated with a single derived mutation. For each DRM, the mean diversity among patients with a fixed ancestral state at the focal locus is compared to those patients with that fixed non-ambiguous DRM. All sequences have no additional DRMs. All DRMs occurring at least 5 times with these specifications are included. (B) The effect of multiple DRMs on diversity is shown as the average diversity level of sequences decreases conditional on number of fixed drug resistance mutations present. Means ± SE are plotted among all patients in the D-PCR dataset.

https://doi.org/10.7554/eLife.10670.004
Figure 2—figure supplement 1
Diversity and the number of drug resistance mutations by treatment categories.

The relationship between diversity and the number of drug resistance mutations is plotted separated out by the major treatment categories included in our analysis: (A) NRTI(s) without any other drugs, (B) NRTIs with unboosted PI, (C) NRTIs and an NNRTI and (D) NRTIs and PI/r. Data plotted is just the abundant treatment dataset.

https://doi.org/10.7554/eLife.10670.005
Figure 2—figure supplement 2
Effect of multiple DRMs on sequence diversity separated by subtype.

Average diversity level of sequences are plotted conditioned on number of fixed drug resistance mutations present separately by all the subtypes with more than 100 associated HIV populations. Means ± SE are plotted among all the D-PCR dataset.

https://doi.org/10.7554/eLife.10670.006
Figure 3 with 2 supplements
Drug resistance mutations are correlated with diversity reduction differently in different types of treatments.

Treatment effectiveness from literature review (percentage of patients with virologic suppression after ~48 weeks) showed positive correspondence with clinical recommendation among RTI regimens (A) and PI+RTI regimens (B). Diversity reduction accompanying a DRM (ΔDRM from the GLMM) lower among the more effective and clinically recommended treatments among RTI treatments (C) and RTI+PI treatments (D). 95% confidence intervals are plotted by excluding the highest 2.5% and lowest 2.5% of GLMM random effect fits of the 1000 subsampled datasets and treatments are ordered by mean ΔDRM within treatment categories. Generalized linear model fits show significantly different slopes for NNRTI treatments versus NRTI treatments (E) and PI/r treatments versus PI treatments (F). Confidence intervals are plotted by excluding the highest 2.5% and lowest 2.5% of GLM fits to 1000 subsampled datasets.

https://doi.org/10.7554/eLife.10670.007
Figure 3—figure supplement 1
Drug resistance mutations are correlated with diversity reduction differently in different types of treatments when all years are included.

This figure is analogous with Figure 3 from the main text, but in this case, the data include 45 sequences sampled before 1995. The inclusion of these sequences substantially changes the effect of the subsampling procedure, which explains the difference in scales of these figures (see Materials and methods: p-thinning to adjust for the effect of year). The figure caption is otherwise shared with Figure 3.

https://doi.org/10.7554/eLife.10670.008
Figure 3—figure supplement 2
Drug resistance mutations are correlated with diversity reduction differently in different types of treatments with un-truncated data.

This figure is analogous with Figure 3 from the main text, but in this case, the data are not truncated to only include the patients with 4 or fewer DRMs. The figure caption is otherwise shared with Figure 3.

https://doi.org/10.7554/eLife.10670.009
Figure 4 with 2 supplements
Nonparametric test shows negative correlation between treatment effectiveness and ΔDRM.

Negative relationship between treatment effectiveness (percentage of patients with virologic suppression after ~48 weeks) and ΔDRM as fit within the GLMM among all treatments for which we have recorded effectiveness information. Treatments included are plotted at the mean ΔDRM from GLMMs fit to 1000 iterations of data subsampling. Points for each treatment are sized in proportion to the number of sequences from patients given that treatment (see legend in the lower left), and are colored based on the treatment type (see legend in the upper right). The black line shows the mean effect of treatment effectiveness on ΔDRM from linear regressions fits to the 1000 subsampled datasets and the grey band shows a 95% confidence interval (excluding the top and bottom 2.5% of fits).

https://doi.org/10.7554/eLife.10670.012
Figure 4—figure supplement 1
Nonparametric test shows negative correlation between treatment effectiveness and ΔDRM when 45 sequences from before 1995 are included.

This figure is analogous with Figure 4 from the main text, but in this case, 45 sequences before 1995 are included in the analysis. The ΔDRM coefficients are taken from the GLMM shown in Figure 3—figure supplement 1 A and B. The figure caption is otherwise shared with Figure 4.

https://doi.org/10.7554/eLife.10670.013
Figure 4—figure supplement 2
Nonparametric test shows negative correlation between treatment effectiveness and ΔDRM when sequences with any number of DRMs are included.

This figure is analogous with Figure 4 from the main text, but in this case, the data is not truncated to only include the patients with 4 or fewer DRMs. The DeltaDRM coefficients are taken from the GLMM shown in Figure 3—figure supplement 2A and B. The figure caption is otherwise shared with Figure 4.

https://doi.org/10.7554/eLife.10670.014
Figure 5 with 5 supplements
Summary of data subgroups and filtering.

Summary of all sequences from patients receiving first line therapy used throughout the analysis. The dataset is broken down into patients receiving protease inhibitor (PI) therapy with reverse transcriptase inhibitors (RTIs) and patients receiving only RTIs. The abundant treatment dataset shows treatments given to many patients within our dataset. For full filtering parameters for the abundant dataset, see Materials and methods: Data collection & filtering. Counts of sequences for each treatment are given.

https://doi.org/10.7554/eLife.10670.015
Figure 5—figure supplement 1
Distribution of number of DRMs by treatment.

The distribution of the number of drug resistance mutations is plotted separated out by the major treatments included in our analysis. The y-axes are not standardized, and show the number of consensus sequences per treatment with each number of fixed DRMs within the abundant treatment dataset.

https://doi.org/10.7554/eLife.10670.016
Figure 5—figure supplement 2
Data is heterogenous with respect to year in terms of the number of drugs per treatment, the diversity of treatments and the number of ambiguous reads per sequence.

(A) Percentage of patients receiving 1, 2 or 3 or more drugs (red, orange and blue bars, respectively) changes over the timecourse of our sampling period. (B) The cumulative number of unique treatment regimens is shown over the course of our sample period. (C) Barplots demonstrating the number of ambiguous amino acids per sequence by year with interquartile ranges. One outlier is excluded for visibility.

https://doi.org/10.7554/eLife.10670.017
Figure 5—figure supplement 3
p-thinning controls for systematic changes in the called number of ambiguous reads over time A.

The mean number of ambiguous calls has increased over time among sequences with 0 DRMs. Linear model fits predicting the number of ambiguous reads from year among sequences with 0 DRMs are shown in black for sequences from 1989 or later (Equation 1) and red for sequences from 1995 or later (Equation 2). These model fits are used to compute the subsample effect, shown in B.

https://doi.org/10.7554/eLife.10670.018
Figure 5—figure supplement 4
The number of ambiguous reads is approximately distributed according to a negative binomial when p-thinned relative to 1989.

The subsampled number of ambiguous reads (relative to 1989) relative to the best fit Poisson distribution (A) and best fit negative binomial distribution (B). The fits to one particular subsample are plotted, but the graphs are representative. These distributions were fit using vcd (Meyer et al., 2015).

https://doi.org/10.7554/eLife.10670.019
Figure 5—figure supplement 5
The number of ambiguous reads is approximately distributed according to a negative binomial when p-thinned relative to 1995.

The subsampled number of ambiguous reads (relative to 1995) relative to the best fit Poisson distribution (A) and best fit negative binomial distribution (B). The fits to one particular subsample are plotted, but the graphs are representative. These distributions were fit using vcd (Meyer et al., 2015). Note, many more ambiguous calls are retained with p-thinning relative to 1995 as compared to relative to 1989 (See Figure 5—figure supplement 4).

https://doi.org/10.7554/eLife.10670.020

Tables

Table 1

Model fits for the fixed effects from GLMMs fit to subsampled data. See Materials and methods: Quantifying the relationship between clinical effectiveness and diversity reduction for further explanations of coefficients. Means of model fits for 1000 independent subsamples are reported for the three different subsampling and model-fitting regimes. 95% confidence intervals (excluding the top 2.5% and bottom 2.5% of coefficent fits) are given in parentheses.

https://doi.org/10.7554/eLife.10670.010
αall (Intercept)Δ (Number of DRMs)γ (Length)
1995+, 4 DRMs-0.82-0.120.0030
(-0.94,-0.68)(-0.13,-0.10)(0.0028,0.0031)
1995+, All DRMs-0.88-0.0830.0030
(-1.00,-0.76)(-0.094,-0.073)(0.0028,0.0031)
1989+, 4 DRMs-1.33-0.120.0030
(-1.50,-1.16)(-0.13,-0.097)(0.0028,0.0032)
Table 2

Coefficients from GLMs fit to subsampled data. See Materials and methods: Quantifying the relationship between clinical effectiveness and diversity reduction for further explanations of coefficient descriptions. Means for 1000 independent subsamples are reported for the three different subsampling and model-fitting regimes. 95% confidence intervals (excluding the top 2.5% and bottom 2.5% of coefficent fits) are given in parentheses.

https://doi.org/10.7554/eLife.10670.011
αall (Intercept)γ (Length)Δ1,2,3NRTIΔ2NRTI+NNRTIΔ2NRTI+PI/rΔ2NRTI+PI
1995+, 4 DRMs-0.510.0025-0.21-0.053-0.270.013
(-0.62,-0.40)(0.0024,0.0027)(-0.23,-0.20)(-0.070,-0.036)(-0.36,-0.18)(-0.009,0.035)
1995+, All DRMs-0.660.0026-0.13-0.039-0.140.033
(-0.77,-0.56)(0.0025,0.0028)(-0.14,-0.12)(-0.053,-0.026)(-0.21,-0.096)(0.019,0.047)
1989+, 4 DRMs-1.030.0026-0.22-0.047-0.270.016
(-1.18,-0.88)(0.0024,0.0028)(-0.23,-0.20)(-0.069,-0.025)(-0.39,-0.17)(-0.013,0.043)

Additional files

Supplementary file 1

Detailed description of treatment effectiveness computation including references.

(A) Detailed description (organized by study) of how we extracted treatment successes versus failures, as well as length of study and viral load limit for each study. Part (B) is a reorganization of (A), but it excludes specific details on how we calculated each number.

https://doi.org/10.7554/eLife.10670.021
Supplementary file 2

Detailed description of treatment effectiveness computation including references.

(B) Chart summary (organized alphabetically by treatment) detailing all studies considered for our treatment effectiveness analysis. For each study, we recorded the number of weeks, the virologic load limit under which was considered a ‘‘success,’’ the number of successes and the total number of patients on on-treatment analysis considered.

https://doi.org/10.7554/eLife.10670.022
Supplementary file 3

Part (C) is a further summary of the final treatment effectiveness used throughout our analysis.

This supplement has a full reference section describing the studies used.

https://doi.org/10.7554/eLife.10670.023

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Alison F Feder
  2. Soo-Yon Rhee
  3. Susan P Holmes
  4. Robert W Shafer
  5. Dmitri A Petrov
  6. Pleuni S Pennings
(2016)
More effective drugs lead to harder selective sweeps in the evolution of drug resistance in HIV-1
eLife 5:e10670.
https://doi.org/10.7554/eLife.10670