Improved characterisation of MRSA transmission using within-host bacterial sequence diversity

  1. Matthew D Hall  Is a corresponding author
  2. Matthew TG Holden
  3. Pramot Srisomang
  4. Weera Mahavanakul
  5. Vanaporn Wuthiekanun
  6. Direk Limmathurotsakul
  7. Kay Fountain
  8. Julian Parkhill
  9. Emma K Nickerson
  10. Sharon J Peacock
  11. Christophe Fraser
  1. University of Oxford, United Kingdom
  2. University of St Andrews, United Kingdom
  3. Sunpasitthiprasong Hospital, Thailand
  4. Mahidol University, Thailand
  5. University of Bath, United Kingdom
  6. University of Cambridge, United Kingdom
  7. Cambridge University Hospitals NHS Foundation, United Kingdom
9 figures, 2 tables and 4 additional files

Figures

50% majority-rule consensus tree of the posterior distribution of ExaBayes phylogenies.

Branch lengths are in substitutions per site. Eight clades are highlighted. Clades 1 to 5 correspond to those identified by Tong et al. (2015), while the additional three are newly designated.

Figure 2 with 1 supplement
The median number of SNPs separating sequences from each colonisation from the most similar sequence from an isolate acquired before the date of the first positive sample from that colonisation.

Blue dots are colonisations isolated from a single swab only (trace colonisations), while red were acquired from colonisations where multiple swabs were isolated. For three colonisations (two trace) …

Figure 2—figure supplement 1
The median number of SNPs separating sequences from each colonisation, restricted to those obtained at the time of the first positive swab, from the most similar sequence from an isolate acquired before the date of the first positive sample from that colonisation.

Blue dots are colonisations isolated from a single swab only (trace colonisations), while red were acquired from colonisations where multiple swabs were isolated. For three colonisations (two trace) …

phyloscanner identifies transmission pairs by ancestral state reconstruction of hosts (left) and subsequent classification of the topological relationships between the subgraphs reconstructed to each host (right).

In this example the hosts are designated (A to D). Here host A is inferred to be the infector of hosts (B and C). The transmission from (A to C) was of only a single pathogen lineage, while that …

Figure 4 with 1 supplement
The reconstruction of the transmission process using the consensus tree overlaid on a timeline of hospital and ICU stays and sampling events.

Each row represents a colonisation, with thin lines representing the colonised subject’s presence in the hospital and thick lines their presence as a patient in an ICU. Colours of the lines and the y

Figure 4—figure supplement 1
The reconstruction of the transmission process using the consensus tree overlaid on a timeline of hospital and ICU stays and sampling events, with trace colonisations included.

Each row represents a colonisation, with thin lines representing the colonised subject’s presence in the hospital and thick lines their presence as a patient in an ICU. Colours of the lines and the y

Figure 5 with 2 supplements
Concordance of inferred infectors for each colonisation with recorded timings of hospital stays and sampling dates.

Each bar represents a colonisation, and the colours represent the proportions of the posterior set of trees where the transmission chain prior to that host involves no sampled subjects (blue), …

Figure 5—figure supplement 1
Concordance of inferred infectors for each colonisation with recorded timings of hospital stays and sampling dates, with trace colonisations included.

Each bar represents a colonisation, and the colours represent the proportions of the posterior set of trees where the transmission chain prior to that host involves no sampled subjects (blue), …

Figure 5—figure supplement 2
Concordance of inferred infectors for each colonisation with recorded timings of hospital stays and sampling dates, from a secondary phyloscanner analysis in which the tip set from each subject was randomly downsampled until a maximum of five remained.

Each bar represents a colonisation, and the colours represent the proportions of the posterior set of trees where the transmission chain prior to that host involves no sampled subjects (blue), …

Figure 6 with 1 supplement
The phyloscanner host relationship diagram.

Each node represents all the sequences for one colonisation. Node fill colours designate patients in the two hospital ICUs and the HCWs. Edges appear where colonisations share a relationship with …

Figure 6—figure supplement 1
The phyloscanner host relationship diagram, with trace colonisations included.

Each node represents all the sequences for one colonisation. Node fill colours designate patients in the two hospital ICUs and the HCWs and nodes with faded colours representing trace colonisations. …

Figure 7 with 1 supplement
The phyloscanner host relationship diagram for a separate analysis where samples taken from distinct body sites on the same subject were treated as separate ‘hosts’.

Each node represents all the sequences for one colonisation of one body site. Node fill colours designate patients in the two hospital ICUs and the HCWs. Edges appear where colonisations share a …

Figure 7—figure supplement 1
The phyloscanner host relationship diagram for the body site analysis, with trace colonisations included.

Each node represents all the sequences for one colonisation of one body site. Node fill colours designate patients in the two hospital ICUs and the HCWs and nodes with faded colours representing …

50% majority-rule consensus phylogenies for the ExaBayes phylogenetic analyses of the sequences from patients T009, T126, T232 and T249.

Tips are coloured by body site of origin. Branch lengths are in substitutions per site. Trees were rooted using the TW20 outgroup (not shown).

Figure 8—source data 1

ExaBayes consensus trees for all subjects analysed in the phyloanatomy analysis.

https://cdn.elifesciences.org/articles/46402/elife-46402-fig8-data1-v4.zip
Figure 9 with 1 supplement
Performance of SNP distance as a method for identifying transmission pairs.

(A) Plots of the sensitivity and specificity of using the number of SNPs to identify transmission pairs, for different distance thresholds. (B) ROC curves plotting true positive rate (sensitivity) …

Figure 9—figure supplement 1
Plots of the sensitivity and specificity of using the number of SNPs to identify transmission pairs, for different distance thresholds.

The gold standard for identifying transmission pairs in the version on the left is a topological relationship suggesting at least one transmitted lineage, while on the right at least two are …

Tables

Table 1
Six pairs of colonisations for whom transmission was reconstructed with a posterior score of at least 0.95 and, for those reconstructed transmissions, a posterior median of at least five tips were in the recipient subgraphs.

Each row gives the posterior median and the limits of the 95% highest posterior density (HPD) interval for the number of reconstructed transmissions between the subjects, in either direction. The …

Colonisation AColonisation BNumber of A to B transitions 95 % HPDTips in descendant B subgraphs (median)Nucleotide diversity transmitted to B (median)Number of B to A transitions 95% HPDTips in descendant A subgraphs (median)Nucleotide diversity transmitted to A (median)Direction
MedianLowerUpperMedianLowerUpper
C012C137323791.46E-060000NAA to B
C012C159b342538392.20E-0610611.40E-06A to B
C126C234333101.92E-060000NAA to B
C126C271a111213.11E-070000NAA to B
C126C327a54592.68E-060000NAA to B
C183C19415123313.41E-075019127.08E-07Unclear
Table 2
Results of the investigation of body site/phylogeny associations.

Each row corresponds to a single BATS analysis on a posterior set of phylogenies consisting just of the sequences from that subject. The p-values are given for the association index (AI), parsimony …

SubjectNumber of Sequencesp-value
AIPSMC
AxillaNoseThroatTracheaWound
T009310.02611*0.0311
T0122230.240.18110.14
T06511<0.0011*0.291*
T071130.1610.161
T092120.035111
T095110.0991*0.11*
T0991911*0.161*
T126239<0.001<0.0010.0040.10.0041*
T137790.00410.1211*
T159460.711*11
T183480.0411*0.221
T1881911*11*
T194320.1311*10.371*1*
T197350.7411111
T23211<0.001<0.0010.0010.001
T2341011*11*
T249140.002<0.0010.0170.0161
T271230.04311*0.4911*1*
T3271011*0.511*
T330811*0.261*
T358220.00410.061

Additional files

Supplementary file 1

Examples of arrangements of tips with two states (red and blue) on a phylogeny and the corresponding values of the AI, PS, and MC statistics.

Given are the values of the statistic and p-values estimated by permuting the tip states 1000 times. (A) Blue tips occur only in one clade but that clade is not exclusively blue. The AI statistic shows significant deviation from the null hypothesis of no phylogeny-state association, but the others do not. (B) Blue tips form a single clade. All statistics other than the MC for the red state show significant deviation from the null. (C) Blue tips appear randomly in the phylogeny. There is no evidence of any association using any statistic. (D) A single blue tip is basal to the remainder of the tree. The AI statistic and the MC for the red state suggest an association. (E/F) A single blue tip occurs amongst the red tips. Some evidence of association may still be identified using the AI statistic and further investigation is warranted. In cases D-F the PS and MC (blue) statistics are always equal to 1.

https://cdn.elifesciences.org/articles/46402/elife-46402-supp1-v4.docx
Supplementary file 2

Coordinates of the accessory regions of the TW20 chromosome.

https://cdn.elifesciences.org/articles/46402/elife-46402-supp2-v4.docx
Supplementary file 3

Coordinates of regions of the MRSA genome in which recombination was identified by Gubbins.

Positions are with respect to the TW20 reference strain.

https://cdn.elifesciences.org/articles/46402/elife-46402-supp3-v4.docx
Transparent reporting form
https://cdn.elifesciences.org/articles/46402/elife-46402-transrepform-v4.docx

Download links