Linking plasmid-based beta-lactamases to their bacterial hosts using single-cell fusion PCR

Version of Record: July 21, 2021
Accepted Manuscript: July 20, 2021

Download
Cite
Share
CommentOpen annotations (there are currently 0 annotations on this page).

Altmetric provides a collated score for online attention across various platforms and media.
See more details

1. Part of Collection
Evolutionary Medicine: A Special Issue

Edited by George H Perry et al.

Abstract
Introduction
Results
Discussion
Materials and methods
Appendix 1
Data availability
References
Article and author information
Metrics

Abstract

The horizonal transfer of plasmid-encoded genes allows bacteria to adapt to constantly shifting environmental pressures, bestowing functional advantages to their bacterial hosts such as antibiotic resistance, metal resistance, virulence factors, and polysaccharide utilization. However, common molecular methods such as short- and long-read sequencing of microbiomes cannot associate extrachromosomal plasmids with the genome of the host bacterium. Alternative methods to link plasmids to host bacteria are either laborious, expensive, or prone to contamination. Here we present the One-step Isolation and Lysis PCR (OIL-PCR) method, which molecularly links plasmid-encoded genes with the bacterial 16S rRNA gene via fusion PCR performed within an emulsion. After validating this method, we apply it to identify the bacterial hosts of three clinically relevant beta-lactamases within the gut microbiomes of neutropenic patients, as they are particularly vulnerable multidrug-resistant infections. We successfully detect the known association of a multi-drug resistant plasmid with Klebsiella pneumoniae, as well as the novel associations of two low-abundance genera, Romboutsia and Agathobacter. Further investigation with OIL-PCR confirmed that our detection of Romboutsia is due to its physical association with Klebsiella as opposed to directly harboring the beta-lactamase genes. Here we put forth a robust, accessible, and high-throughput platform for sensitively surveying the bacterial hosts of mobile genes, as well as detecting physical bacterial associations such as those occurring within biofilms and complex microbial communities.

Introduction

The emergence of multidrug-resistant (MDR) pathogens is a grave public health threat that occurs when pathogenic bacteria acquire antibiotic-resistant genes (ARGs) through horizontal gene transfer (HGT) with bacteria in their proximal environment. The gut microbiome harbors a diverse repertoire of ARGs, and these genes have been proposed to serve as a reservoir for HGT with MDR pathogens (Sommer et al., 2009). ARGs are often carried on mobilizable plasmids that impose technical challenges to surveying the set of bacteria affiliated with these genes. Standard molecular tools such as PCR and next-generation sequencing often fail to associate mobile ARGs with their bacterial hosts because they cannot capture the cellular context of extrachromosomal genes. Novel untargeted sequencing methods, such as bacterial Hi-C (Kent et al., 2020) and methylation profiling (Beaulaurier et al., 2018), provide broad reconstruction of plasmid–host relationships in metagenomes, as a trade-off for sensitivity. Alternatively, single-cell whole-genome sequencing offers an ideal solution to this problem, but may be lower throughput, more expensive and require specialized equipment (Xu et al., 2016; Lan et al., 2017). Targeted methods, such as bacterial cell culture under antibiotic selection, require that the ARG is expressed, functional, and selective in all hosts. Culturing, applied broadly to capture the full diversity of the gut microbiome, is complicated by the need for wide-ranging media and growth conditions (Zou et al., 2019; Poyet et al., 2019).

Several targeted methods using single-cell qPCR have been used to identify the hosts of specific genes; however, each uses specialized microfluidic devices, is limited in bacterial taxa they can capture, and most do not allow direct sequencing of the PCR products (Ottesen et al., 2020; Zeng et al., 2010; Tadmor et al., 2011). Alternatively, epicPCR (Spencer et al., 2015) uses fusion PCR and two emulsion steps to associate a taxonomic marker with a functional gene. Sequencing the fused PCR products provides accurate and sensitive associations between 16S sequence taxonomy and a given target gene. However, this method can be challenging to execute, difficult to scale up for multiple samples, and utilizes toxic and difficult-to-acquire reagents.

Here, we put forth One-step Isolation and Lysis PCR (OIL-PCR), a method that detects host–ARG associations from complex microbial communities through cellular emulsion and fusion PCR. Our streamlined method, based on the innovation of epicPCR, simplifies the procedure by combining the two emulsion steps of cell lysis and fusion PCR into a single emulsion PCR that can be performed in a 96-well format using robotic automation. Furthermore, OIL-PCR can be multiplexed to target at least three genes in the same reaction, uses non-toxic commercially available reagents, and can be performed without relying on microfluidics or specialized equipment. Validation experiments on three environmental bacterial communities reveal that OIL-PCR is highly accurate and specific. We demonstrate the utility of this approach in examining the bacterial hosts of three extended spectrum beta-lactamase genes in the gut of neutropenic patients.

Results

Development of the OIL-PCR method

OIL-PCR applies established fusion PCR methods to fuse any gene of interest to the 16S rRNA gene using three primers: two primers hybridize to the target gene and a universal 16S reverse primer hybridizes to the V4 region. Amplification of the target gene appends a universal 16S forward primer sequence to the end of the target amplicon via a tailed reverse primer. The target gene amplicon then acts as a primer for amplification and hybridizes to the 16S rRNA gene as a forward primer, producing a fused gene product containing both the target gene and the 16S V4 sequence (Figure 1a, Figure 1—figure supplement 1).

Figure 1 with 7 supplements see all

Download asset Open asset

OIL-PCR can specifically link plasmid-encoded genes with their hosts.

(a) Depiction of the OIL-PCR method. (1) Nycodenz-purified cells are mixed with PCR master mix, lysozyme, and emulsion oil and shaken to create an emulsion. (2) Cells are lysed within the emulsion. (3) Fusion PCR is performed in droplets containing cells harboring the targeted gene. Fused amplicons between the gene of interest and the 16S rRNA gene are the product. (b) A boxplot showing the percent of Illumina reads containing correct fusion products, namely the fusion of plasmid-borne *cmR* and the 16S rRNA gene of *E. coli* MG1655. OIL-PCR was performed on two individuals’ and one chicken’s gut microbiome sample in triplicate, spiked with varying concentrations of *E. coli.* (c) Rarefaction analysis of chicken (left) or human gut microbiome sample (middle) with (orange) and without (blue) lysozyme treatment. At right is the rarefaction analysis performed on Firmicutes only in the human stool sample. Grayed regions in the plot represent areas where the curves, each composed of four technical replicates, are significantly different (p<0.05) from one another, according to an FDR-corrected Welch’s t-test.

For fusion PCR to accurately link target genes with host marker genes, cells must be isolated to prevent the formation of non-specific fusion products. Oil emulsions and microwells have long been used to isolate eukaryotic cells; however, it is difficult to lyse bacteria in this format, especially gram-positive bacteria due to their thick cell walls. Existing single-cell isolation methods for bacteria either do not address this problem (Zeng et al., 2010; Tadmor et al., 2011), rely on specialized microfluidics (Liu et al., 2018), or use time-consuming methods to encapsulate bacteria within hydrogel beads before performing multi-step chemical and enzymatic lysis procedures (Spencer et al., 2015; Tamminen and Virta, 2015). To address this problem, OIL-PCR combines bacterial isolation, lysis, and fusion PCR into a single streamlined reaction.

We developed a protocol that allows for the incorporation of Ready-Lyse (RL) Lysozyme into the fusion PCR master mix. Whole bacterial cells are added directly to the master mix while on ice to inhibit lytic activity during sample preparation. Vigorous shaking of the mixture then encapsulates the individual cells in an emulsion. Warming the emulsion to 30°C activates the enzyme, lysing the cells. Next, a standard PCR thermocycler carries out the fusion PCR in the single-cell emulsions. Fused PCR products are purified from the emulsion and amplified further with a nested primer to filter out off-target PCR products and add Illumina adapters. Lastly, custom indexing primers are used to index the fused products before Illumina sequencing. Our experiments confirmed the compatibility of the RL Lysozyme with the fusion PCR, but required the addition of bovine serum albumin (BSA), a globular protein known to reduce protein aggregation (Finn et al., 2012; Figure 1—figure supplement 2a). We found that RL retained full activity in the standard NEB Phusion HF buffer (Figure 1—figure supplement 2b).

Next, we optimized the fusion PCR master mix to maintain a stable emulsion and amplify efficiently in picoliter droplets. PCR emulsions were prepared with fluorinated oil as used in modern emulsion-based methods, such as Drop-Seq (Macosko et al., 2015) and digital qPCR (BioRad ddPCR). We combined the fusion PCR master mix with bacterial cells and emulsion oil in either a 1.5 ml tube or a 0.5 ml deep-well plate before emulsifying the mixture using a tabletop bead homogenizer. Unlike microfluidic-enabled emulsions, our protocol leverages equipment commonly found in most molecular biology laboratories. We stabilized the emulsion with detergent-free buffers and improved the efficiency of the PCR amplification within the emulsion by adding additional polymerase, BSA, dithiothreitol (DTT), and ammonium sulfate. We found that the addition of extra MgCl₂ mitigated the inhibitory effects of extremely high concentrations of cell debris within droplets after lysis (Figure 1—figure supplement 2c).

OIL-PCR accurately associates plasmid genes with the host in a binary community

In any emulsion-based method, it is essential to optimize the concentration of input cells to prevent the encapsulation of two or more cells in the same droplet. When using a monodisperse emulsion such as those generated using microfluidics, the ideal concentration of input cells is chosen using a Poisson distribution (Ottesen et al., 2020; Zeng et al., 2010; Tadmor et al., 2011). However, these calculations are not reliable in the case of a polydisperse emulsion, as employed here to avoid the need for microfluidic devices. We therefore developed a probe-based TaqMan qPCR assay to experimentally verify the optimal concentration of input cells that prevented non-specific gene fusions (Figure 1—figure supplement 3a). OIL-PCR was performed on a binary community consisting of E. coli carrying the chloramphenicol resistance gene cmR on a plasmid and WT V. cholerae. The two strains were mixed 1:1, and we performed OIL-PCR with a fusion primer set specific to cmR and universally targeting the 16S rRNA gene (Spencer et al., 2015; Supplementary file 2 and 3). A gradient of cell input concentrations was used, and the final PCR products were recovered and purified. We then performed probe-based qPCR on the purified product using a nested primer for cmR, two blocking primers to inactivate any unfused amplicons, and two distinct fluorescent TaqMan probes (Thermo Fisher 4316034) to specifically target the V4 region of either E. coli or V. cholerae (Supplementary file 2). The fluorescent signal from each probe measured the relative ratio of specific to non-specific gene fusions present in the final amplicon pool (Figure 1—figure supplement 3a). When the input concentration of cells was at or lower than 400 cells/μl, or 40 k cells per reaction, non-specific gene fusion detection was reduced to undetectable levels (Figure 1—figure supplement 3b). As well as confirming that bacterial cells were isolated within the emulsion, we further confirmed that droplets did not coalesce by performing the TaqMan assay on OIL-PCR products from E. coli and V. cholerae cells combined after they were individually emulsified (Figure 1—figure supplement 3b). Our results confirmed that the emulsion is highly stable and coalescence was undetected.

Application of OIL-PCR to environmental microbial communities allows robust and sensitive association of extrachromosomal elements with their host

Using OIL-PCR on environmental microbial communities requires clean bacterial cell preparations free of environmental contaminants, which may inhibit PCR. To address this concern, cells were purified using Nycodenz density gradient centrifugation (Holmsgaard et al., 2011; Hevia et al., 2015), a simple method that can isolate clean bacterial fractions with minimal handling time to reduce contamination. Additionally, concerned that cell-free DNA can stick to the membranes and cell walls of bacteria (Vorkapic et al., 2016), thus introducing noisy associations in the data, we treated cells with heat-liable double-strand-specific DNase (dsDNase). This enzyme only digests unprotected double stranded genomic DNA present in the samples without degrading single-strand primers. By controlling the enzyme concentration, temperature, and speed at which cells were processed, we were able to digest extra-cellular DNA without impacting PCR efficiency of cellular contents. Using our Taqman assay, we demonstrated that including dsDNase treatment has the potential to increase the total cell input per reaction tenfold (Figure 1—figure supplement 3c).

To test the accuracy of our method on environmental samples, we spiked Escherichia coli MG1655 (Blattner et al., 1997) containing plasmid pBAD33 (Guzman et al., 1995) harboring the cmR gene into two human and one chicken stool samples that lacked the gene according to PCR screening. We performed OIL-PCR in triplicate with primers targeting the cmR gene and sequenced using MiSeq 2x250 reads. Paired-end reads were merged and quality filtered before splitting them at the fusion primer junction. The target portion of each read was confirmed to match the cmR gene and taxonomy was assigned to the 16S portion of each read (Figure 1—figure supplement 4). Our results show that when E. coli was incorporated at 0.1%, or about 20 cells total, 97.8% of the reads (or 99.2%, excluding a single outlier) demonstrated the correct association when the test strain of E. coli was incorporated at 0.1%, or about 20 cells total (Figure 1b), highlighting the sensitivity of OIL-PCR to detect the associations of genes in low-abundant species across different sample types. The accuracy of OIL-PCR decreases slightly when the targeted sequence increases to 10% of the community composition, although associations were still 97% correct on average.

Lysozyme improves capture of difficult-to-lyse gram- positive bacteria

To achieve our goal of robust lysis and amplification to screen all bacteria within a complex community, we measured the effect lysozyme had on bacterial detection. We performed standard 16S sequencing on human and chicken stool communities using OIL-PCR, testing three variables: the effect of lysozyme, dsDNase, and heat inactivation of dsDNase on total bacterial recovery (Figure 1—figure supplement 5). All eight combinations of the three variables were tested in duplicate for two stool samples using robotic automation. For our analysis, we chose to focus on the total number of operational taxonomic units (OTUs) captured in our data rather than relative abundance metrics, as this better reflects our goal of detecting species, rather than recapitulating the starting community structure.

First, we assayed how each of the three variables (RL, dsDNase, and heat inactivation) affected OTU recovery. Based on rarefaction curves, we found dsDNase and heat inactivation had no significant effect on OTU recovery in human and chicken stool, while RL lysozyme significantly increases OTU recovery in chicken stool based on Welch’s t-test with Benjamini–Hochberg FDR correction (Figure 1c, Figure 1—figure supplement 5). RL was the only variable that significantly changed OTU recovery, and therefore, it was the only variable considered for further analysis.

Next, we looked to see which taxonomic groups were being enriched or depleted with the addition of lysozyme. Technical replicate OTU tables were combined for analysis to allow for deeper sampling depth. Rarefaction curves were generated for both lysozyme treatments at each taxonomic level containing 10 or more OTUs from Phylum to Genus. Results show that no taxonomic group was significantly depleted in either human or chicken stool samples (Figure 1—figure supplement 6). Chicken rarefaction curves trended higher with lysozyme for every taxonomic group tested, with significant improvements for the phyla Firmicutes, Bacteroidetes, and Cyanobacteria (Figure 1—figure supplement 6). Overall, 14 taxonomic groups were significantly enriched in chicken stool, mostly from Firmicutes and Bacteroidetes.

The effect of lysozyme on human stool was not as pronounced as for chicken stool, but it did significantly enrich for the Firmicutes phylum as well as the Lachnospiraceae family. The only group that trended worse with lysozyme was the Bacteroidetes (p-val 0.45), with the family Bacteroidaceae accounting for most of the effect. Interestingly, the closely related and biologically important family Prevotellaceae was enriched with a p-val of 0.07 (Figure 1—figure supplement 6b). While we cannot fully explain why lysozyme generally improves capture of Bacteroidetes in chicken but not human stool, the overall benefit of lysozyme is apparent, especially for capturing the breadth of diversity within the Firmicutes phylum. We noticed that the total number of OTUs recovered from OIL-PCR was significantly lower than 16S sequencing of the input community at the same sampling depth (Figure 1—figure supplement 7). We hypothesized the reason for this reduction in OTUs was due to subsampling bias introduced through low cell input and variable amplification efficiency in OIL-PCR. To test our hypothesis, we combined OTU tables from two, four, and eight technical replicates and found a consistent up-shift for each rarefaction curve as we combined more tables. This up-shift was not observed when combining the input Nycodenz sequencing, indicating that the reduced OTU counts were due in part to subsampling bias and not an inherent failure to capture bacterial taxa (Figure 1—figure supplement 7). We therefore recommend OIL-PCR to be performed in replicates to increase the total number of cells being sampled.

Increased throughput through automation and multiplexing

To further improve the efficiency and throughput of OIL-PCR, we sought to transition the method from 1.5 ml centrifuge tubes to a 96-well plate format using the Eppendorf epMotion liquid handling robot. The liquid handling robot can perform certain parts of the PCR preparation as well as DNA recovery and purification. The automated workflow allowed us to process up to 48 samples simultaneously with fewer manual steps overall.

We next tested whether OIL-PCR could simultaneously target multiple genes though multiplexing. We repeated the previously described TaqMan assay using a strain of V. cholerae containing the ampicillin resistance gene ampR and E. coli with cmR, both on a plasmid (Figure 1—figure supplement 3d). Our results demonstrate that OIL-PCR can be multiplexed while still accurately maintaining the correct associations of target genes with their host bacteria.

Bacterial hosts are identified for several clinically important β-lactamase genes

We analyzed metagenomic sequencing of stool samples that were collected from a cohort of patients who were neutropenic because of chemotherapy administered for a hematopoietic cell transplant. Two patients, B335 and B314, were chosen for OIL-PCR based on the presence of three class-A beta-lactamase genes, bla_TEM, bla_SHV, and bla_CTX-M in the metagenomes (Kent et al., 2020). We tested a three-sample time course from patient B335: before antibiotic treatment, after 4 days of trimethoprim-sulfamethoxazole and 1 day of levofloxacin, and lastly after an additional 2 days of levofloxacin (Figure 2a). Patient B335 carried all three genes across three time points with bla_TEM and bla_CTX-M on a metagenomic scaffold which blasted to an 80 kb Klebsiella plasmid and bla_SHV on a contig that blasted to K. pneumoniae genome (Figure 2b). Previously published Hi-C sequencing of the stool samples identified an association between K. pneumoniae and the 80 Kb plasmid, as well as transfer to Citrobacter brakii between time points 1 and 2 (Kent et al., 2020). We tested one sample from patient B314 from before antibiotic treatment which carried multiple bla_SHV genes. We hypothesized that OIL-PCR could be used to sensitively and accurately detect additional hosts of these genes.

Figure 2 with 1 supplement see all

Download asset Open asset

Extended spectrum beta-lactamase genes are associated with both pathogenic and commensal species.

(a) Summary of treatment and sample time points for patient B335. (b) Depiction of an 80 kb plasmid carried by K. pneumoniae harboring the blaCTX-M, blaTEM, Tn3 transposase and resolvase genes. The blaSHV gene is presumed to be carried within the K. pneumoniae genome. Placement of these genes was inferred from metagenomic assemblies of patient B335’s gut microbiome sample. (c) OIL-PCR results for each of the genes depicted in (a) patient B335 at three time points. For all gene-taxa associations, the percent of total OIL-PCR reads for that gene-time point is plotted. All species passing our detection threshold of 0.5% (dotted line) at any of the three time points is included in this plot. (d) A table summarizing the results in (b). All gene-taxa associations for each time point passing our detection thresholds are listed. Two SNP variants of TEM were detected and denoted with subscript numbering. Gene-taxa associations which did not consistently pass our detection threshold across all technical replicates are noted (*).

We designed three degenerate fusion primer sets to broadly target most variants of bla_TEM, bla_SHV, and bla_CTX-M (Supplementary file 2 and 3), and performed multiplexed OIL-PCR with robotic automation. Samples were processed in quadruplicates. We set a threshold for defining positive gene–taxa associations, as having 0.5% of total reads across the four technical replicates.

Our OIL-PCR results largely confirm findings in the metagenomic assemblies from Kent et al., 2020.In B314, we found bla_SHV associated with Klebsiella as suggested by metagenomic assemblies. However, we also detected two other class-A beta-lactamase genes, bla_LEN and bla_OXY, which were present in the metagenomes, but we did not expect to amplify with our primers. bla_LEN amplified with the primers designed for bla_SHV and bla_OXY amplified with primers for bla_CTX-M. Curiously, bla_OXY is an exceptionally poor match for our bla_CTX-M primers, having a mismatch one base away from the 5’ end of the fusion primer. We hypothesize that the low annealing temperature and modified buffer used in the emulsion PCR is highly permissive to priming mismatches. We see permissive annealing as an advantage for the method because it allows for amplification of unknown variants of target genes while amplification due to off-target priming is filtered out during the nested PCR step (Figure 1—figure supplement 1), leaving only the true amplicons in the final sequencing. This permissive annealing behavior of OIL-PCR can be leveraged in the future to design broad-range primers for diverse gene groups such as metallo-beta-lactamases (Somboro et al., 2018).

Results from patient B335’s time course also matched the metagenomic and Hi-C sequencing from Kent et al., associating bla_TEM, bla_SHV, and bla_CTX-M with Klebsiella in all three time points (Figure 2c,d). We also found that all three genes strongly associated with the commensal genus Romboutsia in time points T2 and T3 and to a lesser extent with Agathobacter in time point T1 (Figure 2c,d). Citrobacter brakii, which was detected as a recipient of the Klebsiella plasmid in the Hi-C sequencing, did not initially show up in our analysis as it was clustered with Klebsiella, as its 16S sequence differs by only a single base pair. However, upon closer inspection, the C. brakii strain does appear to be associated with the three genes in time points 2 and 3 only. These results indicate that, using manual inspection or by modifying our computational pipeline, higher resolution associations can be obtained by OIL-PCR. A strain of Escherichia with a distinct variant of bla_TEM was detected at time point T2, but did not pass the detection threshold across all replicates in time point T3. We repeated OIL-PCR on all three samples from B335, this time in triplicate without multiplexing to further confirm these results. The singleplex experiment perfectly mirrored the multiplex results, excluding one replicate of T2/CTX-M which failed to sequence, indicating that these genes may be linked with organisms other than Klebsiella. As further confirmation of this result, we targeted two Tn3-like transposon genes situated in close proximity to bla_TEM and bla_CTX-M on the 80 kb Klebsiella plasmid. We hypothesized that these genes should also be associated with the same genera as the ARGs. Remarkably, we observed the identical pattern with Klebsiella, Romboutsia, and Agathobacter as with the three beta-lactamases, but not Escherichia, which carried a distinct variant of bla_TEM (Figure 2c,d).

OIL-PCR provides further evidence of the association of beta-lactamases with the commensal Romboutsia

We next investigated whether OIL-PCR could be used to further confirm the association between Romboutsia and the three beta-lactamases. We focused specifically on Romboutsia because of the strong signal in the OIL-PCR results compared to Agathobacter. For this experiment, instead of fusing the ARG sequence to the 16S rRNA gene using universal primers, we used primers designed to specifically detect the Romboutsia 16S rRNA (Gerritsen et al., 2014) and fused the 16S gene specifically to bla_TEM (Figure 3a, Supplementary file 2 and 3). In this instance, no amplification is possible unless Romboutsia is encased in the same droplet with the bla_TEM gene and would negate the possibility of false-positive associations due to chimera formation. Results show amplification and sequencing was only produced from time points T2 and T3 with no signal detected at time T1, confirming the presence of bla_TEM within Romboutsia at time points T2 and T3, but not T1 (Figure 3b).

Figure 3

Download asset Open asset

R.timonensis strains associated with the three beta-lactamase genes appear over the patient’s time course.

(a) Depiction of the reverse OIL-PCR in which *Romboutsia*-specific 16S rRNA sequences (blue) are fused with the *bla_TEM* sequences (red). (b) OIL-PCR read counts of the reaction shown in (a) are plotted. (c) The percent sequence identity of assembled *R. timonensis* marker genes between genes identified in timepoints 2 and 3 (top) and between timepoint 1 and between sequences shared at timepoints 2 and 3 (bottom). (d) RPKM-normalized abundance-values for the assembled marker genes for each strain assembled in time point one and the major strain present in timepoints 2 and 3.

We next explored the metagenomic data for clues as to whether the Romboutsia strain was present at timepoint T1, but below the detection threshold, or whether the strain linked with the genes was acquired sometime between time T1 and T2. Based on the 16S data from OIL-PCR and metagenomic sequencing, we identified the Romboutsia species as R. timonensis. Genus-level abundance data showed R. timonensis to be present in all three timepoints in patient B335. Due to the overall low abundance of this organism, we were unable to assemble a Romboutsia genome from these samples. Instead, we aligned patient B335’s three samples to the R. timonensis (PRJEB14233) genome from NCBI, assembled the aligned reads, and examined similarities between the R. timonensis taxonomic markers over the three timepoints (Figure 3c). We found that B335 was colonized by at least two independent strains of R. timonensis during the first timepoint, but that only one R. timonensis strain persisted during timepoint T2 and T3. One of the R. timonensis strains from T1 was identical to the strain from T2 and T3 across 15/30 AMPHORA marker genes, and >99% identical in 24/30 genes (Figure 3c), suggesting that the strain of R. timonensis from time T2 and T3 was also present at time point T1. We found no significant difference in the normalized abundance of Romboutsia between time point T1 and time points T2 and T3 (Figure 3d), albeit our data suggests that the persistent strain is the minor variant at time point 1. Despite the sensitivity of OIL-PCR, which can detect cells at least 0.1% abundant (Figure 1b), we cannot rule out the possibility that the stochasticity of sampling in OIL-PCR and the low abundance of this particular strain of R. timonensis precluded our ability to observe this association at the beginning of the time course.

OIL-PCR confirms the physical association between Romboutsia timonensis and Klebsiella pneumoniae

Our analysis clearly shows an association between Romboutsia and the three beta-lactamase genes; however, there are two plausible explanations for these results. Either Romboutsia acquired all three genes from Klebsiella through HGT, or Romboutsia and Klebsiella are physically linked together, causing them to consistently emulsify within the same droplet, thus allowing the Romboutsia 16S gene to fuse with the three ARGs. To distinguish between these two possibilities, we designed OIL-PCR primers targeting two Klebsiella pneumoniae housekeeping genes rpoB and glmS, and two Romboutsia timonensis genes rpoB and nusA. Using these primer sets, along with bla_CTX-M primers as a control, we ran OIL-PCR to see if we found Klebsiella marker gene sequences fused to Romboutsia 16S or vice versa, suggesting capture of the two species within the same droplet. Klebsiella primers were multiplexed with bla_CTX-M and the Romboutsia primers were assayed in separate reactions to rule out the possibility of PCR chimeras during library preparation.

The results show that both Klebsiella and Romboutsia 16S sequences were fused to both of the Klebsiella marker genes, mirroring the same pattern as was seen for the ARGs and transposase genes assayed. The bla_CTX-M control also presented the same pattern as previously demonstrated (Figure 2—figure supplement 1). This result indicates that Klebsiella and Romboutsia are being emulsified together, suggesting a physical association and not gene transfer. Based on these results, we would also expect to also find the Klebsiella 16S sequence fused to the Romboutsia marker genes. Primers targeting nusA failed to amplify; however, the Romboutsia-specific rpoB primers did fuse to Klebsiella 16S sequences in two of the nine total replicates across three time points. These results, taken with our previous OIL-PCR experiments, present compelling evidence that the observations can be explained as a novel physical association between Klebsiella pneumoniae and Romboutsia timonensis that developed between strains present after time point 2 in patient B335.

Discussion

Here we show the ease with which OIL-PCR can identify carriers of known resistance markers on extrachromosomal elements within complex bacterial communities. We applied it to a neutropenic patient’s gut microbiome and showed the correct association of three beta-lactamases with K. pneumoniae, and also discovered novel associations between these beta-lactamases and two gut commensals, R. timonensis and Agathobacter spp. Two of the genes, bla_CTX-M and bla_TEM, were both found on a large Klebsiella plasmid within the metagenome, suggesting the possible transfer of these genes to R. timonensis during the time course. However, analysis of the plasmid sequence showed that while it does contains an origin of transfer, it does not have the genes necessary to transfer itself, meaning it would require a second ‘helper plasmid’ to mobilize. Additionally, bla_SHV was only found on a contig belonging to the Klebsiella genome without any known mobilizable transposons or integrative conjugative elements nearby, severely limiting its transfer potential. An alternative explanation for our results is that Romboutsia and Klebsiella became physically associated within the gut, and thus consistently emulsified together. Using OIL-PCR targeting species-specific marker genes, we showed that our results were indeed due to a novel physical interaction between K. pneumoniae and R. timonensis.

Whether mobilization of ARG-containing plasmids, or novel physical associates within the gut, our results highlight the strength of OIL-PCR for unraveling the intricate dynamics of the gut microbiome. The ability of OIL-PCR to detect two kinds of ecologically and clinically important interactions, as well as distinguish between them, is a major strength of the method. Additionally, both of these interaction types are deeply entwined, with close physical association being a known activator for conjugal transfer of genes (Clark et al., 2018), as well as a mechanism for resistance in multispecies biofilms (Burmølle et al., 2014) OIL-PCR is a practical and transportable protocol with no requirements for specialized equipment nor specialized expertise. We identify improvements in performing single-cell analysis on stool, namely the use of a Nycodenz purification step and the incorporation of lysozyme plus heat-induced lysis. Additionally, we increased throughput at least threefold through primer multiplexing and developed an automated protocol to process at least 48 samples concurrently, allowing a total of 144 gene-sample association tests per batch.

Additional improvements to OIL-PCR could be explored to further increase throughput and sensitivity. Although we tested multiplexing three genes per reaction, this number could likely be increased as we have found no sign of false positives due to multiplexing as demonstrated by associating a novel bla_TEM variant with only Escherichia in time point T2 of patient B335 (Figure 2b,c). Furthermore, we show that the OIL-PCR master mix facilitates permissive annealing of primers, allowing a mismatch one base from the primer’s 3’ end as demonstrated when bla_OXY was detected in sample B324-2 with bla_CTX-M primers. These results could allow for the development of highly degenerate primers to target a broad range of gene variants. Non-specific priming during OIL-PCR is not of concern because the nested PCR specifically filters out undesired fusion products. Because OIL-PCR uses three primers for each target gene, primers designed for OIL can easily be adapted for probe-based qPCR pre-screening of samples instead of using metagenomic sequencing as was done in this study.

While the startup cost of using OIL-PCR is low compared to other methods, currently it uses large amounts of Phusion polymerase and magnetic beads for DNA purification which inflates the cost (~$15/100 µl reaction). The amount of Phusion needed could be reduced with further optimization of the PCR master mix, and the number of purification steps can be cut by using enzymatic exonuclease I treatment of PCR instead of purification. Lastly, the method described currently allows 40,000 cells total per reaction; however, our probe-based qPCR assays suggest that the input concentration could be increased 10-fold by pretreating cells with dsDNase (Figure 1—figure supplement 3c). Combined with our result showing that OIL-PCR is more accurate when detecting low-abundant taxa (Figure 1b), we feel confident that cell input can be increased to improve sensitivity without sacrificing accuracy.

OIL-PCR is a highly versatile platform that could be applied across fields to address a multitude of questions. While we were interested in plasmid-born ARGs in the gut, the method could be used to target any gene of interest that is difficult to associate with a host using metagenomics. As mobile genetic elements are notoriously difficult to assemble due to their promiscuity which complicates de Bruijn graph assembly (Antipov et al., 2019), this method could be applied to find the hosts of integrated and non-integrated mobile elements. Similarly, as metavirome sequencing has revealed a massive number of viral genomes with unknown hosts (Shkoporov and Hill, 2019), OIL-PCR may be particularly useful in addressing this gap in understanding. Additionally, viral and plasmid host-range is an important determinant for understanding and modeling bacterial ecology of predation and HGT (Flores et al., 2011). As we have shown here, OIL-PCR can detect direct physical associations of bacteria. Such interactions are important for understanding biofilm composition (Shi et al., 2020), identifying endosymbionts, or detecting cross-feeding bacteria which require the direct exchange of nutrients to grow (D'Souza et al., 2018; Goyal et al., 2021), information which could allow for culturing of these often unculturable species. In cases when physical associations are not of interest, samples may be filtered to remove clumped bacteria. Furthermore, targeting functional metabolic genes detected in metagenomes, but present at low abundance in bacterial communities, could identify novel bacteria involved in nutrient cycling which has remained a persistent challenge in the field of bacterial ecology (Preheim et al., 2016). Finally, when combined with microfluidics, direct lysis of bacteria in an emulsion, as shown here, could be used to develop or simplify single-cell genome sequencing or single-cell RNA-seq for bacteria.

Reagent	Stock concetration	Final concentration	Volume (µl)
H₂O			to 100 µl
DF Buffer	5×	1×	20
dNTPs	10 mM	250 M	2.5
16 S-R AP27	100 µM	2 µM	2
pForward	100 µM	1 µM	1–3
pfuse	10 µM	0.01 µM	0.1–0.3
MgCl₂	50 mM	1.5 mM	3
AmSulfate	100 mM	5 mM	5
DTT	100 mM	5 mM	5
BSA	20 mg/ml	4 mg/ml	20
Lysozyme full	variable	300 U/µl	0.792
Polymerase	2000 U/ml	100 U/µl	5
template	10⁴ cells/µl	400 cell/µl	4
Total			100

	5:00	30
	10:00	95
38×	0:05	95
	0:30	54
	0:30	72
	2:00	72
	hold	4

Reagent	Stock concetration	Final concentration	Volume (µl)
H₂O			to 20
Luna Buffer	2×	1×	10
Nest Primer	100 uM	300 nM	0.06
16 S-R AP28	100 uM	300 nM	0.06
Template			2–5

	2:00	95
38×	0:15	95
	0:15	55
	0:20	68
	1:00	65
	0.15°C /s	95

Share this article

Cite this article

OIL-PCR can specifically link plasmid-encoded genes with their hosts.

Extended spectrum beta-lactamase genes are associated with both pathogenic and commensal species.

R.timonensis strains associated with the three beta-lactamase genes appear over the patient’s time course.

Author details

Peter J Diebold

Contribution

Competing interests

Felicia N New

Contribution

Competing interests

Michael Hovan

Contribution

Competing interests

Michael J Satlin

Contribution

Competing interests

Ilana L Brito

Contribution

For correspondence

Competing interests

Citations by DOI

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

Categories and tags

Research organism

Further reading