Abstract
SARS-CoV-2 is spread through exhaled breath of infected individuals. A fundamental question in understanding transmission of SARS-CoV-2 is how much virus an individual is exhaling into the environment while they breathe, over the course of their infection. Research on viral load dynamics during COVID-19 infection has focused on internal swab specimens, which provide a measure of viral loads inside the respiratory tract, but not on breath. Therefore, the dynamics of viral shedding on exhaled breath over the course of infection are poorly understood. Here, we collected exhaled breath specimens from COVID-19 patients and used RTq-PCR to show that numbers of exhaled SARS-CoV-2 RNA copies during COVID-19 infection do not decrease significantly until day 8 from symptom-onset. COVID-19-positive participants exhaled an average of 80 SARS-CoV-2 viral RNA copies per minute during the first 8 days of infection, with significant variability both between and within individuals, including spikes over 800 copies a minute in some patients. After day 8, there was a steep drop to levels nearing the limit of detection, persisting for up to 20 days. We further found that levels of exhaled viral RNA increased with self-rated symptom-severity, though individual variation was high. Levels of exhaled viral RNA did not differ across age, sex, time of day, vaccination status or viral variant. Our data provide a fine-grained, direct measure of the number of SARS-CoV-2 viral copies exhaled per minute during natural breathing—including 312 breath specimens collected multiple times daily over the course of infection—in order to fill an important gap in our understanding of the time course of exhaled viral loads in COVID-19.
Introduction
SARS-CoV-2, the causative agent of COVID-19, spreads through exhaled breath during coughing, talking, singing, and breathing (1–7). Levels of SARS-CoV-2 over the course of infection have been extensively characterized in upper and lower respiratory tract specimens such as nasopharyngeal and oropharyngeal swabs (8–11, 38–39), whereas the dynamics of levels on breath over the course of infection remain virtually unexplored. This is despite the fact that quantifying levels of viral shedding on exhaled breath would allow for a direct approximation of the amount of virus an individual is shedding into the environment (12–17), thereby exposing others to risk of infection. We know particularly little about the dynamics of viral shedding on breath during unlabored natural breathing, which serves as a baseline for viral transmission on breath.
While the dynamics of viral load inside the host respiratory tract has direct relevance to viral pathology, dynamics of viral load on the host’s breath has direct relevance to infectiousness. Understanding the dynamics of viral shedding on breath is important for prevention of transmission of disease. Measuring viral load characteristics of the primary route of onward transmission is critical to inform isolation times in the clinic, where isolation consumes scarce resources, and to inform public health transmission control protocols. In addition, variables that impact level of viral shedding on breath remain unclear, but may vary by a multitude of factors including symptom severity, days since symptom onset, co-morbidities, viral genotype, and other unknowns. Understanding of these factors requires quantification of exhaled viral loads, which cannot be inferred from internal viral loads.
Current techniques for measuring viral load in exhaled breath have successfully detected SARS-CoV-2 in specimens (18–27), though with variable detection rates ranging from 26.9% to 86%. Recent work has also shown that SARS-CoV-2 can be isolated from exhaled breath, confirming that it contains replication-competent virus (17). However, prior work has focused on exertive breathing conditions (talking, singing, coughing), and we therefore have less understanding of viral loads in exhaled breath during natural breathing. Furthermore, prior work has not analyzed exhaled breath collected longitudinally over the course of infection relative to the day of symptom onset (potentially due to expense and lack of portability of breath collection devices), which would allow for a better understanding of the time course of changes in viral loads on breath. An inexpensive, portable device that allows patients to self-collect breath samples at home would facilitate research into factors that contribute to virus transmission from breath, how exhaled virus levels change over the course of infection, and whether therapeutics and other interventions reduce levels of exhaled virus.
Here, we developed a portable, disposable exhaled breath condensate collection device (EBCD) (Fig S1) and used it to collect 312 specimens from 60 COVID-19-tested outpatients who were treated at Northwestern Memorial Hospital (NMH). Specimens were analyzed using RT-qPCR. Our data set included breath specimens collected multiple times per day over the course of infection. We report numbers of SARS-CoV-2 RNA copies exhaled per minute, during natural breathing, over the course of infection and across a range of factors including self-reported symptom severity, age, sex, presence of co-morbidities, vaccination status and viral variant.
Results
Detection of SARS-CoV-2 RNA in exhaled breath
To collect exhaled breath specimens, we developed a new exhaled breath collection device (EBCD) that is inexpensive, portable and disposable, allowing at-home collection of samples by participants. The device consists of a repurposed syringe tube fitted with one-way valve and mouth piece (Fig S1). A cooling sleeve placed over the tube causes exhaled breath to condense on the inner wall during a 10-minute breath collecting session. Participants were instructed to breathe naturally out of their mouth through the tube at a normal pace. To validate our device as a tool to measure SARS-CoV-2 in exhaled breath, we collected specimens from 60 study participants recruited after receiving a clinical, PCR-based SARS-CoV-2 diagnostic test via nasopharyngeal swab (typically Abbott ID NOW platform; see Table 1 for patient demographics), within 10 days of symptom onset. We analyzed our specimens with RT-qPCR. SARS-CoV-2 RNA was detected in 43 of the 43 breath specimens collected from participants who were positive for COVID-19 by diagnostic testing conducted within the first 10 days from symptom onset (100%). SARS-CoV-2 RNA was not detected in any of the 16 participants who were negative for COVID-19 by diagnostic testing (100%) (Fig 1A). Thus, the correspondence between our test and the clinical diagnostic test was 100% accurate within the first 10 days of symptoms. The total number of SARS-CoV-2 RNA copies detected over the 10-minute breathing session ranged from 5 to 8757, with a mean of 834 (95% CI: [259, 1409]). To confirm the specificity of the amplified target, sequencing of resultant PCR product was performed on two specimens, validating proper target amplification. To confirm that the EBCD collects breath condensate without contamination from saliva, we performed viscosity measurements on EBCD samples collected during nasal breathing and oral breathing separately and compared them to viscosity measurements of saliva and water. If oral breath samples collected with EBCD were contaminated with saliva, then their viscosity should be increased compared to nasal breath samples. We found no statistical difference between the viscosity of nasal and oral breath samples, or between either sample type and water, validating our sample type as exhaled breath condensate (Fig S2). Thus, EBCD accurately detects SARS-CoV-2 RNA in exhaled breath.
Average numbers of SARS-CoV-2 RNA copies on breath did not decrease until day 8 from symptom onset
To examine the dynamics of exhaled viral loads over the course of infection, we invited COVID-19-positive individuals to participate in voluntary longitudinal home collection of additional exhaled breath specimens, typically two per day, for a duration determined by participant availability (ranging from 2 to 20 days, see Table S1 for the number of samples provided by each participant). At the time of each at-home sample collection session, participants recorded time of day, current symptoms, and subjective self-assessed overall symptom severity (resolved/none, mild, moderate, severe). This resulted in a total of 294 specimens, each associated with a day-since-symptom-onset and symptom-severity score. To examine the time-course of viral shedding on exhaled breath, we plotted SARS-CoV-2 viral copies against days since symptom onset for all samples, which showed that the number of SARS-CoV-2 viral copies exhaled per minute decreased sharply on day 8 (Fig 2A). To quantify this effect, we fitted the moving-average time course of viral RNA exhaled per minute to an S curve model (R2 = 0.7657, adjusted R2 = 0.7567), which yielded a modelled inflection point of maximal decrease at day 8 (RMSE: 0.3755) (Fig 2A, green dashed line). In a second confirmatory analysis, samples were binned according to their days from symptom onset (1–2, 3–4, 5–6, 7–8, 9–10, 11–12 days) and mean exhaled viral RNA copies were computed for each bin (bin:(mean [%95 CI]); 1–2 days: 37 [6, 68]; 3–4 days: 80 [43, 117]; 5–6 days: 58 [19, 97]; 7–8 days: 13 [4, 23]; 9–10 days: 2 [1, 2]; 11–12 days: 2 [1, 3]) (Fig 2B). Levels were similar across bins including days 1 to 6 (Tukey test, all P’s > 0.05), with the first significant difference observed between the third (days 5-6 from symptom onset) and fourth (days 7-8 from symptom onset) bins (Tukey test, P < 0.05). Taken together, these findings suggest that the number of viral RNA copies does not statistically decrease until day 8 from symptom onset. After day 8, exhaled viral RNA levels dropped near the threshold of detection, and in many cases to undetectable levels. Very low levels of viral RNA were intermittently detected up to 20 days from symptom onset, with occasional spikes in a few participants (Fig 2A). These findings suggest that in outpatients with a low rate of co-morbidities (Table 1), viral loads on breath are substantially decreased after day 8 from the onset of symptoms.
Vaccinated and unvaccinated individuals exhaled similar levels of SARS-CoV-2 RNA
Whether vaccinated individuals who experience breakthrough infection with COVID-19 shed lower levels of virus on breath is unknown. Our data set included 57 samples collected by 11 vaccinated participants with breakthrough infections (Table 1). We found that vaccinated and unvaccinated participants exhaled similar numbers of SARS-CoV-2 RNA copies (Fig 2C), when accounting for age, sex, presence of co-morbidities, days since symptom onset and symptom severity (Supplementary Appendix, Wilcoxon rank sum test, P = 0.31, z = 1.02) (Fig 3A). Viral RNA levels in 57 samples collected by vaccinated individuals on days 1 to 16 from symptom onset ranged from 0 to 549 exhaled copies per minute (%95 CI: [17, 78]) while viral loads for unvaccinated individuals on days 1 to 16 ranged from 0 to 876 exhaled copies per minute (%95 CI: [25, 55]).
Levels of exhaled SARS-CoV-2 RNA were similar across viral variants
Whether infection by different viral variants impacts levels of virus shed on breath is poorly understood. In particular, whether infection with later SARS-CoV-2 variants considered to be more transmissible, such as Delta and Omicron, produces higher levels of viral shedding on breath than earlier variants is unknown. Genome sequencing was performed on viral RNA extracted from 9 exhaled breath condensate samples from 7 participants in order to determine SARS-CoV-2 viral clade (Fig 1C). For all other participants, clade was putatively assigned based on the variant prevalence in Chicago over a rolling 7-day average at time of symptom onset (n = 35) or date of diagnosis (n = 2; P15 and P42) (Fig 1C). Our data set included 194 samples from individuals infected by early SARS-CoV-2 variants between April 2020 and March 2021 (clades 20A, 20B, 20C, and 20G), 35 samples from individuals infected by Delta between July 2021 and October 2021 (clade 21J), and 65 samples from individuals infected by Omicron in December 2021 and January 2022 (clade 21K) (Fig 1C, with additional detail in the online data supplement Table S1). Levels of viral shedding on breath did not differ between these three groups when accounting for age, sex, presence of co-morbidities, and days since symptom onset (Tukey’s test, P > 0.05) (Fig 3B). Viral loads on breath in individuals infected with the early variants ranged from 0 to 876 RNA copies exhaled per minute (%95 CI: [17, 48]), while viral loads in individuals infected with the Delta variant ranged from 0 to 549 RNA copies exhaled per minute (%95 CI: [28, 126]), and viral loads for individuals infected with the Omicron variant ranged from 0 to 264 RNA copies exhaled per minute (%95 CI: [4, 25]) (Fig 2D). In a second confirmatory analysis, we compared peak levels of viral shedding on breath over infection course in individuals infected with each variant. Peak levels of viral shedding on breath were similar across variants (One-way ANOVA; F2,41 = 2.56, P = 0.089) (Fig 1D).
Exhaled SARS-CoV-2 RNA levels increased with self-reported symptom severity
Relationships between viral load and symptom severity have been studied using internal swabs to estimate levels of viral RNA (9, 38). However, whether exhaled SARS-CoV-2 viral loads correlate with symptom severity has been unexplored. We examined the number of SARS-CoV-2 RNA copies exhaled per minute across self-reported symptom severity groups (group: (mean [%95 CI]); Resolved/None: 19 [6, 31]; Mild: 14 [6, 22]; Moderate 33 [12, 53]; Severe: 183 [76, 289]) (Fig 3C), accounting for age, sex, presence of co-morbidities, and days since symptom onset. On average, exhaled viral RNA copies were significantly higher in samples collected when participants reported that their symptoms were Severe compared to those collected when symptoms were reported as Resolved/None, Mild, or Moderate (Tukey test; Resolved/None: P = 0.002, Mild: P < 0.001, or Moderate: P = 0.005). However, there was considerable individual variation reflected by large confidence intervals, and as can be seen in Fig 3C. For example, some individuals who were asymptomatic at the time of specimen collection were exhaling around 400 viral copies per minute (Fig 3C, left-most column), highlighting the importance of understanding the factors that contribute to individual variation in levels of viral shedding on breath (39).
Discussion
Our study used a newly-developed portable exhaled breath condensate collection device to capture breath specimens from COVID-19-tested outpatient participants. We used this data to characterize the dynamics of viral load on exhaled breath from the onset of symptoms through day 20. We found that COVID-19 infected participants exhaled an average of 80 viral RNA copies per minute during the first 8 days of infection, with significant variability, including peaks as high as 876 copies per minute. The average amount of viral RNA being exhaled into the environment did not statistically decrease until day 8 following symptom onset, when we found a steep drop to levels near the limit of detection, which persisted for up to 20 days from symptom onset, and included occasional spikes in a few participants. We further found that participants who rated their symptoms as severe were exhaling significantly more viral RNA copies per minute than participants who rated their symptoms as mild or moderate, though with high individual variance. We found no impact of vaccination status, viral variant, age, sex, or presence of co-morbidities.
Given the long-term effects of COVID-19 infection (30–32), and the potential cumulative risks of re-infection (33), reducing the burden on society due to COVID-19 will require strategies for infection and re-infection prevention (34); characterization of the dynamics of exhaled viral loads—especially during natural, unlabored breathing—is important for better understanding of the risk of onward infection.
Our study has several limitations. First, we had a small number (n = 11) of vaccinated participants, most of whom received the same vaccine (BNT162b2, n = 7), limiting our ability to measure the effect of vaccination on viral shedding on breath. Notably, because vaccines became available later in our study period, our vaccinated participants were likely infected with a later-emerging variant of SARS-CoV-2, potentially impacting comparisons with earlier participants (17). That said, we did not find any differences in exhaled viral loads across viral variants in this study. Second, self-reported symptom severity data is variable and adds uncertainty to those findings. While a majority of participants (95%) were in an outpatient setting, assessing severity through the use of defined clinical severity scores would have yielded a more reliable measure.
Despite these limitations, our findings make a significant contribution to understanding the dynamics of COVID-19 infection in an outpatient population. The majority of research on the dynamics viral load have focused on hospitalized inpatient populations, typically characterizing viral dynamics from day of diagnosis, relative to clinically-assessed disease severity. In contrast, relatively little is known about viral dynamics among outpatients from time of symptom onset, relative to symptom severity (9), and dynamics of viral load on breath over the course of infection are poorly understood. Most COVID-19 infections are experienced as outpatients in household and community settings. Therefore, in order to inform community transmission mitigation strategies—the environment in which the vast majority of transmission occurs—we need to understand the dynamics of the virus in an outpatient population. Significant effort has gone into characterizing viral shedding dynamics, which has led to the development and improvement of models of transmission and methods of infection mitigation. Notably, however, the lack of congruency in viral load dynamics between different swab locations within individuals erodes confidence in respiratory tract viral loads as indicators of infectiousness (39). Here we add a new element to this effort by directly characterizing viral shedding on breath, the major vector of transmission of COVID-19.
Our findings can be used to demonstrate future feasibility of estimating the amount of time it takes for an individual to exhale an infectious dose of SARS-CoV-2: For example, assuming a conservative estimate of infectious dose for COVID-19 of 300 virions (35, 36), a person exhaling 900 viral RNA copies per minute (as did peak shedders in our study) could be conservatively estimated to exhale an infectious dose in around 20 seconds (not safe for an elevator ride), whereas a person who is exhaling 10 viral copies per minute could be estimated to exhale an infectious dose in over 30 minutes (probably ok for an elevator ride) (37). Along these same lines, once the relationship between exhaled viral RNA and replication-competent virus is established, if combined with rapid SARS-CoV-2 detection methods, our device could be used to assist in development of an at-home infectiousness test.
Understanding the dynamics of exhaled viral loads during natural breathing over the course of infection is an important step toward understanding onward of SARS-CoV-2. Our findings support the possibility that an individual becomes less likely to infect others through breath beyond day 8 after symptom onset, though further work is needed to confirm that infectivity parallels viral RNA levels on breath. Notably, it is unknown why some patients exhale large amounts of viral loads and others do not. The method used here exploits a low-cost, portable, disposable, and straightforward-to-use device, allowing participants to complete multiple, unsupervised, at-home sample collection sessions after a single brief training session. This method has the potential to pave the way for large-scale, individualized research into viral shedding on breath for COVID-19, and for other existing and yet-to-come respiratory pathogens.
Materials and methods
Cohort Description
This was a prospective observational study of participants recruited from the Northwestern Memorial Hospital (NMH) Emergency Department (ED) and Immediate Care Clinics (ICCs) in Chicago, Illinois from April 27, 2020 until January 8, 2022. Participants were identified after an order for COVID-19 testing was placed by the clinical care team and they were recruited by study staff as a convenience sample. Any adult (at least 18 years) undergoing COVID-19 testing in the NMH ED or ICCs was eligible for study enrollment regardless of symptom severity. In general, the COVID-19 testing procedure in the clinic included collection of a nasopharyngeal swab and analysis of the direct swab using Abbott ID NOW, a qualitative rapid non-PCR nucleic acid-based test which provides binary results (positive or negative). COVID-19-positive participants had a median age of 34 years (range 20 to 62 years) and 45.45% were female (Table 1). All participants were outpatients, with the exception of 2 ED patients who were admitted. All participants provided voluntary informed consent, and all research procedures were approved by the Northwestern University IRB, Study #STU00212292.
Exhaled breath condensate collection device
EBCD is a simple device that can be built for less than US $5.00 using readily available parts (Fig S1). It consists of a modified 50 CC catheter-tip syringe body and plunger, a plastic mouth piece (or nose piece), a one-way flutter valve/saliva trap and a cooling sleeve with insulating cover (Fig S1A). To assemble the device, the one-way valve is placed inside the mouth piece, oriented to prevent the participant from inhaling room air through the tube. The mouthpiece assembly is then placed into the wide end of the syringe body. The cooling sleeve with insulating cover is placed over the syringe body. To produce a sample, the participant simply breathes naturally and exhales through the EBCD for 10 minutes. As the participant exhales through the device, any saliva that may enter the tube is trapped by the valve casing while moisture in warm exhaled breath condenses on the cooled inner wall of the device, producing a liquid condensate sample (Fig S1B). After the breathing session, the valve and mouth piece are removed, the plunger is inserted, and the liquid sample is plunged into a vial, which contains 1 mL of transport medium (Fig S1C). The resulting sample can be mailed to the lab for analysis. A typical 10-minute breathing session yields around 1 mL of breath condensate.
The EBCD has oral and nasal breath collection configurations (Fig S2). For this study, because mouth pieces are cheaper and more readily commercially available, the device was configured with a mouth piece and participants provided oral breathing samples. Because we were interested in SARS-CoV-2 RNA levels in exhaled breath during natural breathing as a baseline condition of potential viral transmission, we instructed participants to breathe at a natural pace, and avoid talking, shouting, singing, or any other exertive breathing. Natural breathing should produce condensate samples with minimal liquid saliva contamination: Liquid saliva is distinct from saliva which is volatilized naturally during exhale; volatilized saliva is an expected constituent of our sample, while liquid saliva is a contaminant which can enter the sample via spitting or drooling. To avoid saliva contamination via drooling or spittle, the configuration of the mouthpiece and one-way valve were designed to prevent spitting or drooling into the device and to trap any minimal saliva that enters the mouthpiece as a liquid. We confirmed the efficacy of this design in eliminating liquid saliva contamination by performing viscosity measurements on control samples using an rheometer (Anton Paar MCR302) equipped with a Peltier temperature controller set to 25°C and a CP50-1 cone and plate fixture. If oral samples collected with our device are contaminated by saliva, we would expect oral samples to have higher viscosity compared to nasal samples. Two healthy participants produced 3 samples each: nasal and oral breath samples using the EBCD, and saliva samples by spitting into a sample tube, resulting in 6 samples. 0.6 mL of each sample was added to the Peltier stage at 25°C, and was subjected to a shear rate sweep from 5 to 500 s-1 at 5 s-1 increment. Three measurements were performed on each sample. MilliQ water was also measured for comparison. We found that nasal breath, oral breath and water were indistinguishable in terms of viscosity, with saliva having much higher viscosity (Fig S2). Thus EBCD oral breath samples were not contaminated by saliva.
Specimen collection
Participants were asked to provide an initial breath sample either in the clinic, from home following in-person study-team instruction on how to use the EBCD device, or from home following study-team instruction provided remotely over videoconferencing software or via online video. Participants were then invited to participate in voluntary longitudinal home collection of additional exhaled breath samples, typically two per day, for variable duration ranging from 2 to 20 days. Participant demographics were collected at baseline along with the date of symptom onset as determined by clinical notes and participant interviews. At the time of each sample collection, participants recorded time of day, symptoms, and subjective self-assessed overall symptom severity (resolved/none, mild, moderate, severe). Exhaled breath condensate samples were taken from 60 participants within an average of 19 hours (range 1–120, %95 CI: [13.3, 25.1]) after clinical diagnostic COVID-19 testing. A subset of 31 COVID-19-positive participants (70.5%) produced an additional 250 samples longitudinally from home (range 2–30, %95 CI: [6, 12]). In total, 60 participants produced 312 samples, two of which were unusable due to accidental spillage, leaving 310 samples for analysis.
Sample analysis
Samples were collected into vials containing 1 mL molecular transport medium (Primestore, Longhorn Vaccines and Diagnostics) and stored at 4°C. For analysis, the full volume of each sample, representing 10 minutes of natural breathing, was concentrated to a final volume of 200 µL using Amicon Ultra filters (3 kDa). RNA was extracted using the QIAamp MinElute virus spin kit using the full concentrated volume of each sample as an input, and using a 40 µL elution. Reverse-Transcriptase quantitative Polymerase Chain Reaction (RT-qPCR) was performed in triplicate, using TaqMan probes and N1-targeted primers (IDT 2019-nCoV kit, catalog #10006606) following the CDC 2019-Novel Coronavirus (2019-nCoV) Real-Time RT-PCR Diagnostic Panel (CDC-006-00019). Cycle threshold (CT) values were obtained using ThermoFisher QuantStudio Real-Time PCR software. The estimated number of viral copies represented by the CT was determined using a standard curve generated with a dilution series of the 2019-nCoV_N_positive control plasmid (IDT catalog #10006625) (Fig 1B). To convert CT value to the number of viral copies, we used Matlab’s polyfit function to fit a first-degree polynomial to the standard data. The data were modeled as y = a ∗ log(x) + b, where y is the CT value, x is the number of viral copies, a and b and are the polynomial coefficients (a = – 3.63, b = 38.51). Use of the full volume of sample collected over a 10-minute breathing session normalized all samples to represent 10 minutes of natural breathing. This allowed us to convert CT values to numbers of viral copies that were exhaled per minute over a ten-minute breathing session.
Sequencing library preparation, Illumina sequencing, and genome assembly
Complementary DNA (cDNA) synthesis was performed with the SuperScript IV First Strand Synthesis Kit (Thermo) using random hexamer primers according to manufacturer’s specifications. Direct amplification of viral genome DNA was performed as previously described using the Artic Network version 4 primers. Sequencing library preparation of genome amplicon pools was performed using the SeqWell plexWell 384 kit per manufacturer’s instructions. Pooled libraries of up to 96 genomes were sequenced on the Illumina MiSeq using the V2 500 cycle kit. Sequencing reads were trimmed to remove adapters and low-quality sequences using Trimmomatic v0.36. Trimmed reads were aligned to the reference genome sequence of SARS-CoV-2 (accession MN908947.3) using bwa v0.7.15. Pileups were generated from the alignment using samtools v1.9 and consensus sequence determined using iVar v1.2.2 with a minimum depth of 10, a minimum base quality score of 20, and a consensus frequency threshold of 0 (i.e., majority base as the consensus). SARS-CoV-2 clades were assessed using Nextclade (clades.nextstrain.org) and Pango lineages were assigned to the consensus sequences using pangolin software.
Putative clade designations
After determining viral clade for 7 participants by sequencing, we assigned a putative clade to each remaining participant based on time-matched SARS-CoV-2 sequencing data. We downloaded all available SARS-CoV-2 genome data from Chicago between April 2020 and February 2021 (n = 8,155 genomes) from GISAID (https://www.gisaid.org/) as of July 27th, 2022 (Shu et al., 2017). Clade designations for sequences were determined using Nextclade’s phylogenetic nomenclature system (Aksamentov et al., 2021) and a 7-day rolling average was used to estimate clade frequency by date. Participants were assigned putative clade designations based on the most prevalent clade at the date of symptom onset (when available, n = 35) or date of diagnosis (n = 2; P15 and P42). We performed all calculations in base R and visualized the data using the R package ggplot2 33.
Statistical analysis
To examine the effect of self-reported symptom severity on exhaled viral loads, we used a generalized linear mixed model to control for age, sex, presence of co-morbidities, and days since symptom onset. To examine the effect of days since symptom onset on exhaled viral loads, we used a generalized linear mixed model to control for age, sex, presence of co-morbidities, and self-reported symptom severity. Subsequent analysis of variance was performed, and pair-wise comparisons between self-reported symptom groups (Resolved/None, Mild, Moderate, Severe) and between two-day bins of days since symptom onset (days 1–2, 3–4, 5–6, 7–8, 9–10, 11–12) were performed using Tukey’s Honest Significant Difference test on the estimated marginal means of each group or bin. Statistical analyses were performed on the logarithmic-transformed number of viral RNA copies. Not Detected (ND) values were set to zero and a constant (1) was added to all values prior to log transformation. Statistical significance threshold was P = 0.05. To determine the timing of reduction of viral shedding on breath, longitudinal data were modeled using Matlab’s cftool toolbox, which showed a best-fit to the S-curve model. The inflection point of the modeled curve was considered to represent the time of steepest drop in viral RNA copy numbers. We used a Wilcoxon rank sum test to examine the difference in exhaled viral RNA copies between vaccinated and unvaccinated participants. Details of statistical analysis are included in the Supplementary Methods.
Data Availability
All data produced in the present study are available upon reasonable request to the authors
Acknowledgements
Thanks to Goodwin A. Lane for extensive contributions in study planning and data collection, and thanks to Carol O’Dea and Samuel Newton. This research was also supported in part through the computational resources and staff contributions provided for the Quest high performance computing facility at Northwestern University, which is jointly supported by the Office of the Provost, the Office for Research, and Northwestern University Information Technology. Clinical data collection was supported in part by the Northwestern Medicine Enterprise Data Warehouse. This work was supported by the Northwestern University NUSeq Core Facility. Viscosity measurements were performed in the Analytical bioNanoTechnology Equipment Core Facility (ANTEC) of the Simpson Querrey Institute for BioNanotechnology at Northwestern University. ANTEC is currently supported by the Soft and Hybrid Nanotechnology Experimental (SHyNE) Resource (NSF ECCS-2025633) and Feinberg School of Medicine, Northwestern University. The Simpson Querrey Institute for BioNanotechnology, Northwestern University Office for Research, U.S. Army Research Office, and the U.S. Army Medical Research and Materiel Command have also provided funding to develop this facility. A special thanks to the participants whose willingness to participate during their illness made this work possible.
Funding
This study was supported by the following funding: Robert J. Havey, MD Institute for Global Health Research Catalyzer Funding grant (C.Z and G.T.), NIH grant U19 AI135964 (E.A.O.), NIH grant R21 AI163912 (J.F.H.), Walder Foundation (J.F.H., E.A.O., R.L.R.).
Competing interests
JFH reports research support, paid to Northwestern University, from Gilead Sciences. MGI Reports research support, paid to Northwestern University, from AiCuris, GlaxoSmithKline, Janssen and Shire; he is a paid consultant for Adagio, AlloVir, Celltrion, Cidara, Genentech, Roche, Janssen, Shionogi, Viracor Eurofins; he is also a paid member of DSMBs from Janssen, Merck, SAB Biotherapeutics, Sequiris, Takeda and Vitaeris; he also receives royalties from UpToDate. CZ and GL filed pending patent application #WO2022/060917 A1 for EBCD.
Supplementary Materials for
This PDF file includes
Supplementary Text
Fig. S1
Fig. S2
Tables S1 to S8
References (1 to 2)
Supplementary Text
Cohort Description
This was a prospective observational study of participants recruited from the Northwestern Memorial Hospital (NMH) Emergency Department (ED) and Immediate Care Clinics (ICCs) in Chicago, Illinois from April 27, 2020 until January 8, 2022. Participants were identified after an order for COVID-19 testing was placed by the clinical care team and they were recruited by study staff as a convenience sample. Any adult (at least 18 years) undergoing COVID-19 testing in the NMH ED or ICCs was eligible for study enrollment regardless of symptom severity. In general, the COVID-19 testing procedure in the clinic included collection of a nasopharyngeal swab and analysis of the direct swab using Abbott ID NOW, a qualitative rapid non-PCR nucleic acid-based test which provides binary results (positive or negative). COVID-19-positive participants had a median age of 34 years (range 20 to 62 years) and 45.45% were female (Table 1). All participants were outpatients, with the exception of 2 ED patients who were admitted. All participants provided informed consent, and all research procedures were approved by the Northwestern University IRB, Study #STU00212292.
Exhaled breath condensate collection device
Exhaled breath condensate samples were collected using the Exhaled Breath Collection Device (EBCD), developed by our lab. EBCD is a simple device that can be built for less than US $5.00 using readily available parts (Fig S1). It consists of a modified 50 CC catheter-tip syringe body and plunger, a plastic mouth piece, a one-way flutter valve/saliva trap and a cooling sleeve with insulating cover (Fig S1A). To assemble the device, the one-way valve/saliva trap is placed inside the mouth piece, oriented to trap saliva while preventing the participant from inhaling room air through the tube. The mouthpiece assembly is then placed into the wide end of the syringe body. The cooling sleeve with insulating cover is placed over the syringe body. To produce a sample, the participant simply breathes naturally and exhales through the EBCD for 10 minutes. As the participant exhales through the device, moisture in warm exhaled breath condenses on the cooled inner wall of the device, producing a liquid condensate sample (Fig S1B). After the breathing session, the valve and mouth piece are removed, the plunger is inserted, and the liquid sample is plunged into a vial, which contains 1 mL of transport medium (Fig S1C). The resulting sample can be mailed to the lab for analysis. A typical 10-minute breathing session yields around 1 mL of breath condensate.
Specimen collection
Participants were asked to provide an initial breath sample either in the clinic, from home following in-person study-team instruction on how to use the EBCD device, or from home following study-team instruction provided remotely over videoconferencing software or via online video. Participants were then invited to participate in voluntary longitudinal home collection of additional exhaled breath samples, typically two per day, for variable duration ranging from 2 to 20 days. Participant demographics were collected at baseline along with the date of symptom onset as determined by clinical notes and participant interviews. At the time of each sample collection, participants recorded time of day, symptoms, and subjective self-assessed overall symptom severity (resolved/none, mild, moderate, severe). Exhaled breath condensate samples were taken from 60 participants within an average of 19 hours (range 1–120, %95 CI: [13, 25]) after clinical diagnostic COVID-19 testing (Fig 1A). A subset of 31 COVID-19-positive participants (70.5%) produced an additional 250 samples longitudinally from home (range 2–30, %95 CI: [6, 12]). In total, 60 participants produced 312 samples, two of which were unusable due to accidental spillage, leaving 310 samples for analysis.
Detailed list of steps for assembly and use of Exhaled Breath Collection Device
1 Prepare device:
Trim syringe tip to a length of approximately 6 mm.
Remove plunger from syringe body.
Insert flutter valve inside the mouth piece until the mouthpiece hits the stop on the flutter valve, which should be oriented such that air will flow from the smaller end through the larger end of the mouthpiece.
Insert the assembled mouthpiece approximately 15 mm into the large end of the syringe body.
Roll a cooling sleeve (> 2 hrs at –20 °C) over the syringe tube starting from the tip end.
Cover the cooling sleeve with the insulating cover.
2 Produce a sample:
Seated comfortably, place the mouth piece into the mouth.
Breathing naturally over ten minutes, inhale through the nose and exhale through the mouth and thus through the device.
Over the course of the session, keep the device horizontal.
3 Recover a sample:
Keeping the device horizontal, remove the mouthpiece assembly and cooling sleeve, and insert the syringe plunger.
Orient the tip of the device into the collection vial, and using a slow and controlled motion, plunge the liquid sample into the collection vial.
Sample analysis
Samples were collected into vials containing 1 mL molecular transport medium (Primestore, Longhorn Vaccines and Diagnostics) and stored at 4°C. For analysis, samples were concentrated to a final volume of 200 µL using Amicon Ultra filters (3 kDa). RNA was extracted using the QIAamp MinElute virus spin kit in a 40 µL elution. Reverse-Transcriptase quantitative Polymerase Chain Reaction (RT-qPCR) was performed in triplicate, using TaqMan probes and N1-targeted primers (IDT 2019-nCoV kit, catalog #10006606) following the CDC 2019-Novel Coronavirus (2019-nCoV) Real-Time RT-PCR Diagnostic Panel (CDC-006-00019). Cycle threshold (CT) values were obtained using ThermoFisher QuantStudio Real-Time PCR software. The estimated number of viral copies represented by the CT was determined using a standard curve generated with a dilution series of the 2019-nCoV_N_positive control plasmid (IDT catalog #10006625) (Fig 1B). To convert CT value to the number of viral copies, we used Matlab’s polyfit function to fit a first-degree polynomial to the standard data. The data were modeled as y = a ∗ log(x) + b, where y is the CT value, x is the number of viral copies, and a and b are the polynomial coefficients (a = – 3.63, b = 38.51). This allowed us to convert CT values to numbers of viral copies that were exhaled over a ten-minute breathing session.
Sequencing library preparation, Illumina sequencing, and genome assembly
Complementary DNA (cDNA) synthesis was performed with the SuperScript IV First Strand Synthesis Kit (Thermo) using random hexamer primers according to manufacturer’s specifications. Direct amplification of viral genome DNA was performed as previously described using the Artic Network version 4 primers. Sequencing library preparation of genome amplicon pools was performed using the SeqWell plexWell 384 kit per manufacturer’s instructions. Pooled libraries of up to 96 genomes were sequenced on the Illumina MiSeq using the V2 500 cycle kit. Sequencing reads were trimmed to remove adapters and low-quality sequences using Trimmomatic v0.36. Trimmed reads were aligned to the reference genome sequence of SARS-CoV-2 (accession MN908947.3) using bwa v0.7.15. Pileups were generated from the alignment using samtools v1.9 and consensus sequence determined using iVar v1.2.2 with a minimum depth of 10, a minimum base quality score of 20, and a consensus frequency threshold of 0 (i.e., majority base as the consensus). SARS-CoV-2 clades were assessed using Nextclade (clades.nextstrain.org) and Pango lineages were assigned to the consensus sequences using pangolin software.
Putative clade designations
After determining viral clade for 7 participants by sequencing, we assigned a putative clade to each remaining participant based on time-matched SARS-CoV-2 sequencing data. We downloaded all available SARS-CoV-2 genome data from Chicago between April 2020 and February 2021 (n = 8,155 genomes) from GISAID (https://www.gisaid.org/) as of July 27th, 2022 (1). Clade designations for sequences were determined using Nextclade’s phylogenetic nomenclature system (1) and a 7-day rolling average was used to estimate clade frequency by date. Participants were assigned putative clade designations based on the most prevalent clade at the date of symptom onset (when available, n = 35) or date of diagnosis (n = 2; P15 and P42). We performed all calculations in base R and visualized the data using the R package ggplot2 (2).
Statistics pertaining to Result 1: Average levels of SARS-CoV-2 RNA on breath decreased significantly around day 8 from symptom onset
To model the change of the number of viral copies across days since symptom onset, we fitted an S-Curve model to the average time course of viral RNA exhaled per minute. The average time course was smoothed using a moving average with a window of 3. The curve fitting was performed using Matlab’s cftool toolbox. The S-Curve model was y = a/(b + exp(C ∗ x)), where y is the log-transform of the number of viral RNA copies, x is the days since symptom onset, and a, b, and c are coefficients of the model. The inflection point of the modeled curve was considered to represent the time of steepest drop in viral RNA copies. The inflection point was calculated by first using the model parameters to find the fitted number of viral RNA copies from day 1 to day 20 with a step of 0.5 days, then calculating the point of maximal change from the fitted vector. The goodness of the model was qualified using R-squared and adjusted R-squared. We also compared the results to other models including Polynomial, Exponential, Fourier, Gaussian and Sum of Sine (Table S2).
Statistics pertaining to Result 2: Exhaled SARS-CoV-2 RNA levels did not differ in 11 vaccinated individuals
The number of exhaled SARS-CoV-2 RNA copies for vaccinated participants ranged from 0 to 549 (Mean: 47, %95 CI: [17, 78]). A Wilcoxon rank-sum test suggested that the number of exhaled viral RNA copies for unvaccinated participants (Range 0–876, Mean: 32, %95 CI: [18, 45]) was not statistically significant different from that of vaccinated participants (Wilcoxon rank sum test; P = 0.30, z = 1.03). In a second confirmatory analysis, we used a generalized linear mixed model to control for potential confounds (age, sex, the presence of comorbidity, days since symptom onset and self-reported symptom severity), followed by a post-hoc Tukey test on the estimated marginal means (P = 0.75)
Statistics pertaining to Result 3: Levels of exhaled SARS-CoV-2 RNA were similar across viral variants
To compare the number of exhaled SARS-CoV-2 RNA copies for samples infected by early SARS-CoV-2 variants between April 2020 and March 2021 (clades 20A, 20B, 20C, and 20G), Delta July 2021 and October 2021 (clade 21J), and Omicron in December 2021 and January 2022 (clade 21K), we used a generalized linear mixed model to control for potential confounds (age, sex, the presence of comorbidity, days since symptom onset and self-reported symptom severity). Post-hoc Tukey test revealed that the number of exhaled SARS-CoV-2 RNA copies were not statistically different across variants (P > 0.05; Table S3, S4)
Statistics pertaining to Result 4: Exhaled SARS-CoV-2 RNA levels increased with self-reported symptom severity
To examine the effect of self-reported symptom severity on exhaled viral loads, we used a generalized linear mixed model to control for age, sex, the presence of co-morbidities, and days since symptom onset. Subsequent analysis of variance was performed, and pair-wise comparisons between symptom groups (Resolved/None, Mild, Moderate, Severe) were performed using Tukey’s Honest Significant Difference test on the estimated marginal means of each group. Statistical analyses were performed on the logarithmic-transformed number of viral RNA copies. Not Detected (ND) values were set to zero and a constant 1 added to all values prior to log transformation. Statistical significance threshold was P = 0.05. The ANOVA analysis revealed a statistically significant effect of self-reported symptom severity on the number of SARS-CoV-2 RNA copies exhaled per minute (F3, 283 = 14.76, P < 0.001). Post-hoc Tukey’s test using the estimated marginal mean suggested that the number of viral RNA copies exhaled were statistically higher when the participants reported that their symptoms were Severe, as compared to Resolved/None (P = 0.002), Mild (P < 0.001), or Moderate (P = 0.005) (Table S5, S6).
In a second confirmatory analysis to examine the effect of days since symptom onset on exhaled viral RNA loads, samples were binned according to their days from symptom onset (1–2, 3–4, 5– 6, 7–8, 9–10, 11–12 days). We used a generalized linear mixed model to control for age, sex, presence of co-morbidities, and self-reported symptom severity. Subsequent analysis of variance was performed, and pair-wise comparisons between bins was performed using Tukey’s Honest Significant Difference test on the estimated marginal means of each bin. Statistical analyses were performed on the logarithmic-transformed number of viral RNA copies. Not Detected (ND) values were set to zero and a constant 1 was added to all values prior to log transformation. Statistical significance threshold was P = 0.05. The mean viral RNA copies across bins were compared using ANOVA analysis, revealing a main effect of bins (F5, 242 = 11.15, P < 0.001). Levels were similar across bins including days 1 to 6, with significant decreases evident with the bin containing days 7 to 8 (Tukey’s test; P < 0.05; Fig 2C; Tables S7, S8).
References
- 1.Ten scientific reasons in support of airborne transmission of SARS-CoV-2Lancet 397:1603–1605
- 2.Turbulent gas clouds and respiratory pathogen emissions: Potential implications for reducing transmission of COVID-19JAMA 323:1837–1838
- 3.Viral load of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) in respiratory aerosols emitted by patients with coronavirus disease 2019 (COVID-19) while breathing, talking, and singingClin Infect Dis 74:1722–1728
- 4.Aerodynamic analysis of SARS-CoV-2 in two Wuhan hospitalsNature 582:557–560
- 5.Breathing is enough: For the spread of influenza virus and SARS-CoV-2 by breathing onlyJ Aerosol Med Pulm Drug Deliv 33:230–234
- 6.The airborne lifetime of small speech droplets and their potential importance in SARS-CoV-2 transmissionProc Natl Acad Sci 117:11875–11877
- 7.Aerosol and surface stability of SARS-CoV-2 as compared with SARS-CoV-1N Engl J Med 382:1564–1567
- 8.Temporal dynamics in viral shedding and transmissibility of COVID-19Nat Med 26:672–675
- 9.Trajectory of viral RNA load among persons with incident SARS-CoV-2 G614 infection (Wuhan Strain) in association with COVID-19 symptom onset and severityJAMA Netw Open 5
- 10.Estimating infectiousness throughout SARS-CoV-2 infection courseScience 373
- 11.Longitudinal analysis of SARS-CoV-2 vaccine breakthrough infections reveals limited infectious virus shedding and restricted tissue distributionOpen Forum Infect Dis 9
- 12.Particle sizes of infectious aerosols: implications for infection controlLancet Respir Med 8:914–924
- 13.Airborne Transmission of SARS-CoV-2JAMA 324:441–442
- 14.SARS-CoV-2: Viral loads of exhaled breath and oronasopharyngeal specimens in hospitalized patients with COVID-19Int J Infect Dis 110:105–110
- 15.Low exhaled breath droplet formation may explain why children are poor SARS-CoV-2 transmittersAerosol Air Qual Res 20:1513–1515
- 16.Isolation of SARS-CoV-2 from the air in a car driven by a COVID patient with mild illnessInt J Infect Dis 108:212–216
- 17.Evolution of SARS-CoV-2 shedding in exhaled breath aerosolsmedRxiv [preprint] https://doi.org/101101/2022072722278121
- 18.Multi-route transmission potential of SARS-CoV-2 in healthcare facilitiesJ Hazard Mater 402
- 19.Viable SARS-CoV-2 in the air of a hospital room with COVID-19 patientsInt J Infect Dis 100:476–482
- 20.Detection of SARS-CoV-2 in exhaled breath from non-hospitalized COVID-19-infected individualsSci Rep 12
- 21.Respiratory virus shedding in exhaled breath and efficacy of face masksNat Med 26:676–680
- 22.Exhaled breath is a significant source of SARS-CoV-2 emissionmedRxiv [preprint https://doi.org/101101/2020053120115154
- 23.Use of exhaled breath condensate (EBC) in the diagnosis of SARS-COV-2 (COVID-19)Thorax 76:86–88
- 24.RT-PCR diagnosis of COVID-19 from exhaled breath condensate: A clinical studyJ Breath Res 15
- 25.Variation in severe acute respiratory syndrome coronavirus 2 bioaerosol production in exhaled breathOpen Forum Infect Dis 9
- 26.Exhaled SARS-CoV-2 quantified by face-mask sampling in hospitalised patients with COVID-19J Infect 82:253–259
- 27.Breath-, air-and surface-borne SARS-CoV-2 in hospitalsJ Aerosol Sci 152
- 28.Approximate inference in generalized linear mixed modelsJ Am Stat Assoc 88:9–25
- 29.Comparing Individual Means in the Analysis of VarianceBiometrics 5:99–114
- 30.COVID-19 and olfactory dysfunction: a looming wave of dementia?J Neurophysiol 128:436–444
- 31.Persistent neurologic symptoms and cognitive dysfunction in non-hospitalized Covid-19 “long haulers.”Ann Clin Transl Neurol 8:1073–1085
- 32.Long-term neurologic outcomes of COVID-19Nat Med
- 33.Outcomes of SARS-CoV-2 reinfectionRes Sq [preprint https://doi.org/1021203/rs3.rs-1749502/v1
- 34.Burden of PCR-Confirmed SARS-CoV-2 reinfection in the U.S. veterans administration, March 2020 – January 2022medRxiv https://doi.org/101101/2022032022272571
- 35.Review of infective dose, routes of transmission and outcome of COVID-19 caused by the SARS-COV-2: comparison with other respiratory virusesEpidemiol Infect 149
- 36.Finding the infectious dose for COVID-19 by applying an airborne-transmission model to superspreader eventsPLoS One 17
- 37.Reducing aerosol transmission of SARS-CoV-2 in hospital elevatorsIndoor Air 30:1065–1066
- 38.Onset and window of SARS-CoV-2 infectiousness and temporal correlation with symptom onset: a prospective, longitudinal community cohort studyThe Lancet Respiratory Medicine 10:1061–1073
- 39.Daily longitudinal sampling of SARS-CoV-2 infection reveals substantial heterogeneity in infectiousnessNature Microbiology 7:640–652
- 1.GISAID: Global initiative on sharing all influenza data – from vision to realityEurosurveillance 22
- 2.ggplot2Wiley Interdiscip Rev Comput Stat 3:180–185
Article and author information
Author information
Version history
- Sent for peer review:
- Preprint posted:
- Reviewed Preprint version 1:
Copyright
© 2024, Lane et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics
- views
- 506
- downloads
- 20
- citations
- 0
Views, downloads and citations are aggregated across all versions of this paper published by eLife.