Prediction of SARS-CoV-2 transmission dynamics based on population-level cycle threshold values: An epidemic transmission and machine learning modeling study
Abstract
Polymerase chain reaction (PCR) cycle threshold (Ct) values can be used to estimate the viral burden of Severe Acute Respiratory Syndrome Coronavirus type 2 (SARS-CoV-2) and predict population-level epidemic trends. We investigated the use of epidemic transmission modeling and machine learning (ML) based on Ct value distribution for SARS-CoV-2 incidence prediction in British Columbia, Canada during an Omicron subvariant BA.1-predominant period from November 2021 to January 2022. Using real-world data, we developed an epidemic transmission model that was first validated on outbreak data, and subsequently fitted to province-level data to predict incidence. Using simulated data, we developed a ML pipeline including five models to predict the reproductive number as a measure of transmission potential based on Ct value distribution, and validated it on out-of-sample province-level data. The epidemic transmission model demonstrated accurate prediction with the real incidence falling within the 95% credible interval of the predicted MCMC chains for both the long-term care facility outbreak, and province-level data. The ML models demonstrated good performance with a median mean squared error (MSE) lower than 0.17 across all models, and improved performance with increasing sample size. The variability of the Ct distribution around the mean was the strongest predictor of the reproductive number. These modeling approaches demonstrated utility for incidence and reproductive number prediction, and have potential to complement traditional surveillance in real time to guide public health interventions.
Data availability
The genomic sequencing data are publicly available in GISAID under the submitter British Columbia Center for Disease Control Public Health Laboratory (BCCDC PHL). The individual level demographic and epidemiological data can be made accessible following the data governance and data access policy guidelines (http://www.bccdc.ca/about/accountability/data-access-requests). The code for this study is available separately (https://github.com/Afraz496/Vital-E-paper).
Article and author information
Author details
Funding
BCCDC Foundation for Public Health (Public Health Rapid SARS-CoV-2 Vaccine Research Initiative in BC)
- Catherine A Hogan
Genome British Columbia (Public Health Rapid SARS-CoV-2 Vaccine Research Initiative in BC)
- Catherine A Hogan
Michael Smith Health Research BC (Public Health Rapid SARS-CoV-2 Vaccine Research Initiative in BC)
- Catherine A Hogan
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Copyright
© 2026, Khan et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 0
- citations
Views, downloads and citations are aggregated across all versions of this paper published by eLife.