Investigating phenotypes of pulmonary COVID-19 recovery: A longitudinal observational prospective multicenter trial
Abstract
Background:
The optimal procedures to prevent, identify, monitor, and treat long-term pulmonary sequelae of COVID-19 are elusive. Here, we characterized the kinetics of respiratory and symptom recovery following COVID-19.
Methods:
We conducted a longitudinal, multicenter observational study in ambulatory and hospitalized COVID-19 patients recruited in early 2020 (n = 145). Pulmonary computed tomography (CT) and lung function (LF) readouts, symptom prevalence, and clinical and laboratory parameters were collected during acute COVID-19 and at 60, 100, and 180 days follow-up visits. Recovery kinetics and risk factors were investigated by logistic regression. Classification of clinical features and participants was accomplished by unsupervised and semi-supervised multiparameter clustering and machine learning.
Results:
At the 6-month follow-up, 49% of participants reported persistent symptoms. The frequency of structural lung CT abnormalities ranged from 18% in the mild outpatient cases to 76% in the intensive care unit (ICU) convalescents. Prevalence of impaired LF ranged from 14% in the mild outpatient cases to 50% in the ICU survivors. Incomplete radiological lung recovery was associated with increased anti-S1/S2 antibody titer, IL-6, and CRP levels at the early follow-up. We demonstrated that the risk of perturbed pulmonary recovery could be robustly estimated at early follow-up by clustering and machine learning classifiers employing solely non-CT and non-LF parameters.
Conclusions:
The severity of acute COVID-19 and protracted systemic inflammation is strongly linked to persistent structural and functional lung abnormality. Automated screening of multiparameter health record data may assist in the prediction of incomplete pulmonary recovery and optimize COVID-19 follow-up management.
Funding:
The State of Tyrol (GZ 71934), Boehringer Ingelheim/Investigator initiated study (IIS 1199-0424).
Clinical trial number:
ClinicalTrials.gov: NCT04416100
Editor's evaluation
This is an informative paper describing the incidence and predictors of long-term radiological and functional lung abnormalities following COVID-19. Congratulations on the importance of the work!
https://doi.org/10.7554/eLife.72500.sa0Introduction
The ongoing COVID-19 pandemic challenges health-care systems. As of December 2021, the John Hopkins dashboard (Dong et al., 2020) reports 276 million cases and 5.4 million COVID-19-related deaths worldwide (Johns Hopkins Coronavirus Resource Center, 2021). Although the vast majority of COVID-19 patients display mild disease, approximately 10–15% of cases progress to a severe condition and approximately 5% suffer from critical illness (Perez-Saez, 2021; Huang et al., 2020). Similar to severe acute respiratory syndrome (SARS) (Hui et al., 2005; Ng et al., 2004; Ngai et al., 2010; Lam et al., 2009), a significant portion of COVID-19 patients report lingering or recurring clinical impairment and cardiopulmonary recovery may take several months to years (Sonnweber et al., 2021; Sahanic et al., 2021; Caruso et al., 2021; Huang et al., 2021b; Huang et al., 2021a; Faverio et al., 2021; Hellemons et al., 2021; Zhou et al., 2021; Venkatesan, 2021). This observation has led to the introduction of the term ‘long COVID,’ defined by the persistence of COVID-19 symptoms for more than 4 weeks, and the ‘post-acute sequelae of COVID-19’ (PASC) referring to symptom persistence for more than 12 weeks (Sahanic et al., 2021; Shah et al., 2021; Sudre et al., 2021b). Evidence-based strategies for prediction, monitoring, and treatment of PASC are urgently needed (Raghu and Wilson, 2020).
We herein prospectively analyzed the prevalence of nonresolving structural and functional lung abnormalities and persistent COVID-19-related symptoms 6 months after diagnosis. Using univariate risk modeling as well as multiparameter clustering and machine learning (ML), we investigated sets of risk factors and tested the operability of ML classifiers at predicting protracted lung and symptom recovery. The classification and prediction procedures were implemented in an open-source risk assessment tool (https://im2-ibk.shinyapps.io/CovILD/).
Methods
Study design
The CovILD (‘Development of interstitial lung disease in COVID-19’) multicenter, longitudinal observational study (Sonnweber et al., 2021) was initiated in April 2020. Adult residents of Tyrol, Austria, with symptomatic, PCR-confirmed SARS-CoV-2 infection (WHO, 2021) were enrolled by the Department of Internal Medicine II at the Medical University of Innsbruck (primary follow-up center), St. Vinzenz Hospital in Zams, and the acute rehabilitation facility in Münster (Table 1). The participants were diagnosed with COVID-19 between 3 March and 29 June 2020. In course of the study, including the 2020 SARS-CoV-2 outbreak and follow-up visits, the regional health system was able to guarantee an unrestricted, optimal standard of diagnostics and care for all participants. Corticosteroids were not standard of care during the recruitment period of the study, thus were not administered as a therapy of acute COVID-19. Some participants with nonresolving pneumonia received systemic steroids beginning from week 4 post diagnosis at the discretion of the physician (Table 2). The analysis endpoints were the presence of any, mild (severity score ≤ 5), and moderate-to-severe (severity score > 5) lung computed tomography (CT) abnormalities, impaired lung function (LF), and persistent COVID-19 symptoms at the 180-day follow-up visit (Table 3).
In total, 190 COVID-19 patients were screened for participation. Thereof, n = 18 subjects refused to give informed consent, n = 27 declared difficulties to appear at the study follow-ups. Data of n = 145 participants were eligible for analysis (Figure 1). All participants gave written informed consent. The study was approved by the Institutional Review Board at the Medical University of Innsbruck (approval number: 1103/2020) and registered at ClinicalTrials.gov (NCT04416100).
Procedures
We retrospectively assessed patient characteristics during acute COVID-19 and performed follow-up investigations at 60 days (63 ± 23 days [mean ± SD]; visit 1), 100 days (103 ± 21 days; visit 2), and 180 days (190 ± 15 days; visit 3) after diagnosis of COVID-19. Each visit included symptom and physical performance assessment with a standardized questionnaire, LF testing, standard laboratory testing, and a CT scan of the chest. The variables available for analysis with their stratification schemes are listed in Appendix 1—table 1.
Serological markers were determined in certified laboratories (Central Institute of Clinical and Chemical Laboratory Diagnostics, Rheumatology and Infectious Diseases Laboratory, both at the University Hospital of Innsbruck). C-reactive protein (CRP), interleukin-6 (IL-6), N-terminal pro natriuretic peptide (NT-proBNP), and serum ferritin were measured using a Roche Cobas 8000 analyzer. D-dimer was determined with a Siemens BCS-XP instrument using the Siemens D-Dimer Innovance reagent. Anti-S1/S2 protein SARS-CoV-2 immunoglobulin gamma (IgG) were quantified with LIAISON chemoluminescence assay (DiaSorin, Italy), expressed as binding antibody units (BAU, conversion factor = 5.7) and stratified by quartiles (Ferrari et al., 2021).
Low-dose (100 kVp tube potential) craniocaudal CT scans of the chest were acquired without iodine contrast and without ECG gating on a 128-slice multidetector CT (128 × 0.6 mm collimation, 1.1 spiral pitch factor, SOMATOM Definition Flash, Siemens Healthineers, Erlangen, Germany). In case of clinically suspected pulmonary embolism, CT scans were performed with a contrast agent. Axial reconstructions were done with 1 mm slices. CT scans were evaluated for ground-glass opacities, consolidations, bronchial dilation, and reticulations as defined by the Fleischner Society. Lung findings were graded with a semi-quantitative CT severity score (0–25 points) (Sonnweber et al., 2021).
Impaired LF was defined as (1) forced vital capacity (FVC) < 80% or (2) forced expiratory volume in 1 s (FEV1) < 80%, or (3) FEV1:FVC < 70% or (4) total lung capacity (TLC) < 80% or (5) diffusing capacity of carbon monoxide (DLCO) < 80% predicted.
Statistical analysis
Statistical analyses were performed with R version 4.0.5 (Figure 1). Data transformation and visualization were accomplished by tidyverse (Wickham et al., 2019), ggplot2 (Wickham, 2016), ggvenn, plotROC (Sachs, 2017), and cowplot (Wilke, 2019) packages. The recorded variables were binarized as shown in Appendix 1—table 1. Acute COVID-19 severity strata were defined as presented in Table 1. p-Values were corrected for multiple comparisons with the Benjamini–Hochberg method (Benjamini and Hochberg, 1995), and effects were termed significant for p<0.05.
Variable overlap, kinetics, and risk modeling
Overlap between the 180-day follow-up outcome features was assessed by analysis of quasi-proportional Venn plots (package nVennR) (Pérez-Silva et al., 2018) and calculation of the Cohen’s κ statistic (package vcd) (Fleiss et al., 1969). Kinetics of binary outcome variables in participants subsets with the complete longitudinal data record was modeled with mixed-effect logistic regression (random effect: individual, fixed effect: time, packages lme4 [Bates et al., 2015] and lmerTest [Kuznetsova et al., 2017]). Analyses in the severity groups were done with separate models. Significance was assessed by the likelihood ratio test (LRT) against the random-term-only model. Univariate risk modeling was performed with fixed-effect logistic regression (Appendix 1—table 2). Odds ratio (OR) significance was determined by Wald Z test. In-house-developed linear modeling wrappers around base R tools are available at https://github.com/PiotrTymoszuk/lmqc.
Cluster analysis
Clustering of non-CT and non-LF binary clinical features (Appendix 1—table 1) was accomplished with PAM algorithm (partitioning around medoids, package cluster) (Amato et al., 2019) and simple matching distance (SMD, package nomclust) (Boriah et al., 2008). Association analysis for the participants was performed with a combined procedure involving clustering of the observations by the self-organizing map algorithm (SOM, 4 × 4 hexagonal grid, SMD distance, kohonen package), followed by clustering of the SOM nodes by the Ward.D2 hierarchical clustering algorithm (Euclidean distance, hclust() function, package stats) (Vesanto and Alhoniemi, 2000; Kohonen, 1995; Wehrens and Kruisselbrink, 2018). Clustering analyses were performed in the participant subset with the complete set of clustering variables. The selection of the optimal clustering algorithm was motivated by the highest ratio of between-cluster to total variance and the best stability measured by mean classification error in 20-fold cross-validation (CV) (Figure 6—figure supplement 1A and B, Figure 7—figure supplement 1A and B; Lange et al., 2004). The optimal cluster number was determined by the bend of the within-cluster sum-of-squares curve (function fviz_nbclust(), package factoextra) and by the stability in 20-fold CV (Figure 6—figure supplement 1C and D, Figure 7—figure supplement 1D and F; Lange et al., 2004; Wang, 2010), as well as by a visual inspection of the SOM node clustering dendrograms (Figure 7—figure supplement 1E). Assignment of 180-day follow-up outcome features to the clusters of clinical parameters was accomplished with a k-nearest neighbor (kNN) label propagation algorithm (Appendix 1—table 3; Sahanic et al., 2021; Leng et al., 2013). Cluster assignment visualization in a four-dimensional principal analysis score plot was done with the PCAproj() tool (package pcaPP) (Croux et al., 2007). To determine the importance of particular clustering variables, the variance (between-cluster to total variance ratio) between the initial cluster structure and the structure with random resampling of the variable was compared, as initially proposed for the random forests ML classifier (Breiman, 2001). Frequencies of the outcome events in the participant clusters were compared with χ2 test. In-house-developed association analysis wrappers are available at https://github.com/PiotrTymoszuk/clustering-tools-2.
Machine learning
ML classifiers C5.0 (package C50) (Quinlan, 1993), random forests (randomForest) (Breiman, 2001), support vector machines with radial kernel (kernlab) (Weston and Watkins, 1998), neural networks (nnet) (Ripley, 2014), and elastic net (glmnet) (Friedman et al., 2010) were trained to predict the 180-day follow-up outcomes employing non-CT and non-LF binary explanatory features (Appendix 1—table 1). The ML training was performed in the participant subsets with the complete set of explanatory and outcome variables. The training, optimization, and CV (20-fold, five repetitions) were accomplished by the train() tool from caret package, with the Cohen’s κ statistic as a model selection metric (Appendix 1—table 4; Kuhn, 2008). Classifier ensembles were constructed with the elastic net procedure (caretStack() function, caretEnsemble package, Appendix 1—table 4; Deane-Mayer and Knowles, 2019). Classifier performance in the training cohort and CV was assessed by receiver-operating characteristics (ROCs), Cohen’s κ and accuracy (packages caret and vcd, Appendix 1—table 5; Fleiss et al., 1969; Kuhn, 2008). Variable importance measures were extracted from the C5.0 (percent variable usage, c5imp() function, package C50) (Quinlan, 1993), random forests (Δ Gini index, importance(), package randomForest) (Breiman, 2001), and elastic net classifiers (regression coefficient β, coef(), package glmnet) (Friedman et al., 2010).
Pulmonary recovery assessment app
Participant clustering and ML classifiers trained in the CovILD cohort were implemented in an open-source online pulmonary assessment R shiny app (https://im2-ibk.shinyapps.io/CovILD/; code: https://github.com/PiotrTymoszuk/COVILD-recovery-assessment-app). Prediction of the cluster assignment based on the user-provided patient data is done by the kNN label propagation algorithm (Sahanic et al., 2021; Leng et al., 2013).
Results
Patient characteristics
The CovILD study participants (n = 145) were predominantly male (57.8%), age ranging between 19 and 87 years. 77.2% of participants displayed preexisting comorbidity, predominantly cardiovascular and metabolic disease. The cohort included mild (outpatient care, 24.8%), moderate (hospitalization without oxygen supply, 25.5%), severe (hospitalization with oxygen supply, 27.6%), and critical (intensive care unit [ICU] treatment, 22.1%) cases of acute COVID-19 (Table 1). The majority of hospitalized participants received anti-infectives during acute COVID-19, anticoagulative, and/or antiplatelet treatment introduced primarily in the ventilated patients. Systemic steroid administration was initiated at the discretion of the physician beginning from week 4 after diagnosis (Table 2).
Clinical recovery after COVID-19
Most patients, irrespective of the acute COVID-19 severity, showed a significant resolution of disease symptoms over time (Figure 1, Figure 2A). Persistent complaints at the 6-month follow-up were reported by 49% of the study subjects (Table 3), with self-reported impaired physical performance (34.7%), sleep disorders (27.1%), and exertional dyspnea (22.8%) as leading manifestations. The frequency of all investigated symptoms declined significantly, even though the pace of their resolution was remarkably slower in the late (100- and 180-day follow-ups) than in the early recovery phase (acute COVID-19 till 60-day follow-up) (Figure 2B).
Impaired LF was observed in 33.6% of the participants at the 6-month follow-up (Table 3). Except for the critical COVID-19 survivors (60 days: 66.7%; 180 days post-COVID-19: 50%), no significant reduction in the frequency of LF impairment over time was observed (Figure 3). At the 6-month follow-up, structural lung abnormalities were found in 48.5% of patients and moderate-to-severe radiological lung alterations (CT severity score > 5) were present in 19.4% of participants (Table 3). The majority of the participants with impaired LF displayed radiological lung findings. However, a substantial fraction of CT abnormalities, especially mild ones, were accompanied neither by persistent symptoms nor by LF deficits (Figure 3—figure supplement 1, Figure 3—figure supplement 2, Figure 3—figure supplement 3A).
The frequency, scoring, and recovery of CT lung findings were related to the severity of acute infection. Pulmonary lesions scored > 5 CT severity points at the 180-day follow-up were most frequent in the individuals with severe and critical acute COVID-19 (Figure 3—figure supplement 3). Notably, the hospitalized group with oxygen therapy demonstrated the fastest recovery kinetics. As for the symptom resolution, LF and CT lung recovery decelerated in the late phase of COVID-19 convalescence (Figure 3).
Risk factors of protracted recovery
To identify risk factors of delayed recovery at the 6-month follow-up, we screened a set of 52 binary clinical parameters (Appendix 1—table 1) recorded during acute COVID-19 and at the 60-day visit by univariate modeling (Appendix 1—table 2). By this means, no significant correlates for long-term symptom persistence were identified. Risk factors and readouts of severe and critical COVID-19 including multimorbidity, malignancy, male sex, prolonged hospitalization, ICU stay, and immunosuppressive therapy were significantly associated with persistent CT (Figure 4) and LF abnormalities (Figure 5). Persistently elevated inflammatory markers, IL-6 (>7 ng/L) and CRP (>0.5 mg/L), were strong unfavorable risk factors for incomplete radiological and functional pulmonary recovery. Additionally, the biochemical readout of microvascular inflammation, D-dimer (>500 pg/mL) was significantly linked to LF deficits. Low serum anti-S1/S2 IgG titers at the 60-day follow-up and ambulatory acute COVID-19 correlated with an improved pulmonary recovery (Figures 4 and 5).
Clusters of clinical features linked to persistent symptoms and lung abnormalities
Employing the unsupervised PAM algorithm (Amato et al., 2019), three clusters of co-occurring non-CT and non-LF clinical features of acute COVID-19 and early convalescence (Appendix 1—table 1) were identified (Figure 6—figure supplement 1, Appendix 1—table 3): (1) cluster 1 with male sex, hypertension, and cardiovascular and metabolic comorbidity; (2) cluster 2, including characteristics of acute COVID-19 severity and inflammatory markers; and (3) cluster 3 consisting of acute and persistent COVID-19 symptoms (Figure 6—figure supplement 2, Appendix 1—table 3).
The 6-month follow-up outcome variables were incorporated in the cluster structure using kNN prediction (Leng et al., 2013). Long-term symptom persistence was associated with acute and long-lasting COVID-19 symptoms in cluster 3, whereas pulmonary outcome parameters were grouped with cluster 2 features (Figure 6A, Figure 6—figure supplement 2, Appendix 1—table 3). Preexisting comorbidities such as malignancy, kidney, lung and gastrointestinal disease, obesity, and diabetes were found the closest cluster neighbors of mild CT abnormalities (severity score ≤ 5). Moderate-to-severe structural alterations (severity score > 5) and LF deficits were, in turn, tightly linked to markers of protracted systemic inflammation (IL-6, CRP, anemia of inflammation) (Sonnweber et al., 2020; Figure 6B).
Risk stratification for perturbed pulmonary recovery by unsupervised clustering
Next, we tested whether subsets of patients at risk of an incomplete 6-month recovery may be defined by a similar clustering procedure employing exclusively non-CT and non-LF clinical variables (Appendix 1—table 1). Applying a combined SOM – hierarchical clustering approach, three clusters of the study participants were identified (Figure 7, Figure 7—figure supplement 1; Vesanto and Alhoniemi, 2000; Kohonen, 1995). Prolonged hospitalization, anti-infective therapy, overweight or obesity, pain during acute COVID-19, and low anti-S1/S2 titers at the 60-day follow-up were found the most influential clustering features (Figure 7—figure supplement 2; Breiman, 2001). The patient subsets identified by the SOM approach differed significantly in frequency of radiological lung abnormalities and substantially, yet not significantly, in the frequency of LF impairment at the 180-day follow-up. In particular, most of the individuals assigned to the largest, low-risk (LR) subset were CT and LF abnormality-free. The frequency and severity of radiological pulmonary findings were elevated in the smallest intermediate-risk subset (IR) and peaked in the high-risk (HR) group (Figure 8A). Despite a comparable frequency of long-term symptoms between the LR, IR, and HR subsets (Figure 8A), the HR collective showed the lowest prevalence of dyspnea, cough, night sweating, pain, gastrointestinal manifestations, and complete absence of hyposmia at the 180-day follow-up (Figure 8B). Although the LR subset primarily comprised mild COVID-19 cases and the HR subset ICU survivors, the cluster assignment (IR vs. LR, HR vs. LR) remained an independent correlate of persistent CT and LF abnormalities after adjustment for the acute COVID-19 severity (Figure 8—figure supplement 1).
Prediction of persistent symptoms and pulmonary abnormalities by machine learning
Finally, we investigated if the 6-month follow-up outcome may be predicted by ML classifiers trained with a set of non-CT and non-LF variables recorded during acute COVID-19 and at the 60-day follow-up (Appendix 1—table 1). To this end, five technically unrelated ML classifiers were tested (Appendix 1—table 4; Kuhn, 2008): C5.0 (Quinlan, 1993), random forests (RF) (Breiman, 2001), support vector machines with radial kernel (SVM-R) (Weston and Watkins, 1998), shallow neural network (Nnet) (Ripley, 2014), and elastic net generalized linear regression (glmNet) (Friedman et al., 2010). In addition, the single classifiers with varying outcome-specific accuracy (Figure 9—figure supplement 1) were bundled into ensembles by the elastic net procedure (Figure 9—figure supplement 2, Appendix 1—table 4; Kuhn, 2008; Deane-Mayer and Knowles, 2019). Finally, the classifier and ensemble performance was investigated in the training cohort and 20-fold CV by ROC (Appendix 1—table 5).
All tested ML algorithms and ensembles demonstrated good accuracy (area under the curve [AUC] > 0.78) and sensitivity (>0.84) at predicting any lung CT abnormalities at the 6-month follow-up in the study cohort serving as a training data set. Their efficiency in CV was moderate (AUC: 0.69–0.81; sensitivity: 0.69–0.78) (Figure 9, Figure 9—figure supplement 3, Appendix 1—table 5). In turn, moderate-to-severe structural lung findings were recognized with markedly lower sensitivity both in the training data set (>0.43) and the CV (0.39–0.48). Even though impaired LF and persistent symptoms were common at the 6-month follow-up in the training data set (Figures 2 and 3), nearly half of the cases were not identified by any of the tested ML algorithms and their ensembles in the CV setting (Figure 9, Figure 9—figure supplement 3, Appendix 1—table 5). The sensitivity of the ensembles and single classifiers at predicting CT and LF abnormalities was substantially better in severe and critical COVID-19 survivors than in ambulatory and moderate cases (Figure 10, Appendix 1—table 6).
The most important explanatory variables for pulmonary abnormalities by three unrelated classifiers (C5.0, RF, and glmNet) included preexisting malignancy, multimorbidity, markers of systemic inflammation (IL-6 and CRP), and anti-S1/S2 antibody levels at the 60-day follow-up (Figure 9—figure supplement 4, Figure 9—figure supplement 5, Figure 9—figure supplement 6). The highly influential parameters at prediction of symptoms at the 180-day follow-up encompassed symptom presence at the 60-day follow-up, as well as obesity and dyspnea during acute COVID-19 (Figure 9—figure supplement 7).
Discussion
Herein, we prospectively evaluated trajectories of COVID-19 recovery in an observational cohort enrolled in the Austrian CovILD study (Sonnweber et al., 2021). Despite the resolution of symptoms and pulmonary abnormalities at the 6-month follow-up in a large fraction of the study participants, the recovery pace was substantially slower in the late convalescence when compared with the first three months after diagnosis (Sonnweber et al., 2021; Huang et al., 2021a). Persistent symptoms and CT findings were detected in more than 40% and reduced LF in approximately one-third of the cohort, which is in line with recovery kinetics and signs of lung lesion chronicity reported by others (Caruso et al., 2021; Huang et al., 2021b; Huang et al., 2021a; Faverio et al., 2021; Hellemons et al., 2021; Zhou et al., 2021). By comparison, similar protracted pulmonary recovery was reported for SARS (Hui et al., 2005; Ng et al., 2004; Ngai et al., 2010; Lam et al., 2009) and non-COVID-19 acute respiratory distress syndrome (Wilcox et al., 2013; Masclans et al., 2011). Of note, treatment approaches for hospitalized patients in our cohorts and similar cohorts recruited at the pandemic onset in early 2020 (Caruso et al., 2021; Huang et al., 2021b; Huang et al., 2021a; Faverio et al., 2021; Hellemons et al., 2021) differ significantly from the current standard of care for acute COVID-19, which includes early systemic steroid use and antiviral and various immunomodulatory medications. How improved standardized therapy and anti-SARS-CoV-2 vaccination affect the clinical and pulmonary recovery needs to be investigated.
In roughly half of our study participants with abnormal lung CT findings, and especially in those with low-grade structural abnormalities, no overt LF impairment at follow-up was discerned. Still, even subclinical lung alterations may bear the potential for clinically relevant progression of interstitial lung disease (Suliman et al., 2015; Hatabu et al., 2020) requiring systematic CT and LF monitoring. Conversely, symptom persistence was weakly associated with incomplete functional or structural pulmonary recovery.
Since PASC are found in as many as 10% of COVID-19 patients (Sahanic et al., 2021; Venkatesan, 2021; Sudre et al., 2021b), robust, resource-saving tools assessing the individual risk of pulmonary complications are urgently needed (Shah et al., 2021; Raghu and Wilson, 2020). Covariates and characteristics of severe acute COVID-19 such as male sex, age, and preexisting comorbidities, hospitalization, ventilation, and ICU stay were proposed as the risk factors of persistent pulmonary impairment (Sonnweber et al., 2021; Caruso et al., 2021; Huang et al., 2021a; Faverio et al., 2021; Raghu and Wilson, 2020). However, their applicability in predicting complications of pulmonary recovery from mild or moderate COVID-19 is limited. Our results of univariate modeling, clustering, and ML prediction point towards a distinct long-term pulmonary risk phenotype that manifests during acute COVID-19 and early recovery and whose central components are protracted systemic (IL-6, CRP, anemia of inflammation) and microvascular inflammation (D-dimer), and strong humoral response (anti-S1/S2 IgG) demographic risk factors and comorbidities (Sonnweber et al., 2020). Hence, consecutive monitoring of systemic inflammatory parameters analogous to concepts of interstitial lung disease in autoimmune disorders (Khanna et al., 2020) and anti-S1/S2 antibody levels may improve identification of the individuals at risk of chronic pulmonary damage irrespective of the acute COVID-19 severity.
Clustering and ML have been employed for deep phenotyping and predicting acute and post-acute COVID-19 outcomes in multivariable data sets (Sahanic et al., 2021; Sudre et al., 2021a; Estiri et al., 2021; Demichev et al., 2021; Benito-León et al., 2021). We demonstrate that subsets of COVID-19 patients that significantly differ in the risk for long-term CT abnormalities may be defined by an easily accessible clinical parameter set available at the early post-COVID-19 assessment. This approach did not involve any CT or LF variables. Furthermore, the cluster classification correlated with the risk of long-term pulmonary abnormalities independently of the acute COVID-19 severity. Thus, these characteristics provide a useful tool for broad screening of convalescent populations, including individuals who experienced mild or moderate COVID-19.
We show that technically unrelated ML classifiers and their ensemble trained without CT and LF explanatory variables can predict lung CT findings independently of their grading at the 6-month follow-up with good specificity and sensitivity in the training collective and CV. By contrast, the more specific prediction of moderate-to-severe lung CT or risk estimation for LF deficits demonstrated a limited sensitivity. For the moderate-to-severe CT abnormalities, this can be primarily traced back to their low frequency resulting in a suboptimal classifier training, especially in CV. A substantial fraction of the participants (20.7%, n = 30) suffered from a preexisting respiratory condition (pulmonary disease, asthma, or COPD) likely paralleled by LF reduction, which possibly confounded the prediction of the post-COVID-19 LF deficits both by clustering and ML. Accumulating evidence suggests that post-acute COVID-19 symptoms are highly heterogeneous conditions with multiorgan, neurocognitive, and psychological manifestations (Sahanic et al., 2021; Evans et al., 2021; Davis et al., 2021), which may differ in risk factor constellations. This could explain why univariate modeling, clustering, and ML failed to estimate persistent symptom risk in our small study cohort. In general, the ML prediction quality may greatly benefit from a larger training data set and inclusion of additional explanatory variables such as cellular readouts of inflammation, in-depth medication, and broader acute symptom data. Nevertheless, the herein described cluster- and ML classifiers represent resource-effective tools that may assist in the screening of medical record data and identification of COVID-19 patients requiring systematic CT and LF monitoring. To facilitate the identification of patients at risk for protracted respiratory recovery and enable validation in an external collective, we implemented the clustering and prediction procedures in an open-source risk assessment application (https://im2-ibk.shinyapps.io/CovILD/).
Our study bears limitations primarily concerning the low sample size and the cross-sectional character of the trial. Because of the impaired availability of the patients and the prolonged inpatient rehabiliation, the 60- and 100-day follow-up visits in part showed a temporal overlap that may have impacted the accuracy of the longitudinal data. Missingness of the consecutive outcome variable record and the participant dropout, particularly of mild and moderate COVID-19 cases, may have also potentially confounded the participant clustering results and ML risk estimation for CT abnormalities and LF impairment since prolonged hospitalization was found to be a crucial cluster-defining and influential explanatory feature. Additionally, even though the reproducibility of the risk assessment algorithms was partially addressed by CV, cluster and ML classifiers call for verification in a larger, independent multicenter collective of COVID-19 convalescents.
In summary, in our CovILD study cohort we found a high frequency of CT and LF abnormalities and persistent symptoms at the 6-month follow-up, and a flattened recovery kinetics after 3 months post-COVID-19. Systematic risk modeling reveled a set of clinical variables linked to protracted pulmonary recovery apart from the severity of acute infection such as inflammatory markers, anti-S1/S2 IgG levels, multimorbidity, and male sex. We demonstrate that clustering and ML classifiers may help to identify individuals at risk of persistent lung lesions and to relocate medical resources to prevent long-term disability.
Appendix 1
Data availability
The complete R analysis pipeline and the anonymized study data in form of stratified study variables are available as a public GitHub repository: https://github.com/PiotrTymoszuk/CovILD_6_Months (copy archived at swh:1:rev:df521ede1d284e074a0484d3e4d0ce71097d00c3). The R code for the key tools used for uni-variate modeling and model quality control (Figures 4 and 5, https://github.com/PiotrTymoszuk/lmqc; copy archived at swh:1:rev:a020119d8f23b60901115c5c2ce6f6c71998ed31), cluster analysis and its quality control (Figures 6–7, https://github.com/PiotrTymoszuk/clustering-tools-2; copy archived at swh:1:rev:64141197ca28838a8978dce9093443537157d79f) and the risk assessment applicaiton (https://github.com/PiotrTymoszuk/COVILD-recovery-assessment-app; copy archived at swh:1:rev:95f02215f4c13425d3b76f6a13b7862a53279ab9) is available at GitHub. Source data for Figures 2–10 has been included as Source data 1.
References
-
Faster K-Medoids Clustering: Improving the PAM, CLARA, and CLARANS171–187, Similarity Search and Applications, Faster K-Medoids Clustering: Improving the PAM, CLARA, and CLARANS, Cham, Springer, 10.1007/978-3-030-32047-8.
-
Fitting linear mixed-effects models using lme4Journal of Statistical Software 67:1–48.
-
Using Unsupervised Machine Learning to Identify Age- and Sex-Independent Severity Subgroups Among Patients with COVID-19: Observational Longitudinal StudyJournal of Medical Internet Research 23:e25988.https://doi.org/10.2196/25988
-
Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple TestingJournal of the Royal Statistical Society 57:289–300.https://doi.org/10.1111/j.2517-6161.1995.tb02031.x
-
ConferenceProceedings of the 2008 SIAM International Conference on Data MiningSimilarity Measures for Categorical Data: A Comparative Evaluation. pp. 243–254.https://doi.org/10.1137/1.9781611972788.22
-
Algorithms for Projection–Pursuit robust principal component analysisChemometrics and Intelligent Laboratory Systems 87:218–225.https://doi.org/10.1016/j.chemolab.2007.01.004
-
An interactive web-based dashboard to track COVID-19 in real timeThe Lancet. Infectious Diseases 20:533–534.https://doi.org/10.1016/S1473-3099(20)30120-1
-
Six-Month Pulmonary Impairment after Severe COVID-19: A Prospective, Multicentre Follow-Up StudyRespiration; International Review of Thoracic Diseases 100:1078–1087.https://doi.org/10.1159/000518141
-
Harmonization of six quantitative SARS-CoV-2 serological assays using sera of vaccinated subjectsClinica Chimica Acta; International Journal of Clinical Chemistry 522:144–151.https://doi.org/10.1016/j.cca.2021.08.024
-
Large sample standard errors of kappa and weighted kappaPsychological Bulletin 72:323–327.https://doi.org/10.1037/h0028106
-
Regularization Paths for Generalized Linear Models via Coordinate DescentJournal of Statistical Software 33:1–22.https://doi.org/10.18637/jss.v033.i01
-
Interstitial lung abnormalities detected incidentally on CT: a Position Paper from the Fleischner SocietyThe Lancet. Respiratory Medicine 8:726–737.https://doi.org/10.1016/S2213-2600(20)30168-5
-
Clinical features of patients infected with 2019 novel coronavirus in Wuhan, ChinaLancet (London, England) 395:497–506.https://doi.org/10.1016/S0140-6736(20)30183-5
-
6-month consequences of COVID-19 in patients discharged from hospital: a cohort studyLancet (London, England) 397:220–232.https://doi.org/10.1016/S0140-6736(20)32656-8
-
1-year outcomes in hospital survivors with COVID-19: a longitudinal cohort studyLancet (London, England) 398:747–758.https://doi.org/10.1016/S0140-6736(21)01755-4
-
Etiology, Risk Factors, and Biomarkers in Systemic Sclerosis with Interstitial Lung DiseaseAmerican Journal of Respiratory and Critical Care Medicine 201:650–660.https://doi.org/10.1164/rccm.201903-0563CI
-
Building predictive models in R using the caret packageJournal of Statistical Software 28:1–26.https://doi.org/10.18637/jss.v028.i05
-
lmerTest Package: Tests in Linear Mixed Effects ModelsJournal of Statistical Software 82:1–26.https://doi.org/10.18637/jss.v082.i13
-
Mental morbidities and chronic fatigue in severe acute respiratory syndrome survivors: long-term follow-upArchives of Internal Medicine 169:2142–2147.https://doi.org/10.1001/archinternmed.2009.384
-
Stability-based validation of clustering solutionsNeural Computation 16:1299–1323.https://doi.org/10.1162/089976604773717621
-
Adaptive Semi-Supervised Clustering Algorithm with Label PropagationJournal of Software Engineering 8:14–22.https://doi.org/10.3923/jse.2014.14.22
-
Serology-informed estimates of SARS-CoV-2 infection fatality risk in Geneva, SwitzerlandThe Lancet. Infectious Diseases 21:e69–e70.https://doi.org/10.1016/S1473-3099(20)30584-3
-
nVenn: generalized, quasi-proportional Venn and Euler diagramsBioinformatics (Oxford, England) 34:2322–2324.https://doi.org/10.1093/bioinformatics/bty109
-
BookC4.5: Programs for Machine LearningSan Francisco, CA, USA: Morgan Kaufmann Publishers Inc.
-
COVID-19 interstitial pneumonia: monitoring the clinical course in survivorsThe Lancet. Respiratory Medicine 8:839–842.https://doi.org/10.1016/S2213-2600(20)30349-0
-
BookPattern Recognition and Neural NetworksCambridge: Cambridge University Press.https://doi.org/10.1017/CBO9780511812651
-
plotROC: A Tool for Plotting ROC CurvesJournal of Statistical Software 79:1–19.https://doi.org/10.18637/jss.v079.c02
-
Managing the long term effects of covid-19: summary of NICE, SIGN, and RCGP rapid guidelineBMJ (Clinical Research Ed.) 372:136.https://doi.org/10.1136/bmj.n136
-
Cardiopulmonary recovery after COVID-19: an observational prospective multicentre trialThe European Respiratory Journal 57:2003481.https://doi.org/10.1183/13993003.03481-2020
-
Attributes and predictors of long COVIDNature Medicine 27:626–631.https://doi.org/10.1038/s41591-021-01292-y
-
Brief Report: Pulmonary Function Tests: High Rate of False-Negative Results in the Early Detection and Screening of Scleroderma-Related Interstitial Lung DiseaseArthritis & Rheumatology (Hoboken, N.J.) 67:3256–3261.https://doi.org/10.1002/art.39405
-
NICE guideline on long COVIDThe Lancet. Respiratory Medicine 9:129.https://doi.org/10.1016/S2213-2600(21)00031-X
-
Clustering of the self-organizing mapIEEE Transactions on Neural Networks 11:586–600.https://doi.org/10.1109/72.846731
-
Flexible self-organizing maps in kohonen 3.0Journal of Statistical Software 87:1–18.https://doi.org/10.18637/jss.v087.i07
-
ConferenceProceedings, European Symposium on Artificial Neural NetworksMulti-Class Support Vector Machines.
-
BookGgplot2: Elegant Graphics for Data AnalysisCham: Springer-Verlag.https://doi.org/10.1007/978-3-319-24277-4
-
Welcome to the TidyverseJournal of Open Source Software 4:1686.https://doi.org/10.21105/joss.01686
-
BookFundamentals of Data Visualization: A Primer on Making Informative and Compelling FiguresSebastopol: O’Reilly Media.
-
Assessment of Sequelae of COVID-19 Nearly 1 Year After DiagnosisFrontiers in Medicine 8:717194.https://doi.org/10.3389/fmed.2021.717194
Article and author information
Author details
Funding
Land Tirol (GZ 71934)
- Judith Löffler-Ragg
Boehringer Ingelheim (IIS 1199-0424)
- Ivan Tancevski
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Acknowledgements
We acknowledge the commitment of the staff and providers of our institutions through the COVID-19 crisis and the suffering and loss of our patients as well as their families. PT is (from May 2021 on) a freelance data scientist working in his own enterprise ‘Data Analytics as a Service Tirol’. He received an honorary for the study data management, curation and analysis and minor manuscript work. The other authors declare no conflict of interest related to this study. The study was funded by the research fund of the state of Tyrol (Project GZ 71934, JLR) and an Investigator-Initiated Study grant by Boehringer Ingelheim (IIS 1199-0424, IT). The funding bodies did not influence the development of the research and manuscript.
Ethics
Human subjects: All participants gave written informed consent. The study was approved by the institutional review board at the Medical University of Innsbruck (approval number: 1103/2020), and registered at ClinicalTrials.gov (NCT04416100).
Copyright
© 2022, Sonnweber et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 1,736
- views
-
- 288
- downloads
-
- 35
- citations
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Medicine
- Microbiology and Infectious Disease
- Epidemiology and Global Health
- Immunology and Inflammation
eLife has published articles on a wide range of infectious diseases, including COVID-19, influenza, tuberculosis, HIV/AIDS, malaria and typhoid fever.
-
- Epidemiology and Global Health
Background: The role of circulating metabolites on child development is understudied. We investigated associations between children's serum metabolome and early childhood development (ECD).
Methods: Untargeted metabolomics was performed on serum samples of 5,004 children aged 6-59 months, a subset of participants from the Brazilian National Survey on Child Nutrition (ENANI-2019). ECD was assessed using the Survey of Well-being of Young Children's milestones questionnaire. The graded response model was used to estimate developmental age. Developmental quotient (DQ) was calculated as the developmental age divided by chronological age. Partial least square regression selected metabolites with a variable importance projection ≥ 1. The interaction between significant metabolites and the child's age was tested.
Results: Twenty-eight top-ranked metabolites were included in linear regression models adjusted for the child's nutritional status, diet quality, and infant age. Cresol sulfate (β = -0.07; adjusted-p < 0.001), hippuric acid (β = -0.06; adjusted-p < 0.001), phenylacetylglutamine (β = -0.06; adjusted-p < 0.001), and trimethylamine-N-oxide (β = -0.05; adjusted-p = 0.002) showed inverse associations with DQ. We observed opposite directions in the association of DQ for creatinine (for children aged -1 SD: β = -0.05; p =0.01; +1 SD: β = 0.05; p =0.02) and methylhistidine (-1 SD: β = - 0.04; p =0.04; +1 SD: β = 0.04; p =0.03).
Conclusion: Serum biomarkers, including dietary and microbial-derived metabolites involved in the gut-brain axis, may potentially be used to track children at risk for developmental delays.
Funding: Supported by the Brazilian Ministry of Health and the Brazilian National Research Council.