Research Article

Medicine

Infrared molecular fingerprinting of blood-based liquid biopsies for the detection of cancer

Ludwig Maximilians University Munich (LMU), Department of Laser Physics, Germany
Max Planck Institute of Quantum Optics (MPQ), Laboratory for Attosecond Physics, Germany
Asklepios Biobank for Lung Diseases, Department of Thoracic Surgery, Member of the German Center for Lung Research, DZL, Asklepios Fachkliniken München-Gauting, Germany
University Hospital of the Ludwig Maximilians University Munich (LMU), Department of Internal Medicine V, Germany
University Hospital of the Ludwig Maximilians University Munich (LMU), Department of Obstetrics and Gynecology, Breast Center and Comprehensive Cancer Center (CCLMU), Germany
University Hospital of the Ludwig Maximilians University Munich (LMU), Department of Urology, Germany
University Hospital of the Ludwig Maximilians University Munich (LMU), Department of Clinical Radiology, Germany

Oct 26, 2021

Open access
Copyright information

Abstract
Introduction
Results
Discussion
Methods
Data availability
References
Article and author information
Metrics

Abstract

Recent omics analyses of human biofluids provide opportunities to probe selected species of biomolecules for disease diagnostics. Fourier-transform infrared (FTIR) spectroscopy investigates the full repertoire of molecular species within a sample at once. Here, we present a multi-institutional study in which we analysed infrared fingerprints of plasma and serum samples from 1639 individuals with different solid tumours and carefully matched symptomatic and non-symptomatic reference individuals. Focusing on breast, bladder, prostate, and lung cancer, we find that infrared molecular fingerprinting is capable of detecting cancer: training a support vector machine algorithm allowed us to obtain binary classification performance in the range of 0.78–0.89 (area under the receiver operating characteristic curve [AUC]), with a clear correlation between AUC and tumour load. Intriguingly, we find that the spectral signatures differ between different cancer types. This study lays the foundation for high-throughput onco-IR-phenotyping of four common cancers, providing a cost-effective, complementary analytical tool for disease recognition.

Introduction

To address the ever-growing cancer incidence and mortality rates, effective treatment methods are indispensable (Bray et al., 2018). They rely on detection of the disease at the earliest possible stage to allow antitumour interventions and thus improve survival rates (Bannister and Broggio, 2016; Schiffman et al., 2015). Early detection is therefore a crucial factor in the global fight against cancer.

However, the clinical benefits versus the potential harms and costs of several cancer detection approaches remain controversial (Schiffman et al., 2015). Due to the limited sensitivity and specificity of current medical diagnostics, cancer can either be overlooked (false negatives) or falsely detected (false positives), leading to either delayed interventions or unnecessary, potentially harmful investigations or psychological stress (Srivastava et al., 2019). Hence, there is a high need to complement current medical diagnostics with time- and cost-efficient, non-invasive or minimally invasive methods that could possibly lead to new screening and detection approaches, prior to tissue-biopsy-based molecular profiling or prognosis (Wan et al., 2017).

Molecular analyses of human serum and plasma provide systemic molecular information and enabling novel routes of diagnostics (Amelio et al., 2020; Wan et al., 2017). So far, most liquid biopsies predominantly rely on the analysis of a few pre-selected analytes and biomarkers. Although the emergence of highly sensitive and molecule-specific methods in the fields of proteomics (Geyer et al., 2019; Geyer et al., 2017; Uzozie and Aebersold, 2018), metabolomics (Roig et al., 2017; Xia et al., 2013), and genomics (Abbosh et al., 2017; Han et al., 2017; Otandault et al., 2019) has led to the discovery of thousands of different biomarker candidates, only a few of them have been validated and transferred to the clinic so far (Poste, 2011).

Technological developments of the last decade brought a paradigm change regarding liquid biopsies. Instead of relying on a single molecular marker, recent approaches focus on combining information across a broad range of molecules to investigate changes in molecular patterns and identify disease-specific physiologies. However, the combination of various omics techniques (i.e. multi-omics) still requires complex and target-specific sample preparation as well as elaborate ways of merging different datasets (Hasin et al., 2017; Karczewski and Snyder, 2018; Malone et al., 2020; Yoo et al., 2018). Moreover, increasing the number of analytical methods involved often leads to unfeasibly high costs for broad clinical use.

This is where infrared molecular spectroscopy prevails – it captures signals from all types of molecules in a sample in a single time- and cost-effective measurement in a label-free fashion. When applied to blood serum or plasma samples, infrared spectroscopy provides a so-called infrared molecular fingerprint (IMF) reflecting the chemical composition of a sample, that is, the person’s molecular blood phenotype (Huber et al., 2021). Even though the IMF of a highly complex biofluid such as blood serum and plasma can only partially be traced back to its molecular origin, it may deliver a plethora of information sensitive and specific to the health state of the individual. In a recent longitudinal study, we have shown that defined workflows to collect, store, process, and measure human liquid biopsies lead to reproducible IMFs in healthy, non-symptomatic individuals that are stable over clinically relevant time scales (Huber et al., 2021).

Numerous studies have shown the potential of blood-based IMFs for the detection of cancer, notably brain (Butler et al., 2019; Hands, 2014; Sala et al., 2020b), breast (Backhaus et al., 2010; Elmi et al., 2017; Ghimire et al., 2020; Zelig et al., 2015), bladder (Ollesch et al., 2014), lung (Ollesch et al., 2016), prostate (Medipally et al., 2020), and other cancer entities (Anderson et al., 2020; Ollesch et al., 2014; Sala et al., 2020a), with some of the studies reporting specificities and sensitivities higher than 90% (Anderson et al., 2020; Backhaus et al., 2010; Butler et al., 2019; Ghimire et al., 2020; Medipally et al., 2020; Ollesch et al., 2014). Despite these promising initial results, only a few studies involved more than 75 individuals per group (Anderson et al., 2020). Additionally, the majority of these studies had a high risk of bias due to patient selection applied (Anderson et al., 2020). In fact, it was shown that IMFs are susceptible to external confounding factors, such as those related to sample handling and data collection, as well as to inherent biological variations (e.g. age and gender) unrelated to cancer (Diem, 2018). Furthermore, the differences observed in IMFs may be due to the innate immune response and other concomitant factors (Diem, 2018; Fabian et al., 2005). Thus, the specificity of IMF to a certain cancer must be evaluated by investigation of appropriate, carefully selected reference groups.

Altogether, there is a need for studies that address the issues listed above by (i) systematically investigating the pre-analytical factors (Cameron et al., 2020; Huber et al., 2021), (ii) studying the molecular origin of the infrared fingerprints (Voronina et al., 2021), and (iii) adequately applying machine learning tools with involvement of a sufficient number of participants. To date, the latter requirement has only been met in studies investigating the applicability of infrared fingerprinting to bladder (Ollesch et al., 2014), breast (Backhaus et al., 2010), and brain cancer detection (Butler et al., 2019). In addition to the capacity to detect cancer, whether different cancer entities have sufficiently different infrared spectral signatures to be distinguished from each other has so far not been evaluated.

Our present multi-institutional, multi-disease study addresses the above issues to rigorously assess the feasibility of IMFs for high-throughput detection of four common cancer entities as phenotypes, thus referred to as ‘onco-IR-phenotyping.’ Using Fourier-transform infrared (FTIR) transmission spectroscopy of liquid samples, we measured blood serum and plasma samples from 1927 individuals, among these 161 breast cancer, 118 bladder cancer, 278 prostate cancer, and 214 lung cancer patients, prior to any cancer-related therapy, along with non-symptomatic reference individuals and study participants with diseases and/or benign pathologies of the same organ (i.e. organ-specific symptomatic references). By applying support vector machine (SVM) to train models for binary classification, we obtained detection efficiencies in the range of 0.78–0.89 (area under the receiver operating characteristic [ROC] curve [AUC]), with the detection efficiency strongly correlating with the severity of the disease. The results of this prospectively conducted study suggest that infrared fingerprinting of liquid plasma and serum may offer a means of robust and reliable detection of different types of cancer. Furthermore, we reveal that the spectral signatures attributable to different cancer types differ significantly from each other, which facilitates classification between different states and thus carries a translational potential not previously reported.

Results

Study setup and workflow

In this study, we tested infrared molecular spectroscopy for medically relevant blood profiling in a prototypical multi-institutional setting, assessing the usefulness of IMFs as a source of complementary information for cancer diagnostics. The study included cohorts of therapy-naïve, lung, prostate, bladder, and breast cancer patients (cases), and organ-specific symptomatic references as well as non-symptomatic reference individuals (Figure 1a, Figure 1—source data 1).

Figure 1

Download asset Open asset

Infrared molecular fingerprinting workflow and clinical study design.

(a) Cohorts of therapy-naïve, lung, breast, prostate, and bladder cancer patients (cases), and organ-specific symptomatic references as well as non-symptomatic reference individuals were recruited at three different clinical sites – in total, 1927 individuals. (b) Blood samples from all individuals were drawn, and sera and plasma were prepared according to well-defined standard operating procedures. (c) Automated Fourier-transform infrared spectroscopy of liquid bulk sera and plasma were used to obtain IMFs. The displayed IMFs were pre-processed using water correction and normalization (see Methods). (d) For each clinical question studied, the characteristics of the case and the reference cohorts were matched for age, gender, and body mass index (BMI) to avoid patient selection bias. This resulted in total number of 1639 individuals upon matching. (e) Machine learning models were built on training datasets and evaluated on test datasets to separately evaluate the efficiency of classification for each of the four cancer entities.

Figure 1—source data 1 Breakdown of the overall participant pool used within the study. All the following analyses were carried out on subsets of this participant pool; see also other source data files for further details. When selecting the sub-cohorts, special care was taken to match the case and reference cohorts separately, for each question – according to age, gender, and body mass index (BMI) – in order to avoid possible bias in patient selection.: https://cdn.elifesciences.org/articles/68758/elife-68758-fig1-data1-v1.xlsx
Download elife-68758-fig1-data1-v1.xlsx

Blood sera and plasma were collected at several clinical sites according to well-defined standard operating procedures to minimize pre-analytical errors (Figure 1b; Huber et al., 2021). An automated sample delivery system was applied for high-throughput, high-reproducibility, and cost-efficient infrared fingerprinting of liquid sera and plasma of 1927 individuals with an FTIR spectrometer (Figure 1c). Special care was taken to match the characteristics of the case and reference cohorts for each question separately – by age, gender, and body mass index (BMI) – to avoid patient selection bias, although this step reduced the number of individuals analysed within this study to 1639 (Figure 1d). The acquired IMFs were used for training machine learning models to perform binary classification of the samples (Figure 1e) into case and reference groups, allowing the investigation of various clinically relevant questions (see below). Model training was performed by applying SVM algorithm to pre-processed IMFs by splitting the data into train and test sets, employing 10-fold cross-validation, repeated 10-times with randomization. For assessing the classification performance, we evaluated the AUC of the respective ROC curves for the test sets.

Diagnostic performance of infrared molecular fingerprinting for cancer detection

In a first step, we evaluated the diagnostic performance of IMFs obtained from serum samples for the binary classification of each of the four common cancer types individually against matched non-symptomatic reference groups (see Table 1 and Figure 2—source data 1 for details on the characteristics of the individual cohorts). Since our approach produces results in terms of continuous variables (disease likelihood) rather than binary outcomes (disease, non-disease), we use the AUC of the ROC as the main performance metric, and thus take advantage of incorporating information across multiple operating points, not limited to a particular clinical scenario.

The highest detection efficiencies in the test sets were obtained for the lung and breast cancer cohort SVM models, with a ROC AUC of 0.89 and 0.88, respectively (Figure 2a). A lower classification performance of 0.79 and 0.78 (ROC AUC) was obtained for the prostate and bladder cancer cohorts, respectively. Table 1 also lists the optimal combination (see Methods) of sensitivity and specificity for all cancer entities. For making our results comparable to other studies and possibly to gold standards in cancer detection, we present lists with sensitivity/specificity pairs (see Table 1). In particular, we present the optimal pairs extracted by minimizing the distance between the ROC curve and the upper-left corner – a standard practice in studies of this type. In addition, we set the specificity to 95% and present the resulting sensitivities.

Figure 2 with 2 supplements see all

Download asset Open asset

Diagnostic performance of lung, prostate, bladder, and breast cancer detection based on infrared molecular fingerprints (IMFs) of blood sera.

Receiver operating characteristic (ROC) curves for the binary classification of the test set with support vector machine (SVM) models trained on water-corrected and vector-normalized IMFs. The different cancer entities were tested against (a) non-symptomatic references, (b) mixed references that also include organ-specific symptomatic references, and (c) organ-specific symptomatic references only. Detailed cohort characteristics can be found in Figure 2—source data 1. (d) Area under the receiver operating characteristic curve (AUC) for the test sets according to different spectral pre-processing of the IMFs. The error bars show the standard deviation of the individual results of the cross-validation (LuCa: lung cancer; PrCa: prostate cancer; BrCa: breast cancer; BlCa: bladder cancer; NSR: non-symptomatic references; MR: mixed references; SR: symptomatic references).

Figure 2—source data 1 Characteristics of the matched groups of individuals utilized for the analysis as presented in Table 1, Figures 2 and 3a-c.: https://cdn.elifesciences.org/articles/68758/elife-68758-fig2-data1-v1.xlsx
Download elife-68758-fig2-data1-v1.xlsx
Figure 2—source data 2 Zipped folder with trained machine learning models and application instructions.: https://cdn.elifesciences.org/articles/68758/elife-68758-fig2-data2-v1.zip
Download elife-68758-fig2-data2-v1.zip
Figure 2—source data 3 Potential impact of clinical site to classification performance.: https://cdn.elifesciences.org/articles/68758/elife-68758-fig2-data3-v1.xlsx
Download elife-68758-fig2-data3-v1.xlsx

Table 1

Detection efficiency for different binary classifications.

Different cancer types were compared to each other, as well as the impact of using different reference groups was analysed. Detailed cohort characteristics can be found in Figure 2—source data 1 (NSR: non-symptomatic references; MR: mixed references; SR: symptomatic references; AUC: area under the receiver operating characteristic curve; *sensitivity and specificity values are obtained by minimizing the distance of the receiver operating characteristic [ROC] curve to the upper-left corner).

Clinical question for binary classification	# of Individuals	AUC	Sensitivity/specificity*	sensitivity at95% specificity
Lung cancer vs. NSR	214/193	0.89 ± 0.05	0.86/0.79	0.45
Lung cancer vs. MR	214/208	0.77 ± 0.06	0.72/0.67	0.36
Lung cancer vs. SR	214/143	0.74 ± 0.07	0.67/0.71	0.24
Prostate cancer vs. NSR	278/278	0.78 ± 0.06	0.71/0.71	0.36
Prostate cancer vs. MR	278/278	0.75 ± 0.06	0.71/0.68	0.23
Prostate cancer vs. SR	278/278	0.70 ± 0.06	0.65/0.68	0.20
Breast cancer vs. NSR	161/161	0.88 ± 0.06	0.82/0.81	0.35
Bladder cancer vs. NSR	118/118	0.79 ± 0.09	0.72/0.73	0.23

In clinical practice, however, patients may suffer from pathologies that affect the same organ as the cancer under scrutiny. Therefore – in a second step, we tested the capability of IMFs to classify cancer, when organ-specific comorbidities (e.g. chronic obstructive pulmonary disease [COPD] in the lung cancer cohort) and organ-specific benign conditions (e.g. hamartoma of the lung in the lung cancer cohort or benign prostate hyperplasia [BPH] in the prostate cancer cohort – see Figure 1—source data 1 for details) were added to the reference group. In this case, the detection efficiency decreased significantly, from 0.89 to 0.77, for lung cancer and slightly, from 0.79 to 0.75, for prostate cancer (Figure 2b). If the reference group contained only organ-specific symptomatic references, the detection efficiency was reduced further, to 0.74 for lung cancer and 0.70 for prostate cancer (Figure 2c).

To test whether sample collection, handling, and storage have a potential influence on classification results, we examined data from matched, non-symptomatic, healthy individuals from the three major clinics using principal component analysis (PCA). Considering the first five principal components (responsible for 95% of the explained variance), we could not observe any clustering effect related to data from different clinics (Figure 2—figure supplement 1). However, potential bias due to the above-mentioned influences cannot be fully excluded at the present stage. To this end, samples at different clinical sites are being collected to form a large independent test dataset, specifically designed to allow us evaluate the effects of clinical covariates – as well as measurement-related ones – relevant for the proposed IMF-based medical assay. One typically obtains a different AUC by using different control groups, collected at different sites (Figure 2—source data 3). These variations have many potential causes, including measurement-related effects, differences in sample handling, unobserved differences between the clinical populations recruited at different clinical sites, and of course the size of the training sets used for model training, which can significantly affect the model performance. Although important, it is currently not feasible to rigorously disentangle these effects. Furthermore, we investigated the influence of different pre-processing of the IMFs on the classification results and reassuringly found that these are not significantly affected by the applied pre-processing (Figure 2d). Model diagnostics yielded no signs of overfitting as we added different layers of pre-processing into the pipeline (see Methods for details). Since water-corrected and vector-normalized spectra typically resulted in slightly higher AUCs but still low overfitting, this pre-processing was kept in all other analyses.

It is generally known that blood serum and blood plasma provide largely overlapping molecular information, and both can be used as a basis for many further investigations. The extent to which this also applies to infrared fingerprinting has not been extensively studied. In a previous comparative study, we were able to show that healthy phenotypes can be better identified on the basis of serum (Huber et al., 2021).

Here we compare the diagnostic performance of IMFs from serum and plasma collected from the same individuals for the detection of lung and prostate cancer compared to non-symptomatic and symptomatic organ-specific references. Given that plasma samples were only available for a subset of the lung and prostate cohorts, the results for serum slightly deviate from those presented above due to the different cohort characteristics (Figure 2—figure supplement 2—source data 1). The detection efficiency based on IMFs from plasma samples was 3% higher in the case of lung cancer and 2% higher in the case of prostate cancer than the same analysis based on IMFs from serum samples. In both cases, the difference in AUC was only of low significance. It is noteworthy that the corresponding ROC curves show similar behaviour (Figure 2—figure supplement 2). These results suggest that either plasma or serum samples could in principle be used for detection of these cancer conditions. However, for carefully assessing whether (i) the same amount of information is contained in both biofluids and (ii) whether this information is encoded in a similar way across the entire spectra requires yet an additional dedicated study with higher sample numbers.

Investigation of cancer-specific infrared signatures

In many clinical settings, a simple binary classification may not be sufficient; instead, a simple, quick, and reliable test that indicates a specific cancer or disease is preferred. To investigate the possible existence of cancer-specific IMFs (or onco-IR-phenotypes), we first examined and compared the spectral signatures that are relevant for distinguishing cancer cases from non-symptomatic references. For this purpose, we evaluated the differential fingerprints (defined as the difference between the mean IMF of the case cohort and that of the reference cohort), determined the two-tailed p-value of Student’s t-test, and calculated the AUC per wavenumber using the U statistic of a Mann–Whitney U test (see Methods) for all cohorts (Figure 3a–c). The obtained patterns differed significantly for all four cancer entities.

Figure 3 with 1 supplement see all

Download asset Open asset

Infrared spectral signatures of lung, prostate, bladder, and breast cancer.

(**a-a'''**) Differential fingerprints (standard deviations of the reference cohorts are displayed as grey areas), (**b-b'''**) two-tailed p-value of Student’s t-test, and (**c-c'''**) area under the receiver operating characteristic curve (AUC) per wavenumber (extracted by application of Mann–Whitney U test) compared to the AUC of the combined model (dashed horizontal lines). Confusion matrix summarizing the per-class accuracies of multiclass classification of (d) lung, bladder, and breast cancer (matched female cohort) with overall model accuracy of 0.73 ± 0.11, and (e) lung, bladder, and prostate cancer (matched male cohort) with overall model accuracy of 0.74 ± 0.13. Detailed cohort characteristics can be found in Figure 3—source data 1. Chance level for the three-class classification corresponds to 0.33 (LuCa: lung cancer; PrCa: prostate cancer; BrCa: breast cancer; BlCa: bladder cancer).

Figure 3—source data 1 Characteristics of the matched groups utilized for the analysis presented in Figure 3d and e.: https://cdn.elifesciences.org/articles/68758/elife-68758-fig3-data1-v1.xlsx
Download elife-68758-fig3-data1-v1.xlsx

It is noteworthy that for lung and breast cancer the magnitude of the differential fingerprint compared to the variation of the IMFs of the reference group (grey area in Figure 3a) is more pronounced than for bladder and prostate cancer. This is also reflected in the p-values (Figure 3b), reaching many orders of magnitude lower levels for the former cancer entities, and higher spectrally resolved AUCs (Figure 3c). Compared to evaluation based on the entire spectral range, spectral containment significantly reduces detection efficiency for all cancer entities, although the reduction is smaller for lung and breast cancer. For these two cancer entities, classification based on a few selected spectral regions is possible. By contrast, for prostate and bladder cancer, the cancer-relevant information appears to be distributed over the entire spectral range and a high classification rate relies on the entire spectral range accessible.

The fact that the cancer entities studied here have different spectral signatures raises the question of whether it is possible to get first indication of the type of cancer detected, which can become relevant, for example, if the primary origin of a cancer type is unknown. Therefore, we performed a multiclass classification aiming to distinguish between lung, bladder, and breast cancer for a matched female cohort (Figure 3d) and between lung, bladder, and prostate cancer for a matched male cohort (Figure 3e). Note that the number of included cancer cases had to be significantly reduced in multiclass classification, as compared to the binary classification, in order to preserve balanced cohort characteristics. Details on this are given in Figure 3—source data 1. Overall, the classification accuracy was 73% and 74%, respectively. These findings suggest that primary tumours evolving in different organs indeed induce differing changes in the overall molecular composition of blood sera – as reflected in differing spectral signatures – and thus offering potential for cancer stratification in future. However, due to the small dataset, these findings need to be verified with larger, independent cohorts.

Often, a patient may reveal symptoms suggestive of a certain cancer entity (e.g. lung cancer), but same time also symptoms indicative of additional further diseases or benign conditions. Therefore, we tested the ability of infrared fingerprinting to detect signatures that would be specific to lung and prostate cancer in comparison to organ-specific (benign) diseases in each case. To this end, we evaluated the differential fingerprints of the different organ-specific symptomatic references compared to non-symptomatic individuals and compared these signatures to the cancer-related IMFs, respectively (Figure 3—figure supplement 1). We found that the differential fingerprint for asthma and lung hamartoma clearly differs from the ones obtained for lung cancer and COPD. However, the differential fingerprints of the latter two diseases, although distinguishable, exhibit strong similarities in their main spectral features. This explains why the presence of COPD in the reference group lowers the detection efficiency of lung cancer (Figure 2a vs. Figure 2b). In contrast, the differential fingerprints of BPH and prostate cancer differ considerably. Consequently, BPH in the reference group does not strongly affect the detection efficacy of prostate cancer.

Lung cancer is often accompanied by COPD, and the previous analysis showed that the differential fingerprint of COPD and lung cancer exhibits similarities. Thus, we investigated whether infrared fingerprinting could possibly identify any infrared signals specific only to lung cancer (and not to COPD). Towards that end, we separated individuals from the above analysis into sub-cohorts with subjects negative and positive for COPD. We found that the detection of lung cancer was less efficient when the reference cohort contained only COPD-positive individuals (Figure 4—figure supplement 1,). Both conditions (lung cancer and COPD) are often accompanied by an inflammatory response. Considering that the spectral signatures relevant for cancer detection are based on typical molecular changes that also occur in inflammatory conditions (Voronina et al., 2021), the presence of COPD likely masks, at least in part, cancer-relevant signals.

Another relevant question is whether a distinction between cancer and corresponding organ-specific benign pathologies can be made. Here, we evaluated to what extent this was possible for lung and prostate cancer. In both cases, we observed that the cancer detection was only moderately higher against a group of non-symptomatic individuals as compared to a group of patients with a benign condition (lung hamartoma and BPH, respectively; see Figure 4a and b).

Figure 4 with 1 supplement see all

Download asset Open asset

Detection efficiency of benign conditions and multiclass classification.

(a) Pairwise classification performance results between lung cancer (LuCa), hamartoma (Hamart.) and non-symptomatic reference group (NSR) with overall model accuracy of 0.46 ± 0.18, and (b) pairwise classification performance between prostate cancer (PrCa), benign prostate hyperplasia (BPH), and NSR with overall model accuracy of 0.43 ± 0.06. The error bars show the standard deviation of the individual results of the cross-validation. Confusion matrix summarizing the per-class accuracies of multiclass classification in (c) the LuCa cohort and (d) the PrCa cohort. The characteristics of the cohort used for this analysis are given in Figure 4—source data 1. Chance level for the three-class classification corresponds to 0.33.

Figure 4—source data 1 Characteristics of the matched groups utilized for the analysis presented in Figure 4.: https://cdn.elifesciences.org/articles/68758/elife-68758-fig4-data1-v1.xlsx
Download elife-68758-fig4-data1-v1.xlsx

Finally, we explored the possibility of creating multiclass classification models to simultaneously discriminate between multiple groups: cancer patients, individuals with benign conditions, and non-symptomatic reference subjects (Figure 4c and d). In both cases, the classification accuracy was well above chance. Although the accuracy may not yet be sufficient for clinical use, these accuracies may significantly improve with more samples available for training.

Dependence of cancer detection performance on tumour progression

Challenges for cancer detection include the enormous biological and clinical complexity of cancer, and detection is further complicated by the significant intratumour heterogeneity (McGranahan and Swanton, 2017) as well as by the impact of the tumour microenvironment (Boothby and Rickert, 2017). To evaluate whether the blood-based IMFs are sensitive to tumour progression, we first investigated whether the binary classification efficiency depends on tumour size, characterized in terms of clinical TNM staging (Amin et al., 2017).

In general, we observe that the classification efficiency exhibits a positive correlation with tumour size or tumour grade. In the case of lung cancer, when compared to the non-symptomatic references, the classification efficiency for T4 tumours is (in terms of AUC) 9% higher than that for T1 tumours (Figure 5a). Also, for breast and bladder cancer, a significantly higher detection efficiency for T3 tumours was observed. This is also reflected by the fact that a more pronounced differential fingerprint can be found in these cancers in higher T classes (Figure 5a–c). Although the absolute (integrated) deviation – between the cases and the matched references – increases for all four cancer phenotypes, the spectral features are partly different for the different T stages. This could be due to the fact that, due to the moderate number of individuals considered, the actual onco-IR-phenotype is masked by biological variability, or that the heterogeneity of tumour growth leads to different molecular changes and thus to different IMFs.

Figure 5 with 1 supplement see all

Download asset Open asset

Efficiency of binary classification and infrared spectral changes in dependence of tumour progression.

(**a–d**) Binary classification performance of lung, breast, bladder, and prostate cancer against references as a function of T-classification (of TNM-staging). (**a′–d′**) Differential fingerprints in relation with the tumour size (TNM class T) for all four cancer entities. (**a′′–d′′**) Area under the absolute differential fingerprints in relation with the tumour size for all dour cancer entities. The y-axes of the diagrams in the panels (**a'–d'**) and (**a''–d''**) each have the same linear scaling, thus directly comparable. (e) Classification performance of prostate cancer versus references as a function of tumour grade score. (f) Classification performance of prostate cancer as a function of the Gleason score (Gs). (g) Classification performance of lung cancer versus references as a function of the metastasis status. The detailed cohort breakdown and classification results are given as Figure 5—source data 1, Figure 5—source data 2, Figure 5—source data 3, Figure 5—source data 4. Some cohorts did not include sufficient number of participants so that a reliable machine learning model could not be built and were therefore not evaluated. LuCa: lung cancer; PrCa: prostate cancer; BrCa: breast cancer; BlCa: bladder cancer; NSR: non-symptomatic references; MR: mixed references; n.s.: not significant; *p<10^–2; **p<10^–3; ***p<10^–4; ****p<10^–5; The error bars show the standard deviation of the individual results of the cross-validation.

Figure 5—source data 1 Characteristics of the matched groups utilized for the analysis presented in Figure 5a-d, a'-d' and a"-d".: https://cdn.elifesciences.org/articles/68758/elife-68758-fig5-data1-v1.xlsx
Download elife-68758-fig5-data1-v1.xlsx
Figure 5—source data 2 Characteristics of the matched groups utilized for the analysis presented in Figure 5e.: https://cdn.elifesciences.org/articles/68758/elife-68758-fig5-data2-v1.xlsx
Download elife-68758-fig5-data2-v1.xlsx
Figure 5—source data 3 Characteristics of the matched groups utilized for the analysis presented in Figure 5f.: https://cdn.elifesciences.org/articles/68758/elife-68758-fig5-data3-v1.xlsx
Download elife-68758-fig5-data3-v1.xlsx
Figure 5—source data 4 Characteristics of the matched groups utilized for the analysis presented in Figure 5e.: https://cdn.elifesciences.org/articles/68758/elife-68758-fig5-data4-v1.xlsx
Download elife-68758-fig5-data4-v1.xlsx

In contrast, prostate cancer with higher T stage shows neither a significantly better AUC nor a more pronounced differential fingerprint (Figure 5d). Instead, the detection efficiency does increase significantly with tumour grade score (Amin et al., 2017; Figure 5e). A strong correlation between the AUC and Gleason score (Figure 5f) could also not be observed.

Finally, the size of the lung cohort allowed us to also investigate the possible effect of metastasis (TNM M1) on the IMFs and their classification performance. As expected from our previous findings, higher AUCs (although not statistically significantly higher) were found in the cohort of locally advanced and metastatic lung cancer as compared to cohort of non-metastatic cancer patients only (Figure 5g).

Overall, we observe a consistent pattern in agreement with the hypothesis that the signal utilized by the learning algorithm increases with more progressed disease stage (either larger tumour volume, metastatic spread, or tumour grade score). This suggests that the information retrieved from the measured differences between the IMFs of cases and references is connected to tumour-related molecular changes. These changes may be due to larger tumour load leaving a more extensive footprint on the composition of peripheral blood, or to the fact that tumour progression could have caused a higher systemic response, or to a combination of both. While the correlation between AUC and tumour size was most evident for lung, breast, and bladder cancer, spectral signatures relevant for prostate cancer detection were more strongly connected to the tumour grade score. It is important to note, however, that the observed relation – between the spectrum of the disease and classification efficiency – is not conclusively proven by the current analysis, but only suggested.

Discussion

We demonstrated the feasibility of blood-based IMF to detect lung, breast, bladder, and prostate cancer with good efficiency. Although previous smaller studies have yielded fairly high classification efficiencies (Backhaus et al., 2010; Elmi et al., 2017; Ghimire et al., 2020; Medipally et al., 2020; Ollesch et al., 2016; Ollesch et al., 2014; Zelig et al., 2015), they were either based on low number of participants or might have been affected by confounding factors. Here we provided a rigorous multi-institutional study setup with more than 100 individuals in each case and reference group, 1927 individuals in total, with all case and reference cohorts matched for major confounding factors (n = 1639 individuals upon matching). In addition, we observed that visible infrared spectral signatures correlate with tumour stage, suggesting that the IMFs are significantly affected not only by the presence of tumours but also by the progression of the disease. Furthermore, similar cancer detection efficiencies were achieved with IMFs obtained from blood serum and plasma. This not only confirms the robustness of the results, but also reveals that the method is applicable to both biofluids.

This study provides strong indications that blood-based IMF patterns can be utilized to identify various cancer entities, and therefore provides a foundation for a possible future in vitro diagnostic method. However, IMF-based testing is still at the evaluation stage of essay development, and further steps have to be undertaken to evaluate the clinical utility, reliability, and robustness of the IMF approach (Ignatiadis et al., 2021).

First, the machine learning models built within this work will have to be tested with fully independent sample sets. Although the study was designed to account for and minimize the effect of confounding factors, we are aware that these cannot be fully excluded, especially considering that machine learning algorithms are susceptible to them (Zhao et al., 2020). To this end, we freeze the current machine learning models, each trained on the data of the entire cohorts of the current study (Figure 2—source data 2), and will apply them to a consecutively prospective sample collection to better rule out potential confounders.

Second, it needs to be studied in more detail whether IMFs pick up molecular patterns that are specific to a primary disease process or, more generally, to any secondary inflammatory response. At the current stage, both options seem possible as altered immune responses are also known as primary disease drivers in the context of cancer and may affect genome instability, cancer cell proliferation, anti-apoptotic signalling, angiogenesis, and, last but not least, cancer cell dissemination (Hanahan and Weinberg, 2011). This link to systemic effects makes it still difficult to distinguish cancer-specific IMFs (onco-IR-phenotypes) from comorbidities with a strong immune signature (like COPD) in humans in vivo. Nevertheless, we did obtain distinct spectral patterns for all four common cancer entities (Figure 3b) indicating different, potentially disease-specific molecular alterations of the IMFs. These changes are likely to be linked to cancer-induced changes since classification accuracy is higher with more advanced cancer stages, which is also reflected in more pronounced differential fingerprints with larger tumour size.

To gain deeper understanding of the specificity of observed spectral changes to the disease patterns studied, it is helpful to investigate their molecular origin. In this context, we do not consider the approach of assigning spectral positions/features to characteristic vibrational modes of functional molecular entities most appropriate. Although widely used in the IR community, due to the very many molecular assignments possible for each spectral position, unambiguous statements of molecular changes herein are not feasible. Instead, a much deeper analysis is required, as recently revealed by Sangster et al., 2006; Voronina et al., 2021. The latter work involves combination of infrared spectroscopy and quantitative mass spectrometry on the part of the lung cancer sample set used in the current study as well, identifying the molecular origin of the differential infrared fingerprints. These can be partially explained by a characteristic change of proteins that are known to also change due to systemic inflammatory signals. Thereby we highlight the need for further biochemical investigations into the molecular origin of the observed spectral signatures, generally required in the field to address this question conclusively.

In-depth information about the molecular origin of the observed spectral disease patterns will help identify the clinical setting(s) where infrared fingerprinting can make largest contributions to cancer care (e.g. screening, diagnosis, prognosis, treatment monitoring, or surveillance). The specificity of spectral signatures to cancer, along with the obtained sensitivity and specificity in the binary classification (Table 1), will influence whether the approach may best complement primary diagnostics, be possibly suited for screening, or even be used for molecular profiling and prognostication. When further validated, blood-based IMFs could aid residing medical challenges: More specifically, it may complement radiological and clinical chemistry examinations prior to invasive tissue biopsies. Given less than 60 μl of sample are required, sample preparation time and effort are negligible, and measurement time is within minutes, the approach may be well suited for high-throughput screening or provide additional information for clinical decision process. Thus, minimally invasive IMF cancer detection could integratively help raise the rate of pre-metastatic cancer detection in clinical testing. However, further detailed research (e.g. as performed for an FTIR-based blood serum test for brain tumour; Gray et al., 2018) is needed to identify an appropriate clinical setting in which the proposed test can be used with the greatest benefit (in terms of cost-effectiveness and clinical utility).

Moreover, given the recent evidence of high within-person stability of IMFs over time (Huber et al., 2021), serial longitudinal liquid biopsies and infrared fingerprinting of respective samples could eliminate between-person variability by self-referencing and thereby facilitate even more efficient and possibly earlier cancer detection. Once (i) a precise clinical setting is defined and (ii) large-scale, stratified clinical studies controlled for comorbidities can be realized, a systematic, direct comparison to established diagnostics will become feasible and the full potential of infrared fingerprinting can be quantitatively assessed.

For further improvements in the accuracy of the envisioned medical assay, the IR fingerprinting methodology needs to be improved in parallel. Molecular specificity is inherently limited in IR spectroscopy due to the spectral overlap of absorption bands of individual molecules. This might be tackled by chemical pre-fractionation (Hughes et al., 2014; Petrich et al., 2009; Voronina et al., 2021) or by combining IR spectroscopy to methods like liquid chromatography. However, such a pre-fractioning, but also IR fingerprinting itself, would benefit even more from increased spectroscopic sensitivity. Sensitivity of the current commercially available FTIR spectrometer is however limited to detection of highly abundant molecules. Recent developments in infrared spectroscopy demonstrate the possibility to increase the detectable molecular dynamic range to five orders of magnitude (Pupeza et al., 2020) and therefore have the potential to improve the efficiency of infrared fingerprinting.

In summary, infrared fingerprinting reveals the potential for effective detection and distinction of various common cancer types already at its current stage and implementation. Future developments, in terms of instrumentation as well as methodology, have the potential to further improve the detection efficiency. This study presents a general high-throughput and cost-effective framework, and along this, highlights the possibility for extending infrared fingerprinting to other disease entities.

Methods

Study design

The objective of this study was to evaluate whether infrared molecular fingerprinting of human blood serum and plasma from patients, reference individuals, and healthy persons has any capacity to detect cancer, specifically targeting detection of four common cancer entities (lung, breast, bladder, and prostate cancer). A statistical power calculation for the sample size was performed prior to the study and is included in the study protocol. Based on preliminary results, it was determined that with a sample size of 200 cases and 200 controls, the detection power in terms of AUC can be estimated within a marginal error of 0.054. Therefore, the aim was to include more than 200 cases for each cancer type. However, upon matching (see also below), it was not always possible to include 200 individuals per group for all analyses of this study. In the analyses where the sample size of 200 individuals per group could not be reached, the uncertainty obtained increased accordingly (as seen in the obtained errors and error bars). The full sample size calculation is available on request from the corresponding authors.

The multi-institutional study on lung, breast, bladder, and prostate cancer also includes subjects with corresponding benign pathologies in the same organs as well as non-symptomatic subjects. Participants provided written informed consent for the study under research study protocol #17-141 and broad consent under research study protocol #17-182, both of which were approved by the Ethics Committee of the Ludwig-Maximillian-University (LMU) of Munich. Our study complies with all relevant ethical regulations and was conducted according to Good Clinical Practice (ICH-GCP) and the principles of the Declaration of Helsinki. The clinical trial is registered (ID DRKS00013217) at the German Clinical Trails Register (DRKS). The following clinical centres were involved in subject recruitment and sample collections of the prospective clinical study: Department of Internal Medicine V for Pneumology, Urology Clinic, Breast Center, Department of Obstetrics and Gynecology, and Comprehensive Cancer Centre Munich (CCLMU), all affiliated with the LMU. The Asklepios Lung Clinic (Gauting), affiliated to the Comprehensive Pneumology Centre (CPC) Munich, and the German Centre for Lung Research, DZL, were further study sites in the Munich region, Germany. In total, blood samples from 1927 individuals were collected and measured (see below). The full breakdown of all participants is listed in Figure 1—source data 1.

From the existing dataset, the recorded IMFs were selected for further analysis according to the following criteria:

Only data from cancer patients with clinically confirmed carcinoma of lung, prostate, bladder, or breast prior to any cancer-related therapy were considered.
Healthy references were non-symptomatic individuals not suffering from any cancer-related disease nor being under any medical treatment.
Symptomatic references included patients with COPD or pulmonary hamartoma for lung cancer, and BPH patients for prostate cancer.

From this pre-selected dataset, a further subset was created for each binary classification examined (e.g. lung cancer vs. non-symptomatic references). This selection was done using statistical matching (see below) in such a way that it provides a balanced distribution of gender, age, and BMI. This was to ensure that there is no bias towards any of these factors within the analysis of machine learning. The selection step reduced the number of analysed samples to 1639. It is important to note that given that we have performed evaluations addressing more than one main question, depending on some types of questions, some control samples are appropriately used as matched references for multiple questions.

A full breakdown of all included participants (sample pool) along with the breakdown for each of the investigated binary classification is provided as source data files.

Statistical matching

Achieving covariate balance between cases and references is an important procedure in observational studies for neutralizing the effect of confounding factors and limiting the bias in the results. In this work, we deploy optimal pair matching using the Mahalanobis distance within propensity score callipers (Rosenbaum, 2010). The implementation was done in R (v. 3.5.1). In evaluations where pair matching was not sufficient, optimal matching with multiple references was performed instead.

Sample collection and storage

Blood samples were collected, processed, and stored according to the same standard operating procedures at each clinical site. Blood draws were all performed using Safety-Multifly needles of at least 21 G (Sarstedt) and collected with 4.9 ml or 7.5 ml serum and plasma Monovettes (Sarstedt). For the blood clotting process to take place, the tubes were stored upright for at least 20 min and then centrifuged at 2000 g for 10 min at 20°C. The supernatant was carefully aliquoted into 0.5 ml fractions and frozen at –80°C within 5 hr after collection. Samples were transported to the analysis site on dry ice and again stored at –80°C until sample preparation.

Sample preparation and FTIR measurements

In advance of the FTIR measurements, one 0.5 ml aliquot per serum or plasma sample was thawed in a water bath at 4°C and again centrifuged for 10 min at 2000 g. The supernatant was distributed into the measurement tubes (50 µl per tube) and refrozen at –80°C. All the FTIR measurements were performed upon two freeze-thaw cycles.

The samples were mostly measured in the order in which they arrived at the measurement site. As sample collection and delivery is to some extent a stochastic process (both cases and references were continuously collected over the entire period), no additional randomization of the measurement order was performed.

The samples were aliquoted and measured in blinded fashion, that is, the person performing the measurements did not know about any clinical information about the samples. The spectroscopic measurements were performed in liquid phase with an automated FTIR device (MIRA-Analyzer, micro-biolytics GmbH) with a flow-through transmission cuvette (CaF₂ with ~8 µm path length). The spectra were acquired with a resolution of 4 cm^–1 in a spectral range between 950 cm^–1 and 3050 cm^–1. A water reference spectrum was recorded after each sample measurement to reconstruct the IR absorption spectra. Each measurement sequence usually contained up to 40 samples, resulting in measurement times of up to 3 hr. After each measurement batch, the instrument was carefully cleaned and re-qualified according to the manufacturer’s recommendations.

To track experimental errors over extended time periods (Sangster et al., 2006), a measurement of quality control serum (pooled human serum, BioWest, Nuaillé, France) was performed after every five samples. The spectra of the QC samples were also used to evaluate the measurement error. We found in a previous study that the measurement error is small when compared to the between-person biological variability of human serum IMFs (Huber et al., 2021). A relevant analysis comparing the variability between biological samples and QCs is presented in Figure 2—figure supplement 1b-b". In addition, the results obtained on a subset from plasma and serum samples from the same individuals were similar, indicating that no technical variance or device variation affected the measurement results. Thus, individual samples were not measured as replicates.

Outlier detection

If an air bubble was present during the measurement, this was immediately noticeable by saturation of the detector. In such cases, the measurement was considered faulty and another aliquot of the sample was measured. After the entire dataset was collected, we performed an additional outlier removal. For this, we used the method of Local Outlier Factor (LOF), as implemented in Scikit-Learn (v. 0.23.2) (Pedregosa et al., 2011). LOF is based on k-nearest neighbours and is appropriate for (moderately) high-dimensional data. LOF succeeds in removing samples with spectral anomalies such as abnormally low absorbance or contamination signatures. Using this procedure, a total of 28 spectra were removed from the dataset.

Pre-processing of infrared absorption spectra

Negative absorption, which occurs if the liquid sample contains less water than the reference (pure water), was corrected for by a previously described approach (Yang et al., 2015). It is known from measurements of dried serum or plasma that there is no significant absorption in the wavenumber region 2000–2300 cm^–1, resulting in a flat absorption baseline. We used this fact as a criterion for adding to each spectrum a previously measured water absorption spectrum (as provided in Figure 2—source data 2) to account for the missing water in the sample measurement and minimize the average slope in this region in order to obtain a flat baseline. All spectra were truncated to 1000–3000 cm^–1 and the ‘silent region,’ between 1750 cm^–1 and 2800 cm^–1, was removed. Finally, all spectra were normalized using Euclidean (L₂) norm. The calculation of the second derivative of the normalized spectra was included in some cases as an additional (optional) pre-processing step.

Machine learning and classification

To derive classification models, we used Scikit-Learn (Pedregosa et al., 2011; v. 0.23.2), an open-source machine learning framework in Python (v.3.7.6). We trained various binary classification as well as multiclass classification models using linear SVM. Performance evaluation was carried out using repeated stratified k-fold cross-validation and its visualization using the notion of the ROC curve for binary problems and the confusion matrix for multiclass classification. The results of the cross-validation are reported in terms of descriptive statistics, that is, the mean value of the resulting AUC distribution and its standard deviation. The calculation of optimal pair of sensitivity and specificity is done by minimizing the distance of the ROC curve to the upper-left corner.

Statistical analysis

For statistically comparing two groups of spectra (i.e. cases, references), we followed three approaches. First, we calculated the ‘differential fingerprint,’ defined as the sample mean of the cases minus the sample mean of the reference group. We plot this quantity contrasted against the standard deviation of the reference group for obtaining a visual understanding of which wavenumbers are potentially useful for distinguishing/classifying the two populations. Such a graph serves as a visual representation of what is known as the ‘effect size,’ which can be obtained by standardizing the differential fingerprint and, as shown in Figure 5—figure supplement 1, has an evident relation to the AUC per wavenumber. Secondly, we performed t-test (testing the hypothesis that two populations have equal means) for extracting two-tailed p-values per wavenumber. As a last step, we make use of Mann–Whitney U test (also known as Wilcoxon rank-sum test) for extracting the U statistic and calculating the AUC per wavenumber by the relation AUC = U/(n1*n2), where n1 and n2 are the sizes of the two groups.

Data availability

The datasets analysed within the scope of the study cannot be published publicly due to privacy regulations under the General Data Protection Regulation (EU) 2016/679. The raw data includes clinical data from patients, including textual clinical notes and contain information that could potentially compromise subjects' privacy or consent, and therefore cannot be shared. However, the trained machine learning models for the binary classification of bladder, breast, prostate, and lung cancer are provided within Figure 2—source data 4, along with description and code for importing them in a python script. The custom code used for the production of the results presented in this manuscript is stored in a persistent repository at the Leibniz Supercomputing Center of the Bavarian Academy of Sciences and Humanities (LRZ), located in Garching, Germany. The entire code can only be shared upon reasonable request, as its correct use depends heavily on the settings of the experimental setup and the measuring device and should therefore be clarified with the authors.

References

1. Abbosh C
2. Birkbak NJ
3. Wilson GA
4. Jamal-Hanjani M
5. Constantin T
6. Salari R
7. Le Quesne J
8. Moore DA
9. Veeriah S
10. Rosenthal R
11. Marafioti T
12. Kirkizlar E
13. Watkins TBK
14. McGranahan N
15. Ward S
16. Martinson L
17. Riley J
18. Fraioli F
19. Al Bakir M
20. Grönroos E
21. Zambrana F
22. Endozo R
23. Bi WL
24. Fennessy FM
25. Sponer N
26. Johnson D
27. Laycock J
28. Shafi S
29. Czyzewska-Khan J
30. Rowan A
31. Chambers T
32. Matthews N
33. Turajlic S
34. Hiley C
35. Lee SM
36. Forster MD
37. Ahmad T
38. Falzon M
39. Borg E
40. Lawrence D
41. Hayward M
42. Kolvekar S
43. Panagiotopoulos N
44. Janes SM
45. Thakrar R
46. Ahmed A
47. Blackhall F
48. Summers Y
49. Hafez D
50. Naik A
51. Ganguly A
52. Kareht S
53. Shah R
54. Joseph L
55. Marie Quinn A
56. Crosbie PA
57. Naidu B
58. Middleton G
59. Langman G
60. Trotter S
61. Nicolson M
62. Remmen H
63. Kerr K
64. Chetty M
65. Gomersall L
66. Fennell DA
67. Nakas A
68. Rathinam S
69. Anand G
70. Khan S
71. Russell P
72. Ezhil V
73. Ismail B
74. Irvin-Sellers M
75. Prakash V
76. Lester JF
77. Kornaszewska M
78. Attanoos R
79. Adams H
80. Davies H
81. Oukrif D
82. Akarca AU
83. Hartley JA
84. Lowe HL
85. Lock S
86. Iles N
87. Bell H
88. Ngai Y
89. Elgar G
90. Szallasi Z
91. Schwarz RF
92. Herrero J
93. Stewart A
94. Quezada SA
95. Peggs KS
96. Van Loo P
97. Dive C
98. Lin CJ
99. Rabinowitz M
100. Aerts HJWL
101. Hackshaw A
102. Shaw JA
103. Zimmermann BG
104. TRACERx consortium
105. PEACE consortium
106. Swanton C
(2017) Phylogenetic CTDNA analysis depicts early-stage lung cancer evolution
Nature 545:446–451.

https://doi.org/10.1038/nature22364
- PubMed
- Google Scholar
1. Amelio I
2. Bertolo R
3. Bove P
4. Buonomo OC
5. Candi E
6. Chiocchi M
7. Cipriani C
8. Di Daniele N
9. Ganini C
10. Juhl H
11. Mauriello A
12. Marani C
13. Marshall J
14. Montanaro M
15. Palmieri G
16. Piacentini M
17. Sica G
18. Tesauro M
19. Rovella V
20. Tisone G
21. Shi Y
22. Wang Y
23. Melino G
(2020) Liquid biopsies and cancer omics
Cell Death Discovery 6:131.

https://doi.org/10.1038/s41420-020-00373-0
- PubMed
- Google Scholar
Book
1. Amin MB
2. Edge SB
3. Greene FL
4. Byrd DR
5. Brookland RK
6. Washington MK
7. Gershenwald JE
8. Compton CC
9. Hess KR
10. Sullivan DC
11. Jessup JM
12. Brierley JD
13. Gaspar LE
14. Schilsky RL
15. Balch CM
16. Winchester DP
17. Asare EA
18. Madera M
19. Gress DM
20. Meyer LR
(2017) AJCC Cancer Staging Manual
Springer.

https://doi.org/10.1007/978-3-319-40618-3
- Google Scholar
(2020) Liquid biopsy for cancer diagnosis using vibrational spectroscopy: systematic review
BJS Open 4:554–562.

https://doi.org/10.1002/bjs5.50289
- PubMed
- Google Scholar
1. Backhaus J
2. Mueller R
3. Formanski N
4. Szlama N
5. Meerpohl HG
6. Eidt M
7. Bugert P
(2010) Diagnosis of breast cancer with infrared spectroscopy from serum samples
Vibrational Spectroscopy 52:173–177.

https://doi.org/10.1016/j.vibspec.2010.01.013
- Google Scholar
Book
1. Bannister N
2. Broggio J
(2016)
Cancer Survival by Stage at Diagnosis for England (Experimental Statistics)

Office for National Statistics.
- Google Scholar
1. Boothby M
2. Rickert RC
(2017) Metabolic Regulation of the Immune Humoral Response
Immunity 46:743–755.

https://doi.org/10.1016/j.immuni.2017.04.009
- PubMed
- Google Scholar
1. Bray F
2. Ferlay J
3. Soerjomataram I
4. Siegel RL
5. Torre LA
6. Jemal A
(2018) Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries
CA 68:394–424.

https://doi.org/10.3322/caac.21492
- PubMed
- Google Scholar
1. Butler HJ
2. Brennan PM
3. Cameron JM
4. Finlayson D
5. Hegarty MG
6. Jenkinson MD
7. Palmer DS
8. Smith BR
9. Baker MJ
(2019) Development of high-throughput ATR-FTIR technology for rapid triage of brain cancer
Nature Communications 10:4501.

https://doi.org/10.1038/s41467-019-12527-5
- PubMed
- Google Scholar
1. Cameron JM
2. Butler HJ
3. Anderson DJ
4. Christie L
5. Confield L
6. Spalding KE
7. Finlayson D
8. Murray S
9. Panni Z
10. Rinaldi C
11. Sala A
12. Theakstone AG
13. Baker MJ
(2020) Exploring pre-analytical factors for the optimisation of serum diagnostics: Progressing the clinical utility of ATR-FTIR spectroscopy
Vibrational Spectroscopy 109:103092.

https://doi.org/10.1016/j.vibspec.2020.103092
- Google Scholar
1. Diem M
(2018) Comments on recent reports on infrared spectral detection of disease markers in blood components
Journal of Biophotonics 11:e201800064.

https://doi.org/10.1002/jbio.201800064
- PubMed
- Google Scholar
(2017) Application of FT-IR spectroscopy on breast cancer serum analysis
Spectrochim Acta Part A Mol Biomol Spectrosc 187:87–91.

https://doi.org/10.1016/j.saa.2017.06.021
- Google Scholar
(2005) Analysis of biofluids in aqueous environment based on mid-infrared spectroscopy
Journal of Biomedical Optics 10:031103.

https://doi.org/10.1117/1.1917844
- PubMed
- Google Scholar
1. Geyer PE
2. Holdt LM
3. Teupser D
4. Mann M
(2017) Revisiting biomarker discovery by plasma proteomics
Molecular Systems Biology 13:942.

https://doi.org/10.15252/msb.20156297
- PubMed
- Google Scholar
1. Geyer PE
2. Voytik E
3. Treit P
4. Doll S
5. Kleinhempel A
6. Niu L
7. Müller JB
8. Buchholtz M
9. Bader JM
10. Teupser D
11. Holdt LM
12. Mann M
(2019) Plasma Proteome Profiling to detect and avoid sample‐related biases in biomarker studies
EMBO Molecular Medicine 11:1–12.

https://doi.org/10.15252/emmm.201910427
- PubMed
- Google Scholar
1. Ghimire H
2. Garlapati C
3. Janssen EAM
4. Krishnamurti U
5. Qin G
6. Aneja R
7. Perera AGU
(2020) Protein Conformational Changes in Breast Cancer Sera Using Infrared Spectroscopic Analysis
Cancers 12:1708.

https://doi.org/10.3390/cancers12071708
- PubMed
- Google Scholar
1. Gray E
2. Butler HJ
3. Board R
4. Brennan PM
5. Chalmers AJ
6. Dawson T
7. Goodden J
8. Hamilton W
9. Hegarty MG
10. James A
11. Jenkinson MD
12. Kernick D
13. Lekka E
14. Livermore LJ
15. Mills SJ
16. O’Neill K
17. Palmer DS
18. Vaqas B
19. Baker MJ
(2018) Health economic evaluation of a serum-based blood test for brain tumour diagnosis: Exploration of two clinical scenarios
BMJ Open 8:e017593.

https://doi.org/10.1136/bmjopen-2017-017593
- PubMed
- Google Scholar
1. Han X
2. Wang J
3. Sun Y
(2017) Circulating Tumor DNA as Biomarkers for Cancer Detection
Genomics, Proteomics & Bioinformatics 15:59–72.

https://doi.org/10.1016/j.gpb.2016.12.004
- PubMed
- Google Scholar
1. Hanahan D
2. Weinberg RA
(2011) Hallmarks of Cancer: The Next Generation
Cell 144:646–674.

https://doi.org/10.1016/j.cell.2011.02.013
- PubMed
- Google Scholar
1. Hands JR
(2014) Attenuated Total Reflection Fourier Transform Infrared (ATR-FTIR) spectral discrimination of brain tumour severity from serum samples
Journal of Biophotonics 7:189–199.

https://doi.org/10.1002/jbio.201300149
- PubMed
- Google Scholar
1. Hasin Y
2. Seldin M
3. Lusis A
(2017) Multi-omics approaches to disease
Genome Biology 18:83.

https://doi.org/10.1186/s13059-017-1215-1
- PubMed
- Google Scholar
1. Huber M
2. Kepesidis K
3. Voronina L
4. Božić M
5. Trubetskov M
6. Harbeck N
7. Krausz F
8. Žigman M
(2021) Stability of person-specific blood-based infrared molecular fingerprints opens up prospects for health monitoring
Nature Communications 12:1511.

https://doi.org/10.1038/s41467-021-21668-5
- PubMed
- Google Scholar
1. Hughes C
2. Brown M
3. Clemens G
4. Henderson A
5. Monjardez G
6. Clarke NW
7. Gardner P
(2014) Assessing the challenges of Fourier transform infrared spectroscopic analysis of blood serum
Journal of Biophotonics 7:180–188.

https://doi.org/10.1002/jbio.201300167
- PubMed
- Google Scholar
(2021) Liquid biopsy enters the clinic — implementation issues and future challenges
Nature Reviews. Clinical Oncology 18:297–312.

https://doi.org/10.1038/s41571-020-00457-x
- PubMed
- Google Scholar
1. Karczewski KJ
2. Snyder MP
(2018) Integrative omics for health and disease
Nature Reviews. Genetics 19:299–310.

https://doi.org/10.1038/nrg.2018.4
- PubMed
- Google Scholar
1. Malone ER
2. Oliva M
3. Sabatini PJB
4. Stockley TL
5. Siu LL
(2020) Molecular profiling for precision cancer therapies
Genome Medicine 12:8.

https://doi.org/10.1186/s13073-019-0703-1
- PubMed
- Google Scholar
1. McGranahan N
2. Swanton C
(2017) Clonal Heterogeneity and Tumor Evolution: Past, Present, and the Future
Cell 168:613–628.

https://doi.org/10.1016/j.cell.2017.01.018
- PubMed
- Google Scholar
1. Medipally DKR
2. Cullen D
3. Untereiner V
4. Sockalingum GD
5. Maguire A
6. Nguyen TNQ
7. Bryant J
8. Noone E
9. Bradshaw S
10. Finn M
11. Dunne M
12. Shannon AM
13. Armstrong J
14. Meade AD
15. Lyng FM
(2020) Vibrational spectroscopy of liquid biopsies for prostate cancer diagnosis
Therapeutic Advances in Medical Oncology 12:175883592091849.

https://doi.org/10.1177/1758835920918499
- PubMed
- Google Scholar
1. Ollesch J
2. Heinze M
3. Heise HM
4. Behrens T
5. Brüning T
6. Gerwert K
(2014) It’s in your blood: spectral biomarker candidates for urinary bladder cancer from automated FTIR spectroscopy
Journal of Biophotonics 7:210–221.

https://doi.org/10.1002/jbio.201300163
- Google Scholar
1. Ollesch J
2. Theegarten D
3. Altmayer M
4. Darwiche K
5. Hager T
6. Stamatis G
7. Gerwert K
(2016) An infrared spectroscopic blood test for non-small cell lung carcinoma and subtyping into pulmonary squamous cell carcinoma or adenocarcinoma
Biomedical Spectroscopy and Imaging 5:129–144.

https://doi.org/10.3233/BSI-160144
- Google Scholar
1. Otandault A
2. Anker P
3. Al Amir Dache Z
4. Guillaumon V
5. Meddeb R
6. Pastor B
7. Pisareva E
8. Sanchez C
9. Tanos R
10. Tousch G
11. Schwarzenbach H
12. Thierry AR
(2019) Recent advances in circulating nucleic acids in oncology
Annals of Oncology 30:374–384.

https://doi.org/10.1093/annonc/mdz031
- PubMed
- Google Scholar
(2011) Scikit-learn: Machine Learning in Python
Journal of Machine Learning Research 12:2825–2830.

https://doi.org/10.1007/s13398-014-0173-7.2
- Google Scholar
1. Petrich W
2. Lewandrowski KB
3. Muhlestein JB
4. Hammond MEH
5. Januzzi JL
6. Lewandrowski EL
7. Pearson RR
8. Dolenko B
9. Früh J
10. Haass M
11. Hirschl MM
12. Köhler W
13. Mischler R
14. Möcks J
15. Ordóñez–Llanos J
16. Quarder O
17. Somorjai R
18. Staib A
19. Sylvén C
20. Werner G
21. Zerback R
(2009) Potential of mid-infrared spectroscopy to aid the triage of patients with acute chest pain
The Analyst 134:1092.

https://doi.org/10.1039/b820923e
- PubMed
- Google Scholar
1. Poste G
(2011) Bring on the biomarkers
Nature 469:156–157.

https://doi.org/10.1038/469156a
- PubMed
- Google Scholar
1. Pupeza I
2. Huber M
3. Trubetskov M
4. Schweinberger W
5. Hussain SA
6. Hofer C
7. Fritsch K
8. Poetzlberger M
9. Vamos L
10. Fill E
11. Amotchkina T
12. Kepesidis K
13. Apolonski A
14. Karpowicz N
15. Pervak V
16. Pronin O
17. Fleischmann F
18. Azzeer A
19. Žigman M
20. Krausz F
(2020) Field-resolved infrared spectroscopy of biological systems
Nature 577:52–59.

https://doi.org/10.1038/s41586-019-1850-7
- PubMed
- Google Scholar
1. Roig B
2. Rodríguez-Balada M
3. Samino S
4. Ewf L
5. Guaita-Esteruelas S
6. Gomes AR
7. Correig X
8. Borràs J
9. Yanes O
10. Gumà J
(2017) Metabolomics reveals novel blood plasma biomarkers associated to the BRCA1-mutated phenotype of human breast cancer
Scientific Reports 7:17831.

https://doi.org/10.1038/s41598-017-17897-8
- PubMed
- Google Scholar
Book
1. Rosenbaum PR
(2010) Design of Observational Studies, Springer Series in Statistics
Springer.

https://doi.org/10.1007/978-1-4419-1213-8
- Google Scholar
1. Sala A
2. Anderson DJ
3. Brennan PM
4. Butler HJ
5. Cameron JM
6. Jenkinson MD
7. Rinaldi C
8. Theakstone AG
9. Baker MJ
(2020a) Biofluid diagnostics by FTIR spectroscopy: A platform technology for cancer detection
Cancer Letters 477:122–130.

https://doi.org/10.1016/j.canlet.2020.02.020
- Google Scholar
1. Sala A
2. Spalding KE
3. Ashton KM
4. Board R
5. Butler HJ
6. Dawson TP
7. Harris DA
8. Hughes CS
9. Jenkins CA
10. Jenkinson MD
11. Palmer DS
12. Smith BR
13. Thornton CA
14. Baker MJ
(2020b) Rapid analysis of disease state in liquid human serum combining infrared spectroscopy and “digital drying
Journal of Biophotonics 13:118.

https://doi.org/10.1002/jbio.202000118
- Google Scholar
1. Sangster T
2. Major H
3. Plumb R
4. Wilson AJ
5. Wilson ID
(2006) A pragmatic and readily implemented quality control strategy for HPLC-MS and GC-MS-based metabonomic analysis
The Analyst 131:1075–1078.

https://doi.org/10.1039/b604498k
- PubMed
- Google Scholar
(2015) Early Detection of Cancer: Past, Present, and Future
Am Soc Clin Oncol Educ B 10:57–65.

https://doi.org/10.14694/EdBook_AM.2015.35.57
- Google Scholar
1. Srivastava S
2. Koay EJ
3. Borowsky AD
4. De Marzo AM
5. Ghosh S
6. Wagner PD
7. Kramer BS
(2019) Cancer overdiagnosis: a biological challenge and clinical dilemma
Nature Reviews. Cancer 19:349–358.

https://doi.org/10.1038/s41568-019-0142-8
- PubMed
- Google Scholar
1. Uzozie AC
2. Aebersold R
(2018) Advancing translational research and precision medicine with targeted proteomics
Journal of Proteomics 189:1–10.

https://doi.org/10.1016/j.jprot.2018.02.021
- PubMed
- Google Scholar
1. Voronina L
2. Leonardo C
3. Mueller‐Reif JB
4. Geyer PE
5. Huber M
6. Trubetskov M
7. Kepesidis K
8. Behr J
9. Mann M
10. Krausz F
11. Žigman M
(2021) Molecular Origin of Blood‐Based Infrared Spectroscopic Fingerprints**
Angew Chemie Int Ed Anie 60:17060–17069.

https://doi.org/10.1002/anie.202103272
- PubMed
- Google Scholar
1. Wan JCM
2. Massie C
3. Garcia-Corbacho J
4. Mouliere F
5. Brenton JD
6. Caldas C
7. Pacey S
8. Baird R
9. Rosenfeld N
(2017) Liquid biopsies come of age: Towards implementation of circulating tumour dna
Nature Reviews. Cancer 17:223–238.

https://doi.org/10.1038/nrc.2017.7
- PubMed
- Google Scholar
(2013) Translational biomarker discovery in clinical metabolomics: An introductory tutorial
Metabolomics 9:280–299.

https://doi.org/10.1007/s11306-012-0482-9
- PubMed
- Google Scholar
1. Yang H
2. Yang S
3. Kong J
4. Dong A
5. Yu S
(2015) Obtaining information about protein secondary structures in aqueous solution using Fourier transform IR spectroscopy
Nature Protocols 10:382–396.

https://doi.org/10.1038/nprot.2015.024
- PubMed
- Google Scholar
1. Yoo BC
2. Kim KH
3. Woo SM
4. Myung JK
(2018) Clinical multi-omics strategies for the effective cancer management
Journal of Proteomics 188:97–106.

https://doi.org/10.1016/j.jprot.2017.08.010
- PubMed
- Google Scholar
1. Zelig U
2. Barlev E
3. Bar O
4. Gross I
5. Flomen F
6. Mordechai S
7. Kapelushnik J
8. Nathan I
9. Kashtan H
10. Wasserberg N
11. Madhala-Givon O
(2015) Early detection of breast cancer using total biochemical analysis of peripheral blood components: a preliminary study
BMC Cancer 15:408.

https://doi.org/10.1186/s12885-015-1414-7
- PubMed
- Google Scholar
1. Zhao Q
2. Adeli E
3. Pohl KM
(2020) Training confounder-free deep learning models for medical applications
Nature Communications 11:6010.

https://doi.org/10.1038/s41467-020-19784-9
- PubMed
- Google Scholar

Article and author information

Author details

Marinus Huber
1. Ludwig Maximilians University Munich (LMU), Department of Laser Physics, Garching, Germany
2. Max Planck Institute of Quantum Optics (MPQ), Laboratory for Attosecond Physics, Garching, Germany
Contribution
Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Validation, Visualization, Writing – original draft, Writing – review and editing

Contributed equally with
Kosmas V Kepesidis

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0001-5309-4475
Kosmas V Kepesidis
1. Ludwig Maximilians University Munich (LMU), Department of Laser Physics, Garching, Germany
2. Max Planck Institute of Quantum Optics (MPQ), Laboratory for Attosecond Physics, Garching, Germany
Contribution
Conceptualization, Formal analysis, Investigation, Methodology, Validation, Visualization, Writing – original draft, Writing – review and editing

Contributed equally with
Marinus Huber

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0002-6391-7743
Liudmila Voronina

Ludwig Maximilians University Munich (LMU), Department of Laser Physics, Garching, Germany

Contribution
Data curation, Formal analysis, Methodology, Writing – review and editing

Competing interests
No competing interests declared
Frank Fleischmann

Ludwig Maximilians University Munich (LMU), Department of Laser Physics, Garching, Germany

Contribution
Data curation, Investigation, Methodology, Supervision

Competing interests
No competing interests declared
Ernst Fill

Max Planck Institute of Quantum Optics (MPQ), Laboratory for Attosecond Physics, Garching, Germany

Contribution
Conceptualization, Formal analysis, Investigation, Methodology

Competing interests
No competing interests declared
Jacqueline Hermann

Ludwig Maximilians University Munich (LMU), Department of Laser Physics, Garching, Germany

Contribution
Methodology, Project administration, Supervision, Validation

Competing interests
No competing interests declared
Ina Koch

Asklepios Biobank for Lung Diseases, Department of Thoracic Surgery, Member of the German Center for Lung Research, DZL, Asklepios Fachkliniken München-Gauting, Munich, Germany

Contribution
Data curation, Investigation, Resources, Validation

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0002-8766-017X
Katrin Milger-Kneidinger

University Hospital of the Ludwig Maximilians University Munich (LMU), Department of Internal Medicine V, Munich, Germany

Contribution
Resources

Competing interests
No competing interests declared
Thomas Kolben

University Hospital of the Ludwig Maximilians University Munich (LMU), Department of Obstetrics and Gynecology, Breast Center and Comprehensive Cancer Center (CCLMU), Munich, Germany

Contribution
Resources, Supervision

Competing interests
No competing interests declared
Gerald B Schulz

University Hospital of the Ludwig Maximilians University Munich (LMU), Department of Urology, Munich, Germany

Contribution
Investigation

Competing interests
No competing interests declared
Friedrich Jokisch

University Hospital of the Ludwig Maximilians University Munich (LMU), Department of Urology, Munich, Germany

Contribution
Investigation

Competing interests
No competing interests declared
Jürgen Behr

University Hospital of the Ludwig Maximilians University Munich (LMU), Department of Internal Medicine V, Munich, Germany

Contribution
Conceptualization, Resources, Supervision, Writing – review and editing

Competing interests
No competing interests declared
Nadia Harbeck

University Hospital of the Ludwig Maximilians University Munich (LMU), Department of Obstetrics and Gynecology, Breast Center and Comprehensive Cancer Center (CCLMU), Munich, Germany

Contribution
Conceptualization, Resources, Supervision, Writing – review and editing

Competing interests
No competing interests declared
Maximilian Reiser

University Hospital of the Ludwig Maximilians University Munich (LMU), Department of Clinical Radiology, Munich, Germany

Contribution
Conceptualization, Resources, Writing – review and editing

Competing interests
No competing interests declared
Christian Stief

University Hospital of the Ludwig Maximilians University Munich (LMU), Department of Urology, Munich, Germany

Contribution
Conceptualization, Resources, Writing – review and editing

Competing interests
No competing interests declared
Ferenc Krausz
1. Ludwig Maximilians University Munich (LMU), Department of Laser Physics, Garching, Germany
2. Max Planck Institute of Quantum Optics (MPQ), Laboratory for Attosecond Physics, Garching, Germany
Contribution
Conceptualization, Funding acquisition, Investigation, Methodology, Resources, Supervision, Visualization, Writing – review and editing

Competing interests
No competing interests declared
Mihaela Zigman
1. Ludwig Maximilians University Munich (LMU), Department of Laser Physics, Garching, Germany
2. Max Planck Institute of Quantum Optics (MPQ), Laboratory for Attosecond Physics, Garching, Germany
Contribution
Conceptualization, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Supervision, Visualization, Writing – original draft, Writing – review and editing

For correspondence
mihaela.zigman@mpq.mpg.de

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0001-8306-1922

Funding

No external funding was received for this work.

Acknowledgements

We thank Prof. Dr. Gabriele Multhoff, Dr. Stefan Jungblut, Katja Leitner, Dr. Sigrid Auweter, Daniel Meyer, Beate Rank, Sabine Witzens, Christina Mihm, Sabine Eiselen, Tarek Eissa, and Dr. Incinur Zellhuber for their help with this study. In particular, we wish to acknowledge the efforts of many individuals who participated as volunteers in the clinical study reported here. We also thank the Asklepios Biobank for Lung Diseases, member of the German Center for Lung Research (DZL), for providing clinical samples and data.

Ethics

Clinical trial registration DRKS00013217.

The multi-institutional study on lung, breast, bladder and prostate cancer includes cancer patients as well as subjects with corresponding benign pathologies in the same organs as well as non-symptomatic subjects. Participants provided written informed consent for the study under research study protocol #17-141 and broad consent under research study protocol #17-182, both of which were approved by the Ethics Committee of the Ludwig-Maximillian-University (LMU) of Munich. Our study complies with all relevant ethical regulations, and was conducted according to Good Clinical Practice (ICH-GCP) and the principles of the Declaration of Helsinki. The clinical trial is registered (ID DRKS00013217) at the German Clinical Trails Register (DRKS).

Copyright

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.