Serum RNAs can predict lung cancer up to 10 years prior to diagnosis

  1. Sinan Uğur Umu  Is a corresponding author
  2. Hilde Langseth
  3. Verena Zuber
  4. Åslaug Helland
  5. Robert Lyle
  6. Trine B Rounge  Is a corresponding author
  1. Cancer Registry of Norway, Norway
  2. Imperial College London, United Kingdom
  3. Oslo University Hospital, Norway

Abstract

Lung cancer (LC) prognosis is closely linked to the stage of disease when diagnosed. We investigated the biomarker potential of serum RNAs for the early detection of LC in smokers at different prediagnostic time intervals and histological subtypes. In total, 1061 samples from 925 individuals were analyzed. RNA sequencing with an average of 18 million reads per sample was performed. We generated machine learning models using normalized serum RNA levels and found that smokers later diagnosed with LC in 10 years can be robustly separated from healthy controls regardless of histology with an average area under the ROC curve (AUC) of 0.76 (95% CI, 0.68-0.83). Furthermore, the strongest models that took both time to diagnosis and histology into account successfully predicted non-small cell LC (NSCLC) between 6 to 8 years, with an AUC of 0.82 (95% CI, 0.76-0.88), and SCLC between 2 to 5 years, with an AUC of 0.89 (95% CI, 0.77-1.0), before diagnosis. The most important separators were microRNAs, miscellaneous RNAs, isomiRs and tRNA-derived fragments. We have shown that LC can be detected years before diagnosis and manifestation of disease symptoms independently of histological subtype. However, the highest AUCs were achieved for specific subtypes and time intervals before diagnosis. The collection of models may therefore also predict the severity of cancer development and its histology. Our study demonstrates that serum RNAs can be promising prediagnostic biomarkers in a LC screening setting, from early detection to risk assessment.

Data availability

The datasets generated for this manuscript are not readily available because of the principles and conditions set out in articles 6 (1) (e) and 9 (2) (j) of the General Data Protection Regulation (GDPR). National legal basis as per the Regulations on population-based health surveys and ethical approval from the Norwegian Regional Committee for Medical and Health Research Ethics (REC) is also required. Requests to access the datasets should be directed to the corresponding authors with a project proposal. Please refer to our project website for the latest information on data sharing (kreftregisteret.no/en/janusrna). Our scripts, plot data, and bioinformatics workflow files can be accessed from our Github repo (https://github.com/sinanugur/LCscripts).

The following data sets were generated

Article and author information

Author details

  1. Sinan Uğur Umu

    Department of Research, Cancer Registry of Norway, Oslo, Norway
    For correspondence
    sinan.ugur.umu@kreftregisteret.no
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0001-8081-7819
  2. Hilde Langseth

    Department of Research, Cancer Registry of Norway, Oslo, Norway
    Competing interests
    The authors declare that no competing interests exist.
  3. Verena Zuber

    Department of Epidemiology and Biostatistics, Imperial College London, London, United Kingdom
    Competing interests
    The authors declare that no competing interests exist.
  4. Åslaug Helland

    Department of Oncology, Oslo University Hospital, Oslo, Norway
    Competing interests
    The authors declare that no competing interests exist.
  5. Robert Lyle

    Department of Medical Genetics, Oslo University Hospital, Oslo, Norway
    Competing interests
    The authors declare that no competing interests exist.
  6. Trine B Rounge

    Department of Research, Cancer Registry of Norway, Oslo, Norway
    For correspondence
    trine.rounge@kreftregisteret.no
    Competing interests
    The authors declare that no competing interests exist.

Funding

The Research Council of Norway (Human Biobanks and Health Data,[229621/H10,248791/H10])

  • Hilde Langseth
  • Trine B Rounge

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Reviewing Editor

  1. YM Dennis Lo, The Chinese University of Hong Kong, Hong Kong

Ethics

Human subjects: This study was approved by the Norwegian Regional Committee for medical and health researchethics (REC no: 19892 previous 2016/1290) and was based on broad consent from participants in the Janus cohort. The work has been carried out in compliance with the standards set by the Declaration of Helsinki.

Version history

  1. Received: June 6, 2021
  2. Accepted: February 9, 2022
  3. Accepted Manuscript published: February 11, 2022 (version 1)
  4. Version of Record published: February 28, 2022 (version 2)

Copyright

© 2022, Umu et al.

This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.

Metrics

  • 2,006
    views
  • 373
    downloads
  • 14
    citations

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Sinan Uğur Umu
  2. Hilde Langseth
  3. Verena Zuber
  4. Åslaug Helland
  5. Robert Lyle
  6. Trine B Rounge
(2022)
Serum RNAs can predict lung cancer up to 10 years prior to diagnosis
eLife 11:e71035.
https://doi.org/10.7554/eLife.71035

Share this article

https://doi.org/10.7554/eLife.71035

Further reading

    1. Cancer Biology
    2. Genetics and Genomics
    Ting Zhang, Alisa Ambrodji ... Steven M Offer
    Research Article

    Enhancers are critical for regulating tissue-specific gene expression, and genetic variants within enhancer regions have been suggested to contribute to various cancer-related processes, including therapeutic resistance. However, the precise mechanisms remain elusive. Using a well-defined drug-gene pair, we identified an enhancer region for dihydropyrimidine dehydrogenase (DPD, DPYD gene) expression that is relevant to the metabolism of the anti-cancer drug 5-fluorouracil (5-FU). Using reporter systems, CRISPR genome-edited cell models, and human liver specimens, we demonstrated in vitro and vivo that genotype status for the common germline variant (rs4294451; 27% global minor allele frequency) located within this novel enhancer controls DPYD transcription and alters resistance to 5-FU. The variant genotype increases recruitment of the transcription factor CEBPB to the enhancer and alters the level of direct interactions between the enhancer and DPYD promoter. Our data provide insight into the regulatory mechanisms controlling sensitivity and resistance to 5-FU.

    1. Cancer Biology
    2. Epidemiology and Global Health
    Lijun Bian, Zhimin Ma ... Guangfu Jin
    Research Article

    Background:

    Age is the most important risk factor for cancer, but aging rates are heterogeneous across individuals. We explored a new measure of aging-Phenotypic Age (PhenoAge)-in the risk prediction of site-specific and overall cancer.

    Methods:

    Using Cox regression models, we examined the association of Phenotypic Age Acceleration (PhenoAgeAccel) with cancer incidence by genetic risk group among 374,463 participants from the UK Biobank. We generated PhenoAge using chronological age and nine biomarkers, PhenoAgeAccel after subtracting the effect of chronological age by regression residual, and an incidence-weighted overall cancer polygenic risk score (CPRS) based on 20 cancer site-specific polygenic risk scores (PRSs).

    Results:

    Compared with biologically younger participants, those older had a significantly higher risk of overall cancer, with hazard ratios (HRs) of 1.22 (95% confidence interval, 1.18–1.27) in men, and 1.26 (1.22–1.31) in women, respectively. A joint effect of genetic risk and PhenoAgeAccel was observed on overall cancer risk, with HRs of 2.29 (2.10–2.51) for men and 1.94 (1.78–2.11) for women with high genetic risk and older PhenoAge compared with those with low genetic risk and younger PhenoAge. PhenoAgeAccel was negatively associated with the number of healthy lifestyle factors (Beta = –1.01 in men, p<0.001; Beta = –0.98 in women, p<0.001).

    Conclusions:

    Within and across genetic risk groups, older PhenoAge was consistently related to an increased risk of incident cancer with adjustment for chronological age and the aging process could be retarded by adherence to a healthy lifestyle.

    Funding:

    This work was supported by the National Natural Science Foundation of China (82230110, 82125033, 82388102 to GJ; 82273714 to MZ); and the Excellent Youth Foundation of Jiangsu Province (BK20220100 to MZ).