Risk Modeling: Predicting cancer risk based on family history

A new software package provides more accurate cancer risk prediction profiles and has the ability to integrate more genes and cancer types in the future.
  1. Michelle F Jacobs  Is a corresponding author
  1. Internal Medicine, University of Michigan, United States

Countless hours have been dedicated to researching cancer – how to prevent it, how to diagnose it early, and how to treat it. Yet, cancer remains a leading cause of death worldwide, accounting for almost 10 million fatalities in 2020.

Most cancers are caused by changes to genes that happen over a person’s lifetime. In rarer cases (about 5–10%), they start due to inherited genetic mutations that produce a predisposition to cancer. In these instances, also known as familial or hereditary cancer syndromes, the mutation is passed down from generation to generation. In these families, more members tend to develop cancers than expected – often of the same or related type – which can also start at a particularly early age.

It is important to identify people with such genetic mutations so that they – and any family members at higher risk – can undergo enhanced cancer screening. Family history can be a useful predictor of hereditary cancer risk (Blackford and Parmigiani, 2010). As such, risk prediction models that incorporate family history to estimate a person’s chance of having a mutation in a cancer predisposition gene or of developing cancer have been employed for many years (Chen et al., 2004).

Historically, such models have been particularly valuable for deciding who to offer genetic testing to when only few and often costly genetic tests were available (Fasching et al., 2007). In some cases, insurance companies require the risk estimate related to carrying a cancer-related genetic mutation to exceed a certain threshold (typically 5 or 10%) to reimburse the cost of a genetic test (Chen et al., 2006). As research advances, the number of genes available for cancer-related genetic testing has now reached over 100 and is likely to continue increasing. Nevertheless, older risk modeling programs generally include only a small number of genes in their predictions. Now, in eLife, Danielle Braun and colleagues – including Gavin Lee and Jane Liang as joint first authors – report on a new software package that has the capacity to evolve alongside advances in cancer research (Lee et al., 2021).

The researchers, who are based at ETH Zürich, EPFL, Harvard, the Dana-Farber Cancer Institute, and the Broad Institute, developed PanelPRO, a tool that uses evidence gathered from extensive literature reviews to model the complex interplay between genes and cancer risk. PanelPRO’s workflow consists of four main parts: input, preprocessing, algorithm, and output (Figure 1).

Workflow for PanelPRO.

First, information on family history, including cancer diagnoses, age of relatives, and cancer risk factors is added into the risk modeling software PanelPRO (input, blue box on the left). Then, PanelPRO validates data formatting (preprocessing, grey oval), and analyses information about frequency and cancer risks for family cancer syndromes (algorithm, grey box) to estimate the likelihood of a person in a family having a mutation in a gene linked to an increased risk of cancer (output, green box on the right). Mutation probability and cumulative cancer risk are given as a probability between 0.0 (no risk) and 1.0 (100% risk).

The user first adds information about a history of cancers in a family – such as ages and cancer diagnoses – and other factors that might affect cancer risk. These include any risk-reducing surgeries in relatives, or tumors with biomarkers that might indicate a potential hereditary cause of their cancer. The software then adds information on the frequency of different hereditary cancer syndromes and assesses their associated cancer risks. PanelPRO can currently accommodate 18 types of cancer and generate predictions of probable mutations for 24 genes, but its code allows for the addition of new cancers or cancer-related genes that may be identified in the future.

During the preprocessing stage, the software verifies the input for any missing information and data, and also for any family relationships not supported by the software, such as ‘double cousins’, which occur when two siblings have children with two siblings from another family. Messages, warnings, or errors may be given to the user if any issues are detected.

After the information has been checked and modified as needed, the model proceeds to the algorithm stage. To calculate the output, the algorithm uses probabilities based on the family history, the frequency of hereditary cancer syndromes in the population, and the cancer history that would be expected if a cancer syndrome were present. The program then estimates the likelihood of a person in a family to have a mutation in a gene linked to an increased risk of cancer. These calculations can also be easily run for other family members using the existing information. It also shows a personalized estimate of future cancer risks. Users can choose which cancer types and genes to display.

However, some outstanding issues remain. Misreported family history information, such as an inaccurate cancer diagnosis or unknown age of diagnosis, can significantly affect estimates, highlighting that accuracy of patient-reported information is key to producing correct estimates (Katki, 2006). While patients have been shown to generally provide exact information on cancer history for first-degree relatives, the accuracy of these reports decreases for more distant relations (Augustinsson et al., 2018; Murff et al., 2004).

Moreover, analyses with a similar risk modeling software have revealed that a strict adherence to a 10% risk threshold to qualify for a test for a probable mutation in the BRCA gene (which is linked to an increased risk of developing breast, ovarian, and other cancers) would miss around 25% of individuals carrying a mutation when compared to genetic testing outcomes (Varesco et al., 2013). This is likely because cancer risks associated with hereditary cancer syndromes are more variable than initially appreciated, and not all family histories may exhibit a predictable pattern of cancer, even when a mutation is present (Okur and Chung, 2017). This complicates risk assessments and argues against making decisions about genetic testing solely based on risk prediction models. Today, broader insurance coverage guidelines and lower costs for genetic tests have increased clinicians’ ability to order these tests, even if certain risk thresholds are not met based on family history.

Nevertheless, the higher number of genes and cancer types supported by PanelPRO compared to other risk models are impressive and its ability to incorporate new genes and cancer types as testing advances are key in this fast-paced, constantly advancing field.

References

  1. Book
    1. Blackford A
    2. Parmigiani G
    (2010) Familial cancer risk assessment using BayesMendel
    In: Michael F. O, John T. C, editors. Biomedical Informatics for Cancer Research. Springer. pp. 301–314.
    https://doi.org/10.1007/978-1-4419-5714-6

Article and author information

Author details

  1. Michelle F Jacobs

    Michelle F Jacobs is in the Department of Internal Medicine, University of Michigan Medical School, Ann Arbor, United States

    For correspondence
    mfjac@med.umich.edu
    Competing interests
    No competing interests declared
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-0458-1952

Publication history

  1. Version of Record published:

Copyright

© 2021, Jacobs

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.

Metrics

  • 972
    views
  • 56
    downloads
  • 4
    citations

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Michelle F Jacobs
(2021)
Risk Modeling: Predicting cancer risk based on family history
eLife 10:e73380.
https://doi.org/10.7554/eLife.73380

Further reading

    1. Cancer Biology
    2. Computational and Systems Biology
    Nayoung Kim, Sehhoon Park ... Myung-Ju Ahn
    Research Article

    This study investigates the variability among patients with non-small cell lung cancer (NSCLC) in their responses to immune checkpoint inhibitors (ICIs). Recognizing that patients with advanced-stage NSCLC rarely qualify for surgical interventions, it becomes crucial to identify biomarkers that influence responses to ICI therapy. We conducted an analysis of single-cell transcriptomes from 33 lung cancer biopsy samples, with a particular focus on 14 core samples taken before the initiation of palliative ICI treatment. Our objective was to link tumor and immune cell profiles with patient responses to ICI. We discovered that ICI non-responders exhibited a higher presence of CD4+ regulatory T cells, resident memory T cells, and TH17 cells. This contrasts with the diverse activated CD8+ T cells found in responders. Furthermore, tumor cells in non-responders frequently showed heightened transcriptional activity in the NF-kB and STAT3 pathways, suggesting a potential inherent resistance to ICI therapy. Through the integration of immune cell profiles and tumor molecular signatures, we achieved an discriminative power (area under the curve [AUC]) exceeding 95% in identifying patient responses to ICI treatment. These results underscore the crucial importance of the interplay between tumor and immune microenvironment, including within metastatic sites, in affecting the effectiveness of ICIs in NSCLC.

    1. Cancer Biology
    Matthew Yorek, Xingshan Jiang ... Bing Li
    Research Article

    A high density of tumor-associated macrophages (TAMs) is associated with poorer prognosis and survival in breast cancer patients. Recent studies have shown that lipid accumulation in TAMs can promote tumor growth and metastasis in various models. However, the specific molecular mechanisms that drive lipid accumulation and tumor progression in TAMs remain largely unknown. Herein, we demonstrated that unsaturated fatty acids (FAs), unlike saturated ones, are more likely to form lipid droplets in murine macrophages. Specifically, unsaturated FAs, including linoleic acids (LA), activate the FABP4/CEBPα pathway, leading to triglyceride synthesis and lipid droplet formation. Furthermore, FABP4 enhances lipolysis and FA utilization by breast cancer cell lines, which promotes cancer cell migration in vitro and metastasis in vivo. Notably, a deficiency of FABP4 in murine macrophages significantly reduces LA-induced lipid metabolism. Therefore, our findings suggest FABP4 as a crucial lipid messenger that facilitates unsaturated FA-mediated lipid accumulation and lipolysis in TAMs, thus contributing to the metastasis of breast cancer.