Cancer: Beware the algorithm

Spliced peptides present on tumor cells can help to mount an immune response, but algorithms offer limited help in predicting which ones actually exist and perform this role in vivo.
  1. Peter van Endert  Is a corresponding author
  1. Institut National de la Santé et de la Recherche Médicale, Unité 1151, Université de Paris, Centre National de la Recherche Scientifique, UMR 8253, France

The human immune system is a formidable surveillance system that helps to keep cancers in check. Killer T cells, for example, can spot and deactivate tumors: more precisely, they can recognize short peptides which are displayed on the surface of harmful cells by a group of molecules called human leukocyte antigens or HLA (Klein and Sato, 2000). Many variations of the HLA genes exist, each coding for a slightly different molecule that can only bind to a limited set of peptides. In turn, these peptides are created inside target cells through a complex protein degradation process supported by a large enzyme known as the proteasome (Rock et al., 2010). For killer T cells to specifically deactivate tumors, cancer cells should be carrying at least one type of HLA molecule that can bind to peptides produced exclusively or primarily in these diseased cells. It is very rare, however, to find a peptide that is only present on tumors.

One way to overcome this obstacle is to focus on the altered peptides produced by driver mutations in genes that regulate cell growth, and are therefore often changed in cancer (Blankenstein et al., 2015). Algorithms could help in that search. These computer-implementable instructions are developed using existing data to ‘automatically’ predict the outcomes of complex biological processes, such as which peptides could be generated by the protein degradation process. Yet algorithms are never failsafe, and they can even be treacherous when fed sketchy data. Now, in eLife, Gerald Willimsky, Peter Kloetzel and colleagues at the Charité hospital in Berlin and various German institutions report having experienced this the hard way (Willimsky et al., 2021).

The team was hunting peptides that could trigger or boost the activity of killer T cells against tumors, seeking to exploit the KRASG12V and RAC2P29L driver mutations. But they found that the peptides coded by the mutated genes could not bind to HLA-A2, the most frequent HLA variant in Caucasians. This led the researchers to turn to a published algorithm that predicted the production of ‘spliced peptides’ that fit the HLA-A2 molecule (Mishto et al., 2019).

Peptide slicing is a fairly new and partly controversial concept in immunology. It proposes that the proteasome sometimes produces two peptides which can fuse, resulting in a ‘spliced peptide’ containing two fragments of the source protein but lacking several amino acids in-between (Vigneron et al., 2017). Solid data show that a small number of these peptides are actually produced in vitro, in isolated live cells, and in vivo: according to some authors, up to 25% of all proteins that bind to HLA molecules are thought to be spliced peptides – but this value could be much lower (Liepe et al., 2016; Mylonas et al., 2018). A small number of spliced peptides have been shown to activate specific killer cell responses in mouse models (Hanada et al., 2004; Warren et al., 2006).

When Willimsky et al. used the algorithm to predict which spliced peptides could match the HLA-A2 allele, several sequences were returned both for KRASG12V and RAC2P29L. This prompted the team to embark on a series of in vitro and in vivo experiments to check whether these peptides could actually bind to HLA-A2. And indeed, when mice that had been genetically modified to express human HLA-A2 were exposed to the peptides, this led to the production of killer T cells that could react to these sequences. Willimsky et al. then genetically modified certain human immune cells to express specific T cell receptors, and these could spot and kill HLA-A2-expressing cells that had been pre-incubated with the relevant peptides. Both mice and human killer cells were therefore perfectly able to respond to the mutant tumor peptides.

However, further in vitro experiments showed that proteasome digestions only produced the RAC2P29L spliced peptide. More importantly, highly sensitive killer T cells were unable to recognize and deactivate tumor cell lines that expressed the mutant proteins, even when the cells overexpressed pieces of the mutant proteins containing the two fragments that fuse together to form the spliced peptide. This means that, in live cells, the splicing either did not happen or it did not create enough peptide to activate a response by the killer T cells (Figure 1).

Algorithms poorly predict which spliced peptides can help the immune system recognize cancer cells.

Two proteins that often carry a mutation (red dot) that drives cancer (KRASG12V and RAC2P29L) are chosen for further exploration (A). An algorithm predicts multiple potential spliced peptides encompassing the mutations for each protein (B). A second algorithm identifies a small number of putative spliced peptides predicted to bind to HLA-A2 on the surface of target cells (C). In vitro, the proteasome does actually generate a predicted spliced peptide carrying the mutation for RAC2 but not for KRAS (D). Exposing mice to the predicted spliced peptides generates killer T cells that identify the peptides with high affinity (E). The T cell receptors that bind to the spliced peptides are successfully transferred to human immune cells called lymphocytes (F). These ‘transformed’ cells efficiently recognize tumor cells pulsed with the synthetic spliced peptides (G). However, different tumor cell lines that express the mutant proteins (but are not artificially equipped with the spliced peptides) are not recognized by the transformed human immune cells. This suggests that, despite the algorithm’s prediction, these peptides are not produced (or are not produced in large enough numbers) in actual cells (H).

What can be learnt from what Willimsky et al. certainly considered a setback? These results could be dismissed simply as bad luck: after all, the non-spliced peptides predicted by an algorithm also are not fully foolproof. Even without considering peptide splicing, the outcome of protein degradation in cells is notoriously difficult to predict. In future research, it is certainly sensible to test early on whether predicted spliced peptides are actually produced in live cells.

Nevertheless, it is likely that using algorithms to predict spliced peptides production is still premature. There is still a lack of high quality data which verify that these putative sequences are indeed produced in vitro under physiologic conditions, as well as in live cells. These studies are sorely needed to improve future algorithms and find new targets for cancer treatment.

References

    1. Klein J
    2. Sato A
    (2000) The HLA system
    New England Journal of Medicine 343:702–709.
    https://doi.org/10.1056/NEJM200009073431006

Article and author information

Author details

  1. Peter van Endert

    Peter van Endert is in the Institut National de la Santé et de la Recherche Médicale, Unité 1151, Université de Paris, Centre National de la Recherche Scientifique, UMR 8253, Paris, France

    For correspondence
    peter.van-endert@inserm.fr
    Competing interests
    No competing interests declared
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0003-3782-0750

Publication history

  1. Version of Record published: May 26, 2021 (version 1)

Copyright

© 2021, van Endert

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.

Metrics

  • 727
    Page views
  • 46
    Downloads
  • 0
    Citations

Article citation count generated by polling the highest count across the following sources: Crossref, PubMed Central, Scopus.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Peter van Endert
(2021)
Cancer: Beware the algorithm
eLife 10:e69657.
https://doi.org/10.7554/eLife.69657

Further reading

    1. Biochemistry and Chemical Biology
    2. Plant Biology
    Dietmar Funck, Malte Sinn ... Jörg S Hartig
    Research Article

    Metabolism and biological functions of the nitrogen-rich compound guanidine have long been neglected. The discovery of four classes of guanidine-sensing riboswitches and two pathways for guanidine degradation in bacteria hint at widespread sources of unconjugated guanidine in nature. So far, only three enzymes from a narrow range of bacteria and fungi have been shown to produce guanidine, with the ethylene-forming enzyme (EFE) as the most prominent example. Here, we show that a related class of Fe2+- and 2-oxoglutarate-dependent dioxygenases (2-ODD-C23) highly conserved among plants and algae catalyze the hydroxylation of homoarginine at the C6-position. Spontaneous decay of 6-hydroxyhomoarginine yields guanidine and 2-aminoadipate-6-semialdehyde. The latter can be reduced to pipecolate by pyrroline-5-carboxylate reductase but more likely is oxidized to aminoadipate by aldehyde dehydrogenase ALDH7B in vivo. Arabidopsis has three 2-ODD-C23 isoforms, among which Din11 is unusual because it also accepted arginine as substrate, which was not the case for the other 2-ODD-C23 isoforms from Arabidopsis or other plants. In contrast to EFE, none of the three Arabidopsis enzymes produced ethylene. Guanidine contents were typically between 10 and 20 nmol*(g fresh weight)-1 in Arabidopsis but increased to 100 or 300 nmol*(g fresh weight)-1 after homoarginine feeding or treatment with Din11-inducing methyljasmonate, respectively. In 2-ODD-C23 triple mutants, the guanidine content was strongly reduced, whereas it increased in overexpression plants. We discuss the implications of the finding of widespread guanidine-producing enzymes in photosynthetic eukaryotes as a so far underestimated branch of the bio-geochemical nitrogen cycle and propose possible functions of natural guanidine production.

    1. Biochemistry and Chemical Biology
    2. Medicine
    Giulia Leanza, Francesca Cannata ... Nicola Napoli
    Research Article

    Type 2 diabetes (T2D) is associated with higher fracture risk, despite normal or high bone mineral density. We reported that bone formation genes (SOST and RUNX2) and advanced glycation end-products (AGEs) were impaired in T2D. We investigated Wnt signaling regulation and its association with AGEs accumulation and bone strength in T2D from bone tissue of 15 T2D and 21 non-diabetic postmenopausal women undergoing hip arthroplasty. Bone histomorphometry revealed a trend of low mineralized volume in T2D (T2D 0.249% [0.156–0.366]) vs non-diabetic subjects 0.352% [0.269–0.454]; p=0.053, as well as reduced bone strength (T2D 21.60 MPa [13.46–30.10] vs non-diabetic subjects 76.24 MPa [26.81–132.9]; p=0.002). We also showed that gene expression of Wnt agonists LEF-1 (p=0.0136) and WNT10B (p=0.0302) were lower in T2D. Conversely, gene expression of WNT5A (p=0.0232), SOST (p<0.0001), and GSK3B (p=0.0456) were higher, while collagen (COL1A1) was lower in T2D (p=0.0482). AGEs content was associated with SOST and WNT5A (r=0.9231, p<0.0001; r=0.6751, p=0.0322), but inversely correlated with LEF-1 and COL1A1 (r=–0.7500, p=0.0255; r=–0.9762, p=0.0004). SOST was associated with glycemic control and disease duration (r=0.4846, p=0.0043; r=0.7107, p=0.00174), whereas WNT5A and GSK3B were only correlated with glycemic control (r=0.5589, p=0.0037; r=0.4901, p=0.0051). Finally, Young’s modulus was negatively correlated with SOST (r=−0.5675, p=0.0011), AXIN2 (r=−0.5523, p=0.0042), and SFRP5 (r=−0.4442, p=0.0437), while positively correlated with LEF-1 (r=0.4116, p=0.0295) and WNT10B (r=0.6697, p=0.0001). These findings suggest that Wnt signaling and AGEs could be the main determinants of bone fragility in T2D.