In vitro proteasome processing of neo-splicetopes does not predict their presentation in vivo
Abstract
Proteasome-catalyzed peptide splicing (PCPS) of cancer-driving antigens could generate attractive neoepitopes to be targeted by T cell receptor (TCR)-based adoptive T cell therapy. Based on a spliced peptide prediction algorithm, TCRs were generated against putative KRASG12V- and RAC2P29L-derived neo-splicetopes with high HLA-A*02:01 binding affinity. TCRs generated in mice with a diverse human TCR repertoire specifically recognized the respective target peptides with high efficacy. However, we failed to detect any neo-splicetope-specific T cell response when testing the in vivo neo-splicetope generation and obtained no experimental evidence that the putative KRASG12V- and RAC2P29L-derived neo-splicetopes were naturally processed and presented. Furthermore, only the putative RAC2P29L-derived neo-splicetopes was generated by in vitro PCPS. The experiments pose severe questions on the notion that available algorithms or the in vitro PCPS reaction reliably simulate in vivo splicing and argue against the general applicability of an algorithm-driven ‘reverse immunology’ pipeline for the identification of cancer-specific neo-splicetopes.
Introduction
Defined anti-tumor CD8+ T cell responses require the proteasome-dependent processing of intracellular proteins and the efficient generation of antigenic peptides presented in the context of HLA class I molecules at the cell surface for TCR recognition. An important step in defining the proteasome as HLA class I epitope generation machine was the early observation that purified 20S proteasomes in combination with synthetic polypeptide substrates encompassing the epitope of interest reproduced the in vivo generation of these epitopes with high fidelity (Boes et al., 1994; Guillaume et al., 2012; Kessler et al., 2006; Niedermann et al., 1995; van der Bruggen and Van den Eynde, 2006). Thus, in vitro antigen processing experiments in combination with specific CD8+ T cells to monitor HLA class I binding and immune recognition are a widely used reliable tool to verify the generation efficiency of antigenic peptides of viral, bacterial, and human origin.
Our view on antigen processing was significantly extended by analysis of cancer patient-derived CD8+ T cells revealing that by proteasome-catalyzed peptide splicing (PCPS) proteasomes can also fuse excised peptide fragments in a reverse proteolysis reaction, thereby generating new immune reactive spliced epitopes (splicetopes) with an amino acid sequence that differs from that of the substrate protein (Hanada et al., 2004; Vigneron et al., 2004). The isolation of splicetope-specific CD8+ T cells from cancer patients and the finding that splicetope-specific CD8+ T cells derived from tumor-infiltrating lymphocytes (TILs) inhibited the engraftment of human acute myeloid leukemia cells in SCID mice indicated the potential immune relevance of such tumor antigen-derived splicetopes (Robbins et al., 1994).
Importantly, for the fibroblast growth factor (FGF)-5 and several splicetopes derived from the tumor differentiation antigen gp100mel, in vitro proteasome splicing reactions were also found to mimic the in vivo splicing reactions (Ebstein et al., 2016; Warren et al., 2006), suggesting that in vitro PCPS reactions may be a useful tool to discover new spliced epitopes generated from tumor antigens of interest. To be able to identify splicetopes in in vitro PCPS experiments independent of the availability of patient-derived CD8+ T cells, we developed the prediction algorithms ProteaJ (Liepe et al., 2010) and the here-described ProtAG. Using these algorithms, we established an inclusion list of potentially immune-relevant spliced peptides theoretically generated from a given antigen, which in combination with the mass spectrometric analysis of the in vitro digest should allow the identification of new splicetopes. Testing the feasibility of such an algorithm-aided ‘reverse immunology’ approach, we had isolated CD8+ T cells from Listeria monocytogenes–infected mice that specifically recognized two phospholipase PlcB-derived splicetopes generated by the proteasome in vitro and in vivo (Platteel et al., 2017).
Targeting somatic cancer-specific driver mutations derived from neoantigens by TCR-mediated adoptive T cell transfer (ATT) represents a promising approach for personalized cancer therapy (Blankenstein et al., 2015). One drawback of this approach is that often neoepitopes may not exhibit HLA class I binding affinities sufficient to trigger an efficient T cell response or not be generated efficiently by the proteasome. In fact, even if a suitable neoepitope is generated, its HLA haplotype specificity frequently does not match with the patient's HLA class I allele, consequently excluding these tumor patients from ATT.
As outlined above, taking advantage of PCPS for the identification of spliced neoepitopes (neo-splicetopes) may therefore represent an interesting approach to identify suitable TCR targets when the recurrent somatic mutations in a tumor antigen do not result in the production of a non-spliced tumor neoepitope either exhibiting a sufficient HLA class I binding affinity or the appropriate HLA class I haplotype. Furthermore, due to the ligation of two distant generated peptide fragments PCPS not only possesses the interesting potential to generate high-affinity neo-splicetopes harboring the respective somatic mutation but also to extend the HLA haplotype diversity of epitopes generated from a given neoantigen.
In a proof-of-principle ‘reverse immunology’ study, we here identified HLA-A*02:01 restricted putative neo-splicetopes predicted by spliced peptide prediction algorithm derived from the two recurrent somatic mutations KRASG12V and RAC2P29L. TCRs specific for the putative neo-splicetopes were generated in huTCR-α/huTCR-β gene loci transgenic HLA-A*02:01 mice (Li et al., 2010). TCRs recognized the respective putative neo-splicetope with high efficacy when tested in vitro. However, we failed to detect a neo-splicetope-specific T cell response when testing the in vivo (in cellulo) generation of the predicted neo-splicetopes and thus failed to gain evidence that the two KRASG12V and RAC2P29Lderived neo-splicetopes were also generated in vivo as predicted by algorithm-aided studies. In addition, only the predicted neo-splicetope for RAC2P29L could be confirmed by in vitro proteasomal digest. The experiments pose severe questions on the applicability of the previously highlighted pipeline (Mishto et al., 2019) for the identification of immune-relevant neo-splicetopes.
Results
Prediction of KRASG12V-derived neo-splicetope
KRAS is one of the most frequently mutated genes in human cancer with the G12X (X = V, S, D, A, C) substitution accounting for most of the mutations found in this position. However, none of these mutations result in the formation of a non-spliced (linear) high-affinity HLA-A*02:01 binding neoepitope (IC50 <100 nM). Since KRAS harbors the oncogenic mutation G12V in approximately 30% of pancreatic ductal adeno carcinoma and 20% of the colon and non-small cell lung cancers, we used the algorithm ProtAG for theoretical prediction of spliced peptides delineated from the KRASG12V2-35 protein sequence. The algorithm-predicted spliced 9mer peptides were submitted to netMHCpan 4.0 (Jurtz et al., 2017) to screen for putative neo-splicetopes with predicted HLA-A*02:01 binding affinity IC50 <100 nM and carrying the mutant V12 amino acid residue. Using this approach led to the identification of putative nonamer neo-splicetopes (KRASG12V5-8/10-14, KLVV/GAVGV, IC50 33.4; sp1), (KRASG12V5-9/11-14, KLVVV/AVGV, IC50 37.50; sp2), and (KRASG12V5-10/12-14, KLVVVG/VGV, IC50 72.90; sp4), of which sp1 and sp2 allowed for TCR generation (Blankenstein et al., 2019).
Generation of KRASG12Vsplicetope-specific TCRs in a humanized mouse model
To analyze the immunogenicity of spliced epitopes, we utilized transgenic mice (ABabDII mice) that harbor the human TCRαβ gene loci as a source for a diverse human TCR repertoire that is selected by chimeric HLA-A*02:01 (Li et al., 2010). Upon immunization with the peptides KLVVGAVGV and KLVVVAVGV (representing sp1 and sp2, respectively), these mice mounted a CD8+ T cell response detected by in vitro re-stimulation of peripheral blood lymphocytes 7 days after the last immunization, whereas mice without immunization did not show reactivity (Figure 1A, Figure 1—figure supplement 1A). Both peptides induced a specific CD8+ T cell response. By sorting IFNγ-positive sp1- and sp2-reactive CD8+ T cells from splenocytes of responder mice using IFNγ-capture assay (not shown), specific TCRs were isolated upon rapid amplification of cDNA end (5′RACE)-PCR and cloning of the most abundant rearranged TCR-α and TCR-β genes for each individual mouse. One TCR directed against sp1 epitope (1376) and two TCRs specific for sp2 epitope (9383B2 and 9383B14) were isolated. Codon-optimized sequences encoding the α- and β-chains were linked with a P2A element and inserted into retroviral expression vector pMP71, transduced into human peripheral blood mononuclear cells (PBMC) (Figure 1B, Figure 1—figure supplement 1B) and tested for specificity measuring release of IFNγ in a co-culture with TAP-deficient T2 cells loaded with titrated amounts of sp1 (Figure 1C) or sp2 peptides (Figure 1—figure supplement 1C), respectively. All three TCRs induced robust IFNγ release at peptide concentrations of up to 10−10 M, suggesting high functional avidity for these TCRs. For TCR1376 and TCR9383B2, cross-reactivity to the in silico predicted linear KRASG12V KLVVVGAVGV peptide was only seen for the highest peptide concentrations (Figure 1D and not shown, respectively).
KRASG12Vsplicetope-specific TCRs do not recognize cancer cells endogenously expressing mutant KRASG12V
One of the critical tests for the usefulness of therapeutic TCRs in genetically modified T cells is the recognition of cancer cells that endogenously express the respective mutation. This approach was even more decisive for our approach because so far the predicted neo-splicetopes had been predicted in silico but not detected in cells. Therefore, PBMC genetically engineered to express sp1- and sp2-specific TCRs were co-cultured with a series of cancer cell lines that harbored the G12V mutation within the KRAS gene. MCF7 and Mel624 cells with two KRAS wildtype copies served as controls. Whereas some of the cell lines used expressed HLA-A*02:01 (Figure 2A, Figure 2—figure supplement 1A, B), the HLA-A*02:01-negative cell lines were transduced with an HLA-A*02:01 expressing retroviral construct (Figure 2B). Presence of sufficient amounts of HLA-A*02:01 for T cell recognition was analyzed by prior loading of the tumor cells with 10−6 M of the respective peptide. In all cases, peptide-loaded cancer cells were recognized by TCR1376 (Figure 2A, B, red bars) or TCR9383B2 (Figure 2—figure supplement 1A, B, red bars) and TCR9383B14 (Figure 2—figure supplement 1A, B, orange bars) transduced T cells, respectively. In contrast, IFNγ release by TCR1376 (Figure 2A, B) or TCR9383B2 and TCR9383B14 engineered T cells (Figure 2—figure supplement 1A, B) was not above background when cancer cells were co-cultured without prior peptide loading, indicating that the endogenous KRASG12V protein is not recognized. Cancer cells were also treated with IFNγ 48 hr prior to co-culture with the respective TCR-modified PBMCs. As exemplarily shown for sp2-specific TCRs, again only peptide-loaded tumor cells were recognized (Figure 2—figure supplement 1B).
T cells harboring KRASG12Vsplicetope-specific TCRs do not recognize overexpressed KRASG12V
One challenge of targeting neoepitopes with T cells is the low abundance of many neoantigens on the surface of the respective HLA class I molecules that may hamper recognition by T cells. To exclude low expression level as one reason for the failure of TCR1376-engineered T cells to recognize the spliced form of the KRASG12V peptide on the cancer cells, we generated cancer cells (MCF7, Mel624, and mouse NIH-HHD) that ectopically overexpressed triple minigenes encoding three copies of the KRASG12V mutation interconnected by an AAY sequence that ensures proteasomal cleavage (Spiotto et al., 2002). We therefore generated triple minigene cassettes that either encoded the N-terminal 35mer polypeptide of KRASG12V1-35 or as control triple minigenes that encoded the predicted non-spliced 10mer KRASG12V5-14 peptide epitope, the spliced 9mer KRASG12V5-8/10-14, or KRASwt5-8/10-14 peptide epitope, respectively (Figure 3A). As shown in Figure 3B–D, TCR1376-positive T cells efficiently recognized the KRASG12V5-8/10-14 peptide when loaded either onto MCF7 (Figure 3B), Mel624 (Figure 3C), and mouse NIH-HHD (Figure 3D) cells or when expressed as a triple epitope. In contrast, no IFNγ release was elicited with cells expressing the triple KRASG12V1-35 35mer polypeptide (Figure 3B–D). Quantitative PCR analysis of KRASG12V triple minigene 35mer and KRASG12V triple epitope spliced nonamer revealed that KRASG12V triple minigene 35mer is expressed almost twice as high as the KRASG12V triple epitope spliced nonamer (Figure 3E). Altogether, this indicates that the spliced peptide, theoretically predicted, is either not generated in vivo or, despite the overexpression of the KRASG12V1-35 substrate, is produced at amounts insufficient to be recognized by KRASG12V5-8/10-14-specific high-affinity T cells.
KRASG12V splice peptide-specific TCR1376 cross-reacts with HLA-C07 allele
We initially identified two cell lines with the G12V mutation (SW480 and SW620) that induced IFNγ release by TCR1376-transduced PBMC upon co-culture (Figure 3—figure supplement 1A). Upon co-culture with a panel of lymphoblastoid B cell lines (BLCLs) that harbor a series of different HLA class I molecules, a test for potential HLA allo-reactivity that we routinely perform with TCRs obtained from ABabDII mice, we uncovered reactivity to several BLCLs (Figure 3—figure supplement 1B), all of them having in common the expression of HLA-C*07 (Supplementary file 1). Reanalysis of the tumor cell lines SW480 and SW620, which originated from the same patient, confirmed expression of HLA-C*07 molecule (Supplementary file 1). To finally prove HLA allo-reactivity of the TCR1376 to HLA-C*07, we performed co-culture with the HLA-deficient myelogenous leukemia cell line K562 that had been transduced with HLA-C*07:01, HLA-C*07:02, and HLA-A*02:01 molecules, respectively. The experiments showed that K562-C*07:01 and K562-C*07:02 cell lines were recognized by three independent TCR1376-transduced PBMC donors irrespective of loading with peptide sp1, whereas K562-A02:01 cells only induced IFNγ release when these cells were loaded with sp1 peptide prior to co-culture (Figure 3—figure supplement 1C). These results clearly indicate that the TCR1376 directly recognizes members of the HLA-C*07 sub-family and/or peptides bound therein as well as sp1 peptide bound to HLA-A*02:01.
Triple 35mer polypeptide of KRASG12V1-35 minigenes are not immunogenic in vivo
In order to analyze whether triple 35mer polypeptide of KRASG12V1-35 minigenes would induce a CD8+ T cell response in vivo, we performed immunizations of ABabDII mice with an adenovirus expressing the N-terminal 35mer polypeptide of KRASG12V1-35 as a triple minigene. Despite multiple immunizations, neither restimulation with the linear 10mer KLVVVGAVGV nor with the spliced epitopes sp1 KLVVGAVGV, sp2 KLVVVAVGV, sp3 YLVVVGAVGV, or sp4 KLVVVGVGV induced IFNγ release by CD8+ T cells in an intracellular cytokine staining of PBMCs 7 days after the last immunization (Figure 3—figure supplement 2). This supports the notion that the KRASG12V mutation is not immunogenic in the context of HLA-A*02:01, irrespective of whether splicing events occur or not.
Predicted KRASG12V spliced peptides are not generated in in vitro PCPS reactions
Our failure to detect immune-reactive KRASG12V-derived neo-splicetopes under in vivo conditions raised doubts with respect to the reliability of the previously proposed solely algorithm-based pipeline for identification of immune-relevant neo-splicetopes (Mishto et al., 2019). Therefore, we studied the generation of the KRASG12V-derived neo-splicetopes in more detail in in vitro PCPS assays. Accordingly, the polypeptide substrates KRASG12V2-35, KRASG12V2-32, KRASG12V2-21, and KRASG12V2-14 were synthesized. However, due to the extreme hydrophobicity of the KRAS protein, the designed longer polypeptide substrates KRASG12V2-35 and KRASG12V2-32 encountered considerable difficulties during synthesis and subsequent purification, resulting in a highly impure product not suited for in vitro digestion experiments (Figure 3—figure supplement 3). Consequently, we used the polypeptides KRASG12V2-21 and KRASG12V2-14 for the in vitro PCPS reactions. KRASG12V2-14 was chosen based on previous data showing that C-terminal cleavage generating the C-terminal anchor residue is not essentially required to generate a spliced gp100-derived epitope (Liepe et al., 2010; Vigneron et al., 2004). Monitoring the kinetics of proteasomal spliced peptide generation represents an essential parameter for assessing the fidelity of in vitro PCPS reactions. To search for spliced peptides, a fasta data file generated with ProtAG was loaded onto PD2.1 and the kinetics analyzed with LC Quan 2.7 (Willimsky et al., 2021). At t = 0, none of the predicted spliced neo-splicetopes was identified. However, following the generation of the KRASG12V-derived putative spliced neoepitopes from the polypeptide substrates KRASG12V2-14 and KRASG12V2-21 over time in in vitro PCPS reactions (Figure 3—figure supplements 4 and 5), all three predicted 9mer KRASG12V spliced peptides (KRASG12V5-8/10-14 sp1, KRASG12V5-9/11-14, sp2, KRASG12V5-10/12-14, sp4) were found to be generated from the KRASG12V2-14 substrate (Figure 3—figure supplement 4). This corroborated previous findings of Mishto et al., 2019, who reported the in vitro generation of KRASG12V5-8/10-14 and KRASG12V5-9/11-14 from a longer KRASG12V2-35 polypeptide substrate. Using the longer KRASG12V2-21 polypeptide substrate for the in vitro PCPS reactions, only the putative neo-splicetopes KRASG12V5-8/10-14 and KRASG12V5-10/12-14 were generated (Figure 3—figure supplement 5A). The apparent contradiction between our in vivo experiments reported above and the results of the in vitro PCPS reactions was unexpected, considering that for the several spliced epitopes published so far there seemed to be a good correlation between the in vitro and in vivo results (Dalet et al., 2011; Ebstein et al., 2016; Michaux et al., 2014; Mishto et al., 2012; Platteel et al., 2017).
This led us to perform a more detailed MS analysis of the polypeptide substrate used for the in vitro PCPS experiments. Indeed, we found that most likely the accumulation of hydrophobic amino acid residues within the KRASG12V polypeptide substrates had led to mistakes during polypeptide synthesis, resulting in the synthesis of faulty polypeptides (Supplementary file 2) mimicking in sequence the results of the predicted splicing reaction (Figure 3—figure supplement 5B, Willimsky et al., 2021). Therefore, it was impossible to decide whether the candidate KRASG12V-derived spliced peptides identified in vitro were true splicing products or as it appeared the product of normal proteasomal cleavage of already preexisting faulty polypeptides inappropriately simulating a splicing event. Furthermore, depending on the substrate, in vitro generation of non-spliced epitopes can be by orders more efficient than the generation of spliced epitopes (Mishto et al., 2019). Thus, polypeptide substrates with mistakes in their sequence that are degraded at a rate similar to the rate of the correct substrate (Figure 3—figure supplement 5C) may become a prevalent source for the generation of faked spliced peptides. We eventually obtained a KRASG12V1-21 polypeptide substrate (JPT Peptide Technologies, Berlin, Germany) without contaminants mimicking the predicted KRASG12V5-8/10-14 (sp1) and KRASG12V5-10/12-14 (sp4) splicing events. However, using this new KRASG12V1-21 polypeptide as substrate for kinetic in vitro PCPS experiments, we now failed to identify generation of either predicted KRASG12V 9mer neo-splicetope. Although our experiments cannot completely exclude the generation of minor amounts of KRASG12V-derived spliced peptides, they are in line with our failure to detect any immune-reactive KRASG12V-derived spliced epitopes in vivo.
Identification and functional characterization of RAC2P29L-derived neo-splicetopes
RAC2 is a small GTPase belonging to the Rho family of GTPases. The RAC2P29L mutation is another so-called driver mutation facilitating tumor growth as well as metastasis and thus presents a potential target in ATT. The linear RAC2P29L FLGEYIPTV epitope has been predicted with an IC50 of 2 nM. To identify RAC2P29L-specific neo-splicetopes, we applied the ProtAG algorithm to predict all theoretically possible RAC2P29L20-44-derived spliced peptides. From this initial screen, we selected all theoretical linear spliced 9mer peptides with a calculated HLA-A*02:01 binding affinity of IC50 < 100 nM (Jurtz et al., 2017). To establish a cleavage map and identify all linear proteasomal cleavage products generated from the RAC2P29L20-44 polypeptide substrate, we performed in vitro digestions for 24 hr and 48 hr using erythrocyte and LcL 20S proteasomes (Willimsky et al., 2021). In these digests, also the non-spliced RAC2P29L28-36 neoepitope FLGEYIPTV was identified (Figure 4A). To search for spliced peptides, a fasta data file generated with ProtAG was loaded onto PD2.1 (Willimsky et al., 2021). In this search, the spliced RAC2P29L28-34/36-37 (28FLGEYIP34/36VF37) peptide, with a calculated HLA-A*02:01 binding affinity of IC50 = 24,7 nM, was found to be the only HLA-A*02:01 restricted putative RAC2P29L neo-splicetope with an IC50 < 100 nM generated. To confirm the initial identification of RAC2P29L28-34/36-37 kinetic in vitro, PCPS reactions were performed and analyzed by applying the LC Quan software version 2.5 (Thermo Fisher) (Figure 4B, Figure 4—figure supplement 1).
The amounts of peptides generated in an in vitro processing reaction can vary dramatically depending on the assay conditions allowing only a relative estimation. However, judging by ion counts in vitro generation of the non-spliced RAC2P29L28-36 neoepitope was approximately 200-fold more efficient than generation of the putative neo-splicetope RAC2P29L28-34/36-37 (Figure 4A).
To exclude that generation of RAC2P29L28-34/36-37 was the result of an accidental singular splicing event we screened the digests for additional PCPS products. Interestingly, the putative RAC2P29L28-34/36-37 neo-splicetope seemed to be the result of the excision of a single aa residue (T35) and the C-terminal ligation of the dipeptide 36VF37 to the N-terminal 28FLGEYIP34 fragment. However, repetitive specific ligation of a dipeptide in a PCPS reaction would require the unlikely existence of a corresponding specific dipeptide binding site close to the active site and the respective acceptor fragment.
Therefore, we hypothesized that generation of the RAC2P29L28-34/36-37 spliced peptide was the result of a multistep process involving a larger already spliced precursor product. Indeed, as shown in Figure 4B in in vitro kinetic experiments and by detailed mass spectrometric analyses, we identified the RAC2P29L28-34/36-40 FLGEYIP/VFDNY polypeptide being the largest already spliced epitope precursor peptide. Supporting that RAC2P29L28-34/36-37 was generated via precursor peptides, we also detected the corresponding N-terminal splice acceptor peptide RAC2P29L28-34 (FLGEYIP) and the C-terminal splice donor peptide RAC2P29L36-40 (VFDNY) (Figure 4C). This suggests that generation of the final RAC2P29L28-34/36-37 neo-splicetope requires an additional C-terminal proteasomal cleavage step for spliced epitope liberation (Figure 4A).
RAC2P29L28-34/36-37 splicetope-specific TCR does not recognize RAC2P29L triple epitope 45mer
Spliced RAC2P29L28-34/36-37 peptide-specific TCRs were generated by immunizing ABabDII mice with the corresponding synthetic 9mer peptides (not shown). For analysis of in vivo generation and presentation of the spliced RAC2P29L28-34/36-37 peptide, we transduced Mel21a cells to express a triple RAC2P29L1-45 45mer polypeptide minigene (Figure 5A) and monitored HLA-A*02:01 epitope presentation by T cell recognition. As shown in Figure 5B, no IFNγ release was obtained for the putative neo-splicetope RAC2P29L28-34/36-37 using TCR20967A2-transduced T cells, while peptide-loaded Mel21a cells were readily recognized and IFNγ production demonstrated the target specificity of the TCR. Thus, despite the overexpression of the RAC2P29L1-45 45mer substrate peptide, we failed to verify the in vivo generation of the RAC2P29L28-34/36-37 peptide. We also raised a TCR (TCR22894) in ABabDII mice against the linear RAC2P29L peptide. Because Mel21a cells, transduced to express a triple RAC2P29L1-45 45mer (Figure 5B) or RAC2P29L cDNA (Figure 5C), were readily recognized by T cells transduced with TCR22894, we could exclude that our inability to detect cell surface expression of the RAC2P29L28-34/36-37 peptide was due to defects in the antigen presentation pathway. We repeated the experiments with TCR-engineered mouse T cells derived from TCR1xCD45.1xRag1-/- mouse splenocytes (expressing a monoclonal irrelevant TCR against SV40 large T) to monitor RAC2P29L28-34/36-37 peptide cell surface expression using mouse NIH-HHD cells expressing a chimeric HLA-A02:01 (HHD) molecule. As observed in Mel21a cells, the non-spliced RAC2P29L neoepitope (derived from RAC2P29L1-45 45mer as well as cDNA) was efficiently presented also by NIH-HHD cells, excluding potential differences in the catalytic properties of mouse and human proteasomes (Figure 5D). More importantly, again the spliced peptide-specific TCR20967A2 did not confer any reactivity to T cells upon co-culture without prior peptide loading of the target cells or overexpression of the spliced epitope (Figure 5B–D). Quantitative PCR analysis of the triple RAC2P29L1-45 45mer polypeptide minigene and RAC2P29L triple epitope spliced nonamer expressed in mouse NIH-HHD cells revealed that RAC2P29L1-45 45mer polypeptide minigene is expressed almost fivefold higher than the RAC2P29L triple epitope spliced nonamer (Figure 5E). These experiments do not categorically exclude any in vivo generation of the in vitro identified RAC2P29L-derived spliced peptide. However, they clearly demonstrate that even if the RAC2P29L28-34/36-37 neo-splicetope is derived from an overexpressed substrate protein, its amounts are negligible and insufficient to allow its recognition by T cells.
In summary, our results strongly question the idea that in vitro PCPS reaction simulates the in vivo situation with the same high fidelity as the in vitro generation of non-spliced epitopes and contradicts the previously highlighted idea that an algorithm-supported identification of in vitro-generated spliced epitopes is a suitable general approach for the facilitated identification of tumor-specific immune-relevant neo-splicetopes for consecutive TCR generation.
Discussion
Effective CD8+ T cell-induced immune responses depend on both the quality and the amount of proteasome-generated antigenic peptides available for presentation by HLA class I molecules to peptide-specific TCR at the cell surface (Niedermann et al., 1999; Princiotta et al., 2003). Not neglecting TCR affinity or the HLA class I binding affinity of an epitope, in each case the amount of a specific epitope generated by proteasomes from a given antigen has to rise above a certain threshold to elicit a relevant T cell response.
In addition to substrate amounts and protein turnover rates, epitope generation efficiency of both non-spliced and spliced epitopes is determined by the sequence of the epitope, its surrounding protein sequence, and connected with the cleavage site usage and cleavage strength of proteasomes (Mishto et al., 2014; Niedermann et al., 1995; Sijts and Kloetzel, 2011). Thus, even high-affinity peptides, if embedded in a non-favorable protein sequence, will not surpass the necessary threshold for eliciting a T cell response.
The cleavage properties of proteasomes have been intensively studied. However, due to the complexity of protein sequences and lacking information on cleavage strength that decisively determines epitope generation efficiency, algorithms predicting proteasomal generation of immune-relevant non-spliced epitopes still do not reach prediction efficiencies sufficient for large-scale ‘reverse immunology’ approaches (Calis et al., 2015; Di Carluccio et al., 2018; Singh and Mishra, 2016). Thus, many predicted epitopes may be false-positives, which could impede immunotherapy, for example, neoantigen vaccines. Prediction algorithms for spliced epitopes, which are based on protein sequence and proteasomal cleavage properties, do not even yet exist.
Therefore, in vitro generation of epitopes using purified 20S proteasomes and synthetic polypeptide substrates encompassing the epitope(s) of interest, in combination with mass spectrometric analyses and both in vitro and in vivo CD8+ T cell assays, still represents the most frequently used tool to validate the generation of immune-relevant non-spliced peptides, assuming that it closely simulates the in vivo (in cellulo) situation with respect to both quality and relative amounts of the epitope (Kessler et al., 2001; Kessler and Melief, 2007; Sijts and Kloetzel, 2011).
While a number of virus- or tumor-derived non-spliced epitopes have been validated by in vitro experiments and correlated to the in vivo situation, examples for spliced HLA class I epitopes are still very limited. Nevertheless, what applied to non-spliced epitopes also seemed to be valid for spliced HLA class I epitopes generated in vitro by PCPS. Thus, FGF-5, SP110, and several gp100-derived spliced epitopes that are recognized by CD8+ T cells on the cell surface were demonstrated to be produced also in in vitro PCPS assays (Ebstein et al., 2016; Vigneron et al., 2019). Although in these cases the generation efficiency of spliced epitopes in in vitro PCPS assays seemed to be in line with the in vivo situation, it should be noted, however, that the abundance of spliced epitopes presented at the cell surface is a matter of substantial controversy (Liepe et al., 2016; Liepe et al., 2019; Mylonas et al., 2018; Paes et al., 2019; Rolfs et al., 2019). Thus, in light of recent reports (Mylonas et al., 2018; Rolfs et al., 2019) the amount of cell surface-presented spliced epitope seems to be considerably less than initially estimated.
On the other hand and supporting a potential immune relevance, the initial discovery of the splicing event and spliced epitopes was based on the identification of patient-derived CD8+ T cells reactive towards spliced epitopes generated from tumor antigens (Hanada et al., 2004; Vigneron et al., 2004). Widespread identification of spliced epitopes is however limited by the rare availability of corresponding specific CD8+ T cells. Therefore, we developed prediction algorithms allowing the mass spectrometric identification of predicted and in vitro proteasome-generated spliced peptides. Indeed, applying such an algorithm-aided ‘reverse immunology’ approach successfully led previously to the identification of two spliced phospholipase PlcB epitopes that primed antigen-specific CD8+ T cells in L. monocytogenes-infected mice (Platteel et al., 2017).
Because somatic mutations in tumor antigens frequently do not result in the generation of neoepitopes suitable for generation of TCRs for ATT therapy, we applied spliced peptide prediction algorithms to identify neo-splicetopes with HLA-A*02:01 binding affinity predicted to be generated from the mutant tumor antigens KRASG12V and RAC2P29L and used those for TCR generation.
In vitro PCPS experiments in combination with MS analysis aiming at the identification of the algorithm-predicted putative KRASG12V-derived neo-splicetopes, however, gave no final evidence for their in vitro generation. In the initial kinetic splicing reactions, we seemed to have identified the predicted spliced peptides, thereby corroborating also data obtained with a KRASG12V2-35 polypeptide substrate reported by Mishto et al., 2019.
However, we found that most likely the extreme hydrophobicity of the KRAS amino acid composition had led to faulty polypeptide synthesis, resulting in polypeptide substrates mimicking in sequence the results of the predicted splicing reaction (Figure 3—figure supplement 5, Supplementary file 2). Considering that in general the generation of spliced peptides is significantly less efficient than that of non-spliced peptides (Figure 4A), even minor amounts of faulty peptide substrates will become prevalent in in vitro splicing reactions (see also Figure 3—figure supplement 5B). Because high-quality peptide synthesis reaches a purity of 95–99% at most, a thorough substrate analysis appears essential to avoid false-positive results in in vitro PCPS experiments, particularly for chemically difficult substrates. However, when we used a newly synthesized KRASG12V substrate not contaminated with peptides mimicking the KRASG12V5-8/9-14 and KRASG12V5-10/12-14 splicing reactions, we failed to identify the in silico-predicted neo-splicetope. This negative result cannot finally prove the non-existence of the KRASG12V5-8/9-14 and KRASG12V5-10/12-14, but being in line with the in vivo experiments, one has to conclude that if these KRASG12V-derived spliced epitopes are generated, then their amount is below detectable level. In contrast to the negative results obtained with respect to KRASG12V, the analysis of RAC2P29L led to the identification of the in silico-predicted RAC2P29L28-34/36-37 peptide in in vitro PCPS experiments. Nevertheless, the putative RAC2P29L28-34/36-37 neo-splicetope was generated significantly less efficient than the non-spliced RAC2P29L28-36 neoepitope.
Testing the in vivo generation of the spliced KRASG12V5-8/10-14 and RAC2P29L28-34/36-37 peptides using the respective peptide-specific TCRs, which were of high affinity and recognized as little as 10−10 M peptide, we obtained no T cell signal and no evidence for the immune relevance of the two candidate neo-splicetopes, independent of the experimental conditions. Our experiments provide no evidence that either KRASG12V5-8/10-14 or RAC2P29L28-34/36-37 are produced in vivo or presented at the cell surface. However, even in case both neo-splicetopes were generated in vivo, their generation efficiency and the total amount presented by HLA-A*02:01 molecules on the cell surface are too low to be of any immune significance. One possible explanation for the failure to verify the in vitro PCPS reaction for RAC2P29L28-34/36-37 in in vivo settings could be the high substrate and proteasome concentration as used for in vitro PCPS, thereby forcing splicing reactions that do either not or only inefficiently occur under in vivo conditions where only a single substrate protein enters the catalytic cavity of the proteasome for processing at a given time.
Thus, quite in contrast to the experience resulting from proteasome-dependent processing of non-spliced epitopes in vitro, in vitro generation of spliced epitopes by PCPS may not exhibit the same fidelity and does not always simulate the efficacy of in vivo spliced epitope generation. This of course strongly questions the general application of the recently highlighted experimental pipeline for the identification of cancer-specific neo-splicetopes (Mishto et al., 2019). Reconsidering the workflow for the identification of neo-splicetopes, it thus seems that in vitro PCPS even when combined with peptide binding and TAP transport assays are not sufficient for the prediction of their immune relevance. We therefore believe that it is mandatory to first prove the cell surface presentation of algorithm-predicted candidate neo-splicetopes, either in humanized mice under conditions requiring processing and presentation or by peptide elution experiments, before TCR generation. Our data also support the notion (Mylonas et al., 2018) that the frequency of spliced epitopes is largely overestimated.
Materials and methods
Peptides, proteasome, and PCPS
Request a detailed protocolThe polypeptides substrates KRASG12V2-14 (BIH 107) (TEYKLVVVGAVGV), KRASG12V2-21 (Kloe 1178) (TEYKLVVVGAVGVGKSALTI), KRASG12V2-32(TEYKLVVVGAVGVGKSALTIQLIQNHFVDEY), KRASG12V2-35(TEYKLVVVGAVGVGKSALTIQLIQNHFVDEYDPT), as well as the RAC2P29L20-44 (BIH 5) polypeptide substrate (LISYTTNAFLGEYIPTVFDNYSANV) were synthesized by the core facility of the Institute of Biochemistry (Dr. Petra Henklein) using standard Fmoc (N-(9-fluorenyl) methoxycarbonyl) methodology (0.1 mmol) on an Applied Biosystems 433A automated synthesizer. The peptide was purified by HPLC and analyzed by mass spectrometry (ABI Voyager DE PRO). The KRASG12V1-21 (MDC 27) (MTEYKLVVVGAVGVGKSALTI) polypeptide substrate was obtained from JPT Peptide Technologies (Berlin, Germany). 20S proteasomes were purified from human red blood cells, LcL or T2.7 cells in principle following the procedure as previously described (Textoris-Taube et al., 2019). Proteasome composition of LcL and T2.7 cells, which express immunoproteasomes, however, may vary dependent on batch and culture conditions. For kinetic experiments and better comparison, therefore only the results obtained with erythrocyte 20S proteasomes were used for the kinetic experiments. Proteasome digests of the synthetic RAC2P29L and KRASG12V polypeptides were performed in 100 μl of TEAD buffer (20 mM Tris, 1 mM EDTA, 1 mM NaN3, 1 mM DTT, pH 7.2) over time at 37°C. For establishing a cleavage map for RAC2P29L20-44, processing times were 24 hr and 48 hr. The RAC2P29L20-44 and KRASG12V1-21 synthetic polypeptide at a concentration of 60 μM was digested by 8 μg 20S proteasome. Proteasomal processing of the synthetic KRASG12V2-21 and KRASG12V2-14 polypeptides was performed at a substrate concentration of 40 μM or 60 μM and in the presence of 4 μg or 8 μg 20S proteasome, respectively. 10 µl digested sample was loaded for 5 min onto a trap column (PepMap C18, 5 mm × 300 μm × 5 μm, 100 Ǻ, Thermo Fisher Scientific) with 2:98 (v/v) acetonitrile/water containing 0.1% (v/v) trifluoroacetic acid (TFA) at a flow rate of 20 μl/min and analyzed by nanoscale LC-MS/MS using an Ultimate 3000 and LTQ Orbitrap XL mass spectrometer (Thermo Fisher Scientific). The system comprises a 75 µm i.d. ×250 mm nano LC column (Acclaim PepMap C18, 2 μm; 100 Å; Thermo Fisher Scientific) or a 200 mm PicoFrit analytical column (PepMap C18, 3 µm, 100 Å, 75 µm; New Objective). The mobile phase (A) is 0.1% (v/v) formic acid in water and (B) is 80:20 (v/v) acetonitrile/water containing 0.1% (v/v) formic acid. For elution, a gradient 3–45% B in 85 min with a flow rate of 300 nl/min was used. Full MS spectra (m/z 300–1800) were acquired on an Orbitrap instrument at a resolution of 60,000 (FWHM). At first, the most abundant precursor ion was selected for either data-dependent collision-induced dissociation (CID) fragmentation with parent list (1+, 2+ charge state included). Fragment ions were detected in an Ion Trap instrument. Dynamic exclusion was enabled with a repeat count of 2 and 60 s exclusion duration. Additionally, the theoretically calculated precursor ions of the expected spliced peptides were pre-elected for two Orbitrap CID (resolution 7500) and higher energy collisional dissociation (HCD) (resolution 15,000) fragmentation scans. The maximum ion accumulation time for MS scans was set to 200 ms and for MS/MS scans to 500 ms. Background ions at m/z 371.1000 and 445.1200 act as lock mass.
For LC-MS/MS runs using a Q Exactive Plus mass spectrometer coupled with an Ultimate 3000 RSLCnano (Thermo Fisher Scientific), samples were trapped as described above and then analyzed by the system that comprised a 250 mm nano LC column (Acclaim PepMap C18, 2 μm; 100 Å; 75 µm Thermo Fisher Scientific). A gradient of 3–40% B (alternatively 3–45% B) in 85 min was used for elution. The mobile phase (A) was 0.1% (v/v) formic acid in water and (B) 80% acetonitrile in water containing 0.1% (v/v) formic acid. The Q Exactive Plus instrument was operated in the data-dependent mode to automatically switch between full-scan MS and MS/MS acquisition. Full MS spectra (m/z 200–2000) were acquired at a resolution of 70,000 (FWHM) followed by HCD MS/MS fragmentation of the top 10 precursor ions (resolution 17,500, 1+, 2+, 3+, charge state included, isolation window of 1.6 m/z, normalized collision energy of 27%). The ion injection time for MS scans was set to maximum 50 ms, automatic gain control (AGCs) target value of 1 × 106 ions and for MS/MS scans to 100 ms, AGCs 5 × 104, dynamic exclusion was set to 20 s. Background ions at m/z 391.2843 and 445.1200 act as lock mass.
Peptides were identified by PD2.1 software (Thermo Fisher Scientific) based on their merged tandem mass spectra (MS/MS) of CID and HCD. For peptide identification, we set mass tolerances of either 10 ppm (for XL mass spectrometer) or 6 ppm (for Q Exactive mass spectrometer) on precursor masses and either 0.6 Da for fragment ions using Ion Trap or 0.06 Da using Orbitrap for fragmentation (for XL mass spectrometer) or 0.02 Da (for Q Exactive mass spectrometer).
In addition, for spliced peptides we compared the retention time and the merged MS/MS of CID and HCD with the fragmentation pattern of their synthetic counterparts. To identify spliced peptides, a fasta data file was generated with ProtAG for the KRASG12V and RAC2P29L polypeptide substrates and loaded onto PD2.1. The kinetics were analyzed with LC Quan 2.7. HLA-A*02:01 binding affinity of putative spliced epitopes was calculated by the netMHCpan 4.0 algorithm (Jurtz et al., 2017).
Cell lines
Request a detailed protocolT2 cells (ATCC: CRL-1992) were kept in RPMI supplemented with 10% fetal calf serum (FCS). The viral producer cell line HEK-GALV (HEK-293 cells stably expressing GALV-env and MLV-gag/pol) was cultured in DMEM supplemented with 10% FCS. Human peripheral blood mononuclear cells (hPBMCs) and Epstein–Barr virus–transformed lymphoblastoid B cell lines (B-LCLs; Obenaus et al., 2015) were cultured in RPMI 1640 supplemented with 10% FCS, 50 μM 2-mercaptoethanol, 1 mM sodium pyruvate, and non-essential amino acids. The human cell lines with KRASG12V mutation were obtained from ATTC (AsPC-1 [ATCC: CRL-1682]; Capan-1 [ATCC: HTB-79]; CFPAC-1 [ATCC: CRL-1918]; NCI-H441 [ATCC: HTB-174]; NCI-H727 [ATCC: CRL-5815]; Panc 03.27 [ATCC: CRL-2549]; SW480 [ATCC: CRL-228]; SW620 [ATCC: CRL-227]) or Sigma-Aldrich (Colo668). The human cell lines carrying wildtype KRAS gene were MCF-7 (Sigma-Aldrich) and 624-Mel (RRID:CVCL_8054). K562 cells expressing HLA-A02:01, HLA-C07:01, and HLA-C07:02 molecules were obtained after transduction of K562 cells with retroviral vectors MP71 encoding the respective HLA molecules. The human melanoma cell line UKRV-Mel-21a (referred to hereafter as Mel21a) and the mouse cell line NIH-HHD have been described in Sun et al., 2002 and Popovic et al., 2011, respectively. Cancer cell lines were kept in RPMI (Gibco) supplemented with 10% FCS (Pan Biotech), 1 mM L-glutamine, 1 mM sodium pyruvate, and non-essential amino acids. For HLA-A02:01 negative cell lines AsPC-1, Capan-1, NCI-H727, and Colo668 HLA-A02:01 expression was achieved by transient transfection with plasmid pMP71-A2 (retroviral vector encoding HLA-A*0201). All human tumor cell lines were authenticated by sequencing for the presence of the mutant or wildtype KRASG12V configuration (Chang et al., 2013), and all experiments were performed with mycoplasma‐free cells.
Generation of neo-splicetope-specific T cells in ABabDII mice
Request a detailed protocolFor immunization (priming and successive boosts), ABabDII mice were injected subcutaneously with 100 µg of peptide (KLVVVGAVGV [KRASG12V-lin], KLVVGAVGV [KRASG12V-sp1], KLVVVAVGV [KRASG12V-sp2], YLVVVGAVGV [KRASG12V-sp3], KLVVVGVGV [KRASG12V-sp4], FLGEYIPVF [RAC2P29L spliced], JPT) in a 200 µl 1:1 solution of incomplete Freund’s adjuvant and PBS supplemented with 50 µg CpG. Repetitive immunizations were performed with the same mixture at least 3 weeks apart. KRAS/RAC2-specific CD8+ T cells in the peripheral blood of immunized animals were assessed by in vitro peptide restimulation and subsequent intracellular cytokine staining (IFNγ) 7 days after each boost.
Isolation and cloning of KRASG12V-sp1, KRASG12V-sp2-specific, and RAC2P29L splice-specific TCRs
Request a detailed protocolSplenocytes and lymphocytes from inguinal lymph nodes were prepared from responding animals at day 8 after the last boost. For in vitro culture, CD4+ T cells were depleted by CD4 microbeads (Miltenyi Biotech, Bergisch Gladbach, Germany) and 1 × 106 splenocytes were seeded per well of a 24-well plate and expanded for 10 days in RPMI 1640 medium supplemented with 10% FBS gold, HEPES, NEAA, sodium pyruvate, 50 µM β-mercaptoethanol, 20 IU/ml human IL-2, and 10−8 M sp1, sp2 or RAC2 peptide, respectively. Splenocytes were stimulated with 10−6 M peptide for 4 hr before mouse IFNγ secretion assay (Miltenyi Biotech). The cells were treated with Fc Block, stained with antibodies against mouse CD3-APC and mouse CD8-PerPC (BD Biosciences, San Jose, CA, USA). IFNγ secreting CD8+ T cells were sorted with BD FACS Aria III (BD Biosciences) into RTL lysis buffer for RNA isolation with RNeasy Micro Kit (Qiagen, Hilden, Germany) according to the manufacturer’s instructions. First-strand cDNA synthesis and 5′-RACE PCR were carried out using SMARTer RACE cDNA Amplification Kit (Clontech Laboratories) according to the manufacturer’s instructions. In particular, subsequent TCR-specific amplification was carried out with 1–2 µl of the reverse transcription reaction, 1 U Phusion HotStart II polymerase (Thermo Scientific), 0.1 µM of either hTRAC (5′-cggccactttcaggaggaggattcggaac-3′) or hTRBC (5′-ccgtagaactggacttgacagcggaagtgg-3′) primers and 0.1 µM 5′ primer (5′-ctaatacgactcactatagggcaagcagtggtatcaacgcagagt-3′). The amplified TCR genes were analyzed on an agarose gel and specific bands were cut out and cloned using a Zero Blunt TOPO PCR cloning kit (Life Technologies). Plasmids from individual clones were isolated and sequenced using a T3 primer (5′--3′) at Eurofins Genomics. Dominant TCR-α/β chains were selected and paired as follows: 1376 TCR (TRAV5*01 – CAESTDSWGKLQF – TRAJ24*02, TRBV4-1*01 – CASSQDLAGYEQYF – TRBJ2-7*01), 9383B2 TCR (TRAV17*01 – CATDEDTGNQFYF – TRAJ49*01, TRBV12-3*01 – CASSLWGYEQYF – TRBD1*01 – TRBJ2-7*01), 9383B14 TCR (TRAV17*01 – CATDEDTGNQFYF – TRAJ49*01, TRBV12-3*01 – CASSLVGYEQYF – TRBD1*01 – TRBJ2-7*01), and 20967A2 TCR (TRAV20*02 – CAVQAPDSGNTPLVF – TRAJ29*01, TRBV2*01 – CASSDRGAYNEQFF – TRBD1*01 – TRBJ2-1*01). The TCR constant regions were replaced with mouse counterparts. Paired TCR-α/β chains were linked with a P2A element. TCR cassette was codon optimized, synthesized by GeneArt (Thermo Fisher Scientific, Waltham, MA, USA) and cloned into pMP71 by NotI/EcoRI restriction site cloning.
TCR gene transfer
Request a detailed protocolTCR gene transfer was carried out as described before (Niedermann et al., 1995). In brief, packaging cell line HEK-293-GALV (amphotropic) or Plat-E (ecotropic) were grown to approximately 80% confluence and transfected with pMP71 vector carrying the TCR cassette using Lipofectamine2000 (Life Technologies), and retrovirus-containing supernatant was harvested 48 hr and 72 hr after transfection.
Human PBMCs were isolated from healthy donors by Ficoll gradient centrifugation. 1 × 106 freshly isolated or frozen hPBMCs were stimulated with 5 µg/ml anti-CD3 (OKT3) and 1 µg/ml anti-CD28 (CD28.2) (BioLegend)-coated plates in the presence of 300 U/ml recombinant human interleukin 2 (hIL-2, Peprotech). Transductions were performed 48 hr and 72 hr after stimulation by addition of retrovirus-containing supernatant and 4 µg/ml protamine sulfate followed by spinoculation. Transduced T cells were kept in the presence of 300 U/ml hIL-2 for a total of 2 weeks followed by at least 2 days of culture in the presence of 30 U/ml hIL-2, before they were used for experiments.
For transduction of mouse T cells, spleen cells were isolated from TCR1xCD45.1xRag1-/- mice (expressing a monoclonal-irrelevant TCR against SV40 large T), erythrocytes were lysed, and cells were stimulated in medium (RPMI 1640, 10% FCS, 100 IU/ml penicillin-streptomycin, 1 mM sodium pyruvate, 1× non-essential amino acids, 50 μM 2-mercaptoethanol) supplemented with 1 μg/ml anti-mouse CD3, 0.1 μg/ml anti-mouse CD28 antibodies (both BD Biosciences (BD), Franklin Lakes, NJ, USA), and 10 IU/ml human IL-2 (Proleukin S, Novartis, Basel, Switzerland) at a concentration of 2 × 106/ml. 1 × 106 cells were transduced twice by spinoculation in the presence of 10 IU/ml IL-2 and 4 × 105 mouse T-Activator beads (Life Technologies). T cells were expanded in medium (+50 ng/ml IL-15; Miltenyi Biotec) for 10 days before co-culture.
Functional assays
Request a detailed protocolIFNγ production was measured by ELISA after 16 hr co-culture of 1 × 104 TCR-positive T cells with 1 × 104 target cells (human/mouse tumor cell lines or peptide-loaded T2 cells). Stimulation with phorbol myristate acetate (PMA) and ionomycin was used as a positive control. All samples were measured in duplicate.
Flow cytometry
Request a detailed protocolThe following conjugated antibodies were obtained from BioLegend: anti-hCD3 (HIT3α), anti-hCD8 (HIT8α), anti-hHLA-A2 (BB7.2), anti-hHLA-ABC (W6/32), anti-mCD3 (145-2 C11), anti-mCD8 (53-6.7), anti-mIFNγ (XMG1.2), and anti-mTCR-β (H57-597). Samples were analyzed using MACSQuant (Miltenyi) or FACSCalibur (BD Biosciences). Data analysis was performed using FlowJo (Treestar).
Quantitative PCR (qPCR)
Request a detailed protocolRNA was isolated from 1 × 106 NIH-HHD cells using the NucleoSpin TriPrep (MACHEREY-NAGEL). After RNA quality and integrity were verified, 2.8 μg of total RNA were used as template for cDNA synthesis with random hexamers, using SuperScript III Reverse Transcriptase (Invitrogen). Samples were diluted 1:5 and 5 μl used as template in a 20 μl qPCR reaction using SYBR Green PCR Master Mix (Thermo), with 500 nM primer concentration. Primer sequences: GFP: F:5′-acgacggcaactacaagacc-3′, R:5′-tgaagtcgatgcccttcag-3′; Gapdh: F:5′-tggagaaacctgccaagtatg-3′, R:5′-gttgaagtcgcaggagacaac-3′. Samples were run on the QuantStudio 3 Real-Time PCR system (Thermo Fisher) and analyzed according to the comparative ΔΔCt method.
ProtAG prediction algorithm
The ProtAG prediction algorithm was used in combination with mass spectroscopy to identify peptides and spliced peptides derived from an oligomeric protein substrate. The peaks of the MS intensity profile were approximated by Gaussian functions. Goodness of fit was used as one criterion for assessing the reliability of a mass peak. Only peaks above a user-defined noise threshold were compiled together with their HPLC retention times. The likelihood for correctly assigning a peptide to an MS peak was scored by the correspondence between computational and experimental values for peptide mass, occurrence of expected m/z values, similarity of retention times, and tandem MS/MS data. Chemically modified peptides (e.g., by oxidation) were identified by adding to the theoretical mass the masses of possible modifiers. Such modified peptides were included into the list of identified peptides only if the non-modified peptide could also be identified. After assigning the MS peaks to all direct or chemically modified fragments that theoretically can be derived from the protein substrate by one or multiple cleavages, a group of significant but ‘unexplained’ MS peaks remains, which may represent possible spliced products, that is, peptides composed of fragments distant in the parental protein substrate. The likelihood for correctly assigning a splice peptide to unexplained MS peaks was computed in the same way as for conventional peptides, including the additional criterion, that the two fragments merged together in the presumed splice peptide were also present in the set of identified conventional peptides.
Proteasomal cleavage products (PCP) of a substrate peptide can clearly be described by the numbers of the first and last amino acid within the substrate: P(i,j) denominates the peptide of length j-i +1 starting with amino acid i and ending with amino acid j. Proteasomal splice products (PSP) consist of two such fragments, therefore denominated by SP(i,j,k,l), consisting of the peptides P(i,j) and P(k,l) and having a length of (j + l-i-k + 2). They can be in normal order (i < j < k < l), inverse order (k < l < i < j), or overlapping. Overlapping splice product means that there exists a position m that is both part of P(i,j) and P(k,l), therefore max(i,j) <= min(j,l), meaning that the splice product consists of parts of two substrate peptides.
Splice peptides in normal order with j + 1 = k are identical to the original PCP P(i,l) and should therefore be excluded. Splice peptides can have the same sequence like PCPs, for example, if the sequence of the substrate is redundant, or the length of one of the parts is short – such peptides can be excluded from the database. Splice peptides can have the same sequence, so the splice peptides SP(i,j,k,l) and SP(i,j + 1,k + 1,l) have the same sequence if the amino acid in position j + 1 and in position k are the same. Nevertheless, both versions should be kept within the database because if the original peptides P(i,j) and P(k,l) are found within the cleavage products, and P(i,j + 1) not, the first version of the PSP is more likely.
When searching for splice products that are at the same time epitopes for MHC class I or MHC class II, the length of the predicted splice products should be limited. Therefore, according limits are included into the algorithm. Also, the database of spliced products can become very large if you try to evaluate all possible splice products of a large substrate without limits. The number of all possible splice products of a substrate of length 100 consists of about 25 million peptides, and results into a database of nearly 3 GB, so you can predict the size of the database without evaluating it, and avoid the evaluation.
The ProtAG program evaluates a database of splice products in fasta format according different parameters:
Sequence of the substrate of length L(sub).
Minimal/maximal length of the parts of the splice peptides (MinP,MaxP).
Minimal/maximal length of the gap between the parts of the splice peptide (only used if you evaluate splice peptides in normal or inverse order) (MinG,MaxG).
Minimal/maximal length of the whole splice peptide (MinS,MaxS).
Do you want to include PCPs into the database (recommended)?
Do you want to exclude PSPs with sequences identical to PCPs (recommended)?
Do you want to evaluate only PSPs in normal order, or PSPs in normal or inverse order (coming from the same substrate), or all PSPs including PSPs from different substrates?
Algorithm
Request a detailed protocolAlgorithm for the evaluation of PSPs in normal or inverse order – ignoring MaxG for simplification:
– evaluate all possible lengths of splice products actL with MinS<=actL<=MaxS
– evaluate possible lengths for the parts: MinP<=actL1<=min(actL-MinP,MaxP), actL2=actL-actL1
– evaluate starting points for the first fragment of the splice peptide: 1<=Start1<=L(sub)-actL-MinG+1
endpoint for the first fragment will be End1=Start1+actL1-1
– evaluate starting points for the second fragment: End1+1+MinG<=Start2<=L(sub)-actL2+1
endpoint for the second fragment will be End2=Start2+actL2-1
– evaluate the normal PSP SP(Start1,End1,Start2,End2) if Start2 >End1+1
check if the sequence of the PSP is a sub sequence of the substrate
– if not: write out the PSP in fasta format with the name line containing information to the positions
Start1, End1, Start2, End2, mass, m/z values for z=1,2,3
– if inverse PSPs should be included: evaluate the inverse PSP SP (Start2, End2, Start1, End1)
– check if the sequence of the PSP is a sub sequence of the substrate
if not: write out the PSP in fasta file format
The ProtAG algorithm, together with an instruction sheet is available on Dryad.
Data availability
Additional source data comprising databases for ProteomDiscoverer, KRAS/RAC2 kinetics, cleavage maps and PD2.1 result files have been submitted to Dryad under https://doi.org/10.5061/dryad.jq2bvq88b. The ProtAG algorithm and an instruction sheet are also available on Dryad.
-
Dryad Digital RepositoryWillimsky_et_al_11-08-2020-RA-eLife-62019_Supplementary data files.https://doi.org/10.5061/dryad.jq2bvq88b
References
-
Targeting cancer-specific mutations by T cell receptor gene therapyCurrent Opinion in Immunology 33:112–119.https://doi.org/10.1016/j.coi.2015.02.005
-
PatentT cell receptors for tumor specific proteasome splice variants and uses thereof ( WO/2019/129892)CHARITÉ - UNIVERSITÄTSMEDIZIN. Germany.
-
Interferon gamma stimulation modulates the proteolytic activity and cleavage site preference of 20S mouse proteasomesJournal of Experimental Medicine 179:901–909.https://doi.org/10.1084/jem.179.3.901
-
Perpetual complexity: predicting human CD8+ T-cell responses to pathogenic peptidesImmunology and Cell Biology 96:358–369.https://doi.org/10.1111/imcb.12019
-
Analysis of the processing of seven human tumor antigens by intermediate proteasomesThe Journal of Immunology 189:3538–3547.https://doi.org/10.4049/jimmunol.1103213
-
Transgenic mice with a diverse human T cell antigen receptor repertoireNature Medicine 16:1029–1034.https://doi.org/10.1038/nm.2197
-
The 20S proteasome splicing activity discovered by SpliceMetPLOS Computational Biology 6:e1000830.https://doi.org/10.1371/journal.pcbi.1000830
-
Mapping the MHC class I-Spliced immunopeptidome of Cancer cellsCancer Immunology Research 7:62–76.https://doi.org/10.1158/2326-6066.CIR-18-0424
-
Driving forces of proteasome-catalyzed peptide splicing in yeast and humansMolecular & Cellular Proteomics 11:1008–1023.https://doi.org/10.1074/mcp.M112.020164
-
Proteasome isoforms exhibit only quantitative differences in cleavage and epitope generationEuropean Journal of Immunology 44:3508–3521.https://doi.org/10.1002/eji.201444902
-
Estimating the Contribution of Proteasomal Spliced Peptides to the HLA-I Ligandome*Molecular & Cellular Proteomics 17:2347–2357.https://doi.org/10.1074/mcp.RA118.000877
-
Recognition of tyrosinase by tumor-infiltrating lymphocytes from a patient responding to immunotherapyCancer Research 54:3124–3126.
-
The role of the proteasome in the generation of MHC class I ligands and immune responsesCellular and Molecular Life Sciences 68:1491–1502.https://doi.org/10.1007/s00018-011-0657-y
-
Expression of the proteasome activator PA28 rescues the presentation of a cytotoxic T lymphocyte epitope on melanoma cellsCancer Research 62:2875–2882.
-
Analysis of Proteasome-Generated antigenic peptides by mass spectrometryMethods in Molecular Biology 1988:15–29.https://doi.org/10.1007/978-1-4939-9450-2_2
-
Processing and presentation of tumor antigens and vaccination strategiesCurrent Opinion in Immunology 18:98–104.https://doi.org/10.1016/j.coi.2005.11.013
-
Production of spliced peptides by the proteasomeMolecular Immunology 113:93–102.https://doi.org/10.1016/j.molimm.2018.03.030
-
SoftwareWillimsky_et_al_11-08-2020-RA eLife-62019_Supplementary data filesDryad Digital Repository.
Article and author information
Author details
Funding
German Research Foundation (SFB-TR36)
- Gerald Willimsky
- Thomas Blankenstein
Deutsche Krebshilfe (111546)
- Gerald Willimsky
Einstein Stiftung Berlin (A-2013-174)
- Peter M Kloetzel
German Cancer Research Center (NEO-ATT)
- Gerald Willimsky
Berliner Krebsgesellschaft
- Peter M Kloetzel
Helmholtz Association (ZT-0027)
- Thomas Blankenstein
- Gerald Willimsky
Berlin Institute of Health (CRG-1)
- Thomas Blankenstein
- Peter M Kloetzel
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Acknowledgements
We thank Sabrina Horn, Kathrin Borgwald, and Mathias Pippow for excellent technical support. We acknowledge the participation of Michele Mishto (postdoc in CRG-1) and Juliane Liepe (Imperial College, London) in the initial splicing experiments. This work was supported by grants from the German Research Foundation (SFB-TR36), the Berlin Institute of Health (CRG-1), Einstein Stiftung (A-2013-174), Berliner Krebsgesellschaft, Deutsche Krebshilfe (111546), DKTK joint funding (NEO-ATT), and the Helmholtz-Gemeinschaft, Zukunftsthema ‘Immunology and Inflammation’ (ZT-0027).
Ethics
Animal experimentation: All animal experiments were performed according to institutional and national guidelines and regulations. The experiments were approved by the governmental authority (Landesamt für Gesundheit und Soziales, Berlin, H0086/16).
Copyright
© 2021, Willimsky et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 1,956
- views
-
- 268
- downloads
-
- 13
- citations
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Biochemistry and Chemical Biology
The conformational ensemble and function of intrinsically disordered proteins (IDPs) are sensitive to their solution environment. The inherent malleability of disordered proteins, combined with the exposure of their residues, accounts for this sensitivity. One context in which IDPs play important roles that are concomitant with massive changes to the intracellular environment is during desiccation (extreme drying). The ability of organisms to survive desiccation has long been linked to the accumulation of high levels of cosolutes such as trehalose or sucrose as well as the enrichment of IDPs, such as late embryogenesis abundant (LEA) proteins or cytoplasmic abundant heat-soluble (CAHS) proteins. Despite knowing that IDPs play important roles and are co-enriched alongside endogenous, species-specific cosolutes during desiccation, little is known mechanistically about how IDP-cosolute interactions influence desiccation tolerance. Here, we test the notion that the protective function of desiccation-related IDPs is enhanced through conformational changes induced by endogenous cosolutes. We find that desiccation-related IDPs derived from four different organisms spanning two LEA protein families and the CAHS protein family synergize best with endogenous cosolutes during drying to promote desiccation protection. Yet the structural parameters of protective IDPs do not correlate with synergy for either CAHS or LEA proteins. We further demonstrate that for CAHS, but not LEA proteins, synergy is related to self-assembly and the formation of a gel. Our results suggest that functional synergy between IDPs and endogenous cosolutes is a convergent desiccation protection strategy seen among different IDP families and organisms, yet the mechanisms underlying this synergy differ between IDP families.
-
- Biochemistry and Chemical Biology
- Structural Biology and Molecular Biophysics
Dynamic conformational and structural changes in proteins and protein complexes play a central and ubiquitous role in the regulation of protein function, yet it is very challenging to study these changes, especially for large protein complexes, under physiological conditions. Here, we introduce a novel isobaric crosslinker, Qlinker, for studying conformational and structural changes in proteins and protein complexes using quantitative crosslinking mass spectrometry. Qlinkers are small and simple, amine-reactive molecules with an optimal extended distance of ~10 Å, which use MS2 reporter ions for relative quantification of Qlinker-modified peptides derived from different samples. We synthesized the 2-plex Q2linker and showed that the Q2linker can provide quantitative crosslinking data that pinpoints key conformational and structural changes in biosensors, binary and ternary complexes composed of the general transcription factors TBP, TFIIA, and TFIIB, and RNA polymerase II complexes.