The genetic landscape of a physical interaction

Abstract
eLife digest
Introduction
Results
Discussion
Materials and methods
Data availability
References
Article and author information
Metrics

Abstract

A key question in human genetics and evolutionary biology is how mutations in different genes combine to alter phenotypes. Efforts to systematically map genetic interactions have mostly made use of gene deletions. However, most genetic variation consists of point mutations of diverse and difficult to predict effects. Here, by developing a new sequencing-based protein interaction assay – deepPCA – we quantified the effects of >120,000 pairs of point mutations on the formation of the AP-1 transcription factor complex between the products of the FOS and JUN proto-oncogenes. Genetic interactions are abundant both in cis (within one protein) and trans (between the two molecules) and consist of two classes – interactions driven by thermodynamics that can be predicted using a three-parameter global model, and structural interactions between proximally located residues. These results reveal how physical interactions generate quantitatively predictable genetic interactions.

https://doi.org/10.7554/eLife.32472.001

eLife digest

Proteins, the molecular workhorses of the cell, are made of small units called amino acids attached together like the links of a chain. Each protein is composed of a unique combination of amino acids, which is determined by a specific sequence of DNA called a gene. A change in a gene – a mutation – can create a variation in the protein it codes for, for instance by swapping a type of amino acid for another. Different mutations in the same gene can alter a protein in different ways. Some of these changes are harmless, but other can hinder how the protein performs its role. For example, a small change in the structure of a protein could affect how it will bind to other molecules.

It is possible for people to have identical mutations in the same genes, but experience different consequences. For instance, two persons could carry the same disease-inducing mutation, but one has a severe version of the condition and the other only mild symptoms. One reason is that changes in other genes cancel out or enhance the effect of a mutation. This phenomenon is known as a genetic interaction and it remains poorly understood, especially at the molecular level.

Here, Diss and Lehner developed a method, called deepPCA, to study the consequences of mutations in proteins in the laboratory. The experiments focused on two human genes which code for two proteins that normally attach to each other. Two mutations were artificially created, either one in each gene, or two in one of them. Diss and Lehner then examined how strongly the two mutated proteins could still attach to each other. By repeating this process with over 120,000 different pairs of mutations, it became possible to study how one mutation can have different effects depending on the presence of other mutations in the same protein or in the binding partner.

Overall, Diss and Lehner found that genetic interactions are the result of two mechanisms. In the first one, the two mutations together cause specific structural changes that modify how proteins bind to each other. In the second one, the changes solely depend on the magnitude of the initial, thermodynamic effects of individual mutations, but not on their specific physical and chemical properties. To predict the consequences of this second type of genetic interactions, knowing the identity or the exact effects of the two mutations is not necessary.

Understanding and predicting genetic interactions is important to develop personalized medicine, where treatments are tailored based on the genetic make up of an individual. This knowledge will also help to study how genes have evolved together.

https://doi.org/10.7554/eLife.32472.002

Introduction

Mutations often have outcomes that change depending upon additional genetic variation carried by an individual, making their effects difficult to predict (Lehner, 2011). The unexpected outcomes obtained when two or more mutations are combined are referred to as genetic interactions or epistasis (Phillips, 2008).

One approach that has been taken to better understand how mutations interact to alter phenotypes has been to systematically combine together gene deletions or representative hypomorphic alleles (Baryshnikova et al., 2013). In budding yeast, this has been undertaken on a genomic scale, with the resulting network of interactions referred to as the ‘genetic landscape’ of a cell (Costanzo et al., 2010, 2016; Tong et al., 2004).

However, gene deletions are rare in nature – most genetic variation consists of point mutations not deletions or null alleles. Point mutations can have very diverse and difficult to predict effects (Shendure and Akey, 2015). These range from no consequence, through partial loss-of-function, to very strong effects or the creation of new functions. To date, however, there has been no systematic effort to map how point mutations in two genes combine together to alter biological functions.

Protein-protein interactions (PPIs) represent the backbone of a cell’s functional organization. Mutations affecting PPIs lead to disease, to functional innovations, and hence are subject to selection (Diss et al., 2013). It has long been appreciated that pairs of mutations in two physically interacting proteins can have non-additive outcomes (Horovitz, 1996; Lehner, 2011). However, to date, the effects of mutations on PPIs have only been quantified for deep mutant libraries of one protein in combination with a small number of targeted mutants in a physical interaction partner (Aakre et al., 2015; Araya et al., 2012; Raman et al., 2016). A thorough understanding of the patterns of mutation outcome between interacting proteins requires a non-biased, systematic mutagenesis of both interacting proteins.

Here, we present a high-throughput technique based on the protein fragment complementation assay (Tarassov et al., 2008; Diss et al., 2017) (PCA) called deepPCA that quantifies how mutations of diverse individual effect combine to alter protein interactions. We used the assay to systematically and comprehensively determine the effects of combinations of mutations in the proto-oncogenes FOS and JUN on the formation of the AP-1 transcription factor complex (Shaulian and Karin, 2002). Fos and Jun interact through their leucine zipper domains that consist of five heptad repeats (Figure 1A); this interaction has been previously extensively investigated (Mason et al., 2006; Ransone et al., 1989). We first quantified the consequences of combining thousands of pairs of mutations in trans between the two proteins. We then compared these results to the effects of thousands of pairs of mutations in cis within one of the proteins (Fos).

Figure 1 with 1 supplement see all

Download asset Open asset

*deepPCA*.

(A) Leucine zipper domains (colored) and heptad positions of human Fos and Jun. (B) Overview of the assay. Single amino acid variants of Fos and Jun were constructed by overlap-extension PCR using NNS primers and cloned in a head-tail orientation. In PCA, interacting proteins expressed in yeast lead to complementation between their fused DHFR fragments, which is resistant to methotrexate and produces tetrahydrofolate (THF) from dihydrofolate (DHF) to promote growth. Paired-end deep sequencing then allows the frequencies of each variant in the input and output populations to be measured and a PPI score that represents the number of generation of each variant relative to the wild-type interaction to be computed. (C) Scatter plot of PPI scores between biological replicates 1 and 2. (D) Confirmation of single mutants by individual PCA growth assays. Single mutants were reconstructed, sequence confirmed and their PPI scores were derived from their growth curves measured in a plate reader (see Materials and methods). Error bars represent 95% confidence intervals.

https://doi.org/10.7554/eLife.32472.003

The resulting dataset presents a global view of how hundreds of mutations of diverse individual effects in different genes combine to alter a biological function through two major mechanisms related to the thermodynamics of a PPI and the structural interactions between proximal residues.

Results

Quantifying thousands of protein interactions in parallel using deepPCA

To quantify how mutations of diverse individual effect combine to alter protein interactions, we developed deepPCA, a protein-protein interaction assay that uses PCA and deep sequencing to quantify thousands of protein-protein interactions in parallel in a single assay (Figure 1B, see Materials and methods). deepPCA uses deep sequencing to quantify the effects on a PPI of thousands of combinations of point mutations within one or both physically interacting proteins. The method is inspired by deep mutation scanning experiments on individual proteins (Fowler et al., 2010; Fowler and Fields, 2014) and uses physical linkage on a plasmid to read out the frequency of each pair of mutations after a competitive selection for growth dependent on the physical interaction between two proteins (Figure 1B, see Materials and methods). Briefly, the two proteins of interest are fused to complementary halves of a methotrexate-resistant variant of murine dihydrofolate reductase and expressed in yeast. If the two proteins interact, the two fragments complement each other and reform an active enzyme, allowing growth in the presence of methotrexate. PCA is highly quantitative because the growth rate is correlated to the abundance of the complementation complex (Freschi et al., 2013; Levy et al., 2014; Schlecht et al., 2012) so cells expressing strongly interacting variants of the two proteins will hence grow faster and be enriched in the population while cells expressing weakly interacting variants and variants that don’t interact will be depleted. These changes in frequency between the pre- and post-selection populations (input and output, respectively) are then quantified by paired-end deep sequencing. The final PPI score quantifies the strength of interaction relative to the wild-type protein (Figure 1B).

We used deepPCA to quantify the effects of systematically mutating the leucine zipper domains of FOS and JUN. We obtained reliable (input reads > 10 and output reads > 0; Figure 1—figure supplement 1A,B; see Materials and methods) measurements for 607 and 608 of the 608 (32 positions x 19 substitutions) possible single amino acid (aa) changes within the targeted regions of Fos and Jun, respectively. PPI scores measured by deepPCA are highly reproducible between biological replicates (mean Pearson correlation R = 0.95 between the three pairs of replicates, n = 108,840 mutation combinations, Figure 1C, Figure 1—figure supplement 1C and Supplementary file 1) and also with mutation effects tested individually (R = 0.95 for 14 variants chosen randomly, Figure 1D).

The PPI scores for single amino acid changes in both proteins show a bimodal distribution (Figure 2—figure supplement 1A), with ~20% and 15% of substitutions severely detrimental for the interaction and significantly different from the wild-type (PPI score ≤ 0.64, FDR < 0.05, one sample t-test against a mean of 1; Figure 2—figure supplement 1B). However, the individual substitutions altered the interaction across the entire dynamic range, with 25 and 10 aa changes in each protein strengthening the interaction (PPI score > 1.04, FDR < 0.05, average SEM of these 35 variants = 0.0054; Figure 2—figure supplement 1C).

Determinants of single mutant outcome

Mutations in the hydrophobic core of the interaction interface (heptad positions a and d) are most detrimental, followed by mutations at salt-bridge positions (positions e and g, Figure 2A–B). Mutations in the hydrophilic far side of the zipper (positions b, c and f) were generally of small effect (Figure 2A–C). Changes in the physico-chemical properties of the amino acids (hydrophobicity, charge, α-helical stability etc, see Supplementary file 2) provide good prediction of the mutation effects (percentage of variance explained from 35% to 98% across Fos and Jun positions), with properties related to α-helical stability most informative for predicting single mutation outcomes (Figure 2—figure supplement 1D and Supplementary file 2).

Figure 2 with 1 supplement see all

Download asset Open asset

Effects of single mutants.

(A) Heatmap of single mutant PPI scores averaged between the three replicates. Letters inside the heatmap represent the wild-type amino acid. White represents missing data. (B) Distribution of PPI scores per position types. p-Values from Welch t-test. (C) Average PPI score per position overlaid on the crystal structure (pdb: 1fos). Black and gray represent positions not mutated in Fos and Jun, respectively. (D) Scatter plot between PPI scores of corresponding single mutations at the same positions in Fos and Jun. Error bars represent 95% confidence intervals. (E) Scatter plot between average PPI score per corresponding positions in Fos and Jun. The number represents the heptad and the letter the position inside the heptad.

https://doi.org/10.7554/eLife.32472.005

Identical substitutions in the same positions in Fos and Jun often had similar effects. For instance, mutations in one protein that disrupted the interaction (PPI scores < 0.64) were also very likely to disrupt the interaction when made in the other protein (odds ratio = 31.6, p<2.2×10⁻¹⁶, Fisher’s exact test; Figure 2D). Near neutral or strengthening mutations (PPI scores > 0.96) in one protein were also more likely to have a similar effect in the other one (odds ratio = 7.3, p < 2.2×10⁻¹⁶, Fisher’s exact test). However, a substantial number of substitutions had effects that differed between the two proteins (n = 381 out of 581, FDR < 0.05, paired t-test between the three replicate measurements in Fos and Jun), underlining the importance the structural context in which they occur. These mutations are enriched in intermediate effects in one or both proteins (odds ratio = 8.1, p < 2.2×10⁻¹⁶, Fisher’s exact test). The average PPI score per position was also generally conserved between the two proteins, but revealed positions asymmetrically involved in the interaction such as the salt bridge positions (Figure 2E).

trans genetic interactions between mutations in Fos and Jun

Considering pairs of substitutions in the two proteins, we obtained data for 107,625 of the 369,664 possible double mutants (input read count above 10 and output read count above 0, Supplementary file 1, see Materials and methods). The double mutant PPI scores also show a bimodal distribution, but with proportionally more severely detrimental (~26%) and fewer near-neutral outcomes (~21%) than for the single mutants (Figure 3A).

Figure 3 with 2 supplements see all

Download asset Open asset

A thermodynamic model predicts double mutant outcomes.

(A) Distribution of double mutants PPI scores compared to Fos and Jun single mutants. (B) Observed double mutants PPI scores against the scores predicted by a multiplicative model. (C) Pie chart array of genetic interaction score bins by Fos (*x-axis*) and Jun (*y-axis*) single mutant PPI score bins for genetic interactions calculated from the multiplicative model. (D) Fitted thermodynamic model (see Materials and methods). Red arrows illustrate how the sigmoidal function can lead to a different prediction than the multiplicative model. The three fitted free parameters are shown (A_T/AB_wt and B_T/AB_wt represent the total concentration of the two proteins relative to the concentration of the wild-type complex and b/AB_wt represents the background growth relative to the concentration of the wild-type complex, see Materials and methods). ΔΔGs1, ΔΔGs2 and ΔΔGd represent the change in free energy relative to the wild-type complex of the two single mutants and the double mutant, repsectively. (E) 3D scatter plot of double mutants PPI scores (*z-axis*) as a function of the corresponding Fos (*x-axis*) and Jun (*y-axis*) single mutants PPI scores with the fitted surface from the thermodynamic model. Dot color represents genetic interaction scores according to the color scale in (C). (F) Pie chart array of genetic interaction score bins by Fos (*x-axis*) and Jun (*y-axis*) single mutants PPI score bins for genetic interactions calculated from the thermodynamic model. (G) Average genetic interaction score across all double mutants involving a given Fos (*left*) or Jun (*right*) single mutant in function of its PPI score for the multiplicative (*top*) and thermodynamic (*bottom*) models. Data in all panels is for replicate 1.

https://doi.org/10.7554/eLife.32472.007

The outcome of the double mutations was well predicted by multiplying the PPI scores of the constitutive single mutants (percentage of variance explained of 85–86% in all three replicates, Figure 3B), that is, by assuming no genetic interaction between mutations. We calculated a genetic interaction score for each double mutant as the difference between the observed and predicted PPI scores (Supplementary file 1). Negative and positive genetic interactions (16,394 and 11,653 cases, respectively, at a 20% FDR, one-sample t-test) thus represent double mutants with lower or higher interaction strength than expected, respectively. The genetic interaction scores are well correlated between replicates with a distribution centered on zero and long tails of positive and negative scores (Figure 3—figure supplement 1). Thus, as observed in other systems (Araya et al., 2012; Olson et al., 2014), genetic interactions make an important contribution to the outcome of double mutations.

Global dependencies in the genetic interaction landscape

The genetic interaction scores are, however, strongly dependent on the single mutant PPI scores (Figure 3C). Combining two mutants that both moderately reduce PPI strength is likely to result in a negative genetic interaction (Figure 3C). Positive genetic interactions are, however, generally detected between two mutations that greatly weaken the interaction and also often when combining strength-increasing and strength-decreasing mutations (Figure 3C).

A thermodynamic model accurately predicts double mutant outcome

To account for these trends, we considered the thermodynamics of a PPI, relating the concentration of the bound and total subunits to the free energy of a dimeric complex (equation 9 in the Materials and methods). This model has only three free parameters that need to be fitted from the data, representing the total concentration of each protein and the background growth in the PCA selection (see Materials and methods). In the model the changes in free energy (ΔΔG, expressed in arbitrary units) for the mutations are additive but there is a sigmoidal relationship between PPI scores and ΔΔGs (Figure 3D).

Fitting the three parameters from the data (Figure 3—figure supplement 2A–B) reveals that the model provides very good prediction of how mutations in the two proteins combine together (percentage of variance explained of 89–90% in all three replicates, n = 107,618 mutation combinations, Figure 3E). The model also removes the systematic trend in the genetic interaction scores across mutation pairs with different individual effects (Figure 3F–G). Indeed, because of the sigmoidal nature of the model, a single mutant that decreases ΔΔG will increase PPI scores to a lower extent in the wild-type context than when combined with a mutation that destabilized the complex because of the saturation effect caused by the plateau of the sigmoid.

Specific interactions between structurally-related mutations

To investigate the remaining genetic interactions not accounted for by the thermodynamic model, we calculated residual genetic interaction scores as the difference between the observed double mutant PPI score and the thermodynamic model prediction. These new genetic interaction scores also correlate well amongst the three replicates, with a narrow peak centered on zero interaction and long tails of rare strong positive and negative genetic interactions (Figure 3—figure supplement 2C).

We observed more cases of strong negative (1711, 1.6%) than positive (883, 0.82%) genetic interactions (absolute score > 0.1, FDR = 0.2, Figure 3—figure supplement 2D–G). These strong interactions are enriched between particular Fos and Jun residues (Figure 4—figure supplement 1), with positive genetic interactions concentrated between positions close in the sequence of heptad positions (along the diagonal of the matrix in Figure 4A) and negative genetic interactions more spread-out in the structure and less enriched between specific pairs of positions (Figure 4A–B and Figure 4—figure supplement 2). Both directions of interaction are enriched between residues at the interface of the PPI and between residues close in space (Figure 4C–D and Figure 4—figure supplement 3), with this stronger for positive than for negative interactions. Positive interactions are therefore particularly enriched between contacting residues, identifying ‘lock and key’ specificity residues (Horovitz, 1996). We refer to these interactions beyond the interactions predictable from the global thermodynamic model as structural genetic interaction. For instance, in the wild-type PPI, the Glu residue in position 3g of Fos establishes a salt-bridge with the Arg residue in position 4e of Jun (Figure 4E). The individual mutations Glu3gLys and Arg4eGlu both destabilize the PPI by replacing the salt-bridge by repulsive electro-static interactions (Glu-Arg replaced by Lys-Arg and Glu-Glu with average PPI scores of 0.84 and 0.71, respectively). However, the two mutations compensate each other by recreating the salt-bridge (Glu-Arg replaced by Lys-Glu) and restoring a neutral PPI score of 0.98. Additional examples are shown in Figure 4E.

Figure 4 with 3 supplements see all

Download asset Open asset

Structural genetic interactions.

(A) Heatmap (*top*) and distribution (*bottom*) of percentage of significantly (absolute genetic interaction score > 0.1, FDR < 0.2) positive (*left*) or negative (*right*) genetic interactions per pairs of position. Pairs of positions without any significant interactions were excluded from the distribution (*bottom*). (B) Pairs of position significantly enriched in positive (*green*) or negative (*purple*) genetic interactions (Fisher’s exact test, FDR = 10%). Each pair of position was classified according to its heptad position type and the distance between the two position (bottom matrix, yellow cells). (**C–D**) Enrichment for significantly positive (*green*) or negative (*purple*) genetic interactions between different position types (C) and at different distance threshold between the two positions (D). *, FDR < 0.1. **, FDR < 0.01. ***, FDR < 0.001. n.s., non-significant. (E) Example of structural interactions in the Fos-Jun complex (pdb: 1fos). Dashed yellow lines represent polar interactions predicted by pymol and the mutant structures were drawn using the pymol mutagenesis function.

https://doi.org/10.7554/eLife.32472.010

Comparing genetic interactions in cis and trans

In addition to combining pairs of mutations in the two different proteins, we also quantified the effects of 17,688 double amino acid changes within Fos alone (99% of cis double mutant combinations reachable through combinations of single nucleotide changes; for all comparisons between the trans and cis libraries below, we only consider mutants reachable by single nucleotide changes in both libraries; Figure 5A, Figure 5—figure supplement 1A and Supplementary file 3 and 4). The PPI scores for single mutants correlate very well between the two libraries (R = 0.96, Figure 5—figure supplement 1B), further validating the reproducibility of the deepPCA method.

Figure 5 with 9 supplements see all

Download asset Open asset

Comparison of double mutant mutation outcome and genetic interactions in *cis* and *trans*.

(A) Cartoon illustrating how the *cis* library differs from the *trans* one. NNS, whole codon substitution. Asterisk, point mutation. (B) Scatter plot between Average PPI scores of identical pairs of mutations at the same positions in *cis* and *trans*. (C) Distribution of double mutants PPI scores in the original *cis* and *trans* libraries or after matching their single mutant effects distributions. Error bars represents 95% confidence intervals around the mean of 1000 sub-samplings when matching the two libraries. (D) Stacked bar-plot showing the proportion of the non-random variance in double mutant PPI scores that is not accounted for by the multiplicative model, explained by the thermodynamic model and the residual structural genetic interactions. Error bars represent the standard error of the mean. (E) Proportion of significant positive and negative genetic interactions in the two original libraries. See Figure 5—figure supplements 2, 3, 7 and 9 for sub-sampled libraries with matched single mutant effect distributions.

https://doi.org/10.7554/eLife.32472.014

There is a good correlation between the PPI scores of cis and trans double mutants consisting of exactly the same pairs of substitutions at the same positions (R = 0.77, n = 5451 identical double mutants quantified in cis and trans, Figure 5B). This correlation indicates that the structural determinants of mutation effects in FOS and JUN remain well conserved despite sequence divergence over long evolutionary timescales. However, the distributions of double mutant effects are quite different for the cis and trans combinations (Figure 5C). This could be either due to different levels of genetic interactions or merely to the combination of different distribution of single mutant effects (FOS x FOS in cis and FOS x JUN in trans). To control for differences in the distributions of single mutant effects in the two libraries, we sub-sampled the libraries to match their single mutant effect distributions (Figure 5—figure supplement 2A, see Materials and methods). This revealed that, even when controlling for single mutant effect sizes, two mutations within Fos are more likely to increase the strength of the PPI than one mutation in Fos combined with a second mutation in Jun (p < 10⁻³ over 1000 sub-samplings, 5.2% vs 3.4%, respectively, for PPI scores > 1.04, Figure 5—figure supplement 2B). Two mutations in Fos are slightly less likely to destroy the PPI than a trans mutation combination (25.5% vs 27.8%, respectively, p < 10⁻³ for PPI scores < 0.64, Figure 5—figure supplement 2C) but have slightly more intermediate negative effects (p < 10⁻³, 39.4% vs 35.3%, respectively, for PPI scores between 0.64 and 0.92, Figure 5—figure supplement 2D–E). Whether mutations of the same individual effect sizes combine together in cis or in trans therefore influences the double mutant outcome.

The thermodynamic model also accurately predicts interactions in cis

Because leucine zippers, including Fos and Jun, fold upon binding (Patel et al., 1990; Thompson et al., 1993), the same thermodynamic model based on a two-state equilibrium between the two unfolded proteins and the complex can describe how mutations combine in cis as well as in trans. We tested how well the thermodynamic model with parameters fitted on the trans double mutants predicted the cis library data and found that it gave very good prediction (percentage of variance explained of 82–83% for cis vs. 90–91% for trans combinations, Figure 5—figure supplement 3A–B). Similarly, fitting the thermodynamic model on the cis double mutants gave very good prediction of the trans library data (percentage of variance explained of 84% for cis and 90% for trans, Figure 5—figure supplement 3A–B). A common thermodynamic model therefore accounts very well for how mutations combine in both cis and trans to change the PPI (Figure 5D and Figure 5—figure supplement 3C). Therefore, just as in trans-, cis-genetic interactions have a component that results from the non-linear relationship between free energy and protein complex concentration.

Structural interactions in the cis interaction landscape

We then tested whether the residual component of cis-genetic interactions are also enriched for structural interactions. The strongest cases of structural cis interactions (absolute genetic interaction score > 0.1 and FDR < 0.2 in both libraries, Figure 5—figure supplement 4) are indeed also enriched between proximally located residues but are less restricted to pairs of positions that are both at the PPI interface and involve more far side positions compared to trans-genetic interactions (Figure 5—figure supplement 5). cis-genetic interactions are also less enriched at specific positions and more dispersed throughout the structure (Figure 5—figure supplement 6). These results are robust to the magnitude threshold used to call strong genetic interactions and to the differences in single mutant effects between the two libraries (Figure 5—figure supplement 7 and Figure 5—figure supplement 8). Thus, cis-genetic interactions can be subdivided into the same two components as trans-genetic interactions, genetic interactions that results from the non-linearity of the general relationship between protein complex concentration and free energy and specific structural interactions.

Structural interactions are more abundant in cis than in trans

Interestingly, structural genetic interactions explain more of the variance in double mutant PPI scores when mutations are combined in cis than in trans (Figure 5D). Indeed, both positive and negative structural genetic interactions (genetic interactions not accounted for by the thermodynamic model; Supplementary files 3 and 4) are more abundant in cis than in trans (1493 vs. 835 true cases of positive and 1319 vs. 1128 true cases of negative genetic interactions in cis and trans, respectively, at p < 0.031, one sample t-test, FDR < 0.15 and 0.2; Figure 5E and Figure 5—figure supplement 9). This higher prevalence of genetic interactions in cis can potentially be explained by a higher number of contacts between positions within FOS than across the interface (1040 cis vs. 518 trans mutants pairs at positions within 5 Å of each other), supporting our previous result that proximity is a major determinant of genetic interactions both in cis and trans in a PPI interface.

Discussion

Here, we have presented a protein-protein interaction assay – deepPCA – that allowed us to quantify the effects of >120,000 combinations of mutations in both cis and trans on the physical interaction between the products of the FOS and JUN proto-oncogenes. This provided a comprehensive and systematic data for how a very large number of different mutations in two genes combine to alter a biological function, allowing us to investigate the causes of genetic interactions between mutated genes and the extent to which genetic interactions can be quantitatively predicted. In its current form, deepPCA is limited to small domains because of the limit in the amplicon size for paired-end sequencing. However, the use of barcodes (Hiatt et al., 2010) would allow the assay to be applied to longer proteins.

Our data reveal that physical interactions in the cell generate two distinct types of genetic interaction: interactions due to the sigmoidal relationship between the concentration of a protein complex and the free energy of an interaction (Figure 3) and specific, structural interactions (Figure 4).

The general genetic interactions that arise in the physical interaction between molecules is one of several non-linear mappings that can occur between changes in genotype and changes in phenotype. Additional non-linearities occur in the folding of individual proteins or RNAs (‘threshold robustness’) (Bershtein et al., 2006; Olson et al., 2014; Tokuriki and Tawfik, 2009), in saturating enzyme flux (Kacser and Burns, 1973; Stiffler et al., 2015), and in regulatory dynamics (Gjuvsland et al., 2007; Omholt et al., 2000).

This type of genetic interaction is cumulative and easily predictable, for example a three parameter thermodynamic model accounts for ~90% of the variance in our dataset of >120,000 genotypes. The magnitude of this type of genetic interaction can also be predicted when combining three or more mutations together. A better knowledge of all the sources of non-linearities between the genotype and the phenotype is therefore critical to model how genotypic variation translates into phenotypic changes.

The second type of genetic interactions generated by molecular interactions is thermodynamically non-additive. These interactions are enriched between physically contacting and proximal residues, but can also involve some long-range indirect interactions (Halabi et al., 2009). Structural genetic interactions have a more complex basis and are therefore more difficult to predict. Gathering comprehensive data similar to that described here for additional PPIs will help to further elucidate the structural determinants of genetic interactions and the rules for predicting them.

This second type of genetic interactions generated by a protein-protein interaction was more important when combining mutations in cis within the same protein than when combining mutations in trans between the two molecules. This results in a different distribution of double mutant outcomes when combining mutations in cis and trans. Whether a second mutation happens in cis or in trans can therefore impact an evolutionary outcome.

A substantial fraction of genetic interactions could however not be explained by structural contacts. Some other mechanisms not accounted for by the model could be at play. For instance, non-linearities between the growth rate and complementation complex in the protein-complementation assay could artificially produce genetic interactions. However, such saturation effects are unlikely in the range of expression and binding affinities in this study because they would lead to diminishing returns when combining two strength-increasing mutations, which is not observed (Figure 3C). Moreover, Levy et al. have shown that growth is correlated to complementation complex concentration over a wide-range of concentrations (Levy et al., 2014). A more likely source of actual genetic interactions could come from JUN’s ability to form homodimers, which are however less stable than the Fos-JunN heterodimer (Chinenov and Kerppola, 2001). Mutations affecting the equilibrium between the Jun-Jun homodimer and the Fos-Jun heterodimer could indeed have effects that would not be predicted by our thermodynamic model. Elucidating the remaining mechanisms of genetic interactions will thus require further studies that take these effects into account.

Our approach complements the large-scale efforts to comprehensively map genetic interactions between gene deletions or representative alleles of yeast genes (Tong et al., 2004; Costanzo et al., 2010; Costanzo et al., 2016). Gene deletions are, however, rare in nature, with most genetic variation consisting of point mutations of diverse and difficult to predict effects. Our data provides a comprehensive view of how point mutations within two genes interact to affect a biological function. It will be interesting to extend this strategy to quantifying the effects of point mutation combinations on additional phenotypes beyond PPIs, including applying it to gene pairs that do not encode directly physically interacting proteins but instead participate in regulatory interactions or the same biological process.

Materials and methods

All experiments were performed in triplicates starting from the transformation of the variant libraries into yeast.

Deep sequencing data is available at GEO with accession number GSE102901 (reviewer token: yvgbkwqajvypvah).

All perl and R scripts used to analyze the data are available at https://github.com/gdiss/Diss_et_al_eLife_2018 (Diss, 2018; copy archived at https://github.com/elifesciences-publications/Diss_et_al_eLife_2018)

Share this article

Cite this article

deepPCA.

Effects of single mutants.

A thermodynamic model predicts double mutant outcomes.

Structural genetic interactions.

Comparison of double mutant mutation outcome and genetic interactions in cis and trans.

Author details

Guillaume Diss

Contribution

Competing interests

Ben Lehner

Contribution

For correspondence

Competing interests

Citations by DOI

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

Categories and tags

Research organisms

Further reading