Coevolution-based inference of amino acid interactions underlying protein function

  1. Victor H Salinas
  2. Rama Ranganathan  Is a corresponding author
  1. UT Southwestern Medical Center, United States
  2. The University of Chicago, United States
8 figures, 1 table and 2 additional files

Figures

Figure 1 with 2 supplements
A deep coupling scan (DCS) for the PDZ binding pocket.

(A), The thermodynamic double mutant cycle (TDMC), a formalism for studying the energetic coupling of pairs of mutations in a protein. Given two mutations (x at postion i and y at position j), …

https://doi.org/10.7554/eLife.34300.002
Figure 1—figure supplement 1
The bacterial two-hybrid assay for PDZ ligand binding.

(A) The bacterial two-hybrid system consists of three parts, (1) a PDZ domain fused to the C-terminus of a λ-cI DNA binding domain, (2) a ligand peptide as a C-terminal fusion to the α-subunit of …

https://doi.org/10.7554/eLife.34300.003
Figure 1—figure supplement 2
Reproducibility and quality of the bacterial two-hybrid assay.

The panels show 2D histograms of all single and double mutant binding free energies for four independent experimental trials of the bacterial two-hybrid assay for PSD95pdz3. The data show that the …

https://doi.org/10.7554/eLife.34300.004
Figure 2 with 5 supplements
Distributions of pairwise thermodynamic couplings in a single PDZ homolog (PSD95pdz3).

Each subplot shows the distribution of coupling free energies (ΔΔG, see Equation 1, main text) for all measured mutants at one pair of positions in the α2-helix (numbering per Figure 1C) in PSD95pdz3.…

https://doi.org/10.7554/eLife.34300.006
Figure 2—figure supplement 1
Distributions of double mutant cycles for each α2 helix position pair (numbering as in Figure 1C) for PSD95pdz2.

As in Figures 23, distributions are fit to single or double Gaussians, using the Bayes Information Criterion to choose the model. The position of zero coupling is indicated by the solid line and …

https://doi.org/10.7554/eLife.34300.007
Figure 2—figure supplement 2
Distributions of double mutant cycles for each α2 helix position pair (numbering as in Figure 1C) for Shank3pdz.

As in Figure 3. 2–3, distributions are fit to single or double Gaussians, using the Bayes Information Criterion to choose the model. The position of zero coupling is indicated by the solid line and …

https://doi.org/10.7554/eLife.34300.008
Figure 2—figure supplement 3
Distributions of double mutant cycles for each α2 helix position pair (numbering as in Figure 1C) for Syntrophinpdz.

As in Figures 23, distributions are fit to single or double Gaussians, using the Bayes Information Criterion to choose the model. The position of zero coupling is indicated by the solid line and …

https://doi.org/10.7554/eLife.34300.009
Figure 2—figure supplement 4
Distributions of double mutant cycles for each α2 helix position pair (numbering as in Figure 1C) for ZO1pdz.

As in Figures 23, distributions are fit to single or double Gaussians, using the Bayes Information Criterion to choose the model. The position of zero coupling is indicated by the solid line and …

https://doi.org/10.7554/eLife.34300.010
Figure 2—figure supplement 5
Distributions of coupling in relative enrichment (ΔΔE) for a sampling of position pairs in the GB1 domain of the immunoglobulin-binding protein G.

The data are from (Olson et al., 2014), and position numbering is as given in that work; note that coupling is calculated here directly from enrichment scores. As in Figures 1 and 2, distributions …

https://doi.org/10.7554/eLife.34300.011
Figure 3 with 1 supplement
Homolog-averaged pairwise thermodynamic couplings in the PDZ domain.

Each subplot shows the distribution of coupling free energies (ΔΔG, see Equation 1, main text) for all measured mutants at one pair of positions in the α2-helix, but here averaged over the five …

https://doi.org/10.7554/eLife.34300.012
Figure 3—figure supplement 1
Amino acid contributions to distributions of double mutant cycle coupling energies for the PDZ α2 helix.

(A), Boxplots showing the couplings for each amino acid substitution with every other amino acid substitution in the homolog-averaged dataset. The data show that every amino acid substitution is …

https://doi.org/10.7554/eLife.34300.013
A basic model for observed distributions of double mutant cycle coupling energies.

(A), A schematic representation of two coupled equilibria in a protein molecule – a reaction with equilibrium constant Kx corresponding to function (here, binding), an internal two-state …

https://doi.org/10.7554/eLife.34300.014
Conservation and idiosyncrasy in the pattern of energetic couplings over PDZ homologs.

(A–E), Matrices of mutation averaged pairwise thermodynamic couplings for the α2-helix in each PDZ homolog. The color scale is chosen to represent the full range of measured energetic couplings. The …

https://doi.org/10.7554/eLife.34300.015
Figure 6 with 2 supplements
Coevolution-based inference of energetic couplings - SCA.

(A), Coevolution of sequence positions corresponding to the top eigenmode of the SCA matrix, derived from an alignment of 1656 eukaryotic PDZ domains (the 'Poole' alignment). The data show that a …

https://doi.org/10.7554/eLife.34300.016
Figure 6—figure supplement 1
Robustness of the SCA to alignment size.

(A–F) The graphs show scatterplots of homolog-averaged experimental couplings in the α2 helix against the first eigenmode of the SCA coevolution matrix for six independent trials of randomly …

https://doi.org/10.7554/eLife.34300.017
Figure 6—figure supplement 2
The relationship between mutation-averaged couplings and predictions from SCA for individual domains.

(A–E), Scatterplots of experimental couplings in the α2 helix of each individual PDZ homolog against the first eigenmode of the SCA coevolution matrix. Linear fits are shown in red lines with …

https://doi.org/10.7554/eLife.34300.018
Homolog-averaged thermodynamic couplings and protein sectors.

Analysis of the top eigenmodes of the SCA coevolution matrix exposes groups of coevolving amino acids that empirically are found to form physically contiguous networks in the tertiary structure, …

https://doi.org/10.7554/eLife.34300.019
Figure 8 with 2 supplements
Coevolution-based inference of energetic couplings - DCA.

(A), The matrix of direct couplings (Jij) from the DCA method, with tertiary contacts in the PDZ structure (1BE9) indicated by white or black circles. By convention (Morcos et al., 2011), trivial …

https://doi.org/10.7554/eLife.34300.020
Figure 8—figure supplement 1
Robustness of DCA to alignment size.

(A–F), The graphs show scatterplots of homolog-averaged experimental couplings in the α2 helix against computed DCA model coupling energies for six independent trials of randomly sub-sampling the …

https://doi.org/10.7554/eLife.34300.021
Figure 8—figure supplement 2
The relationship between mutation-averaged couplings and predictions from DCA for individual domains.

(A–E), scatterplots of experimental couplings for individual PDZ homologs against the coupling values computed from the DCA model. Linear fits are shown in red lines with adjusted Pearson …

https://doi.org/10.7554/eLife.34300.022

Tables

Table 1
Summary of data collection.

For each PDZ homolog, we indicate the target ligand, the wild-type affinity, the top-hit sequence identity within the ensemble of homologs, and the assay/sequencing statistics.

https://doi.org/10.7554/eLife.34300.005
PDZ homologLigandLigand sequenceAffinityTop ID% (PDZ)No. of single
mutants
(out of 171)
No. of double mutants
(out of 12,996)
Mean cycles/position pair
(out of 361)
Sequence readsunsel/sel
PSD95pdz3CRIPTTKNYKQTSV0.8 μM
(McLaughlin et al., 2012)
41.0 (Syntrophinpdz)17111,53132012,190,079/14,358,962
PSD95pdz2NMDAR2AKMPSIESDV3.6 μM
(Stiffler et al., 2006)
40.5 (PSD95pdz3)17112,0723359,402,209/14,965,473
Shank3pdzDlgap1/2/3YIPEAQTRL0.2 μM
(Stiffler et al., 2006)
25.0 (PSD95pdz2)17110,45429017,232,329/6,429,999
SyntrophinpdzScn5a (Nav1.5)PDRDRESIV1.6 μM
(Stiffler et al., 2006)
41.0 (PSD95pdz3)17110,7572988,227,200/15,248,680
ZO-1pdzClaudin8SIYSKSQYV4.6 μM
(Zhang et al., 2006)
37.5 (PSD95pdz3)171118803305,365,041/11,523,044

Additional files

Download links