Figures and data in Intrinsically disordered linkers determine the interplay between phase separation and gelation in multivalent proteins

Figures
Videos
Tables
Additional files

11 figures, 2 videos, 2 tables and 2 additional files

Figures

Figure 1

Download asset Open asset

Depiction of gelation without phase separation as opposed to phase separation plus gelation.

(a) Schematic of a synthetic multivalent system. SH3 domains bind to proline-rich modules (PRMs). Multivalent SH3 and PR proteins result from the tethering of multiple SH3 domains (or PRMs) by linkers. (b) Schematic of gelation without phase separation: If the bulk concentration of interaction domains is above the gel point but below the saturation concentration then a system spanning network forms across the entire system volume. In this scenario, a percolation transition is realized without phase separation. (c) Schematic of phase separation plus gelation. Linker-mediated cooperative interactions of multivalent proteins drive phase separation, depicted here as a confinement of molecules into a smaller volume (gray envelope) when compared to the system volume (dashed bounding box). If the bulk concentration of interaction domains is higher than a saturation concentration then a dense phase comprising of multivalent SH3 and PRM proteins will be in equilibrium with a dispersed phase of unbound proteins. A droplet-spanning network will form because the concentration of interaction domains within the dense phase is above the gel point.

https://doi.org/10.7554/eLife.30294.003

Figure 2

Download asset Open asset

Illustration of the impact of linker effective solvation volumes on the conformational fluctuations and inter-domain distances in linear multivalent proteins.

(a) Schematic of three SH3 domains connected by positive v_es linkers. In a cartoon schematic, the SH3 domains are shown as blue squares and the linkers are depicted as red tethers. The bidirectional arrows indicate the mapping between the molecular structures and the cartoon schematic. (b) Comparative schematics of SH3 domains connected by different types of linkers. The top row shows a pair of domains connected by linkers of high positive effective solvation volumes. For linkers with near zero effective solvation volumes, the inter-domain distances are characterized by large fluctuations and this engenders large concentration fluctuations. The bottom row shows the scenario for domains connected by linkers with negative v_es values. In this scenario, the inter-domain distances seldom exceed the sum of the individual radii of gyration.

https://doi.org/10.7554/eLife.30294.004

Figure 3

Download asset Open asset

Effective solvation volumes for disordered linkers from the human proteome.

(a) Inter-residue distance profiles for fourteen representative sequences, each 40-residues long. The legend shows the fraction of charged residues within each linker. The green dashed curve shows the inter-residue distance profile for the reference FRC limit. (b) Summary of the variation of ∆ as a function of the fraction of charged residues for the fourteen representative sequences. Here, ∆ $� = \frac{1}{N} \sum_{k} \frac{⟨ R_{k} ⟩ - ⟨ R_{k}^{FRC} ⟩}{⟨ R_{k}^{FRC} ⟩}$ , N is the number of linker residues, $⟨ R_{k} ⟩$ is the average spatial separation between residue pairs that are k apart in the linear sequence, $⟨ R_{k}^{FRC} ⟩$ is the corresponding spatial separation for a FRC chain, and the summation index k runs across all sequence-separations. Linkers for which ∆ < –0.1 will have negative effective solvation volumes (v_es < 0); linkers for which –0.1 ≤ ∆≤0.1 will have near zero effective solvation volumes (v_es ≈ 0); and linkers for which ∆>0.1, will have positive effective solvation volumes (v_es > 0). For the self-avoiding random coil (SARC) linkers, ∆ ≈ 0.5 and this is shown as a horizontal red line. (c) Length distribution of all 226 unique disordered linkers. (d) Distribution of ∆ values extracted from all-atom simulations of all 226 linkers. Based on the results shown in panel (B), we delineate the ∆-distribution into three regimes: ∆ < –0.1 (blue bars), –0.1 ≤ ∆≤0.1 (green bars), and ∆>0.1 (red bars). These regimes correspond, respectively to linkers for which v_es is less than zero, near zero, or greater than zero.

https://doi.org/10.7554/eLife.30294.005

Figure 4

Download asset Open asset

Coarse-grained bead-tether lattice models for modeling the phase behavior of multivalent proteins.

All simulations were performed using 3-dimensional cubic lattice models. In these models, poly-SH3 and poly-PRM proteins were modeled as bead-tether polymers where the red beads mimic an SH3 domain, the blue beads mimic PRMs, and the black or gold tethers mimic linkers that connect domains/modules to one another. Two beads cannot occupy the same lattice site. Panel (a) shows an implicit linker model. To mimic FRC linkers, implicit linkers ensure that two tethered beads cannot move apart beyond a maximum distance, but the linker itself does not occupy any lattice sites. Panel (b) shows the explicit linker model. To mimic SARC linkers, explicit linkers consist of non-interacting beads corresponding to a prescribed number of lattice sites. The explicit linkers tether two folded domains together, but other than occupying sites on the lattice they do not engage in interactions with one another or with the interaction domains. Note that in the explicit linker model each linker bead and interaction domain occupies a single lattice site. This choice was motivated by previous analysis of the comparative effective solvation volumes of FRC and SARC linkers (Mittal et al., 2014). In the figure, the linker beads are represented as being smaller than the interaction beads to emphasize that they are linkers. The real simulation box used is much larger than the lattice dimensions pictured here, which is just for illustration purposes.

https://doi.org/10.7554/eLife.30294.008

Figure 5

Download asset Open asset

Illustration of how ρand ϕ_c are calculated.

(a) *The scenario where ρ >>1.* The radius of gyration over all proteins is the root mean square distance of each of the proteins from the center of mass of the system of proteins and is depicted as the radius of the dashed red envelope. Although the red envelope is centered on the cluster, it extends beyond the cluster boundary due to the presence of proteins outside of the cluster; that is, *R_g^proteins* is always calculated over *all* proteins in the system. When a majority of the proteins are spatially clustered, the calculated *R_g^proteins* is considerably smaller than the radius of the lattice, and hence the ratio ρ >>1. *R_g^lattice* is shown as a black dashed envelope. In panel (a) a majority of the proteins are found within a single droplet-spanning cluster. This cluster encompasses ~ 80% of the modules, hence ϕ_c ~80%. Modules belonging to the single largest system spanning clusters are shown in yellow, the crosslinks are shown in green, and the ‘system’ here refers to the droplet. (b) *The scenario where ρ ≈ 1*. In this case, the modules are dispersed across the lattice volume as shown by the fact that the dashed red envelope is essentially coincident with the dashed black envelope. Here, we depict a scenario where 80% of the modules are incorporated into the single largest system-spanning cluster, where the ‘system’ volume corresponds to that of the entire lattice.

https://doi.org/10.7554/eLife.30294.009

Figure 6

Download asset Open asset

Comparative analysis of the connectivity and density transitions for multivalent proteins of fixed linker lengths.

(a) Heat maps showing ϕ_c as a function of changes to SH3 and PRM concentrations for multivalent proteins with FRC linkers. Progression from cool to hot colors leads to the incorporation of most of the modules into the single largest cluster. The module concentrations at which sharp changes in connectivity are realized will decrease with increasing valence. (b) Heat maps equivalent to those of panel (a) for multivalent proteins with SARC linkers. (c) Analysis of how ϕ_c changes with module concentration for equal concentrations SH3 modules to PRMs. The solid curves plot ϕ_c for proteins with SARC linkers and the dashed curves are results for FRC linkers. The legend provides an annotation of the color scheme for the different curves. (d) Heat maps showing ρ as a function of changes to SH3 and PRM concentrations for multivalent proteins with FRC linkers. Comparison to panel (a) shows the congruence between changes to ρ and ϕ_c, especially for the 5:5, 5:7, 7:5, and 7:7 systems. (e) Heat maps showing ρ as a function of changes to SH3 and PRM concentrations for multivalent proteins with SARC linkers. The value of ρ does not change and remains close to one irrespective of the valence or module concentration. (f) Analysis of how ρ changes with module concentration for equal concentrations SH3 modules to PRMs. The solid curves are for proteins with SARC linkers and this shows that ρ ≈ 1, irrespective of the module concentrations. As discussed in the text and summarized in Figure 7, phase separation is suppressed for systems with SARC linkers and this is reflected in the invariance of ρ. The dashed curves, for the 5:5 and 7:7 systems with FRC linkers show a sharp change above a threshold concentration of the modules. The behavior at high module concentrations is partly an artifact of our approach to increasing concentrations in the simulations, which involves fixing the number of modules and decreasing the volume of the simulation box. Accordingly, the radius of the lattice will decrease, thus decreasing ρ. However, ρ is greater than one above a critical concentration, thus emphasizing the coupling between phase separation and gelation for proteins with FRC linkers.

https://doi.org/10.7554/eLife.30294.010

Figure 7

Download asset Open asset

Representative, post-equilibration, snapshots for the 7:7 system above the gel points with FRC, panel (a), and SARC linkers, panel (b) of length n = 5.

In panel (a), the SH3 modules are shown in red and the PRMs in blue. In panel (b), the coloring is similar to panel (a). Additionally, molecules that are part of the single largest, system-spanning cluster are shown in orange. The main message conveyed here is that the SARC linkers suppress phase separation whereas the FRC linkers lead to gelation driven by phase separation.

https://doi.org/10.7554/eLife.30294.013

Figure 8

Download asset Open asset

Quantifying cooperativity and the coupling between phase separation and gelation.

(a) Plot of c* as a function of linker length for three symmetric multivalent systems connected by FRC linkers. There is an optimal range for linker lengths where c*<1, implying positive global cooperativity that gives rise to phase separation plus gelation. For long linkers, c* converges to unity, implying an absence of cooperativity and pure sol-gel transitions, in accord with Flory-Stockmayer theories. (b) Plot of c* as a function of linker length for three symmetric multivalent systems connected by SARC linkers. The value of c* is greater than unity for all linker lengths. This points to the suppression of phase separation by linkers with positive effective solvation volumes, and a shifting of the gel point to higher concentrations compared to the Flory-Stockmayer threshold. The linker length in terms of number of amino acids can be written as N ≈ 7 n, where n is the number of lattice sites and N is the number of residues.

https://doi.org/10.7554/eLife.30294.014

Figure 9

Download asset Open asset

Phase diagram for a 5:5 system with a hybrid five-site linker.

Here, for each linker, two of the linker beads were modeled explicitly, while the other three were modeled implicitly. For low binding affinities between SH3 domains and PRMs (<3 *_kBT*), the system undergoes a sol-gel transition as a function of module concentration, and the affinity-specific gel points lie on the green dashed line. The red asterisk denotes the critical point located at an interaction affinity of ~3 *_kBT* and a module concentration of ~10^–3polymers/voxel. Above an interaction affinity of ~3 *_kBT*, the system undergoes phase separation plus gelation. Phase separation is characterized by a coexistence curve with two arms, shown in blue and purple. A solution with a bulk concentration that falls within the yellow region will never form a one-phase solution. Instead, it will separate into coexisting dilute and dense phases. The concentrations within these phases are equal to the concentrations taken from coexistence curves that intersect with the corresponding tie line (red dotted line). This is illustrated for interaction strengths of 4.5k_BT. Any solution with a bulk concentration along the tie line will phase separate into a dense phase and a dilute phase of a fixed concentration *c_sl* and *c_sh*, respectively. For this system, the high concentration arm of the coexistence curve always lies beyond the gel-line, and therefore, the dense phase will always form a gel. The gel line within the two-phase region is calculated based on the percolation threshold and is shown as a dotted green line, which is really an extrapolation of the green dashed line. It highlights the fact that *c_sl* <*c_g <* _csh throughout the two-phase regime. The callouts on the right show schematics of the dilute sol coexisting with a dense gel (top right) and a system spanning gel that forms via gelation without phase separation (bottom right).

https://doi.org/10.7554/eLife.30294.015

Figure 10

Download asset Open asset

Impact of linker v_es values on coupling between phase separation and gelation for 5:5 systems with linkers of length n = 5.

Progressing from panel a) to panel f), the value of v_es for each of the linkers increases from 0 to 5 in terms of number of lattice units. The widths of the regimes that correspond to phase separation (yellow regions) shrink as the effective solvation volumes of linkers increase. For the fully implicit, FRC linker (panel a), gelation without phase separation either requires shorter linkers or interaction affinities that are weaker than 2k_BT. The sol-gel lines are shown as dashed lines in each panel. Accordingly, for a) and b) the gelation without phase separation are realized for SH3: PRM affinities that are weaker than 2k_BT and hence they are not shown in these panels. Each panel is annotated with a schematic to show the design of hybrid linkers and each schematic we shown only a single linker for clarity.

https://doi.org/10.7554/eLife.30294.016

Figure 11

Download asset Open asset

Estimating ϕ_cc – the critical value of the fraction of molecules in the largest cluster, ϕ_c that defines the gel point: To estimate **ϕ_cc,** we plot **ϕ_c** against the fraction of SH3 domains and PRMs that are bound.

ϕ_c was calculated using a random network model (see Materials and methods) and for a prescribed affinity between interaction domains. ϕ_c shows a sigmoidal transition that shifts to the right for systems of lower valence (V). For each system, the dashed vertical lines quantify the percolation thresholds, which refer to the fraction of modules for a given valence V that must be bound in order to make a percolated network as prescribed by the theories of Flory and Stockmayer. For a given system of multivalent proteins, the intersection between the solid sigmoidal curve and the dashed vertical line quantifies the value of ϕ_cc.

https://doi.org/10.7554/eLife.30294.017

Videos

Video 1

Download asset

posterframe for video — Demonstration of gelation driven by phase separation for the 7:7 system of poly-SH3 and poly-PRM.

The color-coding is such that SH3 domains are in red and PRMs are in blue. The simulations start with the molecules dispersed uniformly across the simulation volume. The movie shows droplet formation leading to gelation for bulk concentrations of SH3 domains and PRMs that lie above the saturation concentration *c_sl*.

https://doi.org/10.7554/eLife.30294.011

Video 2

Download asset

Tables

Table 1

Summary of the parameters, the physical description of these parameters, and the default values used for the parameters of the lattice model.

https://doi.org/10.7554/eLife.30294.006

Parameter	Physical interpretation	Default value
Valence	Number of PRMs and SH3 domains per poly-PRM and poly-SH3	5 (but titrated for results in Figure 6)
Interaction Strength	Intrinsic affinity between PRMs and SH3 domains	–2k_BT
Linker Length	Length of disordered linker between interaction domains	5 (but titrated for results in Figure 8)
Effective solvation volume (v_es)	Degree to which the Linker Prefers Interacting with Solvent	Proportional to the number of explicitly modeled linker beads

Table 2

Details of the fourteen sequences chosen at random from the human proteome.

All sequences have identical lengths (40 residues) and are enriched in disorder promoting residues. The sequences are listed in descending order of the fraction of charged residues.

https://doi.org/10.7554/eLife.30294.007

Sequence	FCR*	NCPR^†	Fraction of disorder promoting residues	UNIPROT identifier of protein from which the sequence was drawn
EDEDSEKEEEEEDKEMEELQEEKECEKPQGDEEEEEEEEE	0.80	–0.60	0.93	P37275
DEEGNAYGSEREEEDEEEDEEDGKRELELEEEELGGEEED	0.70	–0.55	0.88	P78415
REKDREKYSQREQERDRQQNDQNRPSEKGEKEEKSKAKEE	0.65	0.00	0.93	Q9H0G5
DRVVVTDDSDERRLKGAEDKSEEGEDNRSSESEEESEGEE	0.60	–0.30	0.88	Q9BQG0
EAYRLSLEADRAKREAHEREMAEQFRLEQIRKEQEEEREA	0.55	–0.10	0.88	Q9UNN5
RRQRRWEDIFNQHEEELRQVDKDKEDESSDNDEVFHSIQA	0.50	–0.15	0.73	Q7Z2Y5
NNRKGRGGNRGREFRGEENGIDCNQVDKPSDRGKRARGRG	0.45	0.15	0.76	Q5T6F2
QKQKLRLLSSVKPKTGEKSRDDALEAIKGNLDGFSRDAKM	0.40	0.10	0.75	Q9UMZ2
AEMKVLESPENKSGTFKAQEAEAGVLGNEKGKEAEGSLTE	0.35	–0.10	0.78	Q8N3D4
MAAAESDKDSGFSDGSSECLSSAEQMESEDMLSALGWSRE	0.30	–0.20	0.78	Q9C0C6
DHFMKSGFASGRNFGNRDAGECNKRDNTSTMGGFGVGKSF	0.25	0.05	0.68	Q9NQI0
TAVSTSGPEDICSSSSSHERGGEATWSGSEFEVSFLDSPG	0.20	–0.15	0.80	Q9BQQ3
FSTLGRLRNGIGGAAGIPRANASRTNFSSHTNQSGGSELR	0.15	0.10	0.73	Q9Y252
KSSSQTSGSLVSKSTSLASVSQLASKSSSQTSTSQLPSKS	0.10	0.10	0.85	Q9NXV6

*FCR: Fraction of charged residues defined as (f₊+f_–) where f₊ and f_– denote the fraction of positive and negative charges, respectively;

†NCPR: Net charge per residue defined as (f₊ – f_–)

Additional files

Supplementary file 1 Excel spreadsheet summarizing the proteome-wide analysis of naturally occurring intrinsically disordered linkers in linear multivalent proteins. Data include the Uniprot ID, the name of the protein from which the linker sequence is drawn, the linker length in terms of number of amino acids, the start and end positions in terms of amino acid numbers for each linker, the disorder score on a scale of 0 to 1, the value of ∆ (see Figure 3), the fraction of positively charged residues (f₊ or Fpos), the fraction of negatively charged residues (f_– or Fneg), location of the diagram-of-states developed by Das and Pappu (Das et al., 2015; Holehouse et al., 2017; Das and Pappu, 2013), amino acid sequence of the linker, Gene Ontology molecular function annotation, Gene Ontology biological process annotation, and Gene Ontology cellular location/component annotation.: https://doi.org/10.7554/eLife.30294.018
Download elife-30294-supp1-v2.xlsx
Transparent reporting form: https://doi.org/10.7554/eLife.30294.019
Download elife-30294-transrepform-v2.pdf

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Article PDF

Open citations (links to open the citations from this article in various online reference manager services)

Mendeley

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

Tyler S Harmon
Alex S Holehouse
Michael K Rosen
Rohit V Pappu

(2017)

Intrinsically disordered linkers determine the interplay between phase separation and gelation in multivalent proteins

eLife 6:e30294.

https://doi.org/10.7554/eLife.30294

Figures

Depiction of gelation without phase separation as opposed to phase separation plus gelation.

Illustration of the impact of linker effective solvation volumes on the conformational fluctuations and inter-domain distances in linear multivalent proteins.

Effective solvation volumes for disordered linkers from the human proteome.

Coarse-grained bead-tether lattice models for modeling the phase behavior of multivalent proteins.

Illustration of how ρand ϕ_c are calculated.

Comparative analysis of the connectivity and density transitions for multivalent proteins of fixed linker lengths.

Representative, post-equilibration, snapshots for the 7:7 system above the gel points with FRC, panel (a), and SARC linkers, panel (b) of length n = 5.

Quantifying cooperativity and the coupling between phase separation and gelation.

Phase diagram for a 5:5 system with a hybrid five-site linker.

Impact of linker v_es values on coupling between phase separation and gelation for 5:5 systems with linkers of length n = 5.

Estimating ϕ_cc – the critical value of the fraction of molecules in the largest cluster, ϕ_c that defines the gel point: To estimate ϕ_cc, we plot ϕ_c against the fraction of SH3 domains and PRMs that are bound.

Videos

Demonstration of gelation driven by phase separation for the 7:7 system of poly-SH3 and poly-PRM.

Demonstration of gelation without phase separation for the 7:7 system of poly-SH3 and poly-PRM.

Tables

Summary of the parameters, the physical description of these parameters, and the default values used for the parameters of the lattice model.

Details of the fourteen sequences chosen at random from the human proteome.

Additional files

Supplementary file 1

Transparent reporting form

Download links

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

Be the first to read new articles from eLife

Share this article

Cite this article

Depiction of gelation without phase separation as opposed to phase separation plus gelation.

Illustration of the impact of linker effective solvation volumes on the conformational fluctuations and inter-domain distances in linear multivalent proteins.

Effective solvation volumes for disordered linkers from the human proteome.

Coarse-grained bead-tether lattice models for modeling the phase behavior of multivalent proteins.

Illustration of how ρand ϕc are calculated.

Comparative analysis of the connectivity and density transitions for multivalent proteins of fixed linker lengths.

Representative, post-equilibration, snapshots for the 7:7 system above the gel points with FRC, panel (a), and SARC linkers, panel (b) of length n = 5.

Quantifying cooperativity and the coupling between phase separation and gelation.

Phase diagram for a 5:5 system with a hybrid five-site linker.

Impact of linker ves values on coupling between phase separation and gelation for 5:5 systems with linkers of length n = 5.

Estimating ϕcc – the critical value of the fraction of molecules in the largest cluster, ϕc that defines the gel point: To estimate ϕcc, we plot ϕc against the fraction of SH3 domains and PRMs that are bound.

Demonstration of gelation driven by phase separation for the 7:7 system of poly-SH3 and poly-PRM.

Demonstration of gelation without phase separation for the 7:7 system of poly-SH3 and poly-PRM.

Summary of the parameters, the physical description of these parameters, and the default values used for the parameters of the lattice model.

Details of the fourteen sequences chosen at random from the human proteome.

Supplementary file 1

Transparent reporting form

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

Illustration of how ρand ϕ_c are calculated.

Impact of linker v_es values on coupling between phase separation and gelation for 5:5 systems with linkers of length n = 5.

Estimating ϕ_cc – the critical value of the fraction of molecules in the largest cluster, ϕ_c that defines the gel point: To estimate ϕ_cc, we plot ϕ_c against the fraction of SH3 domains and PRMs that are bound.