Abstract
Liquid-liquid phase separation (LLPS) involving intrinsically disordered protein regions (IDRs) is a major physical mechanism for biological membraneless compartmentalization. The multifaceted electrostatic effects in these biomolecular condensates are exemplified here by experimental and theoretical investigations of the different salt- and ATP-dependent LLPSs of an IDR of messenger RNA-regulating protein Caprin1 and its phosphorylated variant pY-Caprin1, exhibiting, e.g., reentrant behaviors in some instances but not others. Experimental data are rationalized by physical modeling using analytical theory, molecular dynamics, and polymer field-theoretic simulations, indicating in general that interchain salt bridges enhance LLPS of polyelectrolytes such as Caprin1 and that the high valency of ATP-magnesium is a significant factor for its colocalization with the condensed phases, as similar trends are observed for several other IDRs. Our findings underscore the role of biomolecular condensates in modulating ion concentrations and its functional ramifications.
Introduction
Broad-based recent efforts have uncovered many intriguing features of biomolecular condensates, revealing and suggesting myriad known and potential biological functions [1–3]. These assemblies are underpinned substantially, though not exclusively, by liquid-liquid phase separation (LLPS) of intrinsically disordered regions (IDRs) as well as folded domains of proteins and nucleic acids [4, 5], while more complex equilibrium and non-equilibrium mechanisms also contribute [6–14].
Electrostatics is important for IDR LLPS [15, 16], which is often also facilitated by π-related interactions [17, 18], hydrophobicity, hydrogen bonding [19, 20], and is modulated by temperature [10, 21], hydrostatic pressure [22], osmolytes [4], RNA [23–26], salt, pH [27], and post-translational modifications (PTMs) [1, 28, 29]. Multivalency underlies many aspects of IDR properties [30–34]. Here, we focus primarily on how PTM- and salt-modulated multivalent charge-charge interactions might alter IDR condensate behaviors and their possible functional ramifications. In general, electrostatic effects on IDR LLPS [15, 25, 27, 35] are dependent upon their sequence charge patterns [36–40]. Intriguingly, some IDRs undergo reentrant phase separation [4] or dissolution [41] when temperature, pressure [4], salt [42, 43], RNA [41, 44], or concentrations of small molecules such as heparin [45] is varied. Reentrance, especially when induced by salt and RNA, suggest a subtle interplay between multivalent sequence-specific charge-charge interactions and hydrophobic, non-ionic [42, 43], cation-π [41, 44], or π-π interactions.
An important modulator of biomolecular LLPS is adenosine triphosphate (ATP). As energy currency, ATP hydrolysis is utilized to synthesize or break chemical bonds and drive transport to regulate “active liquid” properties such as concentration gradients and droplet sizes [8, 46]. Examples include ATP-driven assembly of stress granules [47], splitting of bacterial biomolecular condensates [48], and destabilization of nucleolar aggregates [49]. ATP can also influence biomolecular LLPS without hydrolysis, akin to other LLPS promotors or suppressors [50] that are effectively ligands of the condensate scaffold [51], or through ATP’s effect on lowering free [Mg2+] [52]. Notably, as an amphiphilic hydrotrope [53] with intracellular concentrations much higher than that required for an energy source, ATP is also seen to afford an important function independent of hydrolysis by solubilizing proteins, preventing LLPS and destabilizing aggregates, as exemplified by measurements on several proteins including fused in sarcoma (FUS) [54].
Subsequent investigations indicate, however, that hydrolysis-independent [ATP] effects on biomolecular LLPS are neither invariably monotonic for a given system nor universal across different systems. For instance, ATP promotes, not suppresses, LLPS of an IgG1 antibody [55] and of basic IDPs [56], and ATP enhances LLPS of full-length and the C-terminal domain (CTD) of FUS at low [ATP] but prevents LLPS at high [ATP] [57]. The latter reentrant behavior has been surmised to arise from ATP binding bivalently [57, 58] or trivalently [59] to charged residues arginine (R) or lysine (K) by a combination of cation-π and electrostatic interactions, an effect also seen in the ATP-mediated LLPS of basic IDPs [56]. A similar scenario was invoked for the reentrant phase behavior of transactive response DNA-binding protein of 43 kDa (TDP-43) [60].
While n-related interactions are important for biomolecular LLPS in general [17, 18] and their interplay with electrostatics likely underlies reentrant biomolecular phase behaviors modulated by RNA [41, 44] or simple salts [42], the degree to which electrostatics alone can rationalize hydrolysis-independent ATP-modulated biomolecular phase reentrance has not been sufficiently appreciated. This question deserves attention. For instance, the suppression of cold-inducible RNA-binding protein condensation by ATP has been suggested to be electrostatically driven [61]. The aforementioned ATP-modulated reentrant phase behavior of FUS [57, 58] is reminiscent of the 236-residue N-terminal IDR of DEAD-box RNA helicase Ddx4’s lack of LLPS at low [NaCl] (< 15–20 mM), LLPS at higher [NaCl] [62] and decreasing LLPS propensity when [NaCl] is further increased [15, 16]. Indeed, the finding that FUS CTD (net charge per residue (NCPR) = 15/156 = 0.096) exhibits ATP-dependent reentrant phase behaviors while the N-terminal domain (NCPR = 3/267 = 0.011) does not [58] is consistent with electrostatics-based theory for the difference in salt-dependent LLPS of polyelectrolytes and polyampholytes [62].
With this in mind, we seek to delineate the degree to which theories focusing primarily on electrostatics can rationalize experimental ATP-related LLPS data on the 103-residue C-terminal IDR of human cytoplasmic activation/proliferation-associated protein-1 (Caprin1). Full-length Caprin1 (709 amino acid residues) is a ubiquitously expressed phosphoprotein that regulates stress [63–66] and neuronal [67] granules, is necessary for normal cellular proliferation [68, 69], and may be essential for long-term memory [70]. Caprin1 dysfunction leads to multiple diseases such as nasopharyngeal carcinoma [71] as well as language impairment and autism spectrum disorder [72], via, e.g., Caprin1’s modulation of the function of the fragile X mental retardation protein (FMRP) [24, 28, 67]. The C-terminal 607–709 Caprin1 IDR, referred to simply as Caprin1 below, is biophysically and functionally significant: It is sufficient for LLPS in vitro [28], important for assembling stress granules in the cell [63, 64], and has a substantial body of experiments [28, 73–75] for comparison with theory. Since tyrosine phosphorylations of Caprin1 in vivo [76] may regulate translation in neurons [28], the Caprin1 system is also useful for gaining insights into phosphorregulation of biomolecular condensates [77–79].
Recent advances in theory and computation enable modeling of sequence-specific IDR LLPS [16, 78, 80–86]. Among the approaches, explicit-chain simulation affords more realistic geometric and energetic representations while analytical theory offers advantages in numerical tractability [87]. The analytical rG-RPA formulation [62], which synthesizes Kuhn-lengh renormalization (renormalized Gaussian, rG) and random phase approximation (RPA) [16] to treat both high-net-charge polyelectrolytes and essentially net-neutral polyampholytes [62], is particularly well suited for Caprin1 and its phosphorylated variant. We hereby leverage a methodological combination of rG-RPA [62], field-theoretic simulation [81, 88], and coarse-grained explicit-chain molecular dynamics (MD) [78, 84] to elucidate the effects of salt, phosphorylation, and ATP on Caprin1 LLPS.
Results
Physical theories of Caprin1 and phosphorylated Caprin1 LLPSs as those of polyelectrolytes and polyampholytes
The 103-residue Caprin1 is a highly charged IDR with 19 charged residues (Fig. 1a and SI Appendix, Fig. S1): 15 R, 1 K, and 3 aspartic acids (D); fraction of charged residues = 19/103 = 0.184 and NCPR = 13/103 = 0.126. With a substantial positive net charge, Caprin1’s phase behaviors are markedly different from those of polyampholytic IDRs with nearly zero net charge such as Ddx4 to which early sequence-specific LLPS theories were targeted [15, 16]. Instead, Caprin1 behaves like chemically synthesized polyelectrolytes [89]. In contrast, when most or all of the seven tyrosines (Y) in the Caprin1 IDR are phosphorylated (pY), negative charges are added to produce a near-net-neutral polyampholyte. Mass spectrometry indicates that the experimental sample of phosphorylated Caprin1 consists mainly of a mixture of IDRs with six or seven phosphorylations (SI Appendix, Fig. S2). We refer to this experimental sample as pY-Caprin1 below. For simplicity, we use only the Caprin1 IDR with seven pYs to model the behavior of this highly phosphorylated experimental sample in our theoretical and computational formulations, partly to avoid the combinatoric complexity of sequences with 5 or 6 pYs. Accordingly, since the charge of a pY is ≈ —2 at the experimental pH = 7.4, -14 charges are added to Caprin1 for our model pY-Caprin1, resulting in a polyampholyte with a very small NCPR ≈ -1/103 = -0.00971 (Fig. 1b). Both the experimental pY-Caprin1 (NCPR « ±1/103 = ±0.00971) and model pY-Caprin1 are expected to exhibit phase properties similar to other polyampholytic IDRs.
While sequence-specific RPA has been applied successfully to model electrostatic effects on the LLPSs of various polyampholytic IDRs [10, 16, 84, 90, 91], RPA is less appropriate for polyelectrolytes with large NCPR [92–94] because of its treatment of polymers as ideal Gaussian chains [95]. Traditionally, theories for polyelectrolytes tackle their peculiar conformations by various renormalized blob constructs [89, 92], two-loop polymer field theory [96], modified thermodynamic perturbation theory [97], and renormalized Gaussian fluctuation (RGF) theory [98, 99], among others. As such, these formulations are mostly designed for homopolymers, making it difficult to apply directly to heteropolymeric biopolymers. In order to analyze Caprin1 and pY-Caprin1 LLPSs, we utilize rG-RPA [62], which combines Gaussian chains of effective (renormalized) Kuhn length with the key idea of RGF [37].
Phase properties predicted by rG-RPA theory for Caprin1 and pY-Caprin1 with monovalent counterions and salt are in agreement with experiment
Fig. 1c and d show that the salt- and temperature (T)-dependent phase diagrams predicted by rG-RPA with an augmented Flory-Huggins (FH) mean-field χ(T) = Δh/T — Δs parameter for nonelectrostatic interactions [16, 62] (“rG-RPA+FH” theory in SI Appendix) are in reasonable agreement with experiment using bulk [Caprin1] ≈ 200 μM. The rG-RPA+FH results in Fig. 1c indicate that (i) Caprin1 undergoes LLPS below 20°C with 100 mM NaCl, and that (ii) LLPS propensity, quantified by the upper critical solution temperature (UCST), increases with [NaCl]. These predictions are consistent with experimental data, including the observation that Caprin1 does not phase separate at room temperature without salt, ATP, RNA or other proteins, though Caprin1 LLPS can be triggered by adding wildtype (WT) and phosphorylated FMRP and/or RNA (bulk [Caprin1] ≥ 10 μM) [28], NaCl [73], or ATP (bulk [Caprin1] = 400 μM) [74]. The trend here is also in line with other theories of polyelectrolytes [99]. In contrast, rG-RPA+FH results in Fig. 1d for pY-Caprin1 shows decreasing LLPS propensity with increasing [NaCl], consistent with experimental data and the expected salt dependence of LLPS of nearly net-neutral polyampholytic IDRs such as Ddx4 [16]. Interestingly, the decrease in some of the condensed-phase [pY-Caprin1]s with decreasing T (orange and green symbols for ≤ 20°C in Fig. 1d trending toward slightly lower [pY-Caprin1]) is suggestive of a lower critical solution temperature (LCST)-like [10, 21] reduction of LLPS propensity as temperature approaches ~ 0°C.
Salt-IDR two-dimensional phase diagrams are instrumental for exploring broader phase properties
Fig. 1c and d, though informative, are computed by a restricted rG-RPA that assumes a spatially uniform [Na+]. For a more comprehensive physical picture, we now examine possible differences in salt concentration between the IDR-dilute and condensed phases by applying unrestricted rG-RPA to compute two-dimensional salt-Caprin1/pY-Caprin1 phase diagrams (Fig. 2).
As stated in Materials and Methods and SI Appendix, here we define “counterions” and “salt ions”, respectively, as the small ions with charges opposite and identical in sign to that of the net charge, Q, of a given polymer. For the Caprin1/NaCl system, since Caprin1’s net charge is positive, Na+ is salt ion and Cl– is counterion. Overall electric neutrality of the system implies that the concentrations (ρ’s) of polymer (ρp), counterions (ρc), and salt ions (ρs) are related by
where zs and zc are, respectively, the valencies of salt ions and counterions. For Caprin1 and pY-Caprin1, Q = +13 and -1, respectively, and (zs, zc) = (1, 1), (1, 2), and (2, 4) are models for different small-ion species in the system. Specifically, in Fig. 2, we identify the zs = 1 salt ion as Na+ (Fig. 2a-f) and the zc =1 counterion as Cl– (Fig. 2a-d), the zc = 2 counterion as (ATP-Mg)2- (Fig. 2g,h), the zs = 2 salt ion as Mg2+ and the zc = 4 counterion as ATP4- (Fig. 2i-l).
Behavioral trends of rG-RPA-predicted Na+-Caprin1 two-dimensional phase diagrams are consistent with experiment
Notably, Fig. 2a,b (zs = zc = 1) predicts that Caprin1 does not phase separate without Na+ , consistent with experiment, indicating that monovalent counterions alone (Cl– in this case) are insufficient for Caprin1 LLPS. When [Na+] is increased, the system starts to phase separate at a small [Na+] ≤ 0.1 M, with LLPS propensity increasing to a maximum at [Na+] ~1 M before decreasing at higher [Na+], in agreement with experiment (Fig. 3a, blue data points) and consistent with Caprin1 LLPS propensity increasing with [NaCl] from 0.1 to 0.5 M (Fig. 1c). The predicted reentrant dissolution of Caprin1 condensate at high [Na+] in Fig. 2a is consistent with measurement up to [Na+] ≈ 4.6 M indicating a significant decrease in LLPS propensity when [Na+] ≥ 2.5 M (Fig. 3a), though the gradual decreasing trend suggests that complete dissolution of condensed droplets is not likely even when NaCl reaches its saturation concentration of ~ 6 M.
The negative tieline slopes in Fig. 2a,b predict that Na+ is partially excluded from the Caprin1 condensate. This “salt partitioning” is most likely caused by Caprin1’s net positive charge and is consistent with published research on polyelectrolytes with monovalent salt [99–101]. Here, the rG-RPA predicted trend is consistent with our experiment showing significantly reduced [Na+] in the Caprin1-condensed compared to the dilute phase (Table 1), although the larger experimental reduction of [Na+] in the Caprin1 condensed droplet relative to our theoretical prediction remains to be elucidated. In contrast, for the near-neutral, very slightly negative model pY-Caprin1 (Fig. 2c,d), rG-RPA predicts LLPS at [Na+] ≈ 0, and the positive tieline slopes indicate that [Na+] is higher in the condensed than in the dilute phase. Consistent with Fig. 1d, Fig. 2c shows that pY-Caprin1 LLPS propensity always decreases with increasing [Na+].
rG-RPA-predicted salt-IDR two-dimensional phase diagrams underscore effects of counterion valency on LLPS
Interestingly, a different salt dependence of Caprin1 LLPS is predicted when the salt ion remains monovalent but Cl– is replaced by a divalent zc = 2 anion modeling (ATP-Mg)2- under the simplifying assumption that ATP4- and Mg2+ do not dissociate in solution. The corresponding rG-RPA results (Fig. 2e-h) indicate that, with divalent counterions (Fig. 2g,h), Caprin1 can undergo LLPS without the monvalent salt (Na+) ions (LLPS regions extend to [Na+] = 0 in Fig. 2e,f), which is likely casued by the fact that the configurational entropic cost of concentrating counterions in the Caprin1 condensed phase is lesser for divalent than for monovalent counterions.
Other predicted differences between monovalent (Fig. 2a,b) and divalent (Fig. 2e,f) counterions’ impact on Caprin1 LLPS include: (i) The maximum condensed-phase [Caprin1] at low [Na+] is lower with monovalent than with divalent counterions ([Caprin1] ~ 400 mM vs. ~ 750 mM). (ii) The [Na+] at the commencement of reentrance (i.e., at the maximum condensed-phase [Caprin1]) is much higher with monovalent than with divalent counterions ([Na+] ~ 1 M vs. ~ 0.1 M). (iii) [Na+] is depleted in the Caprin1 condensate with both monovalent and divalent counterions when overall [Na+] is high (negative tieline slopes for [Na+]≤ 2 M in Fig. 2a,e). However, for lower overall [Na+], [Na+] is slightly higher in the Caprin1 condensate with divalent but not with monovalent counterions (slightly positive tieline slopes for [Na+]. 2 M in Fig.2e,f). This prediction suggests that under physiological [Na+]=150~170 mM, monovalent positive salt ions such as Na+ can be attracted, somewhat counterintuitively, into biomolecular condensates scaffolded by positively-charged polyelectrolytic IDRs in the presence of divalent counterions.
rG-RPA is consistent with experimental [ATP-Mg]-dependent Caprin1 reentrant phase behaviors
For the zs = 2, zc = 4 case in Fig. 2i-l modeling (ATP-Mg)2- complex dissociating completely in solution into Mg2+ salt ions and ATP4- counterions, rG-RPA predicts Caprin1 LLPS with ATP4- (Fig. 2k,l) in the absence of Mg2+ (the LLPS region includes the horizontal axes in Fig. 2i,j), likely because the configurational entropy loss of tetravalent counterions in the Caprin1 condensate is less than that of divalent and monovalent counterions. Tetravalent counterions also increase the maximum condensed-phase [Caprin1] to ≥1200 mM. At the commencement of reentrance (maximum condensed-phase [Caprin1] in Fig. 2i,j), [Mg2+] ~400 mM, which is intermediate between the corresponding [Na+] ~ 1.0 and 0.1 M, respectively, for monovalent and divalent counterions with (zs, zc) = (1, 2) and (1, 1). All tieline slopes for Mg2+ and ATP4- in Fig. 2i-l are significantly positive, except in an extremely high-salt region with [Mg2+]> 8M, indicating that [ATP-Mg] is almost always substantially enhanced in the Caprin1 condensate. Despite the tendency for polymer field theories to overestimate LLPS propensity and condensed-phase concentrations quantitatively because they do not account for ion condensation [98]—which can be severe for small ions with more than ±1 charge valencies, our rG-RPA-predicted semi-quantitative trends are consistent with experiments indicating [ATP-Mg]-dependent reentrant phase behavior of Caprin1 (Fig. 3a, red data points, and Fig. 3b) and that [Mg2+] as well as [ATP4-] are significantly enhanced in the Caprin1 condensate by a factor of ~ 5–60 for overall [ATP-Mg] = 3–30 mM (Table 2).
Coarse-grained MD with explicit small ions is useful for investigating subtle salt dependence in biomolecular LLPS
To gain deeper insights, we extend the widely-utilized coarse-grained explicit-chain MD model for biomolecular condensates [78, 84, 102] to include explicit small cations and anions (Materials and Methods). For computational efficiency, we neglect solvation effects that can arise from the directional hydrogen bonds among water molecules (see, e.g., ref. [103]) by treating other aspects of the aqueous solvent implicitly as in most, though not all [87, 91] applications of the methodology [78]. Several coarse-grained interaction schemes were used in recent MD simulations of biomolecular LLPS [78, 84, 86, 104–107]. Since we are primarily interested in general principles rather than quantitative details of the phase behaviors of Caprin1 and its ariginine-to-lysine (RtoK) mutants, here we adopt the Kim-Hummer (KH) energies for pairwise amino acid interactions derived from contact statistics of folded protein structures [78], which can largely capture the experimental effects of R vs K on LLPS [84].
Explicit-ion MD rationalizes experimentally observed [NaCl]-dependent Caprin1 reentrant phase behaviors and depletion of Na+ in Caprin1 condensate
Consistent with experiment (Fig. 3) and rG-RPA (Fig. 2a-d), explicit-ion coarse-grained MD shows [NaCl]-dependent reentrant phase behavior for Caprin1 but not for pY-Caprin1 (non-monotonic and monotonic trends indicated, respectively, by the grey arrows in Fig. 4a,b). In other words, the critical temperature Tcr, which is defined as the maximum temperature (UCST) of a given phase diagram (binodal, or coexistence curve), increases then decreases with addition of NaCl for Caprin1 but Tcr always decreases with increasing [NaCl] for pY-Caprin1. Moreover, consistent with the rG-RPA-predicted tielines in Fig. 2a-d (negative slopes for Caprin1 and positive slopes for pY-Caprin1), Fig. 4c shows that Na+ is slightly depleted in the Caprin1 condensed droplet, exhibiting the same trend as that in experiment (Fig. 3a, blue data points; and Table 1) but is enhanced in the pY-Caprin1 droplet (Fig. 4d). Because model temperatures in Fig. 4a,b and subsequent MD results are given in units of the MD-simulated Tcr of WT Caprin1 at [NaCl] = 0 (denoted as here), the Tcrs of systems with higher or lower LLPS propensities than WT Caprin1 at zero [NaCl] is characterized, respectively, by or < 1.
Fig. 4c shows that [Cl–] is enhanced while [Na+] is depleted in the Caprin1 droplet. This phenomenon is further illustrated in Fig. 5a-d by comparing the entire simulation box (Fig. 5a) with individual distributions of the Caprin1 IDR (Fig. 5b), Na+ (Fig. 5c), and Cl– (Fig. 5d). A similar trend, also attributed to charge effects, was observed in explicit-water, explicit-ion MD simulations in the presence of a preformed condensate of the N-terminal RGG domain of LAF-1 with a positive net charge [108]. For Caprin1, Fig. 5e,f suggests that, as counterion, Cl– can coordinate two positively charged R residues and thereby stabilize indirect counterion-bridged interchain contacts among polycationic Caprin1 molecules to promote LLPS, consistent with an early lattice-model analysis of generic polyelectrolytes [94].
Explicit-ion MD rationalizes [NaCl]-dependent phase properties of argininelysine mutants of Caprin1
We apply our MD methodology also to four RtoK Caprin1 variants, termed 15Rto15K, 4Rto4KN, 4Rto4KM, and 4Rto4KC (SI Appendix, Fig. S1), which involve 15 or 4 RtoK substitutions [73]. The simulated phase diagrams in Fig. 6 exhibit reentrant phase behaviors for all three 4Rto4K variants. While these results are consistent with experiments showing LLPS of these 4Rto4K variants commencing at different nonzero [NaCl]s [73], the simulated reentrant dissolution is not observed experimentally, probably because the actual [NaCl] needed is beyond the experimentally investigated or physically possible range of salt concentration. Simulated reentrant phase behaviors are also seen for 15Rto15K; but as will be explained below, its much lower simulated UCST is consistent with no experimental LLPS for this variant [73]. Since our main focus here is on general physical principles, we do not attempt to fine-tune the MD parameters for a quantitative match between simulation and experiment. Experimentally, only WT exhibits a clear trend toward reentrant dissolution of condensed droplets (with a LLPS propensity plateau at [NaCl] ≈ 1.55–2.5 M, Fig. 3a, blue data points), whereas the LLPS of 4Rto4KM and 4Rto4KC commences at [NaCl] ~ 1.3 M, LLPS propensity then increases with [NaCl] (a trend consistent with the MD-predicted increasing LLPS propensity at low [NaCl]s in Fig. 6b,c), but no sign of reentrant dissolution is seen up to the maximum [NaCl] = 2 M investigated experimentally for the RtoK variants (Fig. 9B of ref. [73]). In contrast, the MD phase diagrams in Fig. 6 show a maximum LLPS propensity (highest Tcr) at [NaCl] ≈ 0.5 M. This qualitative agreement with quantitative mismatch suggests that real Caprin1 LLPS is somewhat less sensitive to small monovalent ions than that stipulated by the present MD model. This question should be tackled in future studies by considering, for example, alternate pairwise amino acid interaction energies [78, 84, 86, 104–107] and their temperature dependence [4, 21].
Limitations notwithstanding, the MD-simulated trend agree largely with experiment. Predicted LLPS propensities quantified by the Tcrs in Fig. 6 follow the rank order of WT > 4Rto4KM > 4RtoKN ≈ 4Rto4KC > 15Rto15K, which is essentially identical to that measured experimentally, viz., WT > 4Rto4KM > 4RtoKC > 4Rto4KN > 15Rto15K (Fig. 9B of ref. [73]). In comparing theoretical and experimental LLPS, a low theoretical Tcr can practically mean no experimental LLPS when the theoretical Tcr is below the freezing temperature of the real system [16, 109]. Fig. 6a shows that even the highest Tcr for 15Rto15K (at model [NaCl] = 480 mM) is essentially at the same level as for WT at [NaCl] = 0 (). This MD prediction is consistent with the combined experimental observations of no LLPS for 15Rto15K up to at least [NaCl] = 2 M and no LLPS for WT Caprin1 at [NaCl] = 0 (Fig. 9B,C of ref. [73]).
Field-theoretic simulation is an efficient tool for studying multiple-component phase properties
We next turn to modeling of Caprin1 or pY-Caprin1 LLPS modulated by both ATP-Mg and NaCl. Because tackling such many-component LLPS systems using rG-RPA or explicit-ion MD is numerically challenging, here we adopt the complementary field-theoretic simulation (FTS) approach [110] outlined in Materials and Methods for this aspect of our investigation. FTS is based on complex Langevin dynamics [111, 112], which is related to an earlier formulation for stochastic quantization [113, 114] and has been applied extensively to polymer solutions [110, 115]. Recently, FTS has provided insights into charge-sequence-dependent LLPS of IDRs [81, 87, 88, 107, 116]. The starting point of FTS is identical to that of rG-RPA. FTS invokes no RPA and is thus advantageous over rG-RPA in this regard, though it is still limited by the lattice size used for simulation and its restricted treatment of excluded volume [88]. Here we apply the protocol detailed in refs. [87, 88].
A simple model of ATP-Mg for FTS
We adopt a 6-bead polymeric representation of (ATP-Mg)2- (Fig. 7a) in which four negative and two positive charges serve to model ATP4- and Mg2+ respectively. Modeling (ATP-Mg)2- as a short charged polymer enables application of existing FTS formulations for multiple charge sequences to systems with IDRs and (ATP-Mg)2-. While the model in Fig. 7a does not capture structural details, its charge distribution does correspond roughly to that of the chemical structure of (ATP-Mg)2-. In developing FTS models involving IDR, (ATP-Mg)2- , and NaCl, we first assume for simplicity that (ATP-Mg)2- does not dissociate and consider systems consisting of any given bulk concentrations of IDR and (ATP-Mg)2- wherein all positive and negative charges on the IDR and (ATP-Mg)2- are balanced, respectively, by Cl– and Na+ to maintain overall electric neutrality (Fig. 7a).
Phase behaviors can be probed by FTS density correlation functions
LLPS of FTS systems can be monitored by correlation functions [88]. Here, we compute intra-species IDR self-correlation functions Gpp(r) (Fig. 7b,c) and inter-species crosscorrelation functions Gpq(r) between the IDR and (ATP-Mg)2- or NaCl (Fig. 7d,e) at three different bulk [(ATP-Mg)2-] = 10-4b-3, 0.03b-3, and 0.5b-3, where b may be taken as the peptide virtual bond length ≈ 3.8Å (Materials and Methods). The correlation functions in Fig. 7b-e are normalized by bulk densities of the IDR and for (ATP-Mg)2-, Na+ or Cl–, wherein density is the bead density for the given molecular species in units of b-3. LLPS of the IDR is signaled by in Fig. 7b,c dropping below the unity baseline (dashed) at large distance r because it implies a spatial region with depleted IDR below the bulk concentration, which is possible only if the IDR is above the bulk concentration in at least another spatial region. In other words, for large r indicates that IDR concentration is heterogeneous and thus the system is phase separated. For small r, is generally expected to increase because IDR chain connectivity facilitates correlation among residues local along the chain. On top of this, LLPS propensity may be quantified by for small r because a higher value indicates a higher tendency for different chains to associate and thus a higher LLPS propensity [88].
FTS rationalizes [ATP-Mg]-modulated Caprin1 reentrant phase behaviors and their colocalization in the condensed phase
[(ATP-Mg)2-]-modulated reentrance is predicted by FTS for Caprin1 but not for pY-Caprin1: When [(ATP-Mg)2-]/b-3 varies from 10-4 to 0.03 to 0.5, small-r values of the Caprin1 Gpp(r) in Fig. 7b initially increase then decrease, whereas the corresponding small-r values of the pY-Caprin1 Gpp(r) in Fig. 7c decrease monotonically, consistent with rG-RPA (Fig. 2g,h,k,l) and experiment (Fig. 3). The inter-species cross-correlations in Fig. 7d,e show further that when an IDR condensed phase is present at [(ATP-Mg)2-] = 0.03b-3 (as indicated by large-r behaviors of in Fig. 7b,c), (ATP-Mg)2- is colocalized with Caprin1 or pY-Caprin1 (high value of for small r) in the IDR-condensed droplet. By comparison, the variation of [Na+] and [Cl–] is much weaker. For Caprin1, Cl– is enhanced over Na+ in the Caprin1 condensed phase (small-r of the former larger than the latter in Fig. 7d), but the reverse is seen for pY-Caprin1 (Fig. 7e). This FTS-predicted difference, most likely arising the positive net charge on Caprin1 and the smaller negative net charge on pY-Caprin1, is consistent with the MD results in Fig. 4c,d and SI Appendix, Fig. S3.
FTS rationalizes experimentally observed residue-specific binding of Caprin1 with ATP-Mg
The propensities for (ATP-Mg)2- , Na+ , and Cl– to associate with each residue i along the Caprin1 IDR (i = 1, 2, … , 103) in FTS are quantified by the residue-specific integrated correlation in Fig. 7f, which is the integral of the corresponding from r = 0 to a relative short cutoff distance r = rcontact to provide a relative contact frequency for residue i and ionic species q to be in spatial proximity (Materials and Methods and SI Appendix). Notably, the residue-position-dependent integrated correlation for (ATP-Mg)2- varies significantly, exhibiting much larger values near the N-terminal and a little before the C-terminal but weaker correlation elsewhere (Fig. 7f, red symbols). The two regions of high integrated correlation (i.e., favorable association) coincide with regions with high sequence concentration of positively charged residues. This FTS prediction is remarkably similar to the experimental NMR finding that binding between (ATP-Mg)2- and Caprin1 occurs strongly at the arginine-rich N- and C-terminal regions, as indicated by the volume ratio V/V0 data in Fig. 1C of ref. [74] that quantifies the ratio of peaks in NMR spectra in the presence and absence of trace amounts of ATP-Mn. For comparison with the FTS results, this set of experimental data is replotted as 1 - V/V0 in Fig. 7f (grey symbols, right vertical axis) to illustrate the similarity in experimental and theoretical trends because 1 - V/V0 is expected to trend with contact frequency. Corresponding FTS results for Na+ and Cl– in Fig. 7f exhibit much less residue-position-dependent variation, with Cl– displaying only slightly enhanced association in the same arginine-rich regions, and Na+ showing even less variation, presumably because the positive charges on Caprin1 are already essentially neuralized by the locally associated (ATP-Mg)2- or Cl– ions. The theory-experiment agreement in Fig. 7f regarding ATP-Caprin1 interactions indicates once again that electrostatics is an important driving force underlying many aspects of experimentally observed Caprin1-(ATP-Mg)2- association.
FTS snapshots of [ATP-Mg]-modulated reentrant phase behaviors and Caprin1- ATP-Mg colocalization
The above FTS-predicted trends are further illustrated in Fig. 8 by field snapshots. Such FTS snapshots are generally useful for visualization and heuristic understanding [81, 88, 107], including insights into subtler aspects of spatial arrangements exemplified by recent studies of subcompartmentalization entailing either co-mixing or demixing in multiple-component LLPS that are verifable by explicit-chain MD [88, 107]. Now, trends deduced from the correlation functions in Fig. 7 are buttressed by the representative snapshots in Fig. 8: As the bead density of (ATP-Mg)2- is increased from 10-4b-3 to 0.03b-3 to 0.5b-3, the spatial distribution of Caprin1 evolves from an initially dispersed state to a concentrated droplet to a (reentrant) dispersed state again (Fig. 8a), whereas the initial dense pY-Caprin1 droplet becomes increasingly dispersed monotonically (Fig. 8b). Colocalization of (ATP-Mg)2- with both the Caprin1 (Fig. 8c) and pY-Caprin1 (Fig. 8d) droplets is clearly visible at [(ATP-Mg)2-] = 0.03b-3, though the degree of colocalization is appreciably higher for Caprin1 than for pY-Caprin1. This is likely because the positive net charge of Caprin1 is more attractive to (ATP-Mg)2-. By comparison, variations in Na+ and Cl– distribution between Caprin1/pY-Caprin1 dilute and condensed phases are not so discernible in Fig. 8e-h, consistent with the small differences in the corresponding FTS correlation functions (Fig. 7d,e).
Robustness of general trends predicted by FTS
We have also assessed the generality of the results in Figs. 7 and 8 by considering three variations in the molecular species treated by FTS: (i) Caprin1 or pY-Caprin1 with only Na+ and Cl– but no (ATP-Mg)2- (SI Appendix, Fig. S4), (ii) Caprin1 with (ATP-Mg)2- and either Na+ or Cl– (but not both) to maintain overall charge neutrality or pY-Caprin1 with (ATP-Mg)2- and Na+ as counterion but no Cl– (SI Appendix, Fig. S5), and (iii) Caprin1 or pY-Caprin1 with ATP4- , Mg2+ , Na+ and Cl– (SI Appendix, Fig. S6). Despite these variations in FTS models, SI Appendix Figs. S4–S6 consistently show reentrant behavior for Caprin1 but not pY-Caprin1 and Figs.S5 and S6 both exhibit colocalization of ATP with condensed Caprin1, suggesting that these features are robust consequences of the basic electrostatics at play in Caprin1/pY-Caprin1 + ATP-Mg + NaCl systems.
Discussion
It is reassuring that, in agreement with experiment, all of our electrostatics-based theoretical approaches consistently predict salt-dependent reentrant phase behaviors for Caprin1, whereas pY-Caprin1 LLPS propensity decreases monotonically with increasing salt (Figs. 2, 4, 7, and 8). This effect applies to small monovalent salts exemplified by Na+ and Cl– as well as to our electrostatics-based models of (ATP-Mg)2- or ATP4- , with ATP exhibiting a significant colocalization with the Caprin1 condensed phase (Figs. 2g,h,k,l and 8c).
Related studies of electrostatic effects on biomolecular condensates
Our theoretical predictions are also largely in agreement with recent computational studies on salt concentrations in the dilute versus condensed phases [108] and salt-dependent reentrant behaviors [42] of other biomolecular condensates, including explicit-water, explicit-ion atomic simulations with preformed condensates of the N-terminal RGG domain of LAF-1 [108] and of the highly positive proline-arginine 25-repeat dipeptide PR25 [117].
A recent study examines salt-dependent reentrant LLPSs of full-length FUS (WT and G156E mutant), TDP-43, bromodomain-containing protein 4 (Brd4), sex-determining region Y-box 2 (Sox2), and annexin A11 [42]. Unlike the requirement of a nonzero monovalent salt concentration for Caprin1 LLPS, LLPS is observed for all these six proteins with KCl, NaCl or other salts at concentrations as low as 50 mM. Also unlike Caprin1, their protein condensates dissolve at intermediate salt then re-appear at higher salt, a phenomenon the authors rationalize by a tradeoff between decreasing favorability of cation-anion interactions and increasing favorability of cation-cation, cation-π, hydrophobic, and other interactions with increasing monovalent salt [42].
Two reasons may account for this difference. First, Caprin1 does not phase separate at low salt because it is a relatively strong polyelectrolyte (NCPR = +13/103 = +0.126). By comparison, five of the six proteins in ref. [42] are much weaker polyelectrolytes or not at all, with NCPR = +14/526 = +0.0266, +13/526 = +0.0247, -7/80 = -0.0875, 0, and +3/326 = +0.00920, respectively, for FUS (WT, mutant), TDP-43, Brd4, and A11. Apparently, their weak electrostatic repulsions can be overcome by favorable nonelectrostatic interactions alone to enable LLPS.
Second, compared to Caprin1, the proteins in ref. [42] are either significantly larger (WT and mutant FUS) or significantly more hydrophobic and aromatic (the other four proteins), both properties are conducive to LLPS. For instance, although Sox2’s NCPR = +14/88 = +0.159 is higher than that of Caprin1, among Sox2’s amino acid residues, 21/88 = 23.9% are large hydrophobic or aromatic residues leucine (L), isoleucine (I), valine (V), methionine (M), phenylalanine (F), or tryptophan (W), and 17/88 = 19.3% are large aliphatic residues L, I, V, or M. This amino acid composition suggests that hydrophobic or n-related interactions in Sox2 can be sufficient to overcome electrostatic repulsion to effectuate LLPS at zero salt. In contrast, the Caprin1 IDR contains merely one L; only 10/103 = 9.7% of the residues of Caprin1 are in the L, I, V, M, F, W hydrophobic/aromatic category and only 6/103 = 5.8% are in the L, I, V, M aliphatic category. The corresponding aliphatic fractions of TDP-43, Brd4 and A11, at 21/80 = 26.3%, 33/132 = 25%, and 90/326 = 27.6%, respectively, are also significantly higher than that of Caprin1.
Effects of salt on biomolecular LLPS
Effects of salts on LLPS, including partition of salt into polymer-rich phases, are of long-standing interest in polymer physics [118]. In the biomolecular condensate context, the versatile functional roles of salts are highlighted by the interplay between electrostatic and cation-π interactions [119, 120], salts’ modulating effects on heat-induced LLPSs of RNAs [121], their regulation of condensate liquidity [122], and even their potential impact in extremely high-salt exobiological environments [123]. While some of these recent studies focus primarily on salts’ electrostatic screening effects without changing the signs of the effective polymer charge-charge interaction [119], effective attractions between like charges bridged by salt or other oppositely-charged ions [94] as illustrated by Caprin1 (Fig. 5f) are likely needed to account for phenomena such as salt-induced dimerization of highly charged, medically relevant arginine-rich cell-penetrating short peptides [124, 125].
Tielines in protein-salt phase diagrams
In view of Caprin1’s polyelectrolytic nature, the mildly negative tieline slopes in Fig. 2a,b are consistent with rG-RPA predictions for a fully charged polyelectrolyte (Fig. 10a of ref. [62]). This depletion of monovalent salt in the condensed phase is similar to that observed in the complex coacervation of oppositely charged polyelectrolytes [126–128]. By comparison, the positive rG-RPA tieline slopes for polyampholytic pY-Caprin1 (Fig. 2c,d), confirmed by MD in Fig. 4d, are appreciably steeper than that predicted for fully charged (±1) diblock polyampholytes by rG-RPA and the essentially flat tielines predicted by FTS (Fig. 10b of ref. [62] and Fig. 7 of ref. [82]). Whether this difference originates from the presence of divalently charged (-2) phosphorylated sites in pY-Caprin1 remains to be elucidated. In any event, tieline analysis is generally instrumental for revealing details, such as stoichiometry, of the interactions driving multiple-component biomolecular LLPSs [14, 129], rG-RPA should be broadly useful as a computationally efficient tool for this purpose [62].
Counterion valency
Our rG-RPA prediction that the maximum condensed-phase [Caprin1] at low [Na+] is substantially higher with divalent than with monovalent counterions is in line with early findings that higher-valency counterions are more effective in bridging polyelectrolyte interactions to favor LLPS [130] and recent observations that salt ions with higher valencies enhance biomolecular LLPS [131, 132]. The possibility that this counterion/salt effect on LLPS may be exploited more generally for biological functions and/or biomedical applications remains to be further explored. In this regard, while recognizing that ATP can engage in n-related interactions [56–58], our electrostatics-based perspective of ATP-dependent reentrant phase behaviors is consistent with recent observations on polylysine LLPS modulated by enzymatically catalyzed ATP turnovers [128, 133].
Prospective extensions of the present theoretical methodology
Beyond the above comparisons, further experimental testing of other aspects of our theoretical predictions should be pursued, especially those pertaining to pY-Caprin1. Future theoretical efforts should address a broader range of scenarios by independent variations of [ATP4-], [Mg2+], [Na+], [Cl–] and to account for nonelectrostatic aspects of ATP-Mg dissociation [134] with predictions such as tieline slopes analyzed in detail to delineate effects of configurational entropy of salt ions [135] and solvent quality [136]. In addition to our basic modeling constructs, the impact of excluded volume and solvent/cosolute-mediated temperature-dependent effective interactions should be incorporated. Excluded volume is known to affect LLPS [82], demixing of IDP species in condensates [88], and partition of salt ions in polymer LLPS [127]. Moreover, LCST can be driven not only by hydrophobicity [4, 10, 21] but also by electrostatics, as suggested by experiment on complex coacervates of oppositely charged polyelectrolytes [137]. Bringing together these features into a comprehensive formulation will afford a more accurate physical picture.
Summary
To recapitulate, we have employed three complementary theoretical and computational approaches to account for the interplay between sequence pattern, phosphorylation, counterion, and salt in the phase behaviors of IDPs. Application to the Caprin1 IDR and its phosphorylated variant pY-Caprin1 provides physical rationalization for a variety of trends observed in experiments, including reentrance behaviors and very substantial ATP colocalization. These findings support a significant—albeit not exclusive—role of electrostatics in these biophysical phenomena, providing physical insights into effects of sequence-specific charge-charge interactions on ATP-modulated physiological functions of biomolecular condensates such as regulation of ion concentrations. The approach developed here should be of general utility as a computationally efficient tool for hypothesis generation, design of new experiments, exploration and testing of biophysical scenarios, as well as a starting point for more sophisticated theoretical/computational modeling.
Materials and Methods
Further details of the experimental and theoretical/computational methodologies outlined below are provided in SI Appendix.
Experimental sample preparation
The low complexity 607–709 domain of Caprin1 was expressed and purified as before [28, 74]. WT Caprin1 was used in all experiments except those on [NaCl] dependence reported in Table 1 and Fig. 3a, for which a double mutant was used because residue pairs N623-G624 and N630-G631 in WT Caprin1 form isoaspartate (IsoAsp) glycine linkages over time which alters the charge distribution of the IDR [73].
Phosphorylation of the Caprin1 IDR
Phosphorylation of the WT Caprin1 IDR was performed as described in our prior study [28] by using the kinase domain of mouse Eph4A (587–896) [138] with an N-terminal His-SUMO tag.
Determination of phase diagrams
We established phase diagrams for Caprin1 and pY-Caprin1 by measuring the protein concentrations in dilute and condensed phases across a range of [NaCl]s (Fig. 1c,d). Initially homogenizing the two phases of the demixed samples into a milky dispersion through vortexing, ~ 200 μL aliquots were then incubated in a PCR thermocycler with a heated lid at 90°C, in triplicate, for a minimum of one hour. During incubation, the condensed phase settled and formed a clear phase at the bottom. For concentration measurements, the samples were diluted in 6 M GdmCl and 20 mM NaPi (pH 6.5). The dilute phase (top layer) was analyzed through a tenfold dilution of 10 μL samples, and the condensed phase (bottom layer) was analyzed through 250- to 500-fold dilution of 2 or 10 μL samples.
Concentrations of salt and ATP-Mg in dilute and condensed phases
Inductively coupled plasma optical emission spectroscopy (ICP-OES) measurements of [Na+] were performed using a Thermo Scientific iCAP Pro ICP-OES instrument in axial mode. ICP-OES was also used to determine [ATP] and [Mg2+] (Table 2). The detection of phosphorus and magnesium served as proxies for quantifying ATP and Mg2+ levels, respectively. Standard curves were prepared using solutions with known [ATP] and [Mg2+], ranging from 0 to 90 ppm for ATP and 0 to 25 ppm for Mg2+.
Caprin1 phase separation propensity at high salt concentrations
A 6 mM solution of double-mutant Caprin1 IDR (see above) in buffer (25 mM sodium phosphate, pH 7.4) was prepared by exchanging (3 times) the purified protein after size exclusion chromatography using centrifugal concentrators (3 kDa, EMD Millipore). Caprin samples for turbidity measurements were prepared by taking 0.5 μL of the above solution and diluting it into buffer (25 mM sodium phosphate, pH 7.4) containing varying [NaCl]s ranging from 0 to 4.63 M, in a sample volume of 9 μL, so as to achieve [Caprin1] of 300 μM. After rigorous mixing, 5 μL samples were loaded into a μCuvette G1.0 (Eppendorf). OD600 measurements (Fig. 3a) were recorded three times using a BioPhotometer D30 (Eppendorf).
[ATP-Mg]-dependent Caprin1 phase behaviors
Turbidity assays were conducted using the method we described previously [73].
Sequence-specific theory of heteropolymer phase separation
As detailed in refs. [87, 107], an example of the sequence-specific polymer theories [16, 62] is that for a solution with a single species of charged heteropolymers in np copies, nc counterions (same type), and ns salt ions (same type, but different from the counterions). Each polymer chain has N monomers (residues) with charge sequence in vector notation, where σi ∈ {0, ±1, -2} is the charge of the ith residue. The counterions and salt ions are monomers carrying zc and zs charges, respectively. The particle-based partition function is given by
where nw denotes the number of water (solvent) molecules, Rα,i is the position vector of the ith residue of the ath polymer, ra is the position vector of the ath small ion. T accounts for polymer chain connectivity modeled by a Gaussian elasticity potential with Kuhn length l. U describes the interactions among all molecular components of the system, here consisting only of Coulomb electrostatics (el) and excluded-volume (ex) for simplicity, viz., U = Uel + Uex. Their interaction strengths are governed by the Bjerrum length lB and the two-body excluded volume parameter v2. By introducing conjugate fields ψ(r), w(r) and applying the Hubbard-Stratonovich transformation, the system defined by the particle-based partition function in Eq. (2) is recast as a field theory of ψ, w in which their interactions with polymer, salt, and counterion are described, respectively, by single molecule partition functions Qp, Qs, and Qc. For instance,
where Hp is the single-polymer Hamiltonian and the chain label α is dropped.
Renormalized-Gaussian random-phase-approximation (rG-RPA)
Following refs. [37, 62], Hp can be separated into a Gaussian-chain Hamiltonian with an effective (renormalized) Kuhn length l1 = xl and a remaining term, , where
with i2 = -1. By requiring the observable polymer square end-to-end distance be properly quantified by , x can be approximated by variational theory [37]. RPA can then be applied to the renormalized Gaussian (rG) chain system with l → xl and a corresponding scaling of the contour length to arrive at an improved theory, rG-RPA, for sequence-specific LLPS.
Explicit-ion coarse-grained molecular dynamics (MD)
The MD model in this work augments a class of implicit-water coarse-grained models [78, 84] that utilize a “slab” approach for efficient equilibration [102] by incorporating explicit small ions. As before [84], the total MD potential energy UT is the sum of long-spatial-range electrostatic (el) and short-spatial-range (sr) interactions of the Lennard-Jones (LJ) type as well as bond interactions, i.e., UT = Uel + Usr + Ubond. With small ions, the electrostatic component is given by a sum of polymer-polymer (pp), polymer-ion (pi), and ion-ion (ii) contributions: Uel = Uel,pp + Uel,pi + Uel,ii. Details of these terms are provided in SI Appendix.
Field-theoretic simulation (FTS)
FTS is useful for sequence-specific multiple-component LLPSs encountered in biomolecular settings. The new applications developed here are based on recent advances (see, e.g., refs. [81, 82, 87, 88, 110, 115]). Consider the field theoretic Hamiltonian
where Qm is single-molecule partition function [here m labels the components in the system, cf. Eq. (3)] and the breves denote convolution with Γ, i.e., for a generic field ϕ, ; here ϕ = w, ψ, and Γ is a Gaussian smearing function [87]. FTS utilizes the Complex-Langevin (CL) method [111, 112] by introducing an artifical CL time variable (t), viz., w(r) → w(r,t), ψ(r) → ψ(r,t) and letting the system evolve in CL time in accordance with a collection of Langevin equations
where the Gaussian noise ηϕ(r, t) satisfies . Thermal averages of thermodynamic observables are then computed as asymptotic CL time averages of the corresponding field operators. Spatial information about condensation and proximity of various components is readily gleaned from density-density correlation functions [87, 88],
where m, n are labels for the components in the model system. For instance, m may represent all polymer beads (denoted “p”) irrespective of the sequence positions of the beads , and n may represent all six beads in our ATP-Mg model (Fig. 7a). One may also define
where (i) represents the ith residue along a protein chain , and q = (ATP-Mg)2- , Na+, or Cl–. With this definition, residue-specific relative contact frequencies are estimated by integrating Eq. (8) over a spherical volume within a small inter-component distance rcontact:
For the normalized plotted in Fig. 7f, and are bulk (overall) densities, respectively, of the ith protein residue and of (ATP-Mg)2- or small ions, and rcontact ≈ 1.5b is used to characterize contacts. Further details are provided in SI Appendix.
Acknowledgements
This work was supported by Canadian Institutes of Health Research (CIHR) grant NJT-155930 and Natural Sciences and Engineering Research Council of Canada (NSERC) grant RGPIN-2018-04351 to H.S.C., CIHR grant FDN-148375, NSERC grant RGPIN-2016-06718, and Canada Research Chairs Program to J.D.F.-K. as well as CIHR grant FDN-503573 to L.E.K. A.K.R. was supported by a CIHR postdoctoral fellowship. We are grateful for the computational resources provided generously by Compute/Calcul Canada and the Digital Research Alliance of Canada.
Supporting Materials and Methods
Experimental information additional to that in the maintext
Sample preparation — Wildtype (WT) Caprin1.
As stated in the maintext, WT Caprin1 was used in all reported experiments except those on [NaCl] dependence presented in maintext Table 1 and Fig. 3a. The amino acid sequence of WT Caprin1 is given in Fig. S1 (all supporting figures with figure numbers prefixed by “S” are provided in this SI Appendix). The preparation of sample WT Caprin1 is now briefly described as follows: Caprin1 with an N-terminal His-SUMO tag was produced in BL21 (DE3)-RIPL Codon Plus E. coli cells. These cells were cultured until an optical density at 600 nm (OD600) of 0.6 at 37°C and then induced with 0.5 mM IPTG for overnight expression at 23°C. The harvested cells were suspended in a lysis buffer containing 6 M guanidine hydrochloride (GuHCl), 25 mM Tris, 500 mM NaCl, 20 mM imidazole, 2 mM β-mercaptoethanol (BME), at pH 8.0, and lysed via sonication. The supernatant, post-sonication, was applied to Ni-NTA (Cytiva) and washed with lysis, wash (25 mM Tris, 500 mM NaCl, 20 mM imidazole, 2 mM BME, at pH 8.0), and elution (25 mM Tris, 500 mM NaCl, 300 mM imidazole, 2 mM BME, at pH 8.0) buffers. Post-elution, the sample was treated with ULP1 during dialysis against a dialysis buffer (25 mM Tris, 250 mM NaCl, and 2 mM BME at pH 8.0). This step was followed by His-SUMO tag removal through Ni-NTA column chromatography. Final purification of Caprin1 was performed using FPLC with a Superdex 75 16/60 column, equilibrated with a gel filtration buffer (3 M GuHCl, 25 mM Tris, 500 mM NaCl, 2 mM BME, pH 8.0). The protein fractions were then dialyzed twice to remove GuHCl before use in experiments.
Sample preparation — Double-mutant (N623T, N630T) variant of Caprin1.
Our sample preparation for the double-mutant variant used in [NaCl] dependence studies reported in maintext Table 1 and Fig. 3a proceeded as follows. To abolish IsoAsp formation for the salt concentration measurements (Table 1) and the [NaCl]-dependent turbidity measurements in Fig. 3a, we used a double mutant of the Caprin1 IDR (N623T,N630T) in which the two asparagine residues are mutated to threonine. This double mutant has been shown to exhibit a similar propensity to phase separate as the WT Caprin1 IDR [75] purified as described previously [73, 75].
Purified Caprin1 was first exchanged into buffer (25mM sodium phosphate, pH 7.4) via dialysis and was concentrated to ~ 6 mM using 3 kDa centrifugal Amicon concentrators (EMD Millipore). The pH of the concentrated protein was adjusted to 7.4 using concentrated hydrochloric acid. Phase separated samples of Caprin1 were prepared by addition of a concentrated stock solution of NaCl (25mM sodium phosphate, pH 7.4, 4M NaCl) to achieve a bulk salt concentration of 300 mM NaCl. Condensed and dilute phases of Caprin1 were transferred all together into an Eppendorf tube using a syringe. After rigorous vor-texing, the phase separated samples were incubated at the desired temperatures using a thermocycler with a heated lid (95°C). At least one hour was required to allow droplets to form a large condensed phase droplet at the bottom of the tube.
2 μL of condensed and dilute phases were pipetted into 48 μL of a 2.8 M urea solution (U4883, Sigma) in MilliQ water in a 15 mL falcon tube, using a positive and an air displacement pipette, respectively. The outside of the tips were wiped with a KimWipe to remove excess protein, prior to transferring into the urea solutions. Following transfer, the samples were digested for inductively coupled plasma optical emission spectroscopy (ICP-OES) measurements by the addition of 630 μL of concentrated nitric acid (67%, NX0407, Sigma) and 630 μL hydrogen peroxide (95321, Sigma), and incubated in an oven at 60° C for 54 hours. Post digestion, the sample tubes were cooled at room temperature and centrifuged. 40 μL of hydrogen peroxide was then added to the samples. No bubbles were observed, indicating the completion of the digestion process. The samples were then bought up to 12 mL using MilliQ water, to achieve a final nitric acid concentration of 3.5%. Blank samples for the condensed and dilute phases were prepared by pipetting 2 μL of MilliQ water using a positive and an air displacement pipette, respectively, and subsequently following the digestion protocol described above. Sodium standards (0.1, 0.2, 0.5, 1, 2, 4, 8 and 10 ppm) in 3.5% nitric acid for ICP-OES measurements were prepared by dilution of sodium standard solution (00462, Sigma) with MilliQ water and concentrated nitric acid (67%, NX0407, Sigma). All samples were filtered using 0.22 μm syringe filters prior to ICP-OES. Condensed and dilute phases were drawn in triplicate at each temperature.
Phosphorylation of the Caprin1 IDR.
The purified protein was initially concentrated to 25–50 μM in a reaction buffer comprising 25 mM Tris pH 7.4, 50 mM KCl, 10 mM MgCl2 , 3 mM ATP and 2 mM DTT. This mixture was then placed into a dialysis tubing with a 3 kDa cut-off. To the protein sample, purified His-SUMO-Eph4A was added to 5–10 μM, and the reaction mixture was subsequently dialyzed against 4 liters of the same reaction buffer, either at room temperature or at 4° C overnight. Mass spectrometry indicates that the resulting sample consists mainly of a mixture of Caprin1 IDRs with six or seven phosphorylations and a very small fraction of IDRs with five phosphorylations (Fig. S2).
Determination of phase diagrams.
The initial homogenization described in the maintext ensures that the condensed phase in small droplets can rapidly equilibrate with the dilute phase. Absorbance at 280 nm was measured and converted to concentration using the Beer-Lambert law, with an extinction coefficient (ε) of 10,430 M-1cm-1 , based on the molecular masses of 11,108 Da for Caprin1 and 11,668 Da for pY-Caprin1. The reported concentration values and uncertainties, calculated as means and standard deviations, were derived from triplicate measurements.
Concentrations of salt and ATP-Mg in dilute and condensed phases.
ICP-OES measurements were performed in triplicate for each sample. Mean value and uncertainty for the salt concentration were obtained by taking the average and standard deviation over the triplicate samples at the given temperature (maintext Table 1). Specific details of sample preparation for the set of ATP-Mg-dependent experiments are provided above in this SI Appendix. As for salt-dependent experiments, these measurements were performed in triplicate and standard deviations were calculated to assess experimental uncertainties (maintext Table 2).
Caprin1 phase separation propensity at high salt concentrations.
Averages and standard deviations over the three OD600 measurements were reported by the blue symbols in maintext Fig. 3a.
[ATP-Mg]-dependent Caprin1 phase behaviors.
A brief summary of the turbility assays in ref. [73] that we utilized is as follows: The WT Caprin1 IDR was diluted to a 200 μM concentration using a buffer composed of 25 mM HEPES and 2 mM DTT at pH 7.4, with varying levels of ATP-Mg. Samples were prepared with ATP-Mg concentrations ranging from 0 to 40 mM. Following thorough mixing, 5 μL. of each sample was placed into a μCuvette G1.0 (Eppendorf), and OD600 was measured using a BioPhotometer D30 (Eppendorf). This procedure was performed three times for analysis of experimental uncertainties (red symbols in maintext Fig. 3a).
Sequence-specific theory of heteropolymer phase separation — Summary of key steps in the field-theoretic formulation
The following is a more extensive summary to supplement the brief outline in Materials and Methods of the maintext provided under the heading “squence-specific theory of heteropolymer phase separation”. In general, sequence-specific polymer field theories [16, 62] are constructed to model systems of polymers with various salt and counterions. Further details are available from our recent publications (e.g., refs. [87, 107] and references therein).
Using the same notation for the partition function Z in Eq. (2), with T + U being the Hamiltonian in units of the product kBT of Boltzmann’s constant kB and absolute temperature T, the connectivity term T is given by
with [R] being shorthand for [{Rα,i}]. Considering the case when the total potential energy U is taking one of its simplest forms, in that it serves only to model the two-body (pairwise) interactions among polymer residues, salt ions, and counterions, we further confine the interaction types in our formulation to Coulomb electrostatics (el) and excluded-volume (ex). As stated in the maintext, their interaction strengths are governed by the Bjerrum length lB (lB = e2/4πε0εrkBT, where e is protonic charge, ε0 is vacuum permittivity, and εr is relative permittivity), and the two-body excluded volume parameter v2. We now provide the precise field-theoretic forms for the Coulomb electrostatic potential Uel and two-body excluded volume interaction Uex terms in U = Uel + Uex by introducing the number density operators
for the monomers (residues or beads) of the polymer (p), salt (s), and counterion (c), respectively, where δ represents the Dirac δ distribution. Solvent (water) degrees of freedom [nw in Eq. (2) for Z in the maintext] are not included in Eq. (S2) above because they are only used for the incompressibility constraint in our rG-RPA formulation as an approximate treatment of excluded volume, and solvents are not treated explicitly at all in the present field-theoretic simulation (FTS), i.e., the nw factor is dropped for FTS. The corresponding charge density operators for the number density operators in Eq. (S2) are
For for arginine and lysine, σi = —1 for aspartic and glutamic acids, σi = —2 for phosphorylated tyrosine, and σi = 0 for all other amino acid residues in the Caprin1/pY-Caprin1 sequences studied (as stated in the maintext). Uel and Uex are now given by
As mentioned in the maintext, two conjugates fields, ψ(r) for Coulomb interaction and w(r) for excluded volume, are then introduced to linearize the density operators that are quadratic in Uel and Uex by applying the Hubbard-Stratonovich transformation [139, 140], resulting in a reformulated partition function Z’ ≡ (np!ns!nc!nw!)Z [with Z given by Eq. (2) of maintext] expressed as a functional integral over the fields ψ and w:
where Qp, Qs, and Qc are single-molecule partition functions of polymer, salt ion, and counterion, respectively, which are all functionals of ψ and w:
wherein 1 is the imaginery unit, i.e., 12 = —1.
The Z’ in Eq. (S5) can be analyzed via various field theoretic approaches. Two approaches are utilized in the present work: (i) one-loop perturbation expansion is employed to derive analytical theories based upon random-phase-approximation (RPA), and (ii) field-theoretic simulation (FTS) is conducted to compute observables numerically.
Renormalized-Gaussian random-phase-approximation (rG-RPA)
As mentioned in the maintext, sequence-specific random phase approximation (RPA) has been applied successfully to model electrostatic effects on the LLPSs of various polyam-pholytic IDRs [10, 16, 84, 90, 91] to obtain behaviorial trends consistent with experiments and explicit-chain simulations; but RPA is less appropriate for polyelectrolytes with large net charge per residue (NCPR) [92–94] because of RPA’s treatment of polymers as ideal Gaussian chains [95]. This approximation is reasonable for polyampholytes but not for polyelectrolytes. While overall intrachain electrostatic effects in polyampholytes can be mild because of the polymers’ nearly zero net charge and thus entail only a minor perturbation on conformational statistics, repulsive electrostatics in polyelectrolytes with significant net charge is strong, leading to more rod-like conformations with statistics deviating significantly from that of Gaussian chains. Consequently, treating polyelectrolytes as Gaussian chains can lead to large errors in theoretical intrachain and interchain residue-residue (monomer-monomer) correlations, resulting in drastically overestimated LLPS propensities [95].
The rG-RPA theory was put forth by some of the present authors [62]. For a broad overview, we briefly summarize here the major methodological steps and key results of the theory. Interested readers are referred to ref. [62] for further details. As rG-RPA has been designed and verified to tackle polyeletrolyte conformations appropriately [62], we apply it here to the polyelectrolytic Caprin1 IDR. Because rG-RPA allows for a smooth crossover between polyelectrolytic and polyampholytic systems, Caprin1 and pY-Caprin1 can now be analyzed in a universal theoretical formulation without invocation of ad hoc treatments for their different conformational statistics.
In our formulation of rG-RPA theory, simplifying assumptions are made to the effect that excluded volume is taken into account only between pairs of different polymer chains (no consideration of intrachain excluded volume) and small ions are treated as point charges. Denoting the input “bare” Kuhn length as l, and the total free energy and volume of the system as F and Ω respectively, the system free energy in units of kBT per volume l3 is given by
Here s is translational entropy
where ϕp, ϕs, ϕc, and ϕw = 1-ϕp–ϕs–ϕc are, respectively, volume fractions of polymers, salt ions, counterions, and solvent, with the last equality following from the incompressibility condition that we have stipulated. The fion term in Eq. (S7) accounts for the free energy of the small ions via the form
where is the Debye screening length. The term f0 in Eq. (S7) is the zeroth-order excluded volume effect given by
where ρm = npN/Ω is the average monmer (residue or bead) density of the polymers in the system and the expression fp = -(l3/Ω) ln Zp for the last term in Eq. (S7) is derived from the polymer partition function
An analytical perturbative field theory may now be derived from Zp by considering the Taylor expansion of In Qp up to the second order of ψ and w while omitting terms that do not affect the relative energies of the configurations, viz.,
where and are the overall average charge and number densities, respectively, of the polymer [cf. Eqs. (S2) and (S3) above and Eq. (A37) of ref. [62]]. It follows that Zp can then be approximated as a Gaussian integral in the Fourier-transformed k-space,
where , , and are charge-charge, mass-mass (i.e., matter-matter), and masscharge (matter-charge) correlation functions in k-space, and . The free energy fp is then given by
The correlation functions in Eq. (S14) may be estimated by various field-theory approximations. In rG-RPA, they are evaluated using a variational approach to the single-polymer partition functon Qp by first expressing the Hamiltonian Hp in Eq. (3) of the maintext as the sum of an approximate Gaussian-chain Hamiltonian with an effective (renormalized) Kuhn length xl (recall l is the original “bare” Kuhn length) plus the remaining term:
where
are the same equations as Eqs. (4a) and (4b) in the maintext and adopt essentially the same form as the unrenormalized Eq. (S6a). To make progress, we take the polymer square end-to-end distance as a key physical observable and require its thermodynamic average to be produced by in good approximation. Based on this premise, a variational theory as described in ref. [37] is applied to calculate the x parameter in the above Eq. (S16). Details of the derivation are given in the Appendices of ref. [62]. Here we show only the variational equation for solving x:
where is the 2 × 2 matrix in Eq. (S13) with l → xl (renormalized Kuhn length), |i — j| → |i — j|/x (renormalized contour length) and thus l2|i — j| → l2x|i — j| such that the correlation functions in become [62]
and the in the integrand on the right hand side of Eq. (S17) is now given by
where
The rG-RPA+FH formulation.
Because the above field theory is formulated to focus only on sequence-specific electrostatics and excluded volume, in the form presented above it does not account for short-range attractions such as those arising from π-related and hydrophobic effects; but we need to take these effects into consideration to arrive at a more direct comparison between rG-RPA predictions and experiments, e.g., as those provided in Fig. 1c,d of the maintext. To account for these interactions approximately in Caprin1, particularly the interactions involving π-electrons [74], we introduce, as before [16], a temperature-dependent Flory-Huggins (FH) interaction to augmented the free energy f in Eq. (S7) [62, 84], resulting in an overall total free energy
where Δh and Δs are the enthalpic and entropic components, respectively, of the mean-field Flory-Huggins interaction for favorable non-electrostatic attraction. With this augmented rG-RPA+FH system free energy f in hand, we solve the solute and solvent concentrations in dilute and condensed phases by balancing the chemical potentials of each solute components and the osmotic pressures in the two phases. When the salt concentration is assumed for simplicity to be uniform throughout the system (Fig. 1c,d of the maintext), the system only has ρm as a variable and the binodal phase separation concentrations are readily obtained by solving the standard common tangent conditions [87, 141]. The Flory-Huggins parameters Δh and Δs fitted to the experimental data are Δh = 1.08 kcal mol-1, Δs = 0 for the Caprin1 phase diagrams in Fig. 1c and Δh = 1.08 kcal mol-1, Δs = -29.8 kcal mol-1K-1 for the pY-Caprin1 phase diagrams in Fig. 1d. When the uniform salt concentration restriction is removed to allow for fully varying salt and polymer concentrations (Fig. 2 of the maintext), the final concentrations in the two phases depend on their initial bulk (overall) concentrations. The corresponding two-dimensional, polymer-salt phase diagram (at a given temperature T) is obtained by similar balancing conditions [142]. As stated in the maintext, the two-dimensional rG-RPA phase diagrams in Fig. 2 are for T = 300 K.
rG-RPA-predicted effects of counterion valency on Caprin1 LLPS.
As discussed in the maintext, with monovalent salt (Na+), rG-RPA predicts that Caprin1 does not undergo LLPS at [Na+] = 0 when the counterion (Cl–) is monovalent (maintext Fig. 2a,b), but Caprin1 LLPS is possible at [Na+] = 0 when the counterion is divalent (maintext Fig. 2e,f). As mentioned in the maintext, a likely physical reason for this effect is the difference in configurational entropy loss of monovalent vs divalent counterions in the Caprin1-condensed phase. Apparently, when [Cl–] is just sufficient to balance the net positive charge of Caprin1 (i.e., when [Na+] = 0), the entropic cost of concentrating Cl– in a Caprin1-condensed phase cannot be overcome for Caprin1 LLPS to occur. The entropic cost will be lessened (and thus more favorable to Caprin1 LLPS) when there are more Cl– ions beyond what is necessary to balance the net positive charge of Caprin1, corresponding to a situation with nonzero [Na+] from the added NaCl to supply the additional Cl– ions. In comparison, when the counterion is divalent [(ATP-Mg)2- in our case], the number of counterions needed for balancing the positive net charge of Caprin1 is half of that when the counterion is monovalent. It follows that the entropic cost of concentrating the divalent counterion in the Caprin1-condensed phase is less and consequently, at least in the present situation, no added salt is needed for Caprin1 LLPS.
Explicit-ion coarse-grained explicit-chain molecular dynamics (MD) simulation
Coarse-grained MD simulations are performed with the GPU version of HOOMD-Blue software [143, 144] using the slab method that has been developed recently to allow for simulations of relatively large number of polymers [102] and applied to liquid-liquid phase separation (LLPS) of intrinsically disordered proteins (IDPs) [78]. This general MD protocol has been utilized extensively [86, 145], including by our group [84, 146].
Within the methodological framework of this coarse-grained simulation protocol, we introduce explicit small ions into our present simulations because they are necessary to account for subtle experimental observations that are not readily reproduced by using implicit-ion electrostatic screening. Simulations in the present study are performed with 100 chains of the Caprin1 IDR (wildtype or variants) at four salt ([NaCl]) concentrations: (i) at [NaCl] = 0 where the system is neutralized by adding appropriate number (1,300) of chloride (Cl–) ions, (ii) neutralized and at [NaCl] = 200 mM by adding 15,000 pairs of explicit Na+ and Cl– ions, (iii) neutralized and at [NaCl] = 480 mM by adding the same number of 15,000 pairs of explicit Na+ and Cl– ions, and (iv) neutralized and at [NaCl] = 960 mM again with 15,000 pairs of explicit Na+ and Cl– ions. As will be described below, specific small-ion concentrations in (ii), (iii), and (iv) are implemented by varying the size of the final simulation box. A similar procedure is used for simulation of pY-Caprin1 IDR phase behaviors under these four [NaCl] values. Because each pY-Caprin1 IDR has a net -1 charge, the only difference with the Caprin1 IDR simulation is that neutralization of the pY-Caprin1 chain requires 100 Na+ ions instead of 1,300 Cl– ions. The amino acid sequences simulated using coarse-grained MD in the present study are provided in Fig. S1. Note that the experimental pY-Caprin1 sample is highly phosphorylated, consisting mainly of a mixture of Caprin1 IDRs with six or seven phosphorlations, with only a very small fraction of IDRs with five phosphorylations, and essentially no population with fewer than five phosphorylations (Fig. S2). As stated in the maintext, for the sake of simplicity in our theoretical/computational models, we use only the Caprin1 IDR with all seven tyrosines phosphorylated (referred to simply as pY-Caprin1 in Fig. S1) to model the behaviors of this experimental sample, partly to avoid the combinatoric complexity of sequences with five or six phosphorlations, which would entail 21 and 7 possible different sequences, respectively, with currently unknown population fractions.
Coarse-grained MD interaction potentials.
Following prior works [78, 84], each amino acid residue is modeled by a single bead. Beads representing different amino acid residues have different masses, sizes, and engage in pairwise interactions with different strengths [78, 84]. Following the notations of our earlier simulation works [80, 84, 146], we consider np number of polymers labeled as μ, v = 1, 2, …np, with each polymer consisting of N beads labeled by i, j = 1, 2, … , N, and nc counterions to neutralize the charged polymers. Coarse-grained MD is readily applicable to studying variations in LLPS properties among RtoK variants [84] considered here for Caprin1 (Fig. S1). In contrast, since RtoK substitutions do not change the sequence charge pattern of any given sequence, rG-RPA theory as formulated above does not account for their effects on LLPS, though polymer field theory can be extended to incorporate such effects in more sophisticated formulations [107].
For salt-dependent LLPS (ns ≠ 0), we consider n+ small cations and n– small anions. These small cations and anions are classified as salt ions or counterions depending on the net charge of the polymer (see maintext as well as discussion below in this SI Appendix). The small ions are labeled, respectively, by γ = 1,2,…, n+ and β = 1,2,…, n–, and they correspond to Na+ and Cl– in the present coarse-grained MD simulations. As stated in the maintext, our total MD potential energy is the sum of electrostatic (el), short-spatial-range (sr) Lennard-Jones (LJ)-type, and bonding (bond) interactions:
For our systems of interest, the electrostatic part is a sum of polymer-polymer (pp), polymer-small-ion (pi), and small-ion-small-ion (ii) contributions:
As before [107], the polymer-polymer potential energy is given by
where, as in the above field-theoretic formulation, e is protonic charge, ε0 is vacuum permittivity, and εr is relative permittivity (dielectric constant). Here, rμi,vj is the spatial distance between the ith residue of the μth polymer and the jth residue of the vth polymer. The Kronecker symbol δ signals exclusion of the self-interacting μ = v, i = j terms in the summations (irrespective of the values of these terms) because 1 — δxy = 0 if x = y and 1 - δxy = 1 otherwise. In units of the protonic charge e, σi of unphosphorylated amino acid residue beads are taken from ref. [78]. Except lysine and arginine (σi = +1), glutamic and aspartic acid (σi = —1), histidine (σi = +0.5)—but note that there is no histidine in the Caprin1/pY-Caprin1 sequences simulated here, and phosphorylated tyrosine (σi = —2), all other amino acid residues are assigned zero charge. Similarly, the interaction between polymers and small ions is given by
where, in the term after the first equality, the summation ∑k (enclosed in curly brackets) is over small ion types, summation indices γ(+) and γ(—) label, respectively, the positively and negatively charged small ions, nγ(+) and nγ(—) are the total numbers of these ions, and rμi,γ(k) is the spatial distance between the ith residue of the μth polymer chain and the small ion labeled by γ(k). After the second equality, the two terms in ∑k are written explicity, now with rμi,γ/β being the spatial distance between the ith residue of the pth polymer chain and the γ/β (γ or β)-labeled positive/negative small ion as well as σ+ and σ– being the charges of the small positive and negative small ions, respectively. Following ref. [147], we take σ+ = +1 for Na+ and σ– = —1 for Cl–. Depending on the net charge of the polymer, counterions can be included in either the n+ or n– count. For instance, for the positive charged Caprin1 IDR, the total number n– of negatively charged small ions (Cl–) includes the numbers nc counted as counterions and ns counted as salt ions.
The interaction between the small ions is given by
where rγ(k),γ’(k’) is the spatial distance between two small ions. For the MD simulations in this work, the positively and negatively charged small ions correspond to Na+ and Cl– respectively. Eq. (S26) is equivalent to
where rxy is the spatial distance between a pair of small ions labeled by x and y. As rationalized previously [84] in the context of experimental measurements of dielectric properties of biological systems [148], we use εr = 40, a value lower than the ~ 80 dielectric constant of bulk water, for all simulations reported in the present work.
Short-spatial-range non-bonded LJ interactions are similarly constituted by three components, viz., those for polymer-polymer (Usr,pp), polymer-small-ion (Usr,pi), and small-ion- small-ion (Usr,ii) interactions:
Here we adopt the Kim-Hummer (KH) [149] interaction scheme for Usr,pp. KH is based on the Miyazawa-Jernigan (MJ) statistical potential [150] derived from folded globular protein structures in the Protein Data Bank (PDB) and is therefore expected to reflect the energetics of polypeptides, especially the driving forces pertinent to protein folding, its limitations [84] notwithstanding. Our previous work shows that the KH potential is adequate for rationalizing the rank ordering of LLPS propensities of the N-terminal IDR of DEAD-box RNA helicase Ddx4 and its charge scrambled and arginine-to-lysine (RtoK) variants. KH also rationalizes the rank ordering of LLPS propensities of WT and an RtoK variant of LAF-1 [84]. We therefore stipulate that the KH interaction scheme is appropriate, at least as a first approximation, to address the LLPS propensities of Caprin1 and its RtoK variants. The degree to which the differences in simulated LLPS propensity among these Caprin1 variants are affected by how interactions involving K and R are treated differently by the model potential function [84, 86, 106] should be further explored in the future. As before, Usr,pp takes the following form [78, 84]:
in which
and aμi,vj = aij = (ai + aj)/2, where the van der Waals diameters ai and aj, depend only, respectively, on the amino acid residue type (one of twenty) for residue i and residue j (Table S1 of ref. [78]). In contrast, the parameters and εμi,vj = εij depend on both the residue types of residues i and j. Values for εij are provided in Table S3 of ref. [78]. The formula for is given by Eq. (5) of ref. [78] as well as Eqs. (S10) and (S11) of ref. [84].
For the LJ interactions between polymers and small ions, we recognize that while the coarse-grained KH parameters are based on statistical analysis of known folded protein structures, LJ interaction parameters for small ions are typically scaled to match certain physical and chemical properties [147]. Thus it is not straightforward to postulate an interaction scheme based upon first principles. To make progress and to maintain simplicity of our model, we adopt the LJ form [ULJ in Eq. (S30)] for Usr,pi with a uniform εμi,vj = εij = 0.142 (denoted εp± = 0.142) for all residue-small ion pairs. This εij = εp± value is equal to that for a pair of alanine residues in the KH potential and is neither too strong nor too weak among εij values for pairwise interactions between amino acid residues. Accordingly,
where ai+ = (ai + a+)/2, ai– = (ai + a–)/2, with a+ and a– being, respectively, the van der Waals diameters of the positively and negatively charged small ions. For the present MD simulations, , and their values are adopted from ref. [147]. Similarly, for small-ion-small-ion LJ interactions,
where ε+ and ε– are, respectively, the LJ interaction energy parameter for the positively and negatively charged small ions, ε+- = (ε+ε–)1/2, and a+- = (a+ + a–)/2. For the present MD simulations, the and values are also adopted from ref. [147].
As before, the bond-length energy term Ubond in Eq. (S22) for chain connectivity is modeled by a harmonic potential,
Following previous studies [78, 84], Kuhn length l = 3.8 Å is taken as the Cα—Cα virtual bond length for trans polypeptides and Kbond = 10 kJmol-1Å—2.
Simulation protocol.
In each of our coarse-grained MD simulations, the IDR chains and ions are initially placed randomly in a sufficiently large cubic box of dimensions 300 × 300 × 300 Å3. Energy minimization is then performed using the FIRE algorithm (available in the HOOMD-Blue package) which includes removal of steric clashes among the initially placed amino acid beads. Next, the system is compressed at a low temperature of 100 K at 1 atm pressure for a period of 50 ns using the Martyna-Tobias-Klein (MTK) thermostat and barostat [151, 152] with a coupling constant of 1 ps. The equations of motion are integrated using velocity-Verlet algorithm with a timestep of 20 fs. Periodic boundary conditions are applied in all three directions. The electrostatic interaction is computed using the PPPM algorithm [153] available in the package. We use a cut-off distance of 15 Å for the short-spatial-range non-bonded interactions. After this initial NPT step, we compress the simulation box again along the three dimensions for a period of 50 ns until it reaches a sufficiently high density, using Langevin dynamics for an NVT ensemble with a friction coefficient of 1 ps-1. At the end of this compression step, the dimensions of the simulation box for Caprin1 and its four RtoK variants are 115 × 115 × 115 Å3 for [NaCl] = 0 (no small ions beside counterions) and 155 × 155 × 155 Å3 for [NaCl] = 200 mM, 480 mM, and 960 mM. For pY-Caprin1, the corresponding dimensions are 115 × 115 × 115 Å3 for [NaCl] = 0 (no small ions beside counterions) and 150 × 150 × 150 Å3 for [NaCl] = 200 mM, 480 mM, and 960 mM. Next, the system is expanded along one of the spatial dimensions (taken as the z-axis) using isotropic linear scaling for 10 ns while keeping the temperature constant at 100 K. For Caprin1 and its four RtoK variants, the simulation box length in the z-direction is expanded 14 times for [NaCl] = 0, 33.6 times for [NaCl] = 200 mM, 14 times for [NaCl] = 480 mM, and 7 times for [NaCl] = 960 mM. For pY-Caprin1, the expansion factors along the z-direction are 10 times for [NaCl] = 0, 37.07 times for [NaCl] = 200 mM, 15.47 times for [NaCl] = 480 mM, and 7.73 times for [NaCl] = 960 mM. Note that the simulation box volumes for Caprin1, its RtoK variants, and pY-Caprin1 after this last expansion are identical for the same [NaCl] because the same numbers of polymer chains and small ions are used. The practical reason for keeping the number of Na+ and Cl– ions constant for the higher salt concentration is to minimize computational cost. In other words, the three salt concentrations (200 mM, 480 mM and 960 mM) are achieved here by using different box dimensions. After the last box expansion, NVT equilibration using the Langevin thermostat with a friction coefficient of 1 ps-1 is performed for 2 μs at various temperatures. Final production run is then carried out for another 4 μs with the same Langevin thermostat using a much lower friction coefficient of 0.01 ps-1 for sampling efficiency. The snapshots are saved every 1 ns for further analysis. Detailed descriptions of how to construct a phase diagram from simulation trajectories are provided in ref. [78] and our previous works [84, 87]. This simulation protocol and the above-described coarse-grained MD model are used to produce the phase diagrams, distributions, and snapshots in Figs. 4–6 of the maintext and Fig. S3.
Comparison with atomic simulations with a preformed condensate.
As mentioned in the maintext, explicit-water, explicit-ion atomic simulations in the presence of a preformed condensate of the N-terminal RGG domain of LAF-1 with a net charge of +4 produce enhanced Cl– and depleted Na+ in the IDR-condensed phase [108]. This trend is consistent with our implicit-water, explicit-ion MD result for Caprin1 with net charge +13 (maintext Fig. 4c). By comparison, corresponding atomic simulations in the presence of a preformed condensate of the low complexity domain of FUS with a net charge of -2 produce a significant depletion of Cl– and a minor depletion of Na+ in the IDR-condensed phase [108], which is opposite to the trend seen here for pY-Caprin1 with net charge -1 in maintext Fig. 4d and Fig. S3. Whether this difference is caused by the multiple phosphorylated sites with a -2 charge in pY-Caprin1 remains to be elucidated.
Field-theoretic simulation (FTS) for multiple-component LLPS
Biomolecular condensates in vivo can contain hundreds of protein and nucleic acid species. Therefore, to address their biophysical properties and biochemical functions, theories—starting with rudimentary constructions—are needed for multiple-component LLPS. As a first appproximation and a tool for conceptualization, we find it valuable to utilize FTS—especially recently developed FTS approaches for biomolecular LLPS [81, 88, 110, 115] and their extensions—to gain insights into the energetic basis of sequencespecific spatial distributions of various biomolecular components in and out of phase-separated condensates.
FTS enjoys the fundamental advantage that it is not limited by some of the approximations in analytical theories such as RPA and rG-RPA because FTS accounts fully for all field fluctuations in principle. FTS is thus a valuable alternative to analytical theories, though it is computationally more costly in practice and can be impeded by lattice-related artifacts and limitations arising from the small FTS simulation box sizes necessitated by numerical tractability. We view analytical theories and FTS as complementary.
The starting point of FTS is a statistical field theory [e.g., Eq. (S5), which is equivalent to maintext Eq. (5)]. To avoid numerical instabilities, we treat polymer beads and ions as (smeared) Gaussian distributions [154] instead of the point particles stipulated by the Dirac δ-functions in Eqs. (S2) and (S3). For simplicity, this regularization is implemented using a common component-independent width ā irrespective of chemical species by making the general replacements δ(r — ra) → Γ(r — ra) and δ(r — Rα,i) → Γ(r — Rα,i), in Eqs. (S2) and (S3) where .
A general field-theoretic Hamiltonian applicable to a system comprising of one or more charged polymer species including Caprin1, pY-Caprin1, (ATP-Mg)2- (maintext Fig. 7a), ATP4- and small ions such as Na+, Cl– , and Mg2+ is given by
where the Qm functionals (m labels system components) are in general complex when evaluated beyond quadratic order in the fields. Equation (S34) above is identical to Eq. (5) of maintext and formally equivalent to the Hamiltonian in Eq. (S5). Because of the above-described Gaussian smearing, the fields in the arguments of Qm are now convoluted with Γ, i.e. , where the generic ϕ = w, ψ.
Complex Langevin evolution in fictitious time.
A simulation approach developed in the 1980s to handle the complex nature of H[w,ψ] and obtain statistical (Boltzmann) averages is the Complex Langevin (CL) method [111, 112], which analytically continues the fields w and ψ into their respective complex planes and introduces an fictitious (artificial, unphysical) time-coordinate t on which w(r,t) and ψ(r,t) now depend. The CL time evolution is governed by stochastic Langevin differential equations [maintext Eq. (6)], as follows:
where ηϕ(r,t) is real-valued Gaussian noise with zero mean:
Thermal averages of any thermodynamic observable can then be computed in the field picture (indicated by “<… >F” with subscript “F”) using a corresponding field operator through averages over all possible equilibrium field configurations, which in turn translate into asymptotic CL time averages with no final dependence on the fictitious time variable t, i.e.,
The Langevin Eq. (S35) involves functional derivatives of the Hamiltonian with respect to the complex fields, which are formally evaluated as
where
are field operators corresponding, respectively, to number- and charge density of chemical component m.
Number density correlation functions.
Information about the polymer-polymer, polymer-ion, ion-ion association and ion partitioning into the condensate can be gleaned from number density-number density correlation functions [maintext Eq. (7)]
which can be computed in field theory [88] as
The Gm,n functions are useful for assessing Caprin1 and pY-Caprin1 phase separation and the colocalization of ATP-Mg with the polymer condensed droplet (maintext Fig. 7b-e). Information with higher spatial resolution can also be provided by Gm,n if we identify component m with individual polymer bead (labeled by i) along a chain sequence.
In some situations, the physical implications of density-density correlation functions Gm,n(r) are more apparent when normalized by the component bulk densities and , as discussed in the maintext in connection with the correlation functions shown in Fig. 7.
For small ions that are each represented by a single Gaussian distribution, the density operator is given by
and the charge density operator is , where zm is the charge of ion species m and, as defined above, Ω is the system volume. For polymers (denoted “p”), the density and charge-density operators are calculated using forward (subscript “F”) and backward (subscript “B”) chain propagators qF (r, i) and qB (r, i) as follows, with i being the label for the beads/monomers along the polymer chain:
and the chain propagators are constructed iteratively as
with the starting , , and we have used b to denote Kuhn length (b = l) in the present FTS formulation to conform to the notation in our published FTS studies [87, 88, 107]. In the present work, the correlation functions in maintext Fig. 7 and Figs. S4–S6 are computed by integrating pertinent CL fictitious-time evolution equations defined in Eq. (S35) using the first order semi-implicit method of ref. [155].
Residue-specific Caprin1-(ATP-Mg) association.
As outlined in Materials and Methods of the maintext, residue-specific properties of the polymers in our FTS systems can be gleaned from the Gm,n function in Eq. (S40) by identifying m as individual polymer beads (indexed by i) along the polymer chain sequence, viz., define where , and q is another component in the FTS system. Accordingly [maintext Eq. (9)],
with a reasonably small residue-q distance rcontact (spatial separation between residue i and the positions of particles belonging to component q) can be used to represent residue-specific relative residue-q contact frequencies. A normalized version of this quantity is defined by
where and are the bulk (overall) densities, respectively, of the q-component and the ith residue along the given polymer species. Values of in the above Eq. (S46) for p = Caprin1 and q = (ATP-Mg)2- , Na+, or Cl– under the simulation conditions we considered are provided in maintext Fig. 7f. The variation in for (ATP-Mg)2- with residue position i is largely consistent with the experimental trend of NMR-measured volume ratios on Caprin1-ATP association in ref. [74].
In all the FTS simulations in this study except for a part of the model with no (ATP-Mg)2- described immediately below, we use a cubic simulation box of length L = NLΔx where NL = 32 (i.e., a 32 × 32 × 32 lattice) and is the lattice resolution. In view of the periodic boundary conditions implemented for all three spatial dimensions, the maximum possible physical distance between two volume elements is . All possible physical distances r = ri,j,k on this cubic simulation box satisfy the relation , for some i, j, k = 0, 1, 2, … , 31. There is thus a finite number of discretized distances between 0 and . One of these discretized distances, r = 1.47b ≈ 1.50b is used for the integrations in maintext Eq. (9) and Eqs. (S45) and (S46) above.
Further details of the main FTS model utilized for the results in maintext Figs. 7 and 8 as well as alternate FTS models discussed under the heading “Robustness of general trends predicted by field-theoretic simulation” in the maintext are provided below in ascending order of number of components treated by the model:
FTS models of Caprin1/pY-Caprin1 with Na+ and Cl– but no ATP-Mg.
FTS is conducted for Caprin1 and pY-Caprin1 at various concentrations of explicit Na+ and Cl– ions. For all systems, polymer bead concentration is fixed at npN/Ω = 0.4b-3. The salt concentrations, here referring to the concentration of the small ion species with same sign charge as the net charge of the polymer, are set to [NaCl] b3 = 0, 10-6, 10-5, … 10-1, 100. Additional small ions of charge opposite to the polymer net charge are added to achieve overall electric neutrality of the system. For the results in Fig. S4e-h, simulations are performed in an elongated simulation box on a 24 × 24 × 80 lattice with lattice spacing set to the Gaussian smearing length, . The Complex-Langevin (CL) evolution equations [Eq. (S35)] are integrated using a time-step Δt = 10-3b3 for 6 × 104 steps and the system is sampled every 50th step. An equilibration period of 3 × 104 CL steps, determined by monitoring the equilibration of the chemical potentials for each molecular species, is discarded from each trajectory. Eight independent simulations are run for each combination of salt concentration and Caprin1 or pY-Caprin1. All simulations are performed at lB = 7b and v2 = 0.0068b3. The density profiles shown in Fig. S4e,f are obtained by averaging the real part of the field theoretic polymer bead density operator over the x- and y dimensions (i.e. the dimensions corresponding to the short sides of the simulation box). The resulting one-dimensional density snapshots were then individually centred around their center-of-mass z-coordinate zc.o.m. before taking the trajectory average to give the profiles in Fig. S4e,f. The shaded uncertainty bands in these plots indicate the root-mean-squared difference among the eight independent runs. The density profiles are then used to estimate the coexisting condensed and dilute phases shown in Fig. S4g,h. Here, the condensed-phase concentration is obtained as the average density at zc.o.m. , whereas the dilute-phase concentrations are estimated as the average density among the 10 and 60 z-coordinates (for Caprin1 and pY-Caprin1, respectively) furthest from zc.o.m.. Consistent with rG-RPA and explicit-MD, these FTS models show reentrant behavior—albeit subtle—for Caprin1 at low [protein]s (an LLPS region is seen in Fig. S4g at intermediate [NaCl]s but not at higher or lower [NaCl]s) but not for pY-Caprin1 (no such feature in Fig. S4h).
FTS models for Caprin1/pY-Caprin1 with (ATP-Mg)2- and either Na+ or Cl–.
Simulations are performed at lB = 5b on a periodic 32 × 32 × 32 grid (see above) with CL time step Δt = 0.002. All ATP4-s and Mg2+s are assumed to be in the complex (ATP-Mg)2- form with charge sequence (-1 -1 -1 -1 +1 +1) as depicted in maintext Fig. 7a. Bulk Caprin1 and pY-Caprin1 bead densities in the their respective simulation systems are both set at 0.1b-3. For Caprin1, depending on the concentration of (ATP-Mg)2- , counterions Na+ or Cl– (but not both) are added to maintain overall electric neutrality of the FTS system. For pY-Caprin1, Na+ is added as counterions to maintain overall electric neutrality. Results from this set of models are provided in Fig. S5. The bands representing sampling uncertainties in the correlation function plots in maintext Fig. 7b-e and Fig. S5 (top) and Fig. S6a,b are standard deviations across eight independent simulations. If we take the model Kuhn length b in FTS as the Cα-Cα virtual bond length ≈ 3.8 Å of polypeptides, a unit bead concentration of b-3 is equivalent to ~ 30 M. Because (ATP-Mg)2- is modeled by six beads (Fig. 7a of maintext), a model bead concentration of (ATP-Mg)2- (denoted as [(ATP-Mg)2-] in our FTS results) which is equal to b-3 reported for the present FTS results is equivalent to a molar concentration of ~ 5 M of (ATP-Mg)2-. As mentioned above, since excluded volume is often significantly underestimated in FTS [88], we do not directly compare FTS model (ATP-Mg)2- concentrations with experimental (ATP-Mg)2- concentrations, which tend to be substantially lower. Instead, physical insights are gleaned from the trend of variation of model concentrations.
FTS models for Caprin1/pY-Caprin1 with (ATP-Mg)2-, Na+ and Cl–.
We use this set of models for the results in maintext Figs. 7 and 8. Simulations are performed at lB = 7b on a periodic 32 × 32 × 32 grid (see above) with CL time step Δt = 0.005. Again, all ATP4-s and Mg2+s are assumed to be in the complex (ATP-Mg)2- form with charge sequence (-1 -1 -1 -1 +1 +1) as depicted in maintext Fig. 7a. Bulk Caprin1 and pY-Caprin1 bead densities in the their respective simulation systems are both set at 0.1b-3. Three concentrations of (ATP-Mg)2- are studied: [(ATP-Mg)2-]= 0.0001b-3, 0.03b-3, and 0.5b-3. With overall electric neutrality of the simulation system in mind, for Caprin1 (WT), the bulk densities for Na+ and Cl– are [Na+] = 4[(ATP-Mg)2-]/6 and [Cl–] = 2[(ATP-Mg)2-]/6 + 13[Caprin1]/103. For pY-Caprin1, [Na+] = 4[(ATP-Mg)2-]/6 + [pY-Caprin1]/103 and [Cl–] = 2[(ATP-Mg)2-]/6.
FTS models for Caprin1/pY-Caprin1 with ATP4-, Mg2+, Na+ and Cl–.
In contrast to the above models, here we consider ATP4- and Mg2+ as independent components. That is, they can freely dissociate if the favorable electric interaction between them is insufficiently strong. In this set of models, ATP4- is taken as a four-bead charge sequence (-1 -1 -1 -1) whereas Mg2+ is modeled by a single bead with charge 2+ [instead of the two -1 beads in the (ATP-Mg)2- model in maintext Fig. 7a]. As before, bulk Caprin1 and pY-Caprin1 bead densities in the their respective simulation systems are both set at 0.1b-3, and the same three [(ATP-Mg)2-] = 0.0001b-3, 0.03b-3, and 0.5b-3 are studied. For Caprin1 (WT), the bulk densities for Mg2+ , ATP4- , Na+ and Cl– are given by [Mg2+] = [ATP4-]/4, [Na+] = [ATP4-], and [Cl–] = 2[Mg2+] + 13[Caprin1]/103. For pY-Caprin1, [Mg2+] = [ATP4-]/4, [Na+] = [ATP4-] + [pY-Caprin1]/103, and [Cl–] = 2[Mg2+]. Results from this set of FTS models are provided in Fig. S6.
Supporting Figures
References
- [1]Liquid phase condensation in cell physiology and diseaseScience 357
- [2]Phase separation as a missing mechanism for interpretation of disease mutationsCell 183:1742–1756
- [3]A framework for understanding the functions of biomolecular condensates across scalesNat. Rev. Mol. Cell Biol 22:215–235
- [4]Temperature, hydrostatic pressure, and osmolyte effects on liquid-liquid phase separation in protein condensates: Physical chemistry and biological implicationsChem. Eur. J 25:13049–13069
- [5]Deciphering how naturally occurring sequence features impact the phase behaviours of disordered prion-like domainsNat. Chem 14:196–207
- [6]A solid-state conceptualization of information transfer from gene to message to proteinAnnu. Rev. Biochem 87:351–390
- [7]Evaluating phase separation in live cells: Diagnosis, caveats, and functional consequencesGenes Dev 33:1619–1634
- [8]Stress granule formation via ATP depletion-triggered phase separationNew J. Phys 20
- [9]Intrinsically disordered linkers determine the interplay between phase separation and gelation in multivalent proteinseLife 6
- [10]Theories for sequence-dependent phase behaviors of biomolecular condensatesBiochemistry 57:2499–2508
- [11]A conceptual framework for understanding phase separation and addressing open questions and challengesMol. Cell 82:2201–2214
- [12]Biological condensates form percolated networks with molecular motion properties distinctly different from dilute solutionseLife 12
- [13]Phase transitions of associative biomacromoleculesChem Rev 123:8945–8987
- [14]Assembly of model postsynaptic densities involves interactions auxiliary to stoichiometric bindingBiophys. J 121:157–171
- [15]Phase transition of a disordered nuage protein generates environmentally responsive membraneless organellesMol. Cell 57:936–947
- [16]Sequence-specific polyampholyte phase separation in membraneless organellesPhys. Rev. Lett 117
- [17]A molecular grammar governing the driving forces for phase separation of prion-like RNA binding proteinsCell 174:688–699
- [18]Pi-Pi contacts are an overlooked protein feature relevant to phase separationeLife 7
- [19]Molecular interactions underlying liquid-liquid phase separation of the FUS low-complexity domainNat. Struct. Mol. Biol 26:637–648
- [20]An interpretable machine-learning algorithm to predict disordered protein phase separation based on biophysical interactionsBiomolecules 12
- [21]Temperature-controlled liquid-liquid phase separation of disordered proteinsACS Cent. Sci 5:821–830
- [22]Pressure sensitivity of SynGAP/PSD-95 condensates as a model for postsynaptic densities and its biophysical and neurological ramificationsChem. Eur. J 26:11024–11031
- [23]RNA buffers the phase separation behavior of prion-like RNA binding proteinsScience 360:918–921
- [24]Phosphoregulated FMRP phase separation models activity-dependent translation through bidirectional control of mRNA granule formationProc. Natl. Acad. Sci. U.S.A 116:4218–4227
- [25]Charge-driven condensation of RNA and proteins suggests broad role of phase separation in cytoplasmic environmentseLife 10
- [26]RNA chain length and stoichiometry govern surface tension and stability of protein-RNA condensatesiScience 25
- [27]Liquid-Liquid Phase Separation in Oligomeric Peptide SolutionsLangmuir 33:7715–7721
- [28]Phospho-dependent phase separation of FMRP and CAPRIN1 recapitulates regulation of translation and deadenylationScience 365:825–829
- [29]The control centers of biomolecular phase separation: how membrane surfaces, PTMs, and active processes regulate condensationMol. Cell 76:295–305
- [30]Polyelectrostatic interactions of disordered ligands suggest a physical basis for ultrasensitivityProc. Natl. Acad. Sci. U.S.A 104:9650–9655
- [31]Probing the diverse landscape of protein flexibility and bindingCurr. Opin. Struct. Biol 22:643–650
- [32]Polycation-n interactions are a driving force for molecular recognition by an intrinsically disordered oncoprotein familyPLoS Comput. Biol 9
- [33]Polymer physics of intracellular phase transitionsNat. Phys 11:899–904
- [34]Theoretical perspectives on nonnative interactions and intrinsic disorder in protein folding and bindingCurr. Opin. Struct. Biol 30:32–42
- [35]Charge segregation in the intrinsically disordered region governs VRN1 and DNA liquid-like phase separation robustnessJ. Mol. Biol 433
- [36]Conformations of intrinsically disordered proteins are influenced by linear sequence distributions of oppositely charged residuesProc. Natl. Acad. Sci. U.S.A 110:13392–13397
- [37]A theoretical method to compute sequence dependent configurational properties in charged polymers and proteinsJ. Chem. Phys 143
- [38]Sequence and entropy-based control of complex coacervatesNat. Comm 8
- [39]Analytical theory for sequence-specific binary fuzzy complexes of charged intrinsically disordered proteinsJ. Phys. Chem. B 124:6709–6720
- [40]Functional partitioning of transcriptional regulators by patterned charge blocksCell 186:327–345
- [41]Reentrant phase transition drives dynamic substructure formation in ribonucleoprotein dropletsAgnew. Chem. Int. Ed 56:11354–11359
- [42]Reentrant liquid condensate phase of proteins is stabilized by hydrophobic and non-ionic interactionsNat. Comm 12
- [43]Hydrophobicity of arginine leads to reentrant liquid-liquid phase separation behaviors of arginine-rich proteinsNat. Comm 13
- [44]Interplay between short-range attraction and long-range repulsion controls reentrant liquid condensation of ribonucleoprotein-RNA complexesJ. Am. Chem. Soc 141:14593–14602
- [45]Small molecules as potent biphasic modulators of protein liquidliquid phase separationNat. Comm 11
- [46]Diversity of phase transitions and phase separations in active fluidsPhys. Rev. Res 4
- [47]ATPase-modulated stress granules contain a diverse proteome and substructureCell 164:487–498
- [48]ATP-driven separation of liquid phase condensates in bacteriaMol. Cell 79:293–303
- [49]Dual roles for ATP in the regulation of phase separated protein aggregates in Xenopus oocyte nucleolieLife 7
- [50]Three archetypical classes of macromolecular regulators of protein liquid-liquid phase separationProc. Natl. Acad. Sci. U.S.A 116:19474–1948
- [51]Ligand effects on phase separation of multivalent macromoleculesProc. Natl. Acad. Sci. U.S.A 118
- [52]ATP, Mg2+, nuclear phase separation, and genome accessibilityTrends Biochem. Sci 44:P565–574
- [53]Assessing the hydrotropic effect in the presence of electrolytes: Competition between solute salting-out and salt-induced hydrotrope aggregationPhys. Chem. Chem. Phys 24:21645–21654
- [54]ATP as a biological hydrotropeScience 356:753–756
- [55]Adenosine triphosphate-induced rapid liquid-liquid phase separation of a model IgG1 mAbMol. Pharmaceutircs 18:267–274
- [56]Adensosine triphosphate mediates phase separation of disordered basic proteins by bridging intermolecular interaction networksJ. Am. Chem. Soc 146:1326–1336
- [57]ATP enhances at low concentrations but dissolves at high concentrations liquid-liquid phase separation (LLPS) of ALS/FTD-causing FUSBiochem. Biophys. Res. Comm 504:545–551
- [58]A unified mechanism for LLPS of ALS-FTLD-causing FUS as well as its modulation by ATP and oligonucleic acidsPLoS Biol 17
- [59]Uncovering the molecular mechanism for dual effect of ATP on phase separation in FUS solutionSci. Adv 8
- [60]ATP biphasically modulates LLPS of TDP-43 PLD by specifically binding arginine residuesComm. Biol 4
- [61]ATP regulates RNA-driven cold inducible RNA binding protein phase separationProtein Sci 30:1438–1453
- [62]A unified analytical theory of heteropolymers for sequence-specific phase behaviors of polyelectrolytes and polyampholytesJ. Chem. Phys 152
- [63]Distinct structural features of caprin-1 mediate its interaction with G3BP-1 and its induction of phosphorylation of eukaryotic translation initiation factor 2alpha, entry to cytoplasmic stress granules, and selective interaction with a subset of mRNAsMol. Cell. Biol 27:2324–2342
- [64]Caprin-1 is a target of the deafness gene Pou4f3 and is recruited to stress granules in cochlear hair cells in response to ototoxic damageJ. Cell Sci 124:1145–1155
- [65]Defining the Caprin-1 interactome in unstressed and stressed conditionsJ. Proteome Res 20:3165–3178
- [66]Yin and yang regulation of stress granules by Caprin-1Proc. Natl. Acad. Sci. U.S.A 119
- [67]Fragile X mental retardation protein interacts with the RNA-binding protein Caprin1 in neuronal RiboNucleoProtein complexesPLoS ONE 7
- [68]Identification and characterization of a novel protein (p137) which transcytoses bidirectionally in Caco-2 cellsJ. Biol.Chem 270:20717–20723
- [69]Absence of Caprin-1 results in defects in cellular proliferationJ. Immunol 175:4274–4282
- [70]RNG105/caprin1, an RNA granule protein for dendritic mRNA localization, is essential for long-term memory formationeLife 6
- [71]TRESS granule-associated RNA-binding protein CAPRIN1 drives cancer progression and regulates treatment response in nasopharyngeal carcinomaMed. Oncol 40
- [72]CAPRIN1 haploinsufficiency causes a neurodevelopmental disorder with language impairment, ADHD and ASDBrain 146:534–548
- [73]NMR experiments for studies of dilute and condensed protein phases: Application to the phaseseparating protein CAPRIN1J. Am. Chem. Soc 142:2471–2489
- [74]Interaction hot spots for phase separation revealed by NMR studies of a CAPRIN1 condensed phaseProc. Natl. Acad. Sci. U.S.A 118
- [75]Mapping the per-residue surface electrostatic potential of CAPRIN1 along its phase-separation trajectoryProc. Natl. Acad. Sci. U.S.A 119
- [76]PhosphoSitePlus, 2014: mutations, PTMs and recalibrationsNucl. Acids Res 43:D512–520
- [77]Phosphorylation of the FUS low-complexity domain disrupts phase separation, aggregation, and toxicityEMBO J 36:2951–2967
- [78]Sequence determinants of protein phase behavior from a coarse-grained modelPLoS Comput. Biol 14
- [79]Phosphoregulation of phase separation by the SARS-CoV-2 N protein suggests a biophysical basis for its dual functionsMol. Cell 80:1092–1103
- [80]A lattice model of charge-pattern-dependent polyampholyte phase separationJ. Phys. Chem. B 122:5418–5431
- [81]Complete phase diagram for liquid-liquid phase separation of intrinsically disordered proteinsJ. Phys. Chem. Lett 10:1644–1652
- [82]Small ion effects on self-coacervation phenomena in block polyampholytesJ. Chem. Phys 151
- [83]LASSI: A lattice model for simulating phase transitions of multivalent proteinsPLoS Comput. Biol 15
- [84]Comparative roles of charge, n, and hydrophobic interactions in sequence-dependent phase separation of intrinsically disordered proteinsProc. Natl. Acad. Sci. U.S.A 117:28795–28805
- [85]Charge pattern affects the structure and dynamics of polyampholyte condensatesPhys. Chem. Chem. Phys 22:19368–19375
- [86]Physics-driven coarse-grained model for biomolecular phase separation with near-quantitative accuracyNat. Comput. Sci 1:732–743
- [87]Numerical techniques for applications of analytical theories to sequence-dependent phase separations of intrinsically disordered proteinsPhase-Separated Biomolecular Condensates, Methods and Protocols; Methods in Molecular Biology Humana Press :51–94
- [88]Subcompartmentalization of polyampholyte species in organelle-like condensates is promoted by charge-pattern mismatch and strong excluded-volume interactionPhys. Rev. E 103
- [89]Theory of polyelectrolytes in solutions and at surfacesProg. Polym. Sci 30:1049–1118
- [90]Phase separation and single-chain compactness of charged disordered proteins are strongly correlatedBiophys. J 112:2043–2046
- [91]A simple explicit-solvent model of polyampholyte phase behaviors and its ramifications for dielectric effects in biomolecular condensatesJ. Phys. Chem. B 125:4337–4358
- [92]Phase diagrams of salt-free polyelectrolyte semidilute solutionsMacromolecules 33:7649–7654
- [93]A modified random phase approximation of polyelectrolyte solutionsMacromolecules 36:7824–7832
- [94]Monte Carlo study of Coulombic criticality in polyelectrolytesPhys. Rev. Lett 90
- [95]50th Anniversary perspective: A perspective on polyelectrolyte solutionsMacromolecules 50:9528–9560
- [96]Double screening in polyelectrolyte solutions: Limiting laws and crossover formulasJ. Chem. Phys 105:5183–5199
- [97]A new equation of state of a flexible-chain polyelectrolyte solution: Phase equilibria and osmotic pressure in the salt-free caseJ. Chem. Phys 142
- [98]Electrostatic correlations and the polyelectrolyte self energyJ. Chem. Phys 146
- [99]Polyelectrolyte chain structure and solution phase behaviorMacromolecules 51:1706–1717
- [100]Aqueous solutions of polyvinylsulfonic acid: Phase separation and specific interactions with ions, viscosity, conductance and potentiometryJ. Phys. Chem 63:671–680
- [101]Salting-out and salting-in of polyelectrolyte solutions: A liquid-state theory studyMacromolecules 49:9720–9730
- [102]Vapor-liquid phase equilibrium and surface tension of fully flexible Lennard-Jones chainsMol. Phys 115:320–327
- [103]Configuration-dependent heat capacity of pairwise hydrophobic interactionsJ. Am. Chem. Soc 129:2083–2084
- [104]Improved coarse-grained model for studying sequence dependent phase separation of disordered proteinsProtein Sci 30:1371–1379
- [105]A data-driven hydrophobicity scale for predicting liquid-liquid phase separation of proteinsJ. Phys. Chem. B 125:4046–4056
- [106]Accurate model of liquidliquid phase behavior of intrinsically disordered proteins from optimization of single-chain propertiesProc. Natl. Acad. Sci. U.S.A 118
- [107]Analytical formulation and field-theoretic simulation of sequence-specific phase separation of proteinlike heteropolymers with short- and long-spatial-range interactionsJ. Phys. Chem. B 126:9222–9245
- [108]Molecular details of protein condensates probed by microsecond long atomistic simulationsJ. Phys. Chem. B 124:11671–11679
- [109]Structural and hydrodynamic properties of an intrinsically disordered region of a germ cell-specific protein on phase separationProc. Natl. Acad. Sci. U.S.A 114
- [110]The Equilibrium Theory Of Inhomogeneous PolymersNew York: Oxford University Press Inc.
- [111]On complex probabilitiesPhys. Lett. B 131:393–395
- [112]A Langevin approach to fermion and quantum spin correlation functionsJ. Phys. A: Math. Gen 16:L317–L319
- [113]Perturbation theory without gauge fixingScientia Sinica 24:483–496
- [114]New ghost-free infrared-soft gaugesPhys. Rev. D 33:540–547
- [115]Field-theoretic computer simulation methods for polymers and complex fluidsMacromolecules 35:16–39
- [116]Narrow equilibrium window for complex coacervation of tau and RNA under cellular conditionseLife 8
- [117]Salt dependent phase behavior of intrinsically disordered proteins from a coarse-grained model with explicit water and ionsJ. Chem. Phys 155
- [118]The effect of monomer polarizability on the stability and salt partitioning in model coacervatesSoft Matter https://doi.org/10.1039/D3SM00706E
- [119]Cross-talk of cation-n interactions with electrostatic and aromatic interactions: A salt-dependent trade-off in biomolecular condensatesJ. Phys. Chem. Lett 14:8460–8469
- [120]Salt triggers the simple coacervation of an underwater adhesive when cations meet aromatic n electrons in seawaterACS Nano 11:6764–6772
- [121]RNAs undergo phase transitions with lower critical solution temperaturesNat. Chem 15:1693–1704
- [122]Sodium ion influx regulates liquidity of biomolecular condensates in hyperosmotic stress responseCell Rep 42
- [123]Biomolecular condensates under extreme Martian salt conditionsJ. Am. Chem. Soc 143:5247–5259
- [124]Self-association of a highly charged arginine-rich cell-penetrating peptideProc. Natl. Acad. Sci. U.S.A 114:11428–11433
- [125]Bio-membrane internalization mechanisms of arginine-rich cell-penetrating peptides in various speciesMembranes
- [126]Molecular connectivity and correlation effects on polymer coacervationMacromolecules 50
- [127]Phase behavior and salt partitioning in polyelectrolyte complex coacervatesMarcomolecules 51:2988–2995
- [128]Polyelectrolyte-multivalent molecule complexes: physicochemical properties and applicationsSoft Matter 19:2013–2041
- [129]Tie-line analysis reveals interactions driving heteromolecular condensate formationPhys. Rev. X 12
- [130]Precipitation of highly charged polyelectrolyte solutions in the presence of multivalent saltsJ. Chem. Phys 103:5781–5791
- [131]Impact of arginine-phosphate interactions on the reentrant condensation of disordered proteinsBiomacromolecules 22:1532–1544
- [132]Ion binding with charge inversion combined with screening modulates DEAD box helicase phase transitionsCell Rep 42
- [133]Reversible generation of coacervate droplets in an enzymatic networkSoft Matter 14:361–367
- [134]Field theory description of ion association in phase separation of polyampholytesJ. Chem. Phys 156
- [135]Polyelectrolyte complex coacervation by electrostatic dipolar interactionsJ. Chem. Phys 149
- [136]Effect of solvent quality on the phase behavior of polyelectrolyte complexesMarcomolecules 54:105–114
- [137]Lower critical solution temperature in polyelectrolyte complex coacervatesACS Marco Lett 8:289–293
- [138]A change in conformational dynamics underlies the activation of Eph receptor tyrosine kinasesEMBO J 25:4686–4696
- [139]On a method of calculating quantum distribution functionsSoviet Physics Doklady 2:416–419
- [140]Calculation of partition functionsPhys. Rev. Lett 3:77–78
- [141]Random-phase-approximation theory for sequence-dependent, biologically functional liquid-liquid phase separation of intrinsically disordered proteinsJ. Mol. Liq 228:176–193
- [142]Charge pattern matching as a ‘fuzzy’ mode of molecular recognition for the functional phase separations of intrinsically disordered proteinsNew J. Phys 19
- [143]HOOMD-blue: A Python package for highperformance molecular dynamics and hard particle Monte Carlo simulationsComput. Mater. Sci 173
- [144]General purpose molecular dynamics simulations fully implemented on graphics processing unitsJ. Comput. Phys 227:5342–5359
- [145]Simulation methods for liquid-liquid phase separation of disordered proteinsCurr. Opin. Chem. Eng 23:92–98
- [146]Coarse-grained residue-based models of disordered protein condensates: Utility and limitations of simple charge pattern parametersPhys. Chem. Chem. Phys 20:28558–28574
- [147]Determination of alkali and halide monovalent ion parameters for use in explicitly solvated biomolecular simulationsJ. Phys. Chem. B 112:9020–9041
- [148]Picosecond orientational dynamics of water in living cellsNat. Commun 8
- [149]Coarse-grained models for simulations of multiprotein complexes: Application to ubiquitin bindingJ. Mol. Biol 375:1416–1433
- [150]Residue-residue potentials with a favourable contact pair term and an unfavourable high packing density term, for simulation and threadingJ. Mol. Biol 256:623–644
- [151]Constant pressure molecular dynamics algorithmsJ. Chem. Phys 101:4177–4189
- [152]A Liouville-operator derived measure-preserving integrator for molecular dynamics simulations in the isothermal-isobaric ensembleJ. Phys. A: Math. Gen 39:5629–5651
- [153]Self-assembly of coarse-grained ionic surfactants accelerated by graphics processing unitsSoft Matter 8:2385–2397
- [154]Fluctuation in electrolyte solutions: The self energyPhys. Rev. E 81
- [155]Numerical solutions of the complex Langevin equations in polymer field theoryMultiscale Modeling & Simulation 6:1347–1370
Article and author information
Author information
Version history
- Sent for peer review:
- Preprint posted:
- Reviewed Preprint version 1:
Copyright
© 2024, Lin et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics
- views
- 240
- downloads
- 22
- citations
- 0
Views, downloads and citations are aggregated across all versions of this paper published by eLife.