Conformational distributions of isolated myosin motor domains encode their mechanochemical properties
Abstract
Myosin motor domains perform an extraordinary diversity of biological functions despite sharing a common mechanochemical cycle. Motors are adapted to their function, in part, by tuning the thermodynamics and kinetics of steps in this cycle. However, it remains unclear how sequence encodes these differences, since biochemically distinct motors often have nearly indistinguishable crystal structures. We hypothesized that sequences produce distinct biochemical phenotypes by modulating the relative probabilities of an ensemble of conformations primed for different functional roles. To test this hypothesis, we modeled the distribution of conformations for 12 myosin motor domains by building Markov state models (MSMs) from an unprecedented two milliseconds of all-atom, explicit-solvent molecular dynamics simulations. Comparing motors reveals shifts in the balance between nucleotide-favorable and nucleotide-unfavorable P-loop conformations that predict experimentally measured duty ratios and ADP release rates better than sequence or individual structures. This result demonstrates the power of an ensemble perspective for interrogating sequence-function relationships.
Introduction
Myosin motors (Figure 1A) perform an extraordinary diversity of biological functions despite sharing a common mechanochemical cycle. For example, myosin-II motors power muscle contraction, whereas myosin-V motors engage in intracellular transport. This diversity is in part due to differences in myosins’ tails and light chain-binding domains, which influence properties like localization and multimerization (Krendel and Mooseker, 2005). However, some of this diversity is encoded in the motor domains themselves (Greenberg et al., 2016). These differences stem from variations in the tunings of the thermodynamics and kinetics of the individual steps of the myosins’ conserved mechanochemical cycle, which couples ATP hydrolysis to actin binding and the swing of a lever arm (De La Cruz and Ostap, 2004).
Two important and highly variable parameters for motor function are the rate of ADP release, which sets the speed of movement along actin, and the duty ratio, which is the fraction of time a myosin spends attached to actin during one full pass through its mechanochemical cycle. For example, in muscle, myosin-II motors are arranged into multimeric arrays called thick filaments and the individual motors typically have a strong preference for the actin free state (i.e. low duty ratio). These motors quickly detach after pulling on the actin filament to avoid creating drag for other motors in the array, much as a rower quickly removes their oar from the water to minimize drag. In contrast, individual myosin-Va motors have high duty ratios (i.e. prefer the actin-bound state), helping them to processively walk along actin filaments in intracellular transport. Similarly, the speed of myosin movement along actin (in the absence of opposing forces) is set by the rate of ADP dissociation (De La Cruz and Ostap, 2004), and it varies by four orders of magnitude from ~0.4 s−1 for non-muscle myosin-IIb (Nagy et al., 2013) to >2800 s−1 for myosin-XI (Ito et al., 2007).
Unfortunately, inferring the relationship between a motor’s sequence and its biochemical properties is not trivial. For example, one cannot simply predict the duty ratio or ADP release rate of a motor based on phylogeny. Myosin-V family members contain both high duty ratio motors, like myosin-Va, (De La Cruz et al., 1999) and low duty ratio motors, like myosin-Vc (Takagi et al., 2008). Similarly, ADP release rates within the myosin-II family vary from ~0.4 s−1 (non-muscle myosin-IIb) (Nagy et al., 2013) to >400 s−1 (extraocular myosin-II) (Bloemink et al., 2013; Johnson et al., 2019). Insertions and deletions in the myosin motor domain sequence also convey useful, but typically incomplete, information. For instance, pioneering biochemical work Sweeney et al., 1998 demonstrated a correlation between the length of loop 1 and ADP release rates in myosin-II motors. However, this observation does not explain how other myosin isoforms that have virtually the same loop 1 lengths have ADP release rates that differ by an order of magnitude (Deacon et al., 2012). It is also difficult to predict the effects of mutations implicated in human disease, as the effects cannot be easily predicted from the location of the mutation. For example, in human β-cardiac myosin, an A223T mutation causes a dilated cardiomyopathy (Ujfalusi et al., 2018) while an I263T mutation has the opposite effect, resulting in a hypertrophic cardiomyopathy (Tesson et al., 1998), despite being separated by less than 6 Å (Planelles-Herrero et al., 2017).
Structural studies have provided detailed pictures of many key states in the mechanochemical cycle, but have yet to enable the routine prediction of a motor’s biochemical properties from its sequence. For example, high-resolution structures have illuminated many shared features of myosin motor domains, such as the lever arm swing (Fischer et al., 2005) and conformational rearrangements associated with changes in nucleotide binding (Coureux et al., 2004; Rayment et al., 1993). They have also revealed the strain-sensing elements of myosin-I motors (Greenberg et al., 2015; Mentes et al., 2018; Shuman et al., 2014) and the binding modes of many small molecules (Allingham et al., 2005; Planelles-Herrero et al., 2017; Winkelmann et al., 2015). However, the structures of motor domains with vastly different biochemical properties are often nearly indistinguishable. Similarly, computer simulations have begun to reveal aspects of motor function (Blanc et al., 2018; Chinthalapudi et al., 2017; Hashem et al., 2017; Powers et al., 2019). However, simulating an individual motor domain (~700 residues) is a huge computational expense, so most simulation studies have been based on less than a microsecond of data. Thus, adding binding partners like actin to simulate the full mechanochemical cycle and infer properties like duty ratio is currently infeasible, especially if one wanted to compare multiple isoforms to infer sequence-function relationships.
Here, we investigate the possibility that the distribution of structures that an isolated motor domain explores correlates with its biochemical properties, allowing the prediction of sequence-function relationships. This hypothesis was inspired by a growing body of work showing that protein dynamics encode function (Henzler-Wildman and Kern, 2007; Knoverek et al., 2019), even in the absence of relevant binding partners (Bowman and Geissler, 2012; Hart et al., 2016; Porter et al., 2019a). In the case of myosin, we reasoned that as sequence changes modulate motors’ preferences for different states of the mechanochemical cycle, they likely also have a systematic effect on the distribution of conformations explored by the motor, even in the absence of binding partners. Therefore, comparing the distribution of conformations that isolated motor domains sample in solution should reveal signatures of their biochemical differences.
To test this hypothesis, we ran an unprecedented two milliseconds of all-atom, explicit solvent molecular dynamics (MD) simulations of twelve myosin motors with diverse but well-established biochemical properties (Figure 1B, Tables 1, 2). Such simulations are adept at identifying excited states, which are lower probability conformational states that are often invisible to other structural techniques. Indeed, our simulations reveal a surprising degree of conformational heterogeneity, particularly in the highly conserved P-loop (or Walker A motif), a common structural element for nucleotide binding that is highly conserved across myosin motor domains (Saraste et al., 1990). Because of its high conservation, we reasoned that the P-loop would report on the conformation of the nucleotide binding site while still being comparable between motors with otherwise differing sequences. To enable quantitative comparisons, we constructed Markov state models (MSMs) from the MD data for each motor. MSMs are network models of protein free energy landscapes composed of many conformational states and the probabilities of transitioning between these states. They are a powerful means to capture phenomena far beyond the reach of any individual simulation by integrating information from many independent trajectories (Bowman et al., 2013; Chodera and Noé, 2014). Analyzing our MSMs, we find they capture sufficient information about myosin motor domains’ thermodynamics and kinetics to produce reasonable estimates of duty ratio and ADP release rates. Thus, MD and MSMs constitute a powerful platform for identifying relationships between the sequence of individual motor domains and their mechanochemical cycles.
Results and discussion
In simulation, the P-loop adopts conformational states that are rare in crystal structures
We reasoned that any differences between myosin motor domains in nucleotide handling—ADP release rate or duty ratio, for instance—must somehow be manifest at the active site to have an effect. The P-loop is a highly conserved element of the myosin active site that plays an important role in interacting with the phosphates of the ATP substrate (Gulick et al., 1997). Consequently, we reasoned that the P-loop would report on the conformation of the nucleotide binding site while still being comparable between motors whose sequences differ elsewhere in the protein. To assess the degree of conformational heterogeneity captured by crystal structures, we first analyzed structures deposited in the PDB (Figure 2A). We queried the PDB (Berman et al., 2000) for myosin motor domains (see Materials and methods), yielding 114 crystal structures. Using sequence alignments (see Materials and methods) we identified the P-loop in each of these models and computed the backbone root mean square deviation (RMSD) of each of these models to a reference structure (β-cardiac myosin, PDB ID 4PA0) (Winkelmann et al., 2015). We found very little structural diversity among crystal structures, which rarely sample any conformations with P-loop backbone RMSD >0.6 Å away (Figure 2A).
Then, to assess the capacity of the P-loop to adopt conformations not observed in crystal structures, we used molecular dynamics to simulate the myosin motor domain. These simulations of human β-cardiac myosin (Hs MYH7) were performed in the actin-free, nucleotide-free state for roughly a quarter-millisecond in all-atom explicit-solvent detail used to construct an MSM (see Methods). All simulations were conducted using the same force fields and conditions that we have previously used to analyze other systems’ conformational distributions, including β-lactamases (Bowman et al., 2015; Porter et al., 2019a; Zimmerman et al., 2017), E. coli catabolite activator protein (Singh and Bowman, 2017), Ebola virus nucleoprotein (Su et al., 2018), and G-proteins (Sun et al., 2018). Then, using the MSM, we computed the distribution of backbone RMSDs of the P-loop relative to the reference crystal structure.
In contrast to the relative uniformity among crystal structures, simulations revealed extensive conformational heterogeneity in the P-loop (Figure 2B). Where crystal structures rarely sampled conformations with RMSD >0.6 Å, in simulation we observe broad sampling (i.e. high-probability density) in regions from 0.2 Å RMSD all the way to ~1.5 Å RMSD from the starting structure. Only 10 of 114 (9%) crystal structures’ conformations were >0.6 Å RMSD from the reference conformation, whereas fully 58% of the distribution observed in silico is above 0.6 Å RMSD from the reference conformation. These results suggest our simulations may provide mechanistic insight not previously accessible from crystal structures alone.
Simulations suggest that the nucleotide-free motor explores distinct nucleotide-favorable and nucleotide-unfavorable states
We reasoned that P-loop conformations identified by our simulations might have important implications for motors’ nucleotide handling. For example, modulating the relative probabilities of these conformations would provide a facile mechanism by which sequence variation might tune the mechanochemical cycle.
To assess the nucleotide compatibility of the P-loop conformations we observe in simulation, we sought to systematically compare these conformations with crystal structures with and without nucleotide. To do this, we built a map of P-loop conformational space using the dimensionality reduction algorithm Principal Components Analysis (PCA) to learn a low-dimensional representation of the pairwise interatomic distances between P-loop atoms that retains as much of the geometric diversity in the input as possible (see Figure 3—figure supplements 1–3, and Materials and methods for details) (Shlens, 2014). We then projected the states of our MSM built from our MYH7 simulations onto principal components (PCs) one and three to visualize the free energy surface sampled by our simulations (Figure 3A, green level sets). Principal component two chiefly reported on geometric differences between low-probability confirmations (Figure 3—figure supplement 1). Using the same PCA, we then projected each crystal structure’s P-loop conformation into the PC1/PC3 space, plotting each as a point (Figure 3A, points). Points labeled with PDB IDs represent crystal structures with P-loops >0.6 Å backbone RMSD away from the reference structure 4PA0 used above. We also classified each structure (see Materials and methods) as nucleotide-bound (yellow points) or nucleotide-free (purple points). Then, we compared the frequency at which nucleotide-bound and nucleotide-free P-loop conformations were found in various conformations.
This analysis revealed two dominant conformational states that likely constitute nucleotide-favorable and nucleotide-unfavorable states (Figure 3A and B). Once the distribution of P-loop conformations is projected onto two PCs (the green level sets in Figure 3A), we observe two broad minima in the P-loop conformational landscape. We refer to these apparent minima as the upper and lower basin for brevity but recognize that other minima may exist and be obscured by the projection of a high-dimensional space into a low-dimensional space. The lower basin (<0.6 Å RMSD from the reference structure) contains 91% of crystal structures (104/114) and, because 80% (84/105) of these structures are bound to nucleotide, it is highly likely to represent a nucleotide-compatible conformation. In contrast, despite being populated roughly equally in simulation, regions outside the lower basin (≥0.6 Å RMSD) contain only 9% (10/114) of crystal structures. And, because only one (11%) of these structures is nucleotide bound, these regions are significantly depleted in nucleotide-bound structures (odds ratio = 0.03, p<1.3×10−5 by Fisher’s exact test), strongly implying that they are less or not at all nucleotide compatible. Interestingly, this single exception (PDB ID 2Y8I, Dictyostelium discoideum myosin-II G680V) is a highly perturbed motor that has been shown to have low ATPase activity, low motility and a disordered allosteric network (Kinose et al., 1996; Patterson et al., 1997, p.), potentially contributing to its aberrant conformation.
To characterize the structural differences between nucleotide-favorable and nucleotide-unfavorable states captured in the simulations, we coarse-grained our MSM into a model with just five states, called A-E. We used hierarchical clustering to group the thousands of states explored by Hs MYH7 into five states based only on their P-loop conformations (see Materials and methods). Then, using the assignment of each frame from our simulations to one of these five states, we fit a five-state MSM (Figure 3D, node sizes indicate equilibrium probabilities, arrow weights indicate transition probabilities). The most probable single state is the A state (49%), which encompasses the entire lower basin and, as we will see below, appears to form favorable interactions with nucleotide based on the conformation of the P-loop. The excited, apparently nucleotide-disfavoring conformations in the upper basin are split into 3 states, B-D, which together account for 50% of the equilibrium probability. Thus, β-cardiac myosin spends about equal time in nucleotide-favorable (state A) and nucleotide-unfavorable states (states B-D) in simulations. Finally, state E (1%, too low to be seen clearly in Figure 3A), involves a condensation of the P-loop into an extension of the HF helix, similar to the crystal structure 4L79 (Shuman et al., 2014). The reduced number of states in this MSM allowed us to inspect a small number of high-probability conformations near the mean of each P-loop state, which we took as exemplars of each of the five P-loop states.
Comparing the states of our MSM reveals that the dominant geometrical difference between nucleotide-favorable and nucleotide-unfavorable P-loop states is the orientation of the peptide bond between S180 and G181 (Figure 3C). In the nucleotide-favorable state A (Figure 3D, lower right inset), the S180 backbone carbonyl (shown in pink sticks with a white arrow) is oriented away from the phosphates of the nucleotide, enabling the nucleotide to bind to the active site. In contrast, nucleotide-disfavoring states (labeled B-D in Figure 3D) orient the S180 backbone carbonyl toward the phosphate groups of the nucleotide. This positions the carbonyl oxygen in a way that appears to sterically clash with the phosphates of nucleotide. It also orients the negative end of the carbonyl bond’s electric dipole toward the nucleotide binding site and the negatively charged phosphates of ADP and ATP. Taken together, our observations about the geometry of the excited, nucleotide-disfavoring state in the upper basin are consistent with a lowered capacity for nucleotide binding.
The balance between nucleotide-favorable and nucleotide-unfavorable P-loop states predicts duty ratio
We reasoned that motors with a higher probability of adopting nucleotide-favorable P-loop conformations in isolation are likely to have an increased affinity for nucleotide and, therefore, spend more time in nucleotide-bound states of the mechanochemical cycle. Our reasoning is that motors that prefer nucleotide-favorable P-loop conformations in isolation pay a lower energetic cost to adopting these same nucleotide-favorable conformations when they form a complex with nucleotide. Supporting this logic, it has been observed that, absent load, a large free energy difference between ADP-bound and nucleotide-free states is associated with a low duty ratio (Bloemink and Geeves, 2011; Nyitrai and Geeves, 2004). Thus, we hypothesized that a preference for the nucleotide-favorable A state should correlate with low duty ratio.
To test if differences in the probability of excited states encodes information about duty ratio, we simulated an additional seven myosin isoforms of differing duty ratio for a total of ~2 ms of aggregate simulation in all-atom, explicit solvent detail. Specifically, we simulated four human low duty ratio myosin motor domains (from myosin-II genes MYH13, MYH7, MYH10, and myosin-I gene MYO1B) and four human high duty ratio myosin motor domains (from genes MYO5A, MYO6, MYO7A, and MYO10), for between 125 and 325 µs each (see Materials and methods). These motors were selected because extensive kinetic characterization (Bloemink et al., 2013; De La Cruz et al., 2001; De La Cruz et al., 1999; Deacon et al., 2012; Homma and Ikebe, 2005; Lewis et al., 2012; Nagy et al., 2013; Watanabe et al., 2006) has revealed very diverse kinetic tuning, providing a robust test of our hypotheses. Because no crystal structure of the human sequence was available for any of these proteins except MYH7, homology models were built in each case and used as starting points for simulations (see Materials and methods and Table 1). To allow for direct comparisons between motors, we used the same PCA and state definitions as described above for MYH7.
As expected, high duty ratio motors have a stronger in silico preference for nucleotide-favoring P-loop states than low duty ratio motors (Figure 4A). Figure 4A shows an example of this effect on the P-loop conformational distributions of high duty ratio motor MYO6 and low duty ratio motor MYH7. The low duty ratio motor explores both upper and lower basins (Figure 4A, left) while the high duty ratio motor strongly prefers the lower basin (Figure 4A, right). Provocatively, when motors are crystallized without ligand, only motors with low unloaded duty ratios have been crystallized with P-loops outside the nucleotide-favorable conformation (Figure 4A, red and blue points). Of 29 unliganded crystal structures, 8/20 (40%) of low duty ratio motors’ P-loops crystallized outside the A state, whereas 0/9 (0%) high duty ratio motors’ P-loops crystallized outside state A (p<0.034 by Fisher’s exact test, see Materials and methods).
Given this trend, we reasoned that the relative free energies of the nucleotide-favorable state and the nucleotide-disfavoring excited states would provide a useful predictor of a motor’s duty ratio. We assigned every whole-motor MSM state to one of the five P-loop states and used these assignments to compute the free energies of each of the five states for each of the eight motors (see Materials and methods). We then took the difference in free energy between states A and B, which are the two best sampled states and therefore give statistically robust results. Numerical values and references for these experimental values can be found in Table 2.
As expected, we find a strong correlation between motors’ duty ratios and their preferences for the nucleotide-favorable A state over the nucleotide-unfavorable B state (Figure 4B). Specifically, high duty ratio motors have a strong preference for the A state (negative free energy difference) while low duty ratio motors spend more time in state B (positive free energy difference). Decreased stability of the nucleotide-favorable conformation in these low duty ratio motors could explain this observation.
Simulations predict ADP release rates better than loop 1 length does by capturing sequence-specific effects
Because ADP release allows a motor to adopt nucleotide-incompatible P-loop conformations, we reasoned that the rate at which a motor can transition to these conformations in silico might correlate with in vitro ADP release kinetics. While we expect a correlation, we acknowledge that the absolute rates will almost certainly differ, since the rates themselves likely differ in the presence and absence of nucleotide. To test for a correlation, we first focus on data sets that examine several motors under the same experimental conditions. Identical conditions are important because in vitro biochemical rates depend strongly on experimental conditions such as salt and temperature (Chizhov et al., 2013; De La Cruz and Ostap, 2009; Lewis et al., 2012). We focus on low duty ratio motors, since their frequent transitions to nucleotide-unfavorable states make it possible to estimate their transition rates with confidence. In contrast, in high duty ratio motors, transitions between these states are sufficiently rare that their rates cannot be estimated with confidence.
An especially useful dataset for comparing relative ADP release rates was created by Sweeney et al., 1998, which carefully dissected the effect of variation in loop 1 length and sequence on ADP release rates using the same experimental conditions. These authors established a positive relationship between loop 1 length and ADP release rate using engineered constructs of chicken gizzard myosin-II (shown in Figure 5A, henceforth Gg MYH11). A notable exception, however, was the myosin with wild-type loop 1, which had an ADP release rate more than three times faster than predicted by the length-based model (Figure 5B). This deviation from a purely length-driven ADP release rate led these authors to hypothesize that there must also be sequence-specific effects of loop 1 on ADP release rate. They then identified an alanine mutant that ablated the sequence-specific effects of the wild-type loop (henceforth Gg MYH11-ala).
To assess the capacity of in silico P-loop kinetics to capture the experimentally measured ADP release rates in the constructs investigated by Sweeny et al, we simulated and analyzed four Gg MYH11 constructs. These constructs are a subset of the variants considered by Sweeny et al. We selected the wild-type loop (Gg MYH11-wt) because it was the primary outlier in their length-only model. We selected the alanine mutant (Gg MYH11-ala) because it, with just five mutations, shifted the wild-type loop in line with the length-only model proposed by Sweeny et al. Then, we selected the extreme points that were well fit by the loop length-only model: the loop 1 deletion (Gg MYH11-∆loop1) and the construct using the loop 1 from Xenopus non-muscle myosin (Gg MYH11-xeno). We simulated these four constructs for 6–16 µs each beginning from a homology model (see Materials and methods and Table S1) and built whole-motor MSMs which, as before, were used to compute five-state P-loop MSMs. Each P-loop MSM contains a parameter P(A→B) which captures the probability that a conformation in state A transitions to state B within a fixed period of time (known as the lag time of the model). We then compared P(A→B) to ADP release rates measured in vitro for these four constructs.
As expected, there is a strong positive relationship (Pearson’s R = 0.99) between the P(A→B) fit by our MSMs and in vitro ADP release rate (Figure 5C). This is stronger than the equivalent correlation for the length-based model (Pearson’s R = 0.72). Importantly, the rank order of the four isoforms is correct, whereas using a loop 1 length-only model dramatically underestimates the ADP release rate for the wild-type motor. Rank order is used because, as noted above, the timescales of the transition (mean first passage times from state A to B are on order of 5–500 nanoseconds) are not directly comparable to experimentally measured values because nucleotide is absent in the simulations. Together, the fact that the sequence change is small (only five residues differ between wild type and the alanine mutant) and the change is distant (~25 Å) from the P-loop indicate that our model is exquisitely sensitive to sequence, even at sites distant from the active site.
P-loop kinetics in silico correlate with ADP release rates across conditions
To further assess the generalizability of our model, we considered several additional datasets that relax constraints placed on data sets in the previous section. First, we relaxed the constraint that motors differ by just one structural element (loop 1). Specifically, we considered several skeletal myosin isoforms, including MYH7 and MYH13 that Johnson et al (35) studied under the same conditions (Figure 5D and E, yellow points). These motor domains are an interesting case because, at 80% sequence identity, their sequences differ much more than Sweeney et al’s constructs, and these differences are distributed throughout the protein. Crucially, and despite having roughly the same loop 1 length, their ADP release rates differ by about an order of magnitude (59 s−1 vs 400 s−1). Owing to the fact that Johnson et al’s data were collected under different experimental conditions than Sweeny et al’s data (5 mM MgCl2 at 25°C vs 1 mM MgCl2 at 20°C with different light chains), we only expect a general trend to hold, since motors’ properties are very sensitive to magnesium, temperature, and light chain identity (Chizhov et al., 2013; Heissler and Sellers, 2014; Lewis et al., 2012). Second, we assessed the trend in two human non-muscle motor domains, MYO1B and MYH10 with measurements carried out under different conditions. Notably, because they both release ADP very slowly, they test our model’s capacity to evaluate very slow ADP release rates.
Consistent with our expectations, and despite the diverse experimental conditions, we still observe a reasonable correlation between P(A→B) and ADP release across all data sets (Figure 5E, Pearson’s R = 0.75). This dramatically improves on the length-based model (Pearson’s R = 0.14). Importantly, under the matched experimental conditions for MYH7 and MYH13 we still find the correct order of ADP release rates (Figure 3C, yellow points), suggesting that this method generalizes well to the larger phylogenic distances between myosin isoforms. Furthermore, MYO1B and MYH10 are correctly identified as very slow releasers of ADP, although the point estimates appear to be quite noisy. MYH10 is known to be exquisitely sensitive to light chains (Heissler and Sellers, 2014), so it is not surprising that it is one of the greatest outliers given that we did not include these in our simulations.
Structural models provide insight into the mechanism by which sequence influences P-loop conformational distributions
Even though the sequences of motors’ P-loops are identical, their conformational distributions differ. This suggests that interactions with other structural elements in the motor domain bias the P-loop’s conformational distribution and our that models capture these effects.
Although no single interaction is likely to completely explain the difference between conformational distributions, to investigate the mechanisms that contribute to this effect we examined the interactions of the P-loop with nearby sidechains. We then compared them between motors to understand how their presence or absence might bias the balance between A and B states for each motor. While an exhaustive analysis is beyond the scope of this work, we have highlighted two examples of such interactions in Figure 6.
First, we observed that the A state of the P-loop in the high duty ratio motor MYO6 is stabilized by an interaction between the backbone carbonyl oxygen of the P-loop serine (homologous to S180 in MYH7) and the sidechain amide group of the switch-II residue K670 (Figure 6A). A notable difference occurs in the low duty ratio myosin-II motors in our study (MYH13, MYH7, and MYH10), all of which feature an isoleucine at this position (MYH7 I674). Thus, where the strong interaction between the lysine sidechain and P-loop backbone stabilizes the nucleotide-compatible A state in MYO6, this interaction does not exist at all in MYH7, presumably destabilizing this state. Figure 6B shows that the sidechain of I674 in MYH7 almost never forms a direct interaction (distance <0.35 nm) with S180 even in P-loop state A, whereas K670 of MYO6 almost always does when the P-loop occupies the A state. We propose that the substitution of an aliphatic residue at this position in myosin-II motors destabilizes the nucleotide-favoring A state, leading to an increased preference for the nucleotide-disfavoring B state, ultimately resulting in a lower predicted duty ratio. Notably, however, many other low duty ratio myosin classes, such as MYO1B which we simulated here and correctly identify as a low duty ratio motor, feature a lysine at this position, implying that this substitution may be a peculiar innovation limited to myosin-IIs.
Second, we also observed that the B state of the P-loop in the low duty ratio motor MYH7 is stabilized by an interaction between the backbone carbonyl oxygen of S242 (in the Switch-I loop) and the S180 sidechain hydroxyl group (Figure 6C), but that this interaction does not occur in the high duty ratio motor MYO7A (Figure 6D). This interaction is specific to the nucleotide-disfavoring B state of MYH7 (Figure 6D), and hence presumably stabilizes that P-loop state in MYH7 relative to MYO7A. As shown in Figure 6C, this interaction in MYH7 requires the Switch-I loop to move ‘inwards,’ toward the peptide bond between G464 and A463, with which it also sometimes interacts. At the position homologous to A463, however, the high duty ratio motor MYO7A features a phenylalanine (F439). We propose that the bulky, aromatic sidechain in MYO7A (F439) prevents the Switch-I loop from engaging the P-loop serine’s sidechain, whereas the small aliphatic one (MYH7’s A463) does not. On net, this leads to a lower overall preference for the nucleotide-disfavoring B state in MYO7A and thus a higher overall duty ratio prediction. Interestingly, MYO6 has an alanine at this position, indicating that this substitution is not strictly required for high duty ratio.
These examples demonstrate how physically realistic, atomically detailed models can provide mechanistic insight into how sequence variation modulates specific interactions to alter a protein’s function. Of course, there are many interactions at play, and consideration of multiple interactions is necessary to fully explain duty ratio. Therefore, a successful pipeline for predicting duty ratio predictions will probably require additional molecular dynamics simulations for any new variant. Specifically, we suggest that a fruitful design strategy would be to select mutations based on logic like that outlined above, then to simulate the newly designed sequences to check that they behave as intended (in silico), and then to perform experimental tests of these predictions. Such an approach has proved powerful in past applications to other proteins (Hart et al., 2016; Zimmerman et al., 2017).
Conclusions
In this work, we used computer simulations of isolated myosin motor domains to predict the in vitro ADP release rate and duty ratio of unloaded myosin motors. To do this, we identified systematic shifts in the distribution of conformations that a motor explores that correlate with changes in biochemistry, rather than by directly simulating the biochemical processes themselves, which would have been prohibitively expensive. While binding partners (actin and nucleotide, for instance) and structural elements outside the motor domain almost certainly affect the distribution of conformations, our results demonstrate that it is nevertheless possible to extract reasonable estimates for at least some unloaded biochemical properties from only the isolated motor domain’s conformational distribution. The ability of the isolated motor domain’s fluctuations to predict these parameters likely stems from a link between the isolated and bound conformational distributions. In other words, because the motor domain active site must adopt certain key conformations during its functional interactions with binding partners (i.e. nucleotide and actin), it is nearly guaranteed to at least transiently sample those conformations even in the absence of those binding partners. Importantly, our simulations only require a reasonable homology model as a starting point, so our methods should be applicable to a broad range of motor variants, including mutations implicated in disease.
Given the high degree of structural conservation of the myosin motor domain, it was not previously possible to directly predict the duty ratio or kinetics for a given myosin isoform from the sequence or structure of a motor domain alone. Our studies demonstrate that the duty ratio and the rate of ADP release are not captured by a single structural element, but rather by the distribution of conformations that the motor explores in solution. Throughout our simulations, we observed that the distribution of P-loop conformations is sensitive to relevant sequence changes, both large and small, throughout the myosin motor domain. Presumably, these changes are allosterically propagated through the myosin motor domain through complex networks of coupled motions. Thus, capturing the difference between the wild-type and alanine-substituted chicken gizzard myosins (Figure 5C), for instance, required the model to capture the allosteric perturbation induced by a change of a few dozen atoms in a molecule of ~12,500 atoms at a distance of ~25 Å (Figure 1A). Meanwhile, classifying the duty ratio of diverse myosin motors requires the P-loop to integrate signals from across the molecule into a single overall conformational preference. This underscores a key advantage of physics-based simulations, which is the ability to represent these allosteric networks by modeling in detail the complex, nonlinear couplings throughout the molecule.
One tantalizing interpretation of the excited states of the P-loop we observe in silico is that they may be related to the biochemically-observed ‘open’ and ‘closed’ states that nucleotide-free myosin motors populate in vitro (Geeves et al., 2000). In our simulations, we see that the P-loop fluctuates between conformations that are nucleotide-compatible and conformations that probably are not. In biochemical experiments, at least some myosin isoforms in the nucleotide-free actin-bound state fluctuate between a state that binds nucleotide and a state that does not. It has also been shown that the equilibrium between these two biochemical states (Kα), correlates with duty ratio and the transition rate from the nucleotide binding incompetent state to the nucleotide binding competent state (k+α) correlates with the ADP release rate (Bloemink and Geeves, 2011). Similarly, we showed that the equilibrium between nucleotide-favorable and nucleotide-disfavorable conformations predicted duty ratio, while the rate of transition predicted ADP release rate. A simple explanation for these similarities is that there may be a correspondence between these biochemical states and the structural states that we observe in our MSMs in silico.
Finally, our results highlight the general capacity of computational modeling to link sequence and function. One immediate application of our work here is to estimate in silico the biochemical parameters of new or difficult-to-study myosins. In the near term, constructing such models could help us learn more about the atomic basis for healthy functional diversity in myosin motors, and how small changes can give rise to malfunction and disease. Indeed, in the coming years it may prove possible to use these models as a tool for studying patient-specific mutations by understanding the atomic basis for diseases caused by dysfunction of myosin motors or to aid in developing therapeutics. Finally, because we find no reason to believe our approach’s applicability is limited to myosin motors, we expect the techniques we have presented here to be of use for any protein where the physics that maps sequence to biochemistry is not straightforward.
Materials and methods
Preparation of homology models
Request a detailed protocolFor simulations, the initial structure of each myosin motor domain was prepared by first obtaining the full-length protein’s sequence from PubMed Protein, trimming the sequence down to include only the motor domain using crystal structure 4PA0 of MYH7 as a guide, and submitting that sequence to SWISS-MODEL for homology modeling (Waterhouse et al., 2018). Templates were chosen with a preference for those that were high-resolution, high sequence similarity, and in the rigor state. A complete list of sequences, templates, and motor domains can be found in Table 1.
Preparation of example myosin conformation
Request a detailed protocolIn Figure 1A, the position of ATP is based on ligand-bound crystal structure 1MMA (Münnich et al., 2014). The actin binding region was defined by all atoms within 10 Å of the actin filament after alignment to 6BNP chain K (Gurel et al., 2017).
Sequence alignments
Request a detailed protocolAll sequence alignments were performed with MUSCLE 3.8.1551 (Edgar, 2004b) using default parameters. Phylogenetic trees were inferred with the neighbor joining method using these alignments. Distances between sequences were k-mer distances (Edgar, 2004a).
Molecular dynamics simulations
View detailed protocolGROMACS (Abraham et al., 2015; Berendsen et al., 1995) was used to prepare and to simulate all proteins. The protein structure was solvated in a dodecahedron box of TIP3P water (Jorgensen et al., 1983) that extended 1 nm beyond the protein in every dimension. Thereafter, sodium and chloride ions were added to produce a neutral system at 0.1 M NaCl.
Each system was minimized using steepest descents until the maximum force on any atom decreased below 1000 kJ/(mol × nm). The system was then equilibrated with all atoms restrained in place at 300°K maintained by Bussi-Parinello thermostat (Bussi et al., 2007). After these equilibration runs, the restraints on heavy atoms were removed.
Molecular dynamics were performed using the AMBER03 force field (Duan et al., 2003). All covalent bonds involving hydrogen were constrained using LINCS (Hess et al., 1997). Virtual sites were used to allow for a 4 fs time (Feenstra et al., 1999).
Production simulations were performed on a mixture of Folding@home (Shirts and Pande, 2000) and an in-house supercomputing cluster. A mix of Tesla K20, Titan Xp, Tesla P100, and Quandro RTX 6000 GPUs were used and Intel Xeon E5-2650 v2, Intel Xeon E5-2630 v3, Intel Xeon E5-2690 v4, Intel Xeon Gold 6148 CPUs clocked at 2.4–2.6 GHz were used. Using GROMACS 2019.2, nodes featuring a Tesla K20 or Titan Xp produced ~22 ns/day, nodes featuring a Tesla P100 produced ~61 ns/day, and nodes featuring a Quadro RTX 6000 produced ~95 ns/day.
Markov state models
Request a detailed protocolFine-grain, whole-motor domain Markov state models were constructed first by defining microstates using the k-hybrid clustering algorithm with five rounds of k-medoids refinement using the Euclidean distance between residue sidechain solvent accessible surface area (scSASA) as a distance metric. This approach first appeared in Porter et al., 2019a and was chosen because it scales well for extremely large datasets compared to traditional RMSD clustering. The reasons for this are discussed in Porter et al., 2019b but, briefly, although scSASA calculations are initially expensive, they realize substantial performance gains in clustering because each frame’s scSASA need only be computed once. ach frame can be computed independently, allowing for massive parallelization. It also reduces the size of the input data size, since only a single floating point number represents an entire residue, and allows the use of a cheaper distance metric (Euclidean distance rather than RMSD).
Markov state models were then fit for each variant by applying a 1/n pseudocount to each element of the transition counts matrix and row-normalizing, as recommended in Zimmerman et al., 2018. Lag times were chosen by the implied timescales test and by examining the equilibrium probability distribution for unrealistically overpopulated states (suggesting insufficient sampling of a particular transition or internal energy barriers). Important hyperparameters are listed in Table 3.
Construction of the P-loop free energy surface
Request a detailed protocolPairwise interatomic distances in the P-loop were computed using MDTraj (McGibbon et al., 2015), selecting all possible pairs of a backbone amide nitrogen and a backbone carbonyl oxygen atom in the GESGAG portion of the Walker A motif (i.e., the conserved P-loop sequence) that makes up the P-loop.
Principal components analysis (PCA) was performed on the 36-dimensional pairwise atomic distance vectors for each MSM microstate using the PCA implementation in sklearn (Pedregosa et al., 2011). No whitening was employed and the full SVD was calculated.
The surface was then estimated by constructing a weighted two-dimensional histogram in the PC1/PC3 plane with 50 bins between the minimum and the maximum data in each direction. The resulting array of probabilities was then converted into free energies of units kT by taking the natural logarithm of each value. It was then convoluted with a gaussian of variance 0.3 per grid cell using scipy’s gaussian_filter method (Oliphant, 2007). The resulting array was then level-set into six level sets.
Selection of myosin motor domain PDB crystal structures
Request a detailed protocolWe selected crystal structures to map on to the P-loop free energy landscape by querying the PDB (Berman et al., 2000) for all structures with sequence identities to the motor domain of Hs MYH7 greater than 10%, resolution <= 5.0 Å and a BLAST E-value less than 10−10. We then selected the largest chain in each crystal structure, used muscle (Edgar, 2004b) to align that chain’s sequence to the motor domain of Hs MYH7, and used the resulting alignment to identify the P-loop. P-loop distances were computed and projected into the low-dimensional space as described above. Sequence bookkeeping and I/O relied heavily on scikit-bio (github.com/biocore/scikit-bio; scikit-bio development team, 2014).
Crystal structures were classified as bound to a nucleotide or nucleotide analogue if they contained a residue with the name ADP, ATP, ANP, MNQ, MNT, ONP, PNQ, DAE, DAQ, NMQ, AGS, AD9, AOV, or FLC.
Hierarchical clustering of the P-loop
Request a detailed protocolThe five coarse-grained MSM microstates for MYH7 were learned using agglomerative clustering on the four-dimensional P-loop features learned by PCA for the free energy surface. Ward linkage and a Euclidean distance metric were used. Briefly, the states are recursively combined in a way that minimizes the within-cluster variance in a until the specified number of clusters is reached. The number of clusters were increased until no obvious internal free energy barriers were seen in the four PC dimensions. Agglomerative clustering was implemented by sklearn 0.21.2 (Pedregosa et al., 2011).
Assignment of new conformations to P-loop states
Request a detailed protocolP-loop state assignments for conformations of motors other than Hs MYH7 were made using a k-nearest neighbors (Pedregosa et al., 2011) approach. In this approach, a query conformation is assigned to a cluster based on the assignments of nearest k points in the labeled dataset (i.e. MYH7). In other words, the nearest k points to the query point ‘vote’ on the assignment of the query point to a cluster. In our case, k was 5, but we did not appreciate any differences for values of k from 3 to 15.
Implementation of k-nearest neighbors was from sklearn 0.21.2. A ball tree was used to speed the search for neighbors (Omohundro, 1989).
Estimation of equilibrium probability of P-loop states
Request a detailed protocolFor each motor, the probability of a P-loop state was calculated by summing the equilibrium probabilities of all states in the whole-motor MSM assigned to that P-loop state.
Biochemical properties of myosin motors
Request a detailed protocolFor each of the human myosin motors we simulated, an experimental duty ratio is available for either human or a vertebrate relative (e.g. cow, chicken) motor. Thus, wherever numerical duty ratios are reported (e.g. Figure 4B), these biochemical measurements are used. The experimentally-measured duty ratios and ADP release rates used in this work are shown in Table 2.
In our analysis of duty ratio and P-loop crystal position in Figure 4A, some constructs’ unloaded duty ratios have not been measured. For these motors, it was therefore necessary to infer whether they have high or low duty ratios from phylogeny. Specifically, we plotted: 4DBP, 2MYS, 3I5H, 2Y0R, 2BKH, 6I7D, 1DFK, 1OE9, 3I5I, 2OS8, 4P7H, 5V7X, 4ZLK, 1MNE, 1FMV, 2AKA, 3MYL, 2EC6, 4L79, 3L9I, 2BKI, 2Y9E, 1KK7, 1W8J, 2 × 51, 4PA0, 4PD3, 3I5G, and 1SR6. Based upon previous biochemical experiments, myosin-Is and IIs were assumed to have low duty ratios. Myosin-VIs were assumed to have high duty ratio. Myosin-Va and Vb from all organisms were assumed to have high duty ratios and Myosin-Vc was assumed to have a low duty ratio. Plasmodium falciparum MyoA (6I7D) has been shown to have a high duty ratio (Robert-Paganin et al., 2019).
Myosin class was inferred as follows. Where a roman numeral was given in the PDB description (e.g. Myosin-II) this classification was used. Otherwise, if ‘muscle’ or ‘striated’ was appeared in the PDB polymerDescription field, the myosin was classified as a myosin-II. Finally, in the absence of other indicators, myosins from Doryteuthis pealeii, Placopecten magellanicus, and Argopecten irradians were classified as Myosin-IIs, and myosins from Plasmodium falciparum were classified as Myosin-XIVs.
Visualization
Request a detailed protocolProteins structures were visualized and rendered with PyMOL. Data plots were constructed with matplotlib (Hunter, 2007). Free energy surface colormaps were constructed with the cubehelix color system (Green, 2011).
Code and model availability
Request a detailed protocolMSMs and starting conformations for each of the myosin constructs studied in this have been uploaded to the Open Science Framework as project ID 54 G7P, along with the parameters for the PCA used in Figures 2 and 3. This OSF project also includes a CSV that lists the P-loop definition, P-loop RMSD from the reference state, and assignment to P-loop state A-E for each crystal structure.
Data availability
MSMs and starting conformations for each of the myosin constructs studied in this have been uploaded to the Open Science Framework as project ID 54G7P, along with the parameters for the PCA used in Figures 2 and 3. This OSF project also includes a CSV that lists the P-loop definition, P-loop RMSD from the reference state, and assignment to P-loop state A-E for each crystal structure.
References
-
The structural basis of blebbistatin inhibition and specificity for myosin IINature Structural & Molecular Biology 12:378–379.https://doi.org/10.1038/nsmb908
-
GROMACS: a message-passing parallel molecular dynamics implementationComputer Physics Communications 91:43–56.https://doi.org/10.1016/0010-4655(95)00042-E
-
The superfast human extraocular myosin is kinetically distinct from the fast skeletal IIa, IIb, and IId isoformsJournal of Biological Chemistry 288:27469–27479.https://doi.org/10.1074/jbc.M113.488130
-
Shaking the myosin family tree: biochemical kinetics defines four types of myosin motorSeminars in Cell & Developmental Biology 22:961–967.https://doi.org/10.1016/j.semcdb.2011.09.015
-
BookAn Introduction to Markov State Models and Their Application to Long Timescale Molecular SimulationSpringer Science & Business Media.https://doi.org/10.1007/978-94-007-7606-7
-
Canonical sampling through velocity rescalingThe Journal of Chemical Physics 126:014101.https://doi.org/10.1063/1.2408420
-
Markov state models of biomolecular conformational dynamicsCurrent Opinion in Structural Biology 25:135–144.https://doi.org/10.1016/j.sbi.2014.04.002
-
Jug: software for parallel reproducible computation in PythonJournal of Open Research Software 5:022109.https://doi.org/10.5334/jors.161
-
Kinetic mechanism and regulation of myosin VIJournal of Biological Chemistry 276:32373–32381.https://doi.org/10.1074/jbc.M104136200
-
Relating biochemistry and function in the myosin superfamilyCurrent Opinion in Cell Biology 16:61–67.https://doi.org/10.1016/j.ceb.2003.11.011
-
Kinetic and equilibrium analysis of the myosin ATPaseMethods in Enzymology 455:157–192.https://doi.org/10.1016/S0076-6879(08)04206-7
-
Identification of functional differences between recombinant human α and β cardiac myosin motorsCellular and Molecular Life Sciences 69:2261–2277.https://doi.org/10.1007/s00018-012-0927-3
-
A point-charge force field for molecular mechanics simulations of proteins based on condensed-phase quantum mechanical calculationsJournal of Computational Chemistry 24:1999–2012.https://doi.org/10.1002/jcc.10349
-
MUSCLE: multiple sequence alignment with high accuracy and high throughputNucleic Acids Research 32:1792–1797.https://doi.org/10.1093/nar/gkh340
-
Improving efficiency of large time-scale molecular dynamics simulations of hydrogen-rich systemsJournal of Computational Chemistry 20:786–798.https://doi.org/10.1002/(SICI)1096-987X(199906)20:8<786::AID-JCC5>3.0.CO;2-B
-
Kinetic analyses of a truncated mammalian myosin I suggest a novel isomerization event preceding nucleotide bindingJournal of Biological Chemistry 275:21624–21630.https://doi.org/10.1074/jbc.M000342200
-
A perspective on the role of myosins as mechanosensorsBiophysical Journal 110:2568–2576.https://doi.org/10.1016/j.bpj.2016.05.021
-
Modelling proteins' hidden conformations to predict antibiotic resistanceNature Communications 7:12965.https://doi.org/10.1038/ncomms12965
-
Allosteric modulation of cardiac myosin dynamics by omecamtiv mecarbilPLOS Computational Biology 13:e1005826.https://doi.org/10.1371/journal.pcbi.1005826
-
Myosin light chains: teaching old dogs new tricksBioArchitecture 4:169–188.https://doi.org/10.1080/19490992.2015.1054092
-
LINCS: a linear constraint solver for molecular simulationsJournal of Computational Chemistry 18:1463–1472.https://doi.org/10.1002/(SICI)1096-987X(199709)18:12<1463::AID-JCC4>3.0.CO;2-H
-
Myosin X is a high duty ratio motorJournal of Biological Chemistry 280:29381–29391.https://doi.org/10.1074/jbc.M504779200
-
Matplotlib: a 2D graphics environmentComputing in Science & Engineering 9:90–95.https://doi.org/10.1109/MCSE.2007.55
-
Kinetic mechanism of the fastest motor protein, Chara myosinThe Journal of Biological Chemistry 282:19534–19545.https://doi.org/10.1074/jbc.M611802200
-
The ATPase cycle of human muscle myosin II isoforms: adaptation of a single mechanochemical cycle for different physiological rolesJournal of Biological Chemistry 294:14267–14278.https://doi.org/10.1074/jbc.RA119.009825
-
Comparison of simple potential functions for simulating liquid waterThe Journal of Chemical Physics 79:926–935.https://doi.org/10.1063/1.445869
-
Glycine 699 is pivotal for the motor activity of skeletal muscle myosinThe Journal of Cell Biology 134:895–909.https://doi.org/10.1083/jcb.134.4.895
-
Advanced methods for accessing protein Shape-Shifting present new therapeutic opportunitiesTrends in Biochemical Sciences 44:351–364.https://doi.org/10.1016/j.tibs.2018.11.007
-
Mechanism of action of myosin X, a membrane-associated molecular motorJournal of Biological Chemistry 280:15071–15083.https://doi.org/10.1074/jbc.M500616200
-
Calcium regulation of myosin-I tension sensingBiophysical Journal 102:2799–2807.https://doi.org/10.1016/j.bpj.2012.05.014
-
MDTraj: a modern open library for the analysis of molecular dynamics trajectoriesBiophysical Journal 109:1528–1532.https://doi.org/10.1016/j.bpj.2015.08.015
-
Kinetic characterization of nonmuscle myosin IIb at the single molecule levelJournal of Biological Chemistry 288:709–722.https://doi.org/10.1074/jbc.M112.424671
-
Adenosine diphosphate and strain sensitivity in myosin motorsPhilosophical Transactions of the Royal Society of London. Series B, Biological Sciences 359:1867–1877.https://doi.org/10.1098/rstb.2004.1560
-
Python for scientific computingComputing in Science & Engineering 9:10–20.https://doi.org/10.1109/MCSE.2007.58
-
Cold-sensitive mutants G680V and G691C of Dictyostelium myosin II confer dramatically different biochemical defectsJournal of Biological Chemistry 272:27612–27617.https://doi.org/10.1074/jbc.272.44.27612
-
Scikit-learn: machine learning in PythonJournal of Machine Learning Research : JMLR 12:2825–2830.
-
Enspara: modeling molecular ensembles with scalable data structures and parallel computingThe Journal of Chemical Physics 150:044108.https://doi.org/10.1063/1.5063794
-
A structural model for actin-induced nucleotide release in myosinNature Structural & Molecular Biology 10:826–830.https://doi.org/10.1038/nsb987
-
The P-loop--a common motif in ATP- and GTP-binding proteinsTrends in Biochemical Sciences 15:430–434.https://doi.org/10.1016/0968-0004(90)90281-F
-
Quantifying allosteric communication via both concerted structural changes and conformational disorder with CARDSJ Chem Theory Comput acs.jctc 6:b01181.https://doi.org/10.1021/acs.jctc.6b01181
-
Kinetic tuning of myosin via a flexible loop adjacent to the nucleotide binding pocketJournal of Biological Chemistry 273:6262–6270.https://doi.org/10.1074/jbc.273.11.6262
-
Human myosin vc is a low duty ratio, nonprocessive molecular motorJournal of Biological Chemistry 283:8527–8537.https://doi.org/10.1074/jbc.M709150200
-
Dilated cardiomyopathy myosin mutants have reduced force-generating capacityJournal of Biological Chemistry 293:9017–9029.https://doi.org/10.1074/jbc.RA118.001938
-
Drosophila myosin VIIA is a high duty ratio motor with a unique kinetic mechanismJournal of Biological Chemistry 281:7151–7160.https://doi.org/10.1074/jbc.M511592200
-
SWISS-MODEL: homology modelling of protein structures and complexesNucleic Acids Research 46:W296–W303.https://doi.org/10.1093/nar/gky427
-
Choice of adaptive sampling strategy impacts state discovery, transition probabilities, and the apparent mechanism of conformational changesJournal of Chemical Theory and Computation 14:5459–5475.https://doi.org/10.1021/acs.jctc.8b00500
Article and author information
Author details
Funding
National Institutes of Health (R01GM12400701)
- Gregory R Bowman
National Institutes of Health (R01HL141086)
- Michael J Greenberg
National Institutes of Health (T32GM02700)
- Artur Meller
National Institutes of Health (F30HL146052)
- Justin R Porter
National Science Foundation (MCB-1552471)
- Gregory R Bowman
Burroughs Wellcome Fund (Career Award at the Scientific Interface)
- Gregory R Bowman
David and Lucile Packard Foundation (Fellowship for Science and Engineering)
- Gregory R Bowman
Monsanto Company (Graduate Fellowship)
- Maxwell I Zimmerman
Washington University in St. Louis (Center for Biological Systems Engineering Fellowship)
- Maxwell I Zimmerman
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Acknowledgements
We are extremely grateful to the citizen scientists of Folding@home for their generous donation of computing resources. We are also grateful to Prof. Eric Galburt, Prof. John Edwards, and Dr. Joshua Alinger for their insight and helpful comments about this work. We are also grateful to the Center for High Performance Computing at the Mallinkrodt Institute for Radiology for computer time.
This work was funded by National Institutes of Health grants R01GM12400701 (GRB), R01HL141086 (MJG), T32GM02700 (AM), and F30HL146052 (JRP), National Science Foundation CAREER Award MCB-1552471 (GRB), GRB holds a Career Award at the Scientific Interface from the Burroughs Wellcome Fund and a Packard Fellowship for Science and Engineering from the David and Lucile Packard Foundation (GRB). MIZ was supported in part by a Monsanto Graduate Fellowship and a Center for Biological Systems Engineering Fellowship.
Copyright
© 2020, Porter et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 2,106
- views
-
- 217
- downloads
-
- 31
- citations
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Biochemistry and Chemical Biology
- Cell Biology
Activation of the Wnt/β-catenin pathway crucially depends on the polymerization of dishevelled 2 (DVL2) into biomolecular condensates. However, given the low affinity of known DVL2 self-interaction sites and its low cellular concentration, it is unclear how polymers can form. Here, we detect oligomeric DVL2 complexes at endogenous protein levels in human cell lines, using a biochemical ultracentrifugation assay. We identify a low-complexity region (LCR4) in the C-terminus whose deletion and fusion decreased and increased the complexes, respectively. Notably, LCR4-induced complexes correlated with the formation of microscopically visible multimeric condensates. Adjacent to LCR4, we mapped a conserved domain (CD2) promoting condensates only. Molecularly, LCR4 and CD2 mediated DVL2 self-interaction via aggregating residues and phenylalanine stickers, respectively. Point mutations inactivating these interaction sites impaired Wnt pathway activation by DVL2. Our study discovers DVL2 complexes with functional importance for Wnt/β-catenin signaling. Moreover, we provide evidence that DVL2 condensates form in two steps by pre-oligomerization via high-affinity interaction sites, such as LCR4, and subsequent condensation via low-affinity interaction sites, such as CD2.
-
- Biochemistry and Chemical Biology
The development of proteolysis targeting chimeras (PROTACs), which induce the degradation of target proteins by bringing them into proximity with cellular E3 ubiquitin ligases, has revolutionized drug development. While the human genome encodes more than 600 different E3 ligases, current PROTACs use only a handful of them, drastically limiting their full potential. Furthermore, many PROTAC development campaigns fail because the selected E3 ligase candidates are unable to induce degradation of the particular target of interest. As more and more ligands for novel E3 ligases are discovered, the chemical effort to identify the best E3 ligase for a given target is exploding. Therefore, a genetic system to identify degradation-causing E3 ligases and suitable target/E3 ligase pairs is urgently needed. Here, we used the well-established dimerization of the FKBP12 protein and FRB domain by rapamycin to bring the target protein WDR5 into proximity with candidate E3 ligases. Strikingly, this rapamycin-induced proximity assay (RiPA) revealed that VHL, but not Cereblon, is able to induce WDR5 degradation - a finding previously made by PROTACs, demonstrating its predictive power. By optimizing the steric arrangement of all components and fusing the target protein with a minimal luciferase, RiPA can identify the ideal E3 for any target protein of interest in living cells, significantly reducing and focusing the chemical effort in the early stages of PROTAC development.