Investigating the trade-off between folding and function in a multidomain Y-family DNA polymerase

  1. Xiakun Chu
  2. Zucai Suo
  3. Jin Wang  Is a corresponding author
  1. Department of Chemistry, State University of New York at Stony Brook, United States
  2. Department of Biomedical Sciences, College of Medicine, Florida State University, United States
18 figures, 2 tables and 1 additional file

Figures

Figure 1 with 1 supplement
DPO4 folding thermodynamics.

(A) The 2D folding free energy landscape of DPO4 projected onto the fractions of total (Q(Total)) and interdomain (Q(Inter)) native contacts for the default SBM parameter ρ0 at the folding temperature Tfρ0. There are six folding states identified on the free energy landscape, which are denoted by U, I3, I3, I2, I1, and N, with corresponding typical DPO4 structures shown. The all-atom representations of DPO4 were reconstructed based on the Cα structures from the simulations (Rotkiewicz and Skolnick, 2008). Domains of DPO4 are labeled with different colors: the finger domain (F, blue, residues 11–77), the palm domain (P, red, residues 1–10 and 78–166), the thumb domain (T, green, residues 167–229), the little finger domain (LF, purple, residues 245–341), and the flexible linker (gray, residues 230–244), which connects the T and LF domains. (B) The 1D folding free energy landscape of DPO4 projected onto Q(Total) for different values of the ratio of interdomain to intradomain native contact strength, denoted by ρ and ranging from 0.5 to 1.5 (indicated by different colors). It is worth noting that the I3 and I3 states cannot be distinguished by Q(Total). The black line corresponds to the free energy landscape for the default SBM parameter ρ0. We set the zeros of the free energies at the lowest Q(Total) detected from the simulations. (C) Change in stability of each folding state for ρ relative to that for ρ0 at the corresponding folding temperature Tfρ according to the expression ΔΔF(ρ)S=ΔF(ρ)S-ΔF(ρ0)S, where S represents the folding state (I3, I2, I1, or U) and ΔF(ρ)S is the stability of the folding state S at ρ. ΔF(ρ)S is calculated as the free energy difference between the folding state S and the unfolded state U:ΔF(ρ)S=F(ρ)SF(ρ)U, where F is the free energy obtained from (B).

Figure 1—figure supplement 1
Native contact maps of DPO4 for the folding states U, I3, I3, I2, I1, and N.

Native contact maps of DPO4 for the folding states U, I3, I3, I2, I1, and N at the folding temperature Tfρ0.

Figure 2 with 7 supplements
DPO4 folding free energy landscapes, backtracking, and folding cooperativity for different values of ρ.

(a, B) 2D folding free energy landscapes of DPO4 for (A) ρ=0.5 and (B) ρ=1.5. (C) Averaged Q(Inter) versus Q(Total) for different values of ρ at the corresponding folding temperature Tfρ calculated from the 2D free energy landscapes. The black line corresponds to the result with the default parameter ρ0. The two shaded regions show the increase followed by the decrease in Q(Inter) with increasing Q(Total) for ρ close to its default value ρ0, indicating possible backtracking during DPO4 folding. (D) Averaged Q(Inter) versus Q(Total) at a temperature 0.96Tf and for the default parameter ρ0, as calculated from the kinetic simulations. The colored lines are the observations for the individual folding events in all the simulations (a total number of 100), and the black line indicates the average. The domain structure in DPO4 in the backtracking regions is described by the fractions of native contacts Q(LF), Q(Palm), Q(Finger), and Q(Thumb), the last two of which are shown in (E) and (F), respectively. There are two separate regions in backtracking region II, implying two distinct DPO4 structures. (G) Thermodynamic coupling index (TCI) for the default parameter ρ0. (H) Dependence of the mean thermodynamic coupling index (MTCI) on ρ. The black and gray lines show the MTCI of the intact DPO4 and the four domains in DPO4, respectively. The dashed line indicates the MTCI calculated from the independent folding simulations of the four domains of DPO4, reminiscent of the extreme case with ρ=0.

Figure 2—figure supplement 1
2D free energy landscapes for DPO4 folding with different values of ρ.

2D folding free energy landscapes of DPO4 for different values of ρ at the corresponding folding temperatures Tfρ.

Figure 2—figure supplement 2
Averaged Q(Inter) and Q(Total) at a temperature 0.96Tf for different values of ρ.

One hundred independent simulations starting from different unfolded states of DPO4 for each value of ρ.

Figure 2—figure supplement 3
Representative DPO4 folding trajectory to illustrate backtracking.

(A) DPO4 folding trajectory projected onto Q(Total) and Q(Inter). The two shaded regions represent two backtracking processes. (B) DPO4 folding trajectory, including two backtracking and one successful folding processes. (C) Averaged Q(Inter) versus Q(Total). The magnitude of the increase relative to the subsequent decrease in Q(Inter) as Q(Total) increases, which is denoted by ΔQ, is used to characterize the backtracking process. (In practice, we define backtracking by ΔQ>0.1.) The backtracking and folding processes are shown in terms of the fraction of each intradomain folding (D) and interdomain formation (E).

Figure 2—figure supplement 4
Number of backtracking observed for different values of ρ.

(A) Number of backtracking observed versus Q(Total) for different values of ρ. NBacktracking is the number of backtracking characterized by the averaged Q(Inter) and Q(Total) (Figure 2—figure supplement 3) and NTraj is the number of folding events in the simulations at a temperature 0.96Tf. (B) Number of backtracking occurring in various regions for different values of ρ. (C) and (D) are the same as (A) and (B), but at room temperature Tr.

Figure 2—figure supplement 5
Native contact maps and structures of DPO4 at backtracking.

All-atom representations of DPO4 reconstructed based on the Cα structures from the simulations (Rotkiewicz and Skolnick, 2008).

Figure 2—figure supplement 6
Melting curve of each domain/interface and two-state sigmoidal fitting.

(A) Melting curves of Q for each intradomain, interdomain, and linker of DPO4 for ρ0. The dashed line indicates the folding temperature. (B) One example of a melting curve of the P domain of DPO4 and its fitting to the two-state sigmoidal transition to obtain the folding probability curve PI(T), where I is the index of the intra- or interdomain and T is the temperature. The data for the P domain shown here are chosen for the purposes of illustration. Similar results can be observed for the other intra- and interdomains for different values of ρ. The fitting procedure follows the standard protocol instructed by Sborgi et al., 2012.

Figure 2—figure supplement 7
TCI for different values of ρ.

TCII,J between different domains/interfaces (I,J) for different values of ρ.

Figure 3 with 3 supplements
DPO4 folding kinetics at room temperature Tr.

(A–C) Kinetic rates quantified by the mean first passage time (MFPT) for (A) intradomain, interdomain, and total folding, (B) individual intradomain folding, and (C) interdomain formation. MFPT is in units of τ, which is the reduced time unit used in our simulations. Error bars represent the standard deviations at the corresponding MFPT. A folding is defined as when the corresponding Q is higher than 0.75. (D–F) Folding order probability OPkI of individual intra- and interdomain occurs during a successful DPO4 folding event with (D) ρ=0.5, (E) the default parameter ρ=ρ0=1.0, and (F) ρ=1.5. k is the order index of the folding step and I is the index of the domain/interface of DPO4. There are eight domains/interfaces in DPO4, and thus eight folding steps are defined. (G) Standard deviation of OPk from its mean OPk=18IOPkI18 at folding step k for different values of ρ. The black line shows the average σOPk versus ρ.

Figure 3—figure supplement 1
OPkI for different values of ρ.

Folding order probability OPkI of intra- and interdomains for different values of ρ.

Figure 3—figure supplement 2
DPO4 folding kinetics at a temperature 0.96Tf.

Kinetic rates and folding order of individual domains/interfaces during a successful DPO4 folding event for different values of ρ at a temperature 0.96Tf. The data at ρ=0.50 are absent because an SBM with such a parameter value could not accumulate a statistically sufficient number of folding events for analysis.

Figure 3—figure supplement 3
OPkI for different values of ρ at a temperature 0.96Tf.

Folding order probability OPkI of intra- and interdomains for different values of ρ at a temperature 0.96Tf.

Figure 4 with 5 supplements
DPO4–DNA binding thermodynamics and kinetics.

(A) 1D free energy landscape of DPO4–DNA binding versus the distance root-mean-square deviation of DPO4–DNA binding native contacts dRMS for different values of ρ. dRMS has units of length and deviates from 0 nm as unbinding proceeds. The binding process can be divided into four states: US, EC, IS, and BS. The insert at the top left shows the binding affinity for different values of ρ, with the gray line corresponding to the experimental measurements (3–10 nM). The insert at the bottom right is a magnified view of the binding free energy landscape focusing on the transition between the BS and the IS. The error analysis was done by performing four independent umbrella sampling simulations with different initial conditions. Error bars are omitted from this insert for clarity. (B) The four-state DPO4–DNA binding process. Typical corresponding DPO4–DNA structures obtained from the simulations for different binding states are shown from two perpendicular viewpoints. There are three transitions in the four-state DPO4–DNA binding process. The ρ-dependent DPO4–DNA binding kinetics in terms of these three transitions are shown in (CE). (C) Diffusion coefficient D of free DPO4 (in the absence of DNA) for different values of ρ. D reflects the essence of KCap. Error bars represent the standard deviations from the corresponding mean values of D. τ is the reduced time unit. (D) Encounter times of the EC evolving to the IS with different values of ρ. The encounter time was calculated from the expression kDis/kEvo+1. The insert is a plot of encounter times versus the mean root-mean-square fluctuation (RMSF) of DPO4 in the free state. The mean RMSF was obtained from the free DPO4 simulations by averaging all the residues in DPO4 (Figure 4—figure supplement 3). (E) Transition times τtrans between the BS and the IS for different values ρ. τtrans was calculated from 100 independent simulations, and the errors were estimated by a bootstrap analysis with 50 subsamples.

Figure 4—figure supplement 1
Conformational distributions of DPO4 in the IS and BS.

Conformational distributions of DPO4 in the IS and BS during the DNA-binding process. QCT is defined as QbinaryDPO4(F-LF)-Q(T-LF), where Q(T-LF) is the fraction of native contacts between the T and LF domains in the apo-DPO4 structure and QbinaryDPO4(F-LF) is the fraction of native contacts between the F and LF domains in the binary DPO4–DNA structure. QCT=-1.0 when DPO4 is in the apo form, and QCT=1.0 when DPO4 is in the binary form. DPO4 with QCT=0.0 has fully broken the LF interdomain interaction and thus possesses an extended LF domain with flexible linker. Typical structures of DPO4 are shown for illustration at QCT-1.0, 0.0, and 1.0. The native contacts of the LF interdomain are shown when QCT-1.0 and 1.0 by the yellow dashed lines.

Figure 4—figure supplement 2
Free DPO4 diffusion for different values of ρ.

(A) Distributions of the diffusion coefficient D of DPO4 for different values of ρ. D is calculated as r2/6t, where r2 is the mean square displacement (MSD) for the center of mass of DPO4 with respect to the initial reference structure. In practice, D was calculated directly by using the command “g_msd” implemented in Gromacs (4.5.7) (Hess et al., 2008). 1000 trajectories (each 1000τ long, giving an accumulated length of 106τ ) were used for each value of ρ to calculate D. (B) The most probable D obtained from (A) for different values of ρ. The values of the most probable D at different values of ρ are very similar.

Figure 4—figure supplement 3
Structural fluctuations of free DPO4 for different values of ρ.

Structural fluctuations of free DPO4 for different values of ρ in terms of (A) RMSF, (B) mean RMSF averaged over all residues, and (C) RMSD from the native PDB structure. Error bars represent the standard deviations from the corresponding mean values.

Figure 4—figure supplement 4
Thermodynamic barrier heights and kinetic times for the transitions between the IS and the BS during DPO4–DNA binding for different values of ρ.

(A) Free energy barriers of the transitions between the IS and the BS for different values of ρ. The error bars represent the standard deviations from the corresponding mean values. (B, C) Correlations between thermodynamic free energy barrier heights and kinetic times for transitions (B) from the IS to BS and (C) from the BS to the IS for different values of ρ.

Figure 4—figure supplement 5
Comparison of thermodynamic and kinetic stabilities of the IS and BS during DPO4–DNA binding for different values of ρ.

A direct comparison between the thermodynamic and kinetic results can be made by comparing the thermodynamic and kinetic stabilities as suggested previously (Wang et al., 2017). The thermodynamic stability is defined as ΔFBSIS=FBS-FIS, where F is the free energy obtained from the free energy landscape, while the kinetic stability is defined as kTrln(τISBS/τBSIS), where τtrans is the transition time calculated from the kinetic simulations. (A) Correlation between thermodynamic and kinetic stability. (B) Correlation between the kinetic stabilities of the IS and BS. τtrans* was calculated from the free energy landscape (see Appendix 1) and thus is the result from the thermodynamic simulation.

Illustration of intra- and interdomain interactions in the trade-off between DPO4 folding and function.

The four domains of DPO4 are indicated using the same color scheme as in Figure 1A. The sizes of the arrows indicate the magnitudes of the rates or fluxes for folding (binding). DPO4 and DPO4–DNA binding complexes shown in lighter shades are less stable or populated than the others.

Appendix 3—figure 1
Extracting folding temperature Tfρ from the heat capacity curves and melting curves of Q(Total) for different values of ρ.

Tf can be determined as the position of the prominent peak on heat capacity curve (black lines) or the midpoint of melting curves of Q(Total) (grey lines).

These two methods generate very similar Tf, so Tf from heat capacity was finally chosen.

Appendix 3—figure 2
Comparisons of the thermodynamics between the reweighing method (solid lines) and direct REMD simulations (dots).

(A) Comparisons of folding temperatures. The REMD simulations with directly applying four different ρ, which are (B) ρ=0.5, (C) ρ=0.8, (D) ρ=1.2 and (E) ρ=1.5, were performed and analyzed. In (B-E), comparisons of heat capacity curves (blue) and melting curves of Q(Total) (red) with these four different ρ are shown.

Appendix 3—figure 3
Assessment of shifting folding temperature with ρ to that with the default parameter ρ0 in order to perform kinetic simulations.

Comparisons are made between the reweighting method and the direct REMD simulations with rescaled VSBMapoDPO4.

Appendix 3—figure 4
Folding thermodynamics of intradomain in DPO4.

(A) Folding free energy landscapes of intradomain in DPO4 for different values of ρ. Temperatures are the corresponding folding temperatures varied by different ρ. Folded and unfolded states are marked at the Q with the free energy minima. (B) The stability of the folded states to unfolded states varied by different ρ. The stability is defined as the free energy difference between the folded and unfolded states, obtained in (A).

Appendix 3—figure 5
Folding thermodynamics of interdomain in DPO4.

(A) Folding free energy landscapes of interdomain in DPO4 for different values of ρ. (B) The stability of the folded states to unfolded states varied by different ρ.

Appendix 3—figure 6
The averaged Q(Inter) along with Q(Total) at temperature Tr for different values of ρ.
Appendix 3—figure 7
Melting curves of Q for each intra-, interdomain of DPO4 for different values of ρ.
Appendix 3—figure 8
MTCI along with ρ.

The dashed line represents the MTCI of removing the melting curves of the F and F-P from the intact DPO4.

Appendix 3—figure 9
MTCI for independent folding of the four individual domains with different strengths of VSBMDomain modulated by a pre-factor γ.

The reweighting method is used for calculations of MTCI at different γ via the REMD simulations performed at γ=1.0.

Appendix 3—figure 10
The coarse-grained Cα structures for the P domain from PDB (left) and modeling (right).

Five glycines were added to connect the residue 10 to 77. The structure was used for the independent folding simulation of the P domain.

Appendix 3—figure 11
Biased DPO4–DNA binding simulations performed by the umbrella sampling.

The overall overlaps of the probability distribution on dRMS between neighboring replicas are apparent, indicating a sufficient sampling.

Appendix 3—figure 12
The calculated binding affinity Kd along with dRMSb/u at different values of ρ.

The shadow region corresponds to the experimental binding affinity of 3–10 nM. The dashed line indicates the dRMSb/u (3.25 nm) used in our study. The theoretical Kd at ρ=0.7 is equal to 2.24 nM, approximating to the experimental measurements (3–10 nM).

Appendix 3—figure 13
The number of contacts (NC) formed by the same charged beads in DPO4 during the DNA binding.

The contacts are further classified into the (A) two-body and (B) three-body types. A contact is considered to form when the two beads are within the range of 0.5 nm, which is the distance for the two oppositely charged beads having DH potential strength equal to the native contact strength (1.0). The two-body (pairwise) contacts can be formed with a relatively low chance indicated by both the average and maximum contact numbers. At the same time, DPO4 is almost devoid of the three-body contacts formed by the same charged residues. This features no abnormally high number of same charges accumulated in a limited spatial space.

Tables

Key resources table
Reagent type
(species) or resource
DesignationSource or referenceIdentifiersAdditional
information
Software algorithmGROMACS(version 4.5.7)DOI:10.1002/jcc.20291RRID:SCR_014565
Software, algorithmPLUMED(version 2.5.0)DOI:10.1016/j.cpc.2013.09.018https://www.plumed.org/
Software, algorithmMATLABMathWorksRRID:SCR_001622
Software, algorithmGnuplot(version 5.2)http://www.gnuplot.info/RRID:SCR_008619
Software, algorithmPymol(version 1.8)Schrdödinger,IncRRID:SCR_000305
Appendix 2—table 1
The native contact numbers of the intra- and interdomain, as well as the flexible Linker in apo-DPO4 PDB structure (PDB: 2RDI).

The total native contact number of the apo-DPO4 structure is 933, among which the intradomain native contact number is 787 and the number of interdomain native contacts that are mostly formed by the sequential neighbor domains, is 90. There are only two interdomain contacts formed by the non-sequential neighbor domains (P-LF interdomain interface). The number of contacts formed by the flexible Linker is 54. In DPO4–DNA binary PDB structure (PDB: 2RDJ), the T-LF contacts in apo-DPO4 PDB structure are fully broken and at the same time, 11 contacts are formed between the F and LF domains. These contacts are termed as the specific binary DPO4–DNA native contacts in constructing the 'double-basin’ SBM potential (VSBMbinaryDPO4). The other contacts remain the same in apo-DPO4 and binary DPO4–DNA structures.

F domainP domainT domainLF domainLinker
F domain14436011∗0
P domain3625632232
T domain032130229
LF domain11∗22225711
Linker0329112
  1. * These 11 contacts are only formed in binary DPO4–DNA structure.

Additional files

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Xiakun Chu
  2. Zucai Suo
  3. Jin Wang
(2020)
Investigating the trade-off between folding and function in a multidomain Y-family DNA polymerase
eLife 9:e60434.
https://doi.org/10.7554/eLife.60434