Abstract
The rational design of proteins that maintain structural integrity under concurrent thermal, mechanical, and chemical stress remains a challenge in molecular engineering. We present a hierarchical framework that transforms an α-helical domain into an ultrastable scaffold by integrating AI-guided design with foundational chemical principles. This approach progresses from global architectural reinforcement, using multiple AI tools to create a stabilized four-helix bundle, to local chemical tuning, where AlphaFold3 guides the installation of salt bridges and metal-coordination motifs. A computational pipeline using physics-based screening such as molecular dynamics (MD) simulations efficiently distilled millions of designs into a minimal candidate set. The resulting α-helical proteins exhibit unprecedented multi-axis stability, with mechanical unfolding forces exceeding 200 pN, thermal resilience >100 °C, and high resistance to chemical denaturants. By systematically dissecting the contributions of hydrophobic packing, electrostatics, and metal coordination, we establish a general blueprint for imparting extreme robustness. This work bridges AI-driven structural generation with chemical precision, advancing the creation of durable proteins for mechanistic studies and synthetic biology.
Introduction
Proteins that maintain their structural integrity when heated, stretched, or exposed to harsh solvents are foundational to both life (Manteca, et al., 2017; Neupane, et al., 2024; Rief, et al., 1997) and serve as invaluable building blocks for advanced biomaterials (Fu, et al., 2023; Miserez, et al., 2023; Sivasankaran, et al., 2024), molecular devices (Devaux, et al., 2021; Yang, et al., 2024), and operations in extreme environments (Bustamante, et al., 2004; Pepelnjak, et al., 2024). Among natural scaffolds, helical architectures are particularly appealing targets due to their ubiquity and mechanical functionality: coiled-coils contribute to muscle and other cytoskeletal assemblies (Baumann, et al., 2017; Schwaiger, et al., 2002), talin rod domains act as mechanosensors that unfold under load (del Rio, et al., 2009; Le, et al., 2019; Tapia-Rojo, et al., 2023), and arrays of α-helical bundles underlie the erythrocyte membrane skeleton (Rief, et al., 1999; Takahashi, et al., 2018; Wensley, et al., 2010). These precedents motivate the goal of transforming common α-helical folds into programmable components with superior, multi-axis stability for diverse applications.
However, the rational design of such multi-faceted stability into isolated helical domains remains challenging (Baker, et al., 2017). The inherent mechanical fragility of the α-helix is a fundamental limitation; under tensile load, its hydrogen bonds rupture sequentially, leading to low unfolding forces of just tens of piconewtons (pN) (Carrion-Vazquez, et al., 2000; Hoffmann, et al., 2013; Law, et al., 2003; LeBlanc, et al., 2021; Muller, et al., 2021; Neuman and Nagy, 2008; Randles, et al., 2007), well below the hundreds of pN reached by mechanically robust β-sheet topologies pulled in shear (Brockwell, et al., 2003). This inherent fragility persists even in the presence of stable hydrophobic cores (Collet, et al., 2005; Craig, et al., 2004; Lee, et al., 2006; Lim, et al., 2008; Milles, et al., 2017; Minin, et al., 2017; Ng, et al., 2007).
While decades of research have identified key chemical determinants of protein stability like hydrophobic packing, electrostatics, and metal coordination, with both natural and engineered examples showing these factors can enhance resilience (Baker, et al., 2015; Goldenzweig and Fleishman, 2018; Kellis, et al., 1988; Marqusee and Baldwin, 1987; Nandi and Ainavarapu, 2022; Wang, et al., 2021; Zhang, et al., 2025), progress in increasing mechanical strength of isolate helical proteins has been a meaningful yet incremental process (Lopez-Garcia, et al., 2021; Tunn, et al., 2018). For example, only a few natural coiled-coils (e.g., in fibrinogen) approach 100 pN, while canonical α-helical bundles such as spectrin and ankyrin repeats typically unfold near 50 pN under comparable loading. The key challenge we address is thus to move beyond adding individual stabilizing features, and instead to precisely combine these different chemical interactions within a single architecture so they work together to resist multiple forms of stress.
The recent rise of generative AI for protein design has dramatically expanded the accessible fold space (Dauparas, et al., 2022; Jumper, et al., 2021; Lin, et al., 2023; Sakuma, et al., 2024; Watson, et al., 2023), offering a path to overcome the architectural limitations of natural scaffolds. However, a critical gap persists. AI models excel at global structure generation but often lack the understanding of local chemical interaction networks required for extreme, multi-axis stability. These models generate vast candidate libraries, creating a screening bottleneck that impedes the experimental validation essential for iterative, chemically rational design. A method that seamlessly integrates generative design with highly efficient screening is needed to bridge this gap.
Here, we treat multi-axis protein stabilization as a problem of hierarchical chemical engineering. We introduce an AI-guided blueprint that progresses systematically from global architectural reinforcement to atomistically precise chemical installation. First, AI-enabled backbone construction generates stable scaffolds with optimized hydrophobic cores, establishing a foundational stability layer. We then apply precision functionalization, installing a minimal set of inter-helical salt bridges and bi-histidine metal-coordination motifs through rational, site-specific edits. This stepwise approach allows us to quantitatively dissect the mechanical contribution of each engineered chemical interaction (Sakuma, et al., 2024).
To make this hierarchical chemical design tractable, we implement a computational pipeline that leverages developability filters (Kyte and Doolittle, 1982), foldability assessment (Jumper, et al., 2021; Lin, et al., 2023), and critically molecular dynamics (MD) simulations (Karplus and McCammon, 2002; Milles, et al., 2018; Rennekamp, et al., 2024) to rank candidates based on proxies for mechanical and thermal resilience. This pipeline compresses ∼106 in silico designs to a manageable number for experimental validation. The result is a family of ultrastable α-helical proteins where we can quantitatively attribute stability gains to specific chemical engineering steps: hydrophobic core optimization, electrostatic interaction, and metal-coordination clamping. This work provides a general, rational blueprint for the precision engineering of protein energy landscapes, transforming a mechanically weak architecture into a durable platform.
Results
Hierarchical design of ultrastable proteins by integrating AI and chemical principles
To establish a framework for programming multi-axis stability, we developed a hierarchical design strategy that integrates architectural and chemical stabilization principles. We selected the human spectrin repeat R15 (PDB code: 3F57) (Ipsaro, et al., 2009), a three-helix bundle that unfolds at ∼50 pN and melts near 50 °C, as our design template (Figure 1a-b). These moderate baseline properties provide an ideal starting point for quantitatively evaluating the contribution of each engineered stabilization layer. Our approach comprises two distinct stages (Figure 1c): Stage I establishes a stabilized architectural framework through AI-guided backbone construction and computational screening, while Stage II applies precision functionalization with site-specific salt bridges and metal-coordination motifs to reinforce key structural interfaces.

A hierarchical chemical blueprint for multi-axis stabilization of α-helical protein.
a, The erythrocyte membrane skeleton, which relies on spectrin repeats to withstand shear stress. Inset: Structure of a single, naturally fragile spectrin repeat (PDB 3F57), with hydrophobic core residues shown. b, The mechanical unfolding pathway of a spectrin repeat under tensile force, probed computationally by steered molecular dynamics (SMD) to generate force-extension curves. c, Overview of the two-stage design strategy. Stage I (Architectural Stabilization): AI-guided backbone construction and computational screening generate four-helix designs with optimized hydrophobic cores. Stage II (Precision Functionalization): Rational installation of inter-helical salt bridges and metal-coordination motifs reinforces specific mechanical interfaces. d, Stage I, Backbone construction. RFdiffusion appends a fourth helix to the native three-helix template, generating 100 initial backbones. Five optimal four-helix scaffolds are selected, and ProteinMPNN is used to generate 100,000 sequences per scaffold. e, Stage I, Computational screening. A multi-step funnel prioritizes candidates through successive filters: developability (GRAVY score ≤ −0.3), foldability (ESMFold triage followed by AlphaFold2 refinement with RMSD ≤ 2.0 Å and pLDDT ≥ 90), and stability assessed via molecular dynamics. The process efficiently narrows ∼106 initial designs to an experimentally tractable shortlist.
In Stage I, we expanded the native architecture by appending a fourth helix to the spectrin template using RFdiffusion (Figure 1d) (Watson, et al., 2023). We generated 100 backbone variants (50 each for N- and C-terminal extensions) and selected five optimal four-helix scaffolds based on bundle geometry, helical registry, and the potential for hydrophobic core densification without steric clash (Lombardi, et al., 2019). For each selected scaffold, we used ProteinMPNN to generate 100,000 sequences optimized for the remodeled core, producing a total library of 5 × 105 candidate sequences (Liu, et al., 2025).
To identify the best designs fulfilling our stability criteria, we implemented a multi-stage computational funnel that progresses from coarse filtering to high-fidelity functional prediction (Figure 1e). The library was first subjected to a developability screen using physicochemical criteria (GRAVY < −0.3), retaining about 300,000 sequences with high predicted solubility and yield. Foldability was then assessed through a two-tiered model: ESMFold provided an initial screening to 357 designs, which were subsequently refined with AlphaFold2 under strict confidence thresholds (pLDDT > 90, RMSD < 2.0 Å). This process yielded 211 high-confidence designs for further analysis.
Structural characterization revealed that the designed proteins exhibited enhanced packing, with a lower relative solvent-accessible surface area (Porollo, et al., 2013) (rSASA ≈ 0.35) compared to the natural spectrin repeat (rSASA ≈ 0.40). This was further supported by a higher proportion of residues with rSASA < 0.2 (designs: >30% vs. natural spectrin: 23%, see Raw Data 1), indicating successful hydrophobic core optimization (Porollo, et al., 2013).
We then implemented MD simulations to prioritize candidates based on functional stability metrics. Steered MD (SMD) simulations served as a mechanical proxy, stretching each domain under constant-velocity pulling to obtain relative stability rankings (Figure 1b) (Ni, et al., 2024). Complementary annealing MD (AMD) simulations assessed thermal resilience through temperature ramping (310 K to 473 K) while monitoring structural integrity. These physics-based simulations provided the discriminatory power necessary to select candidates exhibiting target multi-axis stability. By integrating SMD and AMD results, we selected a final experimental shortlist that balances predicted mechanical and thermal resilience.
From this refined set, three variants, originally designated SpecAI87788, SpecAI16941, and SpecAI95889 (Figure 1—Figure supplement 1) based on their AI-generated identifiers, were selected for experimental validation based on their superior combined SMD and AMD performance (Figure 1—Figure supplement 2 and Figure 1—Figure supplement 3, Figure 1—video 1-Figure 1—video 10). For clarity in subsequent analyses, these were renamed SpecAI88, SpecAI41, and SpecAI89, reflecting the last two digits of their original IDs.
The funnel compressed ∼106 AI-generated sequences to 3 best candidates, preserving both scaffold and sequence diversity while prioritizing top performers. This selection process demonstrates how computational methods can efficiently navigate vast design spaces to identify optimal candidates for experimental characterization.
Experimental verification of multi-axis stability in designed proteins
From the computational shortlist, we selected three representative designs, SpecAI88, SpecAI41, and SpecAI89, for experimental validation. All three expressed soluble in E. coli with high yields (∼500 mg/L in TB, Table 1). SDS-PAGE showed the expected band near 19 kDa (Figure 2—Figure supplement 1), and mass spectroscopy confirmed molecular weights of 19,760, 19,108 and 19,285 Da, respectively (Figure 2a). Far-UV circular dichroism (CD) spectroscopy was performed, revealing that all three proteins adopted predominantly α-helical structures, as indicated by the characteristic double minima at 208 nm and 222 nm (Figure 2b) (Greenfield, 2007).

Stability of AI-designed spectrin with four α-helices

Thermal and chemical robustness of AI-designed spectrin variants.
a, MALDI-TOF mass spectrometry confirm molecular weights for SpecAI88, SpecAI41, and SpecAI89. b, Far-UV circular dichroism (190–260 nm) shows α-helical signatures with minima at 208 and 222 nm. Temperature-dependent CD (20–100 °C, blue to red) indicates substantial retention of ellipticity at 195 nm, with melting temperatures exceeding 100 °C. c, CD at 222 nm recorded in GdnHCl demonstrates persistence of α-helical signal at high denaturant concentrations (∼3 M), indicating high chemical resistance.
To evaluate thermal stability, we performed temperature-dependent circular dichroism (CD) measurements from 20°C to 100°C. All three designed proteins maintained substantial helical structure at 100°C, as evidenced by the retention of characteristic ellipticity at 195 nm, demonstrating exceptional thermodynamic stability (Figure 2b, inset). We further challenged the proteins by performing thermal denaturation in the presence of the chemical denaturant guanidine hydrochloride (GdnHCl) (Liu and Thirumalai, 2025). Notably, all designs preserved their secondary structure at denaturant concentrations up to 3 M and maintained significant helical content under elevated temperature conditions (∼50°C, 222 nm), demonstrating remarkable resistance to combined chaotropic stress (Figure 2c). These results confirm not only the proper folding of these monomeric α-helical scaffolds but also validate that our Stage I designs, featuring optimized hydrophobic cores, achieve exceptional stability against both thermal and chemical denaturation.
Next, we directly quantified their mechanical stability using atomic force microscopy-based single-molecule force spectroscopy (AFM–SMFS). Each designed SpecAI domain was fused in series with three GB1 fingerprint domains and pulled via the high-strength Coh–Doc handle (rupture force ∼400 pN), enabling unambiguous single-molecule identification (Figure 3a) (Deng, et al., 2019; Shi, et al., 2022; Stahl, et al., 2012). All experiments were conducted in a standard buffer at a pulling speed of 1000 nm/s unless otherwise specified. Force-extension curves were fitted with the worm-like chain (WLC) model to determine the contour length increment (ΔLc) for each unfolding event (Marko and Siggia, 1995).

Stage I four-helix bundle designs exhibit enhanced mechanical stability by single-molecule force spectroscopy.
a, Schematic of the AFM–SMFS experimental setup. A dockerin (Doc)-functionalized AFM tip engages a cohesin (Coh)-tagged SpecAI construct immobilized on the surface, enabling single-molecule pulling. The construct includes three GB1 domains as fingerprint markers. b, Representative force-extension curves. Curve 1: Unfolding of the natural spectrin repeat (ΔLc ≈ 32 nm, red), followed by three GB1 (ΔLc ≈ 18 nm per domain, black). Curves 2-4: Unfolding of SpecAI variants (ΔLc ≈ 53 nm). Sometimes, the Doc unfolds showing additional peak (Curve 3). Dashed lines are worm-like chain model fit. c, Unfolding force histograms of SpecAI (bin size= 30 pN) with Gaussian fits demonstrate a significant increase in mechanical stability compared to the native spectrin repeat (56 ± 3 pN, n = 212, bin size=13.75 pN): SpecAI88, 116 ± 2 pN (n = 224); SpecAI41, 156 ± 4 pN (n = 218) and SpecAI89, 121 ± 4 pN (n = 174). The corresponding ΔLc distributions are centered near 52 nm, consistent with the full unfolding of the designed domain.
As expected, the natural spectrin control (102 amino acids in length) exhibited a ΔLc of 32 nm, in addition to the unfolding steps of the three GB1 domains (18 nm each) (Cao, et al., 2006). In contrast, the designed SpecAI proteins showed a significantly larger ΔLc of ≈52 nm (Figure 3b, Figure 1—Figure supplement 1), consistent with the full unraveling of their ∼150-residue domain. The measured values: 52 ± 2 nm (SpecAI88), 53 ± 1 nm (SpecAI41), and 52 ± 2 nm (SpecAI89) (mean ± s.e.m., from Gaussian fit), closely matched theoretical predictions, thereby confirming the correct assignment of the designed domain’s unfolding (Figure 3c).
Most importantly, all three SpecAI variants exhibited markedly elevated unfolding forces compared to natural spectrin (56 ± 3 pN; n = 212). The unfolding forces reached 116 ± 2 pN (SpecAI88; n = 224), 156 ± 4 pN (SpecAI41; n = 218), and 121 ± 4 pN (SpecAI89; n = 174) (Figure 3c, Table 1). The combination of accurate ΔLc, the presence of three GB1 fingerprint domains, and consistent force statistics confirms single-molecule specificity and establishes a new mechanical baseline for α-helical proteins created through Stage I backbone design.
Precision scaffold functionalization via hierarchical chemical design
Building upon the stabilized architectural frameworks from Stage I, we implemented a precision functionalization strategy to further enhance stability through the rational design of specific chemical interactions at inter-helical interfaces. This stage involved two complementary approaches: engineering of electrostatic salt bridges and incorporation of metal-coordination motifs (Figure 4a).

Precision stabilization of designed proteins via electrostatic interactions and metal coordination.
a, Schematic of Stage II design: introducing inter-helical ion pairs and metal-coordination sites into AI-designed backbones to stabilize specific interfaces. b, AlphaFold3-predicted structure of variant SpecAI41-9K152D, showing an engineered salt bridge (Lys9-Asp152, 3 Å) designed for electrostatic stabilization without perturbing the core. The color of protein is based on the pLDDT value, showing a high confidence structure prediction (>80). c, A representative force-extension curve for the salt-bridge variant shows an unfolding event with a ΔLc of 53 nm, consistent with the parent scaffold. d, Unfolding force histograms reveal a ∼25 pN increase in mechanical stability for salt-bridge variants compared to their Stage-I parents. e, Introduction of a metal-binding site in variant SpecAI41-9K152D-6H153H, with two histidines positioned 2 Å apart, compatible with Ni2+ coordination. f, A representative force-extension curve recorded in 200 µM Ni2+ shows the unfolding event (ΔLc ≈ 53 nm). g, Unfolding force histograms confirm enhanced mechanical stability, with forces reaching ∼200 pN, a significant gain over the salt-bridge parent.
Our design strategy followed a rigorous hierarchical selection process. First, we identified potential mutation sites by excluding residues critical for hydrophobic core integrity, focusing instead on positions at inter-helical interfaces that could accommodate new interactions without perturbing the existing structural stability. Second, we employed AlphaFold3 (AF3) structure prediction to evaluate potential mutations, requiring that designs not only maintain high overall structural confidence (pLDDT > 80 for all mutated residues, with many exceeding 90 but also satisfy precise geometric criteria for the intended molecular interactions, specifically in terms of inter-atomic distances.
We designed and incorporated single inter-helical salt bridges into each optimized parent scaffold: SpecAI41-9K152D (Figure 4b, Figure 1—Figure supplement 1),
SpecAI88-49E102K, and SpecAI89-25E48K (Figure 4—Figure supplement 3). AF3 models confirmed that the engineered residues were positioned at interface-facing turns with favorable i↔i’ or i±1 helical register arrangements without introducing steric clashes. Crucially, all designed salt bridges showed charged atom distances <0.4 nm in the predicted structures, and the contacts remained stable following backbone relaxation simulations. This consistency indicates genuine local interface reinforcement rather than global structural reorganization. All mutated backbone atoms maintained high pLDDT scores (>80), confirming the preservation of structural integrity. For example, the pLDDT scores for the Cα atoms were 90.1 and 80.7 for SpecAI41-9K152D.
AFM-SMFS measurements revealed ∼30 pN increases relative to Stage-I parents with ΔLc unchanged (∼53 nm), as expected. Unfolding forces were 180 ± 4 pN (n = 197) for SpecAI41-9K152D (Figure 4c–d), 141 ± 3 pN (n = 202) for SpecAI88-49E102K, and 153 ± 4 pN (n = 264) for SpecAI89-25E48K (Figure 4—Figure supplement 3, Table 2). The constant ΔLc across parents and salt-bridge variants indicates that electrostatic installation does not change topology or domain length, but raises the mechanical barrier along the established pathway.

SpecAI stability further enhanced by site-specific functionalization
To achieve additional mechanical reinforcement, we designed bi-histidine motifs capable of forming intra-domain Ni(II)-coordination sites63. These motifs were strategically positioned on adjacent helices to create bidentate metal-binding pockets without perturbing the hydrophobic core. Our design criteria targeted Ni-N(His) distances of ∼2.0 Å, Cα-Cα spacing compatible with inter-helical bridging rotamers, and proper side-chain orientation (Nδ1/Nε2) for optimal imidazole coordination geometry. This approach yielded three metal-coordination variants: SpecAI41-9K152D-6H153H, SpecAI88-49E102K-6H149H and SpecAI89-25E48K-24H135H (Figure 4, Figure 1—Figure supplement 1 and Figure 4—Figure supplement 3), whose structures were validated using AF3 again.
In the presence of 200 µM Ni2+, AFM-SMFS revealed a further significant increase in mechanical stability over the salt-bridge parents, with unfolding forces reaching 208 ± 3 pN (n = 222) for SpecAI41-9K152D-6H153H, 190 ± 3 pN (n = 232) for SpecAI88-49E102K-6H149H, and 173 ± 3 pN (n = 204) for SpecAI89-25E48K-24H135H, and an unchanged ΔLc of ∼53 nm (Table 2). These results are consistent with a short, inner-interface coordination clamp that reinforces the native unfolding pathway without introducing alternative topologies.
We further characterized the unfolding kinetics of the computationally designed proteins via dynamic force spectroscopy across a range of pulling speeds, focusing on the SpecAI41 series due to its highest unfolding force. The unfolding forces of the three SpecAI41 exhibited the expected linear dependence on the logarithm of the loading rate (Figure 4—Figure supplement 4). Fitting the data to the Bell-Evans model yielded a consistent distance to the transition state (Δx ≈0.19 nm) for all three constructs, while the spontaneous unfolding rate (koff) decreased progressively from SpecAI41 (0.44 s-1) to the salt-bridge (0.09 s-1) and metal-coordination (0.05 s-1) variants. This kinetic profile further confirms that the engineered stabilizations raise the activation barrier for unfolding without altering the unfolding pathway.
Similarly, we assessed the thermal stability of the Stage II variants (Figure 4—Figure supplement 5). The results demonstrate that the engineered salt bridges and metal-binding sites maintained excellent thermodynamic stability with Tm above 100 °C. In chemical stability tests, the Stage II variants still retained considerable helical content under elevated temperature but their tolerance to GdnHCl decreased slightly compared to the Stage I variants.
This hierarchical computational design methodology (Figure 5), progressing from architectural stabilization to precision functionalization, enabled the predictable enhancement of protein stability through the additive contribution of distinct chemical interaction networks.

Hierarchical computational design of ultrastable proteins through multi-scale stabilization.
Schematic summarizing the two-stage design strategy for additive mechanical reinforcement. Stage I establishes a stable architectural framework through MD screening of AI-designed hydrophobically optimized cores. Stage II introduces precision functionalization via inter-helical salt bridges and bi-histidine metal-coordination motifs, guided by AlphaFold3 structural models. The integration of global architectural stability with local chemical cross-linking produces additive mechanical reinforcement, enabling the creation of ultrastable protein domains.
Discussion
The rational design of proteins capable of withstanding concurrent thermal, chemical, and mechanical stress remains a fundamental challenge. Here, we present a hierarchical strategy that integrates chemically distinct stabilization mechanisms within a single protein architecture. Our approach not only provides a robust blueprint for creating ultrastable proteins but also decouples the quantitative contributions of hydrophobic packing, electrostatic interactions, and metal coordination to mechanical resilience.
Our work builds upon established principles of α-helical protein stability while introducing substantive advances. Whereas most isolated α-helical domains unfold below 20 pN due to sequential hydrogen bond rupture (Wolny, et al., 2014), our designed variants achieve forces up to 200 pN. This represents an advance beyond canonical α-helical bundles such as spectrin and ankyrin repeats, which typically unfold near 50 pN (Serquera, et al., 2010). Previous engineering strategies have demonstrated promising improvements potential (Lopez-Garcia, et al., 2021), and our contribution lies in the systematic integration of global architectural optimization with local chemical reinforcement within a unified design framework enabled by AI protein design and computational screening (Thomas and Elcock, 2004; Wolny, et al., 2017). The hierarchical organization of our approach proved effective. We first established a stable structural framework through computational design of an optimized hydrophobic core, providing baseline mechanical stability of ∼120 pN. Subsequent precision functionalization with inter-helical salt bridges contributed an additional ∼30 pN, while metal coordination sites provided further gains of ∼50 pN depending on geometric parameters. This modular strategy demonstrates that distinct chemical interaction networks can be engineered to operate additively within a single protein structure (Cao, et al., 2011).
Notably, conventional structural metrics such as pLDDT scores, while predictive of foldability, show limited correlation with mechanical stability. This divergence underscores that dynamic functional properties require physics-based simulation. Although steered molecular dynamics (SMD) overestimates absolute unfolding forces due to computationally mandated high pulling speeds (Merkel, et al., 1999) (e.g., ∼600 pN in silico versus ∼50 pN for spectrin in experiment), it served as a powerful semi-quantitative screen. Together, these results indicate that SMD simulations provide a useful qualitative reference for comparing relative mechanostability among designs and for guiding experimental prioritization, despite intrinsic differences in force scales and sampling between simulations and experiments.
In conclusion, our hierarchical framework provides a generalizable route to programming protein stability. By modular integration of architectural and chemical stabilization strategies, we can now tailor mechanical, thermal, and chemical resistance in α-helical proteins. This approach will accelerate the development of robust protein-based materials and molecular devices for demanding applications in biotechnology and synthetic biology (Klein and Nanda, 2025; Ling, et al., 2018).
Materials and methods
Plasmids construction and proteins engineering
The gene fragments encoding the designed proteins were synthesized (General Biosystems, Anhui) and cloned into the pET-30a vector in the format Coh-GB1-SpecAI-GB1-His6-NAL for use in AFM-SMFS experiments (Shi, et al., 2022). All point mutation constructs introducing electrostatic and metal-ligand interactions were generated by PCR-based site-directed mutagenesis. A construct, pET30a-SpecAI-His6, was prepared via seamless cloning for circular dichroism (CD) measurements.
Both protein constructs were expressed and purified following identical protocols. The plasmids were transformed into Escherichia coli BL21(DE3) and plated on Luria-Bertani (LB) agar medium (2.5% LB, 2% agar) containing 100 μg/mL kanamycin. Single colonies were inoculated into 20 mL of LB broth supplemented with kanamycin and incubated overnight at 37°C. These were subsequently expanded into 800 mL of LB medium and grown until the optical density at 600 nm (OD600) reached 0.7–0.8. After cooling to room temperature, protein expression was induced by adding 0.5 mM IPTG, followed by incubation at 18°C for 20 hours. Cells were harvested by centrifugation at 4,000 rpm for 10 minutes and stored at −80°C prior to purification.
For purification, cell pellets were resuspended in 30 mL of lysis buffer (50 mM Tris, pH 7.4) supplemented with DNase I, RNase A, and PMSF. Bacterial disruption was carried out using a high-pressure homogenizer (700 bar, 3 minutes), and the lysate was centrifuged at 13,000 rpm for 15 minutes to collect the supernatant. This was incubated with Co-NTA affinity resin pre-equilibrated with binding buffer (20 mM Tris, 400 mM NaCl, 2 mM imidazole, pH 7.4) for 30 minutes at room temperature. After washing with 3–5 column volumes of binding buffer, the target protein was eluted with 15 mL of elution buffer (20 mM Tris, 400 mM NaCl, 250 mM imidazole, pH 7.4).
The Coh-GB1-SpecAI-GB1-His6-NAL protein was concentrated via ultrafiltration and buffer-exchanged into a low-salt storage buffer (50 mM Tris, 100 mM NaCl, pH 7.4). The SpecAI-His6 variant was similarly processed and exchanged into a CD assay buffer (10 mM K2HPO4, 1.8 mM KH2PO4, pH 7.4).
The GL-His6-GB1-Xmod-Doc, GL-His6-ELP20-Cys and His6-Cys-ELP20-NGL protein used for subsequent single-molecule force spectroscopy experiments was expressed and purified using a similar procedure. The expression and purification protocols of OaAEP1(C247A) can be found in references (Deng, et al., 2019; Yang, et al., 2017).
Structural modeling and MD simulations
First, based on spectrin structure, a total of 100 different protein structures were generated using RFdiffusion by appending new structural elements to either the N- or C-terminus (Watson, et al., 2023). Among these, five structurally reasonable models exhibiting four helical topologies were manually selected for their potential to improve hydrophobic interactions. For each selected structure, 100,000 sequence variants were generated using ProteinMPNN, using a fixed random seed of 37 to ensure reproducibility (Dauparas, et al., 2022). 319,747 sequences with GRAVY score below –0.3 were selected. The structural compatibility of the generated sequences was then quickly assessed using ESMFold and compared with the corresponding initial structures predicted by Rfdiffusion (Lin, et al., 2023). Only 357 Sequences exhibiting high structural agreement, defined by pLDDT scores above 90 and root mean square deviation (RMSD) values below 2.0 Å, were further modeled using AlphaFold2 (Jumper, et al., 2021). During AlphaFold2 modeling, six recycles were performed, with template mode enabled. For each sequence, five structural models were generated and ranked according to their pLDDT scores. Ultimately, only the ones with significantly higher scores were retained, and models with pLDDT >90 and RMSD < 2 Å were selected for subsequent analysis, resulting in a total of 211 models.
To assess the mechanical stability of designed proteins, Steered MD (SMD) simulations were conducted by GROMACS 2023.2 (single-precision build) (Van Der Spoel, et al., 2005). The systems were constructed using GROMACS built-in tools, with the CHARMM36m force field and the TIP3P water model applied (Lee, et al., 2015). Rectangular simulation boxes were used with a minimum protein-box distance of 1.2 nm in all directions; for SMD simulations, the z dimension was extended to the theoretical stretching length (initial N-C distance plus 0.36 nm per stretched residue). All simulations were performed assuming a physiological pH of 7.0, with standard protonation states assigned accordingly. The systems were neutralized with sodium and chloride ions to achieve a physiological NaCl concentration of 0.15 M. Simulations were equilibrated under NVT conditions at 310 K using the V-rescale thermostat, followed by a 5 ns NpT production run with C-rescale pressure coupling (1.0 bar), while maintaining temperature control as described above (Kim, et al., 2019).
Prior to SMD simulations, each system was first subjected to 5,000 steps of energy minimization using the steepest descent algorithm, with all protein backbone atoms positionally restrained. Subsequently, during a 1.0 ns NVT equilibration, these restraints were maintained with a harmonic force constant of 1000 kJ·mol−1·nm2 to allow for the thorough relaxation of the solvent and ions around the protein. After this, 125 ps equilibrium MD runs and 5 ns production MD simulations were run to stabilize the temperature and pressure, ensuring system equilibrium. During SMD simulations, the N-terminus of each protein was restrained using a harmonic potential with a force constant of 1000 kJ·mol−1·nm2, while the C-terminus was pulled along the z-axis at constant velocity using an umbrella restraint with the same force constant (k = 1000 kJ·mol−1·nm2).
SMD simulations were performed at a pulling speed of 1 nm/ns on the 211 predicted protein structures selected from the previous design step (Gräter, et al., 2005; Lu, et al., 1998). Each simulation was repeated three times to ensure reproducibility. Since the designed helical proteins could unfold at multiple sites along their structure, the total extension distance was limited to 65% of the theoretical full length, a value that was previously validated through full-length stretching simulations of individual proteins, which confirmed that no significant unfolding events occurred beyond this distance. During SMD simulations, the C-terminus was harmonically restrained while a harmonic pulling potential was applied to the N-terminus in umbrella mode, directed along the z-axis. The pulled group was unconstrained in the xy-plane, allowing for lateral relaxation during extension. The top three proteins exhibiting the highest average unfolding forces in SMD simulations were selected as candidate constructs. To validate their thermal stability, annealing MD (AMD) simulations were conducted on each protein, with temperature gradually increased from 310 to 473 K over a 1 μs trajectory using a 2 fs time step. Structural deviation was quantified by calculating the RMSD between the final structure and the original predicted conformation. These AMD simulations served to validate the selected proteins, ensuring that the mechanically robust candidates also exhibit favorable thermal stability for downstream applications.
AFM-SMFS experiments
AFM-SMFS protein unfolding experiments were performed on a ForceRobot300 AFM (JPK) at room temperature. Detailed protocols are provided in the literature (Liu, et al., 2024). In brief, proteins were immobilized on both glass substrate and AFM cantilever via strain-promoted azide-alkyne cycloaddition (SPAAC) and Michael addition chemistry (Deng, et al., 2019; Shi, et al., 2022). Elastin-like polypeptide (ELP20, (VPGXG)20) was employed as a spacer and single-molecule signature for the experiment (Ott, et al., 2017). Prior to the measurements, ELP20 with an OaAEP1(C247A) specific recognition site C-terminal NGL (or NAL) or N-terminal GL (His6-Cys-ELP20-NGL/NAL, GL-His6-ELP20-Cys) were separately immobilized on the AFM tips and glass substrates through maleimide-thiol reactions. OaAEP1 catalyzes the formation of a covalent bond between a C-terminal -NGL/-NAL tag and an N-terminal -GL motif (Deng, et al., 2019; Yang, et al., 2017). Then GB1-Doc (GB1-Xmodule-dockerin) was linked at cantilevers by enzymatic ligation using OaAEP1 to provide X-module-dockerin (Doc) domains (Stahl, et al., 2012). Coh-GB1-SpecAI-GB1-His6-NAL fusion protein was immobilized on a modified glass. The AFM tip was pressed onto the surface with a force of 400 pN to capture the target protein via the specific and high-affinity Coh–Doc interaction, enabling reliable single-molecule attachment.
AFM-SMFS measurements were conducted in buffer composed of 50 mM Tris-HCl and 100 mM NaCl at pH 7.4. For proteins engineered with metal coordination sites, 200 μM NiCl2 was supplemented to facilitate Ni2+ binding. The tip was retracted at a constant pulling speed of 1000 nm/s if not specified. For dynamic force spectroscopy experiments, pulling speeds of 400, 1000, 2000, and 4000 nm/s were used.
Bruker MLCT-D or MLCT-E cantilevers were used, and each cantilever was calibrated in buffer prior to data acquisition using the thermal noise method (equipartition theorem) via the instrument’s built-in software. These curves were analyzed by Igor Pro 6.12 (Wavemetrics). Only traces exhibiting at least three consecutive GB1 unfolding peaks followed by a final high force Coh–Doc rupture peak were selected for further analysis. Unfolding forces and contour length increments (ΔLc) were extracted by fitting each individual peak using the worm-like chain (WLC) model of polymer elasticity. Additionaly, the measured unfolding forces were further calibrated using the fingerprint protein GB1 domain as an internal standard, based on its well-established unfolding force (213 pN at 1000 nm/s) (Cao, et al., 2006). For each construct, unfolding forces from multiple independent traces were pooled and fitted with a Gaussian function. The mean unfolding force and the standard error of the mean (SEM) were extracted from the fits and used for subsequent comparison and analysis.
To investigate the effect of loading rate on protein unfolding behavior, AFM measurements were performed not only at the retraction speed of 1000 nm·s−1 but also at constant cantilever retraction speeds of 400, 2000, and 4000 nm·s−1. For each pulling speed, multiple force–extension curves were collected. For each unfolding event, the loading rate r was calculated as the product of the slope of the force peak and the cantilever retraction speed, and calibrated using the unfolding force of the GB1 domain as an internal standard (Cao, et al., 2006). For each speed, the average r and peak unfolding force F were determined, with standard deviations reported as error bars, and plotted in a force–loading rate scatter plot. The zero-force off-rate (koff) and the distance to the transition state (Δx) were subsequently extracted by linear fitting based on the Bell–Evans model (Evans and Ritchie, 1997):

where kB is the Boltzmann constant and T is the temperature.
Site-directed mutagenesis
To further improve the mechanical stability of helical proteins, salt bridge mutations were introduced into the top three candidates from Stage I. Two mutation strategies were employed: introducing salt bridges either at the N- or C-terminus of the protein, or near the regions where force peaks were observed in the SMD simulation. Both strategies targeted areas subjected to mechanical stress. Subsequently, we explored metal coordination as an additional stabilization strategy. Double-histidine mutations were introduced at the N- and C-termini of the parent proteins that exhibited high unfolding forces in previous experiments. Structural predictions using AlphaFold3 confirmed that the overall conformations remained stable and that the histidine residues were properly positioned to satisfy the geometric requirements for Ni2+ coordination. These mutated proteins were then subjected to single-molecule force spectroscopy experiments following the same protocol described above.
MALDI-TOF MS
Measurements were performed on an ultrafleXtreme mass spectrometer (Bruker Daltonics). Protein samples were prepared using a two-layer matrix method at a concentration of approximately 3 mg/mL. Each sample was analysed in at least three independent replicates to ensure data reliability. Mass spectra were acquired in the m/z range of 18,000–22,000.
Circular dichroism
CD measurements were performed on a Chirascan (Applied Photophysics) instrument using a 1 mm pathlength cuvette (Hellma). The purified protein was diluted to 0.1–0.15 mg/mL in CD assay buffer. Far-UV CD spectra were recorded from 190 nm to 260 nm at 20°C to evaluate the secondary structure of SpecAI proteins. To assess thermal stability, temperature-dependent CD measurements were conducted, with spectra collected at 5°C intervals from 20°C to 100°C. Thermal unfolding was monitored by tracking the mean residue ellipticity (MRE) at 195 nm (a characteristic positive peak for α-helices) across a temperature gradient from 20°C to 100°C.
For the chemical denaturation study of SpecAI, thermal scanning was conducted in the presence of GdnHCl at concentrations ranging from 1 M to 5 M. A set of 4–6 concentrations was selected based on their ability to reliably determine the sample’s Tm and produce high-quality fitting curves. Thermal denaturation was performed by heating from 20°C upward in 5°C increments until complete denaturation was achieved. The terminal temperature varied depending on both the specific sample and the concentration of GdnHCl.
At each temperature plateau, full spectra were recorded between 200 nm and 260 nm. Changes in signal were monitored at approximately 222 nm across all temperatures and GdnHCl concentrations. All measurements were repeated in at least three independent replicates to ensure reproducibility.
Data availability
All data are present in the main text or supplementary information.
Acknowledgements
The numerical calculations in this work have been done on the computing facilities in the High-Performance Computing Center (HPCC) of Nanjing University.
Additional files
References
- Local and macroscopic electrostatic interactions in single α-helicesNat. Chem. Biol 11:221–228Google Scholar
- Engineering protein stability with atomic precision in a monomeric miniproteinNat. Chem. Biol 13:764–770Google Scholar
- Increasing evidence of mechanical force as a functional regulator in smooth muscle myosin light chain kinaseeLife 6:e26473https://doi.org/10.7554/eLife.26473Google Scholar
- Pulling geometry defines the mechanical resistance of a beta-sheet proteinNat. Struct. Mol. Biol 10:731–737Google Scholar
- Mechanical processes in biochemistryAnnu. Rev. Biochem 73:705–748Google Scholar
- Nonmechanical protein can have significant mechanical stabilityAngew Chem Int 45:642–645Google Scholar
- Enhancing the mechanical stability of proteins through a cocktail approachBiophysical Journal 100:1794–1799Google Scholar
- Mechanical design of proteins-studied by single-molecule force spectroscopy and protein engineeringProg. Biophys. Mol. Biol 74:63–91Google Scholar
- The elasticity of an individual fibrin fiber in a clotProc. Natl. Acad. Sci. U. S. A 102:9133–9137Google Scholar
- Tuning the Mechanical Stability of Fibronectin Type III Modules through Sequence VariationsStructure 12:21–30Google Scholar
- Robust deep learning–based protein sequence design using ProteinMPNNScience 378:49–56Google Scholar
- Stretching single talin rod molecules activates vinculin bindingScience 323:638–641Google Scholar
- Enzymatic biosynthesis and immobilization of polyprotein verified at the single-molecule levelNat. Commun 10:2775–2785Google Scholar
- Single-molecule mechanics of synthetic aromatic amide helices: Ultrafast and robust non-dissipative windingChem 7:1333–1346Google Scholar
- Dynamic strength of molecular adhesion bondsBiophys J 72:1541–1555Google Scholar
- Cartilage-like protein hydrogels engineered via entanglementNature 618:740–747Google Scholar
- Principles of Protein Stability and Their Application in Computational DesignAnnu. Rev. Biochem 87:105–129Google Scholar
- Mechanically Induced Titin Kinase Activation Studied by Force-Probe Molecular Dynamics SimulationsBiophys. J 88:790–804Google Scholar
- Using circular dichroism spectra to estimate protein secondary structureNat. Protoc 1:2876–2890Google Scholar
- Towards design principles for determining the mechanical stability of proteinsPhys. Chem. Chem. Phys 15:15767–15780Google Scholar
- Structures of the spectrin-ankyrin interaction binding domainsBlood 113:5385–5393Google Scholar
- Highly accurate protein structure prediction with AlphaFoldNature 596:583–589Google Scholar
- Molecular dynamics simulations of biomoleculesNat. Struct. Biol 9:646–652Google Scholar
- Contribution of hydrophobic interactions to protein stabilityNature 333:784–786Google Scholar
- New Method for Constant-NPT Molecular DynamicsJ. Phys. Chem. A 123:1689–1699Google Scholar
- The proteins that could beNat. Rev. Chem 9:283–284Google Scholar
- A simple method for displaying the hydropathic character of a proteinJ. Mol. Bio 157:105–132Google Scholar
- Cooperativity in Forced Unfolding of Tandem Spectrin RepeatsBiophys. J 84:533–544Google Scholar
- Direct single-molecule quantification reveals unexpectedly high mechanical stability of vinculin-talin/alpha-catenin linkagesSci. Adv 5:eaav2720Google Scholar
- Type III secretion system effector proteins are mechanically labileProc. Natl. Acad. Sci. U. S. A 118:e2019566118Google Scholar
- Nanospring behaviour of ankyrin repeatsNature 440:246–249Google Scholar
- CHARMM-GUI Input Generator for NAMD, GROMACS, AMBER, OpenMM, and CHARMM/OpenMM Simulations Using the CHARMM36 Additive Force FieldJ. Chem. Theory Comput 12:405–413Google Scholar
- Molecular Basis of Fibrin Clot ElasticityStructure 16:449–459Google Scholar
- Evolutionary-scale prediction of atomic-level protein structure with a language modelScience 379:1123–1130Google Scholar
- Nanofibrils in nature and materials engineeringNat. Rev. Mater 3:18016Google Scholar
- De Novo Design Mechanostable Titin Ig Domain Using Artificial Intelligence and Steered Molecular Dynamics SimulationsCCS Chem 7:1629–1640Google Scholar
- Click Chemistry-Based Force Spectroscopy Revealed Enhanced Binding Dynamics of Phosphorylated HMGB1 to Cisplatin-DNAJ. Am. Chem. Soc 146:13126–13132Google Scholar
- Impact of Guanidinium Hydrochloride on the Shapes of Prothymosin-α and α-Synuclein Is Dramatically DifferentBiochemistry 64:105–113Google Scholar
- De Novo Design of Four-Helix Bundle Metalloproteins: One Scaffold, Diverse ReactivitiesAcc Chem Res 52:1148–1159Google Scholar
- Fortified Coiled Coils: Enhancing Mechanical Stability with Lactam or Metal StaplesAngew Chem Int 60:232–236Google Scholar
- Unfolding of Titin Immunoglobulin Domains by Steered Molecular Dynamics SimulationBiophysical Journal 75:662–671Google Scholar
- Mechanochemical evolution of the giant muscle protein titin as inferred from resurrected proteinsNat. Struct. Mol. Biol 24:652–657Google Scholar
- Stretching DNAMacromolecules 28:8759–8770Google Scholar
- Helix stabilization by Glu-…Lys+ salt bridges in short peptides of de novo designProc. Natl. Acad. Sci. U. S. A 84:8898–8902Google Scholar
- Energy landscapes of receptor-ligand bonds explored with dynamic force spectroscopyNature 397:50–53Google Scholar
- Mechanical Stability of a High-Affinity Toxin Anchor from the Pathogen Clostridium perfringensJ. Phys. Chem. B 121:3620–3625Google Scholar
- Molecular mechanism of extreme mechanostability in a pathogen adhesinScience 359:1527–1533Google Scholar
- Dynamic Transition from α-Helices to β-Sheets in Polypeptide Coiled-Coil MotifsJ. Am. Chem. Soc 139:16168–16177Google Scholar
- Protein-Based Biological Materials: Molecular Design and Artificial ProductionChem. Rev 123:2049–2111Google Scholar
- Atomic Force Microscopy-Based Force Spectroscopy and Multiparametric Imaging of Biomolecular and Cellular SystemsChem. Rev 121:11701–11725Google Scholar
- Native Salt Bridges Are a Key Regulator of Ubiquitin’s Mechanical StabilityJ. Phys. Chem. B 126:3505–3511Google Scholar
- Single-molecule force spectroscopy: optical tweezers, magnetic tweezers and atomic force microscopyNat. Methods 5:491–505Google Scholar
- Direct observation of prion-like propagation of protein misfolding templated by pathogenic mutantsNat. Chem. Biol 20:1220–1226Google Scholar
- Designing an extracellular matrix protein with enhanced mechanical stabilityProc. Natl. Acad. Sci. U. S. A 104:9633–9637Google Scholar
- ForceGen: End-to-end de novo protein generation based on nonlinear mechanical unfolding responses using a language diffusion modelSci. Adv 10:eadl4000Google Scholar
- Elastin-like Polypeptide Linkers for Single-Molecule Force SpectroscopyAcs Nano 11:6346–6354Google Scholar
- In situ analysis of osmolyte mechanisms of proteome thermal stabilizationNat. Chem. Biol 20:1053–1065Google Scholar
- Maximum Allowed Solvent Accessibilites of Residues in ProteinsPLoS ONE 8Google Scholar
- Spectrin Domains Lose Cooperativity in Forced UnfoldingBiophys. J 92:571–577Google Scholar
- Hidden length lets collagen buffer mechanical and chemical stressPhys. Rev. E 110:054408Google Scholar
- Reversible unfolding of individual titin immunoglobulin domains by AFMScience 276:1109–1112Google Scholar
- Single molecule force spectroscopy of spectrin repeats: low unfolding forces in helix bundlesJ. Mol. Bio 286:553–561Google Scholar
- Design of complicated all-α protein structuresNat. Struct. Mol. Biol 31:275–282Google Scholar
- The myosin coiled-coil is a truly elastic protein structureNature Materials 1:232–235Google Scholar
- Mechanical Unfolding of an Ankyrin Repeat ProteinBiophys. J 98:1294–1301Google Scholar
- Combination of Click Chemistry and Enzymatic Ligation for Stable and Efficient Protein Immobilization for Single-Molecule Force SpectroscopyCCS Chem 4:598–604Google Scholar
- Polymer-mediated protein/peptide therapeutic stabilization: Current progress and future directionsProg. Polym. Sci 156:101867Google Scholar
- Single-molecule dissection of the high-affinity cohesin-dockerin complexProc. Natl. Acad. Sci. U. S. A 109:20431–20436Google Scholar
- alpha-Helix Unwinding as Force Buffer in SpectrinsAcs Nano 12:2719–2727Google Scholar
- Enhanced statistical sampling reveals microscopic complexity in the talin mechanosensor folding energy landscapeNat. Phys 19:52–60Google Scholar
- Molecular Simulations Suggest Protein Salt Bridges Are Uniquely Suited to Life at High TemperaturesJ. Am. Chem. Soc 126:2208–2214Google Scholar
- Tuning coiled coil stability with histidine-metal coordinationNanoscale 10:22725–22729Google Scholar
- GROMACS: fast, flexible, and freeJ. Comput. Chem 26:1701–1718Google Scholar
- Transforming de novo protein a3D into a mechanically stable protein by zinc bindingChem. Commun 57:11489–11492Google Scholar
- De novo design of protein structure and function with RFdiffusionNature 620:1089–1100Google Scholar
- Experimental evidence for a frustrated energy landscape in a three-helix-bundle protein familyNature 463:685–688Google Scholar
- Stable Single α-Helices Are Constant Force Springs in ProteinsJ. Bio. Chem 289:27825–27835Google Scholar
- Characterization of long and stable de novo single alpha-helix domains provides novel insight into their stabilitySci. Rep 7:44341Google Scholar
- Engineering a Catalytically Efficient Recombinant Protein LigaseJ. Am. Chem. Soc 139:5351–5358Google Scholar
- DNA-functionalized artificial mechanoreceptor for de novo force-responsive signalingNat. Chem. Biol 20:1066–1077Google Scholar
- Direct Detection of the Chain Trajectory and Long-Range Correlation in Crystalline–Amorphous Polymer Networks by Single-Molecule Force SpectroscopyMacromolecules 58:6170–6177Google Scholar
Article and author information
Author information
Version history
- Preprint posted:
- Sent for peer review:
- Reviewed Preprint version 1:
- Reviewed Preprint version 2:
Cite all versions
You can cite all versions using the DOI https://doi.org/10.7554/eLife.109753. This DOI represents all versions, and will always resolve to the latest one.
Copyright
© 2026, Qiu et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics
- views
- 222
- downloads
- 14
- citations
- 0
Views, downloads and citations are aggregated across all versions of this paper published by eLife.