Research Article

Structural Biology and Molecular Biophysics

Functionally important residues from graph analysis of coevolved dynamic couplings

UCL School of Pharmacy, United Kingdom
Department of Computer Science, Brunel University London, United Kingdom
Centre for Host-Microbiome Interactions, Faculty of Dentistry, Oral & Craniofacial Sciences, King’s College London, United Kingdom
Research Service, Louis Stokes Cleveland Department of Veterans Affairs Medical Center, United States
Department of Molecular Biology and Microbiology, Case Western Reserve University School of Medicine, United States
Department of Medicine, Case Western Reserve University School of Medicine, United States
Departments of Pharmacology, Biochemistry, and Proteomics and Bioinformatics Case Western Reserve University School of Medicine, United States
CWRU-Cleveland VAMC Center for Antimicrobial Resistance and Epidemiology (Case VA CARES), United States
University of Tabuk (PFSCBR), Saudi Arabia
UCL Center for Advanced Research Computing, University College London, United Kingdom

Mar 28, 2025

https://doi.org/10.7554/eLife.105005.3

Open access
Copyright information

eLife Assessment

This article reports the analysis of coevolutionary patterns and dynamical information for identifying functionally relevant sites. These findings are considered important due to the broad utility of the unified framework and network analysis capable of revealing communities of key residues that go beyond the residue-pair concept. The data are solid and the results are clearly presented.

https://doi.org/10.7554/eLife.105005.3.sa0

Significance of the findings:

Important: Findings that have theoretical or practical implications beyond a single subfield

Landmark
Fundamental
Important
Valuable
Useful

Strength of evidence:

Solid: Methods, data and analyses broadly support the claims with only minor weaknesses

Exceptional
Compelling
Convincing
Solid
Incomplete
Inadequate

During the peer-review process the editor and reviewers write an eLife Assessment that summarises the significance of the findings reported in the article (on a scale ranging from landmark to useful) and the strength of the evidence (on a scale ranging from exceptional to inadequate). Learn more about eLife Assessments

Abstract
Introduction
Results and discussion
Methods
Appendix 1
Appendix 2
Data availability
References
Article and author information
Metrics

Abstract

The relationship between protein dynamics and function is essential for understanding biological processes and developing effective therapeutics. Functional sites within proteins are critical for activities such as substrate binding, catalysis, and structural changes. Existing computational methods for the predictions of functional residues are trained on sequence, structural, and experimental data, but they do not explicitly model the influence of evolution on protein dynamics. This overlooked contribution is essential as it is known that evolution can fine-tune protein dynamics through compensatory mutations either to improve the proteins’ performance or diversify its function while maintaining the same structural scaffold. To model this critical contribution, we introduce DyNoPy, a computational method that combines residue coevolution analysis with molecular dynamics simulations, revealing hidden correlations between functional sites. DyNoPy constructs a graph model of residue–residue interactions, identifies communities of key residue groups, and annotates critical sites based on their roles. By leveraging the concept of coevolved dynamical couplings—residue pairs with critical dynamical interactions that have been preserved during evolution—DyNoPy offers a powerful method for predicting and analysing protein evolution and dynamics. We demonstrate the effectiveness of DyNoPy on SHV-1 and PDC-3, chromosomally encoded β-lactamases linked to antibiotic resistance, highlighting its potential to inform drug design and address pressing healthcare challenges.

Introduction

Quantifying the contribution of individual residues or residue groups to protein function is important to estimate the pathogenic effect of mutations (Stenson et al., 2017). Identifying the functional roles of individual residues has primarily been done through mutagenesis experiments (Matreyek et al., 2018). Bioinformatics methods have complemented these approaches through analysis of multiple sequence alignments (MSA) of homologous proteins and structural data (Poelwijk et al., 2016; Høie et al., 2022; Blaabjerg et al., 2023; Dunham and Beltrao, 2021; Hopf et al., 2017; Radivojac et al., 2013). Among these methods, computational techniques that can decode inter-residue evolutionary relationships from MSAs have paved the way for machine learning (ML)-based strategies that can predict protein structure (Jumper et al., 2021; Lin et al., 2023; Baek et al., 2021; Marks et al., 2012), stability (Broom et al., 2020), and function (Hopf et al., 2017) and extend the scope of computational protein design (Ding et al., 2024; Wu et al., 2019; Russ et al., 2020). A most recent approach has combined experimental data from three proteins, NUDT15, PTEN, and CYP2C9, on the stability and function with sequence and structural features to train an ML model to predict functional sites (Cagiada et al., 2023).

Functional sites are often regulated by both local and global interactions. Changes in these interactions are instrumental for functional events like substrate binding, catalysis, and conformational changes (Wodak et al., 2019). The development of physical models of protein dynamics and the increase in available computational power has stimulated the adoption of computational techniques (Campitelli et al., 2020; Rodrigues et al., 2021) to investigate the conformational dynamics of proteins, an essential component of the many biological functions (Henzler-Wildman and Kern, 2007; James and Tawfik, 2003). Different models have been proposed to describe the interactions between residues during simulations and network models have been particularly popular, including methods on single structures and molecular dynamics (MD) simulations data built by analysing the response to external forces on residue networks (Nevin Gerek et al., 2013), estimating the prevalence of non-covalent energy interaction networks in homologous proteins (Yehorova et al., 2024), or analysing linear or non-linear correlation in atomic fluctuations (Lange and Grubmüller, 2006; Osuna, 2021). These techniques have demonstrated their usefulness in extracting allosteric networks from structural data with applications in enzyme design (Osuna, 2021).

However, none of these techniques incorporate information on residue evolution into the computational approach, while it has been established that evolution through compensatory mutations in dynamic regions, like hinges and loops, can fine-tune protein structural dynamics and introduce promiscuity, thereby diversifying biological function. Assuming that protein functional dynamics is conserved during evolution, significant information on dynamic regions and substrate recognition sites should be recoverable using inter-residue coevolution scores extracted from MSAs (Granata et al., 2017; Liu and Bahar, 2012). Coevolution analysis and MD simulations have independently (Parente et al., 2015) and synergistically been combined in the past to identify important residues for function (Ponzoni et al., 2015; Sutto et al., 2015; Estabrook et al., 2005; Chen et al., 2011; Wang et al., 2013). Yet a method that combines hidden information on dynamics from evolution with direct information on local and global dynamics from conformational ensembles from MD is not yet available.

Here, we present DyNoPy, a computational method that can extract hidden information on functional sites from the combination of pairwise residue coevolution data and powerful descriptors of dynamics extracted from the analysis of MD ensembles. The method can detect coevolved dynamic couplings, that is, residue pairs with critical dynamical interactions that have been preserved during evolution. These pairs are extracted from a graph model of residue–residue interactions. Communities of important residue groups are detected, and critical sites are identified by their eigenvector centrality in the graph (Figure 1). We demonstrate the power of this approach on SHV-1 and PDC-3 β-lactamases of major clinical importance (Olehnovics et al., 2021; Chen et al., 2024). DyNoPy successfully detects residue couplings that align with previous studies, guide in the explanations of mutation sites with previously unexplained mechanisms, and provide predictions on plausible important sites for the emergence of clinically relevant variants.

Figure 1

Download asset Open asset

Results and discussion

β-Lactamases are a group of enzymes capable of hydrolysing β-lactams, conferring resistance to β-lactam antibiotics (Poole, 2004). These enzymes are evolving rapidly as single amino acid substitutions are sufficient to drive their evolution and increase their catalytic spectrum and inhibitor resistance profile (Bush, 2018). The widespread dissemination of β-lactamases across different bacterial species and their extensive emergence highlight their global impact on antibiotic resistance (Bush, 2013). The rapid evolution of β-lactamases and their clinical significance (Bush, 2018) makes them an ideal target for evaluating the robustness of DyNoPy.

In this study, we applied DyNoPy to two model enzymes from different β-lactamase families: class A β-lactamase SHV-1 (a chromosomally encoded enzyme in Klebsiella pneumoniae) and class C β-lactamase PDC-3 (a chromosomally encoded enzyme in Pseudomonas aeruginosa) (Olehnovics et al., 2021; Chen et al., 2024; Figure 2). Both class A and class C β-lactamases comprise an α/β domain and an α helical domain, with the active site situated in between (Matagne et al., 1998; Philippon et al., 2022). Moreover, both enzymes target the carbonyl carbon of the β-lactams using a highly conserved serine residue (Palzkill, 2018; Jacoby, 2009). Despite these similarities, the structures of class A and class C β-lactamases are remarkably different (Figure 2—figure supplements 1 and 2). In class A β-lactamases, the active site is surrounded by three loops: the α3-α4 loop (residues 101–111), the Ω-loop (residues 164–179), and the hinge region (residues 213–218) (Galdadas et al., 2021). The Ω-loop is particularly critical as it positions N₁₇₀ to hydrogen bond with the E₁₆₆ via a conserved water molecule, which is essential for initiating the deacylation step (Kuzin et al., 1999). Compared to class A β-lactamases, the active site of class C β-lactamases is wider, conferring a broader substrate binding capability (Medeiros, 1997; Figure 2—figure supplement 3). The active site of class C β-lactamases can be divided into two parts: the R1 site and the R2 site (Jacoby, 2009). The R1 region is surrounded by the extended Ω-loop (residues 183–226), while the R2 site is enclosed by the R2-loop (residues 280–310) (Chen et al., 2024). The Ω-loop in class C β-lactamases is significantly longer than that in class A, enhancing the active site ability to accommodate diverse substrates and contributing to the extended spectrum profile of some class C enzymes (Chen et al., 2024).

Figure 2 with 3 supplements see all

Download asset Open asset

Structural Comparison of SHV-1 (PDB ID: 3N4I) and PDC-3 (PDB ID: 4HEF) β-lactamases.

Catalytic serine S₇₀ (SHV-1) and S₆₄ (PDC3) are highlighted using stick representation. Important loops surrounding the active site are highlighted in red. In SHV-1, highlighted loops are the α3-α4 loop (residues 101–111), the Ω-loop (residues 164–179), and the hinge region (residues 213–218). In PDC-3, highlighted loops are the Ω-loop (residues 183–226) and the R2-loop (residues 280–310).

SHV-1 is a very well-characterized enzyme with a wealth of information on mutations and their corresponding effects on protein function. In contrast, the information available on PDC-3 remains limited. Essential catalytic residues in SHV-1 are S₇₀, K₇₃, S₁₃₀, E₁₆₆, N₁₇₀, K₂₃₄, G₂₃₆, and A₂₃₇ (Ambler et al., 1991), and conserved catalytic residues in PDC-3 include S₆₄, K₆₇, Y₁₅₀, N₁₅₂, K₃₁₅, T₃₁₆, and G₃₁₇. Highly conserved stretches of 3–9 hydrophobic residues, annotated as hydrophobic nodes, exist in class A β-lactamases and have been proven to be essential for protein stability (Galdadas et al., 2018). Residues defined as belonging to hydrophobic nodes within SHV-1 are listed in Supplementary file 1a.

In SHV-1, the predominant extended spectrum β-lactamase (ESBL) substitutions occur at L₃₅, G₂₃₈, and E₂₄₀, while R₄₃, E₆₄, D₁₀₄, A₁₄₆, G₁₅₆, D₁₇₉, R₂₀₂, and R₂₀₅ appear in ESBLs with lower frequency (Liakopoulos et al., 2016). Mutations at M₆₉, S₁₃₀, A₁₈₇, T₂₃₅, and R₂₄₄ are known to induce inhibitor resistance in the enzyme (Pagan-Rodriguez et al., 2004). In PDC-3, substitutions primarily occur on the Ω-loop, enhancing its flexibility to accommodate the bulky side chains of antibiotics, while deletions are more common in the R2-loop (Jacoby, 2009). The predominant Ω-loop mutations isolated from clinics are found at positions V₂₁₁, G₂₁₄, E₂₁₉, and Y₂₂₁ (Barnes et al., 2018).

Emergence of highly conserved dynamic couplings

DyNoPy builds a pairwise model of conserved dynamic couplings detected by combining coevolution scores and information on functional motions into a score J_ij (see ‘Methods’ and Figure 1). To this end, a dynamic descriptor should be selected. When the descriptor is associated with functional conformational changes, it is expected that functionally relevant couplings will report higher scores. Dynamic descriptors can be selected from commonly used geometrical collective variables (CVs) for the analysis of MD trajectories (see ‘Methods’). As expected, the average J matrix score varies across the different CVs, with some of them showing no signal of dynamic coupling (Figure 3A).

Figure 3 with 1 supplement see all

Download asset Open asset

Features and residues in each community.

(A) Average J matrix score varies across different collective variables (CVs). Left: SHV-1; right: PDC-3. (B) Number of non-zero couplings detected by scaled coevolution scores ( $γ_{i j}$ ) and J values calculated by DyNoPy ( $J_{i j}$ ). (C) Number of residues of each community in SHV-1. (D) Number of residues of each community in PDC-3. A reasonable residue community should contain at least three residues.

SHV-1 and PDC-3 exhibit distinct dynamics, requiring a different choice of the CV that best captures the functional dynamics. For SHV-1, the global first principal component (PC1) proved to be the most effective feature, identifying 571 residue pairs with a J_ij value greater than 0. Conversely, PDC-3 requires selection of more localized features that can extract the Ω-loop dynamics from the overall protein motion. Among the dynamic descriptors, the partial first time-lagged component (TC1_partial) performed best for PDC-3, detecting 216 residue pairs with a J_ij value greater than 0. Consequently, PC1 and TC1_partial were selected to build the J matrix for SHV-1 and PDC-3, respectively. The performance of all 12 CVs for each protein was assessed and listed in Supplementary file 1b.

The importance of dynamical information is evident when coevolution couplings ( $γ_{i j}$ ) and conserved dynamic couplings (J_ij) are compared: the number of non-zero couplings decreases from 40% to <2% of total residue pairs in the protein (Figure 3B) when information from the dynamics descriptor is added. Thus, the inclusion of protein dynamics in coevolution studies acts as an effective filter that rules out residue pairs that do not have significant correlations with functional motions. Moreover, when relying only on $γ_{i j}$ , all the residues in SHV-1 and PDC-3 are included within four identified communities (Supplementary file 1c), suggesting that coevolution scores ( $γ_{i j}$ ) alone do not effectively discriminate residues relevant for protein functions. Furthermore, it would be hard to distinguish critical core residues for each community using only $γ_{i j}$ as the eigenvector centrality (EVC) values for the residues do not show remarkable differences (Figure 3—figure supplement 1A and B). This means that detailed dynamic investigation of the top residues is needed to determine which pairs should be picked up and further analysed. On the other hand, it is much easier to identify essential residues based on J scores calculated as clear outliers with significantly higher EVC values could be seen for almost all communities (Figure 3—figure supplement 1C and D; Parente et al., 2015; Negre et al., 2018). In conclusion, the lack of specificity in the statistically based coevolution analysis supports the choice of incorporating a score for the correlation between residue interactions and dynamic behaviours that enables deconvolution of community information.

DyNoPy reveals critical residues and predicts evolutionary pathways in SHV-1

DyNoPy identified eight meaningful communities, each consisting of at least three strongly coupled residues within SHV-1 (Figure 3C). All crucial catalytic residues and critical substitution sites previously mentioned participating in one of these communities with the exceptions of R₄₃, R₂₀₂, and S₁₃₀. Residues previously known to have critical role in function or conferring ESBLs/IRBLs phenotype are either directly coupled to protein dynamics or act as a central hub. The hubs interact with residues with either a role in catalysis or structural stability through their membership of hydrophobic nodes (Olehnovics et al., 2021). Furthermore, DyNoPy identified key positions (L₁₆₂ and N₁₃₆) within some communities that are known to undergo substitutions, conferring an ESBL phenotype in other class A β-lactamases. These substitutions have not yet emerged in the SHV family, providing insightful predictions about the potential future evolution of the enzyme. Detailed discussions of communities with secondary importance for protein function (communities 3, 8, and 9) is provided in the Appendix (Appendix 2—figure 1).

DyNoPy predicts mutation hotspots in SHV-1

DyNoPy detects critical mutation sites (L₁₆₂ and N₁₃₆) that are known to extend the range of substrates in other class A β-lactamases but have not yet emerged as variants in the SHV family. These sites have not been modified in the SHV family because of their plausible central role within the communities as they are mediating couplings with key functional residues essential for catalytic activity and structural stability, indicating their critical role in protein function and the potential lower mutation rate. These findings provide insightful predictions about the potential future evolution of the enzyme, as well as plausible explanations for why these mutations have not yet appeared.

L_162, positioned at the start of the Ω-loop and adjacent to the crucial catalytic residue E_166, is assigned as the core residue for community 1 (Figure 4A). While it remains conserved in the SHV family, variants of L₁₆₂ have been isolated in other class A β-lactamase and are known to expand the enzyme catalytic spectrum. Single amino acid substitution at L₁₆₂ can intensify antibiotic resistance in BEL-1 (Pozzi et al., 2016), a class A ESBL clinical variant, exhibiting robust resistance to ticarcillin and ceftazidime (Bogaerts et al., 2007). BEL-2 diverges from BEL-1 by single amino acid substitution (L₁₆₂F), which alters the kinetic properties of the enzyme significantly and increases its affinity towards expanded-spectrum cephalosporins (Poirel et al., 2010). The relationship between L₁₆₂ and protein catalytic functions can be explained using DyNoPy model as there are couplings with catalytic important residues M₆₉, K₇₃, E₁₆₆, and K₂₃₄. Moreover, the BEL case has confirmed that L₁₆₂F mutation significantly destabilizes the overall protein structure, highlighting the crucial role of L₁₆₂ in maintaining protein stability (Pozzi et al., 2016). DyNoPy accurately identifies the centrality of L₁₆₂ by reporting its connections with 28 backbone residues, including 9 hydrophobic node residues critical for protein stability. Among these, five hydrophobic residues are part of the α2 node: V₇₅, L₇₆, G₇₈, V₈₀, and L₈₁, highlighting the contribution of L₁₆₂ to the stability of the α2 helix (Olehnovics et al., 2021).

Figure 4

Download asset Open asset

Communities 1, 4, and 5 of SHV-1 β-lactamase.

All the residues are depicted as spheres on the protein structure. The core residue for each community is highlighted in red, while purple is used to emphasize the secondary core residue. Residues that interact with both cores are coloured in light yellow. Functional important residues are marked in cyan. Hydrophobic nodes are enclosed with cyan boxes. (A) Community 1 of SHV-1, comprising 33 residues with L₁₆₂ being the primary core residue. (B) Community 4 of SHV-1, containing 12 residues and is centred by G₁₅₆. G₁₅₆ and A₁₄₆ are two functional important residues distant from the active site. G₁₅₆ is 21.3 Å away from the catalytic S₇₀. A₁₄₆ is 16.8 Å away from S₇₀. (C) Community 5 of SHV-1, embracing 48 residues and showing a strong correlation between V₁₀₃ and S₁₀₆.

Just like L₁₆₂, N₁₃₆ undergoes advantageous mutations in other class A β-lactamases while remains highly conserved within the SHV family. It is the core residue for community 7 (Figure 5B). This residue forms a hydrogen bond with E₁₆₆, stabilizing the Ω-loop (Bös and Pleiss, 2008). Although DyNoPy did not detect this direct interaction between N₁₃₆ and E₁₆₆, the established relationship between N₁₃₆ and N₁₇₀ highlights the role of N₁₃₆ in influencing E₁₆₆. N₁₇₀, an essential catalytic residue located on the Ω-loop, contributes to priming the water molecule for the deacylation step with E₁₆₆ (Agarwal et al., 2023), and is directly coupled with N₁₃₆. Due to the essential contribution of N₁₃₆ in facilitating E₁₆₆ to maintain its proper orientation, it was previously thought to be intolerant to mutations as substitution of asparagine to alanine at this position would make the enzyme lose its function completely (Cao et al., 2020). However, N₁₃₆D substitution has emerged as a new clinical variant very recently in PenL, a class A β-lactamase, by increasing its ability in hydrolysing ceftazidime (Cao et al., 2020), suggesting that this site has potential to mutate. This gain of function is mainly triggered by the increased flexibility of the Ω-loop (Cao et al., 2020). DyNoPy correctly detects a dynamical relationship between N₁₃₆ and the Ω-loop (residues 164–179). Six residues present in the Ω-loop participate within this community, including R₁₆₄ and D₁₇₉. These two residues are critical as they are forming the ‘bottleneck’ of the Ω-loop which is essential for the correct position of E₁₆₆ (Parwana et al., 2024). D₁₇₉ is also a critical mutation site for SHV-1. Single amino acid substitutions like D₁₇₉A, D₁₇₉N, and D₁₇₉G are enough for the extended spectrum phenotype (Liakopoulos et al., 2016).

Figure 5

Download asset Open asset

Communities 6 and 7 of SHV-1 β-lactamase.

All the residues are depicted as spheres on the protein structure. The core residue for each community is highlighted in red, while purple is used to emphasize the secondary core residue. Residues that interact with both cores are coloured in light yellow. Functional important residues are marked in cyan. (A) Community 6 of SHV-1, comprising 30 residues with Y₁₀₅ being the primary core residue. R₂₀₅ is a functional important residue that is 20.6 Å away from the active site S₇₀. (B) Community 7 of SHV-1, containing 34 residues and is centred by N₁₃₆.

DyNoPy detects residue couplings essential for protein stability

DyNoPy identifies residue couplings critical for protein functional motions, particularly associated with protein stability. These residue pairs exhibit strong relationships as they are not only directly coupled with each other but also forms various indirect couplings via other residues. As a result, both residues are considered as core residues inside these communities. It is expected that disruption of these couplings through mutation could compromise collective motions essential for enzyme activity.

As the secondary core residues in community 1 (Figure 4A), F₇₂ is showing a strong coupling with the primary core residue L₁₆₂ and also forms nine indirect couplings with L₁₆₂, including via the catalytic K₂₃₄. This network of direct and indirect relationships reveals the importance of F₇₂ and L₁₆₂ coupling in maintaining protein functional motions. Interestingly, previous studies identified a small hydrophobic cavity formed by L₁₆₂ and F₇₂, together with L₁₃₉, and L₁₄₈, which is essential for the stability of the active site (Pozzi et al., 2016). Notably, DyNoPy successfully recovers the key residues of this local hydrophobic cavity (L₁₆₂, F₇₂, and L₁₄₈).

The strong interplay between V₁₀₃ and S₁₀₆, which are both residues on the α3-α4 loop, is seen in community 5 (Figure 4C). These residues not only interact with each other directly but are also indirectly coupled via 22 other residues. This community emphasizes the significance of hydrophobic nodes in SHV stability and dynamics. Within the analysed 48 residues, 27 are hydrophobic, out of which 15 residues act as nodes critical for enzyme stabilization. Hydrophobic nodes stabilize their own secondary structures and interconnect to stabilize the overall protein (Galdadas et al., 2021). V₁₀₃ and S₁₀₆ themselves are hydrophobic nodes, stabilizing α3 helix and α4 helix, respectively, and are strongly coupled with each other. In CTX-M, another class A enzyme, N₁₀₆S is a common substitution that results in improved thermodynamic stability and compensate for the loss in stability of the variants (Lu et al., 2022). Interestingly, this residue is already a serine in SHV but still implies its pivotal role in protein stability.

DyNoPy provides valid explanations for mutation sites

During the evolution of β-lactamases, single mutations on specific sites that are distant from the functional sites have been observed to significantly alter protein catalytic functions. Additionally, single mutations on some surface exposed residues can dramatically increase protein stability. Understanding how these distant mutations impact function and stability becomes a major challenge in understanding protein evolutionary pathways. Communities extracted by DyNoPy show these residues linked with functional important residues, providing a rational for these mutation sites with unknown functions.

Mutations of G₁₅₆ are limited but they lead to ESBL phenotype in the SHV family (Liakopoulos et al., 2016). G₁₅₆ is the central residue for community 4 (Figure 4B), but it is distant from the active site, over 20 Å away from the catalytic serine S₇₀. Clinical variant SHV-27 has extended resistance ability towards cefotaxime, ceftazidime, and aztreonam (Corkill et al., 2001). It differs from SHV-1 by single amino acid substitution G₁₅₆D, suggesting that it has directly evolved from SHV-1 (Corkill et al., 2001). Limited research has been done on position G₁₅₆, and the understanding of how it affects the enzyme catalytic properties given that it is far away from the active site is still unclear. Based on our results, we suggest that this residue is essential for the overall protein function because of its 11 coevolved dynamic couplings with protein dynamics, including A₁₄₆, another ESBL substitution site.

SHV-38, another ESBL that is capable of hydrolysing carbapenems, harbours a single A₁₄₆V substitution compared to SHV-1 (Poirel et al., 2003). Like G₁₅₆, A₁₄₆ is 16.8 Å away from S₇₀ but shows an ability in altering protein catalytic function. The A₁₄₆-G₁₅₆ residue pair shows a strong coevolutionary signal and strong correlation with protein overall dynamics, implying that there may compensatory mutations at these sites with potential to emerge in the SHV family in the future. These two residues are not connected to any catalytic residues but their coupling to functional dynamics can offer plausible explanation to ESBL activity of these two mutations.

Unlike other substitution sites that are adjacent to the active site, R₂₀₅ is situated more than 20 Å away from catalytic serine S₇₀. Its side chain points outwards from the protein, exposing to the solvent. The R₂₀₅L substitution often co-occurs with other ESBL mutations and is thought to indirectly contribute to the ESBL phenotype by compensating for stability loss induced by other mutations (Ben Achour et al., 2009). SHV-3 is an ESBL that exhibits significant resistance to cefotaxime and ceftriaxone (Nicolas et al., 1989). Two substitutions in this enzyme, R₂₀₅L and G₂₃₈S, extend its resistance profile (Nicolas et al., 1989). Thus, it is promising to see that DyNoPy detected these two mutation sites together within community 6 (Figure 5A).

Y₁₀₅ and R₂₆₆ are the core residues for community 6 (Figure 5A). Y₁₀₅ is situated on the α3-α4 loop positioned at the left side of the binding pocket. It is an important catalytic residue that recognizes and binds to the thiazolidine ring of penicillins or β-lactamase inhibitors (Bethel et al., 2006). There is very limited information on the role of R₂₆₆, except that it may stabilize the Ω-loop in the SHV family similar to the analogous T₂₆₆ in TEM (Kuzin et al., 1999). G₂₃₈ is coupled with an essential catalytic residue Y₁₀₅, which further links with other catalytic functional residues: S₇₀ and A₂₃₇, and R₂₆₆, a residue that is known to stabilize the Ω-loop. This indicates that mutations on G₂₃₈ would result in an alteration on protein catalytic function, as well as an increased flexibility of the protein, which strongly aligns with previous finding (Nicolas et al., 1989). Its linked mutation site R₂₀₅ does not show direct coupling with any catalytic residues. Instead, it is directly coupled with R_266, which we mentioned as an Ω-loop stabilizer. Thus, it is not surprising that R₂₀₅ substitution alone is never observed in nature (Neubauer et al., 2020) as it would not give significant evolutionary advantage to the protein.

Insights into the unexplained functional sites of PDC-3

Unlike the extensively studied SHV-1, the functional roles of individual amino acids in PDC-3 remain largely unexplored. This gap in understanding serves as welcome challenge for interpreting the effects of mutations and the dynamic behaviour of PDC-3 from our results. Although several mutation hotspots, such as those on the Ω-loop (Barnes et al., 2018), have been identified, very little is known about the specific contributions of individual amino acids on the functionality of PDC-3.

In PDC-3, mutations have primarily been reported in the Ω-loop. They enhance its flexibility to accommodate the bulky side chains of antibiotics, while deletions are more common in the R2-loop (Jacoby, 2009). DyNoPy detected five communities in total (Figure 3D), with all the four predominant Ω-loop mutations appearing in these communities. Communities 3, 4 and 5 are discussed in the Appendix (Appendix 2—figure 2). Furthermore, DyNoPy also detected several previously unexplored Ω-loop residues.

G₂₁₄, a known mutation site in PDC-3, is the core residue in community 1. Another two essential mutation sites, E₂₁₉ and Y₂₂₁, also participate in this community, directly coupled with G₂₁₄ (Figure 6A). G₂₁₄ also has direct couplings with four other Ω-loop residues: A₁₉₅, A₁₉₇, G₂₁₂, and L₂₁₆. Previous results have demonstrated that substitutions of glycine to alanine or arginine at 214 significantly destabilize the Ω-loop (Chen et al., 2024). The strong correlation between G₂₁₄ and these Ω-loop residues emphasizes the significant contribution of G₂₁₄ towards the stability of the Ω-loop, which corroborates with previous results (Chen et al., 2024). Moreover, substitutions such as G₂₁₄A and G₂₁₄R and mutations on E₂₁₉ and Y₂₂₁ do not affect R2 loop flexibility, resulting in the smaller active site volume among variants (Chen et al., 2024) because none of the residues from the R2 loop are detected in this community, offering plausible explanation to the previously unexplained phenomenon.

Figure 6

Download asset Open asset

Communities 1 and 2 of PDC-3 β-lactamase.

All the residues are depicted as spheres on the protein structure. The core residue for each community is highlighted in red. Functional important residues are marked in cyan. (A) Community 1 of PDC-3, comprising 36 residues with G₂₁₄ being the primary core residue. (B) Community 2 of PDC-3, containing 74 residues and is centred by G₂₀₄.

G₂₀₄ is the core residue of community 2, coupled with 73 other residues, most of which are distant from the catalytic site, suggesting plausible crucial role in the overall protein stability like L₁₆₂ in SHV-1 (Figure 6B). G₂₀₄, a newly emerged mutation site in the PDC family (67), is located on the short β-sheet β5a within the Ω-loop, near the hinge region between β8 and β9 just above the active site. The only known variant of G₂₀₄ is PDC-466, which was derived from PDC-462 (A₈₉V, Q₁₂₀K, V₂₁₁A, N₃₂₀S), with an addition of G₂₀₄D (Colque et al., 2021). Coupling of G₂₀₄ to several catalytically important residues, including K₆₇, K₃₁₅, and T₃₁₆ can suggest that mutations at this site can negatively impact catalytic power. This offers a plausible explanation of seeing fewer variants at this site, and mutations at this site could have an impact on the hydrolysing capabilities of PDC variants. This should be confirmed by further experimental studies of variants of G₂₀₄. Unlike G₂₁₄, E₂₁₉, and Y₂₂₁ mutations which do not influence the dynamics of the R2 loop, substitutions on V₂₁₁, a member of Ω-loop, have an impact on the dynamics of the R2 loop because of its indirect couplings, through G₂₀₄ to R2-loop residues (Chen et al., 2024). Two less critical substitution sites, H₁₈₈ and V₃₂₉, were also observed in community 2.

Conclusions

DyNoPy offers two distinct advantages over existing computational tools (Yehorova et al., 2024; Osuna, 2021): (a) information on residue–residue coevolution can be directly used to detect the components of protein dynamics that have been preserved during evolution and (b) dynamic descriptors extracted from the MD ensembles can be used to identify the function-specific conserved dynamic couplings. These couplings are then easily modelled as a graph, and network analysis is used to extract epistatic communities and assign roles to residues based on their importance in the graph model. The choice of a relevant descriptor of functional dynamics has an impact on the ability to detect couplings that are involved in functional dynamics.

Here we demonstrated how the choice of relevant global and local descriptors returns a higher number of effective couplings (greater than 0), and in turn leads to interpretable graph models and communities. In other systems, when multiple descriptors can be used to quantify functional conformational change, it is expected that they will differently modulate the effect of coevolution coupling, which will be reflected in a different structure of the associated graph models. This suggests the use of DyNoPy to generate comparative models in proteins with multiple functions associated to distinct dynamical changes.

Mutations of L₁₆₂ and N₁₃₆ have not yet emerged in SHV-1, but they are detected by DyNoPy as core residues for communities. These residues are strongly coupled with other functional important residues, which play critical roles in protein stability and catalytic activity. The identification of these couplings shows high consistency with previous studies and highlights the importance of L₁₆₂ and N₁₃₆ in SHV-1 functional dynamics. Given their central role in these communities, mutations in L₁₆₂ and N₁₃₆ can significantly alter protein function, suggesting their potential for future evolutionary changes. However, their strong relationships with these critical functional residues also suggest that mutation at these sites would need to be balanced to maintain protein function, providing an explanation for why such mutations have not yet emerged in SHV-1 (Soskine and Tawfik, 2010). The ability of DyNoPy in detecting functionally important mutation sites was demonstrated via well-characterized mutation sites including R₂₀₅ and G₂₃₈ from SHV-1. Moreover, DyNoPy shows predictive ability on less-studied mutation sites such as G₁₅₆ and A_146, by detecting critical residue couplings that coevolved with functional motions.

Based on the knowledge we have gained from the analysis of SHV-1 functional protein dynamics, we suggest that in PDC-3, mutations at G₂₀₄ because of its significantly conserved dynamic couplings can lead to new ESBL/IRBL clinical variants. We suggest that DyNoPy can be used as a predictive tool to identify potential functional residues within this enzyme and guide future mutagenesis studies.

In summary, by integrating hidden evolutionary information with direct dynamic interactions, DyNoPy provides a powerful framework for identifying and analysing functional sites in proteins. The tool not only identifies key residues involved in local and global interactions but also improves our ability to predict silent residues with previously unknown roles for future experimental testing. Our application of DyNoPy to broad-spectrum β-lactamases ESBLs and IRBLs demonstrates its potential to address key medical challenges such as antibiotic resistance by providing valid predictions on protein evolution.

Methods

DyNoPy generates a graph representation of the protein structure that captures the couplings between amino acid residues contributing to the functional dynamics of the protein. Residues are represented as graph nodes, and conserved dynamic couplings are recorded as edges. Edge weights quantify the strength of these couplings. The model is built on two assumptions: residue pairs should have (a) coevolved and their (b) time-dependent interactions correlate with a functional conformational change.

Therefore, edge weights ( $J_{i j}$ ) for residue $i$ and $j$ are calculated as

J_{i j} = α γ_{i j} + β ρ_{i j} w h e r e α + β = 1

where $γ_{i j}$ is the scaled coevolution score and $ρ_{i j}$ is the degree of correlation with the selected functional conformational change. α and β are weights assigned to $γ_{i j}$ and $ρ_{i j}$ that have a sum of 1. The relative weight of the scaled coevolution score (α) is set to 0.5 in this study. When either of the assumptions listed above is not met, $J_{i j}$ is set to zero.

Scaled coevolution scores

The occurrence of residue–residue coevolution can be estimated and quantified using probabilistic models of correlated mutations from deep MSA. DyNoPy supports generation of the MSA using the HH-Suite package (Remmert et al., 2011) and calculation of scaled coevolution score ( $γ_{i j}$ ) using CCMpred (Seemayer et al., 2014) as per the protocol described in Bibik et al., 2024. For SHV-1 and PDC-3, hhblits returned 18,174 sequences (N_eff: 11.082) and 27,892 sequences (N_eff: 9.951). Sequences were detected from the UniRef30 (v2022_02) database (Mirdita et al., 2017). First, a pairwise residue coevolution matrix (C) is calculated, then these raw scores ( $c_{i j}$ ) are divided by the matrix mean (Equation 2). All scores ( $s_{i j}$ ) smaller than 1 are set to zero, and the remaining values are normalized by the maximum value (Equation 3):

s_{i j} = \frac{c_{i j}}{⟨ C ⟩}

γ_{i j} = {\begin{cases} 0, & s_{i j} < 1 \\ \frac{s_{i j}}{s_{m a x}}, & s_{i j} \geq 1 \end{cases}

Correlation with functional motions

The contribution of a residue pair to a selected functional motion is estimated by how much the change in interaction energy between the two residues over time is correlated with a CV describing the functional motion:

r_{i j} = c o r (ε_{i j} {(t)}_{d} (t))

ρ_{i j} = {\begin{cases} 0, & r_{i j} \leq 0.5 \\ r_{i j}, r_{i j} > 0.5 \end{cases}

where $ε_{i j} (t)$ is the pairwise non-bonded interaction energy (see details in Appendix 1) and $d (t)$ is the time-dependent value of the CV. Examples of CV and a discussion on the choice of the most relevant CV are presented in the ‘Results’ section. Correlation values smaller than 0.5 are set to 0. In the absence of detectable contributions to the functional dynamics of the system, the couplings extracted by DyNoPy will describe a pure evolutionary model, and the community detection method presented below will be equivalent to a direct decomposition of the residue coevolution network into units.

Graph representation and analysis of conserved dynamic couplings

All pairwise conserved dynamic couplings (Equation 1) are collected into a square matrix J. A graph is built from J using python-igraph v0.11 library (Csárdi and Nepusz, 2006). Nodes represent residues, and edges are drawn between nodes with positive $J_{i j}$ . Edge weights are set to $J_{i j}$ . The relative importance of the residues in this model of protein dynamics is calculated as EVC of the nodes (Newman, 2004). The residues involved in extensive correlated dynamics with other highly connected residues have higher EVC scores. Groups of residues contributing to important collective motions are detected by community analysis of the graph structure. The Girvan–Newman algorithm is used to extract the community structure (Newman, 2006). A meaningful community should contain at least three residues. Applying network analysis on the combined dynamics-coevolution matrix helps us extract higher-order interactions beyond pairwise coupling and detecting critical residues, which show multiple interactions with each other. Moreover, indirect long-range relationships, which would be hard to identify from numerical data, could be detected through community clustering. Community-based analysis offers a more comprehensive understanding of residue relationships and enables the visualization of residue couplings on the protein structure.

Adaptive sampling molecular dynamics simulations

MD simulation data was sourced from our previous studies (Olehnovics et al., 2021; Chen et al., 2024). To summarize, SHV-1 structural coordinates (PDB ID: 3N4I) were obtained from the Protein Data Bank and modified to the wild type by introducing the E104D mutation. Similarly, the PDC-3 structure was derived from PDC-1 (PDB ID: 4HEF) by a T105A substitution. Both enzymes were protonated at pH 7.0 using PropKa from the PlayMolecule platform (Martínez-Rosell et al., 2017). One disulfide bond between C₇₇ and C₁₂₃ was specified in SHV-1. Both structures were solvated with TIP3P water molecules in a periodic box with a box size of 10 Å. Ions were added to neutralize the overall charge of each system at 150 mM KCl. Amber force field ff14SB was used for all MD simulations (Maier et al., 2015). After an initial minimization of 1000 steps, both the enzymes were equilibrated for 5 ns in the NPT ensemble at 1 atmospheric pressure using the Berendsen barostat (Berendsen et al., 1984). The initial velocities for each simulation were sampled from the Boltzmann distribution at 300 K. Multiple Markov state model (MSM)-based adaptively sampled simulations were performed for both proteins based on the ACEMD engine (Doerr et al., 2016; Harvey et al., 2009). A canonical (NVT) ensemble with a Langevin thermostat (Davidchack et al., 2009) (damping coefficient of 0.1 ps−1) and a hydrogen mass repartitioning scheme were employed to achieve time steps of 4 fs. For SHV-1, each trajectory spanned 60 ns with a time step of 0.1 ns, with a total of 593 trajectories. In the case of PDC-3, 100 trajectories were collected, each containing 3000 frames, lasting 300 ns. To manage the extensive datasets efficiently, trajectories were strategically stridden to ensure that a minimum of 30,000 frames were preserved for each system. The resulting trajectories are summarized in Supplementary file 1d.

Calculation and selection of collective variables

DyNoPy works on the assumption that time-dependent interactions between critical residues, either having significant structural change or not, will correlate with functional conformational motions. Since MD simulation data is high-dimensional, a time-dependent CV is required to extract the most relevant information for the process under study. The usefulness of DyNoPy is dependent on the choice of the CVs. To guide the selection of CVs, we selected 12 distinct features: radius of gyration (R_g), the first principal component (PC1), partial PC1 (PC1_partial), the first time-lagged independent component (TC1), partial TC1 (TC1_partial), global root mean square deviation (gRMSD), partial RMSD (pRMSD), dynamical RMSD (dRMSD), global solvent-accessible surface area (gSASA), partial SASA (pSASA), active site pocket volume, and the number of hydrogen bonds (hbond). A description of the CVs, including the calculation methods and the residues used to calculate the partial variables, is detailed in Appendix 1. CVs were subsequently used as input features for DyNoPy. A good CV should appropriately describe protein functional motions. Thus, a CV that detects the highest number of residue couplings is expected to be the most suitable descriptor. The length of the MD simulations should be appropriate to effectively sample the desired functional process as described by the selected CV.

Appendix 1

Interaction energies (E)

The pairwise non-bonded interaction energies for the upper triangle of the N×N residue matrix are calculated using dyno_pwie.py, which is a wrapper to ccptraj from ambertools. The list of pairs is generated and depending on the number of available threads (n_t), ‘n’ sets of pairs are created, and each pair set is assigned to an instance of ccptraj for calculation of pairwise interaction energies. As this is a RAM-dependent operation, based on the size of the MD trajectory, care must be taken to not spawn too many instances of ccptraj as it will slow down each instance of the ccptraj calculation. For each residue pair ( $r_{i j}$ ), a separate file with coulomb ( $q_{i j}$ ) and van der Waals ( $w_{i j}$ ) interaction energies, as two separate columns, are saved to a h5py compressed file and the total interaction energy ( $ϵ_{i j})$ for each time step can be calculated from this data using Equation A1.

ϵ_{i j} = q_{i j} + w_{i j}

Collective variables description

Radius of gyration (Rg)

The radius of gyration (Rg) measures the compactness of the protein structure through the calculation of the root mean square distance of all atoms from the centre of mass. A lower Rg indicates a more compact structure, while a higher Rg suggests a more expanded or unfolded state. This CV was calculated using the MDAnalysis v2.7.0 (Gowers et al., 2016; Michaud-Agrawal et al., 2011), focusing on the whole protein for both enzymes.

The first principal component (PC1)

Principal component analysis identifies the major dynamical feature of a protein by reducing the dimensionality directly from molecular dynamics simulation data. PC1 represents the largest variance on the selected features across the simulation. Global PC1 was computed using PyEMMA v2.5.12 (Scherer et al., 2015), focusing on backbone torsion angles and χ1 angles for all residues, capturing the global dominant structural changes.

The partial PC1 (PC1_partial)

PC1_partial was calculated using the same methodology as PC1 but only focusing on functionally significant residues and essential secondary structures. This approach prioritizes the dynamics within regions critical to the protein’s function, thereby providing a more targeted analysis of pertinent conformational changes. It is particularly well-suited for studying proteins that are predominantly rigid with localized flexible regions.

Time-lagged independent component 1 (TC1)

Time-lagged independent component analysisidentifies slow, independent processes within the protein dynamics. TC1 captures the slowest conformational changes in the protein over time. It was computed globally using backbone torsion angles and χ1 angles for all residues, with PyEMMA v2.5.12 (Scherer et al., 2015).

Partial time-lagged independent component 1 (TC1_partial)

Similarly, TC1_partial was calculated, but focused on specific, functionally important regions. By analysing these key residues and structures, TC1_partial represents the slow conformational changes crucial within essential regions, which are highly corresponding with protein functions.

Global root mean square deviation (gRMSD)

The RMSD measures the average deviation of a protein’s atomic positions from a reference structure, typically the starting conformation. gRMSD provides an overview of the protein’s structural deviation over time. In our case, trajectories were first aligned, and Cα RMSD was calculated using MDAnalysis v2.7.0 (Gowers et al., 2016; Michaud-Agrawal et al., 2011), utilizing the starting conformation of each protein as a reference.

Partial root mean square deviation (pRMSD)

This CV is calculated similarly to gRMSD but focuses on specific residues or regions of interest. pRMSD provides insight into the structural stability of functionally important areas of the protein by focusing on catalytical residues and essential loops.

Dynamical root mean square deviation (dRMSD)

dRMSD is an extension of RMSD that considers the dynamic nature of protein motions by analysing fluctuations over time rather than static deviations. More specifically, instead of calculating RMSD to a fixed reference structure, it focuses on the difference between adjacent frames. This approach allows the analysis of short-timescale fluctuations and provides insight into the dynamic nature of protein motions over time. It was also calculated using MDAnalysis v2.7.0 (Gowers et al., 2016; Michaud-Agrawal et al., 2011) focusing on the Cα and offers a more nuanced understanding of the protein’s conformational landscape.

Global solvent-accessible surface area (gSASA)

SASA quantifies the surface area of the protein accessible to the solvent. gSASA was calculated using GROMACS v2020.1 (Abraham et al., 2015) and provides information about the protein’s overall exposure to the solvent, which is relevant for understanding folding and binding interactions.

Partial solvent-accessible surface area (pSASA)

pSASA focuses on the solvent exposure of specific active site residues, offering insights into how binding sites or functional regions of the protein interact with the solvent.

Active site pocket volume

The volume of the active site pocket was calculated using Mdpocket (Schmidtke et al., 2011). This CV is important for understanding the size and shape changes in the binding site, which can influence ligand binding and protein function. Upon importing the topology file and trajectories, Mdpocket autonomously identifies potential binding pockets within the protein structure. Subsequently, the relevant pocket was manually selected (Figure 2—figure supplement 3)

Number of hydrogen bonds (hbond)

The number of hydrogen bonds was calculated per frame using VMD v1.9.3 (Humphrey et al., 1996), with a distance threshold of 3.5 Å and an angle cut-off of 40°. This CV provides insight into the stability of the protein structure and interactions that are critical for maintaining its conformation and function.

Residues and regions used in partial CVs

For SHV-1, the backbone torsion angles and χ1 angles for catalytic important residues—S70, T71, K73, S130, N132, K234, and T235—and Ω-loop residues (R164-D179) were used as the input feature to calculate the partial CVs. Similarly, for PDC-3, the backbone torsion angles and χ1 angles for catalytic important residues—K67, Y150, N152, K315, T316, and G317—Ω-loop residues (G183-S226), and R2-loop residues (L280-Q310) were utilized to calculate all the partial variables.

Appendix 2

Other SHV-1 communities

Core residues for some of the communities identified by DyNoPy have never been studied in class A β-lactamases before. DyNoPy predicts these residues as essential due to their coevolution trends with many other residues and their critical role in class A β-lactamase dynamics (Appendix 2—figure 1).

D₂₆₇ is the core residue in community 3 (Appendix 2—figure 1A). This residue is located on the loop connecting β-sheet β9 and α-helix α12. It is outside the catalytic site and has not undergone essential substitutions, thus remaining unexplored in class A β-lactamases. However, DyNoPy indicates its potential importance due to its relationships with three known essential mutation sites: L₃₅, E₂₄₀, and A₁₈₇. L₃₅ and E₂₄₀ are predominant ESBL mutation sites, while A₁₈₇ substitution confers an inhibitor-resistant phenotype (Chang et al., 2001). The relationship between D₂₆₇ and these mutation sites suggests a trend towards coevolution, and interactions between D₂₆₇ and these mutation sites are important for protein dynamics.

Similarly, I₂₇₉, located also on α12, is the core residue of community 8 (Appendix 2—figure 1B). This residue forms relationships with R₂₄₄, a critical inhibitor resistant mutation site (Giakkoupi et al., 1998). Other residues within this community are predominantly positioned on the α1 and α12 helices, near the protein’s terminus, highlighting the significant contribution of I₂₇₉ to protein integrity and stability.

For community 9, I₁₅₅ serves as the core residue (Appendix 2—figure 1C). This community is relatively localized and does not encompass any known essential residues. Most of the residues involved in this community are spatially close to each other, suggesting that I₁₅₅ plays a vital role in the local dynamics, especially surrounding α-helices α7 and α9.

Appendix 2—figure 1

Download asset Open asset

Communities 3, 8, and 9 of SHV-1 β-lactamase.

All the residues are depicted as spheres on the protein structure. The core residue for each community is D267, I279, and I155, respectively. They are highlighted in red. Functional important residues are marked in cyan. (A) Community 3 of SHV-1, comprising 31 residues. (B) Community 8 of SHV-1, containing 14 residues. (C) Community 9 of SHV-1, with 7 residues.

Other PDC-3 communities

The core residues in the other PDC-3 communities have not been extensively studied. However, their detection by DyNoPy suggests a significant trend in co-evolution and highlights their crucial role in protein dynamics (Appendix 2—figure 2). This emphasizes the capability of DyNoPy to predict essential residues in previously unexplored proteins, potentially offering valuable insights for future experimental research.

E₄₉, D_206, and R₂₁₀ are core residues for community 3, a small community containing only 14 residues (Appendix 2—figure 2A). R₂₁₀ is the primary core residue in this community, linking to six residues, while both E₄₉ and R₂₁₀ are secondary core residues that show a relationship with four residues. Unlike other communities with widespread interactions, community 3 illustrates a localized relationship among Ω-loop residues, primarily on the short β5a and β5b β-sheets and the short helix α7a. Also, 8 out of 14 residues are Ω-loop residues, with the remainder located on adjacent loops or near the terminals of adjoining secondary structures, all of which are flexible regions. This community indicates that R₂₁₀ is crucial for maintaining local structural integrity and the stability of the Ω-loop.

G₂₀₂, the core residue of community 4, appears essential for active site stability by interacting with residues whose side chains point into the active site (Appendix 2—figure 2B). It also stabilizes two α-helices, α2 and α5, located adjacent to the active site. P₁₅₄ is a special mutation site in the PDC family. P₁₅₄L occur in PDC-73 and PDC-81, giving the protein a mild increase in resistance to ceftazidime (Berrazeg et al., 2015).

K_204a and R₂₀₇ are the central residues in community 5, each establishing six interactions and thus sharing an equal position of importance within this community. K₂₈₁ and K₃₅₁, which interact with both core residues, are highlighted in light yellow (Appendix 2—figure 2C). K_204a forms direct interactions with six residues, primarily located on the opposite side of the active site, either on the R2 loop or at the beginning of helix α11. This suggests that K_204a plays a crucial role in maintaining the conformation of the R2 site. Additionally, K_204a shows direct correlations with the catalytically significant residue K₃₁₅. Although R₂₀₇ is spatially close to K_204a, it interacts with residues on loops that are distant from the catalytic site.

Appendix 2—figure 2

Download asset Open asset

Communities 3, 4, and 5 of PDC-3 β-lactamase.

All the residues are depicted as spheres on the protein structure. The core residue for each community is highlighted in red, while purple is used to emphasize the secondary core residue. Residues that interact with both cores are coloured in light yellow. Functional important residues are marked in cyan. (A) Community 3 of PDC-3, comprising 14 residues with R₂₁₀ being the primary core residue. (B) Community 4 of PDC-3, containing 25 residues and is centred by G₂₀₂. (C) Community 5 of PDC-3, embracing 10 residues and having two core residues K₂₀₄ and R₂₀₇.

Data availability

All files required to run the simulations (topology, coordinates, input), processed trajectories (xtc), corresponding coordinates (pdb), can be downloaded from https://doi.org/10.57760/sciencedb.15876 (PDC-3) and https://doi.org/10.5281/zenodo.13693144 (SHV-1). DyNoPy is available at https://github.com/alepandini/DyNoPy, (copy archived at Pandini, 2024).

The following previously published data sets were used

1. Haider S
(2024) Science Data Bank
Ω-Loop mutations control the dynamics of the active site by modulating a network of hydrogen bonds in PDC-3 β-lactamase.

https://doi.org/10.57760/sciencedb.15876
1. Haider S
(2024) Zenodo
Functionally Important Residues from Graph Analysis of Co-evolved Dynamical Couplings.

https://doi.org/10.5281/zenodo.13693144

References

1. Abraham MJ
2. Murtola T
3. Schulz R
4. Páll S
5. Smith JC
6. Hess B
7. Lindahl E
(2015) GROMACS: High performance molecular simulations through multi-level parallelism from laptops to supercomputers
SoftwareX 1–2:19–25.

https://doi.org/10.1016/j.softx.2015.06.001
- Google Scholar
(2023) Detailed investigation of catalytically important residues of class A β-lactamase
Journal of Biomolecular Structure & Dynamics 41:2046–2073.

https://doi.org/10.1080/07391102.2021.2023645
- PubMed
- Google Scholar
1. Ambler RP
2. Coulson AF
3. Frère JM
4. Ghuysen JM
5. Joris B
6. Forsman M
7. Levesque RC
8. Tiraby G
9. Waley SG
(1991) A standard numbering scheme for the class A beta-lactamases
The Biochemical Journal 276 (Pt 1):269–270.

https://doi.org/10.1042/bj2760269
- PubMed
- Google Scholar
1. Baek M
2. DiMaio F
3. Anishchenko I
4. Dauparas J
5. Ovchinnikov S
6. Lee GR
7. Wang J
8. Cong Q
9. Kinch LN
10. Schaeffer RD
11. Millán C
12. Park H
13. Adams C
14. Glassman CR
15. DeGiovanni A
16. Pereira JH
17. Rodrigues AV
18. van Dijk AA
19. Ebrecht AC
20. Opperman DJ
21. Sagmeister T
22. Buhlheller C
23. Pavkov-Keller T
24. Rathinaswamy MK
25. Dalwadi U
26. Yip CK
27. Burke JE
28. Garcia KC
29. Grishin NV
30. Adams PD
31. Read RJ
32. Baker D
(2021) Accurate prediction of protein structures and interactions using a three-track neural network
Science 373:871–876.

https://doi.org/10.1126/science.abj8754
- PubMed
- Google Scholar
1. Barnes MD
2. Taracila MA
3. Rutter JD
4. Bethel CR
5. Galdadas I
6. Hujer AM
7. Caselli E
8. Prati F
9. Dekker JP
10. Papp-Wallace KM
11. Haider S
12. Bonomo RA
(2018) Deciphering the evolution of cephalosporin resistance to ceftolozane-tazobactam in Pseudomonas aeruginosa
mBio 9:e02085-18.

https://doi.org/10.1128/mBio.02085-18
- PubMed
- Google Scholar
(2009) Characterization of a novel extended-spectrum TEM-type beta-lactamase, TEM-164, in a clinical strain of Klebsiella pneumoniae in Tunisia
Microbial Drug Resistance 15:195–199.

https://doi.org/10.1089/mdr.2009.0900
- PubMed
- Google Scholar
(1984) Molecular dynamics with coupling to an external bath
The Journal of Chemical Physics 81:3684–3690.

https://doi.org/10.1063/1.448118
- Google Scholar
(2015) Mutations in β-Lactamase AmpC Increase resistance of Pseudomonas aeruginosa isolates to antipseudomonal cephalosporins
Antimicrobial Agents and Chemotherapy 59:6248–6255.

https://doi.org/10.1128/AAC.00825-15
- PubMed
- Google Scholar
(2006) Role of Asp104 in the SHV β-Lactamase
Antimicrobial Agents and Chemotherapy 50:4124–4131.

https://doi.org/10.1128/AAC.00848-06
- Google Scholar
1. Bibik P
2. Alibai S
3. Pandini A
4. Dantu SC
(2024) PyCoM: a python library for large-scale analysis of residue-residue coevolution data
Bioinformatics 40:btae166.

https://doi.org/10.1093/bioinformatics/btae166
- PubMed
- Google Scholar
(2023) Rapid protein stability prediction using deep learning representations
eLife 12:e82593.

https://doi.org/10.7554/eLife.82593
- PubMed
- Google Scholar
(2007) Emergence and dissemination of BEL-1-producing Pseudomonas aeruginosa isolates in Belgium
Antimicrobial Agents and Chemotherapy 51:1584–1585.

https://doi.org/10.1128/AAC.01603-06
- PubMed
- Google Scholar
1. Bös F
2. Pleiss J
(2008) Conserved water molecules stabilize the Omega-loop in class A beta-lactamases
Antimicrobial Agents and Chemotherapy 52:1072–1079.

https://doi.org/10.1128/AAC.01035-07
- PubMed
- Google Scholar
(2020) Computational modeling of protein stability: quantitative analysis reveals solutions to pervasive problems
Structure 28:717–726.

https://doi.org/10.1016/j.str.2020.04.003
- Google Scholar
1. Bush K
(2013) Proliferation and significance of clinically relevant β-lactamases
Annals of the New York Academy of Sciences 1277:84–90.

https://doi.org/10.1111/nyas.12023
- PubMed
- Google Scholar
1. Bush K
(2018) Past and present perspectives on β-lactamases
Antimicrobial Agents and Chemotherapy 62:e01076-18.

https://doi.org/10.1128/AAC.01076-18
- PubMed
- Google Scholar
(2023) Discovering functionally important sites in proteins
Nature Communications 14:4175.

https://doi.org/10.1038/s41467-023-39909-0
- PubMed
- Google Scholar
1. Campitelli P
2. Modi T
3. Kumar S
4. Ozkan SB
(2020) The role of conformational dynamics and allostery in modulating protein evolution
Annual Review of Biophysics 49:267–288.

https://doi.org/10.1146/annurev-biophys-052118-115517
- PubMed
- Google Scholar
1. Cao T-P
2. Yi H
3. Dhanasingh I
4. Ghosh S
5. Choi JM
6. Lee KH
7. Ryu S
8. Kim HS
9. Lee SH
(2020) Non-catalytic-region mutations conferring transition of class a β-lactamases into ESBLs
Frontiers in Molecular Biosciences 7:598998.

https://doi.org/10.3389/fmolb.2020.598998
- PubMed
- Google Scholar
1. Chang FY
2. Siu LK
3. Fung CP
4. Huang MH
5. Ho M
(2001) Diversity of SHV and TEM beta-lactamases in Klebsiella pneumoniae: gene evolution in Northern Taiwan and two novel beta-lactamases, SHV-25 and SHV-26
Antimicrobial Agents and Chemotherapy 45:2407–2413.

https://doi.org/10.1128/AAC.45.9.2407-2413.2001
- PubMed
- Google Scholar
1. Chen Z
2. Rappert S
3. Sun J
4. Zeng AP
(2011) Integrating molecular dynamics and co-evolutionary analysis for reliable target prediction and deregulation of the allosteric inhibition of aspartokinase for amino acid production
Journal of Biotechnology 154:248–254.

https://doi.org/10.1016/j.jbiotec.2011.05.005
- PubMed
- Google Scholar
Preprint
1. Chen S
2. Mack AR
3. Hujer AM
4. Bethel CR
5. Bonomo RA
6. Haider S
(2024) Ω-Loop Mutations Control the Dynamics of the Active Site by Modulating a Network of Hydrogen Bonds in PDC-3 β-Lactamase
bioRxiv.

https://doi.org/10.1101/2024.02.04.578824
- Google Scholar
Preprint
1. Colque CA
2. Tomatis PE
3. Albarracín Orio AG
4. Dotta G
5. Moreno DM
6. Hedemann LG
7. Hickman RA
8. Sommer LM
9. Feliziani S
10. Moyano AJ
11. Bonomo RA
12. Johansen HK
13. Molin S
14. Vila AJ
15. Smania AM
(2021) Development of antibiotic resistance reveals diverse evolutionary pathways to face the complex and dynamic environment of a long-term treated patient
bioRxiv.

https://doi.org/10.1101/2021.05.14.444257
- Google Scholar
(2001) SHV-27, a novel cefotaxime-hydrolysing beta-lactamase, identified in Klebsiella pneumoniae isolates from a Brazilian hospital
The Journal of Antimicrobial Chemotherapy 47:463–465.

https://doi.org/10.1093/jac/47.4.463
- PubMed
- Google Scholar
Book
1. Csárdi G
2. Nepusz T
(2006)
The Igraph Software Package for Complex Network Research

Semantic Scholar.
- Google Scholar
(2009) Langevin thermostat for rigid body dynamics
The Journal of Chemical Physics 130:234101.

https://doi.org/10.1063/1.3149788
- PubMed
- Google Scholar
1. Ding D
2. Shaw AY
3. Sinai S
4. Rollins N
5. Prywes N
6. Savage DF
7. Laub MT
8. Marks DS
(2024) Protein design using structure-based residue preferences
Nature Communications 15:1639.

https://doi.org/10.1038/s41467-024-45621-4
- PubMed
- Google Scholar
(2016) HTMD: high-throughput molecular dynamics for molecular discovery
Journal of Chemical Theory and Computation 12:1845–1852.

https://doi.org/10.1021/acs.jctc.6b00049
- Google Scholar
1. Dunham AS
2. Beltrao P
(2021) Exploring amino acid functions in a deep mutational landscape
Molecular Systems Biology 17:e10305.

https://doi.org/10.15252/msb.202110305
- PubMed
- Google Scholar
1. Estabrook RA
2. Luo J
3. Purdy MM
4. Sharma V
5. Weakliem P
6. Bruice TC
7. Reich NO
(2005) Statistical coevolution analysis and molecular dynamics: identification of amino acid pairs essential for catalysis
PNAS 102:994–999.

https://doi.org/10.1073/pnas.0409128102
- PubMed
- Google Scholar
(2018) Defining the architecture of KPC-2 Carbapenemase: identifying allosteric networks to fight antibiotics resistance
Scientific Reports 8:12916.

https://doi.org/10.1038/s41598-018-31176-0
- PubMed
- Google Scholar
1. Galdadas I
2. Qu S
3. Oliveira ASF
4. Olehnovics E
5. Mack AR
6. Mojica MF
7. Agarwal PK
8. Tooke CL
9. Gervasio FL
10. Spencer J
11. Bonomo RA
12. Mulholland AJ
13. Haider S
(2021) Allosteric communication in class A β-lactamases occurs via cooperative coupling of loop dynamics
eLife 10:e66567.

https://doi.org/10.7554/eLife.66567
- PubMed
- Google Scholar
(1998) Substitution of Arg-244 by Cys or Ser in SHV-1 and SHV-5 beta-lactamases confers resistance to mechanism-based inhibitors and reduces catalytic efficiency of the enzymes
FEMS Microbiology Letters 160:49–54.

https://doi.org/10.1111/j.1574-6968.1998.tb12889.x
- PubMed
- Google Scholar
Conference
1. Gowers R
2. Linke M
3. Barnoud J
4. Reddy T
5. Melo M
6. Seyler S
7. Domański J
8. Dotson D
9. Buchoux S
10. Kenney I
11. Beckstein O
(2016) MDAnalysis: A Python Package for the Rapid Analysis of Molecular Dynamics Simulations
Python in Science Conference.

https://doi.org/10.25080/Majora-629e541a-00e
- Google Scholar
(2017) Patterns of coevolving amino acids unveil structural and dynamical domains
PNAS 114:E10612–E10621.

https://doi.org/10.1073/pnas.1712021114
- PubMed
- Google Scholar
(2009) ACEMD: accelerating biomolecular dynamics in the microsecond time scale
Journal of Chemical Theory and Computation 5:1632–1639.

https://doi.org/10.1021/ct9000685
- PubMed
- Google Scholar
1. Henzler-Wildman K
2. Kern D
(2007) Dynamic personalities of proteins
Nature 450:964–972.

https://doi.org/10.1038/nature06522
- PubMed
- Google Scholar
(2022) Predicting and interpreting large-scale mutagenesis data using analyses of protein stability and conservation
Cell Reports 38:110207.

https://doi.org/10.1016/j.celrep.2021.110207
- PubMed
- Google Scholar
1. Hopf TA
2. Ingraham JB
3. Poelwijk FJ
4. Schärfe CPI
5. Springer M
6. Sander C
7. Marks DS
(2017) Mutation effects predicted from sequence co-variation
Nature Biotechnology 35:128–135.

https://doi.org/10.1038/nbt.3769
- PubMed
- Google Scholar
(1996) VMD: visual molecular dynamics
Journal of Molecular Graphics 14:33–38.

https://doi.org/10.1016/0263-7855(96)00018-5
- PubMed
- Google Scholar
1. Jacoby GA
(2009) AmpC beta-lactamases
Clinical Microbiology Reviews 22:161–182.

https://doi.org/10.1128/CMR.00036-08
- PubMed
- Google Scholar
1. James LC
2. Tawfik DS
(2003) Conformational diversity and protein evolution--a 60-year-old hypothesis revisited
Trends in Biochemical Sciences 28:361–368.

https://doi.org/10.1016/S0968-0004(03)00135-X
- PubMed
- Google Scholar
1. Jumper J
2. Evans R
3. Pritzel A
4. Green T
5. Figurnov M
6. Ronneberger O
7. Tunyasuvunakool K
8. Bates R
9. Žídek A
10. Potapenko A
11. Bridgland A
12. Meyer C
13. Kohl SAA
14. Ballard AJ
15. Cowie A
16. Romera-Paredes B
17. Nikolov S
18. Jain R
19. Adler J
20. Back T
21. Petersen S
22. Reiman D
23. Clancy E
24. Zielinski M
25. Steinegger M
26. Pacholska M
27. Berghammer T
28. Bodenstein S
29. Silver D
30. Vinyals O
31. Senior AW
32. Kavukcuoglu K
33. Kohli P
34. Hassabis D
(2021) Highly accurate protein structure prediction with AlphaFold
Nature 596:583–589.

https://doi.org/10.1038/s41586-021-03819-2
- PubMed
- Google Scholar
1. Kuzin AP
2. Nukaga M
3. Nukaga Y
4. Hujer AM
5. Bonomo RA
6. Knox JR
(1999) Structure of the SHV-1 beta-lactamase
Biochemistry 38:5720–5727.

https://doi.org/10.1021/bi990136d
- PubMed
- Google Scholar
1. Lange OF
2. Grubmüller H
(2006) Generalized correlation for biomolecular dynamics
Proteins 62:1053–1061.

https://doi.org/10.1002/prot.20784
- PubMed
- Google Scholar
(2016) A review of SHV extended-spectrum β-Lactamases: neglected yet ubiquitous
Frontiers in Microbiology 7:1374.

https://doi.org/10.3389/fmicb.2016.01374
- PubMed
- Google Scholar
1. Lin Z
2. Akin H
3. Rao R
4. Hie B
5. Zhu Z
6. Lu W
7. Smetanin N
8. Verkuil R
9. Kabeli O
10. Shmueli Y
11. Dos Santos Costa A
12. Fazel-Zarandi M
13. Sercu T
14. Candido S
15. Rives A
(2023) Evolutionary-scale prediction of atomic-level protein structure with a language model
Science 379:1123–1130.

https://doi.org/10.1126/science.ade2574
- PubMed
- Google Scholar
1. Liu Y
2. Bahar I
(2012) Sequence evolution correlates with structural dynamics
Molecular Biology and Evolution 29:2253–2263.

https://doi.org/10.1093/molbev/mss097
- PubMed
- Google Scholar
1. Lu S
2. Hu L
3. Lin H
4. Judge A
5. Rivera P
6. Palaniappan M
7. Sankaran B
8. Wang J
9. Prasad BVV
10. Palzkill T
(2022) An active site loop toggles between conformations to control antibiotic hydrolysis and inhibition potency for CTX-M β-lactamase drug-resistance enzymes
Nature Communications 13:6726.

https://doi.org/10.1038/s41467-022-34564-3
- Google Scholar
(2015) ff14SB: improving the accuracy of protein side chain and backbone parameters from ff99SB
Journal of Chemical Theory and Computation 11:3696–3713.

https://doi.org/10.1021/acs.jctc.5b00255
- PubMed
- Google Scholar
(2012) Protein structure prediction from sequence variation
Nature Biotechnology 30:1072–1080.

https://doi.org/10.1038/nbt.2419
- PubMed
- Google Scholar
(2017) PlayMolecule proteinprepare: a web application for protein preparation for molecular dynamics simulations
Journal of Chemical Information and Modeling 57:1511–1516.

https://doi.org/10.1021/acs.jcim.7b00190
- Google Scholar
(1998) Catalytic properties of class A beta-lactamases: efficiency and diversity
The Biochemical Journal 330 (Pt 2):581–598.

https://doi.org/10.1042/bj3300581
- PubMed
- Google Scholar
1. Matreyek KA
2. Starita LM
3. Stephany JJ
4. Martin B
5. Chiasson MA
6. Gray VE
7. Kircher M
8. Khechaduri A
9. Dines JN
10. Hause RJ
11. Bhatia S
12. Evans WE
13. Relling MV
14. Yang W
15. Shendure J
16. Fowler DM
(2018) Multiplex assessment of protein variant abundance by massively parallel sequencing
Nature Genetics 50:874–882.

https://doi.org/10.1038/s41588-018-0122-z
- PubMed
- Google Scholar
1. Medeiros AA
(1997) β-Lactamases: quality and resistance
Clinical Microbiology and Infection 3:4S2–4S9.

https://doi.org/10.1016/S1198-743X(14)65030-8
- Google Scholar
(2011) MDAnalysis: a toolkit for the analysis of molecular dynamics simulations
Journal of Computational Chemistry 32:2319–2327.

https://doi.org/10.1002/jcc.21787
- PubMed
- Google Scholar
(2017) Uniclust databases of clustered and deeply annotated protein sequences and alignments
Nucleic Acids Research 45:D170–D176.

https://doi.org/10.1093/nar/gkw1081
- PubMed
- Google Scholar
1. Negre CFA
2. Morzan UN
3. Hendrickson HP
4. Pal R
5. Lisi GP
6. Loria JP
7. Rivalta I
8. Ho J
9. Batista VS
(2018) Eigenvector centrality for characterization of protein allosteric pathways
PNAS 115:E12201–E12208.

https://doi.org/10.1073/pnas.1810452115
- PubMed
- Google Scholar
1. Neubauer S
2. Madzgalla S
3. Marquet M
4. Klabunde A
5. Büttner B
6. Göhring A
7. Brandt C
8. Feller K-H
9. Pletz MW
10. Makarewicz O
(2020) A genotype-phenotype correlation study of shv β-lactamases offers new insight into SHV resistance profiles
Antimicrobial Agents and Chemotherapy 64:e02293-19.

https://doi.org/10.1128/AAC.02293-19
- PubMed
- Google Scholar
(2013) Structural dynamics flexibility informs function and evolution at a proteome scale
Evolutionary Applications 6:423–433.

https://doi.org/10.1111/eva.12052
- PubMed
- Google Scholar
1. Newman MEJ
(2004) Detecting community structure in networks
The European Physical Journal B - Condensed Matter 38:321–330.

https://doi.org/10.1140/epjb/e2004-00124-y
- Google Scholar
1. Newman MEJ
(2006) Finding community structure in networks using the eigenvectors of matrices
Physical Review E 74:036104.

https://doi.org/10.1103/PhysRevE.74.036104
- Google Scholar
1. Nicolas MH
2. Jarlier V
3. Honore N
4. Philippon A
5. Cole ST
(1989) Molecular characterization of the gene encoding SHV-3 beta-lactamase responsible for transferable cefotaxime resistance in clinical isolates of Klebsiella pneumoniae
Antimicrobial Agents and Chemotherapy 33:2096–2100.

https://doi.org/10.1128/AAC.33.12.2096
- PubMed
- Google Scholar
1. Olehnovics E
2. Yin J
3. Pérez A
4. De Fabritiis G
5. Bonomo RA
6. Bhowmik D
7. Haider S
(2021) The role of hydrophobic nodes in the dynamics of class a β-lactamases
Frontiers in Microbiology 12:720991.

https://doi.org/10.3389/fmicb.2021.720991
- PubMed
- Google Scholar
1. Osuna S
(2021) The challenge of predicting distal active site mutations in computational enzyme design
WIREs Computational Molecular Science 11:1502.

https://doi.org/10.1002/wcms.1502
- Google Scholar
1. Pagan-Rodriguez D
2. Zhou X
3. Simmons R
4. Bethel CR
5. Hujer AM
6. Helfand MS
7. Jin Z
8. Guo B
9. Anderson VE
10. Ng LM
11. Bonomo RA
(2004) Tazobactam inactivation of SHV-1 and the inhibitor-resistant Ser130 -->Gly SHV-1 beta-lactamase: insights into the mechanism of inhibition
The Journal of Biological Chemistry 279:19494–19501.

https://doi.org/10.1074/jbc.M311669200
- PubMed
- Google Scholar
1. Palzkill T
(2018) Structural and mechanistic basis for extended-spectrum drug-resistance mutations in altering the specificity of tem, ctx-m, and kpc β-lactamases
Frontiers in Molecular Biosciences 5:16.

https://doi.org/10.3389/fmolb.2018.00016
- PubMed
- Google Scholar
Software
1. Pandini A
(2024) DyNoPy (dynamics based network comparisons in python), version swh:1:rev:2291e30a69c7e8ac743a62ad7a9c79a0bdcd0bc9
Software Heritage.

https://archive.softwareheritage.org/swh:1:dir:2a4390c9a521fd4a7e2260a5f738cdcd2570c6ea;origin=https://github.com/alepandini/DyNoPy;visit=swh:1:snp:604ebe1ee6675b0f9cc531b176ebc232693cad98;anchor=swh:1:rev:2291e30a69c7e8ac743a62ad7a9c79a0bdcd0bc9
(2015) Amino acid positions subject to multiple coevolutionary constraints can be robustly identified by their eigenvector network centrality scores
Proteins 83:2293–2306.

https://doi.org/10.1002/prot.24948
- PubMed
- Google Scholar
1. Parwana D
2. Gu J
3. Chen S
4. Bethel CR
5. Marshall E
6. Hujer AM
7. Bonomo RA
8. Haider S
(2024) The structural role of N170 in substrate-assisted deacylation in KPC-2 β-Lactamase
Angewandte Chemie 63:e202317315.

https://doi.org/10.1002/anie.202317315
- PubMed
- Google Scholar
1. Philippon A
2. Arlet G
3. Labia R
4. Iorga BI
(2022) Class c β-lactamases: molecular characteristics
Clinical Microbiology Reviews 35:e0015021.

https://doi.org/10.1128/cmr.00150-21
- PubMed
- Google Scholar
(2016) The context-dependence of mutations: a linkage of formalisms
PLOS Computational Biology 12:e1004771.

https://doi.org/10.1371/journal.pcbi.1004771
- PubMed
- Google Scholar
(2003) Emergence in Klebsiella pneumoniae of a chromosome-encoded shv β-lactamase that compromises the efficacy of imipenem
Antimicrobial Agents and Chemotherapy 47:755–758.

https://doi.org/10.1128/AAC.47.2.755-758.2003
- Google Scholar
1. Poirel L
2. Docquier J-D
3. De Luca F
4. Verlinde A
5. Ide L
6. Rossolini GM
7. Nordmann P
(2010) BEL-2, an extended-spectrum β-lactamase with increased activity toward expanded-spectrum cephalosporins in Pseudomonas aeruginosa
Antimicrobial Agents and Chemotherapy 54:533–535.

https://doi.org/10.1128/AAC.00859-09
- Google Scholar
(2015) SPECTRUS: a dimensionality reduction approach for identifying dynamical domains in protein complexes from limited structural datasets
Structure 23:1516–1525.

https://doi.org/10.1016/j.str.2015.05.022
- Google Scholar
1. Poole K
(2004) Resistance to beta-lactam antibiotics
Cellular and Molecular Life Sciences 61:2200–2223.

https://doi.org/10.1007/s00018-004-4060-9
- PubMed
- Google Scholar
1. Pozzi C
2. De Luca F
3. Benvenuti M
4. Poirel L
5. Nordmann P
6. Rossolini GM
7. Mangani S
8. Docquier J-D
(2016) Crystal structure of the Pseudomonas aeruginosa bel-1 extended-spectrum β-lactamase and its complexes with moxalactam and imipenem
Antimicrobial Agents and Chemotherapy 60:7189–7199.

https://doi.org/10.1128/AAC.00936-16
- PubMed
- Google Scholar
1. Radivojac P
2. Clark WT
3. Oron TR
4. Schnoes AM
5. Wittkop T
6. Sokolov A
7. Graim K
8. Funk C
9. Verspoor K
10. Ben-Hur A
11. Pandey G
12. Yunes JM
13. Talwalkar AS
14. Repo S
15. Souza ML
16. Piovesan D
17. Casadio R
18. Wang Z
19. Cheng J
20. Fang H
21. Gough J
22. Koskinen P
23. Törönen P
24. Nokso-Koivisto J
25. Holm L
26. Cozzetto D
27. Buchan DWA
28. Bryson K
29. Jones DT
30. Limaye B
31. Inamdar H
32. Datta A
33. Manjari SK
34. Joshi R
35. Chitale M
36. Kihara D
37. Lisewski AM
38. Erdin S
39. Venner E
40. Lichtarge O
41. Rentzsch R
42. Yang H
43. Romero AE
44. Bhat P
45. Paccanaro A
46. Hamp T
47. Kaßner R
48. Seemayer S
49. Vicedo E
50. Schaefer C
51. Achten D
52. Auer F
53. Boehm A
54. Braun T
55. Hecht M
56. Heron M
57. Hönigschmid P
58. Hopf TA
59. Kaufmann S
60. Kiening M
61. Krompass D
62. Landerer C
63. Mahlich Y
64. Roos M
65. Björne J
66. Salakoski T
67. Wong A
68. Shatkay H
69. Gatzmann F
70. Sommer I
71. Wass MN
72. Sternberg MJE
73. Škunca N
74. Supek F
75. Bošnjak M
76. Panov P
77. Džeroski S
78. Šmuc T
79. Kourmpetis YAI
80. van Dijk ADJ
81. ter Braak CJF
82. Zhou Y
83. Gong Q
84. Dong X
85. Tian W
86. Falda M
87. Fontana P
88. Lavezzo E
89. Di Camillo B
90. Toppo S
91. Lan L
92. Djuric N
93. Guo Y
94. Vucetic S
95. Bairoch A
96. Linial M
97. Babbitt PC
98. Brenner SE
99. Orengo C
100. Rost B
101. Mooney SD
102. Friedberg I
(2013) A large-scale evaluation of computational protein function prediction
Nature Methods 10:221–227.

https://doi.org/10.1038/nmeth.2340
- PubMed
- Google Scholar
(2011) HHblits: lightning-fast iterative protein sequence searching by HMM-HMM alignment
Nature Methods 9:173–175.

https://doi.org/10.1038/nmeth.1818
- PubMed
- Google Scholar
(2021) DynaMut2: Assessing changes in stability and flexibility upon single and multiple point missense mutations
Protein Science 30:60–69.

https://doi.org/10.1002/pro.3942
- PubMed
- Google Scholar
1. Russ WP
2. Figliuzzi M
3. Stocker C
4. Barrat-Charlaix P
5. Socolich M
6. Kast P
7. Hilvert D
8. Monasson R
9. Cocco S
10. Weigt M
11. Ranganathan R
(2020) An evolution-based model for designing chorismate mutase enzymes
Science 369:440–445.

https://doi.org/10.1126/science.aba3304
- PubMed
- Google Scholar
(2015) PyEMMA 2: a software package for estimation, validation, and analysis of markov models
Journal of Chemical Theory and Computation 11:5525–5542.

https://doi.org/10.1021/acs.jctc.5b00743
- PubMed
- Google Scholar
(2011) MDpocket: open-source cavity detection and characterization on molecular dynamics trajectories
Bioinformatics 27:3276–3285.

https://doi.org/10.1093/bioinformatics/btr550
- PubMed
- Google Scholar
(2014) CCMpred--fast and precise prediction of protein residue-residue contacts from correlated mutations
Bioinformatics 30:3128–3130.

https://doi.org/10.1093/bioinformatics/btu500
- PubMed
- Google Scholar
1. Soskine M
2. Tawfik DS
(2010) Mutational effects and the evolution of new protein functions
Nature Reviews. Genetics 11:572–582.

https://doi.org/10.1038/nrg2808
- PubMed
- Google Scholar
1. Stenson PD
2. Mort M
3. Ball EV
4. Evans K
5. Hayden M
6. Heywood S
7. Hussain M
8. Phillips AD
9. Cooper DN
(2017) The Human Gene Mutation Database: towards a comprehensive repository of inherited mutation data for medical research, genetic diagnosis and next-generation sequencing studies
Human Genetics 136:665–677.

https://doi.org/10.1007/s00439-017-1779-6
- PubMed
- Google Scholar
(2015) From residue coevolution to protein conformational ensembles and functional dynamics
PNAS 112:13567–13572.

https://doi.org/10.1073/pnas.1508584112
- PubMed
- Google Scholar
1. Wang J
2. Zhao Y
3. Wang Y
4. Huang J
(2013) Molecular dynamics simulations and statistical coupling analysis reveal functional coevolution network of oncogenic mutations in the CDKN2A-CDK6 complex
FEBS Letters 587:136–141.

https://doi.org/10.1016/j.febslet.2012.11.001
- PubMed
- Google Scholar
1. Wodak SJ
2. Paci E
3. Dokholyan NV
4. Berezovsky IN
5. Horovitz A
6. Li J
7. Hilser VJ
8. Bahar I
9. Karanicolas J
10. Stock G
11. Hamm P
12. Stote RH
13. Eberhardt J
14. Chebaro Y
15. Dejaegere A
16. Cecchini M
17. Changeux J-P
18. Bolhuis PG
19. Vreede J
20. Faccioli P
21. Orioli S
22. Ravasio R
23. Yan L
24. Brito C
25. Wyart M
26. Gkeka P
27. Rivalta I
28. Palermo G
29. McCammon JA
30. Panecka-Hofman J
31. Wade RC
32. Di Pizio A
33. Niv MY
34. Nussinov R
35. Tsai C-J
36. Jang H
37. Padhorny D
38. Kozakov D
39. McLeish T
(2019) Allostery in its many disguises: from theory to applications
Structure 27:566–578.

https://doi.org/10.1016/j.str.2019.01.003
- PubMed
- Google Scholar
1. Wu Z
2. Kan SBJ
3. Lewis RD
4. Wittmann BJ
5. Arnold FH
(2019) Machine learning-assisted directed protein evolution with combinatorial libraries
PNAS 116:8852–8858.

https://doi.org/10.1073/pnas.1901979116
- PubMed
- Google Scholar
(2024) Key interaction networks: Identifying evolutionarily conserved non-covalent interaction networks across protein families
Protein Science 33:e4911.

https://doi.org/10.1002/pro.4911
- PubMed
- Google Scholar

Article and author information

Author details

Manming Xu

UCL School of Pharmacy, London, United Kingdom

Contribution
Formal analysis, Validation, Investigation, Visualization, Writing - original draft

Contributed equally with
Sarath Chandra Dantu and Shozeb Haider

Competing interests
No competing interests declared
Sarath Chandra Dantu

Department of Computer Science, Brunel University London, Uxbridge, United Kingdom

Contribution
Conceptualization, Software, Formal analysis, Validation, Investigation, Visualization, Methodology, Writing - original draft, Writing – review and editing

Contributed equally with
Manming Xu and Shozeb Haider

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0003-2019-5311
James A Garnett

Centre for Host-Microbiome Interactions, Faculty of Dentistry, Oral & Craniofacial Sciences, King’s College London, London, United Kingdom

Contribution
Conceptualization, Investigation, Writing – review and editing

Competing interests
No competing interests declared
Robert A Bonomo
1. Research Service, Louis Stokes Cleveland Department of Veterans Affairs Medical Center, Cleveland, United States
2. Department of Molecular Biology and Microbiology, Case Western Reserve University School of Medicine, Cleveland, United States
3. Department of Medicine, Case Western Reserve University School of Medicine, Cleveland, United States
4. Departments of Pharmacology, Biochemistry, and Proteomics and Bioinformatics Case Western Reserve University School of Medicine, Cleveland, United States
5. CWRU-Cleveland VAMC Center for Antimicrobial Resistance and Epidemiology (Case VA CARES), Cleveland, United States
Contribution
Validation, Project administration, Writing – review and editing

Competing interests
No competing interests declared
Alessandro Pandini

Department of Computer Science, Brunel University London, Uxbridge, United Kingdom

Contribution
Conceptualization, Software, Formal analysis, Supervision, Validation, Investigation, Visualization, Methodology, Project administration, Writing – review and editing

For correspondence
alessandro.pandini@brunel.ac.uk

Competing interests
No competing interests declared
Shozeb Haider
1. UCL School of Pharmacy, London, United Kingdom
2. University of Tabuk (PFSCBR), Tabuk, Saudi Arabia
3. UCL Center for Advanced Research Computing, University College London, London, United Kingdom
Contribution
Conceptualization, Resources, Formal analysis, Supervision, Validation, Investigation, Visualization, Methodology, Project administration, Writing – review and editing

Contributed equally with
Manming Xu and Sarath Chandra Dantu

For correspondence
shozeb.haider@ucl.ac.uk

Competing interests
Reviewing editor, eLife

"This ORCID iD identifies the author of this article:" 0000-0003-2650-2925

Funding

Leverhulme Trust (RPG-2017-222)

James A Garnett
Alessandro Pandini

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Acknowledgements

SCD was supported by Leverhulme Trust grant RPG-2017-222 awarded to AP and JAG. The authors would like to thank Arianna Fornili for insightful suggestions on the design of DyNoPy methodology.

Version history

Preprint posted: November 3, 2024
Sent for peer review: November 8, 2024
Reviewed Preprint version 1: January 16, 2025
Reviewed Preprint version 2: March 12, 2025
Version of Record published: March 28, 2025

Cite all versions

You can cite all versions using the DOI https://doi.org/10.7554/eLife.105005. This DOI represents all versions, and will always resolve to the latest one.

Copyright

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.