Structure-guided secretome analysis of gall-forming microbes offers insights into effector diversity and evolution

eLife Assessment

This study presents an important discovery regarding the diversity and evolution of gall-forming microbial effectors. Supported by convincing computational structural predictions and analyses, the research provides insights into the unique mechanisms by which gall-forming microbes exert their pathogenicity in plants. This study also offers guidance that is of value for future studies on pathogen effector function and co-evolution with host plants.

https://doi.org/10.7554/eLife.105185.3.sa0

Significance of the findings:

Important: Findings that have theoretical or practical implications beyond a single subfield

Landmark
Fundamental
Important
Valuable
Useful

Strength of evidence:

Convincing: Appropriate and validated methodology in line with current state-of-the-art

Exceptional
Compelling
Convincing
Solid
Incomplete
Inadequate

During the peer-review process the editor and reviewers write an eLife Assessment that summarises the significance of the findings reported in the article (on a scale ranging from landmark to useful) and the strength of the evidence (on a scale ranging from exceptional to inadequate). Learn more about eLife Assessments

Abstract
eLife digest
Introduction
Results
Discussion
Materials and methods
Data availability
References
Article and author information
Metrics

Abstract

Phytopathogens secrete effector molecules to manipulate host immunity and metabolism. Recent advances in structural genomics have identified fungal effector families whose members adopt similar folds despite sequence divergence, highlighting their importance in virulence and immune evasion. To extend the scope of comparative structure-guided analysis to more evolutionarily distant phytopathogens with similar lifestyles, we used AlphaFold2 to predict the 3D structures of the secretome from selected plasmodiophorid, oomycete, and fungal gall-forming pathogens. Clustering protein folds based on structural homology revealed species-specific expansions and a low abundance of known orphan effector families. We identified novel sequence-unrelated but structurally similar (SUSS) effector clusters, rich in conserved motifs such as 'CCG' and 'RAYH'. We demonstrate that these motifs likely play a central role in maintaining the overall fold. We also identified a SUSS cluster adopting a nucleoside hydrolase-like fold conserved among various gall-forming microbes. Notably, ankyrin proteins (ANK) were significantly expanded in gall-forming plasmodiophorids, with most being highly expressed during clubroot disease, suggesting a role in pathogenicity. Subsequently, we screened ANK proteins against Arabidopsis immunity hubs using AlphaFold-Multimer and verified one of the positive results by Y2H and BiFC assays to show that the ankyrin effector PbANK1 targets host MPK3 and a zinc-binding dehydrogenase. These findings suggest a potential new mechanism in which ANK effectors target multiple host proteins involved in stress sensing, opening a novel avenue to study the role of ANK in host–pathogen interactions. Altogether, this study advances our understanding of secretome landscapes in gall-forming microbes and provides a valuable resource for broadening structural phylogenomic studies across diverse phytopathogens.

eLife digest

Microbes can cause a variety of plant diseases and pose a major threat to global food production. To infect plants, many of these microbes release small proteins called effectors. Once inside the plant cell, the effectors can disarm the plant's immune defences and also reprogram its growth. In some cases, they trigger abnormal swellings called galls, which can seriously reduce harvests, such as the clubroot disease of canola, caused by the protist Plasmodiophora brassicae.

Effectors produced by gall-forming microbes remain poorly understood because these pathogens are hard to grow in the laboratory, and many of their proteins have no known function. Mukhopadhyay et al. provide new insights into how gall-forming microbes infect plants and how their effectors have evolved using artificial intelligence tools, such as AlphaFold. These tools can predict the three-dimensional structures of proteins, allowing researchers to look beyond gene sequences and uncover hidden patterns in protein shapes.

To explore the molecular strategies used by different gall-forming organisms to infect plants, the researchers studied the three-dimensional structure and properties of thousands of secreted proteins from fungi, protists and oomycetes.

The results showed that each group showed unique expansions – that is, unusually large numbers – of particular effector families. For example, the clubroot pathogen had expanded families of ankyrin-repeat proteins, a class of proteins characterized by repeating structural motifs that mediate protein-protein interactions. They also discovered clusters of proteins that shared similar shapes despite having little or no genetic similarity, highlighting that protein structure can be more conserved than genetic sequence. Notably, one ankyrin-repeat protein was found to interact with central components of plant immunity, suggesting a direct role in disabling host defenses. Together, these findings provide the first structural map of effectors in gall-forming microbes.

By understanding how effectors work, researchers can identify plant genes that confer stronger resistance to pathogens such as the clubroot microbe. While more experiments are needed to confirm the roles of effectors in plants, this structural resource already offers a powerful tool for scientists. It could help predict which microbial proteins are most likely to manipulate plant health and guide the development of durable crop protection strategies.

Introduction

The evolutionary arms race between host plants and their microbial pathogens is a fascinating example of adaptation and counteradaptation. Central to this battlefield are the effectors — secreted proteins from pathogens that manipulate host cellular processes to facilitate infection and colonization (Toruño et al., 2016). Effectors can also be recognized by plant receptor and trigger immunity and defense responses in the host (Chen et al., 2022). Due to their central role in pathogen-host interactions, effectors must continually evolve to evade detection by the host’s immune system (Martel et al., 2021). This arms race drives rapid changes in effector sequences, often resulting in high mutation rates and diversification (Liu et al., 2019). Additionally, effectors should maintain structural integrity for functionality while altering surface residues to avoid immune recognition (Derbyshire and Raffaele, 2023). Recent studies have defined these groups of effectors as Sequence-Unrelated Structurally Similar (SUSS), which despite lacking sequence similarity, share significant structural homology (Seong and Krasileva, 2021; Seong and Krasileva, 2023). For example, the MAX effector family, which includes proteins from various fungal pathogens like Magnaporthe oryzae and Pyrenophora tritici-repentis, exhibits a conserved β-sandwich fold (de Guillen et al., 2015). Other examples include the RXLR-WY effector families in oomycetes (Combier et al., 2022; Win et al., 2012), the LARS effectors in Cladosporium fulvum and Leptosphaeria maculans (Lazar et al., 2022), the RALPH effectors in Blumeria graminis (Cao et al., 2023), and the FOLD effectors in Fusarium oxysporum f. sp. lycopersici (Yu et al., 2024) demonstrating structural conservation. FOLD effectors have also recently been found in the secretome of unrelated pathogenic and symbiotic fungi, pointing to the fold’s relevance in plant colonization and expansion in different evolutive groups (Teulet et al., 2023). A recent study that classified orphan effector candidates (OECs) from Ascomycota into 62 main structural groups proposes that such structural conservation can be explained by changes in thermodynamic frustration at surface residues, which increase the robustness of the protein structure while altering potential interaction sites (Derbyshire and Raffaele, 2023).

Some of the discussed studies have been fueled by the emergence of machine-learning tools like AlphaFold (Jumper et al., 2021), which has revolutionized the field of protein modeling and enabled the computational prediction of pathogen effector structures. The utility of AlphaFold in plant-microbe interactions has been further demonstrated by its ability to predict the structure of the highly conserved AvrE-family of effector proteins (Nomura et al., 2023). These proteins are crucial in the pathogenesis of various phytopathogenic bacteria but are challenging to study due to their large size, toxicity to plant and animal cells, and low sequence similarity to known proteins (Xin et al., 2015). The structure prediction revealed β-barrel structures similar to bacterial porins, allowing for modulating host cell functions by facilitating the movement of small molecules across plasma membrane. AlphaFold Multimer (Evans, 2022), an extension of AlphaFold designed to predict in-silico protein-protein interactions, has been recently used to identify 15 effector candidates capable of targeting the active sites of chitinases and proteases (Homma et al., 2023), expanding the applicability of these tools to advance the tailored functional characterization of effector proteins.

Recent advances in structure-guided secretome analysis have mostly focused on fungal pathogens due to their significant economic impact and the availability of robust genomic and transcriptomic resources. However, in recent years, there is a growing interest in protist pathogens, which are quickly becoming a threat to agriculture and environment (Mukhopadhyay et al., 2024). For example, the gall-forming Plasmodiophora brassicae, a protist belonging to the class Plasmodiophorid, can cause significant yield loss in canola fields (Javed et al., 2023; Ochoa et al., 2023). Moreover, these obligate biotrophic protists are impossible to culture in axenic media and therefore difficult to transform (González-García and Pérez-López, 2021). Although the effector repertory for some of these protists has been predicted, the majority of their secretome remains uncharacterized due to the absence of known protein domains (Mukhopadhyay et al., 2024). To gain more insights into the secretome composition and effector biology of these understudied pathogens, we conducted structural similarity-based clustering of the predicted effectors of selected plasmodiophorid, oomycete, and fungal gall-farming pathogens. Here we examined (i) if the primary secretome families present in each pathogen share common folds with known fungal effector families; (ii) if they share common folds that could be associated with their biotrophic lifestyle and gall-forming pathogenicity strategy; and (iii) if some of the known effectors from these pathogens are part of SUSS effector families. By comparing the secretome of gall-forming pathogens from distant lineages, we provide a comprehensive overview of the uniqueness and commonality of the secretome landscape and offer more insights into the protist effector families by bringing them into the structural genomics era.

Results

Secretome prediction and structural modeling of gall-forming pathogens

Based on genome availability, phylogenetic distance, and economic significance, six gall-forming pathogens were selected for secretome analysis: two plasmodiophorids, Plasmodiophora brassicae and Spongospora subterranea, oomycete Albugo candida, and three fungi from different lineages, Taphrina deformans, Ustilago maydis, and Synchytrium endobioticum (Figure 1a). To gain a better understanding of plasmodiophorids, for which structural data is scarce, we also included Polymyxa betae, a non-gall-forming plasmodiophorid vector of the beet necrotic yellow vein virus, causing Rhizomania disease (Decroës, 2022; Figure 1a). To identify the putative secretome of these plant pathogens, we first employed SignalP Teufel et al., 2022 to predict sequences with N-terminal signal peptide. Sequences carrying a signal peptide were subjected to DeepTMHMM (Hallgren et al., 2022) search to remove sequences with transmembrane domains. Mature protein sequences greater than 1000 amino acid length were also removed (Figure 1b). The Ustilago maydis secretome and corresponding structures were obtained from a recent study (Seong and Krasileva, 2023) using similar filtering steps. This resulted in a total of 4197 proteins from seven gall-forming or related plant pathogens (Supplementary file 1). Next, the structures of 3575 proteins, excluding the U. maydis secretome which was already available, were modeled using AlphaFold2. InterproScan (Jones et al., 2014) was performed on all 4197 proteins against pfam, Gene3D, and SUPERFAMILY databases, identifying 41.59% of the proteins analyzed carrying a known protein domain (Supplementary file 1). Next, 2349 structures with pLDDT >65 were selected for further analysis (Supplementary file 1). Although pLDDT >70 is recommended as the threshold for structures with reliable confidence by AlphaFold developers, we selected a score of 65 or higher to include AvrSen1 (pLDDT 67), the only known S. endobioticum avirulent gene (van de Vossenberg et al., 2019a; Figure 1c). This resulted in the inclusion of 298 candidate effectors which would have otherwise been excluded (Supplementary file 1). IUPred3 Erdős et al., 2021 was also used to score the proteins as disordered or not based on whether 50% of the sequence position was predicted to be disordered (Supplementary file 1). Most of the proteins (97%) with pLDDT >70 were not disordered, while 26% of the secretome with pLDDT <70 was disordered, thus showing the limitation of AlphaFold 2 in modeling such effectors (Supplementary file 1).

Figure 1

Download asset Open asset

Description of the pathogens included in the study, workflow overview and statistics of structure prediction.

(A) Cladogram of the pathogens used in the study. The schematics represent the disease symptoms on their respective hosts, with the white areas representing galls. The secretome count indicates the number of proteins per species predicted to be secreted, and the functional annotation shows the percentage of the secretome predicted to contain a known protein domain in the Pfam, SUPERFAMILY, or Gene3D databases. (B) Flowchart of the workflow used to predict the secretome and the corresponding 3D structures. (C) Raincloud plot showing the median and density distribution of pLDDT scores of the predicted structures in each pathogen.

Structure-based clustering reveals species-specific folds and low homology with known fungal effector families

Structural similarity among the predicted structures was assessed using TMAlign Zhang and Skolnick, 2005 by scoring all structures against each other. To study the similarity with known effector families, we also included 19 crystal structures from the DELD, FOLD, LARS, MAX, KP6, RALPH, NTF2-like, ToxA, C2-like, and Zn-binding effector families, as previously utilized in a recent study (Teulet et al., 2023; Supplementary file 2). Additionally, we also included 62 structural families of orphan candidate effectors (OCE) recently identified in the Ascomycota lineage (Derbyshire and Raffaele, 2023; Supplementary file 2). Comparisons with TMScore greater than 0.5 were considered positive for structural similarity. Markov clustering (Enright et al., 2002; Van Dongen, 2008) with an inflation value of two was applied to cluster the secretome based on structural similarity score. This resulted in 254 structural clusters with at least two members (Figure 2, Supplementary file 3).

Figure 2 with 1 supplement see all

Download asset Open asset

Visualization of dominant protein folds present in each pathogen.

(Top) Network plot of structurally similar secretome clusters with at least 15 members. Not all 255 clusters are shown to reduce complexity. Each node represents a single protein, and an edge between two nodes represents structural similarity (TMScore >0.5). (Bottom) Representative structure of the dominant fold in each pathogen. Since Ankyrin repeats are common in both *P. brassicae* and *S. subterranea*, they are represented only once.

Ankyrin repeat-containing proteins were the largest cluster for plasmodiophorid P. brassicae (n=42) and S. subterranea (n=39) (Figure 2). The largest cluster detected on A. candida secretome is formed by ‘CCG’ (Furzer et al., 2022) class of effectors (n=48), while U. maydis largest cluster was composed of the Tin2-like proteins (Tanaka et al., 2019) (n=31). For S. synchytrium, the largest cluster (n=64) contains AvrSen1 virulence factor (Figure 2a). P. betae and T. deformans do not carry large (n>30) effector clusters which could be due to P. betae’s vector-like nature and T. deformans’s reduced genome size compared to other fungi (Cissé et al., 2013). T. deformans’s largest cluster is primarily composed of various glycoside hydrolases, while P. betae’s largest cluster consists of orphan helical proteins (Supplementary file 3). A 20-member kinase family (cluster 13) was noted in P. brassicae, which was absent in A. candida and three other fungi. Chitin deacetylases, which have been reported to convert chitin to less immunogenic chitosan (Gao et al., 2019), were expanded (n=11) in P. brassicae (Supplementary file 3). Chitin deacetylases were found in five out of seven pathogens tested, except in oomycete A. candida and ascomycete T. deformans, which despite being a fungus contains very little chitin in the cell wall (Petit and Schneider, 1983; Supplementary file 3). In S. subterranea and P. brassicae, a unique TauD/TfdA protein cluster was also identified, typically found in bacteria for taurine utilization as a sulfur source (Eichhorn et al., 1997). P. betae carries a six-member G-domain protein cluster, with similarity to its host Beta vulgari’s GTPases (Figure 2—figure supplement 1, Supplementary file 3). Among the well-characterized fungal effector families, only three KP6 fold Thynne et al., 2024 and one RALPH-like fold were found in U. maydis (Figure 2—figure supplement 1, Supplementary file 3). T. deformans carries two ToxA homologs (Figure 2—figure supplement 1, Supplementary file 3).

Effector folds conserved across kingdoms

Six protein folds, Hydrolases (clusters 2, 8, 9), Carboxypeptidases (cluster 12), Aspartyl proteases (cluster 3), Lectins (cluster 38), SCP domain (cluster 24), and an orphan group (cluster 5), contained proteins from all the pathogens investigated in this study, indicating deep evolutionary conservation of these folds across kingdoms (Supplementary file 3). In fact, 20 out of 62 Ascomycete orphan effector groups had at least one structural homolog in plasmodiophorids and oomycetes tested in this study, although a complete evolutionary connection would require the comparison of a much larger number of pathogens (Supplementary file 3). We did not find any specific fold (n>1 per species) conserved only among the gall-forming pathogens studied, indicating that this virulence strategy can be achieved by different mechanisms without necessarily converging onto common effector folds.

Structural search identifies a nucleoside hydrolase-like fold conserved in some gall-forming pathogens

Effectors are notorious for carrying unknown domains, making it difficult to predict the putative function of promising candidates (Lovelace et al., 2023). We searched for uncharacterized P. brassicae candidate effectors within the same cluster that were also overexpressed during infection. Two candidate effectors, PBTT_09143 and PBTT_07479, which are the first and fifth most expressed proteins at 16 days post inoculation (dpi) during clubroot disease, belong to the cluster 21 grouping with PBTT_0412, none of which carries a known domain (Figure 3a–c). Interestingly, the cluster also includes ten members from A. candida secretome, with eight carrying a predicted nucleoside hydrolase domain (Supplementary file 4). We subjected the three P. brassicae candidates to a FoldSeek-mediated (van Kempen et al., 2024) structural search against the PDB100 and AFD-proteome databases. Nucleoside hydrolases always emerged as the top hit (E-value <10^–5, Prob ~1, TMscore ≥ 0.5; Supplementary file 4). Next, searching the AFDB cluster web tool (Barrio-Hernandez et al., 2023), which allows for the identification of structural homologs across the known protein space, AFDB clusters A0A0G4IP88 and A0A024FV66 emerged as the top hits. The members of these clusters are often gall-forming, carry predicted signal peptide, and belong to various biotrophs like Melanopsichium pennsylvanicum, Ustilago maydis, Albugo candida, Sporisorium scitamineum, Testicularia cyperi, Colletotrichum orbiculare, among others (Figure 3b and d). Some of the members also do not carry identifiable domains (Supplementary file 4) and show limited sequence similarity among themselves (Figure 3b, Supplementary file 4). Interestingly, the cluster also includes bacterial Type III effector HopQ1 from Pseudomonas syringae (Supplementary file 4). Unlike P. brassicae effectors, HopQ1 is predicted to carry a nucleoside hydrolase domain, although the domain has been reported to be unable to bind standard nucleosides (Li et al., 2013). HopQ1 has been reported to be associated with 14-3-3 plant proteins to promote virulence (Li et al., 2013). Thus, it’s possible that the nucleoside hydrolase-like fold present in various gall-forming fungal, protist, and oomycete pathogens might have also neo-functionalized and be involved in new molecular strategies during the infection.

Figure 3

Download asset Open asset

Sequence and structural similarity among HopQ1 homologs in *U. maydis*, *A. candida,* and *P. brassicae*.

(A) Network plot showing the structural similarity between the members of cluster 21. Edges denote structural similarity (TMScore >0.5). (B) Pairwise sequence identity between selected HopQ1 structural homologs from plasmodiophorids, oomycetes, and fungi, illustrating sequence dissimilarity between some proteins despite structural homology. (C) Gene expression values (log2 TPM) of two highly induced *P. brassicae* genes at 16 and 26 dpi. (D) 3D structure of the mature protein sequences, assuming a HopQ1-like fold.

A fungal effector family shows structural homology despite extreme sequence divergence

The Mig1 protein in Ustilago maydis is a maize-induced effector that plays a crucial role in the biotrophic interaction between the fungus and its host (Basse et al., 2000). It is specifically induced during the biotrophic phase, contributing to the fungus’s pathogenicity. Cluster 30, specifically found in Ustilago maydis, contains this effector (Supplementary file 5). Upon examining the sequence and structural similarities between the members of the cluster, we discovered instances of structural homology (TMScore >0.5) despite pairwise sequence identity being as low as 0.6% and no higher than 30% (Figure 4a–d, Supplementary file 5). When aligning the protein sequences of 13 members, four cysteine residues were found to be strongly conserved (Figure 4e). These four residues form two disulfide bridges (Figure 4f), likely playing a crucial role in maintaining the overall fold despite significant sequence dissimilarity. All 13 members of the cluster were expressed upon infection (Supplementary file 6). A variable expression was observed for the Mig1-like genes located in the same genomic region, including members with completely opposite patterns of induction (Supplementary file 6), suggesting a possible regulatory role or the acquisition of new functions.

Figure 4

Download asset Open asset

Sequence and structural similarity among Mig1 homologs in *Ustilago maydis*.

(A) Similarity matrix showing the pairwise sequence identity (%) between Mig1 cluster members. (B) Similarity matrix showing the pairwise structural homology scores (TMScore) between Mig1 cluster members. (C) Superimposition of two Mig1 homologues, illustrating structural similarity despite extreme sequence divergence. (D) Differential gene expression patterns of two Mig1 tandem duplicates. (E) Multiple alignment of protein sequences, highlighting the conservation of cysteine residues (marked in yellow). (F) Visualization of the conserved cysteine residues forming disulfide bridges.

Identification of new SUSS effector families enriched in known motifs

The fungi S. endobioticum and A. candida encode large effector clusters which included previously identified avirulent factors AvrSen1 and CCG28/31/33/70, respectively (Supplementary file 7; Redkar et al., 2023). It was previously shown that the N-terminal region of the CCG28,33 and 70 share structural homology (Redkar et al., 2023). To examine if these clusters represent SUSS families, we carried out the sequence-based clustering of the 4197 proteins from the seven pathogens investigated here. We performed a BlastP search in all-vs-all mode and kept only those results with E-value lower than 10^–04 and bidirectional coverage of 50%. Markov clustering revealed 642 sequence-based clusters with at least two members (Supplementary file 7). We searched for sequence clusters having members from the same structural clusters. This revealed the presence of 12 sequence-related clusters associated with the AvrSen1 structural cluster and 11 sequence-related clusters associated with the CCG structural cluster (Figure 5a, Supplementary file 7). It was previously reported that CCG-containing effectors share limited similarity around the CCG motif and can be grouped in several clades based on sequence similarity (Furzer et al., 2022). Thus, AvrSen1 and CCG represent novel SUSS effector families whose similarities can’t be delineated by sequence search alone. Integration of sequence and structure data increased the member count of AvrSen1 and CCG clusters to 124 and 50 from previous 64 and 48, respectively (Supplementary file 8).

Figure 5 with 5 supplements see all

Download asset Open asset

SUSS effector families are enriched in common motifs.

(A) Network plots demonstrating that the two primary effector families in *A. candida* and *S. endobioticum* can only be grouped together when structural data is incorporated into the sequence-based clustering. The plots also indicate which sequence-based clusters contain the known effectors from these groups. (B) ‘RAYH’ and ‘CCG’ motif patterns identified by MEME scan. (C) Disulfide bridges in the ‘CCG’ motif, likely playing a pivotal role in structural maintenance, are highlighted in the virulence factor CCG30. A zoomed-in view of the ‘CCG module’ shows the four conserved cysteine residues forming disulfide bridge. (D) The ‘RAYH’ motif, occupying the central position in the core alpha-helix bundle, is highlighted in six sequence-based subclusters within the AvrSen1-like cluster in *S. endobioticum*.

It has been reported that S. endobioticum secretome is enriched in RAYH (van de Vossenberg et al., 2019b). While it is understood that the CCG class of effectors derived its name due to the presence of the CCG motif, it was not immediately clear if Avrsen1 cluster was also the source of the conserved RAYH motif. To verify that, we subjected the mature protein sequences of the two pathogen secretome to motif search using MEME (Bailey and Gribskov, 1998). Here we identified a 16 amino acid long RAYH motif present in 118 S. endobioticum proteins and 15 amino acid long ‘CCG’ motif present in 74 A. candida proteins (E<0.1, combined match p<0.001; Figure 5b, Figure 5—figure supplement 1, Figure 5—figure supplement 2, Supplementary file 8). Of the 118 sequences proteins carrying the RAYH motif, 79 were members of the AvrSen1 cluster in S. endobioticum (Supplementary file 8). Thus, the expansion of SUSS effectors in A. candida and S. synchytrium has resulted in the enrichment of common motifs, something that has recently been observed for the Y/F/WxC motif in Blumeria graminis RNA-like effector cluster (Seong and Krasileva, 2023).

Selection pressure analysis on A. candida CCG members shows that both cysteine residues in the 'CCG' motifs and two additional cysteine residues within 50 amino acids of the motif are often under purifying selection (Figure 5—figure supplement 3, Figure 5—figure supplement 4). Visualizing these four cysteines on the predicted structure shows that they form disulfide bridges and probably play a crucial role in overall maintenance of the fold (Figure 5c). The CCG motif seems to be a crucial part of a module consisting of two parallel alpha-helices joined to a beta-sheet, and CCG effectors are often composed of several of these modules (Figure 5c). The RAYH motif was also found to be part of the core structure of most AvrSen1-like effectors, forming a long alpha-helix (Figure 5d). Apart from the RAYH motif region, several surrounding hydrophobic residues were also strongly conserved, likely playing a role in maintaining the structure (Figure 5—figure supplement 5). Examining the sequence-related subclusters of the AvrSen1-like family, we found that the effectors are evolving by keeping the core alpha-helix bundle fixed while diversifying the peripheral stretches (Figure 5d).

Ankyrin repeat-containing proteins are a common feature of gall-forming plasmodiophorids

The largest structural cluster in P. brassicae and S. subterranea consists of ankyrin repeat-containing proteins (hereafter ANK proteins; Figure 6a). The presence of only five members in the non-gall-forming plasmodiophorid P. betae, in contrast to over 40 members in the gall-forming P. brassicae and S. subterranea, highlights the importance of this domain in their pathogenicity strategies (Supplementary file 9). Interestingly, InterProScan identified 32 additional P. brassicae proteins with ANK domains, while only two S. subterranea ANK proteins could be identified outside of the cluster (Supplementary file 9). We found that the P. brassicae secretome is richer in repeat proteins compared to S. subterranean ANK proteins repertory (Figure 6a), with 19 additional leucine-rich repeats (LRRs), ten tetratricopeptide repeats, and three MORN repeats (Supplementary file 9). Although 40 out of 74 ankyrins are overexpressed (TPM >10) at 16- and 26 dpi in Arabidopsis thaliana, only five LRRs are induced (Supplementary file 9). Notably, 17 ankyrin-repeat proteins also carry the SKP1/BTB/POZ domain, which is often involved in ubiquitination (Geyer et al., 2003; Figure 6b, Supplementary file 9).

Figure 6

Download asset Open asset

Diversity, structural features, and host immune targets of ankyrin repeats in Plasmodiophorids.

(A) Frequency of repeat-containing proteins in *P. brassicae* and *S. subterranea*. (B) Network plot showing structural homology within *P. brassicae* Ankyrin repeats, also highlighting the Ankyrin repeats with SKP1/BTB/POZ superfamily domains. (C) Alignment of Ankyrin motifs from *P. brassicae* and *S. subterranea*. (D) Visualization of conserved hydrophobic residues in a single Ankyrin repeat module. (E) Number of Ankyrin repeat proteins predicted to target *Arabidopsis* immune proteins. (F) AlphaFold Multimer predicted complex of MPK3 and PbANK1 (PBTT_00818), highlighting the predicted aligned errors of surface contacts under 4 Ångströms.

A MEME motif scan with the members of the ankyrin cluster identified a 33 amino acid long motif in P. brassicae and a 32 amino acid length motif in S. spongospora (Figure 6c, Supplementary file 9). Aligning the MEME profile of the two identified ankyrin motifs shows a strong conservation of two leucine and one alanine residues (Figure 6d) and upon visualization, those residues form the hydrophobic pocket between the two alpha helices of the ankyrin repeat, stabilizing the structure (Figure 6d). The rest of the non-conserved residues were found to be highly polymorphic (Figure 6d).

Ankyrin repeat-containing proteins interact with multiple host targets

Finally, to have an idea of the possible role of ANK proteins in plasmodiophorid virulence, we selected 70 ankyrin domain-containing proteins from P. brassicae and screened them against 20 key immune-related genes in Arabidopsis using AlphaFold-Multimer (Supplementary file 10). Protein-protein interactions were considered significant if the inter-chain Predicted Aligned Error (PAE) value was below 10, and the iPTM +pTM score was 70 or higher. Among the identified interactions, MPK3, MAPK4, MAPK6, SnRK1, NPR1, XCP1, CNGC4, and BAK1 were targeted by a total of ten ankyrin domain proteins (Figure 6e, Supplementary file 11). This dataset should serve as a valuable starting point for further understanding the role of ANKs in the virulence mechanisms of plasmodiophorids.

To experimentally validate the AlphaFold-Multimer predictions, we selected the PBTT_00818 (hereafter PbANK1)–MPK3 pair for targeted yeast two-hybrid (Y2H) assays. This pair was prioritized due to its high combined iPTM +pTM confidence score (0.82). Surprisingly, the one-on-one Y2H assay did not detect an interaction between PbANK1 and MPK3 (Figure 7a). To broaden our search, we next employed a large-scale Y2H screen using PbANK1 as bait against a randomly primed Arabidopsis seedling cDNA library, screening approximately 60 million clones. From this screen, 98 His⁺ colonies were recovered on selective medium lacking tryptophan, leucine, and histidine and supplemented with 50 mM 3-aminotriazole to suppress bait autoactivation. Among the recovered clones, several candidate interactors of PbANK1 were identified, with the GroES-like zinc-binding alcohol dehydrogenase (AT3G56460) emerging as the top candidate (Supplementary file 12). The PBTT_00818–GroES-like interaction was subsequently confirmed via one-on-one Y2H assay (Figure 7b). To further investigate these interactions in planta, we performed bimolecular fluorescence complementation (BiFC) assays in Nicotiana benthamiana leaves. These assays confirmed that PbANK1 interacts with both MPK3 (Figure 7c, Figure 7—figure supplement 1) and GroES-like (Figure 7d, Figure 7—figure supplement 2), with both interactions occurring in the nucleus. Together, these findings illustrate how in silico protein–protein interaction predictions can serve as a powerful tool to generate testable hypotheses about effector functions in host-pathogen systems.

Figure 7 with 2 supplements see all

Download asset Open asset

Validation of PbANK1-MPK3 and PbANK1-GroES-like interactions through Yeast two-hybrid (Y2H) and bimolecular fluorescence complementation (BiFC).

(A) 1-by-1 Y2H assay results evaluating the interaction PbANK1-MPK3 (AT3G45640) predicted through AlphaFold-Multimer. N=3. (B) 1-by-1 Y2H assay results evaluating the interaction PbANK1-GroES-like (AT3G56460) predicted through a Y2H screening of an *Arabidopsis* seedling library. N=3. (C) BiFC assay results evaluating the interaction PbANK1-MPK3. N=3, presented in Figure 7—figure supplement 1. Bar = 50 μm. (D) BiFC assay results evaluating the interaction PbANK1- GroES-like. N=3, presented in Figure 7—figure supplement 2. Bar = 50 μm.

Discussion

This study identified the primary protein folds in gall-forming pathogens' secretome supporting the idea that pathogen secretome is often dominated by expansion of specific folds which have been adopted and diversified over the course of evolution. Characterizing these primary effector folds in understudied plasmodiophorids like P. brassicae and S. subterranea would offer valuable insights into their virulence strategies, which remain largely enigmatic (Mukhopadhyay et al., 2024; Pérez-López et al., 2018). Here, we found that the ankyrin proteins, which are significantly expanded in gall-forming plasmodiophorids but less so in the related species P. betae, may be key to their ability to manipulate host immune responses and promote gall formation (Figures 2 and 6). This finding aligns with previous research indicating the importance of repeat-containing proteins in the virulence of plant pathogens (Mesarich et al., 2015). Examples of that are Phytophthora spp. effectors containing tandem repeats of the ‘(L)WY’ motif, whose modularity and elaborate mimicry of a host phosphatase helps to promote infection (Li et al., 2023). ANK motifs are well-known for mediating protein-protein interactions (Li et al., 2006), and have been identified as type IV effectors in the intracellular human pathogens Legionella pneumophila and Coxiella burnetii (Pan et al., 2008). Thus, based on the predicted structure and the validated interactions, we hypothesize that the highly polymorphic surface residues of plasmodiophorid ANK motifs, along with their variable frequency of occurrence across effector proteins, result in diverse binding interfaces for distinct host targets. However, further studies are needed to elucidate the mechanistic basis of how ANK proteins can engage multiple targets, including those reported here like MPK3 involved in immunity Bradley et al., 2022 and GroES-like protein implicated in peroxisomal functions (Xu et al., 2016).

This study also identified conserved protein folds across multiple kingdoms, particularly the hydrolase, carboxypeptidase, and aspartyl protease folds (Supplementary file 3). These folds appear to play a fundamental role in the virulence strategies of these pathogens, likely due to their ability to perform essential biochemical functions that facilitate infection (Reumann et al., 2007). The conservation of these folds across diverse species highlights their evolutionary significance and suggests that they may represent mechanisms of host manipulation that have been retained through speciation. We also identified a nucleoside hydrolase-like fold in evolutionary distant gall-forming pathogens which has homology to bacterial effector HopQ1 (Figure 3). Nucleoside hydrolases are involved in the purine salvage pathway in various pathogens (Hofer, 2023), but similar to HopQ1’s mode of action, these gall-forming biotrophs might also be targeting 14-3-3 proteins, which are implicated in hormonal signaling (Camoni et al., 2018). Although HopQ1 is widely conserved within the Pseudomonas species complex, our FoldSeek-mediated search identified hits in Pseudomonas savastanoi isolates, some of which are known to form galls on woody plants (Harmon et al., 2018; Ramos et al., 2012). It remains to be seen if some of the HopQ1 homologs have been specifically adapted in these bacteria to support a particular lifestyle.

Here we also provided further evidence for the divergent evolution model of effector evolution, which describes how members of the same effector family can exhibit extreme sequence dissimilarity over a long period while retaining the core fold intact (Seong and Krasileva, 2023). We show that this evolution mechanism occurs in both fungi and oomycetes and often involves the conservation of cysteine or hydrophobic residues to maintain the original fold (Figure 4, Figure 5). Given that sequence dissimilarity between homologs can be extreme, as exemplified by the Mig1 family in U. maydis, it would become common practice among researchers to incorporate structural knowledge into sequence searches to accurately gauge the diversity of the effector families (Figure 4).

Overall, our study underscores the power of structural genomics and machine learning tools like AlphaFold2 in uncovering the complexities of pathogen effector repertoires. The findings presented here open new avenues for research into the evolution of virulence strategies in phytopathogens and highlight the potential for these insights to inform the development of novel approaches to plant disease management. As we continue to expand our understanding of effector biology, particularly in under-studied pathogens, it will be crucial to integrate these structural insights with functional studies to fully elucidate the roles of these proteins in host-pathogen interactions.

Materials and methods

Secretome prediction

Request a detailed protocol

The proteome of P. brassicae was derived from our recent study generating the first complete genome of the clubroot pathogen (Javed et al., 2024). We are thankful to Prof. Anne Legrève for providing the updated annotation of the P. betae genome and proteome previously published by them (Decroës et al., 2019; Decroës et al., 2022). The rest of the proteome for S. subterranea (Ciaghi et al., 2018), A. candida (McMullan et al., 2015), S. endobioticum (van de Vossenberg et al., 2019b), U. maydis (Kämper et al., 2006), and T. deformans (Cissé et al., 2013) were downloaded from Uniprot database. SignalP 6 was utilized to identify sequences with predicted signal peptides, which were subsequently removed. DeepTMHMM was run through the pybiolib package. InterProScan 5.61–93.0 was used to confirm the presence of known domains using the Pfam, Gene3D, and SUPERFAMILY databases.

Structure prediction

Request a detailed protocol

A total of 3615 mature protein sequences were selected to be modeled using AlphaFold 2, but 40 predictions repeatedly failed at the MSA construction step, resulting in 3575 structures. To expedite the process, ParaFold 2.0 (Zhong et al., 2022) was used, which employs AlphaFold 2.3.1 internally but distributes the CPU and GPU tasks to facilitate parallelization. ‘Valeria’ compute cluster (https://valeria.science/accueil) of Université Laval was used for the structure prediction. The full database was used to construct the MSA (multiple sequence alignment), and models were predicted in 'monomer' mode, resulting in five PDB structures sorted by pLDDT scores. The Rank_0 PDB was used for subsequent studies. Finally, 2000 models with pLDDT scores over 65 were selected for downstream analysis.

Similarity search, clustering, and network plots

Request a detailed protocol

TM-Align was used to perform an all-versus-all structural comparison of 2000 models, and those comparisons with a normalized TM-score above 0.5 were considered significant. All-versus-all sequence comparison was performed using BlastP (Camacho et al., 2009) with an E-value <10^–4 and bidirectional coverage of at least 50%. Structure and sequence similarity data, represented by three columns with the first two as target and source IDs, and the third one being TM-score/E-value, were clustered using the Markov clustering with an inflation value of two. For sequence clustering, E-values were loaded following the recommendation of the MCL workflow [mcxload -abc seq.abc --stream-mirror --stream-neg-log10 -stream-tf 'ceil(200)']. Custom Python scripts were written to find the sequence-related subclusters belonging to the same structural cluster and to count the occurrence of cluster members (https://github.com/Edelab/AlphaFold_effector_paper, copy archived at Perez-Lopez, 2025). Plots were generated using Chiplot (https://www.chiplot.online/) and ggplot2 (Wickham, 2016).

Sequence alignment

Request a detailed protocol

Pairwise alignment of protein sequences was done by EMBOSS Needle (Rice et al., 2000). Clustal Omega was used to generate multiple protein sequence alignment (Sievers and Higgins, 2018). Kalign 3.4.0 (Lassmann, 2020) was used to generate alignment of extremely divergent Mig1 cluster.

Selection pressure analysis and structure visualization

Request a detailed protocol

Coding sequences (CDS) were obtained from the Ensembl Fungi/ENA database. Multiple nucleotide sequences were aligned using the Kc-Align codon-aware aligner (Nicholas, 2020). Positions with more than 50% gaps were removed using the Clipkit (Steenwyk et al., 2020) online tool in 'gappy' mode. The trimmed alignments were manually analyzed using Geneious (http://www.geneious.com/) for correct codon alignment. The resulting alignment was uploaded to the Datamonkey server (http://www.datamonkey.org/), which hosts the HyPhy package (Pond et al., 2005). All the branches were used as input for FEL (Kosakovsky Pond and Frost, 2005) to identify sites under purifying selection (p<0.01). The ESPript 3.0 web server was used to generate multiple sequence alignments (Robert and Gouet, 2014). Multiple structures were aligned using mTm-Align (Dong et al., 2018). PyMOL 3.0.4 (Schrödinger, 2015) was used to visualize the PDB files and color conserved sites or disulfide bridges.

Expression analysis

Request a detailed protocol

Datasets for the RNA-Seq reads obtained at 16- and 26 dpi during P. brassicae infection were downloaded from the EBI server (accession number PRJEB12261). The reads from the infected samples were mapped to the A. thaliana genome TAIR10 (Genbank accession number GCF_000001735.4) using HiSAT2 (Kim et al., 2019) to remove host contaminant sequences. To use Salmon (Patro et al., 2017), an index file was created by concatenating the P. brassicae genome and CDS sequences. The remaining RNA-Seq reads were quasi-mapped to the index using Salmon 1.10.0 to generate normalized transcripts per million (TPM) counts for all 10,521 genes. TPM values from three replicates were averaged. Pre-processed gene expression data for U. maydis was publicly available (Lanver et al., 2018).

Motif scanning

Request a detailed protocol

MEME 5.5.5 was used to scan the list of amino acid sequences in '-anr' mode (E<0.1) to discover motifs of any length and frequency. MAST 5.5.5 was used to protein sequences with MEME motifs. Sequence profiles of P. brassicae and S. subterranea ANK motifs were aligned using Tomtom 5.5.5 (Gupta et al., 2007).

Structural homology search

Request a detailed protocol

FoldSeek (https://search.foldseek.com/search) was used to search for structural homology against Uniprot50, Swiss-Prot, the AlphaFold proteome, and PDB. The AFDB cluster database (https://cluster.foldseek.com/) was searched to find cluster members in the AlphaFold database.

In silico protein-protein interaction prediction

Request a detailed protocol

ANK proteins were screened for interaction against a list of A. thaliana immune genes using AlphaPulldown v1.0.4 (Yu et al., 2023). It utilizes AlphaFold Multimer but separates the CPU and GPU jobs and reuses the MSA to reduce compute time. The full AlphaFold 2.3.0 database was used for MSA creation. The resulting models from the AlphaPulldown run were parsed with the supplied singularity image alpha-analysis_jax_0.4.sif to produce the final iPTM +pTM score table. ChimeraX 1.8 (Meng et al., 2023) was used to visualize the predicted aligned error for residue pairs under 4 Ångströms at the interface between the two chains produced by AlphaFold-Multimer.

Yeast two-hybrid assay

Request a detailed protocol

Yeast two-hybrid (Y2H) screenings were performed by Hybrigenics Services (https://hybrigenics-services.com/). The coding sequence of Plasmodiophora brassicae PBTT_00818 (amino acids 1–490) was synthesized by Twist Biosciences (https://www.twistbioscience.com) and cloned into the pTwist ENTR vector. This construct was then used as a template to amplify and subclone the gene into the pB66 vector as a C-terminal fusion to the Gal4 DNA-binding domain (Gal4-bait). All constructs were verified by sequencing. The bait construct was used to screen a random-primed Arabidopsis thaliana seedling (1-week-old) cDNA library [ATH], cloned into the pP6 vector. For the PBTT_00818 screen, approximately 60 million clones, equivalent to six times the complexity of the library, were screened using a mating-based approach with the CG1945 yeast strain, following previously described protocols (Fromont-Racine et al., 1997). A total of 98 His⁺ colonies were selected on selective medium lacking tryptophan, leucine, and histidine and supplemented with 50 mM 3-aminotriazole (3-AT) to suppress bait autoactivation. Prey fragments from positive colonies were PCR-amplified and sequenced at both 5′ and 3′ ends, and the resulting sequences were used to identify corresponding proteins via automated BLAST searches against the GenBank (NCBI) database.

For one-by-one (1-by-1) Y2H assays, bait and prey constructs were transformed separately into CG1945 and YHGX13 yeast strains, respectively. Interaction tests were performed using the HIS3 reporter gene system, with growth assays on selective media. As negative controls, bait plasmids were tested with empty prey vectors (pP7), and prey plasmids with empty bait vectors (pB66). The SMAD–SMURF interaction served as a positive control (Colland et al., 2004). Each interaction and control was assessed using streaks of three independent yeast clones. Two selective media were used: DO-2 (lacking tryptophan and leucine) served as a control to confirm the presence of both bait and prey plasmids, while DO-3 (lacking tryptophan, leucine, and histidine) was used to detect protein–protein interactions. Increasing concentrations of 3-AT were added to the DO-3 plates to enhance stringency and reduce false positives due to bait autoactivation. All 1-by-1 Y2H experiments were performed in triplicate to ensure reproducibility.

BiFC assay

Request a detailed protocol

Bimolecular fluorescence complementation (BiFC) assays were performed by PronetBio (https://www.pronetbio.com). The coding sequences of P. brassicae PBTT_00818 and Arabidopsis genes AT3G45640 and AT3G56460 were synthesized and subcloned into the BiFC expression vectors pCAMBIA1300-nYFP and pCAMBIA1300-cYFP, respectively. Recombinant plasmids were first transformed into Escherichia coli TOP10 cells for propagation and then introduced into Agrobacterium tumefaciens strain GV3101 via electroporation.

Nicotiana benthamiana plants (4–6 weeks old, five-leaf stage) were cultivated under greenhouse conditions with a 14 hr light/10 hr dark photoperiod (22–25°C Day / 18–20°C night). For transient expression, healthy leaves (3rd to 6th from the apex) were infiltrated with 30–50 µL of Agrobacterium suspension per site using a needleless syringe to apply the mixture to the abaxial surface. Following infiltration, plants were incubated for 36–48 hr before leaf samples were excised from the marked infiltration zones. Fluorescence signals indicative of protein–protein interaction were detected and imaged using a laser scanning confocal microscope. All BiFC experiments were performed in triplicate to ensure reproducibility. In each assay, the positive control used was the interactive pair HAI1–MPK6 as previously described (Mine et al., 2017).

Data availability

The datasets used in this study can be downloaded from Zenodo and the scripts from GitHub (copy archived Perez-Lopez, 2025).

The following data sets were generated

(2024) Zenodo
Dataset related to "Structure-guided secretome analysis of gall-forming microbes offers insights into effector diversity and evolution".

https://doi.org/10.5281/zenodo.11152389

The following previously published data sets were used

1. Schwelm A
(2018) NCBI Nucleotide
ID OUQQ00000000. Spongospora subterranea, whole genome shotgun sequencing project.

https://www.ncbi.nlm.nih.gov/nuccore/OUQQ00000000
1. Kamper J
2. Kahmann R
3. Bolker M
4. Brefort T
5. Saville BJ
6. Banuett F
7. Kronstad JW
8. Gold SE
9. Muller O
10. Perlin MH
11. Wosten HA
12. de Vries R
13. Ruiz-Herrera J
14. Reynaga-Pena CG
15. Snetselaar K
16. McCann M
17. Perez-Martin J
18. Feldbrugge M
19. Basse CW
20. Steinberg G
21. Ibeas JI
22. Holloman W
23. Guzman P
24. Farman M
25. Stajich JE
26. Sentandreu R
27. Gonzalez-Prieto JM
28. Kennell JC
29. Molina L
30. Schirawski J
31. Mendoza-Mendoza A
32. Greilinger D
33. Munch K
34. Rossel N
35. Scherer M
36. Vranes M
37. Ladendorf O
38. Vincon V
39. Fuchs U
40. Sandrock B
41. Meng S
42. Cahill MJ
43. Boyce KJ
44. Klose J
45. Klosterman SJ
46. Deelstra HJ
47. Ortiz-Castellanos L
48. Li W
49. Sanchez-Alonso P
50. Schreier PH
51. Hauser-Hahn I
52. Vaupel M
53. Koopmann E
54. Friedrich G
55. Voss H
56. Schluter T
57. Margolis J
58. Platt D
59. Swimmer C
60. Gnirke A
61. Chen F
62. Vysotskaia V
63. Mannhaupt G
64. Guldener U
65. Munsterkotter M
66. Haase D
67. Oesterheld M
68. Mewes HW
69. Mauceli EW
70. DeCaprio D
71. Wade CM
72. Butler J
73. Young S
74. Jaffe DB
75. Calvo S
76. Nusbaum C
77. Galagan J
78. Birren BW
79. Ma LJ
80. Ho EC
(2006) NCBI Nucleotide
ID AACP00000000. Mycosarcoma maydis strain 521, whole genome shotgun sequencing project.

https://www.ncbi.nlm.nih.gov/nuccore/AACP00000000

References

1. Bailey TL
2. Gribskov M
(1998) Combining evidence using p-values: application to sequence homology searches
Bioinformatics 14:48–54.

https://doi.org/10.1093/bioinformatics/14.1.48
- PubMed
- Google Scholar
1. Barrio-Hernandez I
2. Yeo J
3. Jänes J
4. Mirdita M
5. Gilchrist CLM
6. Wein T
7. Varadi M
8. Velankar S
9. Beltrao P
10. Steinegger M
(2023) Clustering predicted structures at the scale of the known protein universe
Nature 622:637–645.

https://doi.org/10.1038/s41586-023-06510-w
- PubMed
- Google Scholar
(2000) Characterization of a Ustilago maydis gene specifically induced during the biotrophic phase: evidence for negative as well as positive regulation
Molecular and Cellular Biology 20:329–339.

https://doi.org/10.1128/MCB.20.1.329-339.2000
- PubMed
- Google Scholar
(2022) Secreted glycoside hydrolase proteins as effectors and invasion patterns of plant-associated fungi and oomycetes
Frontiers in Plant Science 13:853106.

https://doi.org/10.3389/fpls.2022.853106
- PubMed
- Google Scholar
1. Camacho C
2. Coulouris G
3. Avagyan V
4. Ma N
5. Papadopoulos J
6. Bealer K
7. Madden TL
(2009) BLAST+: architecture and applications
BMC Bioinformatics 10:421.

https://doi.org/10.1186/1471-2105-10-421
- PubMed
- Google Scholar
1. Camoni L
2. Visconti S
3. Aducci P
4. Marra M
(2018) 14-3-3 proteins in plant hormone signaling: doing several things at once
Frontiers in Plant Science 9:297.

https://doi.org/10.3389/fpls.2018.00297
- PubMed
- Google Scholar
1. Cao Y
2. Kümmel F
3. Logemann E
4. Gebauer JM
5. Lawson AW
6. Yu D
7. Uthoff M
8. Keller B
9. Jirschitzka J
10. Baumann U
11. Tsuda K
12. Chai J
13. Schulze-Lefert P
(2023) Structural polymorphisms within a common powdery mildew effector scaffold as a driver of coevolution with cereal immune receptors
PNAS 120:e2307604120.

https://doi.org/10.1073/pnas.2307604120
- Google Scholar
1. Chen J
2. Zhang X
3. Rathjen JP
4. Dodds PN
(2022) Direct recognition of pathogen effectors by plant NLR immune receptors and downstream signalling
Essays in Biochemistry 66:471–483.

https://doi.org/10.1042/EBC20210072
- PubMed
- Google Scholar
(2018) Draft genome resource for the potato powdery scab pathogen Spongospora subterranea
Molecular Plant-Microbe Interactions 31:1227–1229.

https://doi.org/10.1094/MPMI-06-18-0163-A
- PubMed
- Google Scholar
1. Cissé OH
2. Almeida J
3. Fonseca A
4. Kumar AA
5. Salojärvi J
6. Overmyer K
7. Hauser PM
8. Pagni M
(2013) Genome sequencing of the plant pathogen Taphrina deformans, the causal agent of peach leaf curl
mBio 4:e00055.

https://doi.org/10.1128/mBio.00055-13
- PubMed
- Google Scholar
1. Colland F
2. Jacq X
3. Trouplin V
4. Mougin C
5. Groizeleau C
6. Hamburger A
7. Meil A
8. Wojcik J
9. Legrain P
10. Gauthier JM
(2004) Functional proteomics mapping of a human signaling pathway
Genome Research 14:1324–1332.

https://doi.org/10.1101/gr.2334104
- PubMed
- Google Scholar
(2022) Candidate effector proteins from the oomycetes Plasmopara viticola and Phytophthora parasitica share similar predicted structures and induce cell death in Nicotiana species
PLOS ONE 17:e0278778.

https://doi.org/10.1371/journal.pone.0278778
- PubMed
- Google Scholar
(2019) First Draft genome sequence of a polymyxa genus member, polymyxa betae, the protist vector of rhizomania
Microbiol Resour Announc 8:10.

https://doi.org/10.1128/MRA.01509-18
- Google Scholar
1. Decroës A
(2022) Rhizomania: Hide and seek of polymyxa betae and the beet necrotic yellow vein virus with Beta vulgaris
Molecular Plant-Microbe Interactions 35:989–1005.

https://doi.org/10.1094/MPMI-03-22-0063-R
- PubMed
- Google Scholar
(2022) Metagenomics approach for Polymyxa betae genome assembly enables comparative analysis towards deciphering the intracellular parasitic lifestyle of the plasmodiophorids
Genomics 114:9–22.

https://doi.org/10.1016/j.ygeno.2021.11.018
- PubMed
- Google Scholar
(2015) Structure analysis uncovers a highly diverse but structurally conserved effector family in phytopathogenic fungi
PLOS Pathogens 11:e1005228.

https://doi.org/10.1371/journal.ppat.1005228
- PubMed
- Google Scholar
1. Derbyshire MC
2. Raffaele S
(2023) Surface frustration re-patterning underlies the structural landscape and evolvability of fungal orphan candidate effectors
Nature Communications 14:5244.

https://doi.org/10.1038/s41467-023-40949-9
- PubMed
- Google Scholar
1. Dong R
2. Peng Z
3. Zhang Y
4. Yang J
(2018) mTM-align: an algorithm for fast and accurate multiple protein structure alignment
Bioinformatics 34:1719–1725.

https://doi.org/10.1093/bioinformatics/btx828
- PubMed
- Google Scholar
(1997) Characterization of alpha-ketoglutarate-dependent taurine dioxygenase from Escherichia coli
The Journal of Biological Chemistry 272:23031–23036.

https://doi.org/10.1074/jbc.272.37.23031
- PubMed
- Google Scholar
(2002) An efficient algorithm for large-scale detection of protein families
Nucleic Acids Research 30:1575–1584.

https://doi.org/10.1093/nar/30.7.1575
- PubMed
- Google Scholar
(2021) IUPred3: prediction of protein disorder enhanced with unambiguous experimental annotation and visualization of evolutionary conservation
Nucleic Acids Research 49:W297–W303.

https://doi.org/10.1093/nar/gkab408
- PubMed
- Google Scholar
Preprint
1. Evans R
(2022) Protein Complex Prediction with AlphaFold-Multimer
bioRxiv.

https://doi.org/10.1101/2021.10.04.463034
- Google Scholar
(1997) Toward a functional analysis of the yeast genome through exhaustive two-hybrid screens
Nature Genetics 16:277–282.

https://doi.org/10.1038/ng0797-277
- PubMed
- Google Scholar
1. Furzer OJ
2. Cevik V
3. Fairhead S
4. Bailey K
5. Redkar A
6. Schudoma C
7. MacLean D
8. Holub EB
9. Jones JDG
(2022) An improved assembly of the Albugo candida Ac2V genome reveals the expansion of the “CCG” class of effectors
Molecular Plant-Microbe Interactions 35:39–48.

https://doi.org/10.1094/MPMI-04-21-0075-R
- PubMed
- Google Scholar
1. Gao F
2. Zhang B-S
3. Zhao J-H
4. Huang J-F
5. Jia P-S
6. Wang S
7. Zhang J
8. Zhou J-M
9. Guo H-S
(2019) Deacetylation of chitin oligomers increases virulence in soil-borne fungal pathogens
Nature Plants 5:1167–1176.

https://doi.org/10.1038/s41477-019-0527-4
- PubMed
- Google Scholar
1. Geyer R
2. Wee S
3. Anderson S
4. Yates J
5. Wolf DA
(2003) BTB/POZ domain proteins are putative substrate adaptors for cullin 3 ubiquitin ligases
Molecular Cell 12:783–790.

https://doi.org/10.1016/s1097-2765(03)00341-1
- PubMed
- Google Scholar
1. González-García M
2. Pérez-López E
(2021) Looking for a cultured surrogate for effectome studies of the clubroot pathogen
Frontiers in Microbiology 12:650307.

https://doi.org/10.3389/fmicb.2021.650307
- PubMed
- Google Scholar
(2007) Quantifying similarity between motifs
Genome Biology 8:R24.

https://doi.org/10.1186/gb-2007-8-2-r24
- PubMed
- Google Scholar
Preprint
(2022) DeepTMHMM predicts alpha and beta transmembrane proteins using deep neural networks
bioRxiv.

https://doi.org/10.1101/2022.04.08.487609
- Google Scholar
1. Harmon CL
2. Timilsina S
3. Bonkowski J
4. Jones DD
5. Sun X
6. Vallad GE
7. Sepulveda LR
8. Bull C
9. Jones JB
(2018) Bacterial Gall of Loropetalum chinense caused by Pseudomonas amygdali pv. loropetali pv. nov
Plant Disease 102:799–806.

https://doi.org/10.1094/PDIS-04-17-0505-RE
- PubMed
- Google Scholar
1. Hofer A
(2023) Targeting the nucleotide metabolism of Trypanosoma brucei and other trypanosomatids
FEMS Microbiology Reviews 47:1–20.

https://doi.org/10.1093/femsre/fuad020
- PubMed
- Google Scholar
(2023) AlphaFold-Multimer predicts cross-kingdom interactions at the plant-pathogen interface
Nature Communications 14:6040.

https://doi.org/10.1038/s41467-023-41721-9
- PubMed
- Google Scholar
(2023) The clubroot pathogen Plasmodiophora brassicae: A profile update
Molecular Plant Pathology 24:89–106.

https://doi.org/10.1111/mpp.13283
- PubMed
- Google Scholar
(2024) Telomere-to-telomere genome assembly of the clubroot pathogen plasmodiophora brassicae
Genome Biology and Evolution 16:evae122.

https://doi.org/10.1093/gbe/evae122
- PubMed
- Google Scholar
1. Jones P
2. Binns D
3. Chang HY
4. Fraser M
5. Li W
6. McAnulla C
7. McWilliam H
8. Maslen J
9. Mitchell A
10. Nuka G
11. Pesseat S
12. Quinn AF
13. Sangrador-Vegas A
14. Scheremetjew M
15. Yong SY
16. Lopez R
17. Hunter S
(2014) InterProScan 5: genome-scale protein function classification
Bioinformatics 30:1236–1240.

https://doi.org/10.1093/bioinformatics/btu031
- PubMed
- Google Scholar
1. Jumper J
2. Evans R
3. Pritzel A
4. Green T
5. Figurnov M
6. Ronneberger O
7. Tunyasuvunakool K
8. Bates R
9. Žídek A
10. Potapenko A
11. Bridgland A
12. Meyer C
13. Kohl SAA
14. Ballard AJ
15. Cowie A
16. Romera-Paredes B
17. Nikolov S
18. Jain R
19. Adler J
20. Back T
21. Petersen S
22. Reiman D
23. Clancy E
24. Zielinski M
25. Steinegger M
26. Pacholska M
27. Berghammer T
28. Bodenstein S
29. Silver D
30. Vinyals O
31. Senior AW
32. Kavukcuoglu K
33. Kohli P
34. Hassabis D
(2021) Highly accurate protein structure prediction with AlphaFold
Nature 596:583–589.

https://doi.org/10.1038/s41586-021-03819-2
- PubMed
- Google Scholar
1. Kämper J
2. Kahmann R
3. Bölker M
4. Ma L-J
5. Brefort T
6. Saville BJ
7. Banuett F
8. Kronstad JW
9. Gold SE
10. Müller O
11. Perlin MH
12. Wösten HAB
13. de Vries R
14. Ruiz-Herrera J
15. Reynaga-Peña CG
16. Snetselaar K
17. McCann M
18. Pérez-Martín J
19. Feldbrügge M
20. Basse CW
21. Steinberg G
22. Ibeas JI
23. Holloman W
24. Guzman P
25. Farman M
26. Stajich JE
27. Sentandreu R
28. González-Prieto JM
29. Kennell JC
30. Molina L
31. Schirawski J
32. Mendoza-Mendoza A
33. Greilinger D
34. Münch K
35. Rössel N
36. Scherer M
37. Vranes M
38. Ladendorf O
39. Vincon V
40. Fuchs U
41. Sandrock B
42. Meng S
43. Ho ECH
44. Cahill MJ
45. Boyce KJ
46. Klose J
47. Klosterman SJ
48. Deelstra HJ
49. Ortiz-Castellanos L
50. Li W
51. Sanchez-Alonso P
52. Schreier PH
53. Häuser-Hahn I
54. Vaupel M
55. Koopmann E
56. Friedrich G
57. Voss H
58. Schlüter T
59. Margolis J
60. Platt D
61. Swimmer C
62. Gnirke A
63. Chen F
64. Vysotskaia V
65. Mannhaupt G
66. Güldener U
67. Münsterkötter M
68. Haase D
69. Oesterheld M
70. Mewes H-W
71. Mauceli EW
72. DeCaprio D
73. Wade CM
74. Butler J
75. Young S
76. Jaffe DB
77. Calvo S
78. Nusbaum C
79. Galagan J
80. Birren BW
(2006) Insights from the genome of the biotrophic fungal plant pathogen Ustilago maydis
Nature 444:97–101.

https://doi.org/10.1038/nature05248
- PubMed
- Google Scholar
1. Kim D
2. Paggi JM
3. Park C
4. Bennett C
5. Salzberg SL
(2019) Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype
Nature Biotechnology 37:907–915.

https://doi.org/10.1038/s41587-019-0201-4
- PubMed
- Google Scholar
1. Kosakovsky Pond SL
2. Frost SDW
(2005) Not so different after all: a comparison of methods for detecting amino acid sites under selection
Molecular Biology and Evolution 22:1208–1222.

https://doi.org/10.1093/molbev/msi105
- PubMed
- Google Scholar
1. Lanver D
2. Müller AN
3. Happel P
4. Schweizer G
5. Haas FB
6. Franitza M
7. Pellegrin C
8. Reissmann S
9. Altmüller J
10. Rensing SA
11. Kahmann R
(2018) The biotrophic development of Ustilago maydis Studied by RNA-Seq Analysis
The Plant Cell 30:300–323.

https://doi.org/10.1105/tpc.17.00764
- PubMed
- Google Scholar
1. Lassmann T
(2020) Kalign 3: multiple sequence alignment of large datasets
Bioinformatics 36:1928–1929.

https://doi.org/10.1093/bioinformatics/btz795
- Google Scholar
1. Lazar N
2. Mesarich CH
3. Petit-Houdenot Y
4. Talbi N
5. Li de la Sierra-Gallay I
6. Zélie E
7. Blondeau K
8. Gracy J
9. Ollivier B
10. Blaise F
11. Rouxel T
12. Balesdent MH
13. Idnurm A
14. van Tilbeurgh H
15. Fudal I
(2022) A new family of structurally conserved fungal effectors displays epistatic interactions with plant resistance proteins
PLOS Pathogens 18:e1010664.

https://doi.org/10.1371/journal.ppat.1010664
- PubMed
- Google Scholar
1. Li J
2. Mahajan A
3. Tsai MD
(2006) Ankyrin repeat: a unique motif mediating protein-protein interactions
Biochemistry 45:15168–15178.

https://doi.org/10.1021/bi062188q
- PubMed
- Google Scholar
1. Li W
2. Yadeta KA
3. Elmore JM
4. Coaker G
(2013) The Pseudomonas syringae effector HopQ1 promotes bacterial virulence and interacts with tomato 14-3-3 proteins in a phosphorylation-dependent manner
Plant Physiology 161:2062–2074.

https://doi.org/10.1104/pp.112.211748
- PubMed
- Google Scholar
1. Li H
2. Wang J
3. Kuan TA
4. Tang B
5. Feng L
6. Wang J
7. Cheng Z
8. Skłenar J
9. Derbyshire P
10. Hulin M
11. Li Y
12. Zhai Y
13. Hou Y
14. Menke FLH
15. Wang Y
16. Ma W
(2023) Pathogen protein modularity enables elaborate mimicry of a host phosphatase
Cell 186:3196–3207.

https://doi.org/10.1016/j.cell.2023.05.049
- PubMed
- Google Scholar
1. Liu L
2. Xu L
3. Jia Q
4. Pan R
5. Oelmüller R
6. Zhang W
7. Wu C
(2019) Arms race: diverse effector proteins with conserved motifs
Plant Signaling & Behavior 14:1557008.

https://doi.org/10.1080/15592324.2018.1557008
- PubMed
- Google Scholar
1. Lovelace AH
2. Dorhmi S
3. Hulin MT
4. Li Y
5. Mansfield JW
6. Ma W
(2023) Effector identification in plant pathogens
Phytopathology 113:637–650.

https://doi.org/10.1094/PHYTO-09-22-0337-KD
- PubMed
- Google Scholar
(2021) The ETS-ETI cycle: evolutionary processes and metapopulation dynamics driving the diversification of pathogen effectors and host immune factors
Current Opinion in Plant Biology 62:102011.

https://doi.org/10.1016/j.pbi.2021.102011
- PubMed
- Google Scholar
(2015) Evidence for suppression of immunity as a driver for genomic introgressions and host range expansion in races of Albugo candida, a generalist parasite
eLife 4:e04550.

https://doi.org/10.7554/eLife.04550
- PubMed
- Google Scholar
1. Meng EC
2. Goddard TD
3. Pettersen EF
4. Couch GS
5. Pearson ZJ
6. Morris JH
7. Ferrin TE
(2023) UCSF ChimeraX: Tools for structure building and analysis
Protein Science 32:e4792.

https://doi.org/10.1002/pro.4792
- PubMed
- Google Scholar
(2015) Repeat-containing protein effectors of plant-associated organisms
Frontiers in Plant Science 6:872.

https://doi.org/10.3389/fpls.2015.00872
- PubMed
- Google Scholar
1. Mine A
2. Berens ML
3. Nobori T
4. Anver S
5. Fukumoto K
6. Winkelmüller TM
7. Takeda A
8. Becker D
9. Tsuda K
(2017) Pathogen exploitation of an abscisic acid- and jasmonate-inducible MAPK phosphatase and its interception by Arabidopsis immunity
PNAS 114:7456–7461.

https://doi.org/10.1073/pnas.1702613114
- PubMed
- Google Scholar
(2024) Decoding the arsenal: protist effectors and their impact on photosynthetic hosts
Molecular Plant-Microbe Interactions 37:498–506.

https://doi.org/10.1094/MPMI-11-23-0196-CR
- PubMed
- Google Scholar
Software
1. Nicholas K
(2020) Avebx/kc-align: codon-aware aligner, version f6932b5
Github.

https://github.com/davebx/kc-align
1. Nomura K
2. Andreazza F
3. Cheng J
4. Dong K
5. Zhou P
6. He SY
(2023) Bacterial pathogens deliver water- and solute-permeable channels to plant cells
Nature 621:586–591.

https://doi.org/10.1038/s41586-023-06531-5
- PubMed
- Google Scholar
(2023) Natural variation in Arabidopsis responses to Plasmodiophora brassicae reveals an essential role for Resistance to Plasmodiophora brasssicae 1 (RPB1)
The Plant Journal 116:1421–1440.

https://doi.org/10.1111/tpj.16438
- PubMed
- Google Scholar
(2008) Ankyrin repeat proteins comprise a diverse family of bacterial type IV effectors
Science 320:1651–1654.

https://doi.org/10.1126/science.1158160
- PubMed
- Google Scholar
1. Patro R
2. Duggal G
3. Love MI
4. Irizarry RA
5. Kingsford C
(2017) Salmon provides fast and bias-aware quantification of transcript expression
Nature Methods 14:417–419.

https://doi.org/10.1038/nmeth.4197
- PubMed
- Google Scholar
(2018) Identification of Plasmodiophora brassicae effectors - A challenging goal
Virulence 9:1344–1353.

https://doi.org/10.1080/21505594.2018.1504560
- PubMed
- Google Scholar
Software
1. Perez-Lopez E
(2025) AlphaFold_effector_paper, version swh:1:rev:ab6a6377b9a29ac3d6a31516e9e55e3d3ecf2d7e
Software Heritage.

https://archive.softwareheritage.org/swh:1:dir:4518d9f1e26cf8523701c5b130bdfffbf5556ab1;origin=https://github.com/Edelab/AlphaFold_effector_paper;visit=swh:1:snp:2d0370d8fd67b81c29c193bd63e1211bfa044aa8;anchor=swh:1:rev:ab6a6377b9a29ac3d6a31516e9e55e3d3ecf2d7e
1. Petit M
2. Schneider A
(1983) Chemical analysis of the wall of the yeast form of Taphrina deformans
Archives of Microbiology 135:141–146.

https://doi.org/10.1007/BF00408024
- Google Scholar
(2005) HyPhy: hypothesis testing using phylogenies
Bioinformatics 21:676–679.

https://doi.org/10.1093/bioinformatics/bti079
- Google Scholar
1. Ramos C
2. Matas IM
3. Bardaji L
4. Aragón IM
5. Murillo J
(2012) Pseudomonas savastanoi pv. savastanoi: some like it knot
Molecular Plant Pathology 13:998–1009.

https://doi.org/10.1111/j.1364-3703.2012.00816.x
- PubMed
- Google Scholar
1. Redkar A
2. Cevik V
3. Bailey K
4. Zhao H
5. Kim DS
6. Zou Z
7. Furzer OJ
8. Fairhead S
9. Borhan MH
10. Holub EB
11. Jones JDG
(2023) The Arabidopsis WRR4A and WRR4B paralogous NLR proteins both confer recognition of multiple Albugo candida effectors
The New Phytologist 237:532–547.

https://doi.org/10.1111/nph.18378
- PubMed
- Google Scholar
1. Reumann S
2. Babujee L
3. Ma C
4. Wienkoop S
5. Siemsen T
6. Antonicelli GE
7. Rasche N
8. Lüder F
9. Weckwerth W
10. Jahn O
(2007) Proteome analysis of Arabidopsis leaf peroxisomes reveals novel targeting peptides, metabolic pathways, and defense mechanisms
The Plant Cell 19:3170–3193.

https://doi.org/10.1105/tpc.107.050989
- PubMed
- Google Scholar
(2000) EMBOSS: the European molecular biology open software suite
Trends in Genetics 16:276–277.

https://doi.org/10.1016/s0168-9525(00)02024-2
- PubMed
- Google Scholar
1. Robert X
2. Gouet P
(2014) Deciphering key features in protein structures with the new ENDscript server
Nucleic Acids Research 42:W320–W324.

https://doi.org/10.1093/nar/gku316
- PubMed
- Google Scholar
Software
1. Schrödinger LLC
(2015) The pymol molecular graphics system, version 1.8
Pymol.

https://www.pymol.org/
1. Seong K
2. Krasileva KV
(2021) Computational structural genomics unravels common folds and novel families in the secretome of fungal phytopathogen Magnaporthe oryzae
Molecular Plant-Microbe Interactions 34:1267–1280.

https://doi.org/10.1094/MPMI-03-21-0071-R
- PubMed
- Google Scholar
1. Seong K
2. Krasileva KV
(2023) Prediction of effector protein structures from fungal phytopathogens enables evolutionary analyses
Nature Microbiology 8:174–187.

https://doi.org/10.1038/s41564-022-01287-6
- PubMed
- Google Scholar
1. Sievers F
2. Higgins DG
(2018) Clustal Omega for making accurate alignments of many protein sequences
Protein Science 27:135–145.

https://doi.org/10.1002/pro.3290
- PubMed
- Google Scholar
1. Steenwyk JL
2. Buida TJ
3. Li Y
4. Shen XX
5. Rokas A
(2020) ClipKIT: A multiple sequence alignment trimming software for accurate phylogenomic inference
PLOS Biology 18:e3001007.

https://doi.org/10.1371/journal.pbio.3001007
- PubMed
- Google Scholar
1. Tanaka S
2. Schweizer G
3. Rössel N
4. Fukada F
5. Thines M
6. Kahmann R
(2019) Neofunctionalization of the secreted Tin2 effector in the fungal pathogen Ustilago maydis
Nature Microbiology 4:251–257.

https://doi.org/10.1038/s41564-018-0304-6
- PubMed
- Google Scholar
(2022) SignalP 6.0 predicts all five types of signal peptides using protein language models
Nature Biotechnology 40:1023–1025.

https://doi.org/10.1038/s41587-021-01156-3
- PubMed
- Google Scholar
1. Teulet A
2. Quan C
3. Evangelisti E
4. Wanke A
5. Yang W
6. Schornack S
(2023) A pathogen effector FOLD diversified in symbiotic fungi
The New Phytologist 239:1127–1139.

https://doi.org/10.1111/nph.18996
- PubMed
- Google Scholar
1. Thynne E
2. Ali H
3. Seong K
4. Abukhalaf M
5. Guerreiro MA
6. Flores‐Nunez VM
7. Hansen R
8. Bergues A
9. Salman MJ
10. Rudd JJ
11. Kanyuka K
12. Tholey A
13. Krasileva KV
14. Kettles GJ
15. Stukenbrock EH
(2024) An array of Zymoseptoria tritici effectors suppress plant immune responses
Molecular Plant Pathology 25:e13500.

https://doi.org/10.1111/mpp.13500
- Google Scholar
(2016) Plant-pathogen effectors: cellular probes interfering with plant defenses in spatial and temporal manners
Annual Review of Phytopathology 54:419–441.

https://doi.org/10.1146/annurev-phyto-080615-100204
- PubMed
- Google Scholar
(2019a) The Synchytrium endobioticum AvrSen1 triggers a hypersensitive response in Sen1 potatoes while natural variants evade detection
Molecular Plant-Microbe Interactions 32:1536–1546.

https://doi.org/10.1094/MPMI-05-19-0138-R
- PubMed
- Google Scholar
(2019b) Comparative genomics of chytrid fungi reveal insights into the obligate biotrophic and pathogenic lifestyle of Synchytrium endobioticum
Scientific Reports 9:8672.

https://doi.org/10.1038/s41598-019-45128-9
- PubMed
- Google Scholar
1. Van Dongen S
(2008) Graph clustering via a discrete uncoupling process
SIAM Journal on Matrix Analysis and Applications 30:121–141.

https://doi.org/10.1137/040608635
- Google Scholar
1. van Kempen M
2. Kim SS
3. Tumescheit C
4. Mirdita M
5. Lee J
6. Gilchrist CLM
7. Söding J
8. Steinegger M
(2024) Fast and accurate protein structure search with Foldseek
Nature Biotechnology 42:243–246.

https://doi.org/10.1038/s41587-023-01773-0
- PubMed
- Google Scholar
Book
1. Wickham H
(2016) Ggplot2
Springer.

https://doi.org/10.1007/978-3-319-24277-4
- Google Scholar
(2012) Sequence divergent RXLR effectors share a structural fold conserved across plant pathogenic oomycete species
PLOS Pathogens 8:e1002400.

https://doi.org/10.1371/journal.ppat.1002400
- PubMed
- Google Scholar
1. Xin X-F
2. Nomura K
3. Ding X
4. Chen X
5. Wang K
6. Aung K
7. Uribe F
8. Rosa B
9. Yao J
10. Chen J
11. He SY
(2015) Pseudomonas syringae effector avirulence protein E localizes to the host plasma membrane and down-regulates the expression of the nonrace-specific disease resistance1/harpin-induced1-like13 gene required for antibacterial immunity in arabidopsis
Plant Physiology 169:793–802.

https://doi.org/10.1104/pp.15.00547
- PubMed
- Google Scholar
1. Xu J
2. Meng J
3. Meng X
4. Zhao Y
5. Liu J
6. Sun T
7. Liu Y
8. Wang Q
9. Zhang S
(2016) Pathogen-responsive MPK3 and MPK6 reprogram the biosynthesis of indole glucosinolates and their derivatives in arabidopsis immunity
The Plant Cell 28:1144–1162.

https://doi.org/10.1105/tpc.15.00871
- PubMed
- Google Scholar
(2023) AlphaPulldown-a python package for protein-protein interaction screens using AlphaFold-Multimer
Bioinformatics 39:btac749.

https://doi.org/10.1093/bioinformatics/btac749
- PubMed
- Google Scholar
1. Yu DS
2. Outram MA
3. Smith A
4. McCombe CL
5. Khambalkar PB
6. Rima SA
7. Sun X
8. Ma L
9. Ericsson DJ
10. Jones DA
11. Williams SJ
(2024) The structural repertoire of Fusarium oxysporum f. sp. lycopersici effectors revealed by experimental and computational studies
eLife 12:RP89280.

https://doi.org/10.7554/eLife.89280
- PubMed
- Google Scholar
1. Zhang Y
2. Skolnick J
(2005) TM-align: a protein structure alignment algorithm based on the TM-score
Nucleic Acids Research 33:2302–2309.

https://doi.org/10.1093/nar/gki524
- PubMed
- Google Scholar
Preprint
1. Zhong B
2. Su X
3. Wen M
4. Zuo S
5. Hong L
6. Lin J
(2022) ParaFold: paralleling AlphaFold for large-scale predictions
arXiv.

https://doi.org/10.48550/arXiv.2111.06340
- Google Scholar

Article and author information

Author details

Soham Mukhopadhyay
1. Départment de Phytologie, Faculté des sciences de l’agriculture et de l’alimentation, Université Laval, Québec, Canada
2. Centre de recherche et d’innovation sur les végétaux (CRIV), Université Laval, Québec, Canada
3. L’Institute EDS, Université Laval, Québec, Canada
Contribution
Conceptualization, Formal analysis, Investigation, Methodology, Writing – original draft, Writing – review and editing

For correspondence
soham.mukhopadhyay.1@ulaval.ca

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0002-5279-0396
Muhammad Asim Javed
1. Départment de Phytologie, Faculté des sciences de l’agriculture et de l’alimentation, Université Laval, Québec, Canada
2. Centre de recherche et d’innovation sur les végétaux (CRIV), Université Laval, Québec, Canada
3. L’Institute EDS, Université Laval, Québec, Canada
Contribution
Investigation, Visualization, Writing – review and editing

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0003-1658-3565
Jiaxu Wu
1. Départment de Phytologie, Faculté des sciences de l’agriculture et de l’alimentation, Université Laval, Québec, Canada
2. Centre de recherche et d’innovation sur les végétaux (CRIV), Université Laval, Québec, Canada
3. L’Institute EDS, Université Laval, Québec, Canada
Contribution
Validation, Investigation, Writing – review and editing

Competing interests
No competing interests declared
Edel Perez-Lopez
1. Départment de Phytologie, Faculté des sciences de l’agriculture et de l’alimentation, Université Laval, Québec, Canada
2. Centre de recherche et d’innovation sur les végétaux (CRIV), Université Laval, Québec, Canada
3. L’Institute EDS, Université Laval, Québec, Canada
Contribution
Conceptualization, Supervision, Funding acquisition, Methodology, Writing – original draft, Writing – review and editing

For correspondence
edel.perez-lopez.1@ulaval.ca

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0002-3708-8558

Funding

Canola Council of Canada (2021.4)

Edel Perez-Lopez

Natural Sciences and Engineering Research Council of Canada (RGPIN-2021-02518)

Edel Perez-Lopez

Western Grain Research Foundation

Edel Perez-Lopez

Manitoba Canola Growers Association

Edel Perez-Lopez

Alberta Canola Producers Commission

Edel Perez-Lopez

Fonds de recherche du Québec (Doctoral scholarships)

Muhammad Asim Javed
Jiaxu Wu

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Acknowledgements

We are grateful to the bioinformatics support personnel and infrastructure at IBIS, Université Laval, for their constant assistance throughout this project, and to Prof. Sylvain Raffaele for providing a dataset used in the study. We also thank Elisa Fantino and Anne-Sophie Brochu for their support in coordinating the interaction validations with PronetBio and Hybrigenics Services. This work was funded by the Canola Agronomic Research Program (Grant ID 2021.4), Western Grain Research Foundation, Canola Council of Canada, Alberta Canola, and Manitoba Canola Growers Association, as well as by the Discovery Program (Grant ID RGPIN-2021–02518) of the Natural Sciences and Engineering Research Council of Canada. We are also thankful to the FRQ – Nature et technologies division – for supporting MAJ and JW through doctoral scholarships.

Version history

Preprint posted: November 12, 2024
Sent for peer review: December 19, 2024
Reviewed Preprint version 1: February 21, 2025
Reviewed Preprint version 2: August 18, 2025
Version of Record published: October 7, 2025

Cite all versions

You can cite all versions using the DOI https://doi.org/10.7554/eLife.105185. This DOI represents all versions, and will always resolve to the latest one.

Copyright

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.