Abstract
Mutations in the human PURA gene cause the neuro-developmental PURA syndrome. In contrast to several other mono-genetic disorders, almost all reported mutations in this nucleic acid binding protein result in the full disease penetrance. In this study, we observed that patient mutations across PURA impair its previously reported co-localization with processing bodies. These mutations either destroyed the folding integrity, RNA binding or dimerization of PURA. We also solved the crystal structures of the N- and C-terminal PUR domains of human PURA and combined them with molecular dynamics simulations and NMR measurements. The observed unusually high dynamics and structural promiscuity of PURA indicated that this protein is particularly susceptible to mutations impairing its structural integrity. It offers an explanation why even conservative mutations across PURA result in the full penetrance of symptoms in patients with PURA syndrome.
Introduction
The family of purine-rich element binding (PUR) protein is highly conserved from plants to humans (Molitor et al, 2021). Amongst the three vertebrate paralogs PURA, PURB, and PURG, the best-studied family member is PURA (Bergemann & Johnson, 1992; Haas et al, 1993; Haas et al, 1995). It is ubiquitously expressed and has been implicated in several cellular processes, including transcriptional and translational gene regulation as well as mRNA transport in neurons (Chepenik et al, 1998; Gallia et al, 2001; Haas et al., 1993; Haas et al., 1995; Johnson et al, 2006; Kobayashi et al, 2000; Mitsumori et al, 2017; Tretiakova et al, 1999). Results from two independent knock-out mouse models demonstrated that PURA is important for postnatal brain development (Hokkanen et al, 2012; Khalili et al, 2003).
Accordingly, PURA has been implicated as a modulator of neurodegenerative disorders, such as fragile X-associated tremor/ataxia syndrome (FXTAS), and the amyotrophic lateral sclerosis (ALS) frontotemporal dementia (FTD) spectrum disorder (Swinnen et al, 2020). PURA was reported to be present in the pathological RNA foci of both disorders and to protect against RNA toxicity upon ectopic overexpression (Mori et al, 2013; Rossi et al, 2015; Shen et al, 2018; Swinnen et al, 2018; Xu et al, 2013). In addition, PURA was shown to co-localize with an ALS-causing mutant variant of the FUS protein in stress granules of ALS patients. Importantly, overexpression of PURA reduced the toxicity of mutant FUS by preventing its mis-localization (Daigle et al, 2016).
In 2014, two studies reported that a monogenetic neurodevelopmental disorder is caused by sporadic mutations in the PURA gene (Hunt et al, 2014; Lalani et al, 2014). Hallmarks of this so-called PURA syndrome are developmental delay, moderate to severe intellectual disability, hypotonia, epileptic seizures, and feeding difficulties, amongst others (Reijnders et al, 2018) (Johannesen et al, 2021). Mutations causing PURA syndrome are often frame-shift events but also point mutations distributed over the entire sequence.
Based on the crystal structures of PURA from Drosophila melanogaster (dmPURA), three conserved sequence regions termed PUR repeats I, II, and III were identified to fold into globular domains (Graebsch et al, 2010; Graebsch et al, 2009; Weber et al, 2016). Whereas PUR repeat I and II assemble into an N-terminal PUR domain via intramolecular interactions, two C-terminal PUR repeats III from different molecules interact with each other and thus mediate dimerization of PURA. All these PUR domains belong to the class of the PC4-like protein family, which mainly bind single-stranded nucleic acids and can unwind double-stranded DNA and RNA (Janowski & Niessing, 2020). Although both PUR domains possess the typical PC4-like β-β-β-β-α-(linker)-β-β-β-β-α topology, the N-terminal domain has been suggested to be the main RNA/DNA interaction entity (Weber et al., 2016). In a previous study, homology models based on the dmPURA crystal structures (Graebsch et al., 2009; Weber et al., 2016) were used to predict in silico the effects of PURA syndrome-causing mutations on the structural integrity of the human PURA protein (Reijnders et al., 2018). While these mutations could be classified into groups that likely either do or do not impair the structural integrity of PURA, these predictions remained speculative (Reijnders et al., 2018). Surprisingly, in contrast to phenotypically related genetic diseases such as the Rett syndrome (Lombardi et al, 2015), no hot-spot regions could be identified in the protein sequence that trigger the PURA syndrome. With the exception of the unstructured N-terminal region and the very C-terminus, almost all mutations across the protein sequence appear to result in the full disease spectrum (Johannesen et al., 2021; Reijnders et al., 2018). To date, we fail to understand why there is such an underrepresentation of mild phenotypes in patients with PURA syndrome.
In this study, we experimentally assessed the structure and function of human PURA as well as the impact of representative, PURA syndrome-causing mutations on the protein’s integrity. When studying the impact of patient mutations on PURA’s subcellular localization, we observed impaired processing body (P-body) association but normal localization to stress granules, indicating a potential importance of P-bodies for PURA-syndrome pathology. Two particularly interesting disease-causing mutations, K97E and R140P, were further analyzed by structural means. The results indicated that the N-terminal PUR domain is unusually flexible in its fold and prone to misfolding upon mutation. In summary, we provide a structure-based explanation for the underrepresentation of mutations with mild symptoms in patients and a rational approach for predicting and functionally testing mutations of uncertain significance.
Results
Mutations in hsPURA affect stress granule association
Several patient-related mutations are frame-shift events and thus predictably destroy the integrity of PURA. In addition, a number of point and indel mutations have been reported for which the pathology-causing effects are less easy to predict (Johannesen et al., 2021; Reijnders et al., 2018). To understand the functional impact of such mutations, we decided to first study the effect of three known patient mutations, K97E, I206F, and F233del (Hunt et al., 2014; Lalani et al., 2014; Reijnders et al., 2018), on the subcellular localization of hsPURA.
Previous studies in human cell cultures indicated that hsPURA localizes to stress granules upon cellular stress and may be essential for their formation (Daigle et al., 2016; Markmiller et al, 2018). To analyze whether the afore-mentioned patient-related mutations affect the localization of hsPURA to stress granules, the mutant and wild-type versions of full-length FLAG-tagged hsPURA were overexpressed in HeLa cells using a doxycycline-inducible expression cassette (Fig 1A) and cells were stressed for one hour with 500 µM sodium arsenite. To find out whether stress-granule localization of hsPURA depends on its nucleic-acids binding properties, we also utilized an RNA-binding deficient version of hsPURA bearing eleven structure-guided point mutations (m11) as a control (for rationale and design of this mutant, see Appendix Fig S1). Using an anti-FLAG tag antibody, we observed wild-type hsPURA to be distributed within the cytoplasm and accumulated in stress-granule structures, as seen by co-staining with the stress-granule marker G3BP1 (Fig 1A). In contrast, the nucleic acid-binding deficient mutant hsPURA m11 failed to co-localize to G3BP1-positive granules, indicating that RNA-binding by hsPURA is necessary for its co-localization with stress granules. Furthermore, all three versions of hsPURA bearing patient-derived mutations localized to stress granules upon cellular stress (Fig 1A). While quantification showed a reduced association of hsPURA K97E mutant with G3BP1-positive granules (Fig 1B), the two other mutants, I206F and F233del, showed the same co-localization to stress granules as the wild type control.
To assess the previously reported influence of hsPURA on stress-granule formation at endogenous PURA levels (Daigle et al., 2016), we performed a knockdown of hsPURA in HeLa cells and stained with the stress-granule marker G3BP1 and PURA. As expected, endogenous hsPURA co-localized with G3BP1 upon stress treatment with 500 µM arsenite PURA (Appendix Fig S2A and S2B) and thus behaved similarly to overexpressed hsPURA. A PURA knockdown did not alter the total number of microscopically visible stress granules but resulted in a greater ratio of smaller stress granules (Appendix Fig S2C). Although this finding suggests that PURA is not essential for stress granule formation, it should be noted that the knockdown did not completely eliminate PURA levels and hence a stronger effect on stress granules could occur in a complete knockout of PURA. However, PURA-syndrome patients bear a heterozygous mutation, suggesting that an incomplete knockdown likely resembles the pathogenic situation more closely than a full knockout. Together these findings suggest that (i) reduced stress-granule association of mutant hsPURA variants may not be a core feature of the PURA syndrome and (ii) impaired stress-granule formation is not very likely to be directly responsible for the etiology of this disorder.
Localization of hsPURA to P-bodies is impaired by PURA syndrome-causing mutations
In order to assess the localization of hsPURA to cytoplasmic granules in unstressed conditions, we first recapitulated its recently reported localization to P-bodies (Molitor et al, 2023). Co-staining with the marker DCP1A indeed confirmed a co-localization of overexpressed hsPURA in P-bodies of HeLa cells (Fig 2A and 2B). As for stress granules, the PURA m11 mutant failed to accumulate in P-bodies, indicating that RNA binding of PURA contributes to its P-body localization. In addition, the K97E and F233del mutants of hsPURA showed significantly reduced co-localization to P-bodies (Fig 2A and 2B). In summary, these experiments indicate that P-body localization of hsPURA requires its RNA-binding activity, and that the majority of tested patient mutations result in impaired P-body localization. Furthermore, since F233del mutant has been predicted to have impaired dimerization properties (Reijnders et al., 2018), these findings suggest dimerization to be important for PURA’s P-body association.
hsPURA does not readily undergo RNA-driven phase separation in vitro
hsPURA accumulates in an RNA-dependent fashion to stress granules as well as to P-bodies. Both types of granules are liquid-phase separated entities, indicating that the recruitment to such granules might occur via phase separation of RNA-bound hsPURA. Since hsPURA was recently shown to be required for P-body formation in HeLa cells and fibroblasts (Molitor et al., 2023), PURA-dependent liquid phase separation could potentially also directly contribute to the formation of these granules. Furthermore, a loss of such a function could potential contribute to the etiology of PURA syndrome. In order to test this hypothesis, we used a microscope-based approach to visualize the ability of RNA-bound hsPURA to phase separate in vitro. As a positive control for RNA-mediated liquid-phase separation, the protein FUS RGG3 was used (Hofweber et al, 2018). In contrast to this control, no indication of phase separation was observed for hsPURA either in the presence or absence of total HeLa cell RNA (Appendix Fig S3). These results indicate that hsPURA does not readily phase separate with total cellular RNA under the tested experimental conditions. Of note, this observation does not exclude the possibility that hsPURA can undergo phase separation under different experimental conditions, for instance in presence of additional co-factors. However, when putting this observation in perspective with previous reports, it seems unlikely that P-body formation directly depends on phase separation by hsPURA, but rather on its recently reported function as gene regulator of the essential P-body core factors LSM14a and DDX6 (Molitor et al., 2023).
Recombinant expression tests of PURA protein with disease-causing mutations
In order to understand the impact of mutations on the molecular and structural integrity of hsPURA, we recombinantly expressed the above-described mutant hsPURA proteins K97E, I206F, and F233del in E. coli. In addition, hsPURA with either of the mutations R140P, R199P, and R245P were recombinantly expressed (Fig 3A). All three mutations exchange an arginine against a proline, albeit in different positions of the protein and very different structural contexts. R140P was initially reported to be a disease-causing mutation (Lee et al, 2018) but later re-interpreted as a mutation for which it is unclear if it can trigger PURA Syndrome, i.e. of uncertain significance (https://www.ncbi.nlm.nih.gov/clinvar/variation/192343/). In contrast, the mutations R199P and R245P have been described as disease-causing, with matching symptoms for patients with PURA syndrome (Reijnders et al., 2018). Hence, R140P was chosen in this study to explore the diagnostic potential of our approach, whereas R199P and R245P served as a reference point for pathology-inducing mutations.
Even after extensive optimization, recombinantly expressed hsPURA fragments with the mutations R199P and I206F (full-length variant and N-terminal PUR domain consisting of repeats I-II; PURA I-II) as well as F233del and R245P (full-length variant and C-terminal PUR domain consisting of two repeats III; PURA III) could not be purified due to protein insolubility (Appendix Table S1). In contrast, N-terminal PUR domains (PURA I-II) with either of the two remaining mutations K97E and R140P were soluble and could be purified to near homogeneity. The observed solubility of recombinant hsPURA I-II K97E is also in agreement with the previous in silico prediction (Reijnders et al., 2018) as K97 has a surface-exposed side chain that is unlikely to cause folding issues upon mutation into another flexible, polar side chain. For the R140P mutation with unclear pathological significance, no conclusive prediction on its impact on protein folding could be made in the past.
The C-terminal dimerization PUR domain adopt a classical PC4-like fold
Several PURA-Syndrome-causing mutations have been reported to be located in the C-terminal, dimerization-mediating PUR domain (Johannesen et al., 2021; Reijnders et al., 2018). Due to the lack of an experimental structure of human PURA, previous predictions of the impact of patient mutations on the structural integrity of hsPURA had to rely on a human homology model derived from dmPURA (Reijnders et al., 2018). Since Drosophila and human PURA have a rather moderate sequence identity of 46%, we decided to establish a better experimental basis for predicting the structural effects of patient mutations and determined the crystal structure of the fragment P215-K280 (i.e. hsPURA III) at 1.7 Å resolution (Fig 3B and Appendix Table S2). This fragment corresponds to the domain boundaries suggested by the previously solved invertebrate dmPURA III structure (Weber et al., 2016). The structure of the C-terminal PUR domain possesses the typical four antiparallel β-strands and one α-helix. Like in the previously published dmPURA III structure (Weber et al., 2016), the C-terminal PUR domain of hsPURA is built from two PUR repeats III of independent protein chains that interact with each other to form an intermolecular homodimer. The hsPURA III dimer shows high structural similarity to dmPURA III with r.m.s.d. of 1.26 Å for 125 superimposed Cα atoms and 50% of sequence identity (Appendix Fig S4A).
Dimerization domain of hsPURA shows nucleic acid binding and unwinding activities
PUR repeats III adopt a classical PC4-like fold, suggesting that it might also bind to nucleic acids. Analysis of the crystal structure of the hsPURA III homodimer revealed electrostatic surface potentials favorable for interactions with nucleic acids (Appendix Fig S4B). When performing gel electrophoresis mobility shift assays (EMSAs) we indeed observed RNA binding (Fig 3C and 3D).
It had been previously reported that PURA can separate double-stranded (ds) nucleic acids in an ATP-independent fashion (Darbinian et al, 2001; Weber et al., 2016). This so-called unwindase activity has also been reported for several other PC4-like domains from different organisms (Janowski & Niessing, 2020). For dmPURA it had already been suggested that dsDNA strands are melted by intercalation of the β-ridge between both strands, followed by binding of both single strands to the cavities formed by the curved β-sheets on both sides of the β-ridge (Appendix Fig S5) (Weber et al., 2016). Since we had already observed RNA binding by PUR repeats III (Fig 3C and 3D), we examined its potential unwindase activity by utilizing a previously reported in vitro unwinding assay (Darbinian et al., 2001; Weber et al., 2016) with dsDNA 5’-end labelled with a FAM fluorophore and 3’-end labelled with Dabcyl quencher (Appendix Fig S6). In this assay, double-stranded DNA results in low fluorescence signal due to the proximity of the quencher to the fluorophore. Upon strand separation their distance and fluorescence signal increase. We indeed observed strand separation in presence of the hsPURA III fragment (Fig 3E). Of note, RNA binding as well as dsDNA strand separation by full-length hsPURA was much more efficient than by the PUR III domain (Fig 3E and 3F), indicating potential cooperativity of the N- and C-terminal PUR domains in unwinding double-stranded nucleic acids.
PURA syndrome-causing mutations in the C-terminal PUR domain impair dimerization
Next, we used the hsPURA III structure (Appendix Fig 3B and Table S2) to perform an in-silico analysis of the impact of patient-derived mutations F233del and R245P on the structural integrity of the C-terminal PUR domain. While failure to obtain soluble recombinant PURA with F233del mutation prevented a direct biophysical assessment of its dimerization state in vitro (Appendix Table S1), this observation already indicated a misfolding of the C-terminal PUR domain upon mutation. Consistent with this observation is also the interpretation from homology modeling based on the previously published dmPURA III structure (Reijnders et al., 2018). There, the mutation was predicted to impair the structural integrity of the C-terminal hsPURA dimerization domain. F233 is part of the β-sheet and interacts with the α-helix of the other PUR repeat III within the PUR domain, contributing to dimerization (Appendix Fig S7A). Since in the PURA F233del variant, one residue is deleted, the orientation of all subsequent amino acids in the β-strand adopt the opposite orientation, likely resulting in a fold disruption of the β-sheet and possibly in impaired dimerization. In the case of R245P mutation (Appendix Fig S7B), the exchange into proline also likely impairs β-strand folding by introducing a kink, which could also impact dimerization.
To provide direct proof for impaired dimerization upon the introduction of patient-derived F233del and R245P mutations, we utilized the FRET-based NanoBRET assay in HEK293 cells (Fig 3G). With this assay, we assessed the ability of hsPURA bearing the F233del and R245P patient mutations to dimerize with wild-type hsPURA by measuring corresponding milli BRET Units (mBU). Of note, this combination of wild-type and mutant PURA protein is likely to recapitulate the heterozygous situation in patients. As a control for background signal, non-dimerizing hsPURA I-II was measured (negative control). As a control for positive interaction, hsPURA I-III was used. While for hsPURA I-III the signal intensities above the background level clearly indicated dimerization (Fig 3G), protein fragments with either the F233del or the R245P mutation showed strongly reduced signal intensities. This observation implies impairment of PURA dimerization in patients bearing either of these heterozygous mutations.
N-terminal PUR domain shows great flexibility in its domain fold
In order to better understand the structural impact of mutations in the N-terminal PUR domain (hsPURA I-II), we determined its structure (fragment E57-E212) by X-ray crystallography (Fig 4A and Appendix Table S2). The N-terminal PUR domain resembles the previously published PURA structure from D. melanogaster (Graebsch et al., 2009; Weber et al., 2016) (dmPURA I-II WT) with an average root mean square deviation (r.m.s.d.) of 1.2 Å (Appendix Fig S8). However, in contrast to this invertebrate structure, we observed for hsPURA I-II that the four molecules within its asymmetric unit of the crystal lattice showed marked differences in its loop regions and in particular at the edges of the β-sheets (Fig 4B). This suggested greater structural flexibility than average domain folds and most likely than its D. melanogaster ortholog.
Effects of patient mutations in the N-terminal PUR domain on RNA binding
To assess the interaction of hsPURA I-II with nucleic acids and to understand how selected patient-derived mutations affect this function, we performed EMSAs for this protein fragment (Fig 4C and 4D). While hsPURA I-II quantitatively bound a 24-mer CGG-repeat RNA, the hsPURA I-II K97E mutant protein showed moderately but significantly decreased RNA binding (Fig 4C and 4D). Impaired RNA binding upon K97E mutation is not surprising as a change of the surface electrostatic potential from positive to negative may impact its interaction with nucleic acids (Appendix Fig S4B). In contrast, hsPURA I-II R140P did not show impaired RNA binding (Fig 4C and 4D). This finding indicates that R140P might be a benign mutation and does not have disease-causing properties. As a negative control, the N-terminal PUR domain containing eleven selected point mutations was employed (hsPURA I-II m11; Appendix Fig S1). Since these mutations were introduced to abolish the interaction of hsPURA with nucleic acids, this mutant protein showed no detectable RNA binding in EMSA.
PURA-dependent strand separation of dsDNA is affected by disease-causing mutations
We recapitulated the previously reported unwindase activity (Darbinian et al., 2001; Weber et al., 2016) with human PURA I-II and asked if patient-derived mutations in this domain impair PURA’s strand-separating activity. In vitro unwinding experiments with dsDNA were performed for hsPURA I-II and compared to the patient mutants hsPURA I-II K97E and hsPURA I-II R140P as well as the hsPURA I-II m11 variant. While hsPURA I-II showed considerable strand-separating activity in this assay, the nucleic acid-binding deficient m11 mutant variant expectedly failed to separate dsDNA (Fig 4E and 4F). With an effect considerably stronger than observed for RNA binding (Fig 4C and 4D), the hsPURA I-II K97E variant showed a loss of strand-separation activity by 90% (Fig 4E and 4F). In contrast, hsPURA I-II R140P mutant showed wild-type-like separation of dsDNA (Fig 4E and 4F), further supporting our previous interpretation from RNA-binding analyses that the mutation R140P is likely functional.
CD spectroscopic measurements indicate impaired domain folding upon mutations
In order to confirm the structural integrity of the soluble hsPURA variants, we performed circular dichroism (CD) spectroscopy experiments. The spectra for both, hsPURA I-II as well as hsPURA I-II R140P mutant showed a typical alpha-beta profile (Fig 5A), suggesting a proper folding of the protein. In contrast, the spectrum for hsPURA I-II K97E indicated a strongly altered secondary-structure content. This observation was unexpected as we had predicted that this surface mutation would not affect the protein’s structural integrity (this study and (Reijnders et al., 2018). It also indicates that more detailed structural information of hsPURA I-II is needed to better understand its susceptibility to disease-causing mutations of surface-exposed residues.
hsPURA bearing mutation R140P with uncertain clinical significance shows wild-type-like domain folding
The mutation R140P had been previously described to be of uncertain clinical significance and it remained to be shown if this mutation affects the protein’s domain folding. CD spectra had already suggested that this mutation has no dramatic effect on the overall domain folding (Fig 5A). To assess if this mutation has a rather local effect on the structural integrity of PURA that may not be detectable by CD spectroscopy, we solved the crystal structure of hsPURA I-II R140P at 2.15 Å resolution (Appendix Fig S9A, S9B and Table S2). The crystals of R140P mutant protein exhibited the same space group, unit cell parameters, and crystal packing as the wild-type protein (Appendix Table S2). The R140P mutation is located at the end of the flexible linker between PUR repeat I and II connecting helix α1 to the strand β5 (Fig 3A, Appendix Fig S9A and S9B). Because of its position in a flexible region, the mutated residue is partially visible only in one out of four PURA chains in the asymmetric unit. The overlap of the R140P mutant structure with the PURA I-II WT fragment confirmed our previous observation about the unusually high flexibility of the strands β4 and β8 forming the β-ridge (Appendix Fig S9A and S9B). Since no major structural differences were visible between hsPURA I-II WT and R140P mutant (average r.m.s.d. of 0.89 Å for 133 Cα atoms aligned, Appendix Fig S9C), we concluded that this mutation is unlikely to impair the structural integrity of hsPURA.
hsPURA I-II with the patient mutation K97E reveals promiscuous domain folding
To understand how exactly the surface-exposed K97E mutation alters the domain folding of PURA (Fig 5A), we solved the crystal structure of hsPURA I-II K97E (Fig 5B-E, Appendix Table S2). The asymmetric unit of the crystal contained two molecules of hsPURA I-II K97E and showed good overall fold similarity to the hsPURA I-II structure (Fig 5B and 5C). However, an overlay of both molecules of the K97E-mutant protein from the asymmetric unit also revealed unusual differences between them with r.m.s.d of 1.48 Å for 125 superimposed Cα atoms. While one of these molecules (chain A, Fig 5B) showed a fold very similar to hsPURA I-II (average r.m.s.d. 1.09 Å, Appendix Fig S10A), the second molecule (chain B) displayed much greater structural changes (average r.m.s.d. 1.61 Å, Fig 5C and Appendix Fig S10A). These differences were especially pronounced in PUR repeat II (Fig 5D-E), where we observed a shift of register in the β-sheet by one amino acid in strand β5 (Fig 5D) and by two amino acids in strand β6 (Fig 5E). More C-terminally, the loop linking β6 and β7 is shorter by two residues, thereby correcting again the sequence register within the next β-strands (β7 and β8).
Already the different molecules of wild-type hsPURA I-II in the crystal unit cell revealed considerable conformational freedom (Fig 4B, Appendix Fig S10B and S10C), not only for narrow movements of the loops but also in the folding of the β-strands. For instance, in some of the molecules in the asymmetric unit of the wild-type hsPURA I-II crystal, β4 and β8 strands formed bulges (Fig 4B, Appendix Fig S10B and S10C). From all the areas of the structure, the β-ridge appears to be structurally the most divergent part. It is surprising that the K97E point mutation, which is located on the β4 strand of the β-ridge, induces the observed unusual change in the β-sheet arrangement (Fig 5C-E) without destroying the domain’s overall scaffold. Together these findings indicate great structural flexibility of the N-terminal PUR domain and promiscuity in folding, which results in misfolding upon mutation of a surface-exposed residue.
While CD measurements suggest a difference in folding between wild-type and K97E mutated PURA (Fig 5A) and the crystal structure of hsPURA I-II K97E indicates folding promiscuity (Fig 5C-E), these data do not fully exclude unfolding of the domain in solution. To distinguish between a mutation-induced unfolding and an altered folded state, we performed NMR measurements with the 15N-isotope-labeled samples of wild-type and K97E mutant hsPURA I-II (E57-E212 fragment). First, 1H,15N-correlation spectra (HSQC) of both wild-type and K97E constructs showed a clear presence of the structured core, as evidenced by the well-dispersed resonances, with a significant number of residues adopting strand conformation (1H 8.5 - 9.5 ppm) (Fig 5F). Intriguingly, arginine side chain resonances Nε-Hε were only visible in the K97E variant, while they were broadened out in the wild-type spectrum. Next, we carefully compared the two HSQC spectra for individual resonances/residues. When the two spectra were overlaid, we observed that a significant number of resonances, especially in the region corresponding to the β-strand, were perturbed, which is clearly more extensive than just affecting the residues adjacent to the mutational site of residue 97 (Fig 5F). Finally, we performed the 15N-heteronuclear NOE experiment on the K97E mutant to measure the residue-level backbone dynamics (Appendix Fig S11A). The overlaid spectra of the absence (control) and presence (NOE) of 1H-saturation shows most resonances retaining their signal intensities (rigid, hetNOE ∼ 0.8), while the resonances corresponding to the dynamic regions, likely N-/C-terminal & loop regions, with significantly reduced signal intensities or even negative (flexible). Importantly, all the resonances perturbed upon K97E mutation remained in the presence of 1H-saturation, confirming that the structural rearrangement does not induce protein unfolding. Together, this shows that the K97E mutation does not induce global unfolding of hsPURA I-II, but rather triggers structural rearrangements mostly in the β-strand region.
To obtain a better understanding of the intrinsic dynamics of hsPURA I-II we performed in-silico molecular dynamics simulations. The hsPURA I-II WT and K97E showed a similar overall profile (Appendix Fig S11B), where most of the flexibility or fluctuations concentrated in the loops connecting strands β3 with β4 and β7 with β8. However, there are some statistically significant differences between the two proteins. The most significant differences concentrate on the two ends of the α1 helix, and the β5, β6, and β7 segments. In these regions, the K97E mutant shows higher flexibility than the WT, with the N-terminal part of the α1 helix and the C-terminal end of β5 and β6 having the greatest gap between the two variants. There are also large differences in the residues surrounding the mutation spot (residues 90 to 100), where the K97E mutant also shows much greater fluctuation. However, these differences are not significant due to the large standard deviations. The movie (S1 for hsPURA I-II WT and S2 for hsPURA I-II K97E) of the simulation confirms a large degree of fluctuation in this area, which can adopt many different conformations in both WT and mutant. Finally, there is one region where the WT displays a higher flexibility than the mutant, the loop connecting β7 with β8 (residues 175 to 185), although only a few of those differences are significant. Of note, while these simulations are consistent with the increased flexibility suggested by the crystal structure, the time window of these simulations was too short to observe alterations between the two folding states as seen in this experimental structure.
Discussion
Currently, more than 270 different pathogenic mutations in the PURA gene have been identified in over 600 patients with confirmed PURA syndrome (personal communication: PURA Foundation Australia). The aim of this study was to understand how mutations causing this neurodevelopmental disorder affect the functional and structural integrity of the hsPURA protein. Towards this goal, we performed cellular, biophysical, and structural analysis with wild-type hsPURA as well as with versions of hsPURA bearing representative patient-derived mutations. To understand how PUR-syndrome-causing mutations affect the subcellular localization of hsPURA, we performed immunofluorescence microscopic analyses in HeLa cells by overexpressing full-length hsPURA as well as variants bearing a disease-causing mutation in each of the three PUR repeats, i.e. K97E, I206F, and F233del. Consistent with previous reports (Daigle et al., 2016; Markmiller et al., 2018), we observed that hsPURA co-localizes to stress granules in HeLa cells (Fig 1). The observation that only one of the patient-related mutations of hsPURA, K97E, showed reduced stress granule association indicates that this feature may not constitute a major hallmark of the PURA syndrome. It should be noted however that this interpretation must be considered with some caution as the experiments were performed in a PURA wild-type background. What is more, the knockdown of PURA resulted in reduced sizes of those stress granules (Appendix Fig S2), indicating that PURA indeed does have an influence on these granules. Future experiments will be necessary to unambiguously clarify the importance of PURA’s localization to stress granules for the etiology of PURA syndrome.
Very recently, hsPURA was reported to localize to P-bodies (Molitor et al., 2023). While we recapitulated co-localization of hsPURA to P-bodies (Fig 2), hsPURA K97E and F233del mutants failed to fully localize to P-bodies. Since the F233del mutation impairs the dimerization of hsPURA (Fig 3G), this finding suggests the requirement of dimerization for efficient P-body association.
To address the question of what drives hsPURA accumulation in these phase-separated granules, we utilized a hsPURA version lacking RNA-binding activity (m11, Appendix Fig S1). This mutant failed to localize to stress granules or P-bodies, indicating that RNA binding is essential for both co-localization events. A potential RNA-dependent mechanism to recruit hsPURA into stress granules or P-bodies could be liquid phase separation of hsPURA when bound to RNAs. In vitro experiments testing this potential property of hsPURA failed to yield phase-separated droplets. In summary, these findings indicate that RNA binding of hsPURA is important for its co-localization into stress granules and P-bodies, while we have no evidence that phase separation of hsPURA plays a role.
In a recent study, it was shown that a hsPURA knock-down in HeLa and in fibroblast cells resulted in down-regulation of the essential P-body proteins LSM14a and DDX6 and, most likely as a direct consequence, in strongly impaired P-body formation (Molitor et al., 2023). In light of this report, our finding that patient-derived mutations impair the P-body association of hsPURA (Fig 2) provide further evidence for a potential role of P-body localization of hsPURA for the etiology of PURA syndrome.
It has been reported that protein domains with PC4-like fold interact with RNA and DNA and are responsible for the separation of the double-stranded nucleic acids (Darbinian et al., 2001; Weber et al., 2016) (reviewed in (Janowski & Niessing, 2020). In this study, we show that both, the N- and C-terminal PUR domains of human PURA are able to separate double-stranded nucleic acids (Fig 4E, 4F and 3E) and to bind single-stranded nucleic acids (Fig 4C, 4D, 3C and 3D). We also tested patient-derived mutant versions of hsPURA to find out whether they exhibit impaired interaction with nucleic acids. For the K97E mutated version, we observed in EMSAs a reduced affinity to RNA (Fig 4C and 4D). Similar behavior of the K97E mutant was observed in the strand-separation assay where it showed an almost complete loss of activity (Fig 4E and 4F). The latter is of interest since previous experiments had demonstrated that strand separation contributes to the neuroprotective activity of PURA in a Drosophila FXTAS disease model (Weber et al., 2016). It is therefore also probable that loss of strand-separation activity contributes to cellular abnormalities in hsPURA patients harboring the K97E mutation.
To establish a structural framework for these observations, we solved the crystal structures of hsPURA III (Fig 3B) and of hsPURA I-II (Fig 4A). These human structures were overall very similar to previously published crystal structures from dmPURA (Appendix Fig S4A and S8) (Graebsch et al., 2009; Weber et al., 2016). Based on the crystal structures of hsPURA III (Fig 3B) we were able to predict folding defects in the C-terminal PUR domain (hsPUR III) that are caused by the patient-derived mutations F233del and R245P (Appendix Fig S7). Deletion of phenylalanine 233 within the β2-strand changes the orientation of all subsequent side chains within the β-sheet and likely destroys the interaction network required for the assembly of this secondary structure. A mutation of arginine 245 into proline likely disrupts β-strand folding by introducing a kink. Since repeat III mediates homodimer formation through swapping of its α-helices, the dimerization of hsPURA is potentially affected by folding defects induced by either the F233del or the R245P mutation. Indeed, we could experimentally confirm by nanoBRET measurements in HEK293 cells that dimerization of hsPURA I-III F233del and hsPURA I-III R245 is impaired when compared to wild-type hsPURA (Fig 3G).
Determination of the crystal structure of hsPURA I-II revealed larger flexible regions than the previously reported invertebrate structures (Graebsch et al., 2010; Graebsch et al., 2009; Weber et al., 2016), with considerable differences in the four molecules of hsPURA I-II in the asymmetric unit of the crystals (Fig 4B, Appendix Fig S10B and S10C). These observations provide clear evidence that in particular strands β4 and β8 of the N-terminal PUR domain are dynamic and can adopt considerable local differences within the β-sheet. This flexibility already suggests structural promiscuity of PUR domains. It also indicates that they can at least partially compensate for structural perturbance by mutations, resulting in folded domains of mutant proteins. However, since these mutations were reported to cause PURA syndrome, such modest structural alterations must nevertheless result in functional impairment.
The interpretation of structural flexibility and folding promiscuity of PUR domains found experimental support from the crystal structure of its K97E variant, whose mutated residue has a solvent-exposed side chain (Fig 5B-E). The corresponding structure shows in the β5 strand a shift by one (Fig 5D) and in the β6 strand by two amino acids (Fig 5E) in one of the molecules of the asymmetric unit. Surprisingly, these structural distortions did not result in major unfolding events of this PUR domain. Furthermore, NMR measurements with the 15N- and 1H-labeled N-terminal PUR domain yielded many spectral differences between the wild-type and K97E variants in particular within their β-sheets (Fig 5F and Appendix Fig S11A). Despite these considerable structural changes, the NMR data also confirmed that the K97E-mutant protein remains folded in solution and indicates that PUR domains may be able to partially compensate for mutations by folding promiscuity. These findings were further supported by molecular dynamics simulations, indicating high intrinsic motions of the N-terminal PUR domain (Appendix Fig S11B).
Instead of hot-spot regions for disease-causing mutations, as reported for other genetic disorders such as neurodevelopmental Rett syndrome (Lombardi et al., 2015), mutations of PURA-syndrome patients are found along the entire protein sequence (Johannesen et al., 2021; Reijnders et al., 2018). Also, in contrast to disorders such as the Rett syndrome, the vast majority of these mutations cause the full spectrum of disease symptoms. Our findings suggest that the flexibility and promiscuity of hsPURA domain folding render this protein particularly susceptible to mutations, even when amino-acid substitutions impose only moderate changes to the local charge and stereochemistry. This property of PUR domains might offer a structural explanation for why so many mutations across the sequence of hsPURA result in the full disease spectrum.
Not in all cases is the pathogenic effect of a given mutation as clear as in the case of K97E. For instance, the replacement of arginine 140 by proline in hsPURA was initially reported to cause PURA syndrome (Lee et al., 2018). Even though the corresponding patient did show symptoms, the severity and type of symptoms raised doubts about this initial diagnosis. In our assessment, neither the functional in vitro experiments nor the structural assessment of hsPURA I-II R140P yielded any significant differences from the wild-type protein (Fig 4C-F and Appendix Fig S9). These experimental findings are consistent with R140P being rather a benign mutation. Furthermore, after the initial description of R140P to be causative for PURA Syndrome (Lee et al., 2018), subsequent genomic assessment of a relative of this patient uncovered that this person had the same mutation without any clear symptoms. As a consequence, the R140P mutation is now listed in the NCBI ClinVar database to be of “uncertain significance” (https://www.ncbi.nlm.nih.gov/clinvar/variation/192343/). Our data clearly suggest that this mutation is rather benign. In summary, an assessment of the different mutations in hsPURA indicates a clear correlation between defects observed in in vitro analyses and the reported pathogenicity in patients. Hence, in future such in vitro analyses may serve as a diagnostic tool to distinguish pathogenic from benign mutations.
Recently, two publications resorted to in silico predicted, AI-based structures to model the potential impact of PURA-syndrome mutations on the structural integrity of hsPURA. Instead of assessing the physiologically correct dimer consisting of two PUR repeats, both publications analyzed in silico-predicted hsPURA half-structure consisting of a single PUR repeat III with incomplete dimerization domain (Dai et al, 2023; Lopez-Rivera et al, 2022). Such half PURA structures are very unlikely to exist in solution and hence predictions based on them are of very limited value or worse, prone to yield wrong interpretations. We expect that our experimentally determined structures of the human N- and C-terminal PUR domains offer more physiological structural reference points for future assessments of novel hsPURA mutations and their potential pathogenicity in patients.
In summary, this study provides the first functional and experimental structural assessment of the effects of PURA-syndrome causing mutations on folding, nucleic-acid binding, and unwinding, as well as on subcellular localization to stress granules and P-bodies. Some of the selected patient variants of hsPURA were previously predicted to have folding defects (Reijnders et al., 2018). Of note, for all these cases expression tests with recombinant proteins resulted in insoluble samples, indicating that these mutant proteins indeed have impaired folding (Appendix Table S1). Whereas predictions of such misfolding events appear reliable, the examples of R140P (benign) and K97E (pathogenic) indicate that structural predictions of surface-exposed residues seem more complicated. In fact, the observed great promiscuity of folding of PUR domains indicates that also moderate surface mutations such as K97E can result in promiscuous folding and the full spectrum of disease symptoms. Our study may serve as proof-of-principle that in vitro and structural studies are suitable tools to classify mutations in the hsPURA gene as potentially pathogenic or benign.
Materials and methods
In-depth descriptions of the sample preparation techniques and comprehensive details of the experimental procedures can be found in the Appendix section of this document.
Acknowledgements
We acknowledge the use of the X-ray Crystallography Platform at Helmholtz Munich, the Core Facility Bioimaging of the BMC of the Ludwig-Maximilians University Munich, as well as the Bavarian NMR Centre (BNMRZ). We would like to thank Vera Roman for her excellent technical support as well as Melinda Anderson from the PURA Foundation Australia and the PURA Syndrome Foundation for their precious support.
Funding
Deutsche Forschungsgemeinschaft FOR2333 sub-project Ni-1110/6-2 (DN) Care-for-Rare Science Award (DN)
Disclosure and competing interests statement
Authors declare that they have no competing interests.
Data and materials availability
All data are available in the main text or the supplementary materials. Structural models and diffraction data are available at the Protein Data Base (PDB; https://www.rcsb.org/) under the accession codes 8CHT (hsPURA I-II), 8CHU (hsPURA I-II K97E), 8CHV (hsPURA I-II R140P) and 8CHW (hsPURA III).
References
- The HeLa Pur factor binds single-stranded DNA at a specific element conserved in gene flanking regions and origins of DNA replicationMol Cell Biol 12:1257–1265
- The single-stranded DNA binding protein, Pur-alpha, binds HIV-1 TAR RNA and activates HIV-1 transcriptionGene 210:37–44
- A 25 Mainland Chinese cohort of patients with PURA-related neurodevelopmental disorders: clinical delineation and genotype-phenotype correlationsEur J Hum Genet 31:112–121
- Pur-alpha regulates cytoplasmic stress granule dynamics and ameliorates FUS toxicityActa Neuropathol 131:605–620
- Helix-destabilizing properties of the human single-stranded DNA- and RNA-binding protein PuralphaJ Cell Biochem 80:589–595
- Single-stranded nucleic acid-binding protein, Pur alpha, interacts with RNA homologous to 18S ribosomal RNA and inhibits translation in vitroJ Cell Biochem 83:355–363
- Of bits and bugs--on the use of bioinformatics and a bacterial crystal structure to solve a eukaryotic repeat-protein structurePLoS One 5
- X-ray structure of Pur-alpha reveals a Whirly-like fold and an unusual nucleic-acid binding surfaceProc Natl Acad Sci U S A 106:18521–18526
- A developmentally regulated DNA-binding protein from mouse brain stimulates myelin basic protein gene expressionMol Cell Biol 13:3103–3112
- A 39-kD DNA-binding protein from mouse brain stimulates transcription of myelin basic protein gene in oligodendrocytic cellsJ Cell Biol 130:1171–1179
- Phase Separation of FUS Is Suppressed by Its Nuclear Import Receptor and Arginine MethylationCell 173:706–719
- Lack of Pur-alpha alters postnatal brain development and causes megalencephalyHum Mol Genet 21:473–484
- Whole exome sequencing in family trios reveals de novo mutations in PURA as a cause of severe neurodevelopmental delay and learning disabilityJ Med Genet 51:806–813
- The large family of PC4-like domains - similar folds and functions throughout all kingdoms of lifeRNA Biol 17:1228–1238
- PURA-Related Developmental and Epileptic Encephalopathy: Phenotypic and Genotypic SpectrumNeurol Genet 7
- Role of Pur alpha in targeting mRNA to sites of translation in hippocampal neuronal dendritesJ Neurosci Res 83:929–943
- Puralpha is essential for postnatal brain development and developmentally coupled cellular proliferation as revealed by genetic inactivation in the mouseMol Cell Biol 23:6857–6875
- Neural BC1 RNA associates with pur alpha, a single-stranded DNA and RNA binding protein, which is involved in the transcription of the BC1 RNA geneBiochem Biophys Res Commun 277:341–347
- Mutations in PURA cause profound neonatal hypotonia, seizures, and encephalopathy in 5q31.3 microdeletion syndromeAmerican journal of human genetics 95:579–583
- Expanding the neurodevelopmental phenotype of PURA syndromeAm J Med Genet A 176:56–67
- MECP2 disorders: from the clinic to mice and backJ Clin Invest 125:2914–2923
- Structural Protein Effects Underpinning Cognitive Developmental Delay of the PURA p.Phe233del Mutation Modelled by Artificial Intelligence and the Hybrid Quantum Mechanics-Molecular Mechanics FrameworkBrain Sci 12
- Context-Dependent and Disease-Specific Diversity in Protein Interactions within Stress GranulesCell 172:590–604
- Components of RNA granules affect their localization and dynamics in neuronal dendritesMol Biol Cell 28:1412–1417
- The Molecular Function of PURA and Its Implications in Neurological DiseasesFront Genet 12
- Depletion of the RNA-binding protein PURA triggers changes in posttranscriptional gene regulation and loss of P-bodiesNucleic Acids Res 51:1297–1316
- The C9orf72 GGGGCC repeat is translated into aggregating dipeptide-repeat proteins in FTLD/ALSScience 339:1335–1338
- PURA syndrome: clinical delineation and genotype-phenotype study in 32 individuals with review of published literatureJ Med Genet 55:104–113
- Nuclear accumulation of mRNAs underlies G4C2-repeat-induced translational repression in a cellular model of C9orf72 ALSJ Cell Sci 128:1787–1799
- Puralpha Repaired Expanded Hexanucleotide GGGGCC Repeat Noncoding RNA-Caused Neuronal Toxicity in Neuro-2a CellsNeurotox Res 33:693–701
- A zebrafish model for C9orf72 ALS reveals RNA toxicity as a pathogenic mechanismActa Neuropathol 135:427–443
- RNA toxicity in non-coding repeat expansion disordersEMBO J 39
- Regulation of myelin basic protein gene transcription by Sp1 and Puralpha: evidence for association of Sp1 and Puralpha in brainJ Cell Physiol 181:160–168
- Structural basis of nucleic-acid recognition and double-strand unwinding by the essential neuronal protein Pur-alphaElife
- Expanded GGGGCC repeat RNA associated with amyotrophic lateral sclerosis and frontotemporal dementia causes neurodegenerationProc Natl Acad Sci U S A 110:7778–7783
Article and author information
Author information
Version history
- Preprint posted:
- Sent for peer review:
- Reviewed Preprint version 1:
- Reviewed Preprint version 2:
- Version of Record published:
Copyright
© 2024, Proske et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.