The neuronal DNA-/RNA-binding protein Pur-alpha is a transcription regulator and core factor for mRNA localization. Pur-alpha-deficient mice die after birth with pleiotropic neuronal defects. Here, we report the crystal structure of the DNA-/RNA-binding domain of Pur-alpha in complex with ssDNA. It reveals base-specific recognition and offers a molecular explanation for the effect of point mutations in the 5q31.3 microdeletion syndrome. Consistent with the crystal structure, biochemical and NMR data indicate that Pur-alpha binds DNA and RNA in the same way, suggesting binding modes for tri- and hexanucleotide-repeat RNAs in two neurodegenerative RNAopathies. Additionally, structure-based in vitro experiments resolved the molecular mechanism of Pur-alpha's unwindase activity. Complementing in vivo analyses in Drosophila demonstrated the importance of a highly conserved phenylalanine for Pur-alpha's unwinding and neuroprotective function. By uncovering the molecular mechanisms of nucleic-acid binding, this study contributes to understanding the cellular role of Pur-alpha and its implications in neurodegenerative diseases.https://doi.org/10.7554/eLife.11297.001
Some proteins perform several different tasks inside cells. This is the case for a protein called Pur-alpha, which is essential for neurons to work correctly. For example, Pur-alpha can bind to DNA to regulate gene activity. It also binds to RNA molecules, which are copies of a gene, and helps to distribute them within the neuron. In humans, there are several neurodegenerative diseases in which Pur-alpha is involved. One example is the Fragile X-associated Tremor/Ataxia Syndrome (FXTAS), which causes memory and movement problems.
Experiments with isolated proteins and double-stranded DNA show that Pur-alpha is able to separate the two DNA strands. But it was not clear how this DNA unwinding occurs, and the biological significance of this activity was unknown. Other questions also remained unanswered: how does Pur-alpha recognize DNA and RNA? Does the loss of Pur-alpha’s binding to DNA and RNA contribute to neurodegenerative diseases?
To address these questions, Weber et al. obtained Pur-alpha from the fruit fly and crystallized the protein bound to DNA. A technique called X-ray crystallography was then used to determine the three-dimensional structure of the Pur-alpha/DNA complex in fine enough detail to work out the position of individual atoms.
Based on this structure, Weber et al. could introduce mutations that alter the DNA- and RNA-binding region of the protein to investigate the binding mechanism. The crystal structure and experiments with normal and mutant Pur-alpha protein revealed how it unwinds double-stranded DNA: binding of Pur-alpha to DNA causes a strong twist of the DNA molecule, which contributes to separating the strands. Further experiments in fruit flies revealed that both the DNA-unwinding activity and the ability of Pur-alpha to bind DNA/RNA are needed for the protein to work correctly in neurons.
Because Pur-alpha is involved in a range of different processes inside cells, a future goal is to identify the DNA and RNA sequences it specifically binds to. This information, together with the insights gained from Weber et al.’s study, should advance our understanding of why Pur-alpha is essential for maintaining neurons.https://doi.org/10.7554/eLife.11297.002
Purine-rich element-binding protein A (Pur-alpha) plays a crucial role in postnatal brain development. Pur-alpha-deficient mice appear normal at birth but develop severe neurological abnormalities after 2 weeks and die shortly after birth (Hokkanen et al., 2012; Khalili et al., 2003). These mice show fewer cells in the brain cortex, hippocampus, and cerebellum as a consequence of decreased proliferation of the precursor cells. Further studies indicated that Pur-alpha co-localizes with Staufen and FMRP and that Pur-alpha (-/-) mice display dendritic mislocalization of both proteins (Johnson et al., 2006). In support of its important neuronal function, point mutations in the human Pur-alpha gene have been found to cause the so-called 5q31.3 microdeletion syndrome, which is characterized by neonatal hypotonia, encephalopathy, and severe developmental delay (Lalani et al., 2014; Hunt et al., 2014; Tanaka et al., 2015).
Pur-alpha is an ubiquitously expressed, multifunctional protein that binds to both DNA and RNA and is known to regulate replication, transcription, and translation (Johnson et al., 2013). It has been shown that Pur-alpha binds to single- and double-stranded nucleic acids that contain GGN motifs. Such regions are found at origins of DNA replication and enhancers of TATA-box lacking genes, such as c-myc or the myelin-basic protein, which Pur-alpha regulates. Pur-alpha has also been routinely purified from cytoplasmic kinesin-containing ribonucleoprotein particles (RNPs) (Kanai et al., 2004; Ohashi et al., 2000), further supporting its role in mRNA localization and showing that Pur-alpha is a core factor in localizing mRNPs.
Besides its ability to bind RNA and DNA, Pur-alpha possesses dsDNA-destabilizing activity in an ATP-independent fashion (Darbinian et al., 2001). This function has been suggested as important for DNA replication and transcription regulation. It was postulated that Pur-alpha, being a transcription activator, contacts the purine-rich strand of promoter regions and displaces the pyrimidine-rich strand, which would allow the binding of other proteins and activation of transcription (Darbinian et al., 2001; Wortman et al., 2005). The role of Pur-alpha-dependent unwinding activity in RNA localization and in RNA-based neuropathological disorders is currently unknown.
One particularly interesting interaction partner of Pur-alpha is the RNA helicase Rm62, the Drosophila ortholog of p68. It is implicated in transcriptional regulation, pre-mRNA splicing, RNA interference, and nucleo-cytoplasmic shuttling (Qurashi et al., 2011). Thus, their joint function could be the initial unwinding of short dsRNA regions by Pur-alpha followed by helicase-dependent melting of larger regions for the regulation of RNA processing, translational control, and transport.
Nucleic acid-binding of Pur-alpha is mediated by three central PUR repeats (Graebsch et al., 2010; Graebsch et al., 2009), which are N-terminally flanked by unstructured, glycine-rich sequences and C-terminally by glutamine- and glutamate-rich regions (Figure 1A; Johnson et al., 2013). In the recently published crystal structure of Pur-alpha each of both PUR repeats I and II consist of a four-stranded antiparallel beta-sheet, followed by a single alpha-helix (Graebsch et al., 2009). Repeat I and II fold into an intramolecular dimer that serves as a DNA-/RNA-binding domain. The third repeat leads to intermolecular dimerization (Figure 1A; Graebsch et al., 2009). Despite these insights, it remains unclear how Pur-alpha interacts with its nucleic-acid targets to mediate its cellular functions. Furthermore, the mechanistic basis and physiological importance of its unwinding activity remains unresolved.
Pur-alpha has been implicated in two so-called RNA repeat-expansion diseases, which have been the focus of a number of recent studies. The first one contains expansions in the well-studied fmr1 gene. Individuals with 55 to 200 CGG repeats, termed pre-mutation, develop the neurodegenerative Fragile X-associated Tremor/Ataxia Syndrome (FXTAS) (Hagerman et al., 2001), whereas healthy individuals have less than 54 trinucleotide CGG repeats in their 5’-UTR region (Oostra and Willemsen, 2009). It is generally accepted that expression of FMR1 mRNA with abnormal trinucleotide-repeat expansions are the main cause of FXTAS. The second Pur-alpha related disease is caused by repeat expansions of G4C2-hexanucleotides in the first intron of the c9orf72 transcript. These repeat expansions are considered as the most common genetic abnormality in amyotrophic lateral sclerosis (ALS) and familial frontotemporal lobal degeneration (FTLD) (Stepto et al., 2014). The diseases associated with both types of repeat expansions are accompanied by the formation of repeat RNA-containing protein inclusions (Sareen et al., 2013; Stepto et al., 2014,; Xu et al., 2013), suggesting sequestration of proteins as potential mechanism of pathology. Pur-alpha is incorporated into the inclusions of both types of disease and associates directly with the repeat RNAs (Jin et al., 2007,; Xu et al., 2013; Rossi et al., 2015). In fly and mouse models, the overexpression of Pur-alpha can overcome repeat-dependent neurodegeneration of both diseases (Jin et al., 2007,; Xu et al., 2013), suggesting a direct contribution of Pur-alpha to neuropathology.
Expression of 95 CGG repeats in human neuroblastoma-derived SK-N-MC cells not only induced the formation of nuclear inclusions but also impairs the architecture of the nuclear laminar and activates DNA repair-associated histone variants (Hoem et al., 2011). The expression of G4C2-repeat expansions cause nuclear trafficking defects, which contribute to neurotoxicity in ALS/FTLD (Freibaum et al., 2015; Jovicic et al., 2015; Zhang et al., 2015). Recent studies also showed that repeat-associated non-AUG (RAN) translation occurs from CGG- as well as from G4C2-repeat RNAs and that the resulting proteins can form cytoplasmic aggregates, potentially contributing to pathology (Mori et al., 2013; Todd et al., 2013). It is likely that a combination of RNA toxicity and RAN-derived protein aggregates contribute to the full manifestation of FXTAS.
Here, we used NMR chemical shift titrations together with in vitro-binding assays to demonstrate that the nucleic acid-binding domain of Pur-alpha binds RNA and DNA in the same manner. We present the co-crystal structure of Pur-alpha with a CGG trinucleotide-repeat DNA, providing a detailed structural explanation for nucleotide recognition. Pur-alpha interacts with this single-stranded DNA fragment in a sequence-specific manner with guanines and additional contacts to the phosphordiester backbone. The observed binding mode of Pur-alpha also explains its interaction with G4C2-hexanucleotide repeats. Mutational analyses as well as determination of the complex stoichiometry confirm that the DNA-/RNA-binding domain of Pur-alpha has two nucleic acid-binding sites. The structure also revealed that a highly conserved phenylalanine causes disruption of the normal base stacking and leads to a strong torsion of the DNA strand, which plays a central role in Pur-alpha’s dsDNA-unwinding activity. In vivo analyses of mutant proteins reveal that nucleic-acid binding and unwinding studied in vitro are both essential for Pur-alpha’s function in vivo. This information together with the crystal structure of its C-terminal dimerization domain allows us to propose a mechanism of how full-length Pur-alpha binds and unwinds dsDNA regions.
In order to assess if Pur-alpha has a binding preference for ssDNA or ssRNA, we performed electrophoretic mobility shift assays (EMSA) with the nucleic acid-binding domain of Drosophila Pur-alpha, consisting of repeats I-II (PUR repeat I-II; Figure 1A; Figure 1—figure supplement 1A, B) and radiolabeled DNA or RNA oligonucleotides (24 nt) of identical sequence. The MF0677 sequence was chosen as a physiological Pur-alpha target found upstream of the human c-myc gene (Haas et al., 1993; Haas et al., 1995). In addition, we used a CGG-repeat sequence because Pur-alpha binds to these repeats in the 5’UTR of the FMR1 mRNA upon incorporation into FXTAS inclusions (Jin et al., 2007; Sofola et al., 2007). In these EMSA, the affinity for the physiological Pur-alpha target MF0677 is much higher (KD ~200 nM) than for the disease-related CGG-repeat sequence (KD ~2 µM) (Figure 1B,C; KD estimated from EMSA). However, the binding affinities for ssDNA and ssRNA of the same sequence showed no major differences.
Since full-length Pur-alpha contains a third PUR repeat, which mediates its dimerization, and additional N- and C-terminal sequences (Figure 1A), we also compared DNA and RNA binding of full-length Pur-alpha (Figure 1—figure supplement 1E). For quantification of the nucleic acid-binding affinity, we performed fluorescence-polarization experiments. Full-length Pur-alpha showed a two-fold preference in binding to MF0677 ssRNA (KD = 0.7 µM) over MF0677 ssDNA (KD = 1.4 µM; Figure 1D). Thus, sequences outside PUR repeats I-II seem to moderately affect nucleic-acid binding.
For a more comprehensive, residue-resolved comparison of ssDNA and ssRNA binding, we performed NMR chemical shift titration experiments with 15N-labeled Drosophila Pur-alpha repeat I-II (Figure 1—figure supplement 1C) and short unlabeled GCGGA (5 nt) DNA and RNA fragments. The 1H,15N HSQC NMR spectrum of Pur-alpha alone shows well separated cross peaks (Figure 1E; Figure 1—figure supplement 2A, B), indicating that the protein is correctly folded. Addition of either ssDNA or ssRNA resulted in almost identical, well-localized chemical shift perturbations of backbone and sidechain amide protons (Figure 1E; Figure 1—figure supplement 2A, B). Most NMR signals of residues involved in binding disappeared upon addition of DNA/RNA, thus pointing toward an intermediate exchange regime, which is characteristic for binding affinities in the high nanomolar to micromolar range. In summary, the NMR titration experiments indicate identical binding modes of PUR repeat I-II for ssDNA and for ssRNA involving the same residues in both cases.
In order to obtain high-resolution structural information of Pur-alpha binding to nucleic acids, we performed co-crystallization experiments of Pur-alpha repeat I-II with either CGG-repeat DNA or RNA. Crystals of Pur-alpha repeat I-II with a GCGGCGG trinucleotide-repeat ssDNA diffracted to a resolution of 2.0 Å. The structure was solved by molecular replacement and refined to Rwork and Rfree of 16.3% and 21.5%, respectively (Table 1).
The DNA-bound protein shows the typical intramolecular dimer with two PUR repeats tightly intertwined with each other, forming a globular PUR domain (Figure 2A; Video 1; Figure 2—figure supplement 1A; Graebsch et al., 2009). Each PUR repeat consists of a N-terminal four-stranded antiparallel beta sheet followed by an alpha helix. A superposition of the previously published Pur-alpha repeat I-II apo-structure (PDB ID 3K44) (Graebsch et al., 2009) with the structure of the protein-DNA co-complex showed only a root-mean-square deviation (RMSD) of atomic positions of 1.14 Å (Figure 2—figure supplement 1B). When a flexible loop region from residues L107 to K120 was excluded, the RMSD improved to 0.83 Å. Thus, no major conformational changes occur in the PUR domain upon nucleic-acid binding, which is consistent with the results obtained from NMR chemical shift titrations.
In the crystal structure, the DNA molecule 1 (DNA 1) is clamped between residues of PUR repeat I and II (Figure 2A, B; Video 1). PUR repeat II binds DNA 1 with the residue K138 of its β-sheet, and residues N140 and R142 of the short linker (Figure 2B, C), whereas PUR repeat I contacts the DNA 1 via residues Q52, S53, and K54 in its short linker (Figure 2B, D). Pur-alpha mainly binds to stacking guanine bases, but also to one of the cytosines (C5) and to the sugar phosphate backbone (Figure 2B).
Within the crystal lattice the first two bases (G1 and C2) of the 5’-end of DNA 1 are base pairing with the 5’-end of the symmetry related DNA molecule (DNA 1’; Figure 2—figure supplement 2). The cytosine C5 in the middle of the DNA 1 strand is twisted and does not stack with the neighboring guanines (Figure 2E). Instead, F145 from the β-sheet of PUR repeat II stacks with the neighboring guanine G4 and thereby blocks the space for the cytosine C5 (Figure 2E, Video 1).
In the crystal structure, an additional DNA-binding event was observed for PUR repeat I. The residues Y57, D59, K61, K70, and R80 of the β-sheet interact with the 3’-end of the second DNA molecule (DNA 2) (Figure 2B, F; Video 1). This interface is similar but not identical to the DNA 1-binding site on PUR repeat II. The three DNA-contacting amino acids K138, N140, and R142 of PUR repeat II are also found in corresponding positions of PUR repeat I (Figure 2—figure supplement 3). However, in PUR repeat I only K61 but not N63 or R65 contact the DNA 2 molecule. Thus, although there is a conservation of DNA-contacting residues on both PUR repeats, in the crystal structure their modes of binding are not identical. This observation hints toward a potentially asymmetric binding of nucleic acids on both protein surfaces of Pur-alpha I-II.
To test if Pur-alpha also interacts with two DNA oligonucleotides in solution, we performed filter-binding assays with Pur-alpha repeat I-II and MF0677 ssDNA (24 nt). Pur-alpha repeat I-II was titrated at near-stoichiometric concentrations to a constant amount (1 µM) of radiolabeled DNA and blotted onto a nitrocellulose membrane (Figure 2G). Plots of the signal intensities against the protein concentrations yielded a mean saturation at 0.58 ± 0.1 µM (n=3) of Pur-alpha (Figure 2G). This indicates a stoichiometric ratio of 1:2 (protein:DNA) and confirms that like in the crystal structure (Figure 2A) Pur-alpha repeat I-II binds two molecules of ssDNA in solution.
All amino acids involved in DNA binding within the crystal structure (Figure 2B) are conserved (Figure 2—figure supplement 4A). To assess the importance of these contacts in solution, we generated structure-guided mutations and tested their effect on DNA/RNA binding. The binding motif consisting of K138, N140, R142, and F145 on PUR repeat II (KNR II and F II, respectively) is also found on PUR repeat I (K61, N63, R65, and F68; KNR I and F I, respectively). Hence, these residues were replaced by alanines and tested for nucleic acid-binding in vitro. For the QSK I – KNR II mutant the residues Q52, S53, K54, were replaced by glycine and the residues K138, N140, R142 by alanines, since a pure alanine mutant tended to aggregate. Correct folding of all generated Pur-alpha mutants was verified by circular dichroism (CD) spectroscopy (Figure 1—figure supplement 1B).
First, radioactive EMSA were performed with CGG-repeat and MF0677 DNA/RNA oligomers (24 nt). Except for Pur-alpha mutant F I, all other mutants showed decreased binding to DNA and RNA oligonucleotides with both motifs (Figure 3A–E, G; Figure 3—figure supplement 1A–E). In order to quantify these interactions, we performed fluorescence-polarization experiments with fluorescein-labeled MF0677 DNA and different variants of Pur-alpha. The effects observed in EMSA of mutations in Pur-alpha I-II were confirmed by these experiments (Figure 3H; Figure 3—figure supplement 2). Of note, mutations in PUR repeat I (KNR I, F I) had less severe effects on DNA binding than mutations in repeat II (KNR II, F II).
A large portion of the ssDNA 1 strand in the co-complex is stabilized in its conformation by aromatic stacking of G1, C2, G3, G4 and G6, G7 (Figure 2C–F). F145 of Pur-alpha shows particularly unusual characteristics by undergoing aromatic stacking with G4 (Figure 2E). This protein-DNA interaction blocks additional DNA-base stacking events and forces the DNA to flip out its cytosine base (C5), leading to a strong twist of the DNA 1 strand.
It was previously reported that Pur-alpha unwinds short stretches of dsDNA in an ATP-independent manner (Darbinian et al., 2001). However, the molecular basis of this function has not been understood to date. Since the sequence-specific interactions of Pur-alpha with DNA and the aromatic stacking of DNA with F145 seem incompatible with binding to dsDNA, we wondered which interactions are of foremost importance for the unwinding of dsDNA. Using a previously described unwinding assay (Darbinian et al., 2001), we compared ATP-independent unwinding activity of wild-type and mutant Pur-alpha repeat I-II on a dsDNA substrate.
When the main binding sites on PUR repeat I and II were mutated (QSK I – KNR II) unwinding was abolished (Figure 3F, G), most likely due to impaired DNA binding (Figure 3D, G; Figure 3—figure supplement 1A). In contrast, mutation of F145 (F II) abolished the unwinding activity without a complete loss of DNA binding (Figure 3E–G; Figure 3—figure supplement 1E). All other mutations showed reduced DNA binding (Figure 3A–C, G; Figure 3—figure supplement 1B–D) and only decreased unwinding (Figure 3—figure supplement 1F). Together these observations suggest that the heterotypic stacking of DNA-bases with F145 in PUR repeat II stabilizes the single-stranded conformation of DNA and enforces a twist of the bases that is important for its unwinding activity.
To understand the role of the third repeat of Pur-alpha (Figure 1A; Figure 1—figure supplement 1D) for DNA/RNA binding, we determined its crystal structure. Initial datasets were obtained from native crystals at 2.7 Å resolution, from which electron-density maps were calculated by molecular replacement with the apo-structures of Pur-alpha from Borrelia and Drosophila as search templates (PDB-IDs: 3NM7 and 3K44, respectively). The final structure model was obtained in the same way from selenomethionine-derivatized crystals at 2.6 Å resolution (Table 1; Figure 4A; Figure 4—figure supplement 1). The structure consisting of two repeat III molecules shows the same overall fold as repeat I-II with an RMSD of 1.5 Å, and only few differences in the amino acid composition of its putative nucleic-acid-binding surface (Figure 4A; Figure 2—figure supplement 4B).
PUR repeat III was previously suggested to mainly mediate dimerization of Pur-alpha (Graebsch et al., 2009). However to date, no binding of PUR repeat III to nucleic acids has been measured. We therefore performed EMSA and observed that Pur-alpha repeat III bound with weaker affinities to CGG repeats and to MF0677 than Pur-alpha repeat I-II (Figures 3G and 4B). Also in fluorescence-polarization experiments, PUR repeat III bound MF0677 ssDNA over 30-times weaker than PUR repeat I-II (Figure 3H; Figure 3—figure supplement 2). The main DNA/RNA interactions of full-length Pur-alpha might therefore occur via the first two PUR repeats.
Although Pur-alpha repeat III does not have a phenylalanine in the corresponding position of F145 of PUR repeat II, it also contains a conserved aromatic residue (Y219), which could potentially undergo stacking with DNA bases and support dsDNA unwinding (Figure 2—figure supplement 4B). However, in unwinding assays almost no activity was observed for PUR repeat III (Figures 3G and 4C). These observations confirm that PUR repeats I-II mediate the main nucleic-acid-binding and unwinding activities and suggest that repeat III might predominantly mediate dimerization.
To assess the physiologic relevance of our in vitro findings, we relied on a previously reported Drosophila model. Overexpression of CGG-repeat RNA in the Drosophila eye induces neuronal degeneration and as a consequence the rough eye phenotype (compare Figure 5A with 5B; Jin et al., 2003). Overexpression of Pur-alpha can rescue the eye phenotype in a dose-dependent manner, suggesting that this protein is sequestered into the inclusions (Jin et al., 2007). We compared the rescue by wild-type Pur-alpha with DNA-/RNA-binding and unwinding mutants. Whereas the wild-type protein achieved a full rescue (Figure 5C), expression of the QSK I – KNR II mutant failed to ameliorate the rCGG repeat-induced neuronal toxicity (Figure 5D). On the other hand, a previously reported double mutant R80A/R158A (R I – R II) that impairs nucleic-acid binding (Figure 3H; Figure 3—figure supplement 2; Graebsch et al., 2009) was still able to suppress the rCGG repeat-mediated toxicity (Figure 5E). Thus, there might be differences in the binding for ssCGG repeats and the requirements for neuronal rescue. Most interestingly, however, is the observation that also the mutant F II, which still binds DNA/RNA but fails to unwind dsDNA, is unable to rescue neurodegeneration (Figure 5F). Together these observations confirm the physiologic importance of the nucleic-acid-protein contacts observed in the crystal structure. In addition, these findings formally establish that the binding and unwinding of nucleic acids is required to modulate toxicity caused by pathogenic CGG RNA.
Pur-alpha repeat I-II shows strong and specific binding to its physiological target MF0677 DNA located upstream of the c-myc gene (Bergemann et al., 1992), but much weaker binding to CGG-repeat RNA (Graebsch et al., 2009). For this reason, it has been suggested that the binding of Pur-alpha to DNA is stronger than to RNA and, as a consequence, that there might be differences in the binding modes to both nucleic-acid targets. In this study, we directly compared Pur-alpha binding to RNA and DNA oligonucleotides of the same sequence and found no major differences (Figure 1B–D). This suggests that the higher affinity for MF0677 (KD ~200 nM; Figure 1B) over CGG repeats (KD ~2 µM; Figure 1C) is due to differences in sequence and not the absence of the 2’ OH group in the DNA. This interpretation found further support from NMR titrations with 15N-labeled Pur-alpha repeat I-II and oligonucleotides. The spectra showed similar chemical shift perturbations, regardless of whether it was DNA or RNA, indicating that both nucleic acids are bound in the same way (Figure 1E). Finally, the crystal structure of the Pur-alpha/DNA co-complex showed that a hydroxyl-group on the 2’ position of the pentose ring of the RNA sugar backbone would not cause steric clashes (Figure 2A,C-F). Together, our biochemical, NMR, and X-ray crystallographic insights indicate that Pur-alpha binds DNA and RNA in the same way and thus will interact equally with both types of nucleic acids in the cell. It is also consistent with the previously suggested Pur-alpha-dependent gene regulation by competitive RNA binding (Tretiakova et al., 1998).
Previous findings implied that the positively charged β-sheets mediate DNA/RNAbinding, whereas the amphipathic helices might contribute to protein-protein interactions (Graebsch et al., 2009). The crystal structure of the protein-DNA co-complex confirms that the β-sheets, together with their short linkers, are involved in DNA binding, in contrast to the α-helices that show no interaction (Figure 2A). A comparison of the Pur-alpha repeat I-II apo-structure (PDB ID 3K44) with the co-structure presented here revealed no significant conformational changes (Figure 2—figure supplement 1B).
In the crystal structure, Pur-alpha interacts with nucleic acids by clamping them between its two repeats, mostly by interacting with the guanine bases (Figure 2A, B). Only R142 interacts with the cytosine base C2. K54 and K138 additionally stabilize the DNA binding by interacting with the sugar phosphate backbone of guanine G4 and cytosine C5, respectively (Figure 2B–D). Binding therefore occurs sequence specifically and confirms the GGN-binding motif postulated before (Bergemann and Johnson, 1992).
Mutation of the interacting residues resulted in a decreased binding affinity (Figure 3G, H) and therefore confirmed the interaction sites seen in the crystal structure. Also mutation of the corresponding KNR motif on PUR repeat I (KNR I) caused a decrease in affinity (Figure 3G, H). However, in fluorescence-polarization experiments, the mutation of KNR I had a less severe effect on DNA binding (KD = 1.3 µM) than mutation of KNR II (KD = 4.1 µM; Figure 3H). This is consistent with the observation that in the crystal structure all three residues of KNR II make contacts with DNA 1 (Figure 2B, C), whereas in KNR I only a single amino acid binds to DNA 2 (Figure 2—figure supplement 3). Also, the F I mutation in repeat I had a less severe effect on MF0677 ssDNA binding than the F II mutation (Figure 3G, H). In summary, these observations suggest that the MF0677 ssDNA is bound asymmetrically by PUR repeats I-II.
In FXTAS patients, Pur-alpha binds to CGG-repeat expansions that cause the formation of nuclear inclusions and neurodegeneration (Oostra and Willemsen, 2003). Pur-alpha is also incorporated into inclusion triggered by G4C2-repeat RNA of patients with ALS and FTLD. The nucleic-acid binding of Pur-alpha observed in the crystal structure can explain both binding events, as it makes sequence-specific interactions with a GGC motif found in both repeat RNAs.
The structural model of Pur-alpha repeat I-II forming a PUR domain has two nucleic-acid-binding surfaces. PUR repeats I and II share the identical binding motif (KNR), and adopt the same fold, despite moderate sequence identity of about ~30% (Figure 2—figure supplement 4; Graebsch et al., 2010; Graebsch et al., 2009). Consistent with this finding, we observed a stoichiometric ratio of 1:2 for the PUR domain with ssDNA in filter-binding assays (Figure 2G). Both binding events appear at overlapping but non-identical surface regions (Figure 2A, B), which might prefer different GGN-motifs (GGA, GGG, GGC, GGT) as has been previously suggested (Aumiller et al., 2012). This might also explain why CGG repeats bind less strongly to Pur-alpha than the MF0677 sequence, which mostly consists of GGA and GGT motifs.
Pur-alpha has been previously reported to unwind dsDNA in an ATP-independent manner (Darbinian et al., 2001; Wortman et al., 2005). However, so far, it has not been shown how unwinding is achieved on a molecular level and that this function is physiologically relevant. The crystal structure of our Pur-alpha/DNA co-complex offers a mechanistic explanation: phenylalanine in position 145 of PUR repeat II undertakes base stacking with the guanine G4 and thereby blocks the space for the neighboring cytosine C5 (Figure 2E). Thereupon, the cytosine flips out and the 3’-end of the DNA strand becomes distorted. The interaction of K54 and K138 with the phosphate backbone upstream of the cytosine C5 enforces this strong turn (Figure 2B–D). F145 is highly conserved throughout different species (Figure 2—figure supplement 4A) and its mutation (F II) abolishes unwinding of dsDNA (Figure 3F, G).
Phenylalanine 145 has its structural counterpart in PUR repeat I in position F68. Although F68 is also highly conserved, in the crystal structure the guanine base stacking is not mediated by this residue. Instead, the conserved Y57 in repeat I stacks with G7 (Figure 2B, F). As mentioned before, the two binding sites of Pur-alpha seen in the crystal structure are asymmetric and might account for sequence-specific binding to nucleic acids with different GGN motifs.
To assess the physiological importance of the interactions observed in the crystal structure and validated in vitro, we used the previously reported FXTAS fly model (Jin et al., 2007; Jin et al., 2003). Expression of pre-mutation CGG-repeat RNA in Drosophila induces neurodegeneration, which is easily detectable in abnormalities in the facet eye (compare Figure 5A with 5B). While we observed that overexpression of wild-type Pur-alpha rescues the eye phenotype (Figure 5C), the RNA-binding mutant QSK I - KNR II failed to do so (Figure 5D). Surprisingly, a second, previously published RNA-binding mutant (Pur-alpha R I – R II), which showed strongly reduced MF0677 ssDNA binding (Figure 3H), was able to fully rescue the eye phenotype (Figure 5E). This observation indicates that arginine 80 and 158 are not required for the binding to nucleic acids important for neuroprotection. While the neuroprotection by the R I – R II mutant indicates flexibility in nucleic-acid recognition, the loss of rescue by the QSK I - KNR II mutant formally establishes the requirement of nucleic-acid binding for Pur-alpha-dependent neuroprotection. Additionally, the F II mutation of Pur-alpha, which abolishes its dsDNA-unwinding activity, also impairs the neuroprotective function in the fly model (Figure 5F). These findings indicate that unwinding is important for neuroprotection by Pur-alpha.
Recently, de novo mutations in Pur-alpha have been found to cause the so-called 5q31.3 microdeletion syndrome. This disease is characterized by neonatal hypotonia, encephalopathy, and severe developmental delay (Lalani et al., 2014; Hunt et al., 2014; Tanaka et al., 2015). Of the reported mutations (Figure 6—source data 1), two missense mutations (A89P, K97E) are of particular interest from a structure-to-function point of view (Lalani et al., 2014). Sequence alignment of Pur-alpha from different species shows that the residues A89 and K97 of the human Pur-alpha protein correspond to the residues A72 and R80 of the Drosophila protein, respectively. These residues are highly conserved (Figure 2—figure supplement 4A). In the crystal structure of the protein/DNA co-complex, A72 does not directly interact with the DNA molecule. Instead it forms backbone hydrogen bonds between the β-strands of PUR repeat I to stabilize the nucleic-acid binding β-sheet (Figure 6A, top) (this study and Graebsch et al., 2009). When A72 and its disease-causing counterpart A98 in the human protein (Figure 6A, middle) are substituted by a proline, the backbone interactions that stabilize the β-sheet very likely become disrupted (Figure 6A, bottom) and thus the protein misfolds.
The Drosophila equivalent R80 of the disease-associated human K97 directly binds to the guanine base G7 (Figure 2B, F and 6B, top) and its mutation results in reduced nucleic-acid binding (Graebsch et al., 2009). It is therefore conceivable that a mutation of K97 to glutamate impairs nucleic-acid interaction because of repulsive forces and causes dysfunction of Pur-alpha (Figure 6B, middle, bottom). Although in our fly model the double mutant R80A/R158A (R I – R II) was still able to rescue neurodegeneration (Figure 5E), the reported effect of the K97E mutation in the microdeletion syndrome indicates that nucleic-acid binding by this residue is important at least in humans. Additional interesting disease-causing point mutations in human Pur-alpha are I188T and I206F (Figure 6—source data 1), which likely impair the intramolecular dimerization of PUR repeats I and II (Hunt et al., 2014;, Tanaka et al., 2015). Taken together, the crystal structure of the Pur-alpha/DNA co-complex presented in this study provides a molecular explanation for the effects of missense mutations in the 5q31.3 microdeletion syndrome.
Wild-type Pur-alpha binds to origins of replication and promoter regions (Bergemann and Johnson, 1992,; Bergemann et al., 1992) and regulates the transcription of more than 20 genes (White et al., 2009). Pur-alpha’s ability to unwind dsDNA might therefore play an important role in the initiation of replication and transcription. One recently reported interaction partner of Pur-alpha that might play a role in this context is the RNA helicase Rm62 (Qurashi et al., 2011). In the light of the dsDNA-unwinding activity an intriguing speculation is that Pur-alpha also unwinds dsCGG-repeat RNA. This initial unwinding by Pur-alpha could allow interacting helicases to subsequently regulate RNA processing, transport, and translation. Therefore, it will be important to assess Pur-alpha’s role in unwinding of dsRNA and its interaction with Rm62.
Considering that Pur-alpha repeat I-II has two nucleic-acid-binding sites, it is conceivable that each PUR repeat binds to one of the strands of a duplex DNA molecule thereby unwinding short stretches of dsDNA (Figure 7A–C, top). The insertion of Pur-alpha between both DNA strands might be achieved through spontaneous breathing of the dsDNA helix (Peyrard et al., 2009,; Jose et al., 2012). Intercalating residues (phenylalanine, tyrosine) might cause further separation of the two DNA strands via base stacking with the guanines and thereby causing the strong twist of the DNA strands. The partly melted duplex DNA could then be further unwound by DNA helicases, which are required for initiation of transcription and replication. In the crystal structure, base pairing is observed between the 5’-G1-C2 bases of two symmetry-related DNA molecules (Figure 2—figure supplement 2), indicating, that a PUR domain would unwind a short stretch of approximately four to six bases.
We also solved the crystal structure of PUR repeat III (Figure 4A; Figure 4—figure supplement 1) and found that it binds only weakly to DNA/RNA (Figure 3G, H) and unwinds dsDNA only slightly (Figure 4C). Since in the crystal structure the C-terminal end of PUR repeat I-II is located on the opposite side of its nucleic-acid-binding surface (Figure 7A, B, bottom), it is unlikely that PUR repeat III causes steric clashes interfering with the nucleic-acid binding by PUR repeat I-II. Hence, PUR repeat III might only facilitate dimerization, thereby guiding a second DNA-/RNA-binding domain (PUR repeat I-II) to another GGN motif further upstream or downstream on the dsDNA, where additional DNA-unwinding events could take place (Figure 7C, bottom). How this effect of dimeric Pur-alpha is achieved on a molecular level and if unwinding of longer dsDNA fragments requires its joint action with helicases are main questions to be addressed in future.
Escherichia coli BL21 (DE3) cells transformed with pGEX-6P-1::Pur-alpha fragments were grown at 37°C in LB medium supplemented with 100 µg/ml ampicillin. For 15N-labeling of protein cells were grown in M9 minimal medium supplemented with 0.5 g/l 15NH4Cl. For selenomethionine-substituted protein, cells were grown in M9 minimal medium supplemented with an amino-acid mix of L-alanine, L-arginine, L-aspartic acid, L-cysteine, L-glutamate, L-glycine, L-histidine, L-isoleucine, L-leucine, L-lysine, L-phenylalanine, L-proline, L-serine, L-threonine, L-tyrosine, L-valine, and selenomethionine (100 mg/l each).
After reaching an OD600 of 0.8, cell cultures were cooled down to 18°C and expression was induced by adding 0.25 mM IPTG. Cells were harvested after 18 hr of expression. GST-tagged proteins were purified by GST-affinity chromatography (GE Healthcare, Munich, Germany). After protease cleavage, the GST tag was removed by a glutathione-sepharose column. Nucleic acids were removed by using an anion-exchange Q column (GE Healthcare) followed by size exclusion chromatography with buffer containing 250 mM NaCl, 20 mM Hepes pH 8.0. For cysteine-containing and for selenomethionine-substituted proteins 2 mM DTT was added to the buffer. For NMR experiments size exclusion chromatography was performed in 50 mM potassium phosphate buffer pH 7.0 and 200 mM NaCl (NMR buffer). Absence of nucleic-acid contamination was confirmed by measuring the ratio of absorption at 260/280 nm (Edelmann et al., 2014).
To confirm proper protein folding of the Pur-alpha mutants CD spectra (wavelength 190–260 nm) were recorded with a JASCO-715 spectropolarimeter at 5°C in a 0.1-cm cuvette. Proteins were diluted in buffer containing 250 mM NaCl, 20 mM Hepes pH 8.0, and 2 mM DTT to a final protein concentration of 30 µM in 300 µl total volume. Five scans were taken with a speed of 50 nm/min.
Crystallization was carried out with freshly prepared selenomethionine-substituted Pur-alpha repeat I-II (residues 40–185) in size exclusion buffer (250 mM NaCl, 20 mM Hepes pH 8, 2 mM DTT). The protein was mixed with commercially purchased GCGGCGG ssDNA oligonucleotides, dissolved in Milli-Q H2O at a ratio 1:2.2 (protein:DNA). The final protein concentration was 1.77 mg/ml. A drop size of 3 µl and a 2:1 mixture of protein-DNA complex and crystallization solution were used for hanging-drop vapor-diffusion at 21°C using 24-well EasyXtal Crystal Support plates (Qiagen, Hilden, Germany). The crystallization solution contained 50 mM MES pH 5.2, 500 mM (NH4)2SO4, 1 mM TCEP, and 16% PEG400. The total reservoir volume was 500 µl. Rod-shaped crystals of 160 x 20 µm size appeared within 4 days. Prior to data collection, crystals were cryoprotected in mother liquor and flash frozen in liquid nitrogen. Native dataset was recorded at 100 K at beamline ID23-2 (European Synchrotron Radiation Facility [ESRF] Grenoble, France). Crystals diffracted up to 2.0 Å. Data were integrated and scaled with XDS (Kabsch, 1993). Structure was solved by molecular replacement with PHASER (McCoy et al., 2007) using the apo-structure of Drosophila Pur-alpha 40–185 (PDB ID 3K44) as template and model building was manually completed using COOT (Emsley et al., 2010). Refinement of the native data was performed with PHENIX (Adams et al., 2010) using NCS and TLS. The final model was analyzed with SFCHECK (Vaguine et al., 1999), PHENIX, and REFMAC (Murshudov et al., 1997;, Terwilliger, 2002). Superpositioning of the apo-structure with the DNA-complexed structure of Pur-alpha was performed with the superpose algorithm (Krissinel and Henrick, 2004) of the program COOT. Images and movie of the crystal structure, superimpositions of the co-complex and apo-structure, as well as electrostatic surface potentials were prepared with PyMol (Version 1.7; Schrodinger LLC.; http://www.pymol.org/). All crystallographic software was used from the SBGRID software bundle (Morin et al., 2013). Structural model and dataset is available http://www.rcsb.org (PDB-ID: 5FGP).
Selenomethionine-substituted crystals of Pur-alpha repeat III (residues 188–258) were grown at 4°C with a protein concentration of 0.5–2 mg/ml. The crystallization solution contained 50 mM MES pH 6.5, 200 mM NaCl, 16% PEG 3350, and 6% MPD. Plate-shaped crystals of approximately 70 × 70 × 10 µm size appeared within 2–4 days. For cryo-protection, crystals were shortly incubated in reservoir solution containing 30% ethylene glycol in two steps and then flash frozen in liquid nitrogen.
Native dataset was recorded at 100 K at beamline ID14-1 [ESRF]. Crystals showed good diffraction up to 2.6 Å and belonged to space group P21 (see Table 1). The data were integrated and scaled with the XDS program package. Phases were obtained by molecular replacement using PHASER together with Borrelia burgdorferi Pur-alpha and Drosophila melanogaster Pur-alpha repeat I-II structures as a search model. Best results were achieved using a truncated version of the search models lacking the loop regions and poly-serine as amino-acid sequence. Parts of the initial model were built automatically with Buccaneer (Cowtan, 2006) and manually completed using COOT. Refinement was performed with PHENIX using NCS with 6 monomers per asymmetric unit. Structural model and dataset is available http://www.rcsb.org (PDB-ID: 5FGO).
For RNA-labeling RNase-free buffers, materials, and reagents were used. Ten picomol of chemically synthesized DNA or RNA oligonucleotides were phosphorylated at the 5’-end with 10 pmol γ-32P ATP by T4 polynucleotide kinase (New England Biolabs, Frankfurt, Germany) with buffer A in a final volume of 20 µl. Labeling reaction was carried out at 37°C and stopped after 30 min by incubation at 70°C for 10 min. Labeled oligonucleotides were purified by a NucAway™ Spin column (Ambion, Ulm, Germany) and stored at -20°C.
The protein-nucleic acid complexes were formed in RNase-free binding buffer containing 250 mM NaCl, 20 mM Hepes pH 8.0, 3 mM MgCl2, 4% glycerol, 2 mM DTT). Serial protein dilutions and a constant amount of radiolabeled nucleic acid (2.5 nM) were incubated in a total reaction volume of 20 µl for 20 min at 21°C. DNA-binding experiments contained 25 µg/ml Salmon Sperm DNA, and RNA-binding experiments contained 100 µg/ml yeast tRNA competitor. Ten microliter of the reactions were loaded onto 6% TBE polyacrylamide gels. After electrophoresis (45 min, 100 V), gels were incubated for 15 min in fixing solution ([v/v] 10% acetic acid, [v/v] 30% methanol), dried in a gel dryer (BioRad, Munich, Germany) and analyzed with radiograph films in a Protec Optimax developer (Hohmann, Hannover, Germany).
Sequences of oligonucleotides were as follows: MF0677 ssDNA/RNA, 5’-GGAGGTGGTGGAGGGAGAGAAAAG-3’; CGG ssDNA/RNA, 5’-(CGG)8–3’.
For fluorescence-polarization measurements, protein-nucleic acid complexes were formed in buffer containing 500 mM NaCl, 20 mM Hepes pH 7.5, 3 mM MgCl2, 2 mM DTT). In comparison to EMSA, higher salt concentrations were used (500 mM versus 250 mM) to allow for binding experiments at higher protein concentrations without aggregation of Pur-alpha. Serial protein dilutions and a constant amount of fluorescein-labeled MF0677 ssDNA or ssRNA (100 nM) were incubated for 20 min at 21°C in a total reaction volume of 40 µl. DNA-binding reactions contained 25 µg/ml Salmon Sperm DNA and RNA-binding reactions contained 100 µg/ml yeast tRNA as competitor. Measurements were performed on an Envision Multilabel reader (Perkinelmer). The excitation and emission wavelengths were 485 nm and 535 nm, respectively. The dissociation constant was calculated by fitting the data with the one-site binding model included in the program origin (OriginLab). The experiment was performed as triplicates.
Equation for one-site binding: y=Bmax*x/(k1+x). y = specific binding, x = ligand concentration, Bmax = maximum specific binding, k1 = equilibrium binding constant.
All NMR spectra were recorded in NMR buffer with 5% D2O at 298 K using a Bruker Avance III spectrometer equipped with a TCI cryogenic probe head, at field strengths corresponding to 900 MHz proton Larmor frequency. To study DNA/RNA binding 1H,15N HSQC NMR spectra were recorded of 15N-labeled protein (50 µM) titrated with nucleic acids with different stoichiometric ratio of protein:nucleic acid (1:0.25, 1:0.5, 1:0.75, 1:1, 1:1.25, 1:1.5, 1:2.5, and 1:5). For every spectrum, 256 increments in the 15N indirect dimension with eight scans and an interscan delay of 1 s were acquired. Spectra were recorded and processed with Topspin 3.2 (Bruker) and analyzed with CCPNMR analysis (Vranken et al., 2005).
Unwinding assays were carried out according to reference (Darbinian et al., 2001). A dsDNA substrate was prepared by annealing a complementary 18-mer oligonucleotide to a GGN motif of the M13mp18 ssDNA plasmid. The 18-mer was labeled with γ-32P ATP. Protein dilutions were added to a constant amount of dsDNA substrate (100 ng) in binding buffer composed of 150 mM NaCl, 20 mM Hepes pH 8.0. Samples were incubated at 37°C for 1 hr. The unwinding reaction was stopped by adding SDS to a final concentration of (v/v) 0.3%. Samples were run on 9% native polyacrylamide gels in 1x TBE buffer for 150 min at 200 V. Gels were incubated for 15 min in fixing solution ([v/v] 10% acetic acid, [v/v] 30% methanol), dried and analyzed with radiograph films. The sequence of the 18-mer oligonucleotide was as follows: 5’-TCAGAGCCGCCACCCTCA-3’.
Filter-binding assays were performed as described (Wong and Lohman, 1993). Protein was titrated to a constant amount of 1 µM MF0677 ssDNA (thereof 2.5 nM radiolabeled) in a final volume of 80 µl and incubated for 20 min at 21 °C in binding buffer 150 mM NaCl, 20 mM Hepes pH 8.0. Nitrocellulose filter (Roth, Karlsruhe, Germany) was presoaked for 10 min in 0.4 M KOH followed by intensive washing with Milli-Q H2O. Nitrocellulose and nylon filters (Roth) were then equilibrated in binding buffer for 15 min. Both filters (nitrocellulose, top; nylon filter, bottom) were placed into a dot-blot apparatus (BioRad). Vacuum was applied and the wells were washed once with 80-µl binding buffer before and after samples were loaded. The nitrocellulose filters were analyzed using a phosphor imager system to measure the retained radiolabeled oligonucleotides on the nitrocellulose filter. Quantification was done using the dot blot analyzer plug-in of the ImageJ 1.47v software (National Institute of Health, USA).
Transgenic flies expressing rCGG90 repeats were obtained as previously described (Jin et al., 2003). The pUAST constructs were generated by cloning cDNA of full-length Drosophila Pur-alpha into the pUAST transformation vectors. The constructs were confirmed by DNA sequencing and then injected in a w1118 strain by standard methods. Fly lines were grown on standard medium with yeast paste added. All crosses were performed at 25°C.
For scanning electron microscopy (SEM) images, whole flies were dehydrated in ethanol, dried with hexamethyldisilazane (Sigma-Aldrich, Hamburg and Seezle, Germany), and analyzed with an ISI DS-130 LaB6 SEM/STEM microscope.
PHENIX: a comprehensive python-based system for macromolecular structure solutionActa Crystallographica Section D Biological Crystallography 66:213–221.https://doi.org/10.1107/S0907444909052925
The HeLa pur factor binds single-stranded DNA at a specific element conserved in gene flanking regions and origins of DNA replicationMolecular and Cellular Biology 12:1257–1265.https://doi.org/10.1128/MCB.12.3.1257
Sequence of cDNA comprising the human pur gene and sequence-specific single-stranded-DNA-binding properties of the encoded proteinMolecular and Cellular Biology 12:5673–5682.https://doi.org/10.1128/MCB.12.12.5673
The buccaneer software for automated model building. 1. tracing protein chainsActa Crystallographica Section D Biological Crystallography 62:1002–1011.https://doi.org/10.1107/S0907444906022116
Helix-destabilizing properties of the human single-stranded DNA- and RNA-binding protein pur?Journal of Cellular Biochemistry 80:589–595.https://doi.org/10.1002/1097-4644(20010315)80:4<589::AID-JCB1013>3.0.CO;2-0
X-ray structure of pur- reveals a whirly-like fold and an unusual nucleic-acid binding surfaceProceedings of the National Academy of Sciences of the United States of America 106:18521–18526.https://doi.org/10.1073/pnas.0907990106
A developmentally regulated DNA-binding protein from mouse brain stimulates myelin basic protein gene expressionMolecular and Cellular Biology 13:3103–3112.https://doi.org/10.1128/MCB.13.5.3103
A 39-kD DNA-binding protein from mouse brain stimulates transcription of myelin basic protein gene in oligodendrocytic cellsThe Journal of Cell Biology 130:1171–1179.https://doi.org/10.1083/jcb.130.5.1171
CGG-repeat length threshold for FMR1 RNA pathogenesis in a cellular model for FXTASHuman Molecular Genetics 20:2161–2170.https://doi.org/10.1093/hmg/ddr101
Lack of pur-alpha alters postnatal brain development and causes megalencephalyHuman Molecular Genetics 21:473–484.https://doi.org/10.1093/hmg/ddr476
The pur protein family: genetic and structural features in development and diseaseJournal of Cellular Physiology 228:930–937.https://doi.org/10.1002/jcp.24237
Role of pur in targeting mRNA to sites of translation in hippocampal neuronal dendritessJournal of Neuroscience Research 83:929–943.https://doi.org/10.1002/jnr.20806
Breathing fluctuations in position-specific DNA base pairs are involved in regulating helicase movement into the replication forkProceedings of the National Academy of Sciences of the United States of America 109:14428–14433.https://doi.org/10.1073/pnas.1212929109
Automatic processing of rotation diffraction data from crystals of initially unknown symmetry and cell constantsJournal of Applied Crystallography 26:795–800.https://doi.org/10.1107/S0021889893005588
Secondary-structure matching (sSM), a new tool for fast protein structure alignment in three dimensionsActa Crystallographica Section D Biological Crystallography 60:2256–2268.https://doi.org/10.1107/S0907444904026460
Mutations in PURA cause profound neonatal hypotonia, seizures, and encephalopathy in 5q31.3 microdeletion syndromeThe American Journal of Human Genetics 95:579–583.https://doi.org/10.1016/j.ajhg.2014.09.014
Refinement of macromolecular structures by the maximum-likelihood methodActa Crystallographica Section D Biological Crystallography 53:240–255.https://doi.org/10.1107/S0907444996012255
Nonlinear analysis of the dynamics of DNA breathingJournal of Biological Physics 35:73–89.https://doi.org/10.1007/s10867-009-9127-2
Targeting RNA foci in iPSC-derived motor neurons from ALS patients with a C9ORF72 repeat expansionScience Translational Medicine 5:208ra149.https://doi.org/10.1126/scitranslmed.3007529
De novo mutations in PURA are associated with hypotonia and developmental delayMolecular Case Studies 1:a000356.https://doi.org/10.1101/mcs.a000356
Automated structure solution, density modification and model buildingActa Crystallographica Section D Biological Crystallography 58:1937–1940.https://doi.org/10.1107/S0907444902016438
Association of pur with RNAs homologous to 7 SL determines its binding ability to the myelin basic protein promoter DNA sequenceJournal of Biological Chemistry 273:22241–22247.https://doi.org/10.1074/jbc.273.35.22241
SFCHECK : a unified set of procedures for evaluating the quality of macromolecular structure-factor data and their agreement with the atomic modelActa Crystallographica Section D Biological Crystallography 55:191–205.https://doi.org/10.1107/S0907444998006684
A double-filter method for nitrocellulose-filter binding: application to protein-nucleic acid interactionsProceedings of the National Academy of Sciences of the United States of America 90:5428–5432.https://doi.org/10.1073/pnas.90.12.5428
Mechanism of DNA binding and localized strand separation by pur and comparison with pur family memberr,purβBiochimica et Biophysica Acta 1743:64–78.https://doi.org/10.1016/j.bbamcr.2004.08.010
Expanded GGGGCC repeat RNA associated with amyotrophic lateral sclerosis and frontotemporal dementia causes neurodegenerationProceedings of the National Academy of Sciences of the United States of America 110:7778–7783.https://doi.org/10.1073/pnas.1219643110
Karsten WeisReviewing Editor; ETH Zürich, Switzerland
In the interests of transparency, eLife includes the editorial decision letter and accompanying author responses. A lightly edited version of the letter sent to the authors after peer review is shown, indicating the most substantive concerns; minor comments are not usually included.
Thank you for submitting your work entitled "Structural basis of nucleic-acid recognition and double-strand unwinding by the essential neuronal protein Pur-alpha" for consideration by eLife. Your article has been favorably evaluated by Richard Aldrich (Senior Editor), Karsten Weis (Reviewing Editor) and by two peer reviewers.
The reviewers have discussed the reviews with one another and the Reviewing editor has drafted this decision to help you prepare a revised submission.
In this manuscript, Weber et al. report a 2.0 Å crystal structure of the Pur-alpha repeat I-II in complex with a ssDNA oligo, and a 2.6 Å crystal structure of the Pur-alpha repeat III. In addition, the authors biochemically characterize the nucleic acid binding properties and the unwinding activity of Pur-alpha and its repeats. These in vitro data are complemented with work performed in Drosophila.
There was agreement amongst the reviewers that this study is of general interest and that, in general, the conclusions are well-founded. However, there were some concerns about the functional analysis of the protein and revisions are needed before the paper can be accepted for publication in eLife.
1) There is a concern regarding the discussion of the nucleic acid specificity of Pur-alpha: "Together, our biochemical, NMR, and x-ray crystallographic insights confirm that Pur-alpha binds DNA and RNA in the same way and thus will interact equally with both types of nucleotides (should be nucleic acids) in the cell." The authors show that the Pur-alpha repeats I-II do not display any specificity towards ssRNA or ssDNA in vitro. Furthermore, they detected only a weak binding of repeat III to nucleic acids. However, the authors did not test the full-length protein in their assays. There might be a significant contribution of the additional parts of the protein (i.e. the Gly-rich N-terminus and the Gln/Glu-rich C-terminus) towards nucleic acid specificity. Furthermore, repeat I-II together with repeat III in the context of the full-length protein might show nucleic acid specificity.
In the event that the authors cannot express the full-length protein and test it in their assays, the discussion needs be written more cautiously, and it should be made clearer what conclusions could be drawn concerning full-length Pur-alpha.
2) It remains somewhat confusing whether the nucleic acid binding activity of the mutants correlates with their abilities to rescue the neurodegeneration phenotype. The nucleic acid binding of the rescuing RI-RII mutation should be tested in vitro and compared to the affinity of non-rescuing mutations. To this end, EMSAs should be quantified and apparent KD values should be extracted.
3) DNA binding of the Pur-alpha repeats I and II occurs via largely equivalent surfaces. However, most of the residues that directly contact DNA are non-equivalent (except for the equivalent K61 in repeat I and K138 in repeat II). Mutating residues K61, N63 and R65 in repeat I (triple mutant KNR I) leads to deficiency in nucleic acid binding and unwinding. However, only K61 of this motif directly binds DNA. The contributions of these three residues to DNA binding should be discussed in more detail (direct effects by altering K61, indirect or no effects by altering N63 and R65?). To allow better comprehension/interpretation of the effects of the KNR I triple mutant, the authors should provide an additional figure panel that shows the positions and conformations of all three residues with respect to bound DNA.
4) Table 1 needs to be corrected, as it contains a few careless mistakes:
In the data collection part, the numbers for the resolution do not seem to be fully correct. The data on the Pur-alpha repeat III range from 50 to 0 Å, while the highest resolution shell is indicated as ranging from 47.7-2.6 Å, which does not correlate with the statistical values, such as I/σ etc. Furthermore, it is odd that the completeness of the data is greater for the highest resolution shell than for the full data set.
Similarly, the highest resolution shell for the Pur-alpha I-II/DNA complex is mis-indicated as ranging from 41.9 – 2.0 Å.
Although described in the Methods section, the stereochemistry outliers should be included in the table.https://doi.org/10.7554/eLife.11297.026
Essential revisions: 1) There is a concern regarding the discussion of the nucleic-acid specificity of Pur-alpha: "Together, our biochemical, NMR, and x-ray crystallographic insights confirm that Pur-alpha binds DNA and RNA in the same way and thus will interact equally with both types of nucleotides (should be nucleic acids) in the cell." The authors show that the Pur-alpha repeats I-II do not display any specificity towards ssRNA or ssDNA in vitro. Furthermore, they detected only a weak binding of repeat III to nucleic acids. However, the authors did not test the full-length protein in their assays. There might be a significant contribution of the additional parts of the protein (i.e. the Gly-rich N-terminus and the Gln/Glu-rich C-terminus) towards nucleic-acid specificity. Furthermore, repeat I-II together with repeat III in the context of the full-length protein might show nucleic acid specificity. In the event that the authors cannot express the full-length protein and test it in their assays, the discussion needs be written more cautiously, and it should be made clearer what conclusions could be drawn concerning full-length Pur-alpha.
As suggested, we expressed full-length Pur-alpha and quantified its binding to ssDNA and ssRNA using fluorescence-polarization experiments. This information is included in Figure 1D and explained in the second paragraph of the subsection “Pur-alpha binds RNA and DNA with similar affinities”.
The results show that both types of nucleic acids are bound by full-length Pur-alpha in the same affinity range. MF0677 ssRNA is bound about 2-fold stronger than ssDNA (KD for DNA = 1.4 µM and for RNA = 0.7 µM), suggesting that sequences outside the PUR repeats I-II moderately contribute to the binding.
2) It remains somewhat confusing whether the nucleic-acid-binding activity of the mutants correlates with their abilities to rescue the neurodegeneration phenotype. The nucleic-acid binding of the rescuing R I - R II mutation should be tested in vitro and compared to the affinity of non-rescuing mutations. To this end, EMSA should be quantified and apparent KD values should be extracted.
Because quantification of EMSA is usually not very accurate, in particular when more than one shifted band is observed (please see Figures 1B,C, 3A-E, and Figure 4B), we decided to perform fluorescence-polarization experiments instead. Binding experiments were performed with wild-type Pur-alpha I-II and with repeat III, as well as with all requested mutants. The results are included as a new table in Figure 3H and representative binding curves of all proteins are shown in Figure 3—figure supplement 2. The observed KD values are consistent with our results from EMSA. Interestingly, mutations in PUR repeat II have stronger effects than mutations in PUR repeat I. This observation is consistent with the co-structure, in which also repeat II makes the main interactions with DNA. As shown in a previous publication (Graebsch et al. PNAS 2009), the R I – R II mutant failed to bind DNA in vitro. In summary, the quantifications provide strong support for our conclusions drawn in the initial manuscript.
We are discussing these results and their relation to in vivo rescue experiments in the Discussion.
3) DNA binding of the Pur-alpha repeats I and II occurs via largely equivalent surfaces. However, most of the residues that directly contact DNA are non-equivalent (except for the equivalent K61 in repeat I and K138 in repeat II). Mutating residues K61, N63 and R65 in repeat I (triple mutant KNR I) leads to deficiency in nucleic-acid binding and unwinding. However, only K61 of this motif directly binds DNA. The contributions of these three residues to DNA binding should be discussed in more detail (direct effects by altering K61, indirect or no effects by altering N63 and R65?). To allow better comprehension/interpretation of the effects of the KNR I triple mutant, the authors should provide an additional figure panel that shows the positions and conformations of all three residues with respect to bound DNA.
We address this issue with a more detailed description and discussion of these residues in the Results and Discussion sections. As suggested, we also provide an additional figure, in which residues K61, N63, and R65 are shown as close-up (Figure 2—figure supplement 4). They can now be directly compared with the close-up of residues K138, N140, and R142 in Figure 2C. We are grateful for this suggestion, as this aspect was indeed not sufficiently covered.
4) Table 1 needs to be corrected, as it contains a few careless mistakes: In the data collection part, the numbers for the resolution do not seem to be fully correct. The data on the Pur-alpha repeat III range from 50 to 0 Å, while the highest resolution shell is indicated as ranging from 47.7-2.6 Å, which does not correlate with the statistical values, such as I/σ etc. Furthermore, it is odd that the completeness of the data is greater for the highest resolution shell than for the full data set.
Similarly, the highest resolution shell for the Pur-alpha I-II/DNA complex is mis-indicated as ranging from 41.9 – 2.0 Å.
Although described in the Methods section, the stereochemistry outliers should be included in the table.
We are grateful to the reviewers for pointing us to these obvious errors. We have corrected and double-checked all numbers, and apologize for these mistakes.
Regarding the “greater completeness of the data for the highest resolution shell than for the full data set”, we would like to emphasize that this is in fact correct. The likely reason is that low-resolution data are less complete than the average dataset. Since low-resolution data have a considerable impact on crystallographic statistics, effects as in our datasets can sometimes be observed. We would like to add that we verified the correctness of these statistics (including the identical redundancies for the dataset of Pur-alpha repeat III).
As suggested, we moved the statistics of the stereochemistry from the Methods section to Table 1 and provide the wavelength of data collection in this table. We also added information on the beamline, detector distance, number of images, and oscillation range. We hope that this table now includes all relevant data.https://doi.org/10.7554/eLife.11297.027
- Dierk Niessing
- Janine Weber
- Dierk Niessing
- Dierk Niessing
- Dierk Niessing
- Tobias Madl
- Tobias Madl
- Tobias Madl
- Tobias Madl
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
We thank Marietta Truger and Stephane Roche for support during structure determination.
- Karsten Weis, ETH Zürich, Switzerland
© 2016, Weber et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.