Structure of catalase determined by MicroED
Abstract
MicroED is a recently developed method that uses electron diffraction for structure determination from very small three-dimensional crystals of biological material. Previously we used a series of still diffraction patterns to determine the structure of lysozyme at 2.9 Å resolution with MicroED (Shi et al., 2013). Here we present the structure of bovine liver catalase determined from a single crystal at 3.2 Å resolution by MicroED. The data were collected by continuous rotation of the sample under constant exposure and were processed and refined using standard programs for X-ray crystallography. The ability of MicroED to determine the structure of bovine liver catalase, a protein that has long resisted atomic analysis by traditional electron crystallography, demonstrates the potential of this method for structure determination.
https://doi.org/10.7554/eLife.03600.001Introduction
MicroED is an emerging method, which uses electron diffraction to obtain structural information from extremely small three-dimensional (3D) crystals of biological material. In the original MicroED proof of concept paper (Shi et al., 2013), electron diffraction data were collected from stationary lysozyme microcrystals and the structure was determined to 2.9 Å resolution from a series of still diffraction patterns. The method was substantially improved by employing ‘continuous rotation’ where data were recorded as the crystals were continuously rotated, resulting in higher quality data and allowing simple integration with existing processing programs used for X-ray crystallography (Nannenga et al., 2014). This led to the structure of lysozyme being determined to 2.5 Å resolution with improved statistics and data quality relative to the original lysozyme MicroED study.
In this work, we used thin bovine liver catalase 3D microcrystals for structure determination by MicroED. Catalase is a more difficult target than lysozyme, because it has a much larger unit cell, lower symmetry, and four molecules in the asymmetric unit. Moreover, each catalase monomer contains one heme group as well as a bound NADP molecule (Fita and Rossmann, 1985). Catalase is one of the earliest samples studied by EM, but despite extensive efforts spanning decades, the 3D structure of catalase has not been solved using electron diffraction. This is because the crystals have variable thicknesses (6–10 protein layers have been reported [Dorset and Parsons, 1975b]), and therefore these crystals were not suitable for 3D structure determination by traditional electron crystallography procedures (Longley, 1967; Matricardi et al., 1972; Unwin, 1975; Unwin and Henderson, 1975; Dorset and Parsons, 1975a). Here we report the 3.2 Å structure of catalase determined by MicroED. This is an important next step for the MicroED method as the analysis was rapid, taking a total of 3 weeks from crystal growth to final structural refinement, and used data from only a single catalase microcrystal.
Results and discussion
Sample preparation and data collection
Catalase was chosen for this study as it readily forms thin 3D microcrystals, which can be analyzed by transmission electron microscopy (TEM) (Sumner and Dounce, 1937; Dorset and Parsons, 1975a; Baker et al., 2010). Well-ordered catalase microcrystals were grown by solubilizing crystalline catalase from an aqueous suspension followed by overnight dialysis in 0.05 M Sodium phosphate, pH 6.3. Following crystal formation, crystals were deposited on holey carbon grids and the sample was blotted and vitrified in liquid ethane prior to TEM analysis. The grids were screened in over-focused diffraction mode for the presence of thin 3D crystals (Figure 1, inset). The average crystal dimensions were found to be on the order of 8 µm by 4 µm in length and width and approximately 150 nm thick, in good agreement with previous crystallization results (Dorset and Parsons, 1975b). Suitable crystals were assessed by collecting a single still diffraction pattern at a total dose of 0.05 e−/Å2, where well-ordered crystals showed sharp reflections extending to approximately 3.0 Å for untilted crystals (Figure 1). When high-quality diffraction was observed for an untilted crystal, the crystal was then tilted to 60° to check the diffraction quality at higher tilt angles as crystal flatness and embedding could affect the diffraction quality at higher tilt (Gonen, 2013). Typically, well-embedded and relatively flat crystals yielded data to ∼2.8 Å resolution untilted but only ∼3.2 Å at high tilt. Data sets were then collected as a sequence of 6 s exposures per frame. An example data set is shown in Video 1 and was recorded as the stage was continuously rotated as described previously (Nannenga et al., 2014).
Data analysis, processing, and structure refinement
We sought to analyze the levels of dynamic scattering in our catalase data in order to validate whether kinematical scattering could be assumed. We quantified the dynamic scattering of the catalase crystals using the ratio of the strongest diffracted intensity to the unscattered incident beam (Unwin and Henderson, 1975), as well as the ratio of the sum of all diffracted intensities on an image to the incident beam intensity as described by Dorset and Parsons (1975a). For the kinematical theory to apply, these ratios must be low. A representative crystal, which was approximately 200 nm thick, showed a ratio of 8.4 × 10−3 for the maximum intensity and 0.18 for the sum of all intensities, which are close to the previously reported values (Unwin and Henderson, 1975; Dorset and Parsons, 1975a), indicating that kinematical assumption is valid for these diffraction data.
Data sets from five microcrystals were each integrated with MOSFLM (Leslie and Powell, 2007) followed by merging and scaling with POINTLESS (Evans, 2011) and AIMLESS (Evans and Murshudov, 2013). All crystals yielded comparable resolution but varied in data completeness (Table 1). We combined all five data sets in an effort to increase completeness, multiplicity and to improve the quality of the data and in parallel we processed the data from crystal 4 separately for comparison. Crystal 4 was chosen for processing separately because it had a good compromise between resolution and completeness. The data sets were processed to a range of resolutions (Table 2) and phases were determined by molecular replacement (MR) as implemented in MOLREP (Vagin and Teplyakov, 1997) with PDB ID: 3NWL (Foroughi et al., 2011) as a search model. Following MR, refinement using PHENIX and REFMAC (Murshudov et al., 1997; Adams et al., 2010) with electron scattering factors was performed. Merging data from multiple crystals did not have a significant effect on data completeness, because the catalase crystals all orient on the grid with the c-axis parallel to the electron beam (Table 2). When comparing the statistics presented in Table 2, it was clear that merging multiple crystals had a negative impact on the final refinement statistics, most likely due to non-isomorphism between crystals, and therefore the multiple crystal data sets were disregarded. Crystals 1, 2, 3, and 5 were relatively isomorphous and their diffraction data merged well, but the completeness of this data set was low (∼62%), and we chose not to use it.
The single crystal 4 dataset was used for the remainder of the study. Close analysis indicated that the information content in the 3–3.2 Å resolution shell was too low for inclusion, which was not surprising as the crystals did not diffract well beyond 3.2 Å at high tilt angles. We therefore truncated the resolution to 3.2 Å yielding a final model with acceptable refinement statistics (Rwork/Rfree = 26.2%/30.8%) and geometry (Table 3, Figure 2A, Video 2). The final 2mFobs-DFcalc density map shows well-defined density surrounding the final refined model (overall map CC = 92.3%), both around the backbone and the side-chains (Figure 2B, Video 2), without significant peaks in the mFobs-DFcalc difference density map (Figure 2C). Additionally, the solvent channels between the tetramers in the crystal lattice show very little density (Figure 2D), further evidence of the quality of the model and data. The final structure of catalase at 3.2 Å resolution determined by MicroED agrees well with previously solved X-ray structures, with an RMSD of 0.358 Å and 0.440 Å between the MicroED structure and PDB ID: 3NWL (Foroughi et al., 2011) and PDB ID: 4BLC (Ko et al., 1999), respectively.
Model validation
The orientation of our crystals prevented the full sampling of reciprocal space leading to systematic incompleteness (missing wedge or missing cone). While the incompleteness of data is significant, the resulting maps are still expected to be good enough for proper interpretation (Glaeser et al., 1989). To test the quality of the data and the resulting maps, and to identify any significant model bias or negative effects of the data incompleteness, the data were put through several validation tests. First, the robustness of the MR solution was tested by repeating the MR with a single monomeric chain of PDB ID: 3NWL (Foroughi et al., 2011) instead of the complete tetramer that was used originally. Even with a single chain, a strong solution was found with all four molecules successfully placed to recreate the complete tetramer (Figure 2—figure supplement 1; top MOLREP contrast score = 35.8, where a score of >3.0 is considered a strong solution. An identical test with synchrotron X-ray data yielded a top score of 23.7). We also phased the data with a poly-alanine model derived from PDB ID: 3NWL, and the resulting maps show clear density beyond the model where the correct side-chains could be rebuilt (Figure 2—figure supplement 1B,C).
In order to test the quality of the resulting maps following MR, autobuilding with Buccaneer (Cowtan, 2006) was performed and yielded 2120 residues in 160 fragments (Figure 2—figure supplement 1D). Out of the built residues, 1280 traced the correct backbone and 467 side-chains were correctly assigned. Manual curation in Coot (Emsley et al., 2010), exploiting the fourfold non-crystallographic symmetry, resulted in a nearly complete model indicating the maps initially produced from our data were of good quality.
The next validation test performed involved removing sections of the model and analyzing the effect this had on the resulting refined density maps. This was done in order to examine the strength of the data and to find any potential model bias introduced by the MR search model. For this test, two validation models were used, in which the same sections of all four monomers of the final tetrameric structure were removed. The first model had residues 181 to 185 removed (∆181–185) and the second model lacked the four heme groups (∆heme) that are normally found in catalase. Following refinement and simulated annealing, the resulting difference maps from the ∆181–185 model (Figure 3A) and ∆heme model (Figure 3C) both showed significant positive difference density corresponding to the deleted regions of the model. Additionally, automated ligand identification was performed on the ∆heme maps using phenix.ligand_identification (Terwilliger et al., 2006, 2007), and the program was able to correctly place two of the four heme groups present in the structure. The results of these tests indicate that the data do not suffer from bias introduced by the MR search model.
Next, we sought to determine whether data from MicroED was of sufficient accuracy to locate small molecule ligands and corresponding protein conformational changes in the ligand-binding pocket. Bovine catalase binds four NADP cofactors, one per monomer through several side-chain interactions including F197 (Kirkman and Gaetani, 1984; Fita and Rossmann, 1985). Recently, the structure of bovine catalase was solved lacking the NADP cofactor (PDB ID: 3RGP) (Purwar et al., 2011), and in the NADP-free structure F197 underwent a conformational change as it no longer interacted with NADP. The crystals used for our MicroED analysis do contain NADP. Therefore, we used PDB ID: 3RGP (NADP-free structure) as a molecular replacement model against the MicroED data to determine whether NADP could be visualized in our catalase crystals using difference maps. When analyzing the difference maps, positive density was observed in the location expected for NADP although it appeared fragmented even at lower contour levels (Figure 3E,G). For visual comparison, structure factors from PDB ID: 3NWL, which was solved by X-ray crystallography, were truncated to 3.2 Å, and difference maps were calculated for ∆181–185, ∆heme and NADP-free model (Figure 3B,D,F,H). Maps from both MicroED and synchrotron X-ray diffraction appear fragmented around the NADP even at lower contour levels (Figure 3G,H). The difference maps for both the MicroED and X-ray synchrotron data suggest that F197 should change its orientation to assume its correct position for NADP binding.
These results demonstrate the MicroED data is of sufficient quality to detect subtle differences among structures at atomic resolution. At the current level of methodology with samples that suffer from missing data like catalase, MicroED produces lower quality maps than synchrotron X-ray diffraction. However, the catalase crystals used for MicroED were approximately 1000 times smaller in volume than those used at the synchrotron (Foroughi et al., 2011), and the resulting maps are still of high enough quality to determine the structure.
Concluding Remarks
We present here the second protein structure determined by the emerging MicroED method. Bovine liver catalase resisted structural determination by traditional electron crystallography for decades, but the structure was readily determined by MicroED in 3 weeks from crystal formation to final structure determination using a single crystal. This is the second example where a single crystal was sufficient for structure determination by MicroED (Nannenga et al., 2014). Moreover, the continuous rotation method yields data similar in quality to X-ray diffraction allowing simple processing with existing X-ray data reduction software and further accelerating structure analysis by MicroED (Nannenga et al., 2014). The resulting maps allow us to distinguish between subtly different protein conformations and to identify of small-molecule ligands such as NADP. This study shows that MicroED can be used as an alternative to X-ray crystallography using extremely small crystals for both mechanistic studies as well as structure-based drug design studies where small ligands are assayed.
Materials and methods
Catalase crystallization and sample preparation
Request a detailed protocolCatalase was recrystallized from a commercial aqueous suspension of catalase (C100; Sigma–Aldrich, St. Louis, MO) by first centrifuging the crystalline suspension and dissolving the pellet in 1.7 M NaCl. The solubilized catalase was then centrifuged and the supernatant was dialyzed against 50 mM sodium phosphate pH 6.3 overnight at 4°C. Crystals were removed from dialysis, stored in an Eppendorf tube, and incubated an additional 24 hr at 4°C. Catalase crystals were stored at 4°C and were washed with water prior to sample preparation. To prepare samples for the TEM, crystals were resuspended and the undiluted catalase crystal suspension was applied, blotted and vitrified in liquid ethane as described previously (Shi et al., 2013).
Collection of electron diffraction data
Request a detailed protocolAll electron diffraction was performed on a FEI Tecnai F20 TEM operated at 200 kV with a selected area aperture (6 μm in diameter at the specimen) and data were collected with 4k × 4k TVIPS F416 CMOS cameras (15.6 μm pixel size). Diffraction data were collected with a frame rate of 1 frame per 6 s as the sample was continuously rotated from high to low tilt angle at ∼0.09° s−1 (0.54°/frame) as described previously (Nannenga et al., 2014). A data set of approximately 61° was collected from a single crystal. Crystal thickness was estimated by measuring the intensity of the crystal (I) relative to the intensity of a hole in the carbon film (I0) from an image and using Beer's law:
where ε is the molar absorptivity, c is the molar concentration, and t is the crystal thickness. As an approximation, the value of εc for catalase was assumed to be the same as those for calculated for lysozyme. The lysozyme coefficients were determined using images of lysozyme microcrystals with a known thickness as described previously (Nannenga et al., 2014).
Data processing and structure refinement
Request a detailed protocolRaw TEM diffraction data were converted and processed using MOSFLM v7.1.0 (Leslie and Powell, 2007) and it's graphical interface iMOSFLM v1.0.7 (Battye et al., 2011), POINTLESS (Evans, 2006), and AIMLESS (Evans and Murshudov, 2013) as described in previous work (Nannenga et al., 2014). MOLREP (Vagin and Teplyakov, 1997) was used to perform molecular replacement using catalase PDB ID: 3NWL (Foroughi et al., 2011) as a search model (MOLREP contrast score = 40.8), and the molecular replacement solution was refined in PHENIX (Adams et al., 2010) and REFMAC (Murshudov et al., 1997) using a 5% free data set. Maps in Figure 3E,F,G and H were calculated using BUSTER-TNT (Blanc et al., 2004). Maps and models were displayed using the UCSF Chimera package (Pettersen et al., 2004).
Data availability
-
The crystal structure of the P212121 form of bovine liver catalase previously characterized by electron microscopyPublicly available at RCSB Protein Data Bank.
-
The structure of orthorhombic crystals of beef liver catalasePublicly available at RCSB Protein Data Bank.
-
Structural and kinetic analysis of the beef liver catalase complexed with nitric oxidePublicly available at RCSB Protein Data Bank.
References
-
PHENIX: a comprehensive Python-based system for macromolecular structure solutionActa Crystallographica Section D, Biological Crystallography 66:213–221.https://doi.org/10.1107/S0907444909052925
-
The resolution dependence of optimal exposures in liquid nitrogen temperature electron cryomicroscopy of catalase crystalsJournal of Structural Biology 169:431–437.https://doi.org/10.1016/J.Jsb.2009.11.014
-
iMOSFLM: a new graphical interface for diffraction-image processing with MOSFLMActa Crystallographica Section D, Biological Crystallography 67:271–281.https://doi.org/10.1107/S0907444910048675
-
Refinement of severely incomplete structures with maximum likelihood in BUSTER-TNTActa Crystallographica Section D, Biological Crystallography 60:2210–2221.https://doi.org/10.1107/S0907444904016427
-
MolProbity: all-atom structure validation for macromolecular crystallographyActa Crystallographica Section D, Biological Crystallography 66:12–21.https://doi.org/10.1107/S0907444909042073
-
The Buccaneer software for automated model building. 1. Tracing protein chainsActa Crystallographica Section D, Biological Crystallography 62:1002–1011.https://doi.org/10.1107/S0907444906022116
-
Electron-diffraction from single, fully-hydrated, ox liver catalase microcrystalsActa Crystallographica Section D, Biological Crystallography 31:210–215.https://doi.org/10.1107/S0567739475000423
-
Thickness measurements of wet protein crystals in electron-microscopeJournal of Applied Crystallography 8:12–14.https://doi.org/10.1107/S0021889875009430
-
Features and development of CootActa Crystallographica Section D, Biological Crystallography 66:486–501.https://doi.org/10.1107/S0907444910007493
-
Scaling and assessment of data qualityActa Crystallographica Section D, Biological Crystallography 62:72–82.https://doi.org/10.1107/S0907444905036693
-
An introduction to data reduction: space-group determination, scaling and intensity statisticsActa Crystallographica Section D, Biological Crystallography 67:282–292.https://doi.org/10.1107/S090744491003982x
-
How good are my data and what is the resolution?Acta Crystallographica Section D, Biological Crystallography 69:1204–1214.https://doi.org/10.1107/S0907444913000061
-
The NADPH binding site on beef liver catalaseProceedings of the National Academy of Sciences of USA 82:1604–1608.https://doi.org/10.1073/pnas.82.6.1604
-
The collection of high-resolution electron diffraction dataMethods in Molecular Biology 955:153–169.https://doi.org/10.1007/978-1-62703-176-9_9
-
Catalase: a tetrameric enzyme with four tightly bound molecules of NADPHProceedings of the National Academy of Sciences of USA 81:4343–4347.https://doi.org/10.1073/pnas.81.14.4343
-
Structure of orthorhombic crystals of beef liver catalaseActa Crystallographica Section D, Biological Crystallography 55:1383–1394.https://doi.org/10.1107/S0907444999007052
-
Processing diffraction data with MOSFLMNATO Science Series II: Mathematics, Physics and Chemistry 245:41–51.https://doi.org/10.1007/978-1-4020-6316-9_4
-
Crystal structure of bovine liver catalase - a combined study by x-ray diffraction and electron microscopyJournal of Molecular Biology 30:323–327.https://doi.org/10.1016/S0022-2836(67)80042-1
-
Refinement of macromolecular structures by the maximum-likelihood methodActa Crystallographica Section D, Biological Crystallography 53:240–255.https://doi.org/10.1107/S0907444996012255
-
UCSF Chimera–a visualization system for exploratory research and analysisJournal of Computational Chemistry 25:1605–1612.https://doi.org/10.1002/jcc.20084
-
Ligand identification using electron-density map correlationsActa Crystallographica Section D, Biological Crystallography 63:101–107.https://doi.org/10.1107/S0907444906046233
-
Automated ligand fitting by core-fragment fitting and extension into densityActa Crystallographica Section D, Biological Crystallography 62:915–922.https://doi.org/10.1107/S0907444906017161
-
Beef liver catalase structure: interpretation of electron micrographsJournal of Molecular Biology 98:235–242.https://doi.org/10.1016/S0022-2836(75)80111-2
-
Molecular structure determination by electron microscopy of unstained crystalline specimensJournal of Molecular Biology 94:425–440.https://doi.org/10.1016/0022-2836(75)90212-0
-
MOLREP: an automated program for molecular replacementJournal of Applied Crystallography 30:1022–1025.https://doi.org/10.1107/S0021889897006766
Article and author information
Author details
Funding
Howard Hughes Medical Institute
- Brent L Nannenga
- Dan Shi
- Johan Hattne
- Francis E Reyes
- Tamir Gonen
The funder had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Acknowledgements
The authors wish to thank Garib Murshudov (MRC LMB) for providing a version of REFMAC with support for electron scattering factors and Andrew Leslie (MRC LMB) for data processing support and advice. We also would like to thank Steven Sawtelle (HHMI Janelia Research Campus) for technical support and Joanita Jakana (Baylor) for the protocol for catalase crystallization. Work in the Gonen lab is supported by the Howard Hughes Medical Institute.
Copyright
© 2014, Nannenga et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 5,037
- views
-
- 611
- downloads
-
- 108
- citations
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Biochemistry and Chemical Biology
- Genetics and Genomics
Yerba mate (YM, Ilex paraguariensis) is an economically important crop marketed for the elaboration of mate, the third-most widely consumed caffeine-containing infusion worldwide. Here, we report the first genome assembly of this species, which has a total length of 1.06 Gb and contains 53,390 protein-coding genes. Comparative analyses revealed that the large YM genome size is partly due to a whole-genome duplication (Ip-α) during the early evolutionary history of Ilex, in addition to the hexaploidization event (γ) shared by core eudicots. Characterization of the genome allowed us to clone the genes encoding methyltransferase enzymes that catalyse multiple reactions required for caffeine production. To our surprise, this species has converged upon a different biochemical pathway compared to that of coffee and tea. In order to gain insight into the structural basis for the convergent enzyme activities, we obtained a crystal structure for the terminal enzyme in the pathway that forms caffeine. The structure reveals that convergent solutions have evolved for substrate positioning because different amino acid residues facilitate a different substrate orientation such that efficient methylation occurs in the independently evolved enzymes in YM and coffee. While our results show phylogenomic constraint limits the genes coopted for convergence of caffeine biosynthesis, the X-ray diffraction data suggest structural constraints are minimal for the convergent evolution of individual reactions.
-
- Biochemistry and Chemical Biology
- Structural Biology and Molecular Biophysics
The SARS-CoV-2 main protease (Mpro or Nsp5) is critical for production of viral proteins during infection and, like many viral proteases, also targets host proteins to subvert their cellular functions. Here, we show that the human tRNA methyltransferase TRMT1 is recognized and cleaved by SARS-CoV-2 Mpro. TRMT1 installs the N2,N2-dimethylguanosine (m2,2G) modification on mammalian tRNAs, which promotes cellular protein synthesis and redox homeostasis. We find that Mpro can cleave endogenous TRMT1 in human cell lysate, resulting in removal of the TRMT1 zinc finger domain. Evolutionary analysis shows the TRMT1 cleavage site is highly conserved in mammals, except in Muroidea, where TRMT1 is likely resistant to cleavage. TRMT1 proteolysis results in reduced tRNA binding and elimination of tRNA methyltransferase activity. We also determined the structure of an Mpro-TRMT1 peptide complex that shows how TRMT1 engages the Mpro active site in an uncommon substrate binding conformation. Finally, enzymology and molecular dynamics simulations indicate that kinetic discrimination occurs during a later step of Mpro-mediated proteolysis following substrate binding. Together, these data provide new insights into substrate recognition by SARS-CoV-2 Mpro that could help guide future antiviral therapeutic development and show how proteolysis of TRMT1 during SARS-CoV-2 infection impairs both TRMT1 tRNA binding and tRNA modification activity to disrupt host translation and potentially impact COVID-19 pathogenesis or phenotypes.