Structure of the human heparan-α-glucosaminide N-acetyltransferase (HGSNAT)

Vikas Navratna; Arvind Kumar; Shyamal Mosalaganti

doi:10.7554/eLife.93510.1

eLife assessment

This important work describes the first high-resolution structure of HGSNAT, a lysosomal membrane protein required for the degradation of heparan sulfate (HS). Through careful structural analysis, this work proposes potential reasons why certain mutations in HGSNAT lead to lysosomal storage disorders and outlines the enzyme's catalytic mechanism. The experimental evidence presented provides incomplete support for the proposed molecular mechanism of the HS acetylation reaction and the impact of disease-causing mutations.

https://doi.org/10.7554/eLife.93510.1.sa3

Significance of findings

important: Findings that have theoretical or practical implications beyond a single subfield

landmark
fundamental
important
valuable
useful

Strength of evidence

incomplete: Main claims are only partially supported

exceptional
compelling
convincing
solid
incomplete
inadequate

During the peer-review process the editor and reviewers write an eLife assessment that summarises the significance of the findings reported in the article (on a scale ranging from landmark to useful) and the strength of the evidence (on a scale ranging from exceptional to inadequate). Learn more about eLife assessments

Abstract

Degradation of heparan sulfate (HS), a glycosaminoglycan (GAG) comprised of repeating units of N-acetylglucosamine and glucuronic acid, begins in the cytosol and is completed in the lysosomes. Acetylation of the terminal non-reducing amino group of α-D-glucosamine of HS is essential for its complete breakdown into monosaccharides and free sulfate. Heparan-α-glucosaminide N-acetyltransferase (HGSNAT), a resident of the lysosomal membrane, catalyzes this essential acetylation reaction by accepting and transferring the acetyl group from cytosolic acetyl-CoA to terminal α-D-glucosamine of HS in the lysosomal lumen. Mutation-induced dysfunction in HGSNAT causes abnormal accumulation of HS within the lysosomes and leads to an autosomal recessive neurodegenerative lysosomal storage disorder called mucopolysaccharidosis IIIC (MPS IIIC). There are no approved drugs or treatment strategies to cure or manage the symptoms of, MPS IIIC. Here, we use cryo-electron microscopy (cryo-EM) to determine a high-resolution structure of the HGSNAT-acetyl-CoA complex in an open-to-lumen conformation, the first step in HGSNAT catalyzed acetyltransferase reaction. In addition, we map the known MPS IIIC mutations onto the structure and elucidate the molecular basis for mutation-induced HGSNAT dysfunction.

Introduction

Degradation of the extracellular matrix proteoglycans begins in the cytosol, where they are proteolyzed to glycosaminoglycans (GAGs) such as heparan sulfate (HS), chondroitin sulfate, dermatan sulfate, and keratan sulfate. The proteolyzed GAGs are further degraded to monosaccharides and free sulfate residues within the lysosomes in GAG specific multi-enzyme degradation pathways. Dysfunction, often inherited, of the enzymes involved in GAG degradation causes abnormal accumulation of the partial degradation products within the lysosomes, resulting in mucopolysaccharidoses (MPSs), a collective term for the group of rare autosomal recessive lysosomal storage disorders characterized by the accumulation of partially degraded GAGs (Coutinho et al, 2012; Huizing & Gahl, 2020; Klein et al, 1978; Platt et al, 2018; Ruivo et al, 2009). HS is a GAG made of repeating units of N-acetylglucosamine and glucuronic acid, and impairment of enzymes in the HS degradation pathway causes mucopolysaccharidosis III (MPS III), or Sanfilippo’s syndrome that is characterized by abnormal storage of HS degradation pathway intermediates within the lysosomes in all organs and excretion of these intermediates in the urine. The onset of the symptoms of MPS III, which include organomegaly, abnormal joint mobility, compromised cardiac function, degeneration of vision and hearing, and dementia, often begins in juveniles and ends in premature death. Currently, no therapies for MPS III are available (Coutinho et al, 2012; Huizing & Gahl, 2020; McBride & Flanigan, 2021; Pshezhetsky et al, 2018).

Depending on the dysfunctional enzyme, MPS III is classified into four subtypes – MPS IIIA-IIID. Of these four MPS III subtypes, MPS IIIC (1 in 1.4 million incidence) is caused by the dysfunction of heparan-α-glucosaminide N-acetyltransferase (HGSNAT, EC 2.3.1.78). HGSNAT is the only enzyme of the GAG pathway that is not a hydrolase. It catalyzes the only known biosynthetic reaction of the GAG degradation pathway within the lysosome, that is, the acetyl-CoA (ACO) mediated N-acetylation of the terminal non-reducing amino group of α-D-glucosamine (Fan et al, 2006; Hrebicek et al, 2006; Huizing & Gahl, 2020; Klein et al, 1978; Nagel et al, 2019; Pshezhetsky et al, 2018). Acetylation of the terminal α-D-glucosamine group is essential for subsequent HS degradation within the lysosomes. HGSNAT is not homologous to any known proteins, including acetyl-CoA binding proteins, N-acetyltransferases, and other lysosomal proteins. HGSNAT mRNA has two translation start sites (M1 and M29), yielding two functionally active isoforms (73 kDa and 70 kDa) of HGSNAT, both targeted to the lysosomal membrane (Durand et al, 2010; Fan et al, 2011). In this study, we use isoform 2 of HGSNAT, which is produced as a 635 amino acid protein containing a 30 amino acid N-terminal signal peptide, a ∼110 amino acid long luminal domain, and 11 transmembrane helices (TMs). HGSNAT is believed to be expressed in the cell as an immature precursor protein that localizes as an N-glycosylated dimer on the lysosomal membrane (Durand et al, 2010; Fan et al, 2011). The localization of HGSNAT happens via the adaptor protein-mediated pathway, aided by the lysosomal targeting motifs [DE]-XXXL[LI] (²⁰⁴ETDRLI²⁰⁹) and YXXØ (⁶²⁴YILYRKK⁶³⁰) present towards the N- and C-terminus of HGSNAT respectively (Rudnik & Damme, 2021; Schwake et al, 2013). Deleting the C-terminal lysosomal sorting signal in HGSNAT retains the protein in the plasma membrane (Bonifacino & Traub, 2003; Durand et al, 2010). Once targeted to the lysosomal membrane, HGSNAT, like a few other lysosomal membrane proteins, is proteolyzed by unidentified acid proteases in the lysosomal lumen into two unequal fragments - a smaller luminal N-terminal α-HGSNAT and a larger transmembrane C-terminal β-HGSNAT that continue to co-localize despite the proteolysis (Durand et al, 2010; Fan et al, 2011; Rudnik & Damme, 2021; Steenhuis et al, 2012). Upon proteolytic maturation, it is believed that HGSNAT is assembled as a hetero oligomer of α-& β-HGSNAT chains (Durand et al, 2010; Feldhammer et al, 2009b). However, the essentiality of proteolytic cleavage and oligomerization for HGSNAT activity remains debated as endoplasmic reticulum (ER) retained monomeric HGSNAT was also shown to be active (Durand et al, 2010; Fan et al, 2011).

So far, over 70 unique mutations in the HGSNAT (TMEM76) gene have been identified. These mutations span the entire sequence and include deletions, nonsense mutations, splice-site variants, and silent and missense mutations (Canals et al, 2011; Fan et al, 2006; Fedele & Hopwood, 2010; Feldhammer et al, 2009a; Feldhammer et al, 2009b; Hrebicek et al, 2006; Huizing & Gahl, 2020). The majority of the mutations that cause MPS IIIC are missense mutations that result in HGSNAT folding and localization defects, making them ideal targets for pharmacochaperone therapy. Site-specific inhibitors of HGSNAT activity have been explored as agents to rescue the misfolded conformation of certain missense variants, thereby restoring the partial N-acetyltransferase activity (Coutinho et al, 2012; Fedele & Hopwood, 2010; Feldhammer et al, 2009b; Pan et al, 2022). Despite its clinical relevance, the structure of HGSNAT and the mechanism of N-acetyltransferase activity are poorly understood. Here, using single-particle cryo-electron microscopy (cryo-EM), we report the high-resolution structure of full-length HGSNAT in a complex with acetyl-CoA. This is the first structure of a member of the transmembrane acyl transferase (TmAT) superfamily (classes 9.B.97 and 9.B.169 in the Transporter Classification Database (TCDB)) (Saier et al, 2021). The HGSNAT-acetyl-CoA complex structure presented here is in an open-to-lumen conformation and reveals critical cofactor binding amino acids in the HGSNAT active site. In addition, our structure also provides molecular insights into the impact of MPS IIIC-causing mutations on acetyl-CoA binding and the overall architecture of the protein.

Results

Purification of dimeric HGSNAT

Two groups have previously reported purification of HGSNAT in varying oligomeric forms (Durand et al, 2010; Fan et al, 2011; Feldhammer et al, 2009b). The first group used non-ionic surfactants, NP-40 and Triton X-100, in their purification experiments and observed dimers and hexamers of HGSNAT. The PEG-based headgroup in these detergents is relatively bulky and yields micelles of variable sizes (45-100 kDa) and has been known to cause sample heterogeneity in eukaryotic membrane proteins (Orwick-Rydmark et al, 2016). The second group purified monomeric HGSNAT using non-ionic detergent DDM, which forms a relatively uniform, albeit large, micelle (98kDa). However, the purification process involved two overnight incubation steps – the first in 1% and the second in 0.2% DDM. Both groups used transient transfection to express the protein in COS-7 or HeLa cells, respectively. All these factors could impact the monodispersity of purified HGSNAT. To identify the ideal conditions for expression and purification of monodisperse HGSNAT, we transfected HEK293 GnTI^- cells with plasmids expressing either N-terminal or C-terminal GFP-fusion of HGSNAT and monitored GFP fluorescence in detergent-solubilized HGSNAT lysates by fluorescence-detection size-exclusion chromatography (FSEC) (Kawate & Gouaux, 2006). We noticed that the position of GFP did not alter the expression of HGSNAT (Fig S1A). However, the HGSNAT dimer model predicted using ColabFold suggested that the C-termini of the protomers lie at the dimer interface (Mirdita et al, 2022). Thus, for large scale production of HGSNAT for structural studies, we expressed N-terminal StrepII-tag-GFP-HGSNAT fusion. Large quantities of GFP-HGSNAT fusion were produced using baculovirus mediated transduction of HEK293 GnTI^- cells (Goehring et al, 2014). We found that reducing the temperature of the suspension cell culture to 32°C at about 8-10 hr after transduction, or transient transfection, resulted in a better yield than continuing to grow cells at 37°C (Fig S1B). To identify the ideal conditions for the purification of HGSNAT, we screened various detergents. Interestingly, in most of the detergent conditions we tested, HGSNAT was predominantly dimeric (Fig S1C-H). Then, we performed a single-point thermal melt test of solubilized cell lysates to compare the relative stability of HGSNAT in different detergents. The rationale being that heating should exacerbate the instability of the protein in a said condition. HGSNAT appeared most stable in digitonin (Fig S1I-L). Based on our FSEC analysis of relative thermal stability and homogeneity, we purified HGSNAT in digitonin. We observed a dimer in our chromatography experiments and a band corresponding to a monomer on SDS-PAGE (Fig S1M-O).

Architecture of HGSNAT

We solved the structure of HGSNAT in complex with acetyl-CoA (HGSNAT-ACO complex) to a global resolution of 3.26 Å, with the transmembrane domain (TMD) being better resolved than the luminal domain (LD) (Fig S2 and Fig S3A-C). This is the first experimentally determined structure of a member of the transmembrane acetyltransferase protein family (TmAT). Our structure reveals that HGSNAT is a dimer where the protomers are related to each other by 2-fold rotational symmetry, and the C2 axis of rotation is perpendicular to the plane of the membrane (Figs 1A-1C). Each polypeptide of HGSNAT has an N-terminal LD followed by 11 TMs that comprise the TMD, which is ensconced in the lysosomal membrane. We could model all the secondary structure elements of the protein unambiguously except the first 48 amino acids including the N-terminal signal peptide and the cytoplasmic loop 1 (CL1; L183-L236). The regions that are poorly resolved are the β-turn that connects β7 and β8 (N140-E148) the C-terminal half of TM1 (V176-F181) (Fig 1A, Fig S3D, and Fig S3E). The C-terminus of the protein lies at the dimer interface but is unlikely to be directly involved in dimerization as the C-termini of the protomers lie ∼25 Å away from each other, pointing towards the central acetyl-CoA binding site (ACOS) (Fig 1B and Fig 1C).

Structure of HGSNAT.
Panels **(A)** and **(B)** show two different orientations of HGSNAT dimer that highlight (dashed lines) the LD-TMD interface and dimer interface respectively. Micelle is displayed in gray. Chain A is displayed as a cartoon and chain B as orange surface. All the luminal loops (LLs), cytosolic loops (CLs), and the loops that connect β-sheets are shown in black. The top and bottom sheets in the luminal domain (LD) are colored blue and gray, respectively. The two-fold rotation axis is displayed as a dashed line with an ellipsoid. **(C)** Luminal (top) and cytosolic (bottom) views of the protein. The surface representation of chain B suggests that the acetyl-CoA binding site (ACOS) is more accessible from the luminal side (top) than the cytosolic side (bottom). **(D)** 2D topology of HGSNAT and YeiB family. The helices and strands in the topology are colored similarly to the 3D structure. TMs 2-5 and 6-9 form two bundles (4+4), highlighted by green parallelograms, that are related to each other by a 2-fold rotation parallel to the plane of the membrane. TMs 1, 10, and 11 do not seem involved in this internal symmetry. TM10, interestingly, is bent in the plane of the membrane, allowing a chance for the two halves TM10a and TM10b to move independently of each other. The relative position of bound ACO and active site H269 of LL1 are indicated. **(E)** Luminal (top) and cytosolic (bottom) views of the protein topology. TMs 2-5 and TM10 enclose ACOS (red hexagon) and are referred to as catalytic core (blue dashed oval). TMs 6-9 will be referred to as scaffold domain (gray dashed oval). **(F)** 4+4 bundle formed by TMs 2-5 (black) and TMs 6-9 (gray) are related by a 2-fold rotation. The last sub-panel (bottom left) shows a superposition of TMs 2-5 on TMs 6-9.

Cryo-EM data collection, processing, and validation statistics

Transmembrane domain (TMD)

The TMD of HGSNAT protomer comprises of a central 4+4-fold, where the TMs 2-5 are related to TMs 6-9 by a 2-fold rotational pseudo-symmetry with the axis of rotation being perpendicular to the plane of the membrane (Figs 1D-F). TMs 2-5, along with TM10, form a ‘catalytic core’ and enclose the ACOS accessible via the cytosol and lumen along the dimer interface. ACOSs of the protomers lie on either side of the dimer interface axis. TMs 6-9, along with TM11, form a ‘scaffold domain’ separated from the dimer interface by the catalytic core (Fig 1E). The third luminal loop (LL3) connecting TM6 and TM7 forms a lid that limits the access to the cavity between the catalytic core and the scaffold domain from the luminal side (Figs 1A, 1C, and 1D). The TMD region comprising TM2-TM11 of HGSNAT is predicted to be evolutionarily conserved across HGSNATs from other kingdoms. Owing to the unique architecture we have named this novel fold as the transmembrane N-acetyltransferase (TNAT) fold (Fig S4A and Fig 4A). Although the resolution for the C-terminal half of TM1 is poor, it is sufficient to position the helix separately from the rest of the TMD (Fig 1A). TM1 is connected to TM2 by ∼50 amino acid long CL1 (Fig 1D). LL1-LL3 and LL5 form the boundaries for the luminal entrance of ACOS. The C-terminus, CL2, and a part of CL1 form the boundaries of the cytosolic entrance of ACOS. The ACOS in our structure is relatively more accessible from the luminal side than the cytosolic side, as the nucleoside head group of the bound acetyl-CoA blocks the cytosolic entrance of ACOS (Fig 1C). Thus, the HGSNAT-ACO complex is in a conformation open to the lumen (luminal-open; LO).

Luminal domain (LD)

The LDs of the protomers lie diagonally opposite to each other ∼45 Å away from the dimer interface and are not involved in dimerization (Fig 1B). Located between the N-terminal signal peptide and the TM1, LD is a ∼110 amino acid long β-sandwich made of two beta sheets of 4 strands each where strands β1, β4, β7, & β8 form the mixed β-sheet on top (Figs 1A-E, blue), and the strands β2, β3, β5, & β6 arrange as bottom anti-parallel β-sheet (Figs 1A-E, gray). The mixed β-sheet is arranged such that the order of the strands is β4-β1-β7-β8, with β1 & β7 being parallel. β8 of LD is connected to TM1 (Fig 1B, 1D, and 2A). LD has two predicted disulfide bonds – one in the β-turn that connects β2 and β3 (C76-C79), and the other between the strands β6 and β8 (C123-C151) holding the two sheets together. The resolution of the LD domain in our structure allows us to model only one (C76-C79) of these unambiguously (Fig 2B). HGSNAT is produced as a pro-protein that is believed to get proteolyzed into two fragments, HGSNAT-α and HGSNAT-β, of unequal sizes, which remain together (Durand et al, 2010; Fan et al, 2011). It is unclear if HGSNAT-α is made of just LD or LD and TM1, and if HGSNAT-β is made of all TMs or only TMs2-11 (Fig 2A and Fig S4). While our structure has relatively poor local resolution at the predicted protease sites, making it difficult to map them unambiguously, the FSEC nor the SDS-PAGE analyses of the purified protein indicate that the recombinant HGSNAT is not proteolyzed (Fig S1). We believe the relatively low local resolution of LD is because of the flexibility introduced by GFP fused to the N-term of HGSNAT.

Domain organization, and LD-TMD and dimer interfaces of HGSNAT.
**(A)** HGSNAT is predicted to be proteolyzed into two chains of unequal size - α-HGSNAT (dark magenta cartoon, gray shaded area) and β-HGSNAT (purple cartoon, yellow shaded area). The site for proteolysis remains debated. Based on our structure and prediction of HGSNAT structures from other kingdoms (Fig S4), we have represented α- and β-HGSNAT fragments as shown in panel A. The inset (dashed oval) shows the luminal domain (dark magenta) fit to cryo-EM density (blue; display level 0.21 of the composite map in ChimeraX) (Fig S3). The lysosomal membrane is shown as a dashed gray line. **(B)** LD-TMD interface is highlighted (dashed line). Inset highlights the residues that interact at the LD-TMD interface, and cryo-EM density for the same (blue; display level 0.25 of the 3.26 Å C2 refined map in ChimeraX). C76-C79 disulfide of β2-β3 turn is shown as yellow sticks, while the residue sidechains are colored the same as their secondary structure elements, with heteroatoms highlighted. **(C)** Luminal-view of the protein with dimer interface highlighted (dashed line). Inset (dashed rectangle) highlights LL2 and LL5 that line the dimer interface, and the C334-C334 inter-chain disulfide (yellow) between the chains A (purple) and B (orange). The dashed oval inset shows one-half of the dimer interface with LL2 and LL5 of chains A and B, respectively, contributing other hydrophobic interactions that stabilize the dimer interface. The cryo-EM density in panel C is displayed as blue mesh (display level 0.22 of the C2 refine map in ChimeraX).

LD-TMD interface

LD interacts with TMD at multiple sites via an extensive interaction network that spans ∼35 Å, comprised of a salt bridge (D52-H605), hydrogen bonds, and dipole-dipole and hydrophobic interactions (Fig 2B and Fig S6A). The N-terminus of TM11 interacts with the N-terminus of β1 and LL1. LL1 also interacts with the N-terminus of TM1 and the β4-β5 turn. The β2-β3 turn, including the C76-C79 disulfide, interacts extensively with LL3 and a part of LL4. The β2-β3 turn-LL3-LL4 interaction is also stabilized by a 3π-network formed by the stacking of Y77, H78, & F428 side chains. The hydrogen bond network that stabilizes the LD-TMD interaction is formed between the amino acids M51, H78, and S162 of LD with H605 and Q603, D427 and F428, and Y264 of the TMD respectively (Figs 2B and S6A). The LD-TMD interface is separated from the dimer interface by the central catalytic core (Figs 2B and 2C).

Dimer interface

The dimer interface is spread across ∼ 35 Å towards the luminal side of the protein, perpendicular to the C2 rotation axis (Fig 2C). Although the TMs 4, 10, and 11 from both protomers lie on either side of the dimer interface, they do not directly interact to stabilize the dimer interface. The primary mediator of dimerization is the disulfide between C334s within the LL2s of each protomer (Fig 2C, rectangle inset). In addition to the C334-C334 disulfide, the dimer interface is also stabilized by an extensive π-π interaction network formed between the aromatic residues of LL2 (332-340) of one protomer and the aromatic residues of LL5 (588-598) of the other protomer (Fig 2C and Fig S6B). Notably, the aromatic bulky side chains at the interface are buried in the membrane, and the hydrophilic residues face the lumen (Fig 2C, oval inset). Y333, L338, and S339 of LL2 from one protomer form a series of hydrogen bonds with F593 and K598 of LL5 of the other protomer, which adds to the stabilization of the dimer interface (Figs 2C and S6B). In all the detergents we tested, HGSNAT eluted as a dimer, a testimony to the extensive side-chain interaction network. The 2-fold rotational symmetry between the protomers juxtaposes the ACOSs of protomers connecting them with each other on the cytosolic side (Figs 1 and 2C). This interconnected space is partitioned by lipids, preventing the diffusion of ligands from one active site to the other within a dimer (Fig S7).

Acetyl-CoA binding site (ACOS)

We used MOLEonline to predict the presence of a ∼75Å pore within the TMD, that begins at the cytosolic side and extends to the luminal side (Pravda et al, 2018) (Fig S5A). The pore predicted by MOLEonline is lined by highly conserved residues of the TMs 2-5 and TM 10 (Fig 3 and Fig S5C). Interestingly, helix TM10 is bent by ∼80° such that the luminal-side 2/3^rd (TM10b) of the helix is parallel to TM2 and the cytosolic-side 1/3^rd (TM10a) of the helix protrudes between TM2 and TM11 (Fig 3A). The bending of TM10 is aided by P575 and G576, residues known to induce helix breaks and kinks (Javadpour et al, 1999; Ulmschneider & Sansom, 2001). We find a density in our cryo-EM map overlapping the MOLEonline prediction in both protomers, spanning the entire length of TMD, that allowed us to unambiguously model a single acetyl-CoA, such that the 3’, 5’-ADP nucleoside head group is towards the cytosol and the acetyl group is hydrogen bonded (∼3.5Å) to N258 of TM2 that is open to the lumen. On the cytosolic side, the orientation of TM10a between TM2 and TM11 seemed to have allowed the positioning of the 3’, 5’-ADP nucleoside head group in the space created by bending away of TM10a from the central axis of the catalytic core. The catalytic H269, which is predicted to be acetylated during the acetyltransferase reaction, is ∼4.5 Å away from the acetyl group of acetyl-CoA, nestled within the negative charge on the luminal opening, and is ergonomically positioned to perform acetyltransferase reaction in the presence of the acetyl group acceptor (Fig 3A). The binding pocket, like the rest of the protein, shows a polarity of charge. As we move from the cytosolic side to the luminal side, the charge changes from positive to neutral to negative (Figs 3B, S5A, D). The cytosolic opening of the site is comprised of basic and nonpolar amino acids, making it an ideal pocket for binding the 3’, 5’-ADP head group. The pantothenate group of acetyl-CoA is supported by a network of nonpolar amino acids at the center of the binding pocket (Figs S5A, D). A series of conserved salt bridges stabilize the cytosolic and luminal entrances of the pore. The luminal entrance is lined with salt bridges between the residues H269, D279, R344, D469, E471, H586, and E587. The cytosolic entrance is lined with salt bridges between the residues R239, D244, R247, R317, E363, K491, and K634 (Figs S5C, D). A portion of CL1 towards the TM2 and C-terminus of the protein also seems to be a part of the cytosolic entrance of the ACOS.

Conservation and homology

Sequence conservation between HGSNAT and other known acetyl-CoA binding proteins or transferases is poor. There are no structural homologs of HGSNAT, and a search within the database of known structures using the HGSNAT TMD by the Dali server resulted in hits with sequence identity of <17% across alignments of <25% sequence length (Holm et al, 2023). Even the search with just the luminal domain, which appeared to be a classic two-sheet β-sandwich, yields hits with <15% identity over alignments of at least 70% sequence lengths (Table S1, Fig S4 C, D). We used ModelAngelo, a machine-learning based de novo automatic model building algorithm to build initial HGSNAT model into the cryo-EM density (Jamali et al, 2023). Interestingly, the model built by ModelAngelo into the experimental data superposed well with the HGSNAT model as predicted by AlphaFold with a Cα RMSD of ∼1.7 Å over 533 amino acids (Fig S4A) (Jumper et al, 2021; Varadi et al, 2022). However, unlike the AlphaFold model of Isoform 1, our expression construct lacks the extended signal peptide (N-terminal 28 amino acids). In addition, we do not observe any density to model CL1 that connects TM1 and TM2. HGSNATs of representatives from archaea, bacteria, and plants suggest that LD and TM1 are absent in other kingdoms. These homologs superpose onto the TMD part of HGSNAT, especially TM2-11 with Cα RMSDs of 1-1.4 Å over 350 amino acids (Fig S4). TmAT superfamily of membrane proteins consists of two subfamilies of integral membrane proteins consisting of 8-12 TMs – the acyltransferase-3/acetyl-CoA transporter (ATAT) family (TCDB: 9.B.97) and the integral membrane acetyltransferase (YeiB) family (TCDB: 9.B.169). HGSNAT belongs to the YeiB family. A meaningful sequence alignment could not be generated between the members of these two families. ATAT family of transporters seem to have a single extra-membranous domain, like HGSNAT (Fig S4B). However, based on AlphaFold prediction, this extra-membranous domain seems to be a α-β-α sandwich with a single 5-stranded β-sheet sandwiched between two helical domains. Interestingly, a bent helix near the predicted acetyl-CoA binding site is observed in the ATAT family, suggesting that this could be a conserved feature amongst the members of TmAT superfamily (Newman et al, 2023). We used ConSurf to estimate sequence conservation in HGSNAT based on an automatic sequence alignment algorithm (Ashkenazy et al, 2016). It appears that the LD and TM1 regions have poor sequence conservation compared to the rest of the protein (Fig 4A).

Molecular basis for MPS IIIC mutation-induced dysfunction.
**(A)** Evolutionary sequence conservation of HGSNAT. Amino acids are color coded according to the conservation scores generated by ConSurf webserver using a Clustal multiple sequence alignment of homologs identified by PSI-BLAST (Ashkenazy *et al,* 2016). The positions of the mutations - missense (orange), nonsense (black), and polymorphisms (purple) – are indicated on the sequence by triangles. **(B)** MPS IIIC-causing mutations mapped on the HGSNAT structure. The color coding of the positions is the same as in panel A. Some of the missense mutants are highlighted in the insets (dashed ovals). We grouped them based on their position within the protein – LD-TMD interface, catalytic core, scaffold domain, and other C-terminal mutations. The insets show the 3D environment of the mutant sites on the wild-type HGSNAT color coded as per their evolutionary sequence conservation scores, and the potential disturbance to it caused by the mutation (orange side chains). The coordinates for mutant side chains were generated based on wild-type HGSNAT structure as input in FoldX webserver (Schymkowitz *et al,* 2005).

The basis for mutation-induced dysfunction and destabilization of HGSNAT

We mapped the known clinical mutations (missense, nonsense, and polymorphisms) of HGSNAT onto its three-dimensional structure (Canals et al, 2011; Fan et al, 2006; Fedele & Hopwood, 2010; Feldhammer et al, 2009a; Feldhammer et al, 2009b; Hrebicek et al, 2006; Huizing & Gahl, 2020). Almost all the mutations fall within the conserved regions on HGSNAT (Fig 4A). The nonsense mutations seem to be located more on the cytosolic side, and the missense mutations seem to be populated on the luminal side of HGSNAT (Fig 4B). We estimated the extent of destabilization or stabilization introduced by these variants on HGSNAT structure by FoldX, a force field algorithm to evaluate the effect of mutations on the stability and dynamics of proteins (Table S2) (Schymkowitz et al, 2005). We classified these mutations into four groups based on their position on the structure and found that most destabilizing mutations appear to be concentrated near the LD-TMD interface (Fig 4 and Table S2). Our structure provides the basis for destabilization or dysfunction induced by missense mutations in HGSNAT.

Catalytic core mutations

Five missense mutations - E471K (LL3), R344C/R344H (LL2), P283L (TM3), and N258I (TM2) - have been identified in the residues that line the ACOS (Figs 4B, 3A, and S5C). E471 is close to the acetyl group of acetyl-CoA, and acidic to basic side chain substitution in an E471K mutation could impact the binding affinity of acetyl-CoA. N258 side chain can act as both hydrogen acceptor or donor and is hydrogen bonded to acetyl-CoA in the structure. N258I mutation could directly impact the binding of acetyl-CoA and acetyltransferase activity. R344 forms salt bridges with E469 and E471 that stabilize the luminal entrance of ACOS. The mutations of R344C/R344H could affect the integrity of the luminal entrance of ACOS. In addition to being involved in acetyl-CoA binding, all these three positions – E471, R344, and N258 – are also highly conserved (Fig 4A, B). P283 is not directly involved in the binding of acetyl-CoA. However, the residues flanking P283 – V281, F282, F285, I288, and M289 – all interact with the pantothenate group of acetyl-CoA (Fig 3A and Fig S5D). Proline residues define helix conformation, and mutation of a relatively conserved proline to an aliphatic leucine could alter the TM3 conformation on the luminal side and thus affect acetyl-CoA binding. FoldX prediction indicates that P283L is the most destabilizing of all ACOS mutations perhaps because it impacts the TM3 conformation, while others show a potential to only impact the binding of acetyl-CoA.

Mutations at the LD-TMD interface

C76F, G262R, N273K, G423W, and G424S/G424V are all missense mutations that are on the LD-TMD interface, and our FoldX based analysis indicates these mutations to be the most destabilizing among the ones that we listed. N273K is predicted to be least destabilizing by FoldX, as all these mutations, except N273K, result in charge reversal and drastic change in the side chain size, leading to steric clashes and breakdown of existing interactions (Fig 4B and Fig 2B). Although none of these missense positions at the LD-TMD interface, except C76, are directly involved in LD-TMD interface contacts, the drastic side chain changes in the vicinity of the residues directly involved are expected to destabilize the interaction (Fig 2 and Fig S6). For example, the glycine residues (G262, G423, and G424) lie within pockets lined by aromatic side chain containing amino acids, and the substitution of such a residue with a bulky side chain will cause a steric clash, destabilizing those pockets of interaction (Fig 4B).

Scaffold domain mutations

W403C, M482K, G486E, A489E, S518F, S539C, and S541L are all mutations that occur in the scaffold domain (TM6-TM9) of HGSNAT. FoldX predicts these mutations to be only mildly destabilizing. For example, the change in the charge and size of S539C or W403C mutation is not drastic enough to destabilize TM6 or TM9 helices. Even in the cases where there are large substitutions, for example, G486E or A489E, the substituted side chains face away from the core of the protein and are involved in minimal interactions or clashes (Fig 4B).

Other mutations

Mutations in the LD domain, such as L113P, G133A, and L137P, seem to face the lumen and are involved in minimal interactions. Interestingly, within the mutations that we analyzed, none of them were on the dimer interface. However, two mutations, P571L and Y627C, towards the cytosolic ends of TM10 and TM11 seem closer to the dimer interface on the cytosolic side. However, they are not directly involved in dimerization. These ring side chain amino acids are part of an elaborate π-π-network stabilizing TM10a and the C-terminus in positions that make space for the nucleoside head group at the ACOS (Fig 4B and Fig 3A). Drastic mutations in this region could directly impact acetyl-CoA binding or conformational transition of the protein to a different conformation.

Mechanism of acetyltransferase reaction

HGSNAT, in this report, was purified at pH 7.5. During the purification process, we did not add acetyl-CoA to the buffer. It has been demonstrated, previously, that acetyl-CoA binding happens on the cytosolic side and is optimal around pH 7.0-8.0, and the acetyl transfer activity happens on the luminal side and is optimal at pH 5.5-6.0. The K_m of N-acetyltransferase reaction in the presence of acetyl-CoA as the substrate is in the 0.2-0.6 mM range at pH 5-5.8, and 2.5-20 μM range at pH 7.0, suggesting that HGSNAT has a high affinity for acetyl-CoA at pH >7.0 (Bame & Rome, 1985, 1986a; Meikle et al, 1995). The transfer of the acetyl group from acetyl-CoA to the terminal non-reducing amino group of α-D-glucosamine is believed to be catalyzed by H269 located on the luminal entrance of ACOS (Fig 3A and Fig S5). It is unclear if H269 gets acetylated and forms a stable acetylated intermediate in the process of acetyl transfer. However, for the H269 to be acetylated or for it to transfer the acetyl group to its putative acceptor, protonation of the imidazole sidechain amine group is necessary. The catalytic H269 is primarily uncharged at pH 7.5, and the acetyl-CoA remains an acetate ion. The carboxamide group of N258 can act as both an acceptor and donor in a hydrogen bond. In our structure, ACOS is more accessible from the luminal side than the cytosolic entrance. We note that the acetyl-CoA is hydrogen-bonded to N258 (Fig 3A and Fig S5D). Based on these observations, we conclude that the structure we have is of an acetyl-CoA primed HGSNAT waiting for the protonation of H269 and the availability of an acetyl group acceptor to carry out acetyltransferase reaction (Fig 5). Because the cytosolic pH is favorable for acetyl-CoA binding and the protein we purified endogenously pulled down acetyl-CoA with it, we believe this is the most stable open-to-lumen conformation of HGSNAT in the absence of an acetyl group acceptor.

Proposed mechanism of acetyl transfer by HGSNAT.
HGSNAT catalyzes a bisubstrate reaction. Enzyme-catalyzed bisubstrate reactions could either be sequential reactions (top) or ping pong reactions (bottom), depending on the order of binding and release of substrates and products. The mechanism of bisubstrate reaction catalyzed by HGSNAT to transfer acetyl group from cytosolic acetyl-CoA (red lightning) to terminal non-reducing α-D-Glucosamine (blue hexagon) of luminal heparan sulfate has been a longstanding debate. After the acetyl group transfer, CoA (gray lightning) and acetylated glucosamine (red hexagon) are believed to be released to cytosol and lumen respectively. We believe that the acetyl-CoA bound HGSNAT structure presented in this work (dashed box) is in a cofactor primed open-to-lumen conformation which could proceed by either of the two bisubstrate reaction mechanisms. However, the reaction schema of both mechanisms indicates that a stable acetyl-CoA and HGSNAT complex is an indicative of the enzyme favoring Ping Pong Bi Bi mechanism of action. We hypothesize that both TMD, especially TM10, and LD undergo conspicuous conformational changes, as they transit through the reaction cycle, to make cytosolic and luminal sides of HGSNAT accessible via cytosol and lumen respectively. The function of LD is unclear, and we believe it plays essential role in recognition of substrate and its positioning at the active site.

Discussion

Direct involvement of acetyl-CoA in the heparan sulfate degradation was shown in the early 1980s, by incubating the extracted intact lysosomes with radiolabeled acetyl-CoA and by subsequently monitoring the incorporation of radiolabeled acetate into lysosomal HS (Rome & Crain, 1981; Rome et al, 1983). Using purified lysosomal membranes, Rome and colleagues showed that the acetyl-CoA dependent N-acetyltransferase activity, which is required for acetylation of HS, resides on the lysosomal membrane. In these studies, they also suggested that the HS acetylation reaction occurs in two steps - acetylation of the lysosomal membranes and transfer of acetyl from the membranes to HS. By performing acetylation and acetyltransferase reactions at different pH, and in the presence of different amino acid modification reagents, they showed that the acetylation process is optimal at pH 7, acetyltransferase activity is optimal at pH 5.5, and the amino acid on the lysosomal membrane that gets acetylated is a histidine. They proposed that the N-acetyltransferase activity follows a Ping Pong Bi Bi mechanism of reaction, where acetyl-CoA from the cytosol acetylates the active site histidine of the N-acetyltransferase located on the lysosomal membrane, inducing a conformational change in the transmembrane enzyme that enables the access of active site histidine from the luminal side (Fig 5). Due to the change in pH of the active site cavity, the acetyl-histidine interaction is destabilized, and the acetyl group is transferred to the α-D-glucosaminide on the luminal side (Bame & Rome, 1985, 1986a, b).

Pshezhetsky and colleagues showed that acetylation of lysosomal membranes expressing N-acetyltransferase activity occurred even in the absence of the acetyl group acceptor, suggesting the formation of an acetylated enzyme intermediate (Ausseil et al, 2006; Durand et al, 2010). Here, we purified HGSNAT at a pH favorable for acetyl-CoA binding, but not acetyl transfer reaction. In our structure, we find that the HGSNAT ACOS is more accessible via the lumen as opposed to the cytosol and that the His269 of LL1 is not acetylated. Instead, we find an acetyl-CoA molecule in the ACOS, hydrogen bonded to N258 of TM2, a residue within a close vicinity (∼ 5 Å) of H269 (Fig 3 and Fig S5). We believe N258 holds onto acetyl-CoA and aids H269 in the acetyltransferase reaction, at low pH and in the availability of acetyl group acceptor.

Meikle and colleagues argued that the N-acetyltransferase reaction occurs via a random-order mechanism (Fig 5). They demonstrated that both N-acetyl-α-D-glucosamine and CoA are required for the reverse reaction, suggesting that the formation of a ternary complex of acetyl-CoA, α-D-glucosamine, and the enzyme enables the reaction to proceed in a single step without the requirement of an acetylated-enzyme intermediate. They also report two K_m values, that differ by 5-10 times, for both acetyl-CoA and glucosamine substrates, suggesting that both protomers, perhaps, bind to the substrates at varying affinity and only one of the protomers is catalyzing the acetyl transfer at any given point of time (Meikle et al, 1995). Mahuran and colleagues showed that acetylated enzyme intermediate could not be identified by affinity pulldown of purified transmembrane N-acetyltransferase after incubating it with [³H] acetyl-CoA and proposed that the formation of a stable acetylated enzyme intermediate is not required for the reaction to proceed (Fan et al, 2011). Our structure does not provide evidence of H269 acetylation (Fig 3). However, we observe that acetyl-CoA binds tightly to HGSNAT endogenously resulting in a stable HGSNAT-acetyl-CoA intermediate that stayed as an intact complex all through the affinity purification, size-exclusion chromatography, and cryo-EM sample preparation steps even without the addition of exogenous acetyl-CoA to any of the buffers. Perhaps, this tight endogenous binding of HGSNAT is the reason why Mahuran and colleagues could not observe labeling of purified transmembrane N-acetyltransferase by [³H] acetyl-CoA. Interestingly, the structure that we obtained does not by itself settle the debate of HGSNAT mediated acetylation following a Ping Pong Bi Bi mechanism or a random-order mechanism. Our structure is in a conformation that could proceed through either of these two reaction mechanisms to completion (Fig 5, dotted rectangle). A structure of HGSNAT at pH 5.0 in the absence of an acetyl group acceptor will demonstrate if the enzyme gets acetylated during the reaction.

HGSNAT is believed to be produced as a pro-protein that gets proteolytically processed into its mature form where two fragments of unequal sizes, the α-HGSNAT and β-HGSNAT, continue to stay together because of a disulfide bond mediated interaction between C123 (β6) in the N-terminal luminal domain (LD) of the α-HGSNAT and C434 in the LL3 of the C-terminal transmembrane domain (TMD) of β-HGSNAT (Durand et al, 2010). However, according to the structure, these amino acids are ∼ 34 Å apart making such a bond impossibility in the acetyl-CoA bound HGSNAT conformation reported here (Fig 1A, 1D and 2B). Moreover, we notice that C434 is disulfide bonded to C415 on LL3. C123 of β6 is predicted to form a disulfide bond with C151 of β8, and we do not see density to model such a disulfide. A bond between C123 of LD and C434 of LL3 could be a possibility upon drastic conformational change during the reaction cycle that brings LD closer to the LL3. Thus, we believe that the α-HGSNAT and β-HGSNAT stay together because of a series of hydrogen bonds, dipole-dipole and hydrophobic interactions between β2-β3 turn, β4-β5 turn, LL1, LL3, and LL4 (Fig 2 and Fig S6).

The protease that cleaves HGSNAT into α-HGSNAT and β-HGSNAT is unknown, and the site for proteolysis is unclear. Fan et al, proposed that the site of proteolysis is between amino acids N144 and G145 in the LD (Fan et al, 2011). N144-G145 is located on the β7-β8 turn of the LD, a region poorly resolved in our structure (Fig 1A and Fig 2A). Although proteolysis at this position will cleave the LD into two parts (β1-7 and β8), the LD should remain attached to TMD because of extensive network of interactions along the LD-TMD interface (Fig 2B and Fig S6). Durand et al, proposed that the site of proteolysis is between the end of the luminal domain and beginning of LL1 (Durand et al, 2010). Sequence-based protease site prediction in Procleave and Prosper servers, using the sequence between β8 of LD and LL1, suggests multiple potential sites of proteolysis, for example, L154-A155 at the C-term of β8, S162-N163 in the loop connecting LD to TM1, F170-L171 at the N-term of TM1, L187-S188 and W231-R232 at the N- and C-term of the CL1 (Li et al, 2020; Song et al, 2012). It is hard to speculate which of these are most probable, as the lysosomes are rich in aspartic, serine, and cysteine proteases, including intra-membrane proteases that are involved in protein degradation and proprotein processing. In addition, some matrix metalloproteinases have also been shown to act in intracellular compartments and have been implicated in MPS diseases (Batzios et al, 2012; Jobin et al, 2017; Muller et al, 2012; Schroder & Saftig, 2016; Turk et al, 2000). The recombinant HGSNAT that we produced is a non-proteolyzed form of HGSNAT, as we see a band corresponding to full-length HGSNAT in our SDS-PAGE analysis and a peak corresponding to an intact full-length dimer in our FSEC analysis. HGSNAT is encoded by 18 exons, where the first 6 exons encode an N-terminal luminal domain (LD) & TM1, which are only present in the metazoans. The remaining 10 TMs and the C-terminus are well conserved in archaea, bacteria, and plants (Fan et al, 2006; Hrebicek et al, 2006). Based on the evolutionary sequence and structure conservation and conformational flexibility observed in CL1, we speculate that LD & TM1 together form α-HGSNAT and TMs 2-11 form β-HGSNAT (Fig 2A, Fig 4A and Fig S4A).

There are no known homologs of the LD, and even the strand arrangement in two sheets appears to be rare (Koch et al, 1992). A structure-based search in the Dali server lists hits with <15% identity, with immunoglobulin-like & transthyretin folds as top hits (Table S1) (Beale et al, 2015; Felisberto-Rodrigues et al, 2011; Parker et al, 2021; Rajasekar et al, 2019). Unlike the LD, a typical immunoglobulin-like fold is a disulfide-containing β-sandwich made of 7-9 β-strands arranged in two anti-parallel β-sheets, with a conserved core formed by four strands β2, β3, β5, & β6 (Fig S4C and S4D). In the Ig-like fold, the β2 and β5 strands are in one sheet, and the β3 and β6 strands are in the second one, with β2 and β6 being linked by a disulfide bond. The remaining 3-5 strands comprise the remaining variable region of the Ig-like fold (Bork et al, 1994; Chidyausiku et al, 2022). Although the % identity of LD and the transthyretin fold containing hits is <15%, the strand composition of the two sheets that make the β-sandwich is identical. However, a typical transthyretin fold contains a conserved short α-helix between β5 & β6 and often does not contain disulfide bonds (Fig S4C, D) (Hornberg et al, 2000). Another β-sandwich that shows a similar secondary structure assignment as LD is the type-II C2 domain, where the top sheet is made of β1, β4, β7, & β8 strands while the bottom sheet is made of β2, β3, β5, & β6 (Hirano et al, 2019; Kretsinger et al, 2013). However, both sheets in the type-II C2 domain are anti-parallel, and the arrangement of the top strand is β4-β1-β8-β7. In LD, one sheet is anti-parallel, and the other is a mixed sheet where the arrangement in the top sheet is β4-β1-β7-β8 (Fig S4C and S4D). Because of these non-trivial dissimilarities of LD with three major types of β-sandwiches, we believe that LD is a new fold of β-sandwich that could be best described as a ‘transthyretin-like’ fold. In fact, two proteins in the top hits from a structure-based similarity search on Dali server show the same sheet arrangement as LD (Table S1 and Fig S4D). One of these proteins is AlgF (PDB ID: 6CZT), an adaptor protein from Pseudomonas believed to be involved in O-acetylation of alginate exopolysaccharides and the other is an SPH (self-incompatibility protein homolog) domain (PDB ID: 6G7G) that is described as transthyretin-like and is believed to be a plant secreted protein involved in cell death (Fig S4D).

The function of LD remains unknown. DeepSite predicts the presence of two ligand binding sites in HGSNAT, one that overlaps with the ACOS and the other in the cavity between the two sheets of LD (Fig S5B) (Jimenez et al, 2017). However, it is unclear if LD binds to a ligand and/or a metal ion. It has been shown that the N-acetyltransferase activity of HGSNAT is enhanced in the presence of anionic phospholipids (Fan et al, 2011). However, we expect lipids to bind towards the cytosolic side at the dimer interface. We see unexplained density in these regions that could be best explained as ordered lipids or digitonin (Fig S7). A recent report of a computational modeling and molecular dynamics simulation study conducted on a model of OafB, a bacterial O-antigen modifying transmembrane acetyltransferase of the ATAT family within the TmAT superfamily, showed that the periplasmic SGNH domain undergoes large conformational changes and aid in O-antigen acetylation (Newman et al, 2023). OafB has two domains – the transmembrane acetyltransferase-3 domain (AT-3) and periplasmic SGNH domain. Although the domain organization is similar to HGSNAT, there are marked differences in the structure (Fig S4B). AT-3 domain and HGSNAT TMD are not homologous, even though they share a similar architecture of the ACOS. The LD of HGSNAT is different from the SGNH domain, in the sense that LD is a β-sandwich at the N-terminus of the protein while SGNH is a α-β-α sandwich with a single β-sheet sandwiched between two helical domains at the C-terminus of the protein. There are non-trivial differences between the predicted structures of ATAT and YeiB family members. However, it is possible that the TMDs and the extra-membranous domains function similarly. It is likely that LD of HGSNAT binds to HS, stabilizing the terminal non-reducing sugar of HS near the luminal side of ACOS for catalysis and remains to be structurally investigated. In fact, it has been shown that as the size of acetyl group acceptor was increased in an acetyltransferase reaction from mono-to di-to tetra-saccharide, the K_m values decreased from 0.6 mM to 7 μM (Meikle et al, 1995).

The high-resolution structure of HGSNAT reported in this study heralds a new beginning for structural exploration of the TmAT superfamily. While the conformation of HGSNAT observed in our cryo-EM studies does not put to rest the debate on the kind of bi-substrate reaction mechanism, it favors the Ping Pong Bi Bi mechanism. Also, our structure underscores the requirement to characterize the remaining states of the enzymatic reaction. The molecular basis we delineated for the mutation-induced dysfunction in HGSNAT will serve as a blueprint for the structure-based design of novel therapeutic modulators that could rescue the function of milder variants.

Materials and Methods

Cloning and site-directed mutagenesis

The codon optimized gene encoding the isoform 2 of full-length human HGSNAT was synthesized by GenScript. The synthesized gene was then cloned into the pEG BacMam expression vector (Addgene plasmid # 160683) between EcoRI and NotI restriction sites, to be expressed via baculoviral transduction in HEK293S GnTI^- cells (ATCC # CRL-3022) as a fusion protein containing an N-terminal Strep-tag-II-GFP. The integrity of the clone was confirmed by Sanger sequencing (Plasmidsaurus, OR).

Cell culturing, transient transfection, and transduction

Adherent HEK293S GnTI^- cells were grown in Dulbecco’s Modified Eagle Medium (DMEM, Gibco) supplemented with 10% fetal bovine serum (FBS, Gibco) at 37°C. Immediately preceding transfection, the cells were washed with 1X PBS (Gibco) and supplied with fresh pre-warmed DMEM containing 10% FBS. 1×10⁶ cells were transfected with 1μg DNA using TurboFect (Thermo Fisher Scientific) suspended in serum-free DMEM as suggested by the manufacturer’s protocol. The transfected cells were grown at 37°C and 5% CO₂ for 8-10 hr. The cell-culture media was then replaced with fresh pre-warmed DMEM containing 10% FBS and 10 mM of sodium butyrate, and the cells were grown at 32°C and 5% CO₂ for an additional 24-36 hr, before harvesting. All transfections were done at 80% confluency. Transfected cells were used to screen for ideal expression and purification conditions.

Baculovirus preparation was done as described by Goehring and colleagues (Goehring et al, 2014). Briefly, DH10Bac cells (Thermo Fisher Scientific) were transformed with an HGSNAT expression vector, and lacZ^- colonies were selected on gentamycin-kanamycin-tetracycline LB agar plates for bacmid DNA isolation. 1×10⁶ adherent Sf9 cells (ATCC # 12659017) grown in serum-free Sf-900 III media (Gibco) were transfected with 1 μg of bacmid using Cellfectin II reagent (Gibco) per the manufacturer’s protocol. Transfected cells were grown at 27°C for 96 hr. The supernatant media from the cells was harvested, filtered through a 0.2 μm filter, and stored as P1 virus. 100 μL of P1 virus was added to 1L of Sf9 cells at a cell density of 1×10⁶/mL in serum-free Sf-900 III media. The cells were grown at 96 hr at 27°C while shaking at 120 rpm. The cells were spun down at 4000 xg for 20 min, and the supernatant media was filtered through a 0.2 μm filter and stored as P2 baculovirus. P2 baculovirus was used for large-scale transduction of HEK293S GnTI^- cells. Large-scale expression of HGSNAT was done by baculoviral transduction of HEK293S GnTI^- cells suspended in FreeStyle 293 expression media containing 2% FBS. Mammalian cell culture at a density of 3×10⁶ cells/mL was transduced using P2 baculovirus at a multiplicity of infection of 1.5-2 and was incubated on an orbital shaker at 37°C and 5% CO₂ for 8-10 hr. The cells were supplemented with 10 mM sodium butyrate and were then incubated in a shaker at 32°C and 5% CO₂ for an additional 38-40 hr. The cells were harvested by centrifugation at 4000 xg for 10 min, and the cell pellet was stored at −80°C until further use.

Fluorescence-detection size-exclusion chromatography (FSEC) analysis

To identify suitable conditions for large-scale expression and solubilization of HGSNAT, we employed FSEC (Kawate & Gouaux, 2006). Briefly, 100,000 transfected cells were solubilized in 500 μl of 1 % detergent, 25 mM Tris-HCl, pH 7.5, 200 mM NaCl, 1 mM PMSF, 0.8 μM aprotinin, 2 μg/mL leupeptin, and 2 μM pepstatin A at 4°C for 1 hr on an end-end rotator. After solubilization, the lysate was centrifuged at 185,000 xg for 1 hr at 4°C. The supernatant was filtered through 0.45 μm filter, and 100 μl filtrate was analyzed on a Superose 6 Increase 10/300 GL column (Cytiva Life Sciences) pre-equilibrated with 0.15 mM LMNG, 25 mM Tris-HCl, pH 7.5, and 200 mM NaCl. GFP fluorescence in the eluate was monitored by a fluorometer (Shimadzu scientific instruments) set at Ex/Em of 485/510 nm. To test the stability of HGSNAT in various conditions, the solubilized samples were heated at desired temperature for 15 min, centrifuged at 10,000 rpm for 10 min, filtered through 0.45 μm filter, and were analyzed again on a Superose 6 Increase 10/300 GL column.

Purification of HGSNAT

Cell debris from the sonicated lysate of HEK293S GnTI^- cells expressing N-terminal GFP fusion of HGSNAT was removed by centrifugation at 2400 xg for 10 min. The supernatant from the low-speed centrifugation step was subjected to ultra-centrifugation at 185,000 xg for 1 hr at 4°C to harvest membranes. Membranes were resuspended using a dounce homogenizer in suspension buffer (25 mM Tris-HCl, pH 7.5, 200 mM NaCl, 1 mM PMSF, 0.8 μM aprotinin, 2 μg/mL leupeptin, and 2 μM pepstatin A). To this membrane suspension, an equal volume of 2% digitonin solution prepared in membrane suspension buffer was added. The membrane suspension was solubilized for 90 min. at 4°C. The solubilized membrane suspension was centrifuged at 185,000 xg for 1 hr at 4°C. The solubilized supernatant was passed through Strep-Tactin affinity resin (IBA Life Sciences) column at a flow rate of ∼0.3-0.5 ml/min. Affinity resin saturated with HGSNAT was washed with 5-6 column volumes of 0.5% digitonin, 25 mM Tris-HCl, pH 7.5, and 200 mM NaCl. The bound protein was eluted using 5 mM D-desthiobiotin prepared in the wash buffer. The purity and homogeneity of purified HGSNAT was confirmed by performing FSEC and SDS-PAGE. The amino acid sequence of purified HGSNAT was verified by peptide mass fingerprinting of the bands on SDS-PAGE. For FSEC experiments, ∼10 μL of elution fractions were loaded onto a Superose 6 Increase 10/300 GL column pre-equilibrated with 0.15 mM LMNG, 25 mM Tris-HCl, pH 7.5, and 200 mM NaCl. The elution fractions containing homogeneous and pure protein were pooled and concentrated to 2 mg/mL. The concentrated protein was further purified by size-exclusion chromatography (SEC) using Superose 6 Increase 10/300 GL column pre-equilibrated with 0.5% digitonin, 25 mM Tris-HCl, pH 7.5, and 200 mM NaCl. HGSNAT corresponding to the dimeric peak from the SEC step was pooled and concentrated to 0.9 mg/mL for cryo-EM sample preparation.

Cryo-EM sample preparation and data collection

UltrAuFoil holey-gold 300 mesh 1.2/1.3 μm size/hole space grids (UltrAuFoil, Quantifoil) were glow discharged, using a PELCO glow discharger, for 1 min at 15 mA current and 0.26 mBar air pressure and were immediately used for sample vitrification. 2.5 μL of HGSNAT at 0.9 mg/ml was applied to the glow-discharged grids, and subsequently blotted for 2s at 18°C and 100% humidity using a Vitrobot (mark IV, ThermoFisher Scientific). Without any wait time, the grids were plunge-frozen in liquid ethane. Movies were recorded on a Gatan K3 direct electron detector in super-resolution counting mode with a binned pixel size of 0.85 Å per pixel using Serial EM on a FEI Titan Krios G4i transmission electron microscope operating at an electron beam energy of 300 KeV with a Gatan Image Filter slit width set to 20 eV. Exposures of ∼2s were dose fractionated into 50 frames, attaining a total dose of ∼50 e^- Å^-2, and the defocus values were varied from −1.0 to −2.5 μm.

Cryo-EM data processing

All image processing was done using CryoSPARC (v 4.2.1) unless specified otherwise (Punjani et al, 2017). Briefly, motion correction was carried out on the raw movies, and subsequently, the contrast transfer function (CTF) was estimated using patch-CTF correction on dose-weighted motion-corrected averages within cryoSPARC (Fig S2). A total of 15,067 micrographs were collected, of which 10,394 were selected for further processing based on the CTF fit. Both blob picking and template picking were used to pick particles, and duplicates were removed. Particles were extracted initially using a box size of 288 pixels. A subset of the particles picked by the blob picker was used to generate a low-resolution ab initio reconstruction, which was later used as a reference for iterative 3D classification and heterogenous refinement. The pooled particles were subjected to multiple rounds of reference-free 2D classification and heterogenous refinement to get a cleaned subset of particles which resulted in classes with recognizable features. The cleaned particles were re-extracted with a box size of 360 pixels, and a stack of 94,875 good particles was selected and subjected to an initial round of NU refinement using an ab initio model as a reference (Punjani et al, 2020). 86,500 particles resulted in a 3.89 Å map, which had well-resolved luminal domain architecture, and visible secondary structure features of TM helices. To improve the resolution further, only the particles belonging to micrographs with CTF fit <4 Å were selected for subsequent refinements. Both C1 and C2 symmetry-imposed maps overlapped well, and we used C2 symmetry in the final rounds of processing (Fig S2 and S3C). A final subset of 57,739 particles was used for NU-refinement, yielding a 3.26 Å map (Fourier shell coefficient (FSC) = 0.143 criterion). Symmetry expansion and local refinement with a focused mask on LD and TMD domain was performed to improve local density in these domains. The map obtained by symmetry expansion was only used to assess the fit of the model to density. A composite map generated by combining these local maps was used to finalize the fit of the side chains during model building and refinement (Fig S3D). Local resolutions were estimated using the RESMAP software (Kucukelbir et al, 2014).

Model building, refinement, and structure analysis

ModelAngelo was used to build HGSNAT isoform 2 into the final map, and final refinement was performed in Phenix (Jamali et al, 2023; Liebschner et al, 2019). Coot was used to rebuild certain portions of the protein and to analyze the Ramachandran outliers (Emsley et al, 2010). ChimeraX was used for making the figures (Meng et al, 2023). Ligand binding site prediction on HGSNAT was performed using MOLEonline and DeepSite (Jimenez et al, 2017; Pravda et al, 2018). To analyze the effect of missense mutations on HGSNAT and to calculate free energies indicating the relative mutation-induced destabilization, we used FoldX (Schymkowitz et al, 2005).

Supporting information

Supplementary Data

Acknowledgements

We thank the University of Michigan (UM) cryo-EM facility staff for assistance with cryo-EM data collection, and Drs. Michael Cianfrocco, Farzad Jalali-Yazdi, Jonathan Coleman, and Prashant Rao for helpful comments on cryo-EM analysis. UM cryo-EM facility receives generous support from UM BSI and UM LSI. We thank all the members of Mosalaganti lab and Baldridge lab for their educative discussions on our work. This work was supported by the start-up funding provided to SM by the University of Michigan. SM is a recipient of the NIH Director’s New Innovator Award (DP2) (1DP2GM150019-01).

Author contributions

VN designed the project, performed the experiments, collected the cryo-EM data, and wrote the first draft of the manuscript. AK, VN, and SM processed and analyzed the cryo-EM data. VN and AK built the model and made the figures. AK deposited the map and coordinates. VN, AK, and SM contributed to the editing and preparation of the final draft of the manuscript.

Conflict of interest

The authors declare no conflict of interest.

Data availability

The data presented in this manuscript are available upon request. All correspondence for materials and data presented in this paper should be addressed to mosalaga@umich.edu. The cryo-EM maps and 3D coordinates of HGSNAT have been deposited in the Electron Microscopy Data Bank (EMDB) and Protein Data Bank (PDB) under the accession codes EMD-41620 and PDB-8TU9. All the half-maps and the masks used for refinement have also been deposited in the EMDB under the same accession code.

References

1. Ashkenazy H
2. Abadi S
3. Martz E
4. Chay O
5. Mayrose I
6. Pupko T
7. Ben-Tal N
2016ConSurf 2016: an improved methodology to estimate and visualize evolutionary conservation in macromoleculesNucleic Acids Res 44:W344–350Google Scholar
1. Ausseil J
2. Landry K
3. Seyrantepe V
4. Trudel S
5. Mazur A
6. Lapointe F
7. Pshezhetsky AV
2006An acetylated 120-kDa lysosomal transmembrane protein is absent from mucopolysaccharidosis IIIC fibroblasts: a candidate molecule for MPS IIICMol Genet Metab 87:22–31Google Scholar
1. Bame KJ
2. Rome LH
1985Acetyl coenzyme A: alpha-glucosaminide N-acetyltransferase. Evidence for a transmembrane acetylation mechanismJ Biol Chem 260:11293–11299Google Scholar
1. Bame KJ
2. Rome LH
1986aAcetyl-coenzyme A:alpha-glucosaminide N-acetyltransferase. Evidence for an active site histidine residueJ Biol Chem 261:10127–10132Google Scholar
1. Bame KJ
2. Rome LH
1986bGenetic evidence for transmembrane acetylation by lysosomesScience 233:1087–1089Google Scholar
1. Batzios SP
2. Zafeiriou DI
3. Vargiami E
4. Karakiulakis G
5. Papakonstantinou E
2012Differential expression of matrix metalloproteinases in the serum of patients with mucopolysaccharidosesJIMD Rep 3:59–66Google Scholar
1. Beale JH
2. Parker JL
3. Samsudin F
4. Barrett AL
5. Senan A
6. Bird LE
7. Scott D
8. Owens RJ
9. Sansom MSP
10. Tucker SJ
11. et al
2015Crystal Structures of the Extracellular Domain from PepT1 and PepT2 Provide Novel Insights into Mammalian Peptide TransportStructure 23:1889–1899Google Scholar
1. Bonifacino JS
2. Traub LM
2003Signals for sorting of transmembrane proteins to endosomes and lysosomesAnnu Rev Biochem 72:395–447Google Scholar
1. Bork P
2. Holm L
3. Sander C
1994The immunoglobulin fold. Structural classification, sequence patterns and common coreJ Mol Biol 242:309–320Google Scholar
1. Canals I
2. Elalaoui SC
3. Pineda M
4. Delgadillo V
5. Szlago M
6. Jaouad IC
7. Sefiani A
8. Chabas A
9. Coll MJ
10. Grinberg D
11. et al
2011Molecular analysis of Sanfilippo syndrome type C in Spain: seven novel HGSNAT mutations and characterization of the mutant allelesClin Genet 80:367–374Google Scholar
1. Chidyausiku TM
2. Mendes SR
3. Klima JC
4. Nadal M
5. Eckhard U
6. Roel-Touris J
7. Houliston S
8. Guevara T
9. Haddox HK
10. Moyer A
11. et al
2022Nat Commun13:5661Google Scholar
1. Coutinho MF
2. Lacerda L
3. Alves S
2012Glycosaminoglycan storage disorders: a reviewBiochem Res Int 2012:471325Google Scholar
1. Durand S
2. Feldhammer M
3. Bonneil E
4. Thibault P
5. Pshezhetsky AV
2010Analysis of the biogenesis of heparan sulfate acetyl-CoA:alpha-glucosaminide N-acetyltransferase provides insights into the mechanism underlying its complete deficiency in mucopolysaccharidosis IIICJ Biol Chem 285:31233–31242Google Scholar
1. Emsley P
2. Lohkamp B
3. Scott WG
4. Cowtan K
2010Features and development of CootActa Crystallogr D Biol Crystallogr 66:486–501Google Scholar
1. Fan X
2. Tkachyova I
3. Sinha A
4. Rigat B
5. Mahuran D
2011Characterization of the biosynthesis, processing and kinetic mechanism of action of the enzyme deficient in mucopolysaccharidosis IIICPLoS One 6:e24951Google Scholar
1. Fan X
2. Zhang H
3. Zhang S
4. Bagshaw RD
5. Tropak MB
6. Callahan JW
7. Mahuran DJ
2006Identification of the gene encoding the enzyme deficient in mucopolysaccharidosis IIIC (Sanfilippo disease type C)Am J Hum Genet 79:738–744Google Scholar
1. Fedele AO
2. Hopwood JJ
2010Functional analysis of the HGSNAT gene in patients with mucopolysaccharidosis IIIC (Sanfilippo C Syndrome)Hum Mutat 31:E1574–1586Google Scholar
1. Feldhammer M
2. Durand S
3. Mrazova L
4. Boucher RM
5. Laframboise R
6. Steinfeld R
7. Wraith JE
8. Michelakakis H
9. van Diggelen OP
10. Hrebicek M
11. et al
2009aSanfilippo syndrome type C: mutation spectrum in the heparan sulfate acetyl-CoA: alpha-glucosaminide N-acetyltransferase (HGSNAT) geneHum Mutat 30:918–925Google Scholar
1. Feldhammer M
2. Durand S
3. Pshezhetsky AV
2009bProtein misfolding as an underlying molecular defect in mucopolysaccharidosis III type CPLoS One 4:e7434Google Scholar
1. Felisberto-Rodrigues C
2. Durand E
3. Aschtgen MS
4. Blangy S
5. Ortiz-Lombardia M
6. Douzi B
7. Cambillau C
8. Cascales E
2011Towards a structural comprehension of bacterial type VI secretion systems: characterization of the TssJ-TssM complex of an Escherichia coli pathovarPLoS Pathog 7:e1002386Google Scholar
1. Goehring A
2. Lee CH
3. Wang KH
4. Michel JC
5. Claxton DP
6. Baconguis I
7. Althoff T
8. Fischer S
9. Garcia KC
10. Gouaux E
2014Screening and large-scale expression of membrane proteins in mammalian cells for structural studiesNat Protoc 9:2574–2585Google Scholar
1. Hirano Y
2. Gao YG
3. Stephenson DJ
4. Vu NT
5. Malinina L
6. Simanshu DK
7. Chalfant CE
8. Patel DJ
9. Brown RE
2019Structural basis of phosphatidylcholine recognition by the C2-domain of cytosolic phospholipase A(2)alphaElife :8Google Scholar
1. Holm L
2. Laiho A
3. Toronen P
4. Salgado M
2023DALI shines a light on remote homologs: One hundred discoveriesProtein Sci 32:e4519Google Scholar
1. Hornberg A
2. Eneqvist T
3. Olofsson A
4. Lundgren E
5. Sauer-Eriksson AE
2000A comparative analysis of 23 structures of the amyloidogenic protein transthyretinJ Mol Biol 302:649–669Google Scholar
1. Hrebicek M
2. Mrazova L
3. Seyrantepe V
4. Durand S
5. Roslin NM
6. Noskova L
7. Hartmannova H
8. Ivanek R
9. Cizkova A
10. Poupetova H
11. et al
2006Mutations in TMEM76* cause mucopolysaccharidosis IIIC (Sanfilippo C syndrome)Am J Hum Genet 79:807–819Google Scholar
1. Huizing M
2. Gahl WA
2020Inherited disorders of lysosomal membrane transportersBiochim Biophys Acta Biomembr 1862:183336Google Scholar
1. Jamali K
2. Kall L
3. Zhang R
4. Brown A
5. Kimanius D
6. Scheres SHW
2023Automated model building and protein identification in cryo-EM mapsbioRxiv Google Scholar
1. Javadpour MM
2. Eilers M
3. Groesbeek M
4. Smith SO
1999Helix packing in polytopic membrane proteins: role of glycine in transmembrane helix associationBiophys J 77:1609–1618Google Scholar
1. Jimenez J
2. Doerr S
3. Martinez-Rosell G
4. Rose AS
5. De Fabritiis G
2017DeepSite: protein-binding site predictor using 3D-convolutional neural networksBioinformatics 33:3036–3042Google Scholar
1. Jobin PG
2. Butler GS
3. Overall CM
2017New intracellular activities of matrix metalloproteinases shine in the moonlightBiochim Biophys Acta Mol Cell Res 1864:2043–2055Google Scholar
1. Jumper J
2. Evans R
3. Pritzel A
4. Green T
5. Figurnov M
6. Ronneberger O
7. Tunyasuvunakool K
8. Bates R
9. Zidek A
10. Potapenko A
11. et al
2021Highly accurate protein structure prediction with AlphaFoldNature 596:583–589Google Scholar
1. Kawate T
2. Gouaux E
2006Fluorescence-detection size-exclusion chromatography for precrystallization screening of integral membrane proteinsStructure 14:673–681Google Scholar
1. Klein U
2. Kresse H
3. von Figura K
1978Sanfilippo syndrome type C: deficiency of acetyl-CoA:alpha-glucosaminide N-acetyltransferase in skin fibroblastsProc Natl Acad Sci U S A 75:5185–5189Google Scholar
1. Koch I
2. Kaden F
3. Selbig J
1992Analysis of protein sheet topologies by graph theoretical methodsProteins 12:314–323Google Scholar
1. Kretsinger RH
2. Kretsinger RH
3. Uversky VN
4. Permiakov EA
2013Encyclopedia of metalloproteinsNew York: Springer Google Scholar
1. Kucukelbir A
2. Sigworth FJ
3. Tagare HD
2014Quantifying the local resolution of cryo-EM density mapsNat Methods 11:63–65Google Scholar
1. Li F
2. Leier A
3. Liu Q
4. Wang Y
5. Xiang D
6. Akutsu T
7. Webb GI
8. Smith AI
9. Marquez-Lago T
10. Li J
11. et al
2020Procleave: Predicting Protease-specific Substrate Cleavage Sites by Combining Sequence and Structural InformationGenomics Proteomics Bioinformatics 18:52–64Google Scholar
1. Liebschner D
2. Afonine PV
3. Baker ML
4. Bunkoczi G
5. Chen VB
6. Croll TI
7. Hintze B
8. Hung LW
9. Jain S
10. McCoy AJ
11. et al
2019Macromolecular structure determination using X-rays, neutrons and electrons: recent developments in PhenixActa Crystallogr D Struct Biol 75:861–877Google Scholar
1. McBride KL
2. Flanigan KM
2021Update in the MucopolysaccharidosesSemin Pediatr Neurol 37:100874Google Scholar
1. Meikle PJ
2. Whittle AM
3. Hopwood JJ
1995Human acetyl-coenzyme A:alpha-glucosaminide N-acetyltransferase. Kinetic characterization and mechanistic interpretationBiochem J 308:327–333Google Scholar
1. Meng EC
2. Goddard TD
3. Pettersen EF
4. Couch GS
5. Pearson ZJ
6. Morris JH
7. Ferrin TE
2023UCSF ChimeraX: Tools for Structure Building and AnalysisProtein Sci: e Google Scholar
1. Mirdita M
2. Schutze K
3. Moriwaki Y
4. Heo L
5. Ovchinnikov S
6. Steinegger M
2022ColabFold: making protein folding accessible to allNat Methods 19:679–682Google Scholar
1. Muller S
2. Dennemarker J
3. Reinheckel T
2012Specific functions of lysosomal proteases in endocytic and autophagic pathwaysBiochim Biophys Acta 1824:34–43Google Scholar
1. Nagel L
2. Oliveira R
3. Pshezhetsky AV
4. Morales CR
2019HGSNAT enzyme deficiency results in accumulation of heparan sulfate in podocytes and basement membranesHistol Histopathol 34:1377–1385Google Scholar
1. Newman KE
2. Tindall SN
3. Mader SL
4. Khalid S
5. Thomas GH
6. Van Der Woude MW
2023A novel fold for acyltransferase-3 (AT3) proteins provides a framework for transmembrane acyl-group transferElife :12Google Scholar
1. Orwick-Rydmark M
2. Arnold T
3. Linke D
4. 8
5. 8 35
2016The Use of Detergents to Purify Membrane ProteinsCurr Protoc Protein Sci 84:1–4Google Scholar
1. Pan X
2. Taherzadeh M
3. Bose P
4. Heon-Roberts R
5. Nguyen ALA
6. Xu T
7. Para C
8. Yamanaka Y
9. Priestman DA
10. Platt FM
11. et al
2022Glucosamine amends CNS pathology in mucopolysaccharidosis IIIC mouse expressing misfolded HGSNATJ Exp Med :219Google Scholar
1. Parker JL
2. Deme JC
3. Wu Z
4. Kuteyi G
5. Huo J
6. Owens RJ
7. Biggin PC
8. Lea SM
9. Newstead S
2021Cryo-EM structure of PepT2 reveals structural basis for proton-coupled peptide and prodrug transport in mammalsSci Adv :7Google Scholar
1. Platt FM
2. d’Azzo A
3. Davidson BL
4. Neufeld EF
5. Tifft CJ
2018Lysosomal storage diseasesNat Rev Dis Primers 4:27Google Scholar
1. Pravda L
2. Sehnal D
3. Tousek D
4. Navratilova V
5. Bazgier V
6. Berka K
7. Svobodova Varekova R
8. Koca J
9. Otyepka M
2018MOLEonline: a web-based tool for analyzing channels, tunnels and pores (2018 update)Nucleic Acids Res 46:W368–W373Google Scholar
1. Pshezhetsky AV
2. Martins C
3. Ashmarina M
2018Sanfilippo type C disease: pathogenic mechanism and potential therapeutic applicationsExpert Opin Orphan D 6:635–646Google Scholar
1. Punjani A
2. Rubinstein JL
3. Fleet DJ
4. Brubaker MA
2017cryoSPARC: algorithms for rapid unsupervised cryo-EM structure determinationNat Methods 14:290–296Google Scholar
1. Punjani A
2. Zhang H
3. Fleet DJ
2020Non-uniform refinement: adaptive regularization improves single-particle cryo-EM reconstructionNat Methods 17:1214–1221Google Scholar
1. Rajasekar KV
2. Ji S
3. Coulthard RJ
4. Ride JP
5. Reynolds GL
6. Winn PJ
7. Wheeler MJ
8. Hyde EI
9. Smith LJ
2019Structure of SPH (self-incompatibility protein homologue) proteins: a widespread family of small, highly stable, secreted proteinsBiochem J 476:809–826Google Scholar
1. Rome LH
2. Crain LR
1981Degradation of mucopolysaccharide in intact isolated lysosomesJ Biol Chem 256:10763–10768Google Scholar
1. Rome LH
2. Hill DF
3. Bame KJ
4. Crain LR
1983Utilization of exogenously added acetyl coenzyme A by intact isolated lysosomesJ Biol Chem 258:3006–3011Google Scholar
1. Rudnik S
2. Damme M
2021The lysosomal membrane-export of metabolites and beyondFEBS J 288:4168–4182Google Scholar
1. Ruivo R
2. Anne C
3. Sagne C
4. Gasnier B
2009Molecular and cellular basis of lysosomal transmembrane protein dysfunctionBiochim Biophys Acta 1793:636–649Google Scholar
1. Saier MH
2. Reddy VS
3. Moreno-Hagelsieb G
4. Hendargo KJ
5. Zhang Y
6. Iddamsetty V
7. Lam KJK
8. Tian N
9. Russum S
10. Wang J
11. et al
2021The Transporter Classification Database (TCDB): 2021 updateNucleic Acids Res 49:D461–D467Google Scholar
1. Schroder B
2. Saftig P
2016Intramembrane proteolysis within lysosomesAgeing Res Rev 32:51–64Google Scholar
1. Schwake M
2. Schroder B
3. Saftig P
2013Lysosomal membrane proteins and their central role in physiologyTraffic 14:739–748Google Scholar
1. Schymkowitz J
2. Borg J
3. Stricher F
4. Nys R
5. Rousseau F
6. Serrano L
2005The FoldX web server: an online force fieldNucleic Acids Res 33:W382–388Google Scholar
1. Song J
2. Tan H
3. Perry AJ
4. Akutsu T
5. Webb GI
6. Whisstock JC
7. Pike RN
2012PROSPER: an integrated feature-based tool for predicting protease substrate cleavage sitesPLoS One 7:e50300Google Scholar
1. Steenhuis P
2. Froemming J
3. Reinheckel T
4. Storch S
2012Proteolytic cleavage of the disease-related lysosomal membrane glycoprotein CLN7Biochim Biophys Acta 1822:1617–1628Google Scholar
1. Turk B
2. Turk D
3. Turk V
2000Lysosomal cysteine proteases: more than scavengersBiochim Biophys Acta 1477:98–111Google Scholar
1. Ulmschneider MB
2. Sansom MS
2001Amino acid distributions in integral membrane protein structuresBiochim Biophys Acta 1512:1–14Google Scholar
1. Varadi M
2. Anyango S
3. Deshpande M
4. Nair S
5. Natassia C
6. Yordanova G
7. Yuan D
8. Stroe O
9. Wood G
10. Laydon A
11. et al
2022AlphaFold Protein Structure Database: massively expanding the structural coverage of protein-sequence space with high-accuracy modelsNucleic Acids Res 50:D439–D444Google Scholar

Article and author information

Author information

Vikas Navratna
Life Sciences Institute, University of Michigan, Ann Arbor, Michigan, 48109, United States, Department of Cell and Developmental Biology, University of Michigan, Ann Arbor, Michigan, 48109, United States
ORCID iD: 0000-0001-8599-1461
Arvind Kumar
Thermo Fisher Scientific, Waltham, Massachusetts, 02451, United States
Shyamal Mosalaganti
Life Sciences Institute, University of Michigan, Ann Arbor, Michigan, 48109, United States, Department of Cell and Developmental Biology, University of Michigan, Ann Arbor, Michigan, 48109, United States, Department of Biophysics, College of Literature, Science and the Arts, University of Michigan, Ann Arbor, Michigan, 48109, United States
ORCID iD: 0000-0002-5934-687X
- Corresponding author. E-mail: mosalaga@umich.edu

Version history

Sent for peer review: October 24, 2023
Preprint posted: October 26, 2023
Reviewed Preprint version 1: January 10, 2024
Reviewed Preprint version 2: July 26, 2024
Version of Record published: August 28, 2024

Cite all versions

You can cite all versions using the DOI https://doi.org/10.7554/eLife.93510. This DOI represents all versions, and will always resolve to the latest one.

Copyright

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.

Not revised: This Reviewed Preprint includes the authors’ original preprint (without revision), an eLife assessment, public reviews, and a provisional response from the authors.

Reviewing Editor
Andres Jara-Oseguera
The University of Texas at Austin, Austin, United States of America
Senior Editor
Merritt Maduke
Stanford University, Stanford, United States of America

Reviewer #1 (Public Review):

This article by Navratna et al. reports the first structure of human HGSNAT in an acetyl-CoA-bound state. Through careful structural analysis, the authors propose potential reasons why certain human mutations lead to lysosomal storage disorders and outline a catalytic mechanism. The structural data are of good quality, and the manuscript is clearly written. This study represents an important step toward understanding the mechanism of HGSNAT and is valuable to the field. I have the following suggestions:

1. The authors should characterize whether the purified protein is active. Otherwise, how does one know if the detergent used maintains the protein in a biologically relevant state? The authors should at least attempt to do so. If these prove to be challenging, at the very least, the authors should try a cell-based assay to demonstrate that the GFP tag does not interfere with the function.

2. In Figure 5, the authors present a detailed schematic of the catalytic cycle, which I find to be too speculative. There is no evidence to suggest that this enzyme undergoes isomerization, similar to a transporter, between open-to-lumen and open-to-cytosol states. Could it not simply involve some movements of side chains to complete the acetyl transfer?

https://doi.org/10.7554/eLife.93510.1.sa2

Reviewer #2 (Public Review):

Summary:
This work describes the structure of Heparan-alpha-glucosaminide N-acetyltransferase (HGSNAT), a lysosomal membrane protein that catalyzes the acetylation reaction of the terminal alpha-D-glucosamine group required for the degradation of heparan sulfate (HS). HS degradation takes place during the degradation of the extracellular matrix, a process required for restructuring tissue architecture, regulation of cellular function, and differentiation. During this process, HS is degraded into monosaccharides and free sulfate in lysosomes.

HGSNAT catalyzes the transfer of the acetyl group from acetyl-CoA to the terminal non-reducing amino group of alpha-D-glucosamine. The molecular mechanism by which this process occurs has not been described so far. One of the main reasons to study the mechanism of HGSNAT is that multiple mutations spanning the entire sequence of the protein, such as nonsense mutations, splice-site variants, and missense mutations lead to dysfunction that causes abnormal accumulation of HS within the lysosomes. This accumulation is a cause of mucopolysaccharidosis IIIC (MPS IIIC), an autosomal recessive neurodegenerative lysosomal storage disorder, for which there are no approved drugs or treatment strategies.

This paper provides a 3.26A structure of HGSNAT, determined by single-particle cryo-EM. The structure reveals that HGSNAT is a dimer in detergent micelles and a density assigned to acetyl-CoA. The authors speculate about the molecular mechanism of the acetylation reaction, map the mutations known to cause MPS IIIC on the structure and speculate about the nature of the HGSNAT disfunction caused by such mutations.

Strengths:
The description of the architecture of HGSNAT is the highlight of the paper since this corresponds to the first description of the structure of a member of the transmembrane acyl transferase (TmAT) superfamily. The high resolution of an HGSNAT bound to acetyl-CoA is an important leap in our understanding of the HGSNAT mechanism. The density map is of high quality, except for the luminal domain. The location of the acetyl-CoA allows speculation about the mechanistic role of multiple residues surrounding this molecule. The authors thoroughly describe the architecture of HGSNAT and map the mutations leading to MPS IIIC. The description of the dimeric interphase is a novel result, and future studies are left to confirm the importance of oligomerization for function.

Weaknesses:
Apart from the cryo-EM structure, the article does not provide any other experimental evidence to support or explain a molecular mechanism. Due to the complete absence of functional assays, mutagenesis analysis, or other structures such as a ternary complex or an acetylated enzyme intermediate, the mechanistic model depicted in Figure 5 should be taken with caution.

The authors discuss that H269 is an essential residue that participates in the acetylation reaction, possibly becoming acetylated during the process. However, there is no solid experimental evidence, e.g. mutagenesis analysis or structural analysis, in this or previous articles, that demonstrates this to be the case.

In the discussion part, the authors mention previous studies in which it was postulated that the catalytic reaction can be described by a random order mechanistic model or a Ping Pong Bi Bi model. However, the authors leave open the question of which of these mechanisms best describes the acetylation reaction. The structure presented here does not provide evidence that could support one mechanism or the other.

Although the authors map the mutations leading to MPS IIIC on the structure and use FoldX software to predict the impact of these mutations on folding and fold stability, there is no experimental evidence to support FoldX's predictions.

https://doi.org/10.7554/eLife.93510.1.sa1

Reviewer #3 (Public Review):

Summary:
Navratna et al. have solved the first structure of a transmembrane N-acetyltransferase (TNAT), resolving the architecture of human heparan-alpha-glucosaminide N-acetyltransferase (HGSNAT) in the acetyl-CoA bound state using single particle cryo-electron microscopy (cryoEM). They show that the protein is a dimer, and define the architecture of the alpha- and beta- GSNAT fragments, as well as convincingly characterizing the binding site of acetyl-CoA.

Strengths:
This is the first structure of any member of the transmembrane acyl transferase superfamily, and as such it provides important insights into the architecture and acetyl-CoA binding site of this class of enzymes.

The structural data is of a high quality, with an isotropic cryoEM density map at 3.3Å facilitating the building of a high-confidence atomic model. Importantly, the density of the acetyl-CoA ligand is particularly well-defined, as are the contacting residues within the transmembrane domain.

The open-to-lumen structure of HSGNAT presented here will undoubtedly lay the groundwork for future structural and functional characterization of the reaction cycle of this class of enzymes.

Weaknesses:
While the structural data for the open-to-lumen state presented in this work is very convincing, and clearly defines the binding site of acetyl-CoA, to get a complete picture of the enzymatic mechanism of this family, additional structures of other states will be required.

A potentially significant weakness of the study is the lack of functional validation. The enzymatic activity of the enzyme characterized was not measured, and the enzyme lacks native proteolytic processing, so it is a little unclear whether the structure represents an active enzyme.

https://doi.org/10.7554/eLife.93510.1.sa0

Author Response

eLife assessment

This important work describes the first high-resolution structure of HGSNAT, a lysosomal membrane protein required for the degradation of heparan sulfate (HS). Through careful structural analysis, this work proposes potential reasons why certain mutations in HGSNAT lead to lysosomal storage disorders and outlines the enzyme's catalytic mechanism. The experimental evidence presented provides incomplete support for the proposed molecular mechanism of the HS acetylation reaction and the impact of disease-causing mutations.

We thank the editors and reviewers for taking the time to provide a critical assessment of our manuscript. We appreciate the input and suggestions to improve the analysis. Included here are only our provisional responses. We will address the concerns raised in more detail and incorporate them in the revised version of the manuscript.

Reviewer #1 (Public Review):

This article by Navratna et al. reports the first structure of human HGSNAT in an acetyl-CoAbound state. Through careful structural analysis, the authors propose potential reasons why certain human mutations lead to lysosomal storage disorders and outline a catalytic mechanism. The structural data are of good quality, and the manuscript is clearly written. This study represents an important step toward understanding the mechanism of HGSNAT and is valuable to the field. I have the following suggestions:

We thank the reviewer for their encouraging and positive overall assessment of our work.

The authors should characterize whether the purified protein is active. Otherwise, how does one know if the detergent used maintains the protein in a biologically relevant state? The authors should at least attempt to do so. If these prove to be challenging, at the very least, the authors should try a cell-based assay to demonstrate that the GFP tag does not interfere with the function.

Thank you for highlighting this concern. The cryo-EM sample was prepared without the exogenous addition of ligand, as noted in the manuscript; the acetyl-CoA that we see in the structure was intrinsically bound to the protein, indicating the ability of GFP-tagged HGSNAT protein to bind the ligand. We purified the protein at a pH optimal for acetyl-CoA binding, as suggested by Bame, K. J. and Rome, L. H. (1985) and Meikle, P. J. et al., (1995). Because we see acetyl-CoA in a structure obtained using a GFP fusion, we argue that GFP does not interfere with protein stability and ability to bind to the co-substrate. As demonstrated by existing literature HGSNAT catalyzed reaction is compartmentalized spatially and conditionally. The binding of acetyl-CoA happens towards the cytosol and is optimal at pH 7-0.8.0, while the transfer of the acetyl group to heparan sulfate occurs towards the luminal side and is optimal at pH 5.0-6.0. We are working on establishing a robust assay to study this complicated and compartmentalized acetyl transfer assay.

In Figure 5, the authors present a detailed schematic of the catalytic cycle, which I find to be too speculative. There is no evidence to suggest that this enzyme undergoes isomerization, like a transporter, between open-to-lumen and open-to-cytosol states. Could it not simply involve some movements of side chains to complete the acetyl transfer?

The acetyl-CoA bound structure presented in the paper does not conclusively support a potential for isomerization and conformational dynamics. We agree with the reviewer that the reaction schematic presented in Figure 5 is speculative. We acknowledge in the discussion that our structure represents only a single step of the reaction, and defining the precise mechanism of acetyl transfer needs additional work. However, we will reword the discussion and change Figure 5 to address this concern raised by multiple reviewers.

Reviewer #2 (Public Review):

Summary:

This work describes the structure of Heparan-alpha-glucosaminide N-acetyltransferase (HGSNAT), a lysosomal membrane protein that catalyzes the acetylation reaction of the terminal alpha-D-glucosamine group required for the degradation of heparan sulfate (HS). HS degradation takes place during the degradation of the extracellular matrix, a process required for restructuring tissue architecture, regulation of cellular function, and differentiation. During this process, HS is degraded into monosaccharides and free sulfate in lysosomes.

HGSNAT catalyzes the transfer of the acetyl group from acetyl-CoA to the terminal non-reducing amino group of alpha-D-glucosamine. The molecular mechanism by which this process occurs has not been described so far. One of the main reasons to study the mechanism of HGSNAT is that multiple mutations spanning the entire sequence of the protein, such as nonsense mutations, splicesite variants, and missense mutations lead to dysfunction that causes abnormal accumulation of HS within the lysosomes. This accumulation is a cause of mucopolysaccharidosis IIIC (MPS IIIC), an autosomal recessive neurodegenerative lysosomal storage disorder, for which there are no approved drugs or treatment strategies.

This paper provides a 3.26A structure of HGSNAT, determined by single-particle cryo-EM. The structure reveals that HGSNAT is a dimer in detergent micelles and a density assigned to acetylCoA. The authors speculate about the molecular mechanism of the acetylation reaction, map the mutations known to cause MPS IIIC on the structure and speculate about the nature of the HGSNAT disfunction caused by such mutations.

Strengths:

The description of the architecture of HGSNAT is the highlight of the paper since this corresponds to the first description of the structure of a member of the transmembrane acyl transferase (TmAT) superfamily. The high resolution of an HGSNAT bound to acetyl-CoA is an important leap in our understanding of the HGSNAT mechanism. The density map is of high quality, except for the luminal domain. The location of the acetyl-CoA allows speculation about the mechanistic role of multiple residues surrounding this molecule. The authors thoroughly describe the architecture of HGSNAT and map the mutations leading to MPS IIIC. The description of the dimeric interphase is a novel result, and future studies are left to confirm the importance of oligomerization for function.

We thank the reviewer for their time and for highlighting both the quality and novelty of the structure presented in this work.

Weaknesses:

Apart from the cryo-EM structure, the article does not provide any other experimental evidence to support or explain a molecular mechanism. Due to the complete absence of functional assays, mutagenesis analysis, or other structures such as a ternary complex or an acetylated enzyme intermediate, the mechanistic model depicted in Figure 5 should be taken with caution.

Thank you for pointing out this concern. The proposed mechanistic model in Figure 5 is a hypothesis based on previously reported biochemical characterization of HGSNAT by Rome & Crain (1981), Rome et al, (1983), Miekle et al., (1995) and Fan et al., (2011). However, we agree with the reviewer that this schematic is not experimentally proven and is speculative at best. Especially because our structure presents only a single step of the reaction, which does not conclusively support either ping-pong or random-order bi-substrate reactions. We will rephrase this section of our discussion and edit Figure 5 to address this concern.

The authors discuss that H269 is an essential residue that participates in the acetylation reaction, possibly becoming acetylated during the process. However, there is no solid experimental evidence, e.g. mutagenesis analysis or structural analysis, in this or previous articles, that demonstrates this to be the case.

H269, as a crucial catalytic residue, was suggested by monitoring the effect of chemical modifications of amino acids on acetylation of HGSNAT membranes by Bame, K. J. and Rome, L. H. (1986). We agree that mutagenesis, catalysis, and structural evidence for the same are not currently available. We are pursuing a more thorough exploration of the role of both H269 (previous studies) and N258 (from this study) on the stability and function of HGSNAT.

In the discussion part, the authors mention previous studies in which it was postulated that the catalytic reaction can be described by a random order mechanistic model or a Ping Pong Bi Bi model. However, the authors leave open the question of which of these mechanisms best describes the acetylation reaction. The structure presented here does not provide evidence that could support one mechanism or the other.

We agree with the reviewer’s observation that the structure doesn’t indeed support one reaction mechanism or another. We are pursuing the structural and kinetic characterization of HGSNAT in the presence of other co-substrates and multiple pHs that are required to address this concern thoroughly.

Although the authors map the mutations leading to MPS IIIC on the structure and use FoldX software to predict the impact of these mutations on folding and fold stability, there is no experimental evidence to support FoldX's predictions.

We are working on assessing the impact of specific mutations on the stability of HGSNAT and will add them to the revised version of the manuscript. We thank the reviewer for this suggestion.

Reviewer #3 (Public Review):

Summary:

Navratna et al. have solved the first structure of a transmembrane N-acetyltransferase (TNAT), resolving the architecture of human heparan-alpha-glucosaminide N-acetyltransferase (HGSNAT) in the acetyl-CoA bound state using single particle cryo-electron microscopy (cryoEM). They show that the protein is a dimer and define the architecture of the alpha- and beta- GSNAT fragments, as well as convincingly characterizing the binding site of acetyl-CoA.

Strengths:

This is the first structure of any member of the transmembrane acyl transferase superfamily, and as such it provides important insights into the architecture and acetyl-CoA binding site of this class of enzymes.

The structural data is of a high quality, with an isotropic cryoEM density map at 3.3Å facilitating the building of a high-confidence atomic model. Importantly, the density of the acetyl-CoA ligand is particularly well-defined, as are the contacting residues within the transmembrane domain.

The open-to-lumen structure of HSGNAT presented here will undoubtedly lay the groundwork for future structural and functional characterization of the reaction cycle of this class of enzymes.

We thank the reviewer for their positive assessment of the data presented in this work. We really appreciate and agree with the reviewer's comment that the “structure of HSGNAT presented here will undoubtedly lay the groundwork for future structural and functional studies.”

Weaknesses:

While the structural data for the open-to-lumen state presented in this work is very convincing, and clearly defines the binding site of acetyl-CoA, to get a complete picture of the enzymatic mechanism of this family, additional structures of other states will be required.

We agree with the reviewers’ assessment and are heavily invested in pursuing the structures of all the steps of acetyl transfer by HGSNAT.

A potentially significant weakness of the study is the lack of functional validation. The enzymatic activity of the enzyme characterized was not measured, and the enzyme lacks native proteolytic processing, so it is a little unclear whether the structure represents an active enzyme.

We thank the reviewer for this comment. While the proteolytic cleavage of the protein remains debated, we find no evidence of such an event in our purification (SDS-PAGE and SEC). Studies like Durand et al., (2010) and Fan et al., (2011) suggest that even the ER retained monomeric HGSNAT is active. Because we see acetyl-CoA (co-substrate) bound to the protein in our structure, we surmise that proteolysis is not necessary for function, at least not for substrate binding. However, we are working towards the structural and kinetic characterization of recombinant α- and β-HGSNAT construct to explore the role of proteolysis on HGSNAT stability and function.

https://doi.org/10.7554/eLife.93510.1.sa4

Significance of findings

Strength of evidence

Abstract

Introduction

Results

Purification of dimeric HGSNAT

Architecture of HGSNAT

Structure of HGSNAT.

Cryo-EM data collection, processing, and validation statistics

Transmembrane domain (TMD)

Luminal domain (LD)

Domain organization, and LD-TMD and dimer interfaces of HGSNAT.

LD-TMD interface

Dimer interface

Acetyl-CoA binding site (ACOS)

Acetyl-CoA binding site (ACOS).

Conservation and homology

Molecular basis for MPS IIIC mutation-induced dysfunction.

The basis for mutation-induced dysfunction and destabilization of HGSNAT

Catalytic core mutations

Mutations at the LD-TMD interface

Scaffold domain mutations

Other mutations

Mechanism of acetyltransferase reaction

Proposed mechanism of acetyl transfer by HGSNAT.

Discussion

Materials and Methods

Cloning and site-directed mutagenesis

Cell culturing, transient transfection, and transduction

Fluorescence-detection size-exclusion chromatography (FSEC) analysis

Purification of HGSNAT

Cryo-EM sample preparation and data collection

Cryo-EM data processing

Model building, refinement, and structure analysis

Supporting information

Acknowledgements

Author contributions

Conflict of interest

Data availability

References

Article and author information

Author information

Vikas Navratna

Arvind Kumar

Shyamal Mosalaganti

Version history

Cite all versions

Copyright

Peer review process

Editors