Multiple modes of cholesterol translocation in the human Smoothened receptor

  1. Prateek D Bansal
  2. Maia Kinnebrew
  3. Rajat Rohatgi
  4. Diwakar Shukla  Is a corresponding author
  1. Department of Chemical and Biomolecular Engineering, University of Illinois Urbana-Champaign, United States
  2. Department of Biochemistry, Stanford University School of Medicine, United States
  3. Department of Medicine, Stanford University School of Medicine, United States
  4. Department of Chemistry, University of Illinois at Urbana-Champaign, United States
  5. Department of Bioengineering, University of Illinois Urbana-Champaign, United States
  6. Cancer Center at Illinois, University of Illinois at Urbana-Champaign, United States

eLife Assessment

In this important study, the authors conducted extensive sets of computational and investigations of the mechanism of cholesterol transport in the smoothened (SMO) protein. The computational component integrated multiple state-of-the-art approaches such as adaptive sampling, free energy simulations, and Markov state modeling, providing compelling support for the proposed mechanistic model, which is further validated with solid experimental mutagenesis data.

https://doi.org/10.7554/eLife.108030.3.sa0

Abstract

Smoothened (SMO), a member of the G-protein-coupled receptor superfamily, mediates Hedgehog signaling and is linked to cancer and birth defects. SMO responds to accessible cholesterol in the ciliary membrane, translocating it via a longitudinal tunnel to its extracellular domain. Reaching a complete mechanistic understanding of the cholesterol translocation process would help in the development of cancer therapies. Experimental data suggest two modes of translocation to support entry of cholesterol from outer and inner membrane leaflets, but the exact mechanism of translocation remains unclear. Using atomistic molecular dynamics simulations (∼2 ms simulations) and biochemical assays of SMO mutants, we assess the energetic feasibilities of the two modes. We show that the highest energetic barrier for cholesterol translocation from the outer leaflet is lower than that from the inner leaflet. Mutagenesis experiments and complementary simulations of SMO mutants validate the role of critical amino acid residues along the translocation pathways. Our data suggests that cholesterol can take either pathway to enter SMO, thus explaining experimental observations in the literature. Thus, our results illuminate the energetics and provide a first molecular description of cholesterol translocation in SMO.

Introduction

Smoothened (SMO) is a member of the G-protein-coupled receptor (GPCR) superfamily with a typical heptahelical transmembrane fold (Wang et al., 2013; Wang et al., 2014). SMO has been identified as an oncoprotein, and mutations that cause overactivity of SMO drive tumorigenesis in basal cell carcinoma and medulloblastoma (Raleigh and Reiter, 2019). SMO antagonists are validated anti-cancer drugs (Axelson et al., 2013; Jain et al., 2017) but are limited by drug resistance and side effects (Meani et al., 2014). SMO functions as a transmembrane signal transducer in the Hedgehog (HH) signaling pathway, which is known to mediate cell differentiation during embryonic development (Nüsslein-Volhard and Wieschaus, 1980; Ingham, 2022). It transduces signals across the cell membrane, particularly in the primary cilia (Rohatgi et al., 2007; Corbit et al., 2005). However, the mechanism of endogenous activation of SMO is still a debated question in the field. SMO activation has been linked to membrane sterols in numerous studies (Cooper et al., 2003; Stottmann et al., 2011; Nachtergaele et al., 2012; Myers et al., 2013; Blassberg et al., 2016; Luchetti et al., 2016; Huang et al., 2016; Byrne et al., 2016; Myers et al., 2017; Kinnebrew et al., 2019). Cholesterol, a steroidal molecule found abundantly in the plasma membranes of vertebrates, has been identified as the endogenous agonist for SMO (Luchetti et al., 2016; Huang et al., 2016; Byrne et al., 2016). Membrane cholesterol has been shown to interact with and modulate GPCR activity (Kumar and Chattopadhyay, 2020; Kumar et al., 2021; Serdiuk et al., 2022), but in the case of SMO, cholesterol has been uniquely shown to be necessary and sufficient for SMO activation (Luchetti et al., 2016; Huang et al., 2016). Recent work has shown that SMO is activated by a minor pool of membrane cholesterol, termed accessible cholesterol, at the primary cilium (Kinnebrew et al., 2019; Radhakrishnan et al., 2020; Kinnebrew et al., 2021; Siebold and Rohatgi, 2023). Patched-1 (PTCH1), a 12-pass transmembrane protein, sits directly upstream of SMO in the HH pathway and inhibits SMO (Ingham et al., 1991; Denef et al., 2000; Ingham et al., 2000; Taipale et al., 2002). PTCH1 modulates the availability of accessible cholesterol at the primary cilium, thereby regulating SMO, with models suggesting effects on both the CRD and 7TM pockets (Kinnebrew et al., 2019; Radhakrishnan et al., 2020; Siebold and Rohatgi, 2023).

The mechanism by which SMO is activated has been the subject of much discussion over recent years, with multiple studies theorizing the mechanism of PTCH1’s inhibition on SMO. Previous studies have suggested that PTCH1 could function as a sterol transporter (Davies et al., 2000; Taipale et al., 2002; Bidet et al., 2011; Zhang et al., 2018; Kinnebrew et al., 2019; Kinnebrew et al., 2021). A recent study using coarse-grained Molecular Dynamics simulations investigated the possibility of this process, concluding that the overall process might occur at an energetic cost of ∼3–5 kcal/mol (Ansell et al., 2023). Recently resolved structures of PTCH1 Gong et al., 2018; Qi et al., 2018; Rudolf et al., 2019; Qi et al., 2019a; Zhang et al., 2020 have reported the presence of a Sterol Binding Domain (SBD) and a hydrophobic conduit that extends from the outer leaflet to the extracellular space. Heterologous expression of PTCH1 in membranes reduces outer leaflet cholesterol accessibility, suggesting that PTCH1 reduces the access of SMO to membrane cholesterol (Kinnebrew et al., 2021).

The CRD of SMO contains a binding site for steroidal molecules (Nachtergaele et al., 2012; Nachtergaele et al., 2013; Nedelcu et al., 2013; Myers et al., 2013). The endogenous agonist cholesterol (Byrne et al., 2016), as well as the naturally occurring alkaloidal agonist cyclopamine (Nachtergaele et al., 2013; Huang et al., 2016), bind in the CRD. In addition, SMO has a pocket in the Transmembrane Domain (TMD), which is known to bind to multiple antagonists such as LY2940680 (Wang et al., 2013), SANT1 and AntaXV (Wang et al., 2014), cyclopamine (Weierstall et al., 2014), TC114 (Zhang et al., 2017), Vismodegib (Byrne et al., 2016), the synthetic agonist SAG (and variants SAG1.3, SAG1.5, SAG21k) (Wang et al., 2014; Qi et al., 2020; Deshpande et al., 2019; Kinnebrew et al., 2022), the steroidal agonists 24 S,25-epoxy cholesterol (Qi et al., 2019b), and cholesterol (Deshpande et al., 2019; Qi et al., 2020).

While cholesterol binding to both the TMD and CRD sites is required for full SMO activation, our work focuses on how cholesterol gains access to the CRD site, perched above the outer leaflet of the membrane (Luchetti et al., 2016; Kinnebrew et al., 2022). Multiple lines of evidence suggest that PTCH1-regulated cholesterol binding to the CRD plays an instructive role in SMO regulation both in cells and animals. Mutations in residues predicted to make hydrogen bonds with the hydroxyl group of cholesterol bound to the CRD reduced both the potency and efficacy of SHH in cellular signaling assays (Kinnebrew et al., 2022; Byrne et al., 2016) and, more importantly, eliminated HH signaling in mouse embryos (Xiao et al., 2017). Experiments using both covalent and photocrosslinkable sterol probes in live cells directly show that PTCH1 activity reduces sterol access to the CRD (Kinnebrew et al., 2022; Xiao et al., 2017). Notably, our simulations evaluate a path of cholesterol translocation that includes both the TMD and CRD sites: cholesterol first enters the 7-transmembrane domain bundle from the membrane; it then engages the TMD site before continuing along a conduit to the CRD site. Thus, we analyze translocation energetics and residue-level contacts along a path that includes both the TMD and the CRD.

Huang et al., 2018 theorized that SMO’s activation involved the translocation of cholesterol from the membrane via a hydrophobic conduit to the binding site in the CRD. This study resolved the structure of active Xenopus laevis SMO, which showed outward movements of transmembrane helices 5 and 6 (TM5-TM6) on the intracellular end, which opened a cavity in SMO extending to the inner leaflet laterally, between TM5 and TM6. This led to the hypothesis that the entry of cholesterol into SMO could happen from the inner leaflet of the membrane, between TM5 and TM6. There is also additional evidence showing that the activity of SMO is regulated by cholesterol in the outer leaflet, which enters SMO between TM2 and TM3. Hedger et al., 2019 reported a cholesterol binding site present at the outer leaflet, between the TM2 and TM3 helices. Using coarse-grained simulations, the authors tested a variety of membrane compositions around SMO and found that a cholesterol binding site existed in human SMO (hSMO). Kinnebrew et al., 2021 used total internal reflection fluorescence microscopy (TIRFM) to assess the effect of PTCH1’s activity on membrane cholesterol. They concluded that PTCH1 activity caused a decrease in the accessibility of cholesterol in the outer leaflet, suggesting that outer leaflet cholesterol is sensed by SMO. Furthermore, in our previous work (Bansal et al., 2023), we observed that cholesterol accumulated outside TM2-TM3 in the outer leaflet of inactive SMO. This was supported by the observation that when SMO was bound to its agonist SAG, we observed a tunnel opening between TM2-TM3 in the outer leaflet, which may facilitate the translocation of cholesterol into the core of SMO’s TMD. We therefore hypothesize that cholesterol shows two modes of translocation to enter the TMD from the membrane - (1) starting at the outer leaflet, between TM2-TM3, or (2) starting at the inner leaflet, between TM5-TM6. Alternatively, (3) cholesterol could use both pathways if they show similar energetic behaviors.

How cholesterol moves from the membrane into the core of the TMD of SMO is still poorly understood. Cholesterol traverses SMO to ultimately reach the TMD and CRD binding sites, but the mechanism of cholesterol perception has not been elucidated yet, which gives us an opportunity to explore the mechanistic aspects of this process from both computational and experimental viewpoints. In this study, we simulate SMO by embedding it in a membrane (Figure 1b). We report the entire translocation path of cholesterol from the membrane to SMO’s CRD for both modes of translocations (Figure 1c) - between TM2-TM3 in the outer leaflet of the membrane (hereafter referred to as ‘Pathway 1’), and between TM5-TM6 in the inner leaflet of the membrane (hereafter referred to as ‘Pathway 2’). We observe that cholesterol can translocate via both pathways, and the free energy barriers associated with Pathway 1 are lower than those of Pathway 2. We test mutations in SMO that can disrupt the movement along either pathway and show that the experimental results are further supported by simulations of cholesterol translocation in SMO mutants.

Figure 1 with 1 supplement see all
Overview of existing structures and hypotheses supporting the cholesterol translocation mechanisms in Smoothened.

(a) The binding sites of the sterols along the hypothesized tunnel in SMO. Sterol binding sites have been identified deep in the TMD (6XBL) (Qi et al., 2020), at the interface of CRD and TMD (6XBM; Qi et al., 2020), at the CRD sterol-binding site (5L7D; Byrne et al., 2016), and in a dual-bound mode where cholesterol is bound to both the TMD and CRD (6O3C; Deshpande et al., 2019). (b) Example simulation system showing SMO (5L7D, cyan) embedded in a membrane (white/magenta). Water is shown as a white surface, while sodium (purple), and chloride (green) ions are shown as spheres. (c) Pathway 1 and Pathway 2 investigate the translocation of cholesterol from the membrane to SMO’s TMD. (d) The Common Pathway follows the translocation of cholesterol from the TMD to the CRD. Snapshots in (a) are made from structures in the PDB, while (b–d) are frames taken from MD simulations.

After cholesterol has entered the TMD of SMO in our simulations, we observe cholesterol moving along TM6 to the TMD-CRD interface (Common Pathway, Figure 1d) to access the binding site in the CRD (Huang et al., 2018; Kinnebrew et al., 2022). One of the unique features of SMO is the presence of a long helix 6 (TM6) (Wang et al., 2013; Figure 1—figure supplement 1), which acts as a connector between the CRD and the TMD. We test mutations in SMO that can disrupt cholesterol movement along this Common Pathway and show that these mutants can halt the translocation process by loss of hydrophobic contacts. These results for SMO mutants are further validated by additional MD simulations. Therefore, in this study, the entire process of cholesterol translocation was observed using aggregate 2 ms of unbiased all-atom MD simulations. Exploring the mechanism by which SMO translocates cholesterol would provide insights into the endogenous regulation of HH activity and suggest strategies for the next generation of drugs targeting SMO.

Results and discussion

The entry of cholesterol from the outer leaflet into the TMD exhibits the highest energetic barrier along Pathway 1

We first simulated the translocation mode in which cholesterol enters the TMD of SMO from the outer leaflet (Pathway 1). To model this process, cholesterol was placed outside of the TMD in the membrane outer leaflet, between the TM2-TM3 interface. The entry of cholesterol from the membrane into the protein was steered towards the TMD to sample the entry pathway. The frames generated were then used to seed unbiased simulations for adaptive sampling (see Methods section for details). This was done to estimate the energetic barriers involved in the cholesterol entry from the outer leaflet.

To enter the TMD from the membrane, cholesterol must overcome the entropic barrier of being restrained inside the protein. This can make the entry process energetically expensive. To identify the barrier associated with entry, we projected the entire dataset on different translocation-related metrics and identified the mechanism of cholesterol translocation. The z-coordinate of cholesterol’s center of mass (z-axis is perpendicular to the plane of the membrane) was used as a proxy for the progress of the overall translocation from the membrane to the CRD. The angle of cholesterol with the x-y plane (the plane of the membrane) was calculated to identify cholesterol pose. We observe that this angle (Figure 2a and Figure 2—figure supplement 1a) shows multiple minima (α/β along the Pathway 1). This provides evidence for multiple stable poses along the pathway as observed in the multiple stable poses of cholesterol in Cryo-EM structures of SMO bound to sterols (Deshpande et al., 2019; Qi et al., 2019b; Qi et al., 2020). A reliable estimate of the barriers comes from using the time-lagged Independent Components (tICs), which project the entire dataset along the slowest kinetic degrees of freedom. Overall, the highest barrier along Pathway 1 is ∼5.8±0.7 kcal/mol, and it is associated with the entry of cholesterol into the TMD (Figure 2—figure supplement 2). Several factors contribute to this high barrier; first, cholesterol needs to be at the correct position - just outside the protein. In addition, cholesterol needs to be in the correct orientation - angled, with the isooctyl tail pointing towards the protein core. Additionally, the steric hindrances from the hydrophobic residues at the interface provide further obstacles for entry. Therefore, the TM2/3 residues at the entry point need to undergo conformational change to facilitate cholesterol entry – making it a rare event.

Figure 2 with 3 supplements see all
The molecular events as cholesterol enters the core of SMO’s TMD from the outer leaflet of the membrane.

(a) Free energy plot showing the angle of cholesterol with the x-y plane, the plane of the membrane, versus the z-coordinate of cholesterol for Pathway 1. The pathway followed by cholesterol is α. The experimental structures of SMO are shown as black polygons. (b) Free energy landscape of cholesterol’s y-coordinate plotted versus cholesterol’s x-coordinate. Cholesterol interacts with residues in TM2-TM3 while entering the core TMD of SMO. (c–f) Insets show cholesterol’s interactions with residues at the membrane-protein interface for Pathway 1. (c,e) show cholesterol outside the protein (α), while (d, f) show cholesterol entering the protein (β). All snapshots presented are frames taken from MD simulations.

In the first stage of translocation, cholesterol is in the membrane (α, Figure 2), just outside the TM2/3 helices of SMO as identified in literature (Hedger et al., 2019). At this cholesterol recognition site, SMO primarily contains hydrophobic residues (Figure 2c–f), which preferentially interact with the hydrophobic isooctyl tail of cholesterol. Therefore, the cholesterol entry involves the insertion of the tail into the TMD first, which is then followed by the hydrophilic androsterolic moiety. For the sake of clarity, we divide this entry pocket at the TM2-TM3-membrane interface into two parts - ‘lower’ and ‘upper’ pocket. The ‘upper’ pocket corresponds to the residues that coordinate with the androsterolic moiety of cholesterol (Figure 2c and d), and the ‘lower’ pocket corresponds to residues that lie closer to the isooctyl tail of cholesterol (Figure 2e and f). The upper (M286ECL1, A289ECL1, IECL2, and AECL2) and the lower (W2812.58f, F391ECL2, and M5257.45f) pocket residues undergo conformational changes to open the space for the entry of cholesterol (Figure 2c–f). Here, the superscript refers to the Wang numbering scheme (Wang et al., 2014). This flexible motion is facilitated by G2802.57f in the lower pocket and G288ECL1 in the upper pocket. Upon the entry of the cholesterol tail, an intermediate state, β (Figure 2d and f) is observed where cholesterol lies flat with respect to the membrane plane and forms extensive hydrophobic contacts with A2832.60f, I3173.28f, F3183.29f, and the disulfide bridge forming residues C3143.25f - C390ECL2.

To reach the TMD binding site, cholesterol must first ‘rock’ back to its upright pose (α*) from its flat conformation in the state β. This rocking motion is facilitated by the polar interactions between S3133.24f, Q2842.63f, and the alcoholic oxygen in cholesterol. The entry of cholesterol is thus captured by a αβα* transition. For further clarity, we have plotted the minimum energy path taken by cholesterol as it translocates along this pathway (Figure 2—figure supplement 3a, b). To further elucidate the position of cholesterol as it enters the protein, we projected the x and y coordinates of cholesterol on a free energy landscape (Figure 2b, and Figure 2—figure supplement 1a, b). In this figure, the state β clearly marks the transition state between cholesterol outside and inside the TMD. Overall, the angle between cholesterol and the x-y plane transitions from 90° → 0° → 90° for cholesterol to enter the protein.

To further dive into the details of cholesterol translocation, we designed mutations along Pathway 1 and measured the change in activity with respect to wild-type (WT) mouse Smoothened (mSMO). mSMO was chosen since there are no human SHH-responsive cell lines that can be passaged, edited, or transduced with genes. In addition, mSMO shows 92.8% sequence identity and 94.6% sequence similarity with hSMO for full-length sequences. Previous studies that have resolved hSMO structures have used mSMO for their structure-guided mutagenesis studies to comment on SMO activity (Byrne et al., 2016). Activity was measured using Gli1 mRNA fold change in the presence of SHH (Figure 3a). To further validate the mutations, simulations were performed on hSMO to compute the Potential of Mean Force (PMF) of cholesterol entry in the presence of mutations (Darve et al., 2008; Hénin et al., 2010; additional details presented in Methods). Here, the PMF can characterize the barriers associated with the translocation of cholesterol, and a difference in the peak value of PMF is presented for each mutant (Figure 3b). We mutated G2802.57f to valine - G2.57f V to test whether reducing the flexibility of TM2 prevents cholesterol entry into the TMD. Consequently, the activity of mSMO showed a decrease (Figure 3—figure supplement 1). However, this decrease could also be attributed to the steric hindrance added by the presence of a bulky propyl group in valine. We designed IECL2 A to check the importance of hydrophobic contacts during translocation along Pathway 1. However, this mutant did not affect the activity nor the barrier for translocation significantly (Figure 3b, Figure 3—figure supplement 2). Finally, we mutated A2832.60f to methionine - A2.60f M to test whether the presence of a bulkier residue would block translocation. Surprisingly, the effect on activity was not significant. When we calculated the PMF for cholesterol entry, A2.60f M mutant showed a restricted tunnel, but it did not fully block the tunnel (Figure 3—figure supplement 3). Therefore, the change in the PMF and experimentally measured activity was not significant (Figure 3b, c, Figure 3—figure supplement 2).

Figure 3 with 5 supplements see all
Effects of mutations along Pathways 1 and 2 on the activation of SMO.

(a) Gli1 mRNA fold changes show the responsiveness of SMO mutants to SHH. Untreated Gli1 levels indicate low SMO activity, while SHH-treated values correspond to the level of SMO activation induced by SHH ligand. A t-test with Welch’s correction was used to compute statistical significance. (p values: untreated vs treated: WT: 1.327 × 10-3, G2.57f V: 9.212 × 10-3, IECL2 A: 4.2 × 10-5, A2.60f M: 7.1 × 100-5, R5.64f A: 2.062 × 10-3, R5.64f Q: 1.192 × 10-3, F6.36f I: 2.163 × 10-3, L5.62f A: 1.948 × 10-3, treated WT vs treated mutant: G2.57f V: 9.1 × 10-3, IECL2 A: 0.02734, A2.60f M: 0.7477, R5.64f A: 0.08858, R5.64f Q: 0.02766, F6.36f I: 1.923 × 10-3, L5.62f A: 2.306 × 10 key: Not significant (ns) p > 0.05, *p ≤ 0.05, **p ≤ 0.01, ***p ≤ 0.001, and ****p ≤ 0.0001, All experimental data represent biological replicates, N=4.) (b) ΔGli1 mRNA fold change (SHH vs untreated) and Δ PMF (difference of peak PMF, calculated as PMFWT - PMFmutant) plotted for the mutants in Pathway 1. (c) Example mutant A2.60f M shows that cholesterol is able to enter SMO through Pathway 1 even on a bulky mutation. (d) Same as (b) but for Pathway 2 (e) Example mutant L5.62f A shows that cholesterol can enter SMO through Pathway 2 due to lesser steric hindrance. All snapshots presented are frames taken from MD simulations.

Cholesterol ‘flipping’ corresponds to the highest barrier in its translocation from the inner leaflet along Pathway 2

To quantitatively assess the validity of cholesterol translocation from the inner leaflet, we performed adaptive sampling simulations to obtain the associated free energy barriers. For Pathway 2, cholesterol first binds at the interface between TM5 and TM6 in the inner leaflet (Deshpande et al., 2019; Huang et al., 2018). In a structure resolved in 2022, cholesterol was observed at the interface between the protein and the membrane, in the inner leaflet, between TM5 and TM6 (Zhang et al., 2022). A striking observation is that this cholesterol binding site pose was never used as a starting point for simulations and was discovered independently from the pose described in Zhang et al., 2022 (Figure 4—figure supplement 1). The cholesterol in the inner leaflet has a downward orientation, with the polar hydroxyl group pointing intracellularly (η, Figure 4, Figure 4—figure supplement 1). Thus, if cholesterol has to translocate from the inner leaflet, it has to undergo ‘flipping’ motion, as the resolved structure with the cholesterol completely inside the TMD (Qi et al., 2020) shows the alcoholic moiety pointing towards the CRD (Deshpande et al., 2019; Figure 1a). The energetic barrier associated with flipping the motion of cholesterol can be observed by estimating the angle cholesterol makes with the x-y plane, and the barrier associated with translocation towards the TMD binding site can be estimated by projecting the data on the z-coordinate of cholesterol, similar to Figure 2a for Pathway 1.

Figure 4 with 1 supplement see all
The molecular events as cholesterol enters the TMD from the inner leaflet in Pathway 2.

(a) Free energy plot showing the angle of cholesterol with the x-y plane, the plane of the membrane, versus the z-coordinate of cholesterol for Pathway 2. The pathway followed by cholesterol is η. The experimental structures of SMO are shown as black polygons. (b) Free energy landscape of cholesterol’s y-coordinate plotted versus cholesterol’s x-coordinate for Pathway 2. Cholesterol interacts with residues in TM5-TM6 for Pathway 2 while entering the SMO core TMD. (e–h) Insets show cholesterol’s interactions with residues at the membrane-protein interface for Pathway 2. (c, e) show cholesterol outside the protein (η), while (d, f) show cholesterol entering the protein (θ) for Pathway 2. All snapshots presented are frames taken from MD simulations.

In Figure 4a, multiple free energy minima are observed. The state η corresponds to cholesterol outside of the TMD and pointing downwards, forming a –90° angle with the x-y plane. This observed pose is similar to the binding pose for cholesterol outlined in a previously resolved structure, further corroborating our observations (Zhang et al., 2022). The state ϵ corresponds to cholesterol at the membrane-TMD interface and forms a 45° angle with the x-y plane. Finally, the state α* corresponds to cholesterol inside the TMD and forming a+90° angle with the x-y plane. The state α*, which represents the TMD binding site, is the most stable pose of cholesterol inside SMO. According to the free-energy landscapes (Figures 2a, 4a and Figures 2c, d, 4c, d), the entry of cholesterol from the inner leaflet is associated with the highest barrier for the entire translocation process, ∼6.5 ± 0.8 kcal/mol and it corresponds to the transition between the states η and θ. These values are comparable to ATP-Binding Cassette (ABC) transporters of membrane lipids, which use ATP hydrolysis (–7.54 ± 0.3 kcal/mol; Meurer et al., 2017) to drive lipid transport from the membrane to an extracellular acceptor. Some of these transporters share the same mechanism as SMO, where the lipid from the inner leaflet is flipped and transported to the extracellular acceptor protein (Tarling et al., 2013). Additionally, for secondary active transporters that do not use ATP for the transport of substrates, a thermodynamic barrier of 5–6 kcal/mol has been reported in literature (Chan et al., 2022; Selvam et al., 2019; McComas et al., 2023; Thangapandian et al., 2025).

The computed values of free energy barriers are dependent on the projections of the data. Upon projecting the data using the z-coordinate of cholesterol versus the angle between the cholesterol and the x-y plane (Figure 4a) and the y-coordinate of cholesterol versus the x-coordinate (Figure 4b), the barrier between η and θ is different. On plotting the first two components of tICs (Figure 2—figure supplement 2a,c), we observe that the energetic barrier between η and θ is ∼6.5 ± 0.8 kcal/mol. We further validate that the slowest degrees of freedom in the model correspond to the entry of cholesterol from the inner leaflet of the membrane into SMO TMD (Figure 2—figure supplement 2b,d).

Contrary to Pathway 1, the entry of cholesterol into the TMD happens via the alcoholic moiety first, which forms a hydrogen bond with the backbone oxygen of R4215.64f (Figure 4e). Furthermore, the TM5 loses some of its helicity between residues R4215.64f-M4245.67f due to the flexibility provided by G4225.65f, which is a part of the conserved WGM motif implicated in SMO activation (Bansal et al., 2023). Here, we also designate the ‘upper’ and ‘lower’ pocket, which coordinate with the isooctyl and androsterolic moieties of cholesterol, respectively (Figure 4c–f). In the upper pocket, the isooctyl tail entry is blocked by the strong hydrophobic patch formed by Y4175.60f, L4195.62f, I4205.63f, F4576.38f, and F4606.41f (Figure 4c and e). Flexible residues such as glycine are also not present, which would allow for the fluctuations leading to the opening of the hydrophobic patch and enable entry of the cholesterol tail. Therefore, the androsterolic moiety of cholesterol enters first, followed by the isooctyl tail of cholesterol to reach the state ϵ. To stabilize and facilitate the androsterolic moiety’s entry, F4606.41f and F4556.36f form ππ contacts, and L4195.62f and L4526.33f form hydrophobic contacts (Figure 4d and f) in state θ. Once cholesterol has flipped in θ, it allows for further translocation towards the TMD binding site (α*) and finally arrival at the CRD site (ζ) with the hydroxyl group pointing extracellularly. The translocation of cholesterol from the membrane to the CRD binding site via Pathway 2 is captured by a η → θ → α* ζ transition. For further clarity, we have plotted the minimum energy path taken by cholesterol as it translocates along this pathway (Figure 2—figure supplement 3c, d). Overall, the angle between cholesterol and the x-y plane involves an entire cycle of –90° → 0° → 90° and cholesterol must flip for cholesterol to enter the protein.

Interestingly, mutants along Pathway 2 showed a significant decrease in activity compared to Pathway 1 (Figure 3a, d, Figure 3—figure supplement 4), along with an increased thermodynamic barrier for translocation (Figure 3d, Figure 3—figure supplement 5). Mutating R4215.64f to alanine or glutamine did not decrease SMO activity significantly (Figure 3d), because the interaction with cholesterol is mediated by the protein backbone, and not the side chain (Figure 4e). However, mutations like F6.36f I and L5.62f A reduce SMO activity. Their expression levels of these mutants are comparable to the wild-type mSMO (Figure 3—figure supplement 4). The mutants compared to WT SMO showed a significant increase in PMF, due to the lack of the hydrophobic π-stacking provided by F4556.36f and hydrophobic contacts provided by L4195.62f during cholesterol translocation. Overall, we report that the mutants for Pathway 2 show a decrease in the activity of SMO and show a strong correlation between the reduction in activity and the barrier for cholesterol translocation (Figure 3b and d). These results validate the role of critical residues involved in cholesterol translocation from the inner leaflet as observed in the simulations.

Based on our experimental and computational data, we conclude that cholesterol translocation can happen via either pathway. This is supported on the basis of the following observations: mutations along Pathway 2 affect SMO activity more significantly, and the presence of a direct conduit that connects the inner leaflet to the TMD binding site. In addition, a resolved structure of SMO in the presence of cholesterol (Zhang et al., 2022) shows a cholesterol situated at the entry point from the membrane into the protein between TM5 and TM6, in the inner leaflet. However, we also observe that Pathway 1 shows a lower thermodynamic barrier (5.8 ± 0.7 kcal/mol vs. 6.5 ± 0.8 kcal/mol, p = 0.0013). Additionally, PTCH1 controls cholesterol accessibility in the outer leaflet (Kinnebrew et al., 2021). This shows that there is a possibility for transport from both leaflets. One possibility that might alter the thermodynamic barriers is native membrane asymmetry, particularly the anionic lipid-rich inner leaflet. This presents as a limitation of our current model.

The pathway connecting the TMD to the CRD binding sites shows off-pathway intermediate

According to the dual-site model, to reach the binding site in the CRD (ζ), cholesterol translocates along the TMD-CRD interface from the TM binding site (α*) is required. This Common Pathway is shared by cholesterol molecules translocating from the inner and outer leaflets. The translocation of cholesterol from the TMD to the CRD binding site involves a linear movement of the androsterolic moiety through the extracellular end of the TMD, where cholesterol maintains a primarily upright position, with the polar androsterolic moiety pointing towards the CRD site. The energetic barrier associated with the transition (α* (TMD site) → ζ(CRD site)) can be visualized by plotting the z-coordinate of cholesterol versus the angle it forms with the x-y plane (Figures 2a and 4a). We observe that the highest barrier along the Common Pathway is ∼5.1 ± 0.3 kcal/mol, which is lower than the highest energetic barrier for cholesterol entry from the inner and outer leaflet. Another interesting observation is that the stability of the cholesterol in the TMD binding site (α*) is higher than the CRD binding site (ζ). This observation can be explained on the basis of the thermodynamic driving forces in the two pockets. The TMD binding site is composed mainly of hydrophobic residues, which form strong interactions with cholesterol (Figure 5—figure supplement 1) in contrast to the CRD binding site, where the cholesterol is exposed to the aqueous environment. The CRD binding site being exposed to the solvent increases the conformational entropy associated with the cholesterol compared to the restrained TMD binding site. In addition, the CRD binding site is more flexible than the TMD binding site in the absence of cholesterol, as reported previously (Kinnebrew et al., 2022). This increased flexibility of the CRD binding site further leads to the formation of multiple conformational states between the CRD and cholesterol.

Once cholesterol reaches the TMD-CRD interface, it can adopt multiple poses before reaching the CRD binding site (Figures 2a and 4a). Cholesterol at this position can be upright (δ, Figures 2a and 4a), where it interacts with F4846.65f, W4806.61f, V4886.69f, and L221LD, forming hydrophobic contacts (Figure 5a). However, there exists a thermodynamic barrier to take a completely upright path to the CRD binding site (angle ∼+90°). This is due to the presence of the long beta sheet of the linker domain (residues L197LD-I215LD) of SMO at the TMD-CRD interface, blocking the direct upright translocation of cholesterol (Figure 5—figure supplement 2). Hence, the major conformation of cholesterol at this position is slightly tilted, away from the plane of the membrane (γ, Figures 2a and 4a). Additionally, in γ, Y207LD creates a hydrophobic interaction with cholesterol, stabilizing the bent pose (Figure 5b). γ has been identified as the binding site of the synthetic agonist SAG in SMO (Qi et al., 2020). This provides further validation for γ being the major intermediate state in the Common Pathway.

Figure 5 with 2 supplements see all
Multiple positions of cholesterol as it translocates through the Common Pathway, including the off-pathway intermediate.

(a) upright (δ), (b) tilted (γ), (c) the overtilted off-pathway intermediate (ϵ), and (d) cholesterol at the CRD binding site (ζ). All snapshots presented are frames taken from MD simulations.

Since the degrees of freedom accessible to cholesterol at this point in the pathway are higher than at the TMD, cholesterol can undergo ‘overtilting’ as it approaches the CRD (ϵ, Figure 5c). This state (ϵ) is attributed to an off-pathway intermediate state in the cholesterol translocation process, raising the timescales required for cholesterol to translocate to the CRD binding site. This over-tilted pose is stabilized by hydrophobic interactions between the sterol and Y4876.78f, L221LD, and I509ECL3, and a hydrogen bond between the side chain of N511ECL3 and the alcoholic oxygen (Figure 5c). Once the cholesterol has crossed the TMD-CRD interface, it can reach the CRD sterol-binding site (ζ, Figure 5d) in the CRD. In summary, we identify numerous conformational states of cholesterol-bound SMO, which are distinct from the available structures of sterol-bound SMO.

We performed experimental mutagenesis to validate the critical residues along this pathway, from the TMD to the CRD binding site. These mutants - YLD A, F6.65f A, and IECL3 A, all showed a significant decrease in activity compared to WT SMO (Figure 6a, b, Figure 6—figure supplement 1) as well as an increase in peak PMF, suggesting that the force required to translocate cholesterol is higher than the WT residue in this position (Figure 6b, Figure 6—figure supplement 2). This further implies that the mutants reduce the activity of SMO by increasing the energetic barriers for cholesterol translocation. In particular, the effect is pronounced for YLD A, which forms hydrophobic contacts along the pathway (Figure 6c). F6.65f A showed a significant decrease in SMO activity, which can be attributed to a reduction in hydrophobic stabilizing contacts that enable cholesterol’s entry to the CRD (Figure 6d).

Figure 6 with 3 supplements see all
The effects of mutations along the translocation pathway connecting the TMD and CRD binding sites on the activation of SMO.

(a) Gli1 mRNA fold changes show the responsiveness of SMO mutants to SHH. Untreated Gli1 levels indicate low SMO activity, while SHH-treated values correspond to the level of SMO activation induced by SHH ligand. A t-test with Welch’s correction was used to compute statistical significance. (p values: untreated vs treated: WT: 3 × 10-6, YLD A: 2.46 × 10-4, F6.65f A: 1.08 × 10-3, IECL3 A: 1.12 × 10-4, treated WT vs treated mutant: F6.65f A: 1.6 × 10-5, IECL3 A: 1.6 × 10-5, YLD A: 1.4 × 10-5, key: Not significant (ns) p > 0.05, *p ≤ 0.05, **p ≤ 0.01, ***p ≤ 0.001, and ****p ≤ 0.0001, All experimental data represent biological replicates, N=4.) (b) ΔGli1 mRNA fold change (SHH vs untreated) and Δ PMF (difference of peak PMF, calculated as PMFWT - PMFmutant) are plotted for mutants along the TMD-CRD pathway. (c, d) Example mutants YLD A and F6.65f A show that cholesterol is unable to translocate through this pathway because of the loss of crucial hydrophobic contacts provided by Y207 and F484 and along the solvent-exposed pathway.

Overall, the entire cholesterol translocation process can be divided into two parts - entry from the membrane to the TMD binding site via Pathway 1 or 2, and translocation from the SMO TMD to the CRD binding site via the Common Pathway. The entire process is mapped out in Figure 6—figure supplement 3. The computational investigation shown here covers the dual-site model, where cholesterol reaches the CRD site via binding to the TM binding site first. In comparison to the CRD site, the TM site is more stable by ∼2 kcal/mol (Figure 2—figure supplement 3b, d). The experimental and computational analysis shows that both Pathway 1 and Pathway 2 are thermodynamically feasible pathways and exhibit similar energetic barriers for cholesterol to take from the membrane to the TMD binding site.

A squeezing mechanism for translocation of cholesterol in SMO

To further elucidate the structural changes that happen in SMO during cholesterol translocation, we sought to characterize the conformation of the hydrophobic tunnel inside SMO. We calculated the tunnel diameter along the channel as cholesterol traverses through the protein (Figure 7 and Figure 7—figure supplement 1). The tunnel calculations were done with cholesterol at different points along the pathway as indicated by the metastable states in the Markov state models - with cholesterol in the TMD binding site, cholesterol at the CRD-TMD interface, and cholesterol present at the binding site in the CRD. When we plot cholesterol’s position at different points along its transit through the TMD, we observe that the tunnel diameter varies along the z-coordinate as cholesterol moves through the channel. This corroborates that the tunnel radius is a function of the position of cholesterol in SMO. Furthermore, we observe that the tunnel radius shows a peak at z ∼ -7 when cholesterol is present in the core of the TMD (Figure 7a, b). In the rest of the tunnel, the average radius remains relatively small (Figure 7—figure supplement 2). This peak in diameter is seen to move along with the cholesterol position (Figure 7c, d). This provides evidence that SMO uses a squeezing mechanism to translocate cholesterol. The term ’squeezing’ here implies that the tunnel remains open only around cholesterol, and it closes as cholesterol moves away from its current position in the tunnel. Additionally, recent experimental work (Zhang et al., 2022) also suggests a squeeze-type mechanism for SMO. Once cholesterol has reached the binding site in the CRD, the tunnel in the TMD region is closed (Figure 7e, f, Figure 7—figure supplement 2).

Figure 7 with 2 supplements see all
The tunnel profile during cholesterol translocation in SMO.

(a) Free energy plot of the z-coordinate versus the tunnel diameter when cholesterol is present in the core TMD. The tunnel shows a spike in the radius in the TMD domain, indicating the presence of a cholesterol-accommodating cavity. (b) Representative figure for the tunnel when a cholesterol molecule is in the TMD. (c) Same as (a), when cholesterol is at the TMD-CRD interface. (d) same as (b), when cholesterol is at the TMD-CRD interface. (e) same as (a), when cholesterol is at the CRD binding site. (f) same as (b), when cholesterol is at the CRD binding site. Tunnel diameters are shown as spheres. Cholesterol positions are marked on plots using dotted lines. All snapshots presented are frames taken from MD simulations.

The cholesterol translocation mechanism of SMO shows similarities to the alternating access model proposed for substrate transport in membrane transporter proteins, where the transport tunnel closes behind the substrate to facilitate substrate transport across the membrane. Membrane transporters, including lipid transporters, use either an ion-based gradient or ATP hydrolysis to facilitate substrate transport. An ion binding site for SMO is currently unknown in contrast to Class A receptors, which have a sodium binding site (Katritch et al., 2014; Selvam et al., 2018; Dutta et al., 2022b). There is no experimental evidence of ion-coupling with the cholesterol export via SMO. Therefore, we posit that cholesterol translocation through SMO involves passive or concentration-dependent diffusion driven by a shift in the pool of accessible cholesterol, which rises once PTCH1 is inhibited. Thus, we provide a mechanistic overview of the dynamics of the cavity during cholesterol translocation in SMO.

The translocation of cholesterol occurs on a millisecond timescale

To give a perspective on the overall translocation process from a kinetic standpoint, we sought to calculate the timescales associated with the translocation of cholesterol from the membrane to the binding site in the CRD. Using a combination of Transition Path Theory and Markov state models, the reactive fluxes associated with each stage of the translocation cycle can be calculated. This enables us to calculate the mean first passage time (MFPT), which gives us an estimate of the timescales associated with the process (Figure 8). Pathway 1 was divided into two stages - the first being the translocation of cholesterol from the outer leaflet (Figure 8d) to the TMD - yielding a mean first passage time of 700 ± 122 μs. Here, the cholesterol tail has entered the TMD (Figure 8a). This was followed by cholesterol reaching the TMD binding site, with a timescale of 205 ± 41 μs (Figure 8b). For Pathway 2, cholesterol starts in the inner leaflet (Figure 8f). In the first stage, cholesterol undergoes flipping, which leads to a higher timescale than stage 1 of Pathway 1–823 ± 132 μs. This is followed by 122 ± 22 μs for cholesterol to reach the TMD binding site (Figure 8b).

The timescales associated with the translocation of cholesterol through SMO.

Each major intermediate state has been marked (a–f). Timescales were obtained by calculating the mean first passage time (MFPT) using the Markov state model. Errors in timescales are shown as subscripts. The arrows represent the relative flux for the translocation between subsequent steps. The overall process occurs at a timescale of ∼1 ms.

Once cholesterol has reached the TMD binding site, it is followed by translocation of cholesterol from the TMD to the CRD binding site with a timescale of 255 ± 36 μs. Overall, the calculated timescales for cholesterol to reach the CRD site from the inner/outer leaflet of the membrane are 1023 ± 223 μs (Pathway 1) and 1134 ± 188 μs (Pathway 2). These timescales are comparable to the substrate transport timescales of Major Facilitator Superfamily (MFS) transporters (Chan et al., 2022). Furthermore, several experimental studies have also resolved the millisecond-scale kinetics of MFS transporters (Blodgett and Carruthers, 2005; Körner et al., 2024; Bazzone et al., 2022; Smirnova et al., 2014; Zhu et al., 2019), further corroborating the results from our study. Interestingly, the timescales for the reverse process of translocating cholesterol from the CRD binding site to the membrane are higher for each step (Figure 8), indicating that the reverse process has a higher energetic barrier associated with it. This provides kinetic evidence that the overall translocation process from the membrane to the CRD binding site is thermodynamically favorable. Thus, it can be concluded that SMO facilitates the translocation of cholesterol.

Conclusion

In this study, we have used a combination of millisecond-scale atomistic molecular dynamics simulations, Markov state modeling, and experimental mutagenesis to describe the step-by-step process of cholesterol translocation through SMO. Previous structural studies have delineated multiple translocation pathways for cholesterol transport via SMO. In this study, we have examined the mechanism of cholesterol translocation from the membrane to the CRD sterol-binding site of SMO via two modes - the outer leaflet pathway, between TM2 and TM3, and the inner leaflet pathway, between TM5 and TM6. We quantitatively assess the thermodynamic barriers of the translocation pathways and estimate the timescales associated with the process. The key intermediate cholesterol-bound conformations of SMO and the role of specific residues in the translocation process were identified computationally and validated using both experimental and in silico mutagenesis.

We observe that cholesterol can move through two distinct conduits, starting at either the outer leaflet of the membrane (Pathway 1) or the inner leaflet (Pathway 2), followed by translocation to the cholesterol binding site in the CRD. In the first mode (Pathway 1), cholesterol enters the protein between TM2 and TM3 in the outer leaflet, and then it is translocated along the extended TM6 to reach the CRD binding site. The highest barrier along this first mode is associated with the cholesterol entry from the outer leaflet of the membrane to the TMD binding site. Similarly, the highest barrier for the second mode (Pathway 2) also involves cholesterol translocation from the inner leaflet membrane to the TMD of SMO. We show that the barriers associated with the pathway starting from the outer leaflet are lower by ∼0.7 kcal (p=0.0013). We also provide evidence that cholesterol can enter SMO via both leaflets, considering that multiple computational and experimental studies have found cholesterol entry sites and activation modulation via the outer leaflet, between TM2-TM3 (Hedger et al., 2019; Kinnebrew et al., 2021; Bansal et al., 2023). Other experimental and computational studies have proposed that cholesterol can enter SMO via the inner leaflet (Huang et al., 2018; Zhang et al., 2022). Overall, our work shows that cholesterol translocation from either pathway is feasible.

The second-highest barrier in cholesterol translocation is at the TMD-CRD interface, but it is ∼1 kcal/mol lower than the barrier for cholesterol entry. We also show that the cholesterol translocation process occurs via a squeezing mechanism that maintains the forward flux of cholesterol from the membrane to the CRD binding site. Overall, the translocation process takes place on the millisecond timescale with multiple intermediate states identified using simulations and supported by the sterol-bound structures of SMO.

Despite the extensive MD simulations reported in this study, there is still a need to further probe the endogenous “activation” of SMO by cholesterol in a position-dependent manner. Here, we have explored the role the CRD site plays in SMO activation. In addition, through simulating the CRD site-dependent SMO activation hypothesis, we have also simulated the TMD site-dependent activation. We show that the overall stability of cholesterol in the TMD site is higher than the stability of cholesterol in the CRD site by ∼2 kcal/mol. An alternative possibility states that the flexibility associated with the CRD would allow it to directly access the membrane and, consequently, cholesterol. In the extensive simulations reported in this study, the binding site of cholesterol in the CRD remains at least 20 Å away from the nearest lipid head group in the membrane, suggesting that such direct extraction and the bending of the CRD do not occur within the timescales sampled (Appendix 2—figure 6). The mechanistic details of this process are still unexplored and form the basis of future work. Additionally, GPCRs exist in a conformational equilibrium of the active and inactive states, and the job of the agonist (cholesterol) is to lower the barrier to activation and shift the equilibrium towards the active state. Our recent work (Kim et al., 2024) discusses how the binding position of cyclopamine modulates SMO activity. Cyclopamine acts as an antagonist when bound to SMO TMD and acts as an agonist when bound to SMO CRD. Cholesterol binding at different positions along the translocation pathway leads to a position-dependent modulation of SMO activity (Kinnebrew et al., 2022; Kumari et al., 2023). Kinnebrew et al. propose that binding of cholesterol to the TMD determines basal SMO activity, whereas binding to both the TMD and CRD is required for full, SHH-induced activity. In this study, we only focused on the cholesterol movement from the membrane to the CRD binding site. Therefore, a future investigation is needed to fully sample the activation process of SMO with cholesterol bound at different positions.

Our study provides computational and experimental evidence for the translocation of cholesterol along the conduit within SMO and outlines the entire translocation mechanism from a kinetic and thermodynamic perspective. Our findings provide a plausible model for how an increase in membrane cholesterol accessibility can activate SMO: by promoting the entry and movement of cholesterol along a conduit that traverses the center of the protein and ends at the CRD. Our study provides a framework for the development of drugs against oncogenic SMO that could function by blocking intermediate states of transport.

Methods

Key resources table
Reagent type (species) or resourceDesignationSource or referenceIdentifiersAdditional information
Cell line (Mus musculus)Smo-/- MEFsRohatgi et al., 2007Mouse embryonic fibroblasts lacking Smoothened
AntibodyAnti-GLI1 (Mouse monoclonal, clone L42B10)Cell Signaling TechnologyCat# 2643; RRID:AB_2294746Western Blot, 1:1000 dilution
AntibodyAnti-SMO (Rabbit polyclonal)Rohatgi et al., 2007RRID:AB_3738384Western Blot, 1:2000 dilution
AntibodyAnti-GAPDH (Mouse monoclonal, clone 1E6D9)ProteinTechCat# 60004–1-Ig; RRID:AB_2107436Western Blot, 1:10,000 dilution
Chemical compound, drugHigh-glucose DMEMThermo Fisher ScientificCat# SH30081FS
Chemical compound, drugFetal Bovine Serum (FBS)Sigma-AldrichCat# S11150
Chemical compound, drugSodium pyruvateGibcoCat# 11-360-070
Chemical compound, drugL-glutamineGeminiBioCat# 400106
Chemical compound, drugMinimum essential medium NEAA solutionGibcoCat# 11140076
Chemical compound, drugPenicillin / StreptomycinGeminiBioCat# 400109
Chemical compound, drugSigmaFast Protease inhibitor cocktail, EDTA-freeSigma-AldrichCat# S8830

Molecular Dynamics (MD) simulations

Simulation setup

Structures of SMO bound to sterols - 6XBL (Qi et al., 2020; SMO-CHL-1), 6XBM (Qi et al., 2020; SMO-CHL-2, SMO-CHL-3), and 5L7D (Byrne et al., 2016; SMO-CHL-4) were used as the starting structures for simulations. For SMO-CHL-1, the bound agonist SAG was removed. The two sterols occupying different positions in the tunnel in 6XBM were used to build 2 separate systems with cholesterol at different sites in the pathway (SMO-CHL-2, SMO-CHL-3). For SMO-CHL-2 and SMO-CHL-3, to account for the lack of CRD in the structure 6XBM, the sterol positions were aligned to the full-length SMO (6XBL, 0.8 Å RMSD from 6XBM) and the 24(S),25-epoxycholesterol for each system was replaced by cholesterol. For SMO-CHL-4, the inactivating mutation V3.40 F was mutated back to Wild Type. For all systems, any stabilizing antibodies and bound G proteins were removed. The missing residues in the intra/extracellular loop for every protein were modeled using MODELLER (Webb and Sali, 2016; Appendix 1—table 1). Termini for all proteins were capped using acetyl (ACE) and N-methylamino (NME) at the N- and C-termini to ensure neutrality. All four protein systems with cholesterols at different points along the translocation path were embedded in a lipid bilayer. The composition of the bilayer was set similar to mice cerebellum (Scandroglio et al., 2008; Appendix 1—table 2), to mimic physiological conditions, using CHARMM-GUI (Jo et al., 2008; Lee et al., 2019). Interactions between the atoms - bonded and non-bonded - were modeled using the CHARMM36 Force Field (Klauda et al., 2010; Best et al., 2012). TIP3P water (Jorgensen et al., 1983) and 0.15 M NaCl were used to solvate the system, to mimic physiological conditions. Non-protein hydrogen masses were repartitioned to 3.024 Da, to enable use of a longer timestep (4 fs; Hopkins et al., 2015). Starting points for Pathways 1 and 2 were chosen from already simulated data, according to the closest cholesterol distance from the respective helices (outer leaflet, TM2-TM3 for Pathway 1, lower leaflet, TM5-TM6 for Pathway 2). This was done once the rest of the pathway was completely explored.

Pre-production MD

All systems were subject to 50,000 steps of initial minimization. Further, minimization was performed for another 10000 steps by constraining the hydrogens using SHAKE (Andersen, 1983). Systems were then heated to 310 K to mimic physiological conditions, at NVT for 10 ns. This was followed by equilibration at NPT and 1 bar for 5 ns. Backbone constraints of 10 kcal/mol/Å2 were applied during NVT and NPT. Next, systems were equilibrated at NPT for 40 ns without constraints to ensure system stability. All pre-production steps were performed using AMBER18 (Case et al., 2020; Salomon‐Ferrer et al., 2013b; Case et al., 2005; Götz et al., 2012; Salomon-Ferrer et al., 2013a).

Production MD

Systems were then subjected to some initial sampling of 100 ns each. This was followed by clustering and performing adaptive sampling to enable a divide-and-conquer approach to sample the conformational landscape. Three rounds of sampling were performed on each system. This was followed by similar rounds of adaptive sampling on the distributed computing project Folding@Home (http://foldingathome.org). OpenMM 7.5.1 (Eastman et al., 2017) was used for running simulations on Folding@Home.

In all simulations, 4 fs was the chosen integration timestep. Particle Mesh Ewald (PME; Darden et al., 1993) method was used to account for long-range electrostatics. The cutoff for considering non-bonded interactions was set to 10 Å. Temperature was maintained using the Langevin Thermostat (Davidchack et al., 2009). Pressure was maintained using the Monte Carlo Barostat (Åqvist et al., 2004). All hydrogen bonds were constrained using SHAKE (Andersen, 1983).

Steered MD

Steered MD was performed to steer cholesterol from the membrane into the protein, for generating the starting frames for Pathway 1 and Pathway 2. This was done by first finding frames from the existing data, where cholesterols were closest to the respective starting points. Then, cholesterols were steered towards the center of TMD with the end point being the cholesterol binding site deep in the TMD, as resolved in the structure 6XBL. This was done using a steering force of 20 kcal/(mol Å2), over a course of 500 ns. The entire protein, except the helices involved at the entry (TM2-TM3: Pathway 1; TM5-TM6: Pathway 2), was constrained using two RMSD restraints during simulations to prevent any unphysical effects. The force constants used for RMSD restraints are as follows: 10 kcal/(mol Å2) for restraining residues from CRD to ICL1, and 35 kcal/(mol Å2) for restraining residues from ICL2 to helix 8 for Pathway 1. For Pathway 2, an RMSD restraint of 35 kcal/(mol e the effect of mutations on the choleste) was used to restrain residues from CRD to ECL2, and 10 kcal/(mol Å2) to restrain residues from ECL3 to helix8. Three replicates of steered MD were performed to ensure that the pathway explored converged. The frames generated from these runs were then used as seed frames to start simulations for exploring the entry of cholesterol into SMO. Steered MD was performed using NAMD (Phillips et al., 2020; Phillips et al., 2005). Each frame generated by Steered MD was minimized for 50000 steps, and then equilibrated for 40 ns, using AMBER using the same methodology for these seed frames as described in section Pre-Production MD.

Adaptive biasing force-based sampling for PMF generation

To elucidate the effect of mutations on the cholesterol translocation barriers, we used an adaptive-biasing force (ABF)-based sampling for generating the potential of mean force (PMF) profiles for each case and compared them to the WT translocation barriers. For generating the starting files for every mutant system, psfgen (Humphrey et al., 1996; Stone, 1998) was used. SMO WT PMF profiles were generated for each pathway, and mutant profiles were generated for their respective pathways. A biasing potential of 45 kcal/(mol Å2) was used for both lower wall and upper wall of the ABF potential. Each mutant was run for 3 replicates, and 107 samples were generated for each pathway to compute the PMF. NAMD was used for this purpose (Phillips et al., 2020; Phillips et al., 2005).

Adaptive sampling, feature selection, and clustering

The cholesterol translocation process in SMO was simulated in a stage-wise process, with the four starting points having cholesterol present at different points along the translocation pathway (Figure 1). To accelerate the sampling of the entire translocation process, simulations were performed using an approach that parallelizes the exploration of conformational space. Adaptive sampling (Hinrichs and Pande, 2007; Bowman et al., 2010; Kleiman et al., 2023) was utilized to achieve this acceleration, which uses an iterative sampling approach involving picking the next round of simulation starting points based on the current data. Several different types of machine learning and heuristic-based adaptive sampling approaches have been proposed in the literature (Kleiman et al., 2023; Kleiman and Shukla, 2023; Kleiman and Shukla, 2022; Shamsi et al., 2018; Shamsi et al., 2017; Zimmerman and Bowman, 2015; Weber and Pande, 2011). However, least count-based sampling, where the least visited states are chosen as the starting points for the next set of simulations, is among the best sampling strategies for exploration of the conformational free energy landscapes (Weber and Pande, 2011; Nadeem and Shukla, 2025). The following approach was used to collect the data:

  1. Initially, sampling was started from the four starting points with cholesterol at different points along the tunnel (Figure 1). Each starting point was simulated as a separate system.

  2. All systems were subject to Pre-Production MD (Refer Pre-Production MD).

  3. Following Pre-Production, all systems were subject to 200 ns of Production MD (Refer Production MD).

  4. The production data until this point was combined and clustered on the basis of Adaptive Sampling metrics (Appendix 1—table 3) using k-means clustering (for the first four rounds) and then mini-batch k-means later on. pyEMMA (Scherer et al., 2015) was used for this purpose.

  5. Frames were chosen from the clusters with the least populations and were used as seeds for the next round of simulations. This enabled a parallel iterative approach for sampling the conformational landscape (Appendix 1—table 4, Appendix 2—figure 1)

  6. Steps 4–5 were repeated until the entire landscape was explored.

The entire process of cholesterol translocation - from cholesterol present in the membrane to cholesterol bound to the CRD sterol-binding site - was simulated. A total of 2 ms of unbiased simulation data was collected using this approach.

Dimensionality reduction using tICA

Dimensionality reduction was performed on the high-dimensional data set using Time-Lagged Independent Component Analysis (tICA) (Schwantes and Pande, 2013). tICA uses a linear transformation to project the input data into a lower-dimensional basis set, the components of which approximate the slowest processes observed in the simulations. In our case, the slowest process simulated was identified by tICA as the translocation of cholesterol (Figure 2—figure supplement 2). The dataset was divided into 2 different groups, separately for Pathways 1 and 2. To further gain insights from the simulations, we constructed a Markov state model from the tICA-projected simulation data.

Markov State model construction

Markov State Model (MSM) is a kinetic modeling technique that uses short trajectory data that sample local transitions to provide a global estimate of the thermodynamics and kinetics of the physical process (Pande et al., 2010; Noé and Rosta, 2019; Prinz et al., 2011; Husic and Pande, 2018; Wang et al., 2018; Shukla et al., 2015). A MSM discretizes the dataset into kinetically distinct microstates and calculates the rates of transitions among such microstates. This methodology has been used extensively to investigate the conformational dynamics of membrane proteins, including GPCRs (Deganutti et al., 2024; Fleetwood et al., 2021; Dutta et al., 2022a; Dutta et al., 2022b; Chan et al., 2022; Kohlhoff et al., 2014; Kim et al., 2024; Bansal et al., 2023). To construct the Markov state model, the data collected from simulations were first featurized (Appendix 1—table 3). Pathways 1 and 2 were treated independently of each other. tICA was performed on the data to reduce the dimensionality of the input data and identify the slowest processes observed in the simulations. A Markov state model was then constructed on the 42 and 28 components from the tIC space for Pathways 1 and 2, respectively. To construct the models for both pathways, the following approach was used:

  1. The data was clustered into different numbers of clusters, ranging from 100 to 1000, and the implied timescales were calculated as a function of the MSM lag time. The lag time after which the implied timescales converged (30 ns) in both cases was chosen as the MSM lag time (Appendix 2—figure 2a, Appendix 2—figure 3a).

  2. Once the MSM lag times were chosen, a grid-search-based approach was chosen to compute the optimal number of clusters. Each constructed MSM was evaluated using a VAMP2 score (Wu and Noé, 2020), where the sum of squares of the top five eigenvalues was computed. Each pathway’s dataset was clustered into several clusters (200 - Pathway 1, 400 - Pathway 2) that gave the highest VAMP2 score (Appendix 2—figure 2b, Appendix 2—figure 3b).

  3. Once the MSMs were constructed, bootstrapping was performed, with 200 rounds and 80% of the data in each round, to compute the error associated with the probabilities.

  4. To validate the final MSM, a Chapman-Kolmogorov test was performed (Appendix 2—figures 4 and 5) to show the long-timescale validity of the Markovian property followed by the constructed models.

This information can be used to calculate the probability of each state, which can be used to recover the thermodynamic and kinetic properties of the entire ensemble. One of the use cases of the probabilities is that they can be used to reweigh the projected free energy of each datapoint along a said reaction coordinate, which has been used in Figures 2, 4 and 7. The errors in free energies were computed using the errors in the projected probabilities from the bootstrapped MSMs (Figure 2—figure supplement 1, and Figure 7—figure supplement 1).

Trajectory analysis and visualization

Trajectories were stripped of water and imaged before analysis to allow faster computation. cpptraj (Roe and Cheatham, 2013) was used for this purpose. For constructing the figures, VMD (Humphrey et al., 1996; Stone, 1998) and open-source PyMOL (Schroedinger, 2022) (rendering), MDTraj (McGibbon et al., 2015) (computing observables from trajectories), matplotlib (Hunter, 2007) and seaborn (Waskom, 2021) (Plot rendering), Numpy (Harris et al., 2020) (numerical computations), HOLE (Smart et al., 1993) (tunnel diameter calculations) were used.

Cell culture and cell line generation

Mouse embryonic fibroblasts (MEFs) lacking Smoothened (Smo-⁄-) were tested to ensure lack of endogenous SMO protein using immunoblotting, as described previously (Rohatgi et al., 2007). These Smo-⁄- MEF cells were used to generate stable cell lines expressing SMO mutants, which were then authenticated by immunoblotting to ensure stable expression of the transgene (Nachtergaele et al., 2013). Cell lines were confirmed to be negative for mycoplasma infection.

MEF cells were grown in high-glucose DMEM (Thermo Fisher Scientific, catalog no. SH30081FS) containing 10% FBS (Sigma-Aldrich, catalog no. S11150) and the following supplements: 1 mM sodium pyruvate (Gibco, catalog no. 11-360-070), 2 mM L-glutamine (GeminiBio, catalog no. 400106), 1× minimum essential medium NEAA solution (Gibco, catalog no. 11140076), penicillin (40 U/ml), and streptomycin (40 μg/ml) (GeminiBio, catalog no. 400109). This media, hereafter referred to as supplemented DMEM, was sterilized through a 0.2-μm filter and stored at 4°C.

To measure Hedgehog responsiveness by quantitative polymerase chain reaction (PCR) or western blotting, cells were seeded in 10% FBS-supplemented DMEM and grown to confluence. To induce ciliation, a requirement for Hedgehog signaling, cells were serum-starved in 0.5% FBS-supplemented DMEM and simultaneously treated with Sonic Hedgehog (SHH) for 24 hr before analysis.

Western blotting was carried out to assess SMO protein expression for all mutants. Briefly, whole-cell extracts were prepared in lysis buffer containing 150 mM NaCl, 50 mM Tris-HCl (pH 8), 1% NP-40, 1× protease inhibitor (SigmaFast Protease inhibitor cocktail, EDTA-free; Sigma-Aldrich, catalog no. S8830), 1 mM MgCl2, and 10% glycerol. After lysate clarification by centrifugation at 20,000 × g, samples were resuspended in 50 mM tris(2-carboxyethyl)phosphine and 1× Laemmli buffer for 30 min at 37°C. Samples were then subjected to SDS-polyacrylamide gel electrophoresis, followed by immunoblotting with antibodies against GLI1 [anti-GLI1 mouse monoclonal (clone L42B10); Cell Signaling Technology, catalog no. 2643, RRID:AB_2294746], SMO (rabbit polyclonal) (Rohatgi et al., 2007), or GAPDH [anti-GAPDH mouse monoclonal (clone 1E6D9); Protein tech, catalog no. 60004–1-Ig, RRID:AB_2107436].

Measuring Hedgehog signaling with quantitative PCR

Gli1 mRNA transcript levels were measured using the Power SYBR Green Cells-to-CT kit (Thermo Fisher Scientific). Gli1 levels relative to Gapdh were calculated using the Delta-Ct method CT(Gli1) - CT(Gapdh). The RT-PCR was carried out using custom primers for Gli1 (forward primer: 5′-ccaagccaactttatgtcaggg-3′ and reverse primer: 5′-agcccgcttctttgttaatttga-3′), and Gapdh (forward primer: 5′-agtggcaaagtggagatt-3′ and reverse primer: 5′-gtggagtcatactggaaca-3′).

Appendix 1

Appendix 1—table 1
Modeled residues in 5L7D-inactive-Apo-SMO starting structure.

The helical content of K440–I445 was modeled based on the structure of SANT1-bound SMO (PDB: 4N4W; Wang et al., 2014).

Modeled residues in 5L7D-inac-Apo-SMOConstraintsLocation
I429NoneICL3
K430NoneICL3
S431NoneICL3
N432NoneICL3
H433NoneICL3
P434NoneICL3
G435NoneICL3
L436NoneICL3
L437NoneICL3
S438NoneICL3
E439NoneICL3
K440α-helicalTM6
A441α-helicalTM6
A442α-helicalTM6
S443α-helicalTM6
K444α-helicalTM6
I445α-helicalTM6

Lipid composition

Appendix 1—table 2
Composition of the membrane used for embedding the protein in simulations.
Lipid NameUpper LeafletLower Leaflet
Cholesterol (CHL1)2121
1-palmitoyl-2-oleoylphosphatidylcholine (POPC)7676
Palmitoylsphingomyelin (PSM)44
Total101101

Distances used for trajectory featurization in adaptive sampling

Appendix 1—table 3
Adaptive Sampling metrics used for clustering during the iterative landscape exploration process.
W281M525V321M525W281V321
S278F526L325M525T241F274
D473E518D384S387D384Y394
Y322F391G277V321W281V321
V319Y323V276I320I316V319
N219P513L221I509Y487I509
S483N511Y207K395I215M301
N219W480W109Y130W109R161
E160R485I156E160L112I156
N114V210D209A492E208V488
CHLH470CHLN521CHLF391
CHLY394CHLS387CHLW281
CHLL522CHLN521CHLL325
CHLV404CHLT466CHLI408
CHLT528CHLV329CHLF462
CHLV463CHLF274CHLF332
CHLE518CHLD473CHLN521
CHLF391CHLY394CHLR400
CHLN521CHLL522CHLW281
CHLM525CHLP220CHLF484
CHLS387CHLH470CHLD384
CHLL515CHLV386CHLK395
CHLN219CHLF222CHLW480
CHLP513CHLL221CHLW109
CHLR161CHLI156CHLD95
CHLY130CHLI496CHLL112
CHLN114CHLK105CHLF166
CHLV210CHLL489CHLV157
CHLE160CHLL108CHLP164

Round-wise data collection cholesterol translocation

Appendix 1—table 4
Round-wise data collection for Cholesterol transport of SMO.
Simulation RoundAmount of Data
Round 120.38 μS
Round 2460.15 μS
Round 371.12 μS
Round 482.17 μS
Round 543.26 μS
Round 626.70 μS
Round 7508.16 μS
Round 8184.63 μS
Round 9118.69 μS
Round 10550.59 μS
Total2066.227 μS

Appendix 2

Roundwise data collection for SMO

Appendix 2—figure 1
Roundwise data collection for SMO Cholesterol transport as projected along the first two time-lagged independent components (tICs).

Round numbers are specified in the respective plots.

MSM Construction

Appendix 2—figure 2
MSM Construction for SMO Cholesterol transport - Pathway 1.

(A) Implied Timescales versus MSM Lagtime plot shows the convergence of timescales. A lag time of 30 ns was chosen for the MSM construction. (B) VAMP2 score versus Number of Clusters used for clustering the TICA-reduced data at three different variational cutoffs. The final MSM was made using 200 clusters and a 95% cutoff (corresponding to 42 tIC dimensions).

Appendix 2—figure 3
MSM Construction for SMO Cholesterol transport - Pathway 2.

(A) The Implied Timescales versus MSM Lagtime plot shows the convergence of timescales. A lag time of 30 ns was chosen for the MSM construction. (B) VAMP2 score versus Number of Clusters used for clustering the TICA-reduced data at three different variational cutoffs. The final MSM was made using 400 clusters and a 65% cutoff (corresponding to 28 tIC dimensions).

Appendix 2—figure 4
Chapman Kolmogorov Test for MSM validation - Pathway 1.

Chapman-Kolmogorov test was performed using the deeptime library (Hoffmann et al., 2022).

Appendix 2—figure 5
Chapman-Kolmogorov Test for MSM validation - Pathway 2.

The Chapman-Kolmogorov test was performed using the deeptime library (Hoffmann et al., 2022).

Appendix 2—figure 6
Distance of the binding site of cholesterol (marked by D95) from the nearest lipid headgroups in the membrane.

Kernel density plotted for 2.1 ms of simulation data. The binding site is at least 20 Å away from the nearest lipid headgroup in simulations.

Data availability

Python scripts used for analysis are available on GitHub (https://github.com/ShuklaGroup/Bansal_et_al_Cholesterol_Smoothened_2024, copy archived at Bansal, 2024). Relevant .npy files and .pdb files from each microstate of the MSM, used for the generation of plots, have been submitted to a Dryad Repository (https://doi.org/10.5061/dryad.76hdr7t4w). Due to the large size (>2 TB) of the molecular dynamics simulation trajectories generated for this study, they are available by contacting the corresponding author upon reasonable request.

The following data sets were generated
    1. Bansal P
    2. Kinnebrew M
    3. Rohatgi R
    4. Shukla D
    (2025) Dryad Digital Repository
    Multiple modes of cholesterol translocation in the human Smoothened receptor.
    https://doi.org/10.5061/dryad.76hdr7t4w

References

  1. Thesis
    1. Stone J
    (1998)
    An Efficient Library for Parallel Ray Tracing and Animation
    Master’s thesis, Computer Science Department, University of Missouri-Rolla.

Article and author information

Author details

  1. Prateek D Bansal

    Department of Chemical and Biomolecular Engineering, University of Illinois Urbana-Champaign, Urbana, United States
    Contribution
    Conceptualization, Resources, Data curation, Software, Formal analysis, Investigation, Visualization, Methodology, Writing – original draft, Writing – review and editing
    Competing interests
    No competing interests declared
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-1887-2320
  2. Maia Kinnebrew

    Department of Biochemistry, Stanford University School of Medicine, Stanford, United States
    Contribution
    Validation, Writing – review and editing
    Competing interests
    No competing interests declared
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-7344-8231
  3. Rajat Rohatgi

    1. Department of Biochemistry, Stanford University School of Medicine, Stanford, United States
    2. Department of Medicine, Stanford University School of Medicine, Stanford, United States
    Contribution
    Resources, Supervision, Funding acquisition, Validation, Writing – review and editing
    Competing interests
    No competing interests declared
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0001-7609-8858
  4. Diwakar Shukla

    1. Department of Chemical and Biomolecular Engineering, University of Illinois Urbana-Champaign, Urbana, United States
    2. Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, United States
    3. Department of Bioengineering, University of Illinois Urbana-Champaign, Urbana, United States
    4. Cancer Center at Illinois, University of Illinois at Urbana-Champaign, Urbana, United States
    Contribution
    Conceptualization, Resources, Supervision, Funding acquisition, Validation, Investigation, Methodology, Writing – original draft, Project administration, Writing – review and editing
    For correspondence
    diwakar@illinois.edu
    Competing interests
    No competing interests declared
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0003-4079-5381

Funding

National Science Foundation (OCI-0725070)

  • Prateek D Bansal
  • Diwakar Shukla

National Science Foundation (ACI-1238993)

  • Prateek D Bansal
  • Diwakar Shukla

National Institutes of Health (R35GM142745)

  • Prateek D Bansal
  • Diwakar Shukla

National Institutes of Health (GM118082)

  • Maia Kinnebrew
  • Rajat Rohatgi

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Acknowledgements

The authors thank The Blue Waters Petascale Computing Facility and National Center for Supercomputing Applications, which is supported by the National Science Foundation (awards OCI-0725070 and ACI-1238993) and the state of Illinois. Blue Waters is a joint effort of the University of Illinois at Urbana-Champaign and its National Center for Supercomputing Applications. DS acknowledges support from NIH grant R35GM142745 and the Cancer Center at Illinois. RR acknowledges support from NIH grant GM118082. The authors also acknowledge support from citizen scientists providing computing hours for simulations performed on Folding@Home, which have enabled us to collect data on a large scale for this project.

Version history

  1. Preprint posted:
  2. Sent for peer review:
  3. Reviewed Preprint version 1:
  4. Reviewed Preprint version 2:
  5. Version of Record published:

Cite all versions

You can cite all versions using the DOI https://doi.org/10.7554/eLife.108030. This DOI represents all versions, and will always resolve to the latest one.

Copyright

© 2025, Bansal et al.

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.

Metrics

  • 663
    views
  • 36
    downloads
  • 0
    citations

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Prateek D Bansal
  2. Maia Kinnebrew
  3. Rajat Rohatgi
  4. Diwakar Shukla
(2026)
Multiple modes of cholesterol translocation in the human Smoothened receptor
eLife 14:RP108030.
https://doi.org/10.7554/eLife.108030.3

Share this article

https://doi.org/10.7554/eLife.108030