Single amino acid residue mediates reciprocal specificity in two mosquito odorant receptors
Abstract
The southern house mosquito, Culex quinquefasciatus, utilizes two odorant receptors, CquiOR10 and CquiOR2, narrowly tuned to oviposition attractants and well conserved among mosquito species. They detect skatole and indole, respectively, with reciprocal specificity. We swapped the transmembrane (TM) domains of CquiOR10 and CquiOR2 and identified TM2 as a specificity determinant. With additional mutations, we showed that CquiOR10A73L behaved like CquiOR2. Conversely, CquiOR2L74A recapitulated CquiOR10 specificity. Next, we generated structural models of CquiOR10 and CquiOR10A73L using RoseTTAFold and AlphaFold and docked skatole and indole using RosettaLigand. These modeling studies suggested space-filling constraints around A73. Consistent with this hypothesis, CquiOR10 mutants with a bulkier residue (Ile, Val) were insensitive to skatole and indole, whereas CquiOR10A73G retained the specificity to skatole and showed a more robust response than the wildtype receptor CquiOR10. On the other hand, Leu to Gly mutation of the indole receptor CquiOR2 reverted the specificity to skatole. Lastly, CquiOR10A73L, CquiOR2, and CquiOR2L74I were insensitive to 3-ethylindole, whereas CquiOR2L74A and CquiOR2L74G gained activity. Additionally, CquiOR10A73G gave more robust responses to 3-ethylindole than CquiOR10. Thus, we suggest the specificity of these receptors is mediated by a single amino acid substitution, leading to finely tuned volumetric space to accommodate specific oviposition attractants.
Editor's evaluation
This article addresses the mechanism of ligand specificity of odorant receptors (OR) through mutational analyses and structure prediction. Through solid data, the authors identify a single amino acid substitution that switches ligand specificity between two olfactory receptors. Obtaining structures of OR complexes has been challenging, so such an approach is valuable and will be of interest to scientists within the fields of chemical ecology and sensory neuroscience.
https://doi.org/10.7554/eLife.82922.sa0Introduction
Insects perceive the world with a sophisticated olfactory system essential for survival and reproduction. Their antennae are biosensors par excellence, allowing detection of a plethora of compounds, some with extraordinary sensitivity and selectivity (Kaissling, 2014). The insect olfactory system is comprised mainly of odorant-binding proteins, odorant-degrading enzymes, ionotropic receptors, and the ultimate gatekeepers of selectivity (Leal, 2013; Leal, 2016; Leal, 2020) – the odorant receptors (ORs) (Clyne et al., 1999; Gao and Chess, 1999; Vosshall et al., 1999). The ORs are the binding units in functional heteromeric cation channels (Neuhaus et al., 2005) formed with an odorant receptor coreceptor (Orco) (Larsson et al., 2004). Unlike mammalian olfactory receptors, insect ORs and Orco have inverse topologies compared to G-protein-coupled receptors (GPCRs), with a cytosolic N-terminus and an extracellular C-terminus (Benton et al., 2006; Lundin et al., 2007). One of the major breakthroughs in the fields of insect olfaction in the last two decades since the discovery of ORs (Leal, 2020) was the determination of the cryo-electron microscopy (cryo-EM) structure of the Orco homomer from the parasitic fig wasp, Apocrypta bakeri, AbakOrco (Butterwick et al., 2018). Subsequently, the structure for a promiscuous OR from the evolutionarily primitive (Apterygota, wingless) jumping bristletail, Machilis hrabei, MhraOR5, was solved (DelMarmol et al., 2021). Although structures of ORs from winged insects (Pterygota) have not been solved to date, amino acid residues critical for OR specificity have been reported (Auer et al., 2020; Cao et al., 2021; Hughes et al., 2014; Leary et al., 2012; Pellegrino et al., 2011; Yang et al., 2017; Yuvaraj et al., 2021). These studies focused on one-way alteration of specificity but did not examine how an insect detects two odorants with reverse specificity.
Mosquitoes are vectors of pathogens that cause tremendous harm to public health. Male and female mosquitoes visit plants to obtain nutrients for flight. For reproduction and survival of the species, females must acquire a blood meal to fertilize their eggs and, subsequently, oviposit in an aquatic environment suitable for the offspring to flourish. While feeding on hosts, females transmit viruses and other pathogens. The southern house mosquito, Culex quinquefasciatus, transmits pathogens causing filariasis and various encephalitis (Nasci and Miller, 1996). In the United States, mosquitoes belonging to the Culex pipiens complex transmit the West Nile virus (Andreadis, 2012). Due to its opportunistic feeding on avian and mammalian hosts, Cx. quinquefasciatus is a significant bridge vector in urbanized centers in the Western United States, particularly southern California (Andreadis, 2012; Syed and Leal, 2009). Female mosquitoes rely on multiple sensory modalities, including olfaction, to find plants, vertebrate hosts, and suitable environments for oviposition.
The genome of the southern house mosquito, Cx. quinquefasciatus (Arensburger et al., 2010), has the most extensive repertoire of OR genes (Leal et al., 2013) among mosquito species. The genomes of Anopheles darlingi, the malaria mosquito, Anopheles gambiae, the yellow fever mosquito, Aedes aegypti, the Asian tiger mosquito, Aedes albopictus, and the southern house mosquito contain 18 (Marinotti et al., 2013), 79 (Hill et al., 2002), 117–131 (Bohbot et al., 2007; Nene et al., 2007), 158 (Chen et al., 2015), and 180 (Arensburger et al., 2010) OR genes, respectively. Of those, 61 transcripts were found in An. gambiae (Pitts et al., 2011), 107 in Ae. aegypti (Matthews et al., 2018), and 177 in Cx. quinquefasciatus (Leal et al., 2013). Given that the number of odorants in the environment is larger than the number of OR genes even in Cx. quinquefasciatus, it is not surprising that many ORs from insects are promiscuous (Leal, 2013; Pask, 2020). However, ORs detecting behaviorally critical compounds (semiochemicals) may be narrowly tuned (Hughes et al., 2010; Nakagawa et al., 2005; Stensmyr et al., 2012).
ORs narrowly tuned to 3-methylindole (=skatole) and indole have been found in mosquito species in the subfamilies Culicinae and Anophelinae. Specifically, OR10 and OR2 have been de-orphanized in the southern house mosquito (Hughes et al., 2010; Pelletier et al., 2010), the yellow fever mosquito (Bohbot et al., 2011), and the malaria mosquito (Carey et al., 2010; Wang et al., 2010). Recently, these so-called indolergic receptors Bohbot and Pitts, 2015 have also been found in the housefly, Musca domestica (Pitts et al., 2021). Skatole and indole are fecal products that have been identified as oviposition attractants for the southern house mosquito (Blackwell et al., 1993; Mboera et al., 2000; Millar et al., 1992). Skatole is a potent oviposition attractant in minute doses, whereas indole is active only at high doses (Millar et al., 1992). To the human nose, skatole has a pungent fecal odor. In contrast, indole has an almost floral odor when highly purified and presented at low doses (Fenaroli, 1975), but a fecal odor at high doses. Remarkably, in all mosquito species and the housefly, OR10s are narrowly tuned to skatole and respond with lower sensitivity to indole, whereas indole is the most potent ligand for OR2s, which give lower responses to skatole. In Cx. quinquefasciatus, these receptor genes were initially named CquiOR10 and CquiOR2, respectively, renamed CquiOR21 and CquiOR121, but a proposition to restore the original names is under consideration (Carolyn McBride, personal communication). Here, we refer to these receptors from Cx. quinquefasciatus as CquiOR10 (VectorBase and GenBank IDs, CPIJ002479 and GU945397, respectively) and CquiOR2 (CPIJ014392, GU945396.1).
These oviposition attractant-detecting (Chen and Luetje, 2014) receptors in Cx. quinquefasciatus, CquiOR10 and CquiOR2, provide a suitable model to identify specific determinants as they respond to skatole and indole with reverse specificity. CquiOR10 and CquiOR2 proteins have 377 and 375 amino acid residues, respectively, and share only 49.5% amino acid identity, whereas 61.9% of the amino acids in the predicted transmembrane domains are identical (Supplementary file 1, Table 1). To identify the specificity determinants of these receptors, we swapped transmembrane (TM) domains and tested the chimeric receptors using the Xenopus oocyte recording system. Given that CquiOR10 is the second most sensitive Cx. quinquefasciatus OR (second only to CquiOR36; Choo et al., 2018), we replaced the predicted (Reynolds et al., 2008) transmembrane domains from CquiOR2 into CquiOR10 and measured the specificity of the chimeric receptors. With this approach, we identified TM2 as a specificity determinant. Next, we tested mutations of chimeric CquiOR10 and identified a single amino acid residue in TM2 (Ala-73) that determines the receptor’s specificity. We then directly mutated the wildtype receptor and observed that CquiOR10A73L is specific to indole as CquiOR2. Additionally, CquiOR2L74A emulated the response profile of CquiOR10. To better understand the structural basis of this single-point specificity determinant, we generated structural models of CquiOR10 and CquiOR10A73L using RoseTTAFold (Baek et al., 2021) and AlphaFold (Jumper et al., 2021). We identified the binding poses for skatole and indole using RosettaLigand molecular docking and observed a finely tuned volumetric space to accommodate specific oviposition attractants.
Results
Chimeric OR with reversed specificity
We envisioned that studying a pair of ORs with reverse specificities, like CquiOR10 and CquiOR2, could lead us to specificity determinants. CquiOR10 is activated by the oviposition attractant skatole (Blackwell et al., 1993; Mboera et al., 2000; Millar et al., 1992) with high specificity (Figure 1A), whereas CquiOR2 is specific to indole (Figure 1B).
Our approach was designed to swap TM domains using the more sensitive receptor, CquiOR10, as the acceptor. Specifically, we generated chimeric receptors by replacing CquiOR10 TM domains with related domains from CquiOR2 (Figure 2A). During the life of this project, the cryo-EM structure of an odorant receptor coreceptor AbakOrco from the parasitic fig wasp, Aprocrypta bakeri, was reported (Butterwick et al., 2018). We then compared the experimental structure (Butterwick et al., 2018) with the predicted topology for AbakOrco using the same OCTOPUS method (Viklund and Elofsson, 2008) we used to identify CquiOR10 and CquiOR2 TMs (Figure 2A; Viklund and Elofsson, 2008). The almost perfect overlap between OCTOPUS prediction and the AbakOrco structure (Figure 2B) validated not only our TM predictions (Figure 2A), but also the 21 chimeric ORs already tested when the structure of the coreceptor AbakOrco (Butterwick et al., 2018) was reported.
We referred to these chimeric receptors as CquiOR10Mx, where Mx refers to TMx from CquiOR2. We performed functional assays of these CquiOR10Mx receptors using the Xenopus oocyte recording system. This long-term project could require as many as 127 possible chimeric receptors. We started by swapping all seven TM domains. We envisioned that this chimeric receptor would have a reverse specificity. If so, we would restore one TM at a time to identify critical domains. It turned out that CquiOR10M1,2,3,4,5,6,7 was silent (see Appendix 1, Supplementary file 1, Table 2). To minimize the number of tested mutants, we changed the strategy to start from single mutations to obtain educated guess for the subsequent design of mutants. With this approach, we generated and tested only 8 of the required 99 mutants with 7–3 TMs swapped. We tested 36 chimeric receptors (see Supplementary file 1, Table 2, and Figure 1—figure supplement 1). Fourteen chimeric receptors did not respond to skatole or indole, and 21 receptors retained the specificity to skatole (Supplementary file 1, Table 2, and Figure 1—figure supplement 1). Lastly, CquiOR10M2,7/CquiOrco-expressing oocytes responded to both skatole and indole with a reverse profile (Figure 3A). This dataset shows that CquiOR10M2,7 emulated the profile of the indole receptor CquiOR2 (Figure 1B).
A single-point mutation that reverses the specificity of the skatole and indole receptors
We proceeded to identify the amino acid residues in the swapped domains of CquiOR10M2,7, directly affecting the specificity of the chimeric and wildtype receptors. Given the observation that, by and large, chimeric receptors with TM5 and TM6 from CquiOR2 gave stronger responses (Figure 1—figure supplement 1), we asked whether CquiOR10M2,7 responses with these two additional TM domains swapped would give more robust responses while keeping the same specificity to indole. CquiOR10M2,5,6,7/CquiOrco-expressing oocytes were indeed more sensitive while maintaining the selectivity to indole (Figure 3B). We then used CquiOR10M2,5,6,7 and designed various mutants to rescue single or multiple residues in TM2 at a time (Figure 4). We focused on TM2 because swapping TM7 did not affect the specificity of the receptor (Figure 1—figure supplement 1E). We divided TM2 into outer, middle, and inner segments based on the topology predicted by OCTOPUS (Viklund and Elofsson, 2008).
It has been postulated that the extracellular halves of TM domains form an odorant-binding pocket (Guo and Kim, 2010); thus, we first examined a mutant (CquiOR10M2,5,6,7_Outer) having the residues in the outer segment at 59, 60, 63, and 64 restored to Glu, Val, Asn, and Ala, respectively, as in the wildtype receptor (Figure 4). CquiOR10M2,5,6,7_Outer/CquiOrco-expressing oocytes retained the specificity of CquiOR10M2,5,6,7 with a more robust response to indole than skatole (Figure 3C). These findings indicated that the residues in the outer segment of TM2 are not specificity determinants. After that, we tested a chimeric receptor with the residues in the middle and inner part of the TM2 domain rescued to match those in the wildtype receptor (Figure 4). CquiOR10M2,5,6,7_Mid;Inner restored the skatole-specific profile of CquiOR10 (Figure 3D), thus suggesting that amino acid residues in these segments are specificity determinants. Then, we tested CquiOR10M2,5,6,7_Inner (=CquiOR10M2,5,6,7L73A;T78I), which had only residues at 73 and 78 rescued to the wildtype Ala and Ile, respectively. Oocytes co-expressing CquiOR10M2,5,6,7_Inner and CquiOrco reverted the specificity to skatole (Figure 3E). As a result, we concluded that residues in the predicted inner part of TM2 are critical for the chimeric receptor’s specificity. We further probed the chimeric receptor CquiOR10M2,5,6,7 with single-point mutations to identify the residue(s) determining specificity. CquiOR10M2,5,6,7T78I/CquiOrco-expressing oocytes showed the same specificity as the chimeric receptor CquiOR10M2,5,6,7 (Figure 3F). Specifically, CquiOR10M2,5,6,7T78I gave a more robust response to indole than skatole, suggesting that rescuing the residue at 78 did not affect CquiOR10M2,5,6,7 specificity. By contrast, CquiOR10M2,5,6,7L73A/CquiOrco-expressing oocytes reverted the specificity to skatole (Figure 3G), thus behaving like the wildtype receptor CquiOR10 (Figure 1A). To further examine the role of Ala-73 as a specificity determinant residue, we obtained a single-point mutation of CquiOR10M7, which is specific to skatole (Figure 1—figure supplement 1E). The responses recorded from CquiOR10M7A73L/CquiOrco-expressing oocytes showed a reverse, indole-specific profile (Figure 3H), like the CquiOR2 profile (Figure 1B). Having identified a single amino acid residue in the chimeric receptor that switches the skatole/indole specificity, we tested the effect of single-point mutation on the specificity of the wildtype receptor CquiOR10 (EC50: skatole, 3.6 µM; indole 29.9 µM). CquiOR10A73L showed a reverse specificity, with dose-dependent responses to indole (Figure 3I) (EC50: indole, 3.4 µM; skatole 53.7 µM). Collectively, these findings suggest that a single amino acid residue in CquiOR10 determines the specificity of this receptor. Additionally, we obtained an equivalent single-point mutation in the indole-specific CquiOR2 (Figure 1B) (EC50: indole, 7.7 µM; skatole 16.4 µM). Thus, CquiOR2L74A/CquiOrco-expressing oocytes gave robust and specific responses to skatole (Figure 3J) (EC50: skatole, 8.5 µM; indole 27.6 µM).
As summarized in a graphical representation (Figure 3—figure supplement 1), these findings demonstrate that these two mosquito odorant receptors, CquiOR10 and CquiOR2, have reciprocal specificity mediated by a single amino acid residue, Ala-73 and Leu-74, respectively.
We also recorded the response of these ORs to other phenolic ligands that activate indolic receptors, albeit generating small currents. While CquiOR10 and CquiOR2 responded to phenol, 3,5-dimethylphenol activated only CquiOR10 (Figure 5A). A single-point mutation in CquiOR10 rendered the chimeric receptor insensitive to 3,5-dimethylphenol. By contrast, an equivalent mutation in CquiOR2 recapitulated the profile of CquiOR10 (Figure 5A).
Additionally, we recorded responses elicited by methylindoles. Specifically, we challenged oocytes with 1-methylindole, 2-methylindole, 4-methylindole, 5-methylindole, 6-methylindole, and 7-methylindole. In these analyses, we did not stimulate the oocyte preparations with 3-methylindole to avoid possible desensitization. CquiOR10/CquiOrco-expressing oocytes elicited stronger responses when challenged with 1-methylindole and 5-methylindole than when stimulated with the other methylindoles (Figure 5B). By contrast, CquiOR2/CquiOrco-expressing oocytes elicited similarly lower responses when stimulated with methylindoles (Figure 5C). CquiOR2L74A with a single-point mutation to mimic OR10 receptor recapitulated CquiOR10 response profile (Figure 5D). These data suggest that a single amino acid residue determines a receptor’s specificity toward ligands eliciting robust or small responses.
CquiOR10 computational modeling suggests space-filling constraints for indole-based odorants around A73
To structurally hypothesize the above-described reciprocal specificity, we generated structural models of CquiOr10, CquiOR2, CquiOR10A73L, and CquiOR2L74A using RoseTTAFold (Baek et al., 2021) and a structural model of CquiOR10 using AlphaFold (Jumper et al., 2021; Figure 6).
CquiOR10 models of one homotetramer subunit were generated with AlphaFold and RoseTTAFold, each producing five models. RoseTTAFold models of CquiOR10, CquiOR10A73L, CquiOR2, and CquiOR2A73L produced a transmembrane helix root mean square deviation (RMSD) between α-Carbon atoms less than 1 Å across all 20 models (five models per odorant receptor); this suggests that the homologous CquiOR10 and CquiOR2 are structurally similar and that single-point mutations should not cause great structural deviation from wildtype. Comparing CquiOR10 models, RoseTTAFold and AlphaFold produced a transmembrane helix RMSD of 1.7 Å, with the transmembrane helices at the extracellular membrane face having the largest structural deviation. Considering there is little structural knowledge of insect odorant receptors, their binding mechanisms, and their conformational changes, we suspected that homologous odorant receptors would have similar binding modes. Further, pairwise sequence alignment suggests that a series of residues in MhraOR5 TM4 aligns with CquiOR10 TM2, which contains CquiOR10A73 (CquiOR10: 59EVI-INAYFAMIFFNAV74. MhraOR5: 199EVIAIYEAVAMIFLITA215.; Figure 6—figure supplement 1) while AlphaFold and RoseTTAFold models of CquiOR10 were broadly similar to MhraOR5 (Figure 6—figure supplement 2). Using transmembrane helix 7b (TM7b), we superimposed the top-ranking CquiOR10 RoseTTAFold and AlphaFold models with an experimentally resolved structure, M. hrabei (MhraOR5) in complex with eugenol (PDB ID: 7LID; DelMarmol et al., 2021) to identify which of our models resembled an odorant-bound conformation. With this selection criteria, we proceeded with RoseTTAFold models of the odorant receptors for Rosetta-based small-molecule docking method RosettaLigand (Davis and Baker, 2009; DeLuca et al., 2015) as the structural similarity around the hypothesized binding pocket was greater than the AlphaFold models of the odorant receptors compared with the MhraOR5 structure. We chose to select conformationally similar models over modeling and docking an apo structure into a bound conformation because it is a more cautious approach when there is little structural information. We perceived modeling an apo structure into a bound conformation to potentially yield more biologically implausible conformations than docking of a structurally comparative model.
To verify that RosettaLigand could effectively sample odorants in receptors homologous to CquiOR10 and CquiOR2, we used the structure of eugenol in complex with the insect odorant receptor OR5 from MhraOR5 (PDB ID: 7LID) as a control (DelMarmol et al., 2021). Rosetta protein-ligand docking employs energy-based analyses, such as the interface energy between protein and ligand, to select the representative models (Bidula et al., 2022). With this selection method, the RMSD of our MhraOR5-eugenol models relative to the experimental structure ranged from 0.5 to 5.0 Å (Supplementary file 1, Table 3). RMSD values equal or below 2.0 Å are considered an appropriate range for validation (Park et al., 2021). After using hdbscan cluster analysis (McInnes et al., 2017) to group structurally similar models, the largest cluster had a RMSD of 0.75 Å while the lowest interface-energy model had a RMSD of 2.4 Å. Collectively, these data demonstrate that RosettaLigand paired with the hdbscan clustering method can recapitulate the MhraOR5 structure and blindly select a near-native model, thus is suitable for structural predictions of odorants with CquiOR10A73L and CquiOR2L74A (Figure 6—figure supplements 3 and 4).
For each receptor–ligand complex (CquiOR10-skatole, CquiOR10A73L-skatole, CquiOR10-indole, and CquiOR10A73L-indole), we generated 100,000 docking models using RosettaLigand, clustered the 10,000 lowest interface-energy models, and selected the lowest interface-energy model from the 10 largest clusters, resulting in 10 models per receptor–ligand complex from which to draw structural hypotheses (Supplementary file 1, Tables 4 and 5). Our modeling suggests that both indole and skatole can readily reorient themselves in a similar pore depth near residue 73, regardless of mutant or wildtype receptor. Comparing the lowest interface-scoring model from each receptor–ligand complex, indole and skatole are positioned in the membrane-embedded pore, flanked by transmembrane helices S2, S4, S5, and S6, and show positional overlap in both and CquiOR10-A73L CquiOR10 (Figure 7, Figure 7—figure supplements 1–5).
In most models, skatole and indole form contacts with CquiOR10 and CquiOr10A73L in a similar plane about a center of rotation. These observations are supported by skatole and indole not containing rotatable bonds, thus relying on rigid translational movements and rotation to form favorable contacts with the rotatable and repackable receptor residues. Additionally, our models position indole and skatole within a series of nonpolar, polar-uncharged, and aromatic amino acids. Protein–ligand interaction profiler (PLIP) analysis (Adasme et al., 2021) suggests that the bulk of favorable interactions are nonpolar, occasional hydrogen bonding with the odorant NH group, and occasional parallel pi stacking, with the ligand-binding pocket formed by TMs 2, 4, 5, and 6 (Figure 7—figure supplement 6). Akin to eugenol forming hydrophobic contacts with MhraOR5/Ile-213 from TM4 (Figure 6—figure supplement 3), skatole and indole formed hydrophobic contacts with CquiOR10/Asn-72 from TM2 (Supplementary file 1, Table 8), which are matched pairs form Needleman–Wunsch pairwise alignment (Figure 6—figure supplement 1). We find of most importance skatole and indole not forming contacts with Ala-73 in CquiOR10 models (Supplementary file 1, Tables 6–14). By contrast, in the CquiOR17A73L models, skatole formed hydrophobic contacts with Leu-73 in 5 of the 10 representative models, while indole formed contacts with Leu-73 in 2 representative models (Supplementary file 1, Table 8). This suggests that Ala-73 may indirectly affect specificity by modulating the volume of the binding pocket (see Appendix 3).
Structurally aligning CquiOR10 and CquiOR10A73L receptors by TM7b (Supplementary file 1, Table 15), demonstrates approximately a 1 Å α-carbon outward shift of A73L (Supplementary file 1, Table 16), suggesting a tightly constrained space in CquiOR10 and an expanded space in CquiOR10A73L relative to the protein backbone (Figure 8).
This difference in binding pocket volume is not ligand-induced, but rather independent of skatole or indole binding. Consistent with our structural hypothesis of space constraints, mutation of Ala-73 in CquiOR10 to Ile or Val negatively affected receptor function, whereas CquiOR10A73G retained specificity and showed higher sensitivity. Specifically, CquiOR10A73I/CquiOrco- and CquiOR10A73V/CquiOrco-expressing oocytes did not respond to skatole or indole (Figure 8—figure supplement 1A and B). We concluded that mutations with these bulkier residues caused loss of binding to indole or skatole, given that these receptors were functional, as indicated by the potent responses elicited by the Orco ligand candidate OLC12, 2-{[4-Ethyl-5-(4-pyridinyl)–4 H-1,2,4-triazol-3-yl]sulfanyl}-N-(4-isopropylphenyl)acetamide (Chen and Luetje, 2012), also known as VUAA-3 (Taylor et al., 2012). As previously demonstrated, OR-Orco complexes are more sensitive to activation by Orco agonists than are the Orco homomers (Chen and Luetje, 2012; Chen and Luetje, 2013; Choo et al., 2018; Hughes et al., 2017; Chen and Luetje, 2014). It is, therefore, conceivable that the complexes (Figure 8—figure supplement 1A, B) were expressed but the binding sites were defective.
On the other hand, CquiOR10A73G/CquiOrco-expressing oocytes showed the same skatole specificity as the wildtype receptor (Figure 8—figure supplement 1C). Interestingly, skatole elicited stronger currents recorded from CquiOR10A73G than the wildtype receptor CquiOR10 (Figure 8—figure supplement 2), further implying a higher affinity for skatole when there is a reduced constraint, or increased volume, of the binding pocket.
Since CquiOR2 is homologous to CquiOR10, we also propose that our space constraint structural hypothesis can be applied to CquiOR2. CquiOR2 has 49.5% sequence identity and 71.7% sequence similarity to CquiOR10. CquiOR2 also has physiochemically matched pairs to CquiOR10 residues speculated to form contacts with skatole/indole in our study (Supplementary file 1, Table 17). While we did not perform CquiOR2 docking, we indirectly examined space constraints in CquiOR2 by mutating Leu-74 to Ile, Val, or Gly. We then tested oocytes co-expressing CquiOrco with CquiOR2L74I, CquiOR2L74V, or CquiOR2L74G (Figure 8—figure supplement 1D–F). Consistent with the relaxation of the space constraints, CquiOR2L74I retained the specificity of the wildtype receptor to indole, whereas CquiOR2L74G showed a reverse specificity profile (Figure 8—figure supplement 1F), that is, more robust response to skatole than indole. Interestingly, CquiOR2L74V/CquiOrco-expressing oocytes generated nearly equal, albeit small, currents when stimulated with the two oviposition attractants (skatole, 8.7 ± 2.7 nA; indole, 9.0 ± 2.5 nA at 100 µM; n = 3, p>0.9999, Wilcoxon matched-pairs signed-rank test).
Next, we tested the space constraints hypothesis with a bulkier ligand, 3-ethylindole. A CquiOR10 mutant with a less space-filling residue, CquiOR10A73G, elicited responses to 3-ethylindole (526 ± 110 nA) higher than the responses to indole (205 ± 81 nA, at 100 µM), although less robust than the skatole responses (1939 ± 142 nA; all at 100 µM) (Figure 9—figure supplement 1). By contrast, receptors with a bulkier residue (CquiOR2WT, CquiO10A73L, and CquiOR2L74I) did not respond to 3-ethylindole (Figure 9). However, CquiOR2 mutant with less space-filling residues, CquiOR2L74A and CquiOR2L74G, responded to 3-ethylindole in a dose-dependent manner (Figure 9). Additionally, CquiOR10A73G elicited dose-dependent strong responses to 3-ethylindole than CquiOR10 (Figure 9).
In summary, the findings that a bulkier, non-natural ligand, elicited more robust responses when CquiOR10 and CquiOR2 residues at 73 and 74, respectively, were mutated into Gly are consistent with the space constraints hypothesis.
Discussion
Our mutation studies demonstrated that a single amino acid residue substitution in two narrowly tuned ORs from the southern house mosquito can revert their specificity to the oviposition attractants skatole and indole. Amino acid residues leading to one-way alterations of insect ORs have been previously reported (Auer et al., 2020; Cao et al., 2021; Hughes et al., 2014; Leary et al., 2012; Pellegrino et al., 2011; Yang et al., 2017; Yuvaraj et al., 2021). Here, we demonstrated that switching a single amino acid residue in two mosquito ORs reverses the specificity of these receptors to two physiologically and ecologically significant odorants. Understanding how mosquito odorant receptors detect oviposition attractants may lead to the development of potent lures. Trapping female that already had a blood meal is an invaluable tool for surveillance given that these gravid females are likely to carry virus circulating in an area.
It has been proven challenging to obtain cryo-EM structure of insect ORs. Hitherto, the only experimentally solved structures of insect receptors are AbakOrco, a co-receptor from the parasitic fig wasp, A. bakeri (Butterwick et al., 2018), and MhraOR5, an OR from the jumping bristle, M. hrabei (DelMarmol et al., 2021). While structures of ORs from more evolved winged insects (Pterygota) are yet to be experimentally determined, the most accurate structural modeling methods, such as Rosetta, RoseTTAFold, and AlphaFold, allow us to get a better understanding on how these receptors interact with ligands. Here, we obtained unambiguous experimental evidence that Ala-73 plays a crucial role in CquiOR10 specificity to skatole. We then resorted to modeling to put forward structural hypotheses that can be tested using available experimental approaches and validated by high-resolution structures in the future. Using RoseTTAFold and AlphaFold, we generated models of CquiOR10 followed by RosettaLigand docking of skatole and indole to generate structural hypotheses (Figure 6, Figure 7, Figure 7—figure supplements 1–8). RoseTTAFold (Baek and Baker, 2022) and AlphaFold (Jumper et al., 2021) are highly accurate and state-of-the-art tools for protein structure prediction. On the other hand, RosettaLigand (Davis and Baker, 2009, DeLuca et al., 2015) is a competitive protein docking method (Smith and Meiler, 2020) that has been used previously to predict the binding position of ligands within protein pores (Craig et al., 2020; Nguyen et al., 2017; Nguyen et al., 2019; Pressly et al., 2022). Of note, a structure of an OR-Orco heterocomplex has yet to be elucidated. It has been postulated that they adopt the same overall architecture as the Orco homomeric channel (Butterwick et al., 2018), with one or more Orco subunits being replaced by an OR. Additionally, the structure of the ‘stand alone’ MhraOR5 is remarkably similar in the fold of the helical subunits and the quaternary structure formed by the four subunits within the membrane plane (DelMarmol et al., 2021). Therefore, it is reasonable to assume that ligand–receptor interactions analyzed with homomers reflect the same interactions with heteromers.
RoseTTAFold modeling and RosettaLigand docking studies suggest that the specificity determinant amino acid residue (Ala-73) did not form direct contacts with ligands but rather provided a finely tuned, sensitive volumetric space to accommodate odorants. Interpreting CquiOR10-A73L as an expanded space is counterintuitive compared with our oocyte recordings; the wild-type CquiOR10 receptor responded better to the bulkier skatole (Figure 1A), while CquiOR10-A73L responded better to the smaller indole (Figure 2I).
Considering both experimental and modeling data, the reason for differing receptor responses at residue 73 could be due to the repacking of this position to accommodate more carbon sidechain atoms that are reducing the binding pocket volume, or shift key, nearby residues required for receptor response. The fact that indole is a rigid molecule, and that the methyl group of skatole is the only rotamer, could also suggest that the binding pocket around position 73 is tightly regulated. The addition of a methyl group could prevent skatole from occupying the appropriate configuration for receptor response with Leu-73 constrained volume (Appendix 4). Combined, our results suggest that the residue at 73 provides a finely tuned volumetric space to accommodate specific oviposition attractants. Future structures can test the space constraints hypothesis proposed here by validating the specific residues interacting with odorants, quantifying the binding pocket volume, and providing structural insight into how position 73 modulates receptor response to specific odorants.
Taken together, our findings shed light on a possible path to design more potent oviposition attracts to trap mosquitoes, which may pave the way to novel strategies for vector-borne virus surveillance and, possibly, mosquito control.
Materials and methods
Construction of chimeric receptors
Request a detailed protocolWe used a previously obtained pGEMHE-CquiOR10 (Hughes et al., 2010) plasmid to amplify the sequences of the desired chimeric receptors. To obtain the full length of chimeric OR genes, we used specific primers, with overlapping sequences at each end to target sequences (adaptor underlined) for cloning into pGEMHE vector, which was linearized at restriction sites XmaI (5′-CCCGGG-3') and XbaI (5′-TCTAGA-3'):CqOR10_Fw 5′-GATCAATTCCCCGGGACCATGACCGCGGCACCCATTTT-3' and CqOR10_Rv 5′-CAAGCTTGCTCTAGATCAATTATAAACGCGTCTCAGCAGGGT-3'. To generate a chimeric OR by PCR amplification, we designed specific primers for the desired CquiOR2 TM domain. Simultaneously, we prepared the recipient fragments of CquiOR10 to receive the CquiOR2 TM domains. The fragments were then assembled using In-Fusion HD cloning kit (Clontech) to obtain the desired chimeric OR. Each CquiOR2 TM domain was divided into two parts, each part was synthesized with a specific primer, except for CquiOR2 TM7 domain, which used only one primer with the full domain. These primers contained an overlap with CquiOR10 sequence followed by one part of the CquiOR2 TM sequence and an overlap with the other part of the CquiOR2 TM domain sequence, which was synthesized in another PCR. The underlined sequences represent CquiOR2 TM domains and, unless otherwise specified, sequences in italic are overlaps for CquiOR2 TM domains. CqOR10M1_Fw: 5′-GGTGACCGTGCTGAACGTGTTCCAGTTTATGAACCTGTTTCGAGCCTGGGGCAACATC-3′; CqOR10M1_Rv: 5′-TCAGCACGGTCACCGGAATGCAGCCGAGGAGGTAACTGAGGACGTTGCCTACTGTGATCTCAAGG-3′; CqOR10M2_Fw: 5′-CCGTGCTGTACTTCAACCTTGTGTTGAGAACCACGTTTATACTGTGCAATCGTCAGGATTATGAGG-3′; CqOR10M2_Rv: 5′- AAGTACAGCACGGTGAAATATCCGTCGATGATGATTTTGTCGATGTTGCCCCAGGCT-3′; CqOR10M3_Fw: 5′- GTTCATCAGTGCGTGCTTCGTGACGTATCCGCTTTTTTCACCGACACGTAGCCTCCCGTACG-3′; CqOR10M3_Rv: 5′- ACGCACTGATGAACGCTCCCAGCCAGAGGTTCGATTTGGACAGCAGTCGGGCACGTTTGGTGA-3′; CqOR10M4_Fw: 5′- CACGTTTCCGGCGTGCTGCATGTACATTCCGTTTACCAGCTTCTTCGCCACGACTACTTTG-3′; CqOR10M4_Rv: 5′- ACGCCGGAAACGTGAGGTACACTTGCAGAAAAAACACAACCTGGTACAGGGGCGTC-3′; CqOR10M5_Fw: 5′- GCTATGCGCCTTGCTGTTTCTACTTAGCACCAGCAATAATCCCGCGCAAATTATCATCGTGG-3′; CqOR10M5_Rv: 5′- GCAAGGCGCATAGCATCATCCCAAACGATAGCAACTCAATCAGACAGATGTAGGTCACCAGCG-3′; CqOR10M6_Fw: 5′- TCTTTATGATTCTGTCCCAGATGTACGCCCTGTACTGGCACGCCAACGAGCTGCG-3′; CqOR10M6_Rv: 5′- AGAATCATAAAGATGTACGATCCGATCATCACCATCTGCGCGGGATTTTCGATAATGTTCAGC-3′; CqOR10M7_Rv: 5′- CAAGCTTGCTCTAGATCAATTATAAACGCGTCTCAGCAACGTAAAGTACGAATACGAGGCATTGATCAACTTTTGAAACATTTCCAAGGTCATCGGATAGACGTTGCCTACTGTGATCTCAAGG-3′ (here the italic represents the pGEMHE adaptor). The two fragments were amplified using a combination of CqOR10_Fw with CqOR10Mx_Rv primers and CqOR10_Rv with CqOR10Mx_Fw primers. Then, we cloned the two fragments amplified by PCR into pGEMHE vector using the In Fusion system as described below. Chimeric plasmid with a single TM swapped was used as a template to generate chimeric OR with two TM swapped and subsequently chimeric ORs with multiple CquiOR2 TM domains.
Chimeric OR cloning and subcloning into pGEMHE
Request a detailed protocolThe fragments amplification was performed using Platinum Taq DNA Polymerase High Fidelity (Thermo Fisher Scientific) with the following conditions: 95°C for 5 min, followed by 30 cycles of 95°C for 20 s, 57°C 30 s for annealing and 68°C for 1.5 min, and extension at 68°C for 5 min. PCR products were purified by QIAquick gel extraction kit (QIAGEN). The target pGEMHE plasmid was cut by XmaI and XbaI in separate reactions. Ligation was done with In Fusion (Takara Bio USA) system and the transformation was performed using Stellar competent cells (Takara Bio USA) in heat-shock. After selecting the cells, plasmids were extracted with QIAprep Spin Miniprep kit (QIAGEN). The cloned gene was verified by DNA sequencing (Berkeley Sequencing Facility).
Site-directed point mutagenesis and fragment replacement
Request a detailed protocolPhusion Site-Directed Mutagenesis Kit (Thermo Scientific, West Palm Beach, FL) was used to generate point mutations and TM fragment replacements. Mutations were created with mismatched 5′-phosphorylated mutagenic primers and PCR amplification. The chimeric sequences (CquiOR10M2,5,6,7) in pGEMHE vector were used as templates for rescues, whereas the wildtype sequences in pGEMHE vector served as templates for point-mutations. Rescue primers: CqOR10M2_M2567OUTERup: 5′-GATGATGACCTCGTCGATGTTGCCCCAGGC-3′; CqOr10M2_M2567OUTERdn: 5′-AACGCATATTTCACCGTGCTGTACTTCAACC-3′; CqOr10M2_M2567INNERup: 5′-cgcagcaccgcgttgaagtacagcacggtg-3′; CqOr10M2_M2567INNERdn: 5′-aacaattttcatactgtgcaatcgtcagga-3′; CqOR10M2_M2567MID-INNERup: 5′-catcgacggCtactttgcgatgattttcttcaacgcg-3′; CqOR10M2_M2567MID-INNERdn: 5′-atgattttgtcgatgttgccccaggctc-3′; CqOr10M2_M2567_L73Aup: 5′-CGTTGAAGTACAGCACGGTGAAATAT-3′; CqOr10M2_M2567_L73Adn: 5′-CTGTGTTGAGAACCACGTTTATACTGT-3′; CqOr10M2_M2567_T78Iup: 5′-ATGGTTCTCAACACAAGGTTGAAGTA-3′; CqOr10M2_M2567_T78Idn: 5′-CTTTATACTGTGCAATCGTCAGGATTATG-3′; point mutation primers: CqOR10A73up: 5′-Gttgaagaaaatcatcgcaaagta-3′; CqOR10A73Ldn: 5′-Ctggtgctgcgaacaattttc-3′; CqOR10A73Adn: 5′-Gcggtgctgcgaacaattttc-3′; CqOR10A73Gdn: 5′-Ggggtgctgcgaacaattttc-3′; CqOR10A73Vdn: 5′-Gtggtgctgcgaacaattttc-3′; CqOR10A73Idn: 5′-ATCgtgctgcgaacaattttc-3′; CqOR2L74up: 5′-gttgaagtacagcacggtgaaata-3′; CqOr2L74Adn: 5′-Gctgtgttgagaaccacgtttatac-3′; CqOr2L74Idn: 5′-atcGTGTTGAGAaccacgtttatac-3′; CqOr2L74Gdn: 5′-gggGTGTTGAGAaccacgtttatac-3′; CqOr2L74Vdn: 5′-gtgGTGTTGAGAaccacgtttatac-3′. The amplified linear PCR products containing the desired modification were ligated and transformed into Stellar Competent Cells (Takara Bio USA). All sequences were confirmed by DNA sequencing (UC Berkeley DNA Sequencing Facility).
In vitro transcription, oocyte microinjection, and two-electrode voltage-clamp assay (TEVC)
Request a detailed protocolCapped OR cRNA was prepared using mMESSAGE mMACHINE T7 Kit (Ambion) as previously described (Xu et al., 2019). Purified OR cRNA was resuspended in nuclease-free water at 200 ng/μL and microinjected into Xenopus laevis oocytes on stage V or VI (EcoCyte Bioscience) along with the same amount of CquiOrco. Injected oocytes were incubated at 18°C for 3–7 days in a modified Barth’s solution (88 mM NaCl, 1 mM KCl, 2.4 mM NaHCO3, 0.82 mM MgSO4, 0.33 mM Ca(NO3)2, 0.41 mM CaCl2, and 10 mM HEPES at pH 7.4) supplemented with 10 μg/mL gentamycin, 10 μg/mL streptomycin, and 1.8 mM sodium pyruvate. Two-electrode voltage-clamp technique (TEVC) was used to measure odorant-induced currents at a holding potential of −80 mV. Signals were amplified with an OC-725C amplifier (Warner Instruments), low-pass filtered at 50 Hz, and digitized at 1 kHz. Data acquisition and analysis were performed with Digidata 1440A and software pCLAMP 10 (Molecular Devices). Skatole (98%, CAS# 83-34-1), indole (98%, CAS# 120-72-9), and 3-ethylindole (97%, CAS# 1484-19-1) were provided by Sigma-Aldrich (Milwaukee, WI), ACROS Organics (Geel, Belgium), and AmBeed (Arlington Hts, IL), respectively. Phenol (99.5%, CAS# 108-95-2), 3,5-dimethylphenol (99%, CAS# 108-68-9), 1-methylindole (97%, CAS# 603-76-9), 2-methylindole (98%, CAS# 95-20-5), 5-methylindole (99%, CAS# 614-96-0), 6-methylindole (97%, CAS# 3420-02-8), and 7-methylindole (97%, CAS# 933-67-5) were acquired from Sigma-Aldrich, and 4-methylindole (99%, CAS# 16096-32-5) was provided by ACROS.
Rosetta structural modeling and docking
Request a detailed protocolWe used RoseTTAFold (Baek et al., 2021) and AlphaFold (Jumper et al., 2021) to generate predicted structures of CquiOR10, CquiOR2, CquiOR10A73L, and CquiOR2L74A monomers. The OpenEye Omega toolkit (Hawkins et al., 2010) was used to generate conformer libraries. Molecular docking was performed using RosettaLigand (Davis and Baker, 2009; DeLuca et al., 2015). The top models were visually analyzed using UCSF Chimera (Pettersen et al., 2004).
To verify RosettaLigand could effectively sample homologous receptors in complex with odorants, we used the structure of eugenol in complex with the insect olfactory receptor OR5 from M. hrabei (MhraOR5) as a control test case (DelMarmol et al., 2021). After verifying RosettaLigand could recapitulate the MhraOR5–eugenol complex from structural studies, we docked indole and skatole to the RoseTTAFold monomer models of CquiOR10 and CquiOR10A73L receptors with the same method. Briefly, the initial placement of all ligands into respective structures was guided by the position of eugenol in a complex with MhraOR5 as a basis since it is the only homologous structure with an odorant. For an unbiased receptor sampling, an initial transformation was performed on the ligand relative to the receptor site using Rosetta’s Transform mover. Within a 7 Å sphere, the ligand underwent a Monte Carlo simulation, whereby the ligand was allowed to translate up to 0.2 Å and rotate up to 20°. This was performed 500 times on the ligand, with the lowest scoring pose used as the starting pose for docking. This form of docking is termed local docking and is commonplace when there is experimental evidence and homologous structures as templates. physiologically relevant binding modes can be identified with local, high-resolution ligand docking (Kaufmann et al., 2009). For additional methods, see Appendix 2.
Appendix 1
Preliminary screening of chimeric receptors
CquiOR10M1,2,3,4,5,6,7/CquiOrco-expressing oocytes did not respond to skatole or indole. Likewise, the chimeric receptor with six transmembrane domains replaced, CquiOR10M1,2,3,4,5,6 was also silent. Next, we replaced one TM at a time. CquiOR10M1 and CquiOR10M2 were silent, whereas CquiOR10M3 (Figure 1—figure supplement 1A), CquiOR10M4 (Figure 1—figure supplement 1B), CquiOR10M5 (Figure 1—figure supplement 1C), CquiOR10M6 (Figure 1—figure supplement 1D), and CquiOR10M7 (Figure 1—figure supplement 1E) responded to both skatole and indole with specificity similar to that of the wildtype receptor. Although these chimeric receptors differed from CquiOR10 in terms of sensitivity, they showed the same specificity to skatole as CquiOR10 (Figure 1A). While CquiOR10M3 and CquiOR10M4 were less sensitive than CquiOR10, CquiOR10M7 showed comparable sensitivity to CquiOR10, and CquiOR10M5 and CquiOR10M6 were more sensitive to skatole than the wildtype receptor. We remained focused on specificity and did not attempt to quantify the different responses.
We tested a chimeric receptor with the other five TM domains swapped. CquiOR10M3,4,5,6,7/CquiOrco-expressing oocytes mirrored the profiles obtained with CquiOR10 in terms of sensitivity and specificity (Figure 1—figure supplement 1F). We then kept the wildtype TM7, in addition to TM1 and TM2, in the next chimeric ORs tested. Albeit less sensitive than CquiOR10, CquiOR10M3,4,5,6 showed the same specificity of the wildtype receptor (Figure 1—figure supplement 1G). Following, we tested chimeric receptors with three TM domains swapped. CquiOR10M4,5,6 (Figure 1—figure supplement 1H), CquiOR10M3,5,6 (Figure 1—figure supplement 1I), CquiOR10M3,4,6 (Figure 1—figure supplement 1J), and CquiOR10M3,4,5 (Figure 1—figure supplement 1K) did not differ from CquiOR10 in terms of specificity or sensitivity. Next, we focused on chimeric receptors with two TMs swapped. Thus, we tested CquiOR10M3,4 (Figure 1—figure supplement 1L), CquiOR10M3,5 (Figure 1—figure supplement 1M), CquiOR10M3,6 (Figure 1—figure supplement 1N), CquiOR10M3,7 (Figure 1—figure supplement 1O), CquiOR10M4,5 (Figure 1—figure supplement 1P), CquiOR10M4,6 (Figure 1—figure supplement 1Q), CquiOR10M4,7 (Figure 1—figure supplement 1R), CquiOR10M5,6 (Figure 1—figure supplement 1S), CquiOR10M5,7 (Figure 1—figure supplement 1T), and CquiOR10M6,7 (Figure 1—figure supplement 1U). They all showed skatole-specific profiles, most of them with comparable sensitivity, whereas CquiOR10M4,5, CquiOR10M5,7, and CquiOR10M6,7 were more sensitive and CquiOR10M3,6 less sensitive than CquiOR10. Interestingly, none of the chimeric receptors having TM1 as one of the two domains swapped was functional. Specifically, oocytes co-expressing CquiOrco with either CquiOR10M1,2, CquiOR10M1,3, CquiOR10M1,4, CquiOR10M1,5, CquiOR10M1,6, or CquiOR10M1,7 did not respond to skatole or indole (Supplementary file 1, Table 2). Likewise, CquiOR10M2,3, CquiOR10M2,4, CquiOR10M2,5, and CquiOR10M2,6 did not respond to these oviposition attractants (Supplementary file 1, Table 2).
Appendix 2
Ligand conformer library generation
The Protein DataBank file (.pdb) of eugenol was extracted from the MhraOR5 structure (PDB ID: 7LID), while indole and skatole were extracted as Structure-Data Files files (.sdf) from PubChem (Kim et al., 2021). Each ligand structure was bond-corrected, protonated at pH 7.4, energy minimized using the Merck Molecular Force Field (Halgren, 1996), and saved as Tripos Mol2 files (.mol2) with the Avogadro software (Hanwell et al., 2012). Next, partial charge, atom, and bond-type assignment were performed with Antechamber AM1BCC correction using Amber Tools (Case et al., 2021; Salomon-Ferrer et al., 2013); AM1BCC partial-charge correction is common in Rosetta ligand docking protocols (Park et al., 2021) and has been shown to be similarly ranked with other Rosetta-based partial-charge methods for RosettaLigand (Smith and Meiler, 2020). An in-house script using the OpenEye Omega toolkit and Rosetta was used to generate the conformer library and associated Rosetta-readable ligand parameters file (Supplementary file 2, Protocol Capture).
Protein receptor preparation
The MhraOR5 structure was relaxed with backbone constraints using the RosettaRelax protocol (Nivón et al., 2013) prior to RosettaLigand docking. This is commonplace to allow the repacking of protein sidechains and minimization of the structure into the Rosetta score function for comparison between structures. For RoseTTAFold-generated structures, the RosettaRelax protocol was not performed since the Rosetta score function was used to generate the structures.
RosettaLigand docking
RosettaLigand docking was performed using RosettaScripts protocols described previously (Davis and Baker, 2009; DeLuca et al., 2015). Briefly, initial placement of all ligands into respective structures was guided by the position of eugenol in complex with MhraOR5 as a basis. For an unbiased sampling of the receptor, an initial transformation was performed on the ligand relative to the receptor site using Rosetta’s Transform mover. Within a 7 Å sphere, the ligand underwent a Monte Carlo simulation, whereby the ligand was allowed to translate up to 0.2 Å and rotate up to 20°. This was performed 500 times on the ligand, with the lowest scoring pose used as the starting pose for docking. Six cycles of high-resolution docking were performed with repacking of sidechains every third iteration using Rosetta’s HighResDocker mover. Lastly, the protein/ligand complex was minimized using Rosetta’s FinalMinimizer mover and interface scores reported using the InterfaceScoreCalculator.
For every test case, 100,000 docking poses were generated. The top 10,000 models were selected by Rosetta interface energy (Interface_delta_X) metric, followed by ligand clustering using Biomol2Clust (Timonina et al., 2021), the hdbscan algorithm (McInnes et al., 2017), and a minimum cluster threshold of 50 poses. The 10 most frequently sampled clusters were visually analyzed using UCSF Chimera (Pettersen et al., 2004) by using the lowest interface energy pose in each cluster as the representative for that cluster. The Protein Ligand Interaction Profiler (Salentin et al., 2015) was performed on these representative poses to identify potential hydrophobic interactions, hydrogen bonds, π-stacking, and salt bridges. Hydrogen bond and pi-stacking interactions were filtered by previously reported bond distances (Bissantz et al., 2010) to present a conservative estimate of potential contacts. Atom types from PLIP analysis are reported using IDATM nomenclature (Meng and Lewis, 1991).
Appendix 3
The ligand-binding pocket is formed by TMs 2, 4, 5, and 6. The most frequent hydrophobic contacts were formed by Tyr-185 (38/40 models; TM4) and Ile-69 (32/40 models; TM2) (Supplementary file 1, Tables 6–8). Phe-261 (TM5) also formed hydrophobic interactions in all four receptor–ligand complex sets of models, but less frequently (22/40 models). Asn-72 (TM2), Tyr-287 (TM6), and Ile-291 (TM6) formed hydrophobic contacts more frequently with skatole structures compared to indole structures (Asn-72: 12/20 vs 9/20, Tyr-287: 14/20 vs 5/20, Ile-291: 8/20 vs 3/20). Notably, Leu-73 (TM2) in CquiOR10A73L formed hydrophobic contacts with indole (2/10) and skatole (5/10), but Ala-73 (TM2) did not form hydrophobic contacts with indole or skatole. Tyr-138 (TM3) formed hydrophobic contacts with skatole in CquiOR10A73L (2/10), but not in CquiOR10. Gln-294 (TM6) formed hydrophobic contacts with skatole and indole in these structures, except CquiOR10A73L-skatole; however, Gln-294 does form a hydrogen bond in 1–2 models for each structure. Of note, the range of PLIP-reported hydrophobic contacts among the 10 representative models for each structure changed from an average of 6.3 ± 0.5 contacts (mean ± SEM) in CquiOR10 to 5.2 ± 0.3 contacts in CquiOR10A73L (p=0.035, Mann–Whitney test). Considering the number of unique residues forming hydrophobic contacts, the averages were 4.9 ± 0.4 and 4.9 ± 0.2 contacts, respectively (p=0.97, Mann–Whitney test).
There were at most two models in each structure with hydrogen bonds (Supplementary file 1, Tables 9–11). For CquiOR10-skatole, two models formed hydrogen bonds with Gln-294. For CquiOR10A73L-skatole, one model formed hydrogen bonds with Asn-72, and one other with Gln-294 (Supplementary file 1, Table 11). For CquiOr10-indole, one model formed a hydrogen bond with the backbone of Ile-69, another model with Tyr-138, another with Tyr-287, and a fourth model with Gln-294. For CquiOR10A73L-indole, only one model formed a hydrogen bond at Gln-294.
Parallel pi-stacking interactions (Supplementary file 1, Tables 12–14) were identified for CquiOR10A73L-skatole, CquiOR10-indole, and CquiOR10A73L-indole. For CquiOR10A73L-skatole, two models each formed a parallel pi-stacking interaction with Tyr-185 (Supplementary file 1, Table 14). For CquiOR10-indole, a model formed a parallel pi-stacking interaction with Tyr-138 and another model formed with Tyr-152. For CquiOR10A73L-indole, eight parallel pi-stacking interactions with Tyr-185 were identified across seven representative models.
From the PLIP analysis of our docking study, we find of most importance skatole and indole not forming contacts with Ala-73 in CquiOR10 models. By contrast, skatole formed hydrophobic contacts with Leu-73 in 5 of the 10 representative models, while indole formed contacts with Leu-73 in two representative models. This suggests that Ala-73 may indirectly affect specificity by modulating the volume of the binding pocket.
Appendix 4
Considering both experimental and modeling data, the reason for differing receptor responses at residue 73 could be due to the repacking of this position to accommodate more carbon sidechain atoms that are reducing the binding pocket volume. Leu-73 contains three additional carbon sidechain atoms: γ-carbon, δ1-carbon, and δ2-carbon. Adding these atoms increases the sidechain rotatable bond count from 0 (for Ala-73)–2 (for Leu-73), causing an increase in repackable states for the binding pocket. From our models, the distance of Leu-73 α-carbon to its δ1-carbon or δ2-carbon is approximately 2.8–3.9 Å. The Ala-73 α-carbon to its β-carbon is approximately 1.5 Å. Since the Ala-73 α-carbon is approximately 0.9–1.1 Å inward relative to Leu-73 α-carbon, Leu-73 C-δ atoms could be further inward by approximately 0.2–1.5 Å relative to Ala-73 C-β. This could cause repacking of surrounding sidechains while reducing the volume of the binding pocket, or shift key, nearby residues required for receptor response. Further, from our models, the carbon–carbon length of skatole’s methyl group is approximately 1.5 Å. This distance discrepancy, the fact that indole is a rigid molecule, and that the methyl group of skatole is the only rotamer, could also suggest that the binding pocket around position 73 is tightly regulated. The 1.5 Å increase from skatole’s methyl group could prevent skatole from occupying the appropriate configuration for receptor response with Leu-73. Conversely, the additional inward 0.2–1.5 Å increase from Leu-73’s sidechain could provide indole more contacts or a more favorable configuration with surrounding CquiOR10 sidechains to prompt increased receptor response. Combined, our results suggest that the residue at 73 provide a finely tuned volumetric space to accommodate specific oviposition attractants.
Data availability
All data generated or analysed during this study are included in the manuscript and supporting files. Source data files have been provided for Figures 1, 3, 5, and 9.
References
-
PLIP 2021: expanding the scope of the protein-ligand interaction profiler to DNA and RNANucleic Acids Research 49:W530–W534.https://doi.org/10.1093/nar/gkab294
-
The contribution of culex pipiens complex mosquitoes to transmission and persistence of west nile virus in north americaJournal of the American Mosquito Control Association 28:137–151.https://doi.org/10.2987/8756-971X-28.4s.137
-
Deep learning and protein structure modelingNature Methods 19:13–14.https://doi.org/10.1038/s41592-021-01360-8
-
A medicinal chemist’s guide to molecular interactionsJournal of Medicinal Chemistry 53:5061–5084.https://doi.org/10.1021/jm100112j
-
A behavioural and electrophysiological study of ovi position cues for culex quinquefasciatusPhysiological Entomology 18:343–348.https://doi.org/10.1111/j.1365-3032.1993.tb00607.x
-
Molecular characterization of the aedes aegypti odorant receptor gene familyInsect Molecular Biology 16:525–537.https://doi.org/10.1111/j.1365-2583.2007.00748.x
-
The narrowing olfactory landscape of insect odorant receptorsFrontiers in Ecology and Evolution 3:e39.https://doi.org/10.3389/fevo.2015.00039
-
Veratridine: a janus-faced modulator of voltage-gated sodium ion channelsACS Chemical Neuroscience 11:418–426.https://doi.org/10.1021/acschemneuro.9b00621
-
RosettaLigand docking with full ligand and receptor flexibilityJournal of Molecular Biology 385:381–392.https://doi.org/10.1016/j.jmb.2008.11.010
-
BookFenaroli’s Handbook of Flavor Ingredients: Adapted from the Italian Language Works of Giovanni Fenaroli (2d ed)CRC Press.
-
Merck molecular force field. I. basis, form, scope, parameterization, and performance of MMFF94Journal of Computational Chemistry 17:490–519.https://doi.org/10.1002/(SICI)1096-987X(199604)17:5/6<490::AID-JCC1>3.0.CO;2-P
-
Avogadro: an advanced semantic chemical editor, visualization, and analysis platformJournal of Cheminformatics 4:17.https://doi.org/10.1186/1758-2946-4-17
-
Conformer generation with omega: algorithm and validation using high quality structures from the protein databank and cambridge structural databaseJournal of Chemical Information and Modeling 50:572–584.https://doi.org/10.1021/ci100031x
-
Odorant receptor from the southern house mosquito narrowly tuned to the oviposition attractant skatoleJournal of Chemical Ecology 36:797–800.https://doi.org/10.1007/s10886-010-9828-9
-
Neurobiology of Chemical CommunicationPheromone reception in insects: The example of silk moths, Neurobiology of Chemical Communication, Boca Raton (FL), CRC Press/Taylor & Francis.
-
PubChem in 2021: new data content and improved web interfacesNucleic Acids Research 49:D1388–D1395.https://doi.org/10.1093/nar/gkaa971
-
Odorant reception in insects: roles of receptors, binding proteins, and degrading enzymesAnnual Review of Entomology 58:373–391.https://doi.org/10.1146/annurev-ento-120811-153635
-
BookMolecular biology of receptionIn: Allison JD, Carde RT, editors. Pheromone Communication in Moths: Evolution, Behavior, and Application. University of California Press. pp. 127–138.
-
BookMechanism of action of insect pheromones and other semiochemicalsIn: Liu HW, Begley TP, editors. Comprehensive Natural Products III. Elsevier. pp. 222–236.https://doi.org/10.1016/B978-0-12-409547-2.14735-3
-
Membrane topology of the Drosophila or83b odorant receptorFEBS Letters 581:5601–5604.https://doi.org/10.1016/j.febslet.2007.11.007
-
Search and sequence analysis tools services from EMBL-EBI in 2022Nucleic Acids Research 50:W276–W279.https://doi.org/10.1093/nar/gkac240
-
The genome of anopheles darlingi, the main neotropical malaria vectorNucleic Acids Research 41:7387–7400.https://doi.org/10.1093/nar/gkt484
-
Oviposition and behavioral responses of culex quinquefasciatus to skatole and synthetic oviposition pheromone in tanzaniaJournal of Chemical Ecology 26:1193–1203.https://doi.org/10.1023/A:1005432010721
-
ConferenceAccelerated Hierarchical Density Based Clustering2017 IEEE International Conference on Data Mining Workshops (ICDMW. pp. 33–42.https://doi.org/10.1109/ICDMW.2017.12
-
Hdbscan: hierarchical density based clusteringThe Journal of Open Source Software 2:205.https://doi.org/10.21105/joss.00205
-
Determination of molecular topology and atomic hybridization states from heavy atom coordinatesJournal of Computational Chemistry 12:891–898.https://doi.org/10.1002/jcc.540120716
-
Identification of oviposition attractants for Culex quinquefasciatus from fermented bermuda grass infusionsJournal of the American Mosquito Control Association 8:11–17.
-
Culicine mosquitoes and the agents they transmitThe Biology of Disease Vectors 1:85–97.
-
Force field optimization guided by small molecule crystal lattice data enables consistent sub-angstrom protein-ligand dockingJournal of Chemical Theory and Computation 17:2000–2010.https://doi.org/10.1021/acs.jctc.0c01184
-
BookInvertebrate olfactory receptorsIn: Fritzsch B, editors. The Senses: A Comprehensive Reference (Second Edition). Elsevier. pp. 453–465.https://doi.org/10.1016/B978-0-12-809324-5.23814-5
-
UCSF chimera -- a visualization system for exploratory research and analysisJournal of Computational Chemistry 25:1605–1612.https://doi.org/10.1002/jcc.20084
-
Identification and functional characterization of olfactory indolergic receptors in musca domesticaInsect Biochemistry and Molecular Biology 139:103653.https://doi.org/10.1016/j.ibmb.2021.103653
-
The seizure-inducing plastic explosive RDX inhibits the alpha1beta2gamma2 GABAA receptorAnnals of Clinical and Translational Neurology 9:600–609.https://doi.org/10.1002/acn3.51536
-
Transmembrane topology and signal peptide prediction using dynamic bayesian networksPLOS Computational Biology 4:e1000213.https://doi.org/10.1371/journal.pcbi.1000213
-
PLIP: fully automated protein-ligand interaction profilerNucleic Acids Research 43:W443–W447.https://doi.org/10.1093/nar/gkv315
-
An overview of the amber biomolecular simulation packageWiley Interdisciplinary Reviews 3:198–210.https://doi.org/10.1002/wcms.1121
-
Structure-activity relationship of a broad-spectrum insect odorant receptor agonistACS Chemical Biology 7:1647–1652.https://doi.org/10.1021/cb300331z
-
Bioinformatic analysis of subfamily-specific regions in 3D-structures of homologs to study functional diversity and conformational plasticity in protein superfamiliesComputational and Structural Biotechnology Journal 19:1302–1311.https://doi.org/10.1016/j.csbj.2021.02.005
Article and author information
Author details
Funding
National Institute of Allergy and Infectious Diseases (R01AI095514)
- Walter S Leal
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Acknowledgements
We thank Drs. David K Wilson, Enoch Baldwin (UC Davis), and members of our research groups for enlightening discussions. The Orco ligand candidate OLC12 was provided by the Vanderbilt Institute of Chemical Biology, Chemical Synthesis Core, Vanderbilt University. Research reported in this publication was supported by the National Institute of Allergy and Infectious Diseases (NIAID) of the National Institutes of Health under award number R01AI095514. The content is solely the responsibility of the authors and does not necessarily represent the official views of the NIH.
Copyright
© 2022, Franco et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 1,521
- views
-
- 224
- downloads
-
- 3
- citations
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Neuroscience
Chronic pain is a prevalent and debilitating condition whose neural mechanisms are incompletely understood. An imbalance of cerebral excitation and inhibition (E/I), particularly in the medial prefrontal cortex (mPFC), is believed to represent a crucial mechanism in the development and maintenance of chronic pain. Thus, identifying a non-invasive, scalable marker of E/I could provide valuable insights into the neural mechanisms of chronic pain and aid in developing clinically useful biomarkers. Recently, the aperiodic component of the electroencephalography (EEG) power spectrum has been proposed to represent a non-invasive proxy for E/I. We, therefore, assessed the aperiodic component in the mPFC of resting-state EEG recordings in 149 people with chronic pain and 115 healthy participants. We found robust evidence against differences in the aperiodic component in the mPFC between people with chronic pain and healthy participants, and no correlation between the aperiodic component and pain intensity. These findings were consistent across different subtypes of chronic pain and were similarly found in a whole-brain analysis. Their robustness was supported by preregistration and multiverse analyses across many different methodological choices. Together, our results suggest that the EEG aperiodic component does not differentiate between people with chronic pain and healthy individuals. These findings and the rigorous methodological approach can guide future studies investigating non-invasive, scalable markers of cerebral dysfunction in people with chronic pain and beyond.
-
- Neuroscience
Biological memory networks are thought to store information by experience-dependent changes in the synaptic connectivity between assemblies of neurons. Recent models suggest that these assemblies contain both excitatory and inhibitory neurons (E/I assemblies), resulting in co-tuning and precise balance of excitation and inhibition. To understand computational consequences of E/I assemblies under biologically realistic constraints we built a spiking network model based on experimental data from telencephalic area Dp of adult zebrafish, a precisely balanced recurrent network homologous to piriform cortex. We found that E/I assemblies stabilized firing rate distributions compared to networks with excitatory assemblies and global inhibition. Unlike classical memory models, networks with E/I assemblies did not show discrete attractor dynamics. Rather, responses to learned inputs were locally constrained onto manifolds that ‘focused’ activity into neuronal subspaces. The covariance structure of these manifolds supported pattern classification when information was retrieved from selected neuronal subsets. Networks with E/I assemblies therefore transformed the geometry of neuronal coding space, resulting in continuous representations that reflected both relatedness of inputs and an individual’s experience. Such continuous representations enable fast pattern classification, can support continual learning, and may provide a basis for higher-order learning and cognitive computations.