Main text

Honey bees play a critical role as pollinators for nearly a third of food crops (1). However, over the last decade, beekeepers reported increasingly unsustainable losses (>40%) of their managed hives every year. Several factors have been identified contributing to these declines, amongst them parasites such as Varroa mites, climate change, and pesticides (2-4).

When honey bee workers forage in pesticide-treated fields, they bring back pesticide residues into their hive, which negatively impacts the survival of their colony, endangering the pollination of high-value crops, even at sublethal dosages. We propose a novel solution for this problem: identifying honey bee repellents that can be added to pesticides, thus reducing their contact with treated crops. Honey bees have a powerful sense of smell and use their antennae to detect numerous volatile chemicals which they sense using a sophisticated olfactory system of >180 odorant receptor genes in the genome (5, 6). However, the complexity of the insect olfactory system makes it a significant challenge to experimentally identify repellent odorants. In fact, in the 20 years since the odorant receptor families in insects have been identified, not a single new repellent odorant was registered for use in a new insect repellent product by the U.S. Environmental Protection Agency.

Recently we used Machine Learning to successfully create structure-based models to predict mosquito repellents from chemical structure (7) and for human olfactory behavior allowing us to predict ∼140 odor characters with great success (8, 9). While there exists a very large set of training repellent odorants for mosquitoes from empirical studies, data for honeybees are far less (7). Here we used a related Machine Learning approach, scanning 3D molecular structures of odorants in silico, to predict their potential for repelling honey bees (Figure 1). We overcome the challenges of a small set of training odorants by performing a second round of modelling using data from an initial round of testing to improve the predictive ability (Figure 3A).

A chemical informatics method to predict honey bee repellence

Overview of the steps for the Machine Learning-based cheminformatics pipeline used to model honey bee repellence from the 3D structure of a training set of known repellents, and subsequent in-silico screening for novel repellents from a large chemical space.

To do so, we first reviewed the existing literature on honey bee olfactory behavior and assembled a training set of volatile chemicals including a subset with a negative behavioral valence (10-14). We generated energy-minimized 3D structures for each chemical and calculated 5,290 physicochemical features: initially for 184 training chemicals and later for 203 chemicals (Table S1). From the initial training set we identified 45 features that were considered important for predicting aversive valence. We then evaluated a collection of Machine Learning (ML) algorithms including regularized Random Forest, Gradient Boosted Decision Trees, and a non-linear Support Vector Machine (e.g., incorporating the Radial Basis Function kernel) with these features to predict repellency of the test chemicals To cross-validate each algorithm’s performance, we divided the training data into multiple smaller training data sets and test data sets and calculated average predictive success, graphically depicted as the area under the Receiver Operating Characteristic curve (ROC AUC) in our model. Our computational validation (avg AUC = 0.88) indicated that our model was able to distinguish volatiles with aversive valence from non-repellents (Figure 1).

Using this model, we subsequently evaluated a library of ∼45M small molecules from the MolPort Database. The top predicted repellents (∼7% of hits with computed repellency scores >=0.9), represented a structurally diverse group of 139 chemicals, of which some showed similarities to the known repellents in the training set. The notion that Machine Learning can predict the behavioral valence of small molecules towards honey bees from just 3D chemical structure would be revolutionary and inform follow up behavioral validation.

Because our training set included a few known repellents for other insects as well, such as benzaldehyde and geraniol, we anticipated that the predictions likely included broad-spectrum insect repellents alongside honey bee-selective ones. If repellents were to be mixed with insecticides to repel bees, it might be desirable not to repel pest insects. To evaluate predicted compounds for broad-spectrum insect repellent activity we performed behavior assays using another insect, the fruit fly Drosophila melanogaster (Figure 2A). We randomly selected a set of compounds from the predictions and used a trap-based assay to quantify fly aversion. Several compounds showed low levels of repellency, with a <50% reduction in numbers of flies entering traps relative to the solvent control (Figure 2A). Subsequently, we only used those compounds for further tests in honey bees.

Predicted honey bee repellents

A. Testing Chamber containing a one-choice trap to determine, whether an odorant will repel fruit flies, and the mean percentage of fruit flies caught in a trap treated with predicted repellent odorants (10% v/v in Paraffin oil) and baited with 10% apple cider vinegar. N=5-8 trials (∼20 flies/trial). Error bars= s.e.m.

B. Photograph of the two-choice Petri dish arenas used to test the behavior of honey bees to repellent candidates individually, and

C. Table with preference indexes in the two-choice assay for the first choices of honey bee workers offered two containers of honey water (50 μl of 50%) placed on filter paper discs treated with a test chemical or treated with solvent alone. The index was calculated for each compound as (total number of repellent choices minus total number of solvent choices) divided by the sum of all tests. We used McNemar tests for pairwise comparisons.

We modified a honey bee repellent testing assay in Petri dishes in the lab to test the 15 candidate compounds (15). We placed honey-water (50μl) filled microfuge lids onto treated filter paper disks on two sides of a Petri dish and placed ∼five hungry honey bee workers in the center. From video recordings we evaluated whether the first bee navigated to a honey pot on the repellent treated or the control solvent side (Figure 2B). For many of the tested compounds the bees preferred to visit the honey-water pots on the control side versus the repellent side, (Figure 2C).

The initial training data set used for Machine Learning was limited in size and lacked high quality quantitative values for structurally related compounds. The predictive models can be improved substantially by adding new compounds with quantitative data that are structurally related to the training set. We therefore developed an iterative pipeline where the previous round of behavioral validation data is added to the training data and used to rebuild a more refined training set (Figure 3A). Specifically, we added the quantitative data from the 15 newly tested compounds to the training set. The 3D structures were analyzed, and a physicochemical feature set was selected as before. As expected, the computational validation results based on the AUC values, show an improvement. We next screened a ∼10M compound library in silico with the updated repellent predictive ML model and identified a wide array of natural and synthetic honey bee repellent candidates (Figure 3A). Because the rate of evaporation may affect how long a repellent will be effective in the field, we estimated the vapor pressure for each candidate. To do this, we used Machine Learning on a newly created training data set of 1,931 compounds with known vapor pressures.

Reiterative training of ML predictive models and testing honey bee repellents

A. Schematic pipeline for the overall computational-behavior hybrid pipeline showing the two rounds.

B. Mean Preference index of honey bees in making the first choice to move to the repellent treated side in a two-choice plate assay. 70 μl of pure honey was used in the two containers to attract the bees.

C. Average consumption of honey from containers placed on each side in the two-choice plate assays (N = 460 plates). The index is calculated for each compound as which honey pot has greater consumption (total number of repellent side with greater consumption minus total number of solvent side with greater consumption) divided by sum of all choices).

D. Mean percentage of fruit flies caught in a trap treated with predicted repellent odorants at application rates to test in field tests (0.1mg/cm2) and baited with 10% apple cider vinegar. N=5-8 trials (∼20 flies/trial). Error bars= s.e.m.

To validate the second round of predictions and identify compounds with the strongest behavioral responses in honey bees, we selected 28 molecules with high predicted repellency values that fell into one of three vapor pressure (V.P.) categories: very low V.P solids, very low V.P liquids (10-4 - 10-3 mmHg @25C) and low V.P. liquids (10-2 mmHg @25C). We tested the compounds on honey bee behavior using the 2-choice arena as before, but used pure honey (70μl) instead of honey water. Using honey increased the bees’ participation from 68% in round 1 to 81% in round two, and the increased volume allowed us to evaluate honey consumption on each side of the arena after one hour. Most of the predicted repellent compounds showed strong repellency (Figures 3B). The repellent effects were conserved across three or more different honey bee colonies when tested in batches demonstrating a robust and reproducible effect (Figure S1). For 15 of the 28 test compounds, we found that the bees drank significantly less honey on the treatment side compared to the control side as well (Figure 3C).

We next tested seven compounds that showed the strongest repellency in honey bees on Drosophila melanogaster using the 24-hr trap assay as before, at a concentration relevant for field applications as done later (∼0.1 mg/cm2). The flies showed no behavioral aversion to the candidate repellent chemicals, compared to controls (Figure 3G). This suggests that the new honey bee repellents were not aversive to other insects like fruit flies.

Next, we tested the seven repellents on freely flying foraging honey bees in a field assay. Honey bees are readily attracted to wax honeycomb foundations that have been sprayed with sugar-water. We sprayed sugar-water on three equally spaced wax-foundations placed near hives and treated them with either (1) solvent (acetone), (2) a candidate honey bee repellent (rate of ∼0.1 mg/cm2) or (3) a positive control (DEET) at the same rate (Figure 4A). The honey bees flew to the wax foundations and were offered a choice between the different treatments. We filmed the assays and counted the bees on each foundation in five-minute increments. Each of the seven compounds showed repellency to honey bees, with three of them showing aversion up until the end of the assay. We ended the assay counts when the honey bees stopped being attracted to the untreated controls (Figures 4A,B,C). Kruskal-Wallis tests revealed significantly lower numbers of bees on treated wax foundations compared to controls (Figure 4C), In addition, we tested the two top-scoring compounds at half their original concentration (0.05 mg/cm2) and still observed significant repellency (Figure 4D).

Field testing of top repellents in the lab using robbing assays

A. Representative photo of a robbing assay.

B. Time-course of the mean numbers of foraging honey bees visiting for each indicated compound tested (0.1 mg/cm2). DEET was used as a positive control and acetone as a negative control in each assay. N=6-7. Error bars=s.e.m.

C. Mean number of honey bees for each compound summed over the time points shown in B. * p<0.05, ** p<0.005, *** p<0.001. We used Kruskal Wallis tests for pairwise comparisons.

Mean number of honey bees for each compound tested at the regular dose (0.1mg/cm2) and half dose (0.05mg/cm2), summed over the time points over the duration of the experiment as above. N=6, Error bars=s.e.m.

Taken together our results from the lab and field assays demonstrate that a computational approach can successfully model honey bee aversive valence from chemical structure alone and can be used to efficiently screen millions of chemicals to find strong repellents.

Discussion

Most odorants reaching a bee’s antennae activate or inhibit several of the >180 olfactory receptors at the same time. Each odorant-receptor interaction elicits a specific neuronal activity in the bee brain, consequently allowing for a limited number of receptors to encode a practically infinite number of olfactory combinations (16). Even though honey bees are a well-studied model system for olfactory coding and learning (16), (17), we still know little about the pathways between olfactory stimulus evaluation and decision-making (16). The olfactory system therefore poses an extreme challenge to modeling. In spite of what is already known about the honey bee olfactory system, it would take several years and significant funding before generating a near complete knowledge of the olfactory system that allows to do predictive modelling of valence. The 3D structure based computational approach fills a very important void in identifying novel volatiles triggering a specific olfactory behavioral responses in honey bees. This machine learning pipeline can accelerate research into insect repellency and its physicochemical basis. The overall species-selective approach we have presented will help identify additional novel repellent chemicals for honey bees as well as inspire an exploration of the vastness of chemical space for use in repellent strategies for other animals.

Pesticides have been identified as a significant contributor to the global decline in honey bee populations (18, 19). Numerous studies have demonstrated the detrimental effects of various pesticides alone or in combination with other environmental stressors on honey bee health, including neonicotinoids, organophosphates, pyrethroids (20, 21) and bifenazate (22). These chemicals can cause acute toxicity, leading to immediate bee mortality, as well as sublethal effects that impair bee behavior, navigation, immune function and population growth (22, 23). Chronic exposure to pesticides, such as imidacloprid, has been linked to reduced colony growth, queen failure, and increased susceptibility to pathogens and parasites (24). The widespread use of pesticides in agriculture has raised concerns about the sustainability of honey bee populations and the potential impacts on pollination services and food security (19).

Recent research has explored the potential of using bee repellent odors to mitigate the harmful effects of pesticides on honey bees. The concept involves applying non-toxic, natural compounds that emit odors that are unappealing to bees, thereby deterring them from foraging in treated areas during pesticide application. By reducing bee exposure to pesticides, this approach aims to minimize the direct and indirect impacts on bee health. Studies have identified several promising bee repellent compounds, such as 2-heptanone and essential oils from plants like lavender and thyme but their low efficacy does not make them practical in the agricultural setting (25, 26). Our field behavior assays strongly suggest that the use of bee repellent odors can significantly reduce bee activity in treated areas, potentially lowering the risk of pesticide exposure. This study also provides a foundation for further research to optimize the effectiveness of bee repellent odors, assess their long-term impacts on bee behavior and colony health, and develop practical application strategies for integration into pest management practices. Moreover, our approach for a rational and high-throughput discovery of bee repellents can be extended to any other species of interest, thus providing a very powerful and affordable computational tool for chemical ecologists.

Materials and methods

Machine learning repellency

Each chemical structure tested for attraction/repellency on bees was prepared for machine learning (ML) using RDKit in Python. The 2D structures were first embedded in 3D coordinate space; the 3D geometries were then optimized using the Merck molecular force field (MMFF94). For each optimized 3D structure, AlvaDesc computed 5,290 diverse 2D and 3D chemical features, describing large-scale properties such as molecular shape that affect sensory receptor docking in addition to forces describing potentially more abstract interactions leading to graded activation or inhibition of the sensory receptors of the bee.

Bee behavioral data, presented as a preference score (5 = max repellency), was converted into a binary label, repel/inactive, using > 1 as the cutoff between repellent and neutral or attractant behaviors. In other circumstances, bee behavior to compounds was reported without a repellency score. For attractive compounds, an inactive label was assigned based on previous data (11-14, 27, 28). The data were subsequently split into training and testing partitions (90:10) and any further processing was done exclusively on the training partition.

The large chemical feature matrix was next reduced to a subset of features that optimally classified compounds according to their repellent activity on bees. This involved using the Recursive Feature Elimination (RFE) algorithm alongside a Support Vector Machine (SVM) with a Radial Basis Function (RBF) kernel to evaluate the feature importance over 250 partitions of the training data (e.g., 10-fold cross validation, repeated 25 times). An intermediate subset >100 features was next passed to a series of algorithms (e.g., Random Forest and Boosted Decision Trees) to re-rank the features, selecting the final, consensus set.

Top features were evaluated across a diverse set of algorithms including SVMs, Decision Trees, and Neural Networks (Multilayer Perceptron), and the cross validated classification metrics suggested high classification success rates (AUC, Sensitivity, and Specificity) when aggregating the predictions of three algorithms: SVM with a Radial Basis Function (RBF) kernel, a Gradient Boosting Machine (built with Decision Trees), Random Forest with regularization. Final Evaluation was done on the test set. Trained models predicted 60,000 metabolites and other naturally sourced compounds. The MolPort database, which contains 47 million compounds in total, was filtered to a smaller subset for repellency predictions, based on commercial availability and structural similarity to known bee repellents in the training set. After the first round of experimental validation tests with bees, the models were retrained. Additional prospective repellent compounds were scored using the updated models.

Machine learning Vapor pressure

Training and testing data were obtained from EPI Suites, Environmental Protection Agency (EPA). Methods for fitting these models are as outlined for bee repellency; namely, chemical features were computed using AlvaDesc, followed by processing the feature values and identifying a small predictive subset through the recursive feature elimination algorithm (RFE). To compare the vapor pressure model predictions with respect to different machine learning methods as well as EPI suite, data were split according to the train/test partitions defined in a previous study using the vapor pressure data for Machine Learning (ML) (29).

Fly trap assay

Drosophila melanogaster stocks maintained in the lab were grown in standard media bottles. Flies of a similar age (4-6 days) were collected in groups of 20 (10 females +10 males), and wet-starved for 24hrs in starvation vials. Testing chambers (Dram vials with centrifuge tube traps containing lure (10% Apple Cider Vinegar) plus test compound of desired dilutions) were assembled and flies quickly transferred to them by gentle tapping. The assays were left on the bench and data on number of flies entering traps recorded after18 hours.

2-choice honey bee plate assays

Honey bee (Apis mellifera) workers for our experiments were bred in apiaries at the University of California Riverside, between August 2021 and April 2022. To age them, we placed a frame of capped worker brood at the red-eye stage into an observation frame in a humidified incubator at 34°C. After 2-3 days, we collected groups of 80 newly hatched workers into specially 3D-printed experimental cages (12x10x7.5cm), where we kept them until they were reached an age of 13 to 19 days old, corresponding to foraging age). The bees received sugar water (50%) and water ad libitum, as well as a paste of protein powder (https://megabee.com/) mixed with 50% sugar water every second day until day eight.

The behavior arenas consisted of plastic Petri dishes with a diameter of 15 cm with two filter paper circles on each side (1 cm diameter) treated with 20 μl of solvent (acetone evaporated for 30 min) on one side, and 20 μl of test compound (20 μl, 5% in acetone, evaporated for 30 min) on the opposite side of the arena. To stimulate bees to participate, 50 μl of 50% honey-water in cut caps of PCR tubes was carefully taped to the center each of the two treated papers (Figure 2). For round two (Figure 3), 70 μl of pure (slightly warmed) honey was used in each cap (the honey surface formed an upward meniscus). The night before the experiment, the bees were water-starved, then cooled down at 4 °C for approximately 20 minutes before placing five bees into the middle of the arena. The spacing of the filter papers was kept consistent by taping a template (round piece of paper) to the bottom of each petri-dish and taping the filter papers on top of the correct spot onto the template. An Apple iPad or iPhone was used to film six arenas at once for one hour in a custom built PVC tube cage covered with a cotton sheet to create a uniform visual surrounding. To wake up the bees, the choice arenas were placed onto a heating pad (20’’x24’’, Model No PEHPWIDE-SG, Pure Enrichment, purenrichment.com, set to level 1).

Videos were analyzed offline. For each plate, the side the first bee chose to drink honey from first was recorded (first choice). An additional semi-quantitative metric was noted for Figure 3: an estimate of the percentage of honey consumed in each container 95% (meniscus gone, only), 75%, 50%, 25%, or 0%. Plates in which the bees had not consumed any honey from either side (non-participating plates) were excluded from further evaluations. For each batch of six plates, workers from the same colony were used. For repellent candidates with the most promise, bees from at least three different colonies were tested (at least six participating plates each) (Table S3).

Robbing assays in the field

For this assay we elicited the robbing behavior of honey bees, which means that they steal unguarded honey or honey guarded by weaker colonies (30). To do so, we sprayed standardized wax foundations (44 x 25.5 cm) with 50 puffs of 50% honey diluted in water as an attractant. Chemicals were sprayed on top at 0.1mg/cm2 (15 puffs of 5% in acetone). Three foundations, solvent, DEET and test compound, were affixed on a standing poster board (92x122 cm) placed on a table five meters in front of the first row of 40 hives and filmed for robbing behavior by the bees for 30 min, using an iPad or an iPhone. Prior to the start of the days experiments, three honey frames and honey lures were placed on a table for ∼15-30 min to trigger robbing behavior of bees from nearby colonies, and then removed before trials. The position of the three wax foundations to each other was randomly shifted. After 25 min, the positive control sugar-water sprayed wax foundations were no longer attractive to honey bees and the analysis was restricted to this time point. We repeated each assay a minimum of six times. Moreover, we repeated the assays with the two most promising candidates at half the first concentration (∼0.05 mg/cm2) four times each.

Statistics

To compare the proportion of plates per test batch in which the bees chose to drink honey from the control side versus the treated sides, we used non-parametric, pairwise McNemar tests, assuming the null hypothesis of an equal distribution between choices or 50/50. The non-parametric, pairwise Wilcoxon Signed-Rank tests were used to compare the percentage of honey consumption between treatments. Plates in which bees did not drink any honey were removed (non-participating plates). For the assays on Drosophila behavior, we performed ANOVA followed by Tukey’s multiple comparison tests to compare the number of flies entering the trap between treatments and control. For the robbing assays, we summed up the counts of visiting bees on each wax foundation at 5, 10, 15 and 20 minutes, then used non-parametric Kruskal Wallis tests for pairwise comparisons between each of the three treatments. We adjusted the p-values for multiple testing with Bonferroni corrections.

Supplementary figure and tables

(related to Figure 3). Average percentage of two-choice plate assays with honey bees making the first choice to move to the repellent treated side or solvent treated side across different honey bee colonies, N=3-6 colonies (∼6 plates/colony). Error bars =s.e.m., P*<0.05, **<0.001.

Optimized DRAGON physicochemical descriptor set from iterative training in Figure 3A used to predict bee repellent compounds.

Repellency testing in the 2-choice plate assay from initial round of predictions.

Repellency testing in the 2-choice plate assay from second iteratively trained round of predictions.

Acknowledgements

This work was supported by funding provided by the California Research Alliance (CARA) by BASF.