Supplementary file 1.
(A) Optimized descriptor sets for each Drosophila Or. Optimized descriptors occurrences, symbol, brief description, class, and dimensionality are listed. A summary of the total number of descriptors selected for the receptor repertoire is provided at the beginning. Descriptors are listed in ascending order of when they were selected into the optimized set, such that the descriptors selected first are more important. Weights indicate the number of times a descriptor was selected in an optimized descriptor set. (B) Top 100 predicted compounds for each Drosophila Or. Chemical name or Pubchem compound ID (CIDs), SMILES strings, and distances, of the top ∼100 predicted compounds for each Or. All distances represent the minimum distance based on optimized descriptors to the previously known strongest active compound listed in the gray cells for that particular Or.