Automating an insect biodiversity metric using distributed optical sensors: an evaluation across Kansas, USA cropping systems

Klas Rydhmer; James O. Eckberg; Jonathan G. Lundgren; Samuel Jansson; Laurence Still; John E. Quinn; Ralph Washington Jr.; Jesper Lemmich; Thomas Nikolajsen; Nikolaj Sheller; Alex M. Michels; Michael M. Bredeson; Steven T. Rosenzweig; Emily N. Bick

doi:10.7554/eLife.92227.1

eLife assessment

This study presents useful work comparing different techniques for monitoring insect species in agricultural settings, including a brand new one using optical sensors. That said, the data were analysed using an inadequately-described -- or potentially inadequate -- framework, and more careful thought must be given to the interpretation of the results before the new methodology can be used as a starting point for insect studies in agricultural fields and beyond.

https://doi.org/10.7554/eLife.92227.1.sa2

Significance of findings

useful: Findings that have focused importance and scope

landmark
fundamental
important
valuable
useful

Strength of evidence

inadequate: Methods, data and analyses do not support the primary claims

exceptional
compelling
convincing
solid
incomplete
inadequate

During the peer-review process the editor and reviewers write an eLife assessment that summarises the significance of the findings reported in the article (on a scale ranging from landmark to useful) and the strength of the evidence (on a scale ranging from exceptional to inadequate). Learn more about eLife assessments

Abstract

Global ecosystems and food supply depend on insect biodiversity for key functions such as pollination and decomposition. High-resolution, accurate data on invertebrate populations and communities across scales are critical for informing conservation efforts. However, conventional data collection methodologies for invertebrates are expensive, labor intensive, and require substantial taxonomic expertise, limiting researchers, practitioners, and policymakers. Novel optical techniques show promise for automating such data collection across scales as they operate unsupervised in remote areas. In this work, optical insect sensors were deployed in 20 agricultural fields in Kansas, USA. Measurements were compared to conventional assessments of insect diversity from sweep nets and Malaise traps. Species richness was estimated on optical insect data by applying a clustering algorithm to the optical insect sensor’s signal features of wing-beat frequency and body-to-wing ratio. Species richness correlated more strongly between the optical richness estimate and each of the conventional methods than between the two conventional methods, suggesting sensors can be a reliable indicator of invertebrate richness. Shannon- and Simpson indices were calculated for all three methods but were largely uncorrelated including between conventional methods. Although the technology is relatively new, optical sensors may provide next-generation insight into the spatiotemporal dynamics of invertebrate biodiversity and their conservation.

Significance Statement

The implications of this research extend from the field level to the regional level. Much of what scientists understand about the decline of invertebrates comes from a small number of long-term studies that can be coarse and correlational in nature. High-resolution biodiversity data sets on fields to landscapes may provide the insight needed for the successful management and accounting of biodiversity by practitioners and policymakers. Such high-resolution data has the potential to support global efforts and coordination of biodiversity conservation.

Introduction

Invertebrate biodiversity is fundamental to ecosystem processes, functions, and services (Yang & Gratton, 2014). Monitoring invertebrate populations and communities can inform management and policy at multiple scales. Such data are critical to agricultural production and sustainability (Landis, 2017). However, invertebrate biodiversity is difficult to quantify (Geiger et al., 2016; Shortall et al., 2009) and monitor at broad spatial and temporal scales (Sánchez-Bayo & Wyckhuys, 2019; Tilman et al., 1994). The difficulty is largely due to the necessity of skilled labor required for taxa identification on which biodiversity quantification relies (Wägele et al., 2022), and is both limited and prohibitively costly (Gardner et al., 2008). Common approaches to collecting insect inventories include sweep netting as well as Malaise-, pan-, and light traps. Each method has its own bias toward certain insect groups (Bick et al., 2020) often resulting in the concurrent use of techniques in studies and practice (LaCanne & Lundgren, 2018).

New technology is needed to monitor invertebrate biodiversity in real time for agricultural systems. Such a tool would provide data to support biodiversity-focused management at field to landscape scales (LaCanne & Lundgren, 2018) and allow for tracking of the impact of conservation measures, or the lack thereof. Automation of systems has the potential to reduce labor, time, costs, and human error. While many automated insect monitoring tools are available for agricultural pest monitoring (Bick et al., 2023; Silva et al., 2013), overall, these approaches are not suitable for assessing biodiversity as they focus on the identification of indicator species, not communities (J. G. Lundgren & Fausti, 2015a). The automatic quantification of invertebrate biodiversity could improve the data available for monitoring and evaluation of conservation efforts but currently, no method exists at scale (Wägele et al., 2022), despite calls for such data and analytics to inform the assessment and management of ecosystems (García et al., 2023).

Real-time data on invertebrate biodiversity likely would improve our understanding of insect population changes at a regional or even global scale, filling a gap in the tracking of insect change. The incorporation of ‘big data’ has been shown to help mitigate some methodological biases (Geiger et al., 2016). One such effort is the global malaise project that is using automated taxonomic identification from traps using DNA, addressing the most labor-intensive part of this method (Krishna Krishnamurthy & Francis, 2012). It is a highly promising ‘big data’ approach; unfortunately, the method over-represents known species, has an inherent sampling bias towards flying insects and emphasizes species with large mitochondrial differences. Optical entomological methods such as lidar, where an optical signal is recorded from insects flying through a beam of emitted light, can record large numbers of insect flights without using a lure. However, it is unclear how optical sensors compare to conventional methods in measuring populations and communities (Garcia et al., 2023; Rydhmer et al., 2022).

The goal of this study is to determine if the measurement of an insect biodiversity metric can be automated with the use of optical near-infrared insect sensors. In this work, we deployed sensors (Rydhmer et al., 2021) in 20 agricultural fields across six crops in Kansas, USA. The sensors were deployed alongside Malaise traps and the sites were sampled with sweep nets. Each site was evaluated on two different occasions to capture seasonal changes. Specifically, we compared conventional methods to each other and with the automated biodiversity metric utilizing unsupervised clustering of data collected by a lidar-based sensing method.

Materials and Methods

Data collection

Insect populations were monitored at 20 sites (Figure 1) in June and July of 2020 using sensors alongside conventional methods (Malaise traps and sweep nets). Representative agricultural crops of central Kansas were sampled including three corn, three sorghum, six soybean, one alfalfa, two pasture, and five complex cover crops. The complex cover crops consisted of approximately eight species of annual grass and forb cover crops. An autonomous near-infrared sensor (described in (Rydhmer et al., 2021) and produced by FaunaPhotonics ApS., Copenhagen SV, Denmark) was placed ∼50 m from the field margin and was monitored continuously for two periods of three days in June and in July. The sensor uses light-emitting diodes to transmit infrared light (810 nm & 970 nm), creating a measurement volume between 5 and 70 L, depending on insect size (Rydhmer et al., 2021). Insects flying in front of the sensor back-scatter light, which is recorded by a photodiode as a time signal.

Map of 20 field site locations distributed around central Kansas. Fields are color-coded by crop type. Field dots are enlarged and shifted to maintain the anonymity of the participating farms. Map data from www.openstreetmap.org.

Insect recordings are automatically separated from noise originating from other sources (e.g. rain drops or plant interference) using proprietary cloud-based neural network software, as used in Bick et al., 2023 and Rydhmer et al., 2021. Additionally, observations without clearly identified wingbeats or body-to-wing ratios were discarded. A total of 1,057,115 observations were recorded, of which 106,083 remained after filtering and were included in the study. A recorded observation consists of time series data from which information pertaining to the physical features of the individual insect can be obtained (Rydhmer et al., 2021).

Sensors were compared with conventional sampling of invertebrates (sweep nets and Malaise traps) in the same fields. Foliar and low flying insects were captured using a sweep net (38 cm diam., Bioquip™, Rancho Dominguez, CA, USA). Insects were collected at 50, 100, and 150 m from the field edge along a linear transect. Sweeps (n = 50 per location) were performed perpendicular to the transect, parallel to the field edge. Insects were transferred to a sealed plastic bag and were frozen until processed. In the laboratory, insects were thawed, sorted from the plant material, and identified.

Malaise traps were deployed at each site to capture the aerial insect community. A single bi-directional, Townes-style trap (dimensions 1.8 long; 1.8 m at its tallest height, and 1.2 m at its shortest height) was placed 100 m from the margin and adjacent to the ecosystem service sampling areas. The wall of the trap was parallel with the field margin. The traps were allowed to operate for 24 h, and the insects captured in the collection vials were preserved in ethanol.

All specimens collected by sweep net and malaise traps were identified to the lowest possible taxonomic unit (i.e., species or morphospecies). Due to a lack of species identification knowledge and time limitations, thrips (Insecta: Thysanoptera) were not identified beyond the family level and were not included in community metrics analyses (abundance, species richness, and diversity).

All immature insects were identified to family and grouped together, except for lepidopteran larvae, which were categorized as morphospecies independent of the adult stage due to their functional differences. All other specimens were identified to species using written and online taxonomic keys. Specimens that were not able to be positively identified to species were separated into distinct morphospecies. Voucher specimens of all taxa are housed in the Mark F. Longfellow Biological Collection at Blue Dasher Farm, Estelline, SD.

Ecosystem services were evaluated for insect and weed seed predation. First, invertebrate predators were isolated from both the soil and foliar communities. Additionally, predation rates in each field were assessed using sentinel wax moths (Galleria mellonella L. [Lepidoptera: Pyralidae]) larvae following the Lundgren et al., 2006 protocol, using 15 sentinels per plot arranged in three 5 × 3 7.5 m grid orientations (n = 45 per field). Weed seed predation was assessed from isolating soil and foliar granivore communities and their services using seed cards as described in Lundgren et al., 2006. Granivore services were measured on three abundant weed species (Johnsongrass (Sorghum halapense (L.) Pers.; Poaceae), lambsquarters (Chenopodium album L.; Amaranthaceae) and redroot pigweed (Amaranthus retroflexus L.; Amaranthaceae), V & J Seed Farms, Woodstock, IL, USA). Seeds were attached to 10 × 8 cm plastic cards (Avery™ insertable plastic dividers; #11200; Brea, CA, USA) using 6 cm strips of double-sided tape (Scotch, 3M, St Paul, MN, USA). Each species (n = 20 seeds of each species; 60 seeds per card) were placed on a 2 × 10 pattern on each card. Fine quartz sand was spread over exposed areas of the tape to exclude visiting invertebrates. To exclude granivorous vertebrates, a wire cage (14 × 12 cm cage, 1.4 × 1.4 cm mesh opening) was placed over the card and placed >3 cm above it. Control cards were used to account for seed loss from environmental factors such as wind or rain and contained 1.5 × 1.5 mm black glass beads (Cousin™ DIY, #AJM61215021, Largo, FL, USA) of comparable size as the weed seeds (Lundgren et al., 2006). Each plot received three seed cards and one control card (n = 9 seed cards and three control cards per field), placed on the soil surface in the four corners of each plot. Granivory was measured as the number of seeds removed or damaged per card after a 3-dayosure.

Data analysis

The wing-beat frequency (WBF) and body-to-wing ratio (BWR) were calculated from all observed insect flights similarly to previous groups (Gebru et al., 2018; A. Genoud et al., 2020; A. P. Genoud et al., 2019). The signal from the insect body (σ_b) and the diffuse and specular signal contributions from the insect wings (σ_dw and σ_sw) are estimated and separated using sliding minimum, sliding median and sliding maximum filters with a filter width corresponding to the wing beat period of the insect. The BWR is defined as the closed ratio between the body and wing contributions according to equation (1). An example of an insect signal is shown in Figure 1a.

Insects of the same species exhibit similar physical properties, and therefore also similar signal features (Kirkeby et al., 2021). Normalization of the feature space is a standard procedure prior to clustering. While BWR values are bound between 0 and 1 by definition (equation 1), WBF values frequencies typically vary between 20 Hz and 1 kHz (Jansson et al., 2019). WBFs were therefore divided by 1000 to produce values falling predominantly between 0 and 1. For clustering, we used the DBSCAN (Density-based spatial clustering of applications with noise) algorithm (Ram et al., 2010) due to its suitability in identifying clusters without a Gaussian distribution assumption (Ester et al., 1996). DBSCAN uses two parameters, the minimum number of insects needed to form a cluster (min_samples) and the merge distance ϵ, to determine which observations to merge into clusters. Data points too far away from any cluster and too sparsely distributed to form a new cluster are defined as outliers. This method was used to calculate the number of clusters or distinct groups (i.e. richness) and a diversity index of cluster groups based on Shannon and Simpson indices.

All insects collected with Malaise traps and sweep nets were classified by order, family, and species when possible. Then species richness (defined as the number of distinct taxonomic species present, independent of abundance), Shannon index, and Simpson index were calculated on the insect samples from both conventional methods for each field in June and July.

The data from the capture methods were randomly divided into two data sets: one used to optimize the DBSCAN clustering algorithm, and the other used for testing. To have a sufficiently large test set, the optimization set was limited to 30% of the data collected. During the optimization, ϵ and min samples were tuned to maximize the Spearman correlation between biodiversity metrics from the sensors and conventional metrics using stochastic gradual descent. This process was repeated for the richness and Shannon and Simpson indices for each of the trapping methods, plus an additional model fitted to the combined species richness from both conventional methods. Shannon and Simpson indices were not calculated on the combined dataset since these indices rely on the relative abundance of species, which are not comparable between the two methods.

Optimal parameters could be found that produced significant correlation (p < 0.05) for four of the seven comparative measures; however, no parameters could be found that satisfactorily modeled the Shannon index from the sweep netting nor the Simpson index for either trapping method.

Spearman-rank correlations between the clustering results calculated from the optical sensor data and the biodiversity measures obtained with the two physical insect field-sampling methods were calculated. Additionally, Analyses of Variance (ANOVA)and TukeyHSD post hoc analyses were conducted to evaluate the impact of sampling month, crop type, and field on richness estimates.

Results

In total, 106,083 insect flight events were recorded by the sensors. The Malaise traps collected 14,641 insects, whereas sweep nets collected 15,858 insects (Figure 3). The optical sensors recorded approximately one order of magnitude more insect flights compared to the number of insects collected with each of the conventional methods (Figure 3). Insect abundance was uncorrelated between all methods, including both conventional methods and the automated method (Figure 4; Malaise trap counts and sweep net counts r = 0.25, p =0.16, sensor events and sweep net counts r = 0.05, p = 0.78, sensor events and Malaise trap counts r = 0.05, = <0.88).

The number of insects collected using sweep nets (top panel) and Malaise traps (middle panel), and insect flight events recorded with the sensor (bottom panel) per field. Insect numbers are separated by month with insects observed in June visualized with blue bars and in July with red bars.

Scatter plots of measured insect abundance comparing the monitoring methods on a logarithmic scale. a) Scatter plot of the number of insects captured with sweep nets and Malaise traps. b) Scatter plot of the number of insects captured with sweep nets and the number of insect flight events recorded by the sensor. c) Scatter plot of the number of insects captured with Malaise traps and the number of insect flight events recorded by the sensor. No correlations were found on insect abundance for any of these methods.

Comparing the relative insect abundance between orders collected with conventional methods (Figure 5) depicts differences in capturing biases due to methodology. Diptera were most frequently collected from the Malaise traps, whereas Hemiptera, then Diptera and Psocoptera were more frequently captured with sweep nets. In general, less flight-active insects were more prominent in the sweep net data.

The number of insects collected with sweep net sampling and Malaise trap monitoring, aggregated by order.

There were no discernible differences in variation between time points from sensors (F = 6.091, Pr = 0.0191) and sweep nets (F = 1.326, Pr = 0.258). However, Malaise trap abundance showed significantly greater insect densities in July (μ = 76.9, F = 9.71, Pr = 0.0037) than June (μ = 43.3). Crop type was found to impact sweep net abundance (F = 3.369, Pr = 0.008) but not sensor (F = 1.644, Pr = 0.17) or Malaise trap (F = 1.692, Pr = 0.152) abundance estimates. A series of TukeyHSD post hoc analyses found no differences in abundance estimates between sample time points for each field.

Insect species richness was estimated from sensor-recorded insect flight events using a set of seven DBSCAN parameters (models) to cluster the held-out test data, yielding the number of clusters per field – the novel richness metric (Figure 2b). The correlation between the number of clusters and each of the comparative biodiversity metrics are shown in Table 1.

Correlations between the automated biodiversity metrics calculated from sensed insect data, and those obtained from Malaise trap and sweep net collections. Rows in the table denote which data was used to fit the clustering algorithm, whereas columns indicate which parameters the obtained correlations refer to. Correlations with a p-value below 0.05 are significant and marked in bold.

Example of an insect event’s signal and clustering. a) An example of an insect event recorded from the sensor. The wing beats are visible as modulations on top of the signal. The dashed red, solid magenta and dash-dotted blue curves show the body, diffuse- and specular wing signals respectively. The BWR is the ratio between the magnitude of the body- and diffuse wing signal. b) Clustered insect events recorded by the sensor in a soybean field (Field R) in July. The grey events are too sparse to form clusters and are therefore discarded.

Per field, the mean number of clusters that approximate richness was 41.1 (N = 34, SE = 3.29). The Malaise traps had a mean field richness of 60.5 species (N = 37, SE = 6.43) containing 10 orders, 146 families, and 709 species. The mean richness observed in the sweep nets was 47.4 species (N = 35, SE = 5.53), containing 11 orders, 149 families, and 664 species. Combined, the collected samples with both field-sampling methods contained 941 species distributed over 183 insect families in 11 orders.

The three models fit on species richness are generally comparable (fit on sweep net, Malaise trap, and their combined data). Identical DBSCAN parameters were calculated when the models were fit on the combined sweep net and Malaise trap richness. We therefore used this model termed the ‘automated biodiversity metric’ to evaluate the relationships between sensors, and the conventionally measured species richness and ecosystem services.

All species richness metrics were correlated (Figure 6a-d). The weakest correlation was between Malaise trap and sweep net richness metrics (R = 0.36, p = 0.046). The correlation between the number of clusters calculated in the sensor data and the conventional models was strongest for the combined richness, which was what the model was fitted on (R= 0.55, p = 0.012; Figure 6d). Significant yet weaker correlations were also found between the model and the Malaise trap and sweep net richness respectively (R = 0.52, p = 0.014; R = 0.48, p = 0.028).

Scatter plots and Spearman correlations for the species richness estimations across all models. The sensor results are from the model fitted to the total richness in both Malaise traps and sweep nets. a) Species richness calculated from Malaise traps vs. sweep net samples, b) species richness calculated from sensor clusters vs. sweep net samples, c) species richness calculated from sensor clusters vs. Malaise trap samples, and d) species richness calculated from sensor clusters vs. total richness across traps and sweeps.

No correlations were found when comparing sensor richness to any ecosystem services (Table 2). Conventional sampling methods were typically not correlated with ecosystem services with one exception. Sweep net species richness was positively correlated with the percent of waxworms predated (Table 2).

Spearman’s correlation table between richness metrics calculated from sweep nets, Malaise traps, combined conventional methods, and sensor clusters (automated biodiversity metric) compared to ecosystem services of percent waxworm predation, total number of predators, Johnsongrass predation, Pigweed predation, Lambsquarter predation, and all seed predation.

Discussion

This work serves as the first field-validated insect biodiversity metric using autonomous distributed optical sensors (Kouakou et al., 2020). The automated biodiversity metric was calculated and validated from flight events of sufficiently high quality to be able to extract a wingbeat frequency and body-wing ratio. Our results suggest that the sensor-derived metric is correlated with conventional estimates of biodiversity. Specifically, the sensor-derived biodiversity metric optimized for insect species richness is more correlated with each conventional sampling method than these methods are to one other (Figure a-d) (Malaise trap – sweep net R = 0.36; Malaise trap – sensor R = 0.52; sweep net – sensor R = 0.48). This indicates that metrics derived from optical sensors are likely generalizable and thus have the potential to provide accurate and autonomous measurements of insect species richness. Ecosystem generalizability was demonstrated by deriving and testing the biodiversity metric across six major crops in central Kansas. Still, future work is needed to evaluate the extent to which the metric may be generalized across agroecosystems outside our study area and to other terrestrial ecosystems. Current results indicate that the insect diversity metric may be calculated for a variety of functionally different crops without the need to classify insects into taxonomic groups. This approach is beneficial as such classification at present requires skilled labor and significant time. This metric may provide new insight into the management of ecosystems as significant and growing evidence suggests that biodiversity is correlated with greater ecosystem functions such as pest control (Lundgren & Fausti, 2015). Future work may focus on characterizing the composition of insect communities and species to address specific needs of managing ecosystems beyond biodiversity.

The automated biodiversity metric and Malaise trap species richness significantly correlated across all method iterations, save one (Table 1). Sweep net richness only correlated with two of the method iterations. The stronger correlation between the sensors and Malaise traps is hypothesized to result from these methods monitoring flying insects continuously, compared to sweep net sampling. Results were less clear for the correlations with Shannon and Simpson species diversity indices. The models fitted on Malaise trap richness also was significantly correlated with for Malaise trap Shannon index (Table 1). This is likely due to the co-correlation between the richness and Shannon index in the malaise trap (R=0.6, p=0.01, Supplementary Table 3). Other similar curiosities, such as the negative correlation with the Malaise trap Shannon index achieved when fitting on the sweep net Simpson index are also assumed to be the results of co-correlations between the conventional methods. A full table of all co-correlations is included as supplementary material (Supplementary Table 3). However, when fitting on Shannon index from the sweep-net data no correlation was found. This is likely due to overfitting, where the model performed well on the fitting data but did not generalize to the rest of the dataset. A larger fitting dataset is needed to resolve this issue. No model resulted in significant correlations in Simpson indices between any of the sampling methods. The lack of consistent correlations between biodiversity metrics may also reflect the nature of the biodiversity indexes which considers species evenness, a characteristic not fully accounted for in the DBSCAN algorithm based on minimum thresholds for classification of clusters.

The sensor observed the greatest number of insects, recording almost one order of magnitude more than both the Malaise traps and sweep nets (7.25 and 6.69 times, respectively). This difference is likely explained first by the observational period and then by methodology. Both sensors and Malaise traps continuously monitored each field, unlike sweep nets which collect insects at discrete time points. The sensors’ monitoring period was three times longer than the Malaise traps. However, even after accounting for the greater measurement period, the sensor methodology was still 2.42 times more efficient than Malaise traps at observing all insects. A previous study reported sensors observed 19 times the number of insects compared to those collected in water traps, another continuous monitoring method (Rydhmer et al., 2021). Specific insect species are also detected more efficiently. For example, the sensor was reported to be 18.6 and 6.7 times more efficient than plant counts (discrete sampling) and water traps (continuous monitoring) at observing pollen beetles (Brassicogethes aeneus; Bick et al., 2023). Greater sampling efficiency may be associated with measurement volume, a potential correlate of insect counts. Optical sensors with greater measurement volumes report recording tens of thousands of insect flights per day (Brydegaard et al., 2020). A potential confounding effect of sensors is the possible ‘double counting’ bias, as an individual insect can be recorded repeatedly. While ‘double counting’ possibly explains the greater number of observed insects, such biases are a common limitation of count-based inferences on population dynamics (Elphick, 2008). Despite its potential limitations, our understanding of complex species and community dynamics can benefit greatly from automation that significantly increases sampling intensity across space and time.

The lack of correlation of abundance across all three methods (Figure 4) is surprising as previous work has shown correlations between sensor measurements and water traps for insect abundance (Rydhmer et al., 2021). Disparities in sampling timing may be contributing to the lack of correlation. While sweep netting occurred in conjunction with setting up or taking down the Malaise traps, these efforts were substantially less correlated with the setup of the optical sensors: to the nearest 3 days in June and the nearest 22 days in July. The lack of correlation between the Malaise traps and the sensors may be due to the long period between the monitoring sessions at each site. Insect flight activity is heavily influenced by the weather, or the seasonal differences between the beginning and end of July – both of which may also explain the significance of the month on Malaise trap data. An additional factor may be the high noise composition of the recorded signals due to plant interference. During cleaning of this dataset, it is possible that variations in the relative degree of noise signals between fields (e.g. as a result of different crop heights and stiffness) resulted in more data loss from noisier fields, thus introducing a systematic error in abundance measurements for the sensor data. However, it should be noted that we observed no statistical correlation in abundance collected with Malaise traps versus sweep nets.

One challenge with the sensor’s dataset was the high proportion of noise signals, thought to result from plant interference. Of the total 1,057,115 signals recorded by the sensor, only ∼10% were classified as insects and included in the analysis. While we believe the noise classification filter is highly accurate, misclassified events may alter the total count. The signals generated by insects and plants moving through the sensor’s measurement volume are very different. Most non-insect events show no high-frequency components and are therefore correctly removed by the noise filter. However, plants modulating in front of the sensor may appear to have a wing beat frequency and would be misclassified as insects. It is also hypothesized that strong signals generated by vegetation interference may obscure weaker signals generated by small insects.

Regardless, misclassification is likely low enough to substantially alter the representation of the insect population or the automated biodiversity metric.

There is a need for ‘big data’ in entomology, and more particularly for measuring biodiversity to inform conservation. Autonomous optical sensors, such as the ones used in this study, provide one such solution that offers continuous, potentially real-time monitoring to support next generation, big data insight to the field of entomology. Moreover, the success in leveraging sensor data to calculate a biodiversity species richness metric indicates great promise in the use of autonomous sensors to monitor biodiversity. The use of sensors for insect and insect diversity monitoring is faster, in this case potentially more representative of richness, and likely cost-effective due to a decrease in labor compared to conventional methods. Sensors complement conventional sampling methods by allowing for real-time estimation of biodiversity, reducing time lags associated with traditional species inventory. Automated methods presented in this paper, once a generalizable calibration has been determined, offer faster estimates of biodiversity which will support time-critical decision making and conservation planning efforts. Methods such as the one described in this paper do not rely on identifying taxonomic groups and remove human error (a major concern for insect identification). Furthermore, standardized sensors are not prone to local and regional variations in sampling methods and may therefore be able to facilitate comparative biodiversity monitoring on a global scale.

There is a need to scale up and scale down sensor monitoring to understand species dynamics. The current technology complements the entomological radar group ‘BioDAR’ which is aiming to use libraries of insect radar signals for functional group classifications for high flying migratory insects at a regional scale (Rhodes et al., 2022). Similar estimates of insect functional groups might be similarly inferred from optical sensor recordings for all flying insects on a field scale.

Similarly, vertical looking radar is used to classify insects into higher level taxonomic groups such as Order or even Genus (Chapman et al., 2002; Stefanescu et al., 2013; Wood et al., 2009). It seems likely that similar or even higher precision can be achieved by including taxonomic information in clustering algorithms, such as specific orders (e.g. Lepidoptera, Coleoptera, Diptera). Future work could focus on identifying these groups, determining functional biodiversity, and quantifying their contribution to ecosystem services.

The current study shows a single instance of correlation between richness and a measure of ecosystem services. Greater species richness does not always translate into an increase in functional biodiversity or ecosystem services, as there is often ecological redundancy (Greenop et al., 2018). The lack of a relationship may also reflect different ecological interactions among species in the upper canopy versus above canopy level. These questions can be further explored in future work when the sensor’s ability to estimate functional biodiversity has been developed.

Conclusion

Conservation of biodiversity is gaining recognition as a global challenge with similar significance to climate change. However, unlike global climate, species populations and biodiversity function across different local to regional spatiotemporal scales. Detailed data on insect diversity across these scales is needed to assess the decline and inform conservation efforts. There is a call to automate the collection of in situ data and integrate such data with remote sensing-based models to accelerate conservation of global ecosystems (Garcia et al 2023). These integrated technology and big data approaches are especially needed for the conservation of invertebrates and could expand upon and accelerate the long-term and detailed monitoring efforts of invertebrate biodiversity (Sánchez-Bayo & Wyckhuys, 2019) The current study demonstrates successful development of in situ data collection able to be integrated with remote sensing models as described by Garcia et al 2023. Such approaches are poised to support the Intergovernmental Science-Policy Platform on Biodiversity and Ecosystem Services with rich, real-time data and help inform global biodiversity models that have had to rely on coarse, low-resolution data sets in some cases (Schipper et al., 2020).

Real-time feedback of simple metrics of biodiversity would greatly benefit agriculture by demonstrating the association between management and biodiversity in real-time, without time lag to process samples. The combination of pest detection with optical sensors (Bick et al., 2023; Kirkeby et al., 2021) and monitoring of biodiversity may inform integrated pest management and reduction of pesticide applications. Farming informed by such data analytics may deliver significant benefits to farmers including substantial reductions in pests (Lundren and Fausti 2015) and significant economic benefits at both farm and regional levels (Landis et al 2008, LaCanne and Lundgren 2018). Networks of local monitoring sensors may scale up to infer regional biodiversity and inform its management. The current technology could complement the global Malaise trap initiative by increasing the number of observations by an order of magnitude and providing earlier warning signs of regional pest movement or species decline. Our results suggest AI supported human expertise may provide the most efficient, robust inference on biodiversity.

Thus, we are not advocating such technology replaces conventional monitoring, but rather that this automation enhances the state of the art. Autonomous monitoring has the potential to revolutionize the field of entomology by forming the basis for a next generation of species to community insect models predicting the dynamics of invertebrates.

Acknowledgements

We thank the USDA Cheney Lake Conservation District (Lisa French), Understanding Ag (Ray Archuleta) for logistical support and field assistance. Tom Rabaey (General Mills) provided helpful feedback and advice to guide this project. We thank Kevin James Knagg and Mads Fogtmann from FaunaPhotonics A/S for facilitating this work. We thank Ecdysis Foundation field and lab technicians for collecting, processing, and Dr. Kelton D. Welch for identifying invertebrate specimens. We thank the farmers cooperating in the General Mills regenerative Agriculture Programs who granted us access and permission to sample their farm fields.

Supplementary information

A table describing the crop type and number of insects observed in each field in June and July across all three methods.

Measured insect abundance per crop and monitoring method. Mean and standard deviation.

Co-correlations of all biodiversity metrics.

Model parameters for each fitted metric.

A box plot depicting richness metrics from the Malaise traps, sweep nets, and sensors by field type.

A scatterplot depicting the correlation of the species richness metrics at each field, separated by the June and July timepoints.

References

1. Bick E.
2. Dryden D. M.
3. Nguyen H. D.
4. Kim H.
2020A Novel CO2-Based Insect Sampling Device and Associated Field Method Evaluated in a Strawberry AgroecosystemJournal of Economic Entomology https://doi.org/10.1093/jee/toz359
1. Bick E.
2. Sigsgaard L.
3. Torrance M. T.
4. Helmreich S.
5. Still L.
6. Beck B.
7. Rashid R. El
8. Lemmich J.
9. Nikolajsen T.
10. Cook S. M.
2023Dynamics of pollen beetle (Brassicogethes aeneus) immigration and colonisation of oilseed rape (Brassica napus) in EuropePest Management Science https://doi.org/10.1002/ps.7538
1. Brydegaard M.
2. Jansson S.
3. Malmqvist E.
4. Mlacha Y. P.
5. Gebru A.
6. Okumu F.
7. Killeen G. F.
8. Kirkeby C.
2020Lidar reveals activity anomaly of malaria vectors during pan-African eclipseScience Advances 6https://doi.org/10.1126/sciadv.aay5487 Google Scholar
1. Chapman J. W.
2. Reynolds D. R.
3. Smith A. D.
4. Riley J. R.
5. Pedgley D. E.
6. Woiwod I. P.
2002High-altitude migration of the diamondback moth Plutella xylostella to the U.K.: a study using radar, aerial netting, and ground trappingEcological Entomology 27:641–650https://doi.org/10.1046/j.1365-2311.2002.00472.x Google Scholar
1. Elphick C. S.
2008How you count counts: The importance of methods research in applied ecologyJournal of Applied Ecology 45:1313–1320https://doi.org/10.1111/J.1365-2664.2008.01545.X Google Scholar
1. Ester M.
2. Kriegel H.
3. Sander J.
4. Xu X.
1996A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with NoiseKDD’96: Proceedings of the Second International Conference on Knowledge Discovery and Data Mining 96:226–231Google Scholar
1. García G. C.
2. Bagstad K. J.
3. Brun J.
4. Chaplin-Kramer R.
5. Dhu T.
6. Murray N. J.
7. Nolan C. J.
8. Ricketts T. H.
9. Sosik H. M.
10. Sousa D.
11. Willard G.
12. Halpern B. S.
2023The future of ecosystem assessments is automation, collaboration, and artificial intelligenceEnviron. Res. Lett 18:11003https://doi.org/10.1088/1748-9326/acab19 Google Scholar
1. Garcia K.
2. Olimpi E. M.
3. M’Gonigle L.
4. Karp D. S.
5. Wilson-Rankin E. E.
6. Kremen C.
7. Gonthier D. J.
2023Semi-natural habitats on organic strawberry farms and in surrounding landscapes promote bird biodiversity and pest control potentialAgriculture, Ecosystems & Environment 347:108353https://doi.org/10.1016/J.AGEE.2023.108353 Google Scholar
1. Gardner T. A.
2. Barlow J.
3. Araujo I. S.
4. Ávila-Pires T. C.
5. Bonaldo A. B.
6. Costa J. E.
7. Esposito M. C.
8. Ferreira L. V.
9. Hawes J.
10. Hernandez M. I. M.
11. Hoogmoed M. S.
12. Leite R. N.
13. Lo-Man-Hung N. F.
14. Malcolm J. R.
15. Martins M. B.
16. Mestre L. A. M.
17. Miranda-Santos R.
18. Overal W. L.
19. Parry L.
20. …Peres C. A.
2008The cost-effectiveness of biodiversity surveys in tropical forestsEcology Letters 11:139–150https://doi.org/10.1111/J.1461-0248.2007.01133.X Google Scholar
1. Gebru A.
2. Jansson S.
3. Ignell R.
4. Kirkeby C.
5. Prangsma J. C.
6. Brydegaard M.
2018Multiband modulation spectroscopy for the determination of sex and species of mosquitoes in flightJournal of Biophotonics 11https://doi.org/10.1002/jbio.201800014 Google Scholar
1. Geiger M. F.
2. Moriniere J.
3. Hausmann A.
4. Haszprunar G.
5. Wägele W.
6. Hebert P. D. N.
7. Rulik B.
2016Testing the Global Malaise Trap Program – How well does the current barcode reference library identify flying insects in Germany?Biodiversity Data Journal 4https://doi.org/10.3897/BDJ.4.E10671 Google Scholar
1. Genoud A.
2. Gao Y.
3. Williams G.
4. Thomas B.
2020A comparison of supervised machine learning algorithms for mosquito identification from backscattered optical signalsEcological Informatics 58Google Scholar
1. Genoud A. P.
2. Gao Y.
3. Williams G. M.
4. Thomas B. P.
2019Identification of gravid mosquitoes from changes in spectral and polarimetric backscatter cross sectionsJournal of Biophotonics 12https://doi.org/10.1002/JBIO.201900123 Google Scholar
1. Greenop A.
2. Woodcock B. A.
3. Wilby A.
4. Cook S. M.
5. Pywell R. F.
2018Functional diversity positively affects prey suppression by invertebrate predators: a meta-analysisWiley Online Library 99:1771–1782https://doi.org/10.1002/ecy.2378 Google Scholar
1. Jansson S.
2. Gebru A.
3. Ignell R.
4. Abbott J.
2019Correlation of mosquito wing-beat harmonics to aid in species classification and flight heading assessmentEuropean Conference on Biomedical Optic Google Scholar
1. Kirkeby C.
2. Rydhmer K.
3. Cook S.
4. Strand A.
2021Advances in automatic identification of flying insects using optical sensors and machine learningScientific Reports 11:1555Google Scholar
1. Kouakou B. K.
2. Jansson S.
3. Brydegaard M.
4. Zoueu J. T.
2020Entomological Scheimpflug lidar for estimating unique insect classes in-situ field test from Ivory CoastOSA Continuum 3:2362–2371Google Scholar
1. Krishna Krishnamurthy P.
2. Francis R. A.
2012A critical review on the utility of DNA barcoding in biodiversity conservationBiodiversity and Conservation 2012 21:8 21:1901–1919https://doi.org/10.1007/S10531-012-0306-2 Google Scholar
1. LaCanne C. E.
2. Lundgren J. G.
2018Regenerative agriculture: Merging farming and natural resource conservation profitablyPeerJ 2018:e4428https://doi.org/10.7717/PEERJ.4428/SUPP-1 Google Scholar
1. Landis D. A.
2017Designing agricultural landscapes for biodiversity-based ecosystem servicesBasic and Applied Ecology 18:1–12https://doi.org/10.1016/J.BAAE.2016.07.005 Google Scholar
1. Lundgren J. G.
2. Fausti S. W.
2015aTrading biodiversity for pest problemsScience Advances 1https://doi.org/10.1126/SCIADV.1500558 Google Scholar
1. Lundgren J. G.
2. Fausti S. W.
2015bTrading biodiversity for pest problemsScience Advances 1https://doi.org/10.1126/SCIADV.1500558/SUPPL_FILE/1500558_SM.PDF Google Scholar
1. Lundgren J.
2. Shaw J.
3. Zaborski E.
4. Eastman C.
2006The influence of organic transition systems on beneficial ground-dwelling arthropods and predation of insects and weed seedsRenewable Agriculture and Food Systems 21:227–237https://pubag.nal.usda.gov/catalog/49820 Google Scholar
1. Ram A.
2. Jalal S.
3. Kumar M.
2010A density based algorithm for discovering density varied clusters in large spatial databasesInternational Journal of Computer Applications 3:1–4https://doi.org/10.13140/RG.2.1.4420.1448 Google Scholar
1. Rhodes M. W.
2. Bennie J. J.
3. Spalding A.
4. ffrench-Constant R. H.
5. Maclean I. M. D.
2022Recent advances in the remote sensing of insectsBiological Reviews 97:343–360https://doi.org/10.1111/BRV.12802 Google Scholar
1. Rydhmer K.
2. Bick E.
3. Still L.
4. Strand A.
5. Luciano R.
6. Helmreich S.
7. Beck B.
8. Grønne C.
9. Malmros L.
10. Poulsen K.
11. Elbæk F.
12. Brydegaard M.
13. Lemmich J.
14. Nikolajsen T.
2021Automating insect monitoring using unsupervised near-infrared sensorsScientific Reports 12:2603https://arxiv.org/abs/2108.05435v1 Google Scholar
1. Rydhmer K.
2. Prangsma J.
3. Brydegaard M.
4. Smith H. G.
5. Kirkeby C.
6. Kappel Schmidt I.
7. Boelt B.
2022Scheimpflug lidar range profiling of bee activity patterns and spatial distributionsAnimal Biotelemetry 10https://doi.org/10.1186/S40317-022-00285-Z Google Scholar
1. Sánchez-Bayo F.
2. Wyckhuys K. A. G.
2019Worldwide decline of the entomofauna: A review of its driversBiological Conservation 232:8–27https://doi.org/10.1016/J.BIOCON.2019.01.020 Google Scholar
1. Schipper A. M.
2. Hilbers J. P.
3. Meijer J. R.
4. Antão L. H.
5. Benítez-López A.
6. de Jonge M. M. J.
7. Leemans L. H.
8. Scheper E.
9. Alkemade R.
10. Doelman J. C.
11. Mylius S.
12. Stehfest E.
13. van Vuuren D. P.
14. van Zeist W. J.
15. Huijbregts M. A. J.
2020Projecting terrestrial biodiversity intactness with GLOBIO 4Global Change Biology 26:760–771https://doi.org/10.1111/GCB.14848 Google Scholar
1. Shortall R. C.
2. Moore A.
3. Smith E.
4. Hall J. M.
5. Woiwod P. I.
6. Harrington R.
2009Long-term changes in the abundance of flying insectsInsect Conservation and Diversity 2:251–260https://doi.org/10.1111/J.1752-4598.2009.00062.X Google Scholar
1. Silva D. F.
2. De Souza V. M. A.
3. Batista GEAPA K. E.
4. Ellis D. P. W.
2013Applying machine learning and audio analysis techniques to insect recognition in intelligent trapsProceedings—2013 12th International Conference on Machine Learning and Applications, ICMLA 2013. 2013 Google Scholar
1. Stefanescu C.
2. Páramo F.
3. Åkesson S.
4. Alarcón M.
5. Ávila A.
6. Brereton T.
7. Carnicer J.
8. Cassar L. F.
9. Fox R.
10. Heliölä J.
11. Hill J. K.
12. Hirneisen N.
13. Kjellén N.
14. Kühn E.
15. Kuussaari M.
16. Leskinen M.
17. Liechti F.
18. Musche M.
19. Regan E. C.
20. …Chapman J. W.
2013Multi-generational long-distance migration of insects: studying the painted lady butterfly in the Western PalaearcticWiley Online Library 140:474–486https://doi.org/10.1111/j.1600-0587.2012.07738.x Google Scholar
1. Tilman D.
2. May R. M.
3. Lehman C. L.
4. Nowak M. A.
1994Habitat destruction and the extinction debtNature 1994 371:6492 371:65–66https://doi.org/10.1038/371065a0 Google Scholar
1. Wägele J. W.
2. Bodesheim P.
3. Bourlat S. J.
4. Denzler J.
5. Diepenbroek M.
6. Fonseca V.
7. Frommolt K. H.
8. Geiger M. F.
9. Gemeinholzer B.
10. Glöckner F. O.
11. Haucke T.
12. Kirse A.
13. Kölpin A.
14. Kostadinov I.
15. Kühl H. S.
16. Kurth F.
17. Lasseck M.
18. Liedke S.
19. Losch F.
20. Wildermann S.
2022Towards a multisensor station for automated biodiversity monitoringBasic and Applied Ecology 59:105–138https://doi.org/10.1016/J.BAAE.2022.01.003 Google Scholar
1. Wood C. R.
2. Reynolds D. R.
3. Wells P. M.
4. Barlow J. F.
5. Woiwod I. P.
6. Chapman J. W.
2009Flight periodicity and the vertical distribution of high-altitude moth migration over southern BritainBulletin of Entomological Research 99:525–535https://doi.org/10.1017/S0007485308006548 Google Scholar
1. Yang L. H.
2. Gratton C.
2014Insects as drivers of ecosystem processesCurrent Opinion in Insect Science 2:26–32https://doi.org/10.1016/J.COIS.2014.06.004 Google Scholar

Article and author information

Author information

Klas Rydhmer
Department of Geosciences and Natural Resource Management, Copenhagen University, Rolighedsvej 23, Fredriksberg C, 1958, Denmark, FaunaPhotonics, Støberigade 14, Copenhagen, 2450, Denmark
ORCID iD: 0000-0002-5845-6313
James O. Eckberg
Agriculture and Food Solutions, General Mills, Minneapolis, MN 55427, United States
ORCID iD: 0000-0003-1961-9455
Jonathan G. Lundgren
Ecdysis Foundation, 46958 188, St, Estelline, SD 57234, United States
ORCID iD: 0000-0002-9860-3613
Samuel Jansson
FaunaPhotonics, Støberigade 14, Copenhagen, 2450, Denmark
ORCID iD: 0000-0003-4142-6334
Laurence Still
FaunaPhotonics, Støberigade 14, Copenhagen, 2450, Denmark
ORCID iD: 0000-0002-9741-8176
John E. Quinn
Department of Biology, Furman University, 3300 Poinsett Hwy, Greenville, SC 29613, United States
Ralph Washington Jr.
FaunaPhotonics, Støberigade 14, Copenhagen, 2450, Denmark
Jesper Lemmich
FaunaPhotonics, Støberigade 14, Copenhagen, 2450, Denmark
Thomas Nikolajsen
FaunaPhotonics, Støberigade 14, Copenhagen, 2450, Denmark
ORCID iD: 0000-0002-7541-449X
Nikolaj Sheller
FaunaPhotonics, Støberigade 14, Copenhagen, 2450, Denmark
Alex M. Michels
Ecdysis Foundation, 46958 188, St, Estelline, SD 57234, United States
ORCID iD: 0000-0002-3353-4987
Michael M. Bredeson
Ecdysis Foundation, 46958 188, St, Estelline, SD 57234, United States
ORCID iD: 0000-0002-7174-4133
Steven T. Rosenzweig
Agriculture and Food Solutions, General Mills, Minneapolis, MN 55427, United States
Emily N. Bick
FaunaPhotonics, Støberigade 14, Copenhagen, 2450, Denmark, Department of Entomology, University of Wisconsin-Madison, 1630 Linden Dr, Madison, WI 53706, United States, Department of Plant and Environmental Sciences, University of Copenhagen, Thorvaldsensvej 40, 1871 Frederiksberg C, Denmark
ORCID iD: 0000-0002-0014-8342
- Emily N. Bick Email:⠀ebick@wisc.edu

Author Notes

Author Contributions: Klas Rydhmer aggregated the data, performed the analysis, produced figures and tables, and co-wrote the first draft of the manuscript. James O. Eckberg planned and supervised the experiment and co-wrote the introduction, discussion, and conclusion. Emily N. Bick drafted the introduction, discussion, and conclusion, contributed to the methods, and coordinated edits to the manuscript. Samuel Jansson contributed to the data analysis, the results, and the methods sections and co-wrote the draft of the manuscript. Laurence Still assisted in planning, editing, and contributed to the data analysis. Ralph Washington Jr. and Nikolaj Sheller deployed optical sensors. Jesper Lemmich and Thomas Nikolajsen assisted with planning and coordination. Mike Bredeson, Alex M. Michels, and Jonathan Lundgren collected and supervised the analysis of the invertebrate inventory data. Jonathan G. Lundgren and Steven T. Rosenzweig contributed to the planning of the experiment. Steven Rosenzweig coordinated the recruitment of cooperating farmers and farm fields to host the experiment. John E. Quinn assisted with manuscript framing and editing.

Competing Interest Statement: Klas Rydhmer, Samuel Jansson, Laurence Still, Ralph Washington Jr., Nikolaj Sheller, Jesper Lemmich, and Thomas Nikolajsen are or were affiliated with FaunaPhotonics, who developed the sensor used in the study, as employees or stakeholders. Emily N. Bick was funded in part by FaunaPhotonics as part of a Postdoctoral Fellowship granted by the Danish Innovation Fund.

Version history

Preprint posted: August 17, 2023
Sent for peer review: September 6, 2023
Reviewed Preprint version 1: January 25, 2024
Reviewed Preprint version 2: December 3, 2024

Cite all versions

You can cite all versions using the DOI https://doi.org/10.7554/eLife.92227. This DOI represents all versions, and will always resolve to the latest one.

Copyright

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.

Metrics

views: 889
downloads: 54
citations: 0

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Significance of findings

Strength of evidence

Abstract

Significance Statement

Introduction

Materials and Methods

Data collection

Data analysis

Results

Discussion

Conclusion

Acknowledgements

Supplementary information

A table describing the crop type and number of insects observed in each field in June and July across all three methods.

Measured insect abundance per crop and monitoring method. Mean and standard deviation.

Co-correlations of all biodiversity metrics.

Model parameters for each fitted metric.

References

Article and author information

Author information

Klas Rydhmer

James O. Eckberg

Jonathan G. Lundgren

Samuel Jansson

Laurence Still

John E. Quinn

Ralph Washington Jr.

Jesper Lemmich

Thomas Nikolajsen

Nikolaj Sheller

Alex M. Michels

Michael M. Bredeson

Steven T. Rosenzweig

Emily N. Bick

Author Notes

Version history

Cite all versions

Copyright

Metrics