Flexible use of memory by food-caching birds

  1. Marissa C Applegate
  2. Dmitriy Aronov  Is a corresponding author
  1. Mortimer B. Zuckerman Mind Brain Behavior Institute, Columbia University, United States

Abstract

Animals use memory-guided and memory-independent strategies to make navigational decisions. Disentangling the contribution of these strategies to navigation is critical for understanding how memory influences behavioral output. To address this issue, we studied spatial behaviors of the chickadee, a food-caching bird. Chickadees hide food in concealed, scattered locations and retrieve their caches later in time. We designed an apparatus that allows birds to cache and retrieve food at many sites while navigating in a laboratory arena. This apparatus enabled automated tracking of behavioral variables – including caches, retrievals, and investigations of different sites. We built probabilistic models to fit these behavioral data using a combination of mnemonic and non-mnemonic factors. We found that chickadees use some navigational strategies that are independent of cache memories, including opportunistic foraging and spatial biases. They combine these strategies with spatially precise memories of which sites contain caches and which sites they have previously checked. A single memory of site contents is used in a context-dependent manner: during caching chickadees avoid sites that contain food, while during retrieval they instead preferentially access occupied sites. Our approach is a powerful way to investigate navigational decisions in a natural behavior, including flexible contributions of memory to these decisions.

Editor's evaluation

The extreme memory capacities of food-caching birds provide untapped opportunities for studying mechanisms of memory formation and retrieval. Here, Applegate and Aronov develop an automated animal and cache-site tracking system in which moments of seed deposits, retrievals, and checks are measured continuously alongside the animal's spatial positions. Probabilistic models reveal idiosyncratic spatial preferences in individual birds and also identify flexible memory usage – in which a single memory of past seed deposition can differentially guide spatial trajectories depending on if the bird is in engaged in retrieving or storing seeds. The rigorous behavioral tracking and modeling sets the stage for dissection of neural mechanisms underlying memory storage and retrieval.

https://doi.org/10.7554/eLife.70600.sa0

eLife digest

Humans form new memories about what is happening in their lives every day. These autobiographical memories depend on a part of the brain called the hippocampus. But how these memories are recorded remains unclear. Studying certain birds may help to provide more insight.

Black-capped chickadees, for example, are memory specialists. They stash thousands of food items and use their memories to recover these hidden food stores. This behavior also relies on these birds’ hippocampus. Studying these animals' behavior in the laboratory may help scientists decode how the birds use their memories and to gain more insight about the brain processes underlying memory.

Now, Applegate and Aronov show that chickadees use memory not only to retrieve food but also to decide where to hide it in the first place. In the experiments, chickadees were placed in a specialized enclosure with a grid of holes covered by silicone rubber flaps on the floor. The birds lifted the flaps with their toes or beak to hide a piece of sunflower seed underneath. Applegate and Aronov recorded and analyzed the animals’ seed hiding and retrieving behavior with a video camera to determine whether the birds were remembering the sites or happening on them by chance.

This revealed that black-capped chickadees use the same memories of where they had hidden food in two different ways. When they were hiding new morsels, the birds remembered where they had stashed food and avoided those flaps. When they were retrieving food, the birds knew exactly which flaps to look under. Future experiments using this special enclosure may help scientists monitor what happens in the chickadees’ brains during these activities.

Introduction

The study of episodic memory in animals has been greatly benefitted by experiments that involve spatial navigation (DeVito and Eichenbaum, 2010; Morris et al., 1982; Salwiczek et al., 2010). Whereas memory use is internal to the animal and not directly observable, navigational decisions that require memory are overt and trackable. In addition, memory-related neural circuits, like the hippocampus, exhibit well-described firing patterns that correlate to navigational variables – including place and head direction (O’Keefe and Dostrovsky, 1971; Taube et al., 1990). However, navigation is a rich behavior that includes both mnemonic and non-mnemonic spatial strategies. For example, an animal using memory to obtain a reward may also engage in opportunistic foraging, use memory-independent search strategies, and exhibit spatial biases (Dally et al., 2006; Kamil and Roitblat, 1985; Krebs et al., 1974; Lamprea et al., 2008; Pravosudov, 2001). It is critical to tease apart these contributions to behavior in order to eventually understand the underlying neural mechanisms.

Food-caching birds offer an opportunity to study memory in the context of navigation. These birds hide food in many distinct, concealed locations throughout their environment and later retrieve their caches. Food caching has been studied in two general types of settings well-suited for characterizing different components of the behavior. In some experiments, spatial navigation is minimized, and birds choose from a small number of available options to obtain food (Brodbeck et al., 1992; Clayton and Dickinson, 1998; Clayton and Krebs, 1994a; Salwiczek et al., 2010). These tasks have been ideal for characterizing various contributions to memory, including location, content, and even the relative time of different caches. In contrast, other experiments have been performed in the wild or in large, naturalistic settings (Cowie et al., 1981; Herz et al., 1994; Krushinskaya, 1966; Sherry, 1984a; Stevens and Krebs, 2008). These studies have been well-suited for measuring spatial properties of the food-caching behavior, including the distribution of caches, large-scale biases, and the spatial specificity of cache memories. However, it is unknown how mnemonic and other spatial strategies are coordinated by the animal within a single behavior.

Addressing this question may be possible in reduced spatial settings, like those typical in rodent experiments (Muller et al., 1987; Poucet et al., 2003; Tolman, 1948). Such a setting must retain key spatial aspects of bird navigation, while also incorporating food-caching behavior and memory use. Ideally, it would also permit detailed tracking of the animal’s behavior and be compatible with neural recordings. We developed an experimental setup to fulfill these requirements for a food-caching species, the black-capped chickadee. We then used this setup to dissect the contributions of mnemonic and non-mnemonic navigational strategies to food caching, using probabilistic modeling of behavioral choices (Brunton et al., 2013; Raposo et al., 2012; Scott et al., 2017).

Results

Design of the behavioral paradigm

Our first goal was to engineer a behavioral setup in which chickadees navigate, cache food, and retrieve caches. We designed individual cache sites as holes in the floor of an arena covered by silicone rubber flaps (Figure 1). Dimensions and materials were chosen to allow chickadees to pull the flap open with little effort – using either their toes or their beak – and to deposit a piece of a sunflower seed underneath. Once released, the flap obstructed any visual access to the contents of the site from above. Flaps ensured that no visually guided, memory-independent strategy could be used by the chickadee to determine site contents. Sixty-four cache sites were arranged in an 8 × 8 grid inside a 61 × 61 cm square arena. In addition, the arena contained a food source in each of the quadrants: a feeder with a motorized cover that could be individually opened or closed. This setup allowed navigation on a similar spatial scale to that typically studied in rodents (Poucet et al., 2003) while also offering birds a large number of concealed cache sites.

Figure 1 with 2 supplements see all
Behavioral paradigm for food caching and retrieval in chickadees.

(A) Rendering of the behavioral setup. For clarity, the front wall that contains doors of the arena is removed. Pink box highlights one cache site.(B) Illustration of a chickadee at a cache site. Top: chickadee prior to caching. Middle: cross-section of the cache site, with a chickadee pulling open the silicone flap to deposit a seed. Bottom: cross-section of the cache site after the seed was cached. (C) Left: video frames from the bottom camera showing all 64 cache sites at four timepoints within a behavioral session. Right: real-time detection of cached seeds. Red area indicates shape enclosed by the detected seed contour. (D) Detection of events at one example cache site in one behavioral session. Top: cartoon of the bird’s interactions with the site at several time points. Middle: video frames from the bottom camera at the corresponding time points. Bottom: Pearson correlation of each video frame with the image of the same cache site when empty. Caches create sustained decreases in the correlation, whereas site checks create transient decreases. (E) Ethogram of all behavioral events in an example session of the Caching task. Colored circles correspond to caches, retrievals, and site checks, as in (D). (F) Same as (E), for the Retrieval task. Colored regions indicate the phase of the trial. Black vertical lines denote trial boundaries.

We next developed methods to automatically track chickadee behavior in the caching arena. Automated behavioral tracking was necessary to allow closed-loop manipulations for some of the experiments described below. To achieve this, we used a transparent material for the bottom layer of the arena and positioned a video camera underneath. We applied a real-time contour detection algorithm (Lopes et al., 2015, see Materials and methods) to determine whether each of the sites was empty or occupied by a seed (Figure 1C). During offline analysis, we also detected instances of the bird opening the cover flap of a site, which we call ‘checks’. To do this, we calculated the Pearson correlation between the image of each site when empty and every video frame of that site in the session. Periods of time when a site was occupied corresponded to sustained periods of low correlation, while checks corresponded to transient decreases in correlation (Figure 1D). Thus, caches, retrievals, and checks could all be detected in the behavior using a single-camera video recording. We also trained a deep neural network (Mathis et al., 2018; Nath et al., 2019) to track the bird’s location for this analysis.

We designed two tasks, each suited for analyzing different aspects of behavior (for details see Materials and methods). In the Caching task (Figure 1E, Figure 1—video 1), chickadees were free to cache without any imposed trial structure for 1 hr. Individual feeders opened and closed to motivate movement through the arena and caching, but the schedule of these openings and closings was unrelated to the bird’s behavior. Our term ‘Caching task’ does not mean that birds only cached in this task – in fact, birds also ate seeds, checked sites, and retrieved their caches. However, because this task placed no limit on the number of caches, it was particularly well-suited for investigating the chickadee’s choice of where to cache each seed. This task was performed by 17 chickadees. In 10 of these, we recorded enough caches (> 64; see Materials and methods) for inclusion in the remainder of the Results section.

In the Retrieval task (Figure 1F, Figure 1—video 2), a trial structure was imposed. At the beginning of each trial, one of the feeders was open (‘feeder-open phase’), allowing chickadees to eat or cache seeds. Once the bird cached 1–3 seeds, lights were turned off for a 2 min delay phase. After the delay, all feeders were closed (‘feeder-closed phase’), and the only sources of food in the arena were previously cached seeds. In the feeder-closed phase, birds were generally motivated to find and retrieve their caches. This task was therefore suitable for investigating which sites the chickadee choses to check during cache retrieval. This task was performed by seven chickadees, all of which are included in the analyses.

Chickadees have unique and stable spatial biases

We first asked whether chickadees cached into some sites of the arena more than into others. In the Caching task, we calculated the cache distribution (pbias) by dividing the number of caches into each of the 64 sites across sessions by the total number of caches (Figure 2A). Birds used most of the arena for caching (between 41–64 sites for each of the 10 birds), but in many cases cache distributions appeared non-uniform. To quantify this, we measured the entropy of pbias for each bird and compared its value to simulations in which the same number of caches were sampled from a uniform distribution (Figure 2B). In all birds, entropy was lower than expected by chance (p < 0.001). Therefore, birds exhibited spatial biases, even though they cached widely throughout the arena.

Figure 2 with 2 supplements see all
Chickadees exhibit idiosyncratic and stable biases in the locations of caches.

(A) Probability distributions of cache locations across all sessions for four example birds. Distributions are denoted by pbias in the text. (B) Red lines: entropy values of the spatial distributions shown in (A). Gray histograms: entropy values for simulated caches drawn from a uniform distribution. Number of simulated caches was the same as in the observed data. (C) Principal component analysis of pbias values. Top: coefficients of the first two principal components for all birds. Red circles and numbers indicate birds shown in (A) and (B). Bottom: the first principal component. Birds cluster into center-preferring and edge-preferring groups. (D) Pearson correlation of pbias between subsets of all sessions paired within bird, between different birds, and between birds selected from the same PC1 cluster shown in (C). (E) Pearson correlation of pbias between pairs of trials at different trial lags. Values are medians across birds. Error bars: sem. Dashed line: linear regression. Gray shade: 95% confidence interval of linear regression fit to bootstrapped birds.

There are several possible explanations for the observed biases. One possibility is that biases are broadly shared across individuals of the species – akin to the preference for locations near walls in rodents (Lamprea et al., 2008). Another possibility is that biases are shared across some subgroups of individuals, but not all members of the species. Finally, biases could be unique to individual birds. To distinguish between these possibilities, we first asked whether there were common types of cache distributions across birds. We calculated principal components of all pbias values (Figure 2C). Remarkably, birds formed two clusters that were well-separated along the first principal component (Figure 2—figure supplement 1A). The first principal component resembled a bump in the center of the arena (Figure 2C), and accounted for 59% of the variance across individuals (Figure 2—figure supplement 1B). Thus, there were two groups of birds: those that preferred to cache in the center of the arena, and those that preferred the edges.

We next asked whether membership in one of these two groups was sufficient to explain the observed spatial biases. For each bird, we computed cache distributions separately for two randomly chosen, equal-sized subsets containing the same number of behavioral sessions. We then measured the Pearson correlation between the two distributions from the same bird (‘within-bird correlation’) and between distributions from different, randomly paired birds (‘across-bird correlation’). Within-bird correlation values were significantly higher than across-bird correlation values (p < 0.001, Figure 2D), even when birds were paired exclusively with birds from the same group (center-preferring or edge-preferring). Therefore, birds exhibited individual-specific biases that could not be entirely explained by common biases across a group of birds. An example of this is evident in Figure 2A: although birds 3 and 4 were both center-preferring birds, bird 3 had some additional bias for centrally-positioned rows and columns of the arena.

Finally, we asked whether spatial biases in individual birds were stable or drifted over time. We measured the Pearson correlation between cache distributions computed on pairs of sessions separated by different lags (Figure 2E). The slope of the relationship between lag and correlation was not significantly different from zero (p = 0.49, Bootstrap test), indicating that there was no detectable drift in bias.

We repeated all of the above analyses for the distribution of site checks in the Retrieval task. In this case, pbias was computed by dividing the number of times a particular site was checked by the total number of all site checks. All the results were similar to the ones obtained for caches in the Caching task (Figure 2—figure supplement 2). Collectively, our results suggest that chickadees exhibit idiosyncratic but stable biases, both in the sites that they check and those that they choose for caching. In both cases, the pbias distribution therefore serves as a valid baseline for subsequent models, described below.

Probabilistic models of chickadee behaviors

How do chickadees choose where to cache? One model consistent with our analysis so far is that birds choose each cache location by drawing it randomly from the distribution pbias. However, this is not the only possibility because pbias is computed by pooling across all time points. On a moment-by-moment basis, the likelihood of caching into a particular site may differ from the value given by pbias. For example, the choice of cache site could depend on the proximity of a site to the bird: chickadees may tend to cache close to the location they last interacted with or, conversely, they may avoid the vicinity of that location. The choice of a cache site may also depend on which sites are already occupied by seeds, since birds may tend to either cluster or to spread out their caches. The bird’s knowledge of which sites are occupied might be updated either by the act of caching, or by the act of checking the site to view its contents.

To model these possibilities, we considered three types of ‘special’ places in the arena at each moment in the Caching task: (1) the last site or feeder that the bird interacted with (‘previous site’), which includes checking a site or retrieving a seed, (2) sites that are currently occupied by cached seeds (‘occupied sites’), and (3) sites that have been previously checked by the bird and are currently empty (‘checked-empty sites’) (Figure 3A). We then constructed three scaling factors whose values changed with proximity to each of these special places (Figure 3B, defined in Materials and methods). The likelihood of caching into a particular site was computed by multiplying all three scaling factors by the corresponding value of pbias and normalizing the product to a sum of 1 across the arena. In other words, our model took each bird’s individual biases into account and then asked whether caching probabilities were additionally influenced, on a moment-by-moment basis, by special places in the arena.

Figure 3 with 3 supplements see all
Schematic of the probabilistic behavioral models.

Models here compute the probability of caching into a particular site in the Caching task. Equivalent models are used for the probability of checking a particular site in the Retrieval task. (A) Schematic of the arena at an example time point in the Caching task depicting ‘special locations’ in the arena: previous site interaction, occupied sites, and checked-empty sites. (B) Examples of scaling factors that change with proximity to each of the special locations shown in (A). Probability distribution of caching into different sites is computed by multiplying all scaling factors by pbias. (C) Example models. For each model, some parameters are fixed at 0, while others are free and fit using maximum-likelihood estimation. For free parameters, examples shown here use the same values as in (B). All four models are plotted on the same color scale.

Each scaling factor was a Gaussian function across space, defined by two parameters. The first parameter was the amplitude of the Gaussian (γprv, γocc, or γemp for the previous, occupied, and checked-empty sites, respectively), and indicated the strength and direction of the corresponding effect on probability, measured on a logarithmic base-10 scale. For example, γocc=1 meant that the act of caching into a site decreased the likelihood of caching there again by a factor of 10. The second parameter was the width of the Gaussian (σprv , σocc , or σemp, respectively) and indicated the spatial extent of each effect. In other words, a large value of σocc meant that the act of caching into a site affected the likelihood of subsequently caching into neighboring sites as well. This model was first applied to the data obtained in the Caching task. For each bird, we optimized the values of the model parameters in order to maximize the likelihood of the experimentally observed caches (Brunton et al., 2013; Raposo et al., 2012; Scott et al., 2017).

We also asked how chickadees chose which sites to check in the Retrieval task. Much like the choice of cache location, the choice of check location may be affected by proximity to the previous site, to occupied sites, and to checked-empty sites. We therefore used the same model as above, but fit to locations of checks instead of caches. At first, we fit the model only to the checks that the bird made up until and including finding a cache in the Retrieval task. We did this because after finding the first cache, birds often recached food, which confounded the analysis by mixing caching and retrieval behaviors in the same time period. We will analyze recaching separately later.

Our approach was to start with a baseline model (Model #0), in which the likelihood of choosing a particular site for caching or checking was given by pbias (Figure 3C). In this model, there were no free parameters, and the values of γprv, γocc, γemp, σprv, σocc, and σemp were all set to 0. We then gradually introduced free parameters to the model, one or two at a time. With each parameter introduction, we tested whether model performance was improved by measuring the Akaike Information Criterion (AIC), which quantifies model likelihood accounting for the number of free parameters (Dziak et al., 2020; Lubke et al., 2017).

Our model makes several assumptions. First, it uses a Gaussian function for each of the scaling factors. Second, it assumes a uniform spatial extent of this Gaussian across the arena. Finally, by multiplying scaling factors together it assumes that they have independent effects on probability. In separate models, we tested these assumptions and showed that chickadee behavior can be explained even better by including nonlinear interactions between scaling factors (Figure 3—figure supplement 1) by using functions that have more complex shapes than a Gaussian (Figure 3—figure supplement 2), and by allowing different spatial extents in different parts of the arena (Figure 3—figure supplement 3). However, these improvements were relatively modest and did not affect the main results described below.

Chickadee behaviors exhibit a strong proximity effect

We first asked how the location of the previous site affected the chickadee’s behavior. For this analysis we constructed Model #1, in which γprv and σprv were introduced as free parameters (Figure 4). We found that this model performed significantly better than Model #0 (see Table 1 for statistics of all model comparisons). The best-fit value of γprv was positive in all birds (0.94 ± 0.21), and the best-fit value of σprv was 15.5 ± 0.9 cm (see Table 2 for statistics of all model parameters). Thus, there was roughly a 10-fold (i.e., 100.94) increase in probability of caching next to the previous site, and the spatial extent of this effect was about 1/4 of the arena. To verify this result independently of the model, we compared the observed and expected distances of caches to previous sites and indeed found that birds cached closer to previous sites than expected by chance given each bird’s spatial bias (Figure 4—figure supplement 1).

Table 1
Summary of model performance.

Statistical analysis for all 12 models (0 through 11) presented in the Results section. Each model is compared to one of the previous models using bootstrap analysis of AIC values (see Materials and methods). p-values < 0.05 are emphasized in bold. Model #0 does not have corresponding p-values because it was the baseline model that was not compared to anything. Some models were only fit to data from the Caching or Retrieval task; in these cases, a hypen indicates that the corresponding fit was not performed.

Model #Free parametersCompared toModel #Model improvement?
Caching task p-valueRetrieval task p-value
0none- - -
1γprv;σprv0< 0.001< 0.001
2γprv;σprv;γocc1< 0.001< 0.001
3γprv;σprv;γocc;γemp2< 0.0010.015
4γprv;σprv;γocc;γemp;σocc30.480.35
5γprv;σprv;γocc;γemp;σemp30.930.42
6γprv;σprv;γoccc1 -< 0.001
7γprv;σprv;γoccc;γoccr6 -< 0.001
8γprv;σprv;γocc;γemp;τocc30.48 -
9γprv;σprv;γocc;γemp;τemp30.24 -
10γprv;σprv;γocc;γemp;νocc30.97 -
11γprv;σprv;γocc;γemp;νemp30.29 -
Figure 4 with 1 supplement see all
Effect of proximity to the previous site in Caching and Retrieval tasks.

(A) Comparison of Model #1 to Model #0 applied to the Caching task. Here and in subsequent figures, ΔAIC indicates the difference in AIC between the model being considered (in this case Model #1, labeled on the x-axis) and the baseline model (in this case Model #0, labeled in the white box). Compared to Model #0, Model #1 uses two additional free parameters (γprv and σprv), specified in the colored box. Horizontal black line: ΔAIC value for caches pooled from all birds. Shaded area: distribution of 1000 ΔAIC values on data bootstrapped across birds. Values less than 0 indicate model improvement. Asterisk indicates statistically significant improvement. (B) Best-fit values of model parameters prv and σprv for Model #1 applied to the Caching task. Symbols indicate values for individual birds. Black line: median values across birds. (C, D) Same as (A, B), but for models applied to the Retrieval task.

Table 2
Contributions of different factors to behavior.

Statistical analysis of all γ parameters mentioned in the Results section, which indicate the contribution of different factors to behavior. Best fit parameter values are indicated as median ± standard error of the median across all birds (N = 10 in the Caching task and N = 7 in the Retrieval task). Significance across birds is calculated using a one-sided Wilcoxon rank sum test to determine whether values across the population are significantly different from 0. The fraction of birds that are individually significant is calculated by bootstrapping the behavioral sessions of each bird. A p-value of 0.05 is used to assign significance in this case. See Materials and methods for details.

ParameterBest-fit parameter valueSignificance across birdsFraction of individually significant birds
Caching taskRetrieval taskCaching task p-valueRetrieval task p-valueCaching taskRetrieval task
γprv0.94 ± 0.211.83 ± 0.42< 0.001< 0.0018/107/7
γocc–0.32 ± 0.170.26 ± 0.09< 0.0010.0088/107/7
γemp0.13 ± 0.05–0.14 ± 0.070.0050.0086/105/7
γocc,first check–0.28 ± 0.170.18 ± 0.100.0050.0088/102/7
γemp,first check0.12 ± 0.05-0.042-6/10-
γoccc-0.42 ± 0.05-0.008-7/7
γoccr-0.26 ± 0.07-0.008-6/7

We applied the same model to site checks in the Retrieval task. Again, Model #1 performed better than Model #0. The value of γprv was positive in all birds (1.83 ± 0.42), indicating a tendency to check sites close to the previous sites, and σprv was 12.9 ± 1.7 cm. This result is consistent with our qualitative observations of chickadees in the Retrieval task: as birds moved through the arena – sometimes in the direction of a hidden cache – they often checked sites on their path. In fact, the value of γprv was significantly higher in the Retrieval task (p < 0.05, unpaired t-test), but the best-fit σprv was not significantly different between the two tasks (p = 0.54, unpaired t-test). These results show that birds are influenced by proximity during retrieval more than during caching, even though the spatial extent of this effect is similar during the two behaviors.

Chickadee behaviors are affected by site content

We next asked whether chickadee behaviors were additionally influenced by the contents of individual sites (Figure 5A–B). In the Caching task, we fit Model #2, which included γocc as a third free parameter to model the effect of occupied sites. Model #2 performed even better than Model #1, indicating that occupied sites indeed affected the behavior. We then fit Model #3, in which γemp was additionally introduced as a fourth free parameter to model the effect of checked-empty sites. Model #3 performed better yet, indicating that checked-empty sites also affected the behavior. Across birds, the value of γocc was negative (–0.32 ± 0.17), whereas the value of γemp was positive (0.13 ± 0.05). These values indicate roughly a twofold decrease in probability of caching into occupied sites and a 1.3-fold increase in probability of caching into checked-empty sites. Both results are consistent with chickadees spreading their caches throughout the arena, rather than pooling them into the same sites. Indeed, chickadees tended to cache into occupied sites less often than expected by chance (Figure 5—figure supplement 1A).

Figure 5 with 1 supplement see all
Site-specific, opposing effects of site content on behavior in Caching and Retrieving tasks.

(A) Performance of Models #2 and #3 applied to the Caching task, plotted as in Figure 4A. These models introduce free parameters to quantify the effect of occupied and checked-empty sites on behavior. (B) Best-fit values of γocc and γemp applied to the Caching task, plotted as in Figure 4B. (C, D) Same as (A, B) but for models applied to the Retrieval task. (E) Performance of Models #4 and #5 applied to the Caching task, plotted as in Figure 4A. These models introduce free parameters to quantify the spatial extent of the effect of occupied and checked-empty sites on behavior. (F) Best-fit values of σocc and σemp applied to individual birds in the Caching task, plotted as in Figure 4B. (G, H) Same as (E, F), but for models applied to the Retrieval task.

We again applied the same models to site checks in the Retrieval task (Figure 5C–D). As in the Caching task, introducing γocc and γemp significantly improved the fit of the model. However, the signs of the two parameters were reversed compared to the Caching task: γocc was positive in all birds (0.26 ± 0.09), whereas γemp was negative in all birds (–0.14 ± 0.07). These results show that birds were attracted to occupied sites during the feeder-closed phase of the task. This is consistent with chickadees searching for their caches. Indeed, chickadees found their caches after fewer checks than expected by chance: 8.5 ± 5.6 checks in the observed data, compared to 20.8 ± 4.5 checks in data where trajectories were shuffled between trials (Figure 5—figure supplement 1B).

Our analyses so far show that chickadees can be attracted to or avoid specific sites depending on the content of these sites. We asked whether these behavioral effects were site-specific, or whether neighboring locations were affected as well. We introduced either σocc or σemp as an additional free parameter (Model #4 and Model #5 respectively, Figure 5E–H). Neither model was an improvement over Model #3. In fact, the best-fit values of σocc and σemp in both models were between 0.01 and 0.02 cm, much smaller than the distance between neighboring sites (7.1 cm). Thus, the effects of site contents on chickadee behavior were site-specific, and were small even at neighboring sites of the arena. We verified this result independently of the model by measuring the fraction of caches and checks at various distances away from occupied sites (Figure 5—figure supplement 1C–D). This site specificity is consistent with that demonstrated under different conditions by previous work (Cowie et al., 1981).

Chickadees have memories of site contents

We have shown that chickadees avoid caching into sites that are occupied, but increase their probability of caching into checked-empty sites. An intriguing explanation of these results is that chickadees use memory of site contents to influence their caching behavior. However, after obtaining a seed, a chickadee may simply check multiple sites in the arena and preferentially cache into an empty one that it encounters. This would be a visually-guided strategy that does not require memory. We considered this possibility unlikely because chickadees usually cached into the first site that they opened after visiting the food source (79% of caches). However, we wanted to ensure that the remaining 21% of caches were not responsible for the observed effects. We implemented the same models as before (Models #2 and #3), but instead of fitting them to the observed caches, we fit them to the locations of the first sites opened by the bird after obtaining a seed (‘first check’). We found that γocc in Model #2 was still significantly negative, and γemp in Model #3 was still significantly positive (Figure 6A–B). Therefore, chickadees did not merely check sites until finding an empty one. Rather, they actually used memories of which sites were occupied and which were empty.

Figure 6 with 2 supplements see all
Results are best explained by long-lasting memories.

(A, B) Values of parameters across birds, compared between models applied to caches or first checks in the Caching task. Values of γocc are from Model #2, and values of γemp are from Model #3. (C) Values of γocc across birds, compared between Model #2 applied to checks or the same model applied to first checks in the Retrieval task. (D) Performance of Models #6 and #7 applied to the Retrieval task, plotted as in Figure 4A. These models introduce free parameters to separately quantify the effects of caches and recaches on the site checking behavior. (E) Best-fit values of γoccc and γoccr applied to individual birds in the Retrieval task, plotted as in Figure 4B. (F) Performance of Models #8 and #9 applied to the Caching task, plotted as in Figure 4A. These models introduce free parameters τocc and τemp to quantify the decay of γocc and γemp over time since the site was last checked by the bird. (G) Best-fit values of τocc and τemp applied to individual birds in the Caching task, plotted as in Figure 4B.

We also fit Model #2 to first checks in the Retrieval task. In this case, γocc was still positive. In other words, chickadees had an increased probability of checking an occupied site even on the first attempt. This implies that chickadees did not search the arena until finding a cache, but used memories of occupied sites. Note that γemp was not computed for this analysis because, by definition, there could not be any checked-empty sites before the first check.

Finally, we wanted to make sure that chickadees did not use olfactory cues to detect seeds in cache sites. Chickadees are not known to use olfaction for finding food (Cowie et al., 1981; Herz et al., 1994; Sherry, 1984a), and previous studies have excluded this possibility by removing caches during the delay period of the task. We repeated this experiment in our arena. Consistent with published results, even when caches were removed from the arena, chickadees tended to spend more time at the locations of the missing caches after a delay period of up to 1 hr (Figure 6—figure supplement 1). Collectively, our results show that chickadees have memories of site contents, as in previous work (Clayton and Dickinson, 1999; Sherry, 1984a). They use these memory of site contents in flexible ways, both during caching and during retrieval.

Memories of site contents last the duration of the session

We next asked for how long memories lasted in our arena. In the feeder-closed phase of the Retrieval task, chickadees often ‘recached’ seeds after retrieving them (44% of all retrievals). Therefore, there were two types of caches in this task: recent recaches and older caches made prior to the delay period. We first analyzed how these different types of caches affected behavior. For this analysis, we introduced models with separate parameters γoccc and γoccr instead of γocc to fit the contribution of older caches and recaches, respectively. These models were fit to the entire feeder-closed phase of the Retrieval task, which contained both the retrieval and the recaching behaviors. Model #6 introduced γoccc as a free parameter, while Model #7 introduced both γoccc and γoccr. We found that Model #6 fit the data significantly better than Model #1, and Model #7 fit the data even better than Model #6 (Figure 6D). The best-fit values were significantly positive for both parameters, but higher for γoccr than for γoccc (0.42 ± 0.05 vs 0.26 ± 0.07, respectively, p < 0.01 for the comparison). These results indicate that birds were attracted to all of their caches during retrieval, but had a stronger preference for the recent recaches.

One possible explanation of this result is that birds had a memory decay that reduced the effect of older caches on behavior. However, birds often recache a single seed multiple times, and may simply have a preference for revisiting recache locations. We therefore performed a different analysis that modeled a continuous memory decay. The Caching task was ideally suited for this analysis because it did not have a trial structure; caches could be retrieved at any later time in the session. We defined a ‘memory decay factor’ for each site, which was reset to one whenever that site was checked and decayed exponentially to 0 afterwards. In Model #8, this factor was multiplied by γocc and decayed with a timescale τocc. In Model #9, this factor was multiplied by γemp and decayed with a timescale τemp. Both models were compared to Model #3, in which neither γocc nor γemp decayed. We found that neither model improved the fit to the data (Figure 6F). The best-fit values of τocc and τemp were very large (both >44 min, Figure 6G). Therefore, there did not appear to be a significant memory decay at the timescale of our experiments. This additionally implies that the preference for retrieving recaches may be a behavioral preference unrelated to a memory decay.

Chickadees often recheck sites, with a median of 2.4 min between consecutive checks of the same site. Our results suggest that these rechecks are not necessary for maintaining memory, since the memory decay in the above analysis is reset at every site check. To verify this independently of the model, we measured the fraction of caches that were made into empty sites. In this analysis, we only included sites that had not been recently checked by the bird. We found that chickadees prefer to cache into empty sites even after not checking sites for a long period of time (p < 0.001 after 30 min; Figure 6—figure supplement 2). Our results do not indicate whether memory remains stable over periods much longer than the behavioral session (1 hr), and whether rechecks become necessary at those timescales. However, prior work has showed some memories lasting up to a month, even when birds are not allowed to recheck their sites (Roth et al., 2012). Additionally even 1 hr delays are ethologically important for chickadees, since they typically retrieve their caches over the course of a single day in the wild (Cowie et al., 1981).

Memories of site contents are of high-capacity and accessed in an arbitrary order

Food-caching birds are famous for storing and likely remembering large quantities of food items in the wild (Pravosudov, 1985). So far, it is unclear if chickadees exhibit a high memory capacity in our arena. For example, the statistical effects we observed can be explained by chickadees remembering a small number of caches and having no memory of other caches. To test this possibility, we allowed γocc and γemp to decay exponentially with the total number of occupied sites and checked-empty sites in the arena. The decay constants of these exponentials were two additional parameters, νocc and νemp, respectively. As before, we fit two models (Model #10 and Model #11), each introducing one of these two parameters. Neither model fit the data better than Model #3, in which γocc and γemp did not decay (Figure 7A). The best-fit values of νocc and νemp were >15 sites in both cases (Figure 7B) – comparable to the maximum number caches present in the arena at any single moment in the session (15.0 ± 1.0, N = 101 sessions). Therefore, memory did not appear to decay at capacities tested by our experiments.

Memories are high capacity, and accessed in arbitrary order.

(A) Performance of Models #10 and #11 applied to the Caching task, plotted as in Figure 4A. These models introduce free parameters vocc and vemp to quantify the decay of γocc and γemp as either the number of occupied sites or the number of checked-empty sites in the arena increases, respectively. (B) Best-fit values of vocc and vemp applied to individual birds in the Caching task, plotted as in Figure 4B. (C) Fraction of retrievals as a function of order in which seeds were caches. Order is aligned to the first cache. Red marker: mean observed fraction across birds. Black line: average value expected from retrieving seeds in a random order. Grey: 95% confidence interval of retrievals made in a random order. (D) Same as (C), but aligned to the most recent cache. Arrow indicates the only point that was significantly outside the 95% confidence interval of shuffled data. (E) Same as (D), but with transient caches eliminated. Transient caches are those that are retrieved without leaving the perch.

All of our models so far have used spatial location to explain the choice of sites for caching and checking. We also asked whether the order in which seeds were cached was predictive of the order in which birds chose to retrieve them. For this analysis, we used the Caching task, in which birds often cached and retrieved large numbers of seeds. Across all retrievals, we asked how often the nth cached seed was chosen by the bird (Figure 7C). There was no difference between observed data and shuffled data, in which seeds were chosen randomly from the available caches. Thus, there was no detectable ‘primacy’ effect in the data (p = 0.96). We then asked how often the nth most recent seed was chosen. In this case, the most recently cached seed was retrieved more often than expected by chance (p < 0.001, Figure 7D). However, this ‘recency’ effect was largely caused by caches that the bird retrieved without leaving the site of the cache, which accounted for 14% of all retrievals (Figure 7E). We often observed chickadees making several of these ‘transient caches’ and retrievals with a single seed, before finally settling on a cache location. Once transient caches were excluded from the data, there was no detectable primacy or recency effect in the order of retrieval (p = 0.44 and p = 0.24, respectively).

Discussion

We designed an apparatus to explore how chickadees make navigational decisions during food caching and retrieval. We built quantitative models to understand the relative contributions of mnemonic and non-mnemonic strategies to these behaviors. Our setup allowed birds to move within an arena similar to those used for rodent experiments (Lever et al., 2002; Muller et al., 1987; Wilson and McNaughton, 1993). We found that several natural features of chickadee spatial navigation were preserved in this environment. Our apparatus also allowed birds to cache and retrieve food from a relatively large number of concealed sites within the arena. Tracking behavior at these sites revealed that chickadees rely on multiple strategies for caching and retrieval and use memory in flexible ways during these behaviors.

We found two spatial strategies that did not require birds to remember the contents of individual sites: the use of spatial biases and a proximity effect. First, chickadees had spatial biases that were stable over time and unique to individual birds. Previous work has shown spatial biases in the wild. For example, individual birds can cache in different zones within a shared territory or on different parts of a tree (Cowie et al., 1981; Dally et al., 2006; Lahti et al., 1998). This strategy has a clear ethological advantage: biases reduce the memory load of cache locations, whereas choosing biases idiosyncratically minimizes unwanted overlap between individuals. The second contribution to behavior was the effect of proximity: chickadees tended to cache and check sites close to their previous site interaction. These effects have also been observed in the wild and have ethological advantages. Caching close to the food source minimizes travel distance and exposure to dangers (Clarkson et al., 1986; Scott et al., 2017; Sherry et al., 2010; Stapanian and Smith, 1978). Checking locations in close proximity is an efficient opportunistic method of exploring the environment (Krebs et al., 1974; van der Vaart et al., 2011). Although the patterns we find are on a dramatically smaller scale, it is intriguing to speculate that biases and proximity effects are driven by shared mechanisms across conditions.

We do not know whether these effects are truly non-mnemonic on timescales longer than our behavioral sessions. For example, a bird’s experience over long periods of time may influence its biases or its typical trajectories within an environment (Hampton and Sherry, 1994; Raby et al., 2007). Regardless, explaining away biases and proximity effects was critical for isolating the remaining memory-guided components in our experiments.

Our analysis has revealed several features of chickadee memory. Memories were spatially specific, lasted the duration of the hour-long session, had high capacity, and could be retrieved in an order different from their storage order. These features have been studied before – usually in experiments specifically designed to look at each feature of memory in isolation. For example, radioactive tagging of food in the wild has shown that birds search for caches with centimeter precision (Cowie et al., 1981). In both field and laboratory experiments, birds continue to remember locations on both hour-long timescales (Clayton and Krebs, 1994b; Cowie et al., 1981; Sherry and Vaccarino, 1989) and at least a month later (Ekman et al., 1996; Roth et al., 2012). Chickadees can cache thousands of seeds per day and are suspected to remember a large fraction of them (Brodin, 2010; Pravosudov, 1985). Finally, previous studies have also not detected a relationship between the order of caching and the order of retrieval, suggesting that chickadees have independent memories of caches rather than a single memory of the action sequence (Balda and Kamil, 1989; Sherry, 1984a; Sherry, 1984b). Our models capture these features on smaller spatial and temporal scales, and notably combine them within a single laboratory paradigm.

Classic work has shown that food-caching birds use memory to retrieve food (Krushinskaya, 1966; Sherry, 1984a; Clayton and Dickinson, 1998). They can even remember the contents of individual sites – for example, to choose caches that contain a certain type of food (Clayton and Dickinson, 1998; Sherry, 1984a). We also observed memories of site contents in our arena. Chickadees were attracted to occupied sites during retrieval, but avoided empty sites that they had previously checked. These results show that memories of site contents in chickadees not only affect a single behavior, but even influence the choice between different behaviors – in our case, site attraction or avoidance.

Site content influenced not only retrieval, but also caching itself. When caching, birds avoided occupied sites, but preferred sites that they had checked and found to be empty. These behaviors resulted in a ‘spreading’ of caches throughout an environment. Cache spreading is observed on a larger scale in the wild and is thought to be a defensive strategy against pilferers (Alpern et al., 2012; Cowie et al., 1981). However, it was not previously known that memory plays a role in this behavior. The implication of our results is that a single memory of a cached seed can be used by the bird for entirely different behaviors depending on context – in our case, caching or retrieval.

It has been shown that food-caching birds use memory in flexible ways. For example, they choose caches with certain types of food instead of others depending on selective satiety (Clayton and Dickinson, 1999) or knowledge about food perishability (Clayton and Dickinson, 1998; Feeney et al., 2011). Our results further show that memories might even be used for actions that are unrelated to immediate feeding needs – in our case to spread caches throughout an environment. The ability of the same memory to flexibly drive different behaviors that have different goals is a hallmark of episodic memories in humans (Allen and Fortin, 2013; Clayton et al., 2003; Tolman, 1948; Tulving, 1972). Our findings lends support to the idea that memories of food-caching birds are similarly general-purpose in nature.

Materials and methods

Resource availability

Lead contact

Request a detailed protocol

Further information and requests for resources and reagents should be directed to and will be fulfilled by the Lead Contact, Dmitriy Aronov (da2006@columbia.edu).

Experimental model and subject details

Request a detailed protocol

All animal procedures were approved by the Columbia University Institutional Animal Care and Use Committee and carried out in accordance with the US National Institutes of Health guidelines. The subjects were 21 black-capped chickadees (Poecile atricapillus) collected from multiple sites in New York State using Federal and State scientific collection licenses. Subjects were at least 4 months old at the time of the experiment, but age was not determined more precisely. Chickadees are not visibly sexually dimorphic, and all experiments were performed blindly to sex. Sex was only determined on birds that were subsequently used for other procedures in the lab. For the Caching task, 17 chickadees were used (seven male, 3 female, 7 unknown). Of these birds, 7 made fewer than 64 caches across all sessions and were excluded from all analyses except those in Figure 5—figure supplement 1A,C (see reasoning in Caching task below). Of the remaining birds, six were subsequently used in the Retrieval task and four were used in experiments not described in this paper. One additional bird (sex unknown) was used in both the Caching and the Retrieval task, but the data for the Caching task was not included due to technical problems.

For the Retrieval task, 7 chickadees were used, 6 of which had been previously used in the Caching task (three male, 1 female, 3 unknown). The olfactory control experiment (Figure 6—figure supplement 1) was performed on three birds, none of which were used in other tasks (sex unknown). These birds could not be used for any further experiments because they experienced their caches being removed from the experimental arena, which has been shown in prior literature to affect subsequent caching (Hampton and Sherry, 1994). Prior to experiments, all birds were housed in groups of 1–4 on a ‘winter’ light cycle (9 h:15 hr light:dark).

Behavioral apparatus

Design of caching apparatus

Request a detailed protocol

All custom parts of the behavioral arena were designed in Autodesk Inventor. Part files are available upon request. Two identical enclosed square arenas were constructed and used for all experiments. Each arena was a 61 cm x 61 cm square and contained 64 perches, 64 ‘cache sites’, and 4 ‘feeder sites’.

The arena was constructed from five laser-cut layers. The top surface layer was a white polystyrene sheet (1.6 mm thick, McMaster-Carr, Atlanta, GA, USA, 8734K32). The second layer was a sheet of silicone rubber (0.8 mm thick, 60 A durometer, McMaster-Carr 1460N41). The third and fourth layers were black cast acrylic sheets (3.2 mm and 4.8 mm thick, McMaster-Carr, 8505K742; 8505K748, respectively). Finally, the fifth (bottom-most) layer of the arena was a transparent cast acrylic sheet (3.2 mm thick, McMaster-Carr 8560K257).

All five layers contained cutouts for the perches. These included space directly in front of and behind the perch, allowing birds to wrap their toes around the perch. The fourth layer included small ledges on which the ends of the perch rested. Perches were 9.5 mm diameter, 50.8 mm length wooden dowels (McMaster-Carr, 97195A434).

Cache sites were created by making square cutouts in the third and fourth layer. These cutouts formed a cavity 6.4 mm wide x 4.8 mm long x 7.9 mm deep. A 3D-printed ramp was inserted into this cavity and helped direct seeds into the bottom of the site (Clear resin, Formlabs, Somerville, MA, USA, RS-F2-GPCL-04). The bottom (fifth) layer contained no cutouts at the cache sites and therefore provided a floor for seeds to rest on. The second (silicone rubber) layer was cut on three sides above each cache site, creating a soft flap that could be pulled open. The top layer had a cutout above each site, providing access to the silicone flap. Finally, a white 3D-printed cap was attached to the top level next to each site (White Resin, Formlabs RS-F2-GPWH-04). This cap created a barrier that blocked visual access of site contents from anywhere in the arena.

The top four layers were screwed together. The bottom layer was attached using small magnets (K&J Magnetics, Pipersville, PA, USA, D22), and could be temporarily removed in order to clean out all cache sites between sessions.

Each feeder consisted of a 3D-printed basket (White Resin, Formlabs RS-F2-GPWH-04) attached to a stepper motor (Mouser, Mansfield, TX, USA, 108990003) that could open or close the feeder. The motor was controlled by an Arduino (Mouser, 713–102990189) via a stepper motor driver (Mouser, 474-ROB-12779). A ledge surrounding each feeder was 3D printed from colored PLA using a Taz 6 3D printer (Lulzbot, Fargo, ND, USA). Each feeder’s ledge was a different color (red, blue, green, and yellow). Lights in the arena were turned on and off using a solid-state relay (Mouser, 558-D1D12), also controlled by the Arduino.

Each arena was placed inside a black plastic box with doors on one side. Walls of the box were positioned 2.5 cm from the edge of an arena, creating a ‘moat’ that almost completely surrounded the arena (except the corners). The box was 51 high. Bright shapes ~ 15 cm across (blue star, orange triangle, purple circle, and green tree) were positioned on the center of each wall. The arena was illuminated both from above and from below, using LED lights (Super Bright LEDs, St. Louis, MO, USA, VTL -x1515; NFLS-WW300X2-LC2). Natural sound of rushing water was played in the background to mask inadvertent room noises.

Birds were monitored using three cameras. A camera in the center of the ceiling was used to track bird’s position (Amazon, Seattle, WA, USA, 180 degree Fisheye lens 1,080 p Wide Angle Pc Web USB camera, 30 fps). A camera positioned 90 cm below the arena (bottom camera) monitored the contents of cache sites and the bird’s site checks (Edmund Optics, Barrington, NJ, USA, acA2500-60uc; #63–245, 1” python 5000 CMOS sensor, 57 fps). A third camera mounted in the center of one wall 24 cm above the floor of the arena (side camera) was used for manual verification of bird’s behavior (Edmund Optics, EO-2323; #62–274, 2/3” Progressive Scan CMOS sensor, 50 fps).

Actuation of the behavioral arena

Request a detailed protocol

For the Caching task, where no closed-loop manipulations were needed, all actuation (control of lights and motors) was performed by standalone Arduino code. For the Retrieval task, all task-relevant information (contents of cache sites and the bird’s location) were monitored in real time by software written in Bonsai (Lopes et al., 2015). This software communicated with the Arduino to actuate events in the arena. Bonsai received real-time inputs from the bottom camera and the top camera. All codes are available upon request.

To detect occupied sites, an ROI was drawn around each cache site in the green channel of the video recorded by the bottom camera. The green channel was used because it provided high contrast with the red silicone material. At the beginning of each trial, the background of each ROI was subtracted using BackgroundSubtraction (subtractionMethod Absolute, ThresholdType ToZero, ThresholdValue 6). In each frame, novel objects were then detected using FindContours (Method ChainApproxNone, minArea 4) and counted using BinaryRegionAnalysis. The number of occupied sites was then passed through a lowpass filter (yn=0.01xn+0.99yn-1 , where xn is the count and yn is the filtered count). The value of yn rounded to the nearest integer was used by Bonsai as the current number of seeds in the arena. This filter introduced a delay of ~1 s between a cache occurring and being detected, ensuring that checks were not counted as caches.

To detect visits to feeders, an ROI was drawn around each feeder in the video recorded by the top camera. The sum of pixel values in this ROI was computed, and a threshold was chosen separately for each feeder to reliably detect the chickadee’s head entering the ROI. Because chickadees are black-capped, all feeders were lighter in color, and ROI entrances corresponded to negative threshold crossing.

Behavioral protocol

Habituation

Request a detailed protocol

Birds selected for experiments were singly housed and weighed daily. Primary feathers were trimmed to prevent flight. Initially, birds were given an ad-libitum supply of small bird diet (Mazuri, St. Louis, MO, USA). Due to changes in animal husbandry protocols, for some birds, the diet was supplemented with dried mealworms. This change in diet had no significant effect on any results reported in the paper. Each bird’s cage contained a small platform with 4–6 ‘replica’ cache sites identical to those later used in the behavioral arena.

Birds were first given a 4-day period of acclimation and habituation to handling. After 4 days, they were gradually habituated to food restriction. First, birds were not given food for the first two hours of the lights-on period. This period was gradually increased to 4 hr (occasionally 3–5). Birds were weighed at the end of the food restriction period, and the length of this period was increased only if the bird’s weight remained stable (fluctuations less than ~0.5 g) for 4 days. Experiments began after 14–21 days, once birds had stable weight on 3–5 hr of food restriction and were observed caching into the replica cache sites in their home cages. Some birds underwent 1–3 days of sessions in the olfactory control experiment (see Experimental model and subject details). Other birds underwent several weeks (14–45 days, median 25) of sessions in the Caching task and/or several weeks (7–82 days, median 58) of sessions in Retrieval task. For birds that were used in both of these tasks, the Caching task preceded the Retrieval task. In all tasks, prior to beginning the session, cache sites were emptied between sessions to ‘reset’ the arena.

Caching task

Request a detailed protocol

Each bird was recorded in one 1-hr long behavioral session per day, starting immediately after the food restriction period. These sessions were run every day for the duration of the experiment. During these session, all four feeders in the arena were stocked with chopped sunflower seeds (~25 mg fragments). At any moment in time, the arena could be in one of five states. In state 0, all feeders were closed. In states 1–4, the corresponding feeder (1-4) was open, while the other three feeders were closed. In most sessions, the state of the arena at the beginning of the session was chosen randomly with probability 1/4 of choosing state 0 and 3/16 of choosing any of the other states. A timer was also started, with duration chosen randomly from an exponential distribution between 60 and 90 s with 15 s decay. Once the timer duration elapsed, a new state was chosen, with the same probabilities: 1/4 for state 0 and 3/16 for the others. The timer was restarted with a new randomly chosen duration. A given state could repeat more than once, with one exception: if state 0 has occurred three times in a row, states 1–4 were chosen randomly with 1/4 probability each. In some early sessions, state 0 was omitted. In these sessions, the arena could be in states 1–4, and all transition probabilities between states were 1/4. For some earlier birds, the timer duration was also chosen from smaller values: exponential distribution between 30 and 60 s with 15 s decay. In later birds, this duration was switched to 60–90 s because it was observed to promote caching. There was no difference in parameters between birds run on either protocol.

In most session, we also turned lights off periodically, which helped motivate food seeking and caching behaviors. In these cases, 5 min lights-on periods were alternated with lights-off periods, which lasted 0.5, 1, or 2 min in different sessions. In early sessions for each bird, the lights-off period was omitted.

Except for the analysis in Figure 5—figure supplement 1A,C, we only analyzed data from those 10 birds that made at least 64 caches across all sessions. This threshold was chosen conservatively as the minimum number that could allow the bird to cache at least once into each site. For the analysis in Figure 5—figure supplement 1A,C, all 17 birds were used. For all analyses, we only included sessions where birds made at least 10 caches. For each bird, sessions were pooled across all feeder opening/closing and lights on/off conditions that were used for that bird.

Retrieval task

Request a detailed protocol

Each bird was recorded in one behavioral session per day, which lasted at least 30 min (see below). These sessions were also run every day for the duration of the experiment. The session consisted of trials, each of which contained three phases: the feeder-open phase, the delay phase, and the feeder-closed phase.

The feeder-open phase consisted of two periods: the ‘eating’ period and the ‘caching’ period. In the eating period, one of the feeders was open, while the other three were closed. A timer was started at the beginning of this period, with a random duration chosen from an exponential distribution between 30 and 60 s with 15 s decay. Once the timer had elapsed, a new feeder was chosen to be open with 1/4 probability, with the other three feeders closed; the same feeder could be repeated more than once. The timer was restarted with a new randomly chosen duration.

The caching period started whenever the bird cached the first seed – that is, making one of the sites in the arena occupied. When this first cache was made, a new feeder was immediately chosen to be open with 1/4 probability. Subsequently, a new feeder was chosen with 1/4 probability whenever a new site was occupied by a cache, allowing the same feeder to be repeated more than once. This was done to motivate caching throughout the arena, rather than in the vicinity of the same feeder. When the bird visited the open feeder after caching the first seed, a 15 s timer was started. The feeder-open phase of the trial was terminated whenever this timer elapsed or three sites in the arena became occupied, whichever came first.

In the delay phase, all four feeders were closed, and lights in the arena were turned off. The delay phase lasted 2 min. After this 2 min period, lights were turned back on, and the feeder-closed phase was initiated.

In the feeder-closed phase, all four feeders remained closed at all times. If the bird retrieved all of the caches that it made earlier in the same trial, the feeder-closed phase terminated 15 sec after the removal of the last cache. This 15 s window was implemented to accommodate possible recaching of the last retrieved seed. The feeder-closed phase also terminated after 10 min if the bird had not retrieved all of the caches that were made in the same trial. After the feeder-closed phase terminated, a new trial was immediately started if the duration of the session had not yet exceeded 30 min. The session was also terminated after 30 min if the bird never made any caches.

Experiment controlling for olfactory cues

Request a detailed protocol

Birds were recorded in three sessions per day. During the first session (habituation session), all feeders were closed, and the bird had no access to food. The bird’s location in the arena was tracked for 5 min to measure the ‘baseline’ location distribution. Next, the bird was removed from the arena, then immediately reintroduced for another session (caching session). In the caching session, all feeders were open and the bird was allowed to make three caches. After three caches were made, the bird was again removed from the arena and placed into a cage with no food for a 30–60 min delay period. During this delay, the feeders were closed and all food was removed from the arena, including the bird’s caches and any scattered food. After the delay, the bird was reintroduced into the arena for the final session (test session). During this session, the bird’s location was again tracked for 5 min.

Quantification and statistical analysis

Annotating site interactions

Request a detailed protocol

Using the bottom camera, we defined an ROI around each cache site. Pixel values from the first 100 frames of the video, when all cache sites were empty, were averaged to obtain a ‘baseline’ image for each ROI. For each subsequent frame in the video, we then measured two values: the Pearson correlation with the baseline image (Rb), and the Pearson correlation with the previous frame (Rp). Rb was normalized by subtracting the median and dividing by the standard deviation. Rp was transformed by subtracting a moving average of 200 frames. The MATLAB findpeaks function was used to detect changes in correlation that occurred in either one of these values (Rb : minPeakDistance 100, minPeakHeight 10; Rp : minPeakDistance 100, minPeakHeight 0.3). All of these events were visualized with side and bottom cameras and manually classified as caches, retrievals, checks, or false positives. Any period of time where the lights of the arena were turned off was excluded from this analysis.

Location tracking with deep neural networks

Request a detailed protocol

To determine the bird’s location, we trained a deep neural network (Mathis et al., 2018; Nath et al., 2019) to track two locations on the bird’s body using the top camera: the tip of the beak and the location halfway between the two feet. We then defined an ROI around each feeder, as well as around the perch adjacent to each cache site. A feeder visit was defined as the bird’s beak entering the corresponding ROI while the feeder was open. A cache site visit was defined as the bird’s feet entering the corresponding ROI.

Spatial distribution of caches and checks

Request a detailed protocol

Spatial distributions of caches (Caching task) and checks (Retrieval task) were quantified by computing the pbias values (see Probabilistic model of behavior). To calculate whether these distributions differed from a uniform distribution, we measured the entropy of pbias as

(1) H=ipibiaslog2pibias

For our arena, H could range between 0 bits for a maximally biased bird that uses only one site to six bits for perfectly uniform behavior. To assess statistical significance, we compared this value of H to values from 10,000 simulations, in which the same number of events was drawn randomly from a uniform distribution spanning all sites in the arena.

To determine whether pbias was specific to individual birds, we divided each bird’s sessions into two groups (A and B) and assigned each session to one of these groups. The two groups were equal-sized if the number of sessions was even, or differed by one session if the number of sessions was odd. We computed distributions of cache or check locations separately for the two groups (pbias,A and pbias,B), using the same formula as for pbias. We then computed the Pearson correlation between pbias,A and pbias,B for each bird and defined ‘within-bird correlation’ as the median of these values across birds. To compute ‘across-bird correlations’, we shuffled the identities of all birds 1000 times and computed the median Pearson correlation between pbias,A and pbias,B across these 1000 shuffles. The entire process was repeated for 1000 shuffled datasets, in which the assignment into groups A and B was randomized.

For the ‘within PC1 cluster’ analyses in Figure 2D and Figure 2—figure supplement 2D, we smoothed all values of pbias with a 3 × 3 site Gaussian window (σ=7.1 cm, i.e. the distance between neighboring sites). We computed the first principal component of all smoothed pbias values across birds. We grouped all birds into two clusters, using k-means clustering. We then performed the same process as above, but with one exception: during the shuffling of bird identities, a bird was only allowed the identity of a bird from the same cluster.

To measure the stability of pbias over time, we measured the distribution of cache or check locations for each trial separately, using the same formula as for pbias. We smoothed these distributions with a 3 × 3 site Gaussian window (σ=7.1 cm). We then measured the Pearson correlation value across all pairs of smoothed distributions that were Δs sessions apart and took the median across all of these pairs of sessions. This value was computed for all Δs from one to N. Here, N was determined as the value for which in the Caching task there were at least 2 pairs of sessions available from at least five of the birds. This value was N=8 and was applied to both tasks.

Statistics

Unless otherwise stated, the following statistical methods were used throughout the paper. To test whether a given model (Model A) was a significant improvement over another model (Model B), we computed AIC values for the two models and the difference between these values (ΔAIC). A lower AIC value for Model A (ΔAIC < 0) would indicate its improvement over Model B. To determine the statistical significance of such a result, we recomputed ΔAIC for 1000 samples bootstrapped across birds. We then found the fraction of these bootstrapped ΔAIC values that were >0 and reported that fraction as the p-value. These values are summarized in Table 1.

For indicating the best-fit parameter values, we reported the median and the standard error of the median across birds. To test whether these values were significantly different from 0 across all birds, we used a one-tailed Wilcoxon signed rank test. To determine whether parameter values were individually significant for a particular bird, we bootstrapped the behavioral sessions of that bird and re-fit the model to each set of bootstrapped data. We then measured the p-value the fraction of bootstrapped parameters that were above or below 0 (depending on the direction of the effect across birds). These values are summarized in Table 2.

Probabilistic model of behavior

Consider a bird recorded in a set of behavioral sessions within the arena. This arena contains multiple discrete sites. Each site is either a cache site (i.e. location covered by a silicone flap) or a feeder. We will consider events that occur at these sites.

We first consider cache sites. At a cache site, a check is an opening of the silicone rubber flap by the bird, which allows the bird visual access to the contents of the site. A cache is a placement of seeds into the site. A retrieval is a removal of seeds from the site. Caches and retrievals necessarily require the bird to open the silicone flap; therefore, each cache and retrieval at a cache site is also considered to be a check. For feeders, we define a retrieval as a removal of a seed. A retrieval is the only type of an event that is considered at feeders.

We define a site interaction as any cache, check, or retrieval – either at a cache site or at a feeder. Visits to sites that do not include at least one of these three types of events (e.g. landing at a site) are not considered to be interactions and are not included in the analysis.

For each bird, the experiment consists of non-overlapping trials. In the Caching task, each behavioral session is considered to be one trial. In the Retrieval task, each trial is a period of time consisting of the feeder-open phase, the delay phase, and the feeder-closed phase. In this task, each behavioral session contains at least one trial, and usually multiple trials.

Each site is considered to be occupied if it contains at least one seed and empty if it does not. Feeders have a large capacity and are occupied at all times. Cache sites are considered to be empty at the beginning of each trial. If some cache sites contain seeds remaining from a previous trial, those seeds are ignored in the analysis. This occasionally happens in the Retrieval task if the bird had failed to retrieve all the caches during one of the previous trials (< 1% of all retrievals). Any withdrawal of such a seed is not considered to be a retrieval. However, any subsequent placement of that seed into a cache site is considered to be a cache.

Definitions of variables

Request a detailed protocol

The following variables are defined for each bird:

  • S is the number of sites in the arena. In the arena described in this paper, there are 64 cache sites and four feeders. Therefore, S=68.

  • R is the number of behavioral trials.

  • I is the number of site interactions performed by the bird across all R trials.

For each site interaction i, where 1iI, the following are defined:

  • si is the index of the site where the interaction occurred, where 1siS.

  • ri is the index of the trial during which the interaction occurred, where 1riR.

  • ϕi is the phase of the trial during which the interaction occurred, defined as ϕi=0 for all site interactions during the Caching task, ϕi=1 for interactions during the feeder-open phase of the Retrieval task, and ϕi=2 for interactions during the feeder-closed phase of the Retrieval task. There are no interactions during the delay phase of the Retrieval task.

  • Δi is the change in the number of seeds at site si during the interaction. I.e., Δi<0 for retrievals, Δi>0 for caches, and Δi=0 for checks that are not coincident with either a cache or a retrieval.

For each site s, where 1siS, the following is defined:

  • xs is the location of the site. Here, xs is a vector indicating location in 2-dimensional space.

  • Several additional variables are defined for convenience of notation.

The Euclidean distance between sites s and s` is:

(2) dss=xsxs

Next, we consider the time point immediately preceding interaction i. Several variables are defined to indicate the state of the behavioral arena at this point. For the following definitions, ‘current trial’ is the trial that contains interaction i.

  • nis is the occupancy of site s. The value is nis=1 if the site is occupied, or nis=0 if the site is empty.

  • Ni is the total occupancy of the arena – that is, the number of cache sites that are occupied.

  • tis is the amount of time that has elapsed since the most recent interaction with site s in the current trial. If the bird has not interacted with site s in the current trial, then tis=.

  • cis indicates whether site s is ‘checked-empty’. That is, cis=1 if the bird has checked site s in the current trial, and that site is empty. If the bird has not checked site s in the current trial, or if site s is occupied, then cis=0.

  • Ci is the number of checked-empty cache sites in the arena.

  • nisr indicates whether the latest cache into site s in the current trial has been a recache. That is, nisr=1 if the bird has cached into site s during the feeder-closed phase of the task, and site s is occupied. If the bird has only cached into site s during the feeder-open phase, or if site s is empty, then nisr=0.

  • μi indicates whether the next interaction in the current trial that involves a seed is a cache. If the next interaction involving a seed is a cache, μi=1. If the next such interaction is a retrieval, or if the bird never interacts with a seed again in the current trial, μi=0.

Model parameters

Request a detailed protocol

Twelve parameters are used by various models described in the main text. These are defined in the factors described in Equations 5–9. We define a vector of these parameters:

(3) θ=[γprvσprvγoccσoccγempσempγocccγoccr1/τocc1/τemp1/νocc1/νemp]

Inversions of the last four parameters are done for the purpose of parameter regularization, described later.

Interaction masks

Request a detailed protocol

Models described in the text are applied to specific subsets of interactions. A single model is sometimes applied to different subsets: for example, Model #1 is used to quantify the effect of proximity either on caches or on checks, which are two different subsets of interactions. To define such subsets, we use interaction masks (Table 3). For each interaction i, the value of the mask is 1 if that interaction is included in the analysis or 0 if it is not. For different formulas below, two masks will be used: Gi is used to mask interactions based on the task (Caching or Retrieval); Hi is used to further mask subsets of interactions within these tasks.

Table 3
Masks for different subsets of interactions.
Subset of interactionsGiHi
All caches in the Caching task­Gi={1,ϕi=0,Δi>00,otherwise­Hi={1,ϕi=0,Δi>00,otherwise
First checks in the Caching task. This mask includes the first check that the bird made after retrieving each seed that was subsequently cached (i.e. it does not include checks made after retrieving seeds that were subsequently eaten without caching).­Gi={1,ϕi=0,Δi>00,otherwise­Hi={1,ϕi=0,Δi1<0,μi=10,otherwise
All checks in the feeder-closed phase of the Retrieval task­Gi={1,ϕi=20,otherwise­Hi={1,ϕi=20,otherwise
All checks up to and including finding a cache (i.e. making the first check of an occupied site) in the feeder-closed phase of the Retrieval task. If no cache is found in the feeder-closed phase of the trial, this includes all checks up until the end of the trial.­Gi={1,ϕi=20,otherwise­Hi={1,ϕi=2,j<irj=rinjsj=00,otherwise
First checks in the feeder-closed phase of the Retrieval task. This mask includes the first check made during the feeder-closed phase of each trial.­Gi={1,ϕi=20,otherwise­Hik={1,ϕi1=1,ϕi=20,otherwise

Scaling factors

Request a detailed protocol

In the baseline model (Model #0), the probability of an interaction occurring at a particular site given by the distribution pbias. We compute pbias for either the Caching task or the Retrieval task using the mask G described above. For every site s:

(4) psbias=i=1IGiδsisi=1IGi

where δ is the Kronecker Delta, defined as δxy=1 if x=y, and δxy=0 if xy.

We next define scaling factors. A scaling factor is a function defined for each interaction i and site s. These factors will be used later to scale values of pbias. Positive values of these scaling factors indicate an increase in probability, and negative values indicate a decrease in probability.

The first factor is used to model the effect proximity to the previous site interaction. It is a Gaussian that decays with distance from the previous site, defined at distances > 0. Empirically, we found that models fit better if the value of this scaling factor was set to 0 at a distance of 0. This is due to the fact that birds were unlikely to interact with the exact same site twice in a row, even though they were highly likely to interact with neighboring site. Therefore, we defined the scaling factor as follows:

(5) αis(1)={γprvedsi1s2/2σprv2,ssi10,otherwise

The second factor is used to model the effect of distance from occupied sites in the arena. It is a summation of Gaussians, each centered at an occupied site. The amplitude decays in time (in order to model memory decay) and with the occupancy of the arena (in order to model memory capacity). Note that in most models in the paper these decays are not included (i.e. τocc= and/or νocc=).

(6) αis(2)=γoccs=1S(edss2/2σocc2)(etis/τocc)(eNi/νocc)nis

The third factor is analogous to the second factor, but for checked-empty sites in the arena:

(7) αis(3)=γemps=1S(edss2/2σemp2)(etis/τemp)(eCi/νemp)cis

The fourth and fifth factors are used to separately model the effects of caches made in the feeder-open phase of the trial and caches made in the feeder-closed phase (i.e. ‘recaches’):

(8) αis(4)=γoccrnisr
(9) αis(5)=γoccc(nisnisr)

Each of the scaling factors k=1,2,,5 is converted to a multiplicative form as follows:

(10) βis(k)=10αis(k)

Un-normalized probability of interaction i occurring at site s is then computed by multiplying all five scaling factors by the baseline probability. Because this value depends on all model parameters, we denote it as a function of θ:

(11) pis(θ)=psbiask=15βis(k)

This value is converted to normalized probability that sums to one across all sites of the arena:

(12) p^is(θ)=pis(θ)s=1Spis(θ)

Model definitions

Request a detailed protocol

We describe twelve different models in the main text. All of these models can be expressed using the mathematical formulation described above. The difference between these models is the parameter space: In some models, a particular parameter is fixed at a value of 0, whereas in other models, that same parameter is a free parameter permitted to take values within a certain range. Table 4 specifies the parameter space for each model. Models are numbered using index M=0,1,,11. Parameters within vector θ are numbered using index z=1,2,,12. For each model M and parameter z, the table indicates P(M,z) – the set of permitted values for that parameter. For parameters whose values are fixed at zero, P(M,z)={0} . We use fM to indicate the number of free parameters in model M; this value is also given in the table.

Table 4
Permitted ranges of parameters, P(M,z), for main models.
Model index
(M)
Parameter# free param
(fM)
γprvσprvγoccσoccγempσempγocccγoccr1/τocc1/τemp1/νocc1/νemp
Parameter index (z)
123456789101112
0{0}{0}{0}{0}{0}{0}{0}{0}{0}{0}{0}{0}0
1(-∞,∞)[0,∞){0}{0}{0}{0}{0}{0}{0}{0}{0}{0}2
2(-∞,∞)[0,∞)(-∞,∞){0}{0}{0}{0}{0}{0}{0}{0}{0}3
3(-∞,∞)[0,∞)(-∞,∞){0}(-∞,∞){0}{0}{0}{0}{0}{0}{0}4
4(-∞,∞)[0,∞)(-∞,∞)[0,∞)(-∞,∞){0}{0}{0}{0}{0}{0}{0}5
5(-∞,∞)[0,∞)(-∞,∞){0}(-∞,∞)[0,∞){0}{0}{0}{0}{0}{0}5
6(-∞,∞)[0,∞){0}{0}{0}{0}(-∞,∞){0}{0}{0}{0}{0}3
7(-∞,∞)[0,∞){0}{0}{0}{0}(-∞,∞)(-∞,∞){0}{0}{0}{0}4
8(-∞,∞)[0,∞)(-∞,∞){0}(-∞,∞){0}{0}{0}[0,∞){0}{0}{0}5
9(-∞,∞)[0,∞)(-∞,∞){0}(-∞,∞){0}{0}{0}{0}[0,∞){0}{0}5
10(-∞,∞)[0,∞)(-∞,∞){0}(-∞,∞){0}{0}{0}{0}{0}[0,∞){0}5
11(-∞,∞)[0,∞)(-∞,∞){0}(-∞,∞){0}{0}{0}{0}{0}{0}[0,∞)5

For each model, the parameter space is defined as the Cartesian product of permitted ranges across all twelve parameters:

(13) Θ(M)=P(M,1)×P(M,2)××P(M,12)

Log-likelihood computation

Request a detailed protocol

The log-likelihood of the model is computed by adding log-probabilities across all interactions observed in the data. For this purpose, the interaction mask H is used to only include the subset of the interactions that is being fit by the model:

(14) L(θ)=i=1IHilnp^isi(θ)

To pull data across birds, we denote the log-likelihood for each bird b as Lb(θ) . Here 1bB, where B is the number of birds. The pooled log-likelihood is then computed as:

(15) Lpool(θ)=b=1BLb(θ)

The ‘cost’ of the model fit is computed by negating the likelihood and adding a Ridge regularization term. Ridge regularization penalizes large magnitudes of parameters and helps prevent overfitting. For bird b, the cost is:

(16) Jb(θ)=Lb(θ)+λj=112θj2

For the pooled data, the cost is similarly:

(17) Jpool(θ)=Lpool(θ)+λj=112θj2

Here, λ is the regularization parameter. We use λ=1 for the Caching task and λ=10 for the Retrieval task. These values are different due to different numbers of data points in the two tasks.

Model fitting and evaluation

Request a detailed protocol

To fit each model M using maximum-likelihood estimation, we determine values within the parameter space of model M that minimize the cost. These best-fit parameters are given by θ^b(M) for each bird b and by θ^pool(M) for the pooled data:

(18) θ^b(M)=arg minθΘ(M) Jb(θ)
(19) θ^pool(M)=arg minθΘ(M) Jpool(θ)

We fit each model using the MATLAB fmincon function. Each model was fit five times with different initial conditions, and the solution with the minimum cost was selected.

To evaluate each model, we used the Akaike Information Criterion (AIC). Smaller AIC values indicate better model performance. AIC value is improved by increased model likelihood, but is penalized by adding extra parameters. We computed the AIC value on the pooled data:

(20) AIC(M)=2Lpool(θ^pool(M))+2fM

To determine significance, 1,000 pools of data were constructed by bootstrapping across birds. For each bootstrap, the AIC values of all models were calculated and compared to find the percentage of bootstraps where model performance improved.

Data availability

The following data is located on Dryad: For the Caching task, we include annotations of all trials. For each trial, we annotate the time and location of every cache, retrieval, and site check, as well as the bird's location at every timepoint. We also include the times and identity of any feeder sites that were open, as well as the times of any lights-off periods. For the Retrieval task, we include annotations of all trials. For each trial, we annotate the location and time of any caches made during the feeder-open phase of the trial, as well as the time that feeder-closed phase began and ended. We also included the time and location of the bird's recaches, retrievals and site checks during the trial. We finally include the bird's location at every time point.For the control experiment described in Figure 6—figure supplement 1, we include annotations of the location of the bird for every timepoint in the habituation session and the test session, as well as the location of caches from the caching session. For all behavioral models described in the paper, we include the bootstrapped AIC values as well as the best-fit values of all model parameters for each bird.

The following data sets were generated
    1. Applegate M
    2. Aronov D
    (2022) Dryad Digital Repository
    Flexible use of memory by food-caching birds.
    https://doi.org/10.5061/dryad.6wwpzgn12

References

    1. Brodin A
    (2010) The history of scatter hoarding studies
    Philosophical Transactions of the Royal Society of London. Series B, Biological Sciences 365:869–881.
    https://doi.org/10.1098/rstb.2009.0217
    1. Clayton NS
    2. Dickinson A
    (1999)
    Memory for the content of caches by scrub jays (Aphelocoma coerulescens)
    Journal of Experimental Psychology. Animal Behavior Processes 25:82–91.
    1. Krushinskaya N
    (1966)
    Some complex forms of feeding behavior of nutcracker Nucifraga caryocatactes, after removal of old cortex
    Journal of Evolutionary Biochemistry and Physiology 10:563–568.
    1. Lamprea MR
    2. Cardenas FP
    3. Setem J
    4. Morato S
    (2008) Thigmotactic responses in an open-field
    Brazilian Journal of Medical and Biological Research = Revista Brasileira de Pesquisas Medicas e Biologicas 41:135–140.
    https://doi.org/10.1590/s0100-879x2008000200010
    1. Pravosudov VV
    (1985)
    Search for and storage of food by Parus cinctus lapponicus and P. montanus borealis (Paridae
    Zoologicheskii Zhurnal 64:1036–1043.
    1. Sherry DF
    (1984b) What food-storing birds remember
    Canadian Journal of Psychology/Revue Canadienne de Psychologie 38:304–321.
    https://doi.org/10.1037/h0080835
  1. Book
    1. Tulving E
    (1972)
    Episodic and semantic memory
    In: Tulving E, Donaldson W, editors. Organization of Memory. Cambridge University Press. pp. 381–403.

Decision letter

  1. Jesse H Goldberg
    Reviewing Editor; Cornell University, United States
  2. Laura L Colgin
    Senior Editor; University of Texas at Austin, United States

In the interests of transparency, eLife publishes the most substantive revision requests and the accompanying author responses.

Decision letter after peer review:

Thank you for submitting your article "Flexible use of memory by food-caching birds" for consideration by eLife. Your article has been reviewed by 3 peer reviewers, one of whom is a member of our Board of Reviewing Editors, and the evaluation has been overseen by Laura Colgin as the Senior Editor. The reviewers have opted to remain anonymous.

The reviewers have discussed their reviews with one another, and the Reviewing Editor has drafted this to help you prepare a revised submission.

Essential revisions:

1) All reviewers agreed that the paper overstates the novelty of the behavioral findings. Please address issues raised below.

A main concern is to what extent this paper truly offers an original behavioral result with respect to the flexibility of memory. It is known that birds can remember seed caches and the major gains of the present manuscript are not revealing a new faculty but rather describing a method by which to study that faculty in the laboratory. Also, the sample size is rather small (N = 7 to 17) and variable across figures and analyses. For example, many readers will be familiar with decades-old studies by Sherry, Clayton, and Shettleworth that showed similar flexible memory use by caching birds (e.g. Raby et al., 2007; Shettleworth, 1990; Clayton et al., 2003). As is, the current introduction simply states that flexible memory is 'a matter of debate' in animals – but those familiar with previous studies of caching would likely disagree. The authors should re-write the introduction and abstract to be as generous as possible to past field work, to specify precisely what was previously known, and what is the truly new behavioral finding (if any) in the present work. For example, it's conceivable that this paper could be framed differently, i.e. "it has long been known that caching birds exhibit flexible memory that enable them to target specific sites differentially for depositing versus retrieving seeds, here we leverage this flexible memory capacity to quantify precisely how remembered cache locations influence foraging on a moment to moment basis." (or something like this…)

As in, a potential strength of the paper is the behavioral modeling which identifies the precise candidate considerations (parameters) that influence moment by moment aspects of foraging. This model has the potential to provide an algorithmic level description of the task, which in turn provides numerical values that could conceivably be identified in neural signals in future recording studies (see point 2 below to shore up this potential). It's one thing to show that caching birds have flexible memory (as past work has shown), but what seems uniquely new about this paper is that it leverages this past work towards a model that makes instant by instant predictions for how 'special' places influence behavior in a space- and time-dependent way. This algorithmic level description of the behavior is what could be novel, which should therefore be emphasized in the abstract/intro.

(2) If, as the reviewers suggest, the focus of the paper shifts from a descriptive account of the behavior in the new preparation to an emphasis on the validity of the statistical models, then more needs to be done to motivate the modeling decisions. It is currently not convincing that the parameters of the model are dissociable, especially since birds can continually cache and retrieve and recache.

Here are some unanswered questions that should be addressed if we are to take the quantitative parameter estimates of the models seriously, rather than the overall goodness of fit revealing the existence of some feature of the behavior (e.g. behavior is spatially autocorrelated).

(2.1.) Is log-linear interaction of the parameters appropriate or are there more complex dependencies?

(2.2) Is a Gaussian spatial autocorrelation function appropriate? Is the standard deviation fixed across space? Across strategy?

(2.3) Is time-dependent forgetting tested/modeled appropriately? There was an odd mismatch between the statement about preference for re-caching and the lack of a forgetting curve. What is the longest the animals will have a benefit from remembering caches? If only during one session, is this then perhaps a weakness in the task since one would expect birds in the wild to cache for longer time periods? Given that birds are rechecking single sites every 2.4 minutes, the authors should provide a more convincing argument how this unstructured task addresses the issue of memory decay.

(3) Please clarify how the spatial biases identified for individual birds affect caching strategy.

Currently the authors separately present first that birds have individual spatial biases and later on with a modelling approach show that previous visits and general proximity also biases the animals search and caching behaviour. But how do the two aspects relate to each other? One would assume that a general individual bias would decrease the number of possible locations the animal in general has to look at and therefore it is perhaps a mix of memories being used: the bias memory as well as the episodic memory of caching food. If one would include the bias as well as the principles that the animals have a proximity preference, how much variance will still be explained by the episodic memory? Or is the individual bias a result of the proximity bias? It would be good if the authors could create one major model which includes all identified factors to see how much which factor contributes.

(4) Please provide supplemental videos and increased clarity of the experimental timeline.

Further, it would help readers to understand the details of the behaviour if for example videos of the bird behaviour are provided with the manuscript for the different types of trials. That would help the reader understand in which times scales the animal is remembering and deciding. Further, an overall schematic of the timeline for the birds would be helpful. How long were they trained and habituated on what? How many sessions per day? How many days/weeks in total? For example days/weeks could go to the right and then down would be time within a day since multiple sessions were done per day. Then it can also be marked which data was included for analysis.

(5) Several technical points, if addressed, could make the statistics a bit stronger.

First, for the description of spatial biases, it appears that the same data is used to assign cluster membership and then quantify cluster separation. Some sort of cross-validation should be used throughout the analyses plotted in Figure 2 (Panel 2E notwithstanding).

Second, for the model fitting, a point estimate is used. It would be good to know the confidence around these point estimate and here a Bayesian framework would be helpful perhaps using Markov chain Monte Carlo methods to finds highest density intervals for likely parameter estimates (e.g. Annis and Palmeri 2017, doi: 10.1002/wcs.1458; Kruschke 2013 DOI: 10.1037/a0029146).Reviewer #1 (Recommendations for the authors):

Applegate and Aronov develop an automated animal and cache-site tracking system to study foraging strategies in chickadees. A key strength of this system is that each moment of seed deposit, retrieval, and 'checks' is marked alongside the animal's spatial position. Tracking animals and their caching presents tremendous opportunities for the study of memory storage and retrieval. The paper's main strength is the quantitative modeling of the caching behavior, which is elegant and has explanatory power over moment-to-moment navigation decisions. The paper's main weakness is overstatement of the novelty of the discovery of flexible memory use by caching birds. This paper need not 'discover' flexibly memory use to be of general interest for eLife readership. Leveraging this very unique behavioral capacity for the generation of highly predictive foraging models is sufficient on its own to make a large impact.

(1) The setup of this paper unnecessarily overstates the novelty of the behavior and understates the utility of the behavioral modeling for future mechanistic studies.

A main concern is to what extent this paper truly offers an original behavioral result with respect to the flexibility of memory. For example, many readers will be familiar with decades-old studies by Sherry, Clayton, and Shettleworth that showed similar flexible memory use by caching birds (e.g. Raby et al., 2007; Shettleworth, 1990; Clayton et al., 2003). As is, the current introduction simply states that flexible memory is 'a matter of debate' in animals – but those familiar with previous studies of caching would likely disagree. The authors should re-write the introduction and abstract to be as generous as possible to past field work, to specify precisely what was previously known, and what is the truly new behavioral finding (if any) in the present work. For example, it's conceivable that this paper could be framed differently, i.e. "it has long been known that caching birds exhibit flexible memory that enable them to target specific sites differentially for depositing versus retrieving seeds, here we leverage this flexible memory capacity to quantify precisely how remembered cache locations influence foraging on a moment to moment basis." (or something like this…)

As in, the strength of the paper is the behavioral modeling which identifies the precise candidate considerations (parameters) that influence moment by moment aspects of foraging. This model provides an algorithmic level description of the task, which in turn provides numerical values that could conceivably be identified in neural signals in future recording studies. It's one thing to show that caching birds have flexible memory (as past work has shown), but what seems uniquely new about this paper is that it leverages this past work towards a model that makes instant by instant predictions for how 'special' places influence behavior in a space- and time-dependent way. This algorithmic level description of the behavior is what's novel, which should therefore be emphasized in the abstract/intro.

Discussion paragraph lines 375-387 provides the most accurate description of the contribution of the present work.Reviewer #2 (Recommendations for the authors):

In this article the authors present a new behavioural task in birds and combine it with a modelling approach to show that birds can use food-caching memories to guide both the retrieval of cached food as well as to guide the decision where to cache new food. Overall this is an interesting study and with mainly only minor issues.

The authors claim in the abstract that they could show that a single memory can be used for at least two unrelated goals. However, in this case the goals are not really unrelated, in my view. Caching new food and retrieving old food is still part of one behavioural aspect/goal of the animal.

Currently the authors separately present first that birds have individual spatial biases and later on with a modelling approach show that previous visits and general proximity also biases the animals search and caching behaviour. But how do the two aspects relate to each other? One would assume that a general individual bias would decrease the number of possible locations the animal in general has to look at and therefore it is perhaps a mix of memories being used: the bias memory as well as the episodic memory of caching food. If one would include the bias as well as the principles that the animals have a proximity preference, how much variance will still be explained by the episodic memory? Or is the individual bias a result of the proximity bias? It would be good if the authors could create one major model which includes all identified factors to see how much which factor contributes.

Further, it would help readers to understand the details of the behaviour if for example videos of the bird behaviour are provided with the manuscript for the different types of trials. That would help the reader understand in which times scales the animal is remembering and deciding. Further, an overall schematic of the timeline for the birds would be helpful. How long were they trained and habituated on what? How many sessions per day? How many days/weeks in total? For example days/weeks could go to the right and then down would be time within a day since multiple sessions were done per day. Then it can also be marked which data was included for analysis.

The authors did not provide adequate statistical information related to each reported p-value. Example: Line 117 → "In all birds, entropy was lower than expected by chance p < 0.001", they did not clarify what is the chance probability and which statistical test they used to compute the p-value. I assume they computed it based on the shuffling analysis (lines 629 and 630) but it needs to be clearly and more adequately described.Reviewer #3 (Recommendations for the authors):

In this manuscript Applegate and Aronov used statistical modeling to study the food caching strategies of the black-capped chickadee in the laboratory setting. The authors show that in a modest sample of birds a subset prefer the center of their arena (61cm x 61cm). Birds tended to have spatially autocorrelated behaviors and remembered cached locations as evidenced by avoiding locations that contain a seed while caching and seeking baited locations when retrieving. Birds would often retrieve and recache seeds and these were preferred as were the last seeds cached. In short, the authors can replicate important aspects of natural behaviors in their experimental preparation.

The presented findings clearly set the ground work for careful behavioral analyses of future studies linked to a more mechanistic understanding of why the birds behave as they do. In the present form, the statistical modeling is rather descriptive and the insights gained will likely relate to a rather narrow audience interested in laboratory assessments of spatial memory. In addition, it is known that birds can remember seed caches and the major gains of the present manuscript are not revealing a new faculty but rather describing a method by which to study that faculty in the laboratory. Also, the sample size is rather small (N = 7 to 17) and variable across figures and analyses.

https://doi.org/10.7554/eLife.70600.sa1

Author response

Essential revisions:

(1) All reviewers agreed that the paper overstates the novelty of the behavioral findings. Please address issues raised below.

A main concern is to what extent this paper truly offers an original behavioral result with respect to the flexibility of memory. It is known that birds can remember seed caches and the major gains of the present manuscript are not revealing a new faculty but rather describing a method by which to study that faculty in the laboratory. Also, the sample size is rather small (N = 7 to 17) and variable across figures and analyses. For example, many readers will be familiar with decades-old studies by Sherry, Clayton, and Shettleworth that showed similar flexible memory use by caching birds (e.g. Raby et al., 2007; Shettleworth, 1990; Clayton et al., 2003). As is, the current introduction simply states that flexible memory is 'a matter of debate' in animals – but those familiar with previous studies of caching would likely disagree. The authors should re-write the introduction and abstract to be as generous as possible to past field work, to specify precisely what was previously known, and what is the truly new behavioral finding (if any) in the present work. For example, it's conceivable that this paper could be framed differently, i.e. "it has long been known that caching birds exhibit flexible memory that enable them to target specific sites differentially for depositing versus retrieving seeds, here we leverage this flexible memory capacity to quantify precisely how remembered cache locations influence foraging on a moment to moment basis." (or something like this…)

As in, a potential strength of the paper is the behavioral modeling which identifies the precise candidate considerations (parameters) that influence moment by moment aspects of foraging. This model has the potential to provide an algorithmic level description of the task, which in turn provides numerical values that could conceivably be identified in neural signals in future recording studies (see point 2 below to shore up this potential). It's one thing to show that caching birds have flexible memory (as past work has shown), but what seems uniquely new about this paper is that it leverages this past work towards a model that makes instant by instant predictions for how 'special' places influence behavior in a space- and time-dependent way. This algorithmic level description of the behavior is what could be novel, which should therefore be emphasized in the abstract/intro.

As the Reviewers suggested, we have significantly re-written both our Abstract and Introduction to refocus the framing of the paper on the power of our behavioral modeling approach. We agree that this is an important contribution of the paper.

We left some discussion of memory flexibility in the revised manuscript, now largely relegated to the Discussion section. We try to be very careful not to state that our paper discovers the flexibility of memory and better cite the relevant literature. “Flexibility” is a broad term, and reviewers correctly mention several published papers that show some forms of flexibility, both during caching and during retrieval. For example, birds prefer that contain specific types of food and are able to change this preference in different contexts. Birds can also choose caching locations based on knowledge of the environment.

Our results demonstrate a very specific type of flexibility: the use of the same exact memory to both cache and retrieve food. To our knowledge, this has not been shown in food-caching birds. Although chickadees are known to avoid caching into occupied sites, it has not been known that this behavior depends on memory. We feel that this result is important and will be interesting to scholars of memory because caching and retrieving are very distinct behaviors, and the use of memory during these behaviors serves different purposes. Avoidance of occupied sites during caching is likely a defense against pilfering, whereas attraction to these sites during retrieval serves immediate feeding needs.

We think that this type of flexibility beautifully complements the examples of flexibility described in previous studies of food caching, and we tried to clarify these points carefully in the revised Discussion. We do, however, now focus most of the paper on the contributions of our behavioral modeling.

(2) If, as the reviewers suggest, the focus of the paper shifts from a descriptive account of the behavior in the new preparation to an emphasis on the validity of the statistical models, then more needs to be done to motivate the modeling decisions. It is currently not convincing that the parameters of the model are dissociable, especially since birds can continually cache and retrieve and recache.

Here are some unanswered questions that should be addressed if we are to take the quantitative parameter estimates of the models seriously, rather than the overall goodness of fit revealing the existence of some feature of the behavior (e.g. behavior is spatially autocorrelated).

(2.1.) Is log-linear interaction of the parameters appropriate or are there more complex dependencies?

We addressed this question in the revised manuscript by fitting additional models (new Models #12-13) to account for interactions between pairs of parameters. In other words, rather than modeling log-probability only as a weighted sum of several factors, we also included second-order terms for the products of two of these factors. We tested each pair of factors separately, and used our standard model comparison to ask whether including an interaction term for those two factors justified the addition of an extra parameter.

We now include an additional supplementary figure (Figure 3—figure supplement 1) that shows all of the new results related to considering interaction terms between parameters. Briefly, these results are:

1) We indeed found one nonlinear interaction between parameters that improved the model fit: birds preferred to cache into sites that were both empty and close to the previously visited location. The preference for sites that had both of these characteristics was supralinear – i.e., not entirely accounted by the linear model.

2) However, when we accounted for this supralinear interaction and examined "mnemonic" parameters γocc and γemp (new Model #13 and #12 respectively), the best-fit values of these parameters were nearly identical to those in the original linear model (Model #3). Therefore, using a simpler linear model did not distort the main conclusions of the paper regarding the use of memory.

(2.2) Is a Gaussian spatial autocorrelation function appropriate?

We addressed this question by including a new supplementary figure that explores alternative autocorrelation functions (Figure 3—figure supplement 2) and by clarifying why we chose a Gaussian spatial autocorrelation to begin with. Note that the manuscript uses a notched Gaussian autocorrelation function: a function that is Gaussian for distance>0, but has a value of 0 at distance=0.

1) In the new supplementary figure, we show the actual autocorrelation functions – i.e., the probabilities of caching and checking at various distances away from the previous site. These plots indeed qualitatively look like notched Gaussians: in both cases probability decays with distance, but has small values near zero. Therefore, a notched Gaussian is qualitatively not an unreasonable function to use in our models.

2) In the same supplementary figure, we test two other functions that are often used for fitting distance-dependent probabilities: a notched exponential decay (new Model #14) and a Mexican hat function (new Model #15). We find that using a notched exponential does not improve the model fit. However, using a Mexican hat improves the fit, for both caches and checks. This is true for two reasons. First, the notch at zero in the autocorrelation function has a small, but nonzero width, and is therefore better described by the negative component of the Mexican hat function than a δ function. Second, the probability of caching or checking a site at zero distance is actually lower than expected by chance. The notch is therefore better fit by a slightly negative amplitude rather than a drop to zero.

3) However, in the same supplementary figure we show that using a Mexican hat function instead of a single Gaussian had no effect on the mnemonic parameters γocc and γemp (new Models #16 and #17). Therefore, using a simpler single-Gaussian model did not distort the main memory-related results of the paper. Given this result, we feel that our new observations about the exact shape of the spatial autocorrelation are interesting, but best presented in a supplementary figure. For clarity of the storyline, we left the simpler Gaussian model in the main figures.

Is the standard deviation fixed across space?

We addressed this question by fitting additional models in which standard deviation is not fixed across space. These models are presented in a new supplementary figure (Figure 3—figure supplement 3). Because walls and feeders are salient objects in the arena, we considered the simple possibility that standard deviation varies with distance from these objects. New Model #18 uses separate standard deviation values for different distances from walls, and Model #19 does so for different distances from feeders.

As Figure 3—figure supplement 3B-C shows, both new models improve the fit in the Caching task, and Model #18 improves the fit in the Retrieval task. However, just as above, we found that using these more complex models had no effect on our mnemonic parameters γocc and γemp (new Models #20 through #23). For clarity, we therefore kept the simpler single-σ models in the main figures of the paper and devoted a supplementary figure to exploring more complex models.

Across strategy?

As we reported in the original manuscript, the best-fit σprv in the Caching task was 15.5 ± 0.9 cm and 12.9 ± 1.7 cm in the Retrieval task. There was no statistically significant difference between these two values (unpaired t-test, p = 0.54). We had not emphasized this point in the original manuscript, but now do so. We also now report the difference in γprv values between the two tasks.

(2.3) Is time-dependent forgetting tested/modeled appropriately? There was an odd mismatch between the statement about preference for re-caching and the lack of a forgetting curve. What is the longest the animals will have a benefit from remembering caches? If only during one session, is this then perhaps a weakness in the task since one would expect birds in the wild to cache for longer time periods? Given that birds are rechecking single sites every 2.4 minutes, the authors should provide a more convincing argument how this unstructured task addresses the issue of memory decay.

We have made the following changes in the revised manuscript to address these questions:

(1) We now emphasize that chickadees exhibited a significant memory effect after an hour even in the experiment where they were removed from the arena and could not recheck the sites (Figure 6—figure supplement 1). We therefore think that memory in our experiment lasts substantially longer than the average 2.4 min between site checks.

(2) We performed an additional analysis, in which we quantified the effect of memory at different time delays. At each time point, we included in the analysis only those sites that had not been checked for a certain duration of time. With increasing duration, this analysis looked at progressively smaller subsets of the data. Yet, we still had statistical power to analyze durations up to about half of the behavioral session, and did not find any significant decrease of the memory effect during this time. This analysis supports the conclusion of point #1 above, and is shown in a new figure (Figure 6—figure supplement 2).

3) Although we did not detect a significant memory decay on the timescale of our experiment, we cannot exclude the possibility of forgetting that occurs afterwards. Our experiments only lasted an hour (in the Caching task), and there was no benefit in remembering site contents for periods longer than that. We therefore softened our language about longevity of memory and emphasize that we do not examine memory lasting longer than 1 h.

(4) We now better highlight published work, which suggest that the majority of chickadee caches in the wild are retrieved within minutes to a few hours. Unlike some other food-caching species, chickadees tend to use caching to maintain a stable food supply over relatively short periods of time. We therefore think that our experiments study memory on a timescale that is ethologically relevant.

(3) Please clarify how the spatial biases identified for individual birds affect caching strategy.

Currently the authors separately present first that birds have individual spatial biases and later on with a modelling approach show that previous visits and general proximity also biases the animals search and caching behaviour. But how do the two aspects relate to each other? One would assume that a general individual bias would decrease the number of possible locations the animal in general has to look at and therefore it is perhaps a mix of memories being used: the bias memory as well as the episodic memory of caching food. If one would include the bias as well as the principles that the animals have a proximity preference, how much variance will still be explained by the episodic memory? Or is the individual bias a result of the proximity bias? It would be good if the authors could create one major model which includes all identified factors to see how much which factor contributes.

We think that this might be a misunderstanding, since we already include spatial biases when quantifying other behavioral effects. Effects of proximity and of previous visits are modeled as factors that alter the probability given by pbias. I.e., positive values of these factors mean that the corresponding probabilities are higher than expected from the bird’s spatial bias, whereas negative factors mean lower probabilities. So the Reviewer is correct in that the bird’s behavior is driven by a mix of episodic memory and biases (which may or may not be memory-guided). We think that this issue can be fixed by adding some clarifying language to the main text, which we did in Lines 178-180.

(4) Please provide supplemental videos and increased clarity of the experimental timeline.

Further, it would help readers to understand the details of the behaviour if for example videos of the bird behaviour are provided with the manuscript for the different types of trials. That would help the reader understand in which times scales the animal is remembering and deciding.

We have now included supplemental videos featuring a 1-min fragment from the Caching task and an example trial from the Retrieval task (Supplemental videos 1,2). To give the reader a better sense of the time scales, we have also updated the language describing timing both in the main text, as well as in the Methods section. Note that annotations of the entire videos for all sessions will be included on a data sharing server (Dryad) upon publication.

Further, an overall schematic of the timeline for the birds would be helpful. How long were they trained and habituated on what? How many sessions per day? How many days/weeks in total? For example days/weeks could go to the right and then down would be time within a day since multiple sessions were done per day. Then it can also be marked which data was included for analysis.

More clarification of the experimental timeline is now found in the methods section. We now describe the duration of each habituation step in a more organized format. Briefly, birds were run daily, once per day, for a series of several weeks.

(5) Several technical points, if addressed, could make the statistics a bit stronger.

First, for the description of spatial biases, it appears that the same data is used to assign cluster membership and then quantify cluster separation. Some sort of cross-validation should be used throughout the analyses plotted in Figure 2 (Panel 2E notwithstanding).

Instead of using an arbitrary decision boundary, we now perform k-means clustering to group the data in Figure 2 (Lines 712). Additionally, we used cross-validation to show that birds reliably fall into the two clusters. Here, for every bird, we excluded that bird from the data, recomputed k-means clustering, then assigned the left-out bird to the cluster with the nearest centroid. In all 10 birds, this resulted in identical cluster assignments. The cross-validation is shown in a new supplemental figure (Figure 2—figure supplement 1).

Second, for the model fitting, a point estimate is used. It would be good to know the confidence around these point estimate and here a Bayesian framework would be helpful perhaps using Markov chain Monte Carlo methods to finds highest density intervals for likely parameter estimates (e.g. Annis and Palmeri 2017, doi: 10.1002/wcs.1458; Kruschke 2013 DOI: 10.1037/a0029146).

We believe that, for the population, this issue is addressed by showing the value of each parameter separately for each bird (as individual symbols in the figures), in addition to reporting the median and standard error. To provide some measure of confidence for each bird, we now also report the fraction of birds that were individually significant for each of the parameters we tested (Table 2). We measured this confidence by bootstrapping each bird’s individual sessions and refitting the parameters.

https://doi.org/10.7554/eLife.70600.sa2

Article and author information

Author details

  1. Marissa C Applegate

    Mortimer B. Zuckerman Mind Brain Behavior Institute, Columbia University, New York, United States
    Contribution
    Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Resources, Software, Validation, Visualization, Writing – original draft, Writing – review and editing
    Competing interests
    No competing interests declared
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-3717-2477
  2. Dmitriy Aronov

    Mortimer B. Zuckerman Mind Brain Behavior Institute, Columbia University, New York, United States
    Contribution
    Conceptualization, Formal analysis, Funding acquisition, Methodology, Resources, Supervision, Writing – original draft, Writing – review and editing
    For correspondence
    da2006@columbia.edu
    Competing interests
    No competing interests declared

Funding

National Science Foundation (Graduate Research Fellowship Program)

  • Marissa C Applegate

New York Stem Cell Foundation (Robertson Neuroscience Investigator Award)

  • Dmitriy Aronov

National Institutes of Health (NIH Director's New Innovator Award (DP2-AG071918))

  • Dmitriy Aronov

Arnold and Mabel Beckman Foundation (Beckman Young Investigator Award)

  • Dmitriy Aronov

National Institutes of Health (T32 EY013933)

  • Marissa C Applegate

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Acknowledgements

We thank D Scheck, S Hale, T Tabachnik, R Hormigo, and K Gutnichenko for technical assistance; the Black Rock Forest Consortium, J Scribner and the Hickory Hill Farm, and T Green for help with field work; L Abbott and members of the Aronov lab for comments on the manuscript. Illustrations in Figure 1B are by J Kuhl.

Ethics

All animal procedures were approved by the Columbia University Institutional Animal Care and Use Committee (Protocol AAAR5402) and carried out in accordance with the US National Institutes of Health guidelines. Wild birds were collected under Federal and NY State scientific collection licenses.

Senior Editor

  1. Laura L Colgin, University of Texas at Austin, United States

Reviewing Editor

  1. Jesse H Goldberg, Cornell University, United States

Version history

  1. Preprint posted: April 23, 2021 (view preprint)
  2. Received: May 22, 2021
  3. Accepted: December 2, 2021
  4. Version of Record published: April 25, 2022 (version 1)

Copyright

© 2022, Applegate and Aronov

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.

Metrics

  • 1,736
    Page views
  • 197
    Downloads
  • 6
    Citations

Article citation count generated by polling the highest count across the following sources: PubMed Central, Crossref, Scopus.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Marissa C Applegate
  2. Dmitriy Aronov
(2022)
Flexible use of memory by food-caching birds
eLife 11:e70600.
https://doi.org/10.7554/eLife.70600

Further reading

    1. Neuroscience
    Jiajie Zou, Yuran Zhang ... Nai Ding
    Research Article

    The computational principles underlying attention allocation in complex goal-directed tasks remain elusive. Goal-directed reading, that is, reading a passage to answer a question in mind, is a common real-world task that strongly engages attention. Here, we investigate what computational models can explain attention distribution in this complex task. We show that the reading time on each word is predicted by the attention weights in transformer-based deep neural networks (DNNs) optimized to perform the same reading task. Eye tracking further reveals that readers separately attend to basic text features and question-relevant information during first-pass reading and rereading, respectively. Similarly, text features and question relevance separately modulate attention weights in shallow and deep DNN layers. Furthermore, when readers scan a passage without a question in mind, their reading time is predicted by DNNs optimized for a word prediction task. Therefore, we offer a computational account of how task optimization modulates attention distribution during real-world reading.

    1. Neuroscience
    Christian Brodbeck, Proloy Das ... Jonathan Z Simon
    Tools and Resources

    Even though human experience unfolds continuously in time, it is not strictly linear; instead, it entails cascading processes building hierarchical cognitive structures. For instance, during speech perception, humans transform a continuously varying acoustic signal into phonemes, words, and meaning, and these levels all have distinct but interdependent temporal structures. Time-lagged regression using temporal response functions (TRFs) has recently emerged as a promising tool for disentangling electrophysiological brain responses related to such complex models of perception. Here we introduce the Eelbrain Python toolkit, which makes this kind of analysis easy and accessible. We demonstrate its use, using continuous speech as a sample paradigm, with a freely available EEG dataset of audiobook listening. A companion GitHub repository provides the complete source code for the analysis, from raw data to group level statistics. More generally, we advocate a hypothesis-driven approach in which the experimenter specifies a hierarchy of time-continuous representations that are hypothesized to have contributed to brain responses, and uses those as predictor variables for the electrophysiological signal. This is analogous to a multiple regression problem, but with the addition of a time dimension. TRF analysis decomposes the brain signal into distinct responses associated with the different predictor variables by estimating a multivariate TRF (mTRF), quantifying the influence of each predictor on brain responses as a function of time(-lags). This allows asking two questions about the predictor variables: 1) Is there a significant neural representation corresponding to this predictor variable? And if so, 2) what are the temporal characteristics of the neural response associated with it? Thus, different predictor variables can be systematically combined and evaluated to jointly model neural processing at multiple hierarchical levels. We discuss applications of this approach, including the potential for linking algorithmic/representational theories at different cognitive levels to brain responses through computational models with appropriate linking hypotheses.