Reactivation strength during cued recall is modulated by graph distance within cognitive maps

Simon Kern; Juliane Nagel; Martin F. Gerchen; Cagatay Guersoy; Andreas Meyer-Lin-denberg; Peter Kirsch; Raymond J. Dolan; Steffen Gais; Gordon B. Feld

doi:10.7554/eLife.93357.3

eLife assessment

This magnetoencephalography study reports important new findings regarding the nature of memory reactivation during cued recall. It replicates previous work showing that such reactivation can be sequential or clustered, with sequential reactivation being more prevalent in low performers. It adds convincing evidence, even though based on limited amounts of data, that high memory performers tend to show simultaneous (i.e., clustered) reactivation, varying in strength with item distance in the learned graph structure. The study will be of interest to scientists studying memory replay.

https://doi.org/10.7554/eLife.93357.3.sa2

Significance of findings

important: Findings that have theoretical or practical implications beyond a single subfield

landmark
fundamental
important
valuable
useful

Strength of evidence

convincing: Appropriate and validated methodology in line with current state-of-the-art

exceptional
compelling
convincing
solid
incomplete
inadequate

During the peer-review process the editor and reviewers write an eLife assessment that summarises the significance of the findings reported in the article (on a scale ranging from landmark to useful) and the strength of the evidence (on a scale ranging from exceptional to inadequate). Learn more about eLife assessments

Abstract

Declarative memory retrieval is thought to involve reinstatement of neuronal activity patterns elicited and encoded during a prior learning episode. Furthermore, it is suggested that two mechanisms operate during reinstatement, dependent on task demands: individual memory items can be reactivated simultaneously as a clustered occurrence or, alternatively, replayed sequentially as temporally separate instances. In the current study, participants learned associations between images that were embedded in a directed graph network and retained this information over a brief 8-minute consolidation period. During a subsequent cued recall session, participants retrieved the learned information while undergoing magnetoencephalographic (MEG) recording. Using a trained stimulus decoder, we found evidence for clustered reactivation of learned material. Reactivation strength of individual items during clustered reactivation decreased as a function of increasing graph distance, an ordering present solely for successful retrieval but not for retrieval failure. In line with previous research, we found evidence that sequential replay was dependent on retrieval performance and was most evident in low performers. The results provide evidence for distinct performance-dependent retrieval mechanisms with graded clustered reactivation emerging as a plausible mechanism to search within abstract cognitive maps.

Introduction

Memory classically relies on three distinct stages: encoding (learning), consolidation (strengthening and transforming) and retrieval (reinstating) of information. New episodic memories are learned by encoding a representation, thought to be realized in a specific spatio-temporal neuronal firing pattern in hippocampal and neocortical networks (Frank et al., 2000; Preston & Eichenbaum, 2013). These firing patterns are reactivated during subsequent rest or sleep, sometimes in fast sequential sequences, a process linked to memory consolidation (Born & Wilhelm, 2012; Feld & Born, 2017). Similarly, during retrieval, the same firing patterns seen during encoding are replayed in a manner that predicts retrieval success (Carr et al., 2011; Foster, 2017). Even though replay has been studied most intensely with respect to the hippocampus, the replay of memory traces in temporal succession is suggested as a general mechanism for planning, consolidation, and retrieval (Buhry et al., 2011). While a rich body of evidence exists in rodents (Ambrose et al., 2016; Chen & Wilson, 2023; Foster & Knierim, 2012; Ólafsdóttir et al., 2018), the contributions of replay to memory storage and retrieval in humans are only beginning to be examined (Brunec & Momennejad, 2022; Eichenlaub et al., 2020; Fuentemilla et al., 2010; Wimmer et al., 2020).

One obstacle has been the difficulty in measuring sequential replay or general network reactivation in humans (NB here we follow the definition of Genzel et al., 2020, where reactivation is used as an umbrella term for any form of reoccurrence of a previously encoded neural pattern related to information-encoding, and replay refers to reactivation events with a temporally sequential nature). The most straightforward method is to use intracranial electroencephalography (iEEG), though this is generally only feasible within individuals undergoing evaluation for management of epilepsy (Axmacher et al., 2008; Engel et al., 2005; Staresina et al., 2015; Zhang et al., 2015). Another approach is to use functional magnetic resonance imaging (Schuck & Niv, 2019; Wittkuhn & Schuck, 2021) though the latter is burdened by the challenge posed by the sluggishness of the hemodynamic response. Researchers have recently started to leverage the spatio-temporal precision of magnetoencephalography (MEG), in combination with machine learning based brain decoding techniques, to reveal sequential human replay in humans across a range of settings that includes memory, planning and inference (Eldar et al., 2018; Kurth-Nelson et al., 2016; Liu et al., 2019; Liu, Mattar, et al., 2021; McFadyen et al., 2023; Nour et al., 2021; Wimmer et al., 2020, 2023; Wise et al., 2021). Many of the latter studies deploy a novel statistical analysis technique, temporally delayed linear modeling (TDLM) (Liu, Dolan, et al., 2021). TDLM, and its variants, enable identification of sequential replay for previously experienced material during resting state (Liu et al., 2019; Liu, Mattar, et al., 2021), during planning of upcoming behavioral output (Eldar et al., 2020; Kurth-Nelson et al., 2016; McFadyen et al., 2023; Wise et al., 2021) and during memory retrieval (Wimmer et al., 2020).

Wimmer et al. (2020) reported sequential reactivation of episodic content following a single initial exposure during cued recall one day post-encoding. Specifically, they showed participants eight short, narrated stories, each consisting of four different visual story anchor elements taken from six different categories (faces, buildings, body parts, objects, animals, and cars) and a unique ending element. In a next day recall session, participants were shown two story elements and asked whether both elements were part of the same story, and whether the second element appeared before or after the first. At retrieval they showed stories were replayed in reverse order to the prompt (i.e., when prompting element 3 and element 5, successful retrieval would traverse element 5 through 4 and arrive at element 3). However, this effect was only found in those with regular performance, while in high performers there was no evidence for temporal succession. Instead, the latter group simultaneously reactivated all related story elements at in a clustered manner.

In memory research, declarative tasks often avail of item lists or paired associates (Barnett et al., 2016; Cho et al., 2020; Feld et al., 2013; Kolibius et al., 2021; Roux et al., 2022; Schönauer et al., 2014; Stadler et al., 1999, 1999). When studying sequential replay, the task structure must have a linear element (Liu et al., 2019; Liu, Mattar, et al., 2021; Wimmer et al., 2020; Wise et al., 2021) and such linearity is a defining feature of episodic memory (Tulving, 1993). By contrast, semantic memory is rarely organized linearly and instead involves complex and interconnected knowledge networks or cognitive maps (Behrens et al., 2018) motivating researchers to ask how memory works when organized into a complex graph structure (Eldar et al., 2020; G. Feld et al., 2021; Garvert et al., 2017; Schapiro et al., 2013; for an overview see Momennejad, 2020). However, little is currently known regarding the contribution of replay to consolidation and retrieval processes for information that is embedded in a graph structures. In particular, the question remains how the brain keeps track of graph distances for successful recall and whether the previously found difference between high and low performers also holds true within a more complex graph learning context.

Here we examined the relationship between retrieval from a learned graph structure and reactivation and replay in a task where participants learned a directed, cyclic graph, represented by ten connected images. Eight nodes had exactly one direct predecessor and successor node, two hub nodes, each had two direct predecessors and successors (See Figure 2B). The task was arranged such that participants could not rely on simple pair mappings but needed to learn the context of each edge. Additionally, the graph-structure was never shown to participants as a ‘birds-eye-view’, encouraging implicit learning of the underlying structure. Following a retention period, consisting of eight minutes eyes closed resting state, participants then completed a cued recall task, which is the focus of the current paper.

Methods

Participants

We recruited thirty participants (15 men and 15 women), between 19 and 32 years old (mean age 24.7 years). Inclusion criteria were right-handedness, no claustrophobic tendencies, no current or previous diagnosed mental disorder, non-smoker, fluency in German or English, age between 18 and 35 and normal or corrected-to-normal vision. Caffeine intake was requested to be restricted for four hours before the experiment. Participants were recruited through the institute’s website and mailing list and various local Facebook groups. A single participant was excluded due to a corrupted data file and replaced with another participant. We acquired written informed consent from all participants, including consent to share anonymized raw and processed data in an open access online repository. The study was approved by the ethics committee of the Medical Faculty Mannheim of Heidelberg University (ID: 2020-609). While we had preregistered the study design and an analysis approach for the resting state data (https://aspredicted.org/kx9xh.pdf, #68915) here we report analyses of the retrieval period. The current analysis conceptually replicates the analyses and hypotheses of Wimmer et al. (2020) focusing on the retrieval period albeit in a much more complex and therefore naturalistic paradigm and are therefore, despite not being preregistered, mainly of confirmatory nature. We wish to maintain transparency by acknowledging that the findings from the preregistered analysis concerning the resting state data, are being prepared for publication as part of a distinct submission.

Procedure

Participants came to the laboratory for a single study session of approximately 2.5 hours. After filling out a questionnaire about their general health, their vigilance state (Stanford Sleepiness Scale, Hoddes et al., 1973) and mood (PANAS, Watson et al., 1988), participants performed five separate tasks while in the MEG scanner. First, an eight-minute eyes-closed resting state was recorded. This was followed by a localizer task (∼30 minutes), in which all 10 items were presented 50 times in pseudo-randomized order, using auditory and visual stimuli. Next, participants learned a sequence of the 10 visual items embedded into a graph structure until they achieved 80% accuracy or reached a maximum of six blocks (7 to 20 minutes). Following this, we recorded another eight-minutes eyes-closed resting state to allow for initial consolidation and, finally, a cued retrieval session (four minutes). For an overview see Figure 1.

Stimulus material

Visual stimuli were taken from the colored version (Rossion & Pourtois, 2001) of the Snodgrass & Vanderwart (1980) stimulus dataset. To increase brain pattern discriminability, images were chosen with a focus on diversity of color, shape and category (see Figure 2B) and for having short descriptive words (one or two syllables) both in German and English. Auditory stimuli were created using the Google text-to-speech API, availing of the default male voice (SsmlVoiceGender.NEUTRAL) with the image description labels, either in German or English, based on participants language preference. Auditory stimulus length ranged from 0.66 to 0.95 seconds.

Task description

Localizer task

In the localizer task, the ten graph stimulus items were shown to participants repeatedly in a pseudo-random order, where a DeBruijn-sequence (DeBruijn, 1946) ensured the number of transitions between any two stimuli was equal. Two runs of the localizer were performed per participant, comprising 250 trials with 25 item repetitions. Each trial started with a fixation cross followed by an inter-trial interval of 0.75 to 1.25 seconds. Next, to encourage a multi-sensory neural representation, the name of the to-be-shown image was played through in-ear head-phones (maximum 0.95 seconds) followed 1.25 to 1.75 seconds later by the corresponding stimulus image, shown for 1.0 second. As an attention check, in ∼4% of the trials the auditory stimulus did not match the image and participants were instructed to press a button as fast as possible to indicate detection of an incongruent auditory-visual pair. A short break of maximum 30 seconds was scheduled every 80 trials. Between the two parts of the localizer task, another short break was allowed. Stimulus order was randomized and balanced between subjects. To familiarize the participant with the task, a short exemplar of the localizer task with dummy images was shown beforehand. All subsequent analyses were performed using the visual stimulus onset as a point of reference.

Graph-Learning

The exact same images deployed in the localizer task were randomly assigned to nodes of the graph, as shown in Figure 2B. Participants were instructed to learn a randomized sequence of elements with the goal of reaching 80% performance within six blocks of learning. During each block, participants were presented with each of the twelve edges of the graph exactly once, in a balanced, pseudo-randomized order. After a fixation cross of 3.0 seconds a first image (predecessor) was shown on the left of the screen. After 1.5 seconds, the second image (current image) appeared in the middle of the screen. After another 1.5 seconds, three possible choices were displayed in vertical order to the right of the two other images. One of the three choice options was the correct successor of the cued edge. Of the two distractor stimuli, one was chosen from a distal location on the graph (five to eight positions away from the current item), and one was chosen from a close location (two to four positions away from the current item). Neither of the latter were directly connected to any of the other elements onscreen. Participants used a three-button controller to indicate their answer. The chosen item was then highlighted for 3.0 seconds, and the participant’s performance was indicated (“correct” or “wrong”) (see Figure 2C). No audio was played during learning. The participant was instructed to learn the sequence transitions by trial-and-error, and also instructed that there was no semantic connection between items (i.e., that the sequence did not follow any specific logic related to image content). Participants completed a minimum of two, and a maximum of six blocks of learning. To prevent ceiling effects, learning was discontinued if a participant reached 80% accuracy during any block. To familiarize participants with the task, a short example with dummy images was shown before the learning task. Triplets were shown in a random order and choices were displayed in a pseudo-random position that ensured the on-screen position of the correct item could never be at the same position for more than three consecutive trials. Distractor choices were balanced such that exposure to each individual item was approximately equal.

Resting State

After graph learning, participants completed a resting state session of eight minutes. Here, they were instructed to close their eyes and “to not think of anything particular”. These resting state data are not reported here.

Retrieval

After the resting state, we presented subjects with a single retrieval session block, which followed the exact layout of the learning task with the exception that no feedback was provided as to whether entered choices were correct or incorrect (Figure 2D).

MEG Acquisition and Pre-Processing

MEG was recorded in a passively shielded room with a MEGIN TRIUX (MEGIN Oy, Helsinki, Finland) with 306 sensors (204 planar gradiometers and 102 magnetometers) at 1000 Hz with a 0.1–330 Hz band-pass acquisition filter at the ZIPP facility of the Central Institute for Mental Health in Mannheim, Germany. Before each recording, empty room measurements made sure that no ill-functioning sensors were present. Head movement was recorded using five head positioning coils. Bipolar vertical and horizontal electrooculography (EOG) as well as electrocardiography (ECG) was recorded. After recording, the MEGIN proprietary MaxFilter algorithm (version 2.2.14) was run using temporally extended signal space separation (tSSS) and movement correction with the MaxFilter default parameters (Taulu & Simola, 2006, raw data buffer length of 10 s, and a subspace correlation limit of .98. Bad channels were automatically detected at a detection limit of 7; none had to be excluded. The head movement correction algorithm used 200 ms windows and steps of 10 ms. The HPI coil fit accept limits were set at an error of 5 mm and a g-value of .98). Using the head movement correction algorithm, the signals were virtually re-positioned to the mean head position during the initial localizer task to ensure compatibility of sensor-level analysis across the recording blocks. The systematic trigger delay of our presentation system was measured and visual stimuli appeared consistently 19 milliseconds after their trigger value was written to the stimulus channel, however, to keep consistency with previous studies that do not report trigger delay, timings in this publication are reported uncorrected (i.e., ‘as is’, not corrected for this delay).

Data were pre-processed using Python-MNE (version 1.1, Gramfort, 2013). Data were down-sampled to 100 Hz using the MNE function ‘resample’ (with default settings, which applies an anti-aliasing filter before resampling with a brick-wall filter at the Nyquist frequency in the frequency domain) and ICA applied using the ‘picard’ algorithm (Ablin et al., 2018) on a 1 Hz high-pass filtered copy of the signal using 50 components. As recommended, ICA was set to ignore segments that were marked as bad by Autoreject (Jas et al., 2017) on two-second segments. Components belonging to EOG or ECG and muscle artifacts were identified and removed automatically using MNE functions ‘find_bads_eog’, ‘find_bads_ecg’ and ‘find_bads_emg’, using the EOG and ECG as reference signals. Finally, to reduce noise and drift, data were filtered with a high-pass filter of 0.5 Hz using the MNE filter default settings (hamming window FIR filter, −6 dB cutoff at 0.25 Hz, 53 dB stop-band attenuation, filter length 6.6 seconds).

Trials from the localizer and retrieval task were created from −0.1 to 0.5 seconds relative to visual stimulus onset to train decoders. For the sequenceness analysis related to the retrieval, trials were created from 0 to 1.5 seconds after onset of the second visual cue image. No baseline correction was applied. To detect artifacts, Autoreject was applied using default settings, which repaired segments by interpolation in case artifacts were present in only a limited number of channels and rejected trials otherwise (see Supplement 1). Finally, to improve numerical stability, signals were re-scaled to similar ranges by multiplying values from gradiometers by 1e¹⁰ and from magnetometers by 2e¹¹. These values were chosen empirically by matching histograms for both channel types. As outlier values can have significant influence on the computations, after re-scaling, values that were still above 1 or below −1 were “cutoff” and transformed to smaller values by multiplying with 1e⁻². Anonymized and maxfiltered raw data are openly available at Zenodo (https://doi.org/10.5281/zenodo.8001755), code is made public on GitHub (https://github.com/CIMH-Clinical-Psychology/DeSMRRest-clustered-reactivation).

Decoding framework and training

In line with previous investigations (Kurth-Nelson et al., 2016; Liu et al., 2019; Wimmer et al., 2020) we applied Lasso regularized logistic regression on sensor-level data of localizer trials using the Python package Scikit-Learn (Pedregosa et al., 2011). Decoders were trained separately for each participant, and each stimulus, using liblinear as a solver with 1000 maximum iterations and a L1 regularization of C=6. This value was determined based on it giving the best average cross-validated peak accuracy across all participants when searching within the parameter space of C = 1 to 20 in steps of 0.5, using the same approach as outlined below (note that Scikit-Learn shows stronger regularization with lower C values, opposite to e.g., MATLAB). To circumvent class imbalance due to trials removed by Autoreject, localizer trials were stratified such that they contained an equal number of trials from each stimulus presentation by randomly removing trials from over-represented classes. Using a cross validation schema (leaving one trial out for each stimulus per fold, i.e., 10 trials left out per fold), for each participant the decoding accuracy was determined across time (Figure 3A). During cross validation, for each fold, decoders were trained on data of each 10 milliseconds time step and tested on left out data from the same time step. Therefore, decoding accuracy reflects the separability of the stimulus classes by the sensor values for each time step independently. Decoders were trained using a one-vs-all approach, which means that for each class, a separate classifier was trained using positive examples (target class) and negative examples (all other classes) plus null examples (data from before stimulus presentation, see below). This approach allows the decoder to provide independent estimates of detected events for each class.

A) Decoding accuracy of the currently displayed item during the localizer task for participants with a decoding accuracy higher than 30% (n=21). The mean peak time point across all participants corresponded to 210 ms, with an average decoding peak decoding accuracy of 42% (n=21). Note that the displayed graph combines accuracies across participants, where peak values were computed on an individual level and then averaged. Therefore, the indicated individual mean peak does not match the average at a group level. B) Memory performance of participants after completing the first block of learning, the last block (block 2 to 6, depending on speed of learning), and the retrieval performance. C) Classifier transfer within the localizer when trained and tested at different time points determined by cross validation. D) Classifier transfer from the localizer session to the retrieval session when trained at different time points during training and tested at different time points during cue presentation of the first (predecessor) image cue during retrieval. For B and C: Within the white outline, classification was significantly above chance level (cluster permutation testing, alpha<0.05).

For each participant, a final set of decoders (i.e., 10 decoders per participant, for each stimulus one decoder) were trained at 210 milliseconds after stimulus onset, a time point reflecting the average peak decoding time point computed for all participants (for individual decoding accuracy plots see Supplement 3). For the final decoders, data from before the auditory stimulus onset was added as a negative class with a ratio of 1:2, based upon results from previous publications reaching better sensitivity with higher negative class ratio (Liu, Dolan, et al., 2021). Adding null data allows decoders to report low probabilities for all classes simultaneously in absence of a matching signal, and reduces false positives while retaining relative probabilities between true classes. Together with the use of a sparsity constraint on the logistic regression coefficients, this increases the sensitivity of sequence detection by reducing spatial correlations of decoder weights (see also Liu, Dolan, et al., 2021). For a visualization of relevant sensor positions see Supplement 5.

Decoders were then applied to trials of the retrieval session, starting from the time-point of onset of the second sequence cue (“current image”) and extending to just prior to onset of the selection prompt (1.5 seconds). For each trial, this resulted in ten probability vectors across the trial, one for each item, in steps of 10 milliseconds. These probabilities indicate the similarity of the current sensor-level activity to the activity pattern elicited by exposure to the stimulus and can therefore be used as a proxy for detecting active representations, akin to a representational pattern analysis approach (RSA, Grootswagers et al., 2017). As a sanity check, we confirmed that we could decode the currently on-screen image by applying the final trained decoders to the first image shown during retrieval (predecessor stimulus, see Figure 3D). Note that we only included data from the current image cue, and not from the predecessor image cue, as we assume the retrieval processes differ and should not be concatenated.

Sequential replay analysis

To test whether individual items were reactivated in sequence at a particular time lag, we applied temporally delayed linear modeling (TDLM, Liu, Dolan, et al., 2021) on the time span after the stimulus onset of the sequence cue (“current image”). In brief, this method approximates a time lagged cross-correlation of the reactivation strength in the context of a particular transition pattern, quantifying the strength of a certain activity transition pattern distributed in time. As input for the sequential analysis, we used the raw probabilities of the ten classifiers corresponding to the stimuli.

Using a linear model, we first estimate evidence for sequential activation of the decoded item representations at different time lags. For each item i at each time lag Δt up to 250 milliseconds we estimated a linear model of form:

where Y_i contains the decoded probability output of the classifier of item i and Y(Δt) is simply Y time lagged by Δt. When solving this equation for β_i(Δt) we can estimate the predictive strength of Y(Δt) for the occurrence of Y_i at each time lag Δt. Calculated for each stimulus i, we then create an empirical transition matrix T_e(Δt) that indexes evidence for a transition of any item j to item i at time lag Δt (i.e., a 10×10 transition matrix per time lag, each column j contains the predictive strength of j for each item i at time lag Δt). These matrices are then combined with a ground truth transition matrix T (encoding the valid sequence transitions of interest) by taking the Frobenius inner product. This returns a single value Z_Δt for each time lag, indicating how strongly the detected transitions in the empirical data follow the expected task transitions, which we term “sequenceness”. Using different transition matrices to depict forward (T_f) and backward (T_b) replay, we quantified evidence for replay at different time lags for each trial separately. This process is applied to each trial individually, and resulting sequenceness values are averaged to provide a final sequenceness value per participant for each time lag Δt. To test for statistical significance, we create a baseline distribution by permuting the rows of the transition matrix 1000 times (creating transition matrices with random transitions; identity-based permutation, Liu, Dolan, et al., 2021) and calculate sequenceness across all time lags for each permutation. The null distribution is then constructed by taking the peak sequenceness across all time lags for each permutation.

Differential reactivation analysis

To test for clustered, non-sequential reactivation, we adopted the approach used in Wimmer et al. (2020). Decoders were trained independently for each stimulus, and all decoders reacted to presentation of any visual stimulus to some extent. By using differences in reactivation between stimuli, this aggregated approach allowed us to examine whether near items are more strongly activated than distant items more closely, thereby quantifying non-sequential reactivation with greater sensitivity. For each trial, the mean probability of the two items following the current on-screen item was contrasted with the mean probability of all items further away by subtraction. We chose to combine the following pairs of items for two reasons: First, this doubled the number of included trials; secondly, using this approach the number of trials for each category (“near” and “distant”) was more balanced. The two items currently displayed on-screen (i.e., predecessor and current image) were excluded. As only few trials per participant were available for this analysis, the raw probabilities were noisy. Therefore, to address this we applied a Gaussian smoothing kernel (using scipy.ndimage.gaussian_filter with the default parameter of σ = 1 which corresponds approximately to taking the surrounding timesteps in both direction with the following weighting: current time step: 40%, ±1 step: 25%, ±2 step: 5%, ±3 step: 0.5%) to the probability vectors across the time dimension. By shuffling the stimulus labels 1000 times, we constructed an empirical permutation distribution to determine at which time points the differential reactivation of close items was significantly above chance (α = 0.05).

Graph reactivation analysis

To detect whether reactivation strength was modulated by underlying graph structure, we compared the raw reactivation strength of all items by distance on the directed graph. First, we calculated a time point of interest by computing the peak probability estimate of decoders across all trials, i.e., the average probability for each timepoint of all trials, of all distances except the previous onscreen item. Then, for each participant, for each trial we sorted all nodes based on their distance to the current on-screen item on the directed graph. Again, we smoothed probability values with a Gaussian kernel (α = 1) and ignored the predecessor on-screen item. Following this, we evaluated the sorted decoder probabilities at the previously determined peak time point. Using a repeated measures ANOVA on the mean probability values per distance per participant, we then estimated whether reactivation strength was modulated by graph distance.

Exclusions

Replay analysis relies on a successive detection of stimuli where the chance of detection exponentially decreases with each step (e.g., detecting two successive stimuli with a chance of 30% leaves a 9% chance of detecting a replay event). However, one needs to bear in mind that accuracy is a “winner-takes-all” metric indicating whether the top choice also has the highest probability, disregarding subtle, relative changes in assigned probability. As the methods used in this analysis are performed on probability estimates and not class labels, one can expect that the 30% are a rough lower bound and that the actual sensitivity within the analysis will be higher. Additionally, based on pilot data, we found that attentive participants were able to reach 30% decodability, allowing its use as a data quality check. Therefore, we decided a priori that participants with a peak decoding accuracy of below 30% would be excluded from the analysis (nine participants in all) as obtained from the cross-validation of localizer trials. Additionally, as successful learning was necessary for the paradigm, we ensured all remaining participants had a retrieval performance of at least 50% (see Supplement 2).

Results

Behavioral

All but one participant learned the sequence of ten images embedded into the directed graph with partial overlap (Supplement 3). On average, participants needed 5 blocks of learning (range 2 to 6, see Supplement 4) and attained a memory performance of 76% during their last block of learning (range: 50% to 100%). After eight minutes of rest, retrieval performance improved marginally to a mean of 82% (t=-2.053, p=0.053, effect size r=0.22 Figure 3B). Note that since the last learning block included feedback, this marginal increase cannot necessarily be attributed to consolidation processes. Additionally, we have included an analysis showing how wrong answers participants provided were random in the first block and biased towards closer graph nodes in later blocks. This is consistent with participants actually learning the underlying graph structure as opposed to independent triplets (see figure and legend of Supplement 6 for details).

Decoder training

We first confirmed we could decode brain activity elicited by the ten items using a cross-validation approach. Indeed, decoders were able to separate the items presented during the localizer task (see Figure 3A) well, with an average peak decoding accuracy of ∼42% across all participants (range: 32% to 57%, chance level: 10%, excluding participants with peak accuracy < 30%, for all participants see Supplement 3). We calculated the time point of the mean peak accuracy for each participant separately and subsequently used the average best time point, across all included participants, at 206 milliseconds (rounded to 210 milliseconds) for training of our final decoders. This value is very close in range to time points found in previous studies (Kurth-Nelson et al., 2016; Liu et al., 2019; Liu, Mattar, et al., 2021; Wimmer et al., 2020). The decoders also transferred well to stimulus presentation during the retrieval trials and could effectively decode the current prompted image cue with above chance significance (cluster permutation test, see Figure 3D).

Sequential forward replay in subjects with lower memory performance

Next, we assessed whether there was evidence for sequential replay of the learned sequences during cued recall. Using TDLM we asked whether decoded reactivation probabilities followed a sequential temporal pattern, in line with transitions on the directed graph. Here we focused on all allowable graph transitions and analyzed the entire time window, of 1500 milliseconds, after onset of the retrieval cue (“current image”). We found positive sequenceness across all time lags for forward sequenceness, with a significant increase at around 40-50 milliseconds state to state lag for forward sequenceness (Figure 4A). As discussed in Liu, Dolan et al. (2021), correction for multiple comparisons for this sequenceness measure across time is non-trivial and the maximum of all permutations represents a highly conservative statistic. Due to this complexity, we also report the 95% percentile of sequenceness maxima across time per permutation. Nevertheless, as we did not have a pre-defined time lag of interest, and to mitigate multiple-comparisons, we additionally computed the mean sequenceness across all computed time lags for each participant (similar as previously proposed in the context of a sliding window approach in Wise et al., 2021). This measure can help reveal an overall tendency for replay of task states that is invariant to a specific time lag. Our results show that across all participants, there is a significant increase in task-related forward sequential reactivation of states (p=0.027, two-sided permutation test with 1000 permutations; 95% of permutation maxima reached at 40-50 ms, Figure 4B). Following up on this, in a second analysis, we asked whether mean sequential replay was associated with memory performance and found a significant negative correlation between retrieval performance and forward replay (forward: r=-0.46, p=0.031; backward: r=-0.13, p=0.56, see Figure 4C). In line with previous results (Wimmer et al., 2020) low-performing participants had higher forward sequenceness when compared to high-performing participants, whose mean sequenceness tended towards zero.

A) Strength of forward and backward sequenceness across different time lags up to 250 ms during the 1500 ms window after cue onset. Two significance thresholds are shown: Conservative threshold of the maximum of 1000 permutations of classification labels across all time lags and the 95% percentiles (see Methods section for details). B) Permutation distribution of mean sequenceness values across 1000 state permutations. Observed mean sequenceness is indicated with a red line. C) Association between memory performance and mean sequenceness value computed across all trials, and time lags, for each participant.

Closer nodes show stronger reactivation than distant nodes

Next, in a complementary analysis, we asked whether a non-sequential clustered reactivation of items occurs after onset of a cue image (as shown previously for high performers in Wimmer et al., 2020). We compared reactivation strength of the two items following the cue image with all items associated a distance of more than two steps, subtracting the mean decoded reactivation probabilities from each other. Using this differential reactivation, we found evidence consistent with near items being significantly reactivated compared to items further away within a time window of 220 ms to 260 ms after cue onset (Figure 5A, p<0.05, permutation test with 10000 shuffles).

A) Decoded raw probabilities for off-screen items, that were up to two steps ahead of the current stimulus cue (‘near’,) vs. distant items that were more than two steps away on the graph, on trials with correct answers. The median peak decoded probability for near and distant items was at the same time point for both probability categories. Note that displayed lines reflect the average probability while, to eliminate influence of outliers, the peak displays the median. B) Differential reactivation probability between off-screen items that were up to two steps ahead of the current stimulus cue vs. distant items that were more than two steps away on the graph for trials with correct answers. Between 220 and 260 ms the next items are simultaneously reactivated significantly more than items that are further away (p<0.05; permutation test with 10000 shuffles). C) Reactivation strength of items after retrieval cue onset by distance of items to the currently on-screen stimulus subdivided into trials in which participants answered correctly (left) and in which participants did not know the correct answer (right). A correlation between reactivation strength and distance can only be seen in case of successful retrieval (but see also limitations for a discussion of the low trial and participant number in this sub-analysis). Mean probability values are marked by black dots. D) Mean differential reactivation at peak time point (220-260 milliseconds) during all learning trials (before consolidation) compared to retrieval trials. E) Example activations of a successful retrieval (left) and a failed retrieval (right), sorted by distance to current cue. Colors indicate probability estimates of the decoders.

To further explore the relation of reactivation strength and graph distances, we analyzed the mean reactivation strength by item distance at peak classifier probabilities and found reactivation strength significantly related to graph distance (repeated measures ANOVA, F(4, 80)=2.98, p=0.023 Figure 5B). When subdividing trials into correct and incorrect responses, we found that this relationship was only significant for trials where a participant successfully retrieved the currently prompted sequence excerpt (repeated measures ANOVA, F(4, 80)=5.0, p=0.001 for correctly answered trials, Figure 5C). For incorrect trials we found no evidence for this relationship (F(4, 48)=1.45 p=0.230 for incorrectly answered trials), albeit we found no interaction between distance and response type (F(4, 48)=1.8, p=0.13). Note, that the last two analysis are based on n = 14 since 7 participants had no incorrect trials.

To examine how the 8-minute consolidation period affected reactivation we, post-hoc, looked at relevant measures across learning trials in contrast to retrieval trials. For all learning trials, for each participant, we calculated differential reactivation for the same time point we found significant in the previous analysis (220-260 milliseconds). On average, differential reactivation probability increased from pre to post resting state, however, the effect was non-significant (t = −1.78, p = 0.08) (Figure 5D). Raw mean probabilities between learning and retrieval block for far and distant items are shown in Supplement 9.

Questionnaire results

Participants were concentrated and alert as indicated by the Stanford Sleepiness Scale (M = 2.3, SD = 0.6, range 1-3). Participants’ summed positive affect score was on average 33.2 (SD = 4.5), their summed negative affect score was on average 12.2 (SD = 1.9) (PANAS). Individual questionnaire answers for each included participants are available in the supplementary download in the code repository at GitHub.

Discussion

We combined a graph-based learning task with machine learning to study neuronal events linked to memory retrieval. Participants learned triplets of associated images by trial and error, where these were components of a simple directed graph with ten nodes and twelve edges. Using machine learning decoding of simultaneously recorded MEG data we asked what brain processes are linked to retrieval of this learned information, and how this relates to the underlying graph structure. We show that learned graph items are retrieved by a simultaneous, clustered, reactivation of items and that the associated reactivation strength relates to graph distances.

Memory retrieval is thought to involve reinstatement of previously evoked item-related neural activity patterns (Danker & Anderson, 2010; Johnson & Rugg, 2007; Staresina et al., 2012). Both spatial and abstract information is purported to be encoded into cognitive maps within the hippocampus and related structures (Behrens et al., 2018; Bellmund et al., 2018; Epstein et al., 2017; Garvert et al., 2017; O’Keefe & Nadel, 1979; Peer et al., 2021). While, for example, spatial distance within cognitive maps is encoded within hippocampal firing patterns (Theves et al., 2019), it is unclear how competing, abstract, candidate representations are accessed during retrieval (Kerrén et al., 2018, 2022; Spiers, 2020). Two separate mechanisms seem plausible. First, depth-first search might enable inference in not yet fully consolidated cognitive maps by sequential replay of potential candidates (Mattar & Daw, 2018; Nyberg et al., 2022). Second, breadth-first search could be deployed involving simultaneous activation of candidates when these are sufficiently consolidated within maps that support non-interfering co-reactivation of competing representations (Mattar & Lengyel, 2022), or when exhaustive replay would be too expensive computationally. Indeed, consistent with this, Wimmer et al., (2020) showed that for regular memory performance, sequential and temporally spaced reactivation of items seems to ‘piece together’ individual elements. This contrasted with high performers who showed a clustered, simultaneous, reactivation profile. We replicate this clustered reactivation and show that its strength reflects distance on a graph structure. This complements previous findings of graded pattern similarity during memory search representing distance within the search space (Manning et al., 2011; Tarder-Stoll et al., 2023). As this effect was evident only for correct choices the finding points to its importance for task performance.

As per Wimmer et al. (2020), we found that the strength of replay related to weaker memory performance. This suggests that the expression of sequential replay or simultaneous reactivation depends on the stability of an underlying memory trace. However, we acknowledge that it remains unclear which factors enable recruitment of either of these mechanisms. A crucial step in consolidation encompasses an integration of memory representations into existing networks (Dudai et al., 2015; Sekeres et al., 2017). In Wimmer et al. (2020), participants had little exposure to the learning material and replay was measured after a substantial retention period that included sleep, where the latter is considered to strengthen and transform memories via repeated replay (Diekelmann & Born, 2010; Feld & Born, 2017). This contrasts with the current task design, which solely involved several blocks of learning and retrieval and only a relatively brief period of consolidation.

Intriguingly, it has been speculated that retrieval practice may elicit the same transformation of memory traces as offline replay (Antony et al., 2017). In line with this reasoning, it is possible that both consolidation during sleep and repeated practice have similar effects on the transformation of memories, and consequently on mechanisms that support their subsequent retrieval. This possibility is especially interesting in the light of retrieval practice enhancing memory performance more than is the case for restudy (McDermott, 2021), a finding also in line with evidence that replay during rest prioritizes weakly learned memories (Schapiro et al., 2018). It is known that retrieval practice reduces the pattern similarity of competing memory traces in the hippocampus (Hulbert & Norman, 2015) and, as in the case of our graph-based task, may enable clustered reactivation since differences in timing of reactivation are no longer required to distinguish correct from incorrect items. Therefore, we speculate that clustered reactivation may be a physiological correlate of retrieval facilitated either by repeated retrieval testing-based learning (as in our study) or by sleep dependent memory consolidation (as in Wimmer et al., 2020). This implies that there may be a switch from sequential replay to clustered reactivation corresponding to when learned material can be accessed simultaneously without interference. This suggestion could be systematically investigated by, for example, manipulating retrieval practice, retention interval, and the difficulty of a graph-based task. Nevertheless, even though our results show a nominal, non-significant increase in reactivation from learning to retrieval (see Figure 5D), due to experimental design features our data do not enable us to test for a hypothesized switch for sequential replay (see also “limitations” and Supplement 8). Finally, even though we primarily focused on the mean sequenceness scores across time lags, there appears s to be a (non-significant) peak at 40-60 milliseconds. While simultaneous forward and backward replay is theoretically possible, we acknowledge that it is somewhat surprising and, given our paradigm, could relate to other factors such as autocorrelations (Liu, Dolan, et al., 2021).

Limitations

There are limitations to our study, many of which originate from a suboptimal study design that resulted in a relatively limited number of trials for the retrieval session per participant. Additionally, as we performed criteria learning, a sub-group analysis as in Wimmer et al., (2020) was not feasible, as median performance in our sample was 83% (mean 81%), with six participants exactly at that threshold, resulting in a very high cut-off. Our design also meant participants had different number of learning blocks (two to six blocks, see Supplement 4), making a comparison of learning progress across participants difficult. While we closely follow the analysis approach taken in Wimmer et al., (2020), we did not explicitly preregister the confirmatory analysis of the retrieval data as such. We do acknowledge that only a somewhat limited number of trials were available for analysis, affecting especially the analysis of incorrect answers. In addition, the number of low-performing participants was low in our study, which would render a performance-dependent sub-analysis underpowered. Finally, we want to acknowledge that by selecting a time window for the clustered reactivation we cannot distinguish very fast replay events (<=30ms) from clustered reactivation if they are contained exactly within that specific reactivation analysis time window.

Conclusion

Our findings support a role for a clustered reactivation mechanism for well-learned items during memory retrieval. When interconnected semantic information is retrieved, the retrieval process seems to resemble a breadth-first search, with items sorted by neural activation strength. Additionally, we find that the presence of sequential replay related to low memory performance. The likely coexistence of two types of retrieval process, recruited dependent on the participants’ learning experience, is an important direction for future research. The use of more complex memory tasks, such as explicitly learned associations of graph networks, should enable a more systematic study of this process. Finally, we suggest that accessing information embedded in a knowledge network may benefit from recruitment of either process, replay or reactivation, on the fly.

Data availability

MaxFiltered and anonymized MEG raw data as well as behavioural results are available at Zenodo (https://doi.org/10.5281/zenodo.8001755).

Code availability

The code of the analysis as well as the experiment paradigm and the stimulus material is available at https://github.com/CIMH-Clinical-Psychology/DeSMRRest-clustered-reactivation.

Acknowledgements

This research was supported by an Emmy-Noether research grant awarded to GBF by the DFG (FE1617/2-1) and a project grant by the DGSM as well as a doctoral scholarship of the German Academic Scholarship Foundation, both awarded to SK. Additionally, we want to thank the ZIPP core facility of the Central Institute of Mental Health for their generous support of the study.

Supplement

Percentage of rejected trials for each participant. Artifacts were detected automatically by AutoReject. If possible, channels were interpolated for the affected time span, else the trial was rejected. The figure displays the ratios as well as the absolute number of rejected epochs for each participant in the study. The analysis is based on the remaining non-rejected epochs. For the retrieval, on average 11.5 epochs were available, in total 252 across the study.

Excluded participants based on decoding accuracy and memory performance during retrieval. Peak decoding accuracy was determined by a leave-one-per-class-our cross validation across time for each participant. Memory performance was the percentage of correct responses during the twelve retrieval trials.

Decoding accuracy across time determined by a leave-one-per-class-out cross-validation per participant. For details on decoder training see the methods section.

Number of learning blocks that each participant completed. The number of learning blocks was adapted to the speed of the participant such that each participant had a similar performance at their last block. Learning was stopped if participants reached at least 80% memory performance in a block or if they reached 6 blocks. A minimum of two blocks were shown, even if participants reached above 80% in their first block (by chance).

Percentage of sensors relevant for each image across all participants (beta weight of sensor location unequal to zero). Larger/darker dots indicate more participants’ decoders’ used information from this sensor. LASSO/L1 regularization forces individual regressor values of the classifier belonging to a specific sensor to 0, such that only a sparse number of sensors contribute information to the decision process. The plot shows the average ratio that a sensor was included across participants, giving a rough estimate for location of stimulus processing. The largest dot indicates that this sensor was used for all participants for this image for this image. The smallest/lightest dot indicates that almost no participant’s decoder used information from this sensor. Please note that the MEG head positioning was not aligned between participants such that the average dots do not indicate a specific location but only a broad region.

During the learning and retrieval blocks, participants were presented two lures next to the correct answer to complete the triplet, one of which was closer to the target and one further away on the graph. To show that participants indeed learned the graph structure and not just triplets, the figure shows the ratio of close lures chosen vs lures that were further away on the graph. In the first learning block, the chosen lure is random as participants have not learned the graph structure yet. On the last learning block, many participants exclusively choose the closer lure, indicating that they are aware of approximate distances of the presented stimuli. Note however that the analysis relies solely on trials with incorrect responses. Therefore, the apparent (nonsignificant) drop of the ratio from the last block to the retrieval block can be attributed to participants reaching ceiling performance. Additionally, the number of blocks was determined by the learning speed of the participant (with a minimum of two learning blocks), making it hard to compare between participants with different numbers of learning blocks. Therefore, we have decided to plot the first, last and retrieval blocks, as they were defined for each participant. An ANOVA indicated that the three blocks were significantly different (F=7.5, p=0.001), a posthoc T-test indicated a significant difference between the first and last (t=-4.3, p<0.0001) and the first and retrieval session (t=-2.0, p=0.046) and no difference between the last and retrieval block (t=1.4, p=0.16).

Reactivation strength of items after retrieval cue onset by distance of items to the currently on-screen stimulus. A significant negative correlation between distance on a directional graph and reactivation strength can be seen (p=0.008). The correlation is shown for both, correct and incorrect answers. For a sub-analysis of correct and incorrect analysis, see Figure 5C.

sequential replay for all learning blocks. A) Strength of forward and backward sequenceness across different time lags. (see Methods section and Figure 4 for details). B) Permutation distribution of mean sequenceness values across 1000 state permutations. C) Association between memory performance and mean sequenceness value computed across all trials, and time lags, for each participant. Note: As the paradigm applied criteria learning, participants had different amount of blocks and hence different exposure at different time points (see Supplement Figure 4), making a block-wise comparison between participants conceptually difficult. Therefore, to alleviate the bias of different learning speeds, we combined all trials of the learning blocks.

mean raw probabilities of near vs far item reactivation at peak time point (210-240 ms, see Figure 5B) during learning and retrieval blocks. Reactivation markers increased for near items while it slightly decreased for far items. However, all interactions are non-significant. Note that direct comparison of raw probabilities between different recording session parts might be difficult to interpret due to baseline probability shifts (e.g. due to sensor distance or head position changes).

References

1. Ablin P.
2. Cardoso J.-F.
3. Gramfort A
2018Faster Independent Component Analysis by Preconditioning With Hessian ApproximationsIEEE Transactions on Signal Processing 66:4040–4049https://doi.org/10.1109/TSP.2018.2844203 Google Scholar
1. Ambrose R. E.
2. Pfeiffer B. E.
3. Foster D. J
2016Reverse Replay of Hippocampal Place Cells Is Uniquely Modulated by Changing RewardNeuron 91:1124–1136https://doi.org/10.1016/j.neuron.2016.07.047 Google Scholar
1. Antony J. W.
2. Ferreira C. S.
3. Norman K. A.
4. Wimber M
2017Retrieval as a Fast Route to Memory ConsolidationTrends in Cognitive Sciences 21:573–576https://doi.org/10.1016/j.tics.2017.05.001 Google Scholar
1. Axmacher N.
2. Elger C. E.
3. Fell J
2008Ripples in the medial temporal lobe are relevant for human memory consolidationBrain 131:1806–1817https://doi.org/10.1093/brain/awn103 Google Scholar
1. Barnett J. H.
2. Blackwell A. D.
3. Sahakian B. J.
4. Robbins T. W.
5. Robbins T. W.
6. Sahakian B. J.
2016The Paired Associates Learning (PAL) Test: 30 Years of CANTAB Translational Neuroscience from Laboratory to Bedside in Dementia ResearchIn: Translational Neuropsychopharmacology Springer International Publishing pp. 449–474https://doi.org/10.1007/7854_2015_5001 Google Scholar
1. Behrens T. E. J.
2. Muller T. H.
3. Whittington J. C. R.
4. Mark S.
5. Baram A. B.
6. Stachenfeld K. L.
7. Kurth-Nelson Z
2018What Is a Cognitive Map? Organizing Knowledge for Flexible BehaviorNeuron 100:490–509https://doi.org/10.1016/j.neuron.2018.10.002 Google Scholar
1. Bellmund J. L. S.
2. Gärdenfors P.
3. Moser E. I.
4. Doeller C. F
2018Navigating cognition: Spatial codes for human thinkingScience 362:eaat6766https://doi.org/10.1126/science.aat6766 Google Scholar
1. Born J.
2. Wilhelm I
2012System consolidation of memory during sleepPsychological Research 76:192–203https://doi.org/10.1007/s00426-011-0335-6 Google Scholar
1. Brunec I. K.
2. Momennejad I
2022Predictive Representations in Hippocampal and Prefrontal HierarchiesThe Journal of Neuroscience 42:299–312https://doi.org/10.1523/JNEUROSCI.1327-21.2021 Google Scholar
1. Buhry L.
2. Azizi A. H.
3. Cheng S
2011Reactivation, replay, and preplay: How it might all fit togetherNeural Plasticity 2011https://doi.org/10.1155/2011/203462 Google Scholar
1. Carr M. F.
2. Jadhav S. P.
3. Frank L. M
2011Hippocampal replay in the awake state: A potential substrate for memory consolidation and retrievalNature Neuroscience 14:147–153https://doi.org/10.1038/nn.2732 Google Scholar
1. Chen Z. S.
2. Wilson M. A
2023How our understanding of memory replay evolvesJournal of Neurophysiology 129:552–580https://doi.org/10.1152/jn.00454.2022 Google Scholar
1. Cho K. W.
2. Tse C.-S.
3. Chan Y.-L
2020Normative data for Chinese-English paired associatesBehavior Research Methods 52:440–445https://doi.org/10.3758/s13428-019-01240-2 Google Scholar
1. Danker J. F.
2. Anderson J. R
2010The ghosts of brain states past: Remembering reactivates the brain regions engaged during encodingPsychological Bulletin 136:87–102https://doi.org/10.1037/a0017937 Google Scholar
1. DeBruijn N. G
1946A combinatorial problemProceedings of the Section of Sciences of the Koninklijke Nederlandse Akademie van Wetenschappen Te Amsterdam 49:758–764Google Scholar
1. Diekelmann S.
2. Born J
2010The memory function of sleepNature Reviews Neuroscience 11:114–126https://doi.org/10.1038/nrn2762 Google Scholar
1. Dudai Y.
2. Karni A.
3. Born J
2015The Consolidation and Transformation of MemoryNeuron 88:20–32https://doi.org/10.1016/j.neuron.2015.09.004 Google Scholar
1. Eichenlaub J.-B.
2. Jarosiewicz B.
3. Saab J.
4. Franco B.
5. Kelemen J.
6. Halgren E.
7. Hochberg L. R.
8. Cash S. S
2020Replay of Learned Neural Firing Sequences during Rest in Human Motor CortexCell Reports 31:107581https://doi.org/10.1016/j.celrep.2020.107581 Google Scholar
1. Eldar E.
2. Bae G. J.
3. Kurth-Nelson Z.
4. Dayan P.
5. Dolan R. J
2018Magnetoencephalography decoding reveals structural differences within integrative decision processesNature Human Behaviour 2:670–681https://doi.org/10.1038/s41562-018-0423-3 Google Scholar
1. Eldar E.
2. Lièvre G.
3. Dayan P.
4. Dolan R. J
2020The roles of online and offline replay in planningeLife 9:e56911Google Scholar
1. Engel A. K.
2. Moll C. K. E.
3. Fried I.
4. Ojemann G. A
2005Invasive recordings from the human brain: Clinical insights and beyondNature Reviews Neuroscience 6https://doi.org/10.1038/nrn1585 Google Scholar
1. Epstein R. A.
2. Patai E. Z.
3. Julian J. B.
4. Spiers H. J
2017The cognitive map in humans: Spatial navigation and beyondNature Neuroscience 20:1504–1513https://doi.org/10.1038/nn.4656 Google Scholar
1. Feld G. B.
2. Born J
2017Sculpting memory during sleep: Concurrent consolidation and forgettingCurrent Opinion in Neurobiology 44:20–27Google Scholar
1. Feld G. B.
2. Lange T.
3. Gais S.
4. Born J
2013Sleep-Dependent Declarative Memory Consolidation—Unaffected after Blocking NMDA or AMPA Receptors but Enhanced by NMDA Coagonist D -CycloserineNeuropsychopharmacology 38https://doi.org/10.1038/npp.2013.179 Google Scholar
1. Feld G.
2. Bernard M.
3. Rawson A.
4. Spiers H
2021Learning graph networks: Sleep targets highly connected global and local nodes for consolidation [Preprint]Neuroscience https://doi.org/10.1101/2021.08.04.455038 Google Scholar
1. Foster D. J
2017Replay Comes of AgeAnnual Review of Neuroscience 40:581–602https://doi.org/10.1146/annurev-neuro-072116-031538 Google Scholar
1. Foster D. J.
2. Knierim J. J
2012Sequence learning and the role of the hippocampus in rodent navigationCurrent Opinion in Neurobiology 22:294–300https://doi.org/10.1016/j.conb.2011.12.005 Google Scholar
1. Frank L. M.
2. Brown E. N.
3. Wilson M
2000Trajectory Encoding in the Hippocampus and Entorhinal CortexNeuron 27:169–178https://doi.org/10.1016/S0896-6273(00)00018-0 Google Scholar
1. Fuentemilla L.
2. Penny W. D.
3. Cashdollar N.
4. Bunzeck N.
5. Düzel E
2010Theta-Coupled Periodic Replay in Working MemoryCurrent Biology 20:606–612https://doi.org/10.1016/j.cub.2010.01.057 Google Scholar
1. Garvert M. M.
2. Dolan R. J.
3. Behrens T. E
2017A map of abstract relational knowledge in the human hippocampal–entorhinal cortexeLife 6:e17086Google Scholar
1. Gramfort A
2013MEG and EEG data analysis with MNE-PythonFrontiers in Neuroscience 7https://doi.org/10.3389/fnins.2013.00267 Google Scholar
1. Grootswagers T.
2. Wardle S. G.
3. Carlson T. A
2017Decoding Dynamic Brain Patterns from Evoked Responses: A Tutorial on Multivariate Pattern Analysis Applied to Time Series Neuroimaging DataJournal of Cognitive Neuroscience 29:677–697https://doi.org/10.1162/jocn_a_01068 Google Scholar
1. Hoddes E.
2. Zarcone V.
3. Smythe H.
4. Phillips R.
5. Dement W. C
1973Quantification of Sleepiness: A New ApproachPsychophysiology 10:431–436https://doi.org/10.1111/j.1469-8986.1973.tb00801.x Google Scholar
1. Hulbert J. C.
2. Norman K. A
2015Neural Differentiation Tracks Improved Recall of Competing Memories Following Interleaved Study and Retrieval PracticeCerebral Cortex 25:3994–4008https://doi.org/10.1093/cercor/bhu284 Google Scholar
1. Jas M.
2. Engemann D. A.
3. Bekhti Y.
4. Raimondo F.
5. Gramfort A
2017Autoreject: Automated artifact rejection for MEG and EEG dataNeuroImage 159:417–429https://doi.org/10.1016/j.neuroimage.2017.06.030 Google Scholar
1. Johnson J. D.
2. Rugg M. D
2007Recollection and the reinstatement of encoding-related cortical activityCerebral Cortex (New York, N.Y.: 1991) 17:2507–2515https://doi.org/10.1093/cercor/bhl156 Google Scholar
1. Kerrén C.
2. Linde-Domingo J.
3. Hanslmayr S.
4. Wimber M
2018An Optimal Oscillatory Phase for Pattern Reactivation during Memory RetrievalCurrent Biology 28:3383–3392https://doi.org/10.1016/j.cub.2018.08.065 Google Scholar
1. Kerrén C.
2. van Bree S.
3. Griffiths B. J.
4. Wimber M.
2022Phase separation of competing memories along the human hippocampal theta rhythmeLife 11:e80633https://doi.org/10.7554/eLife.80633 Google Scholar
1. Kolibius L. D.
2. Born J.
3. Feld G. B
2021Vast Amounts of Encoded Items Nullify but Do Not Reverse the Effect of Sleep on Declarative MemoryFrontiers in Psychology 11:607070https://doi.org/10.3389/fpsyg.2020.607070 Google Scholar
1. Kurth-Nelson Z.
2. Economides M.
3. Dolan R. J.
4. Dayan P
2016Fast Sequences of Non-spatial State Representations in HumansNeuron 91:194–204https://doi.org/10.1016/j.neuron.2016.05.028 Google Scholar
1. Liu Y.
2. Dolan R. J.
3. Higgins C.
4. Penagos H.
5. Woolrich M. W.
6. Ólafsdóttir H. F.
7. Barry C.
8. Kurth-Nelson Z.
9. Behrens T. E
2021Temporally delayed linear modelling (TDLM) measures replay in both animals and humanseLife 10:e66917Google Scholar
1. Liu Y.
2. Dolan R. J.
3. Kurth-Nelson Z.
4. Behrens T. E. J
2019Human Replay Spontaneously Reorganizes ExperienceCell 178:640–652https://doi.org/10.1016/j.cell.2019.06.012 Google Scholar
1. Liu Y.
2. Mattar M. G.
3. Behrens T. E. J.
4. Daw N. D.
5. Dolan R. J
2021Experience replay is associated with efficient nonlocal learningScience 372:eabf1357https://doi.org/10.1126/science.abf1357 Google Scholar
1. Manning J. R.
2. Polyn S. M.
3. Baltuch G. H.
4. Litt B.
5. Kahana M. J
2011Oscillatory patterns in temporal lobe reveal context reinstatement during memory searchProceedings of the National Academy of Sciences 108:12893–12897https://doi.org/10.1073/pnas.1015174108 Google Scholar
1. Mattar M. G.
2. Daw N. D
2018Prioritized memory access explains planning and hippocampal replayNature Neuroscience 21:1609–1617https://doi.org/10.1038/s41593-018-0232-z Google Scholar
1. Mattar M. G.
2. Lengyel M
2022Planning in the brainNeuron 110:914–934https://doi.org/10.1016/j.neuron.2021.12.018 Google Scholar
1. McDermott K. B
2021Practicing Retrieval Facilitates LearningAnnual Review of Psychology 72:609–633https://doi.org/10.1146/annurev-psych-010419-051019 Google Scholar
1. McFadyen J.
2. Liu Y.
3. Dolan R. J
2023Differential replay of reward and punishment paths predicts approach and avoidanceNature Neuroscience 26https://doi.org/10.1038/s41593-023-01287-7 Google Scholar
1. Momennejad I
2020Learning Structures: Predictive Representations, Replay, and GeneralizationCurrent Opinion in Behavioral Sciences 32:155–166https://doi.org/10.1016/j.cobeha.2020.02.017 Google Scholar
1. Nour M. M.
2. Liu Y.
3. Arumuham A.
4. Kurth-Nelson Z.
5. Dolan R. J
2021Impaired neural replay of inferred relationships in schizophreniaCell 184:4315–4328Google Scholar
1. Nyberg N.
2. Duvelle É.
3. Barry C.
4. Spiers H. J
2022Spatial goal coding in the hippocampal formationNeuron 110:394–422https://doi.org/10.1016/j.neuron.2021.12.012 Google Scholar
1. O’Keefe J.
2. Nadel L
1979Précis of O’Keefe & Nadel’s The hippocampus as a cognitive mapBehavioral and Brain Sciences 2:487–494https://doi.org/10.1017/S0140525X00063949 Google Scholar
1. Ólafsdóttir H. F.
2. Bush D.
3. Barry C
2018The Role of Hippocampal Replay in Memory and PlanningCurrent Biology 28:R37–R50https://doi.org/10.1016/j.cub.2017.10.073 Google Scholar
1. Pedregosa F.
2. Varoquaux G.
3. Gramfort A.
4. Michel V.
5. Thirion B.
6. Grisel O.
7. Blondel M.
8. Prettenhofer P.
9. Weiss R.
10. Dubourg V.
11. Vanderplas J.
12. Passos A.
13. Cournapeau D.
14. Brucher M.
15. Perrot M.
16. Duchesnay É
2011Scikit-learn: Machine Learning in PythonJournal of Machine Learning Research 12:2825–2830Google Scholar
1. Peer M.
2. Brunec I. K.
3. Newcombe N. S.
4. Epstein R. A
2021Structuring Knowledge with Cognitive Maps and Cognitive GraphsTrends in Cognitive Sciences 25:37–54https://doi.org/10.1016/j.tics.2020.10.004 Google Scholar
1. Preston A. R.
2. Eichenbaum H
2013Interplay of Hippocampus and Prefrontal Cortex in MemoryCurrent Biology 23:R764–R773https://doi.org/10.1016/j.cub.2013.05.041 Google Scholar
1. Rossion B.
2. Pourtois G
2001Revisiting snodgrass and Vanderwart’s object database: Color and texture improve object recognitionJournal of Vision 1:413https://doi.org/10.1167/1.3.413 Google Scholar
1. Roux F.
2. Parish G.
3. Chelvarajah R.
4. Rollings D. T.
5. Sawlani V.
6. Gollwitzer S.
7. Kreiselmeyer G.
8. Wimber M.
9. Self M. W.
10. Hanslmayr S
2022Oscillations support short latency co-firing of neurons during human episodic memory formationeLife 44https://doi.org/10.7554/eLife.78109 Google Scholar
1. Schapiro A. C.
2. McDevitt E. A.
3. Rogers T. T.
4. Mednick S. C.
5. Norman K. A
2018Human hippocampal replay during rest prioritizes weakly learned information and predicts memory performanceNature Communications 9https://doi.org/10.1038/s41467-018-06213-1 Google Scholar
1. Schapiro A. C.
2. Rogers T. T.
3. Cordova N. I.
4. Turk-Browne N. B.
5. Botvinick M. M
2013Neural representations of events arise from temporal community structureNature Neuroscience 16https://doi.org/10.1038/nn.3331 Google Scholar
1. Schönauer M.
2. Pawlizki A.
3. Köck C.
4. Gais S
2014Exploring the Effect of Sleep and Reduced Interference on Different Forms of Declarative MemorySleep 37:1995–2007https://doi.org/10.5665/sleep.4258 Google Scholar
1. Schuck N. W.
2. Niv Y
2019Sequential replay of nonspatial task states in the human hippocampusScience 364https://doi.org/10.1126/science.aaw5181 Google Scholar
1. Sekeres M. J.
2. Moscovitch M.
3. Winocur G.
4. Axmacher N.
5. Rasch B.
2017Mechanisms of Memory Consolidation and TransformationIn: Cognitive Neuroscience of Memory Consolidation Springer International Publishing pp. 17–44https://doi.org/10.1007/978-3-319-45066-7_2 Google Scholar
1. Snodgrass J. G.
2. Vanderwart M
1980A standardized set of 260 pictures: Norms for name agreement, image agreement, familiarity, and visual complexityJournal of Experimental Psychology: Human Learning and Memory 6:174–215https://doi.org/10.1037/0278-7393.6.2.174 Google Scholar
1. Spiers H. J
2020The Hippocampal Cognitive Map: One Space or Many?Trends in Cognitive Sciences 24:168–170https://doi.org/10.1016/j.tics.2019.12.013 Google Scholar
1. Stadler M. A.
2. Roediger H. L.
3. McDermott K. B
1999Norms for word lists that create false memoriesMemory & Cognition 27:494–500https://doi.org/10.3758/BF03211543 Google Scholar
1. Staresina B. P.
2. Bergmann T. O.
3. Bonnefond M.
4. van der Meij R.
5. Jensen O.
6. Deuker L.
7. Elger C. E.
8. Axmacher N.
9. Fell J.
2015Hierarchical nesting of slow oscillations, spindles and ripples in the human hippocampus during sleepNature Neuroscience 18https://doi.org/10.1038/nn.4119 Google Scholar
1. Staresina B. P.
2. Henson R. N. A.
3. Kriegeskorte N.
4. Alink A
2012Episodic Reinstatement in the Medial Temporal LobeJournal of Neuroscience 32:18150–18156https://doi.org/10.1523/JNEUROSCI.4156-12.2012 Google Scholar
1. Tarder-Stoll H.
2. Baldassano C.
3. Aly M
2023The brain hierarchically represents the past and future during multistep anticipationbioRxiv :2023.07.24.550399https://doi.org/10.1101/2023.07.24.550399 Google Scholar
1. Taulu S.
2. Simola J
2006Spatiotemporal signal space separation method for rejecting nearby interference in MEG measurementsPhysics in Medicine and Biology 51:1759–1768Google Scholar
1. Theves S.
2. Fernandez G.
3. Doeller C. F
2019The Hippocampus Encodes Distances in Multidimensional Feature SpaceCurrent Biology 29:1226–1231https://doi.org/10.1016/j.cub.2019.02.035 Google Scholar
1. Tulving E
1993What Is Episodic Memory?Current Directions in Psychological Science 2:67–70https://doi.org/10.1111/1467-8721.ep10770899 Google Scholar
1. Watson D.
2. Clark L. A.
3. Tellegen A
1988Development and validation of brief measures of positive and negative affect: The PANAS scalesJournal of Personality and Social Psychology 54:1063–1070https://doi.org/10.1037/0022-3514.54.6.1063 Google Scholar
1. Wimmer G. E.
2. Liu Y.
3. McNamee D. C.
4. Dolan R. J
2023Distinct replay signatures for prospective decision-making and memory preservationProceedings of the National Academy of Sciences 120:e2205211120https://doi.org/10.1073/pnas.2205211120 Google Scholar
1. Wimmer G. E.
2. Liu Y.
3. Vehar N.
4. Behrens T. E. J.
5. Dolan R. J
2020Episodic memory retrieval success is associated with rapid replay of episode contentNature Neuroscience 23https://doi.org/10.1038/s41593-020-0649-z Google Scholar
1. Wise T.
2. Liu Y.
3. Chowdhury F.
4. Dolan R. J
2021Model-based aversive learning in humans is supported by preferential task state reactivationScience Advances 7:eabf9616https://doi.org/10.1126/sciadv.abf9616 Google Scholar
1. Wittkuhn L.
2. Schuck N. W
2021Dynamics of fMRI patterns reflect sub-second activation sequences and reveal replay in human visual cortexNature Communications 12:1795Google Scholar
1. Zhang H.
2. Fell J.
3. Staresina B. P.
4. Weber B.
5. Elger C. E.
6. Axmacher N
2015Gamma Power Reductions Accompany Stimulus-Specific Representations of Dynamic EventsCurrent Biology 25:635–640https://doi.org/10.1016/j.cub.2015.01.011 Google Scholar

Article and author information

Author information

Simon Kern
Clinical Psychology, Central Institute of Mental Health, Medical Faculty Mannheim, University of Heidelberg, Mannheim, Germany, Psychiatry and Psychotherapy, Central Institute of Mental Health, Medical Faculty Mannheim, University of Heidelberg, Mannheim, Germany, Addiction Behavior and Addiction Medicine, Central Institute of Mental Health, Medical Faculty Mannheim, University of Heidelberg, Mannheim, Germany
ORCID iD: 0000-0002-9050-9040
- Correspondence: Email: gordon.feld@zi-mannheim.de and simon.kern@zi-mannheim.de, Central Institute of Mental Health, J5, 68159 Mannheim, Germany, Tel. +49 621 1703 6540, Fax +49 621 1703 6505.
Juliane Nagel
Clinical Psychology, Central Institute of Mental Health, Medical Faculty Mannheim, University of Heidelberg, Mannheim, Germany, Psychiatry and Psychotherapy, Central Institute of Mental Health, Medical Faculty Mannheim, University of Heidelberg, Mannheim, Germany, Addiction Behavior and Addiction Medicine, Central Institute of Mental Health, Medical Faculty Mannheim, University of Heidelberg, Mannheim, Germany
ORCID iD: 0000-0002-5310-8088
Martin F. Gerchen
Clinical Psychology, Central Institute of Mental Health, Medical Faculty Mannheim, University of Heidelberg, Mannheim, Germany, Department of Psychology, Ruprecht Karl University of Heidelberg, Germany, Bernstein Center for Computational Neuroscience Heidelberg/Mannheim, Mannheim, Germany
ORCID iD: 0000-0003-3071-5296
Cagatay Guersoy
Clinical Psychology, Central Institute of Mental Health, Medical Faculty Mannheim, University of Heidelberg, Mannheim, Germany, Psychiatry and Psychotherapy, Central Institute of Mental Health, Medical Faculty Mannheim, University of Heidelberg, Mannheim, Germany, Addiction Behavior and Addiction Medicine, Central Institute of Mental Health, Medical Faculty Mannheim, University of Heidelberg, Mannheim, Germany
ORCID iD: 0000-0001-9762-7747
Andreas Meyer-Lin-denberg
Psychiatry and Psychotherapy, Central Institute of Mental Health, Medical Faculty Mannheim, University of Heidelberg, Mannheim, Germany, Bernstein Center for Computational Neuroscience Heidelberg/Mannheim, Mannheim, Germany
ORCID iD: 0000-0001-5619-1123
Peter Kirsch
Clinical Psychology, Central Institute of Mental Health, Medical Faculty Mannheim, University of Heidelberg, Mannheim, Germany, Department of Psychology, Ruprecht Karl University of Heidelberg, Germany, Bernstein Center for Computational Neuroscience Heidelberg/Mannheim, Mannheim, Germany
ORCID iD: 0000-0002-0817-1248
Raymond J. Dolan
Max Planck UCL Centre for Computational Psychiatry and Ageing Research, London, UK, Wellcome Centre for Human Neuroimaging, University College London, London, UK
ORCID iD: 0000-0001-9356-761X
Steffen Gais
Institute of Medical Psychology and Behavioral Neurobiology, Eberhard-Karls-University Tübingen, Germany
Gordon B. Feld
Clinical Psychology, Central Institute of Mental Health, Medical Faculty Mannheim, University of Heidelberg, Mannheim, Germany, Psychiatry and Psychotherapy, Central Institute of Mental Health, Medical Faculty Mannheim, University of Heidelberg, Mannheim, Germany, Addiction Behavior and Addiction Medicine, Central Institute of Mental Health, Medical Faculty Mannheim, University of Heidelberg, Mannheim, Germany, Department of Psychology, Ruprecht Karl University of Heidelberg, Germany
ORCID iD: 0000-0002-1238-9493
- Correspondence: Email: gordon.feld@zi-mannheim.de and simon.kern@zi-mannheim.de, Central Institute of Mental Health, J5, 68159 Mannheim, Germany, Tel. +49 621 1703 6540, Fax +49 621 1703 6505.

Version history

Preprint posted: October 12, 2023
Sent for peer review: October 12, 2023
Reviewed Preprint version 1: November 28, 2023
Reviewed Preprint version 2: April 18, 2024
Reviewed Preprint version 3: May 9, 2024
Version of Record published: May 29, 2024

Cite all versions

You can cite all versions using the DOI https://doi.org/10.7554/eLife.93357. This DOI represents all versions, and will always resolve to the latest one.

Copyright

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.

Metrics

views: 2,326
downloads: 183
citations: 9

Views, downloads and citations are aggregated across all versions of this paper published by eLife.