Episodic long-term memory formation during slow-wave sleep

Flavio Schmidig; Simon Ruch; Katharina Henke

doi:10.7554/eLife.89601.1

Not revised: This Reviewed Preprint includes the authors’ original preprint (without revision), an eLife assessment, public reviews, and a provisional response from the authors.

Reviewing Editor
Anna Schapiro
University of Pennsylvania, Philadelphia, United States of America
Senior Editor
Timothy Behrens
University of Oxford, Oxford, United Kingdom

Reviewer #1 (Public Review):

The authors show that concurrently presenting foreign words and their translations during sleep leads to the ability to semantically categorize the foreign words above chance. Specifically, this procedure was successful when stimuli were delivered during slow oscillation troughs as opposed to peaks, which has been the focus of many recent investigations into the learning & memory functions of sleep. Finally, further analyses showed that larger and more prototypical slow oscillation troughs led to better categorization performance, which offers hints to others on how to improve or predict the efficacy of this intervention. The strength here is the novel behavioral finding and supporting physiological analyses, whereas the biggest weakness is the interpretation of the peak vs. trough effect.

Major importance:

I believe the authors could attempt to address this question: What do the authors believe is the largest implication of this studies? How far can this technique be pushed, and how can it practically augment real-world learning?

Lines 155-7: How do the authors argue that the words fit well within the half-waves when the sounds lasted 540 ms and didn't necessarily start right at the beginning of each half-wave? This is a major point that should be discussed, as part of the down-state sound continues into the up-state. Looking at Figure 3A, it is clear that stimulus presented in the slow oscillation trough ends at a time that is solidly into the upstate, and would not neurolinguists argue that a lot of sound processing occurs after the end of the sound? It's not a problem for their findings, which is about when is the best time to start such a stimulus, but it's a problem for the interpretation. Additionally, the authors could include some discussion on whether possibly presenting shorter sounds would help to resolve the ambiguities here.

Medium importance:

Throughout the paper, another concern relates to the term 'closed-loop'. It appears this term has been largely misused in the literature, and I believe the more appropriate term here is 'real-time' (Bergmann, 2018, Frontiers in Psychology; Antony et al., 2022, Journal of Sleep Research). For instance, if there were some sort of algorithm that assessed whether each individual word was successfully processed by the brain during sleep and then the delivery of words was subsequently changed, that could be more accurately labeled as 'closed-loop'.

Figure 5 and corresponding analyses: Note that the two conditions end up with different sounds with likely different auditory complexities. That is, one word vs. two words simultaneously likely differ on some low-level acoustic characteristics, which could explain the physiological differences. Either the authors should address this via auditory analyses or it should be added as a limitation.

Line 562-7 (and elsewhere in the paper): "episodic" learning is referenced here and many times throughout the paper. But episodic learning is not what was enhanced here. Please be mindful of this wording, as it can be confusing otherwise.

https://doi.org/10.7554/eLife.89601.1.sa2

Reviewer #2 (Public Review):

In this project, Schmidig, Ruch and Henke examined whether word pairs that were presented during slow-wave sleep would leave a detectable memory trace 12 and 36 hours later. Such an effect was found, as participants showed a bias to categorize pseudowords according to a familiar word that they were paired with during slow-wave sleep. This behavior was not accompanied by any sign of conscious understanding of why the judgment was made, and so demonstrates that long-term memory can be formed even without conscious access to the presented content. Unconscious learning occurred when pairs were presented during troughs but not during peaks of slow-wave oscillations. Differences in brain responses to the two types of presentation schemes, and between word pairs that were later correctly- vs. incorrectly-judged, suggest a potential mechanism for how such deep-sleep learning can occur.

The results are very interesting, and they are based on solid methods and analyses. Results largely support the authors' conclusions, but I felt that there were a few points in which conclusions were not entirely convincing:

As a control for the critical stimuli in this study, authors used a single pseudoword simultaneously played to both ears. This control condition (CC) differs from the experimental condition (EC) in a few dimensions, among them: amount of information provided, binaural coherence and word familiarity. These differences make it hard to conclude that the higher theta and spindle power observed for EC over CC trials indicate associative binding, as claimed in the paper. Alternative explanations can be made, for instance, that they reflect word recognition, as only EC contains familiar words.
The entire set of EC pairs were tested both following 12 hours and following 36 hours. Exposure to the pairs during test #1 can be expected to have an effect over memory one day later, during test #2, and so differences between the tests could be at least partially driven by the additional activation and rehearsal of the material during test #1. Therefore, it is hard to draw conclusions regarding automatic memory reorganization between 12 and 36 hours after unconscious learning. Specifically, a claim is made regarding a third wave of plasticity, but we cannot be certain that the improvement found in the 36 hour test would have happened without test #1.
Authors claim that perceptual and conceptual processing during sleep led to increased neural complexity in troughs. However, neural complexity was not found to differ between EC and CC, nor between remembered and forgotten pairs. It is therefore not clear to me why the increased complexity that was found in troughs should be attributed to perceptual and conceptual word processing, as CC contains meaningless vowels. Moreover, from the evidence presented in this work at least, I am not sure there is room to infer causation - that the increase in HFD is driven by the stimuli - as there is no control analysis looking at HFD during troughs that did not contain stimulation.

https://doi.org/10.7554/eLife.89601.1.sa1

Reviewer #3 (Public Review):

The study aims at creating novel episodic memories during slow wave sleep, that can be transferred in the awake state. To do so, participants were simultaneously presented during sleep both foreign words and their arbitrary translations in their language (one word in each ear), or as a control condition only the foreign word alone, binaurally. Stimuli were presented either at the trough or the peak of the slow oscillation using a closed-loop stimulation algorithm. To test for the creation of a flexible association during sleep, participant were then presented at wake with the foreign words alone and had (1) to decide whether they had the feeling of having heard that word before, (2) to attribute this word to one out of three possible conceptual categories (to which translations word actually belong), and (3) to rate their confidence about their decision.

The paper is well written, the protocol ingenious and the methods are robust. However, the results do not really add conceptually to a prior publication of this group showing the possibility to associate in slow wave sleep pairs of words denoting large or small object and non words, and then asking during ensuing wakefulness participant to categorise these non words to a "large" or "small" category. In both cases, the main finding is that this type of association can be formed during slow wave sleep if presented at the trough (versus the peak) of the slow oscillation. Crucially, whether these associations truly represent episodic memory formation during sleep, as claimed by the authors, is highly disputable as there is no control condition allowing to exclude the alternative, simpler hypothesis that mere perceptual associations between two elements (foreign word and translation) have been created and stored during sleep (which is already in itself an interesting finding). In this latter case, it would be only during the awake state when the foreign word is presented that its presentation would implicitly recall the associated translation, which in turn would "ignite" the associative/semantic association process eventually leading to the observed categorisation bias (i.e., foreign words tending to be put in the same conceptual category than their associated translation). In the absence of a dis-confirmation of this alternative and more economical hypothesis, and if we follow Ocam's razor assumption, the claim that there is episodic memory formation during sleep is speculative and unsupported, which is a serious limitation irrespective of the merits of the study. The title and interpretations should be toned down in this respect

Other remarks:

Lines 43-45 : the assumption that the sleeping brain decides whether external events can be disregarded, requires awakening or should be stored for further consideration in the waking state is dubious, and the supporting references date from a time (the 60') during which hypnopedia was investigated in badly controlled sleep conditions (leaving open the doubt about the possibility that it occurred during micro awakenings)

1st paragraph, lines 48-53 , the authors should be more specific about what kind of new associations and at which level they can be stored during sleep according to recent reports, as a wide variety of associations (mostly elementary levels) are shown in the cited references. Limitations in information processing during sleep should also be acknowledged.

The authors ran their main behavioural analyses on delayed retrieval at 36h rather than 12h with the argument that retrieval performance was numerically larger at 36 than 12h but the difference was non-significant (line 181-183), and that effects were essentially similar. Looking at Figure 2, is the trough effect really significant at 12h ? In any case, the fact that it is (numerically) higher at 36 than 12h might suggest that the association created at the first 12h retrieval (considering the alternative hypothesis proposed above) has been reinforced by subsequent sleep.

In the discussion section lines 419-427, the argument is somehow circular in claiming episodic memory mechanisms based on functional neuroanatomical elements that are not tested here, and the supporting studies conducted during sleep were in a different setting (e.g. TMR)

Supplementary Material: in the EEG data the differentiation between correct and incorrect ulterior classifications when presented at the peak of the slow oscillation is only significant in association with 36h delayed retrieval but not at 12h, how do the authors explain this lack of effect at 12 hour?

https://doi.org/10.7554/eLife.89601.1.sa0

Author Response

We thank the reviewers for their careful reading of our manuscript and for their constructive and positive comments. We will revise the manuscript to address their key points. Here, we address the reviewer’s scepticism of sleep-learning being mediated by the episodic memory system. We agree that the reported unconscious learning of novel verbal associations during sleep may not match textbook definitions of episodic memory. However, the traditional definitions of episodic memory have long been criticized (e.g, Henke, 2010; Hannula, Minor, Slabbekoorn, 2023; Shohamy & Turk-Browne, 2013; Dew & Cabeza, 2011; Reder et al, 2009). We stand by our claim that sleep-learning was of episodic nature. Here, we provide arguments for this claim:

In the introduction and the discussion, we are reporting that we use a computational definition of episodic memory (Cohen & Eichenbaum, 1993; Henke, 2010; O’Reilly et al., 2014; O’Reilly & Rudy, 2000), and not the traditional definition of episodic memory that ties episodic memory to wakefulness and conscious awareness (Gabrieli, 1998; Moscovitch, 2008; Schacter, 1998; Squire & Dede, 2015; Tulving, 2002). Consciousness and wakefulness are no properties of episodic memory according to the computational definition of episodic memory. Instead, the core computational features of episodic memory according to the computational definition are 1) rapid learning, 2) association formation, and 3) a compositional and flexible representation of the associations in long-term memory. We designed the retrieval task in the current study to assess only the retention of sleep-formed flexibly and compositionally stored word-word associations. Reviewer 3 suggests that sound-sound associations may have been formed during sleep and may have been reactivated at test resulting in the translation of the sound pattern of the translation word to the meaning of the translation word and further to the correct superordinate semantic category of the translation word. Although these processing steps during sleep and during the wake retrieval are possible, the rapid sound-sound associative encoding, long-term storage, and the flexible sound retrieval would still require hippocampal processing and hence computations in the episodic memory system. The interpretation in terms of associative auditory learning with a double semantic translation at wake testing is laborious and inefficient and hence a less parsimonious interpretation of sleep-learning than conceptual associative encoding during sleep. Our view resonates the findings by Andrillon et al. (2017) that mere auditory perceptual learning during slow-wave sleep was not possible at all or led to suppressive memory traces that could not be retrieved following awakening.

Importantly, Züst et al. (Current Biology, 2019) had also presented pseudowords and translation words for paired-associative word encoding during slow-wave sleep. Retrieval testing was performed in the waking state following sleep by use of a cued-recall task, as in the current study. During retrieval testing, Züst et al. recorded brain blood oxygenation using functional magnetic resonance imaging. Importantly, the hippocampus was activated during correctly, but not during incorrectly retrieved memories that had been formed during sleep. Crucially, activation resulting from this contrast within the posterior and anterior hippocampus and within lexical-semantic storage sites in the left temporal pole correlated between participants with retrieval performance (Züst et al., 2019). These correlation results demonstrate that those participants, who learned the vocabulary best during slow-wave sleep activated the hippocampus and lexical-semantic storage sites the most during wake retrieval testing. Because the learning and retrieval tasks in the current study were similar to Züst et al. (2019), the hippocampus was likely mediating the retrieval of the sleep-formed associations in the current study. We have also measured the brain oxygenation using functional magnetic resonance imaging in five persons while they learned pairs of pseudowords and translation words during slow-wave sleep and found the hippocampus activated (besides language areas) in all persons (unpublished).

For these reasons, we believe that vocabulary presentations during sleep had triggered a hippocampus-mediated rapid conceptual-associative encoding process that provided for flexible representations of combinations of pseudowords and translation words in episodic memory.

https://doi.org/10.7554/eLife.89601.1.sa4

Episodic long-term memory formation during slow-wave sleep

Peer review process

Editors

Be the first to read new articles from eLife