Ramping-up hippocampal ripples and their neocortical coupling support human visual short-term memory

Jing Liu; Xianhui He; Can Yang; Nikolai Axmacher; Gui Xue; Shaoming Zhang; Ying Cai

doi:10.7554/eLife.111304.1

Not revised: This Reviewed Preprint includes the authors’ original preprint (without revision), an eLife assessment, and public reviews.

Reviewing Editor
Anna Schapiro
University of Pennsylvania, Philadelphia, United States of America
Senior Editor
Huan Luo
Peking University, Beijing, China

Reviewer #1 (Public review):

Summary:

Cai et al. investigated the role of ripples in the hippocampus and coupled between the hippocampus and the neocortex in visual short-term memory (VSTM) using a similar lures match-to-sample task. The main findings are that hippocampal, but not neocortical ripples, ramp up during the maintenance period, peaking shortly before the memory response is given. This ramping-up effect was stronger for correct compared to incorrect trials. Furthermore, the authors show that stimulus category could be better decoded during coupled hippocampo-neocortical ripples compared to uncoupled ripples. These results provide compelling novel evidence for a role of ripples in supporting human visual short-term memory.

Strengths:

(1) State-of-the-art intracranial EEG in 13 patients during a well-designed visual short-term memory task, with simultaneous hippocampal and neocortical recordings.

(2) Thorough analysis pipeline with validation to detect ripple events, and distinguish them from spurious ripple activity (i.e., as induced by IEDs).

(3) Use of multivariate classifiers to resolve the neural representation of the stimuli.

Weaknesses:

It is difficult to find clear weaknesses in this paper, as the analyses are thorough, the results are clear, and the writing is excellent. However, some more sanity checks on the validity of ripples could have been conducted (i.e., making sure that ripple events have multiple peaks in the unfiltered raw signal at the ripple frequency). Also, the time window for coupled ripples appears to be a bit long, which makes it questionable to what degree these ripples are coupled (i.e., the time window is ~5 times longer than the duration of a ripple event). Lastly, the ramping-up effect could have been more clearly depicted in the figures, but that's a fairly minor point.

https://doi.org/10.7554/eLife.111304.1.sa2

Reviewer #2 (Public review):

Summary:

Liu et al. record intracranial EEG from the hippocampus and lateral temporal lobe in thirteen neurosurgical patients while they perform a delayed match-to-sample visual short-term memory task. The central question is whether hippocampal sharp-wave ripples (brief high-frequency oscillations well established in the long-term memory consolidation literature) also contribute to the active maintenance of visual representations over a short delay. The authors report three main findings: hippocampal ripple rates progressively ramp up across the 7-second maintenance period, hippocampal ripples temporally co-occur with ripples in the lateral temporal lobe, and these coupled events coincide with above-chance category-level decoding of the memorized stimulus in the lateral temporal lobe. The findings are interpreted within the dynamic coding framework of working memory, which predicts discrete reactivation bursts rather than sustained firing during maintenance. The question is timely, and the use of intracranial recordings affords a level of temporal and spatial resolution unavailable to non-invasive methods.

Strengths:

The study addresses a genuinely important and underexplored question: whether a neural mechanism best characterized in the context of offline memory consolidation is also engaged during active online maintenance. The use of intracranial recordings in humans is well suited to this question, providing the millisecond temporal resolution and regional specificity needed to detect transient high-frequency events. The dissociation from long-term memory, tested by splitting remembered trials according to whether the item was later recalled in a cued-recall test, directly addresses what would otherwise be a significant confound, and the finding that ripple dynamics during maintenance are unrelated to subsequent long-term memory performance adds specificity to the interpretation. The coupled ripple analysis is methodologically grounded, and the finding that coupled but not isolated ripples coincide with elevated memory decoding is mechanistically informative. The multivariate decoding approach applied to lateral temporal lobe spectral power provides a meaningful index of memory reactivation that goes beyond simple univariate rate measures. The control analysis and the alternative ripple detection method provide useful robustness checks. The public availability of preprocessed data and analysis code on OSF is commendable.

Weaknesses:

(1) Theoretical motivation for examining ripples in visual short-term memory.

A fundamental question that the paper does not adequately address is why hippocampal ripples, a mechanism strongly associated with offline memory consolidation during sleep, where they coordinate the transfer of hippocampal representations to cortex through temporally compressed replay, should be recruited for the online maintenance of visual information over a seconds-long delay. The Introduction acknowledges this gap but does not close it. The dynamic coding framework is used to motivate the ramping-up prediction, but this framework is agnostic about the specific neural mechanism responsible for reactivation bursts. In particular, the literature cited by the authors predicts high-frequency population activity or gamma bursts, but not specifically hippocampal ripples. The reasoning that "ripples share key properties with postulated reactivation bursts" risks being circular: it amounts to saying that ripples could be the relevant mechanism because the relevant mechanism has properties that ripples also have. A stronger theoretical motivation would require either evidence that the replay or reactivation computations that ripples support during offline states are also engaged during active short-term maintenance, or a mechanistic account of how the circuit processes underlying ripple generation are recruited differently across these two contexts.

This concern is compounded by what the authors present as one of their main controls. The finding that ripple dynamics during maintenance are not associated with subsequent long-term memory performance is treated as a reassurance that the observed effects are specific to short-term memory. But if ripples are canonically a long-term memory consolidation mechanism, the observation that they are engaged by a short-term memory task while appearing disengaged from concurrent long-term memory encoding is itself a finding that demands explanation. Resolving this tension is important for the paper's contribution to be correctly interpreted by the field.

(2) Ripple detection and specificity.

Even granting that ripples could in principle contribute to short-term memory maintenance, the study does not establish that the detected events are physiological sharp-wave ripples rather than broadband high-frequency activity. The detection band (70-180 Hz) substantially overlaps with the high-gamma range, which is a well-established proxy for local neural population activity and coding, and is broader than the 80-120 Hz band used by several of the cited papers, including Vaz et al. (2019), Ngo et al. (2020), Chen et al. (2021), Staresina et al. (2023), and Kunz et al. (2024). Without demonstrating that detected events have the hallmark features of physiological sharp-wave ripples, a clear narrowband spectral peak, and characteristic waveform morphology, it is difficult to conclude that the observed effects reflect a ripple-specific mechanism rather than a more general high-frequency population activity phenomenon. The reported mean rate of 0.29 Hz is somewhat higher than rates reported in some recent work, such as Chen et al. (2021, ref 74) and Kunz et al. (2024, ref 15). It is worth noting that van Schalkwijk and Helfrich (2026, Nature Communications) demonstrated that a large proportion of awake ripple detections in the human medial temporal lobe reflect false positives arising from aperiodic 1/f noise, with task-related modulations of this noise floor producing spurious detections. The authors present an 80-120 Hz control analysis as a robustness check, but this inverts the appropriate logic: if 80-120 Hz is the more validated band, as the cited literature suggests, it should serve as the primary analysis rather than a supplementary one.

(3) Internal inconsistency with the dynamic coding framework.

The authors invoke the dynamic coding framework, which predicts that reactivation bursts should ramp up toward the end of the retention interval in the region where memory representations are actively maintained. The hippocampal ramping-up result is presented as confirming this prediction. However, the lateral temporal lobe, the region where above-chance category decoding is found and memory reactivation is attributed, shows no corresponding ramp-up. The authors acknowledge this asymmetry but do not offer a mechanistically satisfying explanation, and the suggestion that the effect might exist in unsampled subregions cannot be evaluated with the current data. This leaves the framework's core prediction unconfirmed in the region that is claimed to maintain the representations.

(4) Coupled ripples, directionality of hippocampal-lateral temporal coupling, and the ramping-up paradox.

The conclusion that coupled hippocampal-lateral temporal ripples coordinate memory reactivation creates a logical tension that the paper does not resolve. If hippocampal ripples drive lateral temporal reactivation only when co-occurring with lateral temporal ripples, and hippocampal ripples ramp up in a memory-predictive fashion, then the absence of lateral temporal ripple ramping up implies that the hippocampal ramp-up is not primarily expressed through the coupled ripple mechanism, undermining the coherence of the two main findings. The coupled ripple analysis further quantifies only temporal co-occurrence and provides no evidence about the direction of influence. Without demonstrating that hippocampal ripples systematically precede lateral temporal ripples (i.e., the expected signature of hippocampus-to-cortex information flow), the central claim that hippocampal ripples drive lateral temporal reactivation remains an interpretive assumption. Directly testing whether lateral temporal ripples specifically coupled to hippocampal ripples show a ramping temporal profile during maintenance (even if overall lateral temporal ripple rates do not) is necessary to establish whether the lateral temporal lobe engages in hippocampally-gated reactivation bursts in the manner the framework predicts. Additionally, reporting the distribution of peak lags between hippocampal and lateral temporal ripple peaks, and testing whether hippocampal ripples systematically precede lateral temporal ripples, is similarly necessary to support the directional interpretation.

(5) Trial-level analysis clarity.

The paper reports that ripples occurred in 54%, 79%, and 27% of trials during encoding, maintenance, and retrieval, respectively, but does not state whether subsequent analyses were conducted on trials thresholded by ripple occurrence. Given that occurrence rates vary substantially across stages and conditions, this inclusion criterion has implications for interpreting rate differences and should be stated explicitly.

(6) Statistical model specification.

The methods describe the ramping-up analysis using both a "logistic" link function and a "Poisson link function" in different places, with the dependent variable described inconsistently as ripple occurrence and ripple count. These are not equivalent, and the distinction matters for interpreting the reported coefficients. Additionally, the regional dissociation in Figure 3 appears to be assessed by fitting separate models to each region and comparing results informally. This does not constitute a direct test of whether slopes differ between regions and risks the well-known error of inferring a difference based on one p-value being significant while another is not. A direct region × time interaction test would more cleanly support the claimed dissociation.

https://doi.org/10.7554/eLife.111304.1.sa1

Reviewer #3 (Public review):

Summary:

Liu, He, et al. present results suggesting hippocampal ripples support short-term working memory. The basic finding that hippocampal ripples increase during a 7s working memory maintenance period is intriguing and previously not shown as far as I know, but a lack of control analyses within the task, across brain regions, or as compared to alternative oscillatory signals makes the overall evidence weak. The author needs to more thoroughly evidence this signal via several analyses (suggested below) to strengthen their finding. The paper moves on to a hippocampal-cortical ripple coupling analysis that needs further methodological details and corrected statistics to make a meaningful contribution. As is, the ripple coupling results don't seem to necessarily relate to the hippocampal ripples found in the maintenance period, making the manuscript somewhat incoherent and of low impact in its current form.

Major issues:

(1) The framing sets up "visual short term memory" (VSTM) and "long term memory" (LTM) as two different things. A long line of research with humans possessing MTL/hippocampus damage shows the hippocampal memory system contributes to working memory only when the task is difficult enough to warrant its recruitment (see Hannula et al. 2006 J. of Neuroscience, Pertzov et al. 2013 Brain, or particularly Jeneson et al. 2012 Learning & Memory and J. of Neuroscience). This theory therefore, suggests that the hippocampus contributes to working memory via LTM mechanisms, as opposed to it possessing two different roles (VSTM and LTM). While the authors might disagree with this framing, at a minimum, they should describe this line of work. As is, it's difficult to know how their task fits into this literature since it's a cross between a pattern separation probe (identify repeats from lures), working memory (7 s delays), and subsequent cued associate recognition. Addressing why they used this combination of task features would help frame its place in the literature.

(2) The basic idea of looking for hippocampal ripples as a marker for working memory maintenance is new, with no prior literature (that I know of in rodents or in the handful of human intracranial ripple papers) to build on. That said, I suspect hippocampal ripples act as a proxy for hippocampal activation, providing a possible explanation for the hippocampal ripple increase shown during the Maintenance period. The effect they show is well supported by the mixed effects modeling (MEM), making it a potentially meaningful finding, but considering the novelty, it's rather important that control analyses rule out alternative possibilities. I suggest two important ones and a third related to the lack of parametric manipulations in the next paragraph. First, the authors frame the paper by suggesting hippocampal ripples share features with beta/gamma burst theories of working memory maintenance. In that case, the obvious question is why use a ripple detector instead of measuring gamma (or beta) activity as in this previous work? Some work has suggested hippocampal ripples act differently than high-frequency activity (see Sakon et al. 2024 J. of Neuroscience), so an analysis contrasting ripples and gamma seems rather important. Second, and relatedly, the authors only compare the hippocampus and lateral temporal cortex (LTC), likely because these tend to be sites with strong coverage in epilepsy cases. That's ok, but typically there is also reasonable coverage in other MTL areas like entorhinal cortex and amygdala, which would serve as important controls to show what they're measuring likely relates to sharp-wave ripples (a hippocampal phenomenon) and not something more generic like gamma or HFA (as shown in Sakon et al. 2024, Howard et al. 2003 Cerebral Cortex, Axmacher et al. 2007 reference 26, Meltzer et al. 2008 Cerebral Cortex, etc.).

(3) Related to the last point, since there are no parametric manipulations (e.g., different delay durations, different set sizes, varying lure difficulties) there's no way to assess increased hippocampal ripples with stronger loads, which would be important for determining the hippocampal dependence of their task in the first place. Do the authors have any justification for this task as an assessment of hippocampal working memory? I could imagine using a top vs. bottom tercile of lure discrimination difficulty (as assessed across all participants or control non-patients) to compare hippocampal activity. But only after the first trial, each pair is used since only then would the patient have awareness of the difficulty of the upcoming comparison. Or maybe something could be done by comparing VSTM performance by splitting patients based on how they performed at the LTM test.

(4) Also related to the VSTM vs. LTM framing, the authors use an "LTM" cued category recognition task--presumably done at the end of the repeat/lure recognition task--as a way to argue that the hippocampal ripple effects they see relate to VSTM and not LTM. The LTM task is disappointingly underdescribed, where even in the methods (lines 588-592) I cannot figure out when this task was probed, how many trials were done in comparison to the VSTM task, etc. Considering they use the LTM task to support their VSTM interpretation, it's rather crucial to understand precisely what they did. As is, the comparison they do present relies on a statistical error, where they compare p-values (n.b. https://www.nature.com/articles/nn.2886) instead of performing a direct interaction test (lines 177-180). Specifically, if they want to say their signal relates more to VSTM subsequent memory rather than LTM subsequent memory, they need to run a model of the form: ripple_rates ~ remembered + test_type + remembered*test_type (where test_type is either their VSTM or LTM task).

(5) As noted, the increase in hippocampal ripples during maintenance seems substantial, and the MEM confirms a significant increase over time. That said, the presentation of the data is atypical, with an example raster from one channel followed by average time courses of ALL participants below it. Why not show full raster plots for all participants? Ripples are so sparse that all the data in the task can be visualized in a single raster easily. A swarm plot indicating inter-patient variability in the maintenance signal also seems crucial. As is, there is no way to assess how much of the signal depends on a small subset of channels or patients.

(6) To compare ripple rates across task phases, they average over the bounds of each phase (lines 657-660) and input these into their MEMs. This approach makes sense for quantifying what we see in the ripple plots (Figure 2), except for Encoding, where they average over the entire 3 s window, even though there is clear tuning only from ~0-1 s. Using the tuned region and not the entire window is standard and would be more appropriate for the comparisons to maintenance, retrieval, etc (e.g., line 147-148 doesn't check out when looking at the figure), otherwise you are averaging over a seeming ripple inhibition from 1-2 s. They perform a cluster-based permutation test as is, so that a window or something a bit wider would be appropriate.

(7) The authors pivot to a hippocampal-cortical ripple coupling analysis to build the argument that the hippocampal ripples shown in Figure 2 support memory maintenance in the cortex. They use a window of -500 to 500 ms from hippocampal ripples to assess coupling. This is quite wide, since it doesn't seem plausible that a cortical ripple 500 ms from a hippocampal ripple means they synchronize. They cite two papers to justify the analysis, both of which use {plus minus}500 ms windows, but for spindle-ripple coupling, not ripple-ripple, so are miscited. Later in the paper, they switch to {plus minus}50 ms for another coupling analysis, raising the question of why they used {plus minus}500 ms in the previous analysis to begin with. If they want to claim cortical ripples are tuned by hippocampal ripples all the way up to 500 ms away, they should show the rasters (as in Figure 4a) and timecourse ripple rates, but going beyond {plus minus}500 ms to show that ripples in the {plus minus}50-500 ms range are above, say 500-1000 ms to justify their window selection. I will point out that there IS previous work that used {plus minus}500 ms to measure cortical-cortical ripple coupling (Dickey et al 2022 PNAS, which should be cited regardless, as I believe the first hippocampal-cortical ripple paper showing memory effects), although the figures in that paper suggest anything beyond {plus minus}250 ms returns to baseline (see Figure 2A-B).

(8) Lines 239 to 243 comparing p-values instead of an interaction test.

(9) I don't understand what "Further analysis based on the identified cluster" means (line 271). I see in Figure 5c that their broadband classifier identified a window of optimal decoding, but did they use only activity in this cluster to train the subsequent classifier (Figure 5d)? If so, this is not described in the methods. And if it is done that way, I don't think the logic makes sense. As mentioned in comment 6, the ripples during encoding tune to 0-1s after image presentation. So it doesn't make sense to use a 1.85-2.25 s window for ripple-locked decoding-they should just be using the 0-1 s window (or whatever their cluster-based permutation test shows in Figure 2b). Otherwise, it would appear they are studying two different phenomena.

(10) As is, the results in Figure 5d need to be redone. First, the results described on lines 271-275 once again suffer from comparing p-values. They need to run an interaction model if they want to claim Maintenance shows stronger ripple-locked decoding than Encoding (it almost certainly will not, since Encoding appears to show some evidence of decoding (p=0.118)). Second, even if they do change the framing to say Encoding and Maintenance show significant decoding, is it meaningful if Retrieval fails to? If you cannot decode the same information at the time of retrieval as is theoretically being held in working memory during the delay, the coupled ripple reactivation story wouldn't appear to make sense. They do show significant Retrieval decoding in Figure 5a-b, but since I don't really understand how they settled on the "identified cluster" in Figure 5c, I'm not sure what to make of the difference between these decoders.

(11) Finally, as mentioned in the summary, the analyses in Figures 2-3 seem disjointed from those in Figures 4-5. Part of this has to do with the switch to a broadband classifier, then a switch back to coupled ripples, and then, as I already mentioned, decoding results with time windows that don't align with the hippocampal ripple effects they showed earlier. Further, since the main point of Figures 2-3 is to establish a ramp in hippocampal ripples across maintenance, shouldn't they be trying to show how the decoding changes over the course of the Maintenance period? It would also help the interpretation of Figure 5 to see how the coupled ripples change over time in Figure 4 (as they showed them in Figure 2).

Minor issues:

(1) Instead of citing a software package like Emmeans, the statistical test being performed should be explained.

(2) Decoding % accuracy in the heatmaps in Figure 5 and supplementary would be more intuitive, particularly since Figure 5b uses accuracy anyway.

(3) Figure 2b is misleading with an unnecessary change in the y-axis for retrieval.

(4) In Figure 2d, a significant cluster is mentioned, but not drawn onto the figure as in Figure 2b.

https://doi.org/10.7554/eLife.111304.1.sa0

Ramping-up hippocampal ripples and their neocortical coupling support human visual short-term memory

Peer review process

Editors

Be the first to read new articles from eLife