Neural Correlates and Reinstatement of Recent and Remote Memory: A Comparison Between Children and Young Adults

Iryna Schommartz; Philip F Lembcke; Javier Ortiz-Tudela; Martin Bauer; Angela M Kaindl; Claudia Buss; Yee Lee Shing

doi:10.7554/eLife.89908.3

Introduction

Every day we form new memories that may become long-lasting through memory consolidation, a complex process in flux between encoding and retrieval (Dudai, 2012; Josselyn et al., 2015; Moscovitch & Gilboa, 2022; Semon, 1921).During systems-level consolidation, memory representations and traces are reorganized across medial temporal lobe and neocortical brain networks (Ranganath & Ritchey, 2012; Ritchey & Cooper, 2020). These networks include brain regions that are involved both in initial encoding and in integration of new memories as time passes (Axmacher & Rasch, 2017; Dudai, 2012; Moscovitch & Gilboa, 2022; Squire et al., 2015). While decades of work have shed light on general neural mechanisms of memory consolidation in adults (Moscovitch & Gilboa, 2022; Sekeres et al., 2017a; Winocur & Moscovitch, 2011), much less is known about neural mechanisms that support memory consolidation in children – a knowledge gap that we aimed to address with the current study.

Neural correlates of memory consolidation

Learning through repeated activation and reinstatement is one way to rapidly stabilize memory traces and make them accessible upon retrieval (Dudai, 2004; Nader & Hardt, 2009; Teyler & Rudy, 2007). For instance, in young adults, repeated exposure to word-image pairs during encoding, compared to single exposure, was shown to accelerate memory consolidation. This is achieved through enhanced replay of repeated events in the retrosplenial cortex (RSC) and the medial prefrontal cortex (PFC), as well as via increased hippocampal (HC)-cortical replay that promotes the associative word-object memories (Yu et al., 2022). In another study by Brodt et al. (2016), it was found that during repeated spatial navigation in a virtual environment, activation in the posterior parietal cortex (PPC), especially the precuneus, increased and remained elevated after 24 hours, while HC activity and HC-PPC connectivity declined with repeated encoding rounds (Brodt et al., 2016). In addition, neocortical plasticity measured by diffusion-weighted magnetic resonance imaging in the PPC (Brodt et al., 2018) and the cerebellum (Stroukov et al., 2022) supported rapid cortical storage of memory traces for object-location associations after repeated exposure in young adults 1 hour and 12 hours post-learning. Taken together, these findings indicate that repeated learning in young adults promotes fast creation of neural memory representations, which can remain stable for at least 24 hours and predict behavioural mnemonic performance.

Memory consolidation of well-learnt information does not end with the last learning cycle, but undergoes further neural reorganizing and modification over time (Roüast & Schönauer, 2023; Sekeres et al., 2017). For example, during cued recall of face-location associations, young adults who were tested 24 hours after learning, compared to 15 minutes, showed increased activation in the precuneus, inferior frontal gyrus (IFG), and fusiform gyrus, whereas the hippocampus showed a decrease in activation (Takashima et al., 2009). Similarly, increased activation in the anterior temporal cortex during the retrieval of studied figure pairs eight weeks prior was observed, while increased activation in the HC was shown for pairs learned immediately before retrieval (Yamashita et al., 2009). Furthermore, delayed retrieval of naturalistic video clips after a delay of seven days in young adults was associated with increased activations in the lateral and medial PFC and decreases in HC and parahippocampal (PHG) activations over time (Sekeres et al., 2021). This is convergent with the notion that the role of the prefrontal cortex increases during recollection as consolidation progresses over time (Milton et al., 2011). Moreover, subsequently recollected memories showed higher post-rest HC- lateral occipital cortex (LOC) connectivity specifically related to scene-related mnemonic content, indicating the role of LOC in associative memory consolidation (Tambini et al., 2010). On the other hand, HC activation has been reported to remain stable after seven days (Sekeres, Winocur, Moscovitch, et al., 2018) three months (Harand et al., 2012) or even years (Söderlund et al., 2012) for consistent episodic memories that retained contextual details.

To summarize, in alignment with the Multiple Trace Theory (Nadel et al., 2000; Nadel & Moscovitch, 1997), studies have shown that memories of well-learned information increasingly engage cortical regions over time. There regions include the prefrontal, parietal, occipital, and anterior temporal brain areas, supporting the retrieval of general and schematic memories, as well as complex associative information. In line with the Standard Consolidation Theory, some studies have demonstrated a decrease in the recruitment of the HC over time (Squire & Alvarez, 1995). Conversely, and converging with the Contextual Binding Theory (Yonelinas et al., 2019) and the Multiple Trace Theory, some studies have shown that hippocampal involvement lingers over time, particularly for detailed and contextual memories. However, most research has focused on only a selected delay window and solely on young adults.

Mnemonic transformation and reinstatement across consolidation

In addition to changes in neural activation during mnemonic retrieval over time, it is important to characterize the transformations and reinstatement of neural representations – that is, distinctive pattern of neural activity generated by a specific memory (Averbeck et al., 2006; Kriegeskorte, 2008; Kriegeskorte & Kievit, 2013) – as these multivariate patterns of neural activity may change over time. For example, memory for perceptual details often declines over time, while memory for gist may tends to remain more stable, suggesting differential temporal trajectories of transformation (Sekeres et al., 2016). According to Fuzzy Trace Theory (Reyna & Brainerd, 1995, 1998) and Trace Transformation Theory (Moscovitch & Gilboa, 2022), detailed and gist- like memories may be uniquely present or coexist, depending on the strength of formed memories. For instance, Diamond et al. (2020) showed that the specific accurate nature may be preserved for correctly recalled memories. In other instances, initially weak detailed memories may be reorganized over time, with lingering specific memories and parallel creation of gist-like generic memories. Further research supports suggest the idea that memory traces undergo transformation and abstraction beyond simple perceptual reinstatement (Chen et al., 2017; St-Laurent & Buchsbaum, 2019; Ye et al., 2020), pointing to the development of schematic, generic representations. However, relatively little is known about how the neural representation of well- learned memories change over the consolidation period – particularly how similar patterns of neural activity are reactivated upon retrieved again (Clarke et al., 2022; Deng et al., 2021).

Using representational similarity analysis (RSA; Kriegeskorte, 2008), Tompary & Davachi (2017) showed that a one-week delay led to differential memory reorganisation in HC and mPFC for memories with and without overlapping features. Specifically, after a one-week mnemonic representations became more similar for memories with overlapping features, indicating consolidation-related gist-like neural reorganization. Moreover, the authors showed memory- specific reinstatement of neural patterns for specific memories in the right HC, indicated by significant encoding-retrieval similarity for remote but not recent memories. Comparing neural reinstatement of visual clips during encoding, immediate, and delayed recall (after 1-week- period), Oedekoven et al. (2017) showed reliable reinstatement in core retrieval networks, including the precuneus, medial temporal gyrus, occipital gyrus, HC, and PHG among others. In contrast to Tompary and Davachi (2018), this study found no time-related differences in reinstatement effects. Therefore, the findings on memory reinstatement are mixed, and, to date, no study have directly tracked the neural representations of memory traces for perceptual together with more abstract, gist-like features (e.g., semantic categories).

Neural correlates of memory consolidation and mnemonic transformation and reinstatement in middle childhood

Brain regions involved in memory consolidation show protracted developmental trajectories from early to late childhood (Badre & Wagner, 2007; Ghetti & Bunge, 2012c; Gogtay et al., 2004; Keresztes et al., 2022; Lenroot & Giedd, 2006; Mills et al., 2016; Ofen et al., 2007; Shing et al., 2008), which could lead to differences in neural activity and/or patterns and subsequently mnemonic reinstatement between children and adults. For instance, univariate selectivity was reduced in children, while fine-grained neural representational similarity along the ventral visual stream was similar in 5-11 years old children and adults (Cohen et al., 2019; Golarai et al., 2015). Fandakova et al. (2019) also showed that the neural representational distinctiveness of information during encoding was similar in 8-to-15-year-old children and adults in the RSC, LOC and PHG. The fidelity of neural representations was also associated with subsequent memory in a similar way between children and adults. Overall, although these findings did not address the question of neural reinstatement directly in children, they suggest that mnemonic reinstatement may develop prior to univariate selectivity. However, it is yet to be investigated. Moreover, it is unclear whether the age-related differences in neural activation and reinstatement mentioned above are similar for memory consolidation. Specifically, to what extent does consolidation-related transformation of neural representations occur, and how does it impact neural reinstatement of mnemonic content in the developing brain?

In middle childhood, the trade-off between retaining vivid, detail-rich memories and their transformation into vague, gist-like memories due to delay may be more pronounced. Brainerd et al., (2002) demonstrated that, during development, specific memory and gist-memory for events emerge together. However, as children mature, they exhibit more false memories based on gist in the absence of exact memories for the events. On the other hand, Keresztes et al. (2018) postulated that younger children tend to rely more on generalization when forming new memories, while older children and adults use more specific detail-rich information, suggesting a shift from generalization to specificity as children mature. Hence, there are some inconsistencies in the theoretical postulations and findings regarding item-specific and gist-based memories that may impact memory consolidation in middle childhood. Investigation on the neural reinstatement patterns of item-specific and gist-like memories across time may add to the understanding of these inconsistencies in children.

Aim of the current study

In this study, we examined the univariate neural activation and multivariate neural reinstatement patterns of memories for object-location associations across a short delay (after one night of sleep) and a long delay (after a 2-week period), relative to recently consolidated memories (after 30 minutes). Children (5-to-7-year-old) were compared to young adults serving as a reference group with a mature memory consolidation system. We selected 5 to 7 years as the age range of interest because previous studies showed a large improvement in associative memory around this age (Sluzenski, Newcombe, & Kovacs, 2006). Practically, this is also the youngest age range in which MRI scanning coupled with active task execution could be applied relatively successfully.

We hypothesized (i) according to the Multiple Trace Theory, an increasing involvement of prefrontal, parietal, cerebellar, occipital and PHG brain regions over time in adults in comparison to children, as these regions are still maturing in preschool and early school-aged children (Ghetti & Bunge, 2012b; Keresztes et al., 2022; Lebel et al., 2012; Shing et al., 2008, 2010a); (ii) according to the Contextual Binding Theory, the Multiple Trace Theory, and supported by the evidence from Sekeres, Winocur, & Moscovitch (2018), a stable involvement of HC over time in adults and children due to relative maturity of the HC in middle childhood and detailed contextual nature of the repeatedly learned information, as our task emphasizes spatial- contextual binding of objects within scenes (Keresztes et al., 2017; Nadel et al., 2000; Sekeres, Winocur, Moscovitch, et al., 2018; Shing et al., 2008; Sluzenski et al., 2006; Yonelinas et al., 2019); (iii) a decreasing neural reinstatement in all ROIs over time, with this decrease being more pronounced in children compared to young adults (Cohen et al., 2019; Golarai et al., 2015); (iv) qualitative differences in representational format between age groups. Specifically, we expected more generic category-level gist-like memory representations in children, whereas adults would retain more detailed item-specific reinstatement patterns over time due to differences in the strength of formed memories and differences in underlying associative and strategic components of memories (Reyna & Brainerd, 1995; Shing et al., 2008, 2010). This assumption aligns with the Fuzzy Trace Theory (Brainerd & Reyna, 2002), which posits that verbatim and gist representations are encoded in parallel and that verbatim memories can be created without the extraction of gist. Our task design – involving repeated strategic learning – may foster the emergence of consolidation-driven, categorical gist-like neural representation in children. These are to be distinguished from mature semantic gist as defined by the FTT for verbal material. Due to ongoing maturation of associative and strategic memory components and their underlying neural substrates, children may be more inclined to extract generic category- based gist information at the expense of detailed information.

Results

Behavioural results

Final Learning Performance

Unique sets of object-location association pairs were learned on Day 0, Day 1, and Day 14. During each initial encoding trial, participants were presented with an object within a congruent scene (e.g., a fox in a spring pine tree forest), and were asked to memorize the exact location of the object within the scene by creating a story and making “mental” pictures of the scene. The choices for locations varied across scenes while they remained constant across time within individuals. There were 18 unique key locations among which object could be distributed, resulting in a heterogenous set of locations for objects. We employed an adaptive, repetitive learning-to-criteria procedure to ensure initially strong memories (see Fig. 1A for the task overview and Fig. 1B for experimental procedure overview).

(A) Trial Structures in the Experimental Task.
(i) *Initial Encoding*: Participants memorized object-location pairs by creating a story or forming a “mental photo” of each scene, focusing on the exact location of the object within the scene. (ii) *Learning Phase*: Participants selected one of three possible object locations and received feedback: a happy face for correct responses, a sad face for incorrect ones, and a sleeping face for missed response. The correct object-location pairing was then displayed again. (iii) *Retrieval Phase:* Conducted inside the MR scanner, participants chose the object’s location in the scene from three options without receiving feedback. **(B) Experimental Procedure**. Testing took place across three days. On Day 0, participants learned 60 object-location associations (*remote items*). On Day 1 (*short delay*), they learned 30 new object-location associations (*recent items*) and retrieved 30 remote and 30 recent items. On Day 14 (*long delay*), participants learned another 30 new associations and retrieved 30 remote and 30 recent items. Throughout all sessions, participants also completed socio- demographic and psychometric questionnaires, which were distributed across sessions. *Note*: RT – reaction time; s – second, fMRI – functional magnetic resonance imaging.

Before the learning phase began, participants were instructed to create stories to help them memorize the locations of objects within scenes. To familiarize themselves with this strategy, they first practiced this strategy on two unique sets of five object-location associations. Learning then commenced with the initial encoding block, followed by adaptively repeated retrieval-encoding cycles to strengthen memory for the object locations. During these learning cycles, participants were presented with the same scenes again, now with three rectangles indicating possible locations for each previously learned object. The task followed a three- alternative forced-choice task (3AFC) format, with the correct location randomly appearing on the left, middle, or right. The rectangles were presented in close proximity within each scene, requiring participants to recall location details with high precision. Participants were asked to choose one rectangle that corresponded to the correct location of the object within the scene (Fig. 1A “Learning Cycles”). After each response – regardless of accuracy – the object was shown in its correct location to reinforce learning. The learning cycles were repeated for a minimum of two times and a maximum of four times, or until participants achieved at least 83% accuracy in one cycle. This 83% threshold, established through pilot testing, served as a guideline for starting the next learning cycle rather than as a strict learning criterion to exclude participants. Participants who did not reach this threshold after four cycles were still included in the analysis if their performance exceeded chance level (33%). All participants demonstrated at least average cognitive abilities, as determined by a standardized intelligence test (see Table 1).

Concerning number of learning cycles, the linear mixed effects (LME) model revealed a significant Group effect, F_(1,563) = 7.09, p .008, w² = .01, with children needing more learning cycles to reach the learning criteria in comparison to adults, b = -.43, t₍₅₆₃₎ = -2.66, p = .008, within the defined minimum and maximum of learning cycles (Fig. 2A). Five child participants did not reach the learning criteria after the fourth learning cycle and their final performance ranged between 70% and 80%. The number of learning cycles did not differ between sessions as revealed by non-significant Session effect and Group x Session interaction (all p > .40).

**(A) Overview of Learning Performance**. Individual learning trajectories across up to four encoding-retrieval cycles for children and young adults on Day 0, Day 1 and Day 14. Each coloured dot represents a participant’s accuracy (percentage of correct responses) at a given cycle. Transparent connecting lines illustrate within-person changes in accuracy across cycles. Across all sessions, children needed on average between two to four learning- retrieval cycles to reach the criterion of 83% correct responses, while young adults typically reached it within two cycles. **(B) Final Learning Performance.** Final learning accuracy is calculated as the percentage of correct responses during the last learning cycle for both children and young adults. For each group and session, distributions are visualized using half-eye plots (smoothed density estimates), overlaid with boxplots indicating the median and interquartile range. The shape and spread of density plot reflect individual data variability. Grey dashed line indicates the criteria of 83% correctly learned items.

Final learning accuracy, operationalized as the percentage of correctly identified object locations, was significantly higher in young adults than in children, F_(1,79) = 94.31, p < .001, w² = .53, t₍₁₈₅₎ = 7.55, p < .001 (Fig. 2B), as revealed by the LME model. There was no significant effect of Session (p = .79) and no Session x Group interaction (p = .96), indicating that the learning accuracy was stable across sessions with different stimuli sets. Although the learning procedure was adaptive, children showed consistently lower learning performance compared to young adults.

Memory Retention Across Time

Changes in memory retention were assessed during the retrieval part of the memory task (Fig. 1A “Retrieval (fMRI)”). Participants were cued with the object and were instructed to recall as vividly as possible the associated scene and the location of the object within the scene during the fixation window, when no visual input was presented on the screen. The associated scene was then presented with three choices and participants had to select a rectangle indicating the correct location of the object in the scene (see Methods for more details).

First, we investigated whether retention rates for recently learned items (initially correctly encoded on Day 1 and Day 14) differed between sessions in children and adults. The Session x Group interaction was not significant, F_(1,75) = 1.77, p =.187, w² = .001, indicating that retention rate differences across sessions did not vary significantly between groups (see Table S1A). Based on that, we averaged recent retention rates across sessions within each group for subsequent analysis.

Second, we examined changes in memory retention rates for items that were initially correctly learned (i.e., strong initial memories), focusing on group differences in recent and remote (short- and long-delay) memory retention relative to a 100% baseline (see Fig. 3 and Table S1B for details). The linear mixed-effects model predicting retrieval accuracy for learned object-location pairs explained a substantial portion of variance, R² = .77, 95% CI [.73 – .81]. We observed a significant main effects of Item Type, F_(3,250) = 229.18, p <.001, w² = .73. Post hoc comparisons revealed no significant difference between recent memory retention and short- delay remote memory retention, b = 1.49, t₍₂₅₉₎ = 1.26, p = .754. However, recent memory retention was significantly higher than long-delay remote retention, b = 21.36, t₍₂₅₉₎ = 17.59, p < .001, and short-delay remote retention was significantly higher than long-delay remote memory retention, b = 19.88, t₍₂₆₀₎ = 16.16, p < .00. Further, we observed a significant main effect of Group, F_(1,85) = 55.00, p <.001, w² = .38. Post hoc comparisons revealed an overall lower memory retention in children compared to young adults, b = -11.1, t₍₉₁₎ = -7.20, p < .001. Additionally, we observed a significant Item Type x Group interaction, F_(3,250) = 17.35, p < .001, w² = .16. Model-based Sidak post hoc comparisons showed that the slope of memory retention decline was significantly steeper in children compared to adults for recent items, b = 15.26, t₍₂₅₄₎ = 6.56, p < .001, for short delay remote items, b = 11.41, t₍₂₅₅₎ = 4.84, p < .001, and for long delay remote items, b = 13.08, t₍₂₅₈₎ = 5.38, p < .001. In addition, memory retention rates significantly increased (corrected for multiple comparisons with False Discovery Rate (FDR)) with age in the child group for recent items, b = .89, t = 2.62, p = .016, for short delay remote items, b = .91, t = 2.67, p = .016, but not for long delay remote items, b = .15, t = .326, p = .747.

Retention rates for initially correctly learned items.
Memory accuracy is operationalized as the percentage of correct responses in the retrieval task conducted during the MRI scanning sessions for items that were initially correctly learned, indicating strong initial memories. Memory accuracy for recently consolidated items did not differ between sessions in young adults and children and was collapsed across sessions. Overall, young adults show higher and more stable memory accuracy than children, with memory declining over time for both groups, particularly for long delay. All tests used Sidak correction for multiple comparisons. *p < .05; **p < .01; ***p < .001(significant difference); non-significant differences were not specifically highlighted. The boxplot summarizes the distribution of accuracy scores across sessions and delay conditions. In each boxplot, the central line indicates the median, the box represents the interquartile range (25^th - 75^th percentile), and the whiskers extend to the range of values within 1.5 times the variability. The red dashed line at 34% indicates the threshold for chance-level performance.

Of note, we conducted an additional analysis on a subsample that included only participants who needed two learning cycles to reach the learning criterion (see Table S1.1, Fig. S1, Table S1.2 for details). Twenty-one child participants were excluded, resulting in the final subsample of n = 28 children. The results from this subsample fully replicated the findings from the full sample, indicating that the amount of re-exposure to stimuli during encoding did not affect consolidation-related changes in memory retrieval at the behavioral level.

Taken together, both age groups showed a decline in memory performance over time. However, compared to young adults, children showed a steeper slope of memory decline for both immediate recent and remote short- and long-delay memories. In sum, the results showed that children had overall worse memory retention rates compared young adults, indicating less robust memory consolidation in children.

fMRI Results

Mean activation for remote > recent memory in ROIs

To investigate how neural activation for correctly recalled memories varied across different time delays, we examined the contrast of remote > recent correct trials during object presentation at retrieval (Fig. 4 “Retrieval fMRI).

Mean Signal Differences Between Correct Remote and Recent Memories.
The figure presents mean signal difference for remote > recent contrast across sessions and groups during the object presentation time window in **(A)** Anterior and Posterior Hippocampus; (B) Anterior and Posterior Parahippocampal Gyrus; **(C)** Cerebellum; **(D)** Medial Prefrontal Cortex; **(E)** Ventrolateral Prefrontal Cortex; **(F)** Precuneus; **(G)** Retrosplenial Cortex; **(H)** Lateral Occipital Cortex. *Note:* Bars indicate the group mean for each session (solid lines for Day 1, dashed lines for Day 14), plotted separately for children and young adults. Error bars represent ±1 standard error of the mean. The colour indicated the age groups: purple for children and khaki yellow for young adults. Across all panels, mean of individual subject data are shown with transparent points. The connecting faint lines reflect within-subject differences across sessions. Orange asterisks denote significant difference of **remote > recent** contrast from zero. An upward orange arrow indicates that this difference is greater than zero, while a downward arrow indicates that this is less than zero. *p < .05; **p < .01; ***p < .001(significant difference); non- significant differences were not specifically highlighted. Significant main and interaction effects are highlighted by the corresponding asterisks. All main and interaction p-values were FDR-adjusted for multiple comparisons.

We first tested whether the remote > recent contrast significantly differed from zero in each age group and session (Day 1 and Day 14), as an indicator of differential engagement during memory retrieval. FDR-adjusted results showed no significant results in the anterior and posterior HC (Fig. 4A), anterior PHG (Fig. 4B), and RSC (Fig. 4G) across sessions and age groups (all p > .054; see Table S6 for details). To rule out the possibility that these non- significant differences reflect an overall absence of retrieval-related activation, we tested whether mean activation for recent and remote items – each relative to the implicit baseline – was significantly above zero. FDR-adjusted results revealed that activation in these ROIs was significantly greater than zero (all p < .031), except in the recent Day 1 condition in children for the posterior HC (p > .141) and the precuneus (p > .056, see Table S7 and Fig. S3 for details). These findings indicate that the anterior and posterior HC, anterior PHG, and RSC are similarly engaged during successful retrieval of both recent and remote memories, regardless of delay or age group. (As a control analysis, we tested whether the anterior and posterior HC, anterior PHG, RSC were similarly engaged during retrieval of recent and remote items over time using the LME models. These models included mean activation relative to the implicit baseline, a Session x Delay x Group interaction, and Subject as a random intercept. The results were consistent with the earlier findings, showing no significant main effect of Delay (all p > .106), Group (all p > .060), or Session x Delay interaction (all p > .340), indicating comparable engagement of these ROIs across delays and age groups (see Table S8 for full statistical details)). Other ROIs showed more differentiated patterns, which are discussed below. (In contrast, the vlPFC, CE, posterior PHG and LOC, precuneus, and mPFC showed a significant main effect of Delay (all p < .009, see Table S8 for details), indicating time-related changes in the remote > recent contrast. These effects are examined in more detail below. Notably, these findings are consistent with results from the whole-brain analyses (Tables S3.1, 4.1, 5.1)).

To further explore the more differentiated patterns observed in other ROIs, we examined changes in the remote > recent contrast across age groups and sessions (Day 1 and Day 14) using LME models, controlling for sex, handedness, general intelligence, and mean reaction time. All main and interaction effects were FDR-adjusted, and all post hoc tests were Sidak- corrected (see Table S9 for details).

For the posterior PHG (Fig. 4B), a significant Session x Group interaction, F_(1,83) = 9.54, p = .020, w² = .09, indicated a more pronounced increase in remote > recent mean signal difference over time in young adults compared to children, b = .11, t₍₈₃₎ = 3.09, p = .003.

Similarly, also for the cerebellum (Fig. 4C) a significant Session x Group interaction, F_(1,161) = 7.68, p = .020, w² = .04, indicated a stronger increase in remote > recent mean signal difference over time in young adults compared to children, b = .09, t₍₁₆₀₎ = 2.77, p = .006.

For the mPFC (Fig. 4D), a significant main effect of Group, F_(1,86) = 7.61, p = .023, w² = .07, denoted that the overall remote > recent mean signal difference in children was higher than in young adults, b = -.10, t₍₈₆₎ = -2.76, p = .007.

For the vlPFC (Fig. 4E), a significant main effect of Group, F_(1,82) = 31.35, p = <.001, w² = .13, indicated an overall lower remote > recent mean signal difference in children compared to young adults, b = -.125, t₍₁₀₈₎ = -3.91, p < .001. In addition, a significant main effect of Session, F_(1,99) = 10.68, p = .005, w² = .09, pointed out overall higher remote > recent mean signal difference on Day 14 compared to Day 1, b = .08, t₍₉₉₎ = 3.27, p = .001.

For the precuneus (Fig. 4F), a significant main effect of Group, F_(1,161) = 5.09, p = .027, w² = .02, indicated an overall lower remote > recent mean signal difference in adults compared to children, b = -.05, t₍₁₆₀₎ = -2.26, p = .037. In addition, a significant main effect of Session, F_(1,161) = 6.50, p = .036, w² = .03, denoted an overall lower remote > recent contrast for Day 14 compared to Day 1, b = -.05, t₍₁₆₀₎ = -2.55, p = .012. Although the remote > recent contrasts were mostly negative, the mean activation for recent and remote items — each relative to the implicit baseline — was significantly greater than zero for all delays and group (all p < .023), except for children’s recent items on Day 1 (p = .056; see Table S7 for details).

For the LOC (Fig. 4H), a significant main effect of Group, F_(1,82) = 9.12, p = .015, w² = .09, indicated a higher remote > recent mean signal difference in young adults compared to children, b = .07, t₍₈₂₎ = 3.02, p = .003. Additionally, a significant main effect of Session, F_(1,97) = 16.76, p = <.001, w² = .14, showed an overall increase in remote > recent mean signal difference on Day 14 compared Day 1, b = .07, t₍₉₇₎ = 4.10, p = <.001. Furthermore, a significant Session x Group interaction, F_(1,81) = 6.42, p = .032, w² = .06, demonstrated higher increase in remote > recent mean signal difference over time in adults compared to children, b = .09, t(81) = 2.53, p = .013.

Of note, we conducted an additional univariate analysis using a subsample that included only participants who needed two learning cycles to reach the learning criteria (see Table S9.1 for details). The subsampled results fully replicated the findings from the full sample and demonstrated that the amount of re-exposure to stimuli during encoding did not affect consolidation-related changes in memory retrieval at the neural level.

In summary, our findings revealed distinct consolidation-related neural upregulation for remote memory between children and adults. From Day 1 to Day 14, adults showed higher increase in remote > recent signal difference for remembered items in the posterior PHG, LOC, and cerebellum than children. Adults showed overall higher remote > recent difference in the vlPFC than children, while children showed overall higher remote > recent difference in the mPFC than adults. Furthermore, we observed a constant activation of anterior and posterior HC, anterior PHG, and RSC in memory retrieval across age groups irrespective of memory type or delay.

Neural-behavioural Correlation

We further investigated whether neural upregulation (i.e., remote > recent univariate signal difference) is related to memory performance. Specifically, considering all ROIs simultaneously and differential directionality of remote > recent signal differences, we investigated whether any specific profile of ROI constellation of neural upregulation is related to variations in memory performance. For this purpose, we employed the partial least square correlation analysis (PLSC; Abdi, 2010; Abdi & Williams, 2013). With regard to the interconnectedness of the predefined ROIs, the PLSC is a well-suited method to address multivariate associations between neural measures and memory measures. Consequently, latent variables that represent differential profiles of ROI’s neural upregulations with robust relation with either short- or long-delay variations in memory performance were extracted (for more detailed description of the PLSC method, refer to Method section). In addition, we derived for each subject a value that denotes a within-person robust expression of either short- or long-delay brain profile.

For each delay, the permutation test of significance resulted in a single latent variable that reliably and optimally represents across age groups (i) the associations of short delay ROI neural upregulations with variations in short-delay memory accuracy (Fig. 5A; r = .536, p = .0026); and (ii) the associations of long delay ROI neural upregulations with variations in long-delay memory accuracy (Fig. 5B; r = .542, p = .0024). With further bootstrapping, we identified Z- scores estimates of robustness (larger/smaller than ± _1.96 (a < 0.05)) of the components within the multivariate brain profiles across all participants. Thus, for short delay, we observed that higher memory accuracy was robustly associated with greater neural upregulations in the anterior PHG (Z-score = 2.161, r = .347) and vlPFC (Z-score = 3.457, r = .640), as well as with lesser neural upregulation in precuneus (Z-score = -2.133, r = -.323) and cerebellum (Z-score = -2.166, r = -.371) across age groups. In contrast, for long delay, we observed that higher memory accuracy was robustly associated with greater neural upregulation in the vlPFC (Z-score = 3.702, r = .492), RSC (Z-score = 4.048, r = .524), and LOC (Z-score = 3.568, r = .455), and with lesser neural upregulation in mPFC (Z-score = -2.958, r = -.394) across age groups. The identified latent variables indicate that substantial amount of variance (short delay: r = .536 and long delay: r = .542) in either short- or long-delay memory performance was accounted by the identified differential functional profiles of brain regions.

Multivariate short- and long-delay brain profiles of neural upregulation (remote versus recent neural activation differences) are associated with variations in memory accuracy.
A) *Short Delay Brain Profile*. Latent variables weights or saliences for each ROI build up one latent variable that expresses a composite short-delay brain profile across both age groups . B) *Long Delay Brain Profile*. Latent variables weights or saliences for each ROI build up one latent variable that expresses a composite long-delay brain profile across both age groups. The bar plot shows the bootstrap ration (BSR) values for the latent variable, reflecting the stability of the relationship between brain activation and memory performance. Stability of salience elements is defined by Z-scores (depicted as red line: a value larger/smaller than ± _1.96 is treated as reliably robust at (a < .05). C) Short Delay Brain Scores by Group. D) Long Delay Brain Scores by Group. Each box represents the distribution of brain scores within a group, with central lines indicating the median and boxes showing the interquartile range. Whiskers represent the full range of non-outlier values. *Note:* PHGa – anterior parahippocampal gyrus; PHGp – posterior parahippocampal gyrus; HCa – anterior hippocampus ; HCp – posterior hippocampus; PC– precuneus; vlPFC – ventrolateral prefrontal cortex; mPFC – medial prefrontal cortex; RSC – retrosplenial cortex; LOC – lateral occipital cortex; CE – cerebellum; r – Spearman’s rank order correlation index.

Identified brain profiles across groups suggest shared patterns between neural mean signal differences in differential sets of ROIs and memory accuracy are consistent across children and adults. As this approach optimizes for consistent covariance pattern across the full sample, it does not test for group-specific profiles. When conducting the same PLS models within each group, no stable latent profile emerged (all p > .069). The reduced within-group sample may have affected the bootstrap-based stability. To address this, we explored whether groups differ in their expression of the common LV (i.e., brain scores). This analysis revealed that children showed significantly lower brain scores than adults both in short delay, t₍₈₃₎ = -4.227, p = .0001 (Fig. 5C), and long delay, t₍₇₄₎ = -5.653, p < .001 (Fig. 5D), suggesting that while the brain- behaviour profile was shared, its expression varied by group.

Taken together, differential short- and long-delay brain profiles of neural upregulation were related to variations in memory accuracy. Despite age-related differences in the derived brain scores, higher expression of within-participant brain score was associated with higher memory retention rates in short and long delay similarly in children and young adults.

Representational similarity results

In addition to distinct univariate neural upregulation for recent and remote memories, children and adults may exhibit differences in neural representations of these memories. Over time, these representations could also undergo consolidation-related transformations. To address this further, we investigated both more differentiated detailed scene-specific reinstatement and more generic category-based neural representations in children and adult.

3.2.2.1 Corrected scene-specific reinstatement

To measure how scene-specific reinstatement at retrieval during fixation time window (after short cue by object presentation; see Fig. 6A) changes over time as memories decay, we computed a scene-specific reinstatement index for each neural representational similarity matrix. We hypothesized that neural patterns evoked by reinstatement of a specific scene without any visual input during fixation time window would be similar to neural patterns evoked by actual presentation of the scene during the scene time window. Therefore, the scene time window was used as a template against which the fixation period can be compared to. Participants were explicitly instructed to recall and visualize the scene and location of the object during fixation time window after being cued by the object. Since the locations were contextually bound to the scene and each object had a unique location in each scene, the location of the object was always embedded in the specific scene context.

Scene-Specific Reinstatement (A) Index Computation (Scene).
A representational similarity index was computed by calculating the averag similarity between activation patterns in the fixation and scene time windows, separately for recent scenes, remote scenes on Day 1, and remote scenes on Day 14. **(B) Scene-Specific Reinstatement**. A corrected scene-specific reinstatement index was computed by assessing the average similarity within-trial similarity between the fixatio and scene time windows and subtracting the average between-trial (set-based) similarity across all other trials. This controls for baseline similarity unrelated to specific scene content. S – scene time window; F – fixation time window; r – Pearson’s correlation index; Δ z – difference between two Fisher transformed r values. * - Activatio patterns.

To investigate how scene-specific reinstatement changes over time with memory consolidation, all analyses were restricted to correctly remembered items. For each specific scene, the correlation between neural patterns during fixation “fixation period” and neural patterns when viewing the scene “scene period” was conducted (Fisher-transformed Pearson’s r; Fig. 6B). A set-based reinstatement index was calculated as an average distance between “fixation” and “scene” period for a scene and every other scene within the stimuli set (Deng et al., 2021; Ritchey et al., 2013; Wing et al., 2015). The set-based reinstatement index reflects the baseline level of non-specific neural activation patterns during reinstatement. We then calculated the corrected scene-specific reinstatement index as the difference between scene-specific and set-based Fisher-transformed Pearson’s r (Deng et al., 2021; Ritchey et al., 2013; Wing et al., 2015). Given the temporal proximity of the fixation and scene time window, we refrain from interpreting the absolute values of the observed scene-specific reinstatement index. However, given that the retrieval procedure is the same over time and presumably similarly influenced by the temporal autocorrelations, we focus primarily on the changes in reinstatement index for correctly retrieved memories across immediate, short, and long delays. In other words, the focus in the following analysis lies on the time-related change in the scene-specific reinstatement index.

First, scene-specific reinstatement indices for recent items — tested on different days — did not significantly differ, as indicated by non-significant main effects of Session (all p > .323) and Session x ROI interactions (all p > .817) in either age group. This indicates that temporal autocorrelation was consistent across scanning sessions. Based on that, we averaged the scene- specific reinstatement indices for recent items across sessions. To investigate time-dependent change in scene-specific reinstatement in children and young adults in the predefined ROIs, we conducted LME models, with delay (recent, remote short and remote long delays), group (children and young adults) for each ROI, controlling for ROI BOLD activation (Varga et al., 2023) during corresponding sessions. All main and interaction effects were FDR-adjusted, and all post hoc tests were Sidak-corrected for multiple comparisons.

Generally, in all predefined ROIs, we observed a significant main effect of Session (all p <

.001) and a significant effect of Group (all p <.004, Fig.7A-G), except for the LOC (p = .271, Fig. 7J). The pattern of time-related decline was similar across age groups, as indicated by no significant Session x Group interactions in all ROIs (all p > .159). There was no significant effect of BOLD activation (all p > .136). The full statistical report on the LME-model is in Supplementary Material in Table S10. A more detailed overview of the observed main effects and their Sidak-corrected post-hoc tests are summarized in the Table 2.

Corrected scene-specific neural reinstatement.
Scene-specific neural reinstatement defined as the difference between Fisher-transformed scene-specific and set- specific representational similarity. Scene-specific neural reinstatement index by group (children vs. adults) and session (Day 0 – recent, Day 1 – remote short delay, Day 14 – remote long delay). Bars represent the mean reinstatement index for each session within each group, with error bars indicating standard error of the mean. Transparent dors show individual participant data points, jittered horizontally for visibility. The x-axis is grouped by group and displayes (A) Hippocampus Anterior; (B) Hippocampus Posterior; (C) Parahippocampal Gyrus Anterior; (D) Parahippocampal Gyrus Posterior; (E) Cerebellum; (F) Medial Prefrontal Cortex; (G) Ventrolateral Prefrontal Cortex; (H) Precuneus; (I) Retrosplenial Cortex; (J) Lateral Occipital Cortex. *p < .05; **p < .01; ***p < .001(significant difference). Error bars indicate standard error.

Statistical overview of LME-model based Sidak corrected post hoc comparisons for scene-specific reinstatement analysis (based on LME-model described in Table S10).

To ensure that the observed scene-specific reinstatement effects were not driven by general signal properties or artefacts unrelated to memory retrieval, we conducted several control analyses.

First, we repeated the reinstatement analysis using the “object period” instead of the “scene period”. The rationale was that the object and the reinstated scene during fixation period are expected to rely on distinct neural representations. In line with this, we did not expect a delay-related decline in reinstatement. The derived object-specific similarity index, which is also subject to temporal autocorrelation, showed no significant effect of Session or Delay in any ROI (all p > .059; see Table S10.1, S10.2), supporting the specificity of the original reinstatement effect.

Second, we tested whether the observed group and delay effects might reflect global or non-specific BOLD signal fluctuations by analysing three control regions within the corpus callosum (genu, body, and splenium), where no memory-related reinstatement is expected. The LME models revealed no significant Group effects in any of the white matter ROIs (all p > .426), indicating no difference between children and adults. Although we observed significant main effects of Session (all p < .001), post hoc comparisons showed that these effects were driven by differences between the recent (Day 0) and most remote (Day 14) sessions. Crucially, the key contrasts of interest — recent vs. Day 1 remote and Day 1 remote vs. Day 14 remote — were not significant (all p > .080; see Table S10.3, S10.4), in contrast to the robust decline observed in key ROIs for scene-specific reinstatement.

Finally, we assessed whether the observed reinstatement effects were specific to successful memory retrieval by examining item-based reinstatement for incorrectly remembered trials. This analysis revealed no session-related decline in any ROI, further supporting the interpretation that the reinstatement effects observed in correctly remembered trials are memory-related rather than driven by unspecific signal changes (see Fig S5).

Taken together, scene-specific reinstatement declined significantly for overnight compared to immediate memories and declined further after a 2-week delay across all ROIs. These results indicate that the main decrease in scene-specific neural reinstatement for successfully consolidated memories occurs already after a short overnight delay and continues with further decline after a longer, fortnight delay.

Gist-like neural representations

Another way to evaluate the quality of neural representations during the post-cue fixation time window is to examine potential shifts in the ongoing balance between differentiated detailed (“verbatim”) and generalized generic (“gist”) memory, as described by the Fuzzy Trace Theory (Brainerd & Reyna, 2002). Although our associative memory paradigm was designed to foster precise, detailed retrieval, it inherently also permits more generic, gist-like retrieval – for example, some participants may recall “a field” without its unique details (yielding a generic field representation), whereas others reinstate the full, specific features of the original scene.

Accordingly, to quantify gist-like representations of the scenes sharing the same category (e.g., field, forest, etc.) during the fixation time window following the object cueing (see Fig. 1A Retrieval; Fig. 8), we computed a gist-like representation index.

Representational Similarity Analysis.
(A) Index Computation (Gist). A representational similarity index was computed by assessing the average similarity for fixation time window for within-category and between-category scenes separately for recent, remote (Day 1), and remote (Day 14) scenes based on both within-run and cross-run comparisons. The diagonal (similarity of fixation time window with itself) was excluded from the analysis. **(B) Gist-like Representation.** A gist-like representation index was computed by assessing the average similarity in fixation time window for the same- category pairs and subtracting from it the any-other-category pairs. S – scene time window; F – fixation time window; r – Pearson’s correlation index. Δ z – difference between two Fisher transformed r values.

First, a within-category similarity indices were computed by correlating the multivoxel patterns during the fixation time window for all correctly remembered scene pairs from the same category (i.e., field, water, housing, forest, infrastructure, indoor, farming), excluding self- correlations. Category exemplars were evenly and randomly distributed across runs, preventing clusters of temporally adjacent trials. By including only correctly recalled trials and building representational similarity matrices from both within-run and cross-run scene pairs, we substantially increased the number of independent pairwise comparisons in the representational similarity analysis – and hence our sensitivity to detect effects. Next, a between-category similarity indices were computed in the same way, but for scene pairs drawn from different categories. Finally, a gist-like representation index was defined as the difference between Fischer-transformed within- and between-category correlations (i.e., [within category_recent r – between category_recent r] and [within category_remote r – between category_remote r] for each session, Fig. 8). Thus, the gist-like representation index reflects the extent to which neural patterns during the fixation window reactivate a generalized category representation (i.e., forest) – over and above any nonspecific similarity to scenes from other categories.

The non-zero values in this index reflect gist-like representation, as the similarity distance would be higher for pairs of trials within the same category, indicating more generic representation (e.g., during representation of scenes belonging to a category “forest”, participants may tend to recall a generic image of some forest without any specific details). In other words, the representation of a more generic, gist-like image of a forest across multiple trials should yield more similar neural activation patterns. Not significant gist-like representation would indicate that even within the same category, representation of specific scenes is sufficiently differential and rich in details, rendering them dissimilar (e.g., participants may tend to recall detailed image of forests: fall forest with yellow trees, dark pike-tree forest, light summer forest with young birch trees, etc.).

First, we aggregated the gist-like representation indices for recent items on Day 1 and Day 14, as there were no significant differences between sessions in ROIs (all p > .231). Then we applied a one-sample permutation t-test to test for significance of all gist-like indices against zero in each ROI (for full overview see Table S12). FDR-corrected values revealed that young adults showed only remote Day 1 gist-like representation in LOC (p = .024), while significant remote gist-like representation was observed in children on Day 1 in precuneus (p = .044) and LOC (p = .024), and on Day 14 in the mPFC (p = .013) and vlPFC (p = .007). Following this, we further analysed group differences separately for each ROI that showed significant gist-like representation, controlling for the BOLD mean activation in each ROI during corresponding sessions.

Second, we investigated the time-dependent change in gist-like representation in ROIs that showed gist-like representation. For the mPFC (Fig. 9A), we observed a significant main effect of Group, F_(1,244) = 6.55, p = .011, ω² = .02, indicating significantly higher gist-like representation in the mPFC in children compared to young adult, b = .011, t₍₈₂₎ = 2.52, p = .013. Additionally, a significant main effect of Session, F_(1,244) = 3.89, p = .022, ω² = .02, indicated higher remote Day 14 compared to remote Day 1 gist-like representation, b = .014, t₍₁₈₀₎ = 2.64, p = .027. For the vlPFC (Fig. 9B), we observed a significant effect of Session, F_(1,174) = 4.45, p = .013, ω² = .04, indicating higher remote Day 14 gist-like representation compared to recent one, b = .013, t₍₁₉₅₎ = 2.91, p = .012. A significant Session x Group interaction, F_(1,167) = 3.04, p = .05, ω² = .02, highlighting significantly higher remote Day 14 gist-like representation in children compared to young adults, b = .017, t₍₂₄₉₎ = 2.52, p = .037. Neither LOC nor precuneus showed any significant main or interaction effects (all p > .062; Fig. 9C and 9D). Taken together, only the child group showed gist-like representation in the medial and ventrolateral prefrontal brain regions that was significantly higher during retrieval of long delay remote memories, indicating a reorganization of memory representations in children.

Gist-like Representations Bar plots show mean gist-like representation index (difference in Fisher’s z-transformed (Δ z) similarity: *within- category* –– *between-category*) in each group (Children, Young Adults) and session (Day 0, Day 1, Day 14), computed from combined within- and cross-run comparisons.
Error bars indicate ±1 standard error of the mean. A representation value above zero (denoted by red asterisks) reflect greater neural pattern similarity during fixation time window between item from the same category than across categories. Bar positions are grouped by age group (x-axis). Session-specific estimates (Day 0, 1, 14) are differentiated by line of bar border. (A) Medial Prefrontal Cortex; (B) Ventrolateral Prefrontal Cortex; (C) Lateral Occipital Cortex (D) Precuneus; *p < .05; **p < .01; ***p < .001 (significant difference; FDR corrected for multiple comparisons); non-significant difference was not specifically highlighted.

Neural-behavioural Correlations

Further, we also explored whether over time, recent, short- and long-delay scene-specific reinstatement and gist-like representations are beneficial or detrimental for memory performance by correlating the indices with memory retention rates. We derived, with a PLSC analysis, latent brain pattern across implicated ROIs that share the most variance with delay-related variations in memory accuracy.

Neural-behavioural correlations (scene-specific reinstatement)

For the scene-specific reinstatement all predefined ROIs in both age groups were included. With further bootstrapping we identified Z-scores estimates of robustness (larger/smaller than ± _1.96 (a < 0.05)) of the components within the multivariate brain profile.

First, for recent delay (30 minutes after learning), the permutation test of significance resulted in a single latent variable that robustly represents the association of scene-specific reinstatement brain profile and memory accuracy across both age groups (Fig. 10B, r = .293, p = .007). Higher recent memory accuracy was robustly associated with greater scene-specific reinstatement in the anterior PHG (Z-score = 3.010, r = .819), posterior PHG (Z-score = 2.575, r = .367), anterior HC (Z-score = 2.629, r = .3713), posterior HC (Z-score = 3.009, r = .417), and precuneus (Z-score = 2.206, r = .318) across age groups.

Multivariate short- and long-delay brain profiles of scene-specific reinstatement are associated with variations in memory accuracy.
A) *Recent Delay Brain Profile.* Latent variables weights or saliences for each ROI build up one latent variable that expresses a composite immediate-delay scene-specific reinstatement brain profile. B) *Short Delay Brain Profile*. Latent variables weights or saliences for each ROI build up one latent variable that expresses a composite short-delay scene-specific reinstatement brain profile. C) *Long Delay Brain Profile*. Latent variables weights or saliences for each ROI build up one latent variable that expresses a composite long-delay scene- specific reinstatement brain profile. Stability of salience elements is defined by Z-scores (depicted as red line: a value larger/smaller than ± _1.96 is treated as reliably robust at (a < .05). The bar plot shows the bootstrap ration (BSR) values for the latent variable, reflecting the stability of the relationship between brain scene-specific neural reinstatement and memory performance. D) Recent Delay Brain Scores. E) Short Delay Brain Scores. F) Long Delay Brain Scores. Each box represents the distribution of brain scores within a group, with central lines indicating the median and boxes showing the interquartile range. Whiskers represent the full range of non-outlier values. *Note:* PHGa – anterior parahippocampal gyrus; PHGp – posterior parahippocampal gyrus; HCa – anterior hippocampus ; HCp – posterior hippocampus; PC– precuneus; vlPFC – ventrolateral prefrontal cortex; mPFC – medial prefrontal cortex; RSC – retrosplenial cortex; LOC – lateral occipital cortex; CE – cerebellum; r – Spearman’s rank order correlation index.

Second, for short delay, the permutation test of significance resulted in a single latent variable that robustly represents the association of scene-specific reinstatement brain profile and memory accuracy across both age groups (Fig. 10B, r = .339, p = .0017). Higher memory accuracy was robustly associated with greater scene-specific reinstatement in the anterior PHG (Z-score = 2.885, r = .371), posterior PHG (Z-score = 2.597, r = .342), anterior HC (Z- score = 3.126, r = .399), posterior HC (Z-score = 2.844, r = .375), vlPFC (Z-score = 2.434, r = .317), mPFC (Z-score = 2.753, r = .333), and LOC (Z-score = 2.176, r = .298) across age groups.

Third, for long delay, the permutation test of significance resulted in a single latent variable that robustly represents the association of scene-specific reinstatement brain profile and memory accuracy across both age groups (Fig. 10C, r = .455, p = <.001). Higher memory accuracy was robustly associated with greater scene-specific reinstatement in the anterior PHG (Z-score = 6.213, r = .414), posterior PHG (Z-score = 4.810, r = .334), anterior HC (Z- score = 5.353, r = .389), posterior HC (Z-score = 4.707, r = .354), precuneus (Z-score = 3.404, r = .281), vlPFC (Z-score = 3.291, r = .266), RSC (Z-score = 3.72, r = .293), LOC (Z-score = 3.288, r = .282), and cerebellum (Z-score = 3.842, r = .308) across age groups.

Identified brain profiles across groups suggest shared patterns between neural mean signal differences in differential sets of ROIs and memory accuracy are consistent across children and adults. As this approach optimizes for consistent covariance pattern across the full sample, it does not test for group-specific profiles. When conducting the same PLS models within each group, no stable latent profile emerged (all p > .069). The reduced within-group sample may have affected the bootstrap-based stability. To address this, we explored whether groups differ in their expression of the common LV (i.e., brain scores). This analysis revealed that children showed significantly lower brain scores than adults both in immediate delay, t₍₈₅₎ = -3.971, p = .0001 (Fig. 10C), in short delay, t₍₈₁₎ = -2.973, p = .004 (Fig. 10C), and long delay, t₍₇₀₎ = - 2.659, p = .01 (Fig. 10D), suggesting that while the brain-behaviour profile was shared, its expression varied by group.

These results underscore the importance of scene-specific reinstatement in positively contributing to memory performance for detailed associative information both in children and adult.

Neural-behavioural correlations (gist-like representations)

For gist-like representations, we included only those ROIs that showed significant gist-like representations (i.e., for short delay: LOC, and precuneus across both age group; for long delay: vlPFC and mPFC in child group).

First, for short delay, the permutation test of significance did not result in a single latent variable that robustly represents the association gist-like representations brain profile and memory accuracy in children (Fig. 11A, r = .221, p = .065). Second, for long delay, the permutation test of significance resulted in a single latent variable that robustly represented the association of gist-like representations brain profile and memory accuracy in the child group (Fig. 11B, r = .516, p = .0014). The higher long delay Day 14 memory accuracy was robustly associated with lower gist-like representations in the mPFC (Z-score = -2.396, r = -.498), and vlPFC (Z-score = -5.918, r = -.876) in children.

Multivariate short- and long-delay brain profiles of gist-like representations are associated with variations in memory accuracy.
A) *Short Delay Brain Profile*. Latent variables weights or saliences for each ROI build up one latent variable that expresses a composite short-delay gist-like representations brain profile across age groups. B) *Long Delay Brain Profile*. Latent variables weights or saliences for each ROI build up one latent variable that expresses a composite long-delay gist-like representations brain profile in child group. The bar plot shows the bootstrap ration (BSR) values for the latent variable, reflecting the stability of the relationship between brain gist- like neural representations and memory performance. Stability of salience elements is defined by Z-scores (depicted as red line: a value larger/smaller than ± 1.96 is treated as reliably robust at (a < .05). *Note:* vlPFC – ventrolateral prefrontal cortex; mPFC – medial prefrontal cortex; LOC – lateral occipital cortex; r – Spearman’s rank order correlation index.

The significant negative association between long delay gist-like representations in both prefrontal brain regions and memory accuracy observed in children underscores that gist-like representations was detrimental to memory performance for detailed associative information in children in long delay. Taken together, more differentiated detail-rich neural reinstatement was related to better memory retrieval in both children and young adults. On the other hand, more gist-like neural representations, uniquely found in children, was related to worse memory retrieval.

Discussion

In the present study, we investigated system-level memory consolidation of object-location associations across three delays: immediately after learning, after one night of sleep (short delay) and after two weeks (long delay). We tracked changes in neural activation and multivariate activation patterns over time, comparing 5-to-7-year-old children and young adults. Our main findings are as follows: (i) Children showed greater decline in memory retention compared to young adults at both short and long delays. (ii) Regarding neural upregulation – reflected as the mean difference in activation between remote and recent retrieval – the two age groups showed distinct changes over time. Young adults exhibited an increase in neural upregulation over time in the posterior PHG, cerebellum and LOC, and an overall higher neural upregulation in the vlPFC compared to children. In contrast, only children showed a decrease in neural upregulation over time in the RSC and overall greater neural upregulation in the mPFC than adults. Distinct neural upregulation profiles with specific sets of brain regions were related to immediate, short and long delay memory accuracy. (iii) Using RSA, we found that differentiated scene-specific reinstatement declined over time in both age group. Notably, more generic gist-like representations were observed only in children, particularly in medial and ventrolateral prefrontal regions. Importantly, higher scene-specific reinstatement was related to better memory retention in both age groups, whereas greater gist-like representations were related to lower memory retention at the long delay only in children.

Our study extends previous adult-based findings and, for the first time, demonstrates that the memory retrieval after consolidation in children is accompanied by differential patterns of neural activation of some of the core retrieval regions, attenuated neural reinstatement of detailed memories, and the emergence of generic gist-like representations. These findings suggest that adults leverage mature neural memory systems and extensive existing knowledge base to encode and consolidate new complex information with detailed accuracy. In contrast, children – whose neural system is still developing – may rely more on consolidating gist information as a foundational scaffold for their still sketchy knowledge base, possibly at the cost of episodic detailedness. At this developmental stage, focusing on precise detail may not be yet prioritized (Keresztes et al., 2018). Each of these findings is discussed in detail in the following sections.

Less robust short and long delay memory retention in children compared to young adults

Our findings indicate that preschool children (5 to 7 year old) can encode and retain complex associative and highly contextualized information successfully over extended periods following adaptive learning. However, their overall learning and retrieval performance was lower compared to young adults. Moreover, children exhibited more pronounced declines in memory retention over both short and long delays for correctly learned information, suggesting less robust memory consolidation compared to young adults.

Regarding learning, children needed more cycles to memorize object-scene associations and showed lower learning performance after initial strategic encoding compared to young adults. Although we did not expect children to reach adult-like learning rates given the complex and associative nature of the task (Pressley et al., 1981), we aimed to maximize children’s learning capacities through adaptive learning procedures. To support this, attention allocation and motivation during encoding and learning were closely monitored through the constant presence of the experimenter and the use of feedback questionnaires. In addition, all participants underwent training in elaborative encoding strategies to support later retrieval.

Overall, our findings on learning suggests that children were less adept that adults at utilizing strategic control over encoding, such as creating and maintaining stories to aid their retrieval. This is consistent with previous literature, showing continuous improvement in children’s ability to use elaborative strategies between ages 4 and 8 (Bjorkund et al., 2009; Crowley & Siegler, 1999; Pressley, 1982). Additionally, children at this age may experience difficulties in controlling (Ruggeri et al., 2019) and effectively using their learning strategies over time (Brod, 2021; Shing et al., 2010). Observed lower learning rates may also be attributed to less efficient binding processes in children compared to young adults (Shing et al., 2010; Sluzenski et al., 2006). Although we included only stimuli from the primary school curriculum to reduce age differences in knowledge availability, ongoing maturation of the memory brain network in 5-to-7-year-old children may have attenuated their benefit from pre-existing knowledge and memory aid through strategic elaboration (Ghetti & Bunge, 2012; Lenroot & Giedd, 2006; Nishimura et al., 2015; Ofen, 2012; Shing et al., 2008). Despite these challenges, 5- to-7-year-old children were capable of learning complex associative information to a considerable extent, which aligns with their ability to gradually accumulate world knowledge (Bauer, 2021; Brod & Shing, 2022; Wagner, 2010).

Concerning memory consolidation, our results are in line with previous studies that reported worse memory retention for associative information in school age children compared to adults (Østby et al., 2012; Schommartz et al., 2023, 2024). On the other hand, our results are not in line with sleep-related beneficial effects on mnemonic performance of 7-to-12-year-old children after one night delay (Peiffer et al., 2020; Wang et al., 2018) that were shown for novel stimuli not related to any prior knowledge (in the sense of arbitrary stimuli). As we opted for well-learned information that should allow for rapid creation of new schemas or integration of new associations into already existing schemas, our findings indicate that the beneficial role of sleep on memory consolidation in children compared to adults may not apply for repeatedly and strategically learned information. Deliberate learning is potentially more advantageous for subsequent memory retention in young adults, as this information may be integrated into pre- existing knowledge structures faster (van Kesteren et al., 2013), with higher strategic control of memories upon retrieval and therefore greater accessibility of consolidated memories (Fandakova et al., 2017; Gaudreau et al., 2001). Taken together, our findings indicate that compared to young adults, 5-to-7-year-old children exhibit less robust memory consolidation for well-learned information, suggesting an overall reduced ability to retain detailed memories in children.

To better understand whether observed age-related differences in memory performance reflect changes or differences in memory strength per se, we conducted exploratory analysis using drift diffusion modelling (DDM; Lerche & Voss, 2019; Palada et al., 2016; Ratcliff et al., 2011, 2012; Ratcliff & McKoon, 2008; Zhou et al., 2021). DDM estimates the underlying cognitive mechanisms driving decision-making by jointly modelling accuracy and reaction time, offering a nuanced index of memory strength via the drift rate parameter. This approach allowed us to quantify trial-wise memory accessibility beyond raw performance measures (see Fig. S1 and section S2.1 in Supplementary Materials for detailed overview). The results revealed that children had significantly lower drift rates compared to young adults across all delays, indicating slower and noisier evidence accumulation – possibly due to weaker memory representations. As drift rate closely correlates with memory accuracy (Ratcliff et al., 2011), our findings on the memory strength align with those on memory accuracy during retrieval in both age groups. Crucially, drift rate decreased systematically from recent to remote conditions in both groups, but this decline was steeper in adults. This finding suggests that while adults started with stronger memory traces, these detailed, differentiated traces were also more susceptible to decay. In contrast, children’s already lower drift rates remained relatively more stable over time. This pattern points to qualitative group differences in how memories are initially encoded and subsequently consolidated. The DDM analysis helped us to dissociate group differences in retrieval dynamics from surface-level accuracy. It revealed how response patterns varied as a function of memory strength across time, supporting the conclusion that developmental differences in memory consolidation cannot be fully explained by initial performance alone. Our neural findings suggest that differences in functional engagement of the retrieval network and the characteristics of memory representations being created and retained may underlie the observed behavioural differences.

Differential upregulation of remote > recent neural activation over time in children in comparison to young adults

Analyses of neural upregulation (i.e., remote > recent difference in neural activation) over time allowed us to control for the effects of rapid consolidation during repeated learning, while examining changes in short- or long-delay neural activation (Brodt et al., 2016b, 2018; Yu et al., 2022). First, we observed increased upregulation in the vlPFC over time in both age groups, with young adults showing greater vlPFC upregulation overall. Furthermore, we observed stable greater upregulation in the mPFC over time in children. On the one hand, this may indicate a stronger strategic control over retrieval processes in young adult, given the vlPFC’s role in strategic remembering and retrieval of stored memories (Badre & D’Esposito, 2009; Kuhl et al., 2012). Such vlPFC upregulation was beneficial for memory retention. On the other hand, the observed higher mPFC upregulation in children may reflect less efficient suppression of the default mode network during effortful memory search (Chai et al., 2014; Fair et al., 2008). Over time, cognitive control during memory retrieval may increase as it requires greater effort to recollect elaborative stories to remember the associated spatial context. Strategic control over memories may be present but less pronounced in children due to the more protracted developmental trajectories of prefrontal cortex maturation (Ghetti & Bunge, 2012c; Gogtay et al., 2004; Ofen, 2012; Shing et al., 2010b).

In addition, our results indicate that a more pronounced schema-related retrieval that may be mediated by mPFC to a greater extent in children than in young adults. This extends previous findings on the involvement of mPFC in structured and schema-related retrieval of long-term memories (Takashima et al., 2006; Yamashita et al., 2009) to a child developmental cohort. Higher mPFC upregulation in long delay was negatively related to memory performance, suggesting that it is detrimental to the retention of detailed associative memories. In addition, it may suggest consolidation-related transformation of memory traces into less differentiated, more generic and gist-like memories (Gilboa & Marlatte, 2017; Gilboa & Moscovitch, 2021).

Second, in other constituents of the recollection network (Ranganath & Ritchey, 2012), we observed increased upregulation from short to long delay in the posterior PHG and overall lower upregulation in precuneus (i.e., remote > recent) in young adults. As young adults showed higher memory retention rates for more detail-rich information, this superior memory may be mediated by higher upregulation in the posterior PHG involved in contextual associations and scene memory (Aminoff et al., 2013). In children, PHG undergoes prolonged maturation (Golarai et al., 2007), and its increased functional maturation is related to long-term scene recollection (Chai, 2010). In addition, higher mnemonic distinctiveness of more recent memories (i.e., higher retention rates for detailed information) may also be mediated by RSC and precuneus activation profiles, as these regions are involved in mnemonic vividness, spatial, and associative memory as indicated by other findings from immediate delays (Brodt et al., 2016b; Hebscher et al., 2019; Mitchell et al., 2018; Richter et al., 2016; Tambini & D’Esposito, 2020; Vann et al., 2009). Moreover, lower precuneus upregulation after a short delay and higher RSC upregulation after a long delay was related to better memory performance. Time-related decrease in activation of the posterior brain regions observed in children aligns with previous findings of DeMaster and Ghetti (2013), who reported that the engagement of parietal regions during recollection of correct memories increased with age in 8-to-11-year-old children. Therefore, the continuing maturation of parietal regions in 5-to-7-year-old children (Sowell et al., 2002) presumably contributes to the age-related differences in consolidation-related upregulation observed in these regions.

Third, the observed increase in neural upregulation from short to long delay in the LOC and the cerebellum in young adults is also in line with previous findings suggesting that the cerebellum supports rapid cortical storage of memory traces after repeated exposure – even after 24 hours (Stroukov et al., 2022) – and shows upregulation of neural activation for long-term episodic memory retrieval (Andreasen et al., 1999). Concerning the LOC, prior studies have linked HC-LOC activation to scene-related associative memory consolidation (Tambini et al., 2010) and to human object recognition (Grill-Spector et al., 2001). Moreover, the network comprising the angular gyrus and LOC has been shown to enhance overnight retention of schema-related memories in young adults (van der Linden et al., 2017). Consistent with these findings, we also observed that greater LOC upregulation after a long delay was related to better memory performance. The more pronounced upregulation from short to long delay in these regions in adults suggests that the cerebellum and LOC support long-delay memory retention – one that appears to be functionally immature in middle childhood.

Finally, our findings on age-group and delay-invariant activation in the anterior HC and PHG, and posterior HC during the retrieval of detail-rich memories (i.e., the exact location of an object within a scene) are in line with Nadel & Moscovitch (1997),who postulated that the hippocampal formation and related structures remain involved in detail-rich memories upon their retrieval, irrespective of memory age. For example, Du et al. (2019) reported stable hippocampal involvement during retrieval of associative memory across delays of one day, one week and one month in young adults. Tanrıverdi et al. (2022) also demonstrated that post-encoding coactivation of hippocampal and cortical brain regions may lead to experience-dependent change in memories, highlighting the importance of hippocampal involvement during consolidation. Furthermore, the absence of age-related differences in HC and anterior PHG involvement are also in line with developmental studies that have reported the relative maturity of the HC in middle childhood (Keresztes et al., 2017; Lee et al., 2014; Shing et al., 2010b), which is concomitant with an improvement in the ability to bind event features together into a coherent representation around the age of six years (Sluzenski et al., 2006). Specifically, our finding on hippocampal engagement being robust in children and adults extends the Multiple Trace Theory and the Trace Transformation Theory to a child developmental cohort (Moscovitch & Gilboa, 2022; Nadel et al., 2000). Taken together, the similar engagement of medial-temporal cortex over time in children and adults indicated that the retrieval of well-learned detail-rich memories is mediated by these brain structures already in middle childhood.

To summarize, we provide novel evidence about changes in neural upregulation for successfully consolidated memories over short and long delay, relative to immediately learned memories. While children exhibited adult-like stable neural activation for recent and remote memories in medial-temporal brain regions, young adults relied more on prefrontal, occipital, cerebellar, and parietal brain regions over time, compared to more pronounced reliance on medial prefrontal regions in children. Adults show more mature neocortical consolidation-related engagement, resulting in stronger and more durable detailed memories over time while in children immature neocortical engagement may lead to consequent reduction in memory retention of detailed memories.

Reduced scene-specific reinstatement over time in children and young adults

We found that scene-specific reinstatement decreased over time both in children and young adults, aligning with delay-related decrease in memory retention. Additionally, higher scene- specific neural reinstatement was related to better memory performance in short and long delay in both age groups.

Our findings contribute to the memory consolidation literature by demonstrating that scene-specific neural reinstatement observed in neocortical, medial temporal and cerebellar brain regions supports reinstatement of detailed specific contextual memories. This observation is consistent with the Contextual Binding Theory (Yonelinas et al., 2019), which posits that stronger reinstatement of contextual details can enhance memory retention. The similar decay of these processes over time in both children and adults suggests that the basic mechanisms of contextual binding are present early in development. Additionally, in line with the Trace Transformation Theory (Moscovitch & Gilboa, 2022), our findings suggest that reinstatement patterns continuously transform over time. This transformation, observed across all considered memory-related regions, indicates a consistent and systematic consolidation-related reshaping of the unique scene-specific memory representations over time (Chen et al., 2017).

Our findings on scene-specific reinstatement align with and add to the previous literature that show reliable reinstatement of unique events. For example, our findings align with the effects observed by Masís-Obando et al. (2022) for the immediate recall of story details in key memory regions. Consistent with Oedekoven et al. (2017), our results show that memory representations for unique events can be reliably detected through scene-specific reinstatement even after extended delays. Furthermore, we build on Guo & Yang (2022) by demonstrating how specific ROI-related profiles of neural reinstatement during retrieval correlate with long-term memory retention. Unlike Oedekoven et al. (2017), who reported no time-related differences in reinstatement effects and used the same video clips for immediate and delayed recall – which could have inadvertently reinforced memory through reactivation – our study employed unique stimulus sets for each retrieval sessions, preventing any reconsolidation of mnemonic representations. This approach revealed a significant attenuation of reinstatement patterns after an overnight delay, which further diminished after two weeks, highlighting the importance of intentional reactivation for maintaining the specificity of neural reinstatement.

Our findings indicate similar patterns of scene-specific neural reinstatement between children and young adults. Building on Fandakova et al. (2019), who found similar distinctiveness of neural representations during encoding in 8-to-15-year-old children and adults, our results suggest that this similarity extends to younger ages, showing comparable distinctiveness of neural representations for unique memories from middle to late childhood and early adolescence. Additionally, our research supports the presence of delay-related change in scene-specific reinstatement in 5-to-7-year-old children, albeit at a lower level compared to adults, aligning with and extending previous studies (Benear et al., 2022; Cohen et al., 2019; Golarai et al., 2015), which demonstrated reliable mnemonic reinstatement for visual input (i.e., faces, movie clips) in 5-to-11-year-old children. Furthermore, we extend these findings, by showing that successful of long-term memory retrieval is associated with more differentiated neural reinstatement in both children and young adults, indicating similar mechanisms of detail- rich memory consolidation present as early as 5-to-7 year.

Our results indicate that higher scene-specific neural reinstatement over time correlated with better memory retention in both children and adults. This is in line with the neural fidelity hypothesis (Xue, 2018), suggesting that more similar neural reinstatement reflect less noisy representations of mnemonic information. Convergent evidence showed that higher fidelity of neural representation across study episodes leads to successful memory (Xue et al., 2010, 2013). Similarly, Masís-Obando et al. (2022) reported that more specific neural representations predicted subsequent memory performance in young adults.

Of note, our study design, which resulted in temporal autocorrelation in the BOLD signal between memory retrieval (i.e., fixation time window) and scene observation and response (i.e., scene time window), was consistent across all three delay windows. Since the retrieval procedure remained unchanged over time and was similarly influenced by temporal autocorrelations in addition to several control analyses, we attribute our RSA findings to differences in reinstatement between recent and remote trials. Given that the scene time window for the 3AFC task was constant, the brain signals should exhibit similar perception-based but variably memory-based patterns across all delays.

Furthermore, all items, regardless of retrieval delay, underwent extensive learning and showed successful consolidation, as evidenced by correct recall. This suggests that both the fixation and scene time windows engaged memory-related neural processes. According to Brodt et al., (2016, 2018), rapid consolidation-related neural reorganization can occur immediately after learning, indicating that even during recent retrieval, scenes are processed in a memory- oriented manner. Additionally, during the scene time window, participants engaged in retrieval by selecting the correct object location within the scene. Thus, while the scene time window involved perceptual processing, its impact is consistent across all items due to uniform exposure to repeated learning, making them equally familiar to participants. Although our paradigm per se cannot arbitrate between perception-based and memory-based nature of retrieval during scene presentation, our exploratory univariate analysis during the scene presentations time window (see Figure S6, Table S11 in Supplementary Materials) revealed higher neural engagement in the key memory regions with passing time, supporting memory-related processing during the scene time window.

Taken together, our findings provide novel evidence that although children exhibit more attenuated scene-specific reinstatement compared to young adults, the consolidation-related decrease in differentiated reinstatement follows similar patterns as in adults. This highlights that despite less robust memory consolidation and lower memory strength, children’s neural transformations of distinct memories over time may share the same mechanisms as adults, with scene-specific reinstatement proving beneficial for memory retention in both groups.

Unique Gist-like Representations in Children

The results showed that only children demonstrated the emergence of generic gist-like representations in medial and ventrolateral prefrontal brain regions during successful long-delay retrieval. Furthermore, greater long-delay gist-like representations were associated with poorer long-delay memory accuracy in children. With these findings, we provide the first neural empirical evidence to support the Fuzzy Trace Theory (Brainerd & Reyna, 2002; Reyna & Brainerd, 1995), showing neural reorganization of memory representations in children.

The Fuzzy Trace Theory aims to characterize the shifts in ongoing balance between precise, detailed “verbatim” memory and more generalize, simplified “gist” memory (Brainerd & Reyna, 2002) from a developmental perspective. Our associative object-location task allowed the investigation of these “dichotomy” as it was aimed to cultivate detailed, precise memories for retrieval. Simultaneously, it enabled generalization by creating of more generic representations due to the presence of related category-based information. Adults were able to build upon solid pre-existing knowledge by embellishing them with details and integrating them into these structures. Children, in contrast, with their sparser knowledge, may have focused more on solidifying the structure with overlapping information. Aligning with the Fuzzy Trace Theory, our results suggest that reliance on gist-like memory representations is less effective for long- term retention of complex associative information compared to detailed verbatim memory, which seems to be characteristic of adults.

We also observed short-delay gist-like representations in posterior brain regions: the precuneus in children and the LOC in both age groups. These representations were not directly related to memory accuracy. The LOC is involved in object and scene recognition (Golarai et al., 2007; Grill-Spector et al., 2001) and has been shown to participate in schema-related consolidation or in durable but less specific memories (van der Linden et al., 2017). Its involvement in gist-like representations across age groups suggests occipital areas already engage in some degree of categorical abstraction, especially when stimuli share common visual or contextual features. In contrast, the additional precuneus involvement in children might reflect broader and less differentiated cortical engagement. The precuneus is involved in mental imagery, integration of visuospatial and self-referential information, and episodic simulation (Hebscher et al., 2019; Plachti et al., 2023). In young children, where functional specialization is still developing, memory representations may be less tightly constrained, leading to more diffuse activation patterns across associative and imagery-related regions (Plachti et al., 2023). At short delays, memories may still be relatively strong — as supported by our drift diffusion modelling showing higher drift rates — thus allowing these emerging gist-like signals to coexist with detailed memory traces, perhaps as a by-product of early consolidation.

The emergence of long-delay gist-like neural representations in both the mPFC and vlPFC in children may reflect consolidation-related integration of memory representations into more abstract, generic forms over time. This aligns with the mPFC’s known role in integrating across memories (Schlichting et al., 2015), the increase in semantically transformed representations for related information over time in adults (Krenz et al., 2023), and the integration of new information into schema (Gilboa & Marlatte, 2017; Preston & Eichenbaum, 2013). While gist- like neural representations may support the generalization of information to bolster the sparse knowledge structures in children, this occurs at the costs of memory precision (Reyna et al., 2016). The involvement of the vlPFC in gist-like representations was also stable. The vlPFC has been implicated in controlled semantic retrieval, selection among competing memory traces, and integration of overlapping information (Badre & Wagner, 2007; Simons & Spiers, 2003). In the context of developing memory systems, children’s engagement of the vlPFC may reflect an effort to resolve interference among overlapping scene-object memories by drawing on more abstracted or semantically reduced representations. This aligns with the findings that the vlPFC may support the selection and organization of relevant features, especially under cognitive load or when representations are weak (Bunge et al., 2004; Sanefuji et al., 2011; Trelle et al., 2019). Thus, the coactivation of mPFC and vlPFC in children during long-delay memory retrieval suggests that gist-based retrieval strategies are not only present but possibly compensatory, reflecting an adaptive but less precise means of accessing complex memories.

Importantly, we found that gist-like neural representations in the mPFC and vlPFC at long delay were negatively associated with memory accuracy in children. This suggests that, while gist representations may serve a generalizing function, they are less effective for supporting retrieval of detailed object-location associations. In contrast to our findings, Masís-Obando et al. (2022) demonstrated that more schema-based representations in the mPFC were associated with better subsequent memory performance in adults. However, the study utilized stimuli with clearly differentiable schema and details components, whereas our design required the retention of both contextual details and object associations. It is important to note that in our study, gist- like representations were observed only for correctly remembered items, suggesting that children retained some core aspects of the memory trace, even if details were compromised. This aligns with the idea that gist-based representations preserve the overall meaning or category, but not the specific spatial or contextual bindings necessary for high-fidelity retrieval (Reyna et al., 2016). Thus, the negative correlation may not reflect an entirely detrimental effect, but rather a trade-off between generalization and detail preservation.

Overall, our results are in line with Brainerd et al. (2002), showing that in middle childhood, precise mnemonic representations (i.e., scene-specific reinstatement) and gist-like mnemonic representations can co-exist at the neural level. These findings also extend the adult literature, supporting the notion of qualitative transformations of memory traces, whereby detailed and more schematic, generic memories may simultaneously be present (Chen et al., 2017; St-Laurent & Buchsbaum, 2019; Ye et al., 2020). Building on the postulations from Keresztes et al. (2018) and Ngo et al. (2021), who showed that 5-to-7-year-old children tend to rely more on generalization, our findings suggest that retaining memories with viewer specific details may allow for faster integration of overlapping features into emerging knowledge structures (Bauer, 2021; Gilboa & Marlatte, 2017). In contrast, adults may form strong, highly detailed memories supported by effective strategic retrieval mechanisms, without the need to form gist-like representations. Although category-level reinstatement has been documented in adults (e.g., Kuhl & Chun, 2014; Tompary et al., 2020; Tompary & Davachi, 2017), the absence of such effects in our adult group may reflect differences in study design, particularly our use of non-repeated, cross-trial comparisons based on fixation events. It may also reflect different consolidation strategies, with adults preserving more differentiated or item-specific representations, while children tend to form more schematic or generalizable representations — a pattern consistent with our interpretation and supported by prior work (Fandakova et al., 2019b; Sekeres, Winocur, & Moscovitch, 2018) .

Taken together, our findings provide novel evidence children’s memory consolidation is characterized by a shift toward gist-based representations, supported by mPFC and vlPFC engagement. While these representations may aid generalization and schema-building, they appear less effective for detailed retrieval, especially over long delays. With this we provide the first empirical evidence to support Fuzzy Trace Theory at the level of gist-like neural representations in children. Future research may build on this approach to further explore conditions under which schema-based representations enhance memory performance and how these process differ across development.

Limitations

Several limitations of the current study should be noted. First, our test for memory was based on a 3-alternative forced choice procedure, which was intended to reduce the need for strategic search (e.g., in free recall). As reorganization and stabilization in consolidation depend on the psychological nature of mnemonic representations (Moscovitch & Gilboa, 2022), future studies may employ more demanding recall-based memories to characterize memory consolidation more comprehensively. Particularly, future studies may differentiate mnemonic accessibility vs. precision (Murray et al., 2015; Richter et al., 2016), as they may show differential temporal dynamics in the developing brain and involve differential neural mechanisms. Second, as we included only stimuli congruent with prior knowledge, future studies may introduce knowledge-incongruent information to investigate the beneficial effect of prior knowledge on memory consolidation more directly. Prior knowledge may impact learning and consolidation of information over time differentially by development (McKenzie & Eichenbaum, 2011; van Kesteren et al., 2013; Wang & Morris, 2010). Third, we concentrated on a limited age range in middle childhood. To characterize how neural mechanisms of memory consolidation evolve over time, future studies should include other developmental cohorts. Fourth, we acknowledge that our study design leads to temporal autocorrelation in the BOLD signal when calculating RSA between fixation and scene time windows. Although we argue that our results, given the identical procedure over time, are more attributed to the delay-related changes in the neural reinstatement, future studies should tailor the design of the retrieval procedure to warrant cross-run comparisons. This could be achieved by introducing the same items repeatedly across different runs. Fifth, our task may not have been demanding enough for young adults to fully challenge their memory retention and encourage the formation of more gist-like representations. Future studies could explore this further by using more challenging conditions to enhance the formation of more generic memories in adults, avoid bias related to prior knowledge. Sixth, although we focused on ROIs associated with the recollection network and implicated in retrieval of visual information, we did not investigate the connectivity between these brain regions and how it changes as memories age. Future studies should investigate consolidation-related neural connectivity patterns and their temporal dynamics in the developing brain. Finally, children in our sample were positively biased in socio-demographical score and IQ compared to young adults, which may restrict the generalizability of our results.

Conclusions

In this study, we present novel empirical evidence on the neural mechanisms underlying the less robust memory retention of intentionally learned object-location associations in 5-to-7-year-old children compared to young adults. Our findings reveal that, over time, children show attenuated consolidation-related upregulation in neocortical and cerebellar brain regions during successful retrieval. Furthermore, they appear to form different types of memory representations than young adults: while both groups show delay-related change in detailed scene-specific reinstatement, only children exhibit the emergence of more generic gist-like representations, particularly after a longer delay. Our results suggest that, unlike the mature consolidation systems in young adults, the developing brains of early school-age children supports only partially adult-like neural changes over time. Children show less pronounced neural upregulation in core retrieval regions. At the same time, they appear to rely more on gist-like representations, possibly as a developmental mechanism to scaffold and accumulate schema-relevant knowledge despite weaker detailed memory.

Materials and methods

Participants

Sixty-three typically developing children and 46 young adults were recruited to participate in the study through advertisement in newspapers, on the university campus, word-of-mouth, and city registry. All participants had normal vision with or without correction, no history of psychological or neurological disorders or head trauma, average IQ > 85, and were term-born (i.e., born after 37 weeks of pregnancy). Fourteen children were excluded due to : (i) incomplete task execution and missing data (n = 2); (ii) poor quality of the data (n = 7); (iii) technical issues during data acquisition (n = 5). Seven young adult participants were excluded due to incomplete task execution and missing data (n = 5) or being identified as extreme outlier (n = 2) based on interquartile range (IQR; above Q3upper quartile(75th percentile) + 3xIQR or below Q1lower quartile(25th _percentile) – 3xIQR (Hawkins, 1980)) for memory behavioural measures. The excluded participants were comparable in terms of age, sex, and socio-economic status to the final sample. The final total sample consisted of 49 children (22 female, mean age: 6.34 years, age range: 5.3 – 7.1 years), and 39 young adults (19 female, mean age: 25.60 years, age range: 21.3 – 30.8 years; see Table 1 for more details).

All participants or their legal guardians gave written informed consent prior to participation. The study was approved by the ethics committee of the Goethe University Frankfurt am Main (approval E 145/18). The participants received 100 Euro as compensation for taking part in the study.

Materials and Procedure

Object-location associations task

Stimuli for the object-location association task were chosen based on the social studies and science curriculum for German primary school first and second graders (see similar procedure in Brod & Shing, 2019). The themes were chosen based on ratings provided by four primary school teachers on the familiarity of first graders with the topics. 60 different themes (e.g., classroom, farm, etc.) were chosen, each belonging to one of seven categories (i.e., field, water, housing, forest, infrastructure, indoor, farming). Four scene stimuli and four thematically congruent object pictures were selected for each theme (see Fig. 1 for an example), resulting in 240 individual scenes and 240 individual objects. The 240 object-scene pairs were assigned to versions A and B, each containing 120 object-scene pairs. Each participant was randomly assigned either version A or version B. There were six possible object locations across all scenes. Around each location, there were three possible object placements. The distribution of locations across scenes was controlled to ensure realistic placement of the objects within the scenes (for more detailed information see Supplementary Methods section). The object-location association task consisted of three phases (see Fig. 1):

Initial encoding phase _(Day _0, _Day _1, _Day ₁₄₎. A total of 120 object-location pairs were used to create the trials in this phase, with 60 pairs presented on Day 0, 30 pairs on Day 1, and 30 pairs on Day 14. The initially learned object-scene associations on Day 0 were split in two halves based on their categories. Specifically, half of the pairs from the first set and half of the pairs from the second set of 30 object-scene associations were used to create the set 30 remote pair for Day 1 testing. A similar procedure was repeated for the remaining pairs to create a set of remote object-scene associations for Day 14 retrieval. We tried to equally distribute the categories of pairs between the testing sets. During each trial, participants viewed an object in isolation for 2 seconds, followed by the same object superimposed on a scene at a particular location for 10 seconds. After this, a blank screen with a fixation cross was presented for 1 second. Participants were instructed to memorize the object-location pairs and to remember the exact location of the object within the scene using elaborative encoding strategies, such as creating a story or making a “mental photo” of the scene. Such elaborative encoding strategies have been shown to improve memory performance in both children and adults (Craik & Tulving, 1975; Pressley, 1982; Pressley et al., 1981; Shing et al., 2008);
(ii) Learning phase _(Day _0, _Day _1, _Day ₁₄₎. Following the initial encoding phase, participants continued learning the correct location of the object within the scene through adaptively repeated retrieval-encoding cycles. The cycles continued until participants achieved at least 83% correct responses or until the maximum of four cycles had been completed. The number of cycles therefore ranged from a minimum of two to a maximum of four. The 83% threshold was established during piloting as a guideline to determine the appropriate number of learning-retrieval cycles, rather than as a strict learning criterion. It served to standardize task continuation, not to serve as a basis for excluding participants post hoc. Children who did not reach the 83% threshold after the fourth cycle were still included in the analysis if their performance exceeded chance level (33%). Excluding them would have biased the sample toward higher-performing children and reduced the ecological validity of our findings. Including them ensures a more representative view of children’s performance under extended learning conditions.
(iii) During each trial, participants were first presented with an isolated object for 2 seconds, followed by a one-second blank screen with a fixation cross. They were then shown a scene containing three red-framed rectangles, indicating possible location choices. Participants had to select the correct location by choosing one of the rectangles within 12 seconds, and the chosen rectangle was highlighted for 0.5 seconds. After this, feedback in the form of a smiley face was given, with the happy face for a correct answer, a sad face for an incorrect answer, and a sleeping face for a missed answer. Following the feedback, correct object-location associations were displayed for two seconds if the choice was correct and for three seconds if the choice was incorrect or missed.
(iv) Retrieval phase _(Day ₁ _and _Day ₁₄₎. The retrieval phase was conducted inside the MRI scanner. Participants were presented with 30 recently learnt item and 30 remote items learned on Day 0. The remote item sets for Day 1 and Day 14 based on items from Day 0 did not differ in the learning accuracy in either age group (all p > .06 as based on the analysis of variance).

Participants were instructed to recollect and visualize (“put in front of their mental eyes”) as vividly as possible the location of the object within the scene. In this way we prompted the recall of the scene and the location of the object within this scene.

Each trial began with a fixation cross jittered between 3 to 7 seconds (mean of 5 seconds). Participants were then presented with an isolated object for 2 seconds, followed by the presentation of another fixation cross jittered between 2 to 8 seconds (mean of 5 seconds). Following the fixation cross, participants were prompted with the associated scene and were required to recall the location of the object by selecting one of the three red rectangles on the scene within 7.5 seconds. If participants failed to respond within the deadline, the trial was terminated. No time-outs were recorded for young adults, while 5,4 % of time-out trials were recorded for children and these trials were excluded for analysis. After a choice was made or the response deadline was reached, the scene remained on the screen for an additional 0.5 second. The jitters and the order of presentation of recent and remote items were determined using OptimizeXGUI (Spunt, 2016)which followed an exponential distribution (Dale, 1999). Ten unique recently learned items (from the same testing day) and ten unique remotely learned items (from Day 0) were distributed withing each run (in total three runs) in the order as suggested by the software as the most optimal. There were three runs with unique sets of stimuli, each resulting in thirty unique recent and thirty unique remote stimuli overall.

Assessment of demographic and cognitive covariates

IQ scores were assessed using the German version of the “Kaufman Assessment Battery for Children – Second Edition” (K-ABC II; Kaufman & Kaufman, 2015) in children and the “Wechsler Adult Intelligence Scale – Fourth Edition” (WAIS -IV; Wechsler, 2015) in young adults. General socio-demographic questionnaires to assess socio-demographic characteristics of the participants were administered as well.

Experimental Procedure

The testing was conducted over three days (see Fig. 1B). On Day 0, the experiment began with a short training session aimed at familiarizing participants with the object-location associations task and elaborative encoding strategy, using five object-location pairs. The experimental task started with the initial encoding of unique sets of object-location associations. Participants had to learn two unique sets comprised of 30 object-location associations each. After encoding each set, participants engaged in a brief distraction task where they listened to and had to recall a string of numbers. Next, they underwent a learning phase with retrieval-encoding cycles until they reached a criterion of 83% (or a maximum of four cycles). This was done to minimize variances attributed to encoding, allowing for more accurate comparison of subsequent memory consolidation. Afterwards, the children visited a mock scanner to become familiar with the MRI scanning environment. This procedure involved teaching the children the sounds of MRI scanning and training them to stay still during scanning.

On Day 1, participants first learned a new set of 30 object-location associations, using the same learning procedure as on Day 0. This was followed by retrieval in the MRI scanner, during which they were required to recall 30 object-location associations learnt on Day 0 (short-delay, remote) and another 30 learnt on Day 1 (recent). On Day 14, the same procedure was followed as on Day 1, with a new set of 30 object-location associations. They were again required to recall 30 object-location associations learnt on Day 0 (long-delay, remote) and another 30 learnt on Day 14 (recent). In total, participants completed 60 retrieval trials in the MR scanner on Day 1 and Day 14 each, which took approximately 15-20 minutes. Besides the primary task, participants also completed other psychometric tests across all testing sessions. Additionally, socio-demographic questionnaires were administered to young adults and legal guardians of children.

Data acquisition

Behavioural data acquisition

The task paradigm during all phases was presented using Psychtoolbox (Kleiner et al., 2007) software in MATLAB 9.5, R2018b (MATLAB, 2018). During the encoding and learning phases, stimuli were presented on a computer screen with the resolution of 1920x1080 pixels. During the retrieval phase, an MR-compatible screen with identical resolution was used, and participants used an MR-compatible button box with three buttons. To minimize head movements, foam cushions were placed inside the head coil, and MR-compatible headsets and ear plugs were used to reduce the scanner noise.

Magnetic resonance imaging data acquisition

MR images were acquired on a 3 Tesla SIEMENS PRISMA MRI scanner (Siemens Medical Solutions, Erlangen, Germany) using a 64-channel head coil at Berlin Center for Advanced Neuroimaging, Charité Universitätsmedizin Berlin. Each session started with the acquisition of a localizer and head scout sequences for field-of-view-alignment (FoV) based on anatomical landmarks. T1-weighted structural images were obtained with the magnetization prepared rapid gradient echo (MP-RAGE) pulse sequence (TR = 2500 ms, echo time = 2.9 ms, flip angle = 8°, FoV = 256 mm, voxel size = 1x1x1 mm³, 176 slices). Functional images were acquired using echo-planar imaging sequences (TR = 800 ms, echo time = 37 ms, flip angle = 52°, FoV = 208 mm, 72 slices, voxel size = 2x2x2 mm³, maximally 588 volumes). In addition, gradient echo images (field maps) were acquired before each functional run for correction of magnetic field inhomogeneities.

Behavioural data analysis

Learning and Consolidation

The behavioural analyses were performed with R packages (R Core Team, 2022) in RStudio 2022.07.0 (RStudio, Inc.). Throughout the analyses, statistical significance level was set at < .05.

All p-values were FDR-adjusted for multiple comparisons due to multiple ROIs. As a measure of baseline memory performance, final learning accuracy was defined as the percentage of correctly learned locations in relation to the total number of items at the end of the learning phase of each day. To examine memory consolidation, we quantified memory retention across delays, focusing on trials that were correctly learned on Day 0. From these trials, we calculated the percentage of correct responses, separately for Day 1 and Day 14. We conducted a linear mixed-effect model (LME model) for memory measures using the lmer function from the lme4 package in R (Bates et al., 2015) and lmerTest (Kuznetsova et al., 2017). All LME models were calculated with maximum-likelihood estimation and Subject as the random intercept to account for between-subject variability in retention accuracy.

First, to investigate baseline memory performance, we analysed whether final learning accuracy in all three sessions differed between groups. For that, we included the within-subject factor of Session (Day 0, Day 1, and Day 14) and the between-subject factor of Group (children and young adults) in the LME model. Second, for memory retention rates, we included Session (Day 1, Day 14), Item Type (recent, remote), and Group (children, young adults) as fixed factors in the LME model. In addition, we added Subjects as random factor, as well as IQ, Sex, and Handedness (Kang et al., 2017; Willems et al., 2014) as covariates. Degrees of freedom were adjusted using the Satterthwaite’s method (Kuznetsova et al., 2017) if the assumptions of homogeneity of variances were violated. Significant effects were followed up with Sidak post- hoc multiple comparisons. For further group differences in socio-demographic measures, we performed one-way independent analysis of variance (ANOVA) or Games-Howell test (S. Lee & Lee, 2018). The effect size estimation was performed using omega squared (w²) as a less biased estimate for reporting practical significance of observed effects (Okada, 2013). To determine the amount of variance explained by the model, we used partR2 package (Stoffel et al., 2021).

fMRI data pre-processing

Anatomical and functional MR data was pre-processed using fMRIPrep 22.0.0 (Esteban et al., 2019), based on Nipype 1.8.3 (Gorgolewski et al., 2011). Detailed description of the anatomical and functional data pre-processing can be found in Supplementary Methods section.

fMRI data analysis

FMRI data analysis was conducted with FEAT in FSL (Version 6.0.1, FMRIB’s Software Library, Jenkinson et al., 2012; Smith et al., 2004; Woolrich et al., 2009). Prior to that, single runs were excluded if there was (i) root-mean-square realignment estimates(Jenkinson et al., 2002) exceeding 1mm; and (ii) framewise displacement (FD) > 1, and (iii) less than two correct trials in the entire run. Based on these criteria, 14 single runs and two complete sessions in children were excluded from further analysis.

General Linear Model for Mean Activation

For each participant’s fMRI data, a first-level analysis was performed separately for each run using a general linear model (GLM) with eight experimental regressors. The regressors represented the onset and duration of the following events: (i) object recent_correct, (ii) object remote_correct, (iii) scene recent_correct, (iv) scene remote_correct, (v) object recent_incorrect, (vi) object remote_incorrect, (vii) scene recent_incorrect, (viii) scene remote_incorrect. The duration of object events was two seconds, while the duration of scene events was dependent on the reaction time (RT). The regressors were convolved with a hemodynamic response function, modelled with a double- gamma function with first and second derivatives. Confounding regressors were also included in the GLM and were calculated with fMRIPrep, namely six rigid body realignment parameters, framewise displacement, and standardised DVARS (D, temporal derivatives over time courses; VARS, variance over voxels). In addition, six anatomic component-based noise correction (CompCor, a combination of cerebrospinal fluid and white matter) regressors, global signal (we re-run the entire first-level univariate analysis using the pipeline that excluded the global signal. The resulting activation maps (see Supplementary Figure S3.2, S4.2, S5.2, S9.2) differed notably from those obtained with the original pipeline. Specifically, group differences in cortical regions such as mPFC, cerebellum, and posterior PHG no longer reached significance, and the overall pattern of results appeared noisier. Additional analyses revealed that: (i) the global signal was not dependent on group or session in our sample; (ii) the global signal reduced inter-subject variability in children, likely reflecting improved signal quality; (iii) the global signal stabilized the signal and attenuated non-neuronal variability) and cosine drift terms were included, based on previous methodological studies (Ciric et al., 2017; Esteban et al., 2020; Jones et al., 2021; Satterthwaite et al., 2013). The functional images were spatially smoothed with SUSAN (Smallest Univalue Segment Assimilating Nucleus, Smith & Brady, (1997)), applying a Gaussian kernel with a full-width at half-maximum of 6 mm. A high- pass Gaussian filter with a cut-off period of 80 s was applied. Contrasts were defined for each run per subject, and within-subject fixed-effects averaging across runs within each session was conducted per subject. Group-level analysis was performed with FLAME1 (Woolrich et al., 2004) within each session, based on the statistical maps obtained from the first-level analysis. The main contrast of interest was object remote > object recent, as we were primarily interested in the reinstatement of object-scene association before the scene was shown. Univariate analysis was performed with statistical tests voxel-wise and corrected for multiple comparisons with cluster-based thresholding using a z threshold of z > 3.1 and a two-tailed probability of .001.

Several a priori regions of interest (ROI) were selected based on anatomical masks: bilateral anterior/posterior hippocampus (HC), bilateral anterior/posterior parahippocampal gyrus (PHG), and RSC. The masks for the medio-temporal lobe ROIs were taken from the Harvard- Oxford Cortical and Subcortical Atlases (threshold at 30% probability; (Desikan et al., 2006)), and the mask for RSC was taken from the Talairich Atlas (threshold at 30% probability; Lancaster et al., 2000; Talairich & Tournoux, 1988) . For further ROIs in large cortical regions (namely mPFC, precuneus, LOC, vlPFC, and cerebellum), anatomical masks derived from Harvard-Oxford Cortical and Subcortical Atlases or Juelich Atlas (Amunts et al., 2020) were combined with a functional task-related map, based on mean activation across recent and remote objects across all participants and sessions, at voxel-wise threshold of z > 3.1 and a two-tailed probability of .001. With these masks, the mean percent signal change (from the contrast of object remote_correct > object recent_correct) was extracted using FEAT in FSL for each session of each participant, which were then submitted to statistical analysis in R. A linear mixed-effect model was set up to model percent signal change. The linear mixed effect model was calculated with maximum-likelihood estimation and Subject as random intercept to account for between- subject variability. As fixed factors, we included Session (Day 1, Day 14) and Group (children, young adults). We also added IQ and sex and handedness and mean reaction time as covariates to the model.

Representational similarity analysis for neural reinstatement

For the multivariate analysis, single-event (i.e., for every event on each trial) β (beta) estimates (Beta estimates were obtained from a Least Square Separate (LSS) regression model. Each event was modeled with their respective onset and duration and, as such, one beta value was estimated per event (with the lags between events differing from trial to trial). The jitter was included to enable an estimation of the patterns evoked by the events and all subsequent RSA analyses were conducted normally on these estimates without further controls.) were first computed by modelling BOLD time course with a series of Generalized Linear Models (GLM) using the Least Square Separate method (LSS; Abdulrahman & Henson, 2016; Mumford et al., 2012). Each trial contained three events (i.e., object, fixation, and scene), hence a total of 30 GLMs (i.e., ten for objects, ten for fixations, and ten for scenes) were computed for each run, session, and participant. Each of the GLMs contained four experimental regressors: for instance, one for the single fixation of interest and three more for the rest of the events (i.e., for all other fixations except the fixation of interest, for all objects, and for all scenes). The same set up was followed for the object GLMs and the scene GLMs. The regressors were convolved with the hemodynamic response function, which was modelled with a double-gamma function with first and second derivatives. Additionally, the same confounding regressors as the ones for mean- activation analysis were included.

Next, to assess whether mnemonic reinstatement during the fixation period, during which participants were supposed to recollect the scenes associated with the objects, was more item- specific or gist-like, we used the single-event beta estimates of each trial to compute two types of Representational Similarity Matrices (RSMs; Kriegeskorte, 2008). Each RSM was computed separately for each previously identified ROI. All subsequent analyses were performed with homebrew scripts available at https://github.com/iryna1schommartz/memokid_fmri.

Scene-specific reinstatement: To measure the extent of scene reinstatement following object presentation, we computed a scene-specific reinstatement index for each neural RSM, separately for correctly remembered recent and correctly remembered remote scenes of each session (see Figure 5A-B). For each specific scene, we computed the index as the average distance between the “fixation” and “scene period” (Fisher-transformed Pearson’s r; Fig. 5B), which was the correlation between neural patterns during fixation and neural patterns when viewing the scene. We averaged the index across all items, all runs within a session, and then within subjects, resulting in a single value per predefined ROIs and sessions. In addition to scene-specific reinstatement, we also calculated a set-based reinstatement index as a control analysis, which was calculated as an average distance between “fixation” and “scene period” for a scene and every other scene within the stimuli set (Deng et al., 2021; Ritchey et al., 2013; Wing et al., 2015). The set-based reinstatement index reflects the baseline level of non-specific neural activation patterns during reinstatement. We then calculated the corrected scene-specific reinstatement index as the difference between set-based and scene-specific Fisher-transformed Pearson’s values (Deng et al., 2021; Ritchey et al., 2013; Wing et al., 2015). A higher value in this index denotes more distinct scene reinstatement patterns. Only correctly retrieved items were included for this analysis. We obtained the corrected scene-specific reinstatement indices for recent items on Day 1 and Day 14 and tested them for session-related differences. If no differences were observed, the set-corrected scene-specific reinstatement indices for recent scenes on Day 1 and 14 were averaged to obtain a single value per ROI and participant. We then conducted a final LME model, separately for each ROI, with Subject as the random factor and Delay (recent, remote Day 1, remote Day 14) and Group (children, young adults) as fixed factors. In addition, mean neural activation was added as a covariate into the model.

Gist-like representations: Seven overarching thematic categories were identified during stimuli selection (i.e., field, water, housing, forest, infrastructure, indoor, farming). A within-category similarity indices were computed based on fixation time window of correctly remembered items belonging to the same category and excluding the similarity computation for the fixation time windows of correctly remembered items with itself. A between-category similarity indices were computed based on fixation time window of correctly remembered items belonging to different categories. These indices were computed for each run and across runs, Z-standardized and then averaged. A gist-like representations index was computed by subtracting between-categories from within-categories Z-transformed distances ([within category_recent – between category_recent] and [within category_remote – between category_remote] for each session, Fig. 7A-B) . The non-zero values in this corrected index reflect gist-like representations, as the similarity distance would be higher for pairs of trials with the same categories than for pairs with different categories. We applied a one-sample permutation t-test to test for significance in each ROI. Similar to the procedure described above, gist-like representations indices for recent items on Day 1 and Day 14 were averaged when no difference was found, obtaining a single value per ROI and participant. We then conducted a final LME model, separately for each ROI, with Subject as the random factor and Delay (recent, remote Day 1, remote Day 14) and Group (children, young adults) as fixed factors and mean neural activation as a covariate, to analyse any delay-related differences in gist-like representations index for successfully retrieved trials. Finally, we also explored whether over time, long-delay item-specific and representations is beneficial or detrimental for memory performance by correlating the index with memory retention rates. We tested whether this correlation within each group differs based on ROI. If no differences were observed, we averaged representations indices across ROIs that showed significant representations in long delay.

Brain-behavioural relations

To examine the connections between brain function and behavior, we utilized brain metrics generated via the application of a multivariate method known as Partial Least Square Correlation (PLSC) (Abdi & Williams, 2013; McIntosh et al., 1996; Schommartz et al., 2023). This approach focuses on multivariate links between specified neural measures in Regions of Interest (ROIs) and fluctuations in memory performance over short and long delays across different age cohorts. We argue that this multivariate strategy offers a more comprehensive understanding of the relationships between brain metrics across various ROIs and memory performance, given their mutual dependence and connectivity (refer to Genon et al. (2022) for similar discussions).

Initially, we established a cross-subject correlation matrix that included (i) a matrix (n x 10) comprising short and long delay brain indices (encompassing both neural upregulation, scene-specific and gist-like indices) for all specified ROIs, and (ii) a vector (n-sized) that represents a continuous assessment of either short-delay or long-delay memory performance (RR): R = CORR (RR, ROIs). Prior to the correlation, all metrics were standardized. The decomposition of this correlation matrix, R = USV’, was performed using singular value decomposition, yielding singular vectors U and V, or saliences. Here, the left singular vector symbolizes the weights for short- or long-delay memory accuracy (U), while the right singular vector represents ROI weights (V) indicating specific neural indices that optimally represent R, with S being a matrix of singular values.

Subsequently, PLSC identifies a singular estimable latent variable (LV), uncovering pairs of latent vectors with maximal covariance that best describe the association between memory retention rates and ROI neural indices. Therefore, LV delineates distinct patterns of neural indices across ROIs closely linked to either short- or long-delay retention rates. Moreover, we computed a singular value for each participant, termed a within-person “profile,” summarizing the robust expression of the defined LV’s pattern. This was achieved by multiplying the model- derived ROI weight vector (V) with the within-person estimates of ROI neural metrics.

To verify the generalizability and significance of the saliences or LV, we performed 5000 permutation tests to derive a p-value. We also determined the stability of the within-LV weights by bootstrapping with 5000 resamples, calculating a bootstrap ratio (BSRs) by dividing each ROI’s salience by its bootstrap standard error. BSRs, analogous to Z-scores, serve as normalized robustness estimates; hence, values exceeding 1.96 (p < .05) indicate statistically stable saliences. Utilizing PLSC for multivariate statistical analysis in one step eliminates the need for multiple comparisons correction across all ROIs (McIntosh et al., 1996).

To avoid multicollinearity and redundancy, which might diminish the power to uncover neural-behavioral links through conventional statistical approaches, we initially derived a single metric per participant—a participant’s expression of the latent brain pattern (i.e., brain score) for neural indices that share the most variance with either short-delay or long-delay memory accuracy variations. We further explored how these brain patterns correlate with memory performance.

Data availability statement

The datasets generated and analysed during the current study are available from the corresponding authors upon reasonable request.

Acknowledgements

We thank all the children and parents who participated in the study. This work conducted at the Berlin Center for Advanced Neuroimaging (BCAN) and was supported by the Deutsche Forschungsgemeinschaft (DFG; German Research Foundation, Project-ID 327654276, SFB 1315, “Mechanisms and Disturbances in Memory Consolidation: from Synapses to Systems”). The work of YLS was also supported by the European Union (ERC-2018-StG-PIVOTAL- 758898) and the Hessisches Ministerium für Wissenschaft und Kunst (HMWK; project ’The Adaptive Mind’). We also thank Henriette Schultz and Nina Wald de Chamorro for their assistance with study management and data collection.

Additional information

Author contributions

Y.L.S, C.B., A.K secured funding. I.S and Y.L.S, C.B., A.K contributed to conception and design of the study. I.S. and P.L. performed data collection and data curation. I.S., P.L., and J.O.-T. performed the statistical analysis. I.S. wrote the first draft of the manuscript. All authors contributed to manuscript revision, read, and approved the submitted version.

Funding

DFG; German Research Foundation, SFB 1315, “Mechanisms and Disturbances in Memory Consolidation: from Synapses to Systems” (Project-ID 327654276)

Additional files

Supplementary Materials

Note

This reviewed preprint has been updated to correct level headings.

Significance of findings

Strength of evidence

Abstract

Research highlights

Introduction

Neural correlates of memory consolidation

Mnemonic transformation and reinstatement across consolidation

Neural correlates of memory consolidation and mnemonic transformation and reinstatement in middle childhood

Aim of the current study

Results

Behavioural results

Final Learning Performance

(A) Trial Structures in the Experimental Task.

Sample characteristics by age group

Memory Retention Across Time

Retention rates for initially correctly learned items.

fMRI Results

Mean activation for remote > recent memory in ROIs

Mean Signal Differences Between Correct Remote and Recent Memories.

Neural-behavioural Correlation

Multivariate short- and long-delay brain profiles of neural upregulation (remote versus recent neural activation differences) are associated with variations in memory accuracy.

Representational similarity results

3.2.2.1 Corrected scene-specific reinstatement

Scene-Specific Reinstatement (A) Index Computation (Scene).

Corrected scene-specific neural reinstatement.

Statistical overview of LME-model based Sidak corrected post hoc comparisons for scene-specific reinstatement analysis (based on LME-model described in Table S10).

Gist-like neural representations

Representational Similarity Analysis.

Neural-behavioural Correlations

Neural-behavioural correlations (scene-specific reinstatement)

Multivariate short- and long-delay brain profiles of scene-specific reinstatement are associated with variations in memory accuracy.

Neural-behavioural correlations (gist-like representations)

Multivariate short- and long-delay brain profiles of gist-like representations are associated with variations in memory accuracy.

Discussion

Less robust short and long delay memory retention in children compared to young adults

Differential upregulation of remote > recent neural activation over time in children in comparison to young adults

Reduced scene-specific reinstatement over time in children and young adults

Unique Gist-like Representations in Children

Limitations

Conclusions

Materials and methods

Participants

Materials and Procedure

Object-location associations task

Assessment of demographic and cognitive covariates

Experimental Procedure

Data acquisition

Behavioural data acquisition

Magnetic resonance imaging data acquisition

Behavioural data analysis

Learning and Consolidation

fMRI data pre-processing

fMRI data analysis

General Linear Model for Mean Activation

Representational similarity analysis for neural reinstatement

Brain-behavioural relations

Data availability statement

Acknowledgements

Additional information

Author contributions

Funding

Additional files

Note

References

Article and author information

Author information

Iryna Schommartz

Philip F Lembcke

Javier Ortiz-Tudela

Martin Bauer

Angela M Kaindl

Claudia Buss*

Yee Lee Shing*

Author Notes

Version history

Cite all versions

Copyright

Metrics

Claudia Buss

Yee Lee Shing