Abstract
Alzheimer’s disease (AD), the leading cause of dementia, could potentially be mitigated through early detection and interventions. However, it remains challenging to assess subtle cognitive changes in the early AD continuum. Computational modeling is a promising approach to explain a generative process underlying subtle behavioral changes with a number of putative variables. Nonetheless, internal models of the patient’s reasoning process remain underexplored in AD. Determining the states of an internal model between measurable pathological states and behavioral phenotypes would advance explanations about the generative process in earlier disease stages beyond assessing behavior alone. In this study, we assumed the latent cause model as an internal model and estimated internal states defined by the model parameters being in conjunction with measurable behavioral phenotypes. The 6- and 12-month-old AppNL-G-F knock-in AD model mice and the age-matched control mice underwent memory modification learning, which consisted of classical fear conditioning, extinction, and reinstatement. The results showed that AppNL-G-F mice exhibited a lower extent of reinstatement of fear memory. Computational modeling revealed that the deficit in the AppNL-G-F mice would be due to their internal states being biased toward overgeneralization or overdifferentiation of their observations, and consequently the competing memories were not retained. This deficit was replicated in another type of memory modification learning in the reversal Barnes maze task. Following reversal learning, AppNL-G-F mice, given spatial cues, failed to infer coexisting memories for two goal locations during the trial. We concluded that the altered internal states of AppNL-G-F mice illustrated their misclassification in the memory modification process. This novel approach highlights the potential of investigating internal states to precisely assess cognitive changes in early AD and multidimensionally evaluate how early interventions may work.
Introduction
Alzheimer’s disease (AD) is the leading cause of dementia, which severely disturbs daily lives in the aging population and causes a tremendous social burden (Nandi et al., 2022). Neuropathological hallmarks, including amyloid-β (Aβ) plaques and neurofibrillary tangles (NFT), define AD biologically, and the advance in biomarkers have enabled early diagnosis (Jack Jr. et al., 2018, 2024). Abnormal biomarkers can appear years before symptoms arise, a stage referred to as preclinical AD (Sperling et al., 2011, 2014). As the disease progresses, the cognitive symptom gets severe, and its degree defines mild cognitive impairment (MCI) and the onset of AD dementia. Cognitive decline has been reported in a wide range of domains, including memory, language, visuospatial function, and executive function, leading to behavioral change as early as a decade before dementia (Amieva et al., 2008; Knopman et al., 2021). The early stage of AD provides a time window to intervene in disease progression, yet there is an explanatory gap and inconclusive consensus on what particular behavioral phenotype can be directly linked to neuropathology in early AD. Given the present circumstance, the subtle and heterogeneous cognitive change before the onset of dementia poses a challenge to assess clinical outcomes with traditional measurement (Jutten et al., 2023).
To precisely assess clinical symptoms and effects of early intervention for AD pathology, determining universal behavioral phenotypes as well as understanding how such behaviors change from asymptomatic to symptomatic is needed. Memory impairment is the predominant first symptom of AD (J. Barnes et al., 2015). The symptoms are commonly identified as a deficit in recalling previously learned or recognized items (Grober et al., 2018). Previous studies suggested that the performance of prioritized recall, in which memories are recalled in order of assigned value, is a sensitive measure in the early stages of AD (for review, see Knowlton & Castel, 2022). It remains elusive how memory impairments may arise from biased information processing that could affect the encoding, maintaining, or retrieval process of memory function. Using genetic mouse models is a common approach to dissecting the effects of pathological changes on cognitive symptoms measured in behavioral tasks while controlling for potential confounding factors (Webster et al., 2014; Zhong et al., 2024). However, the symptoms at the early stage of AD are relatively mild in traditional measurements, and phenotypes may be inconsistent among studies even when using the same AD mouse model (Jankowsky & Zheng, 2017). As behavioral phenotypes in conjunction with brain pathology state may not be captured by current assessments robustly, it is crucial to seek more sensitive and comprehensive approaches.
Recently proposed computational models provide generative processes underlying observed behavior, potentially bridging the gap between behavioral phenotype and AD pathology. Indeed, such models not only replicated the memory task performance of AD patients in different disease stages (M. D. Lee et al., 2020; M. D. Lee & Stark, 2023; Pooley et al., 2011) but also were valuable in detecting cognitive decline in preclinical AD patients (Bock et al., 2021; Vanderlip et al., 2024). However, previous approaches did not suppose an internal model formulating the patient’s reasoning process for current observation given prior knowledge. Internal models with sufficient explanatory power can replicate various behavioral phenotypes in specific tasks through a common generative process. Estimating an internal model in the early stages of AD patients would elucidate how AD alters their internal states and thereby their behaviors along with disease progression, which is informative for precise diagnosis and assessment of early intervention effects. To our knowledge, only a few studies tried to estimate internal states in amnestic MCI and AD patients and reported that a certain retention rate underlying motor learning (Sutter et al., 2024) but not learning rate in associative learning (Wessa et al., 2016) differed from healthy control. This circumstance demands us to investigate internal models illustrating not only learning but also its interaction with memory with higher explanatory power.
In this study, we assumed the latent cause model as an internal model explaining the memory modification process to solve a problem such that a novel observation should be classified into previously acquired memory or novel memory (Gershman et al., 2017). We hypothesized that the internal state of the latent cause model is altered along with disease progression from the preclinical stage of AD. To test this, we used the AppNL-G-F knock-in AD mouse model (Saito et al., 2014). This model mice carry the Swedish KM670/671NL, Iberian I716F, and Arctic E693G mutations on the gene of amyloid-β precursor protein (APP) and display Aβ plaque accumulation and neuroinflammation across age, reproducing neuropathological change in preclinical AD without artifacts caused by APP overexpression (Saito et al., 2014; Sasaguri et al., 2017). In this study, the 6- and 12-month-old AppNL-G-F knock-in mice and the age-matched control mice were engaged in a memory modification task consisting of Pavlovian fear conditioning, extinction, and reinstatement. Internal states across trials were estimated for each mouse, simultaneously with determining a set of model parameters such that the simulated behavior under the latent cause model given the parameters fits enough to the observed behavior. Because we found a deficit in the AppNL-G-F mice as overgeneralization or overdifferentiation of their observations into certain memories indicated by parameters in the latent cause model, we further confirmed whether this deficit could be replicated in another kind of memory modification task, the reversal Barnes maze paradigm. In the latent cause framework, we discussed how the internal states in the AppNL-G-F mice diverged from those in the control mice during the memory modification process.
Results
The latent cause model, an internal model, explains memory modification processes in the classical fear conditioning paradigm
The latent cause model proposed by Gershman et al. (2010) is an internal model such that an agent infers a latent cause as a source of the co-occurrence of events in the environment. It was later adapted to model the memory modification process, a problem to infer previously acquired latent causes or a novel latent cause for every observation in the Pavlovian fear conditioning paradigm in rodents and humans (Gershman et al., 2017). In the latent cause model, an agent learning an association between a conditioned stimulus (CS) and an unconditioned stimulus (US) has an internal model such that stimuli x (i.e., d-dimensional vector involving the CS or context) and outcome r (i.e., US) at time t are generated at an associative strength w (i.e., d-dimensional vector) from k-th latent cause ztk of total K latent causes by a likelihood p(x|ztk) and p(r|ztk), respectively (Fig. 1A). p(x|ztk) and p(r|ztk) follow Gaussian distributions in which each mean is calculated from previous observations, while each variance (σx 2 and σr2) was a priori defined as a hyperparameter. Since latent causes are unobservable, the agent given an observation infers which latent cause is likely to generate current observation by Bayes’ rule. The prior of z follows the Chinese restaurant process, that is
Prior of ztk refers to the frequency that zk was inferred so far, where 𝕀 [・] = 1 if its argument is true; otherwise, 0. This prior is biased by a temporal kernel 𝒦, as an analogy of the power law of the forgetting curve,
to exponentially decay the probability that zk inferred at previous trial t’ is inferred again at current trial t by a temporal scaling factor g if the time interval between trial t’ and t increases. A new latent cause is inferred at a probability governed by concentration parameter α. Higher α or lower likelihood of the current observation biases the agent to infer a new latent cause given an observation (i.e., differentiation) rather than to infer an old one (i.e., generalization) (Fig. 1B). Although the latent cause model allows for having an infinite number of latent causes, agents with appropriate α and g favor to infer a smaller number of latent causes for observations instead of inferring a unique latent cause for every observation.

Internal state in the latent cause model in the Pavlovian fear conditioning, extinction, and reinstatement.
(A) Schematic diagram of latent cause framework in a simple Pavlovian fear conditioning paradigm. An experimental mouse observes a tone, a context, and an electrical shock, which are set down in a parallelogram. Among the stimuli, the tone is designed as the conditioned stimulus (CS), and the electrical shock is designed as the unconditioned stimulus (US) with additional consideration of context effect. The mouse has an internal model such that a latent cause z generates the observation where the tone and the context induce the shock at an associative weight wcs and wcontext, respectively. The latent cause is a latent variable represented by a white square. The posterior probability of z is represented by a black arrow from z to the observation. The stimuli (i.e., tone, context, and shock) are observed variables represented in shaded circles, and the associative weight w is represented by a black arrow between the two. Since latent causes are unobservable, the mouse infers which latent cause is more likely from posterior distribution over latent causes given the observation following Bayes’ rule. (B) Schematic diagram of internal state posited in the latent cause model demonstrating the memory modification process in the fear conditioning paradigm. The mouse has acquired zA through two observations of the CS accompanied by the US in the same context. Now, the mouse observes the tone alone, then infers a new latent cause zB generating the CS and context without US as wcs and wcontext equal to 0. The internal state at the time consists of zA with largely decreased posterior probability and thereby slightly decreased wcs and wcontext, in addition to zB. The mouse in this panel is likely to differentiate the current observation from zA, as it assigns a higher probability for zB. The posterior probability of zA and zB is represented by the relative size of their diagram. If the mouse generalizes zA to the current observation without inferring zB, it would largely decrease wcs and wcontext of zA to minimize prediction error. (C) Observed and simulated conditioned response (CR) during reinstatement in fear conditioning. In the acquisition phase, a CS was accompanied by a US, inducing associative fear memory in the animal, as it exhibits a higher CR to the CS. In the extinction phase, the CS was presented without the US, inducing extinction memory with decreased CR. After an unsignaled shock without the preceding CS, the CR increases again, and this is called reinstatement. The observed CR was sampled from a 2-month-old male wild-type C57BL/6J mouse (magenta line plot with square markers), where the vertical and horizontal axis indicate CR and cumulative trial, respectively. The latent cause model parameters were estimated so as to minimize the prediction error between the observed and the simulated CR given the parameter (gray line with square markers). The estimated parameter value: α = 2.4, g = 0.316, η = 0.64, max. no. of iteration = 9, w0 = -0.006, σr2 = 4.50, σx2 = 4.39, θ = 0.03, λ = 0.02, K = 6. Trials with red, blue, and yellow backgrounds correspond to the acquisition, extinction, and unsignaled shock, where the first and second extinction consists of 19 and 10 trials, respectively (see Material and Methods). Three trials with white backgrounds are tests 1, 2, and 3 to evaluate if conditioning, extinction, and reinstatement are established. The vertical dashed lines separate the phases. (D) The evolution of the posterior probability and (E) the associative weight of each latent cause across the reinstatement experiment, where wcs (top panel) and wcontext (bottom panel) were computed. (D and E) Each latent cause is indicated by a unique subscript number and color, wcs and wcontext. The latent causes acquired during the acquisition phase and their associative weight are termed zA and wA, respectively. Meanwhile, latent causes acquired during the extinction phase and their associative weight are termed zB and wB, respectively.
According to the latent cause model, the agent solves two computational problems during Pavlovian fear conditioning learning. One is to compute the posterior probability of each latent cause given its associative weight between stimuli x and shock r with a history of observations, and the other is to compute the associative weight of each latent cause given its posterior. Both computations are achieved through the expectation-maximization (EM) algorithm, as the former and the latter computation correspond to E-step and M-step, respectively. In the E-step, the posterior of each latent cause is computed from its prior and the likelihood of current observation given the latent cause following Bayes’ rule, consequently determining the most likely latent cause at the current trial as the one with the maximal posterior, then moves to the M-step. In the M-step, the agent predicts shock by a linear combination of stimuli and associative weight in each latent cause. The weight is updated based on the Rescorla-Wagner rule (Rescorla & Wagner, 1972) to minimize the prediction error between observed and predicted shock. The amount of the weight change depends on learning rate η and prediction error, which is biased by the posterior probability of the latent causes. The amount of weight change will be smaller even in larger prediction errors if the latent cause is unlikely. The agent estimates weights for all latent causes and then returns to E-step. This procedure is iterated a preset number of times, and thereby, the posterior probability over latent causes and their associative weights are determined. Finally, the agent concludes the expected shock in the current trial by a linear combination of observed stimuli and an expectation of weight, which is weighted by the posterior distribution over latent causes. The conditioned response (CR) for the expected shock is determined as an integral of Gaussian distribution (mean is the expected shock, variance is λ) above threshold θ, where λ and θ are hyperparameters. Thus, these parameters of the latent cause model determine an internal state of the agent leading to particular behavioral outcomes given an observation.
Reinstatement exemplifies the memory modification process and indeed is well explained by the latent cause model. Reinstatement occurs when agents who have learned and then extinguished an association between a CS and a US exhibit CR again after being exposed to the US alone following the extinction. The latent cause model explains the reinstatement of fear memory with the changes of agent’s internal state as follows. First, an agent repeatedly observed a CS accompanied with a US in a Pavlovian fear conditioning procedure (Fig. 1C, “Acquisition”). Because these observations are novel for the agent, it inferred a set of latent causes zA (z1, z2, and z3 in Fig. 1D) where the CS is associated with the US by associative weight wA (w1, w2, and w3 in Fig. 1E) during the acquisition. Considering the effect of the context, w contains both wcs and wcontext (Fig. 1E). After the acquisition, the agent inferred zA and then predicted the upcoming US, leading to increased CR when it observed the CS (Fig. 1C, “Test 1” after “Acquisition”). Second, the agent repeatedly observed the CS alone in an extinction procedure. Early in the extinction, the agent would infer zA and then exhibit higher CR to the CS, while the US was absent (Fig. 1C, left “Extinction”). This state increases prediction error, then prompts the agent either to decrease wA toward 0 or to infer a novel set of latent causes zB where the CS is not accompanied with the US, that is, associative strength wB = 0 (Fig. 1B); both processes are run in the mouse in Fig. 1CDE. Later in the extinction, zB rather than zA is inferred, leading to the decreased CR (Fig. 1C, right “Extinction” or “Test 2”). Third, when the agent observes the US alone after the extinction (Fig. 1C, “Unsignaled shock”), the probability of zA or associative weight of some latent causes increases, hence it predicts the US and thereby the CR increases again, that is reinstatement (Fig. 1C, “Reinstatement” at “Test 3”); in the mouse in Fig. 1CDE, the unsignaled shock increases not only the posterior probability of zA that has maintained higher wcs but also wcontext in w6 interpreted as a modification in the association between context and US in z6. Thus, it is rational to expect that certain parameters of the latent cause model are in conjunction with particular behaviors in the reinstatement paradigm. As demonstrated in Gershman & Hartley (2015), the likelihood that α > 0 in human participants correlates with the degree of their spontaneous recovery, that is a return of the CR after a long hiatus following an extinction procedure. Furthermore, a recent study has shown that the latent cause model explains maladaptive decision-making in post-traumatic stress disorder patients as a perseverated inference of previously acquired latent causes even under situations where they should infer novel latent causes (Norbury et al., 2022). By estimating such parameters from the behaviors of each experimental mouse, how the internal state of the AD group diverges from that of the control group could be determined.
AppNL-G-F mice showed unimpaired associative fear and extinction learning but a lower extent of reinstatement
To examine the age and AppNL-G-F knock-in effects on the reinstatement, all cohorts of control and homozygous AppNL-G-F mice were tested at the age of either 6 months or 12 months in the reinstatement paradigm based on auditory-cued fear conditioning (Fig. 2A and Table S1). According to previous studies, the AppNL-G-F mice displayed an age-dependent Aβ deposition from 4 months old and memory impairments could be detected at 6 months of age (Masuda et al., 2016; Saito et al., 2014; Sakakibara et al., 2018). The wide spread of Aβ accumulation was confirmed in this study, and results were provided in Figure 2 – Supplement 1. Given that age affects the Aβ accumulation caused by AppNL-G- F knock-in, a one-way ANCOVA was used to evaluate the AppNL-G-F knock-in effects on behavioral measures controlling for age.

AppNL-G-F mice exhibited successful associative learning and extinction but a lower extent of reinstatement.
(A) Schematic diagram of reinstatement paradigm in this study. In the acquisition phase, the CS was accompanied with the US 3 times. In the following phase, either the CS or the US was presented at certain times. The context was the same throughout the experiment. (B) Freezing rate during CS presentation across reinstatement paradigm. The markers and lines show the median freezing rate of each group in each trial. Red, blue, and yellow backgrounds represent acquisition, extinction, and unsignaled shock in (A). The dashed vertical line separates the extinction 1 and extinction 2 phases. Freezing rate during CS presentation in test 1 (C), test 2 (D), and test 3 (E). Discrimination index (DI) between test 2 and test 1 (F), between test 3 and test 2 (G), and between test 3 and test 1 (H) calculated from freezing rate during CS presentation.
(C to E) The data are shown as median with interquartile range. *p < 0.05, **p < 0.01, and ***p < 0.001 by one-way ANCOVA with age as covariate; #p < 0.05, ##p < 0.01, and ###p < 0.001 by Student’s t-test comparing control and AppNL-G-F mice within the same age. Detailed statistical results are provided in Table S2. (F to H) The dashed horizontal line indicates DI = 0.5, which means no discrimination between the two phases. †p < 0.05 by one-sample Student’s t-test, and the alternative hypothesis specifies that the mean differs from 0.5; *p < 0.05, **p < 0.01, and ***p < 0.001 by one-way ANCOVA with age as a covariate; #p < 0.05, ##p < 0.01, and ###p < 0.001 by Student’s t-test comparing control and AppNL-G-F mice within the same age. Detailed statistical results are provided in Tables S3 and S4. (B to H) Colors indicate different groups: orange represents 6-month-old control (n = 24), light blue represents 6-month-old AppNL-G-F mice (n = 25), pink represents 12-month-old control (n = 24), and dark blue represents 12-month-old AppNL-G-F mice (n = 25). Each black dot represents one animal.
In the acquisition phase of fear conditioning, the freezing rate gradually increased with the number of CS-US pairings (Fig. 2B, leftmost red panel). In the test 1 phase, 24 h after the acquisition, a significant genotype effect was observed in freezing rates during CS presentation [F(1,95) = 9.679, p = 0.002], where the AppNL-G-F mice showed a higher freezing rate than in the control (Fig. 2C and Table S2), indicating the associative fear memory was established in AppNL-G-F mice. As the increasing trials of CS alone presented, both groups showed a gradually decreased freezing rate in the extinction phase (Fig 2B, blue panel). In the test 2 phase, 24 h after the last trial of the extinction, there was a significant age effect in freezing rates during CS presentation [F(1,95) = 4.698, p = 0.033], where 12-month-old mice showed a higher freezing rate than in the 6-month-old mice (Fig. 2D and Table S2). In the test 3 phase, 24 h after given an unsignaled shock, mice showed an elevated freezing rate, and no significant genotype or age effect was detected (Fig. 2B, 2E and Table S2).
To examine the phase differences, we calculated the discrimination index (DI) using freezing rates during the CS presentation. If the mouse shows the same freezing rate at the two test phases, the value of DI will be 0.5, while if the mouse shows a higher freezing rate in one phase compared to the other, DI will be far from 0.5. The results showed that the DIs between test 2 and test 1 were significantly lower than 0.5 in all groups (Fig. 2F and Table S3), indicating the successful establishment of extinction memory. The genotype effect was detected [F(1,95) = 5.013, p = 0.027] (Fig. 2F and Table S4), where AppNL-G-F mice showed lower values due to higher freezing in the test 1 phase. These results suggested that there was no markable fear and extinction learning deficit in AppNL-G-F mice at the behavioral level. The DIs between the test 3 and 2 phases in four groups were significantly higher than 0.5, indicating that the reinstatement was successfully induced after the unsignaled shock (Fig. 2G and Table S3). Six-month-old mice showed higher DIs, as a significant age effect was detected [F(1,95) = 7.480, p = 0.007] (Fig. 2G and Table S4). The DIs between the test 3 and 1 phases did not deviate from 0.5 for both 6- and 12-month-old control mice, suggesting that they displayed comparable freezing rates in tests 1 and 3 (Fig. 2H and Table S3). In contrast, the DIs between test 3 and 1 phases were significantly lower than 0.5 in 6-month-old AppNL-G-F mice [t(24) = - 3.245, p = 0.003] and 12-month-old AppNL-G-F mice [t(24) = -6.165, p < 0.001] (Fig. 2H and Table S3). Moreover, a significant genotype effect was detected [F(1,95) = 15.393, p < 0.001] (Fig. 2H and Table S4), indicating AppNL-G-F mice exhibited a lower extent of reinstatement. Thus, these results suggest that mice retrieved fear memory in the test 1 phase and retrieved extinction memory in the test 2 phase, respectively. The fear memory and the extinction memory compete with each other: the US is present in the former, while it is absent in the latter. In the test 3 phase, after they observed the unsignaled shock, mice had to decide which memory was more relevant given the CS. The same level of freezing rate in tests 1 and 3 observed in control mice suggests that they could retain both fear and extinction memory and thereby infer the fear memory more preferentially. In contrast, AppNL-G-F mice displayed a lower freezing rate in test 3 compared to test 1, suggesting that they might still infer extinction memory even after the unsignaled shock or a completely new memory, while the initial fear memory might be suppressed or eliminated.
The internal state underlying reinstatement simulated with the latent cause model differs between the ages and APP genotype
To seek the internal state of each mouse generating the behaviors in the reinstatement paradigm, we estimated the parameters of the latent cause model, minimizing prediction errors between the observed CR and simulated CR in the latent cause model given a set of parameters (see also Material and Methods). Figure 3 demonstrates the traces of observed CR, simulated CR, and the changes in the internal state in each group. We initially confirmed that the DIs in the simulated CRs in each group well replicated those in the observed CRs, except that the DI between test 3 and test 1 in the 6-month-old AppNL-G-F mice did not significantly deviate from 0.5 in simulated results (c.f. Fig. 2F, 2G, 2H and Fig. 3 – Supplement 1). In the 6-month-old group, two latent causes (z1 and z2) were generated in the acquisition phase with increasing associative weight (rows 2 to 4 in Fig. 3A, 3B). During the extinction phase, the same latent causes were inferred, and the corresponding associative weights declined, indicating the devaluation of previously acquired fear memory (rows 2 to 4 in Fig. 3A and 3B). Notably, the wcs in w2 in control mice remained at a higher value compared to that in AppNL-G-F mice. At trial 35 given an unsignaled shock, wcontext in w1 was elevated in control mice (row 4 in Fig. 3A), while both wcontext in w1 and w2 were elevated in AppNL-G-F mice (row 4 in Fig. 3B). At trial 36 (test 3 phase), in control mice, higher values in wcontext in w1 and wcs in w2 contributed to increased CR (Fig. 3A), whereas wcontext in w1 and w2 were the main components for reinstatement in AppNL-G-F mice (Fig. 3B). This discrepancy suggests that the expectation of the US is different between 6-month-old control and AppNL-G-F mice.

The divergence of internal states between control and AppNL-G-F mice differed in age.
(A) Simulation of reinstatement in the 6-month-old control mice given a set of estimated parameters of the latent cause model. The estimated parameter value: α = 2.4, g = 0.05, η = 0.42, max. no. of iteration = 2, w0 = 0.009, σr2 = 2.90, σx2 = 0.49, θ = 0.008, λ = 0.018, K = 10. (B) Simulation of reinstatement in 6-month-old AppNL-G-F mice. The estimated parameter value: α = 1.1, g = 0.61, η = 0.51, max. no. of iteration = 2, w0 = 0.004, σr2 = 1.51, σx2 = 0.32, θ = 0.015, λ = 0.019, K = 30. (C) Simulation in 12-month-old control mice. The estimated parameter value: α = 1.1, g = 0.92, η = 0.82, max. no. of iteration = 5, w0 = 0.009, σr2 = 1.88, σx2 = 0.51, θ = 0.010, λ = 0.017, K = 4. (D) Simulation in 12-month-old AppNL-G-F mice. The estimated parameter value for: α = 1.0, g = 1.10, η = 0.04, max. no. of iteration = 2, w0 = 0.0025, σr2 = 1.51, σx2 = 0.18, θ = 0.011, λ = 0.011, K = 36.
(A to D) The first row shows the trace of observed CR and simulated CR. The observed CR is the median freezing rate during the CS presentation over the mice within each group; the observed CR of each group was divided by its maximum over all trials. The second row shows the posterior probability of each latent cause in each trial. The third and fourth rows show the associative weight of tone to shock (wcs) and that of context to shock (wcontext) in each trial. Each marker and color corresponds to a latent cause up to 5. Each latent cause is represented by the same color as that in the second row and contains wcs and wcontext. The vertical dashed lines indicate the boundaries of phases.
The transition of internal state in the 12-month-old group was similar to that in the 6-month-old group until the test 1 phase. Unlike the 6-month-old group, new latent causes (z3 and z4) were inferred during the extinction phase in the 12-month-old group (rows 2 to 4 in Fig. 3C, 3D). At trial 35 given an unsignaled shock, the control mice inferred the latent causes z3 and z4 acquired in the extinction phase so that their weights wcontext in w3 and w4 were updated (row 4 in Fig. 3C). In contrast, there was virtually no update of weights in AppNL-G-F mice (row 4 in Fig. 3D). At trial 36 (test 3 phase), the elevated CR in control mice was attributed to a successful update of associative weight after the unsignaled shock (Fig. 3C), whereas the CR in AppNL-G-F mice did not increase due to a redundant latent cause z5, which was newly inferred and reduced the posterior probability of latent causes having higher associative weight (Fig. 3D).
The unsuccessful state inference at reinstatement was caused by misclassification in AppNL-G-F mice
Due to the different evolution of latent causes between ages (Fig. 3), we separately discuss the contribution of differences in estimated parameters within the same age and their correlations with DI between test 3 (after unsignaled shock) and test 1 (after acquisition) (Fig. 2H). We first investigated the individual internal state differences in the 12-month-old group where the impairment was apparent at the behavioral level (Fig. 2H). AppNL-G-F mice had significantly lower α (Mann-Whitney U = 100, p = 0.002) and lower σx2 (Mann-Whitney U = 88.5, p = 0.008) than control mice (Fig. 4A). In addition, these two parameters were significantly correlated with the DI (Fig. 4B). Statistical results for remaining parameters are shown in Table S7 and S8. While the maximal number of inferable latent causes (K) was significantly higher in AppNL-G-F mice than in control mice (Table S7), the total number of acquired latent causes was comparable between the groups (Ktotal in Table S9). Although the number of never inferred latent causes was higher in AppNL-G-F mice, they have virtually no effects on internal states, as the priors and posteriors of them were 0 except those of the next candidate of new latent cause. The higher K in AppNL-G-F mice could be because their sample chains converged without the need to explore the parameter space, as those stayed at the same level as the initial guess (see Material & Methods). It should be noted that these estimated parameters and internal state might not be unique but one of the possible solutions.

Individual parameter estimation and internal state in the 12-month-old group.
(A) The estimated latent cause model parameter. (B) Correlation between discrimination index (DI) and estimated parameter values in the 12-month-old group. The count of latent causes initially inferred during the acquisition trials (C, left), extinction trials (i.e., test 1, extinction, test 2) (D, left), and trials after extinction (i.e., the unsignaled shock and test 3) (E, left), with the sum of posterior probabilities (C, D, E, right), and the sum of associative weights at test 3 in these latent causes (F, G, H).
(A) Each black dot represents one animal. *p < 0.05, and **p < 0.01 by the Mann-Whitney U test. (B) Spearman’s correlation coefficient (ρ) was labeled with significance, where *p < 0.05 and ***p < 0.001. The blue line represents the linear regression model fit, and the shaded area indicates the confidence interval. Each dot represents one animal. (C to E) In the first column, the histogram of the number of latent causes (z) acquired in each phase is shown. The maximum number of latent causes that can be inferred is 3, 31, and 2 in panels C, D, and E. In the second column, the histogram of the sum of the posterior probabilities of the latent causes is shown. The horizontal axis indicates the proportion of the value in each group. (F to H) In the first and second columns, the sum of wcs and wcontext of latent causes are shown in the boxplot, respectively. Note that the initial value of associative weight could take non-zero values, though those were comparable between groups (Table S7). Each black dot represents one animal. *p < 0.05, and **p < 0.01 by the Mann-Whitney U test comparing control and AppNL-G-F mice, and p-values greater than 0.1 were not labeled on the plot. (A to H) Pink represents 12-month-old control (n = 20), and dark blue represents 12-month-old AppNL-G-F mice (n = 18). Detailed statistical results are provided in Tables S7, S8, and S9.
We further characterized how the latent causes acquired in different phases contribute to the distinct internal states between control and AppNL-G-F mice. Lower α would bias AppNL-G-F mice toward overgeneralization as they favor inferring a small number of latent causes even for novel observations. Indeed, the number of latent causes initially inferred at the acquisition phase was significantly lower in AppNL-G-F mice, where most of them inferred two latent causes, and more than half of the control mice inferred three latent causes (Fig. 4C and Table S9). Furthermore, when they initially observed the CS in the absence of the US in test 1, the posterior probabilities of acquisition latent causes in test 1 were significantly higher in the AppNL-G-F mice than in the control mice, leading to the higher CRs in test 1 in the simulation (Fig. 4 – Supplement 1A), consistent with the observed CR (Fig. 2C).
According to the latent cause model, lower σx2 would bias AppNL-G-F mice toward overdifferentiation. In other words, they prefer to infer new potential causes when similar observations are presented instead of interpreting them generated from the same distribution. The likelihood of current stimuli given latent causes was calculated from a Gaussian distribution with the mean of stimulus values under the latent cause so far and fixed variance σx2. Repeated observation of the CS alone during the extinction would sufficiently increase the likelihood of stimuli given the latent causes inferred during the extinction in both AppNL-G-F and control mice. This attenuates the need to infer new latent causes even if AppNL-G-F mice had lower σx2, as the number of latent causes acquired at the extinction phase was comparable between groups (Fig. 4D and Table S9). However, the CS was absent in the unsignaled shock and then presented again in test 3. These observations would decrease the likelihood of stimuli given past latent causes, and consequently increase the probability to infer the new latent causes. This volatility would be more obvious in AppNL-G-F mice with lower σx2. As a result, the number of latent causes acquired after the extinction (i.e., the unsignaled shock and test 3) as well as the sum of posterior probabilities of them were greater in AppNL-G-F mice than those in the control mice, despite that the mice received either the US or the CS alone that were the same with those they observed so far (Fig. 4E and Table S9).
Such internal states in AppNL-G-F mice would diverge the update of associative weight from those in the control mice in test 3, which was observed in the latent cause acquired during the extinction phase, not the acquisition phase (Fig. 4F, 4G and Table S9). Both AppNL-G-F and control mice would still infer the extinction latent causes at the unsignaled shock with higher posterior probabilities. Since the update of the associative weight of each latent cause is modulated by both prediction error and posterior probability of the latent cause, the associative weights in the extinction latent causes were increased by the unsignaled shock in the control mice (Fig. 3C, 4G and Table S9). In AppNL-G-F mice, however, they inferred new latent causes with low associative weight after the extinction (Fig. 4E, 4H and Table S9), which decreased the posterior probabilities of other latent causes and impeded update of their weight (Fig. 3D, 4G). This distinct internal state therefore led to the lower CR and DI in test 3 in AppNL-G-F mice (Fig. 2E, 2H). These results suggest that the α and σx2 would differentiate the internal state and thereby behavioral phenotypes between AppNL-G-F and control mice. In the Chinese restaurant process, the probability that a new latent cause is inferred decreases with trials, preventing that too many latent causes are inferred (Gershman & Blei, 2011). This could be the reason why the overgeneralization by lower α was less effective than the overdifferentiation by lower σx2 in test 3 in AppNL-G-F mice.
As shown previously in Fig. 2 and Fig. 2– Supplement 1, Aβ aggregation has already widely spread in the brain of 6-month-old AppNL-G-F mice, but no clear behavioral impairments were detected by the conventional analysis. Similarly to the 12-month-old group, we investigated the individual internal state differences in the 6-month-old group and tested whether the discrepancy in α and σx2 emerged earlier without behavioral impairment. We found that AppNL-G-F mice had significantly lower α (Mann-Whitney U = 38, p = 0.012), and it was significantly correlated with DI (Fig. 5A, 5B and Table S10, S11). The σx2 was comparable between control and AppNL-G-F mice, but its significant correlation with DI was found (Fig. 5A, 5B and Table S10, S11). The hyperparameter θ, a threshold to emit CR for stimulus inputs, was significantly higher in AppNL-G-F mice than in control mice. Since the λ was comparable between control and AppNL-G-F mice (Table S10), the CR in AppNL-G-F mice could rapidly decrease with expected US during the extinction phase, even if their CR at the test 1 was higher than those in the control. The effect of higher θ was subtle when the expected US was far from its value, and therefore had little contribution to the DI between test 3 and test 1, which is in line with the correlation result in Table S11. Statistical results for the remaining parameters are shown in Tables S10 and S11.

Individual parameter estimation and internal state in the 6-month-old group.
(A) The estimated parameters of the latent cause model. (B) Correlation between discrimination index (DI) and estimated parameter values in the 6-month-old group. The count of latent causes initially inferred during the acquisition trials (C, left), extinction trials (i.e., test 1, extinction, test 2) (D, left), and trials after extinction (i.e., the unsignaled shock and test 3) (E, left), with the sum of posterior probabilities (C, D, E, right), and the sum of associative weights at test 3 in these latent causes (F, G, H).
(A) Each black dot represents one animal. *p < 0.05 by the Mann-Whitney U test. (B) Spearman’s correlation coefficient (ρ) was labeled with significance, where *p < 0.05 and **p < 0.01. The blue line represents the linear regression model fit, and the shaded area indicates the confidence interval. Each dot represents one animal. (C to H) The configuration of the figure is the same as in Figure 4. All the p-values of the Mann-Whitney U test comparing control and AppNL-G-F mice were greater than 0.05 and were not labeled on the plot. (A to H) Orange represents 6-month-old control (n = 15), and light blue represents 6-month-old AppNL-G-F mice (n = 12). Detailed statistical results are provided in Tables S10, S11, and S12.
Similar to the 12-month-old group, the influence of α was observed in the acquisition phase, where most AppNL-G-F mice inferred two latent causes, while more than half of control mice inferred three latent causes (Fig 5C). In addition, lower α in AppNL-G-F mice affect the posterior probabilities of acquisition latent cause and CR in test 1 phase (Fig. 4 – Supplement 1B). No significant difference in the internal states was found between control and AppNL-G-F mice (Fig. 5C to 5H and Table S12).
In summary, α and σx2 are the main contributors to differential internal states and memory modification processes between control and AppNL-G-F mice. Regardless of age, AppNL-G-F mice have a defect in forming a new memory even if the observation is novel, suggesting overgeneralization due to significantly lower α (Fig. 4A, 5A). With a more severe phenotype, 12-month-old AppNL-G-F mice were biased to classify similar observations into different causes due to lower variance of the stimulus, σx2 (Fig. 4A). This overdifferentiation would eventuate the lower reinstatement of fear memory (Fig. 2H). Thus, these results suggest that AppNL-G-F mice failed to retain competing memories because of the misclassification of observation into memories, even though each memory accounts for past and present observations.
AppNL-G-F mice failed to infer coexisting memories in the reversal Barnes maze task
To confirm whether the deficit in the AppNL-G-F mice is replicated in another experimental paradigm, cohorts 4, 5, 6, and 7 were subjected to a reversal Barnes maze task (C. A. Barnes, 1979) two weeks after the reinstatement experiment (Table. S1). Barnes maze is a commonly used behavioral task to examine spatial learning and memory in Alzheimer’s disease model mice (Webster et al., 2014). Unlike fear conditioning, it requires goal-directed learning to compute an optimal path from the current position to a destination while modifying the spatial memory of an environment through trial and error. The first stage contains a 6-day training and 1-day probe test to establish the first spatial memory of the target hole (Fig. 6A). During the training phase, the mice explored a circular field and were reinforced to find an escape box under a target hole as any one of 12 holes equally spaced around the perimeter of the field (Fig. 6A). In the 12-month-old group, the number of errors, latency, and travel distance from start to goal decreased across days in the training phase, suggesting successful initial learning (Fig. 6 – Supplement 1A, and Table S13). A significant interaction between day and genotype [F(5, 170) = 2.447, p = 0.036] was observed in the latency where control mice took a longer time to reach the target hole on training days 4, 5, and 6 (Fig. 6 – Supplement 1A, and Table S13). In order to estimate what kind of strategies mice used to solve the task relying on their memory, we performed algorithm-based strategy analysis (Suzuki & Imayoshi, 2017; Tachiki et al., 2023). A moving trajectory in each trial was categorized into one of five spatial strategies following the definition (Fig. 6 – Supplement 2). As in the conventional analysis above, the usages of the strategies were virtually comparable between AppNL-G-F and control mice (Fig. 6B and Table S14).

AppNL-G-F mice failed to infer coexisting spatial memories in the reversal Barnes maze task.
(A) Schematic diagram of the reversal Barnes maze task. The largest circle represents the Barnes maze field. The filled and open small circles represent the target hole with the escape box and the remaining holes in the field, respectively. The small arrow pointing to the hole indicated the position of the target hole without an escape box in the probe test. The same color of the filled circle and arrow indicates the same position. (B) Strategy usage in initial training (days 1 to 6), the first reversal training (days 9 to 11), and the second reversal training (days 12 to 14) in the 12-month-old group. Time spent around each hole at probe test 1 (C) and probe test 2 (D) in the 12-month-old group. (E) Discrimination index (DI) of the second target hole between probe test 2 and probe test 1 in the 12-month-old group. (F) Strategy usage in initial training (days 1 to 6), the first reversal training (days 9 to 11), and the second reversal training (days 12 to 14) in the 6-month-old group. Time spent around each hole at probe test 1 (G) and probe test 2 (H) in the 6-month-old group. (I) Discrimination index (DI) of the second target hole between probe test 2 and probe test 1 in the 6-month-old group.
(B and F) *p < 0.05 of perimeter strategy, #p < 0.05 of confirmatory strategy, †p < 0.05 of serial strategy, ‡p < 0.05 of spatial strategy by Wilcoxon rank-sum test comparing control and AppNL-G-F mice at the same age. (C, D, G, and H) The data are shown as mean with 95% confidence interval. *p < 0.05, **p < 0.01, and ***p < 0.001 by mixed-design two-way [Genotype (control, AppNL-G-F) × Hole (1-12)] ANOVA; †p < 0.05 by Tukey’s HSD test at the specific hole to compare control and AppNL-G- F mice at the same age. (E and I) The dashed horizontal line indicates DI = 0.5, which means no discrimination between the two phases. †p < 0.05 by the Wilcoxon signed-rank test, and the alternative hypothesis specifies that the mean differs from 0.5; *p < 0.05 by Mann-Whitney U test comparing control and AppNL-G-F mice. (C-E, G-I) Each black dot represents one animal. Colors indicate the different groups: pink represents 12-month-old control (n = 18), dark blue represents 12-month-old AppNL-G-F mice (n = 18), orange represents 6-month-old control (n = 24), and light blue represents 6-month-old AppNL-G-F mice (n = 17). Detailed statistical results are provided in Tables S14 to S16 for the 12-month-old group results and Tables S18 to S20 for the 6-month-old group results.
In the probe test phase, the mice explored the field for 5 min without the escape box, which generated a certain prediction error (Fig. 6A). While the mice retrieved the spatial memory of the target hole in spite of the novel observation, they would stay around the target hole. Consequently, AppNL-G- F mice spent significantly less time around the target hole than control mice did in the probe test 1, as a significant interaction between genotype and holes [F(11, 374) = 4.22, p < 0.001] was detected (Fig. 6C and Table S15). These results suggest that AppNL-G-F mice could successfully form a spatial memory of the target hole, while the memory was less likely to be retrieved by a novel observation such as the absence of the escape box under the target hole at the probe test 1.
In the first reversal learning phase, the position of the target hole with the escape box was moved to the opposite from day 9 to 11, then returned to the original position from day 12 to day 14 (Fig. 6A). Through the first reversal learning, we expected that the second spatial memory of the target hole is formed that competes with the spatial memory of the first target hole. In other words, this reversal training phase allowed the mice to acknowledge that both two holes were possible to be the target hole. The improved performance across reversal training days in the conventional analysis was similar to that of the initial training phase, suggesting intact behavioral flexibility (Brown & Tait, 2010; Gawel et al., 2019) (Fig. 6 – Supplement 1B and Table S13). Unexpectedly, 12-month-old AppNL-G-F mice reached the goal significantly faster than the control mice [Genotype effect from day 9 to 11, F(1,68) = 10.53, p = 0.003; Genotype effect from day 12 to 14, F(1,68) = 10.27, p = 0.003] (Fig. 6 – Supplement 1B, and Table S13). This would be due to the difference in the strategies primarily taken in each group of mice. During the first reversal training (days 9 to 11), more than half of AppNL-G-F mice used the perimeter strategy, which is significantly higher than control mice (Fig. 6B and Table S14). The mice taking this strategy could reach the novel target hole by exploring holes sequentially without spending time around the previous target hole (Fig. 6 – Supplement 2). On the other hand, the use of perimeter strategy decreased, and spatial strategy increased in control mice from days 9 to 11 (Fig. 6B). Moreover, the use of the confirmatory strategy was around 30% in control mice, which is significantly higher than AppNL-G-F mice (Fig. 6B and Table S14). This strategy is the one such that the mice visit the first and second target holes as if they relied on the two memories of the target holes where the escape box should exist (Fig. 6 – Supplement 2). During the second reversal training (days 12 to 14), AppNL-G-F mice constantly used the perimeter strategy more than the control mice. Meanwhile, the control mice used the confirmatory strategy more than AppNL-G-F mice, as in the first reversal learning phase (Fig. 6B and Table S14). These results suggest that the search strategy diverged between AppNL- G-F mice and control mice since the reversal learning. The control mice could retrieve two competing spatial memories of the target hole during each trial. In contrast, AppNL-G-F mice were less likely to retrieve these two memories within a trial even though the use of spatial strategy was comparable with control mice on the same day.
As expected, the AppNL-G-F mice explored significantly shorter time for the second target hole as well as the first target hole, compared to the control mice (Fig. 6D and Table S15). A significant interaction between genotype and hole was detected [Genotype × Hole, F(11, 374) = 3.24, p < 0.001]. The exploration time was uniform over holes in AppNL-G-F mice, while control mice showed hole preferences around two target holes as if the AppNL-G-F mice and the control mice took the perimeter and the confirmatory strategy, respectively. At last, we calculated the discrimination index (DI) from the exploration time for the first and second target holes in probe test 2 and test 1. As in the reinstatement experiment, we expected that if the mice did not retrieve a spatial memory of the second target hole during probe test 2, the exploration time of this hole would be the same between probe tests 1 and 2, resulting in DI around 0.5. The DI for the second target hole was significantly higher than 0.5 in control mice, indicating that the mice successfully modified the spatial memories from probe tests 1 to 2 at a discriminable level (Fig. 6E and Table S16). On the contrary, the DI in AppNL-G-F mice was comparable to 0.5 and significantly lower than that of control mice, indicating that their spatial memory for the second target hole was not modified after the reversal learning (Fig. 6E and Table S16). We confirmed that the DIs for the first target hole were close to 0.5 and comparable between groups, suggesting that the exploration time for the first target hole remained similarly long in both probe tests (Fig. 6 – Supplement 1C, and Table S16).
The distinct learning process observed in the 12-month-old group was also present in the 6-month-old group. The performance shown in the conventional analysis was comparable between the 6-month-old control and AppNL-G-F mice (Fig. 6 – Supplement 2A, 2B, and Table S17). The use of perimeter strategy was over 30% throughout two reversal training phases in 6-month-old AppNL-G- F mice, while control mice used significantly higher confirmatory strategy in the second reversal training (Fig. 6F, and Table S18). During the probe tests, the difference in exploration time around the target hole was less prominent in the 6-month-old group, while AppNL-G-F mice uniformly explored the holes as in the 12-month-old AppNL-G-F mice, especially in the probe test 2 (Fig. 6G, 6H, and Table S19). Unlike 12-month-old control mice, the 6-month-old control mice did not show a strong preference for the second target hole in probe test 2 (Fig. 6H). However, their DIs for the first and second target holes were significantly higher than 0.5, suggesting an increased confidence of spatial memory for each target hole (Fig. 6I, Fig. 6 – Supplement 3C, and Table S20). In AppNL-G-F mice, the DIs for the first and second target hole were comparable to 0.5, as those in 12-month-old AppNL-G-F mice, suggesting few modifications of the spatial memories (Fig. 6I, Fig. 6 – Supplement 3C, and Table S20).
In summary, these results suggest that the control mice retained competing spatial memories for the first and the second target holes following reversal learning. In contrast, the AppNL-G-F mice retained at most one spatial memory during a trial. In other words, the AppNL-G-F mice, given the spatial cues, struggled to infer the spatial memory of either the first or second target hole, leading to the rare usage of the confirmatory strategy. Thus, we confirmed the misclassification in the AppNL-G-F mice in the reversal Barnes maze paradigm. Although both DIs in the reinstatement (Fig. 2H) and the Barnes maze experiment (Fig. 6E, 6I) suggest that inferred memories in each time point would diverge between the AppNL-G-F mice and the control mice, no significant correlations were detected (12-month-old group: Spearman’s correlation coefficient = 0.062, p = 0.738, N = 32; 6-month-old group: Spearman’s correlation coefficient = -0.029, p = 0.862, N = 39), potentially due to differential effects of Aβ accumulation on associative and instrumental learning.
Discussion
Recent advances in disease-modifying intervention have shed light on AD treatment and prevention. Identifying the earliest manifestation of cognitive decline is critical for intervention. However, the behavioral phenotypes and altering trajectories are heterogeneous in predementia AD patients (Duara & Barker, 2022; Jutten et al., 2023), yet standard neuropsychological tests have been optimized for detecting stereotypical symptoms near or after the onset of dementia (Snyder et al., 2014). Recently proposed generative models successfully contributed to predicting nature-nurture factors conjugate with Aβ and behavioral outcomes from the preclinical to severe stage of AD (Hwang et al., 2023; Petrella et al., 2019; Yada & Honda, 2023), while only a few studies explicitly assume an internal model of the patients (Sutter et al., 2024). Assuming an internal model would broaden and deepen interpretations of various cognitive deficits in terms of common computational account (Kocagoncu et al., 2021).
In line with this, our perspective is that states of an internal model would be one candidate of factors to generate a wide variety of behavioral phenotypes in AD, and hence probing the internal state would contribute not only to detecting early signs of AD but also to multidimensionally evaluating how the disease gets worse and how early intervention works. In this study, we assumed the latent cause model proposed by Gershman et al. (2017) as an internal model and unveiled the internal state in 6- and 12-month-old AppNL-G-F mice, a preclinical mouse model of AD, from their behavioral data by estimating the parameters of the latent cause model well replicating the behaviors.
The misclassification in the memory modification process underlies the impairment of AppNL-G- F mice in the reinstatement of conditioned fear memory
Initially, we confirmed a significant accumulation of Aβ increased along aging in the AppNL-G-F mice (Fig. 1 – Supplement 1). We then tested whether the memory modification process was behaviorally impaired in the AppNL-G-F mice across the auditory-cued fear conditioning, extinction, and reinstatement. First, the mice learned an association between a tone as a CS and an electrical shock as a US, as they exhibited increased CR in test 1 (Fig. 2B, 2C). Next, they observed the CS alone repeatedly through the extinction, and then they exhibited decreased CR in test 2 (Fig. 2B, 2D). After they observed the US alone, the mice exhibited increased CR again in test 3, suggesting that the reinstatement was established (Fig. 2B, 2E). However, the AppNL-G-F mice exhibited a lower extent of reinstatement compared to the control (Fig. 2H). The DI between test 1 and 3 in the AppNL-G-F mice was significantly lower than 0.5, while that in the control was comparable with 0.5 (Fig. 2H), suggesting that the AppNL-G-F mice would have lower expectations of upcoming shock in the reinstatement than immediately after the acquisition. These results were the first to report the deficits in the reinstatement in the AppNL-G-F mice, suggesting that Aβ accumulation induces the defect in the memory modification process after the extinction.
Although our results support that the Aβ accumulation in AppNL-G-F mice would have little effect on associative fear learning and extinction, the results have been mixed in studies using fear conditioning tasks. Two studies found normal contextual or auditory-cued fear conditioning responses up to 18 months old (Kundu et al., 2021; Sakakibara et al., 2018), which aligns with our results. On the contrary, another two studies reported that AppNL-G-F mice have defects in both contextual and cued fear conditioning from the age of five months (Emre et al., 2022; Mehla et al., 2019). The inconsistency suggests the impact of Aβ may be complicated and cannot be simply detected by common behavioral tasks, highlighting the need for novel approaches to probe in-depth behavioral changes.
Recent advances in computational psychiatry offer a wealth of internal models to interpret a patient’s internal states (Hauser et al., 2022; Huys et al., 2016; Montague et al., 2012). For instance, Norbury et al. (2022) simulated a memory modification process in severe post-traumatic stress disorder (PTSD) patients by the latent cause model and found that they were biased to infer old memories rather than to acquire new memories for novel observations, as they have lower concentration parameter α. In this study, we found that AppNL-G-F mice exhibited lower α both at 6 months and 12 months of age (Fig. 4A, 5A). As the Aβ accumulation increased with age (Fig. 1 – Supplement 1), lower σx2 was observed in 12-month-old AppNL-G-F mice (Fig. 4A). As α and σx2 significantly correlated with DI between test 3 and 1 (Fig. 4B, 5B), we considered both as critical parameters that shaped the diverged internal states underlying reinstatement between control and AppNL-G-F mice.
AppNL-G-F mice with lower α favored to generalize the observation as if it arose from the old cause instead of differentiating it as a new memory. Indeed, their posterior probabilities of acquisition latent causes were higher than those of the control mice when a novel observation was given in test 1 (Fig. 4 – Supplement 1), contributing to the higher CR in AppNL-G-F mice (Fig. 2C, Fig. 4 – supplement 1). Such overgeneralization due to lower α may, in part, explain the deficits in discriminating highly similar visual stimuli reported in AD mouse model (Ding et al., 2023; Saifullah et al., 2020; Zhu et al., 2017), preclinical AD patients (Leal et al., 2019; H. Lee et al., 2020), and MCI patients (Ally et al., 2013; Belliart-Guérin & Planche, 2023; Laczó et al., 2021; Parizkova et al., 2020; Wesnes et al., 2014; Yassa et al., 2010), as they classify the lure as previous learned object.
Following the Chinese restaurant process, the impact of α on internal states decreases with trials, and meanwhile that of σx2 relatively increases in our reinstatement paradigm. AppNL-G-F mice in 12-month-old with lower σx2 favored to differentiate similar observations into unique memories, especially in later phases, due to the decreased posterior probabilities of existing latent causes. Indeed, AppNL-G-F mice had a larger number of latent causes acquired after the extinction with posterior probabilities (Fig. 4E), resulting in a limited update of the associative weight of the existing latent causes by unsignaled shock (Fig. 4D), and thereby lower CR in test 3 (Fig. 2E, 2H). These results may explain the impaired ability to transfer previously learned association rules could be due to misclassified them as separated knowledge in preclinical autosomal dominant AD mutation carriers and mild AD patients (Bódi et al., 2009; Petok et al., 2018). Such overdifferentiation due to lower σx2 might underlie delusions, a common neuropsychiatric symptom reported in AD dementia (Kumfor et al., 2022), as an extended latent cause model could simulate the emergence of delusional thinking (Erdmann & Mathys, 2022). Thus, the deficit in reinstatement of conditioned fear memory in the 12-month-old AppNL-G-F mice could be due to their overgeneralization or overdifferentiation of observations into memories, and consequently the competing memories were not retained through the memory modification process (Fig. 4 and 5).
Our study demonstrated that estimating internal states with the parameters of the latent cause model provided additional explanations for the cognitive differences between control and AppNL-G-F mice. It is undoubted that conventional behavioral tests for AD patients or AD mouse models have provided evidence of cognitive decline and are useful for stratification in clinical trials. Nonetheless, it is also possible that certain deficits have antecedently emerged in internal states, even if behavioral deficits are not obvious as in 6-month-old AppNL-G-F mice. The computational phenotype was proposed by Montague et al. (2012) and defined as parameters of a computational model being in conjunction with measurable behavioral or biological phenotypes. In this sense, α and σx2 in the latent cause model might satisfy this definition in this study. Computational phenotype was established to fill explanatory gaps between biological and psychological evidence in psychiatry, which would be informative likewise in AD studies.
The misclassification prevents AppNL-G-F mice from retaining competing memories
We further tested whether the deficit in the AppNL-G-F mice observed in the reinstatement experiments can be replicated in the reversal Barnes maze paradigm. The AppNL-G-F and control mice exhibited similar learning curves in the initial training phase regardless of age (Fig. 6 – Supplement 1A and 3A). These data align with previous studies that did not find a significant spatial learning deficit in AppNL-G-F mice up to 10 months old by using the Morris water maze, a common spatial learning task relevant to the Barnes maze (Latif-Hernandez et al., 2019; Saifullah et al., 2020; Whyte et al., 2018). Although AppNL-G-F mice were previously reported to have longer latency and make more errors during training in the Barnes maze (Broadbelt et al., 2022; Sakakibara et al., 2018), these studies haven’t described the mice’s navigation strategies to explain such deficits.
During the reversal training, AppNL-G-F mice displayed behavioral flexibility as control mice did based on the conventional analysis (Fig. 6 – Supplement 2B and 3B), similar to findings in Sakakibara et al. (2018). However, our strategy analysis revealed that this comparable performance was achieved in a sub-optimal way. The usage of confirmatory strategy such that mice selectively explored the first and the second target hole was significantly higher in the control mice during the reversal training phase, especially in the 12-month-old (Fig. 6B, 6F). After the reversal learning, the confirmatory strategy would become the optimal strategy at the first trial in each day to know today’s goal location. In contrast, the AppNL-G-F mice primarily took the perimeter search strategy such that mice explored the target hole in a “brute-force” manner (Fig. 6B, 6F), consistent with deficits in navigation ability found in preclinical AD patients (Allison et al., 2016; Coughlan et al., 2018). These results suggest that the control mice solved the task by inferring the two competing spatial memories representing the two possible target holes, whereas the AppNL-G-F mice solved the task not relying on selective spatial memories but the perimeter strategy, while both mice were given the same spatial cues. Thus, our hypothesis that the AppNL-G-F mice have a deficit in classifying observed cues into two or more competing memories in the same context would be supported by the results of the reversal Barnes maze experiment.
As the latent cause model was built on the fear conditioning paradigm and does not explicitly assume the contribution of other cognitive functions such as attention or working memory, this study provides no direct evidence as to which model parameters are in conjunction with the behavioral outcomes in the reversal Barnes maze and which parameters involve other the cognitive functions that are presumably affected by AD pathology (Finke et al., 2013; Kirova et al., 2015; Malhotra, 2019). To evaluate them, further studies would be required to integrate computational models encompassing different cognitive functions, such as multiple successor representations (Madarasz & Behrens, 2019) or hidden state inference (Sanders et al., 2020) for spatial learning, upon more fundamental principles, such as the predictive coding in a hierarchical neural network (Kocagoncu et al., 2021). We believe that such models would not only explain a wide variety of symptoms in dementia and AD in a common framework but also predict disease progression or effects of interventions for various cognitive functions.
Putative brain regions involving the altered internal states in the AppNL-G-F mice
This study did not explicitly provide evidence about neural mechanisms underlying the defect in memory classification in AppNL-G-F mice. Nonetheless, we speculated that possible neural circuits are presumably involved in the results reported here according to the findings of the previous studies.
It is well known that the amygdala, the hippocampus, and the medial prefrontal cortex are critical in regulating fear memory and its extinction (Maren et al., 2013). In addition, the ventral tegmental area (VTA) plays a modulatory role with its wide projection to these brain regions (Beier et al., 2015; Cai & Tong, 2022). The dysfunction in the abovementioned brain regions may cause the defect of reinstatement. Indeed, previous studies have shown that the inhibition of dopaminergic signal from VTA to the infralimbic cortex (Hitora-Imamura et al., 2015) or inhibition of glutamatergic signal from VTA to the dorsal hippocampus (Han et al., 2020) during unsignaled shock reduced the reinstatement of fear memory in mice.
Although the neural implementation of the latent cause model has not been demonstrated yet, contributions of certain neuronal populations are expected. It is well established that prediction error in classical or operant conditioning learning is computed in the VTA (Lerner et al., 2021). The prediction error is signaled from the VTA to the hippocampal CA1 via the dopaminergic projections, while the VTA receives a feedback signal from the CA1 (Lisman & Grace, 2005). The pattern separation and completion were implemented in the dentate gyrus (DG) and the CA3 in the hippocampus, respectively (Neunuebel & Knierim, 2014). As Sanders et al. (2020) pointed out, one might find an analogy between the process of pattern completion/separation and the inference of old/new latent cause, in the sense that both consist of a generalization/differentiation process to assign a current observation to either the same class with the previous observations or a novel class. The dysfunction in this circuit has long been implicated in memory problems in aging and AD (Palmer & Good, 2011; Wilson et al., 2006). Gershman et al. (2017) predicted that activities of newborn neurons in the DG and neurons in the CA3 correlate with the prior probability over latent causes, while the dopaminergic projection from the VTA to the hippocampal CA1 represents the posterior probability over latent causes. In other words, prediction error signals from the VTA to the CA1 might serve to compute the likelihood of the observation given the latent causes. The feedback from the CA1 to the VTA might signal posterior probability over latent causes and then modulate the extent of associative weight update.
Different lines of evidence suggest that hippocampus functions were affected in AppNL-G-F mice, including lower neuronal firing rates in the CA1 region (Inayat et al., 2023), degradation of gamma oscillation power in the CA3 region (Arroyo-García et al., 2021), and disrupted place cell remapping in CA1 region (Jun et al., 2020). While VTA dysfunction was not reported in AppNL-G-F mice, studies using Tg2576 mice, which overexpress mutated APP, have shown that neurodegeneration in the VTA occurred at an early age and consequent dysfunction was correlated with impairments in the hippocampus (Nobili et al., 2017; Spoleti et al., 2024). Given the evidence above, dysfunctions caused by pathological Aβ in the hippocampus and VTA are potential candidates underlying unsuccessful latent cause inference in AppNL-G-F mice, leading to a lower extent of reinstatement. Although this study considered limited disease factors, cognitive impairments in AD patients result from a combination of neuropathological factors and cannot be ascribed solely to Aβ deposition in clinical practice. Examining whether the interaction of neuropathology can be seen in the internal state with a mouse model including tau pathology, e.g., AppNL-G-F/MAPT double knock-in mice (Hashimoto et al., 2019; Saito et al., 2019), would improve the translational interpretation in this study.
Conclusion
Uncovering the underdeveloped symptoms in the AD continuum is critical to early detection and intervention by filling the explanatory gap between the pathological states and the cognitive decline in AD. We expect that seeking internal state as well as parameters in an internal model in conjunction with behavioral or biological phenotypes could provide in-depth explanations about behavioral outcomes in AD beyond assessing behavior alone. Our study is the first to report that the alteration in the internal state of AppNL-G-F mice biased their memory modification process toward overgeneralization or overdifferentiation.
Material and methods
Animals
Male AppNL-G-F homozygous knock-in mice (abbreviated as AppNL-G-F mice in this study) and age-matched control mice were subjected to behavioral tests at 6 and 12 months old. Cohort and genotype information was provided in Table S1. We decided to assign at least 20 mice for each group (6-month-old AppNL-G-F mice, 6-month-old control mice, 12-month-old AppNL-G-F mice, and 12-month-old control mice) on average when designing the reinstatement experiment (see below). Animals were housed under a 12-h light/dark cycle in a temperature-controlled environment (23–24°C temperature and 40%–50% relative humidity) with food pellets (Japan SLC) and water provided ad libitum. Behavioral experiments were performed during the dark cycle (8 am–8 pm). After the behavioral experiments, mice were sacrificed by cervical dislocation unless their brains were sampled. All animal procedures were performed in accordance with the Kyoto University animal care committee’s regulations (approval number: Lif-K24014).
Behavioral test
Reinstatement paradigm in fear conditioning
The reinstatement of conditioned fear response after a reminder of unconditioned stimulus is observed in different species (Hermans et al., 2005; Hitora-Imamura et al., 2015; Monfils et al., 2009). In the present study, the four-chamber fear conditioning testing system (O’HARA & CO., LTD, Tokyo, Japan) was used to deliver tone and electrical foot shock, and the top view camera in the chamber recorded mouse behavior at 2 Hz, as previously described (Tachiki et al., 2023). The procedure contained the following phases (Fig. 2A): acquisition (day 1), test 1 (day 2), extinction (day 3 and day 4), test 2 (day 5), unsignaled shock (day 8), and test 3 (day 9). The context (acrylic box) was the same throughout the test. Mice were transferred to the experimental room from the breeding room and then habituated for 30 min in their homecage in a rack until the phase started. In each phase, mice were moved between their homecage and the chamber via a transporting cage filled with woodchips. In the acquisition phase, a 60-s tone (65 dB, 10 kHz) as the conditioned stimulus (CS) was presented 300 s later since the mice were released in the chamber. The last 2 seconds of the CS were accompanied by an electrical foot shock (0.3 mA) as the unconditioned stimulus (US). This CS-US association was repeated 3 times with a 90-s interval. After the last US, the mice were left in the chamber for 140 s and then returned to the homecage. In the test phases, a 60-s tone was given 120 s after the mice entry. In the extinction phase, only a 60-s tone was given 19 times on day 3 and 10 times on day 4 with a 200-s interval. In the unsignaled shock phase, an electrical foot shock (0.7 mA) was given 10 min after the mice were placed into the chamber. After the electrical shock, the mice were left in the chamber for 5 min and then transferred back to the homecage. The chamber and grid were cleaned with 70% propanol before trials started. The freezing behavior was measured as a conditioned response (CR), and the criterion for freezing is defined as the change of detected mouse area being less than 10 pixels within 1 second. The freezing rate was calculated as the percentage of time spent freezing during the tone presentation (CS). To evaluate how belief in the mice changed in each test phase, the discrimination index (DI) is calculated as CR(t) / [CR(t) + CR(t’)], where CR indicates freezing rate during CS presentation; t and t’ indicate any two test phases.
Barnes maze
Barnes maze is a commonly used behavioral task to examine spatial learning and reference memory in Alzheimer’s disease model mice (Webster et al., 2014). The apparatus and setting were the same as previously reported (Suzuki & Imayoshi, 2017; Tachiki et al., 2023)(Bio-Medica, Osaka, Japan). The maze consisted of a circular open arena (98 cm in diameter) with 12 holes equally spaced around the perimeter. A black iron escape box was placed under one of the holes as a goal during the training phase. The location of the goal stayed constant for a given mouse but was randomized across mice. Four unique visual cues were located at the outside of the arena. Behavior during the trials was recorded using a GigE Vision camera (UI-5240SE-NIR; IDS Imaging Development Systems GmbH, Obersulm, Germany) mounted on the top of the maze to estimate the mouse’s position in each frame. All programs used for data acquisition, processing, saving, and synchronized device controls were written in LabVIEW 2013 (National Instruments, TX, USA). The procedure contained the following phases (Fig. 6A): habituation (day 0), initial training (day 1 to day 6), probe test 1 (day 7), retraining (day 8), reversal training 1 (day 9 to day 11), reversal training 2 (day 12 to day 14) and probe test 2 (day15). Mice were transferred to the experimental room from the breeding room and then waited for 30 min in their homecage in a rack by the phase started. In the habituation phase, mice were released to the center of the field and could freely explore the field for 5 min. After the trial, the mice were moved into the escape box for another 5 min, then back to the homecage. In the training phase, the mice had to explore the field until they successfully got into the escape box within 10 min. Then, they were returned to the homecage and on standby for the next trial. In the probe test, the procedure was the same as the habituation phase except that the mouse was returned to the homecage directly after the field exploration. The field was cleaned with 70% ethanol thoroughly, and the paper bedding in the escape box was changed per trial to reduce the effect of possible olfactory cues in the maze.
The analysis of the Barnes maze was performed on the custom code based on our previous study (Suzuki & Imayoshi, 2017) with some modifications in strategy analysis. The code for strategy analysis is freely available online at https://github.com/suzuki-yusuke/BM_analysis.git/. In brief, the conventional analysis included the number of errors (count to visit holes other than the goal), latency, and travel distance from start to goal in each trial in the training phase, and time spent around each hole in the probe test. The strategy analysis was performed to qualitatively evaluate what kind of exploratory strategies were used by mice based on their belief in each trial (Suzuki & Imayoshi, 2017; Tachiki et al., 2023). For each mouse, the trajectory from start to goal in each trial was automatically assigned to any one of the exploration strategies following the definition described in Figure 6 – Supplement 2. In this study, confirmatory and perimeter strategies were assumed in addition to the strategies previously reported. The former is the one such that the mice visit two holes, the current or past goal located opposite in the field, as if they have two memories such that the goal exists in each of the two holes. The latter is the one such that the mice find the goal simply running along with the holes, as if they do not have memories that the goal exists in a specific hole. To evaluate how the belief in the mice changed after the reversal training, the discrimination index (DI) is calculated as ER(t2) / [ER(t1) + ER(t2)], where ER indicates the exploration rate; t1 and t2 indicate probe test 1 and probe test 2. The ER was calculated as time spent around the hole divided by the maximum exploration time across all holes in each mouse. The DI is calculated for the first target hole (hole 1) and the second target hole (hole 7).
Simulating reinstatement and estimating parameters in the latent cause model
As the latent cause model serves as a generative model, it allows us to simulate the reinstatement experiment while exploring a set of model parameters, minimizing the prediction error between traces of simulated and observed CR for each mouse. The observed trace of CR across all trials in each mouse was initially normalized. In the simulation, the reinstatement experiment was replicated as follows. The number of trials was the same with our experiment: trials 1 to 3 was the acquisition phase, trial 4 was the test 1 phase, trials 5 to 23 was the first extinction phase, trials 24 to 33 was the second extinction phase, trial 34 was the test 2 phase, trial 35 was the unsignaled shock phase, and trial 36 was the test 3 phase. All trials contained stimuli, while the acquisition and the unsignaled shock trials contained the US. The stimulus given a trial was represented as a two-dimensional vector, where the first and the second dimensions indicated the CS and the context, respectively. If the CS was presented at the phases except for the unsignaled shock, the value got 1; otherwise, 0. Because the context was always presented, the value of context was given 0.2 for all trials. The US was represented as a binary value, either 1 at the acquisition and the unsignaled shock or 0 otherwise. The temporal distance of 24 h between each phase was set to be 20, the same as in the original study (Gershman et al., 2017).
Ten model parameters to be estimated were concentration parameter (α), temporal scaling parameter (g), learning rate (η), maximum number of iterations in the EM algorithm (max. no. of iteration), initial associative weight (w0), variance of the US (σr2) and the stimulus (σx2), response threshold (θ), response gain (λ), and maximum number of latent causes (K), where g is not explicitly described in the original model (Gershman et al., 2017) while this is implemented in their source code (https://github.com/sjgershm/memory-modification.git). They were estimated through the slice sampling algorithm in the Markov chain Monte Carlo method (Neal, 2003). In implementation, we used the MATLAB function, slicesample. The initial guess and upper and lower bounds of each parameter were listed in Table S21. The initial size of the sampling window for each parameter was the difference between the upper and lower bounds of the parameter divided by 50. In each sampling step, a set of parameters was proposed, and then a CR trace was simulated in the virtual reinstatement experiment. For each parameter, one sample was drawn for every 100 steps in 200000 times sampling, then the first 1000 samples were discarded as burn-in, and eventually 1000 samples were left. The convergence of the sample chain was determined if the Gelman-Rubin statistic was smaller than 1.1. The expected value of each parameter was considered to be the maximum bin in the sample histogram, for which the bin width was statistically optimized by the method proposed by Shimazaki & Shinomoto (2007). As max. no. of iteration and K are integers, each expected value was rounded toward infinity. Given the expected parameters, the likelihood of squared errors between the simulated and the observed CR trace was evaluated on a half-normal distribution (μ = 0, σ = 0.3) so that increasing error decreased the likelihood. If the likelihoods of errors were smaller than the thresholds (1 for acquisition trials; 0.25 for tests 1, 2, and 3 trials; 0.5 for remaining trials) in all trials and the Gelman-Rubin statistic < 1.1, the sampling procedure above was terminated. Otherwise, it was continued while keeping the sample chain and setting the initial guess as the last value of the sampling chain. If the procedure was not terminated within 20 iterations, the observed CR trace was classified as an anomaly. The code described above is freely available online at https://github.com/suzuki-yusuke/comp_modeling_lcm.git.
Given the estimated parameters for each mouse, their internal state and CR were simulated for every trial. The internal states consisted of latent causes, their posterior probabilities, and associative weight. The latent causes were categorized based on which phase they were initially inferred: acquisition trials, extinction trials (i.e., test 1, extinction, test 2), and trials after extinction (i.e., the unsignaled shock and test 3).
Immunohistochemistry
After the designated behavioral test, mice were anesthetized by intraperitoneal injection with a cocktail of 0.3 mg/kg medetomidine (Nippon Zenyaku Kogyo, Koriyama, Japan), 4.0 mg/kg midazolam (Sandoz, Minato, Japan), and 5.0 mg/kg butorphanol (Meiji Seika Pharma, Chuo, Japan) and transcardially perfused phosphate buffer saline (PBS) followed by 4% paraformaldehyde (PFA). Brain samples were preserved in 4% PFA overnight, and the solution was replaced with 30% sucrose in PBS. Then, the sample was embedded in the OCT compound (Tissue TEK, Sakura finetek Japan, Chuo, Japan) and sectioned at 30 μm (CM1950, Leica Biosystems, Tokyo, Japan). Every six sections were collected. Ten coronal brain sections from the anterior part (AP = +2.2 ∼ +1 mm from Bregma) and the posterior part (AP = -1.5∼-2.5 mm from Bregma) were subjected to staining in a 12-well plate. Sections were washed in PBS with 0.3% Triton X-100 (PBST) to remove excessive OCT compound, then blocked in 5% normal donkey serum (NDS) in PBST for an hour at room temperature and incubated with primary antibodies on a shaker at 4°C overnight. The following day, sections were washed in PBST and reacted with second antibodies and DAPI for 2 hours at room temperature. Finally, the sections were washed in PBST, mounted on slides, and coverslipped with antifade reagent (FluoroMount-G®, SoutherBiotech, Cat#0100-01). Primary antibodies, including mouse anti-β-Amyloid (1:1000, Biolegend, Cat#803001, RRID:AB_2564653), rabbit anti-Iba1 (1:1000, FUJIFILM Wako Pure Chemical Corporation, Cat#019-19741, RRID:AB_839504), and rat anti-GFAP (1:1000, Invitrogen, Cat#13-0300, RRID:AB_2532994), were prepared in PBST with 1% NDS. Second antibodies against mouse Alexa Fluor-568nm (Invitrogen, Cat#A10037, RRID:AB_11180865), rabbit Alexa Fluor-647nm (Invitrogen, Cat#A31573, RRID:AB_2536183), and rat Alexa Fluor-488nm (Invitrogen, Cat#A21208, RRID:AB_2535794) were used in a dilution of 1:500 in PBST with 1% NDS. Fluorescent immunohistochemistry slides were imaged on a fluorescence microscope (BZ-X800, Keyence, Japan) with a 10x objective, and the stitching function was used to cover the entire section. Images were stitched in built-in analysis software (Keyence, Japan). The Aβ plaque area of each brain was quantified under QUINT workflow (Yates et al., 2019). First, a series of brain section images were registered to Allen Brain Atlas CCFv3 on QuickNII (Puchades et al., 2019) and fine-adjusted on VisuAlign (https://www.nitrc.org/projects/visualign). Next, the Aβ staining channel images were segmented in Ilastik (Berg et al., 2019). Finally, the integration of registration and segmented results was performed in Nutil (Groeneboom et al., 2020) to quantify Aβ plaque in selected brain regions. The Aβ plaque area (%) is calculated as the percentage of positive Aβ staining area covering the whole coronal brain section.
Statistical analysis
For the fear conditioning, extinction, and reinstatement results, samples with freezing rates in test phases exceeding the 1.5 times interquartile range of the group were treated as outliers and excluded for further statistical analysis. The one-way ANCOVA was performed to analyze genotype effects on the freezing rate and DI, with age as a covariate. The Student’s t-test was performed to compare the freezing rate and the DI between control and AppNL-G-F mice within the same age. To test whether the DI differs from 0.5, the one-sample t-test was performed within the group. To compare the estimated parameters and properties of latent causes between control and AppNL-G-F mice within the same age, the Mann-Whitney U test was performed, as the Shapiro-Wilk normality test reported a violation of normality assumption in the data. The Spearman’s rank correlation coefficient between the estimated model parameters and the DIs between test 3 and test 1 phase were calculated.
For the reversal Barnes maze results, a mixed-designed two-way [Genotype (control, AppNL-G-F) × Day (1-6, 9-11, or 12-14)] ANOVA was performed for the number of errors, the latency, and the travel distance in the initial and reversal training. A mixed-designed two-way [Genotype (control, AppNL-G-F) × Hole (1-12)] ANOVA was performed for the time spent around each hole in the probe test 1 and 2. If a significant interaction between genotype and hole was detected, Tukey’s HSD test at a specific hole between control and AppNL-G-F was performed. The Wilcoxon rank sum test was performed to compare the use of each strategy on each training day between control and AppNL-G-F within the same age. To test whether the DI between probe tests 1 and 2 differs from 0.5, the Wilcoxon signed-rank test was performed within the group, as the Shapiro-Wilk normality test reported a violation of the normality assumption in the data. The Mann-Whitney U test was performed to compare the DI between control and AppNL-G-F mice within the same age.
All statistical analyses were done in JASP 0.18.3.0 (JASP team, 2024) and MATLAB R2022b (MathWorks Inc., MA, USA). The significance level for all statistical tests was set at 0.05.
Acknowledgements
We thank Adam T. Guy, Ph.D. and Shigeru Shinomoto, Ph.D. for informative comments on the manuscript. We also thank all the members of the Imayoshi lab for their support. We acknowledge ChatGPT-4 and Google Gemini for their contributions, which were selectively referenced during the Python code refactoring process for visualization and drawing mice artwork in Figure 1.
Additional information
Author Contributions
M.H.: Writing – original draft, Conceptualization, Formal analysis, Investigation, Methodology, Visualization
Y.S.: Writing – review & editing, Conceptualization, Formal analysis, Methodology, Software, Supervision
H.S.: Writing – review & editing, Resource
T.S.: Writing – review & editing, Resource
T.C.S.: Resource
I.I.: Writing – review & editing, Funding acquisition, Resource, Supervision
Funding sources
This work was supported by the Japan Society for the Promotion of Science (JSPS), Grant-in-Aid for Early-Career Scientists JSPS 19K16292 (to Y.S.), Grant-in-Aid for Scientific Research (B) JSPS 21H02485 (to I.I.) from the Ministry of Education, Culture, Sports, Science and the Technology of Japan (MEXT); by Japan Science and Technology Agency (JST) CREST program Grants JPMJCR1921 (to I.I.) and JPMJCR1752 (to I.I.); and by the Program for Technological Innovation of Regenerative Medicine Grant 21bm0704060h0001 and 24bm1123049h0001 (to I.I.) and Brain/MINDS Grant 21dm0207090h0003 (to I.I.) from the Japanese Agency for Medical research and Development (AMED).
Additional files
References
- 1.Spatial Navigation in Preclinical Alzheimer’s DiseaseJournal of Alzheimer’s Disease : JAD 52:77–90https://doi.org/10.3233/JAD-150855
- 2.Pattern separation and pattern completion in Alzheimer’s disease: Evidence of rapid forgetting in amnestic mild cognitive impairmentHippocampus 23:1246–1258https://doi.org/10.1002/hipo.22162
- 3.Prodromal Alzheimer’s disease: Successive emergence of the clinical symptomsAnnals of Neurology 64:492–498https://doi.org/10.1002/ana.21509
- 4.Impaired spike-gamma coupling of area CA3 fast-spiking interneurons as the earliest functional impairment in the AppNL-G-F mouse model of Alzheimer’s diseaseMolecular Psychiatry 26:5557–5567https://doi.org/10.1038/s41380-021-01257-0
- 5.Memory deficits associated with senescence: A neurophysiological and behavioral study in the ratJournal of Comparative and Physiological Psychology 93:74–104https://doi.org/10.1037/h0077579
- 6.Alzheimer’s disease first symptoms are age dependent: Evidence from the NACC datasetAlzheimer’s & Dementia : The Journal of the Alzheimer’s Association 11:1349–1357https://doi.org/10.1016/j.jalz.2014.12.007
- 7.Circuit Architecture of VTA Dopamine Neurons Revealed by Systematic Input–Output MappingCell 162:622–634https://doi.org/10.1016/j.cell.2015.07.015
- 8.Mnemonic Discrimination Performance in a Memory Clinic: A Pilot StudyJournal of Alzheimer’s Disease 94:1527https://doi.org/10.3233/JAD-230221
- 9.ilastik: Interactive machine learning for (bio)image analysisNature Methods 16:1226–1232https://doi.org/10.1038/s41592-019-0582-9
- 10.Application of Digital Cognitive Biomarkers for Alzheimer’s Disease: Identifying Cognitive Process Changes and Impending Cognitive DeclineThe Journal of Prevention of Alzheimer’s Disease 8:123–126https://doi.org/10.14283/jpad.2020.63
- 11.Associative Learning, Acquired Equivalence, and Flexible Generalization of Knowledge in Mild Alzheimer DiseaseCognitive and Behavioral Neurology 22:89https://doi.org/10.1097/WNN.0b013e318192ccf0
- 12.Impairment in novelty-promoted memory via behavioral tagging and capture before apparent memory loss in a knock-in model of Alzheimer’s diseaseScientific Reports 12:22298https://doi.org/10.1038/s41598-022-26113-1
- 13.Behavioral Flexibility: Attentional Shifting, Rule Switching, and Response ReversalIn: Encyclopedia of Psychopharmacology Springer pp. 1–7https://doi.org/10.1007/978-3-642-27772-6_340-2
- 14.Anatomy and Function of Ventral Tegmental Area Glutamate NeuronsFrontiers in Neural Circuits 16:867053https://doi.org/10.3389/fncir.2022.867053
- 15.Spatial navigation deficits—Overlooked cognitive marker for preclinical Alzheimer disease?Nature Reviews Neurology 14:496–506https://doi.org/10.1038/s41582-018-0031-x
- 16.Electroacupuncture promotes neurogenesis in the dentate gyrus and improves pattern separation in an early Alzheimer’s disease mouse modelBiological Research 56:65https://doi.org/10.1186/s40659-023-00472-z
- 17.Heterogeneity in Alzheimer’s Disease Diagnosis and Progression Rates: Implications for Therapeutic TrialsNeurotherapeutics 19:8–25https://doi.org/10.1007/s13311-022-01185-z
- 18.Intranasal delivery of pro-resolving lipid mediators rescues memory and gamma oscillation impairment in AppNL-G-F/NL-G-F miceCommunications Biology 5:245https://doi.org/10.1038/s42003-022-03169-3
- 19.A generative framework for the study of delusionsSchizophrenia Research 245:42–49https://doi.org/10.1016/j.schres.2020.11.048
- 20.A biased competition account of attention and memory in Alzheimer’s diseasePhilosophical Transactions of the Royal Society B: Biological Sciences 368:20130062https://doi.org/10.1098/rstb.2013.0062
- 21.Assessment of spatial learning and memory in the Barnes maze task in rodents—Methodological considerationNaunyn-Schmiedeberg’s Archives of Pharmacology 392:1–18https://doi.org/10.1007/s00210-018-1589-y
- 22.A Tutorial on Bayesian Nonparametric Models (No. arXiv:1106.2697)arXiv https://doi.org/10.48550/arXiv.1106.2697
- 23.Context, learning, and extinctionPsychological Review 117:197–209https://doi.org/10.1037/a0017808
- 24.Individual differences in learning predict the return of fearLearning & Behavior 43:243–250https://doi.org/10.3758/s13420-015-0176-z
- 25.The computational nature of memory modificationeLife 6:e23763https://doi.org/10.7554/eLife.23763
- 26.Temporal unfolding of declining episodic memory on the Free and Cued Selective Reminding Test in the predementia phase of Alzheimer’s disease: Implications for clinical trialsAlzheimer’s & Dementia: Diagnosis, Assessment & Disease Monitoring 10:161–171https://doi.org/10.1016/j.dadm.2017.12.004
- 27.Nutil: A Pre- and Post-processing Toolbox for Histological Rodent Brain Section ImagesFrontiers in Neuroinformatics 14https://doi.org/10.3389/fninf.2020.00037
- 28.Excitatory VTA to DH projections provide a valence signal to memory circuitsNature Communications 11:1466https://doi.org/10.1038/s41467-020-15035-z
- 29.Tau binding protein CAPON induces tau aggregation and neurodegenerationNature Communications 10:2394https://doi.org/10.1038/s41467-019-10278-x
- 30.The promise of a model-based psychiatry: Building computational models of mental ill healthThe Lancet Digital Health 4:e816–e828https://doi.org/10.1016/S2589-7500(22)00152-2
- 31.Reinstatement of fear responses in human aversive conditioningBehaviour Research and Therapy 43:533–551https://doi.org/10.1016/j.brat.2004.03.013
- 32.Prefrontal dopamine regulates fear reinstatement through the downregulation of extinction circuitseLife 4:e08274https://doi.org/10.7554/eLife.08274
- 33.Computational psychiatry as a bridge from neuroscience to clinical applicationsNature Neuroscience 19:404–413https://doi.org/10.1038/nn.4238
- 34.Real-world prediction of preclinical Alzheimer’s disease with a deep generative modelArtificial Intelligence in Medicine 144:102654https://doi.org/10.1016/j.artmed.2023.102654
- 35.Weak-hyperactive hippocampal CA1 neurons in the prodromal stage of Alzheimer’s disease in hybrid App NL-G-F/NL-G-F × Thy1-GCaMP6s+/− mice suggest disrupted plasticityNeurobiology of Aging 130:154–171https://doi.org/10.1016/j.neurobiolaging.2023.06.002
- 36.Revised criteria for diagnosis and staging of Alzheimer’s disease: Alzheimer’s Association WorkgroupAlzheimer’s & Dementia 20:5143–5169https://doi.org/10.1002/alz.13859
- 37.NIA-AA Research Framework: Toward a biological definition of Alzheimer’s diseaseAlzheimer’s & Dementia 14:535–562https://doi.org/10.1016/j.jalz.2018.02.018
- 38.Practical considerations for choosing a mouse model of Alzheimer’s diseaseMolecular Neurodegeneration 12:89https://doi.org/10.1186/s13024-017-0231-7
- 39.Disrupted Place Cell Remapping and Impaired Grid Cells in a Knockin Model of Alzheimer’s DiseaseNeuron 107:1095–1112https://doi.org/10.1016/j.neuron.2020.06.023
- 40.Why a clinical trial is as good as its outcome measure: A framework for the selection and use of cognitive outcome measures for clinical trials of Alzheimer’s diseaseAlzheimer’s & Dementia : The Journal of the Alzheimer’s Association 19:708–720https://doi.org/10.1002/alz.12773
- 41.Working Memory and Executive Function Decline across Normal Aging, Mild Cognitive Impairment, and Alzheimer’s DiseaseBioMed Research International 2015:748212https://doi.org/10.1155/2015/748212
- 42.Alzheimer diseaseNature Reviews Disease Primers 7:1–21https://doi.org/10.1038/s41572-021-00269-y
- 43.Memory and Reward-Based Learning: A Value-Directed Remembering PerspectiveAnnual Review of Psychology 73:25–52https://doi.org/10.1146/annurev-psych-032921-050951
- 44.Evidence and implications of abnormal predictive coding in dementiaBrain 144:3311https://doi.org/10.1093/brain/awab254
- 45.Examining the presence and nature of delusions in Alzheimer’s disease and frontotemporal dementia syndromesInternational Journal of Geriatric Psychiatry 37https://doi.org/10.1002/gps.5692
- 46.Integrated analysis of behavioral, epigenetic, and gut microbiome analyses in AppNL-G-F, AppNL-F, and wild type miceScientific Reports 11:4678https://doi.org/10.1038/s41598-021-83851-4
- 47.Spatial Pattern Separation Testing Differentiates Alzheimer’s Disease Biomarker-Positive and Biomarker-Negative Older Adults With Amnestic Mild Cognitive ImpairmentFrontiers in Aging Neuroscience 13https://doi.org/10.3389/fnagi.2021.774600
- 48.Subtle behavioral changes and increased prefrontal-hippocampal network synchronicity in APPNL−G−F mice before prominent plaque depositionBehavioural Brain Research 364:431–441https://doi.org/10.1016/j.bbr.2017.11.017
- 49.Development of a mnemonic discrimination task using naturalistic stimuli with applications to aging and preclinical Alzheimer’s diseaseLearning & Memory 26:219–228https://doi.org/10.1101/lm.048967.118
- 50.Genetic Alzheimer’s Disease Risk Affects the Neural Mechanisms of Pattern Separation in Hippocampal SubfieldsCurrent Biology 30:4201–4212https://doi.org/10.1016/j.cub.2020.08.042
- 51.An application of multinomial processing tree models and Bayesian methods to understanding memory impairmentJournal of Mathematical Psychology 95:102328https://doi.org/10.1016/j.jmp.2020.102328
- 52.Bayesian Modeling of the Mnemonic Similarity Task Using Multinomial Processing TreesBehaviormetrika 50:517–539https://doi.org/10.1007/s41237-023-00193-3
- 53.Dopamine, Updated: Reward Prediction Error and BeyondCurrent Opinion in Neurobiology 67:123–130https://doi.org/10.1016/j.conb.2020.10.012
- 54.The Hippocampal-VTA Loop: Controlling the Entry of Information into Long-Term MemoryNeuron 46:703–713https://doi.org/10.1016/j.neuron.2005.05.002
- 55.Better Transfer Learning with Inferred Successor MapsAdvances in Neural Information Processing Systems 32https://papers.nips.cc/paper_files/paper/2019/hash/274a10ffa06e434f2a94df765cac6bf4-Abstract.html
- 56.Impairments of attention in Alzheimer’s diseaseCurrent Opinion in Psychology 29:41–48https://doi.org/10.1016/j.copsyc.2018.11.002
- 57.The contextual brain: Implications for fear conditioning, extinction and psychopathologyNature Reviews. Neuroscience 14:417–428https://doi.org/10.1038/nrn3492
- 58.Cognitive deficits in single App knock-in mouse modelsNeurobiology of Learning and Memory 135:73–82https://doi.org/10.1016/j.nlm.2016.07.001
- 59.Age-dependent behavioral and biochemical characterization of single APP knock-in mouse (APPNL-G-F/NL-G-F) model of Alzheimer’s diseaseNeurobiology of Aging 75:25–37https://doi.org/10.1016/j.neurobiolaging.2018.10.026
- 60.Extinction-Reconsolidation Boundaries: Key to Persistent Attenuation of Fear MemoriesScience 324:951–955https://doi.org/10.1126/science.1167975
- 61.Computational psychiatryTrends in Cognitive Sciences 16:72–80https://doi.org/10.1016/j.tics.2011.11.018
- 62.Global and regional projections of the economic burden of Alzheimer’s disease and related dementias from 2019 to 2050: A value of statistical life approacheClinicalMedicine 51https://doi.org/10.1016/j.eclinm.2022.101580
- 63.CA3 Retrieves Coherent Representations from Degraded Input: Direct Evidence for CA3 Pattern Completion and Dentate Gyrus Pattern SeparationNeuron 81:416–427https://doi.org/10.1016/j.neuron.2013.11.017
- 64.Dopamine neuronal loss contributes to memory and reward dysfunction in a model of Alzheimer’s diseaseNature Communications 8:14727https://doi.org/10.1038/ncomms14727
- 65.Latent cause inference during extinction learning in trauma-exposed individuals with and without PTSDPsychological Medicine 52:3834–3845https://doi.org/10.1017/S0033291721000647
- 66.Hippocampal synaptic activity, pattern separation and episodic-like memory: Implications for mouse models of Alzheimer’s disease pathologyBiochemical Society Transactions 39:902–909https://doi.org/10.1042/BST0390902
- 67.Spatial Pattern Separation in Early Alzheimer’s DiseaseJournal of Alzheimer’s Disease: JAD 76:121–138https://doi.org/10.3233/JAD-200093
- 68.Impairment of memory generalization in preclinical autosomal dominant Alzheimer’s disease mutation carriersNeurobiology of Aging 65:149–157https://doi.org/10.1016/j.neurobiolaging.2018.01.022
- 69.Computational Causal Modeling of the Dynamic Biomarker Cascade in Alzheimer’s DiseaseComputational and Mathematical Methods in Medicine 2019:e6216530https://doi.org/10.1155/2019/6216530
- 70.Understanding memory impairment with memory models and hierarchical Bayesian analysisJournal of Mathematical Psychology 55:47–56https://doi.org/10.1016/j.jmp.2010.08.003
- 71.Spatial registration of serial microscopic brain images to three-dimensional reference atlases with the QuickNII toolPLOS One 14:e0216796https://doi.org/10.1371/journal.pone.0216796
- 72.A theory of Pavlovian conditioning: Variations in the effectiveness of reinforcement and nonreinforcementClassical Conditioning II: Current Research and Theory 2
- 73.Touchscreen-based location discrimination and paired associate learning tasks detect cognitive impairment at an early stage in an App knock-in mouse model of Alzheimer’s diseaseMolecular Brain 13:147https://doi.org/10.1186/s13041-020-00690-6
- 74.Single App knock-in mouse models of Alzheimer’s diseaseNature Neuroscience 17:5https://doi.org/10.1038/nn.3697
- 75.Humanization of the entire murine Mapt gene provides a murine model of pathological human tau propagationJournal of Biological Chemistry 294:12754–12765https://doi.org/10.1074/jbc.RA119.009487
- 76.Cognitive and emotional alterations in App knock-in mouse models of Aβ amyloidosisBMC Neuroscience 19:46https://doi.org/10.1186/s12868-018-0446-8
- 77.Hippocampal remapping as hidden state inferenceeLife 9:e51140https://doi.org/10.7554/eLife.51140
- 78.APP mouse models for Alzheimer’s disease preclinical studiesThe EMBO Journal 36:2473–2487https://doi.org/10.15252/embj.201797397
- 79.A Method for Selecting the Bin Size of a Time HistogramNeural Computation 19:1503–1527https://doi.org/10.1162/neco.2007.19.6.1503
- 80.Assessing cognition and function in Alzheimer’s disease clinical trials: Do we have the right tools?Alzheimer’s & Dementia 10:853–860https://doi.org/10.1016/j.jalz.2014.07.158
- 81.Toward defining the preclinical stages of Alzheimer’s disease: Recommendations from the National Institute on Aging-Alzheimer’s Association workgroups on diagnostic guidelines for Alzheimer’s diseaseAlzheimer’s & Dementia 7:280–292https://doi.org/10.1016/j.jalz.2011.03.003
- 82.The Evolution of Preclinical Alzheimer’s Disease: Implications for Prevention TrialsNeuron 84:608–622https://doi.org/10.1016/j.neuron.2014.10.038
- 83.Dopamine neuron degeneration in the Ventral Tegmental Area causes hippocampal hyperexcitability in experimental Alzheimer’s DiseaseMolecular Psychiatry :1–16https://doi.org/10.1038/s41380-024-02408-9
- 84.Early-Stage Alzheimer’s Disease Affects Fast But Not Slow Adaptive Processes in Motor LearningeNeuro 11https://doi.org/10.1523/ENEURO.0108-24.2024
- 85.Network analysis of exploratory behaviors of mice in a spatial learning and memory taskPLoS ONE 12:e0180789https://doi.org/10.1371/journal.pone.0180789
- 86.Scale Space Calibrates Present and Subsequent Spatial Learning in Barnes Maze in MiceeNeuro 10https://doi.org/10.1523/ENEURO.0505-22.2023
- 87.Cognitive modeling of the Mnemonic Similarity Task as a digital biomarker for Alzheimer’s diseaseAlzheimer’s & Dementia 1https://doi.org/10.1002/alz.14163
- 88.Using mice to model Alzheimer’s dementia: An overview of the clinical disease and the preclinical behavioral changes in 10 mouse modelsFrontiers in Genetics 5https://doi.org/10.3389/fgene.2014.00088
- 89.Performance on a pattern separation task by Alzheimer’s patients shows possible links between disrupted dentate gyrus activity and apolipoprotein E ∈4 status and cerebrospinal fluid amyloid-β42 levelsAlzheimer’s Research & Therapy 6:20https://doi.org/10.1186/alzrt250
- 90.Impaired and preserved aspects of feedback learning in aMCI: Contributions of structural connectivityBrain Structure and Function 221:2831–2846https://doi.org/10.1007/s00429-015-1075-y
- 91.Reduction in open field activity in the absence of memory deficits in the AppNL−G−F knock-in mouse model of Alzheimer’s diseaseBehavioural Brain Research 336:177–181https://doi.org/10.1016/j.bbr.2017.09.006
- 92.Neurocognitive aging: Prior memories hinder new hippocampal encodingTrends in Neurosciences 29:662–670https://doi.org/10.1016/j.tins.2006.10.002
- 93.Few-shot prediction of amyloid β accumulation from mainly unpaired data on biomarker candidatesNPJ Systems Biology and Applications 9:59https://doi.org/10.1038/s41540-023-00321-5
- 94.High-resolution structural and functional MRI of hippocampal CA3 and dentate gyrus in patients with amnestic Mild Cognitive ImpairmentNeuroImage 51:1242–1252https://doi.org/10.1016/j.neuroimage.2010.03.040
- 95.QUINT: Workflow for Quantification and Spatial Analysis of Features in Histological Images From Rodent BrainFrontiers in Neuroinformatics 13https://doi.org/10.3389/fninf.2019.00075
- 96.Updates on mouse models of Alzheimer’s diseaseMolecular Neurodegeneration 19:23https://doi.org/10.1186/s13024-024-00712-0
- 97.Impairments of spatial memory in an Alzheimer’s disease model via degeneration of hippocampal cholinergic synapsesNature Communications 8:1676https://doi.org/10.1038/s41467-017-01943-0
Article and author information
Author information
Version history
- Sent for peer review:
- Preprint posted:
- Reviewed Preprint version 1:
Copyright
© 2025, Huang et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics
- views
- 39
- downloads
- 0
- citations
- 0
Views, downloads and citations are aggregated across all versions of this paper published by eLife.